Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2021 Oct 1;39(1):msab291. doi: 10.1093/molbev/msab291

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

© The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com

PMC Copyright notice

Fig. 1. — Schematic of the MK regression. The MK regression consists of two components: a generalized linear model and a McDonald–Kreitman-based likelihood function. First, I assume that, in a site-wise manner, the rate of adaptive evolution ( $ω_{a}$ ) at a functional site is a linear combination of local genomic features followed by an exponential transformation, in which regression coefficient β_i indicates the effect of the ith feature on adaptive evolution. Similarly, I assume that the probability of observing a SNP ( $P_{func}$ ) at the same functional site is another linear combination of the same set of genomic features, followed by a logistic transformation. Second, in the McDonald–Kreitman-based likelihood function, I combine $ω_{a}$ and $P_{func}$ at every functional site with two neutral parameters, $D_{neut}$ and $P_{neut}$ , to calculate the probability of observed divergence and polymorphism data given model parameters. $D_{neut}$ and $P_{neut}$ denote the expected number of substitutions and the probability of observing a SNP at a neutral site, respectively. $D_{func}$ denotes the expected number of substitutions at a functional site.