Figure - PMC

Skip to main content

An official website of the United States government

Here's how you know

Here's how you know

Official websites use .gov
A .gov website belongs to an official government organization in the United States.

Secure .gov websites use HTTPS
A lock ( ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.

View full-text article in PMC

. 2010 Jan 19;107(5):2147–2152. doi: 10.1073/pnas.0909000107

Search in PMC
Search in PubMed
View in NLM Catalog
Add to search

PMC Copyright notice

Fig. 3. — Cumulative probability distribution of SNP nucleotide diversity between HuRef and the reference sequence in 5,000-bp regions. (A) The distributions from simulated demographic models conditioned on the presence of a polymorphic insertion (demographic models shown in blue). (B) The unconditional distributions of the demographic models (show in gray). The orange line is the observed distribution in regions surrounding polymorphic insertions, while the red line is the observed distribution of 2,432 randomly chosen genomic regions. The best-fitting demographic model is the maximum likelihood estimate among all three-parameter demographic models considered, with a large ancient population size of N_A = 18,500 starting t = 1.2 Mya (see Materials and Methods). Because genealogies that contain polymorphic mobile elements are ancient, the best-fitting model is clearly differentiated from the constant population size model in A. In contrast, the two models are nearly indistinguishable in B, demonstrating that the unconditional distribution of nucleotide diversity contains relatively little information about ancient population history, with only very large changes in ancient population size producing a noticeable effect (N_A = 50,000). For the constant population-size model, the effective population size is n = 9,244, which is the effective population size for HuRef and the reference sequence based on genome-wide estimates of nucleotide diversity (23). The best-fitting model is significantly more likely than the constant population-size model (P = 2.5 × 10⁻¹⁶, likelihood-ratio test). The differences in the observed distributions for regions surrounding polymorphic insertions and regions chosen at random are highly significant (P < <10⁻³⁰, χ²; Table S1). Nucleotide diversity is also stochastically greater in regions surrounding polymorphic insertions compared to regions chosen at random (P < <10⁻³⁰, Mann-Whitney U).