Skip to main content
. 2018 Oct 12;210(4):1429–1452. doi: 10.1534/genetics.118.301502

Figure 1.

Figure 1

Visual representation of the haplotype- and MLG-based statistics. For all panels, the total area of the orange or blue squares within a panel represents the value of the statistics. Hard sweep scenarios are in orange, and soft sweeps are in blue. (A) Under a hard sweep (left), a single haplotype rises to high frequency, p1, so the probability of sampling two copies of that haplotype is p12. Choosing p1 as the largest frequency yields H1 (dark orange area), while pooling p1+p2 as the largest frequency yields H12 (total orange area). Under a soft sweep (right), pooling the largest haplotype frequencies results in a large shaded area, and therefore H12 has a similar value for both hard and soft sweeps. (B) Under Hardy-Weinberg equilibrium, a single high-frequency haplotype produces a single high-frequency MLG (frequency q1). Pooling frequencies up to q3 has little effect on the value of the statistic, thus G1, G12, and G123 have similar values. When two haplotypes exist at high frequency, three MLGs exist at high frequency. Under a soft sweep, pooling the largest two MLGs (G12) may provide greater resolution of soft sweeps than not pooling (G1), and pooling the largest three creates a statistic (G123) truly analogous to H12.