Population Stratification and Unreported Variants, Related to Figures 2A–2C and STAR Methods
Top: Population Stratification: Maximum allele frequency difference as a function of population differentiation. Blue line is loess fits after excluding populations with 10 samples or less. Deletions (Left), Insertions (Centre), Duplications (Right). Bottom: Variants not present in 1000G or SGDP. Continental (red) or Population (green) specific variants (n > 2) in the HGDP not found in 1000G or SGDP SV callsets binned by allele frequency. The same variant can be present in both distributions.