Table 1. Data sets included in the combined data set and their sample sizes.
Sample Size |
||||||
---|---|---|---|---|---|---|
Data Set Name | Original Data Set | MS5795a | MS5547b | MS5435c | Obtained From | Reference(s) |
HGDP-CEPH | 1048 | 1046 | 966 | 947 | Rosenberg laboratory | Rosenberg et al. 2002, 2005; Ramachandran et al. 2005 |
Native American | 436 | 418 | 363 | 338 | Rosenberg laboratory | Wang et al. 2007 |
Latino | 249 | 246 | 244 | 241 | Dataset S1 of | Wang et al. 2008 |
Wang et al. (2008) | ||||||
Jewish | 80 | 79 | 79 | 77 | Rosenberg laboratory | Kopelman et al. 2009 |
Asian Indian | 432 | 430 | 430 | 430 | Rosenberg laboratory | Rosenberg et al. 2006 |
Chha Gaam Patel (CGP)d | 249 (203) | 203 | 185 | 180 | Rosenberg laboratory | Pemberton et al. 2012 |
Pacific Islander | 936 | 847 | 756 | 709 | F. Friedlaender and | Friedlaender et al. 2008 |
J. Friedlaender | ||||||
African | 2561 | 2526 | 2524 | 2513 | Supplement of Tishkoff et al. (2009) | Tishkoff et al. 2009 |
MS5879a | MS5631b | MS5519c | ||||
Chimpanzee | 84 | 84 | 84 | 84 | Dataset S1 of | Becquet et al. 2007 |
Becquet et al. (2007) |
MS5795 and MS5879 represent the complete combined-human and combined-human–chimpanzee data sets, respectively, and they include intra-population relative pairs.
MS5547 and MS5631 are constructed from MS5795 and MS5879, respectively, by the removal of a member of every intra-population first-degree relative pair (Table S21).
MS5435 and MS5519 are constructed from MS5547 and MS5631, respectively, by the removal of a member of every intra-population second-degree relative pair (Table S22).
Some Gujarati individuals were included in both the Rosenberg et al. (2006) and Pemberton et al. (2012) studies. The number of unique individuals included from the CGP data set is given in parentheses.