Extended Data Table 1:
Population | Country | Language family | Date | Sample size | Data type | Reference |
---|---|---|---|---|---|---|
| ||||||
Shum Laka | Cameroon | ~8000–3000 BP | 4/1/1 | 1240k/DG/SG | This paper | |
Ancient Malawi HG | Malawi | ~8100–2500 BP | 7* | 1240k | [22] | |
Mota | Ethiopia | ~4500 BP | 1 | SG | [23] | |
Ancient South African HG | South Africa | ~2000 BP | 3† | SG | [21,22] | |
Taforalt | Morocco | ~15,000–14,000 BP | 6 | 1240k | [26] | |
Altai Neanderthal | Russia | ~120,000 BP | 1 | DG | [78] | |
Aghem | Cameroon | NC | Present | 28 | HO | This paper |
Bafut | Cameroon | NC | Present | 11 | HO | This paper |
Baka | Cameroon | NC | Present | 2 | DG | [20] |
Bakoko | Cameroon | NC | Present | 1 | HO | This paper |
Bakola | Cameroon | NC | Present | 2 | DG | [20] |
Bangwa | Cameroon | NC | Present | 2 | HO | This paper |
Bedzan | Cameroon | NC | Present | 2 | DG | [20] |
Fulani | Cameroon | NC | Present | 2 | DG | [20] |
Lemande | Cameroon | NC | Present | 2 | DG | [25] |
Mada | Cameroon | AA | Present | 2 | DG | [20] |
Mbo | Cameroon | NC | Present | 21 | HO | This paper |
Ngumba | Cameroon | NC | Present | 2 | DG | [20] |
Tikar | Cameroon | NC | Present | 2 | DG | [20] |
Agaw | Ethiopia | AA | Present | 2 | DG | [20] |
Aka (Biaka) | Central African Republic | NC | Present | 20/2 | HO/DG | [22,25] |
Chewa | Malawi | NC | Present | 11 | HO | [22] |
Dinka | Sudan | NS | Present | 7/4 | HO/DG | [22,25] |
French | France | IE | Present | 3 | DG | [25] |
Hadza | Tanzania | KS | Present | 5(2)/1 | HO/DG | [22,25] |
Han | China | ST | Present | 4 | DG | [25] |
Herero | Namibia | NC | Present | 2 | DG | [25] |
Khoesan | Namibia | KS | Present | 22 | HO | [22] |
Mbuti | DR Congo | NC, NS | Present | 10/4 | HO/DG | [22,25] |
Mende | Sierra Leone | NC | Present | 8/2 | HO/DG | [22,25] |
Mursi | Ethiopia | NS | Present | 2 | DG | [20] |
Sandawe | Tanzania | KS | Present | 22 | HO | [22] |
Somali | Kenya | AA | Present | 13 | HO | [22] |
Yoruba | Nigeria | NC | Present | 70/3 | HO/DG | [22,25] |
List of populations used in analyses in the study. Data types are in-solution targeted SNP capture (1240k), whole-genome sequence with pseudo-haploid genotype calls (SG), high-coverage whole-genome sequence with diploid genotype calls (DG), and Human Origins SNP array (HO). For some populations, we used different sample sets for different analyses, indicated by slashes; Human Origins array genotyped individuals were used for PCA and for f-statistics testing differential relatedness to Shum Laka (Fig. 3B, Extended Data Fig. 3B). For Hadza, we used five individuals with Human Origins data for PCA and two of those five individuals for admixture graph modeling. HG, hunter-gatherers; AA, Afroasiatic; IE, Indo-European; KS, Khoesan; NC, Niger-Congo; NS, Nilo-Saharan; ST, Sino-Tibetan.
Individuals from Hora, Chencherere, and Fingira.
Individuals from Ballito Bay (A and B) and St. Helena Bay.