Abstract
Few African Americans have been able to trace family lineages back to ancestors who died before the 1870 United States Census, the first in which all Black people were listed by name. We analyze 27 individuals from Maryland’s Catoctin Furnace African American Cemetery (1774–1850), identifying 41,799 genetic relatives among consenting research participants in 23andMe, Inc.’s genetic database. One of the highest concentrations of close relatives is in Maryland, suggesting that descendants of the Catoctin individuals remain in the area. We find that many of the Catoctin individuals derived African ancestry from the Wolof and Kongo groups and European ancestry from Great Britain and Ireland. This study demonstrates the power of joint analysis of historical DNA and large datasets generated through direct-to-consumer ancestry testing.
One Page Summary:
Introduction:
Genetic analysis of historical individuals has the potential to help restore knowledge of people whose stories were omitted from written records. During the late 18th and early 19th centuries, Catoctin Furnace in Maryland relied on a workforce of enslaved individuals to operate the iron furnace and carry out domestic and agricultural tasks. Despite the role that Catoctin Furnace played in early United States history (including supplying munitions to combatants during the Revolutionary War), relatively little is known about the African Americans who labored there or their descendants compared to the furnace’s later predominantly white workforce.
Rationale:
We produced whole-genome data for 27 individuals buried in the Catoctin Furnace African American cemetery and compared them to approximately 9.3 million consenting research participants genotyped by 23andMe, Inc. to address the following questions: (i) How were the Catoctin individuals related to each other? (ii) What were the sources of their African and European ancestry? (iii) Where in the United States do their genetic relatives live today, including their direct descendants? (iv) What can their genomes reveal about their health?
Results:
We identified five genetic families, consisting of biological mothers, children, and siblings, among the Catoctin individuals. In most cases, biological family members were buried in close proximity. All but one of the Catoctin individuals had primarily African ancestry, with variable amounts of European ancestry. To learn more about their ancestry we developed a new approach to detect identical-by-descent segments of the genome shared between the Catoctin individuals and 23andMe research participants. Identical-by-descent segments of DNA are shared by two or more people because they have been inherited from a recent common ancestor. We identified 41,799 close and distant relatives of the Catoctin individuals among 23andMe research participants (Fig 0). Within Africa, we found the highest rates of genetic sharing between Catoctin individuals and research participants who self-identified as belonging to the Wolof or Kongo ethnolinguistic groups. Within Europe, we observed the highest rates of genetic sharing with research participants that have ties to Great Britain and Ireland.
Within the United States, we observed elevated rates of sharing with participants from the South, largely reflecting distant connections to 23andMe research participants with Sub-Saharan African ancestry (possibly tracing back to shared common ancestors in Africa). When we considered genetic relatives who share the most identical DNA with the Catoctin individuals, we observed the highest rates of sharing in Maryland, suggesting that at least some descendants stayed in the region following the furnace’s transition away from enslaved and paid African American labor. Finally, we found that some of the Catoctin individuals carried risk factors for sickle cell anemia and G6PD deficiency, genetic diseases that are common in African Americans today.
Conclusion:
These results demonstrate the power of joint analysis of DNA from historical individuals and the extremely large datasets generated through direct-to-consumer ancestry testing, and serve as a model for obtaining direct insights into the genome-wide genetic ancestry of enslaved people in the historical United States.
One Sentence Summary
Identifying genetic connections between historical African Americans from Catoctin Furnace and present-day people.
The vast majority of the ~45 million self-identified Black and/or African American individuals living in the United States (US) descend from approximately 456,600 enslaved Africans who were forcibly transported to the US from Africa during the transatlantic slave trade between 1501 and 1867 (1, 2). However, African Americans often have little information about these ancestors or their African origins due to a history of inhumane treatment of the enslaved and their descendants, which included marginalization and the obfuscation of family histories (3). In this study, we demonstrate that when combined with genome-wide data from a sufficiently large and diverse genetic database, DNA from historical individuals provides a means for restoring knowledge of familial connections between contemporary peoples and their historical relatives. Specifically, we report on the DNA of enslaved and free African Americans from Catoctin Furnace, Maryland, who lived, worked, died, and were buried there in the late 18th and early 19th centuries.
As early as December 1768, a tract of land was acquired in order to build an iron works at the foot of Catoctin Mountain near present-day Thurmont, Maryland (4). The furnace was in blast by 1776, producing pig iron, tools, household items, and munitions used during the Revolutionary War. At least 271 enslaved and an unknown number of free African Americans worked at Catoctin within and outside the furnace, as ore miners, colliers, forgemen, fillers, teamsters, and woodcutters, as well as in domestic and agrarian roles in the furnace owners’ households and plantations (5). In the second quarter of the 19th century, the furnace’s labor force switched primarily to wage labor and a primarily white workforce (6). Gradually, the contribution of African Americans in this early industrial complex was largely forgotten. The Catoctin Furnace African American cemetery, near an old ore pit, was excavated in 1979–1980 in advance of highway construction (Fig. 1A) (7–10). The Maryland State Highway Administration transferred stewardship of the recovered remains of deceased humans to the Smithsonian Institution, where curator Dr. J. Lawrence Angel conducted preliminary forensic anthropological investigations (11).
Fig. 1. Burial context, genetic kinship, and ancestry of Catoctin individuals.
(A) Map showing the location of Catoctin Furnace and burials within the cemetery. Burial locations of the five genetic families are circled. The rectangle in the upper right shows a portion of the cemetery with unexcavated burials identified through ground penetrating radar (map adapted from (12), prepared by Robert Wanner). (B) Individuals, labeled according to burial ID, are grouped into families based on genetic relationships. Genetic sex, mt and Y haplogroups are indicated by marker shape, fill, and outline color, respectively. The type of genetic relationship is indicated by connector linestyle. Marker fill pattern indicates individuals with one or more copies of an allele associated with sickle cell anemia or glucose-6-phosphate-dehydrogenase (G6PD) deficiency. (C) Ancestry proportions assigned to each individual from representative African (YRI), European (GBR) and Indigenous American (Pima) populations drawn from the public dataset according to the qpAdm software. Error bars indicate one standard error. Asterisks (*) indicate cases where damage restricted data were analyzed. Hash symbols (#) and plus signs (+) indicate models with p-values <0.01 or ancestry proportion estimates that fall more than three standard errors outside the range of 0–1, respectively.
The Catoctin Furnace Historical Society, Inc. (CFHS) was initially founded to save the Catoctin Furnace village and its archaeological and architectural heritage from this highway construction (4). In recent years, its mission expanded to include restorative justice, highlighting the critical role that enslaved and free African Americans played in the furnace’s history and in the growth of industrial wealth and power in the young US. In 2015, a grant from the Maryland Heritage Areas Authority supported further scientific analysis of individuals buried in the cemetery. The first phase of the project involved historical documentary research and osteological re-analysis that refined previous assessments of demography and bone and dental pathology, with testing for stable carbon and nitrogen isotopes, and trace elements to shed light on the life histories of the individuals (12). The project’s second phase used DNA to explore their biogeographic ancestries and relationships to one another, details of which were summarized in a technical note (13) and are expanded upon here. These data address a critical component of CFHS’s mission, pursued jointly with the African American Resources Cultural and Heritage Society (AARCH) of Frederick County, to return knowledge to the African American community and identify descendants of Catoctin’s enslaved and free workers.
Advances in ancient DNA (aDNA) technology have made it possible to use genetic data as a tool for restoring knowledge of enslaved and historically marginalized peoples whose stories were often omitted from or disregarded in written records. Studies of the New York African Burial Ground (14, 15), the Anson Street ancestors (16, 17) and others (18, 19) used a combination of anthropological and biomolecular tools to provide insight into the identity and life history of enslaved individuals through the study of their remains. However, their ability to localize the African origins of these individuals was limited by the exclusive use of mitochondrial (mt) DNA (20) and/or reliance on comparisons with data from publicly available reference datasets (21, 22). This study shows how deeper insights into the precise ancestral origins and genetic legacy of enslaved and free African Americans, such as those buried at Catoctin Furnace, can be obtained from genome-wide aDNA data when compared to a reference database containing genetic data from millions of living people, like the one maintained by 23andMe, Inc.
We recovered genome-wide aDNA from all 27 of the Catoctin individuals who were selected for sampling, targeting ~1.2 million single nucleotide polymorphisms (SNPs) using a capture-based approach (23–26), which we combined with imputation to further increase the amount of available genetic information (Table S1). By comparing the DNA of the Catoctin individuals with genotype data from 9,255,493 participants in the 23andMe cohort, all of whom consented to participate in research, we were able to learn about their biogeographic ancestries, genetic relationships with one another, and to provide insight into their genetic legacy by identifying identical-by-descent (IBD) connections with living relatives.
Community Engagement and Ethical Considerations
This research analyzed data from deceased individuals who were unable to directly consent to participate in this study, as well as from millions of research participants (including those genotyped by 23andMe) who actively consented to participate in research. The ties between present-day African Americans, their ancestors within the US, and their ancestors in Africa were forcibly severed by the transatlantic slave trade, the centuries-long institution of slavery, and generational systemic racist practices that have endured after the abolition of race-based slavery (27), as illustrated by Frederick Douglas’ famous words: “Genealogical trees do not flourish among slaves” (28). Our objective is to contribute to the restoration of memories of a past community whose legacy was intentionally obscured and to create an avenue for living people to learn about their ancestors. We followed guidelines for the ethical analysis of the DNA of historical and living people (29, 30), including consultation with stakeholder groups, as emphasized in recent discussions on the future of studies involving the remains of African Americans (31–35). Based on interactions with stakeholders, we believe there is interest among African Americans and the public to harness aDNA to learn about historical connections to people who lived in the past, and to leverage this technology to develop accurate methods to identify genetic relationships, many of which were previously unknown. Equally important is the need to communicate the results of these analyses with descendants and others in a sensitive and accurate manner.
In the case of Catoctin Furnace, the goals of CFHS, developed in partnership with AARCH, include identifying descendants and widening the community of stakeholders. To date, researchers at CFHS have traced the lineages of two African Americans (one enslaved and one free) who labored at Catoctin Furnace to their living descendants by utilizing historical documents and genealogical data. Members of CFHS, AARCH, and the recently discovered descendants all expressed support for the use of genetic approaches that could identify a larger descendant community. Based on conversations with these stakeholders about their research interests, we sought not only to identify living genetic relatives of the Catoctin individuals, but also to conduct analyses to shed light on their life stories, such as identifying family relationships shared between the Catoctin individuals, exploring their African and European origins, and identifying biologically significant variants. In collaboration with CFHS and AARCH we also held a series of public events to directly return research results at various phases of the project (36). One of the ways in which stakeholders chose to honor the legacy of enslaved individuals at Catoctin during these events was through the act of reading the names of individuals that could be abstracted from a variety of sources, including land records, probate inventories, church records, diaries, and freedom-seeker ads (although not associated with specific burials) (5). We include these names in the supplementary text (36). Establishing family connections to living people through genetics contributes to remembering and honoring those buried at Catoctin.
In this study, we show that the joint analysis of DNA extracted from the remains of deceased humans and millions of living people in a re-contactable research cohort (i.e., a cohort in which participants can be asked follow-up questions or receive results), like the one maintained by 23andMe, makes it possible to recover previously unknown connections between present-day people and historical individuals from sites like Catoctin Furnace. Not all members of the Catoctin stakeholder community have a known genetic connection to Catoctin. While this study is responsive to community requests to use genetic approaches to identify descendants, future studies applying these methods should be cautious about the danger of contributing to the biologization of notions of identity, as genetic connections represent one of many ways in which people may feel connected to historical individuals. A full ethics statement is provided in (36). Additional ethical considerations involving the co-analysis of aDNA and data from private genetic databases are discussed in (37).
Multiple families buried at Catoctin Furnace
Among the 27 Catoctin individuals, we identified five distinct genetic families (labeled A–E) that are primarily composed of mothers, children, and siblings; in this study, the term “genetic family” refers to a group of people who are closely related biologically. Similarly, specific relationship terms (e.g., mother, son, or daughter) are used in a biological sense and based on genetic sex as inferred by the presence or absence of sex chromosomes; thus, they may not reflect the actual kin-based relationships, biological presentation, or gender identities recognized by the Catoctin individuals. Fifteen Catoctin individuals could be assigned to one of the five genetic families, while the remaining 12 individuals appear genetically unrelated, except for the individual from Burial 28, whose coverage was insufficient to confidently assign to Family B. Some unrelated individuals share mt or Y-chromosome haplogroups, which may indicate more distant relationships that fall outside the limits of our resolution. We used information about genetic sex and mt and Y haplogroups to further resolve these family groupings (Fig. 1B). Close genetic relatives tended to be buried near one another (Fig. 1A), while individuals who were buried separately from their genetic families were typically more distantly related. For instance, Family A consists of a mother (Burial 3) and two sons (Burials 1 and 2), interred side-by-side, in addition to a 2nd–3rd-degree relative (Burial 24) who was buried separately and whose exact relationship to the other individuals is unresolved. While genetic relatedness had a role in burial patterning, other factors, such as temporal context and cultural and religious practices likely contributed as well.
Sex-biased reproduction
The European ancestry of enslaved African Americans originated largely through a process whereby white men reproduced with Black women through rape. This gender-based sexual violence contributed to the brutal systematic enslavement of African Americans and frequently produced children born into slavery (22, 38). This pattern of behavior, a form of sex-biased admixture, is reflected in the distribution of the mt and Y haplogroups observed among the Catoctin individuals. Three of the 16 Catoctin males have Y haplogroups that are broadly associated with West Eurasian ancestry (Fig. S1). These include subclades of the R1a and R1b haplogroups which are common throughout Europe, indicating that their paternal lineages likely trace back to a fully European ancestor. In contrast, only one individual (Burial 32) has a European-associated mt haplogroup (J1b1a1a) (Fig. S2). This individual is an outlier with respect to ancestry as they also have a European-associated Y haplogroup (R1a1a1b1a3b) and over 50% European ancestry. Among other possible causes, their spatially separated grave, located in the northwestern edge of the burial ground, may reflect their distinct ancestral origins or lack of relatedness within the community represented in the cemetery.
Variable proportions of African, European, and Indigenous American-related ancestry in Catoctin individuals
All individuals (except the individual from Burial 32) were assigned a majority of African ancestry by qpAdm, using a model designed to estimate each individual’s African, European, and Indigenous American ancestry proportions (Fig. 1C, Table S2–3). Over one quarter of individuals (n=7) could be modeled as having no detectable European admixture (i.e., the amount of ancestry assigned to the European source population is within a single standard error of zero). This is in sharp contrast to nearly all present-day self-identifying African Americans who typically have at least some European-derived ancestry (e.g., there is an average of 24% European ancestry among 23andMe research participants who self-identify as African American (39)). Several individuals could be modeled as having Indigenous American ancestry, but in all cases, estimates were within three standard errors of zero. It is therefore uncertain based on these analyses alone whether these estimates represent true Indigenous American ancestry.
We next imputed genotypes across the whole genome for each Catoctin individual using the software GLIMPSE (40) with an approach optimized for low-coverage, capture-based aDNA data. To ensure that the imputation process would not bias results, we tested the performance of the Templated Positional Burrows–Wheeler Transform (41) (TPBWT) IBD detection tool and 23andMe’s Ancestry Composition (42) tool on a dataset composed of 48 high-coverage ancient individuals who were downsampled to varying degrees (36). Further, we compared the results of ADMIXTURE (Fig. S3), PCA (Fig. S4), and qpAdm (Fig. S5) analyses using the imputed and non-imputed datasets to ensure that the imputation process did not substantially bias ancestry estimates. We observed a bias towards estimating excess European-related ancestry in the lowest coverage individuals when using the imputed dataset relative to the non-imputed dataset; however, this did not appear to substantially impact individuals with >0.5x coverage, which is the group we focused on for subsequent analyses.
Using the imputed dataset, we applied 23andMe’s Ancestry Composition tool (42) to infer local ancestry along the chromosomes of the Catoctin individuals. Each region of their genomes was assigned to one of six previously determined broad ancestry categories: Sub-Saharan African, European, East Asian & Indigenous American, Western Asian & North African, Melanesian, and Central & South Indian (Fig. S6, Table S4). These assignments are correlated with qpAdm estimates (Fig. S7) and provide support for the identification of Indigenous American ancestry in several Catoctin individuals. For instance, we inferred low levels of Indigenous American ancestry in two brothers (Burials 1 and 2) from Family A, but not in their mother (Burial 3) (Fig. 2), suggesting that their un-sampled father had some Indigenous American ancestry.
Fig. 2. Ancestry composition chromosome paintings for members of Family.
A. Chromosome paintings demonstrating the biogeographic ancestry assigned to portions of the autosomal chromosomes for four related Catoctin individuals assigned to Family A—a mother, two sons, and their 2nd–3rd-degree relative. A likely pedigree for Family A (top left) describes their relationship to one another, although we note that the true relationship of the individuals from Burials 1, 2, and 3 to the individual from Burial 24 is uncertain. Across the genome, ancestry is assigned to one of six ancestry components defined using 23andMe reference populations: Sub-Saharan African (purple), West Asian & North African (dark blue), European (dark teal), East Asian & Indigenous American (orange), Melanesian (yellow), and Central & South Asian (green). Unassigned regions are shown in white.
Identity-by-Descent (IBD)
To learn more about the biogeographic ancestry and genetic legacy of the Catoctin individuals, we searched for identical-by-descent (IBD) segments of the genome–that is, long segments of DNA that are identical in two or more people because they have been inherited from a recent common ancestor. We searched for IBD shared between each of the Catoctin individuals and ~9.3 million 23andMe research participants. We identified 55,342 IBD segments shared between the historical Catoctin individuals and 41,799 research participants, ranging up to 60 cM in length (Table 1, Figs. S8–9, Tables S5–6). We calculated the total IBD shared between each pair of individuals to estimate their most likely genetic relationship; however, we caution that we are likely underestimating the true amount of DNA shared between these individuals, particularly among close relatives (36). In Box 1, we discuss how the relationships between Catoctin individuals and research participants with whom they share DNA can be interpreted (Table S7), noting that not all present-day individuals who share DNA with Catoctin individuals are direct descendants. In fact, most connections are likely between collateral relatives—relatives who are neither direct ancestors nor descendants of one another, but instead both descend from a common ancestor who lived generations before the Catoctin individuals. Further, many of the most distant relatives that we identified may not share a common ancestor who lived in the Americas. Instead, their connection may trace back to an individual who lived in Africa or Europe prior to their descendants’ arrival in the Americas, either willingly or as part of the transatlantic slave trade.
Table 1.
IBD shared between 22 Catoctin individuals and 23andMe participants
23andMe Participants included in group | Number of 23andMe participan ts in group | Proportion of 23andMe participants in group that share IBD with Catoctin Individuals | Number of IBD segments detected | Median Length of IBD segments detected (cM)* | Maximu m length of IBD segments detected (cM)* | Median total IBD in 23andMe participants with IBD detected (cM)* | Maximum total IBD shared with 23andMe participant s (cM)* |
---|---|---|---|---|---|---|---|
All participants | 9,255,493 | 0.45% | 55,342 | 10 | 60 | 9.9 | 280 |
Atlantic African cohort | 3,304 | 2.27% | 85 | 7.4 | 20 | 7.4 | 30 |
European cohort | 226,384 | 0.22% | 519 | 7.6 | 20 | 7.6 | 25 |
US cohort | 2,993,165 | 0.51% | 17,854 | 9.7 | 55 | 9.6 | 280 |
Members of US cohort with ≥5% Sub-Saharan African ancestry | 192,880 | 4.25% | 10,675 | 11 | 55 | 11 | 280 |
Members of US cohort with ≥99% European Ancestry | 1,896,655 | 0.26% | 5,123 | 8.0 | 30 | 8.0 | 40 |
Summary statistics describing frequencies and amounts of IBD shared between the 22 Catoctin individuals with >0.5x coverage and subsets of the 23andMe cohort. Participants were included in the US cohort either [1] if all four of their grandparents were born in the US, or [2] if they were born in the US and their grandparent birth location information was either unavailable or their grandparents were born in multiple countries. Similarly, members of the Atlantic African and European cohorts were determined using grandparent or participant birth location, with the additional requirement of ≥95% Sub-Saharan African ancestry or ≥99% European, respectively. Individual level results for each Catoctin individual are available in Table S5.
Values are rounded based on the magnitude of IBD sharing as follows: values >100 cM are rounded to the nearest ten, values between 30–100 cM are rounded to the nearest five, values between 10–30 cM are rounded to the nearest integer, and values <10 cM are rounded to one decimal place).
Box 1. Interpreting DNA shared between historical and present-day individuals.
The Catoctin Furnace African American Cemetery was in use between 1774–1850 (midpoint: 1812), while the average birth year of 23andMe research participants with genetic connections to Catoctin is 1974 (Table S22). The average separation between the death of the historical individuals and birth of present-day genetic relatives is ~162 years (although it is possible that as few as 70 years may have elapsed since the date of death of some of the latest Catoctin burials and the oldest research participants). The average age of reproduction among humans is 28–30 years, therefore the most likely number of generations separating the Catoctin individuals from their living relatives is about five to six generations (Fig. B1a).
The vast majority of the genetic relatives that we detected are not direct descendants of the Catoctin individuals. Instead, they are collateral relatives (i.e., relatives who are not direct ancestors or descendants) (Fig. B1b). To determine the likelihood that someone shares a direct descendant relationship with a Catoctin individual, we must consider the age of the two individuals and the amount of identical DNA (or IBD) that they share. Relatives who share very little IBD, but who lived close together in time (as is the case for most of the connections we identified), are less likely to share a direct descendant/ancestor relationship.
Fig. B1.
(A) A timeline showing the years in which the African American Cemetery at Catoctin Furnace was active and a histogram of birth years of research participants who share IBD with Catoctin (Table S23). (B) Examples of relationships that could be shared between individuals that were born five generations apart, with varying degrees of genetic separation. Median amount of IBD is reported for pairs of present-day and historical individuals (with 2x coverage) (Table S7).
Genetic connections to present-day Africans
We examined IBD segments shared between Catoctin individuals and members of the African cohort (i.e., 23andMe research participants with ≥95% Sub-Saharan African ancestry who indicated that either they or all four of their grandparents were born in Africa) to identify the geographical regions in Africa with which these present-day people are associated. We observed the highest rates of IBD sharing between the Catoctin individuals and participants linked to Senegal, Gambia, Angola, and Democratic Republic of the Congo (DRC) (Fig. 3A, Fig. S10, and Table S8), confirming via randomization testing that we would be unlikely to detect the same number of IBD connections (n=10) with an identically sized sample (n=166) of randomly selected individuals from the Atlantic African cohort (i.e., research participants in the African cohort with ties to specific Atlantic African countries, defined in (36)) (p-value<0.001) (Table S9).
Fig. 3. Genetic connections to the Catoctin individuals among members of the African cohort.
(A) The proportion of 23andMe participants in the African cohort who share IBD with Catoctin. Geographic coordinates are rounded to the nearest integer, and only coordinates that have at least 25 associated participants after 80% downsampling are shown. Marker size indicates the number of participants associated with each coordinate and color indicates the proportion of participants who share IBD with the Catoctin individuals. (B) IBD network demonstrating Catoctin individuals’ connections to members of the African cohort who share less than 700 cM of IBD with one another (N = 2,807). IBD clusters (represented by circles) are filled according to members’ average local African ancestry and arranged by average pairwise IBD sharing between clusters using a Force Atlas graph layout. Catoctin individuals, displayed as squares, are projected based on their average IBD shared with each cluster (shown as lines).
Many African ethnolinguistic groups occupy wide geographic ranges that cross present-day national borders. We therefore also determined the amount of IBD each Catoctin individual shared with genetic clusters in Atlantic Africa that correspond to the self-identified ethnolinguistic groups of research participants (Fig. 3B, Table S10). Of the 15 Catoctin individuals with detectable IBD connections to Africa, seven share a connection with only a single cluster. Six of these individuals have high proportions of Sub-Saharan African ancestry (>90%). In contrast, it is less common for present-day research participants with four grandparents born in the US and ≥50% Sub-Saharan African ancestry to have a connection to only a single cluster (36). Among the Catoctin individuals with connections to Atlantic Africans, the most commonly observed connections are to genetic groups associated with the Wolof, Mandinka, and Kongo (whose present-day geographic distribution is described in Table S11). Overall, the Catoctin individuals share relatively more connections with Senegambian groups, like Wolof and Mandinka, than do research participants with four grandparents born in the US and ≥50% Sub-Saharan African ancestry (Fig. 3, Supplementary Text S5).
Genetic connections to present-day Europeans
Next, we explored the rate of IBD sharing with members of the European cohort (i.e., 23andMe research participants with ≥99% European ancestry who indicated that either they or all four of their grandparents were born in Europe) to learn about the origins of the European-related ancestry that we observed in over half of the Catoctin individuals. We detect the highest rates of IBD among participants associated with Great Britain and Ireland (Fig. 4A, Table S12). Randomization testing confirms that we would be unlikely to detect the same number of IBD connections (n=467) with an identically sized sample (n=101,262) of randomly selected individuals from the European cohort (p-value<0.001) (Table S9). Much of the Irish-related signal is driven by connections to the individual from Burial 15, an adolescent male belonging to Family C, who, when projected onto a graph network of clusters that correspond to European geography again shares the most IBD with clusters of participants associated with the northern, western, and southeastern regions of the island of Ireland (Fig. 4B). Multiple other Catoctin individuals (in particular, those from Burials 2, 10, and 34) share broader connections with participants from Great Britain and Ireland (Fig. 4B, Fig. S11, Table S13).
Fig. 4. Genetic connections to the Catoctin individuals among members of the European cohort.
(A) Proportion of 23andMe participants in the European cohort who share IBD with Catoctin. Geographic coordinates are rounded to the nearest integer, and only coordinates that have at least 25 associated participants after 80% downsampling are shown. Marker size corresponds to the number of participants associated with each location, while the color indicates the proportion of participants who share IBD with the Catoctin individuals. (B) IBD network indicating Catoctin individuals’ connections to the 23andMe participants in the European cohort who share less than 700 cM with one another. Clusters are labeled by the geographic region with which the majority of cluster members are associated using ISO2 country abbreviations and when appropriate, prefixes to indicate the cardinal directions. Clusters are arranged by the average pairwise IBD sharing between clusters using a Force Atlas graph layout, with outlines that indicate participant’s average local European ancestry. Catoctin individuals, displayed as squares, are projected based on the average IBD shared with each European cluster (shown as lines).
Distant and close relatives in the United States
When we considered IBD sharing between research participants in the US cohort (i.e., research participants who indicated that either they or all four of their grandparents were born in the US) and each of the Catoctin individuals separately, each historical individual exhibited a unique pattern of IBD sharing with respect to geography. However, when we considered the Catoctin individuals together, we observed the highest rates of sharing between Catoctin and research participants from the southern US (Fig. 5A, Fig. S12, Table S14). This signal resembles the geographic distribution of 23andMe research participants with Sub-Saharan African ancestry in the US (Fig. S13) and is therefore plausibly driven by a higher rate of IBD sharing in genomic regions with Sub-Saharan African ancestry. To address this source of bias, we restricted our analysis to research participants in the US cohort with ≥5% Sub-Saharan African ancestry (Fig. 5B, Table S15). This filtering strategy increases the rate of IBD sharing from 0.45% of all US participants to 4.25% (Table 1).
Fig. 5. Geographic distribution of distant and close relatives of the Catoctin individuals among members of the US cohort.
(A) Proportion of 23andMe research participants in the US cohort who share IBD with Catoctin. Only coordinates representing at least 25 participants after 80% downsampling are shown. Marker size corresponds to the number of participants associated with each coordinate, while color indicates the proportion of participants with shared IBD. Marker outlines indicate the number of participants at each coordinate who share ≥30 cM of IBD with one or more Catoctin individuals. The same information is shown for (B) participants with ≥5% Sub-Saharan African ancestry and (C) participants with ≥99% European ancestry. (D) IBD network of the closest present-day relatives of Catoctin individuals among 23andMe research participants. Circles represent modularity clusters consisting of close Catoctin relatives (sharing ≥30 cM of IBD) along with their relatives (sharing ≥100 cM with a close relative of a Catoctin individual). Clusters are outlined according to their average ancestry and arranged by the average pairwise IBD sharing between clusters using a Force Atlas layout. Catoctin individuals, displayed as squares, are projected based on the average IBD shared with each familial group (shown as lines).
For participants included in this filtered dataset, we continued to observe elevated rates of IBD sharing with Catoctin in the southern US (including Maryland) (Fig. 5B). We confirmed via randomization test that we would be unlikely to detect the same number of IBD connections (n=2,034) with an identically sized sample (n=42,132) of randomly sampled individuals from the US cohort with ≥5% Sub-Saharan African ancestry (p-value <0.001) (Table S9). In contrast, when we filtered to include only participants in the US cohort with ≥99% European ancestry to focus on the genetic legacy of the admixed Catoctin individuals’ European ancestors along lineages with little to no African admixture, we did not detect any clear geographic patterns (Fig. 5C, Table S12).
When we focused on “close relatives” (referred to here as pairs of individuals who share at least 30 cM of IBD with a Catoctin individual, reflecting a relationship that is predicted to be 9th-degree or closer), we observed particularly pronounced connections to Maryland, identifying a total of 30 close relatives in the state (Table S16). A randomization test confirmed that we would be unlikely to identify 30 or more close relatives among an identically sized sample (n=19,972) of randomly selected individuals (p-value<0.001) (Table S17). These results suggest that at least some descendants of the Catoctin individuals or their family members remain in the Maryland area.
Next, we considered population substructure among the close relatives of the Catoctin individuals in the US using an IBD network approach. We analyzed 443 close relatives of the Catoctin individuals (≥30 cM of IBD) along with 4,385 of those participants’ closest relatives who share ≥100 cM of IBD—many of whom we hypothesize could be genetically related to the Catoctin individuals, even if we did not detect IBD sharing between them due to the high false negative rate of our approach—and identified 123 familial groups (Fig. 5D, Table S18). The positions of these familial clusters within the IBD network appear to be primarily correlated with the relative proportions of European and Sub-Saharan African ancestry detected among each cluster’s members, likely reflecting the ancestry of the individual IBD segments shared with the Catoctin individuals. In most cases, the familial groups do not appear to be correlated with geography. However, notably, all members of familial group 36 who provided location information have ties to Maryland, providing further evidence that Catoctin descendants and close relatives remained in or returned to Maryland after emancipation.
While elevated rates of IBD sharing are particularly evident in the Maryland region, we also identified other regions of the US with an enrichment of Catoctin close relatives. For example, the maximum amount of IBD (280 cM) is observed among a set of individuals from Southern California. This amount of sharing (corresponding to ~4% of the autosomal genome) is consistent with a 5th-degree relationship to the individual from Burial 35 based on maximum likelihood predictions (although the actual relationship may differ by a few degrees in either direction). Individuals with this amount of IBD sharing are likely either direct descendants of those buried at Catoctin or direct descendants of very close relatives of the Catoctin individuals (given that the Catoctin individuals likely lived at least five generations before most of the research participants included in this study) (Box 1, Table S7).
To reconstruct pedigrees describing the connections shared between the Catoctin individuals and their closest genetic relatives in the 23andMe cohort, we used a modified version of the tool Bonsai (36, 43). The informed consent process for participation in 23andMe research requires strict protection of research participant anonymity, which means full pedigrees cannot be published. Instead, we highlight the ways in which 8,721 independent pedigrees (636 of which contain more than one research participant) are connected to Catoctin Family A (Fig. 6, Table S19). We found no cases where the most likely connection was via a direct descendant of individuals from Burials 1, 2, or 24, consistent with our expectations since these individuals died during childhood. The most probable path for most (83%) of the pedigrees we considered connects through an ancestor of the unsampled father of individuals from Burials 1 and 2 (referred to as un-genoyped individual f in Fig. 6). We find that pedigrees that include research participants who have at least twice as much Indigenous American ancestry as Sub-Saharan African ancestry were significantly more likely to connect through this unsampled individual than through other Catoctin individuals (p < 10−6; calculated via a permutation test with 106 replicates) (Supplementary Text S6), consistent with our earlier prediction that this unsampled individual had some Indigenous American ancestry.
Fig. 6. Connections between Catoctin Family A and modern pedigrees.
The pedigree for Catoctin Family A is shown with blue-shaded individuals connected by gray lines. Open gray diamonds indicate un-genotyped individuals, some of whom must have existed (i.e., b, c, f, i, and j), whereas others (i.e., a, d, e, g, h, k) are inferred to have existed. The large purple triangle represents all present-day pedigrees composed of 23andMe research participants, and the probability distribution of how these pedigrees connect to the historical pedigree is inferred. Orange dots indicate all possible points of connection of present-day pedigrees to the historical pedigree. Numbers in ovals give the percentage of present-day pedigrees whose most likely connection was to a given point on the historical pedigree. Numbers in orange diamonds indicate the average degree of a lineage connecting to a particular point. Pie charts show the average European (blue), Sub-Saharan African (purple), and Indigenous American (yellow) ancestry (normalized to sum to one) of individuals in pedigrees whose most likely point of connection was through the respective lineage leading to the present day.
Most inferred connections extend upward in pedigrees generated for Families A, C and D (meaning that they connect through an ancestor of one or more members of the family, and therefore do not involve a direct descent relationship) (Figs. 6 and S14, Table S19). This is consistent with expectations, as individuals who lived in the last few hundred years are expected to have far more living collateral relatives than direct descendants. Notably, lineages that extend upwards from the Catoctin pedigrees tend to involve research participants who have more European ancestry than those lineages that extend downward. This suggests that a relatively large fraction of the connections that we identified involve ancestors of members of Family A with European ancestry, which in part may reflect biases in the 23andMe cohort where European ancestry is overrepresented.
These results demonstrate the power of our IBD-based approach to identify connections between historical and present-day individuals. In the future, by obtaining additional informed consent from research participants it may be possible to present more complex pedigrees that include direct descendants of historical individuals using this approach.
Biologically significant variants
We considered sites in the Catoctin individuals’ genomes that might shed light on their physical traits and health (Table S20, Fig. S15). However, we caution that these results are based on low coverage data, and further work is required to conclusively infer genotypes at these positions.
For three Catoctin individuals, we identified copies of only the causal A allele at the genetic position (rs334/i3003137) which is associated with protection against malaria in the heterozygous form and sickle cell disease—a red blood cell disorder that causes pain and increased susceptibility to infection and early mortality (44)—in individuals with two copies of the allele. Although we only have limited coverage at this position for each of these three individuals (range: 1–5x coverage), making it impossible to confidently determine their genotypes, we note that two individuals (Burials 17 and 19) are siblings who died during early childhood. If they had sickle cell disease, this may have contributed to their early mortality.
We also observed four individuals with at least one copy of the causal T allele at another genetic position (rs1050828) that provides protection against malaria in genetically female individuals who carry one copy of this allele on only one of their X chromosomes. In genetic males, who are hemizygous, and in genetic females who are homozygous for the T allele at this position, it is associated with G6PD deficiency (45). Both variants occur at elevated rates in populations with ancestry from Sub-Saharan Africa where malaria has historically been endemic (46). Understanding how the frequency of these alleles may have changed in African American populations over time is a question of great interest (47, 48), which may be informed by ancient DNA data as more historical African Americans are sequenced.
Discussion
This study demonstrates the power of genetic analyses to uncover previously unknown information about the family structure and ancestry of historical individuals, and to connect them with living relatives. For Catoctin Furnace, this research is a critical step toward identifying a larger descendant community, one of the main goals expressed by stakeholders from CFHS, AARCH, and the two Catoctin descendant families recently identified using historical documents and genealogical data. By developing an approach to identify genetic connections between historical individuals and their present-day relatives, DNA can be a significant means through which these relationships can be re-constructed.
At least 15 of the Catoctin individuals can be clustered into five genetic families, providing insight into the social structure of African Americans at this early industrial site. Historical records suggest that iron works enslavers often kept families together to benefit from a sustained knowledge transfer between generations. It was also believed that this practice minimized the likelihood of revolt or escape caused by family separation (49–51). The majority of the genetic connections we identified were between first-degree relatives, usually mothers and children. No fathers were identified, nor did we find families with representation of three or more generations. There were at least 11 individuals who appeared unrelated to others buried in the cemetery. These results may therefore indicate that, in practice, families did not remain together at this iron works. One possible explanation is that partners may have been sought outside of the Catoctin village. However, additional consideration must be given to how the types of genetic relationships we observed may be biased by incomplete excavation combined with burial patterning at the site, which was likely influenced by personal choice and/or imposed religious practices. For instance, from first-hand accounts recorded in journals and diaries, we know that some African American funerals at Catoctin were conducted by Moravian ministers (52, 53). The Moravians prescribed burial protocols in which families were not interred together, instead, they were buried in “choirs,” separated by marital status, age, and gender (54). This tradition introduces the possibility that married men were buried elsewhere in the cemetery, and therefore not sampled as part of this study (54). It was only through the co-analysis of aDNA and archival Moravian diaries that this unique aspect of burial and demography at Catoctin could be considered.
The genetically inherited conditions identified in this study (sickle cell anemia and G6PD deficiency) provide additional insight into the health and well-being of the Catoctin African Americans. Although these results are tentative and require more refined analyses, they offer insight and potential future avenues by which to explore the remains of deceased humans recovered from archaeological contexts.
At the population level, the Catoctin individuals have diverse ancestry, with clear genetic links to Africa, Europe, and the Americas. Most individuals have primarily Sub-Saharan African ancestry, with the strongest ties to present-day peoples in Senegambia and West Central Africa, a region that primarily encompasses present-day Angola and the Democratic Republic of the Congo. These findings correlate with historical records that show that slave ships originating from Senegambia (particularly before the end of the 18th century) and West Central Africa accounted for the highest disembarkation rates in Maryland over the course of the transatlantic slave trade, while lower disembarkation rates were recorded from intervening coastal regions and the Caribbean (Table S21) (55). However, records of the intra-American slave trade, which was responsible for the arrival of many enslaved African Americans to Maryland, are more limited, making it challenging to infer what the most likely major sources of African ancestry in Maryland might have been based on historical records alone. According to the transatlantic slave trade database, ships departing from Senegambia and West Central Africa account for 15.8% and 20.8% of overall disembarkations in North America (55). Notably, the most common sources of African ancestry among the Catoctin individuals do not align with larger trends in the greater Chesapeake region, where the highest rate of disembarkation was from the Bight of Biafra–home to people of Igbo and Yoruba ancestry (55).
The Catoctin individuals were less likely to have genetic connections to multiple distinct African ethnolinguistic groups than research participants with substantial (≥50%) Sub-Saharan African ancestry and longstanding ties to the US (i.e., four grandparents born in the US), likely reflecting admixing between individuals with diverse African ancestries that occurred among enslaved African Americans and their descendants over time, following forced migration to the Americas (22, 56).
Due to the overrepresentation of research participants with European (particularly British and Irish) ancestry in the 23andMe cohort, we have even more power to localize the European ancestry of the Catoctin individuals. Eight individuals exhibit connections to Great Britain and/or Ireland. Differing frequencies of European associated maternal and paternal haplogroups observed at Catoctin indicate that their European ancestry was likely introduced through a sex-biased process almost certainly driven by rape of enslaved women as part of the gender-based sexual violence inherent in the US’s system of chattel slavery (38). We also observed a single European-associated mt haplogroup (in an individual who we estimate to have over 50% European ancestry). This may be one of numerous examples present in the historical record of enslaved or free Black men having children with white women (often indentured servants) (57).
Another objective of this study was to explore the possibility of identifying direct descendants using DNA. We identified 41,799 research participants with genetic connections to the Catoctin individuals. In many cases, it was possible to construct detailed genetic pedigrees that directly link 23andMe research participants to the historical individuals from Catoctin, either as direct descendants or, most commonly, as collateral relatives with a shared common ancestor. While most of these connections are very distant, we identified over 500 relatives who share ≥30 cM of IBD, reflecting a maximum likelihood estimate of 9th-degree relationship or closer.
It has been suggested that Catoctin’s enslaved workers were sold and transported to more southern states when the furnace transitioned to white wage labor (4). During the early 19th century large numbers of enslaved individuals were sold from Mid-Atlantic states and transported south (58). While we observe strong connections to Maryland that suggest that this was unlikely to be the fate of Catoctin’s entire enslaved population, we do observe small clusters of close relatives throughout the US, including in the South.
The methodological approach we present in this study can be applied to the remains of deceased humans from other sites and contexts, offering a new scientific tool for individuals and descendant communities seeking greater knowledge of their ancestors, as well as archaeologists, bioarchaeologists, historians, genealogists. Museums and universities that steward the remains of deceased humans now have an additional means by which to identify past individuals and potentially link them to biological descendants. This work can currently only be done in partnership with organizations with access to massive, genetically diverse, re-contactable research cohorts, such as those maintained by genetic-ancestry companies. These partnerships will require conversations on the ethical implications and consequences of this work, particularly how to avoid reinforcing the biologization of identity (37). For researchers, this study represents advances in scientific methodology, but the impact may be even greater for those seeking connections to their past.
Materials and Methods Summary
We sampled aDNA from the temporal bones of 27 Catoctin individuals, using a minimally destructive cranial based drilling approach when sampling from intact skulls (59). We generated double stranded, partially uracil-DNA glycosylase (UDG) treated DNA libraries (60–64). Before sequencing, we enriched the libraries for DNA aligning to the mitochondrial genome and 1.2 million positions in the nuclear genome using a capture-based approach (23–26). Following bioinformatic processing, all 27 DNA samples were deemed suitable for analysis, although the data for two individuals was subjected to damage restriction based on slightly elevated MT (26) or X-chromosome (65) contamination rates. For each individual, we inferred genetic sex (66) and uniparental haplogroups (67) and estimated the proportion of African, European and Indigenous American-related ancestry by comparing to publicly available datasets (68–71). Diploid genotypes were imputed using GLIMPSE (72) with the 1000 Genomes project phase 3 dataset (68) as a reference panel. After filtering and rephasing the data (73), we searched for IBD (41) between the imputed Catoctin individuals and 9,255,493 participants (elsewhere referred to as the “23andMe cohort”) who had been genotyped by 23andMe, Inc., a consumer personal genetics company, and provided informed consent to participate in research by July 28th, 2020. Summary statistics were generated describing these IBD connections for cohorts of research participants that were created based on each research participant’s genetic ancestry (as determined by the tool Ancestry Composition) and answers to 23andMe survey questions about their birth and grandparent birth locations. Genetic pedigrees were reconstructed using a modified version of the Bonsai pedigree inference algorithm (43). Finally, we counted the number of unique DNA sequences that overlap biologically significant positions in the genome for each Catoctin individual.
Supplementary Material
Materials and Methods
Supplementary Text S1 - Extended Ethics Statement
Supplementary Text S2 - The Return of Names
Supplementary Text S3 - Testing the application of IBD detection methods to imputed low coverage ancient DNA using simulated data
Supplementary Text S4 - Testing the performance of Ancestry Composition on imputed low coverage ancient DNA using simulated data
Supplementary Text S5 - Comparison of IBD Network results of Catoctin individuals with present-day African Americans
Supplementary Text S6 - Genetic connections to African Among Catoctin individuals and 23andMe participants in the US cohort with at least 50% Sub-Saharan African ancestry
Figs. S1 to S16
References 74–124
Tables S1 to S24, S1.1, S3.2 to S3.7, and S4.1
Fig. 0. Genetic connections to individuals from Catoctin Furnace African American Cemetery in Maryland.
A timeline highlighting major events in the history of Catoctin Furnace and the birth years of research participants in the 23andMe cohort, presented alongside maps showing the proportion of 23andMe research participants associated with geographic coordinates in the United States, Sub-Saharan Africa, and Europe who share genetic connections to the Catoctin individuals.
Acknowledgments:
We acknowledge the individuals from Catoctin Furnace whose DNA we studied and recognize that they could not directly consent to participate in this study. We hope to honor the people of Catoctin and their contributions by telling their story and highlighting their ongoing legacy in the form of descendant communities. The genetic research was authorized by the Smithsonian Institution’s National Museum of Natural History Department of Anthropology Collections Advisory Committee. For nearly 50 years, the Catoctin Furnace Historical Society, Inc. (CFHS) has been at the forefront of stewardship, research, and interpretation of the African American Cemetery located in Catoctin village. The CFHS has been a driving force in the effort to link the remains of deceased individuals who are yet to be fully identified to a descendant community, with the ultimate goal of using genealogical and historical research as well as DNA to establish definitive genealogies. Inquiries from those interested in learning more about the history of Catoctin Furnace, and from possible descendants interested in exploring their connections to the people who worked at the site, are welcome to contact CFHS at https://catoctinfurnace.org/.We thank the CFHS, the African American Resources Cultural and Heritage Society of Frederick, Maryland (AARCH), and members of descendant families for support of this research. We thank the 23andMe research participants and others who consented to participate in research for enabling this study.
Funding:
The ancient DNA analysis was funded by NIH grant HG012287; by John Templeton Foundation grant 61220; and by the Howard Hughes Medical Institute. In 2016, a $14,000 Maryland Heritage Areas Authority grant was awarded to CFHS to fund forensic research for the African American cemetery. The Rice Family Endowment for Forensic Anthropology provided critical support for Smithsonian Skeletal Biology Program contributions.
Footnotes
Competing Interests:
É.H., S.Mi., W.F., K.B., E.J., H.L.G., S.A.E, J.M. and members of the 23andMe Research Team are employees of and have stock, stock options, or both, in 23andMe. The remaining authors declare no competing interests.
Data Availability:
Unaligned and aligned sequences for the 27 Catoctin individuals were originally reported in Harney et al. (13) and are available from the European Nucleotide Archive under accession number PRJEB52230. Genotype files for pseudo-haploid and imputed versions of the dataset are available at https://reich.hms.harvard.edu/datasets. There are restrictions to the availability of 23andMe genotype data due to 23andMe consent and privacy guidelines. 23andMe agrees that the publication coauthors will rerun the comparison of historical genomic data against customer genetic data upon request by other academic and nonprofit researchers on reasonable terms to enable the results of the research activities to be replicated for at least seven years after publication or for as long as the coauthors are employed by, or otherwise affiliated with 23andMe in a capacity that allows them to rerun the analysis. Wherever possible, supplementary tables were included that report the summary statistics that were used to create figures that involved 23andMe datasets. Unless another comparable anonymizing approach was specified, these summary statistics were generated with the requirement that in all reported results, any 23andMe research participant must be indistinguishable from at least four other research participants included in the dataset. When meaningful, we repeated analyses performed on the 23andMe dataset using only the 1000 genomes and/or African American imputation panel datasets and reported these results.
References
- 1.Eltis D, Richardson D, Atlas of the Transatlantic Slave Trade (Yale University Press, 2010). [Google Scholar]
- 2.U.S. Census Bureau QuickFacts: United States, (available at https://www.census.gov/quickfacts/fact/table/US/RHI225219#RHI225219).
- 3.Williams HA, Help me to find my people: The African American search for family lost in slavery (Univ of North Carolina Press, 2012). [Google Scholar]
- 4.Anderson EY, Catoctin Furnace: Portrait of an Iron Making Village (Arcadia Publishing, 2013). [Google Scholar]
- 5.Catoctin Furnace Historical Society, Compiled by donated labor for the Catoctin Furnace Historical Society. Primary manager, Elizabeth Anderson Comer., Access database of confirmed and potential members of the Catoctin Furnace community. (2015). [Google Scholar]
- 6.Frederick County Account of Sales Liber GME 9, Folio 353–361. LIst of names appears on page 355.
- 7.Thomas RA, Mellin GS, Payne T, Burnston S, McCarthy JP, “Archaeological Investigations at Catoctin Furnace, Frederick County, Maryland” (Mid-Atlantic Archaeological Research, Inc., 1980). [Google Scholar]
- 8.Townsend A, Thomas R, Orr K, “Specifications and Research Designs. The Catoctin Furnace Archaeological Mitigation Project. U.S. Route 15 from Putman Road to Route 77.” (Prepared for the State Highway Administration, Baltimore, Md., Thurmont, MD, 1979). [Google Scholar]
- 9.Orr K, Orr R, “Interim Report of the Catoctin Furnace Archaeological Mitigation Project Contract F522-152-770” (Prepared for the Maryland State Highway Administration, Baltimore, 1980). [Google Scholar]
- 10.Burnston S, Thomas RA, “Archaeology Data Recovery at Catoctin Furnace Cemetery” (Prepared for Orr and Son, Consulting Archaeologists, 1981). [Google Scholar]
- 11.Kelley A, J. L. JO, The workers of Catoctin furnace. Maryland Archeology. 19, 2–17 (1983). [Google Scholar]
- 12.Bruwelheide KS, Owsley DW, Barca KG, France CAM, Little N, Comer EA, Restoring Identity to People and Place: Reanalysis of Human Skeletal Remains from a Cemetery at Catoctin Furnace, Maryland. Hist Arch. 54, 110–137 (2020). [Google Scholar]
- 13.Harney É, Olalde I, Bruwelheide K, Barca KG, Curry R, Comer E, Rohland N, Owsley D, Reich DE, Technical Report on Ancient DNA analysis of 27 African Americans from Catoctin Furnace, Maryland. bioRxiv (2022), p. 2022.06.12.495320, , doi: 10.1101/2022.06.12.495320. [DOI] [Google Scholar]
- 14.Blakey ML, Rankin-Hill LM, Howard University, United States, General Services Administration, The skeletal biology of the New York African Burial Ground (Howard University Press : in association with the U.S. General Services Administration, Washington, D.C., 2009). [Google Scholar]
- 15.Clinton CK, Jackson FLC, Historical overview, current research, and emerging bioethical guidelines in researching the New York African burial ground. American Journal of Physical Anthropology (2021), doi: 10.1002/ajpa.24171. [DOI] [PubMed] [Google Scholar]
- 16.Fleskes RE, Ofunniyin AA, Gilmore JK, Poplin E, Abel SM, Bueschgen WD, Juarez C, Butler N, Mishoe G, Cabana GS, Ancestry, health, and lived experiences of enslaved Africans in 18th century Charleston: An osteobiographical analysis. American Journal of Physical Anthropology (2021). [DOI] [PubMed] [Google Scholar]
- 17.Fleskes RE, Cabana GS, Gilmore JK, Juarez C, Karcher E, Oubré L, Mishoe G, Ofunniyin AA, Schurr TG, Community-engaged ancient DNA project reveals diverse origins of 18th-century African descendants in Charleston, South Carolina. Proceedings of the National Academy of Sciences. 120, e2201620120 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Barquera R, Lamnidis TC, Lankapalli AK, Kocher A, Hernández-Zaragoza DI, Nelson EA, Zamora-Herrera AC, Ramallo P, Bernal-Felipe N, Immel A, Bos K, Acuña-Alonzo V, Barbieri C, Roberts P, Herbig A, Kühnert D, Márquez-Morfín L, Krause J, Origin and Health Status of First-Generation Africans from Early Colonial Mexico. Current Biology. 30, 2078–2091.e11 (2020). [DOI] [PubMed] [Google Scholar]
- 19.Fleskes RE, Bruwelheide KS, West FL, Owsley DW, Griffith DR, Barca KG, Cabana GS, Schurr TG, Ancient DNA and bioarchaeological perspectives on European and African diversity and relationships on the colonial Delaware frontier. American Journal of Physical Anthropology. 170, 232–245 (2019). [DOI] [PubMed] [Google Scholar]
- 20.Salas A, Richards M, Lareu M-V, Scozzari R, Coppa A, Torroni A, Macaulay V, Carracedo Á, The African Diaspora: Mitochondrial DNA and the Atlantic Slave Trade. The American Journal of Human Genetics. 74, 454–465 (2004). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Bentley AR, Callier S, Rotimi CN, Diversity and inclusion in genomic research: why the uneven progress? J Community Genet. 8, 255–266 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Micheletti SJ, Bryc K, Ancona Esselmann SG, Freyman WA, Moreno ME, Poznik GD, Shastri AJ, Beleza S, Mountain JL, Agee M, Aslibekyan S, Auton A, Bell R, Clark S, Das S, Elson S, Fletez-Brant K, Fontanillas P, Gandhi P, Heilbron K, Hicks B, Hinds D, Huber K, Jewett E, Jiang Y, Kleinman A, Lin K, Litterman N, McCreight J, McIntyre M, McManus K, Mozaffari S, Nandakumar P, Noblin L, Northover C, O’Connell J, Petrakovitz A, Pitts S, Shelton J, Shringarpure S, Tian C, Tung J, Tunney R, Vacic V, Wang X, Zare A, Genetic Consequences of the Transatlantic Slave Trade in the Americas. The American Journal of Human Genetics. 107, 265–277 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Mathieson I, Lazaridis I, Rohland N, Mallick S, Patterson N, Roodenberg SA, Harney E, Stewardson K, Fernandes D, Novak M, Genome-wide patterns of selection in 230 ancient Eurasians. Nature. 528, 499–503 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Haak W, Lazaridis I, Patterson N, Rohland N, Mallick S, Llamas B, Brandt G, Nordenfelt S, Harney E, Stewardson K, Massive migration from the steppe was a source for Indo-European languages in Europe. Nature. 522, 207–211 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Fu Q, Hajdinjak M, an OT, Constantin S, Mallick S, Skoglund P, Patterson N, Rohland N, Lazaridis I, Nickel B, Viola B, Prüfer K, Meyer M, Kelso J, Reich D, Pääbo S, An early modern human from Romania with a recent Neanderthal ancestor. Nature. 524, 216–219 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Fu Q, Mittnik A, Johnson PLF, Bos K, Lari M, Bollongino R, Sun C, Giemsch L, Schmitz R, Burger J, Ronchitelli AM, Martini F, Cremonesi RG, Svoboda J, Bauer P, Caramelli D, Castellano S, Reich D, Pääbo S, Krause J, A Revised Timescale for Human Evolution Based on Ancient Mitochondrial Genomes. Current Biology. 23, 553–559 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Gates HL, Life Upon These Shores: Looking at African American History, 1513–2008 (Alfred A. Knopf, 2011). [Google Scholar]
- 28.Douglass F, My Bondage and My Freedom (Oxford University Press, 2019). [Google Scholar]
- 29.Wagner JK, Colwell C, Claw KG, Stone AC, Bolnick DA, Hawks J, Brothers KB, Garrison NA, Fostering Responsible Research on Ancient DNA. The American Journal of Human Genetics. 107, 183–195 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Alpaslan-Roodenberg S, Anthony D, Babiker H, Bánffy E, Booth T, Capone P, Deshpande-Mukherjee A, Eisenmann S, Fehren-Schmitz L, Frachetti M, Fujita R, Frieman CJ, Fu Q, Gibbon V, Haak W, Hajdinjak M, Hofmann KP, Holguin B, Inomata T, Kanzawa-Kiriyama H, Keegan W, Kelso J, Krause J, Kumaresan G, Kusimba C, Kusimba S, Lalueza-Fox C, Llamas B, MacEachern S, Mallick S, Matsumura H, Morales-Arce AY, Matuzeviciute GM, Mushrif-Tripathy V, Nakatsuka N, Nores R, Ogola C, Okumura M, Patterson N, Pinhasi R, Prasad SPR, Prendergast ME, Punzo JL, Reich D, Sawafuji R, Sawchuk E, Schiffels S, Sedig J, Shnaider S, Sirak K, Skoglund P, Slon V, Snow M, Soressi M, Spriggs M, Stockhammer PW, Szécsényi-Nagy A, Thangaraj K, Tiesler V, Tobler R, Wang C-C, Warinner C, Yasawardene S, Zahir M, Ethics of DNA research on human remains: five globally applicable guidelines. Nature (2021), doi: 10.1038/s41586-021-04008-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Dunnavant J, Justinvil D, Colwell C, Craft an African American Graves Protection and Repatriation Act. Nature. 593, 337–340 (2021). [DOI] [PubMed] [Google Scholar]
- 32.Jackson F, Clinton C, Graves J, Genomics of African American remains — limits must not compound inequity. Nature. 596, 35–35 (2021). [DOI] [PubMed] [Google Scholar]
- 33.Benn Torres J, Anthropological perspectives on genomic data, genetic ancestry, and race. American Journal of Physical Anthropology. 171, 74–86 (2020). [DOI] [PubMed] [Google Scholar]
- 34.Lans AM, Decolonize this collection: Integrating black feminism and art to re-examine human skeletal remains in museums. Feminist Anthropology. 2, 130–142 (2021). [Google Scholar]
- 35.Austin RM, Zuckerman M, Honap TP, Lee H, Ward GK, Warinner C, Sankaranarayanan K, Hofman CA, Remembering St. Louis Individual—structural violence and acute bacterial infections in a historical anatomical collection. Commun Biol. 5, 1–10 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. Detailed information is provided in the supplementary materials.
- 37.Harney É, Sirak K, Sedig J, Micheletti S, Curry R, Ancona Esselmann S, Reich D, Ethical considerations when co-analyzing ancient DNA and data from private genetic databases. AJHG (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Baptist EE, “Cuffy,”“fancy maids,” and “one-eyed men”: rape, commodification, and the domestic slave trade in the United States. The American Historical Review. 106, 1619–1650 (2001). [Google Scholar]
- 39.Bryc K, Durand EY, Macpherson JM, Reich D, Mountain JL, The Genetic Ancestry of African Americans, Latinos, and European Americans across the United States. The American Journal of Human Genetics. 96, 37–53 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Hui R, D’Atanasio E, Cassidy LM, Scheib CL, Kivisild T, Evaluating genotype imputation pipeline for ultra-low coverage ancient genomes. Sci Rep. 10, 18542 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Freyman WA, McManus KF, Shringarpure SS, Jewett EM, Bryc K, the 23andMe Research Team, A. Auton, Fast and robust identity-by-descent inference with the templated positional Burrows-Wheeler transform. Molecular Biology and Evolution (2020), doi: 10.1093/molbev/msaa328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Durand EY, Do CB, Wilton PR, Mountain JL, Auton A, Poznik GD, Macpherson JM, bioRxiv, doi: 10.1101/2021.01.19.427308. [DOI] [Google Scholar]
- 43.Jewett EM, McManus KF, Freyman WA, Auton A, Bonsai: An efficient method for inferring large human pedigrees from genotype data. The American Journal of Human Genetics. 108, 2052–2070 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Pauling L, Itano HA, Singer SJ, Wells IC, Sickle Cell Anemia, a Molecular Disease. Science. 110, 543–548 (1949). [DOI] [PubMed] [Google Scholar]
- 45.Beutler E, G6PD Deficiency. Blood. 84, 3613–3636 (1994). [PubMed] [Google Scholar]
- 46.Esoh K, Wonkam A, Evolutionary history of sickle-cell mutation: implications for global genetic medicine. Human Molecular Genetics. 30, R119–R128 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Jin W, Xu S, Wang H, Yu Y, Shen Y, Wu B, Jin L, Genome-wide detection of natural selection in African Americans pre- and post-admixture. Genome Res. 22, 519–527 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Bhatia G, Tandon A, Patterson N, Aldrich MC, Ambrosone CB, Amos C, Bandera EV, Berndt SI, Bernstein L, Blot WJ, Bock CH, Caporaso N, Casey G, Deming SL, Diver WR, Gapstur SM, Gillanders EM, Harris CC, Henderson BE, Ingles SA, Isaacs W, De Jager PL, John EM, Kittles RA, Larkin E, McNeill LH, Millikan RC, Murphy A, Neslund-Dudas C, Nyante S, Press MF, Rodriguez-Gil JL, Rybicki BA, Schwartz AG, Signorello LB, Spitz M, Strom SS, Tucker MA, Wiencke JK, Witte JS, Wu X, Yamamura Y, Zanetti KA, Zheng W, Ziegler RG, Chanock SJ, Haiman CA, Reich D, Price AL, Genome-wide Scan of 29,141 African Americans Finds No Evidence of Directional Selection since Admixture. The American Journal of Human Genetics. 95, 437–444 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Dew CB, David Ross and the Oxford Iron Works: A Study of Industrial Slavery in the Early Nineteenth-Century South. The William and Mary Quarterly. 31, 189–224 (1974). [Google Scholar]
- 50.Dew CB, Bond of Iron: Master and Slave at Buffalo Forge (W. W. Norton & Company, 1995). [Google Scholar]
- 51.Lewis RL, Coal, Iron, and Slaves: Industrial Slavery in Maryland and Virginia, 1715–1865. (Greenwood Press, Westport, CT, 1979). [Google Scholar]
- 52.Moravian Missionary Letters – 1766–1787, Moravian Archives, Bethlehem, Pennsylvania. Translated by Quinter Edward, 2021–2022.
- 53.Diaries of Moravian clergy – 1798–1850, Moravian Archives, Bethlehem, Pennsylvania. Translated by Quinter Edward, 2021–2022.
- 54.Gordon SP, Fishing for a Few: Moravians on the Eighteenth-Century Pennsylvania Frontier. Pennsylvania History: A Journal of Mid-Atlantic Studies. 88, 319–350 (2021). [Google Scholar]
- 55.Eltis D, A brief overview of the Trans-Atlantic Slave Trade. Voyages: The trans-atlantic slave trade database: 1700–1810 (2007) (available at https://slavevoyages.org/). [Google Scholar]
- 56.Thornton JK, A Cultural History of the Atlantic World, 1250–1820 (Cambridge University Press, 2012). [Google Scholar]
- 57.Heywood LM, Thornton JK, Central Africans, Atlantic Creoles, and the Foundation of the Americas, 1585–1660 (Cambridge University Press, 2007). [Google Scholar]
- 58.Maryland: A History of Its People (JHU Press, 1986). [Google Scholar]
- 59.Sirak KA, Fernandes DM, Cheronet O, Novak M, Gamarra B, Balassa T, Bernert Z, Cséki, Dani J, Gallina JZ, Kocsis-Buruzs G, Kővári I, László O, Pap I, Patay R, Petkes Z, Szenthe G, Szeniczey T, Hajdu T, Pinhasi R, A minimally-invasive method for sampling human petrous bones from the cranial base for ancient DNA analysis. BioTechniques. 62, 283–289 (2017). [DOI] [PubMed] [Google Scholar]
- 60.Dabney J, Knapp M, Glocke I, Gansauge M-T, Weihmann A, Nickel B, Valdiosera C, García N, Pääbo S, Arsuaga J-L, Meyer M, Complete mitochondrial genome sequence of a Middle Pleistocene cave bear reconstructed from ultrashort DNA fragments. PNAS. 110, 15758–15763 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Korlević P, Gerber T, Gansauge M-T, Hajdinjak M, Nagel S, Aximu-Petri A, Meyer M, Reducing microbial and human contamination in DNA extractions from ancient bones and teeth. BioTechniques. 59, 87–93 (2015). [DOI] [PubMed] [Google Scholar]
- 62.Rohland N, Glocke I, Aximu-Petri A, Meyer M, Extraction of highly degraded DNA from ancient bones, teeth and sediments for high-throughput sequencing. Nature Protocols. 13, 2447–2461 (2018). [DOI] [PubMed] [Google Scholar]
- 63.Rohland N, Harney E, Mallick S, Nordenfelt S, Reich D, Partial uracil–DNA–glycosylase treatment for screening of ancient DNA. Philosophical Transactions of the Royal Society B: Biological Sciences. 370, 20130624 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Gansauge M-T, Aximu-Petri A, Nagel S, Meyer M, Manual and automated preparation of single-stranded DNA libraries for the sequencing of DNA from ancient biological remains and other sources of highly degraded DNA. Nature Protocols. 15, 2279–2300 (2020). [DOI] [PubMed] [Google Scholar]
- 65.Korneliussen TS, Albrechtsen A, Nielsen R, ANGSD: analysis of next generation sequencing data. BMC bioinformatics. 15, 1–13 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Skoglund P, Storå J, Götherström A, Jakobsson M, Accurate sex identification of ancient human remains using DNA shotgun sequencing. Journal of Archaeological Science. 40, 4477–4482 (2013). [Google Scholar]
- 67.Weissensteiner H, Pacher D, Kloss-Brandstätter A, Forer L, Specht G, Bandelt H-J, Kronenberg F, Salas A, Schönherr S, HaploGrep 2: mitochondrial haplogroup classification in the era of high-throughput sequencing. Nucleic acids research. 44, W58–W63 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Auton A, Abecasis GR, Altshuler DM, Durbin RM, Abecasis GR, Bentley DR, Chakravarti A, Clark AG, Donnelly P, Eichler EE, Flicek P, Gabriel SB, Gibbs RA, Green ED, Hurles ME, Knoppers BM, Korbel JO, Lander ES, Lee C, Lehrach H, Mardis ER, Marth GT, McVean GA, Nickerson DA, Schmidt JP, Sherry ST, Wang J, Wilson RK, Gibbs RA, Boerwinkle E, Doddapaneni H, Han Y, Korchina V, Kovar C, Lee S, Muzny D, Reid JG, Zhu Y, Wang J, Chang Y, Feng Q, Fang X, Guo X, Jian M, Jiang H, Jin X, Lan T, Li G, Li J, Li Y, Liu S, Liu X, Lu Y, Ma X, Tang M, Wang B, Wang G, Wu H, Wu R, Xu X, Yin Y, Zhang D, Zhang W, Zhao J, Zhao M, Zheng X, Lander ES, Altshuler DM, Gabriel SB, Gupta N, Gharani N, Toji LH, Gerry NP, Resch AM, Flicek P, Barker J, Clarke L, Gil L, Hunt SE, Kelman G, Kulesha E, Leinonen R, McLaren WM, Radhakrishnan R, Roa A, Smirnov D, Smith RE, Streeter I, Thormann A, Toneva I, Vaughan B, Zheng-Bradley X, Bentley DR, Grocock R, Humphray S, James T, Kingsbury Z, Lehrach H, Sudbrak R, Albrecht MW, Amstislavskiy VS, Borodina TA, Lienhard M, Mertes F, Sultan M, Timmermann B, Yaspo M-L, Mardis ER, Wilson RK, Fulton L, Fulton R, Sherry ST, Ananiev V, Belaia Z, Beloslyudtsev D, Bouk N, Chen C, Church D, Cohen R, Cook C, Garner J, Hefferon T, Kimelman M, Liu C, Lopez J, Meric P, O’Sullivan C, Ostapchuk Y, Phan L, Ponomarov S, Schneider V, Shekhtman E, Sirotkin K, Slotta D, Zhang H, McVean GA, Durbin RM, Balasubramaniam S, Burton J, Danecek P, Keane TM, Kolb-Kokocinski A, McCarthy S, Stalker J, Quail M, Schmidt JP, Davies CJ, Gollub J, Webster T, Wong B, Zhan Y, Auton A, Campbell CL, Kong Y, Marcketta A, Gibbs RA, Yu F, Antunes L, Bainbridge M, Muzny D, Sabo A, Huang Z, Wang J, Coin LJM, Fang L, Guo X, Jin X, Li G, Li Q, Li Y, Li Z, Lin H, Liu B, Luo R, Shao H, Xie Y, Ye C, Yu C, Zhang F, Zheng H, Zhu H, Alkan C, Dal E, Kahveci F, Marth GT, Garrison EP, Kural D, Lee W-P, Fung Leong W, Stromberg M, Ward N, Wu J, Zhang M, Daly MJ, DePristo MA, Handsaker RE, Altshuler DM, Banks E, Bhatia G, del Angel G, Gabriel SB, Genovese G, Gupta N, Li H, Kashin S, Lander ES, McCarroll SA, Nemesh JC, Poplin RE, Yoon SC, Lihm J, Makarov V, Clark AG, Gottipati S, Keinan A, Rodriguez-Flores JL, Korbel JO, Rausch T, Fritz MH, Stütz AM, Flicek P, Beal K, Clarke L, Datta A, Herrero J, McLaren WM, Ritchie GRS, Smith RE, Zerbino D, Zheng-Bradley X, Sabeti PC, Shlyakhter I, Schaffner SF, Vitti J, Cooper DN, Ball EV, Stenson PD, Bentley DR, Barnes B, Bauer M, Keira Cheetham R, Cox A, Eberle M, Humphray S, Kahn S, Murray L, Peden J, Shaw R, Kenny EE, Batzer MA, Konkel MK, Walker JA, MacArthur DG, Lek M, Sudbrak R, Amstislavskiy VS, Herwig R, Mardis ER, Ding L, Koboldt DC, Larson D, Ye K, Gravel S, The 1000 Genomes Project Consortium, Corresponding authors, Steering committee, Production group, Baylor College of Medicine, BGI-Shenzhen, Broad Institute of MIT and Harvard, Coriell Institute for Medical Research, E. B. I. European Molecular Biology Laboratory, Illumina, Max Planck Institute for Molecular Genetics, McDonnell Genome Institute at Washington University, US National Institutes of Health, University of Oxford, Wellcome Trust Sanger Institute, Analysis group, Affymetrix, Albert Einstein College of Medicine, Bilkent University, Boston College, Cold Spring Harbor Laboratory, Cornell University, European Molecular Biology Laboratory, Harvard University, Human Gene Mutation Database, Icahn School of Medicine at Mount Sinai, Louisiana State University, Massachusetts General Hospital, McGill University, N. National Eye Institute, A global reference for human genetic variation. Nature. 526, 68–74 (2015).26432245 [Google Scholar]
- 69.Bergström A, McCarthy SA, Hui R, Almarri MA, Ayub Q, Danecek P, Chen Y, Felkel S, Hallast P, Kamm J, Blanché H, Deleuze J-F, Cann H, Mallick S, Reich D, Sandhu MS, Skoglund P, Scally A, Xue Y, Durbin R, Tyler-Smith C, Insights into human genetic variation and population history from 929 diverse genomes. Science. 367, eaay5012 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Mallick S, Li H, Lipson M, Mathieson I, Gymrek M, Racimo F, Zhao M, Chennagiri N, Nordenfelt S, Tandon A, Skoglund P, Lazaridis I, Sankararaman S, Fu Q, Rohland N, Renaud G, Erlich Y, Willems T, Gallo C, Spence JP, Song YS, Poletti G, Balloux F, van Driem G, de Knijff P, Romero IG, Jha AR, Behar DM, Bravi CM, Capelli C, Hervig T, Moreno-Estrada A, Posukh OL, Balanovska E, Balanovsky O, Karachanak-Yankova S, Sahakyan H, Toncheva D, Yepiskoposyan L, Tyler-Smith C, Xue Y, Abdullah MS, Ruiz-Linares A, Beall CM, Di Rienzo A, Jeong C, Starikovskaya EB, Metspalu E, Parik J, Villems R, Henn BM, Hodoglugil U, Mahley R, Sajantila A, Stamatoyannopoulos G, Wee JTS, Khusainova R, Khusnutdinova E, Litvinov S, Ayodo G, Comas D, Hammer MF, Kivisild T, Klitz W, Winkler CA, Labuda D, Bamshad M, Jorde LB, Tishkoff SA, Watkins WS, Metspalu M, Dryomov S, Sukernik R, Singh L, Thangaraj K, Pääbo S, Kelso J, Patterson N, Reich D, The Simons Genome Diversity Project: 300 genomes from 142 diverse populations. Nature. 538, 201–206 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Mallick S, Micco A, Mah M, Ringbauer H, Lazaridis I, Olalde I, Patterson N, Reich D, The Allen Ancient DNA Resource (AADR): A curated compendium of ancient human genomes (2023), p. 2023.04.06.535797, , doi: 10.1101/2023.04.06.535797. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Rubinacci S, Ribeiro DM, Hofmeister RJ, Delaneau O, Efficient phasing and imputation of low-coverage sequencing data using large reference panels. Nat Genet. 53, 120–126 (2021). [DOI] [PubMed] [Google Scholar]
- 73.Loh P-R, Palamara PF, Price AL, Fast and accurate long-range phasing in a UK Biobank cohort. Nature Genetics. 48, 811–816 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Pinhasi R, Fernandes DM, Sirak K, Cheronet O, Isolating the human cochlea to generate bone powder for ancient DNA analysis. Nature Protocols. 14, 1194–1205 (2019). [DOI] [PubMed] [Google Scholar]
- 75.Pinhasi R, Fernandes D, Sirak K, Novak M, Connell S, Alpaslan-Roodenberg S, Gerritsen F, Moiseyev V, Gromov A, Raczky P, Anders A, Pietrusewsky M, Rollefson G, Jovanovic M, Trinhhoang H, Bar-Oz G, Oxenham M, Matsumura H, Hofreiter M, Optimal Ancient DNA Yields from the Inner Ear Part of the Human Petrous Bone. PLOS ONE. 10, e0129102 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Mathieson I, Alpaslan-Roodenberg S, Posth C, Szécsényi-Nagy A, Rohland N, Mallick S, Olalde I, Broomandkhoshbacht N, Candilio F, Cheronet O, Fernandes D, Ferry M, Gamarra B, Fortes GG, Haak W, Harney E, Jones E, Keating D, KrauseKyora B, Kucukkalipci I, Michel M, Mittnik A, Nägele K, Novak M, Oppenheimer J, Patterson N, Pfrengle S, Sirak K, Stewardson K, Vai S, Alexandrov S, Alt KW, Andreescu R, Antonović D, Ash A, Atanassova N, Bacvarov K, Gusztáv MB, Bocherens H, Bolus M, Boroneanţ A, Boyadzhiev Y, Budnik A, Burmaz J, Chohadzhiev S, Conard NJ, Cottiaux R, Čuka M, Cupillard C, Drucker DG, Elenski N, Francken M, Galabova B, Ganetsovski G, Gély B, Hajdu T, Handzhyiska V, Harvati K, Higham T, Iliev S, Janković I, Karavanić I, Kennett DJ, Komšo D, Kozak A, Labuda D, Lari M, Lazar C, Leppek M, Leshtakov K, Vetro DL, Los D, Lozanov I, Malina M, Martini F, McSweeney K, Meller H, Menđušić M, Mirea P, Moiseyev V, Petrova V, Price TD, Simalcsik A, Sineo L, Šlaus M, Slavchev V, Stanev P, Starović A, Szeniczey T, Talamo S, Teschler-Nicola M, Thevenet C, Valchev I, Valentin F, Vasilyev S, Veljanovska F, Venelinova S, Veselovskaya E, Viola B, Virag C, Zaninović J, Zäuner S, Stockhammer PW, Catalano G, Krauß R, Caramelli D, Zariņa G, Gaydarska B, Lillie M, Nikitin AG, Potekhina I, Papathanasiou A, Borić D, Bonsall C, Krause J, Pinhasi R, Reich D, The genomic history of southeastern Europe. Nature. 555, 197–203 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Behar DM, Van Oven M, Rosset S, Metspalu M, Loogväli E-L, Silva NM, Kivisild T, Torroni A, Villems R, A “Copernican” reassessment of the human mitochondrial DNA tree from its root. The American Journal of Human Genetics. 90, 675–684 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Li H, Durbin R, Fast and accurate long-read alignment with Burrows–Wheeler transform. Bioinformatics. 26, 589–595 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Skoglund P, Northoff BH, Shunkov MV, Derevianko AP, Pääbo S, Krause J, Jakobsson M, Separating endogenous ancient DNA from modern day contamination in a Siberian Neandertal. Proceedings of the National Academy of Sciences. 111, 2229–2234 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Olalde I, Mallick S, Patterson N, Rohland N, Villalba-Mouco V, Silva M, Dulias K, Edwards CJ, Gandini F, Pala M, Soares P, Ferrando-Bernal M, Adamski N, Broomandkhoshbacht N, Cheronet O, Culleton BJ, Fernandes D, Lawson AM, Mah M, Oppenheimer J, Stewardson K, Zhang Z, Arenas JMJ, Moyano IJT, Salazar-García DC, Castanyer P, Santos M, Tremoleda J, Lozano M, Borja PG, Fernández-Eraso J, Mujika-Alustiza JA, Barroso C, Bermúdez FJ, Mínguez EV, Burch J, Coromina N, Vivó D, Cebrià A, Fullola JM, García-Puchol O, Morales JI, Oms FX, Majó T, Vergès JM, Díaz-Carvajal A, Ollich-Castanyer I, López-Cachero FJ, Silva AM, Alonso-Fernández C, de Castro GD, Echevarría JJ, Moreno-Márquez A, Berlanga GP, Ramos-García P, Ramos-Muñoz J, Vila EV, Arzo GA, Arroyo ÁE, Lillios KT, Mack J, Velasco-Vázquez J, Waterman A, Enrich L. B. de L., Sánchez MB, Agustí B, Codina F, de Prado G, Estalrrich A, Flores ÁF, Finlayson C, Finlayson G, Finlayson S, Giles-Guzmán F, Rosas A, González VB, Atiénzar GG, Pérez MSH, Llanos A, Marco YC, Beneyto IC, López-Serrano D, Tormo MS, Valera AC, Blasco C, Liesau C, Ríos P, Daura J, Michó M. J. de P., Diez-Castillo AA, Fernández RF, Farré JF, Garrido-Pena R, Gonçalves VS, Guerra-Doce E, Herrero-Corral AM, JuanCabanilles J, López-Reyes D, McClure SB, Pérez MM, Foix AO, Borràs MS, Sousa AC, Encinas JMV, Kennett DJ, Richards MB, Alt KW, Haak W, Pinhasi R, Lalueza-Fox C, Reich D, The genomic history of the Iberian Peninsula over the past 8000 years. Science. 363, 1230–1234 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Harney É, Patterson N, Reich D, Wakeley J, Assessing the performance of qpAdm: a statistical tool for studying population admixture. Genetics. 217 (2021), doi: 10.1093/genetics/iyaa045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Alexander DH, Novembre J, Lange K, Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Patterson N, Price AL, Reich D, Population Structure and Eigenanalysis. PLOS Genetics. 2, e190 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 84.Narasimhan VM, Patterson N, Moorjani P, Rohland N, Bernardos R, Mallick S, Lazaridis I, Nakatsuka N, Olalde I, Lipson M, Kim AM, Olivieri LM, Coppa A, Vidale M, Mallory J, Moiseyev V, Kitov E, Monge J, Adamski N, Alex N, Broomandkhoshbacht N, Candilio F, Callan K, Cheronet O, Culleton BJ, Ferry M, Fernandes D, Freilich S, Gamarra B, Gaudio D, Hajdinjak M, Harney É, Harper TK, Keating D, Lawson AM, Mah M, Mandl K, Michel M, Novak M, Oppenheimer J, Rai N, Sirak K, Slon V, Stewardson K, Zalzala F, Zhang Z, Akhatov G, Bagashev AN, Bagnera A, Baitanayev B, Bendezu-Sarmiento J, Bissembaev AA, Bonora GL, Chargynov TT, Chikisheva T, Dashkovskiy PK, Derevianko A, Dobeš M, Douka K, Dubova N, Duisengali MN, Enshin D, Epimakhov A, Fribus AV, Fuller D, Goryachev A, Gromov A, Grushin SP, Hanks B, Judd M, Kazizov E, Khokhlov A, Krygin AP, Kupriyanova E, Kuznetsov P, Luiselli D, Maksudov F, Mamedov AM, Mamirov TB, Meiklejohn C, Merrett DC, Micheli R, Mochalov O, Mustafokulov S, Nayak A, Pettener D, Potts R, Razhev D, Rykun M, Sarno S, Savenkova TM, Sikhymbaeva K, Slepchenko SM, Soltobaev OA, Stepanova N, Svyatko S, Tabaldiev K, Teschler-Nicola M, Tishkin AA, Tkachev VV, Vasilyev S, Velemínský P, Voyakin D, Yermolayeva A, Zahir M, Zubkov VS, Zubova A, Shinde VS, Lalueza-Fox C, Meyer M, Anthony D, Boivin N, Thangaraj K, Kennett DJ, Frachetti M, Pinhasi R, Reich D, The formation of human populations in South and Central Asia. Science. 365 (2019), doi: 10.1126/science.aat7487. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 85.Van Dorp L, Lowes S, Weigel JL, Ansari-Pour N, López S, Mendoza-Revilla J, Robinson JA, Henrich J, Thomas MG, Nunn N, Genetic legacy of state centralization in the Kuba Kingdom of the Democratic Republic of the Congo. Proceedings of the National Academy of Sciences. 116, 593–598 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Jackson BA, Wilson JL, Kirbah S, Sidney SS, Rosenberger J, Bassie L, Alie JA, McLean DC, Garvey WT, Ely B, Mitochondrial DNA genetic diversity among four ethnic groups in Sierra Leone. American journal of physical anthropology. 128, 156–163 (2005). [DOI] [PubMed] [Google Scholar]
- 87.Henn BM, Gignoux CR, Jobin M, Granka JM, Macpherson J, Kidd JM, Rodríguez-Botigué L, Ramachandran S, Hon L, Brisbin A, Hunter-gatherer genomic diversity suggests a southern African origin for modern humans. Proceedings of the National Academy of Sciences. 108, 5154–5162 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E, Fast unfolding of communities in large networks. Journal of statistical mechanics: theory and experiment. 2008, P10008 (2008). [Google Scholar]
- 89.Bastian M, Heymann S, Jacomy M, “Gephi: an open source software for exploring and manipulating networks” in (2009).
- 90.Enslaved faces from African American cemetery in Catoctin, Maryland recreated - The Washington Post, (available at https://www.washingtonpost.com/history/2021/07/09/african-american-cemetery-catoctin-enslaved-faces/).
- 91.Roberts A, Historical Society unveils facial reconstructions of African-Americans once enslaved in Thurmont. The Frederick News-Post, (available at https://www.fredericknewspost.com/news/economy_and_business/tourism/historical-society-unveils-facial-reconstructions-of-african-americans-once-enslaved-in-thurmont/article_57a8171b-f3e2-52c8-bb8d-a6fc6d189ef2.html). [Google Scholar]
- 92.Fullerton SM, Lee SS-J, Secondary uses and the governance of de-identified data: Lessons from the human genome diversity panel. BMC Medical Ethics. 12, 16 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Resnik DB, The Human Genome Diversity Project: Ethical Problems and Solutions. Politics and the Life Sciences. 18, 15–23 (1999). [DOI] [PubMed] [Google Scholar]
- 94.O’Connell J, Yun T, Moreno M, Li H, Litterman N, Kolesnikov A, Noblin E, Chang P-C, Shastri A, Dorfman EH, Shringarpure S, Auton A, Carroll A, McLean CY, A population-specific reference panel for improved genotype imputation in African Americans. Commun Biol. 4, 1–9 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Ebel ER, Reis F, Petrov DA, Beleza S, Historical trends and new surveillance of Plasmodium falciparum drug resistance markers in Angola. Malaria Journal. 20, 175 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Sweeney L, k-ANONYMITY: A MODEL FOR PROTECTING PRIVACY. Int. J. Unc. Fuzz. Knowl. Based Syst 10, 557–570 (2002). [Google Scholar]
- 97.Birney E, Inouye M, Raff J, Rutherford A, Scally A, The language of race, ethnicity, and ancestry in human genetic research (2021), , doi: 10.48550/arXiv.2106.10041. [DOI] [Google Scholar]
- 98.Wagner JK, Yu J-H, Ifekwunigwe JO, Harrell TM, Bamshad MJ, Royal CD, Anthropologists’ views on race, ancestry, and genetics. American Journal of Physical Anthropology. 162, 318–327 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Mathieson I, Scally A, What is ancestry? PLOS Genetics. 16, e1008624 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Agyemang C, Bhopal R, Bruijnzeels M, Negro, Black, Black African, African Caribbean, African American or what? Labelling African origin populations in the health arena in the 21st century. Journal of Epidemiology & Community Health. 59, 1014–1018 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 101.AP says it will capitalize Black but not white | AP News, (available at https://apnews.com/article/entertainment-cultures-race-and-ethnicity-us-news-ap-top-news-7e36c00c5af0436abc09e051261fff1f).
- 102.Coop G, Genetic similarity versus genetic ancestry groups as sample descriptors in human genetics (2023), , doi: 10.48550/arXiv.2207.11595. [DOI] [Google Scholar]
- 103.Lewis ACF, Molina SJ, Appelbaum PS, Dauda B, Di Rienzo A, Fuentes A, Fullerton SM, Garrison NA, Ghosh N, Hammonds EM, Jones DS, Kenny EE, Kraft P, Lee SS-J, Mauro M, Novembre J, Panofsky A, Sohail M, Neale BM, Allen DS, Getting genetic ancestry right for science and society. Science. 376, 250–252 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Miyagi M, Guthman EM, Sun SD-K, Transgender rights rely on inclusive language. Science. 374, 1568–1569 (2021). [DOI] [PubMed] [Google Scholar]
- 105.Brück J, Ancient DNA, kinship and relational identities in Bronze Age Britain. Antiquity. 95, 228–237 (2021). [Google Scholar]
- 106.Ebenesersdóttir SS, Sandoval-Velasco M, Gunnarsdóttir ED, Jagadeesan A, Guðmundsdóttir VB, Thordardóttir EL, Einarsdóttir MS, Moore KHS, Sigurðsson Á, Magnúsdóttir DN, Jónsson H, Snorradóttir S, Hovig E, Møller P, Kockum I, Olsson T, Alfredsson L, Hansen TF, Werge T, Cavalleri GL, Gilbert E, Lalueza-Fox C, Walser JW, Kristjánsdóttir S, Gopalakrishnan S, Árnadóttir L, Magnússon ÓÞ, Gilbert MTP, Stefánsson K, Helgason A, Ancient genomes from Iceland reveal the making of a human population. Science. 360, 1028–1032 (2018). [DOI] [PubMed] [Google Scholar]
- 107.Olalde I, Brace S, Allentoft ME, Armit I, Kristiansen K, Booth T, Rohland N, Mallick S, Szécsényi-Nagy A, Mittnik A, Altena E, Lipson M, Lazaridis I, Harper TK, Patterson N, Broomandkhoshbacht N, Diekmann Y, Faltyskova Z, Fernandes D, Ferry M, Harney E, de Knijff P, Michel M, Oppenheimer J, Stewardson K, Barclay A, Alt KW, Liesau C, Ríos P, Blasco C, Miguel JV, García RM, Fernández AA, Bánffy E, Bernab-Brea M, Billoin D, Bonsall C, Bonsall L, Allen T, Büster L, Carver S, Navarro LC, Craig OE, Cook GT, Cunliffe B, Denaire A, Dinwiddy KE, Dodwell N, Ernée M, Evans C, Kuchařík M, Farré JF, Fowler C, Gazenbeek M, Pena RG, Haber-Uriarte M, Haduch E, Hey G, Jowett N, Knowles T, Massy K, Pfrengle S, Lefranc P, Lemercier O, Lefebvre A, Martínez CH, Olmo VG, Ramírez AB, Maurandi JL, Majó T, McKinley JI, McSweeney K, Mende BG, Modi A, Kulcsár G, Kiss V, Czene A, Patay R, Endrődi A, Köhler K, Hajdu T, Szeniczey T, Dani J, Bernert Z, Hoole M, Cheronet O, Keating D, Velemínský P, Dobeš M, Candilio F, Brown F, Fernández RF, Herrero-Corral A-M, Tusa S, Carnieri E, Lentini L, Valenti A, Zanini A, Waddington C, Delibes G, Guerra-Doce E, Neil B, Brittain M, Luke M, Mortimer R, Desideri J, Besse M, Brücken G, Furmanek M, Hałuszko A, Mackiewicz M, Rapiński A, Leach S, Soriano I, Lillios KT, Cardoso JL, Pearson MP, Włodarczak P, Price TD, Prieto P, Rey P-J, Risch R, Rojo Guerra MA, Schmitt A, Serralongue J, Silva AM, Smrčka V, Vergnaud L, Zilhão J, Caramelli D, Higham T, Thomas MG, Kennett DJ, Fokkens H, Heyd V, Sheridan A, Sjögren K-G, Stockhammer PW, Krause J, Pinhasi R, Haak W, Barnes I, Lalueza-Fox C, Reich D, The Beaker phenomenon and the genomic transformation of northwest Europe. Nature. 555, 190–196 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 108.“Allen Ancient Genome Diversity Project / John Templeton Ancient DNA Atlas.”
- 109.Lazaridis I, Nadel D, Rollefson G, Merrett DC, Rohland N, Mallick S, Fernandes D, Novak M, Gamarra B, Sirak K, Connell S, Stewardson K, Harney E, Fu Q, Gonzalez-Fortes G, Jones ER, Roodenberg SA, Lengyel G, Bocquentin F, Gasparian B, Monge JM, Gregg M, Eshed V, Mizrahi A-S, Meiklejohn C, Gerritsen F, Bejenaru L, Blüher M, Campbell A, Cavalleri G, Comas D, Froguel P, Gilbert E, Kerr SM, Kovacs P, Krause J, McGettigan D, Merrigan M, Merriwether DA, O’Reilly S, Richards MB, Semino O, Shamoon-Pour M, Stefanescu G, Stumvoll M, Tönjes A, Torroni A, Wilson JF, Yengo L, Hovhannisyan NA, Patterson N, Pinhasi R, Reich D, Genomic insights into the origin of farming in the ancient Near East. Nature. 536, 419–424 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Llorente MG, Jones ER, Eriksson A, Siska V, Arthur KW, Arthur JW, Curtis MC, Stock JT, Coltorti M, Pieruccini P, Stretton S, Brock F, Higham T, Park Y, Hofreiter M, Bradley DG, Bhak J, Pinhasi R, Manica A, Ancient Ethiopian genome reveals extensive Eurasian admixture in Eastern Africa. Science (2015), doi: 10.1126/science.aad2879. [DOI] [PubMed] [Google Scholar]
- 111.Damgaard P. de B., Martiniano R, Kamm J, Moreno-Mayar JV, Kroonen G, Peyrot M, Barjamovic G, Rasmussen S, Zacho C, Baimukhanov N, Zaibert V, Merz V, Biddanda A, Merz I, Loman V, Evdokimov V, Usmanova E, Hemphill B, Seguin-Orlando A, Yediay FE, Ullah I, Sjögren K-G, Iversen KH, Choin J, de la Fuente C, Ilardo M, Schroeder H, Moiseyev V, Gromov A, Polyakov A, Omura S, Senyurt SY, Ahmad H, McKenzie C, Margaryan A, Hameed A, Samad A, Gul N, Khokhar MH, Goriunova OI, Bazaliiskii VI, Novembre J, Weber AW, Orlando L, Allentoft ME, Nielsen R, Kristiansen K, Sikora M, Outram AK, Durbin R, Willerslev E, The first horse herders and the impact of early Bronze Age steppe expansions into Asia. Science (2018), doi: 10.1126/science.aar7711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Gamba C, Jones ER, Teasdale MD, McLaughlin RL, Gonzalez-Fortes G, Mattiangeli V, Domboróczki L, Kővári I, Pap I, Anders A, Whittle A, Dani J, Raczky P, Higham TFG, Hofreiter M, Bradley DG, Pinhasi R, Genome flux and stasis in a five millennium transect of European prehistory. Nat Commun. 5, 5257 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Gokhman D, Lavi E, Prüfer K, Fraga MF, Riancho JA, Kelso J, Pääbo S, Meshorer E, Carmel L, Reconstructing the DNA Methylation Maps of the Neandertal and the Denisovan. Science. 344, 523–527 (2014). [DOI] [PubMed] [Google Scholar]
- 114.Günther T, Malmström H, Svensson EM, Omrak A, Sánchez-Quinto F, Kılınç GM, Krzewińska M, Eriksson G, Fraser M, Edlund H, Munters AR, Coutinho A, Simões LG, Vicente M, Sjölander A, Sellevold BJ, Jørgensen R, Claes P, Shriver MD, Valdiosera C, Netea MG, Apel J, Lidén K, Skar B, Storå J, Götherström A, Jakobsson M, Population genomics of Mesolithic Scandinavia: Investigating early postglacial migration routes and high-latitude adaptation. PLOS Biology. 16, e2003703 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Ferrando-Bernal M, Morcillo-Suarez C, de-Dios T, Gelabert P, Civit S, Díaz-Carvajal A, Ollich-Castanyer I, Allentoft ME, Valverde S, Lalueza-Fox C, Mapping co-ancestry connections between the genome of a Medieval individual and modern Europeans. Scientific Reports. 10, 6843 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 116.Margaryan A, Lawson DJ, Sikora M, Racimo F, Rasmussen S, Moltke I, Cassidy LM, Jørsboe E, Ingason A, Pedersen MW, Korneliussen T, Wilhelmson H, Buś MM, de Barros Damgaard P, Martiniano R, Renaud G, Bhérer C, Moreno-Mayar JV, Fotakis AK, Allen M, Allmäe R, Molak M, Cappellini E, Scorrano G, McColl H, Buzhilova A, Fox A, Albrechtsen A, Schütz B, Skar B, Arcini C, Falys C, Jonson CH, Błaszczyk, Pezhemsky D, Turner-Walker G, Gestsdóttir H, Lundstrøm I, Gustin I, Mainland I, Potekhina I, Muntoni IM, Cheng J, Stenderup J, Ma J, Gibson J, Peets J, Gustafsson J, Iversen KH, Simpson L, Strand L, Loe L, Sikora M, Florek M, Vretemark M, Redknap M, Bajka M, Pushkina T, Søvsø M, Grigoreva N, Christensen T, Kastholm O, Uldum O, Favia P, Holck P, Sten S, Arge SV, Ellingvåg S, Moiseyev V, Bogdanowicz W, Magnusson Y, Orlando L, Pentz P, Jessen MD, Pedersen A, Collard M, Bradley DG, Jørkov ML, Arneborg J, Lynnerup N, Price N, Gilbert MTP, Allentoft ME, Bill J, Sindbæk SM, Hedeager L, Kristiansen K, Nielsen R, Werge T, Willerslev, Population genomics of the Viking world. Nature. 585, 390–396 (2020). [DOI] [PubMed] [Google Scholar]
- 117.Sikora M, Pitulko VV, Sousa VC, Allentoft ME, Vinner L, Rasmussen S, Margaryan A, de Barros Damgaard P, de la Fuente C, Renaud G, Yang MA, Fu Q, Dupanloup I, Giampoudakis K, Nogués-Bravo D, Rahbek C, Kroonen G, Peyrot M, McColl H, Vasilyev SV, Veselovskaya E, Gerasimova M, Pavlova EY, Chasnyk VG, Nikolskiy PA, Gromov AV, Khartanovich VI, Moiseyev V, Grebenyuk PS, Fedorchenko AY, Lebedintsev AI, Slobodin SB, Malyarchuk BA, Martiniano R, Meldgaard M, Arppe L, Palo JU, Sundell T, Mannermaa K, Putkonen M, Alexandersen V, Primeau C, Baimukhanov N, Malhi RS, Sjögren K-G, Kristiansen K, Wessman A, Sajantila A, Lahr MM, Durbin R, Nielsen R, Meltzer DJ, Excoffier L, Willerslev E, The population history of northeastern Siberia since the Pleistocene. Nature. 570, 182–188 (2019). [DOI] [PubMed] [Google Scholar]
- 118.Kivisild T, Saag L, Hui R, Biagini SA, Pankratov V, D’Atanasio E, Pagani L, Saag L, Rootsi S, Mägi R, Metspalu E, Valk H, Malve M, Irdt K, Reisberg T, Solnik A, Scheib CL, Seidman DN, Williams AL, Tambets K, Metspalu M, Patterns of genetic connectedness between modern and medieval Estonian genomes reveal the origins of a major ancestry component of the Finnish population. The American Journal of Human Genetics. 108, 1792–1806 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Huff CD, Witherspoon DJ, Simonson TS, Xing J, Watkins WS, Zhang Y, Tuohy TM, Neklason DW, Burt RW, Guthery SL, Woodward SR, Jorde LB, Maximum-likelihood estimation of recent shared ancestry (ERSA). Genome Res. 21, 768–774 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.THOMAS A, SKOLNICK MH, LEWIS CM, Genomic mismatch scanning in pedigrees. Mathematical Medicine and Biology: A Journal of the IMA. 11, 1–16 (1994). [DOI] [PubMed] [Google Scholar]
- 121.Wohns AW, Wong Y, Jeffery B, Akbari A, Mallick S, Pinhasi R, Patterson N, Reich D, Kelleher J, McVean G, A unified genealogy of modern and ancient genomes (2021), p. 2021.02.16.431497, , doi: 10.1101/2021.02.16.431497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Gouveia MH, Borda V, Leal TP, Moreira RG, Bergen AW, Kehdy FSG, Alvim I, Aquino MM, Araujo GS, Araujo NM, Furlan V, Liboredo R, Machado M, Magalhaes WCS, Michelin LA, Rodrigues MR, Rodrigues-Soares F, Sant Anna HP, Santolalla ML, Scliar MO, Soares-Souza G, Zamudio R, Zolini C, Bortolini MC, Dean M, Gilman RH, Guio H, Rocha J, Pereira AC, Barreto ML, Horta BL, Lima-Costa MF, Mbulaiteye SM, Chanock SJ, Tishkoff SA, Yeager M, Tarazona-Santos E, Origins, Admixture Dynamics, and Homogenization of the African Gene Pool in the Americas. Molecular Biology and Evolution. 37, 1647–1656 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Jackson FLC, So many Nigerians: why is Nigeria overrepresented as the ancestral genetic homeland of Legacy African North Americans? The American Journal of Human Genetics. 108, 202–208 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.Ko A, Nielsen R, Composite likelihood method for inferring local pedigrees. PLOS Genetics. 13, e1006963 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Materials and Methods
Supplementary Text S1 - Extended Ethics Statement
Supplementary Text S2 - The Return of Names
Supplementary Text S3 - Testing the application of IBD detection methods to imputed low coverage ancient DNA using simulated data
Supplementary Text S4 - Testing the performance of Ancestry Composition on imputed low coverage ancient DNA using simulated data
Supplementary Text S5 - Comparison of IBD Network results of Catoctin individuals with present-day African Americans
Supplementary Text S6 - Genetic connections to African Among Catoctin individuals and 23andMe participants in the US cohort with at least 50% Sub-Saharan African ancestry
Figs. S1 to S16
References 74–124
Tables S1 to S24, S1.1, S3.2 to S3.7, and S4.1
Data Availability Statement
Unaligned and aligned sequences for the 27 Catoctin individuals were originally reported in Harney et al. (13) and are available from the European Nucleotide Archive under accession number PRJEB52230. Genotype files for pseudo-haploid and imputed versions of the dataset are available at https://reich.hms.harvard.edu/datasets. There are restrictions to the availability of 23andMe genotype data due to 23andMe consent and privacy guidelines. 23andMe agrees that the publication coauthors will rerun the comparison of historical genomic data against customer genetic data upon request by other academic and nonprofit researchers on reasonable terms to enable the results of the research activities to be replicated for at least seven years after publication or for as long as the coauthors are employed by, or otherwise affiliated with 23andMe in a capacity that allows them to rerun the analysis. Wherever possible, supplementary tables were included that report the summary statistics that were used to create figures that involved 23andMe datasets. Unless another comparable anonymizing approach was specified, these summary statistics were generated with the requirement that in all reported results, any 23andMe research participant must be indistinguishable from at least four other research participants included in the dataset. When meaningful, we repeated analyses performed on the 23andMe dataset using only the 1000 genomes and/or African American imputation panel datasets and reported these results.