Skip to main content
Wiley Open Access Collection logoLink to Wiley Open Access Collection
. 2025 Jul 15;89(5):264–273. doi: 10.1111/ahg.70007

Power and Limitations of Inferring Genetic Ancestry

Nancy Bird 1, Turi King 2, Garrett Hellenthal 1,
PMCID: PMC12336972  PMID: 40665632

ABSTRACT

Background

The recent emergence of technologies that capture and analyse genetic variation patterns obtained from a person's DNA sample has led to numerous academic and commercial endeavours to infer individuals' ancestries. In theory, a person's genome contains a wealth of readily accessible information regarding their ancestors, despite only some of our ancestors contributing to the DNA we carry. This makes genetic tests an attractive alternative to the painstaking reconstruction of family trees or directly contacting long‐lost relations, particularly when, unless there are notable individuals in the tree, historical and genealogical records tend to diminish in frequency with each generation. However, while powerful, there are limits to what genetic data can unearth, as well as important assumptions underlying these analyses.

Methods

This review describes some of the early history and latest advances in techniques and data used to infer ancestry using genetics, highlighting both the power and limitations of current studies.

Conclusion

While genetics is a powerful means of exploring aspects of people's ancestry, a stronger focus on conveying uncertainty will allow both academics and non‐academics to avoid the ever‐present risks of over‐interpretation.

1. Background

The term ancestry is problematic for a number of reasons. For example, it can mean our genealogical ancestors, i.e. those individuals whose identities we can trace using historical and genealogical documents. Many people conflate the terms ancestry and ethnicity. In particular, ethnicity can be considered to be a more fluid term based on how an individual or individuals claim identity with a particular group based on a shared culture, language, religion or history and can therefore change within a person's lifetime. Genetic ancestry is something else again as, due to how our DNA is inherited, not all of our genealogical ancestors are represented in the DNA that we carry. Therefore, genetic ancestry refers only to the subset of paths through your genealogical pedigree that you have inherited genetic material (see Mathieson and Scally 2020 for a review of different definitions of ancestry).

In this review, we will confine ourselves to genetic ancestry. We discuss recent advances in studies of genetic ancestry, while also highlighting some of the limitations and assumptions of these studies. These limitations include the widely‐used practice of categorising individuals into distinct groups, e.g. using genetics or ethnic labels, which—rather than having any biological relevance—is done for convenience to assist analyses and interpretations. The potential for such classifications to propagate harmful ideas regarding “biological race”, along with the increasing visibility of genetic data to the general public through e.g. DTC tests, underscores how effective communication of the field's limitations, in addition to its potential, are essential.

2. Data Types and Initial Studies

Due to the simplicity of their inheritance patterns, initial ancestry analyses focused on studying mitochondrial DNA (mtDNA), which is passed from mother to offspring, and the non‐recombining portion of the Y chromosome (NRY), which is passed from father to son. In 1987, Cann et al., using low‐resolution RFLP analysis, examined the mtDNA types of 147 individuals, and from this, inferred that the common ancestor of the research participants was a woman who lived probably in Africa some 200,000 years ago (Cann et al. 1987). On publication, it generated international headlines and within months, Cann had dubbed this individual ‘Mitochondrial Eve’: she was known as the ‘mother’ of all living humans.

As mtDNA analysis developed, focusing first on sequencing the two hypervariable regions HVS1 and HVS2, further mtDNA types, known as haplogroups, were determined. By 1996, geneticist Bryan Sykes had set up a commercial DNA testing company ‘Oxford Ancestors’, offering the paying public an analysis of their mtDNA type. Further interest in his company, and mtDNA testing in terms of genetic ancestry, was boosted by the publication of his book ‘The Seven Daughters of Eve’ which, riffing on that early paper, artificially split mtDNA types into seven groups, which he gave the names of fictional regional ‘daughters’ (Richards et al. 1996; Sykes 2001). Here the use of genealogical terms and names fuelled the public imagination and was the start of the direct‐to‐consumer (DTC) genetic ancestry testing industry we see today.

Testing of mtDNA types for genetic ancestry was followed swiftly by a similar analysis of Y chromosomes. Here, again, Bryan Sykes was one of the early providers of such testing for the public, first using a set of 10 short‐tandem‐repeat (STR) markers, short repetitive stretches of DNA sequence, to predict a Y chromosome haplogroup—he also gave ‘clan’ names and determined the testee's ‘ancient ancestral father’. Here the terminology is used to conjure up a closer relationship that exists—using the term ‘father’ for someone who would have lived many generations ago, some millennia in the past.

Analyses of mtDNA and NRY data thus heralded the start of the field of genetic genealogy. Of course, due to the inheritance pattern of these two segments of DNA, women could only take an mtDNA test, whereas men can take both a Y chromosome and a mitochondrial test. Another company, Family Tree DNA, soon offered similar tests, progressing over time to larger sets of STRs on the Y chromosome to type. These could then be used to test within groups of men suspected to share a common ancestor, for example, due to sharing the same surname, to determine groups descended from a common ancestor within a genealogical timeframe or to prove or disprove family relationships.

While some mtDNA and NRY analyses have proven useful scientifically (King et al. 2006; King et al. 2007; King and Jobling 2009; King et al. 2014), despite some of the over‐interpretations mentioned above, recent ancestry studies instead analyse the autosomal chromosomes, which comprise the vast majority of our genome. Key to studying genetic ancestry using autosomes is the process of recombination. Effectively, when an autosomal chromosome is passed from a parent to their offspring, each recombination event will bifurcate the chromosome into segments inherited from a different grandparent on that parent's side. Stacking these recombinations up over generations, a person's autosomal genome can be depicted as a mosaic of segments inherited from different ancestors who lived at some point in the past. Therefore, an important consequence of recombination is that a person's autosomes contain information from each of the many ancestors that passed genetic material through the generations to that person. In contrast, each of the NRY and mtDNA contain information from only one of these ancestors and hence represent a very incomplete picture of their progenitors, despite some early out‐sized claims of their importance (Figure 1).

FIGURE 1.

FIGURE 1

Mitochondrial DNA (blue fill) and Y chromosomes (orange outline) are each inherited from a single ancestor matrilineally or patrilineally, respectively. They contain no genetic information from any other ancestors (grey figures with black outlines). The autosomes are represented by patterned rectangles, with one autosomal chromosome pair depicted per person. One chromosome in a pair is inherited from each parent, and the process of recombination affects the amount of DNA inherited from each ancestor at each generation. Relative to the mtDNA and NRY, the autosomal chromosome pair contains genetic material from many of an individual's ancestors, hence providing a more complete picture of their genetic history.

Even across the autosomes, many of the people in one's distant family tree are expected to contribute little or no DNA to that person. For example, while each of a person's four grandparents is expected to contribute to 1/4th of their autosomal genome, ten generations (~250–300 years) ago, a person can have at most 2^10 = 1024 ancestors. Thus each of these more distant ancestors is expected to contribute to <1/1000th of their autosomal genome, with a ∼58% chance of no detectable relationship at all. At fourteen generations ago (∼420 years), this probability goes up to >95% (Donnelly 1983; Ralph and Coop 2013). For this reason, our genome is necessarily only a snapshot of a fraction of a person's complete set of ancestors and carries no information about the ancestors that did not contribute any DNA to this particular descendant.

3. Inferring Shared Recent Ancestry Using Autosomal Data

Due to recombination, individuals who are more closely related are expected to share longer segments of autosomal DNA with each other. For example, grandparents are expected to share longer segments with their grandchildren than great‐grandparents are with their great‐grandchildren. Similarly, cousins are expected to share longer segments than second cousins, as cousins' shared segments are inherited through a common grandparent and second cousins' shared segments are inherited through a common great‐grandparent. Given the mutation rate per meiosis in humans is small, such segments inherited from a shared recent ancestor are expected to contain the same (or very similar) genetic sequence data, with most studies today analysing biallelic single‐nucleotide polymorphism (SNP) positions where a given segment can carry one of two possible alleles (genetic variants). Based on this logic, several programs e.g. hap‐IBD, ancIBD and ibd‐cluster (Browning and Browning 2024; Ringbauer et al. 2024; Zhou et al. 2020) try to identify long segments (e.g. >2Mb) for which two individuals share matching SNP data, as these may indicate so‐called identical‐by‐descent (IBD) segments that both have inherited from the same recent ancestor. The total amount and lengths of IBD sharing genome‐wide can then be used to infer the two individuals' degree of kinship (Seidman et al. 2020). Indeed some DNA testing companies return results showing the inferred amount (measured in centimorgans) of genome‐wide DNA shared between a customer and others in the reference database (Guerrini et al. 2022). Using currently available genetic variation datasets in modern humans, IBD‐inference programs typically can reliably estimate kinship up to a 6th degree (Ramstetter et al. 2017).

Identifying close relatives in this manner is of enormous interest in many applications, such as DTC services, forensics and reconstructing pedigrees using genetic data from archaeological sites (Cassidy et al. 2025; Tillmar and Kling 2025). However, most academic and commercial studies of ancestry instead use collections of unrelated people, where sampled individuals have little inferred IBD (i.e. long segment) sharing. This is because unrelated people share more distant ancestry that is often not well‐captured by other non‐genetic records. To learn about a person's ancestry, several programs such as Chromopainter, RFmix, MOSAIC and supervised ADMIXTURE (Alexander and Lange 2011; Lawson et al. 2012; Maples et al. 2013; Salter‐Townshend and Myers 2019) compare the genetic data from a target individual of unknown ancestry to that in a set of reference individuals for which ancestry is presumed known. The aim of such analyses is to identify the person within the reference set that a target individual is most closely related to in a genomic region, i.e., this represents the reference person for which the target individual shares a more recent ancestor than they do with any other person in the reference set. Due to historical recombination as described above, this “closest relative” changes multiple times along each autosomal chromosome of a target individual (Figure 1). Typically, and intuitively, the closest relative in a genetic region is inferred to be the reference individual who carries the longest stretch of SNP data matching that of the target individual in the genetic region (Lawson et al. 2012). While this is a powerful approach to inferring genetic ancestry, other commonly used approaches perform matching using summaries of SNP data, e.g., projecting people with unknown ancestry onto principal components extracted from reference sample data (Herrando‐Pérez et al. 2021; Patterson et al. 2006; Price et al. 2006).

4. Limitations of Autosomal‐Based Ancestry Inference

In addition to the methodology used for inference, this approach to determining a person's ancestry depends greatly on the reference set of individuals that the target is compared to. Typically the reference set consists of people sampled from particular geographic regions and/or people who have provided ethnicity or birthplace information. Reference individuals are then classified into different groups based on this geographic and/or ethnicity information. Sometimes genetic clustering algorithms are employed in an attempt to ensure these reference groups are relatively genetically homogeneous. A target individual that matches someone in a particular reference group is assumed to have at least some ancestors who were closely related to people from that reference group. While this makes logical sense, interpreting what this close relatedness means is not straightforward. Even the best‐matching reference individuals may be genetically distant from their true ancestors, which can occur if no good representatives for a person's ancestors exist in the reference set. As an example, many available reference datasets contain a larger number of people of European descent relative to those of non‐European descent (Adebamowo et al. 2022; Fatumo et al. 2022). In such cases, more precise matches, i.e., matching to reference individuals in fine‐scale geographic regions, are expected when inferring the ancestry of someone of European descent relative to someone with few recent European ancestors.

Furthermore, matching a person's genetic variation patterns to those of people in a particular reference group does not mean that the person's ancestors were ever a member of that current ethnic group or ever lived in that geographic region. This is an assumption, as genetics contains no geocoding or ethnicity information, and the genetic ancestry of people in a certain region can dramatically change through time with migration and drift. Conversely, a target individual may not match a reference group even if they have ancestors who are from the ethnic group or geographic region represented by that reference group. This can occur if the reference individuals representing that ethnic group or geographic region do not well reflect the complete genetic diversity of that group or region. For example, a reference set of British individuals that consists only of people sampled from a small geographic area within Wales may not be matched by someone else with recent British ancestry. Instead, this person may share more best matches with reference individuals sampled from France who happen to have a greater amount of genetic diversity, resulting in misleading inference. In addition to these issues, sample size bias can occur, whereby geographic regions and/or ethnic groups represented by more reference individuals are a priori more likely to be matched unless careful steps are taken to avoid this (e.g. ensuring all reference groups have the same number of people).

5. Recent Advances in Acquiring and Analysing Genetic Variation Data

Notwithstanding these complications, both the genetic data acquired and the methods used to analyse these data have made continuous strides. Of particular note, large‐scale whole‐genome‐sequencing (WGS) datasets, which consist of high‐quality variant calls at all (millions of) SNPs within thousands to hundreds of thousands of individuals, are becoming increasingly available (All of Us Research Program Genomics Investigators 2024; Bycroft et al. 2018; Milani et al. 2024; Nagai et al. 2017; Walters et al. 2023). Such datasets enable finding which individuals share variants that are very rarely found in the rest of the population, such as doubletons (a genetic variant found in only two individuals within a sample), which are indicative of very recent ancestor sharing (Mathieson and McVean 2014). Emerging technologies are increasingly accurate at interrogating parts of the genome that were previously difficult or impossible to capture, enabling telomere‐to‐telomere assembly that includes better assessment of (e.g.) copy‐number‐variants and ribosomal DNA patterns that may further improve ancestry inference (Aganezov et al. 2022; Miga and Eichler 2023; Nurk et al. 2022).

In addition, the technology to reliably sequence DNA from ancient human remains (aDNA) is continually improving (Liu et al. 2022). Consequently, recent years have seen a massive increase in the amount of available aDNA representing worldwide geographic regions over different time periods (Akbari et al. 2024; Mallick et al. 2024). In many cases, relative to present‐day samples, aDNA can offer a better genetic representative of the ancestral groups that a target individual descends from. When using present‐day reference samples to learn whether a target individual has ancestors from genetically different (e.g. geographically distant) groups, a major challenge is that many people alive today themselves descend from the intermixing of genetically different groups. DNA samples from individuals that predate these intermixing events can circumvent this issue. For example, ancient genomes from Jomon hunter‐gatherers have been useful for understanding patterns of ancestry in modern Japanese people who are admixed between this ancient population and continental East Asian groups. Similarly, genomes from Viking age individuals have been used to investigate their genetic legacy in modern‐day Europeans, and present‐day Ethiopians’ genomes have been shown to have differing amounts of ancestry related to a 4500‐year‐old ancient Ethiopian forager (Watanabe and Ohashi 2023; Yamamoto et al. 2024; Margaryan et al. 2020; López et al. 2021). Of course, the complexity of human history has demonstrated that aDNA samples often descend from intermixed groups too, and challenges in acquiring reliable data from particular geographic regions and time periods can still hinder sampling the “best” ancestral proxies.

Historical samples (from within the last millennia) are increasingly being published, including those from famous individuals, such as Beethoven, King Richard III of England and Lakota Sioux leader Sitting Bull (King et al. 2014; Moltke et al. 2021; Begg et al. 2023). Some DTC companies, such as 23andMe, DNA Consultants and MyTrueAncestry now allow individuals to compare their DNA to certain historical individuals and ancient DNA samples.  However, such mtDNA or Y chromosome matches are likely to be on timescales that are not genealogically relevant. Moreover, given the doubling of ancestors in each generation backwards in time, the importance of a genetic match to any specific historical individual on the autosomes may carry little significance (Chang 1999; Derrida et al. 2000; Ralph and Coop 2013). Indeed any autosomal DNA sharing between a customer and a historical individual detected by these companies will almost certainly be small and due to them sharing some distant common genetic ancestor(s), rather than the customer directly descending from that person. https://www.forbes.com/sites/jenniferraff/2019/04/09/genetic-astrology-when-ancient-dna-meets-ancestry-testing/#6dcb5b496c69

High‐quality WGS data have also stimulated the advance of approaches that infer full (genetic) genealogies relating thousands of people back in time through their shared ancestors e.g., Relate, tsinfer, argWEAVER (Hubisz et al. 2019; Kelleher et al. 2019; Speidel et al. 2019; Wong et al. 2024). These approaches leverage the full set of variants that samples carry across the genome as a clock to infer the number of generations ago each ancestor lived. In principle, the complete genealogy of sampled genomes represents the best obtainable information from genetic variation data. Computational considerations currently require a number of simplifying assumptions that make inferred genealogies deviate from the true ones to an unknown degree, while also limiting the number of samples that can be analysed jointly. However, these techniques are continually improving, and currently show unprecedented power to resolve aspects of demographic and evolutionary history (Lewanski et al. 2024; Wong et al. 2024; Speidel et al. 2019; Lewanski et al. 2024; Wong et al. 2024).

6. Inferring Genetic Structure and Demographic History Using DNA

Along with these continuing advances, recent genetic analyses have provided new insights into human demography, including genetic links among people. Such research has demonstrated a notable correlation between genetic variation patterns and geography, both at continent‐level scales as expected (Rosenberg et al. 2002) but also within many countries (Byrne et al. 2020; Elliott et al. 2022; López et al. 2021). Some of this genetic structure is remarkably fine‐scaled. For example, when incorporating sufficient reference samples, genetic variation data from arrays with hundreds of thousands of SNPs can fairly reliably distinguish people from different counties within England (Hu et al. 2025) and from different ethnic groups living <20 km apart within Cameroon (N. Bird et al. 2023). This likely reflects how people historically on the whole tend to intermix with people that live near them, making it possible that such clusters of individuals share several recent ancestors who lived in or near that area.

As noted above, new techniques to infer genetic genealogies using WGS data have enabled more precise inference of the time when two people first share a common ancestor at locations throughout the genome (DeHaas et al. 2024; Kelleher et al. 2019; Speidel et al. 2019; Wong et al. 2024). This in turn enables better inference when populations split from one another. Such approaches have been used to infer the date of the initial ‘out‐of‐Africa’ dispersal of anatomically modern humans that have left present‐day descendants, as well as splits between non‐Africans in Europe and Asia and between different groups within Africa (Bergström et al. 2020). aDNA can also be used to infer splits between ancient groups, such as between different hunter‐gatherer groups in West Eurasia (Speidel et al. 2021). It is important to note that split times tend to represent an ‘average’ (midpoint) split for the lineages and splits between populations are often more gradual, perhaps over many thousands of years. In addition, there is increasing evidence that in some cases assuming a tree‐like model of splits oversimplifies human history, and a more structured model with multiple populations changing connectedness through time may be closer to the truth (Ragsdale et al. 2023).

These approaches can also be used to estimate changes in the effective population size of a group over time (Li and Durbin 2011; Schiffels and Durbin 2014; Steinrücken et al. 2019; Terhorst et al. 2017). For example, time periods where lots of people from a sampled group are inferred to share common ancestors can indicate population bottlenecks that occurred during those times. Conversely, time periods where there are few shared ancestors suggest population expansions or the formation of population structure (Hilgers et al. 2025). The humans who first migrated out of Africa experienced a significant bottleneck in population size which can be still detected today using such concepts. In contrast, within the past 10,000 years, the majority of human populations have experienced major growth, although some African hunter‐gatherer groups appear to be exceptions (Bergström et al. 2020; Fan et al. 2023).

Other approaches that infer more recent (i.e. within the last 3000 years) changes in effective population size are also available such as IBDNe and GONE (Browning et al. 2018; Browning and Browning 2015; Novo et al. 2023; Tournebize et al. 2022). Recent bottlenecks have been inferred in many present‐day populations, for example, Andamanese Islanders, Ashkenazi Jews, Finnish and Hispanic populations. New methods also infer effective population size changes in ancient individuals, with papers reporting a population expansion in Vikings during the Iron Age and in Medieval Britain before the Black Death (Fournier et al. 2023; Huang et al. 2025).

In addition to these insights, the process of recombination can be leveraged to learn about if and when populations intermixed (i.e., admixed) in the past. In particular, when two or more groups intermix, their descendants’ autosomes will be a mixture of segments inherited from ancestors representing each admixing group (Falush et al. 2003). Since the rate of recombination per each meiosis event is well characterised in humans (Auton et al. 2015; Palsson et al. 2025), the size of contiguous segments inherited from each ancestral group carries information on precisely how many generations ago the groups intermixed. This idea is leveraged by several available programs, such as ROLLOFF, ALDER, fastGLOBETROTTER, MOSAIC and DATES (Chintalapati et al. 2022; Hellenthal et al. 2014; Loh et al. 2013; Moorjani et al. 2011; Salter‐Townshend and Myers 2019; Wangkumhang et al. 2022).

By dating admixture and combining this with historical or archaeological sources, we can learn more about events in human history, such as migrations. In Europe, the spread of agricultural practices was accompanied by admixture between migrating farmers and local hunter‐gatherers, and the inferred dates of these mixing events reveal the complex nature of the expansion, with farmers migrating along multiple routes (Chintalapati et al. 2022). A similar, complex, picture is found when inferring admixture dates between Bantu‐speaking agriculturalists and local hunter‐gatherers in sub‐Saharan Africa (N. Bird et al. 2023; Fortes‐Lima et al. 2024; Tallman et al. 2023). Admixture events are also often inferred to overlap with periods of historical trade or large empires/states, which likely precipitate the meeting of people from different geographical regions (Brielle et al. 2023; Hellenthal et al. 2014; Schlebusch and Jakobsson 2018; Wangkumhang et al. 2022). Incorporating Y chromosome and mtDNA data can also help determine whether some of these migrations were sex‐biased (Bajić et al. 2018; Wood et al. 2005).

7. Ethical Considerations

There are several important ethical considerations around the use of genetic variation data. First, it is inherently identifiable and can contain information about sensitive details, such as an individual's physical traits, susceptibility to disease and genetic ancestry (Fox 2020; Joly and Dalpe 2022; Mohammed Yakubu and Chen 2020; Thomas et al. 2024). Such data therefore can pose security and privacy concerns without suitable protective measures in place. Importantly, an individual's genetic data has identifiable information about their family too. Publicly available genetic databases, such as GEDmatch, allow the identification of specific individuals using long‐range familial relationships. One study projected that around 60% of U.S. individuals of European descent could be re‐identified through distant family DNA in a public database (Erlich et al. 2018). This technique has been successfully used by law enforcement to identify suspects in criminal cases, but also has the potential for misuse (Greytak et al. 2019; Kennett 2019; Kuru 2025). In addition, there are concerns over the collection of genetic data from people without adequate consent, and the unethical use of genetic databases for surveillance (D'Amato et al. 2024; Lipphardt et al. 2021; Moreau 2019; Normile 2021; Wessel 2019; Whitmore et al. 2023).

As with studies linking genetic variants to traits such as disease, genomes with diverse representations from different worldwide groups are required for research into genetic ancestry, especially in regions that have been historically underrepresented in genomic research (Fatumo et al. 2022). Recently, there have been increased calls for a more diverse representation of populations in genomic databases (Martin et al. 2017; Popejoy and Fullerton 2016). However, resource, capacity and specialist knowledge inequalities between the Global North and other regions of the world mean that such genomes are often sequenced and analysed in the former, potentially leading to imbalanced relationships (Ali et al. 2021; Atutornu et al. 2022). Related to this, criticisms regarding ‘parachute’ research and allegations of research malpractice have fostered a distrust of genetic research in several communities (Claw et al. 2018; Nordling 2018, 2017; TallBear 2013). An ideal solution is the creation of equal and meaningful partnerships between collaborators, where all contributions to a project are considered equally valid. Guidelines also emphasise the importance of community engagement, effectively communicating research findings and capacity building and training endeavours (Adame 2021; Haelewaters et al. 2021; Martin et al. 2022; Sawchuk et al. 2024).

The rapid increase in sequencing genetic material from ancient samples brings its own ethical issues (Alpaslan‐Roodenberg et al. 2021; Kowal et al. 2023). One issue centres on consent; while the deceased cannot provide consent, there may be descendants or culturally affiliated communities who can be consulted before the research begins. Such community consultation and engagement is also important to discuss the potential negative consequences aDNA ancestry research could bring (Bardill et al. 2018; Cortez et al. 2021; Gibbon 2020; Sawchuk et al. 2024). Additional ethical issues arise from the fact that human remains must be damaged to extract DNA from them (Prendergast and Sawchuk 2018), and the possibility of identifying living descendants from ancient and historical DNA samples (Harney et al. 2023).

Finally, an ongoing ethical concern with research into genetic ancestry is its potential for reinforcing and propagating outdated ideas of ethnic and racial ‘essentialism’ (K. A. Bird and Carlson 2024). The use of genetic ancestry categories, widely used by researchers for convenience, can be misinterpreted by non‐experts to reflect discrete, genetically homogenous ‘pure’ sources, ignoring the complex demographic histories and intermixing discussed in this review (Lewis et al. 2022; Saini 2019). The criteria used to select some reference samples may add to this problem. For example, one of the earliest collections of DNA from worldwide populations, the Human Genome Diversity Panel (HGDP), which is used in many genetic ancestry studies today, focused specifically on collecting DNA from ‘populations of anthropological interest’, i.e., those that were assumed not to have experienced significant migrations or admixture since the 16th century (Cavalli‐Sforza 2005). Such subtleties, and the limitations of methods for understanding genetic ancestry discussed above, can be difficult to communicate, or even be deliberately misinterpreted, with the increased availability of DTC tests seeming to exacerbate these issues (Carlson and Harris 2020; Kampourakis and Peterson 2023; Panofsky et al. 2020). Some research suggests scientific racism and eugenics‐related ideas have reached more mainstream political discourse in recent years, and scientists are becoming increasingly aware of how their work can be misappropriated (Wojcik 2025; Carlson et al. 2022). There have been calls for scientists to represent the complex truth of genetic ancestry more accurately, for example, by emphasising the continuous nature of genetic variation (e.g. with sampling strategies and plots), avoiding arbitrary ancestry categories and communicating limitations clearly (Birney et al. 2021; Committee on the Use of Race, Ethnicity, and Ancestry as Population Descriptors in Genomics Research et al. 2023; Coop 2022; Lewis et al. 2023).

8. Conclusion and Future Perspectives

The field of genetic ancestry studies has developed significantly since the early days of analysing mtDNA and Y chromosome diversity. New techniques that leverage the rich information in autosomal genetic variation data are enabling inference of distant kinship relationships and complex demographic processes such as population splits, changes in population size and admixture events. This coupled with the continuing increase in available data, both from modern and ancient individuals and from historically underrepresented groups, is allowing us to examine genetic history in even greater detail. As data and computational techniques in this field are still proliferating rapidly, the near future should bring increasingly more precise insights in this area. However, limitations still remain. Genetic ancestry information is complex, typically difficult to capture and often even more challenging to communicate to the general public. As with other fields, perhaps a stronger focus on conveying uncertainty will allow both academics and non‐academics to avoid the ever‐present risks of over‐interpretation. This in turn may lend more credence to reports highlighting the interesting facets of ancestry that genetics can reveal with high confidence.

Author Contributions

N.B., T.K. and G.H. wrote the manuscript. N.B. generated the figure.

Funding: NB and GH are funded by the Wellcome Trust.

Data Availability Statement

The authors have nothing to report.

References

  1. Adame, F. 2021. “Meaningful Collaborations Can End “Helicopter Research”.” Nature . 10.1038/d41586-021-01795-1. [DOI] [PubMed]
  2. Adebamowo, C. A. , Adeyemo A., Ashaye A., et al. 2022. “Polygenic Risk Scores for CARDINAL Study.” Nature Genetics 54: 527–530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Aganezov, S. , Yan S. M., Soto D. C., et al. 2022. “A Complete Reference Genome Improves Analysis of Human Genetic Variation.” Science 376, no. 6588: eabl3533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Akbari, A. , Barton A. R., Gazal S., et al. 2024. “Pervasive Findings of Directional Selection Realize the Promise of Ancient DNA to Elucidate Human Adaptation.” Preprint, bioRxiv, September 15. 10.1101/2024.09.14.613021. [DOI]
  5. Alexander, D. H. , and Lange K.. 2011. “Enhancements to the ADMIXTURE Algorithm for Individual Ancestry Estimation.” BMC Bioinformatics 12: 246. 10.1186/1471-2105-12-246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Ali, J. , Cohn B., Mwaka E., et al. 2021. “A Scoping Review of Genetics and Genomics Research Ethics Policies and Guidelines for Africa.” BMC Medical Ethics 22, no. 1: 39. 10.1186/s12910-021-00611-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. All of Us Research Program Genomics Investigators . 2024. “Genomic Data in the All of Us Research Program.” Nature 627, no. 8003: 340–346. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Alpaslan‐Roodenberg, S. , Anthony D., Babiker H., et al. 2021. “Ethics of DNA Research on Human Remains: Five Globally Applicable Guidelines.” Nature 599, no. 7883: 41–46. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Atutornu, J. , Milne R., Costa A., Patch C., and Middleton A.. 2022. “Towards Equitable and Trustworthy Genomics Research.” EBioMedicine 76, no. 103879: 103879. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Auton, A. , Abecasis G. R., Altshuler D. M., et al. 2015. “A Global Reference for Human Genetic Variation.” Nature 526, no. 7571: 68–74. 10.1038/nature15393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bajić, V. , Barbieri C., Hübner A., et al. 2018. “Genetic Structure and Sex‐biased Gene Flow in the History of Southern African Populations.” American Journal of Physical Anthropology 167, no. 3: 656–671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Bardill, J. , Bader A. C., Garrison N. A., et al. 2018. “Advancing the Ethics of Paleogenomics.” Science 360, no. 6387: 384–385. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Begg, T. J. A. , Schmidt A., Kocher A., et al. 2023. “Genomic Analyses of Hair From Ludwig van Beethoven.” Current Biology: CB 33, no. 8: 1431–1447.e22. [DOI] [PubMed] [Google Scholar]
  14. Bergström, A. , McCarthy S. A., Hui R., et al. 2020. “Insights Into Human Genetic Variation and Population History From 929 Diverse Genomes.” Science 367, no. 6484: eaay5012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Bird, K. A. , and Carlson J.. 2024. “Typological Thinking in Human Genomics Research Contributes to the Production and Prominence of Scientific Racism.” Frontiers in Genetics 15: 1345631. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Bird, N. , Ormond L., Awah P., et al. 2023. “Dense Sampling of Ethnic Groups Within African Countries Reveals Fine‐Scale Genetic Structure and Extensive Historical Admixture.” Science Advances 9, no. 13: eabq2616. 10.1126/sciadv.abq2616. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Birney, E. , Inouye M., Raff J., Rutherford A., and Scally A.. 2021. “The Language of Race, Ethnicity, and Ancestry in Human Genetic Research.” arXiv. http://arxiv.org/abs/2106.10041.
  18. Brielle, E. S. , Fleisher J., Wynne‐Jones S., et al. 2023. “Entwined African and Asian Genetic Roots of Medieval Peoples of the Swahili Coast.” Nature 615, no. 7954: 866–873. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Browning, S. R. , and Browning B. L.. 2015. “Accurate Non‐Parametric Estimation of Recent Effective Population Size From Segments of Identity by Descent.” American Journal of Human Genetics 97, no. 3: 404–418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Browning, S. R. , and Browning B. L.. 2024. “Biobank‐Scale Inference of Multi‐Individual Identity by Descent and Gene Conversion.” American Journal of Human Genetics 111, no. 4: 691–700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Browning, S. R. , Browning B. L., Daviglus M. L., et al. 2018. “Ancestry‐Specific Recent Effective Population Size in the Americas.” PLoS Genetics 14, no. 5: e1007385. 10.1371/journal.pgen.1007385. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Bycroft, C. , Freeman C., Petkova D., et al. 2018. “The UK Biobank Resource With Deep Phenotyping and Genomic Data.” Nature 562, no. 7726: 203–209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Byrne, R. P. , van Rheenen W., van den Berg L. H., Veldink J. H., and McLaughlin R. L.. 2020. “Dutch Population Structure Across Space, Time and GWAS Design.” Nature Communications 11, no. 1: 1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Cann, R. L. , Stoneking M., and Wilson A. C.. 1987. “Mitochondrial DNA and Human Evolution.” Nature 325, no. 6099: 31–36. [DOI] [PubMed] [Google Scholar]
  25. Carlson, J. , and Harris K.. 2020. “Quantifying and Contextualizing the Impact of bioRxiv Preprints Through Automated Social Media Audience Segmentation.” PLoS Biology 18, no. 9: e3000860. 10.1371/journal.pbio.3000860. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Carlson, J. , Henn B. M., Al‐Hindi D. R., and Ramachandran S.. 2022. “Counter the Weaponization of Genetics Research by Extremists.” Nature 610: 444–447. [DOI] [PubMed] [Google Scholar]
  27. Cassidy, L. M. , Russell M., Smith M., et al. 2025. “Continental Influx and Pervasive Matrilocality in Iron Age Britain.” Nature 637, no. 8048: 1136–1142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Cavalli‐Sforza, L. L. 2005. “The Human Genome Diversity Project: Past, Present and Future.” Nature Reviews. Genetics 6, no. 4: 333–340. [DOI] [PubMed] [Google Scholar]
  29. Chang, J. T. 1999. “Recent Common Ancestors of all Present‐Day Individuals.” Advances in Applied Probability 31, no. 4: 1002–1026. [Google Scholar]
  30. Chintalapati, M. , Patterson N., and Moorjani P.. 2022. “The Spatiotemporal Patterns of Major Human Admixture Events During the European Holocene.” Elife 11: e77625. 10.7554/ELIFE.77625. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Claw, K. G. , Anderson M. Z., Begay R. L., et al. 2018. “A Framework for Enhancing Ethical Genomic Research With Indigenous Communities.” Nature Communications 9, no. 1: 2957. 10.1038/s41467-018-05188-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Committee on the Use of Race, Ethnicity, and Ancestry as Population Descriptors in Genomics Research, Board on Health Sciences Policy, Committee on Population, Health and Medicine Division, Division of Behavioral and Social Sciences and Education, & National Academies of Sciences, Engineering, and Medicine . 2023. Using Population Descriptors in Genetics and Genomics Research: A New Framework for an Evolving Field. National Academies Press. 10.17226/26902. [DOI] [PubMed] [Google Scholar]
  33. Coop, G. 2022. “Genetic Similarity and Genetic Ancestry Groups.” ArXiv.
  34. Cortez, A. D. , Bolnick D. A., Nicholas G., Bardill J., and Colwell C.. 2021. “An Ethical Crisis in Ancient DNA Research: Insights From the Chaco Canyon Controversy as a Case Study.” Journal of Social Archaeology 21, no. 2: 157–178. [Google Scholar]
  35. D'Amato, M. E. , Joly Y., Lynch V., Machado H., Scudder N., and Zieger M.. 2024. “Ethical Considerations for Forensic Genetic Frequency Databases: First Report Conception and Development.” Forensic Science International. Genetics 71, no. 103053: 103053. [DOI] [PubMed] [Google Scholar]
  36. DeHaas, D. , Pan Z., and Wei X.. 2025. “Enabling Efficient Analysis of Biobank‐Scale Data With Genotype Representation Graphs.” Nature Computational Science 5: 112–124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Derrida, B. , Manrubia S. C., and Zanette D. H.. 2000. “On the Genealogy of a Population of Biparental Individuals.” Journal of Theoretical Biology 203, no. 3: 303–315. [DOI] [PubMed] [Google Scholar]
  38. Donnelly, K. P. 1983. “The Probability That Related Individuals Share Some Section of Genome Identical by Descent.” Theoretical Population Biology 23, no. 1: 34–63. [DOI] [PubMed] [Google Scholar]
  39. Elliott, K. S. , Haber M., Daggag H., et al. 2022. “Fine‐Scale Genetic Structure in the United Arab Emirates Reflects Endogamous and Consanguineous Culture, Population History, and Geography.” Molecular Biology and Evolution 39, no. 3: msac039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Erlich, Y. , Shor T., Pe'er I., and Carmi S.. 2018. “Identity Inference of Genomic Data Using Long‐Range Familial Searches.” Science 362, no. 6415: 690–694. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Falush, D. , Stephens M., and Pritchard J. K.. 2003. “Inference of Population Structure Using Multilocus Genotype Data: Linked Loci and Correlated Allele Frequencies.” Genetics Society of America 164: 1567–1587. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Fan, S. , Spence J. P., Feng Y., et al. 2023. “Whole‐Genome Sequencing Reveals a Complex African Population Demographic History and Signatures of Local Adaptation.” Cell 186, no. 5: 923–939.e14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Fatumo, S. , Chikowore T., Choudhury A., Ayub M., Martin A. R., and Kuchenbaecker K.. 2022. “A Roadmap to Increase Diversity in Genomic Studies.” Nature Medicine 28, no. 2: 243–250. 10.1038/s41591-021-01672-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Fortes‐Lima, C. A. , Burgarella C., Hammarén R., et al. 2024. “The Genetic Legacy of the Expansion of Bantu‐Speaking Peoples in Africa.” Nature 625, no. 7995: 540–547. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Fournier, R. , Tsangalidou Z., Reich D., and Palamara P. F.. 2023. “Haplotype‐Based Inference of Recent Effective Population Size in Modern and Ancient DNA Samples.” Nature Communications 14, no. 1: 7945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Fox, K. 2020. “The Illusion of Inclusion—The “All of Us” Research Program and Indigenous Peoples' DNA.” New England Journal of Medicine 383, no. 5: 411–413. [DOI] [PubMed] [Google Scholar]
  47. Gibbon, V. E. 2020. “African Ancient DNA Research Requires Robust Ethics and Permission Protocols.” Nature Reviews. Genetics 21, no. 11: 645–647. [DOI] [PubMed] [Google Scholar]
  48. Greytak, E. M. , Moore C., and Armentrout S. L.. 2019. “Genetic Genealogy for Cold Case and Active Investigations.” Forensic Science International 299: 103–113. [DOI] [PubMed] [Google Scholar]
  49. Guerrini, C. J. , Robinson J. O., Bloss C. C., et al. 2022. “Family Secrets: Experiences and Outcomes of Participating in Direct‐to‐Consumer Genetic Relative‐Finder Services.” American Journal of Human Genetics 109, no. 3: 486–497. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Haelewaters, D. , Hofmann T. A., and Romero‐Olivares A. L.. 2021. “Ten Simple Rules for Global North Researchers to Stop Perpetuating Helicopter Research in the Global South.” PLoS Computational Biology 17, no. 8: e1009277. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Harney, É. , Sirak K., Sedig J., et al. 2023. “Ethical Considerations When Co‐Analyzing Ancient DNA and Data From Private Genetic Databases.” American Journal of Human Genetics 110, no. 9: 1447–1453. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Hellenthal, G. , Busby G. B. J., Band G., et al. 2014. “A Genetic Atlas of human Admixture History.” Science 343, no. 6172: 747–751. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Herrando‐Pérez, S. , Tobler R., and Huber C. D.. 2021. “Smartsnp, an R Package for Fast Multivariate Analyses of Big Genomic Data.” Methods in Ecology and Evolution 12, no. 11: 2084–2093. [Google Scholar]
  54. Hilgers, L. , Liu S., Jensen A., et al. 2025. “Avoidable False PSMC Population Size Peaks Occur Across Numerous Studies.” Current Biology: CB 35, no. 4: 927–930.e3. [DOI] [PubMed] [Google Scholar]
  55. Hu, S. , Ferreira L. A. F., Shi S., et al. 2025. “Fine‐Scale Population Structure and Widespread Conservation of Genetic Effect Sizes Between Human Groups Across Traits.” Nature Genetics 57, no. 2: 379–389. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Huang, Y. , Carmi S., and Ringbauer H.. 2025. “Estimating Effective Population Size Trajectories From Time‐Series Identity‐by‐Descent Segments.” Genetics 229, no. 3: iyae212. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Hubisz, M. J. , Williams A. L., and Siepel A.. 2019. “Mapping Gene Flow Between Ancient Hominins Through Demography‐Aware Inference of the Ancestral Recombination Graph.” PLoS Genetics 16, no. 8: e1008895. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Joly, Y. , and Dalpe G.. 2022. “Genetic Discrimination Still Casts a Large Shadow in 2022.” European Journal of Human Genetics 30, no. 12: 1320–1322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Kampourakis, K. , and Peterson E. L.. 2023. “The Racist Origins, Racialist Connotations, and Purity Assumptions of the Concept of “Admixture” in Human Evolutionary Genetics.” Genetics 223, no. 3: iyad002. 10.1093/genetics/iyad002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Kelleher, J. , Wong Y., Wohns A. W., Fadil C., Albers P. K., and McVean G.. 2019. “Inferring Whole‐Genome Histories in Large Population Datasets.” Nature Genetics 51, no. 9: 1330–1338. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Kennett, D. 2019. “Using Genetic Genealogy Databases in Missing Persons Cases and to Develop Suspect Leads in Violent Crimes.” Forensic Science International 301: 107–117. [DOI] [PubMed] [Google Scholar]
  62. King, T. E. , Ballereau S. J., Schürer K. E., and Jobling M. A.. 2006. “Genetic Signatures of Coancestry Within Surnames.” Current Biology 16, no. 4: 384–388. 10.1016/j.cub.2005.12.048. [DOI] [PubMed] [Google Scholar]
  63. King, T. E. , Fortes G. G., Balaresque P., et al. 2014. “Identification of the Remains of King Richard III.” Nature Communications 5, no. 1. 10.1038/ncomms6631. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. King, T. E. , Fortes G. G., Balaresque P., et al. 2014. “Identification of the Remains of King Richard III.” Nature Communications 5, no. 1: 5631. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. King, T. E. , and Jobling M. A.. 2009. “What's in a Name? Y Chromosomes, Surnames and the Genetic Genealogy Revolution.” Trends in Genetics 25, no. 8: 351–360. 10.1016/j.tig.2009.06.003. [DOI] [PubMed] [Google Scholar]
  66. King, T. E. , Parkin E. J., Swinfield G., et al. 2007. “Africans in Yorkshire? The Deepest‐rooting Clade of the Y Phylogeny Within an English Genealogy.” European Journal of Human Genetics 15, no. 3: 288–293. 10.1038/sj.ejhg.5201771. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Kowal, E. , Weyrich L. S., Argüelles J. M., et al. 2023. “Community Partnerships Are Fundamental to Ethical Ancient DNA Research.” HGG Advances 4, no. 2: 100161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Kuru, T. 2025. “Investigative Genetic Genealogy in Europe: Why the “Manifestly Made Public by the Data Subject” Legal Basis Should be Avoided.” Computer Law and Security Report 56, no. 106106: 106106. [Google Scholar]
  69. López, S. , Tarekegn A., Band G., et al. 2021. “Evidence of the Interplay of Genetics and Culture in Ethiopia.” Nature Communications 12, no. 1: 1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Lawson, D. J. , Hellenthal G., Myers S., and Falush D.. 2012. “Inference of Population Structure Using Dense Haplotype Data.” PLoS Genetics 8, no. 1: e1002453. 10.1371/journal.pgen.1002453. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Lewanski, A. L. , Grundler M. C., and Bradburd G. S.. 2024. “The Era of the ARG: an Introduction to Ancestral Recombination Graphs and Their Significance in Empirical Evolutionary Genomics.” PLoS Genetics 20, no. 1: e1011110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Lewis, A. C. F. , Molina S. J., Appelbaum P. S., et al. 2022. “Getting Genetic Ancestry Right for Science and Society.” Science 376, no. 6590: 250–252. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Lewis, A. C. F. , Molina S. J., Appelbaum P. S., et al. 2023. “An Ethical Framework for Research Using Genetic Ancestry.” Perspectives in Biology and Medicine 66, no. 2: 225–248. [DOI] [PubMed] [Google Scholar]
  74. Li, H. , and Durbin R.. 2011. “Inference of Human Population History From Individual Whole‐Genome Sequences.” Nature 475, no. 7357: 493–496. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Lipphardt, V. , Surdu M., Ellebrecht N., Pfaffelhuber P., Wienroth M., and Rappold G. A.. 2021. “Europe's Roma People Are Vulnerable to Poor Practice in Genetics.” Nature 599, no. 7885: 368–371. [DOI] [PubMed] [Google Scholar]
  76. Liu, Y. , Bennett E. A., and Fu Q.. 2022. “Evolving Ancient DNA Techniques and the Future of Human History.” Cell 185, no. 15: 2632–2635. [DOI] [PubMed] [Google Scholar]
  77. Loh, P. R. , Lipson M., Patterson N., et al. 2013. “Inferring Admixture Histories of Human Populations Using Linkage Disequilibrium.” Genetics 193, no. 4: 1233–1254. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Mallick, S. , Micco A., Mah M., et al. 2024. “The Allen Ancient DNA Resource (AADR) a Curated Compendium of Ancient Human Genomes.” Scientific Data 11, no. 1: 182. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Maples, B. K. , Gravel S., Kenny E. E., and Bustamante C. D.. 2013. “RFMix: A Discriminative Modeling Approach for Rapid and Robust Local‐Ancestry Inference.” American Journal of Human Genetics 93: 278–288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Margaryan, A. , Lawson D. J., Sikora M., et al. 2020. “Population Genomics of the Viking World.” Nature 585, no. 7825: 390. [DOI] [PubMed] [Google Scholar]
  81. Martin, A. R. , Stroud R. E., Abebe T., et al. 2022. “Increasing Diversity in Genomics Requires Investment in Equitable Partnerships and Capacity Building.” Nature Genetics 54, no. 6: 740–745. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Martin, A. R. , Gignoux C. R., Walters R. K., et al. 2017. “Human Demographic History Impacts Genetic Risk Prediction Across Diverse Populations.” The American Journal of Human Genetics 100, no. 4. 10.1016/j.ajhg.2017.03.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Mathieson, I. , and McVean G.. 2014. “Demography and the Age of Rare Variants.” PLoS Genetics 10, no. 8: e1004528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Mathieson, I. , and Scally A.. 2020. “What Is Ancestry?.” PLoS Genetics 16, no. 3: e1008624. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Miga, K. H. , and Eichler E. E.. 2023. “Envisioning a New Era: Complete Genetic Information From Routine, Telomere‐to‐Telomere Genomes.” American Journal of Human Genetics 110, no. 11: 1832–1840. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Milani, L. , Alver M., Laur S., et al. 2024. “From Biobanking to Personalized Medicine: The Journey of the Estonian Biobank.” Preprint, medRxiv, September 24. 10.1101/2024.09.22.24313964. [DOI]
  87. Mohammed Yakubu, A. , and Chen Y.‐P. P.. 2020. “Ensuring Privacy and Security of Genomic Data and Functionalities.” Briefings in Bioinformatics 21, no. 2: 511–526. [DOI] [PubMed] [Google Scholar]
  88. Moltke, I. , Korneliussen T. S., Seguin‐Orlando A., et al. 2021. “Identifying a Living Great‐grandson of the Lakota Sioux Leader Tatanka Iyotake (Sitting Bull).” Science Advances 7, no. 44: eabh2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Moorjani, P. , Patterson N., Hirschhorn J. N., et al. 2011. “The History of African Gene Flow Into Southern Europeans, Levantines, and Jews.” PLoS Genetics 7, no. 4: e1001373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Moreau, Y. 2019. “Crack Down on Genomic Surveillance.” Nature 576, no. 7785: 36–38. [DOI] [PubMed] [Google Scholar]
  91. Nagai, A. , Hirata M., Kamatani Y., et al. 2017. “Overview of the BioBank Japan Project: Study Design and Profile.” Journal of Epidemiology 27, no. 3S: S2–S8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Nordling, L. 2018. “African Scientists Call for More Control of Their Continent's Genomic Data.” Nature, April 18. 10.1038/d41586-018-04685-1. [DOI]
  93. Nordling, L. 2017. “San People of Africa Draft Code of Ethics for Researchers.” Science, March 17. https://www.science.org/content/article/san‐people‐africa‐draft‐code‐ethics‐researchers.
  94. Normile, D. 2021. “Genetic Papers Containing Data From China's Ethnic Minorities Draw Fire.” Science, August 10. https://www.science.org/content/article/genetic‐papers‐containing‐data‐china‐s‐ethnic‐minorities‐draw‐fire.
  95. Novo, I. , Ordás P., Moraga N., Santiago E., Quesada H., and Caballero A.. 2023. “Impact of Population Structure in the Estimation of Recent Historical Effective Population Size by the Software GONE.” Genetics, Selection, Evolution: GSE 55, no. 1: 86. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Nurk, S. , Koren S., Rhie A., et al. 2022. “The Complete Sequence of a Human Genome.” Science 376, no. 6588: 44–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Palsson, G. , Hardarson M. T., Jonsson H., et al. 2025. “Complete Human Recombination Maps.” Nature 639: 700–707. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Panofsky, A. , Dasgupta K., and Iturriaga N.. 2020. “How White Nationalists Mobilize Genetics: From Genetic Ancestry and Human Biodiversity to Counterscience and Metapolitics.” American Journal of Physical Anthropology 175, no. 2: 387–398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Patterson, N. , Price A. L., and Reich D.. 2006. “Population Structure and Eigenanalysis.” PLoS Genetics 2, no. 12: 2074–2093. [DOI] [PMC free article] [PubMed] [Google Scholar]
  100. Popejoy, A. B. , and Fullerton S. M.. 2016. “Genomics Is Failing on diversity.” (Vol. 538, Issue no. 7624). In Nature. 10.1038/538161a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Prendergast, M. E. , and Sawchuk E.. 2018. “Boots on the Ground in Africa's Ancient DNA “Revolution”: Archaeological Perspectives on Ethics and Best Practices.” Antiquity 92, no. 363: 803–815. 10.15184/aqy.2018.70. [DOI] [Google Scholar]
  102. Price, A. L. , Patterson N. J., Plenge R. M., Weinblatt M. E., Shadick N. A., and Reich D.. 2006. “Principal Components Analysis Corrects for Stratification in Genome‐Wide Association Studies.” Nature Genetics 38, no. 8: 904–909. [DOI] [PubMed] [Google Scholar]
  103. Ragsdale, A. P. , Weaver T. D., Atkinson E. G., et al. 2023. “A Weakly Structured Stem for Human Origins in Africa.” Nature 617, no. 7962: 755–763. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Ralph, P. , and Coop G.. 2013. “The Geography of Recent Genetic Ancestry Across Europe.” PLoS Biology 11, no. 5: e1001555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Ramstetter, M. D. , Dyer T. D., Lehman D. M., et al. 2017. “Benchmarking Relatedness Inference Methods With Genome‐Wide Data From Thousands of Relatives.” Genetics 207, no. 1: 75–82. [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Richards, M. , Côrte‐Real H., Forster P., et al. 1996. “Paleolithic and Neolithic Lineages in the European Mitochondrial Gene Pool.” American Journal of Human Genetics 59, no. 1: 185–203. [PMC free article] [PubMed] [Google Scholar]
  107. Ringbauer, H. , Huang Y., Akbari A., et al. 2024. “Accurate Detection of Identity‐by‐Descent Segments in Human Ancient DNA.” Nature Genetics 56, no. 1: 143–151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Rosenberg, N. A. , Pritchard J. K., Weber J. L., et al. 2002. “Genetic Structure of Human Populations.” Science 298, no. 5602: 2381–2385. [DOI] [PubMed] [Google Scholar]
  109. Saini, A. 2019. Superior: The Return of Race Science. Beacon Press. [Google Scholar]
  110. Salter‐Townshend, M. , and Myers S.. 2019. “Fine‐Scale Inference of Ancestry Segments Without Prior Knowledge of Admixing Groups.” Genetics 212, no. 3: 869–889. [DOI] [PMC free article] [PubMed] [Google Scholar]
  111. Sawchuk, E. A. , Sirak K. A., Manthi F. K., et al. 2024. “Charting a Landmark‐Driven Path Forward for Population Genetics and Ancient DNA Research in Africa.” American Journal of Human Genetics 111, no. 7: 1243–1251. [DOI] [PMC free article] [PubMed] [Google Scholar]
  112. Schiffels, S. , and Durbin R.. 2014. “Inferring Human Population Size and Separation History From Multiple Genome Sequences.” Nature Genetics 46, no. 8: 919–925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  113. Schlebusch, C. M. , and Jakobsson M.. 2018. “Tales of Human Migration, Admixture, and Selection in Africa.” Annual Review of Genomics and Human Genetics 19, no. 1: 405–428. [DOI] [PubMed] [Google Scholar]
  114. Seidman, D. N. , Shenoy S. A., Kim M., et al. 2020. “Rapid, Phase‐Free Detection of Long Identity‐by‐Descent Segments Enables Effective Relationship Classification.” American Journal of Human Genetics 106, no. 4: 453–466. [DOI] [PMC free article] [PubMed] [Google Scholar]
  115. Speidel, L. , Cassidy L., Davies R. W., Hellenthal G., Skoglund P., and Myers S. R.. 2021. “Inferring Population Histories for Ancient Genomes Using Genome‐Wide Genealogies.” Molecular Biology and Evolution 38, no. 9: 3497–3511. 10.1093/molbev/msab174. [DOI] [PMC free article] [PubMed] [Google Scholar]
  116. Speidel, L. , Forest M., Shi S., and Myers S. R.. 2019. “A Method for Genome‐Wide Genealogy Estimation for Thousands of Samples.” Nature Genetics 51, no. 9: 1321–1329. [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Steinrücken, M. , Kamm J., Spence J. P., and Song Y. S.. 2019. “Inference of Complex Population Histories Using Whole‐Genome Sequences From Multiple Populations.” Proceedings of the National Academy of Sciences of the United States of America 116, no. 34: 17115–17120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  118. Sykes, B. 2001. The Seven Daughters of Eve: The Science That Reveals Our Genetic Ancestry. WW Norton. [Google Scholar]
  119. TallBear, K. 2013. Native American DNA: Tribal Belonging and the False Promise of Genetic Science. University of Minnesota Press. [Google Scholar]
  120. Tallman, S. , Sungo M., Das D., Saranga S., and Beleza S.. 2023. “Whole Genomes From Angola and Mozambique Inform About the Origins and Dispersals of Major African Migrations.” Nature Communications 14, no. 1: 7967. [DOI] [PMC free article] [PubMed] [Google Scholar]
  121. Terhorst, J. , Kamm J. A., and Song Y. S.. 2017. “Robust and Scalable Inference of Population History From Hundreds of Unphased Whole Genomes.” Nature Genetics 49, no. 2: 303–309. 10.1038/ng.3748. [DOI] [PMC free article] [PubMed] [Google Scholar]
  122. Thomas, M. , Mackes N., Preuss‐Dodhy A., Wieland T., and Bundschus M.. 2024. “Assessing Privacy Vulnerabilities in Genetic Data Sets: Scoping Review.” JMIR Bioinformatics and Biotechnology 5, no. 1: e54332. [DOI] [PMC free article] [PubMed] [Google Scholar]
  123. Tillmar, A. , and Kling D.. 2025. “Comparative Study of Statistical Approaches and SNP Panels to Infer Distant Relationships in Forensic Genetics.” Genes 16, no. 2: 114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  124. Tournebize, R. , Chu G., and Moorjani P.. 2022. “Reconstructing the History of Founder Events Using Genome‐Wide Patterns of Allele Sharing Across Individuals.” PLoS Genetics 18, no. 6: e1010243. 10.1371/journal.pgen.1010243. [DOI] [PMC free article] [PubMed] [Google Scholar]
  125. Walters, R. G. , Millwood I. Y., Lin K., et al. 2023. “Genotyping and Population Characteristics of the China Kadoorie Biobank.” Cell Genomics 3, no. 8: 100361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  126. Wangkumhang, P. , Greenfield M., and Hellenthal G.. 2022. “An Efficient Method to Identify, Date, and Describe Admixture Events Using Haplotype Information.” Genome Research 32, no. 8: 1553–1564. [DOI] [PMC free article] [PubMed] [Google Scholar]
  127. Watanabe, Y. , and Ohashi J.. 2023. “Modern Japanese Ancestry‐derived Variants Reveal the Formation Process of the Current Japanese Regional Gradations.” Iscience 26, no. 3: 106130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  128. Wessel, L. 2019. “Scientists Concerned Over US Plans to Collect DNA Data From Immigrants.” Nature, October 7. 10.1038/d41586-019-02998-3. [DOI] [PubMed]
  129. Whitmore, L. , McCauley M., Farrell J. A., et al. 2023. “Inadvertent Human Genomic Bycatch and Intentional Capture Raise Beneficial Applications and Ethical Concerns With Environmental DNA.” Nature Ecology & Evolution 7, no. 6: 873–888. [DOI] [PMC free article] [PubMed] [Google Scholar]
  130. Wojcik, G. L. 2025. “Eugenics Is on the Rise Again: human Geneticists Must Take a Stand.” Nature 641, no. 8061: 37–38. [DOI] [PubMed] [Google Scholar]
  131. Wong, Y. , Ignatieva A., Koskela J., Gorjanc G., Wohns A. W., and Kelleher J.. 2024. “A General and Efficient Representation of Ancestral Recombination Graphs.” Genetics 228, no. 1: iyae100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  132. Wood, E. T. , Stover D. A., Ehret C., et al. 2005. “Contrasting Patterns of Y Chromosome and mtDNA Variation in Africa: Evidence for Sex‐biased Demographic Processes.” European Journal of Human Genetics: EJHG 13, no. 7. 10.1038/sj.ejhg.5201408. [DOI] [PubMed] [Google Scholar]
  133. Yamamoto, K. , Namba S., Sonehara K., et al. 2024. “Genetic Legacy of Ancient Hunter‐gatherer Jomon in Japanese Populations.” Nature Communications 15, no. 1: 9780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  134. Zhou, Y. , Browning S. R., and Browning B. L.. 2020. “A Fast and Simple Method for Detecting Identity‐by‐Descent Segments in Large‐Scale Data.” American Journal of Human Genetics 106, no. 4: 426–437. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The authors have nothing to report.


Articles from Annals of Human Genetics are provided here courtesy of Wiley

RESOURCES