Skip to main content
Human Genetics and Genomics Advances logoLink to Human Genetics and Genomics Advances
. 2023 Jan 13;4(2):100178. doi: 10.1016/j.xhgg.2023.100178

Guidelines for genetic ancestry inference created through roundtable discussions

Jennifer K Wagner 1,2,3,4,5,6,16,, Joon-Ho Yu 7,8, Duana Fullwiley 9, CeCe Moore 10, James F Wilson 11, Michael J Bamshad 12,13,15, Charmaine D Royal 14,15; Genetic Ancestry Inference Roundtable Participants, on behalf of the
PMCID: PMC9926022  PMID: 36798092

Summary

The use of genetic and genomic technology to infer ancestry is commonplace in a variety of contexts, particularly in biomedical research and for direct-to-consumer genetic testing. In 2013 and 2015, two roundtables engaged a diverse group of stakeholders toward the development of guidelines for inferring genetic ancestry in academia and industry. This report shares the stakeholder groups’ work and provides an analysis of, commentary on, and views from the groundbreaking and sustained dialogue. We describe the engagement processes and the stakeholder groups’ resulting statements and proposed guidelines. The guidelines focus on five key areas: application of genetic ancestry inference, assumptions and confidence/laboratory and statistical methods, terminology and population identifiers, impact on individuals and groups, and communication or translation of genetic ancestry inferences. We delineate the terms and limitations of the guidelines and discuss their critical role in advancing the development and implementation of best practices for inferring genetic ancestry and reporting the results. These efforts should inform both governmental regulation and self-regulation.

Keywords: genetic ancestry, population descriptors, race and ethnicity, ELSI


Two roundtables engaging a diverse group of stakeholders enabled the development of guidelines for inferring genetic ancestry in academia and industry. This report describes the engagement process and the resulting guidelines, which can inform continued improvement of practices and assist ongoing scholarly, industry, and policy discussions.

Introduction

Genetic ancestry inference is commonly used by researchers (e.g., to control for population stratification, explore population history, etc.), sought by individuals via direct-to-consumer (DTC) genetic testing companies to learn more about their genealogical history, geographic origins, and population affiliations, and contemplated by precision health initiatives as a means to “return value” or otherwise engage patient-participants. Both research findings and individual DTC results are often featured in the print and broadcast media. Genetic ancestry inference is thus used across a wide variety of settings, with different goals, assumptions, methods, and claims of accuracy and validity. It is therefore not surprising that there has been considerable disagreement among diverse stakeholders about the merits, limitations, utility, and impact of genetic ancestry inference. Consequently, efforts by any individual stakeholder group to advocate for standards or regulation have met with little success. The need for standards and other guidance has never been greater, as the popularity of DTC genetic ancestry testing continues to expand,1 members of Congress have expressed concern,2 controversial applications of genetic ancestry information continue to be explored (e.g., forensic or investigative genetic genealogy;3,4,5,6,7,8 immigration decisions;9,10 political posturing;11,12 medical school admissions;13 entitlement eligibility;14,15 and social identity and belonging16,17,18,19), and society has little choice but to confront whether, how, and when it is appropriate to rely upon anyone’s genetic ancestry information.

In 2008 the American Society of Human Genetics (ASHG) convened a task force to study issues emerging from genetic ancestry inference in academia and industry and to explore a constructive response to the need for more transparency and discussion about what genetic ancestry inference entails, as well as how it works and what it can and cannot do. The ASHG Board of Directors approved an ASHG position statement developed by the task force and the Society’s Social Issues Committee on ancestry inference that called attention to motivations, technical accuracy and reliability of results, health implications, societal and personal implications, proprietary databases, communication of limitations and potential impacts, public and personal education, interdisciplinary collaboration, and mechanisms for accountability (https://www.ashg.org/wp-content/uploads/2008/11/Statement-20081311-ASHGAncestryTesting.pdf) and approved an ASHG White Paper that provided recommendations aimed at addressing these issues.20

One of the recommendations of the task force was to convene a diverse group of stakeholders and develop guidelines for genetic ancestry inference that would inform research and commercial applications of ancestry testing. The long-term goal was to develop the precursors to best practices for inferring genetic ancestry and guidelines for the uses and reporting of genetic ancestry testing results. A planning committee was formed consisting of two Chairpersons (C.D.R. and M.J.B.), representatives of human population geneticists, the DTC ancestry testing industry, scholars in ethical and social implications of human genetics, and consumers of ancestry testing (e.g., genetic genealogists), and it organized a “roundtable” to develop consensus statements and guidelines for genetic ancestry inference. A first meeting (i.e., Roundtable I) was held in Washington, DC in September 2013. There were 65 participants representing seven stakeholder groups: population geneticists, DTC ancestry testing companies, social and behavioral scientists, humanists, clinicians, community representatives, and consumer groups including genealogical associations/societies. The participants generated 22 “consensus” statements—defined as statements accepted by a majority but not all participants—about the uses, methods, and outcomes of genetic ancestry inference (Table 1). Critical dissent and constructive criticisms were encouraged to ensure that all stakeholders had a “voice” and that multiple perspectives—even those that were highly dissenting—were considered. Reports from those in the minority were solicited. Several statements were edited for clarity and grammar in subsequent email exchanges among participants. A second roundtable meeting (i.e., Roundtable II) was held in Durham, NC in May 2015 to compose, based upon these 22 consensus statements, a set of guidelines (Table 2) for ancestry inference that would serve as a template for the development of best practices. There were 49 participants in this second meeting, including 40 attendees from the first roundtable meeting.

Table 1.

Stakeholder group views: Consensus statements and voting summary

Consensus statements Voting results (% yes/% no; no. of votes)
  • 1.

    Genetic ancestry inference uses regions of the genome that inform us about individual, genealogical, population, and/or geographical ancestry, by comparison of similarities to a reference sample(s). There are many different types of ancestry tests that use Y DNA, mtDNA, autosomal, and/or X chromosome markers. Information about parentage, kinship, and identification can be captured by ancestry tests and vice versa. This information can also be used for forensic applications.

90/10; n = 48
  • 2.

    Whether an application of genetic ancestry testing is appropriate or not is context-dependent.

90/10; n = 48
  • 3.

    Individuals may pursue genetic ancestry testing and should be informed of the potential impact of their results on themselves, their family members, and their communities.

85/15; n = 48
  • 4.

    Genetic ancestry may inform our understanding of disease risk and guide treatment options in research and clinical settings.

90/10; n = 48
  • 5.

    It is recommended that genetic ancestry tests alone not be used to associate people with culturally, legally, or socially defined groups.

78/22; n = 51
  • 6.

    Genetic ancestry results alone are not a valid basis for denying individuals’ or peoples’ human rights, responsibilities, or social benefits.

73/27; n = 48
  • 7.

    Ancestry testing should be conducted with standards (to be defined) and the data/methods/models used should be transparent, ideally publicly available, to promote accuracy and facilitate interpretation.

94/6; n = 53
  • 8.

    A reference sample is composed of individuals sampled from contemporary populations that are assumed to share recent common ancestry with an ancestral group or lineage.

87/13; n = 53
  • 9.

    How representative a contemporary population is of an ancestral group depends in part on the age and history of the ancestral group.

82/18; n = 49
  • 10.

    There are multiple sources of genetic information about ancestral populations including existing populations, ancient remains, and statistically inferred group.

96/4; n = 51
  • 11.

    Development of minimum standards and best practices for genetic ancestry inference is needed.

92/8; n = 51
  • 12.

    Researchers and providers of ancestry tests should identify limitations of their genetic ancestry inference and make these clear to the general public, research participants, and consumers.

89/11; n = 53
  • 13.

    Genetic ancestry inference is a tool for individuals to ask personally meaningful questions and discover new information about themselves, their family, and their ancestors.

96/4; n = 48
  • 14.

    Many benefits and harms of genetic ancestry inference are dependent on personal experiences and perceptions of identity, society’s perception of an individual, and community context.

92/8; n = 50
  • 15.

    There are multiple perceived individual benefits to genetic ancestry inference including satisfying curiosity, assisting in identification, inclusion as a member of a group, discovery of kinship and ancestry information, potentially conferring status, connection to historical and evolutionary context, and facilitating investigation of health risks and treatment options.

96/4; n = 47
  • 16.

    For communities, including researchers, benefits include understanding heritage and group history, as well as facilitating investigation of population disease burden. More broadly, engagement with ancestry testing can potentially promote a broader understanding of genetics and human evolution.

96/4; n = 49
  • 17.

    Genetic ancestry testing enhances our understanding of human history and advances scientific research by the large volume of data being generated and crowd-sourcing of analysis.

90/10; n = 49
  • 18.

    Genetic ancestry testing generates collaborative relationships and cross-fertilizes ideas between researchers, industry and non-scientists; creates more training and employment opportunities for individuals; engages under-represented communities in science; benefits development of academic and industrial enterprises; and promotes science and history education, citizen science, and bioethical innovations.

84/16; n = 50
  • 19.

    Genetic ancestry testing provides information and knowledge. Interpretation and application of data from genetic ancestry tests can, depending on context, lead to benefit or harm.

86/14; n = 50
  • 20.

    The frequency and magnitude of both benefits and harms of genetic ancestry testing are largely unknown and require systematic investigation.

82/18; n = 49
  • 21.

    Potential group harms of genetic ancestry testing include stigmatization; challenging origin narratives and/or religious beliefs; erroneous or inconsistent results undermining the credibility of science and medicine; reinforcing biologically deterministic notions of race and ethnicity; and a false sense that one population is more evolved than another.

78/22; n = 49
  • 22.

    Ancestry testing has the potential to personalize (e.g., emotional distress) the violence of historical atrocities (e.g., slavery in the U.S.).

78/22; n = 50

Table 2.

Summary of voting results for guidelines

Guideline Vote Confidence level (extremely, very, somewhat, not at all)
  • 1.

    Application of genetic ancestry inference

for: 26 (76%) 6 11 9 0
against: 5 (15%) 2 1 1 1
abstain: 3 (9%)
  • 2.

    Assumptions and confidence/laboratory and statistical methods

for: 28 (82%) 9 15 4 0
against: 3 (9%) 0 2 0 1
abstain: 3 (9%)
  • 3.

    Terminology and population identifiers

for: 28a (82%) 13 11 3 0
against: 4 (12%) 1 1 1 1
abstain: 2 (6%)
  • 4.

    Impact on individuals and groups

for: 29a (85%) 9 13 6 0
against: 3 (9%) 1 0 1 1
abstain: 2 (6%)
  • 5.

    Communication/translation of genetic ancestry inferences

for: 29 (85%) 12 7 10 0
against: 2 (6%) 0 1 0 1
abstain: 3 (9%)
a

One “for” response did not indicate level of confidence for guidelines 3 and 4.

Here we report on these efforts before discussing emergent issues and offering suggestions for next steps and insights that could inform both governmental regulation and self-regulation.

“Consensus” statements (Roundtable I)

The planning committee produced a set of eight draft consensus statements and related questions focused on eight key areas: (1) the application of genetic ancestry inference; (2) assumptions underlying genetic ancestry inference; (3) laboratory and statistical methods; (4) estimating confidence and communicating uncertainty; (5) terminology and population identifiers; (6) assessing and addressing individual and group benefits; (7) assessing and addressing individual and group harms; and (8) translating genetic ancestry findings for the media and public. These statements were distributed to attendees of Roundtable I prior to the meeting and served as starting points for discussion. Roundtable I was structured into eight sessions of small group discussions followed by negotiation and adjudication, and each attendee was allotted one vote in a general session of all attendees. Each small group consisted of representatives from each stakeholder category. Their discussions revolved around questions provided to them about each draft consensus statement. They were instructed to accept the statement as is, edit the statement, or develop a new consensus statement.

Roundtable I resulted in an expansion of the eight draft consensus statements to 22 (Table 1), reflecting robust discussion and reactions to the questions that accompanied the draft statements. Clearly, it was often the case that discussion identified that original draft statements did not adequately capture the complexity of the questions posed and therefore required further elaboration. In several instances a consensus statement was deemed obvious, and therefore unnecessary, to one stakeholder group but was simultaneously judged by another stakeholder group to be unobvious and important. Eighty-five percent or more of attendees agreed with 15 statements, and no statement received less than 73% of positive votes (Table 1). Dissent from the consensus statements was captured by minority reports that were solicited from attendees at the conclusion of Roundtable I. In general, these reports reflected continued ambivalence about wording, technical accuracy, and necessity of a consensus statement.

A reflective critique of Roundtable I revealed to the planning committee that the design might have unintentionally constrained the meeting’s potential for achieving its goal: the development of guidelines through consensus. Had the initial small group discussions been organized as an opportunity to first reach consensus within stakeholder groups and later work toward consensus between and among stakeholder groups in the full discussion, the process might have more readily identified the most serious gaps in understanding of the issues and underlying bases for divergent opinions. Instead, some participants made compromises to their positions within their small group discussion only to see those points rehashed in the full group discussion. Thus, this process was not as efficient, or satisfying to participants, as it otherwise might have been, and there is a possibility that the votes tallied on the consensus statements at the end of a protracted discussion reflected an indifference or disengagement of some participants rather than genuine support for any particular statement. Accordingly, the approach to Roundtable II was changed to improve the process and maximize the utility of the progress that was made in Roundtable I.

Guidelines (Roundtable II)

Based on the consensus statements, minority reports, and feedback from and self-critique of Roundtable I, the planning committee produced a set of draft guidelines in six areas: (1) application of genetic ancestry inference; (2) assumptions and confidence underlying genetic ancestry inference; (3) laboratory and statistical methods; (4) terminology and population identifiers; (5) assessing and addressing impact on individuals and groups; and (6) communication/translation of genetic ancestry inferences. These draft guidelines were distributed to attendees of Roundtable II prior to the meeting and served as starting points for the discussion.

Roundtable II was structured differently from Roundtable I to facilitate conversation both within and between the stakeholder groups (social scientists/humanists/community representatives, genetic genealogists, ancestry testing companies, and geneticists). Specifically, in Roundtable II there were four sessions of stakeholder group discussions followed by four sessions of mixed-stakeholder groups and then a review in a plenary session with all attendees. As with Roundtable I, groups were instructed to accept each guideline as is, edit the guideline, or develop a new guideline. Roundtable II resulted in reduction of the six draft guidelines to five. More than 80% of attendees agreed with four guidelines, and no guideline received less than 76% of positive votes (Table 2). To capture dissent from the guidelines, minority reports were solicited from attendees at the conclusion of Roundtable II. No minority reports were received. Below we present each guideline accompanied by a summary of comments concerning the importance of the guideline, our reasons for including the guideline, further explanations that are needed, limitations of the guideline, and suggestions for best practices relevant to the guideline.

1. Application of genetic ancestry inference

GUIDELINE

Genetic ancestry testing can be used in myriad ways, for example, to estimate individual ancestry proportions, infer genealogical relationships among individuals, characterize population history, and adjust for population stratification in biomedical research. Researchers, study participants, ancestry testing companies, and consumers should be aware that many aspects of ancestry inference continue to change because of changing reference populations, type and number of marker sets, methods of analysis, and understandings of human history. As the process of ancestry inference evolves, interpretations may change. Genetic ancestry information is only one line of evidence for establishing or denying an individual’s or group’s eligibility for particular entitlements (e.g., affirmative action in education, citizenship, employment, etc.). Where they exist, other lines of evidence should also be considered.

This guideline is important for several reasons. While genetic ancestry inference is a moving target, consumers often think that “a test is a test” and that it offers definitive, static answers. This guideline recognizes the role that genetic ancestry inference does or might play in how individuals think about their racial or ethnic background and in informing individuals’ quests for entitlements as well as underscoring the many variables that contribute to why ancestry testing is capable of offering only partial information due, for example, to the limited type and number of markers tested, as well as the populations and time frame sampled. Genetic ancestry is not synonymous with genealogical ancestry, as an individual does not inherit genetic material from all of one’s distant ancestors, and the inheritance of genetic material from a particular distant ancestor need not be a prerequisite to, or preclude, a social affinity with that distant ancestor’s culture. The guideline explains why the nature of genetic ancestry information might be subject to redefinition, reclassification, or other changes. In this way, this guideline aims to prevent overstating results and, in the case of DTC companies, overselling information. The guideline further helps a broad readership understand the distinction between acting on information that is an “inference” versus treating the information as a “fact.” Genetic ancestry information (or “results”) is subject to change, but many people might not, for a variety of reasons, understand how, why, or what that means. Rather than indicating a problem, changes in an individual’s “results” over time, and even differences across DTC services, could reflect uncertainty in science, which at its best is a self-correcting way of knowing the world. For example, changes in an individual’s results could be attributed to analytical methods having improved or discoveries having occurred after the information was initially generated and/or communicated to the consumer or user. That the “result” is not a definitive or permanent answer is important when thinking about the appropriateness of decision-making based upon that information. Such decisions include, for example, setting a threshold proportional ancestry that must be met for group membership or acquiring citizenship based on “results” suggesting affiliation with one country that later does or could turn out to be less likely than affiliation with another country.

This guideline is included to provide context to the changing nature of the field to consumers and researchers alike; to acknowledge the potential benefits and harms to individuals and groups; to emphasize the wide-ranging applications of genetic ancestry testing; and to stress that caution should be taken when any entity communicates results. Genetic ancestry inference affects the public’s confidence in genetic testing in general and vice versa, even if the methods, analysis, and resulting information gleaned for the analysis and application are quite different. As noted by Nelson (2016, at 81), “DNA spillover occurs when an individual’s experience with one domain of genetic analysis informs his or her understanding of other forms of it or authorizes its use in another domain.”21 So, it is important to make assumptions underlying genetic ancestry inference transparent and understandable upfront and to distinguish the features of this type of genetic test so that genetic ancestry inference does not unjustifiably undermine or, alternatively, unnecessarily bolster the public’s confidence in or willingness to consider or trust other types of genetic information (e.g., results of genetic testing for Mendelian conditions).

Further explanation is needed for this guideline to clarify why test interpretations continue to change, and should be expected to continue to change, to provide examples of when ancestry inference has been used for determining eligibility for entitlements, and to elucidate points of tension across stakeholder groups regarding whether and when genetic ancestry information is ever appropriate without corroborating non-genetic evidence.

There are limitations to this guideline. Some might view it as a prelude to guidelines rather than a guideline itself. Additionally, it might not be clear or obvious to consumers how changes in testing or reference samples would impact interpretation. Current interpretations—particularly interpretations indicating admixture proportions—are not static and will change in the future in foreseeable and unforeseeable ways. Perhaps the most glaring limitation of this guideline is that it calls out four groups (i.e., researchers, study participants, ancestry testing companies, and consumers) and, in so doing, apparently overlooks how users (who can be, but are not necessarily, the purchasers or test-takers), bystanders, and others actively make decisions based on the genetic ancestry information, are passively influenced by the information, or unknowingly are affected—directly and indirectly—by what the four identified groups know, believe, and do.

This guideline lends itself to a few practical next steps for those engaged in genetic ancestry inference (which were not themselves addressed as part of the roundtable discussions). First, it is important to monitor DTC testing websites for updated information and compile that information for easy reference. While a particular actor—whether a specific individual or entity—to perform this task is not inherently obvious, an independent, impartial, and trustworthy actor (such as the International Society of Genetic Genealogy, ISOGG) would be appropriate. Second, we encourage the creation and use of a model disclosure form throughout the DTC genetic ancestry industry (similar to other consumer protection disclosure forms, such as “your credit score and understanding your credit score”) that includes the material information necessary to understand the genetic ancestry testing. Third, it is preferred that the research and commercial providers make educational materials available and discuss this information prior to or as an integral part of the informed consent and purchasing processes.

2. Assumptions and confidence/laboratory and statistical methods

GUIDELINE

The assumptions, type and number of genetic markers, and reference populations used for ancestry inference should be explicitly described. An easily interpretable assessment of confidence, accuracy, and limitations should accompany quantitative claims about genetic ancestry inference. Steps should be taken to adequately convey the amount and sources of uncertainty in the inferences about genetic ancestry, whether in a research or commercial setting. As a means of proficiency testing, providers of genetic ancestry inference should analyze samples against a standard reference population panel (e.g., 1000 Genomes) that represents human global diversity to the extent possible and should communicate the results. Scientifically acceptable standards should be established for validation of results. Individuals should be informed of the availability of their raw data or lack thereof.

The goal of this guideline is to offer transparency about how results are determined and to establish standards so that results can be replicated or validated. For example, there are many different ways to look at data as rich as human genetic variation, and a model or approach appropriate for one purpose and context could be misguided for another purpose or context depending upon the assumptions and constraints of that particular model or approach. This guideline is important to enable researchers as well as consumers to make informed decisions about genetic ancestry tests and to enable consumers and other users to interpret the test results within the proper context of the test’s limitations. Methods are not uniform from one researcher or DTC provider to another or from one type of test to another. Without disclosure of these material details, it is difficult for a person (whether study participant or consumer) to obtain a second opinion (that is truly an apples-to-apples comparison) or make sense of the information. This guideline, by encouraging the disclosure of this information, will aid in the identification of the “snake oil salesmen” among those offering genetic ancestry analyses to the public. This guideline is included to avoid misinterpretation and overinterpretation of genetic ancestry test or inference results.

Further explanation is needed for this guidance, as it does not define validation or what specific measures are expected or steps to be taken. First, the guideline does not articulate how proficiency testing would best or most reasonably be achieved. Establishing an accepted proficiency testing approach would require substantial work (Box 1). Additionally, the guideline does not stipulate what information about reference populations is necessary for meaningful disclosure. At the very least, transparency about reference populations used should include their size, geographic location, level of population inclusivity, and how the markers used in genetic ancestry panels were selected, validated, and sometimes excluded to optimize a workable model of homogeneity in the reference populations being compared.

Box 1. Preliminary factors to consider when designing “proficiency testing” for genetic ancestry inferences.
  • How is validation being defined and, relatedly, what specific measures are expected?
    • How do genetic ancestry inferences for each test company compare to one another?
    • How do genetic ancestry inferences for each test company compare to a “gold standard” (e.g., published literature) or benchmark (e.g., a set of agreed upon DNA samples, such as a subset of 1000 Genomes Project, or an agreed upon simulated dataset)?
  • Does the proficiency testing process require companies to take a proficiency test from DNA to interpretation, or can the companies interpret existing genotypes?
    • If the former, the numbers need to be limited to avoid overburdening small companies (which would disproportionately feel the associated costs).
    • For proficiency testing to be effective, the process needs to have support and participation across the industry (from DNA ancestry testing companies with large and small market shares alike).
    • It seems feasible for those engaged in genetic ancestry inference to select a subset of 1000 Genomes Project samples that include diverse representation of samples with substantial admixture and samples from different geographic areas.
    • Companies would need to be blinded to the DNA IDs of the benchmark sample set to avoid foreseeable gaming of the proficiency testing system (e.g., by just looking up the 1000 Genomes Project publications).

Moreover, this guideline needs further explanation as to what is meant by “should communicate the results” and what is intended by “an easily interpretable assessment.” Without stating the perspective by which this should be judged (e.g., applying a “reasonable person standard” such as a hypothetical non-expert consumer), there might be widely varying ways of complying. One could envision an assessment being easily interpretable from one geneticist to another as satisfying the underlying purpose of this guideline; however, without more explanation, some might interpret this guideline as requiring a type of communication that goes beyond typical consumer protection requirements for formatting and text to address complex matters of both data literacy and scientific literacy in addition to basic literacy.

Limitations of this guideline are (1) that thresholds for acceptability, validity, and the like are not stated or given but would be needed for interpretation by consumers and (2) that there are many different types and sources of uncertainty, which are difficult to quantify. This guideline lends itself to at least two genetic ancestry inference practice suggestions. Establishing and sustaining a centralized resource where regular updates of these measures could be compiled and displayed in one trusted location for comparisons by consumers, study participants, and others would be meritorious. Additionally, interoperability practices should be developed and encouraged. Establishing a standardized file format for returning data to participants would enable another DTC provider or researcher to examine genetic ancestry inferences using different methods and offer “second opinions.”

3. Terminology and population identifiers

GUIDELINE

Those inferring genetic ancestry should clearly explain how they are using the terms pertaining to race, ethnicity, nationality, and other population identifiers. Those inferring ancestry should provide clear information on what reference samples are used to represent populations and what criteria are used for inclusion in a reference sample (e.g., self-identified ancestry, all grandparents from that population, etc.). Reference populations used to draw ancestral inferences should be labeled using specific geographic, cultural, and temporal identifiers.

This guideline is important because genetic ancestry inference has the simultaneous potential (1) to disconnect, alienate, and estrange individuals from groups and identities with which they are affiliated based on other types of evidence as well as (2) to connect and reconnect individuals, families, and communities with groups and identities to which they have always felt themselves to belong, or, on the other hand, to provide a bridge to peoples with whom they have no current affiliation. These disconnections, new connections, and reconnections could have real consequences in terms of how individuals and institutions understand cultural identities and define social rights and obligations as well as political responsibilities. This guideline underscores that inaccurate or even sometimes colloquial association of various terms with “ancestry” has the potential to lead consumers and the broader public to incorrect conclusions about socially ascribed populations, especially traditionally marginalized populations. In the absence of consistent terminology, and an acknowledgment that populations have complex histories of mixing and migration that should also be noted alongside their present-day appellations, consumers are sometimes confused about differences in tests that produce varying results. Increasing popularity of DTC genetic testing offers an opportunity to engage and educate the public, and common terminology and identifiers would be useful. This guideline emphasizes that accurate population labeling should include geographic, cultural, and temporal identifiers to acknowledge that these aspects of human identity are also important and that they are not static or fixed but, rather, have changed and will continue to change over time.

Identifiers are not uniform across geopolitical boundaries and do not remain constant over time. The instability of identifiers in the United States Census is well known, and common practices to aggregate and disaggregate distinct concepts of ethnicity, country of origin, and nationality into one or more Office of Management and Budget (OMB) race categories or to synthesize global data further complicate the matter (e.g., OMB,22,23 Institute of Medicine,22,23,24,25,26,27,28 Morning,22,23,24,25,26,27,28 Villarroel et al.,22,23,24,25,26,27,28 Popejoy et al.,22,23,24,25,26,27,28 Byeon et al.22,23,24,25,26,27,28). Terminology used with genetic ancestry inference must be as clear, accurate, and precise as possible to promote understanding. For example, rather than describing a DNA sample or data with merely a regional geographic descriptor (e.g., West African), it might be useful to have additional information such as self-identification (e.g., Yoruba), place of collection (e.g., Ibadan, Nigeria), and language or dialect. It would also be helpful if, to the extent practicable, samples and reference populations were accompanied by a researched description providing further reading for users on the historical emergence of each reference population. This step would help to preclude tendencies that “fossilize” groups as static, which is especially needed to broaden conceptions of non-Western and marginalized peoples. Such descriptions might include dynamics that have shaped groups’ complex histories and current social practices. This is to emphasize Ogundiran’s point regarding the history of the Yoruba: that identity making is a process (see, e.g., Ogundiran,29,30,31 Law,29,30,31 and Matory29,30,31). This guideline is included to ensure that consumers can make informed choices about different genetic ancestry tests and better understand why test results might differ between platforms. Moreover, its inclusion reflects the roundtable participants’ recognition that misinterpretation and overgeneralizations, enabled by loose and inconsistent terminology unmoored from history, can contribute to misunderstandings of other people, social discrimination, and racism that must be acknowledged, prevented, and mitigated.

Further explanation of the underlying rationale for this guideline is needed to articulate why social labels can be unintentionally confusing and their use potentially harmful in the context of genetic ancestry inference. For example, DTC genetic ancestry inference reporting a presence or absence of African ancestry or Native American ancestry can disrupt one’s self-identity. Additionally, further explanation is needed as to what specific geographic, cultural, or temporal identifiers are appropriate, how often those labels should be revisited, and how their appropriateness is to be assessed.

While this guideline has the notable limitation that enforcement of such practices and disclosures would be challenging, the genetics community’s support of this practice could create sufficient normative pressure to encourage its implementation. Additionally, this guideline could encourage best practices if an authoritative, trustworthy entity were to maintain and host a compilation of nomenclature regarding this information for available DTC genetic ancestry tests and prominent research efforts.

Subsequent to the roundtable discussions, the National Academies of Sciences, Engineering, and Medicine (NASEM) took steps toward developing and recommending best practices for use of population descriptors not only in genetic ancestry inferences but also in human genetics and genomics research more broadly. In 2021, NASEM established an ad hoc committee tasked with examining population descriptors (such as race, ethnicity, and genetic ancestry) in genomics research and offering “best practices” including recommendations for “culturally responsive methods and common data elements” that could enable the harmonization of population descriptors used in genomics research within and beyond the United States.32

4. Impact on individuals and groups

GUIDELINE

Individuals pursuing ancestry inference should be informed of potential impacts on themselves, family members, and communities. They should also be informed of the potential for planned and unplanned access to their genetic ancestry information by law enforcement, government agencies, commercial entities, and other third parties. Genetic ancestry inference is an international activity; thus, people should be aware of the different regulations regarding privacy and data security across countries. Genetic ancestry inference can lead to the discovery of unexpected results that conflict with existing beliefs or knowledge. Potential positive and negative impacts should be laid out prior to testing or participation in research. Individuals should consider discussions with family members prior to undertaking genetic ancestry inference because it may have implications for relatives. Those inferring genetic ancestry should facilitate the study of potential individual- or group-level impacts to inform the development and implementation of best practices for communicating potential impacts of ancestry inference. The potential outcomes should be broadly disseminated.

This guideline’s importance is to protect consumers and study participants from the complex web of third-party interests and varying regulations that intersect with genetic ancestry testing and information. It is intended to enable consumers to make informed decisions about genetic ancestry tests and sharing their data directly with others, whether within an ancestry testing company’s website or more broadly through online publication elsewhere. The guideline also is important in its recognition that there are potential benefits and risks of harm that can occur at the individual level and at familial, community, or broader social levels. For example, for groups in the United States that have been oppressed and had their “roots,” cultures, and actual ancestors purposefully eradicated, actively erased, or historically concealed from them, the acquisition of genetic ancestry information could affect not only the connections that individuals make with the past but also the viability of connections that individuals pursue in the present and future. Unexpected outcomes from DTC genetic testing are not infrequent, and, in most cases, these outcomes involve kinship rather than ancestral origins (e.g., Law30). Additionally, this guideline is important because ancestry inference results might be unexpected or disruptive to an individual’s or group’s understanding of themselves. Individuals and families should be aware of the limitations to genetic privacy and understand that their DNA samples or data might be used in other contexts, such as law enforcement or other governmental agencies (e.g., Guerrini et al.3). (While law enforcement agencies’ use of resources made available by DTC genetic ancestry testing was a possibility at the time the roundtables were held, it was not until years after the roundtables that news of investigative genetic genealogy to catch California’s Golden State Killer brought widespread media attention to the issue [e.g., Zhang,33,34 Speakers et al.33,34)

This guideline reflects the roundtable participants’ recognition (1) of variation in perceived and potential benefits and risks associated with genetic ancestry testing that might impact either individuals or socially-ascribed groups and (2) that it is an ethical responsibility to communicate these possibilities to the potential test-takers prior to the DNA collection and analysis. The issue of third-party uses of ancestry data and the variation in regulations governing data security and privacy increase the potential risk of harm to consumers and the need for protections. Related to this is the pervasive commercial interest in the digital age that requires consumer confidence in mechanisms for privacy and consensual use of data perceived as personal information. Following the roundtables of 2013 and 2015, considerable privacy law scholarship and data protection policy developments have materialized in the genetics community35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50 and society more broadly. These developments, discussed further below, have ongoing implications for the responsible collection, storage, and use of genetic and genomic data.

Notably, this guideline is an aggregate of several related but distinct components that the roundtable participants conceptualized under the rubric of potential risks and benefits. The broad scope or apparent randomness of this guideline’s aggregated components might limit its effectiveness. A reordering of the topics or offering of subcategories might be helpful to clarify the meaning. Additionally, the impacts (real and perceived; possible and actualized) are emerging continuously and might not be immediately apparent, obvious, or noticed by those inferring genetic ancestry or seeking genetic ancestry inferences. Another potential limitation of this guideline is that privacy and third-party data use issues are broader than the ancestry-specific context, so there is potential for these recommendations to be subsumed within existing best practices regarding genetic information more generally.

There are several potential next steps prompted by this guideline. Firstly, the research community, the DTC ancestry testing companies, and/or another entity should compile and provide analyses of applicable laws and regulations regarding access and use of genetic ancestry information from the state and federal levels in the United States and international jurisdictions. Secondly, the potential benefits and risks for groups of people should be presented to prospective study participants and consumers in addition to potential benefits and risks for the individual. Thirdly, the final two sentences of this guideline impose a directive that geneticists and social scientists work collaboratively and, moreover, suggest that when genetic ancestry inference is being conducted as research it would be good practice to integrate pre- and post-assessment of outcomes and enable ongoing communications with participants (in other words, a longitudinal and more engaged approach is preferred). Another good practice is to identify conspicuously (i.e., clearly and prominently) the major issues for those having their genetic ancestry inferred and also to promote an informed consent or informed purchase process that prompts the individual (whether study participant or consumer) (1) to review the aforementioned personal, familial, social, data, and oversight considerations and (2) to communicate with family members about their intention prior to obtaining the genetic ancestry information.

5. Communication/translation of genetic ancestry inferences

GUIDELINE

Genetic ancestry inference should be communicated in ways that are as accurate and understandable as possible for the end user and public at large. To avoid misunderstanding there should be transparency about genetic ancestry inference assumptions and techniques. Claims must be supported by evidence that is clearly communicated. Effective translation of genetic ancestry inferences for the public requires acknowledgment of historical, social and cultural contexts and constructs of ancestry, race, and ethnicity. The public should be educated on the topic of genetic ancestry inference. Those engaging in genetic ancestry inference should promote education and inform the public.

This guideline is relevant to advertising, research publications, press releases, and other communication of genetic ancestry inference. Reasonable expectations of “end users” should include not only test-takers or roots-seekers and their immediate family members but also, for example, policymakers who might consider genetic ancestry inferences and the use of the results in litigation and administrative proceedings; the press and social media users whose understandings or misunderstandings have the potential for easy amplification and influence; members of the legal profession (such as attorneys and judges) who might proffer or encounter results in novel contexts; and educators. The guideline is important to minimize misrepresentations of genetic ancestry testing and inference and to protect consumers of genetic ancestry testing from bogus claims. Genetic ancestry information is often presented as exciting, revealing, and/or a challenge to existing knowledge, and this guideline is important by underscoring the need to not oversell or overstate results to the public. This guideline urges researchers, companies, and users to avoid embellishing claims through hypothetical storytelling used for dramatic effect, including such claims in press releases, news media, film, television, and other media.

The recommendations regarding communication and translation in this guideline stem from a fundamental difference of opinion regarding the impact of genetic ancestry inference/estimation. One area of agreement among the roundtables’ diverse stakeholders was the need for greater transparency and education, which hopefully will enable consumers and prospective study participants (whom researchers might offer opportunities to receive genetic ancestry information as a study benefit) to make informed decisions about whether to obtain an ancestry test. Communication and education are shared responsibilities that can be fulfilled in several ways but are dependent upon available resources. This guideline underscores the importance of ongoing efforts to improve communication and education while acknowledging the tensions commonly observed between those in marketing departments seeking to puff products and services with colorful commentary to increase sales and interest and the perceived “dull” or black-and-white scientific state of knowledge. Table 3 provides a comprehensive, but not necessarily exhaustive, list of various types of information that can be learned from genetic ancestry testing, which could inform the development or enhancement of educational tools for consumers and other users of genetic ancestry information.

Table 3.

What can an individual learn from genetic ancestry testing?

Many things can be learned, with the strongest evidence coming from combining across genetic systems and with non-genetic evidence; however, DNA often captures information not recorded elsewhere
a info DNA profile/genotype
data indels, short tandem repeats (STRs), SNPs from fragment analysis, gene chips (arrays), or sequencing (Sanger or next-generation)
b info Paternity/maternity tests
data autosomal STRs or genome-wide chips
limits availability of samples
adv parents and offspring share exactly half their autosomal genomes identical-by-descent, so inference is straightforward given sufficient information (marker variability or number)
c info Genomic kinship in any ancestral line (out to ∼fourth cousins)
data high-density autosomal genotypes/whole-genome sequence
eg identification of cryptic relatives51
eg complex pedigree reconstruction52
eg confirmation of illegitimate descents from Kaiser Wilhelm I of Germany 1797–1888, from autosomal genomic sharing (http://www.agnessecretprincess.com/)
limits cannot tell exact degree of cousinship; cannot detect many relationships beyond ∼ fifth cousins; some deeper relationships share considerably more DNA than average—high variance in sharing
adv recovers relationships in lineages other than the patriline and matriline
d info Genetic relationships in the male line within last ∼1,000 years (descent from common patrilineal ancestor)
data high-resolution Y chromosome genotypes/sequence
eg confirmation Sally Hemmings’s child was fathered by Thomas Jefferson53
eg shared descent of majority of Cohanim from patrilineal ancestor54
eg confirmation of Napoleon’s relics from patrilineal relative55
eg descent of ∼90% of males with surname Attenborough from a common male line ancestor56
limits no information about other ancestral lineages; confidence in closeness of relationship varies by information content of markers
adv records relationships over tens of generations not captured elsewhere in genome
e info Genetic relationships in the female line within last ∼1,000 years (descent from common matrilineal ancestor)
data high-resolution mtDNA genotypes/sequence
eg identity of skeleton of King Richard III, from match to distant matrilineal relatives57
eg identity of remains of Tsar Nicholas II of Russia and family in Ekaterinburg, from match to Prince Philip58,59,60
eg many examples (http://www.isogg.org/wiki/Success_stories)
limits no information about other ancestral lineages; relatively slow mutation rate means common ancestor can be >1,000 years ago
adv records relationships over tens of generations not captured elsewhere in genome; this lineage is often poorly detailed in paper records
f info Prehistoric ancestry of male line (inferred origins 1–100 kya)
data high-resolution Y chromosome genotypes/sequence
eg discovery of African male line ancestry for Yorkshire family61
eg evidence for Norse Viking male line ancestry in men across the British Isles carrying R1a-S44362,63,64,65
limits only one ancestral line; confidence in inference varies as information content of markers, size, and coverage of databases and actual phylogeographic patterns—some haplotypes are geographically very circumscribed, whereas others are more widespread
adv strong population genetic structure can allow fine-scale ancestry inference; retains information over many more generations than autosomal DNA
g info Prehistoric ancestry of female line (inferred origins 1–100 kya)
data high-resolution mtDNA genotypes/sequence
eg Native American matrilineal ancestry in majority of Colombian and other Mestizos in Latin America66
eg Princess Diana of Wales’s matrilineal Indian roots (https://www.thetimes.co.uk/article/revealed-the-indian-ancestry-of-william-ldvsjmc9w59)
eg matrilineal East Asian ancestry in a subset of Ashkenazi Jews67
limits only one ancestral line; confidence in inference varies as information content of markers, size and coverage of databases and actual phylogeographic patterns—weaker population genetic structure than Y means inference typically less fine-scale
adv retains information over many more generations than autosomal DNA; paper records very rare for deep matrilineal genealogy
h info Overall continental ancestry composition/components (%)
data high-density autosomal genotypes/whole-genome sequence
eg Native American genomic ancestry in Afro-Caribbean and Mestizo populations68,69,70
eg African ancestry in European Americans varies by state71
eg European ancestry in the Greenlanders72
limits large variation in amount of DNA inherited from different ancestors; beyond about seven generations genetic ancestors do not include all paper genealogy ancestors, only reflects proportion of pedigree ancestry further back; the confidence of particular percentage ancestry estimates is unclear
adv individual segments chronicle ancestry not recorded elsewhere in genome; DNA can reveal overall ancestry mix, irrespective of pedigree ancestors
i info Within continental ancestry composition/components (%)
data high-density autosomal genotypes/whole-genome sequence
eg inference of genetic differentiation between inhabitants of the English counties of Devon and Cornwall73 or Scottish cities of Aberdeen and Dundee74
eg inference of village of origin in European isolated populations75
eg Scandinavian, British, and East European ancestry among European Americans varies by state71
limits large variation in amount of DNA inherited from different ancestors; beyond about seven generations genetic ancestors do not include all paper genealogy ancestors, only reflects proportion of pedigree ancestry further back; the confidence of particular percentage ancestry estimates is unclear; more difficult for finer-scale inference, but see Gilbert et al.74 for potential future resolution
adv individual segments chronicle ancestry not recorded elsewhere in genome
j info Descent from specific historical ancestor
data high-resolution Y chromosome genotypes/sequence
eg identification of men patrilineally descended from Sir John Stewart of Bonkill c1245–1298, using marker S781, and Stewart YSNP haplotype; shared ancestry of many unpedigreed Stewarts from this lineage (http://www.s781.org/)
eg identification of men patrilineally descended from the lineage of John MacDonald of Islay, c1310–1386 (and thus likely Somerled) using marker YP326 and haplotype; shared ancestry of MacDonald clan chiefs and many unpedigreed MacDonalds from this lineage (https://clandonaldusa.org/index.php/dna-after-somerled)
eg identification of men descended from the lineage of Iain Cam MacGregor of Glenorchy, c1335–1390 using marker S690; shared ancestry of cadet branches of Clan Gregor and many unpedigreed MacGregors from this lineage (http://themacgregordnaproject.blogspot.com/2017/)
limits there are different degrees of certainty arising from particular genealogies available for discovery
adv very few people have 500+ year pedigrees, allows anyone to test for a connection to those who do; determines non-paternities
k info Parental consanguinity
data high-density autosomal genotypes/whole-genome sequence
eg inference of recent ancestral inbreeding76
eg inference of demographic history of ancestors77
eg inference of incest78
limits cannot infer exact degree of parental kinship
adv documents parental relationships in any ancestral line
Depending on the test taken, the genetic raw data may also provide non-ancestry related information, if subjected to an analysis to extract such information
l info Recreational genomic information
data high-density autosomal genotypes/whole-genome sequence
eg carrier status for recessive traits, e.g., red hair
eg genetic risk of complex trait, e.g., baldness, asparagus pee
limits complex trait prediction is as yet of limited accuracy for most individuals
adv in some cases, individuals will be able to estimate chances of offspring carrying traits
m info Health information
data high-density autosomal genotypes/whole-genome sequence
eg carrier status for recessive disease, e.g., cystic fibrosis
eg genetic risk of complex disease, e.g., breast cancer
eg genetic risk of adverse drug reaction
limits complex disease prediction is as yet of limited accuracy for most individuals
adv accurate prediction will empower precision medicine

Further explanation is needed for this guideline to emphasize that this is a rational recommendation based on disagreement among stakeholders about specific details. This guideline is vulnerable to subjective interpretation of what constitutes transparent communication and education. Subjective interpretation allows for flexibility necessary to garner broad support for the guideline and multiple ways with which to fulfill the responsibilities set by the guideline. However, a third party and/or non-partisan or bipartisan review of content and form of communications over time could alleviate the limitation.

This guideline lends itself to four potential next steps. Firstly, the next step toward establishing a set of best practices would be to define standards (e.g., what constitutes accuracy, transparency, acknowledgment of social factors, and education). Secondly, it would be prudent to take steps to avoid hype when reporting research findings and to take corrective actions when research findings or inferences reported by others are inflated or otherwise misinterpreted. Thirdly, it is appropriate for those engaged in genetic ancestry inference to treat every transaction and study participation as a “teachable moment.” Additionally, a roundtable discussion with communication scholars and science journalists might be useful to address the aforementioned limitation of the guideline.

Discussion

By way of a process of consensus building through two roundtables involving dozens of stakeholders, we have developed a set of broadly agreed upon guidelines that form the basis for more specific recommendations about the use of genetic ancestry inference. Development and operationalization of such recommendations will require substantially more discussion by stakeholders and beyond. These two roundtables stemmed from an earlier working group and identify a wide range of implications involving heterogeneous stakeholders. It is only through continued engagement that the details (of who else, how, and in what contexts these guidelines will be relevant) can even be established. At present, we hope that those conducting genetic ancestry inference, be they researchers or DTC testing companies, consider adopting and continuing to contribute to the refinement and implementation of these guidelines. Most important will be their leadership in operationalizing these guidelines with the support of scholars and policymakers. As with all guidelines, we expect them—like genetic ancestry inference itself—to change over time via stakeholder-engaged processes. Most specifically, several major governmental bodies and regulators have important roles to play in supporting and promulgating a number of key practices and shared resources.

As an attempt at full transparency, we have attested to the imprecision of both our processes to develop these guidelines and the guidelines themselves. Because genetic ancestry inference tests and practices are moving targets, we must balance precision of our guidelines with the need for those guidelines to be both responsive and durable. Again, only through empirical study and expanding to new issues from new stakeholders will we have a chance at maintaining or increasing any sense of precision in these and future guidelines.

We have highlighted several limits to our processes and have acknowledged both flaws in the approach taken and caveats with the resulting deliverables. For example, the structure of Roundtable I (small group discussions with diverse stakeholder representation in each small group followed by full discussion by all participants) provoked noticeable frustrations among participants, as compromises reached in small group discussions were reopened and issues with particular statements were rehashed and renegotiated among the full group discussions. This inefficiency likely influenced the tally of votes on “consensus” statements. Also illustrative is our admission of ways in which the guidelines from Roundtable II require further explanation and the details to be operationalized. Yet the fact that we anchor these guidelines with their limitations and dissents provides the important opportunity to both enhance future processes and most transparently continue to modify and hopefully improve these guidelines. For example, the evolution from Roundtable I to Roundtable II involved moving from a long list of fragmented statements with varying degrees of support to a smaller set of thematically coherent guidelines representing a process of consensus.

Since the roundtable discussions, scholarly work examining genetic ancestry inference, descriptions of human variation, and impacts of genetic ancestry testing have continued;79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94 industry practices for DTC genetic ancestry inference have changed (for a list of active and inactive DTC companies offering genetic ancestry testing, see https://isogg.org/wiki/List_of_DNA_testing_companies); and some genetic genealogy standards have emerged.95 Policy and scholarly developments—for example, regarding privacy harms,96 data justice and dataveillance concerns,97 attention brought to Indigenous data sovereignty,44 and calls in the United States for comprehensive (rather than sector-specific) approaches to personal data protection98 as well as modernized rules for data access and increased interoperability99,100,101,102—have substantial implications for the refinement and implementation of guidelines for genetic ancestry inference and addressing matters of genetic privacy. In addition to the changing state laws for genetic information privacy specifically (e.g., Florida’s Protecting DNA Privacy Act, H.B. 1189, enacted in 2021; Utah’s Genetic Information Privacy Act, S.B. 227, enacted in 2021; Wyoming’s Genetic Data Privacy Act, H.B. 86, enacted in 2022; and Kentucky’s Genetic Information Privacy Act, H.B. 502, enacted in 2022) and personal data protection generally (e.g., the California Privacy Rights Act, Proposition 24, approved by CA voters in 2020; the Colorado Privacy Act, S.B. 21-1980, enacted in 2021; and the Virginia Consumer Data Protection Act, H.B. 2307/S.B. 1392, enacted in 2021), agencies such as the Federal Trade Commission have been increasingly active in ensuring that data practices are not unfair or deceptive or unreasonably lax with regard to privacy and security measures. Those engaged in genetic ancestry inference—whether as members of the DTC industry or academic research—will need to pay careful attention to shifting data governance expectations of individuals and obligations imposed implicitly or explicitly by policymakers.

While this report reflects efforts undertaken several years ago, the issues remain salient. Moreover, the deliverables (i.e., the “consensus” statements and preliminary guidelines) from the impassioned roundtable discussions represent an ongoing need to address current scientific and societal challenges with genetic ancestry inference. These guidelines were never intended to serve as an endpoint but, rather, as a crucial step toward the development, implementation, and continued improvement of best practices. It is our hope that this report will assist in ongoing scholarly, industry, and policy discussions for the advancement of science and broad human benefit.

Diverse stakeholders in the roundtable discussions

In addition to the authors in the writing group, the participants in the roundtable discussions included Thomas W. Athey, Mathew Barber, Jessica Bardill, Terry Barton, Blaine Bettinger, Deborah Bolnick, Vence Bonham, Katherine Borges, Joann Boughman, Esteban Burchard, Carlos Bustamante, Jake Byrnes, Shannon Stewart Christmas, Mike Dougherty, Carla Easter, Alice M. Fairhurst, Peter Förster, Elise Friedman, Malia Fullerton, Jonathan Gitlin, Bennett Greenspan, Tim Janzen, Kathryn Johnston, Rick Kittles, Sandra Soo-Jin Lee, Sally Lehrman, Jeffrey C. Long, Rita Anne Lopez, Joe McInerney, Yolanda T. Moses, Joanna Mountain, Alondra Nelson, Jean-Chretien Norreel, John Novembre, Pilar N. Ossorio, Harry Ostrer, David Pike, Terry Powell, Jenny Reardon, Casey Ross-Pethernick, Laura Lyman Rodriguez, Charles Rotimi, Ali M. Salih, Pamela Sankar, Theodore Schurr, Alexandra Shields, Mark Shriver, Eugene J. Silva, Lindsay Smith, Megan Smolenyak, Bryan Sykes, Kim Tallbear, Noah Tamarkin, Ann Turner, Marilyn Vann, James Larry Vick, Spencer Wells, Ron Whitener, Jun J. Yang, and Laurie Zoloth.

Data and code availability

This effort did not generate or analyze datasets or code except for the tally of votes during the roundtables, which are included in the tables of this publication.

Acknowledgments

We thank Misha Angrist, Beth Balkite, Melissa Creary, William Darity, Patricia Furnish, Jayne Ifekwunigwe, Erich Jarvis, Sara Katsanis, Paul Magwene, Michael Williams, and Chantelle Wolpert for their intellectual contributions to the development of the guidelines. We are also grateful to members of the ASHG Professional Practice & Social Implications Committee (formerly known as the ASHG Social Issues Committee) and our anonymous reviewers for their constructive feedback on earlier drafts of the report. The Genetic Ancestry Inference Roundtables were supported by National Human Genome Research Institute/National Institutes of Health grants to the American Society of Human Genetics and CDR (grant #R13HG006823 and grant #R01HG006295, respectively) and by Duke University. J.K.W.’s contribution to this report was supported in part by grant #5R00HG006446 and J.-H.Y.’s contribution was supported in part by grant #5R00HG007076 from the National Human Genome Research Institute. The content of this report is solely the authors’ responsibility and does not necessarily reflect the official views of the American Society of Human Genetics, the National Human Genome Research Institute/National Institutes of Health, or any other institution or organization with which the authors are affiliated.

Declaration of interests

Four authors (J.K.W., J.-H.Y., M.J.B., and C.D.R.) are editors for Human Genetics and Genomics Advances, and two authors (M.J.B. and C.D.R.) are members of the NASEM ad hoc committee on the “Use of Race, Ethnicity, and Ancestry as Population Descriptors in Genomics Research.” C.C.M. serves as Chief Genetic Genealogist for Parabon NanoLabs, Inc. and as Lead Genetic Genealogist for the PBS TV series Finding Your Roots.

References

  • 1.Antonio Regalado A. MIT Technology Review; 2018. 2017 was the Year Consumer DNA Testing Blew up.https://www.technologyreview.com/s/610233/2017-was-the-year-consumer-dna-testing-blew-up/ [Google Scholar]
  • 2.Schumer C.E. 2017. Schumer Reveals: Popular at Home DNA Test Kits are Putting Consumer Privacy at Great Risks, as DNA Firms Could Sell Your Most Personal Info & Genetic Data to All-Comers; Senator Pushes Feds to Investigate & Ensure Fair Privacy Standards for All DNA Kits.https://www.schumer.senate.gov/newsroom/press-releases/schumer-reveals-popular-at-home-dna-test-kits-are-putting-consumer-privacy-at-great-risk-as-dna-firms-could-sell-your-most-personal-info-and-genetic-data-to-all-comers-senator-pushes-feds-to-investig [Google Scholar]
  • 3.Guerrini C.J., Robinson J.O., Petersen D., McGuire A.L. Should police have access to genetic genealogy databases? Capturing the Golden State Killer and other criminals using a controversial new forensic technique. PLoS Biol. 2018;16:e2006906. doi: 10.1371/journal.pbio.2006906. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Molteni M. Wired; 2018. The Future of Crime Fighting is Family Tree Genetics.https://www.wired.com/story/the-future-of-crime-fighting-is-family-tree-forensics/ [Google Scholar]
  • 5.Murphy H. New York Times; 2018. Genealogists Turn to Cousins’ DNA and Family Trees to Crack Five More Cold Cases.https://www.nytimes.com/2018/06/27/science/dna-family-trees-cold-cases.html [Google Scholar]
  • 6.Taylor D., Turner T. The Washington Post; 2019. How Your at-Home DNA Test Results Could Solve Cold Cases.https://www.washingtonpost.com/science/2019/01/10/how-your-at-home-dna-test-results-could-solve-cold-cases/?utm_term=.fbb52f8bb98c [Google Scholar]
  • 7.Guerrini C.J., Wickenheiser R.A., Bettinger B., McGuire A.L., Fullerton S.M. Four misconceptions about investigative genetic genealogy. J. Law Biosci. 2021;8:lsab001. doi: 10.1093/jlb/lsab001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Katsanis S.H. Pedigrees and perpetrators: uses of DNA and genealogy in forensic investigations. Annu. Rev. Genomics Hum. Genet. 2020;21:535–564. doi: 10.1146/annurev-genom-111819-084213. [DOI] [PubMed] [Google Scholar]
  • 9.Khandaker T. Vice News; 2018. Canada is Using Ancestry DNA Websites to Help Deport People.https://www.vice.com/en/article/wjkxmy/canada-is-using-ancestry-dna-websites-to-help-it-deport-people [Google Scholar]
  • 10.Oray V., Katsanis S.H. Ethical considerations for DNA testing as a proxy for nationality. Glob. Bioeth. 2021;32:51–66. doi: 10.1080/11287462.2021.1896454. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Zhang S. The Atlantic; 2018. The First DNA Test as Political Stunt.https://www.theatlantic.com/science/archive/2018/10/elizabeth-warren-dna-test/572998/ [Google Scholar]
  • 12.Tallbear K. Twitter; 2018. Statement on Elizabeth Warren’s DNA Test.https://twitter.com/KimTallBear/status/1051906470923493377?s=20&t=MnvxP84TbDGPKsLE5HLl5Q [Google Scholar]
  • 13.Jasnik S. Inside Higher Ed; 2019. DNA Testing, Race, and an Admissions Lawsuit.https://www.insidehighered.com/admissions/article/2019/01/28/lawsuit-raises-questions-about-dna-testing-race-and-admissions#.XFGepRSuuqJ.twitter [Google Scholar]
  • 14.Johnson S. Daily Mail; 2019. Australians Claiming to be Aboriginal Will be Forced to Undergo DNA Testing to Prove it under a One Nation Proposal to Crack Down on Welfare Fraud.https://www.dailymail.co.uk/news/article-6792955/Australians-claiming-Aboriginal-forced-undergo-DNA-test-One-Nation-policy.html [Google Scholar]
  • 15.Lyte B. The Atlantic; 2016. Native Soil.https://www.theatlantic.com/politics/archive/2016/09/native-soil/501419/ [Google Scholar]
  • 16.Horowitz A.L., Saperstein A., Little J., Maiers M., Hollenbach J.A. Consumer (dis-) interest in Genetic Ancestry Testing: the roles of race, immigration, and ancestral certainty. New Genet. Soc. 2019;38:165–194. doi: 10.1080/14636778.2018.1562327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Panofsky A., Donovan J. Genetic ancestry testing among white nationalists: from identity repair to citizen science. Soc. Stud. Sci. 2019;49:653–681. doi: 10.1177/0306312719861434. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Panofsky A., Dasgupta K., Iturriaga N. How White nationalists mobilize genetics: from genetic ancestry and human biodiversity to counterscience and metapolitics. Am. J. Phys. Anthropol. 2021;175:387–398. doi: 10.1002/ajpa.24150. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Tyler K. Genetic ancestry testing, whiteness and the limits of anti-racism. New Genet. Soc. 2021;40:216–235. [Google Scholar]
  • 20.Royal C.D., Novembre J., Fullerton S.M., Goldstein D.B., Long J.C., Bamshad M.J., Clark A.G. Inferring genetic ancestry: opportunities, challenges, and implications. Am. J. Hum. Genet. 2010;86:661–673. doi: 10.1016/j.ajhg.2010.03.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Nelson A. Beacon Press; 2016. The Social Life of DNA: Race, Reparations, and Reconciliation after the Genome. [DOI] [PubMed] [Google Scholar]
  • 22.Office of Management and Budget . Office of Management and Budget; 1997. Revisions to the Standards for the Classification of Federal Data on Race and Ethnicity; pp. 58782–58790. [Google Scholar]
  • 23.Office of Management and Budget . Office of Management and Budget; 2016. Review and Possible Limited Revision of OMB’s Statistical Policy Directive on Standards for Maintaining, Collecting, and Presenting Federal Data on Race and Ethnicity; pp. 67398–67401. [Google Scholar]
  • 24.IOM (Institute of Medicine) The National Academies Press; Washington, DC: 2009. Race, Ethnicity, and Language Data: Standardization for Health Care Quality Improvement. [PubMed] [Google Scholar]
  • 25.Morning A. Ethnic classification in global perspective: a cross-national survey of the 2000 census round. Popul. Res. Policy Rev. 2008;27:239–272. [Google Scholar]
  • 26.Villarroel N., Davidson E., Pereyra-Zamora P., Krasnik A., Bhopal R.S. Heterogeneity/granularity in ethnicity classifications project: the need for refining assessment of health status. Eur. J. Public Health. 2019;29:260–266. doi: 10.1093/eurpub/cky191. [DOI] [PubMed] [Google Scholar]
  • 27.Popejoy A.B., Ritter D.I., Crooks K., Currey E., Fullerton S.M., Hindorff L.A., Koenig B., Ramos E.M., Sorokin E.P., Wand H., et al. Clinical Genome Resource ClinGen Ancestry and Diversity Working Group ADWG The clinical imperative for inclusivity: race, ethnicity, and ancestry (REA) in genomics. Hum. Mutat. 2018;39:1713–1720. doi: 10.1002/humu.23644. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Byeon Y.J.J., Islamaj R., Yeganova L., Wilbur W.J., Lu Z., Brody L.C., Bonham V.L. Evolving use of ancestry, ethnicity, and race in genetics research-A survey spanning seven decades. Am. J. Hum. Genet. 2021;108:2215–2223. doi: 10.1016/j.ajhg.2021.10.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Ogundiran A. Indiana University Press; 2020. The Yoruba: A New History; pp. 8–9. [Google Scholar]
  • 30.Law R. Ethnicity and the slave trade: ‘lucumi’ and ‘nago’ as ethnonyms in west Africa. Hist. Afr. 1997;24:205–219. [Google Scholar]
  • 31.Matory J.L. The English professors of Brazil: on the diasporic roots of the Yoruba nation. Comp. Stud. Soc. Hist. 1999;41:72–103. [Google Scholar]
  • 32.NASEM . 2021. Use of Race, Ethnicity, and Ancestry as Population Descriptors in Genomics Research.https://www.nationalacademies.org/our-work/use-of-race-ethnicity-and-ancestry-as-population-descriptors-in-genomics-research#sl-three-columns-a1d4ff8f-34f6-4989-b5cc-c6fd31efbdb7 [Google Scholar]
  • 33.Zhang S. The Atlantic; 2018. When a DNA Test Shatters Your Identity.https://www.theatlantic.com/science/archive/2018/07/dna-test-misattributed-paternity/562928/ [Google Scholar]
  • 34.Speakers H.J., Wagner J., Guerrini C., McGuire A. ELSI Virtual Forum; 2020. To Catch a Killer: Law Enforcement Uses of DNA Databases.https://vimeo.com/430540938 Available at. [Google Scholar]
  • 35.Contreras J.L., Knoppers B.M. The genomic commons. Annu. Rev. Genomics Hum. Genet. 2018;19:429–453. doi: 10.1146/annurev-genom-083117-021552. [DOI] [PubMed] [Google Scholar]
  • 36.Bonomi L., Huang Y., Ohno-Machado L. Privacy challenges and research opportunities for genomic data sharing. Nat. Genet. 2020;52:646–654. doi: 10.1038/s41588-020-0651-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.O'Doherty K.C., Shabani M., Dove E.S., Bentzen H.B., Borry P., Burgess M.M., Chalmers D., De Vries J., Eckstein L., Fullerton S.M., et al. Toward better governance of human genomic data. Nat. Genet. 2021;53:2–8. doi: 10.1038/s41588-020-00742-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Rehm H.L., Page A.J.H., Smith L., Adams J.B., Alterovitz G., Babb L.J., Barkley M.P., Baudis M., Beauvais M.J.S., Beck T., et al. GA4GH: international policies and standards for data sharing across genomic research and healthcare. Cell Genom. 2021;1:100029. doi: 10.1016/j.xgen.2021.100029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Wan Z., Hazel J.W., Clayton E.W., Vorobeychik Y., Kantarcioglu M., Malin B.A. Sociotechnical safeguards for genomic data privacy. Nat. Rev. Genet. 2022;23:429–445. doi: 10.1038/s41576-022-00455-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Hendricks-Sturrup R.M., Prince A.E.R., Lu C.Y. Direct-to-Consumer genetic testing and potential loopholes in protecting consumer privacy and nondiscrimination. JAMA. 2019;321:1869–1870. doi: 10.1001/jama.2019.3384. [DOI] [PubMed] [Google Scholar]
  • 41.Clayton E.W., Evans B.J., Hazel J.W., Rothstein M.A. The law of genetic privacy: applications, implications, and limitations. J. Law Biosci. 2019;6:1–36. doi: 10.1093/jlb/lsz007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Kaye J., Terry S.F., Juengst E., Coy S., Harris J.R., Chalmers D., Dove E.S., Budin-Ljøsne I., Adebamowo C., Ogbe E., et al. Including all voices in international data-sharing governance. Hum. Genomics. 2018;12:13. doi: 10.1186/s40246-018-0143-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Byrd J.B., Greene A.C., Prasad D.V., Jiang X., Greene C.S. Responsible, practical genomic data sharing that accelerates research. Nat. Rev. Genet. 2020;21:615–629. doi: 10.1038/s41576-020-0257-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Carroll S.R., Garba I., Figueroa-Rodríguez O.L., Holbrook J., Lovett R., Materechera S., Parsons M., Raseroka K., Rodriguez-Lonebear D., Rowe R., et al. The CARE principles for indigenous data governance. Data Sci. J. 2020;19:1–12. [Google Scholar]
  • 45.Tiffin N. Potential risks and solutions for sharing genome summary data from African populations. BMC Med. Genomics. 2019;12:152. doi: 10.1186/s12920-019-0604-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Wilkinson M.D., Dumontier M., Aalbersberg I.J.J., Appleton G., Axton M., Baak A., Blomberg N., Boiten J.W., da Silva Santos L.B., Bourne P.E., et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci. Data. 2016;3:160018. doi: 10.1038/sdata.2016.18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Wilkinson M.D., Dumontier M., Jan Aalbersberg I., Appleton G., Axton M., Baak A., Blomberg N., Boiten J.W., da Silva Santos L.B., Bourne P.E., et al. Addendum: the FAIR Guiding Principles for scientific data management and stewardship. Sci. Data. 2019;6:6. doi: 10.1038/s41597-019-0009-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Phillips M., Molnár-Gábor F., Korbel J.O., Thorogood A., Joly Y., Chalmers D., Townend D., Knoppers B.M. Genomics: data sharing needs an international code of conduct. Nature. 2020;578:31–33. doi: 10.1038/d41586-020-00082-9. [DOI] [PubMed] [Google Scholar]
  • 49.McCartney A.M., Anderson J., Liggins L., Hudson M.L., Anderson M.Z., TeAika B., Geary J., Cook-Deegan R., Patel H.R., Phillippy A.M. Balancing openness with Indigenous data sovereignty: an opportunity to leave no one behind in the journey to sequence all of life. Proc. Natl. Acad. Sci. USA. 2022;119 doi: 10.1073/pnas.2115860119. e2115860119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Juengst E.T., Meslin E.M. Sharing with strangers: governance models for borderless genomic research in a territorial world. Kennedy Inst. Ethics J. 2019;29:67–95. doi: 10.1353/ken.2019.0000. [DOI] [PubMed] [Google Scholar]
  • 51.Henn B.M., Hon L., Macpherson J.M., Eriksson N., Saxonov S., Pe'er I., Mountain J.L. Cryptic distant relatives are common in both isolated and cosmopolitan genetic samples. PLoS One. 2012;7:e34267. doi: 10.1371/journal.pone.0034267. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Staples J., Ekunwe L., Lange E., Wilson J.G., Nickerson D.A., Below J.E. PRIMUS: improving pedigree reconstruction using mitochondrial and Y haplotypes. Bioinformatics. 2016;32:596–598. doi: 10.1093/bioinformatics/btv618. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Foster E.A., Jobling M.A., Taylor P.G., Donnelly P., de Knijff P., Mieremet R., Zerjal T., Tyler-Smith C. Jefferson fathered slave's last child. Nature. 1998;396:27–28. doi: 10.1038/23835. [DOI] [PubMed] [Google Scholar]
  • 54.Thomas M.G., Skorecki K., Ben-Ami H., Parfitt T., Bradman N., Goldstein D.B. Origins of old testament priests. Nature. 1998;394:138–140. doi: 10.1038/28083. [DOI] [PubMed] [Google Scholar]
  • 55.Lucotte G., Thomasset T., Hrechdakian P. Haplogroup of the Y chromosome of napoleon the first. J. Mol. Biol. Res. 2011;1:12–19. [Google Scholar]
  • 56.King T.E., Jobling M.A. Founders, drift, and infidelity: the relationship between Y chromosome diversity and patrilineal surnames. Mol. Biol. Evol. 2009;26:1093–1102. doi: 10.1093/molbev/msp022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.King T.E., Fortes G.G., Balaresque P., Thomas M.G., Balding D., Maisano Delser P., Neumann R., Parson W., Knapp M., Walsh S., et al. Identification of the remains of king richard III. Nat. Commun. 2014;5:5631. doi: 10.1038/ncomms6631. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Gill P., Ivanov P.L., Kimpton C., Piercy R., Benson N., Tully G., Evett I., Hagelberg E., Sullivan K. Identification of the remains of the Romanov family by DNA analysis. Nat. Genet. 1994;6:130–135. doi: 10.1038/ng0294-130. [DOI] [PubMed] [Google Scholar]
  • 59.Ivanov P.L., Wadhams M.J., Roby R.K., Holland M.M., Weedn V.W., Parsons T.J. Mitochondrial DNA sequence heteroplasmy in the grand Duke of Russia georgij romanov establishes the authenticity of the remains of tsar nicholas II. Nat. Genet. 1996;12:417–420. doi: 10.1038/ng0496-417. [DOI] [PubMed] [Google Scholar]
  • 60.Coble M.D., Loreille O.M., Wadhams M.J., Edson S.M., Maynard K., Meyer C.E., Niederstätter H., Berger C., Berger B., Falsetti A.B., et al. Mystery solved: the identification of the two missing Romanov children using DNA analysis. PLoS One. 2009;4:e4838. doi: 10.1371/journal.pone.0004838. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.King T.E., Parkin E.J., Swinfield G., Cruciani F., Scozzari R., Rosa A., Lim S.K., Xue Y., Tyler-Smith C., Jobling M.A. Africans in Yorkshire? The deepest-rooting clade of the Y phylogeny within an English genealogy. Eur. J. Hum. Genet. 2007;15:288–293. doi: 10.1038/sj.ejhg.5201771. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Wilson J.F., Weiss D.A., Richards M., Thomas M.G., Bradman N., Goldstein D.B. Genetic evidence for different male and female roles during cultural transitions in the British Isles. Proc. Natl. Acad. Sci. USA. 2001;98:5078–5083. doi: 10.1073/pnas.071036898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Capelli C., Redhead N., Abernethy J.K., Gratrix F., Wilson J.F., Moen T., Hervig T., Richards M., Stumpf M.P.H., Underhill P.A., et al. A Y chromosome census of the British Isles. Curr. Biol. 2003;13:979–984. doi: 10.1016/s0960-9822(03)00373-7. [DOI] [PubMed] [Google Scholar]
  • 64.Goodacre S., Helgason A., Nicholson J., Southam L., Ferguson L., Hickey E., Vega E., Stefánsson K., Ward R., Sykes B. Genetic evidence for a family-based scandinavian settlement of shetland and orkney during the viking periods. Heredity. 2005;95:129–135. doi: 10.1038/sj.hdy.6800661. [DOI] [PubMed] [Google Scholar]
  • 65.Bowden G.R., Balaresque P., King T.E., Hansen Z., Lee A.C., Pergl-Wilson G., Hurley E., Roberts S.J., Waite P., Jesch J., et al. Excavating past population structures by surname-based sampling: the genetic legacy of the Vikings in northwest England. Mol. Biol. Evol. 2008;25:301–309. doi: 10.1093/molbev/msm255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Carvajal-Carmona L.G., Soto I.D., Pineda N., Ortíz-Barrientos D., Duque C., Ospina-Duque J., McCarthy M., Montoya P., Alvarez V.M., Bedoya G., Ruiz-Linares A. Strong Amerind/white sex bias and a possible Sephardic contribution among the founders of a population in northwest Colombia. Am. J. Hum. Genet. 2000;67:1287–1295. doi: 10.1016/s0002-9297(07)62956-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Tian J.Y., Wang H.W., Li Y.C., Zhang W., Yao Y.G., van Straten J., Richards M.B., Kong Q.P. A genetic contribution from the far east into Ashkenazi jews via the ancient silk road. Sci. Rep. 2015;5:8377. doi: 10.1038/srep08377. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Via M., Gignoux C.R., Roth L.A., Fejerman L., Galanter J., Choudhry S., Toro-Labrador G., Viera-Vera J., Oleksyk T.K., Beckman K., et al. History shaped the geographic distribution of genomic admixture on the island of Puerto Rico. PLoS One. 2011;6:e16513. doi: 10.1371/journal.pone.0016513. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Moreno-Estrada A., Gravel S., Zakharia F., McCauley J.L., Byrnes J.K., Gignoux C.R., Ortiz-Tello P.A., Martínez R.J., Hedges D.J., Morris R.W., et al. Reconstructing the population genetic history of the Caribbean. PLoS Genet. 2013;9:e1003925. doi: 10.1371/journal.pgen.1003925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Moreno-Estrada A., Gignoux C.R., Fernández-López J.C., Zakharia F., Sikora M., Contreras A.V., Acuña-Alonzo V., Sandoval K., Eng C., Romero-Hidalgo S., et al. Human genetics. The genetics of Mexico recapitulates Native American substructure and affects biomedical traits. Science. 2014;344:1280–1285. doi: 10.1126/science.1251688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Bryc K., Durand E.Y., Macpherson J.M., Reich D., Mountain J.L. The genetic ancestry of African Americans, Latinos, and European Americans across the United States. Am. J. Hum. Genet. 2015;96:37–53. doi: 10.1016/j.ajhg.2014.11.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Moltke I., Fumagalli M., Korneliussen T.S., Crawford J.E., Bjerregaard P., Jørgensen M.E., Grarup N., Gulløv H.C., Linneberg A., Pedersen O., et al. Uncovering the genetic history of the present-day Greenlandic population. Am. J. Hum. Genet. 2015;96:54–69. doi: 10.1016/j.ajhg.2014.11.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Leslie S., Winney B., Hellenthal G., Davison D., Boumertit A., Day T., Hutnik K., Royrvik E.C., Cunliffe B., Wellcome Trust Case Control Consortium 2. Lawson D.J., et al. The fine-scale genetic structure of the British population. Nature. 2015;519:309–314. doi: 10.1038/nature14230. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Gilbert E., O'Reilly S., Merrigan M., McGettigan D., Vitart V., Joshi P.K., Clark D.W., Campbell H., Hayward C., Ring S.M., et al. The genetic landscape of Scotland and the Isles. Proc. Natl. Acad. Sci. USA. 2019;116:19064–19070. doi: 10.1073/pnas.1904761116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.O'Dushlaine C., McQuillan R., Weale M.E., Crouch D.J.M., Johansson A., Aulchenko Y., Franklin C.S., Polašek O., Fuchsberger C., Corvin A., et al. Genes predict village of origin in rural Europe. Eur. J. Hum. Genet. 2010;18:1269–1270. doi: 10.1038/ejhg.2010.92. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Leutenegger A.L., Sahbatou M., Gazal S., Cann H., Génin E. Consanguinity around the world: what do the genomic data of the HGDP-CEPH diversity panel tell us? Eur. J. Hum. Genet. 2011;19:583–587. doi: 10.1038/ejhg.2010.205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Kirin M., McQuillan R., Franklin C.S., Campbell H., McKeigue P.M., Wilson J.F. Genomic runs of homozygosity record population history and consanguinity. PLoS One. 2010;5:e13996. doi: 10.1371/journal.pone.0013996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Schaaf C.P., Scott D.A., Wiszniewska J., Beaudet A.L. Identification of incestuous parental relationships by SNP-based DNA microarrays. Lancet. 2011;377:555–556. doi: 10.1016/S0140-6736(11)60201-8. [DOI] [PubMed] [Google Scholar]
  • 79.Korf B.R. Describing human populations: an evolving picture in human genetics research. Am. J. Hum. Genet. 2021;108:2207. doi: 10.1016/j.ajhg.2021.11.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Popejoy A.B., Crooks K.R., Fullerton S.M., Hindorff L.A., Hooker G.W., Koenig B.A., Pino N., Ramos E.M., Ritter D.I., Wand H., et al. Clinical genetics lacks standard definitions and protocols for the collection and use of diversity measures. Am. J. Hum. Genet. 2020;107:72–82. doi: 10.1016/j.ajhg.2020.05.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Callier S.L. The use of racial categories in precision medicine research. Ethn. Dis. 2019;29:651–658. doi: 10.18865/ed.29.S3.651. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Roth W.D., Yaylacı Ş., Jaffe K., Richardson L. Do genetic ancestry tests increase racial essentialism? Findings from a randomized controlled trial. PLoS One. 2020;15:e0227399. doi: 10.1371/journal.pone.0227399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Blanchard J.W., Outram S., Tallbull G., Royal C.D.M. We don't need a swab in our mouth to prove who we are": identity, resistance, and adaptation of genetic ancestry testing among native American communities. Curr. Anthropol. 2019;60:637–655. doi: 10.1086/705483. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Rubanovich C.K., Zhang W., Bloss C.S. Direct-to-Consumer genetic ancestry testing in clinical encounters: perspectives from psychotherapy cases. JMIR Ment. Health. 2020;7:e23596. doi: 10.2196/23596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Rubanovich C.K., Taitingfong R., Triplett C., Libiger O., Schork N.J., Wagner J.K., Bloss C.S. Impacts of personal DNA ancestry testing. J. Community Genet. 2021;12:37–52. doi: 10.1007/s12687-020-00481-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Hazel J.W., Hammack-Aviran C., Brelsford K.M., Malin B.A., Beskow L.M., Clayton E.W. Direct-to-consumer genetic testing: prospective users' attitudes toward information about ancestry and biological relationships. PLoS One. 2021;16:e0260340. doi: 10.1371/journal.pone.0260340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Marcon A.R., Rachul C., Caulfield T. The consumer representation of DNA ancestry testing on YouTube. New Genet. Soc. 2021;40:133–154. [Google Scholar]
  • 88.Wright E.A., Wagner J.K., Shriver M.D., Fernandez J.R., Jablonski N.G. Practical and ethical considerations of using personal DNA tests with middle-school-aged learners. Am. J. Hum. Genet. 2019;104:197–202. doi: 10.1016/j.ajhg.2019.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Batai K., Hooker S., Kittles R.A. Leveraging genetic ancestry to study health disparities. Am. J. Phys. Anthropol. 2021;175:363–375. doi: 10.1002/ajpa.24144. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Chisholm R.L. Is genetic ancestry a tool to combat health disparities? Cell. 2021;184:1964–1965. doi: 10.1016/j.cell.2021.03.038. [DOI] [PubMed] [Google Scholar]
  • 91.Suarez-Pajes E., Díaz-de Usera A., Marcelino-Rodríguez I., Guillen-Guio B., Flores C. Genetic ancestry inference and its application for the genetic mapping of human diseases. Int. J. Mol. Sci. 2021;22:6962. doi: 10.3390/ijms22136962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Rotimi C.N., Adeyemo A.A. From one human genome to a complex tapestry of ancestry. Nature. 2021;590:220–221. doi: 10.1038/d41586-021-00237-2. [DOI] [PubMed] [Google Scholar]
  • 93.Borrell L.N., Elhawary J.R., Fuentes-Afflick E., Witonsky J., Bhakta N., Wu A.H.B., Bibbins-Domingo K., Rodríguez-Santana J.R., Lenoir M.A., Gavin J.R., III, et al. Race and genetic ancestry in medicine—a time for reckoning with racism. N. Engl. J. Med. 2021;384:474–480. doi: 10.1056/NEJMms2029562. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Lewis A.C.F., Molina S.J., Appelbaum P.S., Dauda B., Di Rienzo A., Fuentes A., Fullerton S.M., Garrison N.A., Ghosh N., Hammonds E.M., et al. Getting genetic ancestry right for science and society. Science. 2022;376:250–252. doi: 10.1126/science.abm7530. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Genetic Genealogy Standards Committee . 2015. Genetic Genealogy Standards.http://www.geneticgenealogystandards.com/ [Google Scholar]
  • 96.Keats Citron D., Solove D.J. Vol. 102. B.U. L. Review; 2022. (Privacy Harms). [Google Scholar]
  • 97.Taylor L. What is data justice? The case for connecting digital rights and freedoms globally. Big Data Soc. 2017;4 205395171773633. [Google Scholar]
  • 98.Price W.N., 2nd, Kaminski M.E., Minssen T., Spector-Bagdady K. Shadow health records meet new data privacy laws. Science. 2019;363:448–450. doi: 10.1126/science.aav5133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Schuler K. 2021. Federal Data Privacy Regulation is on the Way — That’s a Good Thing.https://iapp.org/news/a/federal-data-privacy-regulation-is-on-the-way-thats-a-good-thing/ [Google Scholar]
  • 100.National Institutes of Health . 2020. NIH Final Policy for Data Management and Sharing.https://grants.nih.gov/grants/guide/notice-files/NOT-OD-21-013.html [Google Scholar]
  • 101.Office of the National Coordinator for Health Information Technology (ONC) 21st century cures Act: interoperability, information blocking, and the ONC health IT certification program. Fed. Reg. 2020;85:25642–25961. [Google Scholar]
  • 102.Evans B.J. HIPAA's individual right of access to genomic data: reconciling safety and civil rights. Am. J. Hum. Genet. 2018;102:5–10. doi: 10.1016/j.ajhg.2017.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

This effort did not generate or analyze datasets or code except for the tally of votes during the roundtables, which are included in the tables of this publication.


Articles from Human Genetics and Genomics Advances are provided here courtesy of Elsevier

RESOURCES