Head
The Report by M. Gymrek et al. (“Identifying personal genomes by surname inference,” 18 January, p. 321) demonstrates that DNA samples can be combined with surname and other data to re-identify seemingly anonymous records. The study adds to the literature showing that large, publicly available data sets can be leveraged to infer personal information (1) and identify unique individuals (2). In 2005, a 15-year-old tracked down his sperm donor father using a similar approach (3). This demonstration may undermine individual research subjects’ confidence that their DNA can be shared in a way that is not re-identifiable. To counteract this effect, we must augment imperfect technical safeguards with measures that make such re-identification socially, legally, and economically unacceptable.
Society can work in many ways to mitigate risks and maintain a climate of confidence that will continue to encourage research subject participation. Subjects must be informed of the risk that their DNA sequence will be identified with their names. Data stewards must make a good faith effort to protect DNA-based records, including continuing to de-identify the data (e.g., remove explicit identifiers). Systems for audit of access and use of the data should be routine, and data should be protected from anonymous access. Data use agreements should provide institutional and legal remedies when societal boundaries and robust research norms of respect for the privacy of individuals are violated.
The diversity of human genomes guarantees that each person will harbor markers for higher-than-average risk for some adverse outcomes. Thus, further risk mitigation should include regulatory approaches such as expansion of protections currently provided under the Genetic Information Nondiscrimination Act and other anti-discrimination laws, which currently protect against some forms of discrimination, but not others, such as long-term care or life insurance.
References
- 1.El Emam K, Jonker E, Arbuckle L, Malin B. PLoS One. 2011;6:e28071. doi: 10.1371/journal.pone.0028071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Jobling M. Trends Genet. 2001;17:353. doi: 10.1016/s0168-9525(01)02284-3. [DOI] [PubMed] [Google Scholar]
- 3.Sample I. Teenager finds sperm donor dad on Internet. The Guardian. 2005 Nov 2; www.guardian.co.uk/science/2005/nov/03/genetics.news.