Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2022 Feb 14;119(7):e2200329119. doi: 10.1073/pnas.2200329119

Correction for Toczydlowski et al., Poor data stewardship will hinder global genetic diversity surveillance

PMCID: PMC8851486  PMID: 35165153

GENETICS Correction for “Poor data stewardship will hinder global genetic diversity surveillance,” by Rachel H. Toczydlowski, Libby Liggins, Michelle R. Gaither, Tanner J. Anderson, Randi L. Barton, Justin T. Berg, Sofia G. Beskid, Beth Davis, Alonso Delgado, Emily Farrell, Maryam Ghoojaei, Nan Himmelsbach, Ann E. Holmes, Samantha R. Queeno, Thienthanh Trinh, Courtney A. Weyand, Gideon S. Bradburd, Cynthia Riginos, Robert J. Toonen, and Eric D. Crandall, which published August 17, 2021; 10.1073/pnas.2107934118 (Proc. Natl. Acad. Sci. U.S.A. 118, e2107934118).

The authors note that “Eight percentage values associated with the amount of metadata present after searching outside of INSDC (e.g., in associated papers) need to be revised.” The authors state that these numbers appear in the manuscript text and in Fig. 2B. The authors also note that, “Six values in the supplementary material were typos and previously did not match the same values reported in the main text (which were and remain correct).”

Fig. 2.

Fig. 2.

Most genomic-level sequence data in the INSDC lack critical metadata. (A) Status of metadata in the INSDC for wild and domesticated individuals (BioSamples, n = 327,577). Gray hashed box indicates datasets (BioProjects) with more than four wild individuals that lacked latitude/longitude and are addressed in B (n = 493). (B) Status of metadata for records inside hashed box in A after augmenting with metadata from associated publications. Left of black diamonds = present in INSDC.

In the abstract, “86%” should instead appear as “87%,” “33%” should instead appear as “39%,” and “39%” should instead appear as “51%.” On page 1, right column, first full paragraph, line 6, “14%” should instead appear as “13%.” On page 1, right column, first full paragraph, line 11, “41%” should instead appear as “40%.” On page 2, left column, second full paragraph, lines 11–13, “40% had collection years, and 33% had both (39% if any type of location data were considered)” should instead appear as “51% had collection years, and 39% had both (51% if any type of location data were considered).” The online version has been corrected.

In the SI Appendix, page 4, first paragraph, line 4, “233,644” should instead appear as “233,639.” In the SI Appendix, page 4, second full paragraph, line 10, “14%” should instead appear as “15%.” In the SI Appendix, page 6, line 35, “380,416 sequences from 327,582 BioSamples” should instead appear as “380,410 sequences from 327,577 BioSamples.” In the SI Appendix, page 6, lines 40–41, “268,384 sequences from 233,644 BioSamples” should instead appear as “268,378 sequences from 233,639 BioSamples.” The SI Appendix has been corrected online.

The corrected Fig. 2 and its legend appear below. The online version has been corrected.


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES