Skip to main content
PhytoKeys logoLink to PhytoKeys
. 2017 Oct 12;(88):119–122. doi: 10.3897/phytokeys.88.14607

Data Release: DNA barcodes of plant species collected for the Global Genome Initiative for Gardens Program, National Museum of Natural History, Smithsonian Institution

Jose D Zúñiga 1, Morgan R Gostel 1,2, Daniel G Mulcahy 1, Katharine Barker 1, Asia Hill 3, Maryam Sedaghatpour 4, Samantha Q Vo 5, Vicki A Funk 2, Jonathan A Coddington 1
PMCID: PMC5672136  PMID: 29118648

Abstract Abstract

The Global Genome Initiative has sequenced and released 1961 DNA barcodes for genetic samples obtained as part of the Global Genome Initiative for Gardens Program. The dataset includes barcodes for 29 plant families and 309 genera that did not have sequences flagged as barcodes in GenBank and sequences from officially recognized barcoding genetic markers meet the data standard of the Consortium for the Barcode of Life. The genetic samples were deposited in the Smithsonian Institution’s National Museum of Natural History Biorepository and their records were made public through the Global Genome Biodiversity Network’s portal. The DNA barcodes are now available on GenBank.

Keywords: DNA barcoding, GenBank, land plants

Introduction

The Global Genome Initiative (GGI) is a Smithsonian Institution program to collect, organize, share, and study genomic samples of non-human species. The mission of GGI is to preserve and understand Earth’s genomic biodiversity. In pursuit of this mission, GGI aims to collect and preserve genome-quality tissue samples from all major lineages of life on Earth; foster biodiversity genomics research by generating DNA barcodes for dark taxa (i.e., those with no genetic data in online repositories); and promote the use of new technologies to study genomics across the tree of life. GGI supports the Global Genome Biodiversity Network (GGBN), an international network of institutions interested in the preservation of non-human genomic samples (Seberg et al. 2016). Members of GGBN can make their DNA and tissue collections discoverable on GGBN’s data portal (http://www.ggbn.org/ggbn_portal/), ensuring transparent access and visibility to the genetic resources to the research community.

The Global Genome Initiative for Gardens Program (GGI-Gardens) is a GGI-funded effort to collect and preserve genetic material from the plant Tree of Life that is not yet represented in any of GGBN’s partner institutions, and that are currently found in living collections around the globe. In its first phase, the program targeted living plant collections in the Washington, DC area and collected more than 1,800 genome-quality tissues from 209 families, 1007 genera and 1631 species. Moving forward, GGI-Gardens is focused on expanding its partnerships internationally to continue sample and preserve genomic biodiversity from all families and genera, and, potentially, species of plants on Earth.

The genetic samples collected to date have been deposited in the Smithsonian Institution’s National Museum of Natural History Biorepository (http://naturalhistory.si.edu/rc/biorepository) and are available upon request to researchers across the globe (regulations on sampling leaf material can be found here). All corresponding specimen vouchers have been accessioned in the United States National Herbarium (US) or other recognized, partner herbaria. The GGI-Gardens protocol (Gostel et al. 2016) and US National Herbarium best practices (Funk et al. 2017) have been published to facilitate the establishment of voucher programs at partner institutions.

GGI’s barcoding strategy data-mines GenBank to detect taxonomic groups that do not have sequences flagged as barcodes, thus allowing GGI to focus sequencing efforts on lineages that are not represented in this repository. Using this method, GGI selected more than 500 plant genera from GGI-Gardens collections and generated sequences for four genetic markers according to the DNA barcode data standard (Consortium for the Barcode of Life 2005). As a result, all sequences from officially accepted barcoding regions (two of the four markers targeted, see below) have been labeled with the keyword “BARCODE” in GenBank. All samples were determined at least to genus by the time of publication of this release paper by staff at the living collection where they were collected. Our intentions are to make these data publicly available, to contribute to the DNA Barcode library to assist further research, and to make the presence of these genomic-quality tissues known and available for the academic community for genomic research and education purposes via a documented application process. All DNA barcode sequences were submitted to GenBank as part of the GGI-Gardens BioProject (ID: PRJNA389125), which is included in Global Genome Initiative’s DNA Barcoding umbrella BioProject (ID: PRJNA384793).

Data resources and contents of the dataset

Data are deposited in GenBank under accession numbers MF348326-MF350286 (see supplementary file 1 for the full list of accession numbers). A total of 1961 sequences have been submitted to GenBank representing 160 families and 521 plant genera, including 29 families and 309 genera that previously did not have sequences flagged as barcodes in this data repository. Two of the four genetic markers sequenced, rbcL and matK, have been officially recognized as barcoding regions for land plants (CBOL Plant Working Group 2009). The other two loci targeted in this study, the nuclear ribosomal internal transcribed spacer (nrITS) and the plastid psbA-trnH intergenic spacer, are commonly used for barcoding in angiosperms (Kress et al. 2005, Kress and Erickson 2009, Hollingsworth et al. 2016).

Acknowledgements

All laboratory procedures and computer work were conducted in the Laboratories for Analytical Biology facilities at the National Museum of Natural History in Washington, DC and at the Museum Support Center in Suitland, MD. The authors wish to thank Natalia Agudelo and Gabriel Johnson for their continuous support in the lab, Kathryn Faulconer, Sarah Gabler, Kadiera Ingram, Carol Kelloff, Monica Marcelli, Jacob Suissa and Kristen Van Nest for their participation in the collection and curation of the genetic samples and the associated voucher specimens, and Niamh Redmond and Michael Trizna for their advice on post-sequencing data management. This material is based upon work supported by the Global Genome Initiative and Smithsonian Institution Barcoding Network. Any paper(s) resulting directly from this specimen-processing project should reference support from the Smithsonian Institution’s DNA Barcode Network and the Laboratories of Analytical Biology, National Museum of Natural History, Smithsonian Institution.

Citation

Zúñiga JD, Gostel MR, Mulcahy DG, Barker K, Hill A, Sedaghatpour M, Vo SQ, Funk VA, Coddington JA (2017) Data Release: DNA barcodes of plant species collected for the Global Genome Initiative for Gardens Program, National Museum of Natural History, Smithsonian Institution. PhytoKeys 88: 119–122. https://doi.org/10.3897/phytokeys.88.14607

Supplementary materials

Supplementary material 1

List of samples collected for the Global Genome Initiative for Gardens project selected for DNA barcoding, with GenBank accession numbers and genetic sample identification numbers. All the sequences are included in the GGI-Gardens BioProject.

This dataset is made available under the Open Database License (http://opendatacommons.org/licenses/odbl/1.0/). The Open Database License (ODbL) is a license agreement intended to allow users to freely share, modify, and use this Dataset while maintaining this same freedom for others, provided that the original source and author(s) are credited.

Jose D. Zúñiga, Morgan R. Gostel, Daniel G. Mulcahy, Katharine Barker, Asia Hill, Maryam Sedaghatpour, Samantha Q. Vo, Vicki A. Funk, Jonathan A. Coddington

Data type: Microsoft Word Document (.docx)

Explanation note: List of samples collected for the Global Genome Initiative for Gardens project selected for DNA barcoding, with GenBank accession numbers and genetic sample identification numbers.

phytokeys-88-119-s001.doc (999.5KB, doc)

References

  1. Consortium for the Barcode of Life (2005) Data Standards for BARCODE Records in INSDC (BRIs). http://www.barcodeoflife.org/sites/default/files/DWG_data_standards-Final.pdf
  2. CBOL Plant Working Group (2009) A DNA barcode for land plants. Proceedings of the National Academy of Sciences of the United States of America 106(31): 12794–12797. https://doi.org/10.1073/pnas.0905845106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Funk VA, Gostel MR, Devine A, Kelloff CL, Wurdack KJ, Tuccinardi C, Radosavljevic A, Peters M, Coddington JA. (2017) Guidelines for collecting vouchers and tissues intended for genomic work (Smithsonian Institution): Botany best practices. Biodiversity Data Journal 5: e11625. https://doi.org/10.3897/BDJ.5.e11625 [DOI] [PMC free article] [PubMed]
  4. Gostel MR, Kelloff CL, Wallick K, Funk VA. (2016) A workflow to preserve genome-quality tissue samples from plants in botanical gardens and arboreta. Applications in Plant Sciences 4: 1600039. https://doi.org/10.3732/apps.1600039 [DOI] [PMC free article] [PubMed]
  5. Kress WJ, Wurdack KJ, Zimmer EA, Weigt LA, Janzen DH. (2005) Use of DNA barcodes to identify flowering plants. Proceedings of the National Academy of Sciences of the United States of America 102(23): 8369–8374. https://doi.org/10.1073/pnas.0503123102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Kress WJ, Erickson DL. (2009) A two-locus global DNA barcode for land plants: The coding rbcL gene complements the non-coding trnH-psbA spacer region. PLoS ONE. https://doi.org/10.1371/journal.pone.0000508 [DOI] [PMC free article] [PubMed]
  7. Hollingsworth PM, Li D, van der Bank M, Twyford AD. (2016) Telling plant species apart with DNA: form barcodes to genomes. Philosophical Transactions of the Royal Society B 371: 20150338. https://doi.org/10.1098/rstb.2015.0338 [DOI] [PMC free article] [PubMed]
  8. Seberg O, Droege G, Barker K, Coddington JA, Funk VA, Gostel MR, Petersen G, Smith PP. (2016) Global Genome Biodiversity Network: saving a blueprint of the Tree of Life–a botanical perspective. Annals of Botany 118: 393–399. https://doi.org/10.1093/aob/mcw121 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material 1

List of samples collected for the Global Genome Initiative for Gardens project selected for DNA barcoding, with GenBank accession numbers and genetic sample identification numbers. All the sequences are included in the GGI-Gardens BioProject.

This dataset is made available under the Open Database License (http://opendatacommons.org/licenses/odbl/1.0/). The Open Database License (ODbL) is a license agreement intended to allow users to freely share, modify, and use this Dataset while maintaining this same freedom for others, provided that the original source and author(s) are credited.

Jose D. Zúñiga, Morgan R. Gostel, Daniel G. Mulcahy, Katharine Barker, Asia Hill, Maryam Sedaghatpour, Samantha Q. Vo, Vicki A. Funk, Jonathan A. Coddington

Data type: Microsoft Word Document (.docx)

Explanation note: List of samples collected for the Global Genome Initiative for Gardens project selected for DNA barcoding, with GenBank accession numbers and genetic sample identification numbers.

phytokeys-88-119-s001.doc (999.5KB, doc)

Articles from PhytoKeys are provided here courtesy of Pensoft Publishers

RESOURCES