Terriglobus albidus strain ORNL is a heterotrophic soil acidobacterium isolated from the rhizosphere of an Eastern cottonwood tree (Populus deltoides) in Tennessee. Its 6.4-Mb chromosome was completely sequenced using PacBio long reads, and it encodes 5,010 proteins and 53 RNAs.
ABSTRACT
Terriglobus albidus strain ORNL is a heterotrophic soil acidobacterium isolated from the rhizosphere of an Eastern cottonwood tree (Populus deltoides) in Tennessee. Its 6.4-Mb chromosome was completely sequenced using PacBio long reads, and it encodes 5,010 proteins and 53 RNAs.
ANNOUNCEMENT
Acidobacteria are one of the most diverse and abundant bacterial phyla, with members colonizing most soils, in which they represent, on average, one-fifth of the microbial community (1, 2). Among the 14 currently recognized classes in the phylum (3), the class Acidobacteriia encompasses the vast majority of the several dozen cultivated members and genomic sequences. The other classes have few isolates and are only represented by culture-independent sequence data from single cells and metagenomes. Here, we report the complete genome sequence of Terriglobus albidus ORNL, isolated from a Populus deltoides rhizosphere sample in Oak Ridge, Tennessee. Single bacterial cells from a root-associated soil sample were randomly deposited by flow cytometry sorting (4, 5) on Reasoner’s 2A (R2A) agar and incubated at 28°C to form colonies, followed by identification by small subunit (SSU) rRNA gene amplicon sequencing (5). Based on ClustalW 2.1 sequence alignment, one colony had 1,459 out of 1,461 nucleotides (nt) identical (99.8%) to those of the sequence of T. albidus Ac26B10, a species isolated and described from semiarid savannah soil in Namibia (6). Therefore, our isolate is a strain of T. albidus, designated ORNL.
T. albidus ORNL was grown in liquid R2A agar for 5 days. Genomic DNA was extracted and purified using a Qiagen DNeasy kit and a Zymo Research DNA Clean & Concentrator kit, followed by shearing with g-Tubes (Covaris, Woburn, MA) to a 10-kb average fragment size. A library was prepared with an SMRTbell template prep kit v1.0 (Pacific Biosciences, Menlo Park, CA) and sequenced on a Pacific Biosciences Sequel instrument. Sequence quality-based filtering and assembly were performed using Hierarchical Genome Assembly Process 4 (HGAP4) software implemented in the PacBio SMRTLink v7 pipeline with a target genome size of 5 Mbp (based on other Terriglobus genomes), a minimum confidence of 40, a minimum coverage of 50, and the other options set as defaults. A total of 78,854 filtered subreads (N50 length, 8,759 nt) were assembled into a single polished contig 6,405,582 nt long, with a mean coverage of 553-fold and a G+C content of 58.5%. Using Geneious v11 (7), we determined that the ends of the contig spanned an open reading frame encoding the same potential protein. ClustalW v2.1 was used to align the extracted contig ends with the corresponding sequencing reads. An identical region was identified, indicating a closed circular genome, and the duplicated sequence was manually removed from one of the ends. Gene prediction and functional annotation were performed using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) v4.8 (8), which identified 5,010 protein coding sequences, 47 tRNAs, 1 rRNA operon, and 3 noncoding RNAs (ncRNAs). A metabolic model was built using KBase (9) and is accessible together with a Rapid Annotations using Subsystems Technology (RAST) annotation at https://narrative.kbase.us/narrative/ws.44746.obj.1. The genome of Terriglobus albidus will help in identifying genes and processes associated with the evolution of plant-microbe associations.
Data availability.
The Terriglobus albidus ORNL genome sequence has been deposited in GenBank under accession number CP042806. The version described in this paper is the first version, CP042806.1. The PacBio reads have been deposited in the SRA database under accession number SRR10037938.
ACKNOWLEDGMENTS
We thank the Genomic Resource Center at the University of Maryland School of Medicine for the genomic library preparation and sequencing.
This research was funded by the U.S. DOE Office of Biological and Environmental Research, Genomic Science Program as part of the Plant Microbe Interfaces Scientific Focus Area (http://pmi.ornl.gov). Oak Ridge National Laboratory is managed by UT-Battelle, LLC, for the U.S. Department of Energy under contract DE-AC05-00OR22725.
REFERENCES
- 1.Janssen PH. 2006. Identifying the dominant soil bacterial taxa in libraries of 16S rRNA and 16S rRNA genes. Appl Environ Microbiol 72:1719–1728. doi: 10.1128/AEM.72.3.1719-1728.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Kielak AM, Barreto CC, Kowalchuk GA, van Veen JA, Kuramae EE. 2016. The ecology of Acidobacteria: moving beyond genes and genomes. Front Microbiol 7:744. doi: 10.3389/fmicb.2016.00744. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Parks DH, Chuvochina M, Waite DW, Rinke C, Skarshewski A, Chaumeil PA, Hugenholtz P. 2018. A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. Nat Biotechnol 36:996. doi: 10.1038/nbt.4229. [DOI] [PubMed] [Google Scholar]
- 4.Hamilton-Brehm SD, Vishnivetskaya TA, Allman SL, Mielenz JR, Elkins JG. 2012. Anaerobic high-throughput cultivation method for isolation of thermophiles using biomass-derived substrates. Methods Mol Biol 908:153–168. doi: 10.1007/978-1-61779-956-3_15. [DOI] [PubMed] [Google Scholar]
- 5.Utturkar SM, Cude WN, Robeson MS Jr, Yang ZK, Klingeman DM, Land ML, Allman SL, Lu TY, Brown SD, Schadt CW, Podar M, Doktycz MJ, Pelletier DA. 2016. Enrichment of root endophytic bacteria from Populus deltoides and single-cell-genomics analysis. Appl Environ Microbiol 82:5698–5708. doi: 10.1128/AEM.01285-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Pascual J, Wust PK, Geppert A, Foesel BU, Huber KJ, Overmann J. 2015. Terriglobus albidus sp. nov., a member of the family Acidobacteriaceae isolated from Namibian semiarid savannah soil. Int J Syst Evol Microbiol 65:3297–3304. doi: 10.1099/ijsem.0.000411. [DOI] [PubMed] [Google Scholar]
- 7.Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, Thierer T, Ashton B, Meintjes P, Drummond A. 2012. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28:1647–1649. doi: 10.1093/bioinformatics/bts199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI Prokaryotic Genome Annotation Pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Arkin AP, Cottingham RW, Henry CS, Harris NL, Stevens RL, Maslov S, Dehal P, Ware D, Perez F, Canon S, Sneddon MW, Henderson ML, Riehl WJ, Murphy-Olson D, Chan SY, Kamimura RT, Kumari S, Drake MM, Brettin TS, Glass EM, Chivian D, Gunter D, Weston DJ, Allen BH, Baumohl J, Best AA, Bowen B, Brenner SE, Bun CC, Chandonia J-M, Chia J-M, Colasanti R, Conrad N, Davis JJ, Davison BH, DeJongh M, Devoid S, Dietrich E, Dubchak I, Edirisinghe JN, Fang G, Faria JP, Frybarger PM, Gerlach W, Gerstein M, Greiner A, Gurtowski J, Haun HL, He F, Jain R, Joachimiak MP, Keegan KP, Kondo S, Kumar V, Land ML, Meyer F, Mills M, Novichkov PS, Oh T, Olsen GJ, Olson R, Parrello B, Pasternak S, Pearson E, Poon SS, Price GA, Ramakrishnan S, Ranjan P, Ronald PC, Schatz MC, Seaver SMD, Shukla M, Sutormin RA, Syed MH, Thomason J, Tintle NL, Wang D, Xia F, Yoo H, Yoo S, Yu D. 2018. KBase: the United States Department of Energy Systems Biology Knowledgebase. Nat Biotechnol 36:566–569. doi: 10.1038/nbt.4163. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The Terriglobus albidus ORNL genome sequence has been deposited in GenBank under accession number CP042806. The version described in this paper is the first version, CP042806.1. The PacBio reads have been deposited in the SRA database under accession number SRR10037938.