Abstract
Microbial diversity in geothermal waters of the Unkeshwar hot springs in Maharashtra, India, was studied using 16S rRNA amplicon metagenomic sequencing. Taxonomic analysis revealed the presence of Bacteroidetes, Proteobacteria, Cyanobacteria, Actinobacteria, Archeae, and OD1 phyla. Metabolic function prediction analysis indicated a battery of biological information systems indicating rich and novel microbial diversity, with potential biotechnological applications in this niche.
GENOME ANNOUNCEMENT
Deccan basaltic geothermal hot springs are rich in sulfur and yet are unexploited for microbial ecology (1). The geographical location of Unkeshwar is latitude 19°34′ to 19°40′N and 78°22′ to 78°34′E longitude, with water temperatures ranging from 42°C to 60°C, located in Maharashtra, India.
Replicate water samples were collected during December 2012 in sterile containers, filtered through 0.22-µm-pore-size filters (Merck Millipore, India), and DNA extraction was performed using the RNA PowerSoil total RNA isolation kit (Mo Bio Laboratories, Inc., Carlsbad, CA, USA), according to the manufacturer’s protocol. DNA was enriched by the Multiple Annealing and Loop-Based Amplification Cycles (MALBAC) protocol (2) and then amplified by using primers spanning the V3 to V4 region of the 16S rRNA gene (3).
Paired-end sequencing of the library was performed on an Illumina MiSeq platform using 2 × 251-bp chemistry. The quality parameters for the obtained sequences were checked using the FASTQ quality filter (Phred quality [Q] <20). The resulting good-quality sequences were then overlapped into single longer reads using SeqPrep QIIME (4). Chimeras were removed using the program UCHIME, and all nonchimeric sequences were taken for picking operational taxonomic units (OTUs) using the program Uclust, with a threshold of 97% similarity (5, 6). A representative sequence was identified for each OTU and aligned against a Greengenes core set of sequences using the PyNAST program (7, 8) Taxonomic classification was performed using the RDP Classifier (9) and Greengenes (7) OTU databases. Further, alpha diversity was determined by calculating Shannon, Chao1, and observed species metrics (4). The rarefaction curve was generated for each of the metrics, and metric calculations were performed using the QIIME software (8).
A total of 1,360,637 raw reads were obtained, and 873,631 reads were considered for analysis (after filtering), from which a total of 6,684 OTUs were detected. They were checked for singleton OTUs (i.e., OTU has single reads), and 4,935 singletons were identified and removed. A total of 1,749 OTUs were used for taxonomy classification showing the dominant phyla of Bacteroidetes, Proteobacteria, Cyanobacteria, and Actinobacteria. Around 80% of the reads were assigned to Bacteroidetes and the other 20% to all other phyla. Two of the reads were also assigned to Archaea belonging to the Cenarchaeum genus (phylum Thaumarchaeota). Interestingly, some of the reads mapped to the OD1 phylum, which is known for small genome size (0.7 to 1.2Mbp) and large inventories of novel proteins (10). The rest of the OTUs were mapped to other and unknown phyla. The microbial diversity within the samples was also calculated by Shannon, Chao1, and observed species metrics used to measure the estimated observed OTU abundances, accounting for both richness and evenness.
To determine the accuracy of functional predictions using a Phylogenetic Investigation of Communities by Reconstruction of Unobserved States (PICRUSt), inferences were compared (11) to understand survival strategies and adaptation in extreme niches. The results revealed a wider range of genetic diversity involved in various essential processes, like genetic (translation, transcription, and repair) and environmental information signaling and processing, cellular processes (cell growth and death, cell communication, cell motility, transport, and catabolism), signal transduction and metabolism (of carbohydrates, amino acids, lipids, terpenoids, polyketides, cofactors, vitamins, xenobiotics, energy, and proteins and biosynthesis of secondary metabolites) and organismal systems.
Nucleotide sequence accession number.
The sequence reads obtained in this study were deposited in the Sequence Read Archive (SRA) accession no. SRX1499015.
ACKNOWLEDGMENTS
We thank the Director of the CSIR-National Chemical Laboratory, Pune, for providing infrastructure facilities. G.T.M. thanks the Council of Scientific and Industrial Research (CSIR) for fellowship support and the Academy of Scientific and Innovative Research (AcSIR), New Delhi, India. M.S.D. gratefully acknowledges funding support from Science and Engineering Research Board (SERB), New Delhi, India (project no. EMR 2014/000483). We acknowledge SciGenom Labs, India, for their help in the generation of sequences.
Footnotes
Citation Mehetre GT, Paranjpe A, Dastager SG, Dharne MS. 2016. Investigation of microbial diversity in geothermal hot springs in Unkeshwar, India, based on 16S rRNA amplicon metagenome sequencing. Genome Announc 4(1):e01766-15. doi:10.1128/genomeA.01766-15.
REFERENCES
- 1.Pathak AP, Rathod MG. 2014. Cultivable bacterial diversity of terrestrial thermal spring of Unkeshwar, India. J Biochem Tech 5:814–818. [Google Scholar]
- 2.De Bourcy CF, De Vlaminck I, Kanbar JN, Wang J, Gawad C, Quake SR. 2014. A quantitative comparison of single-cell whole genome amplification methods. PLoS One 9:e105585. doi: 10.1371/journal.pone.0105585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Takahashi S, Tomita J, Nishioka K, Hisada T, Nishijima M. 2014. Development of a prokaryotic universal primer for simultaneous analysis of Bacteria and Archaea using next-generation sequencing. PLoS One 9:e105592. doi: 10.1371/journal.pone.0105592. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kuczynski J, Stombaugh J, Walters WA, González A, Caporaso JG, Knight R. 2012. Using QIIME to analyze 16S rRNA gene sequences from microbial communities. Curr Protoc Bioinformatics Chapter 10:Unit 10.7. doi: 10.1002/9780471729259.mc01e05s27. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Lozupone CA, Stombaugh J, Gonzalez A, Ackermann G, Wendel D, Vázquez-Baeza Y, Jansson JK, Gordon JI, Knight R. 2013. Meta-analyses of studies of the human microbiota. Genome Res 23:1704–1714. doi: 10.1101/gr.151803.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.D’Argenio V, Casaburi G, Precone V, Salvatore F. 2014. Comparative metagenomics analysis of human gut microbiome composition using two different bioinformatic pipelines. Biomed Res Int 2014:325340. doi: 10.1155/2014/325340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, Huber T, Dalevi D, Hu P, Andersen GL. 2006. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol 72:5069–5072. doi: 10.1128/AEM.03006-05. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Caporaso JG, Bittinger K, Bushman FD, DeSantis TZ, Andersen GL, Knight R. 2010. PyNAST: a flexible tool for aligning sequences to a template alignment. Bioinformatics 26:266–267. doi: 10.1093/bioinformatics/btp636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wang Q, Garrity GM, Tiedje JM, Cole JR. 2007. Naive Bayesian classifier for rapid assignment of rRNA sequences into the new bacterial taxonomy. Appl Environ Microbiol 73:5261–5267. doi: 10.1128/AEM.00062-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Kantor RS, Wrighton KC, Handley KM, Sharon I, Hug LA, Castelle CJ, Thomas BC, Banfield JF. 2013. Small genomes and sparse metabolisms of sediment-associated bacteria from four candidate phyla. mBio 4(5):e00708-13. doi: 10.1128/mBio.00708-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Staley C, Gould TJ, Wang P, Phillips J, Cotner JB, Sadowsky MJ. 2014. Core functional traits of bacterial communities in the upper Mississippi River show limited variation in response to land cover. Front Microbiol 5:414. doi: 10.3389/fmicb.2014.00414. [DOI] [PMC free article] [PubMed] [Google Scholar]
