Abstract
Bacterial community of the Khuangcherapuk cave sediment was assessed by Illumina amplicon sequencing. The metagenome comprised of 533,120 raw reads with an average base quality (Phred score) 36.75 and G + C content is 57.61%. A total of 18 bacterial phyla were detected with following abundant genus — Mycobacterium (21.72%), Rhodococcus (7.09%), Alteromonas (1.42%), Holomonas (0.7%) and Salinisphaera (0.20%). Majority portion of the sequences (68%) is unclassified at the genus level indicating the possibilities for the presence of novel species in this cave. This study reports the cave bacterial diversity from the biodiversity hotspot region of Eastern Himalayas. Metagenome sequence data are available at NCBI under the Bioproject database with accession no. SRP056890.
Keywords: Khuangcherapuk cave, Illumina, Metagenome
Specifications | |
---|---|
Organism/cell line/tissue | Illumina-based analysis of bacterial community in Khuangcherapuk cave of Mizoram, Northeast India |
Sex | Not applicable |
Sequencer or array type | Illumina |
Data format | Analyzed |
Experimental factors | Environmental sample |
Experimental features | V3 hypervariable region of 16S rDNA was sequenced using paired end Illumina Mi-Seq technology and the sequence was analyzed using QIIME data analysis package. |
Consent | Not applicable |
Sample source location | Sediment sample, Khuangcherapuk cave, Mizoram, Northeast India |
1. Direct link to deposited data
2. Experimental design, materials and methods
Most of the diversity study on cave microbiology is based on the cultivation approach which can determine less than 1% of the microbes [1] With the advancement of culture independent technique using next generation sequencing or clone library construction, it is now possible to analyze the entire population in the community as well as their functional potentiality in extreme environments [2], [3], [4] Therefore, the present research was intended to analyze the bacterial community using Illumina based metagenomic approach in Khuangcherapuk cave, which is devoid of any light source and thrives under energetically unfavorable and nutrient-poor conditions.
Samples were collected during February 2014 from the Khuangcherapuk Cave (23°41′30″ N, 92°37′5″E), Ailawng village, Mizoram, Northeast India. The cave is 162 m long with a vertical range of 10 m depth and is considered as the biggest cave in Mizoram. Ten individual composite sediment samples were collected from different places of the cave floor and DNA was extracted using the Fast DNA spin kit (MP Biomedical, Solon, OH, USA). The extracted DNA was purified twice using 0.5% low melting point agarose gel and mixed to prepare a composite sample.
The V3 hypervariable region of the 16S rRNA gene was amplified using F 341/R518 primer combination (5′-CCTACGGGAGGCAGCAG-3′; 5′-ATTACCGCGGCTGCTGG-3′). Amplicon metagenomic sequencing was performed using the Illumina Mi-Seq platform and the analysis and annotation of output data were carried out by QIIME data analysis package [5]. Raw sequences were filtered based on base quality score, average base content per read and GC distribution in the reads. Reads that did not cluster with other sequences i.e. singletons (abundances < 2) were removed. Chimeras were also removed using UCHIME program [6] The pre-processed consensus V3 sequences were finally grouped into operational taxonomic units (OTUs) using the clustering program UCLUST at a similarity threshold of 0.97 [7] All the pre-processed reads were used to identify the OTUs using QIIME program and the representative sequences were aligned against the Greengenes core set reference database using PyNAST program [8]. Representative sequence for each OTU was classified using RDP classifier and Greengenes OTU database.
The output file comprised 161 MB data with a total of 533,120 raw reads having 57.61% GC content. A total of 18 bacterial phyla were detected in our analysis. The most dominant prokaryotic phylum was Actinobacteria (64.07%), a broad class of high G + C, Gram-positive bacteria commonly found in caves and soils [9]. In this phylum, 34.26% reads were classified under the genus Mycobacterium. Other dominant phyla were Firmicutes (17.06%), Proteobacteria (16.43%), Bacteroidetes (1.75%) and Chloroflexi (0.02%) (Fig. 1B). At the family level, Mycobacteriaceae (21.95%) was dominant followed by Bacillaceae (17.04%), Sphingomonadaceae (9.74%), Alteromonadaceae (1.53%), Salinisphaeraceae (0.44%), Xanthomonadaceae (0.39%), Flavobacteriaceae (0.18%) and Moraxellaceae (0.005%). The leading genera were Mycobacterium (21.72%), Rhodococcus (7.09%), Alteromonas (1.42%), Holomonas (0.7%) and Salinisphaera (0.20%) (Supplementary Fig. 1, Supplementary Fig. 2). Among the identified species Rhodococcus fascians was present in high numbers which is reported to participate in Calcite Biomineralization process [10] Our data provides the first scientific report on diverse group of bacteria, using Illumina sequencing method, from the unexplored Khuangcherapuk cave located in a lesser known Northeastern Indian region. The most dominated phylum in this study was actinomycetes which are known to produce valuable secondary metabolites useful for biotechnological applications. This study also detected a huge number of unclassified bacteria which might be representative of novel species.
Fig. 1.
Taxonomy classification of reads at phylum level (A), OTUs at phylum level (B) for the sample. Only top 10 enriched class categories are shown in the figure.
3. Nucleotide sequence accession number
Metagenome sequence data are available at NCBI accession no. SRP056890.
The following are the supplementary data related to this article.
Bacterial community structure from phylum to species level of Khuangcherapuk cave metagenome can be visualized in this file using Krona visualization tool.
Neighbor-joining tree based on the V3 region of 16S rRNA genes classified at the genus level (Newick format).
4. Competing interests
The authors declare that there are no competing interests.
Acknowledgements
This research was funded by a grant from the Bioinformatics Infrastructure Facility sponsored by Department of Biotechnology, Govt. of India, New Delhi.
References
- 1.Amann R.I., Binder B.J., Olson R.J., Chisholm S.W., Devereux R., Stahl D.A. Combination of 16S rRNA targeted oligonucleotide probes with flow-cytometry for analyzing mixed microbial populations. Appl. Environ. Microbiol. 1990;56:1919–1925. doi: 10.1128/aem.56.6.1919-1925.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Mangrola A.V., Dudhagara P., Koringa P., Joshi C.G., Patel R.K. Shotgun metagenomic sequencing based microbial diversity assessment of Lasundra hot spring, India. Genomics Data. 2015;4:73–75. doi: 10.1016/j.gdata.2015.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Xie W., Wang F., Guo L., Chen Z., Sievert S.M., Meng J., Huang G., Li Y., Yan Q., Wu S., Wang X., Chen S., He G., Xiao X., Xu A. Comparative metagenomics of microbial communities inhabiting deep-sea hydrothermal vent chimneys with contrasting chemistries. ISME J. 2011;5(3):414–426. doi: 10.1038/ismej.2010.144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Ghelani A., Patel R., Mangrola A., Dudhagara P. Cultivation-independent comprehensive survey of bacterial diversity in Tulsi Shyam hot springs, India. Genomics Data. 2015;4:54–56. doi: 10.1016/j.gdata.2015.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Caporaso J.G., Kuczynski J., Stombaugh J., Bittinger K., Bushman F.D., Costello E.K., Fierer N., Pena A.G., Goodrich J.K., Gordon, J.I. QIIME allows analysis of high-throughput community sequencing data. Nat. Methods. 2010;7:335–336. doi: 10.1038/nmeth.f.303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Edgar UCHIME improves sensitivity and speed of chimera detection. Bioinformatics. 2011;27(16):2194–2200. doi: 10.1093/bioinformatics/btr381. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Edgar R.C. Search and clustering orders of magnitude faster than BLAST. Bioinformatics. 2010;26:2460–2461. doi: 10.1093/bioinformatics/btq461. [DOI] [PubMed] [Google Scholar]
- 8.DeSantis T.Z., Hugenholtz P., Larsen N., Rojas M., Brodie E.L., Keller K., Huber T., Dalevi D., Hu P., Anderson G.L. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl. Environ. Microbiol. 2016;72:5069–5072. doi: 10.1128/AEM.03006-05. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Goodfellow M., Williams S.T. Ecology of actinomycetes. Annu. Rev. Microbiol. 1983;37:189–216. doi: 10.1146/annurev.mi.37.100183.001201. [DOI] [PubMed] [Google Scholar]
- 10.Rusznyák A., Akob D.M., Nietzsche S., Eusterhues K., Totsche K., Neu T., Frosch T., Popp P., Keiner R., Geletneky J., Katzschmannn L., Schule E.D., Küsel K. Calcite biomineralization by bacterial isolates from the recently discovered pristine karstic herrenberg cave. Environ. Microbiol. 2012;78:1157–1167. doi: 10.1128/AEM.06568-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Bacterial community structure from phylum to species level of Khuangcherapuk cave metagenome can be visualized in this file using Krona visualization tool.
Neighbor-joining tree based on the V3 region of 16S rRNA genes classified at the genus level (Newick format).