Skip to main content
Data in Brief logoLink to Data in Brief
. 2019 Nov 11;28:104791. doi: 10.1016/j.dib.2019.104791

Metagenomic data on bacterial diversity profiling of Arabian sea sediment by amplicon sequencing

Harisree P Nair 1, Sarita G Bhat 1,
PMCID: PMC6909178  PMID: 31871968

Abstract

This data is about the microbial community genome analysis of Arabian sea sediment by Illumina sequencing by targeting the hypervariable region V3 of 16S rRNA gene. The data analysis revealed the existence of numerous unknown sequences, indicating a large unexploited bacterial diversity in the area. The raw sequence data used for analysis is available in NCBI under the Sequence Read Archive (SRA) with the BioProject No. PRJNA397165 and SRA accession number SRP125840.

Keywords: Community genome, Arabian sea, 16S rRNA, Marine


Specifications Table

Subject area Biology
More specific subject area Marine Metagenomics
Type of data Figures
How data was acquired Illumina MiSeq platform
Data format Raw and analyzed
Experimental factors Arabian Sea sediment of 96 m depth were collected, representing the most productive epipelagic zone
Experimental features Metagenomic DNA extraction and amplicon sequencing of V3 region of 16S rRNA gene
Data source location Arabian Sea (9°59′10.9968″ N; 75° 39′ 26.4564″ E)
Data accessibility The sequencing data is available in NCBI under the Sequence Read Archive (SRA) with the BioProject No. PRJNA397165 and SRA accession number SRP125840. The direct URL to data ishttps://www.ncbi.nlm.nih.gov/sra/?term=SRP125840
Value of the Data
  • The data provides insights into the hidden microbial diversity of Arabian sea sediments which can utilized as a treasure trove of novel biomolecules

  • The sequencing data is publicly available for comparative studies of microbial diversity in global oceans.

  • The scientific community is informed through the study about the existence of several unidentified sequences indicating the high incidence of novel yet-to-be cultured bacteria in the Arabian sea epipelagic sediments

1. Data

The largest habitable space for living organisms, particularly microorganisms is the marine realm, covering 70% of the planet surface. These microbial communities are key players in marine ecosystem maintenance [1]. Study of marine microbial biodiversity is of great significance, for it enables understanding biogeochemical cycles prevailing in the area. To harness these enormous genetic diversities in toto, metagenomic procedures can be applied. However, advances in next-generation sequencing methods have accelerated the large-scale exploration of taxonomic diversity of bacterial communities from diverse environments [[2], [3], [4]]. The complete focus is on presenting the taxonomic profile of bacterial communities of Arabian sea sediment.

6309 Operational Taxonomic Units (OTUs) were identified from the sequencing data and segregated into diverse taxonomic level of bacterial domains, which were classified into 43 bacterial phyla including 18 formally described bacterial phyla (Fig. 1) and 25 candidate phyla (Fig. 2).

Fig. 1.

Fig. 1

Taxonomic distribution of OTUs at phylum level from Arabian sea sediment.

Fig. 2.

Fig. 2

Taxonomic classification of OTUs at candidate phylum level from Arabian sea sediment.

Phylum Proteobacteria with 2932 OTUs was most abundant (at 46.47% of the total diversity). 16.45% (1038 OTUs) represented novel yet to be cultured organisms in Arabian Sea sediments awaiting discovery. 476 OTUs belonged to Acidobacteria, 369 OTUs to Chloroflexi, 283 OTUs to Bacteroidetes, while 182 OTUs from Actinobacteria and Gemmatimonadetes were also identified. Firmicutes, Nitrospirae, Spirochaetes, Planctomycetes, Chlorobi, Fusobacteria, Tenericutes, Cyanobacteria, Verrucomicrobia, Fibrobacteres, Deinococcus-Thermus and Elusimicrobia contributed less than 2% of the total identified OTUs.

2. Experimental design, materials, and methods

Marine sediments were collected from eastern Arabian Sea (9°59′10.9968″ N; 75° 39′ 26.4564″ E) onboard the research vessel FORV Sagar Sampada (Cruise No: 305) during August 2012 using grab at a depth of 96 m. Community DNA was isolated by modifying the classical method by utilizing liquid nitrogen for grinding sediment sample [5,6]. The V3 hypervariable region of 16S rRNA gene was amplified using 341F 5′-CCTACGGGAGGCAGCAG-3′ and 518R 5′-ATTACCGCGGCTGCTGG-3′ primer pairs with appropriate dilution of metagenomic DNA as template. Purified PCR product was used for a second PCR reaction which attached Illumina sequencing adapters and dual-index barcodes to the amplicon target. Sequencing reactions (151 bp × 2 paired end reads) were performed using the MiSeq platform (Illumina, Inc., CA, USA) following manufacturer's instructions. Raw sequencing data obtained were quantity filtered and processed using QIIME [7].

Acknowledgments

Authors acknowledge the financial support from University Grants Commission, Govt. of India through Major research project vide F.No. 41/527/2012 (SR).

Conflict of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  • 1.Fuhrman J.A. Microbial community structure and its functional implications. Nature. 2009;459:193–199. doi: 10.1038/nature08058. [DOI] [PubMed] [Google Scholar]
  • 2.Qiu Z., Li N., Lu X., Zheng Z., Zhang M., Qiao X. Characterization of microbial community structure and metabolic potential using Illumina MiSeq platform during the black garlic processing. Food Res. Int. 2018;106:428–438. doi: 10.1016/j.foodres.2017.12.081. [DOI] [PubMed] [Google Scholar]
  • 3.Chen C., Peng X., Huang S., Wang Y., Liao S., Wei Y. Data on microbial community composition of sludge from high altitude wastewater treatment plants determined by 16S rRNA gene sequencing. Data Brief. 2019;23:103739. doi: 10.1016/j.dib.2019.103739. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Marshall P.G., Karst S.M., Nielsen P.H., Jørgensen B.B. Metagenomes from deep Baltic Sea sediments reveal how past and present environmental conditions determine microbial community composition. Mar. Genomics. 2018;37:58–68. doi: 10.1016/j.margen.2017.08.004. [DOI] [PubMed] [Google Scholar]
  • 5.Volossiouk T., Robb E.J., Nazar R.N. Direct DNA extraction for PCR-mediated assays of soil organisms. Appl. Environ. Microbiol. 1995;61(11):3972–3976. doi: 10.1128/aem.61.11.3972-3976.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Nair H.P., Vincent H., Bhat S.G. Evaluation of five in situ lysis protocols for PCR amenable metagenomic DNA from mangrove soils. Biotechnol. Rep. 2014;4:134–138. doi: 10.1016/j.btre.2014.09.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Caporaso J.G., Kuczynski J., Stombaugh J., Bittinger K., Bushman F.D., Costello E.K., Fierer N., Pena A.G., Goodrich J.K., Gordon J.I., Huttley G.A., Kelley S.T., Knights D., Koenig J.E., Ley R.E., Lozupone C.A., McDonald D., Muegge B.D., Pirrung M., Reeder J., Sevinsky J.R., Turnbaugh P.J., Walters W.A., Widmann J., Yatsunenko T., Zaneveld J., Knight R. QIIME allows analysis of high-throughput community sequencing data. Nat. Methods. 2010;7(5):335–336. doi: 10.1038/nmeth.f.303. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES