Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2019 Oct 17;8(42):e01001-19. doi: 10.1128/MRA.01001-19

Draft Genome Sequences of 10 Bacterial Strains Isolated from an Abandoned Coal Mine in Southeast Kansas

Brady Steinbock a, Rachel Bechtold a,*, Joseph L Sevigny b, Devin Thomas b, W Kelley Thomas b, Anuradha Ghosh a,
Editor: Irene L G Newtonc
PMCID: PMC6797535  PMID: 31624170

Here, we report 10 bacterial strains isolated from an abandoned coal mine in southeast Kansas to determine their potential for bioremediation through comparison of the genome sizes and distribution patterns of unique metabolic genes. The selected strains belong to the genera Arthrobacter, Jeotgalibacillus, Kocuria, Microbacterium, Pantoea, Rhodococcus, Vibrio, Brevibacterium, and Paenibacillus.

ABSTRACT

Here, we report 10 bacterial strains isolated from an abandoned coal mine in southeast Kansas to determine their potential for bioremediation through comparison of the genome sizes and distribution patterns of unique metabolic genes. The selected strains belong to the genera Arthrobacter, Jeotgalibacillus, Kocuria, Microbacterium, Pantoea, Rhodococcus, Vibrio, Brevibacterium, and Paenibacillus.

ANNOUNCEMENT

For over half a century, sizeable areas of land throughout the Midwest were mined for various underground resources, one of the most notable being coal. Once mined, the lands were left for years. Some areas were partially reclaimed following the Surface Mining Control and Reclamation Act of 1977. The major environmental pollutant that became established in those reclaimed as well as abandoned areas was acid mine drainage (AMD) (1). Pyrite, which is usually found in anaerobic environments, is exposed to atmospheric oxygen and water after coal mines are abandoned, leading to the formation of sulfuric acid and iron. The iron precipitates at the shores and beds of mined land, and sulfuric acid runoff dissolves heavy metals into ground and surface water. Naturally occurring acidophilic bacteria in mined land can increase the oxidation rate of pyrite, while other microbes may offer a solution to the detrimental effects of AMD (2). In the present study, whole-genome sequencing (WGS) analysis provided insight into the presence of genes related to stress, resistance, and metabolism. Thus, application of these isolates is not limited to restoration of AMD sites but also could be exploited for bioremediation of heavy metals and recalcitrant chemicals in contaminated sites.

Soil samples were collected from a depth of 5 cm at five specific sites in the partially reclaimed soil as well as from an AMD site. Up to 60 bacterial isolates were characterized using physiological and biochemical tests (3). Out of these, 10 strains were selected for WGS analysis based on their metabolic diversity. Selected pure cultures were grown overnight in tryptic soy broth (BD Difco, Franklin Lakes, NJ), and whole genomic DNA was isolated with a GenElute bacterial genomic DNA kit (Sigma-Aldrich Corporation, Natick, MA) following the manufacturer’s protocol. Finally, a NanoDrop Lite (Thermo Fisher Scientific, Waltham, MA) analysis was performed on each isolated DNA for quantification and to ensure sample purity.

WGS was performed at the Hubbard Center for Genome Studies (University of New Hampshire, Durham, NH). A paired-end library was constructed using a Nextera DNA library preparation kit (Illumina, San Diego, CA) and sequenced with an Illumina HiSeq 2500 instrument to produce 250-bp paired-end reads. The total numbers of reads for all 10 strains are listed in Table 1. FASTQ files were trimmed for Nextera adapters and low-quality bases using Trimmomatic version 0.32 (4). For read trimming, trailing and leading bases were removed if the quality score was below 3. In addition, the reads were scanned using a 4-base sliding window and trimmed if the average quality dropped below 15. Trimmed sequencing reads were then assembled using the SPAdes pipeline version 3.5 (5) with default settings. QUAST version 4.6.0 (6) was used to assess the contiguity of the assemblies, and coverage statistics were calculated by mapping FASTQ reads to the assembled contigs with the Burrows-Wheeler Aligner MEM algorithm (BWA-MEM) with default settings. The assembled genomes were annotated via the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) (7), while the identity of these strains was determined by performing a BLASTn (8) search on 16S rRNA and translation initiation factor-2 (IF-2) gene sequences. The assembly metrics and annotated features are given in Table 1.

TABLE 1.

Accession numbers, assembly metrics, and annotated features of the sequenced environmental strains isolated from an abandoned coal mine in southeast Kansas

Bacterial species Strain GenBank accession no. Avg coverage (×) No. of contigs Total no. of reads Genome assembly size (bp) N50 value (bp) G+C content (%) No. of coding genes No. of rRNAs No. of tRNAs No. of ncRNAsa
Vibrio vulnificus S-C3 SMZZ00000000 447 31 11,287,962 5,792,187 849,468 46.67 5,330 24 81 6
[Brevibacterium] frigoritolerans F-D2 SMZY00000000 1,408 21 12,709,518 4,568,742 1,243,018 40.87 4,411 21 77 5
Microbacterium oleivorans F-B2 SMZX00000000 577 5 7,893,216 3,092,005 1,655,501 46.55 2,911 5 45 3
Paenibacillus dendritiformis F-A1 SMZW00000000 340 33 10,817,884 6,777,904 735,989 53.98 5,788 21 80 4
Paenibacillus amylolyticus S-B4 SMZV00000000 267 21 8,167,626 7,067,323 3,885,192 45.81 6,008 27 80 4
Rhodococcus qingshengii S-E5 SMZU00000000 431 45 12,222,178 6,609,607 474,786 38.17 6,021 32 115 7
Kocuria rosea S-A3 SMZT00000000 576 18 9,751,028 3,953,419 565,760 72.67 3,503 5 48 3
Pantoea ananatis F-C2 SMZS00000000 239 29 6,247,448 4,805,071 405,717 53.64 4,346 13 71 20
Jeotgalibacillus sp. S-D1 SMZR00000000 558 49 9,813,060 3,978,119 790,309 41.33 3,789 18 77 5
Arthrobacter nitrophenolicus S-A1 SMZQ00000000 479 43 8,918,834 4,614,804 320,201 65.99 4,168 8 50 3
a

ncRNAs, noncoding RNAs.

Data availability.

The draft genome sequences of these environmental strains, as well as the accession numbers for both the assembly and raw reads, are available at DDBJ/ENA/GenBank under the BioProject number PRJNA523266, and the described accession numbers are listed in Table 1.

ACKNOWLEDGMENTS

This project was partly funded by Pittsburg State University Graduate and Continuing Studies and the Kansas IDeA Network of Biomedical Research Excellence (K-INBRE) (grant P20 GM103418, National Institute of General Medical Sciences). The whole-genome sequencing was supported by the New Hampshire IDeA Network of Biomedical Research Excellence (NH-INBRE) (grant P20 GM103506, National Institute of General Medical Sciences).

We declare no conflicts of interest.

REFERENCES

  • 1.Johnson DB, Hallberg KB. 2003. The microbiology of acidic mine waters. Res Microbiol 154:466–473. doi: 10.1016/S0923-2508(03)00114-1. [DOI] [PubMed] [Google Scholar]
  • 2.Kolmert Å, Johnson DB. 2001. Remediation of acidic waste waters using immobilised, acidophilic sulfate‐reducing bacteria. J Chem Technol Biotechnol 76:836–843. doi: 10.1002/jctb.453. [DOI] [Google Scholar]
  • 3.Bechtold R. 2017. Bacterial diversity at an abandoned coal mine in southeast Kansas. Master’s thesis. Pittsburg State University, Pittsburg, Kansas: https://digitalcommons.pittstate.edu/etd/335/. [Google Scholar]
  • 4.Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Nurk S, Bankevich A, Antipov D, Gurevich AA, Korobeynikov A, Lapidus A, Prjibelski AD, Pyshkin A, Sirotkin A, Sirotkin Y, Stepanauskas R, Clingenpeel SR, Woyke T, Mclean JS, Lasken R, Tesler G, Alekseyev MA, Pevzner PA. 2013. Assembling single-cell genomes and mini-metagenomes from chimeric MDA products. J Comput Biol 20:714–737. doi: 10.1089/cmb.2013.0084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI Prokaryotic Genome Annotation Pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The draft genome sequences of these environmental strains, as well as the accession numbers for both the assembly and raw reads, are available at DDBJ/ENA/GenBank under the BioProject number PRJNA523266, and the described accession numbers are listed in Table 1.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES