Draft Genome Sequence of a Tepidicella baoligensis Strain Isolated from an Oil Reservoir

Shan Hong; Guan Wang; Jiliang Yu; Jing You; Yanfen Xue

doi:10.1128/MRA.00796-20

. 2020 Sep 24;9(39):e00796-20. doi: 10.1128/MRA.00796-20

Draft Genome Sequence of a Tepidicella baoligensis Strain Isolated from an Oil Reservoir

Shan Hong ^a, Guan Wang ^b, Jiliang Yu ^b, Jing You ^b, Yanfen Xue ^a,^✉

Editor: Frank J Stewart^c

PMCID: PMC7516150 PMID: 32972939

We report the draft genome sequence of Tepidicella baoligensis strain B18-50^T, isolated from a high-temperature oil well in Baolige Oilfield, China. The estimated genome is 2.87 Mb, with 2,653 protein-coding sequences.

ABSTRACT

ANNOUNCEMENT

Tepidicella species are slightly thermophilic Gram-negative rods that are distributed in geothermal areas. These organisms do not assimilate carbohydrates or polyols but use organic acids as carbon and energy sources and oxidize thiosulfate to sulfate in the presence of an assimilable carbon source (1, 2). Tepidicella baoligensis strain B18-50^T was isolated from the production liquid of a high-temperature oilfield in China (2). The whole-genome sequence will help in understanding its ecological function in the oil reservoir environment.

Strain B18-50^T was isolated from oil well production liquid from Baolige Oilfield (44°86′N, 115°81′E) in China. The liquid sample was diluted with sterilized enrichment medium (EN), plated onto EN agar, and incubated at 50°C (2). A pure strain was obtained by repeated streaking on the same medium. Strain B18-50^T was routinely cultured at 45°C in modified Degryse 162 medium (3) supplemented with 10 g liter⁻¹ succinate, 1.0 g liter⁻¹ yeast extract (Oxoid), and 1.0 g liter⁻¹ tryptone (Oxoid). Genomic DNA was prepared from an overnight culture by using the phenol-chloroform extraction method (4). The quality and concentration of DNA were determined using a Quantus fluorometer with the Quant-iT PicoGreen double-stranded DNA (dsDNA) assay kit (Thermo Fisher Scientific, USA).

DNA samples were sheared into 400- to 500-bp fragments using a Covaris M220 focused acoustic shearer following the manufacturer’s protocol. The Illumina paired-end (PE) libraries were prepared from the sheared fragments using the NEXTflex rapid DNA-Seq kit (Bioo Scientific, USA) and sequenced in 150-bp paired-end mode using the Illumina HiSeq X Ten platform at Majorbio Bio-Pharm Technology, Inc. (Shanghai, China). The sequencing generated 3,449,944 pairs of raw reads totaling 1,041,883,088 bp, giving approximately 362× coverage. The reads were quality trimmed with Trimmomatic v0.36 (5) and assembled using SOAPdenovo v2.04 (https://github.com/aquaskyline/SOAPdenovo2) (6). The resultant assembly totaled 2,874,470 bp with 47 scaffolds, an N₅₀ value of 232,753 bp, and a GC content of 65.2%.

The genomic scaffolds were analyzed using the I-Sanger Cloud platform from Shanghai Majorbio. Glimmer v3.02 (http://ccb.jhu.edu/software/glimmer/index.shtml) (7) was used for coding DNA sequence (CDS) prediction, tRNA-scan-SE v2.0 (http://trna.ucsc.edu/software/) (8) was used for tRNA prediction, and Barrnap v0.8 (https://github.com/tseemann/barrnap) was used for rRNA prediction. The predicted CDSs were annotated from the nonredundant (NR), Swiss-Prot, Pfam, GO, COG, and KEGG databases using the sequence alignment tools BLAST+ v2.3.0 (9), Diamond v0.8.35 (10), and HMMER v3.1b2 (11). A total of 2,709 CDS genes in addition to 48 tRNAs and 5 rRNAs were annotated for the draft genome sequence, of which 2,653 are protein coding genes. Default parameters were used for all software.

Data availability.

This whole-genome shotgun sequencing project has been deposited in DDBJ/ENA/GenBank under the accession number JACCDB000000000; the raw reads are available under SRA accession number SRR12179937. This announcement represents the first version of the genome.

ACKNOWLEDGMENTS

This work was funded by the National Key R&D Program of China (2018YFA0902101) and the China National Petroleum Corporation (2017E-1507-2).

REFERENCES

1.França L, Rainey FA, Nobre MF, da Costa MS. 2006. Tepidicella xavieri gen. nov., sp. nov., a betaproteobacterium isolated from a hot spring runoff. Int J Syst Evol Microbiol 56:907–912. doi: 10.1099/ijs.0.64193-0. [DOI] [PubMed] [Google Scholar]
2.You J, Li Y, Hong S, Wang J, Yu J, Mu B, Ma X, Xue Y. 2019. Tepidicella baoligensis sp. nov., a novel member of betaproteobacterium isolated from an oil reservoir. Curr Microbiol 76:410–414. doi: 10.1007/s00284-018-1604-z. [DOI] [PubMed] [Google Scholar]
3.Degryse E, Glansdorff N, Piérard A. 1978. A comparative analysis of extreme thermophilic bacteria belonging to the genus Thermus. Arch Microbiol 117:189–196. doi: 10.1007/BF00402307. [DOI] [PubMed] [Google Scholar]
4.Mesapogu S, Jillepalli CM, Arora DK. 2013. Microbial DNA extraction, purification, and quantitation, p 1–16. In Arora DK, Das S, Sukumar M (ed), Analyzing microbes: manual of molecular biology techniques. Springer-Verlag, Berlin, Germany. [Google Scholar]
5.Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, Tang J, Wu G, Zhang H, Shi Y, Liu Y, Yu C, Wang B, Lu Y, Han C, Cheung DW, Yiu S-M, Peng S, Xiaoqian Z, Liu G, Liao X, Li Y, Yang H, Wang J, Lam T-W, Wang J. 2012. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1:18. doi: 10.1186/2047-217X-1-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679. doi: 10.1093/bioinformatics/btm009. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. 2009. BLAST+: architecture and applications. BMC Bioinformatics 10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Buchfink B, Xie C, Huson DH. 2015. Fast and sensitive protein alignment using DIAMOND. Nat Methods 12:59–60. doi: 10.1038/nmeth.3176. [DOI] [PubMed] [Google Scholar]
11.Eddy SR. 2011. Accelerated profile HMM searches. PLoS Comput Biol 7:e1002195. doi: 10.1371/journal.pcbi.1002195. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

[B1] 1.França L, Rainey FA, Nobre MF, da Costa MS. 2006. Tepidicella xavieri gen. nov., sp. nov., a betaproteobacterium isolated from a hot spring runoff. Int J Syst Evol Microbiol 56:907–912. doi: 10.1099/ijs.0.64193-0. [DOI] [PubMed] [Google Scholar]

[B2] 2.You J, Li Y, Hong S, Wang J, Yu J, Mu B, Ma X, Xue Y. 2019. Tepidicella baoligensis sp. nov., a novel member of betaproteobacterium isolated from an oil reservoir. Curr Microbiol 76:410–414. doi: 10.1007/s00284-018-1604-z. [DOI] [PubMed] [Google Scholar]

[B3] 3.Degryse E, Glansdorff N, Piérard A. 1978. A comparative analysis of extreme thermophilic bacteria belonging to the genus Thermus. Arch Microbiol 117:189–196. doi: 10.1007/BF00402307. [DOI] [PubMed] [Google Scholar]

[B4] 4.Mesapogu S, Jillepalli CM, Arora DK. 2013. Microbial DNA extraction, purification, and quantitation, p 1–16. In Arora DK, Das S, Sukumar M (ed), Analyzing microbes: manual of molecular biology techniques. Springer-Verlag, Berlin, Germany. [Google Scholar]

[B5] 5.Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B6] 6.Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, Tang J, Wu G, Zhang H, Shi Y, Liu Y, Yu C, Wang B, Lu Y, Han C, Cheung DW, Yiu S-M, Peng S, Xiaoqian Z, Liu G, Liao X, Li Y, Yang H, Wang J, Lam T-W, Wang J. 2012. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1:18. doi: 10.1186/2047-217X-1-18. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B7] 7.Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679. doi: 10.1093/bioinformatics/btm009. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B8] 8.Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B9] 9.Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. 2009. BLAST+: architecture and applications. BMC Bioinformatics 10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B10] 10.Buchfink B, Xie C, Huson DH. 2015. Fast and sensitive protein alignment using DIAMOND. Nat Methods 12:59–60. doi: 10.1038/nmeth.3176. [DOI] [PubMed] [Google Scholar]

[B11] 11.Eddy SR. 2011. Accelerated profile HMM searches. PLoS Comput Biol 7:e1002195. doi: 10.1371/journal.pcbi.1002195. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Draft Genome Sequence of a Tepidicella baoligensis Strain Isolated from an Oil Reservoir

Shan Hong

Guan Wang

Jiliang Yu

Jing You

Yanfen Xue

Roles

ABSTRACT

ANNOUNCEMENT

Data availability.

ACKNOWLEDGMENTS

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Draft Genome Sequence of a Tepidicella baoligensis Strain Isolated from an Oil Reservoir

Shan Hong

Guan Wang

Jiliang Yu

Jing You

Yanfen Xue

Roles

ABSTRACT

ANNOUNCEMENT

Data availability.

ACKNOWLEDGMENTS

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases