Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2024 Feb 1;13(3):e01065-23. doi: 10.1128/mra.01065-23

Draft genome sequence of Dietzia sp. strain CH92 isolated from oil reservoir

Wei Xiang 1,2,#, Quan Zhang 1,#, Jianxin Wang 2,3, Yanfen Xue 2, Bo Yu 2,
Editor: J Cameron Thrash4
PMCID: PMC10927634  PMID: 38299820

ABSTRACT

We report the draft genome sequence of Dietzia sp. strain CH92, isolated from a high temperature oil well in Baolige oilfield, China. The estimated genome is 3.73 Mb, with 3,479 protein-coding sequences.

KEYWORDS: Dietzia sp., hydrocarbons, degradation

ANNOUNCEMENT

Dietzia species are aerobic, Gram-positive and distributed in various environments including crude oil reservoirs (1, 2). The organisms can use carbohydrates, organic acids, and hydrocarbons as carbon and energy sources (2). Dietzia sp. strain CH92 was isolated from oil-well production liquid of Baolige oilfield (44°86′N, 115°81′E) in China. The whole-genome sequence will help us understand its function and application potential.

Strain CH92 was isolated by dilution plating method. The oil-well production liquid sample collected from a sampling valve at the pipeline of the well head by fully filling 25 L plastic sampling bottles. The samples were transported to the laboratory and stored overnight at ambient temperature (about 15–20°C). Since the cell biomass is rather low in this water sample, we first concentrated it by using 0.22 µm hollow-fiber filter (MOF-1d, purchased from Motech Co. Ltd., Tianjin, China), and then, the filtrate was used for dilution and plating on the agar medium. The concentrated liquid was diluted with sterilized enrichment medium (ENM), plated on ENM agar, and incubated at 45°C (3). The pure strain was obtained by repeated streaking on the same medium agar plates. Strain CH92 was routinely cultured at 37°C in modified DSMZ 878 medium (4) supplemented with 10 g L−1 succinate or 1% (wt/vol) n-hexadecane as carbon sources. Genomic DNA was prepared from an overnight culture in the modified DSMZ 878 medium with the addition of g L−1 succinate by using the TIANamp Bacteria DNA kit (TIANGEN, China). The quality and concentration of DNA were determined using a Quantus Fluorometer with the Quant-iT PicoGreen dsDNA Assay Kit (Thermo Fisher Scientific, USA).

DNA samples were sheared into 400–500 bp fragments using a Covaris M220 Focused Acoustic Shearer following manufacture’s protocol. The Illumina PE libraries were prepared from the sheared fragments using the NEXTflex Rapid DNA-Seq Kit (Bioo Scientific, USA) and sequenced in the 150 bp pair-end mode using the Illumina Hiseq × 10 platform at Majorbio Bio-Pharm Technology Inc. (Shanghai, China). The sequencing generated 5,876,037 pairs of raw reads totaling 1,774,563,174 bp, giving approximately 473 × coverage. The reads were quality-trimmed with Trimmomatic v.0.36 (5) and assembled using SOAPdenovo v2 (6). The resultant assembly totaled 3,729,715 bp with 32 contigs, an N50 value of 636,736 bp, and a GC content of 71.40%.

The genomic contigs were analyzed using I-Sanger Cloud Platform from Shanghai Majorbio in March, 2021. Glimmer v3.02 (http://ccb.jhu.edu/software/glimmer/index.shtml) (7) was used for coding DNA sequence (CDS) prediction, tRNA-scan-SE v2.0 (http://trna.ucsc.edu/software/) (8) was used for tRNA prediction, and Barrnap v0.8 (https://github.com/tseemann/barrnap) was used for rRNA prediction. The predicted CDSs were annotated from NR, Swiss-Prot, Pfam, GO, COG, and KEGG database using sequence alignment tools: BLAST + v2.3.0 (9), Diamond v0.8.35 (10), and HMMER v3.1b2 (11). A total of 3,479 CDS genes in addition to 50 tRNAs and 5 rRNAs were annotated for the draft genome sequence. Default parameters were used for all software. The annotation was also uploaded in Fig Share with the link of https://figshare.com/articles/online_resource/The_genome_annotation_with_predicted_functions_for_each_and_every_gene_of_i_Dietzia_i_sp_strain_CH92/24967986.

ACKNOWLEDGMENTS

This work was funded by the National Key R&D Program of China (2018YFA0902100).

B.Y., Conceptualization, Project administration, Resources, Writing—review and editing; Q.Z. and W.X., Formal analysis, Investigation, Methodology; J.W., Data curation, Methodology; Y.X., Project administration, Resources, Supervision, Writing—original draft.

Contributor Information

Bo Yu, Email: yub@im.ac.cn.

J. Cameron Thrash, University of Southern California, USA.

DATA AVAILABILITY

This whole-genome shotgun sequencing project has been deposited in DDBJ/ENA/GenBank under the accession number JAVHXC000000000; the raw reads are available under SRA accession number SRR24848572. This announcement represents the first version of the genome.

REFERENCES

  • 1. Olowo-Okere A, Ibrahim YKE, Lo CI, Olayinka BO, Yimagou EK, Yacouba A, Mohammed Y, Nabti LZ, Ragueh AA, Lupande D, Raoult D, Rolain J-M, Diene SM. 2022. Bhargavaea massiliensis sp. nov. and Dietzia massiliensis sp. nov., novel bacteria species isolated from human urine samples in Nigeria. Curr Microbiol 79:18. doi: 10.1007/s00284-022-02838-0 [DOI] [PubMed] [Google Scholar]
  • 2. Pukall R. 2014. The family Dietziaceae, p 327–338. In Rosenberg E, DeLong EF, Lory S, Stackebrandt E, Thompson F (ed), The Prokaryotes: Actinobacteria, 4th edn. Springer, Berlin. [Google Scholar]
  • 3. Xiang W, Liang Y, Hong S, Wang G, You J, Xue YF, Ma YH. 2022. Degradation of long-chain n-alkanes by a novel thermal-tolerant Rhodococcus strain. Arch Microbiol 204:259. doi: 10.1007/s00203-022-02872-3 [DOI] [PubMed] [Google Scholar]
  • 4. Degryse E, Glansdorff N, Piérard A. 1978. A comparative analysis of extreme thermophilic bacteria belonging to the genus Thermus. Arch Microbiol 117:189–196. doi: 10.1007/BF00402307 [DOI] [PubMed] [Google Scholar]
  • 5. Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, et al. 2012. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. Gigascience 1:18. doi: 10.1186/2047-217X-1-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with glimmer. Bioinformatics 23:673–679. doi: 10.1093/bioinformatics/btm009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. doi: 10.1093/nar/25.5.955 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. 2009. BLAST+: architecture and applications. BMC Bioinformatics 10:421. doi: 10.1186/1471-2105-10-421 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Buchfink B, Xie C, Huson DH. 2015. Fast and sensitive protein alignment using DIAMOND. Nat Methods 12:59–60. doi: 10.1038/nmeth.3176 [DOI] [PubMed] [Google Scholar]
  • 11. Eddy SR. 2011. Accelerated profile HMM searches. PLoS Comput Biol 7:e1002195. doi: 10.1371/journal.pcbi.1002195 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

This whole-genome shotgun sequencing project has been deposited in DDBJ/ENA/GenBank under the accession number JAVHXC000000000; the raw reads are available under SRA accession number SRR24848572. This announcement represents the first version of the genome.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES