ABSTRACT
We report a draft genome sequence of Rhodococcus erythropolis IEGM 746 isolated from oil-polluted soil from an oil-extracting enterprise, Udmurt Republic, Russia. This strain was able to degrade ketoprofen, a commonly used nonsteroidal anti-inflammatory drug. Using the obtained sequence, putative genes encoding enzymes for ketoprofen degradation were revealed.
ANNOUNCEMENT
Ketoprofen [KTP; C16H14O3; CAS 22071-15-4; 2-(3-benzoylphenyl)propanoic acid] is a commonly used nonsteroidal anti-inflammatory drug. Entering the environment, this biologically active compound becomes an emerging micropollutant with significant toxicological effects against biota (1, 2). Biological degradation plays the key role in removal of ketoprofen from the environment (3–5); however, this process is not fully elucidated. The Rhodococcus erythropolis strain IEGM 746 was isolated from oil-polluted soil near an oil-extracting enterprise, Udmurt Republic, Russia, and deposited in the Regional Specialized Collection of Alkanotrophic Microorganisms (acronym IEGM, number 768 WDCM, http://www.iegmcol.ru/strains/rhodoc/eryth/r_eryth746.html). This strain degraded up to 38% ketoprofen at its initial concentration of 100 mg/L in a mineral salt medium supplemented with 0.1% n-hexadecane as a cosubstrate in 14 days (6).
DNA was isolated from the R. erythropolis IEGM 746 cells preliminarily grown in LB broth at 28°C and 160 rpm for 28 h using the MagMAX DNA Multi-Sample Ultra 2.0 kit (Thermo Fisher Scientific) and a KingFisher Flex automatic station (Thermo Fisher Scientific) following the manufacturer’s protocol. A paired-end library (2 × 100 nucleotides [nt]) was produced using Illumina DNA Prep and sequenced on an Illumina NovaSeq 6000 instrument. A total number of 18,564,421 reads was obtained. Demultiplexing of the sequenced reads was performed with Illumina bcl2fastq (2.20). Adapters were trimmed with Skewer (version 0.2.2) (7). The quality of the FASTQ files was analyzed with FastQC (version 0.11.5-cegat) (8). The data were assembled with SPAdes v. 3.14.1 (9) at the scaffold level. Assembly statistics were computed using QUAST 5.2.0 (10). The assembly consisted of 78 contigs with a total sequence length of 6,772,728 bp, an N50 value of 516,745 bp, a GC content of 68.02%, and coverage of 275×. Average nucleotide identity (ANI) was calculated using the online tool https://www.ezbiocloud.net/tools/ani (11), and the highest OrthoANIu values of 95.38% and 98.74% were obtained for R. erythropolis (NCBI: txid1833) and Rhodococcus qingshengii (NCBI: txid334542) species consequently. Currently, R. qingshengii is a synonym of R. erythropolis (https://lpsn.dsmz.de/species/rhodococcus-qingshengii), and we consider the IEGM 746 strain as belonging to the R. erythropolis species. The annotation of coding sequences (CDS) was performed using PGAP 5.3 (annotation method “best-placed reference protein set,” GeneMarkS-2+) (12) and RAST 2.0 (annotation scheme RASTtk) (13). Default parameters were used for all software unless otherwise specified.
A total of 6,303 CDS, 6,249 CDS with protein, 3 rRNAs (5S, 16S, and 23S), and 52 tRNAs were found in the R. erythropolis IEGM 746 genome. Among CDS, those coding for monooxygenases (66 CDS), dioxygenases (28 CDS), proteins with decarboxylase activity (21 CDS), peroxidases (10 CDS), multicopper oxidases (5 CDS), vanillate O-demethylase oxidoreductases (EC 1.14.13.-) (3 CDS), and dehydrogenases (367 CDS) were revealed. Products of these genes can participate in ketoprofen degradation via oxidation, decarboxylation, or demethylation reactions.
Data availability.
This Whole-Genome Shotgun project has been deposited in DDBJ/ENA/GenBank under the accession numbers JAJNDF010000001 to JAJNDF010000078. The version described in this paper is the first version, JAJNDF000000000.1. Raw sequence reads were deposited in the Sequence Read Archive with accession number SRR17030444 under BioSample accession number SAMN23420022 and BioProject accession number PRJNA783162.
ACKNOWLEDGMENTS
The work was supported by the Ministry of Science and Higher Education (state task АААА-А19-119112290008-4) and the Russian Science Foundation (project number 21-14-00132).
The draft genome was sequenced and assembled at CeGaT GmbH, Tuebingen, Germany (www.cegat.de).
Contributor Information
Anastasiia V. Krivoruchko, Email: nast@iegm.ru.
J. Cameron Thrash, University of Southern California
REFERENCES
- 1.Borecka M, Siedlewicz G, Haliński ŁP, Sikora K, Pazdro K, Stepnowski P, Białk-Bielińska A. 2015. Contamination of the southern Baltic Sea waters by the residues of selected pharmaceuticals: method development and field studies. Mar Pollut Bull 94:62–71. doi: 10.1016/j.marpolbul.2015.03.008. [DOI] [PubMed] [Google Scholar]
- 2.Alkimin GD, Soares AMVM, Barata C, Nunes B. 2020. Evaluation of ketoprofen toxicity in two freshwater species: effects on biochemical, physiological and population endpoints. Environ Pollut 265:114993. doi: 10.1016/j.envpol.2020.114993. [DOI] [PubMed] [Google Scholar]
- 3.Xu J, Chen W, Wu L, Chang AC. 2009. Adsorption and degradation of ketoprofen in soils. J Environ Qual 38:1177–1182. doi: 10.2134/jeq2008.0347. [DOI] [PubMed] [Google Scholar]
- 4.Almeida B, Oehmen A, Marques R, Brito D, Carvalho G, Barreto Crespo MT. 2013. Modelling the biodegradation of non-steroidal anti-inflammatory drugs (NSAIDs) by activated sludge and a pure culture. Bioresour Technol 133:31–37. doi: 10.1016/j.biortech.2013.01.035. [DOI] [PubMed] [Google Scholar]
- 5.Ismail MM, Essam TM, Ragab YM, Mourad FE. 2016. Biodegradation of ketoprofen using a microalgal–bacterial consortium. Biotechnol Lett 38:1493–1502. doi: 10.1007/s10529-016-2145-9. [DOI] [PubMed] [Google Scholar]
- 6.Bazhutin GA, Polygalov MA, Tyumina EA, Tyan SM, Ivshina IB. 2022. Cometabolic bioconversion of ketoprofen by Rhodococcus erythropolis IEGM 746. Lect Notes Netw Syst 342:404–410. doi: 10.1007/978-3-030-89477-1_40. [DOI] [Google Scholar]
- 7.Jiang H, Lei R, Ding SW, Zhu S. 2014. Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads. BMC Bioinformatics 15:182. doi: 10.1186/1471-2105-15-182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Andrews S. 2010. FastQC, a quality control tool for high throughput sequence data. https://www.bioinformatics.babraham.ac.uk/projects/fastqc/.
- 9.Nurk S, Bankevich A, Antipov D, Gurevich A, Korobeynikov A, Lapidus A, Prjibelsky A, Pyshkin A, Sirotkin A, Sirotkin Y, Stepanauskas R, McLean J, Lasken R, Clingenpeel SR, Woyke T, Tesler G, Alekseyev MA, Pevzner PA. 2013. Assembling genomes and mini-metagenomes from highly chimeric reads, p 158–170. In Deng M, Jiang R, Sun F, Zhang X (ed), Research in computational molecular biology, RECOMB 2013. Springer, Berlin, Germany. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Yoon SH, Ha SM, Lim JM, Kwon SJ, Chun J. 2017. A large-scale evaluation of algorithms to calculate average nucleotide identity. Antonie Van Leeuwenhoek 110:1281–1286. doi: 10.1007/s10482-017-0844-4. [DOI] [PubMed] [Google Scholar]
- 12.Li W, O’Neill KR, Haft DH, DiCuccio M, Chetvernin V, Badretdin A, Coulouris G, Chitsaz F, Derbyshire MK, Durkin AS, Gonzales NR, Gwadz M, Lanczycki CJ, Song JS, Thanki N, Wang J, Yamashita RA, Yang M, Zheng C, Marchler-Bauer A, Thibaud-Nissen F. 2021. RefSeq: expanding the Prokaryotic Genome Annotation Pipeline reach with protein family model curation. Nucleic Acids Res 49:D1020–D1028. doi: 10.1093/nar/gkaa1105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. 2008. The RAST server: rapid annotations using subsystems technology. BMC Genomics 9:75. doi: 10.1186/1471-2164-9-75. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
This Whole-Genome Shotgun project has been deposited in DDBJ/ENA/GenBank under the accession numbers JAJNDF010000001 to JAJNDF010000078. The version described in this paper is the first version, JAJNDF000000000.1. Raw sequence reads were deposited in the Sequence Read Archive with accession number SRR17030444 under BioSample accession number SAMN23420022 and BioProject accession number PRJNA783162.
