Abstract
The genome sequence for Rhodococcus sp. 4J2A2, a newly described desiccation-tolerant strain that removes aromatic hydrocarbons, is reported here. The genome is estimated to be around 7.5 Mb in size, with an average G+C content of 60.77% and a predicted number of protein-coding sequences of 6,354.
GENOME ANNOUNCEMENT
Rhodococcus sp. 4J2A2 is a desiccation-tolerant Gram-positive bacterium belonging to the Actinobacteria phylum and the Nocardiaceae family, and it was isolated from the Nerium oleander rhizosphere (1). The genome sequences of other desiccation-tolerant microorganisms have been reported (2, 3), including that of the recently described new species Arthrobacter siccitolerans 4J27 (4). These microorganisms produce different compounds, substances known as xeroprotectants (5), in response to changes in osmotic conditions and water activity (1). These compounds, produced to protect essential biomolecules and cell integrity, allow the cell to tolerate extremely low concentrations of water and other chemical insults (6, 7).
Species of the genus Rhodococcus have been described as efficient removers of pollutants, particularly aromatic organic compounds (8). Here, we report the whole-genome sequence of Rhodococcus sp. 4J2A2, obtained with pyrosequencing technology implemented in the 454 Life Science-Roche platform with a combined approach based on shotgun and 8-kb mate pair sequencing (Lifesequencing SL, Valencia, Spain) (9). This technology was used to obtain a total of 222,955 reads. The average read length for the shotgun sequencing approach was 661 nucleotides, rendering 123,125 sequences. The average read length for the mate pair sequencing strategy was 419.72 bases. The total number of sequenced bases was 123,294,461, representing a sequencing depth of around 16×. For de novo assembly, Newbler Assembler version 2.6 was used with default parameters. This assembly yielded 60 contigs, 45 of which were >500 bp. The N50 of the contig assembly was 375,535 bp, and the largest contig was 890,470 bp. Mate pair information indicated that most of these contigs were ordered in four scaffolds, the largest comprising 4,858,281 bp. The estimated genome size of 7.5 Mb was deduced from this combination of scaffolds and contigs. Gap-spanning clones and PCR products were used to attempt gap closure, and putative coding sequences were predicted. Genes were annotated with a pipeline implemented at Lifesequencing, and protein-coding sequences (CDS) were predicted with Glimmer (10–12), RNAmmer (13), tRNAscan (14, 15), and BLAST (16, 17) in combination. Most of the contigs used to obtain complete genomic information for Rhodococcus 4J2A2 are contained on four scaffolds, with an average G+C content of 60.77%. The genome was found to contain 6,354 protein-coding genes, 4 rRNA operons, and 51 tRNA genes.
On the basis of this genome sequence, we propose the presence of pathways for the catabolism of chloroalkanes and chloroalkenes, such as cis- and trans-dichloropropene, trichloroethane, and tetrachloroethene via pyruvate, glyoxylate, dicarboxylate, and methane metabolism. We also propose the presence of pathways for the metabolism of aromatic hydrocarbons, such as toluene, xylene, benzoate, and phthalate, and polycyclic hydrocarbons, such as fluorene, anthracene, phenanthrene, pyrene, benzo[a]pyrene, and naphthalene, and some of its derivate compounds.
The complete genome sequence of Rhodococcus sp. 4J2A2 will contribute to the development of biotechnological applications in the field of bioremediation, particularly for the removal of pollutants in arid regions (6, 18).
Nucleotide sequence accession numbers.
The complete genome sequence of Rhodococcus sp. 4J2A2 has been deposited in the TBL/EMBL/GenBank databases under the accession numbers CEDU01000001 to CEDU01000060.
ACKNOWLEDGMENTS
This research was funded by the Ministry of Science and Innovation of Spain and the Andalusian regional government (Spain) as part of research projects P11-RNM-7844 and CTM2009-09270. M. Manzanera received a Ramón y Cajal research grant from the Ministry of Science and Innovation and support from European Regional Development Funds (EU).
We thank K. Shashok for improving the use of English in the manuscript.
Footnotes
Citation Manzanera M, García-Fontana C, Vílchez JI, González-López J. 2015. Genome sequence of Rhodococcus sp. 4J2A2, a desiccation-tolerant bacterium involved in biodegradation of aromatic hydrocarbons. Genome Announc 3(3):e00592-15. doi:10.1128/genomeA.00592-15.
REFERENCES
- 1.Narváez-Reinaldo JJ, Barba I, González-López J, Tunnacliffe A, Manzanera M. 2010. Rapid method for isolation of desiccation-tolerant strains and xeroprotectants. Appl Environ Microbiol 76:5254–5262. doi: 10.1128/AEM.00855-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Manzanera M, Vílchez JI, García-Fontana C, Calvo C, González-López J. 2015. Genome sequence of Leucobacter sp. 4J7B1, a plant-osmoprotectant soil microorganism. Genome Announc 3(3):00398-15 10.1128/genomeA.00398-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Manzanera M, Santa-Cruz-Calvo L, Vílchez JI, García-Fontana C, Silva-Castro GA, Calvo C, González-López J. 2014. Genome sequence of Arthrobacter siccitolerans 4J27, a xeroprotectant-producing desiccation-tolerant microorganism. Genome Announc 2(3):e00526-15. doi: 10.1128/genomeA.00526-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.SantaCruz-Calvo L, González-López J, Manzanera M. 2013. Arthrobacter siccitolerans sp. nov., a highly desiccation-tolerant, xeroprotectant-producing strain isolated from dry soil. Int J Syst Evol Microbiol 63:4174–4180. doi: 10.1099/ijs.0.052902-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Julca I, Alaminos M, González-López J, Manzanera M. 2012. Xeroprotectants for the stabilization of biomaterials. Biotechnol Adv 30:1641–1654. doi: 10.1016/j.biotechadv.2012.07.002. [DOI] [PubMed] [Google Scholar]
- 6.Vilchez S, Manzanera M. 2011. Biotechnological uses of desiccation-tolerant microorganisms for the rhizoremediation of soils subjected to seasonal drought. Appl Microbiol Biotechnol 91:1297–1304. doi: 10.1007/s00253-011-3461-6. [DOI] [PubMed] [Google Scholar]
- 7.Vílchez S, Tunnacliffe A, Manzanera M. 2008. Tolerance of plastic-encapsulated Pseudomonas putida KT2440 to chemical stress. Extremophiles 12:297–299. doi: 10.1007/s00792-007-0123-9. [DOI] [PubMed] [Google Scholar]
- 8.Kuyukina MS, Ivshina IB 2010. Application of Rhodococcus in bioremediation of contaminated environments, p 231–262. In Alvarez HM (ed), Biology of Rhodococcus, vol 16. Springer, Berlin, Germany.
- 9.Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Godwin BC, He W, Helgesen S, Ho CH, Ho CH, Irzyk GP. 2005. Genome sequencing in microfabricated high-density picolitre reactors. Nature 437:376–380. doi: 10.1038/nature03959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. 1999. Improved microbial gene identification with Glimmer. Nucleic Acids Res 27:4636–4641. doi: 10.1093/nar/27.23.4636. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Salzberg SL, Delcher AL, Kasif S, White O. 1998. Microbial gene identification using interpolated Markov models. Nucleic Acids Res 26:544–548. doi: 10.1093/nar/26.2.544. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679. doi: 10.1093/bioinformatics/btm009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Lagesen K, Hallin P, Rødland EA, Staerfeldt HH, Rognes T, Ussery DW. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 35:3100–3108. doi: 10.1093/nar/gkm160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Schattner P, Brooks AN, Lowe TM. 2005. The tRNAscan-SE, snoscan and snoGPS Web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res 33:W686–W689. doi: 10.1093/nar/gki366. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. doi: 10.1093/nar/25.5.0955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403–410. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 17.Morgulis A, Coulouris G, Raytselis Y, Madden TL, Agarwala R, Schäffer AA. 2008. Database indexing for production MegaBLAST searches. Bioinformatics 24:1757–1764. doi: 10.1093/bioinformatics/btn322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Manzanera M, Vilchez S, Tunnacliffe A. 2004. Plastic encapsulation of stabilized Escherichia coli and Pseudomonas putida. Appl Environ Microbiol 70:3143–3145. doi: 10.1128/AEM.70.5.3143-3145.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]