Skip to main content
Scientific Data logoLink to Scientific Data
. 2019 Nov 25;6:280. doi: 10.1038/s41597-019-0289-x

Complete genome sequence of Sphingomonas paucimobilis AIMST S2, a xenobiotic-degrading bacterium

Suganniiya K Ravintheran 1, Sumitra Sivaprakasam 1, Stella Loke 2, Su Yin Lee 1, Ravichandran Manickam 1, Adibah Yahya 3, Lawrence Croft 4, Andrew Millard 5, Sivachandran Parimannan 1,, Heera Rajandas 1,
PMCID: PMC6877580  PMID: 31767854

Abstract

Complete genomes of xenobiotic-degrading microorganisms provide valuable resources for researchers to understand molecular mechanisms involved in bioremediation. Despite the well-known ability of Sphingomonas paucimobilis to degrade persistent xenobiotic compounds, a complete genome sequencing is lacking for this organism. In line with this, we report the first complete genome sequence of Sphingomonas paucimobilis (strain AIMST S2), an organophosphate and hydrocarbon-degrading bacterium isolated from oil-polluted soil at Kedah, Malaysia. The genome was derived from a hybrid assembly of short and long reads generated by Illumina HiSeq and MinION, respectively. The assembly resulted in a single contig of 4,005,505 bases which consisted of 3,612 CDS and 56 tRNAs. An array of genes involved in xenobiotic degradation and plant-growth promoters were identified, suggesting its’ potential role as an effective microorganism in bioremediation and agriculture. Having reported the first complete genome of the species, this study will serve as a stepping stone for comparative genome analysis of Sphingomonas strains and other xenobiotic-degrading microorganisms as well as gene expression studies in organophosphate biodegradation.

Subject terms: Sequencing, Genome, Environmental biotechnology, Environmental microbiology


Measurement(s) genome • sequence_assembly
Technology Type(s) DNA sequencing • genome assembly
Sample Characteristic - Organism Sphingomonas paucimobilis
Sample Characteristic - Environment oil contaminated soil
Sample Characteristic - Location Malaysia

Machine-accessible metadata file describing the reported data: 10.6084/m9.figshare.9971711

Background and Summary

Sphingomonas spp. are Gram-negative, oxidase positive and non-fermentative rods1. One of the best known species of the genus is Sphingomonas paucimobilis as it was originally said to be the only species described in human infection1,2. It is a non-spore forming strictly aerobic, yellow-pigmented bacteria that can survive in low nutrient environment1,3. S. paucimobilis is naturally found in diverse environments such as soil and water and also has been shown to have a wide range of xenobiotic-biodegradative abilities46. Previous studies had shown its’ ability to degrade various types of hydrocarbons and pesticides, specifically chlorpyrifos712. It is also well recognized for its potential for biofilm formation13. Despite the potential role of this bacterium in bioremediation, there is a lack of complete genome in the public domain which will allow for the identification of genes involved in the biodegradation of chlorpyrifos, a widely used organophosphate.

General features of S. paucimobilis strain AIMST S2 are summarized in Table 1. S. paucimobilis strain AIMST S2 was first isolated in an oil-contaminated soil sample from Kedah, Malaysia. Following enrichment in LB broth, this strain was acclimatized in M9 minimal medium supplemented with diesel (max. 1% v/v) and chlorpyrifos (max. 100 mg/L) in increasing concentrations, as the sole carbon source. Genomic DNA extraction was performed according to the GeneJet Genomic DNA purification kit’s protocol using a log-phase culture grown in Luria broth. The concentration and quality of extracted DNA was determined using Nanodrop, Qubit dsDNA BR assay and a 1% (v/w) agarose gel. The genomic DNA was then subjected to sequencing via Illumina HiSeq. 2500 and Oxford Nanopore. DNA sequencing was performed with both Illumina and Nanopore technologies as they yield short (~150 bases) and long reads (~10,000 bases), respectively, a combination of which has shown to improve hybrid genome assembly quality by providing accurate, complete genomes without gaps14.

Table 1.

General features of S. paucimobilis strain AIMST S2 based on MIGS mandatory information.

Items Description
Investigation type Bacteria
Project name Complete genome sequencing of S. paucimobilis AIMST S2
Latitude and longitude 5.663 N 100.505 E
Geographical location Malaysia
Collection date 19 December 2008
Isolation source Oil-contaminated soil
Estimated size 4,005,505 bp
Sequencing method Illumina HiSeq. 2500 & MinION
Assembly Hybrid genome assembly (Unicycler)
Assembly level Complete Genome
Genome representation Full
Genome coverage ~446.6×
Finishing strategy Sequencing & assembly

The complete genome sequence reported in this study will be useful for analysis of protein-coding gene families, identification of genomic islands, repeat regions, prophages, and structural rearrangements. Apart from that, the data from this study can be utilized for comparative genome analysis of strains belonging to the genus Sphingomonas and other xenobiotic-degrading microorganisms, as well as transcriptome studies of chlorpyrifos biodegradation.

An overview of the experimental design of the study is illustrated in Fig. 1 and a detailed account of the workflow is provided in the methodology.

Fig. 1.

Fig. 1

Overview of the experimental design of study.

Methods

Bacterial growth and genomic DNA extraction

S. paucimobilis was cultivated in LB broth and incubated at 37 °C until it attained an absorbance of ~0.7 at 600 nm. The log-phase culture was centrifuged at 10,000 × g for 10 minutes and the cell pellet was subjected to genomic DNA extraction according to the GeneJet Genomic DNA purification kit’s protocol (Thermo Fisher Scientific, Waltham, MA, USA). The concentration and quality of extracted DNA was determined using Nanodrop ™ Lite spectrophotometer (Thermo Scientific, Wilmington, DE, USA), Qubit dsDNA BR assay (Thermo Scientific, Wilmington, DE, USA) and 1% (v/w) agarose gel electrophoresis. The genomic DNA was then subjected to sequencing via Illumina HiSeq. 2500 and MinION.

Illumina Sequencing

DNA was fragmented using Covaris to a targeted size of 350 bp and upon adapter ligation, a library containing fragments of 470 bp was generated. The library size was determined using Bioanalyzer high sensitivity DNA chip (Agilent, CA, USA). Library was prepared using NEBNext Ultra DNA Library Prep Kit for Illumina (NEB, MA, USA) and paired-end sequenced.

Oxford Nanopore MinION Sequencing

Approximately 500 ng genomic DNA was used to build a DNA library using a Rapid Sequencing Kit (SQK-RAD004) (ONT, Oxford, UK) as described by the manufacturer. MinKNOW software version 2.0 (ONT, Oxford, UK) was used to perform a quality check on the flow cell before the DNA library was loaded. Sequencing was performed on MK1B (MIN-101B) MinION platform with a FLO-MIN 106 R9.4 (SpotON) flow cell according to the manufacturers’ instructions. Raw sequence reads were basecalled real time using MinKNOW, producing Fastq format data.

Hybrid genome assembly

The FastQ format data obtained from Illumina and MinION sequencing was subjected to genome assembly using Unicycler version 0.4.3 with default parameters.

Genome annotation

The assembly was annotated with Prokka15. Genome-wide COG functional annotation was performed using eggNOG mapper with DIAMOND mapping mode, which is available in version 4.5.116,17. Following this, the amino acid sequences were subjected to KEGG analysis via KAAS for pathway mapping. Prophages and genomic islands were also identified using PHASTER18 and IslandViewer 419.

Data Records

Sequencing raw reads obtained from Illumina and Nanopore MinION runs have been deposited in the NCBI Sequence Read Archive under SRP185601 (accessible at https://identifiers.org/ncbi/insdc.sra:SRP185601)20. All predicted genes and their functional annotations are provided in GenBank (Accession number: NZ_CP035765)21. The circular genome assembly for S. paucimobilis has been deposited in NCBI Assembly under GCA_003314795.222, and the whole project is at BioProject under PRJNA478628 (https://identifiers.org/bioproject:PRJNA478628).

Technical Validation

FaQCs was used to obtain the sequencing statistics and Q scores of Illumina short-reads, while Pauvr was used to obtain the same for MinION sequencing (Table 2). Illumina sequencing yielded paired-end reads of ~150 bases with more than 98% reads possessing Phred scores (Q scores) above 20 (Fig. 2a), when quality screening was performed with FaQCs. MinION reads were also of high quality, as shown in Fig. 2b.

Table 2.

Basic statistics of Illumina and MinION sequencing.

Illumina MinION
Number of reads 6,111,374 11,688
Mean Length 150 14,176
Maximum Length 150 112,765
N50 22,110
Number of reads >10,000 bp 6,296 (~54%)

Fig. 2.

Fig. 2

Phred analysis of Illumina and MinION reads for the Sphingomonas paucimobilis AIMST S2 strain genome. (a) Q scores for Illumina reads. (b) Q scores for MinION reads.

The hybrid genome assembly performed with the reads provided a complete, circular genome of S. paucimobilis, containing 4,005,505 bases, with an overall GC content of 65.73%. The sequencing coverage based on raw reads was 446.6×. A total of 3,612 coding sequences (CDS), 56 tRNAs, 1 tmRNA and 1 CRISPR array were identified. Three identical ribosomal operons were identified.

Figure 3 illustrates the circular genome of S. paucimobilis plotted using CGView23.

Fig. 3.

Fig. 3

Circular map of S. paucimobilis. (a) Circular representation of genome with basic features including CDS and tRNA distributions. (b) Circular representation of genome based on COG classification.

Several levels of validation were performed to refine the hybrid assembly and check for completeness and the quality of genes predicted. Pilon refines the assembly using short reads during the final stage of assembly in Unicycler, by detecting and correcting single base differences, small and large indels or block substitution events. The present hybrid assembly was polished twice by Pilon with no changes in the assembly, suggesting an accurate assembly.

The completeness of the genomic data was further assessed according to Watson and Warr (2019)24. A DIAMOND blast against the UniProt TREMBL database showed that 99.1% of the genes predicted in the genome had more than 90% coverage to its top hit, suggesting good quality assembly and annotation was generated.

Among these, approximately 32 genes were shown to be involved in xenobiotic degradation (Table 3).

Table 3.

Gene clusters involved in xenobiotic degradation.

Pathway Number of genes involved
Benzoate degradation 8
Aminobenzoate degradation 3
Chloroalkane & chloroalkene degradation 2
Chlorocyclohexane & chlorobenzene degradation 1
Xylene degradation 1
Ethylbenzene degradation 1
Styrene degradation 1
Caprolactam degradation 4
Atrazine degradation 3
Dioxin degradation 1
Drug metabolism - other enzymes 7

Interestingly, one of the key genes responsible for organophosphate biodegradation, glutathione S-transferase, gst was identified in the analysis. gst has previously been said to detoxify xenobiotics by catalyzing the nucleophilic conjugation of reduced tripeptide glutathione (GSH; γ-Glu-Cys-Gly) into hydrophobic and electrophilic substrates25,26.

Apart from genes involved in chlorpyrifos and other xenobiotic biodegradation, several genes related to plant-growth promoting factors were also identified in the genome. This includes several genes in auxin biosynthesis, alkaloid biosynthesis and nitrogen metabolism. Auxin plays a significant role in promoting stem elongation27,28, while alkaloid plays an important role in plants by preventing insects from eating them29. Genes involved in nitrogen metabolism like nitrate reductase, on the other hand, is responsible in reducing nitrate to nitrite for the production of protein in most crop plants, as nitrate is the predominant source of nitrogen in fertilized soils3032.

Characterization of the complete genome of S. paucimobilis, identification of potential chlorpyrifos-degrading gene, gst and an array of genes coding for plant-growth promoting factors opens an avenue to more studies on bioremediation and its’ potential use as an effective microorganism in bioremediation and agriculture.

Acknowledgements

We would like to acknowledge Malaysian Genomic Resource Centre (MGRC) for performing the Illumina sequencing.

Author contributions

The project and pipeline were conceived and designed by H.R. and S.P. DNA extraction was performed by S.R. Sequencing was performed by S.S., S.R. and S.L. Computational resource was provided by A.M. Data analysis was performed by S.R., L.C., A.M., H.R. and S.P. The manuscript was written and revised by S.S., H.R., S.P., L.S.Y., R.M. and A.M. The final manuscript was approved by all authors.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors jointly supervised this work: Sivachandran Parimannan and Heera Rajandas.

Contributor Information

Sivachandran Parimannan, Email: sivachandran@aimst.edu.my.

Heera Rajandas, Email: heraadaas@gmail.com.

References

  • 1.Ryan MP, Adley CC. Sphingomonas paucimobilis: a persistent Gram-negative nosocomial infectious organism. J. Hosp. Infect. 2010;75:153–157. doi: 10.1016/j.jhin.2010.03.007. [DOI] [PubMed] [Google Scholar]
  • 2.Martínez MA, Ovalle A. Sphingomonas paucimobilis. Rev Chilena Infectol. 2013;30:49–50. doi: 10.4067/S0716-10182013000100007. [DOI] [PubMed] [Google Scholar]
  • 3.Walayat S, Malik A, Hussain N, Lynch T. Sphingomonas paucimobilis presenting as acute phlebitis: A case report. IDCases. 2018;11:6–8. doi: 10.1016/j.idcr.2017.11.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Guo C, Dang Z, Wong Y, Tam NF. Biodegradation ability and dioxgenase genes of PAH-degrading Sphingomonas and Mycobacterium strains isolated from mangrove sediments. Int. Biodeter. Biodegr. 2010;64:419–426. doi: 10.1016/j.ibiod.2010.04.008. [DOI] [Google Scholar]
  • 5.Ayed L, Mahdhi A, Cheref A, Bakhrouf A. Decolorization and degradation of azo dye Methyl Red by an isolated Sphingomonas paucimobilis: Biotoxicity and metabolites characterization. Desalination. 2011;274:272–277. doi: 10.1016/j.desal.2011.02.024. [DOI] [Google Scholar]
  • 6.Che Noraini CH, Morad N, Norli I, Teng TT, Ogugbue CJ. Methylene blue degradation by Sphingomonas paucimobilis under aerobic conditions. Water Air Soil Pollut. 2012;223:5131–5142. doi: 10.1007/s11270-012-1264-8. [DOI] [Google Scholar]
  • 7.Li X, He J, Li S. Isolation of a chlorpyrifos-degrading bacterium, Sphingomonas sp. strain Dsp-2, and cloning of the mpd gene. Res. Microbiol. 2007;158:143–149. doi: 10.1016/j.resmic.2006.11.007. [DOI] [PubMed] [Google Scholar]
  • 8.Math RK, et al. Isolation of a novel gene encoding a 3,5,6-trichloro-2-pyridinol degrading enzyme from a cow rumen metagenomic library. Biodegradation. 2010;21:565–573. doi: 10.1007/s10532-009-9324-5. [DOI] [PubMed] [Google Scholar]
  • 9.Singh BK, Walker A, Morgan JAW, Wright DJ. Effects of soil pH on the biodegradation of chlorpyrifos and isolation of a chlorpyrifos-degrading bacterium. Appl. Environ. Microbiol. 2003;69:5198–5206. doi: 10.1128/AEM.69.9.5198-5206.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.El-Helow ER, Badawy MEI, Mabrouk MEM, Mohamed EAH, El-Beshlawy YM. Biodegradation of chlorpyrifos by a newly isolated Bacillus subtilis strain, Y242. Bioremediat. J. 2013;17:113–123. doi: 10.1080/10889868.2013.786019. [DOI] [Google Scholar]
  • 11.Anwar S, Liaquat F, Khan QM, Khalid ZM, Iqbal S. Biodegradation of chlorpyrifos and its hydrolysis product 3,5,6-trichloro-2-pyridinol by Bacillus pumilus strain C2A1. J. Hazard Mater. 2009;168:400–405. doi: 10.1016/j.jhazmat.2009.02.059. [DOI] [PubMed] [Google Scholar]
  • 12.Xu G, et al. Biodegradation of chlorpyrifos and 3,5,6-trichloro-2-pyridinol by a newly isolated Paracoccus sp. strain TRP. Int. Biodeter. Biodegr. 2008;62:51–56. doi: 10.1016/j.ibiod.2007.12.001. [DOI] [Google Scholar]
  • 13.Gulati P, Ghosh M. Biofilm forming ability of Sphingomonas paucimobilis isolated from community drinking water systems on plumbing materials used in water distribution. J Water Health. 2017;15:942–954. doi: 10.2166/wh.2017.294. [DOI] [PubMed] [Google Scholar]
  • 14.Todd SM, Settlage RE, Lahmers KK, Slade DJ. Fusobacterium genomics using MinION and Illumina sequencing enables genome completion and correction. mSphere. 2018;3:e00269–18. doi: 10.1128/mSphere.00269-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30:2068–2069. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
  • 16.Huerta-Cepas J, et al. eggNOG 4.5: a hierarchical orthology framework with improved functional annotations for eukaryotic, prokaryotic and viral sequences. Nucleic Acids Res. 2016;44:D286–D293. doi: 10.1093/nar/gkv1248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Huerta-Cepas J, et al. Fast genome-wide functional annotation through orthology assignment by eggNOG-Mapper. Mol Biol Evol. 2017;34:2115–2122. doi: 10.1093/molbev/msx148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Arndt D, et al. PHASTER: a better, faster version of the PHAST phage search tool. Nucleic Acids Res. 2016;44:W16–W21. doi: 10.1093/nar/gkw387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Bertelli C, et al. IslandViewer 4: expanded prediction of genomic islands for larger-scale datasets. Nucleic Acids Res. 2017;45:W30–W35. doi: 10.1093/nar/gkx343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.2019. NCBI Sequence Read Archive. SRP185601
  • 21.Ravintheran SK, 2019. Sphingomonas paucimobilis strain AIMST S2 chromosome, complete genome. GenBank. CP035765 [DOI] [PMC free article] [PubMed]
  • 22.2018. NCBI Assembly. GCA_003314795.2
  • 23.Grant JR, Stothard P. The CGView server: a comparative genomics tool for circular genomes. Nucleic Acids Res. 2008;36:W181–W184. doi: 10.1093/nar/gkn179. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Watson M, Warr A. Errors in long-read assemblies can critically affect protein prediction. Nat. Biotechnol. 2019;37:124. doi: 10.1038/s41587-018-0004-z. [DOI] [PubMed] [Google Scholar]
  • 25.Shen M, et al. Identification of glutathione S-transferase (GST) genes from a dark septate endophytic fungus (Exophiala pisciphila) and their expression patterns under varied metals stress. PLoS One. 2015;10:e0123418. doi: 10.1371/journal.pone.0123418. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Liu Y-J, Han X-M, Ren L-L, Yang H-L, Zeng Q-Y. Functional divergence of the glutathione S-transferase supergene family in Physcomitrella patens reveals complex patterns of large gene family evolution in land plants. Plant Physiol. 2013;161:773–786. doi: 10.1104/pp.112.205815. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Davies, P. J. In Plant Hormones: Biosynthesis, Signal Transduction, Action! (ed. Davies, P. J.) Ch. 1 The Plant Hormones: Their Nature, Occurrence, and Functions. (Springer Netherlands, 2010).
  • 28.Santner A, Calderon-Villalobos LIA, Estelle M. Plant hormones are versatile chemical regulators of plant growth. Nat. Chem. Biol. 2009;5:301–307. doi: 10.1038/nchembio.165. [DOI] [PubMed] [Google Scholar]
  • 29.Steppuhn A, Gase K, Krock B, Halitschke R, Baldwin IT. Nicotine’s defensive function in nature. PLoS Biol. 2004;2:e217. doi: 10.1371/journal.pbio.0020217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Crawford NM, Guo F-Q. New insights into nitric oxide metabolism and regulatory functions. Trends Plant Sci. 2005;10:195–200. doi: 10.1016/j.tplants.2005.02.008. [DOI] [PubMed] [Google Scholar]
  • 31.Ho C-H, Lin S-H, Hu H-C, Tsay Y-F. CHL1 functions as a nitrate sensor in plants. Cell. 2009;138:1184–1194. doi: 10.1016/j.cell.2009.07.004. [DOI] [PubMed] [Google Scholar]
  • 32.Kaiser WM, Huber SC. Post-translational regulation of nitrate reductase: mechanism, physiological relevance and environmental triggers. J Exp. Bot. 2001;52:1981–1989. doi: 10.1093/jexbot/52.363.1981. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

  1. 2019. NCBI Sequence Read Archive. SRP185601
  2. Ravintheran SK, 2019. Sphingomonas paucimobilis strain AIMST S2 chromosome, complete genome. GenBank. CP035765 [DOI] [PMC free article] [PubMed]
  3. 2018. NCBI Assembly. GCA_003314795.2

Articles from Scientific Data are provided here courtesy of Nature Publishing Group

RESOURCES