Complete Genome Sequence of Nocardiopsis exhalans Strain JCM 11759T, Isolated from Indoor Air of a Water-Damaged Private House in Finland

Dan Chen; Ping Mo; Baiyuan Li

doi:10.1128/mra.00930-22

. 2022 Nov 3;11(12):e00930-22. doi: 10.1128/mra.00930-22

Complete Genome Sequence of Nocardiopsis exhalans Strain JCM 11759^T, Isolated from Indoor Air of a Water-Damaged Private House in Finland

Dan Chen ^a, Ping Mo ^b, Baiyuan Li ^a,^✉

Editor: J Cameron Thrash^c

PMCID: PMC9753723 PMID: 36326500

ABSTRACT

The genus Nocardiopsis contains pharmaceutically and biotechnologically important species that produce a wide variety of secondary metabolites with a wide range of biological activities. Here, we report the complete genome sequence of Nocardiopsis exhalans JCM 11759^T for a better understanding of its metabolic characteristics and toxin synthesis pathway.

ANNOUNCEMENT

Nocardiopsis species are capable of producing diverse enzymes, compatible solutes, and surfactants that may allow them to prevail in multiple ecosystems (1, 2). Many Nocardiopsis species are known to produce a vast variety of bioactive compounds (3). Nocardiopsis exhalans JCM 11759^T was isolated from indoor air in a water-damaged private house in Finland in 2001 (4), and the strain name was validated by IJSEM in 2002 (5). This organism may produce toxins that pose a hazard to human health (4), but its genomic properties are still unknown.

The type strain, N. exhalans JCM 11759 (=DSM 44407 =NBRC 100346 =NRRL B-24123 =VTT E-062617), was purchased from the Japan Collection of Microorganisms (JCM). Cells were streaked onto tryptic soy agar (TSA) plates and incubated at 28°C for 7 days. An individual colony was inoculated into tryptic soy broth (TSB) and incubated at 28°C for 7 days with shaking (220 rpm). Genomic DNA was extracted using a TIANamp bacterial DNA kit (Tiangen, Beijing, China) according to the manufacturer’s protocol. The purity, concentration, and integrity of the DNA sample were checked using a NanoDrop One spectrophotometer (NanoDrop Technologies, Wilmington, DE), Qubit 3.0 fluorometer (Life Technologies, Carlsbad, USA), and 0.35% agarose gel electrophoresis. The DNA samples were simultaneously subjected to the NovaSeq 6000 platform (Illumina Inc., CA, USA) for short-read sequencing and the Nanopore PromethION platform (Oxford Nanopore Technologies, Oxford, UK) for long-read sequencing. Default parameters were used for all software unless otherwise specified. Briefly, the DNA was fragmented and end repaired, and sequencing adapters and barcode labels were added. Then, a short-read sequencing library (paired-end, 150-bp format) was prepared using the NEBNext Ultra II DNA library prep kit (NEB, USA), and a long-read sequencing library with no size selection was prepared using the SQK-LSK109 ligation kit (Oxford Nanopore Technologies). The Illumina library was quantified using a 2100 Bioanalyzer instrument (Agilent Technologies, Santa Clara, CA) and reverse transcriptase quantitative PCR (RT-qPCR). Finally, the libraries were sent to Wuhan Benagen Technology Company Ltd. (Wuhan, China) for sequencing.

The resulting long-read sequences (n = 52,523 reads; N₅₀, 25,024 bp) were assembled, with a coverage depth of 132×, using Flye v2.8.3 (6). The Illumina short reads (n = 9,330,736 reads), with a coverage depth of 182×, were used to correct the genome assembly using Pilon v1.22 (7). The Illumina raw sequencing reads were filtered into clean reads using the software SOAPnuke v1.4.0 (8) to discard reads with an N ratio of >10% or more than 50% low-quality bases (Q ≤ 5). Base calling of the Nanopore raw data was accomplished using Guppy v3.1.5, and sequences with a Q score of <7 were discarded. Adapter sequences were trimmed using Porechop v0.2.4 (https://github.com/rrwick/Porechop). The resultant contigs were checked for further joins and circularity using Circlator v1.1.3 (9). The assembly results showed that the two contigs were circular. Sequence comparison was performed using BLAST, and no overlap was found between the two contigs. Using PlasFlow v1.0 software (10), the two contigs were confirmed to be a chromosome and a plasmid. The genomic features were annotated using the NCBI Prokaryotic Genome Annotation Pipeline (PGAP) v6.1 (11). The genome features of N. exhalans JCM 11759^T are summarized in Table 1. The genome comprises 7,597,621 bp; it consists of one circular chromosome (7,506,724 bp) and one circular plasmid (90,897 bp). A total of 6,637 protein coding genes (CDSs), 60 tRNAs, 15 rRNAs, and 3 noncoding RNA genes were detected.

TABLE 1.

Genome features of N. exhalans JCM 11759^T

Name	Length (bp)	GenBank accession no.	GC content (%)	No. of CDSs^a	No. of rRNA operons	No. of tRNA genes
Chromosome	7,506,724	CP099837.1	69.7	6,539	15	60
Plasmid unnamed1	90,897	CP099838.1	69.5	98	0	0

Open in a new tab

CDSs, coding DNA sequences.

Data availability.

The complete genome sequences were deposited at NCBI GenBank under accession numbers CP099837 (genome) and CP099838 (plasmid). The raw reads were deposited in the Sequence Read Archive under the accession numbers SRR21429931 and SRR21429932.

ACKNOWLEDGMENTS

This work was supported by the Hunan Natural Science Foundation (grant 2021JJ40221), the National Natural Science Foundation of China (grant 32100151), the General Project of the Hunan Provincial Education Department (grants 20C0855 and 21C0515), and the Innovation Team of Microbial Technology at Hunan University of Arts and Science (number 202026).

Contributor Information

Baiyuan Li, Email: lby245239@126.com.

J. Cameron Thrash, University of Southern California.

REFERENCES

1.Bennur T, Kumar AR, Zinjarde S, Javdekar V. 2014. Nocardiopsis species as potential sources of diverse and novel extracellular enzymes. Appl Microbiol Biotechnol 98:9173–9185. doi: 10.1007/s00253-014-6111-y. [DOI] [PubMed] [Google Scholar]
2.Bennur T, Kumar AR, Zinjarde S, Javdekar V. 2015. Nocardiopsis species: incidence, ecological roles and adaptations. Microbiol Res 174:33–47. doi: 10.1016/j.micres.2015.03.010. [DOI] [PubMed] [Google Scholar]
3.Shi T, Wang Y-F, Wang H, Wang B. 2022. Genus Nocardiopsis: a prolific producer of natural products. Mar Drugs 20:374. doi: 10.3390/md20060374. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Peltola JS, Andersson MA, Kampfer P, Auling G, Kroppenstedt RM, Busse HJ, Salkinoja-Salonen MS, Rainey FA. 2001. Isolation of toxigenic Nocardiopsis strains from indoor environments and description of two new Nocardiopsis species, N. exhalans sp. nov. and N. umidischolae sp. nov. Appl Environ Microbiol 67:4293–4304. doi: 10.1128/AEM.67.9.4293-4304.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.International Journal of Systematic and Evolutionary Microbiology. 2002. Validation list no. 84. Validation of publication of new names and new combinations previously effectively published outside the IJSEM. Int J Syst Evol Microbiol 52:3–4. doi: 10.1099/00207713-52-1-3. [DOI] [PubMed] [Google Scholar]
6.Kolmogorov M, Yuan J, Lin Y, Pevzner PA. 2019. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540–546. doi: 10.1038/s41587-019-0072-8. [DOI] [PubMed] [Google Scholar]
7.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Chen Y, Chen Y, Shi C, Huang Z, Zhang Y, Li S, Li Y, Ye J, Yu C, Li Z, Zhang X, Wang J, Yang H, Fang L, Chen Q. 2018. SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience 7:1–6. doi: 10.1093/gigascience/gix120. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Hunt M, Silva ND, Otto TD, Parkhill J, Keane JA, Harris SR. 2015. Circlator: automated circularization of genome assemblies using long sequencing reads. Genome Biol 16:294. doi: 10.1186/s13059-015-0849-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Krawczyk PS, Lipinski L, Dziembowski A. 2018. PlasFlow: predicting plasmid sequences in metagenomic data using genome signatures. Nucleic Acids Res 46:e35. doi: 10.1093/nar/gkx1321. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI Prokaryotic Genome Annotation Pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

[B1] 1.Bennur T, Kumar AR, Zinjarde S, Javdekar V. 2014. Nocardiopsis species as potential sources of diverse and novel extracellular enzymes. Appl Microbiol Biotechnol 98:9173–9185. doi: 10.1007/s00253-014-6111-y. [DOI] [PubMed] [Google Scholar]

[B2] 2.Bennur T, Kumar AR, Zinjarde S, Javdekar V. 2015. Nocardiopsis species: incidence, ecological roles and adaptations. Microbiol Res 174:33–47. doi: 10.1016/j.micres.2015.03.010. [DOI] [PubMed] [Google Scholar]

[B3] 3.Shi T, Wang Y-F, Wang H, Wang B. 2022. Genus Nocardiopsis: a prolific producer of natural products. Mar Drugs 20:374. doi: 10.3390/md20060374. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B4] 4.Peltola JS, Andersson MA, Kampfer P, Auling G, Kroppenstedt RM, Busse HJ, Salkinoja-Salonen MS, Rainey FA. 2001. Isolation of toxigenic Nocardiopsis strains from indoor environments and description of two new Nocardiopsis species, N. exhalans sp. nov. and N. umidischolae sp. nov. Appl Environ Microbiol 67:4293–4304. doi: 10.1128/AEM.67.9.4293-4304.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5] 5.International Journal of Systematic and Evolutionary Microbiology. 2002. Validation list no. 84. Validation of publication of new names and new combinations previously effectively published outside the IJSEM. Int J Syst Evol Microbiol 52:3–4. doi: 10.1099/00207713-52-1-3. [DOI] [PubMed] [Google Scholar]

[B6] 6.Kolmogorov M, Yuan J, Lin Y, Pevzner PA. 2019. Assembly of long, error-prone reads using repeat graphs. Nat Biotechnol 37:540–546. doi: 10.1038/s41587-019-0072-8. [DOI] [PubMed] [Google Scholar]

[B7] 7.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B8] 8.Chen Y, Chen Y, Shi C, Huang Z, Zhang Y, Li S, Li Y, Ye J, Yu C, Li Z, Zhang X, Wang J, Yang H, Fang L, Chen Q. 2018. SOAPnuke: a MapReduce acceleration-supported software for integrated quality control and preprocessing of high-throughput sequencing data. Gigascience 7:1–6. doi: 10.1093/gigascience/gix120. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B9] 9.Hunt M, Silva ND, Otto TD, Parkhill J, Keane JA, Harris SR. 2015. Circlator: automated circularization of genome assemblies using long sequencing reads. Genome Biol 16:294. doi: 10.1186/s13059-015-0849-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B10] 10.Krawczyk PS, Lipinski L, Dziembowski A. 2018. PlasFlow: predicting plasmid sequences in metagenomic data using genome signatures. Nucleic Acids Res 46:e35. doi: 10.1093/nar/gkx1321. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] 11.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI Prokaryotic Genome Annotation Pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Complete Genome Sequence of Nocardiopsis exhalans Strain JCM 11759^T, Isolated from Indoor Air of a Water-Damaged Private House in Finland

Dan Chen

Ping Mo

Baiyuan Li

Roles

ABSTRACT

ANNOUNCEMENT

TABLE 1.

Data availability.

ACKNOWLEDGMENTS

Contributor Information

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Complete Genome Sequence of Nocardiopsis exhalans Strain JCM 11759T, Isolated from Indoor Air of a Water-Damaged Private House in Finland

Dan Chen

Ping Mo

Baiyuan Li

Roles

ABSTRACT

ANNOUNCEMENT

TABLE 1.

Data availability.

ACKNOWLEDGMENTS

Contributor Information

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Complete Genome Sequence of Nocardiopsis exhalans Strain JCM 11759^T, Isolated from Indoor Air of a Water-Damaged Private House in Finland