Closed Genome Sequences of Two Clostridium botulinum Strains Obtained by Nanopore Sequencing

Narjol Gonzalez-Escalona; Julie Haendiges; Jesse D Miller; Shashi K Sharma

doi:10.1128/MRA.01075-18

. 2018 Sep 6;7(9):e01075-18. doi: 10.1128/MRA.01075-18

Closed Genome Sequences of Two Clostridium botulinum Strains Obtained by Nanopore Sequencing

Narjol Gonzalez-Escalona ^a,^✉, Julie Haendiges ^a,^b, Jesse D Miller ^b, Shashi K Sharma ^a

Editor: J Cameron Thrash^c

PMCID: PMC6256530 PMID: 30533938

Here we report the genome sequences of two toxin-producing Clostridium botulinum strains, one environmental sample (83F) and one clinical sample (CDC51232). The genomes were closed by a combination of long-read and short-read sequencing.

ABSTRACT

ANNOUNCEMENT

Clostridium botulinum is a Gram-positive, spore-forming anaerobic bacterium that produces botulinum neurotoxin (BoNT) (1). Ingestion of the potent BoNT causes a serious paralytic illness known as botulism in humans and is a critical concern for food safety. The neurotoxins produced by these organisms are serologically different, and seven serotypes have been described, designated by the letters A through G (2). Four of the seven serotypes, namely, A, B, E, and F, have been linked with human botulism, with most cases due to serotypes A and B (3).

The genomes of two bivalent toxin-producing C. botulinum strains (strains that carry two botulinum toxins) were sequenced to be prepared for botulism outbreaks. The strains were grown, and the DNA was extracted as reported previously (4). The long reads for each strain were generated with MinION sequencing (Nanopore, Oxford, UK). The sequencing libraries were prepared using the rapid sequencing kit RAD004 and run in a FLO-MIN106 (R9.4.1) flow cell, according to the manufacturer’s instructions, for 48 h at 230 to 290× average coverage. The sequencing library contained DNA fragmented randomly by a transposase present in the fragmentation mix of the RAD004 kit, rendering fragments of >30 kb. The short-read whole-genome sequence (WGS) for each strain was generated using the Illumina MiSeq sequencing platform with the MiSeq v3 kit using 2 × 250-bp paired-end chemistry (Illumina, San Diego, CA) according to the manufacturer’s instructions at 160 to 180× coverage. The libraries were constructed with 100 ng of genomic DNA using the Nextera DNA flex kit (Illumina) according to the manufacturer’s instructions. The genome sequences for each strain were obtained by de novo assembly, using nanopore data and default settings within the Canu program v1.6 (5). A second assembly was generated using a SPAdes (6) hybrid assembly (with default settings) using both Nanopore and MiSeq data generated for each strain. The resultant assemblies from Canu were error corrected using the Pilon tool (7) and the MiSeq data. The final assembly (FA) was generated by comparing the SPAdes hybrid and Canu-polished assemblies using Mauve (8) and filling in the missing regions in the SPAdes assembly with the Canu-polished assembly. The FA sequences were annotated using the NCBI Prokaryotic Genomes Automatic Annotation Pipeline (PGAAP, https://www.ncbi.nlm.nih.gov/genome/annotation_prok).

In silico multilocus sequence typing (MLST) analyses (https://pubmlst.org/bigsdb?db=pubmlst_cbotulinum_seqdef&page=sequenceQuery) showed that CDC51232 belonged to sequence type 7 (ST7) and 83F belonged to ST4. Whole-genome single-nucleotide polymorphism (SNP) analysis, performed as described previously (4), showed that these genomes belonged to two different lineages, with CDC51232 and 83F belonging to lineages 2 and 4, respectively, and contain mostly bivalent strains, as inferred from our previous study (4). Analysis of the resulting sequences showed the presence of two plasmids in each sequenced strain, although the sizes and sequences of these two plasmids differed greatly between each other (Table 1). Furthermore, although these two isolates were also bivalent C. botulinum strains, the location of the BoNT clusters differed between them. In strain CDC51232 the BoNT clusters (BoNTB and BoNTA4) were located in the larger plasmid, whereas in 83F, the BoNT clusters (BoNTB and BoNTA1) were located in the chromosome (Table 1).

TABLE 1.

Metadata for the two C. botulinum strains reported in this study^a

CFSAN no.	Isolate name	GenBank accession no. (size [bp])		Sequence Read Archive no.	Source	Serotype	Sequence type
CFSAN no.	Isolate name	Chromosome	Plasmids	Sequence Read Archive no.	Source	Serotype	Sequence type
CFSAN034200	CDC51232	CP031095 (270,024)	CP031096 (9,953)	SRR7530166	Clinical	AB	7
			CP031097 (3,922,194)	SRR7530167
CFSAN034202	83F	CP031098 (3,954,901)	CP031100 (57,676)	SRR7532471	Environmental	AB	4
			CP031099 (5,926)	SRR7532470

Open in a new tab

The GC content for each strain was 28.2%.

Data availability.

The genome sequences of the two C. botulinum strains are listed in Table 1.

ACKNOWLEDGMENTS

This study was supported by funding from the MCMi Challenge grants program proposal no. 2018-646 and FDA Foods Program intramural funds. J.H. and J.D.M. were supported by the NSF International Applied Research Center.

REFERENCES

1.Gill DM. 1982. Bacterial toxins: a table of lethal amounts. Microbiol Rev 46:86–94. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Shapiro RL, Hatheway C, Swerdlow DL. 1998. Botulism in the United States: a clinical and epidemiologic review. Ann Intern Med 129:221–228. [DOI] [PubMed] [Google Scholar]
3.Kubota T, Yonekura N, Hariya Y, Isogai E, Isogai H, Amano K-I, Fujii N. 1998. Gene arrangement in the upstream region of Clostridium botulinum type E and Clostridium butyricum BL6340 progenitor toxin genes is different from that of other types. FEMS Microbiol Lett 158:215–221. doi: 10.1111/j.1574-6968.1998.tb12823.x. [DOI] [PubMed] [Google Scholar]
4.Gonzalez-Escalona N, Timme R, Raphael BH, Zink D, Sharma SK. 2014. Whole-genome single-nucleotide-polymorphism analysis for discrimination of Clostridium botulinum group I strains. Appl Environ Microbiol 80:2125–2132. doi: 10.1128/AEM.03934-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. 2017. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res 27:722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Darling ACE, Mau B, Blattner FR, Perna NT. 2004. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res 14:1394–1403. doi: 10.1101/gr.2289704. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The genome sequences of the two C. botulinum strains are listed in Table 1.

[B1] 1.Gill DM. 1982. Bacterial toxins: a table of lethal amounts. Microbiol Rev 46:86–94. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B2] 2.Shapiro RL, Hatheway C, Swerdlow DL. 1998. Botulism in the United States: a clinical and epidemiologic review. Ann Intern Med 129:221–228. [DOI] [PubMed] [Google Scholar]

[B3] 3.Kubota T, Yonekura N, Hariya Y, Isogai E, Isogai H, Amano K-I, Fujii N. 1998. Gene arrangement in the upstream region of Clostridium botulinum type E and Clostridium butyricum BL6340 progenitor toxin genes is different from that of other types. FEMS Microbiol Lett 158:215–221. doi: 10.1111/j.1574-6968.1998.tb12823.x. [DOI] [PubMed] [Google Scholar]

[B4] 4.Gonzalez-Escalona N, Timme R, Raphael BH, Zink D, Sharma SK. 2014. Whole-genome single-nucleotide-polymorphism analysis for discrimination of Clostridium botulinum group I strains. Appl Environ Microbiol 80:2125–2132. doi: 10.1128/AEM.03934-13. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B5] 5.Koren S, Walenz BP, Berlin K, Miller JR, Bergman NH, Phillippy AM. 2017. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res 27:722–736. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B6] 6.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B7] 7.Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, Earl AM. 2014. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS One 9:e112963. doi: 10.1371/journal.pone.0112963. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B8] 8.Darling ACE, Mau B, Blattner FR, Perna NT. 2004. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res 14:1394–1403. doi: 10.1101/gr.2289704. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Closed Genome Sequences of Two Clostridium botulinum Strains Obtained by Nanopore Sequencing

Narjol Gonzalez-Escalona

Julie Haendiges

Jesse D Miller

Shashi K Sharma

Roles

ABSTRACT

ANNOUNCEMENT

TABLE 1.

Data availability.

ACKNOWLEDGMENTS

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Closed Genome Sequences of Two Clostridium botulinum Strains Obtained by Nanopore Sequencing

Narjol Gonzalez-Escalona

Julie Haendiges

Jesse D Miller

Shashi K Sharma

Roles

ABSTRACT

ANNOUNCEMENT

TABLE 1.

Data availability.

ACKNOWLEDGMENTS

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases