ABSTRACT
Most efforts to understand the biology of Vibrio cholerae have focused on a single group, the pandemic-generating lineage harboring the strains responsible for all known cholera pandemics. Consequently, little is known about the diversity of this species in its native aquatic environment. To understand the differences in the V. cholerae populations inhabiting regions with a history of cholera cases and those lacking such a history, a comparative analysis of population composition was performed. Little overlap was found in lineage compositions between those in Dhaka, Bangladesh (where cholera is endemic), located in the Ganges Delta, and those in Falmouth, MA (no known history of cholera), a small coastal town on the United States east coast. The most striking difference was the presence of a group of related lineages at high abundance in Dhaka, which was completely absent from Falmouth. Phylogenomic analysis revealed that these lineages form a cluster at the base of the phylogeny for the V. cholerae species and were sufficiently differentiated genetically and phenotypically to form a novel species. A retrospective search revealed that strains from this species have been anecdotally found from around the world and were isolated as early as 1916 from a British soldier in Egypt suffering from choleraic diarrhea. In 1935, Gardner and Venkatraman unofficially referred to a member of this group as Vibrio paracholerae. In recognition of this earlier designation, we propose the name Vibrio paracholerae sp. nov. for this bacterium. Genomic analysis suggests a link with human populations for this novel species and substantial interaction with its better-known sister species.
IMPORTANCE Cholera continues to remain a major public health threat around the globe. Understanding the ecology, evolution, and environmental adaptation of the causative agent (Vibrio cholerae) and tracking the emergence of novel lineages with pathogenic potential are essential to combat the problem. In this study, we investigated the population dynamics of Vibrio cholerae in an inland locality, which is known as endemic for cholera, and compared them with those of a cholera-free coastal location. We found the consistent presence of the pandemic-generating lineage of V. cholerae in Dhaka, where cholera is endemic, and an exclusive presence of a lineage phylogenetically distinct from other V. cholerae lineages. Our study suggests that this lineage represents a novel species that has pathogenic potential and a human link to its environmental abundance. The possible association with human populations and coexistence and interaction with toxigenic V. cholerae in the natural environment make this potential human pathogen an important subject for future studies.
KEYWORDS: Vibrio cholerae, cholera, novel species, paracholera, population genomics
INTRODUCTION
Vibrio cholerae is the causative agent of cholera, a disease that has shaken human civilization in the past few centuries and continues to be a public health threat, especially to the developing world (1, 2). Its pathogenesis and epidemiology have been extensively studied, but the aquatic part of its life cycle is still not fully understood. Strikingly, few close relatives have been found for this species in recent years, most being initially classified as V. cholerae-like bacteria. One of them was the occasional human pathogen Vibrio mimicus, which was proposed as a new species in 1981 based on phenotypic characteristics (3). Later, genome-based studies established the molecular basis of its importance as a pathogen and close association and exchange of important virulence genes with V. cholerae (4, 5). Two other closely related novel species, Vibrio sp. (unofficially named Vibrio parilis) and Vibrio metoecus, were more recently isolated alongside V. cholerae from coastal waters (6, 7) and were found to exchange genetic material with their well-known sister species in aquatic environments (6, 8). Biological information on the close relatives of a dangerous environmental pathogen like V. cholerae is of significance because of their potential as emerging pathogens themselves and their interaction with V. cholerae in their natural habitats. Even though this diverse species is ubiquitous in tropical and temperate coastal waters worldwide, cholera is only caused by a specific lineage of V. cholerae in which the O1 antigen is ancestral, the pandemic-generating (PG) V. cholerae lineage. It is not clear whether aquatic V. cholerae maintains a significantly different population structure in areas where cholera is endemic and not endemic and if this structure is influenced by cooccurring species. This is a crucial gap in our understanding of the factors defining cholera endemicity and driving local and global biogeographic dispersal patterns of V. cholerae. It has recently become possible to investigate the details of the population structure of V. cholerae and its close relatives using a molecular marker based on the single-copy housekeeping gene vibriobactin utilization protein subunit B (viuB), which provides subspecies-level resolution (9). This method was used to study a cholera-free region on the east coast of the United States, the Oyster Pond ecosystem (Falmouth, MA), where differences in abundance of individual alleles in particular locations/habitats indicated potential adaptation to ecological conditions at the subspecies level (9, 10). A similar study was performed in V. cholerae populations in an inland location (Dhaka) in Bangladesh where cholera is endemic (9, 11).
Here, to understand the role played by subspecies population structure in disease, we compared the V. cholerae population from inland Bangladesh with that from the east coast of the United States. This revealed that distribution and abundance of major lineages of V. cholerae differ significantly in the two distinct ecosystems. Both globally distributed as well as locally adapted lineages of V. cholerae are found in the two environments studied. One of the most striking differences was the presence of several related lineages in Dhaka that form a divergent clade at the base of the V. cholerae species in a phylogenomic analysis and were completely absent in the coastal United States location. Genomic characterization of these lineages revealed that they form a novel species closely related to, but distinct from, V. cholerae. A revision of recent and decades-old historical isolates related to this novel species indicated that it has been found in similar environments to pandemic V. cholerae for decades and is associated with human infections ranging from septicemia to choleraic diarrhea.
RESULTS AND DISCUSSION
Pandemic-related strains increase total V. cholerae abundance in Dhaka and reduce local diversity.
One of the main differences between the V. cholerae populations from Oyster Pond (Falmouth, MA, USA) and Dhaka (Bangladesh) is, unsurprisingly, the abundance of the pandemic-generating (PG) lineage, which includes strains responsible for the current 7th pandemic. Water samples were previously collected biweekly from seven different sites in the water bodies surrounding Dhaka city for nine continuous months (from June 2015 to March 2016) and were also collected from Oyster Pond over the summers of 2008 and 2009 in Cape Cod on the United States east coast (10). Here, we compare the V. cholerae populations from these two areas to gain insights into the differences between a region that experiences strong seasonal variation where cholera is not endemic and a tropical area where cholera is endemic. High-throughput sequencing of viuB marker gene amplicons was used to analyze the subspecies composition of V. cholerae in these two populations. Amplicons of this gene were annotated following a previously established scheme (9) in which diversity within the V. cholerae species is measured based on relative abundance and distribution of viuB alleles. Each allele represents a V. cholerae lineage, the diversity of which is roughly equivalent to that of a clonal complex as traditionally defined by multilocus sequence typing (9). A single viuB allele (viuB-73) is found to be uniquely associated with the PG lineage. Abundance and distribution of viuB alleles in samples collected from the two locations were estimated from viuB amplicon sequencing data normalized by quantitative data of viuB gene copy numbers determined by quantitative PCR (qPCR) (12). The total abundance of V. cholerae in the two locations varied significantly (Kruskal-Wallis test, P < 0.01), being almost twice as high on average in Dhaka (2.30 × 105 gene copies/liter) than in Oyster Pond (1.25 × 105 gene copies/liter) (Fig. 1A). However, when PG V. cholerae O1 (viuB-73) was excluded (quantified independently of other lineages using qPCR of the rfbO1 gene), average abundance was very similar in the two locations (Kruskal-Wallis, P < 0.01). The PG lineage was the predominant genotype in Dhaka, with an average abundance of 1.4 × 105 rfbO1 gene copies/liter, whereas it was just a minor member of the population in Oyster Pond, with an average abundance of 1.5 × 104 gene copies/liter (Fig. 1A). qPCR analysis confirmed that PG V. cholerae O1 present in the Oyster Pond population were nontoxigenic (CTX negative) as opposed to the vast majority of PG V. cholerae O1 in Dhaka being toxigenic (CTX positive) (12). Similarity percentage (SIMPER) analysis based on Bray-Curtis dissimilarity suggests that the allele most responsible for the overall dissimilarities between Dhaka and Oyster Pond is indeed viuB-73. This allele was predominant throughout the 9-month sampling period in Dhaka (11), constituting around 60% of the total V. cholerae population on average, whereas its presence was stochastic in Oyster Pond, constituting around 5% of the total population (9).
Population structure indices (diversity and evenness) were significantly lower in Dhaka than in Oyster Pond (Kruskal-Wallis test, P < 0.01) (Fig. 1B). This indicates a more stable and diverse V. cholerae community structure in the coastal location and a less diverse community dominated by fewer alleles in inland Bangladesh (Dhaka), likely because of the dominance of viuB-73 in that environment. Dhaka’s aquatic reservoirs therefore seem to harbor a V. cholerae community highly dominated by the PG lineage that is most likely to be affected substantially by human activity. It is one of the most densely populated megacities in the world and has a long history of suffering from recurring cholera (13). Maintenance of the cholera-causing genotype (PG) in the environment could be the driving factor to shape the overall population of V. cholerae in Dhaka. The reduction of intraspecies diversity by PG V. cholerae in Dhaka where cholera is endemic could be attributed to the potential selective advantage of colonizing the human gut (14), which would result in a constant output to water reservoirs. Type six secretion-mediated killing could also lead to the reduction of diversity, giving an advantage to PG V. cholerae in a resource-limited competitive environment, where PG V. cholerae is a superior competitor to other V. cholerae lineages under certain conditions. For example, in competition assays, PG V. cholerae has been shown to outcompete other lineages at a higher temperature (37°C), whereas at a lower temperature (25°C), other environmental lineages could outcompete it (40). Environmental conditions (i.e., the lower salinity seen in Dhaka [Table S1 in the supplemental material]) could also give an advantage to PG strains over others, as they have been shown to be more prevalent in lower-salinity environments relative to other lineages (9). In Oyster Pond, V. cholerae was present in substantially higher abundance in a coastal pond and lagoon compared to nearby ocean waters, indicating a likely ecological barrier (9). Overall, it is plausible that the combination of human population density and environmental factors creates the conditions for the maintenance of a natural V. cholerae population dominated by the PG lineage in Dhaka.
A novel divergent lineage is endemic to inland Bangladesh.
Besides the PG lineage, the population composition of V. cholerae sampled over 6 to 9 months was strikingly different in Dhaka and Oyster Pond. This was determined by using the abundance and distribution data of individual viuB alleles from the two locations. Nonmetric multidimensional scaling (NMDS) was performed to compare the two communities, and statistical significance of community structure dissimilarity was evaluated using the analysis of similarity (ANOSIM) with a Bray-Curtis distance matrix. In the NMDS plot, samples from Dhaka and Oyster Pond clustered separately, and community structure dissimilarity was statistically significant (ANOSIM R = 0.75, P < 0.01) (Fig. 2). Only two major alleles were shared between these locations from a total of 13 viuB alleles in Dhaka and 15 alleles in Oyster Pond (each individual allele constituting at least 1% of the V. cholerae population). The most abundant alleles in Dhaka after viuB-73 were viuB-06, viuB-07, viuB-25, and viuB-05 (Fig. 3). Of these four alleles, three are exclusively found in Dhaka (viuB-05, viuB-06, and viuB-07) and are of particular interest. Together, they composed ∼15% of the average Dhaka V. cholerae population and have been found to display higher abundance in sites surrounded by a high human population density and levels of pollution (16).
To have more information on the lineages found in Dhaka, 23 V. cholerae strains isolated from the city during the study period were selected for whole-genome sequencing: 9 V. cholerae O1 isolates harboring the viuB-73 allele and 14 V. cholerae non-O1/O139 isolates displaying a diversity of viuB alleles. Four strains possessed viuB alleles 05, 06, 07, and 08 (EDC690, EDC716, EDC717, and EDC792) and were found to be part of a very long branch occupying a basal position in a global core genome phylogeny compared to the rest of the V. cholerae strains (hence termed long branch clade or LB) (Fig. 4). This phylogenetic group has not been described in any other studies, although other strains from public databases isolated from different parts of the world also belong to it (Table 1). Nine isolates were recovered from human clinical specimens across the United States and were reported to the CDC as part of the surveillance conducted under the Cholera and Other Vibrio Illness Surveillance (COVIS) program (17). Two more isolates originated from stool samples of diarrheal patients in Mozambique in 2008 (18), and one isolate was recovered from a diarrheal patient from Thailand in 1993 and was described as V. cholerae serogroup O155 (16). Seven additional isolates have been found to belong in the clade for a total of 23 as of August 2019 (Table 1).
TABLE 1.
Isolates | Origin | Yr of isolation | Source | Phylogenetic group | Genome accession no. |
---|---|---|---|---|---|
N16961 | Bangladesh | 1971 | Clin | PG Vibrio cholerae O1 | AE003852.1 |
2010EL1786 | Haiti | 2010 | Clin | PG Vibrio cholerae O1 | NC_016445.1 |
EDC 721b | Bangladesh | 2015 | Env | PG Vibrio cholerae O1 | WYCI00000000 |
MJ1236 | Bangladesh | 1994 | Clin | PG Vibrio cholerae O1 | NC_012668.1 |
MO10 | India | 1992 | Clin | PG Vibrio cholerae O1 | AAKF00000000 |
2740-80 | USA | 1980 | Env | PG Vibrio cholerae O1 | AAUT00000000 |
BX330286 | Australia | 1985 | Env | PG Vibrio cholerae O1 | ACIA00000000 |
O395 | India | 1965 | Clin | PG Vibrio cholerae O1 | CP001235, CP001236 |
95412 | Mexico | 1997 | Env | PG Vibrio cholerae O1 | NZ_KB662090.1 |
V52 | Sudan | 1968 | Clin | PG Vibrio cholerae O37 | AAKJ00000000 |
2012EL1759 | Haiti | 2012 | Env | PG Vibrio cholerae O1 | JNEW01000000 |
2012Env9 | Haiti | 2012 | Env | Non-PG Vibrio cholerae O1 | JSTH00000000.1 |
MZO-03 | Bangladesh | 2001 | Clin | Non-PG Vibrio cholerae non-O1/O139 | AAUU00000000 |
12129 | Australia | 1985 | Env | Non-PG Vibrio cholerae O1 | ACFQ00000000 |
1587 | Peru | 2000 | Clin | Non-PG Vibrio cholerae non-O1/O139 | AAUR00000000 |
CP1035 | Mexico | 2004 | Clin | Non-PG Vibrio cholerae non-O1/O139 | AJRM00000000 |
EDC 689b | Bangladesh | 2015 | Env | Non-PG Vibrio cholerae non-O1/O139 | WYCR00000000 |
EDC 715b | Bangladesh | 2016 | Env | Non-PG Vibrio cholerae non-O1/O139 | WYCJ00000000 |
EDC 800b | Bangladesh | 2016 | Env | Non-PG Vibrio cholerae non-O1/O139 | WYCA00000000 |
EM1676A | Bangladesh | 2011 | Env | Non-PG Vibrio cholerae non-O1/O139 | APFY00000000.1 |
2012ENV-92 | Haiti | 2012 | Env | Non-PG Vibrio cholerae non-O1/O139 | JSTJ00000000.1 |
HE 48 | Haiti | 2010 | Env | Non-PG Vibrio cholerae non-O1/O139 | AFOR00000000.1 |
VCC19 | Brazil | 1994 | Env | Vibrio paracholerae | ATEV00000000.2 |
877163 | Bangladesh | 2002 | Env | Vibrio paracholerae | LBNV00000000 |
EDC 792b | Bangladesh | 2016 | Env | Vibrio paracholerae | WYCC00000000 |
EDC 690b | Bangladesh | 2015 | Env | Vibrio paracholerae | WUWI00000000 |
EDC 716b | Bangladesh | 2015 | Env | Vibrio paracholerae | WYBZ00000000 |
EDC 717b | Bangladesh | 2015 | Env | Vibrio paracholerae | WYBY00000000 |
HE09 | Haiti | 2010 | Env | Vibrio paracholerae | AFOP00000000.1 |
HE16 | Haiti | 2010 | Env | Vibrio paracholerae | ALEB00000000.1 |
CISM300506 | Mozambique | 2008 | Clin | Vibrio paracholerae | NZ_MWFL00000000 |
CISM1163068 | Mozambique | 2012 | Clin | Vibrio paracholerae | NZ_MWFO01000150 |
49093 DA89 | Thailand | 1992 | Clin | Vibrio paracholerae | JIDQ00000000.1 |
SIO | USA | 2003 | Waste water | Vibrio paracholerae | MIPJ01000076 |
2017V-1144 | USA | 2017 | Stool | Vibrio paracholerae | QKKV00000000 |
2014V-1107 | USA | 2014 | Stool | Vibrio paracholerae | QKKP00000000 |
2017V-1105 | USA | 2017 | Wound | Vibrio paracholerae | QKKT00000000 |
2016V-1114 | USA | 2016 | Stool | Vibrio paracholerae | QKKS00000000 |
2016V-1111 | USA | 2016 | Stool | Vibrio paracholerae | QKKR00000000 |
2017V-1176 | USA | 2017 | Animal feed | Vibrio paracholerae | QKKW00000000 |
2016V-1091 | USA | 2016 | Stool | Vibrio paracholerae | QKKQ00000000 |
2017V-1110 | USA | 2017 | Wound | Vibrio paracholerae | QKKU00000000 |
87395 | UK | UK | UK | Vibrio paracholerae | APFL00000000.1 |
07-2425 | UK | UK | UK | Vibrio paracholerae | QKKO00000000 |
NCTC30 | Egypt | 1916 | Clin | Vibrio paracholerae | LS997868.1 |
Clin, clinical; Env, environmental; PG, pandemic-generating; UK, unknown.
Genomes sequenced for this study.
A sister species to Vibrio cholerae?
Comparative genome analysis suggests that LB isolates could represent a new species, which would be the closest relative of V. cholerae described to date. Based on the genome sequences, G+C content of the strains belonging to the LB clade was 46% to 48.1%, falling within the known range of the genus Vibrio. Overall, a representative strain (EDC-792) of the clade shared 2,883 genes with type strains from both V. cholerae (N16961) and V. metoecus (OP3H) and 2,996 genes with V. cholerae alone (Fig. S1). The genomes of 22 LB isolates were compared with a set of V. cholerae strains containing the same number (n = 22) of representatives from both pandemic and nonpandemic lineages (Table 1). This comparison revealed that genetic distance between LB strains and V. cholerae fall below or at the threshold of the species cutoff values. Indeed, digital DNA-DNA hybridization (dDDH) values ranged from 82% to 100% within the LB clade and from 69% to 70% with V. cholerae, whereas average nucleotide identity (ANI) values ranged from 97% to 100% within the group and from 95% to 96% with V. cholerae strains, respectively. dDDH values are considered to be the gold standard for species designation, and a value of ≤70% presents as an indication that the tested organism belongs to a different species than the type strain(s) used as a reference (19). ANI has been proposed as an alternative genomic statistic to dDDH, and the cutoff values of 95% to 96% have been used for species delineation (20). In this case, all the strains from the LB clade had a dDDH value of ≤69% and an ANI value of ≤96% compared with the V. cholerae type strain N16961. In large-scale taxonomic studies, dDDH has been shown to overpower ANI values for the purpose of species designation (21, 22), suggesting that dDDH values should be given more importance when other evidence (phylogenetic, phenotypic, and ecological) support a novel species designation. Thus, according to the current species definition (23), the LB clade meets the genotypic criteria to qualify as a candidate for a novel species designation. It also meets the phylogenetic criteria, as it represents a well-defined, well-supported monophyletic clade in the core genome phylogeny (Fig. 4). Although the 16S rRNA gene has been used for taxonomic designations, it does not provide sufficient resolution to differentiate the closely related species in the V. cholerae clade (24, 25). For example, V. mimicus and V. metoecus, two species closely related to V. cholerae, show >99% 16S rRNA gene sequence identity with V. cholerae strains across the clade (7). We also performed a four-gene multilocus sequence analysis (MLSA), which has been used in studies describing several novel bacterial species in recent times (25, 26). Similar to the higher-resolution core genome phylogeny, the LB clade was well supported as a monophyletic cluster in the MLSA phylogenetic tree (Fig. S2). From an ecological perspective, lineages within the LB clade have shown distinct and cohesive abundance patterns in natural ecosystems of Dhaka, Bangladesh (11). Abundance and distribution of viuB alleles corresponding to the clade were found to be tightly correlated with each other, whereas they were negatively correlated with the abundance of the viuB allele corresponding to the dominant PG V. cholerae lineage (11).
To confirm that the LB clade represents a novel species, its phenotypic traits were compared to those from the most closely related species V. cholerae and V. metoecus. Their ability to catabolize 190 different carbon sources and respond to 96 chemicals and antimicrobials was determined using Biolog phenotypic microarray (PM) plates. Four LB strains were examined, two of environmental origin from Bangladesh (EDC 690, EDC792) and two from clinical sources in the United States (2016V-1111, 2016V-1091). These strains were compared with four V. cholerae (N16961, V52, YB3B05, YB8E08) and four V. metoecus strains (082459, OP6B, OP4B, OP3H). Although the LB clade isolates resembled V. cholerae in most biochemical and growth characteristics, they clearly differed for some phenotypic characteristics (Table 2). All four LB strains tested could utilize α-cyclodextrin as a sole carbon source, whereas none of the tested V. cholerae strains could. Cyclodextrin utilization requires a specific category of amylases, which has not been reported in V. cholerae so far (27). In silico analysis revealed that LB strains possess a gene cluster (genes 03367 to 03379 in the NCTC30 genome NZ_LS997867) containing homologs of genes encoding cyclomaltodextrin glucanotransferase (amyM), ABC transporter MalK (malK), glycosidase MalE (malE), glucosamine N-acetyltransferase, cyclodextin-specific porin (cycA), cyclodextrin binding protein (cycB), cyclodextrin transport system permease (malF), cyclodextrin transport system permease (malD/malG), and neopullulanase (nplT) (Table 3). Only two (9%) V. cholerae strains in our data set (n = 22) and a similar percentage in the NCBI database possessed this cluster, whereas 100% of the LB clade strains (n = 22) harbored it. This gene cluster might be associated with the cyclodextrin degradation phenotype, as reported previously (28). LB isolates also differed clearly from V. cholerae in the utilization of pectin and monomethyl succinate (Table 2). In contrast with V. cholerae, 75% (3 out of 4) of LB isolates tested were found to be lacking the ability to utilize d-mannose, l-aspartic acid, and citric acid (Table 2). d-Mannose was found to be readily utilized by both V. cholerae and V. metoecus tested in this study, and previous literature reported that ∼80% of V. cholerae is capable of utilizing this sugar (3). The gene cluster encompassing manP to manA (VC1820 to VC1827 in the N16961 genome AE003852.1), including the well-known mannose-6 phosphate isomerase (manA) gene required for this process (29), was present in all the tested V. cholerae (n = 22) and V. metoecus (n = 22) strains, whereas it was found in only ∼40% (9/22) of LB strains (Table 3). Notably, three isolates that were unable to utilize mannose lack the gene cluster manP-manA, whereas the only isolate (2016V-1111) of the four tested isolates from the LB clade that could utilize mannose (Table 2) possesses the gene cluster in its genome. We could not find the genetic basis for the other phenotypic differences between LB isolates and V. cholerae. LB strains are similar to V. cholerae in N-acetyl-d-galactosamine d-glucuronic acid utilization tests and acetoin production, which differentiate both of the groups from V. metoecus (7). Resistance to 96 drugs or metals was also tested at different concentrations, and V. cholerae and LB isolates showed similar profiles in most tests, although three chemicals elicited differential responses by the two species. LB isolates were resistant to cadmium chloride, sodium selenite, and dichlofluanid in contrast to the sensitivity of the V. cholerae strains toward those chemicals (Table S2). Thus, phylogenetic, genotypic, phenotypic, and ecological data support the designation of a novel species for the LB clade. Very recently, genome sequencing efforts of a historical collection of isolates from cholera or cholera-like diseases have identified a strain isolated from a soldier convalescent in Egypt during the first World War (in 1916) as a divergent V. cholerae (30). This NCTC30 strain actually belongs to the LB clade found in this study (Fig. 5). Interestingly, NCTC30 was initially designated “Vibrio paracholerae” by Gardner and Venkatraman in 1935, and the disease caused was described as choleraic and termed as “paracholera” (15). To honor its history, we propose the name Vibrio paracholerae sp. nov. (EDC-792T) for this novel species. Inclusion of this novel species would revise the phylogeny of the V. cholerae species complex, making V. paracholerae sp. nov. the sister species to V. cholerae and V. metoecus as the sister species to V. mimicus (Fig. 5). This improved description of the species complex containing pandemic V. cholerae will provide a framework for understanding the emergence and evolution of human-pathogenic vibrios from their environmental common ancestor.
TABLE 2.
Phenotypic testa |
Vibrio paracholerae sp. nov.b |
Vibrio cholerae
c
,
d
|
Vibrio metoecus
d
,
e
|
|||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | |
α-Cyclodextrin | + | + | + | + | − | − | − | − | + | + | − | + |
Pectin | + | + | + | + | − | − | − | − | + | + | + | + |
Monomethyl succinate | − | − | − | − | + | + | + | + | − | − | − | − |
d-Mannose | − | − | + | − | + | + | + | + | + | + | + | + |
l-Aspartic acid | − | − | + | − | + | + | + | + | + | + | + | + |
Citric acid | − | − | + | − | + | + | + | + | + | + | + | + |
α-Keto glutaric acid | − | − | + | + | + | + | + | + | − | − | + | − |
N-Acetyl-d-galactosamine | − | − | − | − | − | − | − | − | + | + | + | + |
d-Glucuronic acid | − | − | − | − | − | − | − | − | + | + | + | + |
Acetoin production | + | + | + | + | + | + | + | + | − | − | − | − |
+, growth/positive test result; −, no growth/negative test result.
Tested strains: 1, EDC 792; 2, EDC 690; 3, 2016V-1111; 4, 2016V-1091.
Tested strains: 5, N16961; 6, V52; 7, YB3B05; 8, YB8E08.
Results for V. cholerae and V. metoecus strains were obtained from Kirchberger et al. (7).
Tested strains: 9, 082459; 10, OP6B; 11, OP4B; 12, OP3H.
TABLE 3.
Genomic island/gene cluster | Genomic position in reference genomesa |
Present in % strainsb |
Putative function | Reference | |||
---|---|---|---|---|---|---|---|
N16961 (AE003852.1) locus | NCTC30 (LS997868.1) locus | VC (n = 22) | VP (n = 22) | VM (n = 22) | |||
RND efflux pump gene cluster | Absent | 818–823 | 0 | 90 | 0 | Resistance to antimicrobials and heavy metals | This study |
GI-66 | Absent | 1923–1927 | 0 | 100 | 0 | Iron regulation | 43 |
Cyclo-maltodextrin operon | Absent | 3367–3375 | 9 | 100 | 68 | Cyclodextrin utilization | 27 |
VCA1102-1111 (N16961) | VCA_1102–VCA_1111 | Absent | 100 | 0 | 100 | Fatty acid biosynthesis, heme biliverdin, thermostable hemolysin | This study |
Tor operon | VC_1692–VC_1694, VC_1719–VC_1720 | Absent | 100 | 0 | 100 | Virulence gene regulation | 31 |
Glutathione-regulated potassium pump | VC_2606–VC_2607 | Absent | 100 | 0 | 100 | Potassium regulation | This study |
Beta lactamase | Absent | 3210 | 0 | 60 | 0 | Resistance to β-lactams | 30 |
manA-manP | VC_1820–VC_1827 | 1534–1543 | 68 | 22 | 100 | Utilization of mannose | 29 |
Reference genomes N16961 (V. cholerae) and NCTC30 (V. paracholerae sp. nov.) were used for determining locus positions of the gene clusters.
VC, Vibrio cholerae; VM, Vibrio metoecus; VP, Vibrio paracholerae sp. nov.
A potential threat to humans?
To be a successful disease-causing agent to humans, a bacterial pathogen of aquatic origin needs to have the ability to survive in the environment and colonize the human body. In Dhaka, where cholera is endemic, V. paracholerae sp. nov. has been found to exist abundantly in local water reservoirs. In one particular site, the number even surpassed that of PG V. cholerae, which was otherwise the most predominant lineage found in Dhaka (11). That site (Kamrangirchar) happened to be the most densely populated area among those sampled, indicating a possible link between human population and the prevalence of V. paracholerae sp. nov. Overall, ecological analysis of the population in Dhaka suggests that the environmental abundance and distribution of the lineages corresponding to V. paracholerae sp. nov. strains are correlated with human population density (11). Although data from other locations are required to confirm this correlation, this observation raises the possibility of adaptation to the human gut as an alternate niche and an important factor in the ecology of this novel species. Association of the members of the species with cholera-like cases, such as in the case of the historical strain NCTC30, and isolation from clinical/human stool samples from different parts of the world (Table 1; Table S4) would suggest their pathogenic potential to humans. To assess this potential, V. paracholerae sp. nov. strains were screened for the presence of known virulence-related genes and islands often found in V. cholerae (Table S3).
V. paracholerae sp. nov. strains lack CTX, VPI-1, and VPI-2 (Table S3; Fig. S3), three major elements known to be essential for V. cholerae to cause cholera (2). They also lack a cluster of genes (VC1692, VC1694, VC1719, and VC1720 in the N16961 genome) encoding proteins for the “Tor operon” required for trimethylamine N-oxide respiration in V. cholerae. The genes in this operon have been shown to be crucial for cholera toxin production, cytotoxicity, and intestinal colonization of V. cholerae in an infant mouse model (31). The Tor operon was found in 100% of V. cholerae (n = 22) and V. metoecus (n = 4) strains in our data set, which indicates that it was likely lost in the V. paracholerae sp. nov. phylogenetic branch. All the V. paracholerae sp. nov. strains in our data set possessed the RTX toxin gene cluster (Table S3), a virulence factor for V. cholerae known to have a role in interactions with eukaryotes (32). Interestingly, four V. paracholerae sp. nov. strains (22%) (including NCTC30) possess type three secretion system (T3SS) genes (Table S3), an established virulence factor for nonpandemic V. cholerae (33). The historical strain NCTC30, which was isolated from a choleraic patient, contains a T3SS genetically more similar to a V. parahaemolyticus T3SS than that of one found in a subset of V. cholerae (acquired within a VPI-2 insertion site locus of PG V. cholerae) (30). The region of V. paracholerae sp. nov. strains containing this T3SS is 97% identical at the nucleotide sequence level to the T3SS island found in V. parahaemolyticus, highlighting a recent acquisition from this human-pathogenic species (Fig. S4). However, NCTC30 does not have the mobile integrative conjugative element (ICE) SXT/RC39, whereas four V. paracholerae sp. nov. strains contain this element in their genomes (Table S3). SXT/RC39-related ICEs are found in most PG V. cholerae strains since 2001 and are believed to be involved in improved fitness in 7th pandemic El Tor V. cholerae (34, 35). ICEs found in V. paracholerae sp. nov. strains showed 95.7% to 96.4% nucleotide sequence identity in the conserved homologous regions to the first reported SXT element in V. cholerae SXTMO10, originally detected in V. cholerae O139 strain MO10 (36). However, ICEs found in V. paracholerae sp. nov. strains were more related to ICEVchMex1, which was first reported to be present in an environmental V. cholerae non-O1/O139 strain (36) based on the nucleotide sequence of the integrase (int) gene (Fig. S7). Additionally, 50% of the V. paracholerae sp. nov. strains also contained genes for the recently discovered cholix toxin, thought to be an important virulence factor for V. cholerae (37). Recently, a novel island has been found in the VPI-1 insertion site, designated GIVchS12, which contains CRISPR-Cas and type 6 secretion system (T6SS) modules (38). CRISPR-Cas is a defense mechanism that bacteria can utilize against unwanted lateral gene transfer by recognizing foreign DNA and cleaving it. CRISPR-Cas modules are usually unevenly distributed across taxa, but can potentially provide a fitness advantage to the bacteria in a mixed population (38, 39). Seven out of 22 strains (∼31%) contain a version of the GIVchS12 island in the region to the VPI-1 insertion site of PG V. cholerae (Fig. S5). The T6SS operon of Vibrio cholerae contains diverse arrays of toxic effector and cognate immunity genes, which are thought to play an important role in the environmental lifestyle and adaptation to the human host (40). The functional T6SS genetic element corresponds to three loci across the V. cholerae genome, termed aux-1, aux-2, and large cluster (41). Like V. cholerae and its close relatives V. mimicus and V. metoecus (40), V. paracholerae sp. nov. possesses all three chromosomally encoded T6SS loci and shares its effector and immunity gene pool with these species (Table S5). However, for a similar phylogenetic diversity, the diversity of the T6SS effector-immunity loci is much less in V. paracholerae sp. nov. than that in V. cholerae (40, 42). This surprising lack of diversity could represent an ecological adaption in V. paracholerae, which could be deriving a fitness advantage in specific niches from this restricted set of T6SS effector and immunity proteins.
Apart from the known virulence genes usually found in V. cholerae, gene content analysis revealed a few species-specific genetic traits in V. paracholerae sp. nov. Two genes were present in all 22 V. paracholerae sp. nov. strains, with no homolog found in any V. cholerae strains. These two genes encode a LysR family transcriptional regulator (WP_001924807.1) and an HAD-IB family hydrolase (WP_071179638.1). Both of these genes are part of a previously reported genomic island (GI-66) found in Vibrio albensis (43). This genomic island contains iron-related regulatory genes that can be significant in the regulation of iron scavenging in this group of organisms. Iron acquisition is thought to be an important aspect for the regulation of virulence as well as host selectivity/specificity (44). Most (90%) of V. paracholerae sp. nov. strains also harbored a novel resistance-nodulation-division (RND) efflux pump (Table 3) thought to be critical for intrinsic and induced antimicrobial resistance, virulence gene expression, colonization in an animal host, and environmental regulation of stress response (45). Efflux pumps have been proposed to be important for expelling bile out of the cell, and the resulting bile resistance would be key to overcoming this challenge inside the human gut (46). RND efflux pumps have specifically been found to confer increased bile resistance in other Gram-negative bacteria (47). The novel RND gene cluster is absent in both V. cholerae and V. metoecus, but homologs have been found in the halophilic bacteria V. cincinnatiensis (48) and a bile-associated isolate of V. fluvialis (49). Other than efflux pumps, cholera toxin transcriptional activator (ToxR) and outer membrane protein TolC have also been proposed to be crucial for bile resistance, and, like V. cholerae strains, all the V. paracholerae sp. nov. strains possess both genes. All these factors make V. paracholerae sp. nov. a potential candidate for a species associated with the human gut microbial population and underscore the importance of studying their biology in greater detail.
Interaction of Vibrio paracholerae sp. nov. with pandemic Vibrio cholerae could have impacted the ecology and evolution of both species.
Horizontal gene transfer (HGT) among species sharing an ecological niche can have a major impact on their evolution (50). As V. paracholerae sp. nov. coexists with V. cholerae in natural ecosystems (at least in Dhaka), it is expected that HGT could take place between these two groups. To assess the propensity of interspecies HGT, potential gene transfer events between two groups (V. cholerae and V. paracholerae sp. nov.) were inferred based on phylogenetic congruence of individual genes. Maximum likelihood (ML) trees were constructed for each of the core and accessory gene families present in at least two strains from each group. A gene transfer was hypothesized if a member of a group clustered with members of the other group in a clade, and the gene tree could not be partitioned into perfect clades, which must consist of all members from the same group and only of that group (8, 51). In our groups of 22 V. cholerae and 22 V. paracholerae sp. nov. strains, 216 HGT events were hypothesized, involving 82 gene families from V. cholerae to V. paracholerae sp. nov., but only 62 events from V. paracholerae sp. nov. to V. cholerae involving 33 gene families. All of the core genes transferred from V. paracholerae sp. nov. to V. cholerae were acquired by strains outside the PG group.
In the case of accessory genes, we could infer 82 potential transfer events from V. cholerae to V. paracholerae sp. nov. and 54 events from V. paracholerae sp. nov. to V. cholerae. Only four events involved strains belonging to the PG clade. Thus, gene transfer directionality was biased from V. cholerae to V. paracholerae sp. nov., V. paracholerae sp. nov. being the recipient of HGT in most cases. A lower rate of HGT toward V. cholerae was previously reported with the cooccurring V. metoecus, which has a lower abundance in the environment (8). This gene transfer bias could be attributed to the dominance of V. cholerae in an area where cholera is endemic, as it is generally more abundant than V. paracholerae sp. nov. and, therefore, more likely to be a DNA donor (12). Among the accessory genes transferred from V. cholerae to V. paracholerae sp. nov., there were proteins related to O antigen synthesis, the type six secretion system, iron regulation, chaperone and multidrug resistance, and putative metabolic functions. There are examples of a single gene or even a small set of nucleotides within a gene acquired via HGT impacting the ecology and pathogenicity of Vibrio species (50, 52). Thus, the HGT events in V. paracholerae sp. nov. underscore the possibility for species coexisting with PG V. cholerae to acquire virulence and fitness-related genes to become pathogenic to humans and/or develop novel ecological traits.
Gene transfer events have led to the rise of virulent V. cholerae before, the most striking example being the rise of V. cholerae O139. The latter emerged in Bangladesh and India in 1992 and has been hypothesized to have originated via genetic recombination of the O antigen region from a serogroup O22 strain to a serogroup O1 El Tor strain (53). After its emergence, V. cholerae O139 remained an important cause of widespread cholera epidemics in that region until 2004, along with V. cholerae O1 El Tor (53). Interestingly, among the accessory genes inferred as subject to HGT from V. paracholerae sp. nov. to V. cholerae, genes encoding UDP-glucose 4-epimerase (EC 5.1.3.2) and UDP-N-acetylgalactosaminyltransferase appear to be a transfer in an ancestor of O139 strain MO10 from the V. paracholerae sp. nov. clade (Fig. 6). Both of these genes are involved in O antigen biosynthesis and could be of significance in the emergence and evolution of V. cholerae O139 as a human pathogen and pandemic agent. Even though it will require further investigation to find out how and to what extent V. paracholerae sp. nov. as a species contributed to the emergence and evolution of V. cholerae O139, these transfer events could be considered examples of how interaction of this close relative with V. cholerae could impact the epidemiology of cholera. The O antigen region is a highly variable region in terms of gene content. Previously, O22 strains have been hypothesized as the source of the O139 regions not found is V. cholerae O1 El Tor. However, there are parts of the O antigen region in V. cholerae reference O139 strain MO10 (designated region IV by Yamasaki et al.) for which the origin could not be attributed to either O22 or V. cholerae O1 (54). Recently, closely related homologs of genes in region IV of V. cholerae O139 strains were found in V. fujianensis (55). This indicates that the origin of the MO10 O antigen region is most likely a result of multiple recombination events from various origins, forming the mosaic O antigen-encoding region in V. cholerae O139. V. paracholerae sp. nov. strains contain at least nine genes found in the V. cholerae O139 reference strain MO10 that are not found in V. cholerae O1 (Fig. S6). These observations suggest that V. paracholerae sp. nov. could have been an important player of the gene transfer dynamics that gave rise to the second pathogenic serogroup of V. cholerae.
Conclusions.
Culture-independent analysis below the species level in inland locations where cholera is endemic and in coastal locations where cholera is not endemic in distinct geographic settings identified differences in the population structures present in these environments. It revealed that human influences are likely to be a major factor shaping communities of that species in areas where cholera is endemic. In urban tropical Dhaka, found in inland Bangladesh, PG V. cholerae was abundant and continuously present, but was accompanied by members of a related, but phylogenetically distinct, clade, which could represent a novel species. The abundance of this putative species, “Vibrio paracholerae sp. nov.,” in Dhaka and its absence from Oyster Pond on the United States east coast indicates that it is not a ubiquitous member of aquatic communities. In addition to those identified from the COVIS program in the United States and from Mozambique and Thailand, a number of strains of Vibrio spp. have been very recently isolated from clinical cases in China and Korea, which would belong to this species according to the genome sequence similarities they share with strains analyzed here (Table S4). An indirect association of their abundance with human population density indicates that they could be adapted to the human gut in areas where cholera is endemic (11). They could therefore occasionally become pathogenic by acquiring pathogenicity gene clusters or cause opportunistic infections in vulnerable individuals. Its history, biology, genetic traits, and coexistence with a pathogenic sister species make it a risk as an emerging human pathogen. Its potential contribution to the evolution of new pathogenic variants of V. cholerae (such as PG lineage O139) and likely influence on their population structure highlights the importance of studying this novel species in the context of a globally distributed infectious disease.
Taxonomy.
Vibrio paracholerae (pa.ra.chol.er.ae. Gr. prep. para, alongside; L. gen. f. n. cholerae, bilious disease, referring to the isolation of the type strain alongside Vibrio cholerae, the causative agent of cholera). A Gram-negative, oxidase-positive, curved rod-shaped bacterium, roughly 1.25 to 2 mm in length and 0.4 mm in width. Exhibits motility by means of a single polar flagellum. Growth is observed at 30°C with salt concentrations in the range of 0% to 6.0% NaCl; no growth occurs in the presence of 10% NaCl. The ability to utilize α-cyclodextrin and pectin as well as the lack of ability to utilize monomethyl succinate differentiates the species from its closest relative V. cholerae (99% identity of the 16S rRNA gene). Forms yellow circular colonies on thiosulfate citrate bile salts sucrose (TCBS) agar (similar to those formed by V. cholerae) and circular colonies of creamy white color on tryptic soy broth (TSB) agar. Positive for carbon utilization from d-glucose, d-fructose, sucrose, maltose, d-galactose, maltotriose, d-mannitol, d-trehalose, d-glucose-6-phosphate, N-acetyl-d-glucosamine, glycerol, succinic acid, l-lactic acid, l-glutamic acid, fumaric acid, acetic acid, l-proline, d-alanine, l-asparagine, 2-deoxyadenosine, adenosine, inosine and l-serine, dextrin, gelatin, glycogen, d-glucosamine, and d-lactic acid methyl ester. Variation of response observed between strains in utilization of d-mannose, d-cellobiose, d-fructose-6-phosphate, d-psicose, α-d-lactose, l-aspartic acid, d-glucuronic acid, d-gluconic acid, d,l-α-glycerol phosphate, d,l-malic acid, d-ribose, Tween-20, thymidine, α-keto-glutaric acid, Tween-40, Tween-80, α-keto-butyric acid, uridine, l-glutamine, α-hydroxy-glutaric acid-γ-lactone, β-methyl-d-glucoside, citric acid, l-threonine, l-alanine, l-alanyl-glycine, glycine, histidine, methyl pyruvate, d-malic acid, glycyl-l-proline, and pyruvic acid. The proposed type strain EDC-792 was isolated from environmental water in Dhaka, Bangladesh, in 2016; the strain displays all the properties given above for the species. The proposed type strain (EDC-792 = DSM 112460T = CCUG 75319T) shows 95.6% ANI and 69% dDDH, respectively, to the type strain of the closest sister species, Vibrio cholerae (N16961). The oldest isolate of the species, NCTC30, described by Dorman et al. (30), is likely to have some phenotype-changing mutations that urged for a new type strain with typical properties present in all the known strains of the species.
MATERIALS AND METHODS
Sample collection and processing.
Environmental water samples were collected every 2 weeks between June 2015 and March 2016 from seven points along the water bodies surrounding Dhaka city, which is located in the central part of Bangladesh (23.8103°N, 90.4125°E). One-time water samples were collected from two natural coastal water bodies in Mathbaria (22.2920°N, 89.9580°E) and Kuakata (21.8210°N, 90.1214°E), which are geographically adjacent to the coast of the Bay of Bengal and approximately 200 km and 250 km southwest of Dhaka, respectively. One liter of water was collected from each site in sterile Nalgene bottles placed in an insulated plastic box and transported at ambient air temperature from the site of collection to the central laboratory of the International Center for Diarrheal Disease Research, Bangladesh (icddr,b), in Dhaka. Oyster Pond sampling was performed at the same spots and at approximately the same time of day in the pond and the nearby lagoon connected to the ocean in monthly intervals from June to October in 2008 and 2009 as described in Kirchberger et al. (10). Fifty liters of water was filtered through 0.22-μm Sterivex filters (Mo Bio Laboratories Inc., Carlsbad, CA, USA) for the collection of biomass. Genomic DNA was extracted from the biomass using the protocol described by Wright et al. (56).
Isolation and identification of isolates.
Bacterial isolates were recovered as described elsewhere (57). Briefly, water samples were enriched in alkaline peptone water (APW; Difco Laboratories, Detroit, MI) at 37°C for 6 to 8 h before plating. About 5 μl of enriched APW broth was streaked using an inoculating loop onto both thiosulfate citrate bile salts sucrose (TCBS) and taurocholate tellurite gelatin agar (TTGA) and incubated at 37°C for 18 to 24 h. Colonies with the characteristic appearance of V. cholerae were confirmed by standard biochemical and serological tests (and, in the case of the latter, by testing with polyvalent and monoclonal antibodies specific for V. cholerae O1 or O139) and, finally, by PCR.
Phenotypic tests.
For the comparison of phenotypic characteristics, Biolog phenotypic microarray plates PM1, PM2A, PM14A, PM16A, and PM18C were used (58). Overnight-cultured bacterial colonies were inoculated into Biolog IF-0a base medium to reach 85% turbidity, and a 1:200 dilution was aliquoted into IF-10b medium supplemented with dye mix A, as indicated by the manufacturer’s instructions. The mixture was then added into wells of Biolog PM1 and PM2A plates containing various carbon sources and PM14A, PM16A, and PM18C plates containing substrates of various antimicrobials and heavy metal salts. The incubation and monitoring of the growth of inocula were done for 96 h in the presence of a sole carbon source or the heavy metals; growth causes reduction of the dye, resulting in purple color formation.
Quantitative PCR.
Estimation of Vibrio cholerae number was done using qPCR following the protocol described elsewhere (12). The target probe for viuB, 5′-/56-FAM/TCATTTGGC/ZEN/CAGAGCATAAACCGGT/3IABkFQ/-3′, the forward primer, 5′-TCGGTATTGTCTAACGGTAT-3′, and the reverse primer, 5′-CGATTCGTGAGGGTGATA-3′, were used. The volume of the PCR reaction was 10 μl and contained 5 μl of 2× Dynamite qPCR master mix (MBSU, University of Alberta, Edmonton, Canada), 1 μl each of 500 nM primer-250 nM probe mix, 1 μl of molecular-grade water, and 2 μl of DNA template. Real-time quantitative PCR was performed under the following conditions: initial primer activation at 95°C for 2 min followed by 40 cycles of 95°C for 15 s and 60°C for 1 min in an Illumina Eco real-time PCR system.
Amplicon sequencing.
Amplicon sequencing of the viuB gene was performed following the method described elsewhere (9). To amplify 293 bp of the viuB region from DNA extracted from water samples, a touchdown PCR was performed using 0.5 μl each of 10 pmol forward and reverse primers (for viuB: viuB2f 5′-CCGTTAGACAATACCGAGCAC-3′ and viuB5r 5′-TTAGGATCGCGCACTAACCAC-3′), 0.4 μl of 10 mM dNTP mix (ThermoFisher), 0.4 μl of Phire Hot Start II DNA polymerase (ThermoFisher), 0.5 μl of molecular biology-grade bovine serum albumin (20 mg/ml, New England Biolabs), 5 μl of 5× Phire buffer, and 2 μl of template DNA. The PCR was performed as follows: initial denaturation at 98°C for 4 min, followed by 10 cycles of denaturation at 98°C for 10 s, annealing at 60°C for 6 s (reduced by 1°C per cycle), and extension at 72°C for 1 s, followed by 23 cycles of denaturation at 98°C for 10 s, annealing at 50°C for 6 s (reduced by 1°C per cycle), and extension at 72°C for 1 s, and a final extension at 72°C for 1 min. In preparation for sequencing, dual-indexed sequences were tagged using indices developed by Kozich et al. (59) as follows: 2 μl of preceding viuB PCR amplification reaction was used as the template for tagging PCR; initial denaturation at 98°C for 30 s, followed by two cycles of denaturation at 98°C for 10 s, annealing at 55°C for 6 s, and extension at 72°C for 1 s, and final extension at 72°C for 1 min. Eight tagging reactions were performed for each sample, and products were pooled and run on a 2% agarose gel in 1× Tris-acetate-EDTA buffer. The appropriate bands (428 bp) were cut out of the gel. PCR products were then purified using the Wizard SV gel and PCR clean-up system (Promega) according to the manufacturer’s instructions. Concentration of clean PCR products was then measured using a Qubit fluorometer (ThermoFisher) with a Qubit double-stranded DNA high-sensitivity assay kit (ThermoFisher), and products were pooled in equal concentrations (>10 ng/μl). The pooled samples were then concentrated using a Wizard SV gel and PCR clean-up system (Promega). Quality control of the pooled and concentrated sample was done using an Agilent 2100 Bioanalyzer. Sequencing was performed using Illumina MiSeq technology with a v3 (600 cycles) reagent kit.
Amplicon sequence analysis.
Demultiplexed raw reads from the sequencing run were processed in R (60) using the DADA2 pipeline 1.4.0 (61). First, 10 bp of forward and reverse reads were trimmed, and reads with a maximum expected error rate of >1 were discarded. Chimera detection implemented in DADA2 was then performed on pooled samples. To account for the possibility of real chimeras between protein-coding genes from closely related organisms (due to recombination or homoplastic mutations), chimeras were compared with a reference data set of viuB alleles found in 782 sequenced V. cholerae genomes (obtained from GenBank). Only viuB alleles composed of more than 1,000 reads found in multiple samples (with an average of 100,000 reads per sample) were considered for further analysis. Samples were rarefied to the level of the sample with the lowest reads using mothur 1.39.5 (62), and further analysis was performed in R, with statistical tests and distance calculations performed using the VEGAN 2.4-6 package (63). Bray-Curtis similarity was calculated based on relative read abundance of each allele in different samples in the Primer-E software suite and used for similarity percentage (SIMPER) and nonmetric multidimensional scaling (NMDS) analysis.
Whole-genome sequencing and core genome phylogeny.
The genomes of 23 strains from Dhaka belonging to various viuB genotypes were chosen for whole-genome sequencing as described in Orata et al. (8). Sequencing libraries were prepared from genomic DNA using the Nextera XT DNA library preparation kit (Illumina, San Diego, CA, USA) and sequenced using Illumina MiSeq sequencing platforms (2 × 250-bp paired-end reads). Quality control and de novo assembly of the reads were done using default parameters in CLC Genomics workbench 7 (Qiagen). Whole-genome alignment was performed using Mugsy v1.2.3 (64) with default parameters, and a maximum likelihood tree was built from this alignment using RaxML v8 (65) under the general time reversible (GTR) gamma model with 100 bootstrap replicates. Additional V. cholerae genomes were downloaded from GenBank. The maximum likelihood phylogenomic tree was constructed from the alignment of locally colinear blocks (2,094,734 bp) using a GTR gamma substitution model with 100 bootstrap replicates.
Comparative genomic analysis.
The genome sequences were annotated with RAST 2.0 (66). Genomic distances were calculated in Geneious (67). Core and accessory genes were determined with BPGA, finding orthologous protein-coding genes clustered into families based on a 30% amino acid sequence identity (68). Group-specific genes were clustered using a custom-made Python program. BLAST atlas of the genomes and genomic islands was carried out using GView server (https://server.gview.ca/). Effector and immunity genes in type 6 secretion system loci were typed as previously described in Kirchberger et al. (42).
Data availability.
All previously sequenced bacterial genomes and genomes sequenced in this study are available from the NCBI GenBank and the PubMLST databases. Table 1 and Table S5 in the supplemental material list all the accession numbers for the genomes used in this study. In addition, all genome sequences and relevant genomic and epidemiological data related to the isolates used are publicly available on PubMLST (https://pubmlst.org/vcholerae/).
ACKNOWLEDGMENTS
M.T.I., T.N., M.A., and Y.F.B. designed the experiments. M.T.I., T.N., and F-T.J. performed data collection. M.T.I., P.C.K., K.Y.H.L., F.D.O., and N.A.S.H. performed data analysis. M.T.I., K.Y.H.L., and F.D.O. performed bioinformatics analyses. M.S.I. performed genome sequencing and supplied meta data of isolates from the CDC. M.A. and C.L.T. provided isolates used in this study. M.T.I. wrote the manuscript. Y.F.B. and C.L.T. provided critical reviews on the manuscript. This work was supported by the Natural Sciences and Engineering Research Council (NSERC) of Canada (to Y.F.B.), the Integrated Microbial Biodiversity program of the Canadian Institute for Advanced Research (to Y.F.B.), federal appropriations to the Centers for Disease Control and Prevention through the Advanced Molecular Detection Initiative (to C.L.T.), and graduate student scholarships from Alberta Innovates-Technology Futures (to M.T.I., K.Y.H.L., F.D.O., and T.N.), NSERC (to K.Y.H.L. and T.N.), the University of Alberta Faculty of Graduate Studies and Research (Queen Elizabeth II Graduate Scholarship to K.Y.H.L. and T.N., Alberta Graduate Excellence Scholarship to N.A.S.H.), Alberta Advanced Education (to N.A.S.H.), and the Bank of Montréal Financial Group (to F.D.O.). M.A. from icddr,b acknowledges the governments of Bangladesh, Canada, Sweden, and the United Kingdom for providing core/unrestricted support. The funders had no role in study design, data collection, and interpretation or the decision to submit the work for publication. The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention.
Footnotes
Supplemental material is available online only.
Contributor Information
Yann F. Boucher, Email: yboucher@ualberta.ca.
Knut Rudi, Norwegian University of Life Sciences.
REFERENCES
- 1.Ali M, Lopez AL, Ae You Y, Eun Kim Y, Sah B, Maskery B, Clemens J. 2012. The global burden of cholera. Bull World Health Organ 90:209–218. 10.2471/BLT.11.093427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Kaper JB, MorrisJG, Jr., Levine MM. 1995. Cholera. Clin Microbiol Rev 8:48–86. 10.1128/CMR.8.1.48. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Davis BR, Fanning GR, Madden JM, Steigerwalt AG, Bradford HB, Smith HL, Brenner DJ. 1981. Characterization of biochemically atypical Vibrio cholerae strains and designation of a new pathogenic species, Vibrio mimicus. J Clin Microbiol 14:631–639. 10.1128/jcm.14.6.631-639.1981. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Hasan NA, Grim CJ, Haley BJ, Chun J, Alam M, Taviani E, Hoq M, Munk AC, Saunders E, Brettin TS, Bruce DC, Challacombe JF, Detter JC, Han CS, Xie G, Nair GB, Huq A, Colwell RR. 2010. Comparative genomics of clinical and environmental Vibrio mimicus. Proc Natl Acad Sci U S A 107:21134–21139. 10.1073/pnas.1013825107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Neogi SB, Chowdhury N, Awasthi SP, Asakura M, Okuno K, Mahmud ZH, Islam MS, Hinenoya A, Nair GB, Yamasaki S. 2019. Novel cholera toxin variant and ToxT regulon in environmental Vibrio mimicus isolates: potential resources for the evolution of Vibrio cholerae hybrid strains. Appl Environ Microbiol 85:e01977-18. 10.1128/AEM.01977-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Haley BJ, Grim CJ, Hasan NA, Choi SY, Chun J, Brettin TS, Bruce DC, Challacombe JF, Detter JC, Han CS, Huq A, Colwell RR. 2010. Comparative genomic analysis reveals evidence of two novel Vibrio species closely related to V. cholerae. BMC Microbiol 10:154. 10.1186/1471-2180-10-154. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Kirchberger PC, Turnsek M, Hunt DE, Haley BJ, Colwell RR, Polz MF, Tarr CL, Boucher Y. 2014. Vibrio metoecus sp. nov., a close relative of Vibrio cholerae isolated from coastal brackish ponds and clinical specimens. Int J Syst Evol Microbiol 64:3208–3214. 10.1099/ijs.0.060145-0. [DOI] [PubMed] [Google Scholar]
- 8.Orata FD, Kirchberger PC, Méheust R, Jed Barlow E, Tarr CL, Boucher Y. 2015. The dynamics of genetic interactions between Vibrio metoecus and Vibrio cholerae, two close relatives co-occurring in the environment. Genome Biol Evol 7:2941–2954. 10.1093/gbe/evv193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Kirchberger PC, Orata FD, Nasreen T, Kauffman KM, Tarr CL, Case RJ, Polz MF, Boucher YF. 2020. Culture-independent tracking of Vibrio cholerae lineages reveals complex spatiotemporal dynamics in a natural population. Env Microbiol 22:4244–4256. 10.1111/1462-2920.14921. [DOI] [PubMed] [Google Scholar]
- 10.Kirchberger PC, Orata FD, Barlow EJ, Kauffman KM, Case RJ, Polz MF, Boucher Y. 2016. A small number of phylogenetically distinct clonal complexes dominate a coastal Vibrio cholerae population. Appl Environ Microbiol 82:5576–5586. 10.1128/AEM.01177-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Islam MT, Nasreen T, Liang KYH, Johura F-T, Kirchberger PC, Sultana M, Case RJ, Alam M, Boucher YF. Dynamic subspecies population structure of Vibrio cholerae in Dhaka, Bangladesh. Research Square https://www.researchsquare.com/article/rs-364004/v1. [DOI] [PubMed]
- 12.Nasreen T, Hussain NAS, Islam MT, Orata FD, Kirchberger PC, Case RJ, Alam M, Yanow SK, Boucher YF. 2020. Simultaneous quantification of Vibrio metoecus and Vibrio cholerae with its O1 serogroup and toxigenic subpopulations in environmental reservoirs. Pathogens 9:1053. 10.3390/pathogens9121053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Rafique R, Rashid MU, Monira S, Rahman Z, Mahmud MT, Mustafiz M, Saif-Ur-Rahman KM, Johura FT, Islam S, Parvin T, Bhuyian MS, Sharif MB, Rahman SR, Sack DA, Sack RB, George CM, Alam M. 2016. Transmission of infectious Vibrio cholerae through drinking water among the household contacts of cholera patients (CHoBI7 Trial). Front Microbiol 7:1635. 10.3389/fmicb.2016.01635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Shapiro BJ, Levade I, Kovacikova G, Taylor RK, Almagro-Moreno S. 2016. Origins of pandemic Vibrio cholerae from environmental gene pools. Nat Microbiol 2:16240. 10.1038/nmicrobiol.2016.240. [DOI] [PubMed] [Google Scholar]
- 15.Gardner AD, Venkatraman KV. 1935. The antigens of the cholera group of vibrios. J Hyg (Lond) 35:262–282. 10.1017/s0022172400032265. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Bishop-Lilly KA, Johnson SL, Verratti K, Luu T, Khiani A, Awosika J, Mokashi VP, Chain PS, Sozhamannan S. 2014. Genome sequencing of 15 clinical Vibrio isolates, including 13 non-O1/non-O139 serogroup strains. Genome Announc 2:e00893-14. 10.1128/genomeA.00893-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Islam MT, Liang K, Im MS, Winkjer J, Busby S, Tarr CL, Boucher Y. 2018. Draft genome sequences of nine Vibrio sp. isolates from across the United States closely related to Vibrio cholerae. Microbiol Resour Announc 7:e00965-18. 10.1128/MRA.00965-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Garrine M, Mandomando I, Vubil D, Nhampossa T, Acacio S, Li S, Paulson JN, Almeida M, Domman D, Thomson NR, Alonso P, Stine OC. 2017. Minimal genetic change in Vibrio cholerae in Mozambique over time: multilocus variable number tandem repeat analysis and whole genome sequencing. PLoS Negl Trop Dis 11:e0005671. 10.1371/journal.pntd.0005671. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Auch AF, von Jan M, Klenk HP, Goker M. 2010. Digital DNA-DNA hybridization for microbial species delineation by means of genome-to-genome sequence comparison. Stand Genomic Sci 2:117–134. 10.4056/sigs.531120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Ciufo S, Kannan S, Sharma S, Badretdin A, Clark K, Turner S, Brover S, Schoch CL, Kimchi A, DiCuccio M. 2018. Using average nucleotide identity to improve taxonomic assignments in prokaryotic genomes at the NCBI. Int J Syst Evol Microbiol 68:2386–2392. 10.1099/ijsem.0.002809. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Meier-Kolthoff JP, Klenk H-P, Göker M. 2014. Taxonomic use of DNA G+C content and DNA–DNA hybridization in the genomic age. Int J Syst Evol Microbiol 64:352–356. 10.1099/ijs.0.056994-0. [DOI] [PubMed] [Google Scholar]
- 22.Jin Y, Zhou J, Zhou J, Hu M, Zhang Q, Kong N, Ren H, Liang L, Yue J. 2020. Genome-based classification of Burkholderia cepacia complex provides new insight into its taxonomic status. Biol Direct 15:6. 10.1186/s13062-020-0258-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Richter M, Rossello-Mora R. 2009. Shifting the genomic gold standard for the prokaryotic species definition. Proc Natl Acad Sci U S A 106:19126–19131. 10.1073/pnas.0906412106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Boyd EF, Carpenter MR, Chowdhury N, Cohen AL, Haines-Menges BL, Kalburge SS, Kingston JJ, Lubin JB, Ongagna-Yhombi SY, Whitaker WB. 2015. Post-genomic analysis of members of the family Vibrionaceae. Microbiol Spectr 3 10.1128/microbiolspec.VE-0009-2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Thompson FL, Gevers D, Thompson CC, Dawyndt P, Naser S, Hoste B, Munn CB, Swings J. 2005. Phylogeny and molecular identification of vibrios on the basis of multilocus sequence analysis. Appl Environ Microbiol 71:5107–5115. 10.1128/AEM.71.9.5107-5115.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Orata FD, Xu Y, Gladney LM, Rishishwar L, Case RJ, Boucher Y, Jordan IK, Tarr CL. 2016. Characterization of clinical and environmental isolates of Vibrio cidicii sp. nov., a close relative of Vibrio navarrensis. Int J Syst Evol Microbiol 66:4148–4155. 10.1099/ijsem.0.001327. [DOI] [PubMed] [Google Scholar]
- 27.Fiedler G, Pajatsch M, Böck A. 1996. Genetics of a novel starch utilisation pathway present in Klebsiella oxytoca. J Mol Biol 256:279–291. 10.1006/jmbi.1996.0085. [DOI] [PubMed] [Google Scholar]
- 28.Davis GH, Park RW. 1962. A taxonomic study of certain bacteria currently classified as Vibrio species. J Gen Microbiol 27:101–119. 10.1099/00221287-27-1-101. [DOI] [PubMed] [Google Scholar]
- 29.Sun T, Altenbuchner J. 2010. Characterization of a mannose utilization system in Bacillus subtilis. J Bacteriol 192:2128–2139. 10.1128/JB.01673-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Dorman MJ, Kane L, Domman D, Turnbull JD, Cormie C, Fazal MA, Goulding DA, Russell JE, Alexander S, Thomson NR. 2019. The history, genome and biology of NCTC 30: a non-pandemic Vibrio cholerae isolate from World War One. Proc Biol Sci 286:20182025. 10.1098/rspb.2018.2025. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lee KM, Park Y, Bari W, Yoon MY, Go J, Kim SC, Lee HI, Yoon SS. 2012. Activation of cholera toxin production by anaerobic respiration of trimethylamine N-oxide in Vibrio cholerae. J Biol Chem 287:39742–39752. 10.1074/jbc.M112.394932. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Lin W, Fullner KJ, Clayton R, Sexton JA, Rogers MB, Calia KE, Calderwood SB, Fraser C, Mekalanos JJ. 1999. Identification of a Vibrio cholerae RTX toxin gene cluster that is tightly linked to the cholera toxin prophage. Proc Natl Acad Sci U S A 96:1071–1076. 10.1073/pnas.96.3.1071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Dziejman M, Serruto D, Tam VC, Sturtevant D, Diraphat P, Faruque SM, Rahman MH, Heidelberg JF, Decker J, Li L, Montgomery KT, Grills G, Kucherlapati R, Mekalanos JJ. 2005. Genomic characterization of non-O1, non-O139 Vibrio cholerae reveals genes for a type III secretion system. Proc Natl Acad Sci U S A 102:3465–3470. 10.1073/pnas.0409918102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Chun J, Grim CJ, Hasan NA, Lee JH, Choi SY, Haley BJ, Taviani E, Jeon YS, Kim DW, Lee JH, Brettin TS, Bruce DC, Challacombe JF, Detter JC, Han CS, Munk AC, Chertkov O, Meincke L, Saunders E, Walters RA, Huq A, Nair GB, Colwell RR. 2009. Comparative genomics reveals mechanism for short-term and long-term clonal transitions in pandemic Vibrio cholerae. Proc Natl Acad Sci U S A 106:15442–15447. 10.1073/pnas.0907787106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Mutreja A, Kim DW, Thomson N, Connor TR, Hee J, Lebens M, Niyogi SK, Kim EJ, Ramamurthy T, Chun J, Parkhill J, Dougan G. 2013. Evidence for multiple waves of global transmission within the seventh cholera pandemic. Nature 477:462–465. 10.1038/nature10392.Evidence. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Burrus V, Quezada-Calvillo R, Marrero J, Waldor MK. 2006. SXT-related integrating conjugative element in New World Vibrio cholerae. Appl Environ Microbiol 72:3054–3057. 10.1128/AEM.72.4.3054-3057.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Lugo MR, Merrill AR. 2015. The father, son and cholix toxin: the third member of the DT group mono-ADP-ribosyltransferase toxin family. Toxins (Basel) 7:2757–2772. 10.3390/toxins7082757. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Labbate M, Orata FD, Petty NK, Jayatilleke ND, King WL, Kirchberger PC, Allen C, Mann G, Mutreja A, Thomson NR, Boucher Y, Charles IG. 2016. A genomic island in Vibrio cholerae with VPI-1 site-specific recombination characteristics contains CRISPR-Cas and type VI secretion modules. Sci Rep 6:36891. 10.1038/srep36891. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Westra ER, van Houte S, Gandon S, Whitaker R. 2019. The ecology and evolution of microbial CRISPR-Cas adaptive immune systems. Philos Trans R Soc Lond B Biol Sci 374:20190101. 10.1098/rstb.2019.0101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Hussain NA, Kirchberger PC, Case RJ, Boucher YF. 2021. Modular molecular weaponry plays a key role in competition within an environmental Vibrio cholerae population. Front Microbiol 12:671092. 10.3389/fmicb.2021.671092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Unterweger D, Miyata ST, Bachmann V, Brooks TM, Mullins T, Kostiuk B, Provenzano D, Pukatzki S. 2014. The Vibrio cholerae type VI secretion system employs diverse effector modules for intraspecific competition. Nat Commun 5:3549. 10.1038/ncomms4549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Kirchberger PC, Unterweger D, Provenzano D, Pukatzki S, Boucher Y. 2017. Sequential displacement of type VI secretion system effector genes leads to evolution of diverse immunity gene arrays in Vibrio cholerae. Sci Rep 7:45133. 10.1038/srep45133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Choi SY, Rashed SM, Hasan NA, Alam M, Islam T, Sadique A, Johura FT, Eppinger M, Ravel J, Huq A, Cravioto A, Colwell RR. 2016. Phylogenetic diversity of Vibrio cholerae associated with endemic cholera in Mexico from 1991 to 2008. mBio 7:e02160. 10.1128/mBio.02160-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Payne SM, Mey AR, Wyckoff EE. 2016. Vibrio iron transport: evolutionary adaptation to life in multiple environments. Microbiol Mol Biol Rev 80:69–90. 10.1128/MMBR.00046-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Bina XR, Provenzano D, Nguyen N, Bina JE. 2008. Vibrio cholerae RND family efflux systems are required for antimicrobial resistance, optimal virulence factor production, and colonization of the infant mouse small intestine. Infect Immun 76:3595–3605. 10.1128/IAI.01620-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Trastoy R, Manso T, Fernández-García L, Blasco L, Ambroa A, Pérez Del Molino ML, Bou G, García-Contreras R, Wood TK, Tomás M. 2018. Mechanisms of bacterial tolerance and persistence in the gastrointestinal and respiratory environments. Clin Microbiol Rev 31:e00023-18. 10.1128/CMR.00023-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Routh MD, Zalucki Y, Su CC, Long F, Zhang Q, Shafer WM, Yu EW. 2010. Efflux pumps of the resistance-nodulation-division family: a perspective of their structure, function, and regulation in gram-negative bacteria. Adv Enzymol Relat Areas Mol Biol 77:109–146. 10.1002/9780470920541.ch3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Brayton PR, Bode RB, Colwell RR, MacDonell MT, Hall HL, Grimes DJ, West PA, Bryant TN. 1986. Vibrio cincinnatiensis sp. nov., a new human pathogen. J Clin Microbiol 23:104–108. 10.1128/jcm.23.1.104-108.1986. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Zheng B, Jiang X, Cheng H, Guo L, Zhang J, Xu H, Yu X, Huang C, Ji J, Ying C, Feng Y, Xiao Y, Li L. 2017. Genome characterization of two bile-isolated Vibrio fluvialis strains: an insight into pathogenicity and bile salt adaption. Sci Rep 7:11827. 10.1038/s41598-017-12304-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Polz MF, Alm EJ, Hanage WP. 2013. Horizontal gene transfer and the evolution of bacterial and archaeal population structure. Trends Genet 29:170–175. 10.1016/j.tig.2012.12.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Schliep K, Lopez P, Lapointe FJ, Bapteste É. 2011. Harvesting evolutionary signals in a forest of prokaryotic gene trees. Mol Biol Evol 28:1393–1405. 10.1093/molbev/msq323. [DOI] [PubMed] [Google Scholar]
- 52.Levade I, Terrat Y, Leducq J-B, Weil AA, Mayo-Smith LM, Chowdhury F, Khan AI, Boncy J, Buteau J, Ivers LC, Ryan ET, Charles RC, Calderwood SB, Qadri F, Harris JB, LaRocque RC, Shapiro BJ. 2017. Vibrio cholerae genomic diversity within and between patients. Microb Genom 3:e000142. 10.1099/mgen.0.000142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Faruque SM, Sack DA, Sack RB, Colwell RR, Takeda Y, Nair GB. 2003. Emergence and evolution of Vibrio cholerae O139. Proc Natl Acad Sci U S A 100:1304–1309. 10.1073/pnas.0337468100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Yamasaki S, Shimizu T, Hoshino K, Ho S-T, Shimada T, Nair GB, Takeda Y. 1999. The genes responsible for O-antigen synthesis of Vibrio cholerae O139 are closely related to those of Vibrio cholerae O22. Gene 237:321–332. 10.1016/s0378-1119(99)00344-3. [DOI] [PubMed] [Google Scholar]
- 55.Huang Z, Yu K, Fang Y, Dai H, Cai H, Li Z, Kan B, Wei Q, Wang D. 2020. Comparative genomics and transcriptomics analyses reveal a unique environmental adaptability of Vibrio fujianensis. Microorganisms 8:555. 10.3390/microorganisms8040555. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Wright JJ, Lee S, Zaikova E, Walsh DA, Hallam SJ. 2009. DNA extraction from 0.22 μM Sterivex filters and cesium chloride density gradient centrifugation. J Vis Exp 31:1352. 10.3791/1352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Alam M, Hasan NA, Sadique A, Bhuiyan NA, Ahmed KU, Nusrin S, Nair GB, Siddique AK, Sack RB, Sack DA, Huq A, Colwell RR. 2006. Seasonal cholera caused by Vibrio cholerae serogroups O1 and O139 in the coastal aquatic environment of Bangladesh. Appl Environ Microbiol 72:4096–4104. 10.1128/AEM.00066-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Bochner BR, Gadzinski P, Panomitros E. 2001. Phenotype microarrays for high-throughput phenotypic testing and assay of gene function. Genome Res 11:1246–1255. 10.1101/gr.186501. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Kozich JJ, Westcott SL, Baxter NT, Highlander SK, Schloss PD. 2013. Development of a dual-index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform. Appl Environ Microbiol 79:5112–5120. 10.1128/AEM.01043-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.R Development Core Team. 2011. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. [Google Scholar]
- 61.Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJA, Holmes SP. 2016. DADA2: high-resolution sample inference from Illumina amplicon data. Nat Methods 13:581–583. 10.1038/nmeth.3869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Schloss PD, Westcott SL, Ryabin T, Hall JR, Hartmann M, Hollister EB, Lesniewski RA, Oakley BB, Parks DH, Robinson CJ, Sahl JW, Stres B, Thallinger GG, Van Horn DJ, Weber CF. 2009. Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol 75:7537–7541. 10.1128/AEM.01541-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Oksanen J, Kindt R, Legendre P, O’Hara B, Simpson GL, Solymos PM, Stevens MHH, Wagner H. 2008. The vegan package: community ecology package. http://vegan.r-forge.r-project.org/.
- 64.Angiuoli SV, Salzberg SL. 2011. Mugsy: fast multiple alignment of closely related whole genomes. Bioinformatics 27:334–342. 10.1093/bioinformatics/btq665. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Stamatakis A. 2014. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30:1312–1313. 10.1093/bioinformatics/btu033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Aziz RK, Bartels D, Best A, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. 2008. The RAST server: rapid annotations using subsystems technology. BMC Genomics 9:75. 10.1186/1471-2164-9-75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, Thierer T, Ashton B, Meintjes P, Drummond A. 2012. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics 28:1647–1649. 10.1093/bioinformatics/bts199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Chaudhari NM, Gupta VK, Dutta C. 2016. BPGA-an ultra-fast pan-genome analysis pipeline. Sci Rep 6:24373. 10.1038/srep24373. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Treangen TJ, Ondov BD, Koren S, Phillippy AM. 2014. The harvest suite for rapid core-genome alignment and visualization of thousands of intraspecific microbial genomes. Genome Biol 15:524. 10.1186/s13059-014-0524-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All previously sequenced bacterial genomes and genomes sequenced in this study are available from the NCBI GenBank and the PubMLST databases. Table 1 and Table S5 in the supplemental material list all the accession numbers for the genomes used in this study. In addition, all genome sequences and relevant genomic and epidemiological data related to the isolates used are publicly available on PubMLST (https://pubmlst.org/vcholerae/).