Skip to main content
Journal of Clinical Microbiology logoLink to Journal of Clinical Microbiology
. 2006 Dec 20;45(3):736–746. doi: 10.1128/JCM.01895-06

Vibrio cholerae Strain Typing and Phylogeny Study Based on Simple Sequence Repeats

Yael Danin-Poleg 1, Lyora A Cohen 1, Hanan Gancz 1,, Yoav Y Broza 1, Hanoh Goldshmidt 1, Elinor Malul 1, Lea Valinsky 2, Larisa Lerner 2, Meir Broza 3, Yechezkel Kashi 1,*
PMCID: PMC1829105  PMID: 17182751

Abstract

Vibrio cholerae is the etiological agent of cholera. Its natural reservoir is the aquatic environment. To date, practical typing of V. cholerae is mainly serological and requires about 200 antisera. Simple sequence repeats (SSR), also termed VNTR (for variable number of tandem repeats), provide a source of high genomic polymorphism used in bacterial typing. Here we describe an SSR-based typing method that combines the variation in highly mutable SSR loci, with that of shorter, relatively more stable mononucleotide repeat (MNR) loci, for accurate and rapid typing of V. cholerae. In silico screening of the V. cholerae genome revealed thousands of perfect SSR tracts with an average frequency of one SSR every 152 bp. A panel of 32 V. cholerae strains, representing both clinical and environmental isolates, was tested for polymorphism in SSR loci. Two strategies were applied to identify SSR variation: polymorphism of SSR tracts longer than 12 bp (L-SSR) assessed by capillary fragment-size analysis and MNR polymorphism assessed by sequencing. The nine L-SSR loci tested were all polymorphic, displaying 2 to 13 alleles per locus. Sequence analysis of eight MNR-containing loci (MNR-multilocus sequence typing [MLST]) provided information on both variations in the MNR tract itself, and single nucleotide polymorphism (SNP) in their flanking sequences. Phylogenetic analysis of the combined SSR data showed a clear discrimination between the clinical strains belonging to O1 and O139 serogroups, and the environmental isolates. Furthermore, discrimination between 27 strains of the 32 strains was achieved. SSR-based typing methods combining L-SSR and MNR-MLST were found to be efficient for V. cholerae typing.


Vibrio cholerae, a gram-negative bacterium, is the causal agent of the severe diarrheal disease cholera. Its natural reservoir is the aquatic environment (49, 84). Cholera pandemics are caused by specific serogroups of V. cholerae that are pathogenic only to humans (35, 84). Since 1817, V. cholerae has caused a number of pandemics. The first seven were presumed to be caused by O1 serogroup of V. cholerae (68). In October 1992, a new serogroup, defined O139, caused a severe outbreak of cholera in southeast India. Within 10 months, the O139 serogroup was disseminated all over the Indian subcontinent and soon thereafter spread to 11 neighboring countries, temporarily displacing the O1 serogroups (64). Since then both serogroups coexist and are responsible for large outbreaks. Genomic studies indicated that O139 epidemic strain arose by horizontal acquisition of unique DNA (15, 47).

Traditionally, V. cholerae classification is serological and requires about 200 antisera based on the somatic O antigen (72). Isolates of V. cholerae are divided into three major subgroups: O1, O139, and non-O1/non-O139, of which only the O1 and O139 serogroups are associated with cholera pandemics and epidemics. Non-O1, non-O139 serogroups are recognized as causative agents of sporadic and localized outbreaks (73). Pathogenic V. cholerae isolates carry virulence genes, such as the toxins genes ctxAB (25, 65, 68). The environmental V. cholerae strains from non-O1, non-O139 serogroups are a possible natural reservoir of potentially new emerging epidemic strains (67, 73). This assumption is supported by the finding that some of these environmental strains harbor virulence genes (23) and thus are likely to evolve into novel pathogenic strains by horizontal gene transfer (24, 25). The emergence of new pathogenic V. cholerae strains requires not only an efficient, rapid, and accurate identification tool but also a means for determining genetic relationships among environmental and clinical isolates.

Genome-based bacterial identification and typing is essential for several disciplines, including taxonomy, epidemiology, determining phylogenetic relationships, and the study of evolutionary mechanisms. It allows distinguishing among strains within a species to monitor epidemics and routes of contamination. Recent advances in biotechnology have resulted in the development of numerous methods for microorganism typing that differ in their sensitivity, rapidity, complexity, discriminatory power, reproducibility, labor-intensity, and cost (7, 60, 82, 85). Several DNA-based methods were used for typing of V. cholerae, such as tests for virulence genes, ribotyping, pulsed-field gel electrophoresis (PFGE), amplified fragment length polymorphism, enterobacterial repetitive intergenic consensus sequence-PCR, random amplification of polymorphic DNA, and multilocus sequence typing (MLST) (2, 17, 34, 44, 46, 67, 73, 87).

Simple sequence repeats (SSR), also termed variable number of tandem repeats (VNTR), are a class of short DNA sequence motifs that are tandemly repeated at a specific locus. The variation in SSR tracts, generally having several alleles, enables their use for strain typing in several bacterial species (30, 38, 41, 59, 80). SSR composed of mononucleotide repeats (MNR) in Escherichia coli were found to be abundant and highly polymorphic but stable at the strain level, making them a valuable tool for bacterial typing and phylogenetics studies (20, 30). Larger SSRs (L-SSR) consisting of two to 9-bp motifs were successfully used to analyze variation among strains by size determination of the PCR amplicon (39, 81). Thus, in the last few years multiple-locus SSR, also termed multiple-locus VNTR analysis (MLVA), has been increasingly recognized as the marker of choice for genotyping a number of pathogens such as Bacillus anthracis (39), Borrelia species (22), Salmonella enterica (48), and Enterococcus faecium (78).

MLST utilizing differences mainly in housekeeping genes is a rising DNA sequence-based method for bacterial strain typing (14, 44, 51). DNA sequencing uncovers far more variation per locus than any other method currently used for bacterial strain typing, and it provides a uniform platform for strain comparison analyzed in different laboratories and for database storage. By their basic nature, however, housekeeping genes diversify slowly and exhibit only limited sequence variation. Hence, it has been suggested to apply MLST analysis to additional highly polymorphic genomic regions, such as virulence genes, stress response genes, and intergenic regions (11, 16, 44, 61). Recently, MLST of loci harboring MNR at noncoding regions were shown in E. coli to contain high sequence variation, including single-nucleotide polymorphisms (SNPs) in the flanking area of the MNR (20). This approach was termed MNR-MLST. In the present study, the short highly informative MNR sequences proved to be more efficient than MLST of housekeeping genes for strain discrimination and for determining phylogenetic relationships in E. coli.

The present study presents an SSR typing method applied to V. cholerae. The two chromosomes of V. cholerae genome (32) were screened for the presence of SSR tracts. The selected SSR sites were used for the development of an SSR-based method that combines both analysis of length variation at large SSR (L-SSR) loci and MLST at MNR loci (MNR-MLST). This method combines the variation seen in highly mutable SSR loci with that of shorter, relatively more stable MNR loci for the accurate and rapid typing of V. cholerae isolates.

MATERIALS AND METHODS

Bacterial strains and growth conditions.

The present study includes 32 strains of V. cholerae (Table 1) : 16 isolates belonging to the O1 and O139 serogroups and 16 belonging to non-O1/non-O139, environmental, isolates. The environmental strains (Table 1) were isolated from Chironomidae egg masses or flying adults (9, 31), followed by serotyping (E. Arakawa, National Institute of Infectious Diseases, Tokyo, Japan) (31). All isolates were grown on Luria agar plates for 24 h at 37°C and were serologically tested for the agglutination of O1 or O139 polyclonal antibodies.

TABLE 1.

V. cholerae strains screened in the present study

Strain Serotype and identification Presence (+) or absence (−) of ctxA Source of isolatea Strain abbreviation
1 O1 Ogawa + Clinical, IMH* O1 Og-1 (+)
3 O139 + Clinical, Ramamurthy, Kolkata, India O139-3 (+)
5 O9 Env, WSP, Shafdan, Israel O9-5 (−)
6 O1 + Clinical, IMH* O1 −6 (+)
7 O1 Inaba 391-93 + Arakawa E O1 In-7 (+)
8 O1 Inaba 390-93 Arakawa E O1 In-8 (−)
9 O139 58-99 + Arakawa E O139-9 (+)
10 O139 470-00 Arakawa E O139-10 (−)
11 O141 475-00 + Arakawa E O141-11 (+)
18 O141 849-95 Arakawa E O141-18 (−)
20 O140 ZC33 Env, Zanzibar O140-20 (−)
21 O140 Z56B Env, Zanzibar O140-21 (−)
25 O2 K183 Env, Kishon River, Israel O2-25 (−)
27 O2 EB116 Env, Elroi Spring, Israel O2-27 (−)
28 O79 B145 Env, WSP, Tivon, Israel O79-28 (−)
29 O79 T145 Env, WSP, Tivon, Israel O79-29 (−)
30 O2 K2 Env, Lake Victoria, Kenya O2-30 (−)
32 O2 B27 Env, WSP, Shafdan, Israel O2-32 (−)
108 O1 Ogawa El-Tor + Clinical, Gaza 1994, Palestinian Aut* O1 Og ET-108 (+)
109 O1 Ogawa El-Tor 4 IMH* O1 Og ET-109 (−)
110 O22 Japan* O22-110 (−)
118 O1 Ogawa El-Tor 185 + Clinical, IMH* O1 Og ET-118 (+)
119 O1 Ogawa El-Tor 20 Clinical, Ruwanda, 1994* O1 Og ET-119 (−)
120 O1 Ogawa 41 + Clinical, Maryland, 1971* O1 Og-120 (+)
121 O1 Inaba 384157 + Clinical, Israel* O1 In-121 (+)
122 O1 Inaba El-Tor 155314 + Clinical, Gaza, 1972, Palestinian Aut* O1 In ET-122 (+)
123 O1 Inaba 35-A-3 + Clinical, Maryland* O1 In-123 (+)
124 O1 Inaba -6-11-61 + Clinical, IMH* O1 In-124 (+)
125 O10 ZC22 Env, Zanzibar O10-125 (−)
126 O10 ZC22B Env, Zanzibar O10-126 (−)
128 O49 ZC32 Env, Zanzibar O49-128 (−)
129 O37 ZC5Ab Env, Zanzibar O37-129 (−)
a

*, collection of The Israeli Ministry of Health, Central Laboratory, Jerusalem, Israel (IMH). Env (environmental), isolated from Chironomides egg mass and flying adults (8, 9, 31) (isolated from adults: O2 K2, O2 B27, and O79 B145). WSP, waste stabilization pond. E. Arakawa, Department of Bacteriology, National Institute of Infectious Diseases, Tokyo, Japan.

b

Presence of NAG-ST gene. This gene was tested only for the environmental isolates and the Arakawa isolates but was present only in the marked isolate (31).

Stability test of L-SSR loci.

Two clinical V. cholerae strains, O1 Og-1 and O1 Og ET-109, were grown and transferred daily for the four following days. From each of the strains, two single colonies were picked daily from an overnight grown plate and streaked onto fresh LB plate and grown at 37°C. Crude and fast DNA extraction was conducted from the 60 resulting plates (see below).

DNA preparation.

High-quality DNA extraction of pure cultures was used throughout the study (30, 76). Briefly, a loop full of V. cholerae colonies was washed with SSC buffer (0.15 M NaCl, 15 mM sodium citrate [pH 8.0]). The suspended cells were incubated overnight at 50°C with proteinase K solution (20 mg/ml in Tris-EDTA [TE]). Sodium dodecyl sulfate was added to a final concentration of 1%, followed by incubation at 55°C for 1 h, followed in turn by phenol-chloroform extraction and ethanol precipitation. The DNA was stored at −20°C.

For crude and fast DNA extraction, a loopful of cells was suspended in 0.5 ml of TE buffer, heated for 10 min at 80°C, and centrifuged for 10 min at maximum speed. The pellet was resuspended in 100 μl of TE buffer and heated for 5 min at 100°C. The supernatant was stored at −20°C for a few days.

Genome evaluation of SSR distribution.

The complete genomic sequence of V. cholerae (El-Tor N16961 [32]) was obtained from the National Center for Biotechnology Information database (http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi). The two chromosomes VC (chromosome 1, 2.9 Mb) and VCA (chromosome 2, 1.1 Mb) were screened for perfect SSR with minimal number of repeats greater than three, using the “SSR” computer program (30; http://www.technion.ac.il/∼anne/choice3.html).

Loci and primer selection.

The longest 17 SSR loci identified by the SSR computer program, in regions without similarity to phage or prophage sequences (from both chromosomes) were selected for the present study. These 17 sequences include nine L-SSR loci with core motifs of 3 to 9 bp and eight sites with MNR. Unique primers were designed to generate PCR products of 100 to 250 bp for the L-SSR loci and 200 to 350 bp for the MNR loci using the computer software Gene Runner (version 3.05; Hastings Software, Inc.). The locus name includes the chromosomal designation, the downstream gene-assigned number, and the repeat motif and length (see Table SA1 in the supplemental material). Each locus was tested for uniqueness throughout the V. cholerae genome by using NCBI BLAST against microbial genome (http://www.ncbi.nlm.nih.gov/sutils/genom_table.cgi).

PCR.

PCR mixtures contained 0.2 mM deoxynucleoside triphosphates, 10 μM concentrations of each primer; 0.25 U of Taq polymerase (SuperNova; JMR Holding, Kent, England), 1× buffer (1.5 mM MgCl2), and 50 ng of template DNA in a total volume of 25 μl. The reactions were carried out in a PTC-100 Thermocycler (MJ Research) as follows: 95ºC for 5 min; followed by five cycles of 45 s at 95°C, 45 s at Tm (see Table SA1 in the supplemental material), and 45 s at 72°C; followed by 20 cycles of 45 s at 95°C, 45 s at Tm of −5°C, and 45 s at 72°C; followed by a final 7 min at 72°C. All PCR amplification products were analyzed by agarose gel (1.5%) electrophoresis.

Multiplex PCR assay of L-SSR loci.

Primers of six SSR loci were mixed into two PCR multiplex cocktails. The PCR mixtures were as described above. Primer concentrations for each mix were as follows. Mix 1 included VC0437-(7)7, VC1650-(9)7, and VCA0283-(6)14 at final concentrations of 0.8, 0.6, and 1.4 μM, respectively; mix 2 included VC0147-(6)9, VCA0171-(6)23, and VC1457-(7)4 at final concentrations of 0.4 μM. The annealing temperatures were 60 and 62°C for mix 1 and mix 2, respectively.

Detection of specific genes by PCR.

Amplification of ctxA and ompW (terminal) loci (58) was performed, together with the amplification of bacterial-universal 16S-rRNA subunit (76) in a multiplex PCR assay, using a 65°C annealing temperature. nag (stn/sto) gene amplification was performed as described above, with annealing temperature of 60°C. The primers were NAG-F 5′-TCGCATTTAGCCAAACAGTAGAA-3′ and 194R 5′-GCTGGATTGCAACATATTTCGC-3′ (67), amplifying a predicted 172-bp fragment.

DNA sizing.

Sizing of L-SSR amplification products was performed by 2.5% agarose gel electrophoresis and by capillary electrophoresis on an ABI Prism 310 automated DNA sequencer, using Fluorofor-labeled primers. The amplified samples were loaded on the ABI analyzer, together with 10 μl of formamide and 0.6 μl of tetramethylrhodamine (TMR) MapMaker LOW (lot 030600; BioVentures, Inc.). The results were analyzed with Genescan 3.1 and Genotyper analysis software (Perkin-Elmer).

DNA sequencing.

PCR products were purified using a QIAquick PCR purification kit (QIAGEN). Portions (50 ng) of the purified product were sequenced on both strands using a BigDye terminator cycle sequencing kit (Perkin-Elmer Applied Biosystems) and loaded onto an ABI 310 automated sequencer. Analysis of the results was done by using the sequence analysis software (Perkin-Elmer).

PFGE.

Gel electrophoresis was performed according to the PulseNet standardized PFGE protocol for Escherichia coli subtyping (12) with minor modifications. Briefly, genomic DNA from all isolates were prepared in agarose plugs and digested with the restriction enzyme NotI, separated in a 1% agarose gel in 0.5× Tris-borate-EDTA at 14°C using a CHEF-DRIII apparatus (Bio-Rad). The pulse time ranged from 4 s to 8 s for 9 h and from 8 to 25 s for 11 h both at 6 V/cm. After visualization, the PFGE patterns were analyzed by using the BioNumerics software (Applied Maths). Profiles were compared by using the Dice coefficient, followed by UPGMA (unweighted pair-group method with arithmetic averages) clustering (tolerance, 1.0%).

Data and statistical analysis.

Two methods were used to assess the variation at MNR loci: allele analysis of sequence types and sequence comparison.

(i) Allele analysis.

A nonparametric analysis of allele variations was used for both L-SSR and MNR loci. Alleles refer to L-SSR size alleles, MNR alleles, or sequence types (ST) that include MNR polymorphism and SNPs. An additional allele (null allele) was counted where there was no amplification product (see Table 4). Two polymorphic MNR tracts were found at the VC0929-(G)8 and VCA0107-(T)8 loci. Thus, MNR microhaplotypes were considered as alleles (see Table 4). In addition, the VC1457-(7)4 locus was excluded form all analyses, since it was found to be duplicated in O1-124 (see Table SA2 in the supplemental material).

TABLE 4.

Genetic distances among V. cholerae strains calculated base on non-parametric analyses of L-SSR, MNR-MLST data

Strain group comparison Avg genetic distance ± SD (minimum, maximum)
L-SSR MNR-MLST Combined L-SSR + MNR
Among
    Clinical strainsa 0.59 ± 0.173 (0.125, 0.875) 0.073 ± 0.105 (0.00, 0.25) 0.332 ± 0.12 (0.063, 0.563)
    Environmental strainsb 0.396 ± 0.125 (0.00, 0.50) 0.632 ± 0.20 (0.00, 1.00) 0.524 ± 0.148 (0.00, 0.75)
Between
    Clinical and O37-129 0.766 ± 0.232 0.445 ± 0.129 0.555 ± 0.068
    Environmental and O37-129 0.397 ± 0.063 0.713 ± 0.107 0.605 ± 0.179
a

Strains of O1 and O139 ctxA (+) serogroups.

b

Strains of non-O1/non-O139 and O139 ctxA (−) serogroups.

The data for all genotypes were scored as present (“1”) or absent (“0”) for each of the alleles at a specific locus. Diversity index was calculated as 1 − ΣP2ij, where Pij is the frequency of the jth allele at the ith locus.

Genetic relations among strains were inferred from the L-SSR, MNR, and MNR-MLST data. SAS 8.02 software was used to calculate the Nei coefficient of association and to generate the corresponding matrix (SAS system for Windows, version 8.02; SAS Institute, Inc., Cary, NC). This matrix was used to create dendrograms based on the UPGMA method using MEGA 2.1 software (45).

(ii) Sequence comparison.

Two phylogenetic analyses were performed for the sequence data: (i) combined sequences of 29 strains at all eight MNR polymorphic loci and (ii) combined sequences of all 32 stains at seven loci. These analyses were carried out since no products were amplified for three strains (see Table SA3 in the supplemental material) at the VC0929-(G)8 locus. Multiple alignments of the sequences were performed by using Sequence Navigator (version 1.0.1; ABI) or CLUSTALX (33). The alignment files, in PIR format, were exported to the Gapcoder software (86) for analysis of the indel. The output files (NEXUS format), including the indel variations coded as presence or absence, were transformed to FASTA format via SEAVIEW software (28), following transformation of the presence or absence to A/C, respectively (0 = A, 1 = C). These files were converted to MEGA format and used to evaluate genetic relationships by the UPGMA method (MEGA 2.1) (45). Gaps were treated as missing data using the “pairwise deletion” option. Bootstrap confidence values were based on 1,000 simulated dendrograms.

Nucleotide accession numbers.

The GenBank accession numbers for the nucleotide sequence of the 32 strains at eight MNR loci determined in the present study were as follows: EF176922 to EF176953 for VC0332(A)9, EF176954 to EF176982 for VC0929(G)8, EF176983 to EF177014 for VC1132(A)9, EF177015 to EF177046 VC1490(A)9, EF177047 to EF177078 for VC1833(T)9, EF177079 to EF177110 for VCA0107(T)8, EF177111 to EF177142 for VCA0196(A)8, and EF177143 to EF177174 for VCA1063(T)8.

RESULTS

SSR distribution in V. cholerae genome.

A genome-wide screen of V. cholerae revealed thousands of perfect SSR tracts (Table 2). The scan was for MNR and for large SSR (L-SSR). MNR were defined as sequence repeats larger than 5 bp. L-SSR were defined as sequence repeats with a motif larger than 2 bp, having a repeat number larger than three. The majority of the SSR (68%) were found to be MNR. Most of the MNR tracts were short and did not exceed 11 bp. There were many L-SSR tracts (82) with SSR tracts longer than 12 bp (Table 2; also see Fig. SA1 in the supplemental material) that could serve as potential informative markers. All SSRs were evenly distributed and highly abundant showing an average frequency of one SSR every 152 bp in V. cholerae genome. On average, one L-SSR tract appears every 473 bp, and one MNR tract appears every 224 bp. The distribution of L-SSR with repeats of >3 bp among open reading frames (ORFs) (85.15%) and noncoding regions (13.4%) approximated the overall proportion of these regions in the genome (85.9 and 14.1%, respectively). In contrast, MNR of >5 bp are under-represented in ORFs (77.8%) and more abundant in noncoding regions (22.1%).

TABLE 2.

Number of SSR loci along the two chromosomes of V. cholerae (El-Tor N16961)

Chromosome and no. of repeats SSR core motif (bp)a
1 2 3 4 5 6 7 8 9
Chromosome 1 (VC)
    3 4,758 1,098 21 2 1 1 1 0
    4 331 36 0 0 0 2 (2) 0 0
    5 9,490 20 2 (1) 0 0 0 0 0 0
    6 2,852 1 0 0 0 0 0 0 0
    7 629 0 0 0 0 0 1 (1) 0 1 (1)
    8 112 (1) 0 0 0 0 0 0 0 0
    9 20 (4) 0 0 0 0 1 (1) 0 0 0
    10 2 0 0 0 0 0 0 0 0
    11 1 0 0 0 0 0 0 0 0
Chromosome 2 (VCA)
    3 1,711 409 5 0 0
    4 111 5 (1) 0 0 0
    5 3,524 8 0 0 0 0
    6 1,073 0 0 0 0 0
    7 233 0 0 0 0 0
    8 31 (3) 0 0 0 0 0
    9 1 0 0 0 0 0
    10 0 0 0 0 0 0
    11 1 0 0 0 0 0
    14 0 0 0 0 0 1 (1)
    23 0 0 0 0 0 1 (1)
a

Select loci for the study (total of 17) are given in parentheses.

Comparison of the two V. cholerae chromosomes revealed a similar SSR frequency: one SSR for every 150 bp in chromosome 1 and one SSR for every 155 bp in chromosome 2 (Table 2). However, the average appearance of L-SSR tracts with a minimal size of 12 bp is twofold higher in chromosome 1 than in chromosome 2 (24.1 and 10.9 per Mb, respectively). Similarly, the average appearance of an MNR of ≥8 is 1.5 higher in chromosome 1 (46.6 and 30 per Mb, respectively). In addition, the motif size of L-SSR is limited to 9 bp in chromosome 1 versus 6 bp in chromosome 2. However, L-SSR with the highest repeat numbers are found in chromosome 2.

The longest 17 SSR loci were chosen for further analysis. Nine loci had core motifs ranging from three to nine nucleotides (L-SSR), and eight loci had a mononucleotide core repeat (Tables 2 and 3). The L-SSR loci were selected from both coding and noncoding regions, whereas the MNR were selected from noncoding regions only.

TABLE 3.

Number of alleles and diversity index values at 17 SSR loci of 32 clinical and environmental V. cholerae isolates

Locus Core motif and no. of repeats in O1 El-Tora Repeat positionb (gene name or product) Repeat variation
Sequence variation (MNR-MLST)
No. of alleles DIc No. of ST DI Sequence length (bp)
L-SSR
    VC0147-(6)9 (AACAGA)9 Coding (cell division protein FtsY) 10 0.85
    VC0437-(7)7 (GACCCTA)7 Noncoding (GTP1/Obg family protein) 7 0.78
    VC0500-(7)4 (TTGTCGA)4 Coding (transposase OrfAB, subunit A, authentic frameshift) 2d 0.45
    VC1418-(3)5 (TGA)5 Coding (hypothetical protein) 2d 0.50
    VC1457-(7)4 (AAATCAA)4 Noncoding (cholera enterotoxin, A subunit) 5d 0.58
    VC1650-(9)7 (GATAATCCA)7 Coding (collagenase) 8 0.84
    VCA0171-(6)23 (TGCTGT)23 Coding (hypothetical protein) 13 0.91
    VCA0283-(6)14 (ACCAGA)14 Coding (hypothetical protein) 10d 0.68
    VCA1082-(3)4 (AAC)4 Coding (hypothetical protein) 2d 0.06
MNR
    VC0332-(A)9 (A)9, (G)5 Noncoding (uroporphyrinogen decarboxylase) 3 0.53 4 0.56 145
    VC0929-(G)8 (A)6, (T)5, (G)8 Noncoding (hypothetical protein)e 8d 0.77 11d 0.79 155
    VC1132-(A)9 (T)7, (A)7, (A)9 Noncoding (ATP phosphoribosyltransferase) 4 0.28 9 0.71 160
    VC1490-(A)9 (A)9 Noncoding (hypothetical protein) 1 4 0.53 178
    VC1833-(T)9f (T)9 Noncoding (quinolinate synthetase)e 4 0.54 9 0.59 193
    VCA0107-(T)8 (T)6, (T)8, (T)4 (G)4 Noncoding (hypothetical protein) 3 0.51 5 0.57 138
    VCA0196-(A)8 (A)8, (A)8, (T)7 Noncoding (hypothetical protein) 2 0.06 4 0.37 270
    VCA1063-(T)8 (T)5, (T)8, (T)5 Noncoding (ornithine decarboxylase, inducible) 3 0.56 6 0.68 116
a

Polymorphic MNR motifs are marked in boldface.

b

For repeats located in noncoding regions the downstream gene (ORF) is denoted.

c

DI, diversity index.

d

Number includes one null allele.

e

MNR located 3′ to the ORF.

f

This locus was previously published by our group (18).

Polymorphism.

High levels of polymorphism were found among the 32 V. cholerae isolates in the 17 tested SSR loci (Table 3). The strain panel included members of different serogroups from both clinical and environmental sources (Table 1), primarily tested for the presence of ctxA, ompW, and nag genes (Table 1). Two strategies were used for the identification of SSR polymorphism: sizing for the nine L-SSR loci with core motifs of 3 to 9 and sequencing of the eight MNR loci.

Size variation at L-SSR loci.

All nine L-SSR loci were polymorphic among the tested V. cholerae strains, with the number of alleles ranging from 2 to 13. The most polymorphic loci were those of 6-bp motifs with 10 alleles or more (Table 3). Rapid and accurate sizing was achieved by using both agarose gel and fluorescence-labeled capillary electrophoresis. Size variation of PCR amplification products differed according to the repeated motif size (see Table SA2 in the supplemental material), with the number of repeats ranging from 2 to 25. No correlation was found between the number of repeats and serological groups. Five of the loci had no amplification product (null allele) that was referred to as a single allele. Diversity indexes were high, ranging from 0.45 to 0.91, with one exception of 0.06 at VCA1082-(3)4. At the VCA1082-(3)4 locus, all strains had the same allele, except the O1 Og ET-109-negative [ET-109(−)] strain, which presented a null allele. All nine L-SSR markers have consistent amplification products in at least three separate amplifications. Two products were amplified consistently in V. cholerae O1 In-124 (+) strain at the VC1457-(7)4 L-SSR locus (see Table SA2 in the supplemental material). Efficient L-SSR genotyping was shown by multiplex PCRs, followed by capillary sizing. Two cocktails of three markers were chosen. Mix 1 consisted of VC0437-(7)7, VC1650-(9)7, and VCA0283-(6)14 primers labeled with Hex, Fam, and Hex, respectively, and mix 2 consisted of VC0147-(6)9, VCA0171-(6)23, and VC1457-(7)4 primers labeled with Fam, Hex, and Fam, respectively. The same alleles were observed in both multiplex and separate amplifications, supporting the use of the multiplex assay for high-throughput typing.

The consistency of L-SSR markers was verified in a stability experiment of 4-day colony transfers (“generations”) of two representative V. cholerae strains, O1 In-1 (+) and O1 Og ET-109 (−), each tested for the stability of two loci, VCA0283-(6)14 and VC0147-(6)9. In order to examine the stability of these markers each day, two single colonies were picked from an overnight grown plate and streaked onto a fresh LB plate. Crude DNA extraction proved to be sufficient for the assessment of variants from the 60 colonies. All colonies from a specific strain showed amplification of the expected alleles [23 and 24 repeats at VCA0283-(6)14 and 10 repeats at VC0147-(6)9 for O1 In-1 (+) and O1 Og ET-109 (−), respectively (see Table SA2 in the supplemental material)], supporting the appropriate consistency of L-SSR loci.

Sequence variation at MNR loci.

Sequence analysis of MNR harboring loci provided information for both MNR tract variations and SNPs at their flanking sequences (Fig. 1). Most of the loci contained few MNR tracts (with numbers of repeats greater than five), at least one of which was polymorphic (Table 3). MNR tracts showed high polymorphism in seven of the loci with up to eight alleles (including no amplification product as one of the alleles; see Table 4 and Table SA3 in the supplemental material), despite their short length. These variations were mostly due to insertion and/or deletions in the MNR sequence itself. The corresponding diversity indexes ranged from 0.06 to 0.77. Additional SNPs were found at the flanking sequences of the eight polymorphic MNR loci. These multiple sequence variations that combine both MNR and SNPs presented the highest levels of polymorphism with 4 to 11 ST for a locus (Table 3). Diversity indexes were high as well, ranging from 0.37 to 0.79. These findings of highly variable sequences at noncoding MNR loci regions indicated the potential of their use for V. cholerae typing.

FIG. 1.

FIG. 1.

Sequence alignment of representatives isolates of V. cholerae at the VC0929-(G)8 MNR locus presenting the 10 ST. No product was amplified in O140-20, O140-21, and O49-128 strains. MNR alleles are labeled with different underlines. SNPs at the flanking sequences of the MNR are represented in boldface. -, missing nucleotide; *, nonconserved basepair.

Phylogenetic analyses.

Genetic relations among 32 V. cholerae strains were inferred from three types of analysis in order to compare results for loci with different mutation rates.

(i) L-SSR analysis.

Phylogenetic analysis of the fragment size variation data at eight L-SSR loci (see Fig. SA2 in the supplemental material) enabled the partitioning of the 32 V. cholerae isolates to 27 different SSR types. We could discriminate among all O1 and O139 ctxA (+) isolates, such that each isolate presented a unique SSR type, whereas the 17 non-O1/non-O139 and O139 ctxA (−) isolates were divided into 12 different SSR types.

(ii) MNR-MLST.

Two strategies were used for the phylogenetic analysis of sequence variations (MNR polymorphism and SNP) at the MNR loci: (i) direct analysis of the aligned sequences-sequence comparison (Fig. 2a; also see Fig. SA3 in the supplemental material) and (ii) nonparametric ST (NST) analysis (see Fig. SA4 in the supplemental material).

FIG. 2.

FIG. 2.

FIG. 2.

Phylogenetic relationships and variation among V. cholerae isolates constructed by UPGMA cluster analysis based on three methods. (a and b) Sequence data of 29 isolates at eight MNR-MLST loci (a) and combined data of eight L-SSR loci and eight MNR-MLST loci (analyzed as sequence types) for 32 isolates (b). The combined analysis was based on 106 polymorphic points (16 loci times the number of alleles in each locus). (c) Dendrogram based on PFGE patterns of 29 isolates digested with NotI.

For sequence comparison, 29 of the 32 strains were typeable with eight MNR loci (1,355 bp, Fig. 2a), since three strains (O140-20, O140-21, and O49-128) gave no product at the VC0929-(G)8 locus (see Table SA3 in the supplemental material). Information from those eight loci enabled the partitioning of the 15 O1 and O139 ctxA (+) isolates to three ST and the 14 environmental and 0139 ctxA (−) strains to 10 ST (Fig. 2a). Alternatively, all strains were typeable by using only seven MNR loci (1,200 bp; see Fig. SA3 in the supplemental material).

In order to include all sequence variation information, an NST analysis, including the use of nonamplified product as one of the alleles, was carried out. The NST-based dendrogram showed similar results to the sequence comparison dendrogram but included all 32 strains (Fig. 2a; also see Fig. SA4 in the supplemental material). The NST analysis enabled the separation of the 15 O1 and O139 ctxA (+) isolates to three sequence types, and allows us to discriminate among 11 of the 17 environmental non-O1/O139 and O139 ctxA (−) strains.

(iii) Combined nonparametric analysis.

The nonparametric analysis enabled the inclusion of all source of variation data in one comprehensive analysis. Therefore, the data for eight L-SSR and eight MNR-MLST loci (NST) were combined and used for the inclusive phylogenetic analysis of all V. cholerae isolates (Fig. 2b). The resulting dendrogram discriminates among all O1 and O139 ctxA (+) isolates and among 12 of the 17 environmental non-O1/O139 and O139 ctxA (−) isolates.

In general, all analyses showed clear separation of the clinical strains from the environmental strains without relation to the isolation source or location (Fig. 2; also see Fig. SA2 to SA4 in the supplemental material). All O1 isolates ctxA (+ and −) and O139 ctxA (+) isolates were grouped together. The O139 ctxA (−) strain clustered with strains from the non-O1/non-O139 serogroups. However, the diversity, estimated by the genetic distance, among strains of the clinical or among the environmental groups was different as a result of the various analyses (Table 4). The MNR-MLST analysis showed low diversity among O1 and O139 ctxA (+) isolates compared to the diversity found among environmental isolates. In addition, the O37-129 isolate, which carries virulent potential, clustered closer to the clinical O1 and O139 ctxA (+) strains (Table 4 and Fig. 2a; see also the Discussion). The L-SSR, which is known to have higher mutation rate, showed high genetic distances across clinical O1 and O139 ctxA (+) isolates compared to the low diversity found by the MNR-MLST analysis. Interestingly, the L-SSR analysis showed higher genetic distances among the closely related clinical isolates than that found across environmental strains and O139 ctxA (−) (Table 4). The combination of both L-SSR and MNR-MLST variations showed higher genetic distances within the environmental strains. In addition, the combined analysis clearly separated the clinical and environmental groups (mainly by the MNR loci), while discriminating the closely related strains belonging to the O1 groups (by the L-SSR loci), thus yielding better prediction of the genetic relations among the tested strains (Fig. 2b).

PFGE.

PFGE analysis was used as a reference method to assess the genetic interrelationship among the 32 V. cholerae strains (Fig. 2c). Chromosomal DNAs from all isolates were digested in agarose plugs with the restriction enzyme NotI, resulting in 17 to 24 bands. A total of 31 V. cholerae strains were partitioned to into 28 PFGE types and 1 strain, O49-128, could not be typed by the PFGE methodology used here and constantly appeared as a smear on the gel. Two pairs of isolates O140-20-O140-21 and O79-28-O79-29 revealed the same PFGE pattern (data not shown). Each of the 15 O1 and O139 ctxA (+) isolates gave a unique PFGE type. O139 ctxA (−) clustered together with environmental strains. No clear correlation was found between the PFGE clusters and the serological groups or the major phylogenic groups identified by SSR analysis.

DISCUSSION

An optimal typing method should provide the ability to distinguish among both distantly and closely related strains. This could be achieved by multilocus analysis of polymorphic sites, with diverse mutation rates, found across the whole bacterial genome. The present study presents an SSR-based method that combines the variation seen in highly mutable L-SSR loci with that of shorter, relatively more stable MNR loci for V. cholerae typing and for inferring phylogenetic relations between strains.

SSRs were found to be highly abundant and evenly distributed in the genome sequence of V. cholerae (El-Tor N16961), as in other bacterial species (see, e.g., references 26, 27, 30, 41, 43, 80, and 81). MNR are the most abundant SSR in V. cholerae genome, in agreement with previous reports for E. coli and for Yersinia pestis (30, 43). The frequency of SSR (with repeats >3 and MNR of >5 bp) in the V. cholerae genome is similar to that found in the pathogenic E. coli O157:H7 EDL933 (one SSR every ∼150 bp). However, there are more and longer L-SSR tracts (longer than 12 bp) in the genome of E. coli O157:H7 (146 SSR tracts compared to 82 in V. cholerae). The hypothesis connecting part of the SSR to pathogenicity is supported by the more frequent appearance (per megabase) of L-SSR and of MNR (>8 bp) in chromosome 1 (Table 2), which carries more genes associated with pathogenicity than does chromosome 2 (32). Polymorphism at SSR tracts could have a functional role affecting both gene regulation and the expressed protein, such that this variation could become available for natural selection and subsequent evolution (36, 37, 42, 57). SSR polymorphism, found in regulatory regions of bacterial species, was found to be associated with variation in gene expression (i.e., on-off switches and levels), providing the dynamic response to environmental changes (4, 52, 56) [see the discussion of the VC1457-(7)4 locus below]. Most of the tested L-SSR are found in coding regions where different alleles could have various biological effects (Table 3) (41). Since most of the tested L-SSR consist of 3-, 6-, or 9-bp motifs, variations do not change the reading frame. In contrast to the L-SSR, MNR are 1.56-fold more abundant at noncoding regions. All tested MNR tracts were found to be located between 90 to 355 bp upstream of an ORF, regions that might harbor regulatory elements.

Polymorphisms at SSR loci were tested in a highly diverse genetic panel, including 32 V. cholerae strains, representing both clinical and environmental isolates. High polymorphism, but stability at the strain level, was found among the V. cholerae isolates at all tested SSR loci with corresponding high diversity indexes. There was a correlation, in general, between the number of alleles and the diversity indexes, mainly across the L-SSR loci (Table 3). High variation was previously found among O139 strains at the VCA0171-(6)23 locus by using polyacrylamide gel electrophoresis (83). In addition, variation of L-SSR tracts increases with the number of repeats, as found in other bacterial species (e.g., references 22 and 41).

The VC1457-(7)4 L-SSR locus is located at the promoter of the ctxA gene. This region originated from prophage islands and is known to exist in two or more copies in pathogenic O1 strains (3, 19, 53, 74). This could explain the presence of two amplification products (alleles) in the V. cholerae O1 In-124(+) strain. However, the consistency of a single amplification product in the other 31 isolates enables the use of this marker in cases where only one product is observed for a specific isolate. Furthermore, three to six repeats of the heptamer sequence at the VC1457-(7)4 locus were observed in the tested strain panel. This polymorphism should have direct effect on virulence since it was shown that the higher number of this tandem repeat is connected to a higher binding affinity of the ToxR, leading to higher toxin expression (55, 62).

MNR loci were found to be polymorphic and highly informative in V. cholerae. Variations of these loci were due to repeat variability in the MNR tract and due to the SNPs in the flanking sequences. Similar results were observed for MNR loci in E. coli (20, 30, 54) or in Mycobacterium avium subsp. paratuberculosis (1). We compared the variation found using MLST analyses of housekeeping or virulence genes to that of MNR loci. Both methods revealed the same number of alleles; however, the same degree of nucleotide variation was found on shorter segments (115 to 270 bp) of the MNR loci compared to a longer segment (560 to 1,100 bp) for the housekeeping, 16S-23S rRNA intergenic spacer regions or virulence genes (10, 13, 29, 44). The original MLST method is based on sequence variation at housekeeping genes that are conserved and therefore usually present in all strains but that have limited variation (21, 59, 69). On the other hand, the MNR loci that present high levels of polymorphism have the disadvantage of missing data from a portion of the strains due to nonamplified (“null”) alleles (20, 75) as a result of sequence mismatch (SNPs) at the priming sites, as well as insertions or deletions and genome rearrangements (32). We have solved the problem of “no data” from part of the tested strains by using a nonparametric sequence type NST analysis. According to the NST analysis, no product was assigned as an additional ST allele. The NST analysis provides 100% typeability of any strain at any selected locus by combining both sequence variations and product amplification results versus the commonly used sequence based methods that consider only sequence variations (51, 79). In addition, NST analysis enables the combination of data from various diagnostics methods, e.g., MNR-MLST, L-SSR and AILP (75), as well as the comparison of results from various typing methods.

The data sets were used for phylogenetic analyses in order to compare the results obtained by the different typing strategies. In general, the analyses showed that all O1 isolates, including ctxA (+) and ctxA (−) isolates, and all O139 isolates [ctxA (+)] were grouped together, indicating, like other studies (73), that O1 and O139 strains have the same clonal origin. The O139 ctxA-negative strain clustered together with strains from the non-O1/non-O139 serogroups by both L-SSR and MNR-MLST methods, as well as by the PFGE method (Fig. 2). Although clustered together with the environmental strains, the O139 ctxA (−) isolate presented a unique allele in one SSR locus [VC0147-(6)9] and unique ST in two MNR loci [VC0929-(G)8 and VC1833-(T)9]. Accordingly, this strain also presented a unique PFGE type, and its serology clearly belonged to the O139 serogroup. These results support previous findings suggesting that serogroup O139 does not evolve from a unique clone (21).

Low diversity was found among pathogenic, clinical [O1and O139 ctxA (+)] isolates by the MNR-MLST. Most of these strains presented the same ST. These results are an outcome of the low mutation rate found at MNR loci, indicating the power of MNR-MLST analysis for long-term phylogeny relation. Such information can also be use for simple differentiation between pathogenic and nonpathogenic isolates. Similarly, the majority of O1 (ctxAB-positive and -negative) and all O139 (ctxAB-positive) strains presented the same PCR-SSCP profile (I) (66).

In contrast to the MNR-MLST results, high variation was found in the present study among pathogenic O1 and O139 ctxA (+) strains by the L-SSR, as well as by the PFGE methods. Each of these strains presented a unique L-SSR type and a unique PFGE type, thus indicating that the L-SSR have higher mutation rates compared to MNR-MLST in V. cholerae. Similar results were found in E. coli, where MNR-MLST could not differentiate among O157 strains (20) opposed to the L-SSRs (VNTRs) that have higher mutational rate and therefore have higher resolution power of closely related O157 E. coli strains (41). Interestingly, in Bacillus anthracis the mutation rate for L-SSR and MNR was found to be the same (40). The PFGE method is frequently used for epidemiological studies on account of its generally high discriminatory power, specifically in V. cholerae (2, 50). Here, the PFGE partitioned the 31 V. cholerae strains from various serological groups into 28 PFGE types versus the L-SSR method that separated all 32 strains into 27 SSR types. The L-SSR discrimination was shown to be as useful as the PFGE analyses in V. cholerae, resembling previous studies in E. coli and in Enterococcus faecalis (41, 77). On the other hand, the L-SSR method has the benefits of correlation to serological groups and typeability of all strains. Thus, the MNR-MLST and L-SSR typing (also termed MLVA) methods should be more suitable for evolutionary studies by their nature, since they parametrically detect multiple sequence variation that accumulates slowly compared to the highly discriminative PFGE method (51). Thus, multilocus SSR typing is a good tool for epidemiological and phylogeny studies of V. cholerae, as demonstrated with other bacterial species (22, 39, 41, 48, 78).

An additional example that the MNR-MLST can better predict a remote phylogenetic relation is demonstrated by the analysis of O37-129 strain (Fig. 2a; also see Fig. SA3 and SA4 in the supplemental material). Previously, a strain belonging to the O37 serogroup was shown to be responsible for the cholera outbreak in the Sudan in 1968 (5, 6). Interestingly, the O37-129 in the present study was isolated in Zanzibar during an unidentified cholera outbreak in February and March 2002. This strain harbors the nag virulence gene (31) but not the ctxA gene. The pandemic potential of cholera strains might be based on genomic features that are not only the acquisition of the TCP and the CT genes alone (23, 63, 68). According to MNR-MLST analysis, the O37-129 environmental strain shows genetic similarity to the pathogenic O1 and O139 ctxA (+) strains; therefore, it has the potential to evolve to the next pandemic strain by acquisition of TCP and CT genes.

In conclusion, SSR-based typing that combines L-SSR and MNR-MLST is a practical tool for differentiation between V. cholerae strains. The genotyping results obtained by this method are simply and easily compared between different labs and could be analyzed by high-throughput methods (71). The method combines genome-wide coverage of loci with different mutation rates, which could serve as an efficient tool for phylogeny studies and for rapid bacterial typing. Furthermore, it could meet the challenge of correctly assessing the risk of newly emerging epidemic variants.

Supplementary Material

[Supplemental material]

Acknowledgments

We thank E. Arakawa, National Institute of Infectious Diseases, Tokyo, Japan; T. Ramamurthy, National Institute of Cholera and Enteric Diseases, Kolkata, India; and the Central Laboratories, Ministry of Health, Jerusalem, Israel, for providing bacterial strains.

This research was supported by the Grand Water Research Institute, Technion; The Israeli Water Commission; Israel Science Foundation grant 1005697; and NATO project CBD.MD.SFP 981456.

Footnotes

Published ahead of print on 20 December 2006.

Supplemental material for this article may be found at http://jcm.asm.org.

REFERENCES

  • 1.Amonsin, A., L. L. Li, Q. Zhang, J. P. Bannantine, A. S. Motiwala, S. Sreevatsan, and V. Kapur. 2004. Multilocus short sequence repeat sequencing approach for differentiating among Mycobacterium avium subsp. paratuberculosis strains. J. Clin. Microbiol. 42:1694-1702. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Arakawa, E., T. Murase, S. Matsushita, T. Shimada, S. Yamai, T. Ito, and H. Watanabe. 2000. Pulsed-field gel electrophoresis-based molecular comparison of Vibrio cholerae O1 isolates from domestic and imported cases of cholera in Japan. J. Clin. Microbiol. 38:424-426. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Basu, A., A. K. Mukhopadhyay, P. Garg, S. Chakraborty, T. Ramamurthy, S. Yamasaki, Y. Takeda, and G. B. Nair. 2000. Diversity in the arrangement of the CTX prophages in classical strains of Vibrio cholerae O1. FEMS Microbiol. Lett. 182:35-40. [DOI] [PubMed] [Google Scholar]
  • 4.Bayliss, C. D., K. M. Dixon, and E. R. Moxon. 2004. Simple sequence repeats (microsatellites): mutational mechanisms and contributions to bacterial pathogenesis. FEMS Immunol. Med. Microbiol. 40:11-19. [DOI] [PubMed] [Google Scholar]
  • 5.Beltran, P., G. Delgado, A. Navarro, F. Trujillo, R. K. Selander, and A. Cravioto. 1999. Genetic diversity and population structure of Vibrio cholerae. J. Clin. Microbiol. 37:581-590. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Bik, E. M., R. D. Gouw, and F. R. Mooi. 1996. DNA fingerprinting of Vibrio cholerae strains with a novel insertion sequence element: a tool to identify epidemic strains. J. Clin. Microbiol. 34:1453-1461. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Boerlin, P., E. Bannerman, F. Ischer, J. Rocourt, and J. Bille. 1995. Typing Listeria monocytogenes: a comparison of random amplification of polymorphic DNA with five other methods. Res. Microbiol. 146:35-49. [DOI] [PubMed] [Google Scholar]
  • 8.Broza, M., H. Gancz, M. Halpern, and Y. Kashi. 2005. Adult non-biting midges: possible windborne carriers of Vibrio cholerae non-O1 non-O139. Environ. Microbiol. 7:576-585. [DOI] [PubMed] [Google Scholar]
  • 9.Broza, M., and M. Halpern. 2001. Pathogen reservoirs: chironomid egg masses and Vibrio cholerae. Nature 412:40. [DOI] [PubMed] [Google Scholar]
  • 10.Byun, R., L. D. Elbourne, R. Lan, and P. R. Reeves. 1999. Evolutionary relationships of pathogenic clones of Vibrio cholerae by sequence analysis of four housekeeping genes. Infect. Immun. 67:1116-1124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Cai, S., D. Y. Kabuki, A. Y. Kuaye, T. G. Cargioli, M. S. Chung, R. Nielsen, and M. Wiedmann. 2002. Rational design of DNA sequence-based strategies for subtyping Listeria monocytogenes. J. Clin. Microbiol. 40:3319-3325. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Center for Diseases Control and Prevention. 2000. Standardized molecular subtyping of food-borne bacterial pathogen by pulsed-field gel electrophoresis. Centers for Diseases Control and Prevention, Atlanta, GA.
  • 13.Chun, J., A. Huq, and R. R. Colwell. 1999. Analysis of 16S-23S rRNA intergenic spacer regions of Vibrio cholerae and Vibrio mimicus. Appl. Environ. Microbiol. 65:2202-2208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Clarke, S. C. 2002. Nucleotide sequence-based typing of bacteria and the impact of automation. Bioessays 24:858-862. [DOI] [PubMed] [Google Scholar]
  • 15.Colwell, R. R. 1996. Global climate and infectious disease: the cholera paradigm. Science 274:2025-2031. [DOI] [PubMed] [Google Scholar]
  • 16.Cooper, J. E., and E. J. Feil. 2004. Multilocus sequence typing-what is resolved? Trends Microbiol. 12:373-377. [DOI] [PubMed] [Google Scholar]
  • 17.Dalsgaard, A., O. Serichantalergs, A. Forslund, W. Lin, J. Mekalanos, E. Mintz, T. Shimada, and J. G. Wells. 2001. Clinical and environmental isolates of Vibrio cholerae serogroup O141 carry the CTX phage and the genes encoding the toxin-coregulated pili. J. Clin. Microbiol. 39:4086-4092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Danin-Poleg, Y., L. Somer, L. A. Cohen, E. Diamant, Y. Palti, and Y. Kashi. 2006. Towards the definition of pathogenic microbe. Int. J. Food Microbiol. 112:236-243. [DOI] [PubMed] [Google Scholar]
  • 19.Davis, B. M., K. E. Moyer, E. F. Boyd, and M. K. Waldor. 2000. CTX prophages in classical biotype Vibrio cholerae: functional phage genes but dysfunctional phage genomes. J. Bacteriol. 182:6992-6998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Diamant, E., Y. Palti, R. Gur-Arie, H. Cohen, E. M. Hallerman, and Y. Kashi. 2004. Phylogeny and strain typing of Escherichia coli, inferred from variation at mononucleotide repeat loci. Appl. Environ. Microbiol. 70:2464-2473. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Farfan, M., D. Minana-Galbis, M. C. Fuste, and J. G. Loren. 2002. Allelic diversity and population structure in Vibrio cholerae O139 Bengal based on nucleotide sequence analysis. J. Bacteriol. 184:1304-1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Farlow, J., D. Postic, K. L. Smith, Z. Jay, G. Baranton, and P. Keim. 2002. Strain typing of Borrelia burgdorferi, Borrelia aftelii, and Borrelia garinii by using multiple-locus variable-number tandem repeat analysis. J. Clin. Microbiol. 40:4612-4618. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Faruque, S. M., N. Chowdhury, M. Kamruzzaman, M. Dziejman, M. H. Rahman, D. A. Sack, G. B. Nair, and J. J. Mekalanos. 2004. Genetic diversity and virulence potential of environmental Vibrio cholerae population in a cholera-endemic area. Proc. Natl. Acad. Sci. USA 101:2123-2128. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Faruque, S. M., and J. J. Mekalanos. 2003. Pathogenicity islands and phages in Vibrio cholerae evolution. Trends Microbiol. 11:505-510. [DOI] [PubMed] [Google Scholar]
  • 25.Faruque, S. M., I. B. Naser, M. J. Islam, A. S. Faruque, A. N. Ghosh, G. B. Nair, D. A. Sack, and J. J. Mekalanos. 2005. Seasonal epidemics of cholera inversely correlate with the prevalence of environmental cholera phages. Proc. Natl. Acad. Sci. USA 102:1702-1707. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Field, D., and C. Wills. 1996. Long, polymorphic microsatellites in simple organisms. Proc. R. Soc. Lond. B Biol. Sci. 263:209-215. [DOI] [PubMed] [Google Scholar]
  • 27.Field, D., and C. Wills. 1998. Abundant microsatellite polymorphism in Saccharomyces cerevisiae, and the different distributions of microsatellites in eight prokaryotes and S. cerevisiae, result from strong mutation pressures and a variety of selective forces. Proc. Natl. Acad. Sci. USA 95:1647-1652. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Galtier, N., M. Gouy, and C. Gautier. 1996. SEAVIEW and PHYLO_WIN: two graphic tools for sequence alignment and molecular phylogeny. Comput. Appl. Biosci. 12:543-548. [DOI] [PubMed] [Google Scholar]
  • 29.Garg, P., A. Aydanian, D. Smith, J. Glenn, G. B. Nair, and O. C. Stine. 2003. Molecular epidemiology of O139 Vibrio cholerae: mutation, lateral gene transfer, and founder flush. Emerg. Infect. Dis. 9:810-814. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Gur-Arie, R., C. J. Cohen, Y. Eitan, L. Shelef, E. M. Hallerman, and Y. Kashi. 2000. Simple sequence repeats in Escherichia coli: abundance, distribution, composition, and polymorphism. Genome Res. 10:62-71. [PMC free article] [PubMed] [Google Scholar]
  • 31.Halpern, M., Y. B. Broza, S. Mittler, E. Arakawa, and M. Broza. 2004. Chironomid egg masses as a natural reservoir of Vibrio cholerae non-O1 and non-O139 in freshwater habitats. Microb. Ecol. 47:341-349. [DOI] [PubMed] [Google Scholar]
  • 32.Heidelberg, J. F., J. A. Eisen, W. C. Nelson, R. A. Clayton, M. L. Gwinn, R. J. Dodson, D. H. Haft, E. K. Hickey, J. D. Peterson, L. Umayam, S. R. Gill, K. E. Nelson, T. D. Read, H. Tettelin, D. Richardson, M. D. Ermolaeva, J. Vamathevan, S. Bass, H. Qin, I. Dragoi, P. Sellers, L. McDonald, T. Utterback, R. D. Fleishmann, W. C. Nierman, and O. White. 2000. DNA sequence of both chromosomes of the cholera pathogen Vibrio cholerae. Nature 406:477-483. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Jeanmougin, F., J. D. Thompson, M. Gouy, D. G. Higgins, and T. J. Gibson. 1998. Multiple sequence alignment with CLUSTAL X. Trends Biochem. Sci. 23:403-405. [DOI] [PubMed] [Google Scholar]
  • 34.Jiang, S. C., M. Matte, G. Matte, A. Huq, and R. R. Colwell. 2000. Genetic diversity of clinical and environmental isolates of Vibrio cholerae determined by amplified fragment length polymorphism fingerprinting. Appl. Environ. Microbiol. 66:148-153. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Kaper, J. B., J. G. Morris, and M. M. Levine. 1995. Cholera. Clin. Microbiol. Rev. 8:48-86. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Kashi, Y., D. King, and M. Soller. 1997. Simple sequence repeats as a source of quantitative genetic variation. Trends Genet. 13:74-78. [DOI] [PubMed] [Google Scholar]
  • 37.Kashi, Y., and D. G. King. 2006. Simple sequence repeats as advantageous mutators in evolution. Trends Genet. 22:253-259. [DOI] [PubMed] [Google Scholar]
  • 38.Keim, P., A. M. Klevytska, L. B. Price, J. M. Schupp, G. Zinser, K. L. Smith, M. E. Hugh-Jones, R. Okinaka, K. K. Hill, and P. J. Jackson. 1999. Molecular diversity in Bacillus anthracis. J. Appl. Microbiol. 87:215-217. [DOI] [PubMed] [Google Scholar]
  • 39.Keim, P., L. B. Price, A. M. Klevytska, K. L. Smith, J. M. Schupp, R. Okinaka, P. J. Jackson, and M. E. Hugh-Jones. 2000. Multiple-locus variable-number tandem repeat analysis reveals genetic relationships within Bacillus anthracis. J. Bacteriol. 182:2928-2936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Keim, P., M. N. Van Ert, T. Pearson, A. J. Vogler, L. Y. Huynh, and D. M. Wagner. 2004. Anthrax molecular epidemiology and forensics: using the appropriate marker for different evolutionary scales. Infect. Genet. Evol. 4:205-213. [DOI] [PubMed] [Google Scholar]
  • 41.Keys, C., S. Kemper, and P. Keim. 2005. Highly diverse variable number tandem repeat loci in the Escherichia coli O157:H7 and O55:H7 genomes for high-resolution molecular typing. J. Appl. Microbiol. 98:928-940. [DOI] [PubMed] [Google Scholar]
  • 42.King, D. G., M. Soller, and Y. Kashi. 1997. Evolutionary tuning knobs. Endeavour 21:36-40. [Google Scholar]
  • 43.Klevytska, A. M., L. B. Price, J. M. Schupp, P. L. Worsham, J. Wong, and P. Keim. 2001. Identification and characterization of variable-number tandem repeats in the Yersinia pestis genome. J. Clin. Microbiol. 39:3179-3185. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Kotetishvili, M., O. C. Stine, Y. Chen, A. Kreger, A. Sulakvelidze, S. Sozhamannan, and J. G. Morris, Jr. 2003. Multilocus sequence typing has better discriminatory ability for typing Vibrio cholerae than does pulsed-field gel electrophoresis and provides a measure of phylogenetic relatedness. J. Clin. Microbiol. 41:2191-2196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Kumar, S., K. Tamura, I. B. Jakobsen, and M. Nei. 2001. MEGA2: molecular evolutionary genetics analysis software. Bioinformatics 17:1244-1245. [DOI] [PubMed] [Google Scholar]
  • 46.Leal, N. C., M. Sobreira, T. C. Leal-Balbino, A. M. de Almeida, M. J. de Silva, D. M. Mello, L. M. Seki, and E. Hofer. 2004. Evaluation of a RAPD-based typing scheme in a molecular epidemiology study of Vibrio cholerae O1, Brazil. J. Appl. Microbiol. 96:447-454. [DOI] [PubMed] [Google Scholar]
  • 47.Li, M. R., M. Kotetishvili, Y. S. Chen, and S. Sozhamannan. 2003. Comparative genomic analyses of the vibrio pathogenicity island and cholera toxin prophage regions in nonepidemic serogroup strains of Vibrio cholerae. Appl. Environ. Microbiol. 69:1728-1738. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Lindstedt, B. A., T. Vardund, L. Aas, and G. Kapperud. 2004. Multiple-locus variable-number tandem-repeats analysis of Salmonella enterica subsp. enterica serovar Typhimurium using PCR multiplexing and multicolor capillary electrophoresis. J. Microbiol. Methods 59:163-172. [DOI] [PubMed] [Google Scholar]
  • 49.Lipp, E. K., A. Huq, and R. R. Colwell. 2002. Effects of global climate on infectious disease: the cholera model. Clin. Microbiol. Rev. 15:757-770. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Mahalingam, S., Y. M. Cheong, S. Kan, R. M. Yassin, J. Vadivelu, and T. Pang. 1994. Molecular epidemiologic analysis of Vibrio cholerae O1 isolates by pulsed-field gel electrophoresis. J. Clin. Microbiol. 32:2975-2979. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Maiden, M. C., J. A. Bygraves, E. Feil, G. Morelli, J. E. Russell, R. Urwin, Q. Zhang, J. Zhou, K. Zurth, D. A. Caugant, I. M. Feavers, M. Achtman, and B. G. Spratt. 1998. Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. Proc. Natl. Acad. Sci. USA 95:3140-3145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Martin, P., K. Makepeace, S. A. Hill, D. W. Hood, and E. R. Moxon. 2005. Microsatellite instability regulates transcription factor binding and gene expression. Proc. Natl. Acad. Sci. USA 102:3800-3804. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Mekalanos, J. J. 1983. Duplication and amplification of toxin genes in Vibrio cholerae. Cell 35:253-263. [DOI] [PubMed] [Google Scholar]
  • 54.Metzgar, D., E. Thomas, C. Davis, D. Field, and C. Wills. 2001. The microsatellites of Escherichia coli: rapidly evolving repetitive DNAs in a non-pathogenic prokaryote. Mol. Microbiol. 39:183-190. [DOI] [PubMed] [Google Scholar]
  • 55.Miller, V. L., and J. J. Mekalanos. 1984. Synthesis of cholera toxin is positively regulated at the transcriptional level by toxR. Proc. Natl. Acad. Sci. USA 81:3471-3475. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Moxon, E. R., P. B. Rainey, M. A. Nowak, and R. E. Lenski. 1994. Adaptive evolution of highly mutable loci in pathogenic bacteria. Curr. Biol. 4:24-33. [DOI] [PubMed] [Google Scholar]
  • 57.Moxon, E. R., and C. Wills. 1999. DNA microsatellites: agents of evolution? Sci. Am. 280:94-99. [DOI] [PubMed] [Google Scholar]
  • 58.Nandi, B., R. K. Nandy, S. Mukhopadhyay, G. B. Nair, T. Shimada, and A. C. Ghose. 2000. Rapid method for species-specific identification of Vibrio cholerae using primers targeted to the gene of outer membrane protein OmpW. J. Clin. Microbiol. 38:4145-4151. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Noller, A. C., M. C. McEllistrem, O. C. Stine, J. G. Morris, D. J. Boxrud, B. Dixon, and L. H. Harrison. 2003. Multilocus sequence typing reveals a lack of diversity among Escherichia coli O157:H7 isolates that are distinct by pulsed-field gel electrophoresis. J. Clin. Microbiol. 41:675-679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Olive, D. M., and P. Bean. 1999. Principles and applications of methods for DNA-based typing of microbial organisms. J. Clin. Microbiol. 37:1661-1669. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.O'Shea, Y. A., F. J. Reen, A. M. Quirke, and E. F. Boyd. 2004. Evolutionary genetic analysis of the emergence of epidemic Vibrio cholerae isolates on the basis of comparative nucleotide sequence analysis and multilocus virulence gene profiles. J. Clin. Microbiol. 42:4657-4671. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Pfau, J. D., and R. K. Taylor. 1996. Genetic footprint on the ToxR-binding site in the promoter for cholera toxin. Mol. Microbiol. 20:213-222. [DOI] [PubMed] [Google Scholar]
  • 63.Purdy, A., F. Rohwer, R. Edwards, F. Azam, and D. H. Bartlett. 2005. A glimpse into the expanded genome content of Vibrio cholerae through identification of genes present in environmental strains. J. Bacteriol. 187:2992-3001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Ramamurthy, T., S. Garg, R. Sharma, S. K. Bhattacharya, G. B. Nair, T. Shimada, T. Takeda, T. Karasawa, H. Kurazano, and A. Pal. 1993. Emergence of novel strain of Vibrio cholerae with epidemic potential in southern and eastern India. Lancet 341:703-704. [DOI] [PubMed] [Google Scholar]
  • 65.Raskin, D., J. Bina, and J. Mekalanos. 2004. Genomic and genetic analysis of Vibrio cholerae. ASM News 70:57-62. [Google Scholar]
  • 66.Reen, F. J., and E. F. Boyd. 2005. Molecular typing of epidemic and nonepidemic Vibrio cholerae isolates and differentiation of V. cholerae and V. mimicus isolates by PCR-single-strand conformation polymorphism analysis. J. Appl. Microbiol. 98:544-555. [DOI] [PubMed] [Google Scholar]
  • 67.Rivera, I. N. G., J. Chun, A. Huq, R. B. Sack, and R. R. Colwell. 2001. Genotypes associated with virulence in environmental isolates of Vibrio cholerae. Appl. Environ. Microbiol. 67:2421-2429. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Sack, D. A., R. B. Sack, G. B. Nair, and A. K. Siddique. 2004. Cholera. Lancet 363:223-233. [DOI] [PubMed] [Google Scholar]
  • 69.Salcedo, C., L. Arreaza, B. Alcala, L. de la Fuente, and J. A. Vazquez. 2003. Development of a multilocus sequence typing method for analysis of Listeria monocytogenes clones. J. Clin. Microbiol. 41:757-762. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Reference deleted.
  • 71.Shepard, J. R., Y. Danin-Poleg, Y. Kashi, and D. R. Walt. 2005. Array-based binary analysis for bacterial typing. Anal. Chem. 77:319-326. [DOI] [PubMed] [Google Scholar]
  • 72.Shimada, T., E. Arakawa, K. Itoh, T. Okitsu, A. Matsushima, Y. Asai, S. Yamai, T. Nakazato, G. B. Nair, M. J. Albert, and Y. Takeda. 1994. Extended serotyping scheme for Vibrio cholerae. Curr. Microbiol. 28:175-178. [Google Scholar]
  • 73.Singh, D. V., M. H. Matte, G. R. Matte, S. Jiang, F. Sabeena, B. N. Shukla, S. C. Sanyal, A. Huq, and R. R. Colwell. 2001. Molecular analysis of Vibrio cholerae O1, O139, non-O1, and non-O139 strains: clonal relationships between clinical and environmental isolates. Appl. Environ. Microbiol. 67:910-921. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Smirnova, N. I., N. B. Cheldyshova, S. P. Zadnova, and V. V. Kutyrev. 2004. Molecular-genetic peculiarities of classical biotype Vibrio cholerae, the etiological agent of the last outbreak Asiatic cholera in Russia. Microb. Pathog. 36:131-139. [DOI] [PubMed] [Google Scholar]
  • 75.Somer, L., Y. Danin-Poleg, E. Diamant, R. Gur-Arie, Y. Palti, and Y. Kashi. 2005. Amplified intergenic locus polymorphism as a basis for bacterial typing of Listeria spp. and Escherichia coli. Appl. Environ. Microbiol. 71:3144-3152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Somer, L., and Y. Kashi. 2003. A PCR method based on 16S rRNA sequence for simultaneous detection of the genus Listeria and the species Listeria monocytogenes in food products. J. Food Prot. 66:1658-1665. [DOI] [PubMed] [Google Scholar]
  • 77.Titze-de-Almeida, R., R. J. Willems, J. Top, I. P. Rodrigues, R. F. Ferreira, H. Boelens, M. C. Brandileone, R. C. Zanella, M. S. Felipe, and A. van Belkum. 2004. Multilocus variable-number tandem-repeat polymorphism among Brazilian Enterococcus faecalis strains. J. Clin. Microbiol. 42:4879-4881. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Top, J., L. M. Schouls, M. J. Bonten, and R. J. Willems. 2004. Multiple-locus variable-number tandem repeat analysis, a novel typing scheme to study the genetic relatedness and epidemiology of Enterococcus faecium isolates. J. Clin. Microbiol. 42:4503-4511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Urwin, R., and M. C. J. Maiden. 2003. Multi-locus sequence typing: a tool for global epidemiology. Trends Microbiol. 11:479-487. [DOI] [PubMed] [Google Scholar]
  • 80.van Belkum, A., S. Scherer, L. van Alphen, and H. Verbrugh. 1998. Short-sequence DNA repeats in prokaryotic genomes. Microbiol. Mol. Biol. Rev. 62:275-293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.van Belkum, A., S. Scherer, W. van Leeuwen, D. Willemse, L. van Alphen, and H. Verbrugh. 1997. Variable number of tandem repeats in clinical strains of Haemophilus influenzae. Infect. Immun. 65:5017-5027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.van Belkum, A., M. Struelens, A. de Visser, H. Verbrugh, and M. Tibayrenc. 2001. Role of genomic typing in taxonomy, evolutionary genetics, and microbial epidemiology. Clin. Microbiol. Rev. 14:547-560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Vodop'ianov, S. O., I. P. Oleinikov, E. K. Goncharov, O. V. Duvanova, M. B. Mishan'kin, I. I. Suchkov, A. S. Vodop'ianov, M. V. Shishiianu, E. V. Shcherbakova, I. V. Kornienko, and B. N. Mishan'kin. 2002. Variable tandem repeat VcA of Vibrio cholerae. Mol. Biol. 36:1074-1079. (In Russian.) [PubMed] [Google Scholar]
  • 84.Wachsmuth, K., Ø. Olsvik, G. M. Evins, and T. Popovic. 1994. Molecular epidemiology of cholera, p. 357-370. In I. K. Wachsmuth, P. A. Blake, and Ø. Olsvik (ed.), Vibrio cholerae and cholera: molecular to global perspectives. ASM Press, Washington, DC.
  • 85.Wiedmann, M. 2002. Subtyping of bacterial food-borne pathogens. Nutr. Rev. 60:201-208. [DOI] [PubMed] [Google Scholar]
  • 86.Young, N. D., and J. Healy. 2003. GapCoder automates the use of indel characters in phylogenetic analysis. BMC Bioinformatics 4:6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Zo, Y. G., I. N. G. Rivera, E. Russek-Cohen, M. S. Islam, A. K. Siddique, M. Yunus, R. B. Sack, A. Huq, and R. R. Colwell. 2002. Genomic profiles of clinical and environmental isolates of Vibrio cholerae 01 in cholera-endemic areas of Bangladesh. Proc. Natl. Acad. Sci. USA 99:12409-12414. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

[Supplemental material]
jcm_45_3_736__1.pdf (125KB, pdf)

Articles from Journal of Clinical Microbiology are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES