Skip to main content
Infection and Immunity logoLink to Infection and Immunity
. 2002 Apr;70(4):1971–1983. doi: 10.1128/IAI.70.4.1971-1983.2002

Influence of Recombination and Niche Separation on the Population Genetic Structure of the Pathogen Streptococcus pyogenes

Awdhesh Kalia 1, Brian G Spratt 2, Mark C Enright 3, Debra E Bessen 1,*
PMCID: PMC127860  PMID: 11895961

Abstract

The throat and skin of the human host are the principal reservoirs for the bacterial pathogen Streptococcus pyogenes. The emm locus encodes structurally heterogeneous surface fibrils that play numerous roles in virulence, depending on the strain. Isolates harboring the emm pattern A-C marker exhibit a strong tendency to cause throat infection, whereas emm pattern D strains are usually recovered from impetigo lesions; as a group, emm pattern E organisms fail to display obvious tissue tropisms. The peak incidence for streptococcal pharyngitis and impetigo varies with season and locale, leading to wide spatial and temporal distances between throat and skin strains. To assess any impact of niche separation on genetic variation, the extent of recombinational exchange between emm pattern A-C, D, and E subpopulations was evaluated. Analysis of nucleotide sequence data for internal portions of seven housekeeping loci from 212 isolates provides evidence of extensive recombination between strains belonging to different emm pattern subpopulations. Furthermore, no fixed nucleotide differences were found between emm pattern A-C and D strains. Thus, despite some niche separation created by distinct epidemiological trends and innate tissue tropisms there is little evidence for neutral gene divergence between throat and skin strains. Maintenance of a relationship between emm pattern and tissue tropism in the face of underlying recombination suggests that tissue tropism is associated with emm or a closely linked gene.


Streptococcus pyogenes strains (commonly referred to as group A streptococci [GAS]) are among the most prevalent of pathogenic bacteria which afflict humans. They have a worldwide distribution, and humans are the only known biological host. S. pyogenes can cause a life-threatening illness, such as toxic shock syndrome, or trigger autoimmune disorders, such as rheumatic fever. However, most often S. pyogenes causes only mild infection at either the throat or skin, leading to pharyngitis or impetigo, respectively. These two tissue sites serve as the principal reservoir for organisms of this species.

One striking feature of group A streptococcal disease is the different degree to which throat and skin infections occur (8, 34). In many temperate regions of the world, the incidence of pharyngitis peaks in winter, whereas impetigo, although less common, peaks during the summer months. In contrast, for many tropical host populations, impetigo is hyperendemic year round, whereas throat infection, be it pharyngitis or asymptomatic carriage, ranges from very low to moderate levels. These distinct epidemiological trends in streptococcal disease can lead to wide spatial and temporal distances between organisms infecting the throat or the skin and thereby might limit the opportunities for lateral gene exchange.

The recognition of distinct throat and skin strains of S. pyogenes emerged from several decades of field epidemiology, and this concept is now widely accepted in the streptococcal field (1, 8, 10, 14, 17, 18, 31, 34, 35, 41, 47). M proteins form surface fibrils which contain the antigenic targets of a serological typing scheme; the type-specific determinants lie at the amino-terminal ends of the fibril tips (15, 24). Certain M-types were frequently isolated from cases of pharyngitis but were rarely isolated from impetigo lesions. Likewise, other M-types were found to be far more common in impetigo. From this arose the concept of distinct sets of throat and skin types. Using a related typing scheme based on variation in the emm genes which encode the M proteins, >150 emm types have been characterized according to nucleotide sequence heterogeneity at the 5′ ends (2, 20). Antibodies directed to the type-specific epitopes of M proteins are protective and thereby emm genes are under strong selection by the host immune response.

Closer examination of emm gene structure reveals four emm gene subfamilies, defined by nucleotide sequence divergence at the 3′ end (27). Analysis of the chromosomal content of emm subfamily genes, and their relative arrangement, led to the identification of five basic patterns (designated A through E), which account for ∼99% of all strains. A worldwide collection of genetically diverse isolates, recovered from cases of pharyngitis or impetigo over a time frame spanning nearly 50 years, revealed that isolates with emm patterns A, B, and C have a strong tendency to be recovered from the throat, whereas emm pattern D strains are most often isolated from impetigo lesions; as a group, pattern E strains display no obvious tissue site preference (7).

It has since been confirmed in population-based studies that emm pattern A-C isolates represent the historical throat types and pattern D isolates represent the skin types. Nearly all pharyngeal isolates (>99%) collected from hospitals in Rome were found to be of emm types typically giving rise to emm patterns A-C or E, whereas <1% were of emm types associated with pattern D (16). In a rural aboriginal community in tropical Australia where impetigo is hyperendemic, not a single case of GAS pharyngitis was detected during the 25-month surveillance period; of the total GAS isolates recovered, nearly 85% were of either emm pattern D or E, with relatively few emm pattern A-C strains found (5). An experimental model for streptococcal impetigo lends further support to a strong association between emm pattern D strains and superficial infection at the skin (44, 45). Although the differences in relative emm pattern distribution among throat and skin isolates are statistically significant, the associations between emm pattern A-C and the throat, and emm pattern D and the skin, are not absolute.

Sequence divergence among neutral (i.e., housekeeping) genes has proved useful for identifying discrete ecological populations (12). emm pattern A-C strains and pattern D strains can be regarded as distinct ecological populations because they exploit different niches. The emergence of a new ecological population from an ancestral population is believed to begin with some adaptive change that allows the exploitation of a new ecological niche. Subsequently, the periodic emergence of fitter variants (periodic selection) within the new ecological population results in the purging of neutral gene diversity within this population but not in the ancestral population (and vice versa). If recombination arising from lateral gene transfer between the new and ancestral populations remains sufficiently low, multiple rounds of periodic selection events are expected to lead to substantial neutral gene divergence between the two ecological populations, but not within a population. This is an early step in the evolution of new bacterial species (12).

In this report, the extent of genetic recombination and neutral gene divergence, both within and between the emm pattern subpopulations, is measured for seven housekeeping loci. No evidence for neutral gene divergence between strains with differing emm patterns or between isolates recovered from the throat versus skin was observed. The findings suggest that recombination between the two ecological populations is too high to allow for the emergence of neutral gene sequence clusters. In cases where neutral alleles are randomly distributed with respect to ecologically distinct populations, genetic variation that is strongly associated with the different ecological populations may be directly responsible for adaptation to the ecological niche, that is to say, emm gene products (or closely linked genes) may have a direct role in tissue tropism.

MATERIALS AND METHODS

Bacteria.

The 212 isolates of S. pyogenes (GAS) used in this study were described previously (19).

Genotyping.

Chromosomal DNA used as a template for PCR was prepared from freshly grown bacteria according to previously described methods (6).

Multilocus sequence typing (MLST) was determined as previously reported (19). In brief, internal fragments of the glucose kinase (gki), glutamine transporter protein (gtr), glutamate racemase (murI), DNA mismatch repair protein (mutS or hexA), transketolase (recP or tktB), xanthine phosphoribosyl transferase (xpt), and acetyl-CoA acetyltransferase (yqiL, not atoB as previously indicated) genes were amplified by PCR and subjected to nucleotide sequence determination. The relative positions of the seven housekeeping loci on the M1 genome (22) are as follows: SPy0140 (yqiL), SPy0361 (murI), SPy1136 (xpt), SPy1506 (gtr), SPy1529 (gki), SPy1676 (recP), and SPy2148 (mutS). For each locus, every different sequence was assigned a distinct allele number, and each isolate was defined by a series of seven integers (the allelic profile) corresponding to the alleles at the seven loci, in the order (alphabetical) gki-gtr-murI-mutS-recP-xpt-yqiL. Isolates with identical allelic profiles were assigned to the same sequence type (ST).

emm sequence typing was determined as reported previously (19). emm sequence typing is based on the 5′ end of the central emm gene within the emm chromosomal region (for map, see references 5 and 6). There is a very strong correspondence between the M type, as determined by serology, and the emm type that meets the stated definition (2, 20). Until validation is complete, new emm types are designated emmst, which stands for emm sequence type (20) and is not to be confused with ST, which refers to the MLST allelic profile. A complete listing of emm types found in association with S. pyogenes is maintained on the Internet (www.cdc.gov/ncidod/biotech/strep/strains.html).

The emm pattern was determined for at least one representative of each unique emm type-ST combination (n = 104) according to previously described methods (5, 6) involving a PCR-based mapping approach that uses oligonucleotide primers specific for each of the four major lineages that arise from phylogenetic trees based on the 3′ ends of emm genes (26, 27).

Computations and phylogenetic analysis.

A matrix of pair-wise differences in allelic profiles between strains was constructed by cluster analysis based on the proportion of loci having shared alleles using the unweighted pair-group method with arithmetic averages (UPGMA) and the percent disagreement distance measure (Statistica version 5.5; StatSoft, Tulsa, Okla.).

The 212 isolates of S. pyogenes were represented by 100 STs, or clones. In order to minimize any sampling bias that might arise from inclusion of single and double locus variants (i.e., clonal complexes), representative strains were selected at a linkage distance of 0.3, based on the UPGMA dendrogram (number of STs = 72). The 28 STs eliminated by linkage distance 0.3 calculations are as follows: 3, 4, 7, 8, 12, 16, 18, 21, 23, 26, 29, 38, 41, 45, 50, 58, 61, 66, 68, 69, 70, 74, 75, 77, 79, 84, 94, and 96. The 72 STs at a linkage distance of 0.3 were not chosen at random, but were selected based on being the most representative of the S. pyogenes strains. First, in the few instances where there was more than one emm type associated with an ST, an isolate of the ST was included in the set of 72. The second ranking criterion was that the ST which was most abundant among the set of 212 isolates was retained. The third criterion was that STs which had a single locus variant within the 100-ST set were retained. Finally, if all else was equal, the ST having a variant allele with the lowest allelic frequency among the set of 212 GAS isolates was removed. In all instances, the eliminated ST was of the same emm pattern as the other characterized descendants of that node, except for ST7 and ST61.

For analysis of the subset of isolates (N = 91) recovered from either the upper respiratory tract (URT) or impetigo lesions, the criteria stated above were also applied for selecting strains at a linkage distance of 0.3. In addition, the emm pattern was determined for all 91 of these isolates.

Phylogenetic trees of the nucleotide sequences from housekeeping loci were constructed using the maximum likelihood (ML) method available in the PAUP* package (version 4.0, beta 8). The initial start-up trees were obtained using the neighbor-joining method. ML trees were reconstructed using optimized values for the transition-to-transversion ratio (Ts/Tv) and the α parameter, which describes the extent of rate variation among nucleotide sites assuming a discrete gamma distribution with eight categories; both were estimated from the empirical data during tree constructions. Optimization of the desired evolutionary model of DNA substitution and the parameters was done using hierarchical likelihood ratio tests (29), with the aid of MODELTEST, version 3.0 (40).

For congruency analysis, the HKY85 + G model for DNA substitution was used for all tree constructions. An ML method was used to determine the extent of congruency between housekeeping gene trees (21, 28). For each housekeeping gene, the differences in log likelihood (Δ−ln L) were computed between the ML tree for the reference gene and the ML trees of the other housekeeping loci. The comparison trees were constructed so that their topologies were used with the sequence data from the reference tree, but with branch lengths optimized to maximize the likelihood of the second tree. To determine whether differences in log likelihood scores were significant, 200 random tree topologies were generated for each reference gene, and the likelihoods for the random trees were estimated by optimizing the branch lengths. The differences in likelihood between the reference tree and the random tree topologies are considered a null distribution of Δ−ln L values, as would be obtained when there is no more similarity in topology among gene trees than expected by chance. If the Δ−ln L values between ML trees of different loci are less than the 99th percentile of the null distribution for random tree topologies, then the ML trees are considered to be significantly different and thereby incongruent. A previous report found that this method was not adversely affected by very similar sequences that contained little phylogenetic signal (21).

Split decomposition analysis was performed using SplitsTree (version 3.1). Distance measures were derived using the optimized model of evolution as determined using the ML approach. Splits graphs were also constructed using uncorrected Hamming distance measures (30).

Nucleotide diversity (average pair-wise sequence divergence; π) and average nucleotide divergence between populations (Κ) were calculated using DnaSP (version 3.52).

RESULTS

Relationships between allelic profiles of housekeeping loci and S. pyogenes strain subpopulations.

The sequences of internal portions of seven housekeeping loci have been determined for 212 isolates of S. pyogenes, representing 78 emm types (19). From this set, 100 unique allelic profiles (STs) were resolved. Each ST was represented by isolates sharing the same emm type, with three exceptions, in which isolates of different emm types had the same allelic profile. For the 104 unique emm type-ST combinations, a matrix of pair-wise distances between strains was constructed, based on the proportion of loci having shared alleles, by cluster analysis using UPGMA (Fig. 1). Data on allelic profiles, emm type, and emm pattern for the S. pyogenes strains under study are posted on the Internet (http://www.mlst.net); most of the raw data has been previously published (6, 19). For nearly all emm types, the isolates tested share the same emm pattern grouping.

FIG. 1.

FIG. 1.

Cluster analysis of strains of defined emm pattern. The dendrogram was constructed using UPGMA and shows all unique emm type-ST combinations (n = 104). Labels at branch tips indicate emm type, followed by emm pattern group (A-C [ABC], D, or E); for cases where there is more than one ST associated with a given emm type, the string ends with a numerical assignment. In all instances, isolates of a given emm type share the same emm pattern grouping. Large asterisks denote nodes at linkage distances of ≤0.6, with descendants represented by six or more distinct clones (i.e., STs) and/or emm types. From the top of the dendrogram, the clusters are comprised of the following numbers of distinct emm types, STs, and emm patterns: cluster 1, six emm types and six STs (five of pattern A-C, one of pattern D); cluster 2, eight emm types and seven STs (six of pattern D, one of pattern E); cluster 3, seven emm types and nine STs (three of pattern A-C, six of pattern D); cluster 4, six emm types and six STs (one of pattern A-C, one of pattern D, four of pattern E).

Based on epidemiological studies in several different host populations, it was established that emm pattern serves as an imperfect marker for the tissue site preference of many strains of S. pyogenes. Thus, in a population-based sampling, a given strain or clone might be recovered from the skin of many hosts, but only occasionally from the throat, and thereby exhibit a strong preference for one tissue over the other. The dendrogram in Fig. 1 shows that there is a general lack of concordance between emm pattern and the genetic relatedness of strains inferred using allelic profiles of neutral housekeeping genes. Strains of all three emm pattern groupings are widely scattered along branch tips up and down the dendrogram. At a linkage distance of 0.3, 72 STs were resolved, only 19 of which were represented by more than one strain; these are clonal complexes, comprised of single and/or double locus variants. Only 2 of the clonal complexes were represented by strains of different emm patterns, whereas 12 clonal complexes contained distinct clones sharing an emm type; emm type often corresponds with clone or clonal complex and probably reflects recent diversification (19). However, moving deeper into the phylogeny, at a linkage distance of <0.6, four major genetically diverse clusters were evident, and each of these major clusters contains six or more clones. For each of these four clusters, strains belonging to different emm pattern subpopulations are represented. Thus, the findings on deeper phylogenetic relationships indicate that strains of an emm pattern group do not cluster together.

Of the 212 isolates of S. pyogenes, 91 were actually known to be recovered from either the URT or impetigo lesions. Forty-eight emm types were found among the throat and skin isolates. Also, 57 STs and 61 unique emm type-ST combinations are represented among this subset of strains. Of the 91 isolates, 55 (60.4%) originated from the URT and 36 (39.6%) from impetigo lesions. For the 55 URT isolates, the majority were of emm pattern A-C (37 [67.3%]), 3 (5.5%) were of emm pattern D, and 15 (27.3%) were of emm pattern E. For the 36 impetigo isolates, only a small minority were of emm pattern A-C (4 [11.1%]), 17 (47.2%) were of emm pattern D, and 15 (41.7%) were of emm pattern E. Thus, among this set of isolates, which were originally selected in large part based on diversity in emm type (19), the correlation was high between emm pattern A-C and URT infection and between emm pattern D and impetigo.

For the 61 unique emm type-ST combinations of URT and impetigo isolates, a matrix of pair-wise distances between allelic profiles was constructed and the similarity between strains was evaluated by cluster analysis (Fig. 2). Both throat and skin isolates were widely distributed throughout the dendrogram. Of the 17 emm type-STs represented by >2 isolates, 10 contained only URT isolates and 3 were limited to impetigo isolates, whereas 4 had a mixture of both URT and impetigo isolates. Three distinct, major clusters were evident, and each cluster contains six or more unique emm type-ST combinations separated by linkage distances of <0.6. One cluster consists of a mix of both throat and skin isolates, with seven unique emm type-STs and seven URT and eight impetigo isolates; both emm patterns A-C and D are represented. However, two clusters were homogenous with regard to the tissue site origin of strains. One cluster was represented by six unique emm type-STs and contained seven impetigo isolates that were all emm pattern D. Another cluster of eight unique emm type-STs was represented by 10 URT isolates, all of emm pattern A-C. Thus, some noticeable degree of clustering is evident for both URT and impetigo isolates.

FIG. 2.

FIG. 2.

Cluster analysis of strains of defined tissue site of isolation. The dendrogram shows all unique emm type-ST combinations (n = 61) of isolates known to be recovered from either the URT or impetigo lesions. Labels at branch tips indicate emm type, followed by emm pattern group (A-C [ABC], D, or E); for cases where there is more than one ST associated with a given emm type, the string ends with a numerical assignment. In all instances, isolates of a given emm type share the same emm pattern grouping. Large asterisks denote nodes at linkage distances of ≤0.6, with descendants represented by six or more distinct clones (i.e., STs) and/or emm types. The number of URT (○) and impetigo (▴) isolates for each emm type-ST combination is indicated to the left of the branch tip labels.

emm patterns and evolutionary divergence of housekeeping loci.

One shortcoming of UPGMA cluster analysis is that alleles are regarded as character states and the extent of nucleotide diversity among alleles is not considered. Also, data interpretation can be limited because many of the deeper branches in the UPGMA dendrogram are probably unreliable, as they represent comparisons between strains differing at all (or nearly all) of the seven housekeeping loci.

Another approach for determining whether the throat and skin strains of S. pyogenes comprise distinct evolutionary lineages is to directly assess the phylogenetic relationships inferred from sequences of the housekeeping gene sequences of each strain. Since the DNA fragments obtained for each locus are only about 450 bp, a phylogenetic tree generated by the ML method was constructed for concatenates of the seven housekeeping loci for known URT and impetigo isolates (Fig. 3A). The concatenated alleles are 3,134 bp in length; this length exceeds sequences tested in computer simulations, showing that longer sequences improve the accuracy of phylogenetic inference (43). Taxa were selected based on a linkage distance of 0.3 in the UPGMA dendrogram (Fig. 2). This truncation value eliminates from the analysis multiple isolates that differ from each other at only one or two of the seven housekeeping loci used in MLST and which are likely to have arisen following recent clonal diversification (19). Also, by evaluating slightly deeper phylogenetic relationships, any effects of sampling bias (e.g., multiple isolates of the same emm type) are minimized.

FIG. 3.

FIG. 3.

ML trees of concatenated housekeeping alleles. Unrooted, radial gene trees generated by the ML method, using the concatenated sequences of the seven housekeeping alleles for each strain (nucleotide sequence length = 3,134 bp), are shown. (A) Taxa for throat (n = 20) and impetigo (n = 22) isolates were selected at a linkage distance of 0.3; the most appropriate model for evolution was used (TrN + G + I). (B) Taxa for emm pattern A-C (n = 17) and D (n = 19) strains were selected at a linkage distance of 0.3; the most appropriate model for evolution was used (HKY85 + G + I). To determine the significance of the observed groupings, bootstrap analysis with 1,000 replicates was performed, using trees reconstructed by the neighbor-joining method to avoid excessive computational time while incorporating the same ML substitution parameters. Bootstrap values of >50% are indicated. The scale bars indicate the number of nucleotide substitutions per site. In panel A, strains for which there were descendants isolated from both tissue sites based on the dendrogram in Fig. 2 are indicated (∗); branch labels indicate throat (T) or skin (S) sources of each isolate.

A phylogenetic tree derived from the concatenated housekeeping alleles of URT and impetigo isolates is shown (Fig. 3A). Two branches supported by bootstrap values of >70% are evident. One strongly supported branch contains only emm pattern A-C throat-derived strains, whereas a second well-supported branch contains a mixture of throat and skin isolates (patterns A-C and D). Overall, there appears to be a lack of sharp division between throat and skin isolates.

A phylogenetic tree generated by the ML method was also constructed for concatenates of the seven housekeeping loci for emm pattern A-C and D isolates, which as a group display strong preferences for the throat and skin, respectively (Fig. 3B). Taxa were selected based on a linkage distance of 0.3 (Fig. 1). Two branches that were supported by bootstrap values of >70% contained a mix of emm pattern A-C and D isolates, consistent with the idea that certain pattern A-C and D strains might share a recent common ancestor. Furthermore, isolates represented by one of the well-supported branches were known to be recovered from both tissue sites (m3ABC from URT; m53D, m91D, and M93D2 from impetigo lesions). However, most branches of the ML tree were not well supported, and therefore, it is difficult to draw conclusions on evolutionary relationships between emm pattern A-C and D strains. Nonetheless, the tree provided no evidence for any sharp division between emm pattern A-C and D strains.

The weak support for most branches in the ML trees can be explained by the very low phylogenetic signal in the housekeeping alleles and/or a history of recombination. Very low levels of average pair-wise sequence divergence (π) were observed for alleles found within both the emm pattern A-C and D subpopulations (Table 1). This finding is similar to the result obtained for the entire S. pyogenes population, as analyzed in a prior report (21). Furthermore, the average nucleotide divergence (Κ) between the emm pattern A-C and D populations was also low. Of the 3,134 bp of total nucleotide sequence for the seven housekeeping genes, 129 nucleotide sites are polymorphic for the combined alleles found among emm pattern A-C and D strains selected at a linkage distance of 0.3. Of these, 50 polymorphic sites (38.8%) are shared among pattern A-C and D strains.

TABLE 1.

DNA divergence between emm pattern A-C and D subpopulationsa

Locus Size (bp) No. of polymorphic sites
Nucleotide diversity π
Κb No. of fixed differences No. of shared polymorphisms
Pattern A-C Pattern D Total Pattern A-C Pattern D
gki 498 23 21 30 0.01335 0.01087 0.01252 0 14
gtr 450 14 12 19 0.00704 0.0076 0.00784 0 7
murI 438 8 6 10 0.00543 0.00359 0.0046 0 4
mutS 405 10 11 14 0.0071 0.00589 0.0066 0 7
recP 459 14 15 21 0.0081 0.0092 0.00947 0 8
xpt 450 17 10 23 0.00689 0.00412 0.00568 0 4
yqiL 434 8 10 12 0.00538 0.0052 0.00554 0 6
a

Values are based on alleles found in strains selected at a linkage distance of 0.3.

b

K, average nucleotide divergence between two populations. The values were Jukes-Cantor corrected.

A hallmark of evolutionary divergence resulting from ecological separation is an accumulation of fixed differences at some nucleotide sites. However, when comparing alleles derived from emm pattern A-C and D strains, there were no sites at which the nucleotide present in one population was different from that in the other population (Table 1). Furthermore, there were no fixed nucleotide differences between the 55 URT- and 36 impetigo-derived isolates (data not shown). The complete absence of fixed nucleotide differences in housekeeping loci between emm pattern A-C and D strains and between URT and impetigo isolates suggests that any ecological separation between throat and skin strains of S. pyogenes has not been sufficient to lead to genetic differentiation between these populations. This finding is further supported by the ratio of between-population divergence (Κ) to the average within-population divergence (π) for emm pattern A-C and D strains (Table 1), which approaches unity for all loci (range, 1.016 to 1.071) (37).

Recombination between S. pyogenes strain subpopulations.

Recombination can disrupt phylogenetic relationships. One approach for determining the extent of recombination is to estimate the level of congruency between tree topologies for different genes. Congruent trees signify relatively low levels of recombination, whereas incongruent trees indicate that recombination has been sufficiently frequent to eliminate the phylogenetic signal.

The extent of congruence between tree topologies for all pair-wise comparisons of the seven housekeeping loci was determined by the ML method (Table 2). Congruence of gene tree topologies was determined for all 72 STs of S. pyogenes, selected at a linkage distance of 0.3 based on the UPGMA dendrogram (Fig. 1) and representing all three emm pattern groups. Out of a possible 42 pair-wise tree comparisons, there were three examples of significant congruence between trees, whereby the differences in likelihoods of the trees was less than the 99th percentile of the random distribution of random tree topologies (Fig. 4, gtr, recP, and xpt). However, for each instance, the comparisons fell just marginally outside of the 99th percentile of the random distribution of random tree topologies.

TABLE 2.

Statistical tests of congruence of loci

emm pattern (no. of isolates) Locus No. of alleles represented at a linkage distance of 0.3 −ln L of ML tree Δ−ln L of ML trees based on other genes 99th percentile Δ−ln L in random trees Locus(i) outside 99th percentile of random trees
All (72) gki 32 1,025.346 531.347-567.297 515.167
gtr 30 872.383 222.310-262.264 225.761 recP
murI 21 762.807 210.676-235.932 207.941
mutS 17 703.505 269.369-286.282 243.313
recP 33 1,122.478 506.298-619.705 517.758 gtr
xpt 25 847.913 140.672-159.500 141.088 murI
yqiL 20 750.401 199.448-223.575 197.275
A to C (17) gki 13 873.574 71.438-114.645 76.183 recP
gtr 12 720.084 37.995-51.586 31.425
murI 8 668.895 25.324-60.965 42.844 recP, yqiL
mutS 7 625.980 57.626-70.617 43.431
recP 10 750.122 53.041-83.228 53.537 gki
xpt 11 734.882 22.123-32.500 19.712
yqiL 8 656.719 22.407-61.750 33.289 murI
D (19) gki 11 850.621 69.594-103.789 64.627
gtr 10 716.289 60.351-72.304 48.135
murI 6 650.398 44.548-59.966 28.886
mutS 9 631.579 101.812-114.478 59.269
recP 11 758.422 84.895-133.028 74.051
xpt 10 689.388 29.208-37.413 26.375
yqiL 10 676.989 53.123-65.909 39.580
E (36) gki 17 890.048 236.325-274.259 211.935
gtr 17 779.819 109.024-140.679 118.992 recP
murI 14 702.981 133.632-151.467 111.555
mutS 12 653.179 123.346-134.129 102.204
recP 20 1,017.812 270.716-397.968 267.703
xpt 11 699.425 43.805-57.364 43.714
yqiL 13 688.612 103.707-120.363 78.513

FIG. 4.

FIG. 4.

ML analysis of gene tree congruence. The ML trees of each housekeeping locus are compared with the ML trees from the other six housekeeping loci. The differences in log likelihood scores (Δ−ln L) are shown between housekeeping loci (squares) and 200 trees of random topology (diamonds). The lower 99th percentile of the likelihood differences between the ML tree for each locus and the 200 random tree topologies is indicated (vertical line).

This finding of relatively low congruency among 72 STs is consistent with a previous report on S. pyogenes strains selected at a linkage distance of 0.55 (41 STs), where the congruence between the ML trees for the same seven housekeeping genes was no better than the congruence between each tree and trees of random topologies for all comparisons (21). The presence here of slight congruence in some comparisons is likely due to truncation at a linkage distance of 0.3, which will result in the inclusion of some isolates that have a relatively recent common ancestor. Therefore, rates of recombination are relatively high in the overall S. pyogenes population.

The extent of intragenic recombination within each of the housekeeping loci was assessed by split decomposition analysis using the alleles at each of the seven loci (Fig. 5). In splits graphs, conflicting phylogenetic signals are depicted by parallel edges, and the extent of both bifurcating and networked evolution can be illustrated. For the yqiL locus, all 22 alleles that have been identified among the 212 isolates of S. pyogenes are shown; a fit parameter of 99 indicates that near complete resolution of the relationships between alleles was achieved. The four alleles occupying the corners of the central network (yqiL01, yqiL04, yqiL10, and yqiL15) differ from one another at two nucleotide sites (109 and 237), and all four possible combinations of two nucleotides-two sites are found. Strains from all three emm pattern groupings are represented by alleles that occupy the central network. The non-networked branches that emanate from three of the central alleles represent housekeeping alleles found in strains corresponding to all emm patterns. The split decomposition findings for yqiL are indicative of intragenic recombination between alleles of strains belonging to different emm patterns.

FIG. 5.

FIG. 5.

FIG. 5.

Split decomposition analysis of housekeeping alleles. Splits graphs are shown for all known alleles of yqiL (A) and mutS (B) found in S. pyogenes. Representation by isolates of the three emm pattern groupings is indicated for each allele. For yqiL, nucleotide differences between the four alleles occupying the corners of the central network are indicated. Pair-wise distances between sequences were estimated using the HKY85 model of evolution incorporating ML optimized parameters. Nearly identical graphs were obtained using uncorrected Hamming distances. The scale bars indicate the number of nucleotide substitutions per site. A fit parameter of 100 indicates that all conflicts in phylogenetic signals are depicted in the graph. Splits graphs for the other five housekeeping loci are available from the authors upon request.

The splits graph for mutS also illustrates networked evolution; however, the relationships between alleles are more complex than those observed for yqiL and involve six polymorphic nucleotide sites (Fig. 5B). Strains representing the three emm pattern subpopulations have alleles within the central network. For the five remaining housekeeping loci, fit parameters ranged from 22 to 90.8 for splits graphs that included all known alleles; all loci showed networked evolution, except murI (data not shown; graphs are available from the authors upon request). Only after removal of the four longest branches of the murI splits graph was there resolution of central networks, with an improvement in the fit parameter from 37.5 to 93. All seven housekeeping loci showed evidence for intragenic recombination involving alleles that are distributed among different emm pattern groups.

Several alleles of yqiL and mutS are shared by strains belonging to different emm pattern groups (Fig. 5). The emm pattern distribution of shared alleles among strains selected at a linkage distance of 0.3 is summarized for all seven loci (Table 3). For the 212 isolates of S. pyogenes, a total of 197 alleles are found for the seven housekeeping loci. For the 72 STs selected at a linkage distance of 0.3, 180 unique alleles are represented. Of these, 64 (35.6%) are shared among the 72 strains that differ from each other by MLST at two or more loci. Of the 64 shared alleles, only 20 are restricted to strains of a single emm pattern. Twenty-four alleles are common to both emm pattern A-C and D strains.

TABLE 3.

Distribution of shared alleles among strains according to emm pattern group

Pattern distribution No. of alleles shared by multiple STs at a linkage distance of 0.3 (%)
A-C, D, and E 20 (31.3)
A-C and D only 4 (6.3)
A-C and E only 8 (12.5)
D and E only 12 (18.8)
A-C only 3 (4.7)
D only 3 (4.7)
E only 14 (21.9)
Total 64 (100)

Several shared alleles are distributed among strains of classical throat and skin types (5, 8, 17, 35), for example, gki02 (emm3 [A-C] and emm33, emm53, emm91, and emm93 [D]), gtr02 (emm1, emm6, and emm12 [A-C] and emm33, emm52, and emm93 [D]), murI08 (emm3, emm19, and emm24 [A-C] and emm52 and emm93 [D]), mutS05 (emm3, emm5, and emm6 [A-C] and emm91 and emm93 [D], recP02 (emm3 [A-C] and emm33, emm53, emm91, and emm93 [D], xpt03 (emm1, emm3, emm17, and emm18 [A-C] and emm91, emm93, and emmstD633 [D], and yqiL02 (emm3 and emm12 [A-C] and emm33, emm52, emm53, emm91, and emm93 [D]) (note that emm91 and emm93 were formerly PT5757 and Potter41, respectively).

The distribution of shared alleles among isolates known to be recovered from either the URT or impetigo lesions was also calculated. For the 91 URT and impetigo isolates (representing 57 STs), a total of 133 alleles were found for all seven housekeeping loci. For the 42 STs selected at a linkage distance of 0.3, 120 alleles are represented. Of these, 38 (31.7%) alleles are shared among the 42 strains that differ from each other at two or more loci. Of the 38 shared alleles, only eight are restricted to strains recovered from either the throat or skin. Thirty shared alleles (78.9%) are distributed among both URT and impetigo isolates.

In summary, for isolates that differ from all others at two or more housekeeping loci, the majority of shared alleles are distributed among strains belonging to different emm pattern groups or among isolates recovered from different tissue sites. It is highly probable that this distribution arose, in part, from horizontal gene movements involving housekeeping alleles and/or emm genes. The alternative hypothesis (mutation only) is less likely because it would require at least three independent events as follows: mutations in two or more housekeeping loci plus multiple changes in the emm type-emm pattern locus.

Recombination within emm pattern subpopulations.

Several lines of evidence support the notion that there is extensive recombination between strains of different emm pattern groups and between URT and impetigo isolates. The extent of congruence between tree topologies for all pair-wise comparisons of the seven housekeeping loci was also determined separately for each emm pattern grouping (Table 1). Taxa were selected based on a linkage distance of 0.3 in the UPGMA dendrogram of allelic profiles (Fig. 1). emm pattern D strains showed differences in log likelihood scores falling within the 99th percentile of the null distribution for trees of random topology at all loci (Table 2). Gene tree comparisons among emm pattern E strains revealed only one example where the difference in log likelihood was less than that of the random trees (for gtr, when optimized to the recP gene tree). Pattern A-C strains had five comparisons for which the likelihood differences were less than those with the random trees, although two of these comparisons fell just marginally outside of the 99th percentile of the random distribution (all plots are available from the authors upon request). Thus, for all emm patterns, most comparisons between gene trees showed no significant congruence. The slight differences in tree congruency among strains of the three emm pattern groups seem largely insignificant and may merely reflect the number of isolates that have a recent common ancestor retained in the data set by truncation at a genetic distance of 0.3.

In summary, these data indicate that there has been a history of recombination among housekeeping loci within each of the three emm pattern groupings. The findings for all three emm pattern groups of S. pyogenes are in sharp contrast to those of Escherichia coli, for which nearly all ML tree comparisons of seven housekeeping loci fall outside the 99th percentile of the random distribution, even for taxa selected at linkage distances of >0.3, indicating significantly higher levels of congruency (21).

DISCUSSION

The emm patterns serve as imperfect ecological markers for strains of S. pyogenes showing preference for host infection at either the throat or skin. Organisms having biological determinants for host tissue tropism are further separated from each other in time and space by the differential host and environmental risk factors for throat and skin infection, which provide the driving force behind distinct epidemiological trends (5). This raises a fundamental question: Is there a relationship between niche separation and evolutionary divergence among strains of S. pyogenes? If a positive correlation were found, the biological implication might be that emm pattern A-C and D strains--the so-called throat and skin strains, respectively--are at the beginning stages of speciation and are evolving along separate lineages.

An analysis of ecological isolation depends on the accurate classification of each strain as throat or skin tropic. Based on epidemiological findings, the emm patterns provide an approximation of throat- and skin-tropic strains (patterns A-C and D, respectively) and segregate those strains whose tissue site preference is less clear (emm pattern E). An alternate approach is to categorize each isolate based on the actual tissue site from which it was recovered. The drawback here is that strains having only weak tissue site preferences are included in the analysis. Despite the different strengths and weaknesses for each of the two classification schemes, their findings were highly concordant.

The data provide strong evidence for a lack of clustering of S. pyogenes strains according to emm pattern. The UPGMA dendrogram derived from MLST allelic profiles shows that strains of different emm pattern groups are widely distributed along the branches. This finding is further supported by the lack of evolutionary divergence between emm pattern A-C and D strains, as revealed by a phylogenetic tree constructed using the concatenated alleles of the seven housekeeping loci. For all housekeeping loci, splits graphs show networked evolution suggestive of intragenic recombination and involving strains representing all three emm pattern groups. Furthermore, recombination between strains of S. pyogenes has been sufficiently frequent over the long term to result in a lack of congruency between the gene trees for the seven housekeeping loci. Whereas emm type often equates with clone or clonal complex (19), the strains of the many emm types that comprise each emm pattern subpopulation are genetically distant. Thus, recombination has obscured the phylogenetic relationships between the emm pattern subpopulations, but in most cases, it has not been significant enough to mask the relationships between isolates of the same emm type. BURST analysis offers a new and powerful tool for estimating the ratio of recombination to mutation, based on recent clonal diversification; however, even with 212 isolates, the present S. pyogenes sample set is not sufficiently large to make these estimates (21). This is because the data set was chosen to represent diversity within the S. pyogenes population, and thus there were relatively few large clonal complexes of strains which are needed for BURST.

For isolates that were known to be derived from either the URT or impetigo lesions, the UPGMA dendrogram revealed a prominent cluster of throat-derived strains (all pattern A-C) and a second major cluster of skin-derived strains (all pattern D). One possible explanation for these two clusters is that different strains arose via clonal diversification that did not alter the emm pattern and the descendant strains retained their tissue tropisms. However, genetic recombination between throat and skin strains is clearly evident. A third major cluster contains a mix of both throat and skin strains, and furthermore, both throat and skin isolates are widely distributed throughout the dendrogram. There is a lack of evolutionary divergence between throat and skin isolates, as indicated by a phylogenetic tree constructed using the concatenated alleles of the seven housekeeping loci. When the entire set of URT- and impetigo-derived strains are considered, the vast majority of alleles shared by different strains are found among both throat and skin isolates. The majority of shared alleles are also found among strains of different emm patterns. Importantly, no fixed nucleotide differences were found in any of the housekeeping genes that distinguished throat isolates from skin isolates or emm pattern A-C strains from emm pattern D strains.

In summary, there is no clear evidence that genetic variation in housekeeping loci is tightly linked to emm pattern or to actual tissue site origin. Therefore, niche separation appears to have little or no effect on neutral gene divergence.

Our findings on the relative lack of sequence divergence among neutral genes of strains from different clonal complexes or different emm pattern subpopulations closely parallels findings on genes encoding extracellular putative virulence factors of S. pyogenes (42). In that study, for 11 concatenated genes exceeding 26 kb in length, derived from isolates of 12 emm types (emm types typical of emm patterns A-C and E only), with few exceptions the bootstrap confidence limits for the ancestral nodes were <70%. Furthermore, the topologies of individual gene trees were not congruent with the topology of the tree made with concatenated sequences; the findings of split decomposition and maximum χ2 analysis suggest that multiple recombination events contribute to allelic diversity. Also consistent with our data on the relationship between emm type and ST (19), restricted allelic variation is observed for extracellular virulence factor genes among strains of the same M type (42).

Humans are believed to be the sole natural host for S. pyogenes, and this bacterial species is not known to have an environmental reservoir. Horizontal gene movements between different strains of S. pyogenes are likely to be favored when the donor and recipient strains occupy the same human host tissue site at the same time. Presumably, this occurs during coinfection (or cocolonization) at either the throat or skin (5, 11). Long-term carriage at the URT might further increase the probability for coinfection with multiple strains. But even for strains with a strong tissue site preference, there are occasions when an organism ends up at the other tissue site. Secondary infection of the URT can arise following impetigo, although transfer of streptococci in the opposite direction (from throat to skin) is far less common (8, 23). Perhaps strains that lack tissue site preferences (e.g., emm pattern E) act as shuttles and facilitate the movement of genes between throat- and skin-specific strains (6). The primary vehicle(s) for lateral gene exchange between strains of S. pyogenes is not known for certain, but generalized transduction by bacteriophage is likely to play an important role (9).

Horizontal gene transfer and genetic recombination can lead to random associations between loci (linkage equilibrium), as found for the neutral housekeeping loci. Recombination between emm pattern A-C and D strains should result in the dissociation of emm pattern from the genes which confer tissue tropism, unless the emm genes themselves, or loci that are physically close to emm on the chromosome, directly contribute to tissue site preference. Therefore, the strong nonrandom association that is observed between emm pattern and tissue site provides compelling evidence that emm genes, or emm-linked genes, act as determinants of host tissue tropism. In fact, all physically distant loci that are in strong linkage disequilibrium with emm pattern A-C or D are strong candidates for having a tissue-specific role in GAS infection. Analysis of the population genetic structure of a bacterial species in combination with careful epidemiologic sampling can provide new insights on the molecular basis for pathogenicity and virulence.

The emm gene products (M and M-like proteins) are virulence factors, displaying a wide array of functions that include specific binding of several tissue and plasma proteins of the human host. In some instances, M protein-derived phenotypes are uniquely associated with emm pattern. For example, binding activity for immunoglobulin A is limited to emm pattern E (3). The high-affinity-binding site for human plasminogen (known as PAM [plasminogen-associated M protein]) is restricted to emm pattern D strains (46). Through the action of streptokinase, a plasminogen activator produced by S. pyogenes, bacterial-bound plasminogen can be converted to plasmin, a broad-spectrum protease. Conceivably, this M protein-based phenotype specifically enhances infection at the skin.

Structural domains within M and M-like proteins, whose functional significance is largely unknown, also correlate strongly with emm pattern. For example, emm pattern A through D strains have the class I antigenic epitope which is lacking in emm pattern E (class II) (4). The emm patterns themselves are defined based on differences within the 3′ region of emm genes, which encodes a peptidoglycan-spanning domain; the combination of domains for each emm pattern is unique but is shared by all strains of that emm pattern (26, 27). Four peptidoglycan-spanning domain forms are defined, and they differ markedly from one another in both amino acid sequence composition and length, suggesting that each domain form reflects an adaptation to a different cell wall structure. Those phenotypes, or combinations of phenotypes, that are specifically associated with either emm pattern A-C or D strains are strong candidates for having a tissue-specific function.

Perhaps the best proven example of a tissue-specific adaptive trait for S. pyogenes infection is the secreted cysteine proteinase activity due to SpeB (45). An epidemiological survey shows highest mean levels of SpeB activity for emm pattern D strains, whereas pattern A-C strains have significantly lower levels. Direct evidence for a role of SpeB in impetigo is provided by ΔspeB mutants, which show a loss in virulence and reproductive growth when tested in an experimental model for skin infection. Based on the genome of strain SF370 (M-type 1, emm pattern A-C), the emm and speB loci lie ∼13 kb apart on the chromosome (22). The genotypic basis for differential levels of SpeB activity is not precisely known, and since the regulation of SpeB activity is complex (13, 25, 32, 33, 36, 38, 39), there may be several different adaptive mutations which can lead to the “fitter” phenotype. The strong association between the emm pattern D genotype and the high SpeB activity phenotype is presumably maintained either by coinheritance due to chromosomal linkage, or alternatively, by strong coselective pressures. Loss of either trait is expected to result in a large loss of fitness for that isolate, because it would no longer be well-adapted to the skin, yet it would not necessarily be well-adapted to the throat either.

In a clonal population of bacteria, recombination rates are low and linkage disequilibrium is strong for all loci in the genome. High rates of recombination disrupt genetic linkages, leading to random associations between neutral housekeeping genes. Since S. pyogenes displays random associations between housekeeping alleles, irrespective of emm pattern grouping, it should be feasible to use predictive models to identify loci that have a tissue-specific adaptive function. Even against a background of randomly associated neutral genes, key adaptive loci will retain strong nonrandom associations because of epistatic effects. In-depth studies in population genomics should lead to the identification of the genotypes and phenotypes that are uniquely associated with infection at either the throat or skin. In addition to providing deeper insight on pathogenic mechanisms, tissue-specific adaptive loci are ideal targets for vaccination.

Acknowledgments

We thank Eddie Holmes for helpful suggestions on gene tree topology congruency measures.

This work was supported by grants from the Wellcome Trust (to B.G.S.), the National Institutes of Health (AI-28944, to D.E.B., and GM-60793, to D.E.B. and B.G.S.), the American Heart Association (grant-in-aid, to D.E.B.), and a Brown-Coxe Postdoctoral Fellowship (to A.K.). M.C.E. is a Royal Society University Research Fellow.

Editor: E. I. Tuomanen

REFERENCES

  • 1.Anthony, B. F., E. L. Kaplan, L. W. Wannamaker, and S. S. Chapman. 1976. The dynamics of streptococcal infections in a defined population of children: serotypes associated with skin and respiratory infections. Am. J. Epidemiol. 104:652-666. [DOI] [PubMed] [Google Scholar]
  • 2.Beall, B., R. Facklam, and T. Thompson. 1996. Sequencing emm-specific PCR products for routine and accurate typing of group A streptococci. J. Clin. Microbiol. 34:953-958. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Bessen, D., and V. A. Fischetti. 1990. A human IgG receptor of group A streptococci is associated with tissue site of infection and streptococcal class. J. Infect. Dis. 161:747-754. [DOI] [PubMed] [Google Scholar]
  • 4.Bessen, D., K. F. Jones, and V. A. Fischetti. 1989. Evidence for two distinct classes of streptococcal M protein and their relationship to rheumatic fever. J. Exp. Med. 169:269-283. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Bessen, D. E., J. R. Carapetis, B. Beall, R. Katz, M. Hibble, B. J. Currie, T. Collingridge, M. W. Izzo, D. A. Scaramuzzino, and K. S. Sriprakash. 2000. Contrasting molecular epidemiology of group A streptococci causing tropical and non-tropical infections of the skin and throat. J. Infect. Dis. 182:1109-1116. [DOI] [PubMed] [Google Scholar]
  • 6.Bessen, D. E., M. W. Izzo, T. R. Fiorentino, R. M. Caringal, S. K. Hollingshead, and B. Beall. 1999. Genetic linkage of exotoxin alleles and emm gene markers for tissue tropism in group A streptococci. J. Infect. Dis. 179:627-636. [DOI] [PubMed] [Google Scholar]
  • 7.Bessen, D. E., C. M. Sotir, T. L. Readdy, and S. K. Hollingshead. 1996. Genetic correlates of throat and skin isolates of group A streptococci. J. Infect. Dis. 173:896-900. [DOI] [PubMed] [Google Scholar]
  • 8.Bisno, A. L., and D. Stevens. 2000. Streptococcus pyogenes (including streptococcal toxic shock syndrome and necrotizing fasciitis), p. 2101-2117. In G. L. Mandell, R. G. Douglas, and R. Dolin (ed.), Principles and practice of infectious diseases, 5th ed., vol. 2. Churchill Livingstone, Philadelphia, Pa. [Google Scholar]
  • 9.Broudy, T. B., V. Pancholi, and V. A. Fischetti. 2001. Induction of lysogenic bacteriophage and phage-associated toxin from group A streptococci during coculture with human pharyngeal cells. Infect. Immun. 69:1440-1443. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Carapetis, J., B. Currie, and E. Kaplan. 1999. Epidemiology and prevention of group A streptococcal infections: acute respiratory tract infections, skin infections, and their sequelae at the close of the twentieth century. Clin. Infect. Dis. 28:205-210. [DOI] [PubMed] [Google Scholar]
  • 11.Carapetis, J., D. Gardiner, B. Currie, and J. D. Mathews. 1995. Multiple strains of Streptococcus pyogenes in skin sores of aboriginal Australians. J. Clin. Microbiol. 33:1471-1472. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Cohan, F. M. 2001. Bacterial species and speciation. Syst. Biol. 50:513-524. [DOI] [PubMed] [Google Scholar]
  • 13.Collin, M., and A. Olsen. 2000. Generation of a mature streptococcal cysteine proteinase is dependent on cell wall-anchored M1 protein. Mol. Microbiol. 36:1306-1318. [DOI] [PubMed] [Google Scholar]
  • 14.Colman, G., A. Tanna, A. Efstratiou, and E. Gaworzewska. 1993. The serotypes of Streptococcus pyogenes present in Britain during 1980-1990 and their associations with disease. J. Med. Microbiol. 39:165-178. [DOI] [PubMed] [Google Scholar]
  • 15.Cunningham, M. W. 2000. Pathogenesis of group A streptococcal infections. Clin. Microbiol. Rev. 13:470-511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Dicuonzo, G., G. Gherardi, G. Lorino, S. Angeletti, M. DeCesaris, E. Fiscarelli, D. E. Bessen, and B. Beall. 2001. Group A streptococcal genotypes from pediatric throat isolates in Rome, Italy. J. Clin. Microbiol. 39:1687-1690. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Dillon, H., C. Derrick, and M. Dillon. 1974. M-antigens common to pyoderma and acute glomerulonephritis. J. Infect. Dis. 130:257-267. [DOI] [PubMed] [Google Scholar]
  • 18.Dillon, H. C. 1972. Streptococcal infections of the skin and their complications: impetigo and nephritis, p. 571-587. In L. W. Wannamaker and J. M. Matsen (ed.), Streptococci and streptococcal diseases. Academic Press, New York, N.Y.
  • 19.Enright, M. C., B. G. Spratt, A. Kalia, J. H. Cross, and D. E. Bessen. 2001. Multilocus sequence typing of Streptococcus pyogenes and the relationship between emm type and clone. Infect. Immun. 69:2416-2427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Facklam, R., B. Beall, A. Efstratiou, V. Fischetti, E. Kaplan, P. Kriz, M. Lovgren, D. Martin, B. Schwartz, A. Totolian, D. Bessen, S. Hollingshead, F. Rubin, J. Scott, and G. Tyrrell. 1999. Report on an international workshop: demonstration of emm typing and validation of provisional M-types of group A streptococci. Emerg. Infect. Dis. 5:247-253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Feil, E. J., E. C. Holmes, D. E. Bessen, M.-S. Chan, N. P. J. Day, M. C. Enright, R. Goldstein, D. Hood, A. Kalia, C. E. Moore, J. Zhou, and B. G. Spratt. 2001. Recombination within natural populations of pathogenic bacteria: short-term empirical estimates and long-term phylogenetic consequences. Proc. Natl. Acad. Sci. USA 98:182-187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Ferretti, J. J., W. M. McShan, D. Ajdic, D. J. Savic, G. Savic, K. Lyon, C. Primeaux, S. Sezate, A. N. Suvorov, S. Kenton, H. S. Lai, S. P. Lin, Y. Qian, H. G. Jia, F. Z. Najar, Q. Ren, H. Zhu, L. Song, J. White, X. Yuan, S. W. Clifton, B. A. Roe, and R. McLaughlin. 2001. Complete genome sequence of an M1 strain of Streptococcus pyogenes. Proc. Natl. Acad. Sci. USA 98:4658-4663. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Ferrieri, P., A. S. Dajani, L. W. Wannamaker, and S. S. Chapman. 1972. Natural history of impetigo. I. Site sequence of acquisition and familial patterns of spread of cutaneous streptococci. J. Clin. Investig. 51:2851-2862. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Fischetti, V. 2000. Surface proteins on gram-positive bacteria, p. 11-24. In V. A. Fischetti, R. P. Novick, J. J. Ferretti, D. A. Portnoy, and J. I. Rood (ed.), Gram-positive pathogens. ASM Press, Washington, D.C.
  • 25.Heath, A., V. J. DiRita, N. L. Barg, and N. C. Engleberg. 1999. A two-component regulatory system, CsrR-CsrS, represses expression of three Streptococcus pyogenes virulence factors, hyaluronic acid capsule, streptolysin S, and pyrogenic exotoxin B. Infect. Immun. 67:5298-5305. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Hollingshead, S. K., T. Readdy, J. Arnold, and D. E. Bessen. 1994. Molecular evolution of a multi-gene family in group A streptococci. Mol. Biol. Evol. 11:208-219. [DOI] [PubMed] [Google Scholar]
  • 27.Hollingshead, S. K., T. L. Readdy, D. L. Yung, and D. E. Bessen. 1993. Structural heterogeneity of the emm gene cluster in group A streptococci. Mol. Microbiol. 8:707-717. [DOI] [PubMed] [Google Scholar]
  • 28.Holmes, E. C., R. Urwin, and M. C. J. Maiden. 1999. The influence of recombination on the population structure and evolution of the human pathogen Neisseria meningitidis. Mol. Biol. Evol. 16:741-749. [DOI] [PubMed] [Google Scholar]
  • 29.Huelsenbeck, J., and B. Rannala. 1997. Phylogenetic methods come of age: testing hypotheses in an evolutionary context. Science 276:227-232. [DOI] [PubMed] [Google Scholar]
  • 30.Huson, D. 1998. SplitsTree: a program for analyzing and visualizing evolutionary data. Bioinformatics 14:68-73. [DOI] [PubMed] [Google Scholar]
  • 31.Johnson, D. R., D. L. Stevens, and E. L. Kaplan. 1992. Epidemiological analysis of group A streptococcal serotypes associated with severe systemic infections, rheumatic fever, or uncomplicated pharyngitis. J. Infect. Dis. 166:374-382. [DOI] [PubMed] [Google Scholar]
  • 32.Li, Z., D. D. Sledjeski, B. Kreikemeyer, A. Podbielski, and M. D. Boyle. 1999. Identification of pel, a Streptococcus pyogenes locus that affects both surface and secreted proteins. J. Bacteriol. 181:6019-6027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Lyon, W., C. Gibson, and M. Caparon. 1998. A role for Trigger factor and an Rgg-like regulator in the transcription, secretion and processing of the cysteine protease of Streptococcus pyogenes. EMBO J. 17:6263-6275. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Martin, D. R., and K. S. Sriprakash. 1996. Epidemiology of group A streptococcal disease in Australia and New Zealand. Rec. Adv. Microbiol. 4:1-40. [Google Scholar]
  • 35.Maxted, W. R. 1980. Disease association and geographical distribution of the M types of group A streptococci, p. 763-777. In S. E. Read and J. B. Zabriskie (ed.), Streptococcal diseases and the immune response. Academic Press, New York, N.Y.
  • 36.Molinari, G., M. Rohde, S. R. Talay, G. S. Chhatwal, S. Beckert, and A. Podbielski. 2001. The role played by the group A streptococcal negative regulator Nra on bacterial interactions with epithelial cells. Mol. Microbiol. 40:99-114. [DOI] [PubMed] [Google Scholar]
  • 37.Palys, T., L. K. Nakamura, and F. M. Cohan. 1997. Discovery and classification of ecological diversity in the bacterial world: the role of DNA sequence data. Int. J. Syst. Bacteriol. 47:1145-1156. [DOI] [PubMed] [Google Scholar]
  • 38.Podbielski, A., and B. Leonard. 1998. The group A streptococcal dipeptide permease (Dpp) is involved in the uptake of essential amino acids and affects the expression of cysteine protease. Mol. Microbiol. 28:1323-1334. [DOI] [PubMed] [Google Scholar]
  • 39.Podbielski, A., B. Pohl, M. Woischnik, C. Korner, K. Schmidt, E. Rozdzinski, and B. Leonard. 1996. Molecular characterization of group A streptococcal (GAS) oligopeptide permease (Opp) and its effect on cysteine protease production. Mol. Microbiol. 21:1087-1099. [DOI] [PubMed] [Google Scholar]
  • 40.Posada, D., and K. Crandall. 1998. MODELTEST: testing the model of DNA substitution. Bioinformatics 14:817-818. [DOI] [PubMed] [Google Scholar]
  • 41.Potter, E. V., M. Svartman, I. Mohammed, R. Cox, T. Poon-King, and D. P. Earle. 1978. Tropical acute rheumatic fever and associated streptococcal infections compared with concurrent acute glomerulonephritis. J. Pediatr. 92:325-333. [DOI] [PubMed] [Google Scholar]
  • 42.Reid, S. D., N. M. Green, J. K. Buss, B. Lei, and J. M. Musser. 2001. Multilocus analysis of extracellular putative virulence proteins made by group A Streptococcus: population genetics, human serologic response, and gene transcription. Proc. Natl. Acad. Sci. USA 98:7552-7557. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Rosenberg, M. S., and S. Kumar. 2001. Incomplete taxon sampling is not a problem for phylogenetic inference. Proc. Natl. Acad. Sci. USA 98:10751-10756. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Scaramuzzino, D. A., J. M. McNiff, and D. E. Bessen. 2000. Humanized in vivo model for streptococcal impetigo. Infect. Immun. 68:2880-2887. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Svensson, M. D., D. A. Scaramuzzino, U. Sjöbring, A. Olsen, C. Frank, and D. E. Bessen. 2000. Role for a secreted cysteine proteinase in the establishment of host tissue tropism by group A streptococci. Mol. Microbiol. 38:242-253. [DOI] [PubMed] [Google Scholar]
  • 46.Svensson, M. D., U. Sjöbring, and D. E. Bessen. 1999. Selective distribution of a high-affinity plasminogen binding site among group A streptococci associated with impetigo. Infect. Immun. 67:3915-3920. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Wannamaker, L. W. 1970. Differences between streptococcal infections of the throat and of the skin. N. Engl. J. Med. 282:23-31. [DOI] [PubMed] [Google Scholar]

Articles from Infection and Immunity are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES