Abstract
The 16S-23S spacer regions of two ribosomal operons (rrnA and rrnE) have been sequenced in seven representatives of the Salmonella enterica subspecies. Isolated nucleotide substitutions were found at the same sites as in Escherichia coli but the number of polymorphic sites was much larger, as could be expected for a more heterogeneous species. Still, as in E. coli, most of the variation found was due to insertions and/or deletions affecting blocks of nucleotides generally located at equivalent regions of the putative secondary structure for both species. Isolated polymorphic sites generated phylogenetic trees generally consistent with the subspecies structure and the accepted relationships among the subspecies. However, the sequences of rrnE put subspecies I closer to E. coli K-12 than to the other S. enterica subspecies. The distribution of polymorphisms affecting blocks of nucleotides was much more random, and the presence of equivalent sequences in distantly related subspecies, and even in E. coli, could reflect relatively frequent horizontal transfer. The smallest 16S-23S spacers in other genera of the family Enterobacteriaceae were also sequenced. As expected, the level of variation was much larger. Still, the phylogenetic tree inferred is consistent with those of 16S rRNA or housekeeping genes.
Ribosomal operons have acquired paramount relevance for the study of bacterial evolution and phylogeny. The rRNA 16S and 23S genes are the most widely used molecular chronometers for inferring microbial phylogeny and have been instrumental in developing a comprehensive view of microbial phylogeny and systematics (14). In most prokaryotes the ribosomal genes form an operon with the order 16S-23S-5S and are cotranscribed in a single polycistronic RNA that has to be processed to give the RNA species present in the mature ribosome (8). The spacer between the 16S and 23S genes is often called the internal spacer region (ISR). These spacers contain regions with secondary structures (6) and sometimes, very often in gram-negative bacteria, tRNA genes. The number of ribosomal operons in bacteria varies between 1 and 11 (13). When there is more than one it is common that they are not identical (intercistronic heterogeneity), particularly regarding the ISR sequence (7, 9, 30). In Escherichia coli K-12 and Salmonella enterica LT2 there are seven operons located at equivalent loci in the genetic maps of the chromosomes of each strain (21). In three of them the ISR contains two tRNA genes (ISR2) for isoleucine and alanine (operons A, D, and H in K-12 and A, D, and B in LT2) with an average size of about 450 bp. The remaining four (operons B, C, E, and G in K-12 and C, E, G, and H in LT2) have a single tRNA gene for glutamic acid (ISR1) and are consequently smaller (about 350 bp) (9).
The role of the spacer regions in the maturation of the polycistronic rRNA is far from clear. Some regions have secondary structures with stretches of canonical base pairing that are substrates for RNase III cuts and separation of the gene products (26, 29). However, the final stages in maturation, which include the trimming of the rRNAs to their final size, take place in the assembling ribosomal subunit with ribonucleoproteins as substrate. Here the interaction among the precursor rRNA (containing stretches of the spacer still attached) and the ribosomal proteins seems to be essential for proper processing.
The variation found among relatively close taxa is known to be very high for the spacers of the rRNA operons (27). Even within the γ subclass of the class Proteobacteria, some species show hardly any similarity to E. coli. For example, Haemophilus influenzae has only about 22 nucleotides centered around the RNase III recognition site 3′ to the 16S rRNA gene and about 12 nucleotides at the equivalent site 5′ to the 23S gene which are highly similar to those of E. coli. The rest of the spacer (except for the tRNA gene and the antiterminator called box A [4]) is totally different in the two species. In species of Pseudomonas only the last two sequences remain similar (13). The extreme divergence in size and sequence of the spacers among different groups of prokaryotes, together with their location between highly conserved rRNA genes, makes them good markers for bacterial classification, and their use in rapid, and even automatic, identification has been suggested (3, 11, 13, 17, 20).
Recently (2), the ISR sequences of the seven operons of 12 representative strains of the E. coli reference (ECOR) collection were studied, a group of strains encompassing much of the phylogenetic diversity within this species (28, 31). The variation was, however, very restricted. In fact, at the level of single nucleotide substitutions, the degree of variation found for these spacer regions is not very different from that found for the 16S rRNA genes of the same strains (7, 23). The main source of heterogeneity (both intercistronic and interstrain) was the presence of insertions and/or deletions involving blocks of nucleotides. And most of these were already present in K-12 as intercistronic heterogeneity. There was a clear tendency toward homogenization (less intercistronic heterogeneity) in most strains. However, some apparently infrequent variations, such as the rsl sequence (6), maintained a widespread distribution, perhaps reflecting a horizontal transfer and recombination that counteracts the homogenization trend. These results represent an interesting problem of molecular evolution. First, if this sequence has indeed little functional restriction, as the great intergenic variation seems to indicate, greater variation should be expected. And second, there seems to be a sharp change in the rate of variation within and between species that contrasts with the relatively smooth gradient of variation found for the ribosomal genes.
To improve our understanding of the molecular evolution of these spacer regions, we have studied their sequence in representatives of S. enterica subspecies. Besides the location and tRNA genes present (21) nothing was known about the S. enterica ISRs. After E. coli, S. enterica is probably the bacterial species best known at the level of intraspecies variation and phylogeny (32). The structure of the subspecies seems to be very robust; very diverse types of data, from multilocus enzyme electrophoresis to sequencing of various genes, give results coherent with the accepted subspecies structure (5). Sequence data also point to a very preserved clonal structure within this species. We have also sequenced some tRNAGlu-containing spacers of other members of the family Enterobacteriaceae to scan the variation found for wider phylogenetic gaps.
MATERIALS AND METHODS
Strains.
The S. enterica strains utilized were obtained from different sources as described previously (35). The S. enterica subspecies and strains used were as follows: subspecies I, LT2 and M298; subspecies II, M495; subspecies IIIa, M313; subspecies IIIb, M316; subspecies IV, M318 and M320; subspecies V, M321; and subspecies VI, M324. Citrobacter freundii OS60 (19), Enterobacter aerogenes E482 (19), Klebsiella pneumoniae LD119 (19), Escherichia blattae ATCC 29907, Escherichia hermannii ATCC 33650, Escherichia vulneris ATCC 33821, and Escherichia fergusonii ATCC 35469 were kindly provided by J. G. Lawrence (Washington University).
PCR primers.
To specifically amplify the ISRs of operons rrnA and rrnE, two primers were designed located in genes upstream of both operons in E. coli K-12. For rrnA the primer TCTCAACAAGGGACGGACAA, which corresponds to nucleotide positions 20 to 39 in hemG of E. coli K-12, was used. For rrnE the primer used was TAATGGCATCGTCCTGGCCGC. This primer is derived from positions 1322 to 1342 of purH. Universal primers 16S14F and 23S1R have been described elsewhere (12).
DNA extraction and PCR amplification.
One colony from a fresh culture was resuspended in 200 μl of Instagene Purification Matrix (Bio-Rad) in Eppendorf tubes. DNA was extracted following the manufacturer’s recommendations, and the DNA concentration was estimated with a Gene Quant RNA/DNA calculator (Pharmacia LKB Biochrom). For PCR the DNA concentration was adjusted to 20 ng/μl. Amplification of specific rrn operons in S. enterica strains was carried out in two steps (nested PCR). Approximately 40 ng of chromosomal DNA was subjected to PCR amplification in a total volume of 50 μl containing 50 mM KCl, 10 mM Tris-HCl (pH 9.0), 1.5 mM MgCl, 0.1% Triton X-100, 0.2 mM (each) deoxyribonucleotide (dATP, dCTP, dGTP, and dTTP; Pharmacia LKB Biotechnology), 2 U of TaqPlus DNA polymerase (Stratagene), and 50 to 60 pmol of each primer. For the first step, rrn-specific primers were used in combination with 23S1R. The reaction mixtures were subjected to a “touch down” (10) thermal cycling regime which consisted of the following: 3 cycles of 94°C for 15 s, 65°C for 30 s, and 72°C for 3 min; 3 cycles of 94°C for 15 s, 62°C for 30 s, and 72°C for 3 min; 3 cycles of 94°C for 15 s, 59°C for 30 s, and 72°C for 3 min; 3 cycles of 94°C for 15 s, 56°C for 30 s, and 72°C for 3 min; and 23 cycles of 94°C for 15 s, 55°C for 30 s, and 72°C for 3 min; followed by an extension step at 72°C for 5 min. The amplicon from this reaction was diluted 1:10,000 and submitted again to PCR with the universal primers 16S14F and 23S1R as follows: 35 cycles of 94°C for 15 s, 62°C for 30 s, and 72°C for 1 min plus an extension step of 5 min at 72°C. For all other strains of Enterobacteriaceae only the second PCR protocol was carried out followed by cloning. All reactions were carried out in a PTC-100 Peltier effect thermocycler (MJ Research Inc.). The PCR products were separated in 1% agarose gels in Tris-acetate-EDTA buffer, stained with 0.5 μg of ethidium bromide, and visualized with UV.
Purification of the PCR products.
PCR products were purified with the QIAquick PCR Purification Kit (Qiagen) according to the manufacturer’s protocol. The DNA was recovered in 30 μl of water and diluted to a concentration between 0.3 and 0.5 μg/μl.
Cloning of Enterobacteriaceae PCR products.
For the Enterobacteriaceae other than S. enterica no operon-specific primers were used, and therefore all the operons were amplified simultaneously. The PCR product was cloned using the original TA Cloning Kit (Invitrogen) following the manufacturer’s recommendations. From each PCR product 20 colonies were picked and grown in Luria Bertani medium at 37°C for 18 h. Recombinant plasmids were extracted by using the QIAprep Spin Miniprep Kit (Qiagen) following the manufacturer’s recommendations. The purified plasmids were digested with EcoRI to separate the insert, and the product was run in agarose gels to determine insert size. For each strain one clone containing the smallest insert was selected for sequencing.
Sequencing of 16S-23S ISR.
Nucleotide sequences of PCR products and plasmids were determined by using the ABI PRISM Dye Terminator Cycle Sequencing Ready Reaction Kit (Perkin-Elmer) and an ABI PRISM 377 sequencer (Perkin-Elmer) according to the manufacturer’s indications. The 16S14F and 23S1R primers were used as sequencing primers in all cases. For every PCR product both strains were sequenced and every sequence was repeated at least twice.
Data analysis.
Sequences were aligned by using the PCGene program (IntelliGenetics Inc.). All other analyses were done using the MEGA (Molecular Evolutionary Genetics Analysis) 1.01 program obtained from the Institute of Molecular Evolutionary Genetics, The Pennsylvania State University, University Park, Pa.
Nucleotide sequence accession numbers.
S. enterica sequences determined here were assigned GenBank accession numbers AFO45563 and AFO46806 to AFO46822; representatives of other Enterobacteriaceae were assigned accession numbers AFO47420 to AFO47426.
RESULTS
We have sequenced the S. enterica rrnA and rrnE operons as defined by the upstream sequences found in E. coli K-12. However, except for LT2, we have no direct evidence about the location of the operons in the genomic map of the strains studied. In fact, for Salmonella typhi a highly plastic chromosomal organization has been previously described (22). In E. coli strains the general organization of the rrn operons in the genome seems to be quite conserved. Not only is the combination of four ISR1- and three ISR2-containing operons kept throughout the ECOR collection but the type of operon (as defined by the tRNA genes in the ISR) present at each locus is also highly conserved (2). However, in S. enterica there seems to be variation at this level. LT2 has an ISR1 present in rrnE and an ISR2 in rrnA. However, in M313 (subspecies IIIa) and M318 and M320 (both subspecies IV) the primer for rrnA gave operons with ISR1 present. It is possible that, in these three strains, a two-tRNA-containing operon was present at another locus to keep the four ISR1- and three ISR2-containing operon distribution found in E. coli and LT2. However, in all these strains the operon present at the rrnE locus was an ISR1 as well. When an ISR1 was retrieved with the primer for rrnA the sequence found was very similar to the rrnE sequence determined for the same strain. Specifically, in M313 and M318 the two ISR1 sequences differed by only 6 and 1 nucleotides, respectively. In M320 the ISR1 obtained with the primer for rrnA differed in 4 aligned nucleotides but lacked two of the insertions found in the ISR1 of rrnE (see below). That does not necessarily mean that the ISR1 spacers present at the putative rrnA locus derive from the rrnE operon, since, at least in E. coli, intercistronic heterogeneity in many strains is low (2) and any of the ISR1-containing operons could have a sequence very similar to that of rrnE. To simplify the terminology the ISRs obtained with the rrnE and rrnA primers are identified in the text and figures by the letters E and A, respectively, and the strain designation.
Variability among the ISR1 operons.
The alignment of the ISR1 operons (containing a single tRNA gene) is shown in Fig. 1. Figure 1 includes the nine sequences retrieved from the strains with the primers designed for rrnE as well as three sequences obtained with the primers for operon rrnA. Two types of sequence variation were found. One variation affected aligned residues only and was used to analyze level of divergence and to infer relationships. The other involves insertions and/or deletions or block substitutions and will be discussed separately (see below). The polymorphic nucleotides of ISR1 and their relation to the secondary structure are shown in Fig. 2. In the S. enterica ISR1s that were retrieved the maximum pairwise nucleotide divergence was 18% and the average divergence was 9.8%. This is about three times higher than that found for the rrnE operon in E. coli (2.6%) (2), perhaps reflecting the wider phylogenetic span (genetic variability) included in S. enterica (25). Although the number of polymorphic sites is much larger in the ISR1 spacers of S. enterica, the variation in this region, similar to that of E. coli, is rather restricted and is concentrated at some hypervariable sites, including those already detected in E. coli. For the 70 nucleotides downstream of the 16S rRNA gene, in addition to the 8 polymorphic sites which are also present in E. coli (2), S. enterica strains have 14 additional polymorphic sites (a total of 22 polymorphic sites).
A phylogenetic tree constructed by the neighbor-joining method and comparing the ISR1 sequences retrieved is shown in Fig. 3. The main clusters reflect the subspecies structure and are generally concordant with a tree of three housekeeping (HK) genes (5). Subspecies I, VI, II, IV, and IIIb form a cluster as do the five HK genes studied by Boyd et al. (5), and subspecies V and IIIa appear clearly distant from the rest, which is also the case for the HK gene tree. Some sequences found in S. enterica, such as those of E298 and ELT2, are more similar to rrnE of E. coli K-12 than they are to other S. enterica strains.
As found in E. coli rrn operons, there are polymorphisms clustered in hypervariable regions in which substitutions affecting blocks of nucleotides occur, sometimes with insertions and deletions of highly variable lengths. At positions 66 to 86 (Fig. 1 and 2), some E. coli ISR1-containing operons present the rsl sequence (6), an insertion of 106 nucleotides. In the S. enterica strains studied the last 10 nucleotides are highly polymorphic, although all the alternative sequences found were of equivalent length, similar to that of the E. coli ISR1 without the rsl sequence. The second of such variable regions is located downstream from the tRNAGlu gene and corresponds to an insertion found previously in rrnE and rrnC of ECOR 40 (2) which contains a perfect direct repeat of the last 11 bases of the tRNAGlu gene (Fig. 2). Sequences A318, E318, E320 (all representatives of subspecies IV), and ELT2 (subspecies I) have this insertion. In nucleotides 234 to 274 there is a secondary structure loop (Fig. 2) that has been shown to be hypervariable in E. coli (2, 12, 16), in which species there are at least three versions which have little similarity to one another except that each maintains the same size and general structure of the loop (2). In the strains of S. enterica studied the version found in ECOR 35 and ECOR 40 predominates; E321 has a version that is almost identical to version I (2) of E. coli, which is present in most operons and strains in this species. ISRs A318, E318, and E320 have yet another different and longer loop (box B in Fig. 2) that has not been found in E. coli. The secondary structure loop from nucleotide 307 to nucleotide 338 (Fig. 2) concentrates 18 polymorphisms with most strains diverging widely from the sequence found in E. coli and the two representatives of subspecies I. Finally, the region downstream from nucleotide 358 is also highly polymorphic. In E. coli there are two alternative versions of 17 and 8 nucleotides depending on the strains and operons (2). In S. enterica two ISRs, ELT2 and E298, had sequences very similar to the 17-mer version of E. coli (the most abundant in this species). Two ISRs, E495 and E316, had a large insertion of approximately 145 bp occupying this region. This insertion could form a long secondary structure loop (Fig. 2). The rest (E324, A313, E313, E321, A318, E318, E320, and A320) all have a stretch of 10 bp vaguely reminiscent of the octamer found in a few strains of E. coli.
Most polymorphisms, particularly those affecting clusters of nucleotides, are located outside the regions that are involved in secondary structure with nucleotide stretches upstream or downstream of the 16S or 23S gene, respectively, in the transcript (Fig. 2). This is understandable considering the role of these regions as RNase III targets and the requirements for concerted variation with regions distantly located in the operon.
Variability among the ISR2.
The rrnA-specific PCR primer generated amplicons with an ISR2 (containing tRNAIle and tRNAAla genes) only in six of the nine strains studied here. Therefore, the alignment of the sequences for the ISR2-containing operons shown in Fig. 1 comprises only 6 S. enterica sequences plus the sequence for the E. coli K-12 rrnA operon included for comparison. The ISR2-containing operons obtained have very little sequence diversity for the 150-odd bases that comprise the tRNAIle plus the 70 bp located upstream from this gene. This region is also very similar to the E. coli ISR2-containing operons. Downstream from the tRNAIle, variability is very high, and in fact, this stretch in all the S. enterica strains diverged considerably from that in E. coli, in which all the strains and operons studied have an identical sequence of 43 bp (2). In S. enterica this stretch, i.e., downstream from the tRNAile, is much larger in all strains (about 120 bp). Among the six strains of Salmonella studied, four (LT2 and M298, both subspecies I; M495, subspecies II; and M321, subspecies V) had nearly identical sequences (one nucleotide different) of 112 bp. M324 (subspecies VI) and M316 (subspecies IIIb) had stretches of 111 and 114 bp, respectively, but with widely divergent sequences. The three types of sequence have little in common except overall length and the first 5 and last 11 nucleotides. A316 is different throughout the whole sequence, while A324 differs mostly in the first 50 nucleotides of this stretch, being for the last 60-odd bases identical to that of ALT2 and the other sequences. Downstream of the tRNAAla there is little heterogeneity at the level of isolated nucleotide substitutions, and again the sequence is rather similar to that of E. coli. The hypervariable loop located near the 5′ end of the 23S gene (Fig. 2, box C) is also present in the ISR2 spacers. Here the variability is extremely high to the point that none of the sequences determined was identical to any of the others. ALT2, A298, and A316 have 20-mer stretches that are highly similar to the 19-mer stretch found in rrnE in E. coli K-12. A321 has a long loop similar to that which was also found in this region in E316 and E495 (both ISR1). A495 has a very short stretch of 7 bp, again with a completely different sequence from that found in the rrnA operon of E. coli K-12. Finally, A324 has still another sequence of 11 bp, which is similar to the sequence present in rrnA of K-12.
ISR1 spacers in other Enterobacteriaceae.
To get an impression of how the spacer sequence varies with increasing phylogenetic distance, we sequenced the smallest ISRs present in the ribosomal operons of seven strains of different species of Enterobacteriaceae. In fact operons belonging to C. freundii and Enterobacter aerogenes had spacers of 235 and 198 bp, respectively, that did not contain a tRNA gene and thus cannot be considered functionally equivalent to the other ribosomal operons. A tree which compares these sequences with those of rrnE of E. coli K-12 and S. enterica LT2 is shown in Fig. 4. The conserved positions for all the Enterobacteriaceae studied are shown in Fig. 1. Only the regions around the RNase III recognition site and box A are conserved for all of the species studied. Generally speaking, the clustering was similar to that developed from other gene comparisons (1, 19, 32). Escherichia fergusonii is a sister species to E. coli, and S. enterica appears as its closest relative in the ISR1 tree. These relationships are also evident in a comparison of HK genes. K. pneumoniae and E. vulneris form a cluster that also appears in a phylogenetic tree generated by comparing gapA and ompA genes (19). At the level of the ISR1 this cluster is already widely divergent. Klebsiella is the closest genus to E. coli and Salmonella by 16S rRNA sequence (1). E. blattae and E. hermannii are very divergent, which is consistent with the genus Escherichia being polyphyletic as has been shown by other authors (19).
DISCUSSION
The relative lack of functional restriction of the rRNA operon spacers would make us expect a rather high level of sequence variation. Still, some data in the literature hints at unexpected levels of conservation in other spacer sequences. For example, in the classical studies by Milkman and coworkers of the trp operon of E. coli (24, 34) there is information showing that the variation in the noncoding areas is similar to, or only slightly higher than, that in the coding ones; the main difference was that in the coding areas the variation seemed to concentrate on the third base of each codon (silent mutations). If the noncoding or spacer areas of the bacterial genomes really are devoid of function, something that is by no means certain, it could be considered an argument in favor of the neutral hypothesis for molecular evolution (18), i.e., if variation is found with similar frequencies in functional and nonfunctional regions it could be because most of it is neutral.
Altogether, the results of this work strengthen the view based on a previous work with E. coli (2) in the sense that, at the level of isolated nucleotide substitutions, the ISRs do not differ so much from the rRNA genes themselves. The main difference is the presence of relatively large insertions and/or deletions. These drastically alter the primary structure of the spacer sequence and in the long run can probably also affect the secondary structure and even the function of the different regions of the spacer DNA. This phenomenon is by no means exclusive to the ISR sequences. For example, the spacer between the genes rpsU and dnaG only varies in 15 of 108 alignable nucleotides between E. coli K-12 and S. enterica LT2 (33). However, in S. enterica there is an insertion of 120 bp that is not present in E. coli. Other bacterial species have been shown to differ due to insertion and/or deletion events affecting the central region of the rrn spacer (12, 36). Since it appears that the size of the spacer must vary within certain limits, it is foreseeable that the accumulation of insertions could eventually lead to deletions affecting other areas and thus to the gradual substitution of the overall spacer sequence, as is found when relatively wide phylogenetic distances are analyzed at this level.
The phylogeny generated by the ISR1 sequences is roughly consistent with that of 16S rRNA or housekeeping genes in Enterobacteriaceae, as has been shown for other bacterial groups (20). However, the distribution of clustered polymorphisms did not form a coherent picture with any of the relationship trees inferred from HK sequences or multilocus enzyme electrophoresis. Even the barrier between the two main clusters of S. enterica subspecies, i.e., subspecies IIIa and V and subspecies I, II, IIIb, IV, and VI, appears inconsistent with the distribution of some of the polymorphisms. For example, the long loop of box C (Fig. 2) is found in M316 (subspecies IIIb) and M495 (subspecies II) and a similar loop is present in the ISR2 of M321 (subspecies V). The same applies to the sequence located between the two tRNA genes in ISR2 of A321 (subspecies V) that is also present in a representative of subspecies II (A495) and in those of subspecies I (LT2 and A298). The presence of some of these sequences in relatively separated lineages of S. enterica and even E. coli hints at a relatively widespread horizontal transfer and recombination within and between species. In a previous work (2) a model was suggested in which two opposing forces act upon the rrn operons. Homogenization processes would lead to little intercistronic heterogeneity and conservation of the operon sequence. On the other hand, horizontal transfer from other strains (that homogenize different versions of the operon sequence), followed by recombination, acts as a continuous source of variation and intercistronic heterogeneity. The diversity found in the different operons and strains results from the equilibrium between these two opposed processes (2). In S. enterica recombination seems to be even more widespread, acting among different subspecies and perhaps even species (E. coli).
It is remarkable that in terms of the ISR1 sequences there is no sharp discrimination between E. coli and S. enterica, i.e., some sequences found in S. enterica are more similar to E. coli than they are to other Salmonella strains. The main difference between the two species was found in the inter-tRNA gene stretch of the ISR2 spacers. Otherwise it would be difficult to delimit both species as separate entities by ISR sequences. For example, E298 (subspecies I) has only 19 nucleotide differences from the rrnE gene of K-12, which is much less than the number of nucleotide differences found for the rrnE gene of some strains of E. coli (2) or those between the rrnE of E289 and other ISR1s of S. enterica strains. As shown in Fig. 1 the subspecies II (M495) and subspecies I (M298 and LT2) representatives have, over most of the ISR1, an E. coli-like sequence, although LT2 and E495 have Salmonella features like the sequence downstream of TRNAGlu found in ELT2 or the insertion of box C (Fig. 2) in E495. This mosaic-like structure is normally attributed to recombination, although the possibility of these features corresponding to ancestral traits cannot be totally disregarded.
On the other hand, the average (most frequent bases at polymorphic sites) sequence found in S. enterica is significantly different from the average sequence in E. coli. This can be interpreted as indicating that both species are linked by a continuum of diversity rather than having the sharp discontinuity expected for sexual and isolated populations (15).
ACKNOWLEDGMENTS
Financial support was from CICYT PM95-0111 to F.R.-V. and from the Australian Research Council to P.R.R. Part of the work was carried out while F.R.-V. was a Visiting Academic at the Department of Microbiology, The University of Sydney, supported by a Generalitat Valenciana scholarship.
We are grateful to K. Hernández for secretarial assistance and S. Ingham for graphics.
REFERENCES
- 1.Ahmad S, Weisburg W G, Jensen R A. Evolution of aromatic amino acid biosynthesis and application to the fine-tuned phylogenetic positioning of enteric bacteria. J Bacteriol. 1990;172:1051–1061. doi: 10.1128/jb.172.2.1051-1061.1990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Antón, A. I., A. J. Martínez-Murcia, and F. Rodríguez-Valera. Sequence diversity in the 16S-23S intergenic spacer region (ISR) of the rRNA operons in representatives of the Escherichia coli ECOR collection. J. Mol. Evol., in press. [DOI] [PubMed]
- 3.Barry T, Colleran G, Glennon M, Dunican L K, Gannon P. The 16S/23S ribosomal spacer region as target for DNA probes to identify eubacteria. PCR Methods Appl. 1991;1:51–56. doi: 10.1101/gr.1.1.51. [DOI] [PubMed] [Google Scholar]
- 4.Berg K L, Squires C, Squires C L. Ribosomal RNA operon anti-termination function of leader and spacer region box B-box A sequences and their conservation in diverse microorganisms. J Mol Biol. 1989;209:345–358. doi: 10.1016/0022-2836(89)90002-8. [DOI] [PubMed] [Google Scholar]
- 5.Boyd E F, Wang F-S, Whittam T S, Selander R K. Molecular genetic relationships of the salmonellae. Appl Environ Microbiol. 1996;62:804–808. doi: 10.1128/aem.62.3.804-808.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Brosius J, Dull T J, Sleeter D D, Noller H F. Gene organization and primary structure of a ribosomal RNA operon from Escherichia coli. J Mol Biol. 1981;148:107–127. doi: 10.1016/0022-2836(81)90508-8. [DOI] [PubMed] [Google Scholar]
- 7.Cilia V, Lafay B, Christen R. Sequence heterogeneities among 16S ribosomal RNA sequences and their effect on phylogenetic analyses at the species level. Mol Biol Evol. 1996;13:451–461. doi: 10.1093/oxfordjournals.molbev.a025606. [DOI] [PubMed] [Google Scholar]
- 8.Condon C, Philips J, Fu A, Squires C, Squires C L. Comparison of the expression of the seven ribosomal RNA operons in Escherichia coli. EMBO J. 1992;11:4175–4185. doi: 10.1002/j.1460-2075.1992.tb05511.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Condon C, Squires C, Squires C L. Control of rRNA transcription in Escherichia coli. Microbiol Rev. 1995;59:623–645. doi: 10.1128/mr.59.4.623-645.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Don R H, Cox P T, Wainwright B J, Baker K, Mattick J S. ‘Touchdown’ PCR to circumvent spurious priming during gene amplification. Nucleic Acids Res. 1991;19:4008. doi: 10.1093/nar/19.14.4008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.García-Martínez J, Martínez-Murcia A J, Rodríguez-Valera F, Zorraquino A. Molecular evidence supporting the existence of two major groups in uropathogenic Escherichia coli. FEMS Immunol Med Microbiol. 1996;14:231–244. doi: 10.1111/j.1574-695X.1996.tb00291.x. [DOI] [PubMed] [Google Scholar]
- 12.Gürtler V, Barrie H D. Typing of Staphylococcus aureus strains by PCR-amplification of variable-length 16S-23S rDNA spacer regions: characterization of spacer sequences. Microbiology. 1995;141:1255–1265. doi: 10.1099/13500872-141-5-1255. [DOI] [PubMed] [Google Scholar]
- 13.Gürtler V, Stanisich V A. New approaches to typing and identification of bacteria using the 16S-23S rDNA spacer region. Microbiology. 1996;142:3–16. doi: 10.1099/13500872-142-1-3. [DOI] [PubMed] [Google Scholar]
- 14.Gutell R R, Larsen N, Woese C R. Lessons from an evolving rRNA: 16S and 23S rRNA structures from a comparative perspective. Microbiol Rev. 1994;58:10–26. doi: 10.1128/mr.58.1.10-26.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Guttman D S, Dykhuizen D E. Clonal divergence in Escherichia coli as a result of recombination, not mutation. Science. 1994;266:1380–1383. doi: 10.1126/science.7973728. [DOI] [PubMed] [Google Scholar]
- 16.Harvey S, Hill C W, Squires C, Squires C L. Loss of the spacer loop sequence from the rrnB operon in the Escherichia coli K12 subline that bears the relA1 mutation. J Bacteriol. 1988;170:1235–1238. doi: 10.1128/jb.170.3.1235-1238.1988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Jensen M A, Webster J A, Straus N. Rapid identification of bacteria on the basis of polymerase chain reaction-amplified ribosomal DNA spacer polymorphisms. Appl Environ Microbiol. 1993;59:945–952. doi: 10.1128/aem.59.4.945-952.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Kimura M. The neutral theory of molecular evolution. London, United Kingdom: Cambridge University Press; 1983. [Google Scholar]
- 19.Lawrence J G, Ochman H, Hartl D L. Molecular and evolutionary relationships among enteric bacteria. J Gen Microbiol. 1991;137:1911–1921. doi: 10.1099/00221287-137-8-1911. [DOI] [PubMed] [Google Scholar]
- 20.Leblond-Bourget N, Philippe H, Mangin I, Decaris B. 16S rRNA and 16S to 23S internal transcribed spacer sequence analyses reveal inter- and intraspecific Bifidobacterium phylogeny. Int J Syst Bacteriol. 1996;46:102–111. doi: 10.1099/00207713-46-1-102. [DOI] [PubMed] [Google Scholar]
- 21.Lehner A F, Harvey S, Hill C W. Mapping and spacer identification of rRNA operons of Salmonella typhimurium. J Bacteriol. 1984;160:682–686. doi: 10.1128/jb.160.2.682-686.1984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Liu S-H, Sanderson K E. Highly plastic chromosomal organization in Salmonella typhi. Proc Natl Acad Sci USA. 1996;93:10303–10308. doi: 10.1073/pnas.93.19.10303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Martinez-Murcia, A. J., A. I. Antón, and F. Rodríguez-Valera. Concerted evolution by homogenization of the rRNA multigene family may induce sequence convergences responsible for chronometric distortions and conflicting phylogenetic trees. Submitted for publication.
- 24.Milkman R, Bridges M M. Molecular evolution of the Escherichia coli chromosome. IV. Sequence comparisons. Genetics. 1993;133:455–468. doi: 10.1093/genetics/133.3.455. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Nelson K, Whittam T S, Selander R K. Nucleotide polymorphism and evolution in the glyceraldehyde-3-phosphate dehydrogenase gene (gapA) in natural populations of Salmonella and Escherichia coli. Proc Natl Acad Sci USA. 1991;88:6667–6671. doi: 10.1073/pnas.88.15.6667. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Noller H, Nomura M. Ribosomes. In: Neidhardt F C, Curtiss III R, Ingraham J L, Lin E C C, Low K B, Magasanik B, Reznikoff W S, Riley M, Schaechter M, Umbarger H E, editors. Escherichia coli and Salmonella: cellular and molecular biology. 2nd ed. Washington, D.C: ASM Press; 1996. pp. 167–186. [Google Scholar]
- 27.Normand P, Ponsonnet C, Nesme X, Neyra M, Simonet P. ITS analysis of prokaryotes. In: Akkermans A D L, van Elsas J D, de Bruijn F J, editors. Molecular microbial ecology manual. Norwell, Mass: Kluwer Academic Publishers; 1996. pp. 1–12. [Google Scholar]
- 28.Ochman H, Selander R K. Standard reference strains of Escherichia coli from natural populations. J Bacteriol. 1984;157:690–693. doi: 10.1128/jb.157.2.690-693.1984. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Pace N R, Burgin A B. Processing and evolution of the rRNAs. In: Hill W E, Dahlberg A, Garrett R A, Moore P B, Schlessinger D, Warner J R, editors. The ribosome. Structure, function, & evolution. Washington, D.C: American Society for Microbiology; 1990. pp. 417–425. [Google Scholar]
- 30.Sechi L A, Daneo-Moore L. Characterization of intergenic spacers in two rrn operons of Enterococcus hirae ATCC 9790. J Bacteriol. 1993;175:3213–3219. doi: 10.1128/jb.175.10.3213-3219.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Selander R K, Cougant D A, Whittam T S. Genetic structure and variation in natural populations of Escherichia coli. In: Neidhardt F C, Ingraham J L, Low K B, Magasanik B, Schaechter M, Umbarger H E, editors. Escherichia coli and Salmonella typhimurium: cellular and molecular biology. Washington, D.C: American Society for Microbiology; 1987. pp. 1625–1648. [Google Scholar]
- 32.Selander R K, Li J, Boyd E F, Wang F-S, Nelson K. Bacterial diversity and systematics. New York, N.Y: Plenum Press; 1994. DNA sequence analysis of the genetic structure of populations of Salmonella enterica and Escherichia coli. In F. G. Priest, A. Ramos-Cormenzana, and B. J. Tindall (ed.) [Google Scholar]
- 33.Sharp P M. Determinants of DNA sequence divergence between Escherichia coli and Salmonella typhimurium: codon usage, map position, and concerted evolution. J Mol Evol. 1991;33:23–33. doi: 10.1007/BF02100192. [DOI] [PubMed] [Google Scholar]
- 34.Stolzfus A, Leslie J F, Milkman R. Molecular evolution of the Escherichia coli chromosome. I. Analysis of structure and natural variation in a previously uncharacterized region between trp and tonB. Genetics. 1988;120:345–358. doi: 10.1093/genetics/120.2.345. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Thampapillai G, Lan R, Reeves P R. Molecular evolution in the gnd locus of Salmonella enterica. Mol Biol Evol. 1994;11:813–828. doi: 10.1093/oxfordjournals.molbev.a040165. [DOI] [PubMed] [Google Scholar]
- 36.Whiley R A, Duke B, Hardie J M, Hall L M C. Heterogeneity among 16S-23S rRNA intergenic spacers of species within the ‘Streptococcus miller group.’. Microbiology. 1995;141:1461–1467. doi: 10.1099/13500872-141-6-1461. [DOI] [PubMed] [Google Scholar]