Skip to main content
Infection and Immunity logoLink to Infection and Immunity
. 1999 Jan;67(1):230–236. doi: 10.1128/iai.67.1.230-236.1999

Type-Specific Contributions to Chromosome Size Differences in Escherichia coli

Christopher K Rode 1, Lyla J Melkerson-Watson 1, Amanda T Johnson 1, Craig A Bloch 1,2,*
Editor: P E Orndorff
PMCID: PMC96301  PMID: 9864220

Abstract

The Escherichia coli genome varies in size from 4.5 to 5.5 Mb. It is unclear whether this variation may be distributed finely throughout the genome or is concentrated at just a few chromosomal loci or on plasmids. Further, the functional correlates of size variation in different genome copies are largely unexplored. We carried out comparative macrorestriction mapping using rare-restriction-site alleles (made with the Tn10dRCP2 family of elements, containing the NotI, BlnI, I-CeuI, and ultra-rare-cutting I-SceI sites) among the chromosomes of laboratory E. coli K-12, newborn-sepsis-associated E. coli RS218, and uropathogenic E. coli J96. These comparisons showed just a few large accessory chromosomal segments accounting for nearly all strain-to-strain size differences. Of 10 sepsis-associated and urovirulence genes, previously isolated from the two pathogens by scoring for function, all were colocalized exclusively with one or more of the accessory chromosomal segments. The accessory chromosomal segments detected in the pathogenic strains from physical, macrorestriction comparisons may be a source of new virulence genes, not yet isolated by function.


The gram-negative bacterium Escherichia coli occurs commonly as a benign enteric commensal of mammals. Additionally, different types of E. coli characteristically cause different diseases (46), including the hemolytic-uremic syndrome (15, 19, 29, 49), urosepsis (1), and newborn sepsis/meningitis (20). Although recent determination of the entire nucleotide sequence from laboratory strain K-12 indicated 4,639 kb (6), estimates for natural isolates range from 4,660 to 5,300 kb (3). This indicates substantial size differences among genome copies of the various E. coli strains. How these differences originated and have persisted is unclear.

Genes for some enterobacterial virulence traits, especially those essential to one or another major pathogenic life cycle, may reside on specialized chromosomal elements, i.e., pathogenicity islands (7, 21, 23, 29, 35, 47); in contrast, others, notably antibiotic resistances, typically reside on plasmids (18). Chromosomal virulence traits may be both difficult to isolate by functional means (e.g., if their phenotypes can be scored only in interactions with mammalian hosts) and impossible to isolate by straightforward physical means (i.e., by plasmid preparations, given that genes for them occur integrated on the chromosome). They could be identified by the positional approach to gene discovery (13, 17), however, if the genes conferring them could be distinguished as local alterations to chromosome structure prior to functional analysis. To investigate the applicability of positional gene discovery for finding genes that contribute to E. coli pathogenesis, we mapped the components of chromosomal size differences among laboratory strain K-12 and two pathogenic strains, the uropathogen J96 and the newborn-sepsis-associated strain RS218. Further, we compared the locations of these large, accessory chromosomal segments with the locations of known virulence genes.

MATERIALS AND METHODS

Bacterial genetics techniques.

Bacterial strains were grown in LB with aeration or on solid LB or M9-glucose (34). Media were supplemented with kanamycin (50 μg/ml), spectinomycin (100 μg/ml), and/or chloramphenicol (15 μg/ml) as required. Cultures were incubated at 37°C, or at 30°C for P1 infections of RS218 and RS218-chimera cultures (10). Cells were stored long term by being suspended in LB-glycerol (80%/20%, vol/vol) and cooled to −80°C. Bacteriophage stocks were grown and stored as described by Sternberg and Maurer (45). Double-insertion mutants of strain MG1655 and single- and double-insertion mutants of strains RS218 and J96 were generated by transducing recipient strains with P1Δdamrev6 lysates of MG1655 insertion mutants (40). Genome structure, assessed by pulsed-field gel electrophoresis (PFGE) in at least six independent isolates from each transduction, was used to confirm P1 transduction fidelity and lack of transduction-associated rearrangements.

Genomic DNA biophysical techniques.

Genomic DNAs were purified from 5-ml overnight cultures of wild-type and insertionally mutagenized E. coli in a manner suitable for yielding macrorestriction fragments (50 to 1000 kb), as described previously (40). After digestion of agarose-embedded DNAs with I-SceI (Boehringer Mannheim, Indianapolis, Ind.) for 1 h, NotI (New England Biolabs, Beverly, Mass.) for 4 to 5 h, or BlnI (Panvera, Madison, Wis.) overnight, according to the manufacturers’ directions, and after reaction buffer decanting, agarose dots were melted (70°C) and gently pipetted with plastic 200-μl tips into sample wells in 1.2% agarose (PFGE approved; FastLane; FMC, Portland, Maine) gels for electrophoresis in 0.5× TBE buffer (0.045 M Tris borate, 0.045 M boric acid, 0.001 M EDTA) in a PFGE apparatus (Bio-Rad DR-III) according to the manufacturer’s instructions. Ramping of PFGE pulse times was determined as described elsewhere (5); ramping from 11 to 21 s over 13 h and from 50 to 56 s over 7 h was used to approximate log-linear separations between 150 and 350 kb. After electrophoresis of samples with Megabase I and/or II DNA standards (Gibco/BRL, Bethesda, Md.), fragments sizes were quantitated as described elsewhere (25).

RESULTS

Structural and functional correlates to genome size variation within species have only recently been attempted (3, 4). The components of genome size were investigated in three strains from different genealogical branches of E. coli (43). The strains analyzed were the nonpathogenic laboratory K-12 strain MG1655 (2), the newborn-sepsis strain RS218 (44), and the uropathogenic strain J96 (28). The chromosomes of these strains vary in length from 4,673 kb for strain MG1655 to 5,195 kb for strain RS218 to 5064 kb for strain J96 (Fig. 1). Also, strains RS218 and J96 carry plasmids of 110 and 113 kb, respectively. The additional ∼556 kb of chromosomal DNA in pathogenic strain RS218 and ∼455 kb of chromosomal DNA in pathogenic strain J96 relative to the nonpathogenic strain MG1655 may reside within pathogenicity islands (7, 23, 29, 35), i.e., chromosomal segments on which genes contributing to the virulence of the pathogenic strains reside. Additionally, “black hole” genomic deletions that enhance pathogenicity (32) also need to be considered. Macrorestriction maps of these chromosomes by NotI (22, 23, and 25 fragments in strains MG1655, RS218, and J96, respectively), BlnI (13, 17, and 13 fragments in strains MG1655, RS218, and J96, respectively), and I-CeuI (7 fragments in all strains) digestions are shown in Fig. 1. Further, through macrorestriction analyses we were able to map the positions in the different strains for a set of 20 rare-restriction-site alleles, made with Tn10dRCP2 insertion elements, which carry the rare-cutting polylinker 2 of rare restriction sites including NotI, BlnI, I-CeuI, and I-SceI.

FIG. 1.

FIG. 1

Locations of the identical set of rare-restriction-site insertions on linearized macrorestriction maps from three different copies of the E. coli circular chromosome. Flags marked by location indicate all 20 insertions on the MG1655 (25), RS218 (40), and J96 (41) maps. The location of each insertion in the MG1655 background was known from the NotI, BlnI, and/or I-CeuI restriction pattern changes that it caused (40); locations of insertions in the RS218 and J96 backgrounds were determined by the same procedure. For each insertion, the allele number and host strains from the three different backgrounds are given in Table 1; note that the actual host strains in Table 1 each carry only one insertion rather than all 20 as depicted here for conciseness.

Integrated macrorestriction mapping with transposons that carry rare restriction sites can distinguish accessory chromosomal segments from conserved chromosomal segments in the physical maps of different chromosomal copies (40); this requires determination of reference loci and of the physical distances separating those loci. Reference loci were determined, and gene order conservation was assessed in these three strains by introduction of Tn10dRCP2 insertions carrying the I-SceI restriction site (9). I-SceI is an ultra-rare-cutting megaendonuclease which recognizes an 18-bp nucleotide, TAGGGATAA↓CAGGGTAAT, generating 3′ cohesive ends (36). Statistically, the I-SceI restriction site occurs once in ∼6.9 × 1010 bp. Therefore, it is not surprising that this sequence does not occur in E. coli sequences of ∼5 Mb. The Tn10dRCP2 family of insertion elements occurs in three antibiotic resistance varieties. Previously, MG1655::Tn10dKanRCP2, MG1655::Tn10dSpcRCP2, and MG1655::Tn10dCamRCP2 insertion mutants were isolated (9). From this strain collection, eight MG1655::Tn10dKanRCP2, nine MG1655::Tn10dSpcRCP2, and three MG1655::Tn10dCamRCP2 strains (Table 1) were chosen to facilitate comparisons among the chromosomes of E. coli MG1655, RS218, and J96 and to localize chromosomal additions/deletions. These MG1655::Tn10dRCP2 mutants were chosen to give 20 I-SceI insertions separated from one another by approximately ∼250 kb (i.e., ∼5,000 kb of E. coli genome/20). By this attention to spacing, the ability to resolve chromosomal segment size in the three strain backgrounds between neighboring pairs of I-SceI fragments was optimized. This was because all of the I-SceI fragments generated by adjacent pairs of these evenly spaced insertions could be determined to equivalent accuracy with a single set of PFGE parameters designed to afford log-linear separations between 150 and 350 kb (5). The 20 Tn10dRCP2, I-SceI cleavage site landmarks were introduced around the chromosome within either the RS218 or the J96 strain background by P1 transduction (Table 1). The locations of the I-SceI insertions within each strain were mapped relative to that strain’s macrorestriction map based on the artificial NotI, BlnI, and/or I-CeuI site introduced on the Tn10dRCP2 element. Locations of the Tn10dRCP2 inserts in all three strain backgrounds are shown on a linearized schematic of the E. coli chromosome opened at the thrA gene at 0 min (Fig. 1). The clockwise order of the 20 insertions was maintained in all three backgrounds, indicating a lack of detectable inversions or translocations. This was despite the potential for inversion between even the different laboratory derivatives of strain K-12 (26, 38) but was indeed expected from general conservation of the E. coli genetic map throughout the species and the family Enterobacteriaceae (39).

TABLE 1.

E. coli strains used in this studya

Strain RS218 transductant J96 transductant Descriptionb
MG1655 K-12 prototype
RS218 Newborn meningitis prototype
J96 Pyelonephritis prototype
MG1655::Tn10dKanRCP2; Kmr
χM2008 χM3008 χM4008 zah-108, (316), [312], {319}
χM2020 χM3020 χM4020 zbh-120, (813), [836], {812}
χM2031 χM3031 χM4031 zch-131, (1267), [1439], {1385}
χM2064 χM3064 χM4064 purF-164, (2419), [2659], {2474}
χM2080 χM3080 χM4080 cys-180, (2875), [3158], {2922}
χM2095 χM3095 χM4095 zhc-195, (3363), [3701], {3640}
χM2107 χM3107 χM4107 zid-207, (3870), [4209], {4148}
χM2127 χM3127 χM4127 zji-227, (4504), [4941], {4895}
MG1655::Tn10dSpcRCP2; Spr
χM2002 χM3002 χM4002 car-102, (35), [35], {35}
χM2014 χM3014 χM4014 zbd-114, (575), [597], {573}
χM2026 χM3026 χM4026 zcc-126, (1038), [1130], {1031}
χM2046 χM3046 χM4046 zdh-146, (1773), [1922], {1803}
χM2061 χM3061 χM4061 zeh-161, (2226), [2400], {2281}
χM2070 χM3070 χM4070 zfh-170, (2671), [2914], {2723}
χM2088 χM3088 χM4088 zgf-188, (3124), [3457], {3401}
χM2099 χM3099 χM4099 zhi-199, (3607), [3945], {3889}
χM2121 χM3121 χM4121 zje-221, (4315), [4682], {4596}
MG1655::Tn10dCamRCP2; Cmr
χM2038 χM3038 χM4038 zdb-138, (1499), [1668], {1564}
χM2053 χM3053 χM4053 zed-153, (1985), [2161], {2012}
χM2115 χM3115 χM4115 zii-215, (4080), [4443], {4357}
MG1655 (Tn10dSpcRCP2) (Tn10dKanRCP2)
χM2207 χM3207 χM4207 car-102 (Spr) zah-108 (Kmr)
χM2208 χM3208 χM4208 zbd-114 (Spr) zbh-120 (Kmr)
χM2209 χM3209 χM4209 zcc-126 (Spr) zch-131 (Kmr)
χM2210 χM3210 χM4210 zeh-161 (Spr) purF-164 (Kmr)
χM2211 χM3211 χM4211 zfh-170 (Spr) cys-180 (Kmr)
χM2212 χM3212 χM4212 zgf-188 (Spr) zhc-195 (Kmr)
χM2213 χM3213 χM4213 zhi-199 (Spr) zid-207 (Kmr)
χM2214 χM3214 χM4214 zje-221 (Spr) zji-227 (Kmr)
MG1655 (Tn10dKanRCP2) (Tn10dSpcRCP2)
χM2215 χM3215 χM4215 zah-108 (Kmr) zbd-114 (Spr)
χM2216 χM3216 χM4216 zbh-120 (Kmr) zcc-126 (Spr)
χM2217 χM3217 χM4217 cys-180 (Kmr) zgf-188 (Spr)
χM2216 χM3216 χM4216 zbh-120 (Kmr) zcc-126 (Spr)
χM2217 χM3217 χM4217 cys-180 (Kmr) zgf-188 (Spr)
χM2218 χM3218 χM4218 purF-164 (Kmr) zfh-170 (Spr)
χM2219 χM3219 χM4219 zji-227 (Kmr) car-102 (Spr)
χM2220 χM3220 χM4220 zhc-195 (Kmr) zhi-199 (Spr)
MG1655 (Tn10dCamRCP2) (Tn10dSpcRCP2)
χM2221 χM3221 χM4221 zdb-138 (Cmr) zdh-146 (Spr)
χM2222 χM3222 χM4222 zii-215 (Cmr) zje-221 (Spr)
χM2223 χM3223 χM4223 zed-153 (Cmr) zeh-161 (Spr)
MG1655 (Tn10dSpcRCP2) (Tn10dCamRCP2)
χM2224 χM3224 χM4224 zdh-146 (Spr) zed-153 (Cmr)
MG1655 (Tn10dKanRCP2) (Tn10dCamRCP2)
χM2225 χM3225 χM4225 zch-131 (Kmr) zdb-138 (Cmr)
χM2226 χM3226 χM4226 zid-207 (Kmr) zii-215 (Cmr)
a

This study was the source for all strains listed except strain MG1655 (2), strain RS218 (44), strain J96 (28), and χM strains 2002, 2008, 2014, 2020, 2026, 2031, 2038, 2046, 2053, 2061, 2064, 2070, 2080, 2088, 2095, 2099, 2107, 2115, 2121, and 2127 (9). 

b

Locus designations were relative to established transposon insertions, determined as described previously (40), or were determined by auxanography (25). Physical map coordinates are clockwise from the position 25 kb counterclockwise to the (conserved) native NotI site nearest the car locus (25). The insertions’ coordinates refer to kilobases in the MG1655 (parentheses), RS218 (brackets), and J96 (braces) strain backgrounds; these were determined from the distances between insertions by I-SceI digestions from double mutants (Fig. 3B). 

The crossing of pairs of rare-restriction-site alleles between different E. coli strain backgrounds allows physical distance comparisons between pairs of corresponding points in different copies of the E. coli genome (9). Biophysical comparison of corresponding genome segments between strains MG1655 and pathogenic strains RS218 and J96 was carried out following introduction of pairs of Tn10dRCP2 insertion alleles. This was done by sequential P1 transductions; the distinct antibiotic resistances carried by the neighboring Tn10dRCP2 inserts in the set enabled construction of double mutants in the second step of this process (Table 1). In this way, the chromosome was divided into 20 contiguous and nonoverlapping intervals in each of the three strain backgrounds. A representative macrorestriction-PFGE analysis of corresponding double-insertion mutants, with strains χM2211, χM3211, and χM4211, is shown in Fig. 2. A genomic NotI digestion of strain χM2211 (MG1655 zcc-126::Tn10dSpcRCP2, zch-131::Tn10dKanRCP2) is shown (Fig. 2, lane 1). The NotI pattern serves to verify the Tn10dRCP2 insertions into native fragments JN and IN. This strain carrying a pair of Tn10dRCP2 inserts was detected by the loss of the JN and IN bands and the generation of four new subfragments (Fig. 2, lane 1). This same strain was also digested with I-SceI, resulting in a single band of 229 kb (Fig. 2, lane 2). The corresponding double mutants χM3211 (RS218 zcc-126::Tn10dSpcRCP2 zch-131::Tn10dKanRCP2) and χM4211 (J96 zcc-126::Tn10dSpcRCP2 zch-131::Tn10dKanRCP2) were also digested with NotI and I-SceI. For verification of the double insertion into the RS218 background, loss of CN and MN was sought (Fig. 2, lane 3); for verification in the J96 background, loss of DN and MN was sought (Fig. 2, lane 5). The different sizes of the genome interval, measured in unit fragments following I-SceI digestion, were determined to be 309 kb within χM3211 for RS218 (Fig. 2, lane 4) and 354 kb within χM4211 for J96 (Fig. 2, lane 6). These data indicate that both pathogenic E. coli strains RS218 and J96 contain added chromosomal segments within this interval, potentially containing virulence factors (see below).

FIG. 2.

FIG. 2

Determination of macrorestriction fragment length polymorphism on the E. coli chromosome. The macrorestriction digestion patterns of genomic DNAs from different double mutants, all bearing the same insertions (zcc-126 and zch-131) in the MG1655 background (lanes 1 and 2), the RS218 background (lanes 3 and 4), and the J96 background (lanes 5 and 6), are shown. Missing native fragments (white bars) and novel subfragments (black bars) generated by NotI restriction (lanes 1, 3, and 5) indicated the positions of the insertions in the different strain backgrounds relative to native sites (Fig. 1). (Open bars indicate the electrophoretic positions of 5-kb novel NotI subfragments from each background, invisible under the conditions shown.) Unique fragments generated by I-SceI restriction (lanes 2, 4, and 6) indicate by contrast, in readily comparable units allowed by the absence of native sites, the distances between the insertions in the different strain backgrounds (Fig. 3).

Similar comparative analyses were repeated for each of the 20 ∼250-kb genomic intervals. The results are summarized schematically in Fig. 3A relative to the MG1655 background. The sizes of the I-SceI intervals for the double-Tn10dRCP2 strains are given in Fig. 3B. An interval size difference of >7 kb between corresponding I-SceI fragments was taken to indicate substantial addition or deletion. Detected differences of ≤7 kb were considered to have resulted from small rearrangements including transposon and insertion sequence migrations. It is conceivable, however, and a general shortcoming of comparative mapping with rare-restriction-site insertions, that the differences of ≤7 kb could have reflected larger additions or deletions canceling out each other’s contributions to size within a given interval. The RS218 chromosome contained 10 unique segments relative to the MG1655 chromosome: zah to zbd (26 kb), zbh to zcc (69 kb), zcc to zch (80 kb), zdh to zed (27 kb), zeh to purF (66 kb), zfh to cys (40 kb); cys to zgf (50 kb), zid to zii (24 kb), zje to zji (70 kb), and zji to thrA (85 kb). The RS218 chromosome had one deletion of 20 kb relative to the MG1655 chromosome between zdb and zdh. The J96 chromosome had two deletions relative to strain MG1655: one of 35 kb within the same zdb-to-zdh interval (as in RS218), and a second of 53 kb between zch and zdb. The J96 chromosome had four unique segments relative to the MG1655 chromosome: zcc to zch (125 kb), zed to zeh (28 kb), cys to zgf (230 kb), and zje to zii (110 kb). Three of these J96 unique segments mapped to the same intervals as different unique segments in strain RS218: those at zcc to zch (22 to 27 min), cys to zgf (cys to 65 min), and zje to zii (94 to 98 min). These locations contain the virulence factors sfa (22 to 27 min in strains RS218 and J96) kpsA (64 min in strain RS218), hlyB/D and pap (64 min in strain J96), and hlyB/D, prs, and cnf (94 to 98 min in strain J96). Interestingly, some of the same virulence factors (hlyB/D and prs) were originally mapped to yet other chromosomal loci in uropathogenic strain 536 (12).

FIG. 3.

FIG. 3

Three-copy integrated macrorestriction map of the E. coli chromosome. Three identical sets of 20 rare-restriction-site insertions, each in a different copy of the chromosome (Fig. 1), were used; the different copies encoded genealogically distant E. coli types (i.e., strains MG1655, RS218, and J96). (A) Flags marked by location indicate the borders of the 20 chromosome intervals delimited by adjacent pairs of the insertions; the allele numbers of these insertions are given in Table 1. The distances between the insertions in each strain background were determined, as shown in Fig. 2, by PFGE of I-SceI digests from double mutants constructed by P1 transduction. Intervals containing differences of >7 kb between copies from the different backgrounds are shown as carrying putative copy-specific additions (tangential circles) or putative copy-specific deletions (gaps). The additions and deletions are labeled by size in kilobases and by “R” or “J” for RS218 or J96, respectively. Intervals containing fragment-size differences of ≤7 kb (potentially caused by small rearrangements including insertion-sequence and transposon migrations) are depicted as essentially undisrupted (without circles or gaps). Arrows indicate the known positions of J96 virulence factors (sfa [41]; hlyB/D and pap [47]; hlyB/D, prs, and cnf [47]) and of RS218 virulence factors (sfa [41]; kpsA [16]; ibe-10 [7]; ibeB [7]). (B) The sizes of I-SceI fragments from MG1655, RS218, and J96 double mutants are indicated for the corresponding chromosome intervals in the gridwork below the map. The additions and deletions represented schematically are reflected in the I-SceI fragment sizes labeled in bold.

DISCUSSION

Previously we have shown that insertions containing rare restriction sites can facilitate integrated genome mapping (40). In subsequent work we have shown that pairs of insertions containing the unique I-SceI restriction site allow purification of the genomic intervals that they flank (9). In this study, we combined integrated genome mapping with the isolation of genomic intervals between I-SceI insertions to identify chromosomal segments that distinguish pathogenic from nonpathogenic E. coli strains. Through comparisons of corresponding chromosomal segments, we anticipated finding evidence of insertions and/or deletions that contributed to genome size variation and perhaps to pathogenic traits. We identified 11 such chromosomal differences between strains RS218 and MG1655 (10 additions and 1 deletion) and 6 between strains J96 and MG1655 (4 additions and 2 deletions) of genomic segments of 15 kb or larger. These relatively few, large additions and deletions accounted for nearly all genome size differences.

The E. coli strains that we examined, MG1655, RS218, and J96, exhibit distinct modes of interaction with mammalian hosts. Strain MG1655 is a nonpathogenic derivative of E. coli K-12. Strains RS218 and J96 are pathogenic, especially in targeted subpopulations of the host. These strains also exhibit extensive (up to ∼500 kb) variation in chromosome size. Despite their extensive structural and functional divergence, overall chromosomal gene order is conserved among these three strains; i.e., the 20 I-SceI insertions are in the same order along all three chromosomes. In addition, the data indicated five different classes of genomic intervals: (i) seven intervals carrying genomic segments of the same length in all three strains, (ii) eight intervals carrying additional genomic segments in one or the other of the two pathogenic strains, (iii) three intervals carrying additional genomic segments in both pathogenic strains, (iv) one interval carrying a genomic deletion relative to strain MG1655 in one of the two pathogenic strains, and (v) one interval carrying different genomic deletions relative to strain MG1655 in both pathogenic strains (Fig. 3). The conservation of overall gene order and of many physical distances among the chromosomes of the three strains indicates a previously unimagined degree of structural identity-by-descent among them. They represent various instances of the E. coli chromosome in which different combinations of accessory components and/or deletions have been acquired.

Associated with a majority of the strain-specific chromosomal segments from the two pathogens were genes contributing to some of the key virulence traits distinguishing them. For instance, newborn-sepsis-associated strain RS218 carries genes putatively for penetration of epithelial basement membranes (sfa) (24), for immune evasion by molecular mimicry of a fetal brain antigen (kpsA) (22), and for penetration of the blood-brain barrier (ibe-10) (27). The coordinates for these virulence genes within the E. coli chromosome are as follows: sfa, 24 min (41); kpsA, 64 min (8, 16, 48); ibe-10, 87 min (7); and ibeB, 98 min (7). Herein we demonstrate the association of these virulence factors with the RS218-specific segments at zcc to zch (∼24 min), zeh to purF (∼47 min), cys to zgf (∼64 min), zid to zii (∼87 min), and zji to thrA (∼98 min), respectively. The uropathogenic strain J96 carries genes putatively for penetration of epithelial basement membrane (sfa) (37, 42), for ascendance of the host’s ureters (pap) (30), for disruption of eukaryotic cells by α-hemolysin (hly) and by cytotoxic necrotizing factor 1 (cnf) (11), and for adhesion to host tissues (prs) (31). The coordinates for these virulence genes within the E. coli chromosome are as follows: sfa at 24 min (41); hlyB/D and pap at 64 min (47); and hlyB/D, prs, and cnf at 94 min (47). Again, these J96 virulence factors are associated with J96-specific segments at zcc to zch (∼24 min), cys to zgf (∼64 min), and zje to zji (∼94 min), respectively. The acquisition of different strain-specific pathogenicity islands within the same genomic regions indicates that these loci are potential hot spots for evolution of pathogenic traits. Insertions of many known pathogenicity islands into the E. coli chromosome are at tRNA genes: at the phenylalanine gene pheV for the pap gene of J96 (47) and the kpsA gene of strain RS218 (16); and at the selenocysteine gene selC for the locus of enterocyte effacement element (33), and at the phenylalanine gene pheR for the prs and hly genes, of strain J96 (11). Further, strains RS218 and J96 both have large genomic deletions relative to strain MG1655 at zdb to zdh (∼31 min). Strain J96 has a second deletion occurring at zch to zdb (∼27 min). These deletions may constitute virulence black holes (loss of genes enhancing a strain’s virulence), like that recently reported for the evolution of Shigella spp. and enteroinvasive E. coli (32). This inverse complement to pathogenicity islands may also contribute to the evolution of these pathogens by enhancing the pathogen’s survival through the loss of chromosomal sequences.

Our findings of large unique components to the E. coli chromosome in pathogenic strains are consistent with the work of others showing that virulence factors tend to be clustered both on pathogenicity islands (23) and within particular branches of the E. coli tree (14). Further, the segments uniquely absent from the chromosomes of pathogenic strains are consistent with Maurelli et al.’s black-hole concept that loss of chromosomal components may be important in the evolution of pathogenesis (32). In the two pathogenic strains, the unique regions identified by physical chromosomal alignments relative to strain MG1655 were colocalized with known virulence genes, and it is possible that the unaccounted-for coding capacity of these regions may contain new unidentified virulence factors. Also, the unique chromosomal segments in the three strains accounted for most of the genome size differences among them, which may suggest an explanation for the correlation between genome size variation and conventional genetic distance in E. coli.

ACKNOWLEDGMENTS

This work was supported by grants R29-AI31419 and R01-AI40074 from the National Institutes of Health to C.A.B.

We thank S. Hanash for kind and enthusiastic support of this work, J. Adams for critically reviewing the manuscript, and Erin McDaid-Kelly and Janice Hatch for technical assistance in preparation of the manuscript.

REFERENCES

  • 1.Bacheller C D, Bernstein J M. Urinary tract infections. Med Clin North Am. 1997;81:719–730. doi: 10.1016/s0025-7125(05)70542-3. [DOI] [PubMed] [Google Scholar]
  • 2.Bachmann B J. Derivations and genotypes of some mutant derivatives of Escherichia coli K-12. In: Neidhardt F C, Ingraham J L, Low K B, Magasanik B, Schaechter M, Umbarger H E, editors. Escherichia coli and Salmonella typhimurium: cellular and molecular biology. Vol. 2. Washington, D.C: American Society for Microbiology; 1987. pp. 1190–1219. [Google Scholar]
  • 3.Bergthorsson U, Ochman H. Distribution of chromosome length variation in natural isolates of Escherichia coli. Mol Biol Evol. 1998;15:6–16. doi: 10.1093/oxfordjournals.molbev.a025847. [DOI] [PubMed] [Google Scholar]
  • 4.Bergthorsson U, Ochman H. Heterogeneity of genome sizes among natural isolates of Escherichia coli. J Bacteriol. 1995;177:5784–5789. doi: 10.1128/jb.177.20.5784-5789.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Birren B, Lai E. Pulsed field gel electrophoresis: a practical guide. San Diego, Calif: Academic Press, Inc.; 1993. [Google Scholar]
  • 6.Blattner F, Plunkett III G, Bloch C A, Perna N T, Riley M, Burland V, Collado-Vides J, Glassner J D, Rode C K, Mayhew G, Gregor J, Davis N W, Kirkpatrick H, Goeden M, Rose D, Mau R, Shao Y. The complete genome sequence of Escherichia coli. Science. 1997;277:1453–1462. doi: 10.1126/science.277.5331.1453. [DOI] [PubMed] [Google Scholar]
  • 7.Bloch C A, Huang S-H, Rode C K, Kim K S. Mapping of noninvasion TnphoA mutations on the Escherichia coli O18:K1:H7 chromosome. FEMS Microbiol Lett. 1996;144:171–176. doi: 10.1111/j.1574-6968.1996.tb08526.x. [DOI] [PubMed] [Google Scholar]
  • 8.Bloch C A, Rode C K. Pathogenicity island evaluation in Escherichia coli K1 by crossing with strain K-12. Infect Immun. 1996;64:3218–3223. doi: 10.1128/iai.64.8.3218-3223.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Bloch C A, Rode C K, Obreque V H, Mahillon J. Purification of Escherichia coli chromosomal segments without cloning. Biochem Biophys Res Commun. 1996;223:104–111. doi: 10.1006/bbrc.1996.0853. [DOI] [PubMed] [Google Scholar]
  • 10.Bloch C A, Thorne G M, Ausubel F M. General method for site-directed mutagenesis in Escherichia coli O18ac:K1:H7: deletion of the inducible superoxide dismutase gene, sodA, does not diminish bacteremia in neonatal rats. Infect Immun. 1989;57:2141–2149. doi: 10.1128/iai.57.7.2141-2148.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Blum G, Falbo V, Caprioli A, Hacker J. Gene clusters encoding the cytotoxic necrotizing factor type 1, Prs-fimbriae and α-hemolysin form the pathogenicity island II of the uropathogenic Escherichia coli strain J96. FEMS Microbiol Lett. 1995;126:189–196. doi: 10.1111/j.1574-6968.1995.tb07415.x. [DOI] [PubMed] [Google Scholar]
  • 12.Blum G, Ott M, Lischewski A, Ritter A, Imrich H, Tschape H, Hacker J. Excision of large DNA regions termed pathogenicity islands from tRNA-specific loci in the chromosome of an Escherichia coli wild-type pathogen. Infect Immun. 1994;62:606–614. doi: 10.1128/iai.62.2.606-614.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Botstein D, White R L, Skolnick M, Davis R W. Construction of a genetic linkage map in Man using restriction fragment length polymorphism. Am J Hum Genet. 1980;32:314–331. [PMC free article] [PubMed] [Google Scholar]
  • 14.Boyd E F, Hartl D L. Chromosomal regions specific to pathogenic isolates of Escherichia coli have a phylogenetically clustered distribution. J Bacteriol. 1998;180:1159–1165. doi: 10.1128/jb.180.5.1159-1165.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Campos L C, Whittam T S, Gomes T A, Andrade J R, Trabulsi L R. Escherichia coli serogroup O111 includes several clones of diarrheagenic strains with different virulence properties. Infect Immun. 1994;62:3282–3288. doi: 10.1128/iai.62.8.3282-3288.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Cieslewicz M, Vimr E. Reduced polysialic acid capsule expression in Escherichia coli K1 mutants with chromosomal defects in kpsF. Mol Microbiol. 1997;26:237–249. doi: 10.1046/j.1365-2958.1997.5651942.x. [DOI] [PubMed] [Google Scholar]
  • 17.Collins F, Guyer M, Chakravarti A. Variations on a theme: cataloging human DNA sequence variation. Science. 1997;278:1580–1581. doi: 10.1126/science.278.5343.1580. [DOI] [PubMed] [Google Scholar]
  • 18.Falkow S. Infectious multiple drug resistance. London, England: Pion Limited; 1975. [Google Scholar]
  • 19.Feng P, Lampel K A, Karch H, Whittam T S. Genotypic and phenotypic changes in the emergence of Escherichia coli O157:H7. J Infect Dis. 1998;177:1750–1753. doi: 10.1086/517438. [DOI] [PubMed] [Google Scholar]
  • 20.Ferrieri, P. 1990. Neonatal susceptibility and immunity to major bacterial pathogens. Rev. Infect. Dis. 12(Suppl. 4):S394–S400. [DOI] [PubMed]
  • 21.Fetherston J D, Schuetze P, Perry R D. Loss of the pigmentation phenotype in Yersinia pestis is due to the spontaneous deletions of 102 kb of chromosomal DNA, which is flanked by a repetitive element. Mol Microbiol. 1992;6:2693–2704. doi: 10.1111/j.1365-2958.1992.tb01446.x. [DOI] [PubMed] [Google Scholar]
  • 22.Finne J, Leinonen M, Mäkelä P H. Antigenic similarities between brain components and bacteria causing meningitis. Lancet. 1983;ii:355–357. doi: 10.1016/s0140-6736(83)90340-9. [DOI] [PubMed] [Google Scholar]
  • 23.Hacker J, Blum-Oehler G, Muhldorfer I, Tschape H. Pathogenicity islands of virulent bacteria: structure, function, and impact on microbial evolution. Mol Microbiol. 1997;23:1089–1097. doi: 10.1046/j.1365-2958.1997.3101672.x. [DOI] [PubMed] [Google Scholar]
  • 24.Hacker J, Kestler H, Hoschutzky H, Jann K, Lottspeich F, Korhonen T K. Cloning and characterization of the S fimbrial adhesin II complex of an Escherichia coli O18:K1 meningitis isolate. Infect Immun. 1993;61:544–550. doi: 10.1128/iai.61.2.544-550.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Heath J D, Perkins J D, Sharma B, Weinstock G M. NotI genomic cleavage map of Escherichia coli K-12 strain MG1655. J Bacteriol. 1992;174:558–567. doi: 10.1128/jb.174.2.558-567.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Hill C W, Harnish B W. Inversions between ribosomal RNA genes of Escherichia coli. Proc Natl Acad Sci USA. 1981;78:7069–7072. doi: 10.1073/pnas.78.11.7069. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Huang S H, Wass C, Fu Q, Prasadarao N V, Stins M, Kim K S. Escherichia coli invasion of brain microvascular endothelial cells in vitro and in vivo: molecular cloning and characterization of invasion gene ibe10. Infect Immun. 1995;63:4470–4475. doi: 10.1128/iai.63.11.4470-4475.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Hull R A, Gill R E, Hsu P, Minshew B H, Falkow S. Construction and expression of recombinant plasmids encoding type 1 or d-mannose-resistant pili from a urinary tract infection Escherichia coli isolate. Infect Immun. 1981;33:933–936. doi: 10.1128/iai.33.3.933-938.1981. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Knapp S, Hacker J, Jarchau T, Goebel W. Large, unstable inserts in the chromosome affect virulence properties of uropathogenic Escherichia coli O6 strain 536. J Bacteriol. 1986;168:22–30. doi: 10.1128/jb.168.1.22-30.1986. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Lund B, Lindberg F, Marklund B I, Normark S. The PapG protein is the alpha-d-galactopyranosyl-(1-4)-beta-d-galactopyranose-binding adhesin of uropathogenic Escherichia coli. Proc Natl Acad Sci USA. 1987;84:5898–5902. doi: 10.1073/pnas.84.16.5898. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Marklund B-I, Tennent J M, Garcia E, Hamers A, Baga M, Lindberg F, Gaastra W, Normark S. Horizontal gene transfer of the Escherichia coli pap and prs pili operons as a mechanism for the development of tissue-specific adhesive properties. Mol Microbiol. 1992;6:2225–2242. doi: 10.1111/j.1365-2958.1992.tb01399.x. [DOI] [PubMed] [Google Scholar]
  • 32.Maurelli A T, Fernandez R E, Bloch C A, Rode C K, Fasano A. “Black holes” and bacterial pathogenicity: a large genomic deletion that enhances the virulence of “Shigella” spp. and enteroinvasive Escherichia coli. Proc Natl Acad Sci USA. 1998;95:3943–3949. doi: 10.1073/pnas.95.7.3943. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.McDaniel T K, Jarvis K G, Donnenberg M S, Kaper J B. A genetic locus of enterocyte effacement conserved among diverse enterobacterial pathogens. Proc Natl Acad Sci USA. 1995;92:1664–1668. doi: 10.1073/pnas.92.5.1664. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Miller J H. Experiments in molecular genetics. Cold Spring Harbor, N.Y: Cold Spring Harbor Laboratory; 1972. [Google Scholar]
  • 35.Mills D M, Bajaj V, Lee C A. A 40kb chromosomal fragment encoding Salmonella typhimurium invasion genes is absent from the corresponding region of the Escherichia coli K-12 chromosome. Mol Microbiol. 1995;15:749–759. doi: 10.1111/j.1365-2958.1995.tb02382.x. [DOI] [PubMed] [Google Scholar]
  • 36.Monteilhet C, Perrin A, Thierry A, Colleau L, Dujon B. Purification and characterization of the in vitro activity of I-SceI, a novel and highly specific endonuclease encoded by a group I intron. Nucleic Acids Res. 1990;18:1407–1413. doi: 10.1093/nar/18.6.1407. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Morschhauser J, Vetter V, Korhonen T, Uhlin B E, Hacker J. Regulation and binding properties of S fimbriae cloned from E. coli strains causing urinary tract infection and meningitis. Zentbl Bakteriol. 1993;278:165–176. doi: 10.1016/s0934-8840(11)80834-0. [DOI] [PubMed] [Google Scholar]
  • 38.Perkins J D, Heath J D, Sharma B R, Weinstock G M. XbaI and BlnI genomic cleavage maps of Escherichia coli K-12 strain MG1655 and comparative analysis of other strains. J Mol Biol. 1993;232:419–445. doi: 10.1006/jmbi.1993.1401. [DOI] [PubMed] [Google Scholar]
  • 39.Riley M, Krawiec S. Genome organization. In: Neidhardt F C, Ingraham J L, Low K B, Magasanik B, Schaechter M, Umbarger H E, editors. Escherichia coli and Salmonella typhimurium: cellular and molecular biology. Vol. 2. Washington, D.C: American Society for Microbiology; 1987. pp. 967–981. [Google Scholar]
  • 40.Rode C K, Obreque V H, Bloch C A. New tools for integrated genetic and physical analyses of the Escherichia coli chromosome. Gene. 1995;166:1–9. doi: 10.1016/0378-1119(95)00630-5. [DOI] [PubMed] [Google Scholar]
  • 41.Rode, C. K., L. Xhang, B. Foxman, and C. A. Bloch. Unpublished data.
  • 42.Schmoll T, Hoschutzky H, Morschhauser J, Lottspeich F, Jann K, Hacker J. Analysis of genes coding for the sialic acid-binding adhesin and two other minor fimbrial subunits of the S-fimbrial adhesin determinant of Escherichia coli. Mol Microbiol. 1989;3:1735–1744. doi: 10.1111/j.1365-2958.1989.tb00159.x. [DOI] [PubMed] [Google Scholar]
  • 43.Selander R K, Caugant D A, Whittam T S. Genetic structure and variation in natural populations of Escherichia coli. In: Neidhardt F C, Ingraham J L, Low K B, Magasanik B, Schaechter M, Umbarger H E, editors. Escherichia coli and Salmonella typhimurium: cellular and molecular biology. Vol. 2. Washington, D.C: American Society for Microbiology; 1987. pp. 1625–1647. [Google Scholar]
  • 44.Silver R P, Finn C W, Vann W F, Aaronson W, Schneerson R, Kretschmer P J, Garon C F. Molecular cloning of the K1 capsular polysaccharide genes of E. coli. Nature. 1981;289:696–698. doi: 10.1038/289696b0. [DOI] [PubMed] [Google Scholar]
  • 45.Sternberg N L, Maurer R. Bacteriophage-mediated generalized transduction in Escherichia coli and Salmonella typhimurium. Methods Enzymol. 1991;204:18–43. doi: 10.1016/0076-6879(91)04004-8. [DOI] [PubMed] [Google Scholar]
  • 46.Sussman M. Escherichia coli: mechanisms of virulence. New York, N.Y: Cambridge University Press; 1997. [Google Scholar]
  • 47.Swenson D, Bukanov N, Berg D, Welch R. Two pathogenicity islands in uropathogenic Escherichia coli J96: cosmid cloning and sample sequencing. Infect Immun. 1996;64:3736–3743. doi: 10.1128/iai.64.9.3736-3743.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Vimr E. Map position and genomic organization of the kps cluster for polysialic acid synthesis in Escherichia coli K1. J Bacteriol. 1991;173:1335–1338. doi: 10.1128/jb.173.3.1335-1338.1991. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Whittam T S, Wolfe M L, Wachsmuth I K, Orskov F, Orskov I, Wilson R A. Clonal relationships among Escherichia coli strains that cause hemorrhagic colitis and infantile diarrhea. Infect Immun. 1993;61:1619–1629. doi: 10.1128/iai.61.5.1619-1629.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Infection and Immunity are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES