Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2000 Aug 15;97(17):9671–9676. doi: 10.1073/pnas.97.17.9671

Identification of type II restriction and modification systems in Helicobacter pylori reveals their substantial diversity among strains

Qing Xu *, Richard D Morgan , Richard J Roberts , Martin J Blaser *,‡,§
PMCID: PMC16923  PMID: 10944229

Abstract

A total of 22 type II restriction endonucleases with 18 distinct specificities have been identified in six Helicobacter pylori strains. Among these 18 specificities are three completely new endonucleases, Hpy178III, Hpy99I, and Hpy188I, that specifically cleave DNA at TCNNGA, CGWCG, and TCNGA sites, respectively. The set of endonucleases identified in each strain varies, but all have four- or five-base recognition sequences. Among 16 H. pylori strains, examination of the DNA modification status at the recognition sites of 15 restriction endonucleases reveals that each strain has a substantially different complement of type II modification systems. We conclude that the type II restriction-modification systems in H. pylori are highly diverse between strains, a unique characteristic of H. pylori. The diverse methylation status of H. pylori chromosomal DNA may serve as a new typing system to discriminate H. pylori isolates for epidemiological and clinical purposes. This study also demonstrates that H. pylori is a rich source of type II restriction endonucleases.


Type II restriction-modification (R-M) systems are pairs of enzymes: one, a restriction endonuclease (RE); the other, a methyltransferase, with opposing intracellular activities (1, 2). The RE recognizes specific sequences in DNA and cleaves the DNA at a particular site whereas the cognate methyltransferase modifies DNA within the same recognition sequence, thereby preventing cleavage by the RE. By possessing these two opposing enzymes, bacteria may protect their own DNA and degrade foreign DNA, thus limiting the spread of invading DNA molecules within the bacterial population (36). In addition, DNA methylation may be involved in regulation of chromosomal DNA replication (7, 8) and gene expression (9, 10), transposon movement (11), or DNA mismatch repair (12).

Helicobacter pylori are bacteria that colonize the human stomach and increase the risk of development of peptic ulcer disease and gastric adenocarcinoma (1315). Analysis of the entire genomic sequence of H. pylori strains 26695 and J99 predicted that they possess an unusually high number (14 or 15) of type II R-M systems (16, 17). Comparison of these two strains showed that the two genomes are quite similar, with only 6–7% strain-specific genes (17); R-M systems are a major source of the strain differences. Furthermore, comparison of strains J166 and 26695 using a PCR-based subtractive hybridization method (18) showed that 7 of 18 J166-specific DNA clones had homology to the genes of R-M systems. In total, these studies demonstrated that, at the genomic level, R-M systems are quite diverse among the three strains examined. Whether this diversity is a general phenomenon among various H. pylori strains and which functional R-M systems are present in these strains remain as unanswered questions. Thus far, only the iceA-hpyIM locus in H. pylori, an ulcer-related NlaIII-like type II R-M system (19, 20), has been studied among various strains, which indicates that the M gene is conserved whereas the R gene (iceA1) is not. To investigate whether H. pylori strains indeed have diversified R-M systems, we sought to study the H. pylori R-M systems by identifying type II REs at the biochemical level in six strains. In complementary studies, we also have examined the DNA modification status at multiple RE recognition sites in a variety of H. pylori strains.

Materials and Methods

Bacterial Strains, Growth Conditions, and Reagents.

The bacterial strains used in this study are listed in Table 1. H. pylori strains were grown under conditions described (20). Digestion buffers were from New England Biolabs. All columns used for protein purification were obtained from Amersham Pharmacia, unless specifically indicated.

Table 1.

H. pylori strains used in this study

H. pylori strain Relevant genotype Origin Age/Sex* Ethnicity* Diagnosis*
60190 iceA1, cagA+, vacA s1a m1 U.K. NA NA NA
J166 iceA1, cagA+, vacA s1b m1 U.S. 56M Caucasian Duodenitis
CH4 iceA1, cagA+, vacA s1a m2 China NA Oriental NA
9366 iceA1, cagA+, vacA s2 m2 U.S. 57M Caucasian DU
A101 iceA1, cagA+, vacA s1a m2 U.S. 75M Caucasian NUD
26695 iceA1, cagA+, vacA s1a m1 U.K. NA NA NA
97690 iceA1, cagA+, vacA s1a m1 India 45M Caucasian NUD
HPK5 iceA1, cagA+, vacA s1a m1 Japan NA Oriental NA
J195 iceA2, cagA, vacA s2 m2 U.S. 44M Caucasian NUD
J178 iceA2, cagA+, vacA s1a m1 U.S. 61M African CA
J188 iceA2, cagA, vacA s2 m2 U.S. 54M Caucasian NUD
J262 iceA2, cagA, vacA s2 m2 U.S. 63M Caucasian NUD
8822 iceA2, cagA, vacA s2 m2 U.S. NA NA NA
97762 iceA2, cagA+, vacA s1a m1 India 62F Oriental NUD
98518 iceA2, cagA+, vacA s1a m1 India 30F Oriental NUD
J99 iceA2, cagA+, vacA s1b m1 U.S. 48M Caucasian DU
*

Patient information: DU, duodenal ulcer; NUD, non-ulcer dyspepsia; CA, gastric cancer; NA, data not available, reference strains. 

Restriction Endonuclease Assay.

Chromatography fractions were assayed for RE activity by incubation at 37°C in NEB buffer 4 (50 mM KOAc/20 mM Tris⋅OAc/10 mM Mg(OAc)2/1 mM DTT, pH 7.9), using phage λ, or φX174 DNAs as substrates, and were further examined by agarose gel electrophoresis. One unit of an RE is defined as the amount of the enzyme required to cleave completely 1 μg of φX174 DNA in 50 μl of NEB buffer 4 within 1 h at 37°C.

Purification of Restriction Endonucleases from Various H. pylori Strains.

Preparation of crude cell extracts of H. pylori.

H. pylori cells, grown in Brucella broth for 48 h, were harvested by centrifugation and were resuspended in sonication buffer (20 mM Tris⋅HCl/0.5 mM EDTA/1 mM DTT pH 7.5), then were sonicated until ≈50 mg of protein per gram of cells was released. The lysate was centrifuged, and the supernant was collected for column chromatography. Based on the characteristics of REs present in each strain, a different set of columns was used for the purpose of chromatography, as determined experimentally.

Chromatography of 60190 crude cell extract.

The crude extract made from 15 g of 60190 cells was first applied to a 20-ml heparin HyperD column (Biosepra, Marlborough, MA), which was eluted with a linear gradient of 0.05–1.0 M NaCl in a total volume of 200 ml. After assay, the fractions containing RE activity were pooled and applied to either 3-ml Heparin-TSK (Tosohaas, Philadelphia) or 6 ml Source Q-15 columns. The TSK column was then eluted with a 50-ml linear gradient of either 0.05–0.60 M or 0.20–0.85 M NaCl whereas the Source Q-15 column was eluted with a 100-ml linear gradient of 0.05–1.0 M NaCl. At this stage, all REs in 60190 were substantially purified and ready for characterization.

Chromatography of J178 crude cell extract.

The crude extract of ≈4 g of J178 cells was applied to a 5-ml Heparin Hitrap column, which was eluted with a 50-ml linear gradient of 0.12–1.0 M NaCl. After endonuclease assay, the eluted fractions were combined into two pools, 0.50–0.60 M and 0.65–0.85 M NaCl, which were applied to 3-ml TSK columns separately. The TSK columns were eluted with a 60-ml linear gradient of either 0.10–1.0 M or 0.15–1.0 M NaCl. For further purification, the TSK fractions between 0.35 and 0.48 M NaCl were pooled and applied to a 2-ml Mono Q column, which was eluted with a 40-ml linear gradient of 0.05–0.60 M NaCl.

Chromatography of J188 crude cell extract.

The crude extract of 8 g of J188 cells was applied to a 20-ml Heparin HyperD column, which was eluted under the same conditions as described above. Endonuclease assay on eluted fractions revealed the presence of Hpy188I and one other specificity (21). Purification of Hpy188I has been described (21). To purify the second RE, the RE-containing fractions were applied to a 7-ml Mono S column, which was eluted with a linear gradient of 0.10–0.80 M NaCl.

Chromatography of J99 crude cell extract.

The crude extract of 3.2 g of J99 cells was applied to a 20-ml Heparin HyperD column, which was eluted under the same conditions as described above. The bound REs were eluted from the column between 0.35 and 0.44 M, 0.48 and 58 M, and 0.9 and 1.2 M NaCl, respectively. The fractions in each of these ranges were pooled and applied to a 3-ml Heparin TSK column separately. The TSK columns each were eluted with a 45-ml linear gradient of 0.05–0.60 M, 0.10–0.70 M, or 0.50–1.5 M NaCl. A 7-ml hydroxylapatite (HAP) column was used to purify REs in Hyper D flow-through fractions and was eluted with a 70-ml linear gradient of 0.05–0.40 M KPO4 (pH 7.4) in the presence of 0.20 M NaCl.

Chromatography of CH4 crude cell extract.

The crude extract of 8 g of CH4 cells was applied to a 20-ml Heparin HyperD column and was eluted under the same conditions as described above. Four peaks of RE activities were detected among fractions eluted from the column. After dilution, each pool was applied to either 3-ml Heparin-TSK or 2-ml Mono S columns. The TSK column was eluted by using 55 ml of a linear gradient of 0.05–0.60 M, 0.12–0.60 M, or 0.15–0.70 M NaCl whereas the Mono S column was eluted with a 0.05–0.60 M NaCl linear gradient. After assay, RE-containing fractions were further purified over either 7-ml SP-Sepharose or 2-ml Mono Q columns, which were eluted by linear gradients of NaCl (0.05–0.80 M or 0.05–0.60 M).

Chromatography of J166 crude cell extract.

The crude extract of 5.5 g of J166 cells was applied to a 20-ml Heparin HyperD column and was eluted under the same conditions as described above. RE fractions eluted from the column were pooled. Further purification used a 3-ml Heparin TSK column, which was eluted with a 45-ml linear gradient of 0.10–0.70 M NaCl. The flow-through fraction was applied to a 7-ml HAP column, eluted with a linear gradient of 0.05–0.40 KPO4. The RE fractions from the HAP column were applied to a 2-ml Mono Q column, then to a 1-ml Heparin Hitrap column, which was eluted with a 30-ml gradient of 0.05–0.70 M NaCl.

Determination of Recognition Sequences of Purified Restriction Endonucleases.

The recognition sequences of purified REs were determined as described (21).

Determination of the DNA Modification Status of Various H. pylori Strains.

Chromosomal DNAs from H. pylori were prepared as described (20) and were digested with purified REs, under the conditions used for assay of these purified H. pylori REs. The digested DNAs were subjected to electrophoresis through 1% agarose gels.

Results

Purification and Identification of Restriction Endonucleases in H. pylori.

Endonucleases from 60190.

We first sought to study the type II R-systems in H. pylori strains, using column chromatography. To purify REs from strain 60190, a crude extract was applied to Heparin Hyper D, Heparin TSK, and Source Q-15 columns. A total of four different type II REs, termed HpyII, HpyIV, HpyV, and HpyVIII, were purified from strain 60190. HpyV, HpyII, and HpyVII were eluted from the Heparin TSK columns in the 0.07–0.15 M, 0.34–0.37 M, and 0.40–0.48 M NaCl fractions, respectively, whereas HpyIV was identified in the 0.29–0.32 M NaCl fractions eluted from the Source Q-15 column. Mapping experiments indicated that the recognition sequence of HpyII is GAAGA (isoschizomer of MboII); of HpyIV is GANTC (HinfI); of HpyV is TCGA (TaqI); and of HpyVIII is CCGG (HpaII) (Table 2). HpyV had the highest total activity (≈2.7 × 104 units), followed by HpyIV, HpyVIII, and HpyII (≈7,800, ≈3,400, and ≈2,500 units, respectively).

Table 2.

Restriction endonucleases isolated from six H. pylori strains

H. pylori strain Designation of endonuclease Recognition site Isoschizomer Units of activity/g cells
60190 HpyII GAAGA MboII 1.7  × 102
HpyIV GANTC HinfI 5.2  × 102
HpyV TCGA TaqI 1.8  × 103
HpyVIII CCGG HpaII 2.3  × 102
J178 Hpy178II GAAGA MboII 2.3  × 10
Hpy178III TCNNGA None 1.5  × 102
Hpy178VI GGATG FokI 2.3  × 102
Hpy178VII GGCC HaeIII 2.8  × 103
J188 Hpy188I* TCNGA None 1.3  × 103
Hpy188III TCNNGA Hpy178III 6.2  × 10
J99 Hpy99I CGWCG None 1.6  × 10
Hpy99II GTSAC Tsp45 I 8.0  × 102
Hpy99III GCGC HhaI 2.9  × 103
CH4 HpyCH4I CATG NlaIII 1.4  × 102
HpyCH4II CTNAG DdeI 1.6  × 102
HpyCH4III ACNGT Tsp4CI 1.4  × 103
HpyCH4IV ACGT MaeII 3.1  × 10
HpyCH4V TGCA CviRI 2.1  × 102
HpyCH4VI TCNNGA Hpy178III 4.5  × 10
J166 Hpy166I TCNGA Hpy188I 2.4  × 103
Hpy166II GTNNAC MjaIV 7.3
Hpy166III CCTC MnlI 5.4

Enzymes with new specificities are written in bold. 

*

Purification and cloning of this restriction system are described in ref. 21

Endonucleases from J178.

A crude extract of J178 was applied to Heparin Hitrap, Heparin TSK, and Mono Q columns, before four REs, designated Hpy178II, Hpy178III, Hpy178VI, and Hpy178VII, were completely separated. Hpy178III and Hpy178VI were eluted between 0.35 and 0.38 M or 0.27 and 0.30 M NaCl from the TSK columns, respectively, whereas Hpy178II was eluted around 0.16 M NaCl from the Mono Q column, and Hpy178VII was in the flow-through fractions of the column. Mapping experiments showed that Hpy178III was an RE with a specificity TCNNGA; Hpy178II, also present in H. pylori 60190, is an isoschizomer of MboII; Hpy178VI recognizes GGATG (FokI); and Hpy178VII recognizes GGCC (HaeIII) (Table 2). The complement of REs in J178 was different from that in 60190, and, as in strain 60190, the enzymatic levels of the REs isolated from J178 varied substantially. In J178, Hpy178VII was the most abundant RE detected, with ≈1.14 × 104 units, followed by Hpy178VI, Hpy178III, and Hpy178II (≈900, ≈600, and ≈90 units, respectively).

Endonucleases from J188.

A crude extract of J188 was applied to Heparin Hyper D and Mono S columns. In total, only two REs, Hpy188I and Hpy188III, were identified from strain J188. Hpy188I is an RE with the specificity TCNGA (21). Hpy188III was eluted around 0.22 M NaCl from the Mono S column. Mapping experiments indicated that Hpy188III recognizes TCNNGA, the phenotype also found in J178 (Table 2). Hpy188I was the most abundant RE (≈104 units), followed by Hpy188III (≈500 units).

Endonucleases from J99.

To identify type II REs from the sequenced strain J99, a crude cell extract was applied to Heparin Hyper D, Heparin TSK, and HAP columns. In total, three REs, designated Hpy99I, Hpy99II, and Hpy99III, were isolated from strain J99. Hpy99I was eluted from the TSK column between 0.35 and 0.40 M NaCl whereas Hpy99II and Hpy99III eluted at ≈0.54 M or ≈0.90 M NaCl, respectively. Mapping of their recognition sites indicated that Hpy99I is an RE with the specificity CGWCG; Hpy99II recognizes GTSAC (an isoschizomer of Tsp45I); and Hpy99III recognizes GCGC (HhaI) (Table 2). Based on the total activities isolated, Hpy99III was the most abundant RE (≈9,400 units), followed by Hpy99II (≈2,550 units) and Hpy99I (≈50 units).

Endonucleases from CH4.

In addition to strains from the U.S. and Europe, we also screened an Asian strain, CH4, for its RE activities. Heparin Hyper D, Heparin TSK, Mono S, SP-Sepharose, and Mono Q columns were used for purification. In total, six REs, designated HpyCH4I, HpyCH4II, HpyCH4III, HpyCH4IV, HpyCH4V, and HpyCH4VI, were isolated. HpyCH4I, HpyCH4IV, and HpyCH4V were eluted between 0.13 and 0.23 M, 0.42 and 0.44 M, and 0.52 and 0.59 M NaCl, from the TSK columns, whereas HpyCH4II and HpyCH4VI were eluted around 0.15 M and 0.33 M NaCl from the Mono Q column. Furthermore, HpyCH4III was eluted from the Mono S column between 0.35 and 0.42 M NaCl. Mapping of their recognition sites showed that HpyCH4I recognizes CATG (an isoschizomer of NlaIII); HpyCH4II recognizes CTNAG (DdeI); HpyCH4III recognizes ACNGT (Tsp4CI); HpyCH4IV recognizes ACGT (MaeII); HpyCH4V recognizes TGCA (CviRI); and HpyCH4VI recognizes TCNNGA, the specificity also identified in strains J178 and J188 (Table 2). HpyCH4III had the highest activity (≈1.1 × 104 units), followed by HpyCH4V, HpyCH4II, HpyCH4I, HpyCH4VI, and HpyCH4IV (≈1,700, ≈1,250, ≈1,100, ≈360, ≈250 units, respectively). Of these REs, five were not found in any other strain screened.

Endonucleases from J166.

A crude extract of J166 was applied to Heparin Hyper D, Heparin TSK, HAP, Mono Q, and Heparin Hitrap columns, which permitted separation of three REs, designated Hpy166I, Hpy166II, and Hpy166III. Hpy166I was detected in fractions between 0.36 and 0.50 M NaCl from the Heparin Hyper D column, while Hpy166II was eluted from Mono Q column at ≈0.22 M NaCl, whereas Hpy166II was eluted from the Hitrap column around 0.20 M NaCl. Mapping experiments showed that Hpy166I recognizes TCNGA, the specificity identified in strain J188 (21); Hpy166II recognizes GTNNAC (an isoschizomer of MjaIV); and Hpy166III recognizes CCTC (MnlI) (Table 2). Among the REs isolated, Hpy166I has the highest activity (≈1.3 × 104 units), followed by Hpy166III (≈40 units); and Hpy166II (≈30 units). Hpy166II and Hpy166III were not identified in any of the other five strains.

Examination of the DNA Modification Status in H. pylori Strains.

In total, we isolated 22 type II REs with 18 distinct specificities from six H. pylori strains. Each strain had a varying complement of REs, suggesting considerable variation in R-M systems among H. pylori strains. However, an alternative explanation is that variable expression of a common set of R-M genes may be occurring, as seems to be the case in Neisseria gonorrhoeae (22). To assess this possibility, we examined the modification status of 16 clinical isolates of H. pylori, including the six strains used above for RE screening. These strains differ in their genotypes at the iceA locus; iceA1 is a homolog of the NlaIII gene whereas iceA2 is unrelated (19). Eight strains each are iceA1 or iceA2 genotype. Chromosomal DNA was prepared from these strains and was digested with 15 of the 18 distinct type II RE specificities we purified (Fig. 1). The other three specificities (HpyV, Hpy99I, and Hpy166II) could not be used, because most activity was lost during storage. Substantially different patterns of chromosomal DNA resistance to digestion by the 15 REs were observed among these H. pylori strains. Only two of the REs tested, Hpy99II and HpyCH4I, failed to digest any of the DNAs, indicating that all chromosomal DNAs were modified at GCGC and CATG sites, and confirming our previous observation that hpyIM is conserved (20). Another 13 recognition sites examined are modified in some, but not all, of the strains (Fig. 1; Table 3). Modification at the recognition site of HpyVIII (CCGG) was present in 15 of the 16 DNAs tested. Only DNA from strain 26695 was digestible, indicating that all strains, except 26695, carried a methyltransferase with the specificity of M.HpyVIII. Modification at the Hpy166III site (CCTC) also was common, because 14 of the 16 DNAs were resistant to digestion by Hpy166III. In contrast, modification at GGATG, the Hpy178VI site, was the least common, present only in strain J178, from which Hpy178VI was isolated. Modification at GGCC and TCNGA, the sites of Hpy178VII and Hpy188I, also was uncommon. Two strains, 97690 and J178, were protected against Hpy178VII; and two other strains, J166 and J188, against Hpy188I. Modification at the remaining nine recognition sites occurs in an intermediate number of strains (6–11 of the 16 strains) (Table 3).

Figure 1.

Figure 1

Digestion of chromosomal DNA from 16 H. pylori strains by 15 different type II REs. The digests, resolved in a 1% agarose gel, are listed in the right panel. “+” indicates digestible by the RE whereas “−” indicates resistant to digestion. The name, recognition site, and prototype of each RE are listed in the left panel. The sources of the chromosomal DNA are listed at the top of the figure. The iceA genotypes of the DNA are also indicated.

Table 3.

Modification status of chromosomal DNA from 16 H. pylori strains

Modification site H. pylori strains examined
A101 J166 60190 9366 26695 97690 CH4 HPK5 J99 J178 J188 J195 J262 8822 97762 98518
GANTC + +* + + + + + + +
CCGG + + +* + + + + + + + + + + + +
GAAGA +* + + + +* + +
TCNNGA + + + +* +* +* + + + + +
GGATG +*
GGCC + +*
TCNGA +* +*
GTSAC + + + + + + +* + + + + + +
GCGC + + + + + + + + +* + + + + + + +
CATG + + + + + + +* + + + + + + + + +
CTNAG + + +* + + +
ACNGT + + +* + + +
ACGT + + +* + + + + + + +
TGCA + + +* + + + +
CCTC + +* + + + + + + + + + + + +

“+” indicates “modified,” and “−” indicates “not modified.” 

*

Restriction enzymes isolated from each of the six strains (J166, 60190, CH4, J99, J178, and J188) correspond to the methylation in their DNA. 

Strains from A101 to HPK5 have the iceA1 genotype; strains from J99 to 98518 have the iceA2 genotype. 

The complement of the M systems in each strain appears different from that of the 15 others (Table 3). Among the 16 chromosomal DNAs tested, those from strains 97690, 8822, and 97762 were modified to the highest degree, with 10 of the 15 RE sites modified. In contrast, the DNA from two strains, J166 and 26695, was modified at only six of the 15 sites tested. We further compared the modification status by plotting the number of specific recognition sites shared between individual strains (Table 4); the shared sites range from 5 to 14. Three pairs of strains, A101/97762, 97762/98518 and HPK5/8822, possessed the most similar M systems; their modification status at the 15 sites tested are the same for each pair except for one site TGCA or GAAGA, respectively. In contrast, strain J178 shares with strains 9366 or HPK5 modification at only 5 of the 15 sites tested. The geographical relationship of M systems was assessed among these strains from the U.S., Europe, and Asia. Strains originating from the same geographic area generally do not share more methylation patterns than geographically unrelated strains (Tables 1 and 4). Overall, our studies on the 16 H. pylori strains indicates that H. pylori has many different active M systems and confirms the diversity of R-M systems among strains.

Table 4.

Sharing of chromosomal DNA modification status among 16 H. pylori strains at 15 different recognition sequences

H. pylori strain J166 60190 9366 26695 97690 CH4 HPK5 J99 J178 J188 J195 J262 8822 97762 98518
A101 8 12 9 10 10 11 9 13 9 12 13 13 10 14 13
J166 9 12 9 7 8 10 10 6 11 8 8 9 9 10
60190 8 11 9 8 10 12 10 11 12 12 11 11 12
9366 10 10 11 13 11 5 10 9 11 12 10 11
26695 9 6 10 12 8 11 10 10 9 9 10
97690 10 10 8 10 9 12 10 11 9 8
CH4 11 9 7 8 11 11 12 12 11
HPK5 11 5 8 11 11 14 10 11
J99 7 12 11 13 10 12 13
J178 8 9 7 6 8 7
J188 10 12 9 11 12
J195 11 12 12 11
J262 12 12 13
8822 11 12
97762 14

This table indicates the number of sites at which each pair of strains had the same DNA modification status. 

Discussion

Our screening of six H. pylori strains reveals a total of 22 different type II REs (Table 2), which indicates that H. pylori is rich in type II restriction systems. The enzymatic activities of these REs are varied, but they are in a normal range, when compared with those in other bacteria. Each H. pylori strain tested had a different complement of REs, and, in each of these six strains, modification of additional sequences was observed. Furthermore, in a survey of 16 H. pylori strains for the presence of the 15 type II M systems, the modification profiles were found to be substantially different (Fig. 1; Table 3). Overall, these data demonstrate that R-M systems are highly diversified in H. pylori strains. Comparison of the complete genome sequences of strains 26695 and J99 shows that the two strains share 11 of the 15 type II R-M genes (17), and comparable differences between them were observed in our study. We found six functional M systems in 26695, and seven in J99. The two strains share five systems, including the M systems of Hpy99III and HpyCH4I, that are conserved among all strains tested. Our demonstration of highly diversified R-M systems at a functional level is consistent with studies at the genomic level in strains 26695, J99, and J166 (16, 17). Our previous studies of two H. pylori R-M systems (Hpy188I and HpyI) also demonstrate substantial interstrain variation (1921). First, the Hpy188I R-M genes are not present in 9 of 11 strains tested (21). Second, although hpyIM is functionally conserved in all strains tested (20), its cognate RE gene, iceA1, is present in less than half of the strains, whereas the entirely unrelated iceA2 occupies the same locus in other strains (19, 23).

The total numbers of REs identified in each of the six strains vary from two to six. Thus far, CH4 represents the H. pylori strain with the largest number of REs identified at the biochemical level. Although it was predicted that strain J99 has 15 type II R-M systems (17), we found activity of only three REs from this strain. One explanation may be that some RE activity was actually present in J99, but could not be detected; such low activity could be attributable to a low level of expression, or poor enzyme stability. In any event, the detected RE activities in each of the six strains varied considerably (Table 2). Another explanation is that some of these R-M systems are degenerate and no longer functional, which clearly is the case for iceA1 (23), an NlaIII homolog in H. pylori. Most iceA1 genes are prematurely truncated and are not predicted to encode a functional RE. Screening three iceA1 strains in this study revealed that only CH4 encoded a functional NlaIII-like RE, consistent with the fact that the iceA1 gene in CH4 is intact whereas those in 60190 and J166 are degenerate with frame-shift or nonsense mutations (23).

H. pylori plasmids have been reported previously and are also evident in strains CH4, 97762, and 98518 in this study (Fig. 1). R-M systems may be encoded by plasmids in bacteria (2426). However, cloning of the R-M genes from H. pylori, including the sequenced strains J99 and 26695 and CH4, which possesses an unusually high number of REs, shows that all of the genes are chromosomal (ref. 21; R.D.M., Q.X., and H. M. Kong, unpublished work). These data suggest that the H. pylori R-M systems are largely, if not exclusively, encoded chromosomally.

N. gonorrhoeae differs from other bacteria by possessing a high number of type II M systems and at least seven linked REs (22, 2730). The genes of Type II M systems in this organism are generally conserved among strains and always are expressed to keep the host genome modified. In contrast, the cognate RE genes sometimes are silent, possibly because of specific regulation of their gene expression. Thus, although R-M systems in N. gonorrhoeae are similar at the gene level, expression of their REs varies among different isolates (22). H. pylori resembles N. gonorrhoeae in possessing an unusually high number of R-M systems (16, 17), but differs in that different sets of R-M systems are present in each strain at the gene level. Thus, H. pylori is unique because of the unusually high numbers and diversity of its R-M systems.

It is not clear why H. pylori possesses so many different R-M systems. One major function of R-M systems in bacteria is to protect the host from bacteriophage infections. An advantage of having diversified R-M systems among a population of strains is that when one strain is successfully infected by a phage, other strains, possessing different R-M systems, can be protected. Thus, the entire bacterial population will not be destroyed. We speculate that R-M systems must play other important roles in this organism. Because H. pylori is naturally competent (31), possessing a large number of R-M systems may prevent exogenous DNA from integrating into the host genomic DNA. All of the REs identified in H. pylori have 4- or 5-bp recognition sequences, suggesting that efficient digestion of DNA is required for this organism. There are two highly conserved methyltransferases, M.Hpy99II and M.HpyI, among the 16 strains examined, modifying H.pylori DNA at GCGC and CATG sites, respectively. Strong conservation of these two methyltransferases in contrast to the diversified R-M systems in H. pylori suggests that GCGC and CATG-methylation may be involved in important cellular processes. For example, dam methylation (at GATC) is essential for regulation of chromosomal DNA replication and mismatch repair in other organisms (712). Modification at GATC has been found in most but not all of the 16 H. pylori strains examined (Q.X. and M. J. Blaser, unpublished work). It is conceivable that other methyltransferases, perhaps M.Hpy99II or M.HpyI, play a similar role in H. pylori.

Whereas H. pylori is cosmopolitan and highly prevalent in most populations (1315), fewer than 10% of people carrying the bacteria develop H. pylori-associated diseases. One explanation may be that H. pylori strains of particular genotypes are more virulent than other strains (32, 33). Typing systems based on genomic variability have been used to discriminate H. pylori isolates for epidemiological and clinical purposes (3437). Because diversified R-M systems in H. pylori are major strain-specific genes (17), the modification status of chromosomal DNA may be used to develop a new system for typing H. pylori isolates. In this study, all 16 strains examined had unique DNA modification profiles, demonstrating the potential discriminatory power of this typing system. A typing scheme based on DNA modification status may become a simple, reproducible, and discriminatory means of documenting diversity within H. pylori populations.

Finally, although numerous Type II REs with differing specificities are currently available, there is a continuing need for additional type II REs, because new enzymes recognizing novel sequences improve flexibility for genetic manipulation, and enable specialized applications (38). Finding REs with novel specificities is becoming increasingly difficult. However, because H. pylori has such an unusually high number of R-M systems that are diversified among strains, this organism could be an excellent source for the discovery of new type II REs. Thus far, nearly 30 type II REs with a total of 22 distinct specificities (39, 40), including those from this study, have been identified from H. pylori. Importantly, from a screen of six H. pylori strains, we found three novel type II REs, an unusually high number. It seems possible that more REs with novel specificities will be identified from H. pylori, as more screening efforts are made.

Acknowledgments

This work was supported in part by the Dissertation Enhancement Grant from the Vanderbilt Graduate School and National Institutes of Health Grants GM56534 and DK53707.

Abbreviations

R

restriction

M

modification

RE

restriction endonuclease

HAP

hydroxylapatite

Footnotes

This paper was submitted directly (Track II) to the PNAS office.

References

  • 1.Roberts R J, Halford S E. In: Nucleases. Linn S M, Lloyd S, Roberts R J, editors. Plainview, NY: Cold Spring Harbor Lab. Press; 1993. pp. 35–88. [Google Scholar]
  • 2.Wilson G G, Murray N E. Annu Rev Genet. 1991;25:585–627. doi: 10.1146/annurev.ge.25.120191.003101. [DOI] [PubMed] [Google Scholar]
  • 3.Arber W, Dussoix D. J Mol Biol. 1962;5:18–36. doi: 10.1016/s0022-2836(62)80058-8. [DOI] [PubMed] [Google Scholar]
  • 4.Dussoix D, Arber W. J Mol Biol. 1962;5:37–49. doi: 10.1016/s0022-2836(62)80059-x. [DOI] [PubMed] [Google Scholar]
  • 5.Kuhnlein U, Arber W. J Mol Biol. 1972;63:9–19. doi: 10.1016/0022-2836(72)90518-9. [DOI] [PubMed] [Google Scholar]
  • 6.Smith J D, Arber W, Kuhnlein U. J Mol Biol. 1972;63:1–8. doi: 10.1016/0022-2836(72)90517-7. [DOI] [PubMed] [Google Scholar]
  • 7.Messer W, Noyer-Weidner M. Cell. 1998;54:735–737. doi: 10.1016/s0092-8674(88)90911-7. [DOI] [PubMed] [Google Scholar]
  • 8.Smith D W, Garland A M, Herman G, Baker T A, Zyskind J W. EMBO J. 1985;4:1319–1326. doi: 10.1002/j.1460-2075.1985.tb03779.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Braaten B A, Nou X, Kaltenbach L S, Low D A. Cell. 1994;76:577–588. doi: 10.1016/0092-8674(94)90120-1. [DOI] [PubMed] [Google Scholar]
  • 10.Heithoff D M, Sinsheimer R L, Low D A, Mahan M J. Science. 1999;284:967–970. doi: 10.1126/science.284.5416.967. [DOI] [PubMed] [Google Scholar]
  • 11.Roberts D, Hoopes B C, McClure W R, Kleckner N. Cell. 1985;43:117–130. doi: 10.1016/0092-8674(85)90017-0. [DOI] [PubMed] [Google Scholar]
  • 12.Modrich P. J Biol Chem. 1989;264:6597–6600. [PubMed] [Google Scholar]
  • 13.Berg D E, Logan R P H. BioEssays. 1997;19:86–90. doi: 10.1002/bies.950190114. [DOI] [PubMed] [Google Scholar]
  • 14.Blaser M J, Parsonnet J. J Clin Invest. 1994;94:4–8. doi: 10.1172/JCI117336. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Goodwin C S, Armstrong J A, Chilvers T, Peters M, Collins M D, Sly L, McConnell W, Harper W. Int J Syst Bacteriol. 1989;39:397–405. [Google Scholar]
  • 16.Tomb J F, White O, Kerlavage A R, Clayton R A, Sutton G, Fleischmann R D, Ketchum K A, Klenk H P, Gill S, Dougherty B A, et al. Nature (London) 1997;388:539–547. doi: 10.1038/41483. [DOI] [PubMed] [Google Scholar]
  • 17.Alm R A, Ling L L, Moir D T, King B L, Brown E D, Doig P C, Smith D R, Guild B C, deJonge B L, Carmel G, et al. Nature (London) 1999;397:176–180. doi: 10.1038/16495. [DOI] [PubMed] [Google Scholar]
  • 18.Akopyants N S, Fardkov A, Diatchenko L, Hill J E, Siebert P D, Lukyanov S, Sverdlov E D, Berg D E. Proc Natl Acad Sci USA. 1998;95:13108–13113. doi: 10.1073/pnas.95.22.13108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Peek R M, Jr, Thompson S A, Atherton J C, Blaser M J, Miller G G. Proc Assoc Am Phys. 1998;110:531–544. [PubMed] [Google Scholar]
  • 20.Xu Q, Peek R M, Miller G G, Blaser M J. J Bacteriol. 1997;179:6807–6815. doi: 10.1128/jb.179.21.6807-6815.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Xu Q, Stickel S, Roberts R J, Blaser M J, Morgan R D. J Biol Chem. 2000;275:17086–17093. doi: 10.1074/jbc.M910303199. [DOI] [PubMed] [Google Scholar]
  • 22.Stein D C, Gunn J S, Radlinska M, Piekarowicz A. Gene. 1995;157:19–22. doi: 10.1016/0378-1119(94)00649-d. [DOI] [PubMed] [Google Scholar]
  • 23.Figueiredo C, Quint W G V, Sanna R, Sablon E, Donahue J P, Xu Q, Miller G G, Peek R M, Blaser M J, van Doorn L J. Gene. 2000;246:59–68. doi: 10.1016/s0378-1119(00)00054-8. [DOI] [PubMed] [Google Scholar]
  • 24.Zakharova M V, Beletskaya I V, Kravetz A N, Pertzev A V, Mayorov S G, Solonin A S. Gene. 1998;208:177–182. doi: 10.1016/s0378-1119(97)00637-9. [DOI] [PubMed] [Google Scholar]
  • 25.Lubys A, Janulaitis A. Gene. 1995;157:25–29. doi: 10.1016/0378-1119(94)00795-t. [DOI] [PubMed] [Google Scholar]
  • 26.Wayne J, Holden M, Xu S Y. Gene. 1997;202:83–88. doi: 10.1016/s0378-1119(97)00457-5. [DOI] [PubMed] [Google Scholar]
  • 27.Korch C, Hagblom P, Normark S. J Bacteriol. 1983;142:1325–1332. doi: 10.1128/jb.155.3.1324-1332.1983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Norlander L, Davies J K, Hagblom P, Normark S. J Bacteriol. 1981;140:788–795. doi: 10.1128/jb.145.2.788-795.1981. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Piekarowicz A, Stein D C. Gene. 1995;157:101–102. doi: 10.1016/0378-1119(94)00531-v. [DOI] [PubMed] [Google Scholar]
  • 30.Ritchot N, Roy P H. Gene. 1990;76:103–106. doi: 10.1016/0378-1119(90)90120-g. [DOI] [PubMed] [Google Scholar]
  • 31.Suerbaum S, Smith J M, Bapumia K, Morelli G, Smith N H, Kunstmann E. Proc Natl Acad Sci USA. 1998;95:12619–12624. doi: 10.1073/pnas.95.21.12619. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Cover T L, Glupczynski Y, Lage A P, Burette A, Tummuru M K, Perez-Perez G I, Blaser M J. J Clin Microbiol. 1995;33:1496–1500. doi: 10.1128/jcm.33.6.1496-1500.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.van Doorn L J, Quint W G V, Schneeberger P M, Tytgat G N J, de Boer W A. Lancet. 1997;350:71–72. doi: 10.1016/s0140-6736(05)66280-0. [DOI] [PubMed] [Google Scholar]
  • 34.Burucoa C, Lhomme V, Fauchere J L. J Clin Microbiol. 1999;37:4071–4080. doi: 10.1128/jcm.37.12.4071-4080.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Kansau I, Raymond J, Bingen E, Courcoux P, Kalach N, Bergeret M, Braimi N, Dupont C, Labigne A. Res Microbiol. 1996;147:661–669. doi: 10.1016/0923-2508(96)84023-x. [DOI] [PubMed] [Google Scholar]
  • 36.Owen R J, Fraser J, Costas M, Morgan D, Morgan D R. J Clin Pathol. 1990;43:646–649. doi: 10.1136/jcp.43.8.646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Taylor N S, Fox J G, Akopyants N S, Berg D E, Thompson N, Shames B, Yan L, Fontham E, Janney F, Hunter F M, et al. J Clin Microbiol. 1995;33:918–923. doi: 10.1128/jcm.33.4.918-923.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Schiestl R H, Nicklen S, Petes T D. Proc Natl Acad Sci USA. 1991;88:7585–7589. doi: 10.1073/pnas.88.17.7585. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Ivic A, Jakeman K J, Penn C W, Brown N L. FEMS Microbiol Lett. 1999;179:175–180. doi: 10.1111/j.1574-6968.1999.tb08724.x. [DOI] [PubMed] [Google Scholar]
  • 40.Repin V E, Puchkova L I, Ananko G G, Reshetnikov O V, Kurilovich S A. Mol Gen Microbiol Virusol. 1999;3:37–38. [PubMed] [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES