Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2015 May 19;5:9747. doi: 10.1038/srep09747

Expression and purification of a single-chain Type IV restriction enzyme Eco94GmrSD and determination of its substrate preference

Xinyi He 1,2,*, Victoria Hull 1,*, Julie A Thomas 3,, Xiaoqing Fu 1, Sonal Gidwani 1, Yogesh K Gupta 4, Lindsay W Black 3, Shuang-yong Xu 1,a
PMCID: PMC4437046  PMID: 25988532

Abstract

The first reported Type IV restriction endonuclease (REase) GmrSD consists of GmrS and GmrD subunits. In most bacteria, however, the gmrS and gmrD genes are fused together to encode a single-chain protein. The fused coding sequence for ECSTEC94C_1402 from E. coli strain STEC_94C was expressed in T7 Express. The protein designated as Eco94GmrSD displays modification-dependent ATP-stimulated REase activity on T4 DNA with glucosyl-5-hydroxymethyl-cytosines (glc-5hmC) and T4gt DNA with 5-hydroxymethyl-cytosines (5hmC). A C-terminal 6xHis-tagged protein was purified by two-column chromatography. The enzyme is active in Mg2+ and Mn2+ buffer. It prefers to cleave large glc-5hmC- or 5hmC-modified DNA. In phage restriction assays, Eco94GmrSD weakly restricted T4 and T4gt, whereas T4 IPI*-deficient phage (Δip1) were restricted more than 106-fold, consistent with IPI* protection of E. coli DH10B from lethal expression of the closely homologous E. coli CT596 GmrSD. Eco94GmrSD is proposed to belong to the His-Asn-His (HNH)-nuclease family by the identification of a putative C-terminal REase catalytic site D507-H508-N522. Supporting this, GmrSD variants D507A, H508A, and N522A displayed no endonuclease activity. The presence of a large number of fused GmrSD homologs suggests that GmrSD is an effective phage exclusion protein that provides a mechanism to thwart T-even phage infection.


Restriction endonucleases (REases) are a diverse group of DNA-cleaving enzymes that serve to protect bacteria against phage infection or invasion of mobile genetic elements (see reviews1,2). In order to overcome attack by REases, bacteriophages evolve elaborate modifications on their genomic DNA. Bacteria, in turn, develop new enzymes that can specifically target modified DNA. Modification-dependent REases such as McrBC, McrA, Mrr, GmrSD, and PvuRts1I are loosely grouped together and referred to as Type IV REases (recent review in Ref. 3). The modified bases on DNA are N6-methyladenine (N6mA) restricted by Mrr, 5-methylcytosine (5mC) restricted by McrBC and Mrr, 5-hydroxymethylcytosine (5hmC) restricted by PvuRts1I-family enzymes and McrBC, and glucosyl-5-hydroxymethyl-cytosine (glc-5hmC) restricted by AbaSI or GmrS/GmrD enzymes4,5,6,7,8,9,10. The first characterized GmrSD enzyme was found in E. coli strain CT596 and encoded by two adjacent genes gmrS and gmrD6,7. The GmrS and GmrD subunits were separately expressed as intein and chitin-binding domain (CBD) fusion proteins and cleaved off by intein cleavage. The reconstituted enzyme is active and specific for glc-5hmC-modified DNA, but it has poor activity on 5hmC- or 5mC-modified DNA. The reaction buffer for the reconstituted enzyme included UTP, Ca2+, and Mg2+6. It has been reported that T-even phages encapsidate a diverse set of internal proteins encoded at the ip1 locus that function to counteract the GmrSD nuclease activity during DNA injection7,11 (ip1, T4 phage inhibitor gene encoding IPI protein that is processed into encapsidated IPI* protein). Interestingly, a close homolog (UTI89_C2960 or UT enzyme) found in E. coli O18 K1 H7 UTI89 is a fused single-chain enzyme; the UT enzyme is insensitive to IPI* inhibition due to its altered amino acid (aa) sequence and specificity, but it does not restrict either T4 IPI*-deficient or wild-type (WT) T4 phage although it restricts many other T even-like phages such as T2 and T612. The enzymatic tools that differentially cleave 5hmC DNA are limited, since McrBC- and MspJI-family enzymes cleave both 5hmC and 5mC-modified DNA13,14. Structural studies that could determine the interactions that occur between GmrSD and its inhibitor protein, IPI*, have been hampered by the poor expression of the two chain GmrS/GmrD enzyme.

The flux of sequenced bacterial genomes has revealed that there are many GmrSD homologs in proteobacterial genomes. As with the UTI89_C2960 protein, in these homologs the gmrS and gmrD genes are fused together to form a single gene, which may encode a single-chain GmrSD enzyme. The goal of this work was to evaluate the endonuclease activity of such a single-chain GmrSD homolog found in the genome of E. coli strain STEC_94C and to develop methods for simple purification of the target protein. In addition, we studied the metal ion requirement and preferred substrate size for Eco94GmrSD, and identified a potential endonuclease catalytic site (a conserved nuclease motif Asp-His-Asn (D-H-N) in its C-terminus). We found that the single-chain enzyme is capable of cleaving 5hmC and glc-5hmC DNA in vitro. This property differs from the two-chain GmrS/GmrD enzyme complex that only cleaved glc-5hmC DNA. However, despite this difference in in vitro substrate sensitivity we found that the phage restriction activity of Eco94GmrSD is very similar to that of the two-chain GmrS/GmrD: Eco94GmrSD only weakly restricted WT T4 and T4gt (deficient in α-, β-glucosyltransferase (gt) phages), but strongly restricted T4Δip1 phage (about a million fold). The possible involvement of GmrSD-like enzyme in the bacterial immigration control region (ICR) is also discussed.

Results

The hypothetical protein ECSTEC94C_1402 (GenBank accession #: WP_000834395) from E. coli STEC_94C has 629 amino acid (aa) residues. It displays 93% aa sequence identity to the GmrSD fusion protein found in E. coli UTI89 (UTI89_C2960, EcoUTI89GmrSD or UT enzyme) (see sequence alignment in Supplementary Fig. S1). The ECSTEC94C_1402 gene is located on a 41.5 kb region of the ECSTEC94C genome diagnosed by “Phast” to be a prophage most similar to Shigella phage SfII (NC_021857). This similarity is mostly over the first 17 kb of the SfII genome (97% by blastn analysis), a region that encodes the major morphogenesis genes, although there are other shorter homologous regions. Notably, the Eco94GmrSD gene is immediately downstream of the genes predicted to form the phage tail fibers that are responsible for host adsorption. This is a morphogenic region known to evolve rapidly to adapt to changes in host cell receptors. Notably, Shigella phage SfII does not encode a GmrSD homolog at this position (or elsewhere). We hypothesize that the GmrSD gene being prophage-borne indicates it has some role in an evolutionary arms race between phage and host, supported by its in vitro and in vivo restriction properties (see below). EcoCT596GmrSD (CT enzyme) was also encoded by a curable prophage that restricted T4ip1 and rII mutants.

Three major differences were found when Eco94GmrSD sequence was compared to the prototype EcoCT596GmrS/GmrD as shown in Supplementary Fig. S1: 1) Eco94GmrSD lacks 3-aa residues Ser97-Leu98-Ala99; 2) Eco94GmrSD carries one additional amino acid difference (Arg313 in Eco94GmrSD vs Gln313 in the two-chain CT enzyme); 3) Eco94GmrSD contains 84-aa residues as a connector of the GmrS and GmrD subunits, which fused two subunits into a single-chain peptide. Recently, the gene sequence encoding the CT enzyme has been resequenced and updated as a single gene that restricts glc-5hmC containing T-even phages as cloned in a pBeloBac11 (single copy vector)(Genbank ID: AF493796_1). It is now apparent that the two-chain GmrSD originally cloned had suffered mutational events, but still retained activity. Subcloning of the ecoCT596gmrSD gene into higher copy plasmids was enabled by introduction of a stop codon which caused a truncation product GmrS, and re-initiation product GmrD with a small deletion between the gmrS and gmrD genes (see below)6.

We used two expression strategies to express ECSTEC94C_1402. One strategy was to clone its ORF in pET21b in fusion with a C-terminal 6xHis tag (Eco94GrmSD-6xHis) and purification through a nickel-NTA agarose column. Another method was to clone its ORF in fusion with an intein and CBD, the same strategy proved successful in expression of the two-chain GmrS/GmrD originally cloned from E. coli strain CT596 (see Supplementary information). The Eco94GrmSD enzyme purified by two different methods (6xHis-tagged protein via nickel column or intein-CBD tagged protein via chitin column) share nearly identical enzyme properties except that the His-tagged enzyme displays higher specific activity (see below).

Expression and purification of 6xHis-tagged single-chain GmrSD (GmrSD-6xHis) and endonuclease activity assay

In one expression strategy we cloned the single-chain eco94gmrSD gene into pET21b and purified it with a C-terminal 6xHis-tag (GmrSD-6xHis). This protein was insoluble if IPTG-induction was carried out at 37°C, however it expressed well at 16°C to 18°C in co-overexpression of GroEL/GroES protein from a compatible plasmid (data not shown). GmrSD-6xHis protein was purified by nickel-NTA agarose column chromatography, and further purified via a heparin column. Most of the GroEL protein was removed by the second step. Fig. 1 shows the partially purified GmrSD-6xHis enzyme (pooled fractions from heparin for activity assay) and its low enzyme activity on T4 (panel C, lanes 1–2), T4gt (lanes 4–5), and λ DNA (lanes 7–8). The endonuclease activity was strongly stimulated by addition of 1 mM ATP in digestion of T4 and T4gt DNA (lanes 10–11, 13–14); poor activity was detected on λ DNA (Dam+ Dcm+). In a control experiment, T4, T4gt, and λ DNAs were digested by MluCI (AATT) whose activity was not affected by cytosine modifications. The specific activity of the purified enzyme was estimated to be ~500 units/mg protein on T4 DNA (see unit definition in Methods). The final protein yield was estimated at 4 mg/L of IPTG/arabinose-induced cells. It appeared that the GmrSD enzyme displayed lower cleavage activity on T4gt DNA compared to T4 (less than 2-fold difference).

Figure 1. SDS-PAGE analysis of purified Eco94GrmSD-6xHis protein and agarose gel analysis of endonuclease activity assay.

Figure 1

(A). Purified C-terminal 6xHis-tagged GrmSD from a nickel column (Ni). The enzyme was purified from cell lysate of T7 Express [pET21b-gmrSD, pGro7]. L, protein ladder. (B). SDS-PAGE analysis of purified GmrSD fractions from a heparin column. (C). Digestion of T4 (glc-5hmC), T4gt (5hmC) and λ DNA (Dam+Dcm+) by purified GmrSD-6xHis in the presence (+) or absence (−) of 1 mM ATP. Lanes 3, 6, and 9, MluCI digested DNAs. One μl or 5 μl of GmrSD (~0.5 μg/μl, 0.14 or 0.70 μM) was used to digest 1 μg DNA in buffer 2 plus or minus 1 mM ATP.

Metal ion and dithiothreitol (DTT) requirement for GmrSD endonuclease activity

The purified Eco94GmrSD was tested for activity on T4 DNA in a basic buffer (50 mM NaCl, 10 mM Tris-HCl, pH 7.5, 1 mM DTT) supplemented with different divalent cations. Eco94GmrSD was active in digestion of T4 DNA when the basic buffer was supplemented with Mg2+ or Mn2+ (Fig. 2A, lanes 1–4). It was interesting to note that metal ions can modulate the relative endonuclease activity. At 1 mM of divalent cation, the enzyme is more active in the presence of Mn2+ than Mg2+. At 10 mM of the metal ion, GmrSD is more active in Mg2+ than Mn2+. The free Mg2+ concentration in E. coli cells was estimated at 1 to 2 mM15,16. The intracellular concentration of Mn2+ in bacteria was estimated at μM range (www.bionumbers.hms.harvard.edu). The Eco94GmrSD enzyme shows poor activity with other metal ions such as Ca2+, Co2+, Ni2+, or Zn2+ (Fig. 2A, lanes 5–9). We also compared GmrSD endonuclease activity in 10 μM, 0.1 and 1 mM of Co2+, Ni2+, or Zn2+ (since high concentration of transition metal ions may inhibit activity). Eco94GmrSD displayed very low activity in 0.1 mM Co2+ or Zn2+ (data not shown). GmrSD nuclease activity on T4 DNA was clearly detected in Mn2+ buffer (optimal concentration at 1 mM MnCl2). But this low nuclease activity was independent of ATP cofactor (see Supplementary Fig. S2A, lanes 1–4). It was somewhat unexpected that addition of 1 mM ATP could inhibit GmrSD activity in Mn2+ buffer (lanes 6–8). In a control digestion, GmrSD degraded T4 DNA into small fragments (100–300 bp) in NEB buffer 2 and 4 with 10 mM Mg2+. It was puzzling that the same ATP cofactor could have a positive stimulatory effect on GmrSD nuclease activity in Mg2+ buffer, but it exerts a negative inhibitory effect in Mn2+ buffer (at 0.1 to 0.5 mM). It is well known that HNH-family endonucleases are more promiscuous in metal ion cofactor requirement for catalytic activity17,18,19. Perhaps the negatively regulatory loop by ATP provides a safeguard to GmrSD star activity on unmodified DNA when GmrSD enzyme is “accidently” bound by Mn2+ ions. To see whether GmrSD enzyme displays any nuclease activity on λ DNA in Mn2+ buffer, λ DNA was digested by GmrSD in the absence or presence of 1 mM ATP. Supplementary Fig. S2B shows that GmrSD caused some λ DNA smearing as an indication of low nuclease activity. The supplement of 1 mM ATP appeared to inhibit nuclease activity at 0.1 to 0.5 mM Mn2+. In a control digestion, GmrSD enzyme shows no smearing in NEB buffer 2 and a low level of smearing in buffer 4. We speculate that GmrSD enzyme displays relaxed specificity (star activity) in Mn2+ buffer (since it partially cleaved non-glc-5hmC or non-5hmC DNA). This star activity is consistent with the observation that GmrSD over-expression in a RecA-deficient E. coli host was quite toxic (see below), probably caused by dsDNA breaks at star sites and the lack of RecA-mediated DNA recombination and repair.

Figure 2. Determination of divalent cation cofactor requirement and DNA substrate preference for GmrSD digestion.

Figure 2

(A). Metal ion cofactor requirement for GmrSD digestion. Divalent cations or EDTA are indicated on top of each lane. (B). Substrate preference and optimal substrate size for GmrSD digestion. PCR DNA substrates containing 5hmC or regular dC were generated by PCR using pBR322 template and digested by GmrSD endonuclease in the presence of 1 mM ATP in NEB buffer 2. The same DNA substrates were also digested by HpaII (CCGG) in NEB buffer 4 (to confirm modified DNA).

Eco94GmrSD enzyme gradually loses activity during storage at −20°C, however, its activity can be restored by addition of fresh DTT (data not shown). There are seven Cys residues in Eco94GmrSD enzyme and presumably oxidation of these Cys residues may contribute to lower activity during storage. The optimal temperature for Eco94GmrSD activity was determined to be 37°C (see Supplementary Fig. S3).

Preferred substrate and substrate size for the single-chain Eco94GmrSD

To study the substrate size preference we used PCR products that contain 5hmC incorporated during PCR by including 5hm-dCTP in PCR reactions. PCR products (3.8, 1.9, 1.0, 0.5, and 0.3 kb) containing 5hmC or unmodified dC were purified by spin columns and digested with Eco94GmrSD. 5hmC-modified PCR DNA substrates (3.8 kb, 1.9 kb, 1.0 kb) were efficiently digested; while modified PCR products in 0.3 and 0.5 kb were cleaved with reduced efficiency (Fig. 2B). PCR products (same sizes) with regular dC were poorly digested by Eco94GmrSD at the same enzyme concentration tested (Fig. 2B). This result is consistent with the substrate preference for modified DNA (T4gt) shown in Fig. 1. In a control experiment, 5hmC-modified PCR substrates were resistant to HpaII digestion and PCR DNAs with unmodified cytosine were digested by HpaII (Fig. 2B, right panel). A 60-bp PCR fragment containing 5hmC-N20-G (two 5hmC on the opposite strands separated by 20 bp) was partially cleaved by Eco94GmrSD; but the 60mer with 5hmC-N10-G (two 5hmC on the opposite strands separated by 10 bp) was not cleaved (Supplementary Fig. S4). We also cloned and sequenced some GmrSD cleavage products of T4gt and determined the cut sites (see Supplementary information and Table S1). The common feature of these cut sites was 5hmC N(17–23) G (two 5hmC on the opposite strands separated by 17–23 bp), where cleavage takes place mostly at the symmetric sites 5hmC N(9–11)↓N(9–11)G. Sequencing of more cleavage products are required to pinpoint the substrate preference and the effect of flanking sequence on cleavage efficiency of the modified 5hmC-containing DNA.

NTP- and dNTP-stimulated GmrSD endonuclease activity

In the previously published report, NTP stimulated the CT enzyme activity6. Therefore, we examined the endonuclease activity in the presence of NTP, dNTP, or non-hydrolysable γ-S-ATP. Fig. 3A and 3B show strong stimulation of endonuclease activity by addition of 0.1 to 1 mM ATP (but higher concentration of ATP at 10 mM inhibits activity). Stimulation of activity was also detected at 2 mM ATP concentration (data not shown). Supplement of 0.1, 0.5, and 1 mM CTP or UTP had a minimal effect. Addition of GTP (0.5–1 mM) also had a moderate effect on enzyme activity. Supplement of dATP (1–10 mM), or dTTP (1–10 mM) also strongly stimulated the endonuclease activity, while dCTP and dGTP have moderate effect. But addition of non-hydrolysable γ-S-ATP (1–2 mM) had no stimulatory effect on enzyme activity. We have not directly measured NTP hydrolysis in GmrSD cleavage reactions.

Figure 3. Stimulation of GmrSD endonuclease activity by supplement of NTP or dNTP in digestion of T4 DNA.

Figure 3

No stimulator effect on enzyme activity was detected by supplement of non-hydrolysable γ–S-ATP. NTP or dNTP concentrations were indicated on top of each lane.

Site-directed mutagenesis of a putative catalytic site in the C-terminal domain

In a protein homology search, Eco94GmrSD had a weak hit with His-metal finger nuclease family (conserved amino acid residues DHxxP). The putative endonuclease active site residues located near the C-terminus are D507-H508-N522-(N528-N535) with additional Asn/Gln/Lys residues in close proximity (conforming to DH-N-N catalytic site). The HNH (HNK or HNN) motif is found in Colicin nucleases, homing endonucleases, DNA repair enzymes, REases, DNA nicking enzyme, transposase, type II intron-encoded reverse transcriptase, and Cas920,21,22,23,24,25,26,27. To investigate the importance of these residues, six GmrSD variants D507A, H508A, C517A, N522A, N528A, and N535A were constructed by site-directed mutagenesis and the mutant proteins were purified by nickel column chromatography. Three inactive mutants (D507A, H508A, and N522A) and three partially active variants (C517A, N528A, and N535A) were further purified by heparin column and analyzed by SDS-polyacrylamide gel electrophoresis (SDS-PAGE) (Fig. 4). The protein yield and purity of D507A, H508A, C517A, N528A, and N535A were comparable to the WT enzyme, but the N522A variant showed reduction in protein yield and purity after heparin column. Fig. 4B and 4C shows that GmrSD variants D507A, H508A, and N522A are devoid of endonuclease activity. Variants C517A, N528A, and N535A are partially active (about ~25% to 50% of WT activity. Although the relative activity estimates were crude, C517, N528, and N535 could be ruled out as potential catalytic residues. The relative activity of WT and mutants are summarized in Table 1.

Figure 4. Analysis of partially purified WT Eco94GmrSD and mutant proteins D507A, H508A, C517A, N522A, N528A, N535A, E271A/E273A, E278A/K280A on SDS-PAGE and endonuclease activity assays for the mutant enzymes on T4 DNA.

Figure 4

(A). SDS-PAGE analysis of WT and mutant proteins. Left panel: WT and mutant proteins (D507A, H508A, N522A, and N535A) purified by nickel-NTA agarose and heparin HP columns (purified D507A showed aberrant migration). Right panel: WT, E271A/E273A, E278A/K280A, C517A and N528A proteins. (B and C). Endonuclease activity assay for WT and GmrSD variants D507A, H508A, C517A, N522A, N528A, N535A on T4 DNA. The amount of input protein was 0.5 μg, 1 μg, and 2 μg, respectively in digestion of 1 μg T4 DNA. For the double mutants E271A/E273A and E278A/K280A, the amount of input protein was 0.5 μg, 1 μg, 1.5 μg and 2 μg, respectively in digestion of 1 μg T4 DNA.

Table 1. Summary of endonuclease activity on T4 DNA and protein expression levels of WT and mutant forms of Eco94GmrSD.

Enzyme Endonuclease activity Protein expression level
WT +++ Active (100%) ++
C-terminus mutants (1 to 6)    
1) D507A Inactive (binding+/−) ++
2) H508A Inactive (binding+) ++
3) C517A + Partially active ++
4) N522A Inactive (binding+/−) +
5) N528A + Partially active ++
6) N535A + Partially active ++
Possible Eco94GmrSD endonuclease catalytic site: D507-H508-N522. Binding+, bound/shifted complexes detected at all four protein concentrations tested; binding+/−, bound/shifted complexes were detected only at high enzyme concentrations.

The mutagenesis results indicate that the critical amino acids of the GmrSD endonuclease catalytic site are likely residues D507, H508, and N522 (See Supplementary Fig. S5 for a model of the predicted active site). This catalytic site is similar to that found in I-HmuI and I-PpoI homing endonucleases and other HNH-family nucleases19,20,28. When a catalytic residue of a REase is mutated, the mutant protein is still capable of DNA binding and this can be detected by DNA mobility shift assay (DNA-REase complexes migrated slower than substrate DNA in native PAGE)29,30. Purified WT GmrSD, D507A, H508A, and N522A were used to bind a 266-bp PCR fragment containing 5hmC or dC. Two major bound complexes by the WT enzyme were detected and most of the substrate DNA was bound and shifted to the top of gel at high enzyme concentration in an EDTA buffer (no divalent cation, data not shown). It is known that divalent cations can modulate REase specificity: KpnI displays high specificity (low star activity) in Ca2+ buffer compared to its specificity in Mg2+ and Mn2+ buffers24. Similarly, divalent cations enhanced the binding specificity of EcoRV catalytic-deficient mutants31. Therefore, we examined DNA binding in a buffer with cofactors MgCl2 and ATP (binding at room temperature for 10 min to minimize cleavage activity). Fig. 5A shows that two bound complexes were detected on both dC and 5hmC substrates by the WT enzyme. D507A appeared to have reduced DNA binding affinity than the WT enzyme (Fig. 5B) (a large complex is not discernable due to a large DNA fragment present in the substrate DNA). Similar to the WT enzyme, H508A variant also caused gel shift of both 5hmC- and dC-DNAs in the binding assay and appeared to have enhanced binding activity since all the substrate was shifted to the loading well at 60:1 protein to DNA molar ratio (Fig. 5C, lanes 4 and 8). N522A variant protein appeared to have reduced DNA binding affinity to 5hmC DNA: a major bound complex was detected at 100, 250, and 500 ng protein in the gel shift assay for dC-PCR DNA (Fig. 5D, lanes 2–4), but only weak complex formation was detected for 5hmC-PCR DNA at high enzyme concentration (Fig. 5D, lane 8). To further confirm the DNA binding activity of D507A, H508A, and N522A mutant proteins, T4 MluCI (AATT) restriction fragments were used in the DNA mobility shift assay. The WT enzyme and H508A showed similar binding complexes except that at high enzyme concentration all the substrates were shifted up by H508A; D507A and N522A proteins displayed lower affinity and produced shifted/bound complex(s) only at high enzyme concentrations (data not shown). It was concluded that H508A variant is a binding-proficient and cleavage-deficient mutant that fits the definition of catalytic mutant of REases. The binding results on cleavage-deficient mutants D507A and N522A were not conclusive, but suggesting D507 and N522 may be involved in both binding (specificity determination) and catalysis. Further biochemical and structure analysis are needed to refine the roles of D507 and N522 residues.

Figure 5. DNA mobility shift assays for WT GmrSD (panel A) and its variants D507A (panel B), H508A (panel C), N522A (panel D) in the presence of 10 mM Mg2+ and 1 mM ATP.

Figure 5

Bound DNA was resolved in 10% TBE native gels. Two PCR substrates were used: 266-bp dC-DNA (unmodified) and 266-bp 5hmC DNA (modified). Arrows indicate bound complexes (shifted bands).

Restriction of phage by GmrSD endonuclease

The proposed biological role of GmrSD endonuclease is to serve as a phage exclusion protein (the resident prophage expressing GmrSD to restrict incoming phage with sugar-modified DNA). To counteract GmrSD restriction, T4-like phages evolved inhibitor proteins (internal proteins) such as encapsidated IPI* to inhibit GmrSD activity following T4 DNA ejection into the host cytoplasm6,12. To determine if Eco94GmrSD was capable of phage restriction, we tested it against WT T4, several T4 mutants and λ phage. We tested its phage plating efficiency using the T7 Express strain containing pET21-eco94gmrSD (under constitutive expression, no IPTG added) and used pET21b vector as a control. Consistent with the in vitro result of poor cleavage activity on λ DNA, Eco94GmrSD did not restrict λ phage (Table 2). Eco94GmrSD restricted T4 and T4gt by 15 to 20-fold (Table 2) and this lack of strong restriction could be attributed to the counter measure evolved by T4 phage. T4 phage co-eject inhibitor protein (IPI*, ~360 copies per viral capsid) into the host cytoplasm; this inhibitor protein can antagonize GmrSD nuclease activity and overcome the phage exclusion mechanism, leading to successful phage DNA replication and virus packaging12. Consistent with this explanation, Eco94GmrSD strongly restricted T4Δip1 (T4 mutant eG506, IPI*-deficient). T4Δip1 plating efficiency on Eco94GmrSD expressing strain under non-induced condition is in the range of 10−6 to 10−7. In a control experiment, DH10B cells expressing EcoCT596GmrSD from a single copy pBeloBAC plasmid restricted T4 and T4gt at 5 to 10-fold, and restricted T4 Δip1 phage at ~106-fold. Similarly, the phage restriction activity by phage spot test (10 μl of the diluted phage was spotted on a host cell lawn pre-plated with soft agar) is shown in Fig. 6. Consistent with the phage titers (EOP) in restriction assay, the expression of Eco94GmrSD endonuclease strongly restricted T4Δip1 in the phage spot test (Fig. 6, bottom panel).

Table 2. Restriction of phages by Eco94GmrSD endonuclease.

  PFU/ml on T7 Express [pET21]a PFU/ml on T7 Express [pET21-gmrSD]a How many fold of restriction by GmrSDb
T4 4.4 × 109 (±0.3 × 109) 3.0 × 108 (±0.2 × 108) 15-fold
T4gt 4.2 × 109 (±0.3 × 109) 2.1 × 108 (±0.2 × 108) 20-fold
T4 Δip1 (IPI*-deficient) 2.2 × 109 (±0.2 × 109) Less than 103 (No plaque at 100-fold dilution) More than 106-fold
λvir 2.8 × 109 (±0.4 × 109) 3.0 × 109 (±0.4 × 109) No restriction
T4 eG192 (IPII IPIII) 5.8 × 109 (±0.2 × 109) 2.7 × 108 (±0.5 × 108) 21-fold
a.PFU/ml are average of three plating numbers (SD± is shown in parenthesis).
b.Ratio of PFU/ml on vector strain and Eco94GmrSD expressing host.

Figure 6. Phage spot tests for T4, T4gt, T4Δip1 (IPI*-deficient), T4 eG192 IPI+ (ΔIPII ΔIPIII), and λvir on E. coli strains expressing EcoCT596GmrSD or Eco94GmrSD.

Figure 6

(A). Two strains DH10B carrying pBeloBAC vector or pBeloBAC-EcoCT596GmrSD were used for comparison. The difference in phage spot (plaques) formation is most evident at 106-fold dilution where EcoCT596GmrSD restricted T4 and T4gt at approximately 5-fold. T4Δip1 (IPI*-deficient) failed to form plaques on EcoCT596GmrSD-expressing strain (input phage ~2–3 × 105 pfu). (B). T7 Express [pET21] and T7 Express [pET21-Eco94GmrSD] strains were used for phage spot tests. Eco94GmrSD moderately restricted T4, T4gt, and T4 eG192, and it did not restrict λvir. T4Δip1 phage was strongly restricted by Eco94GmrSD (no plaque formation at 100-fold dilution, estimated phage input ~2–3 × 105 pfu).

Co-expression of ecoCT596gmrSD and ip1 (IPI*) genes to alleviate toxicity

In the native strain the ecoCT596gmrSD expression may be tightly regulated (or because of an unknown detoxification mechanism carried by the surrounding prophage-encoded gene products), E. coli CT596 cells show normal growth. In a heterologous host, RecA-deficient E. coli DH10B with a single copy plasmid carrying the ecoCT596gmrSD gene restricts T4-like glc-5hmC containing phages. However, subcloning of this gene into higher copy plasmids (pBR322-based ColE1 origin) was toxic: successful cloning was apparently enabled by introduction of a stop codon which caused a truncation product GmrS and reinitiation product GmrD with a small deletion between the gmrS and gmrD genes (Genbank ID: AF493796_1)6. (i.e. the two-chain GmrS and GmrD were the result of cloning artifact that still retains endonuclease activity on T4 DNA). Toxicity of the ecoCT596gmrSD gene in DH10B was reflected as less than ~10−6 survivors by even low level expression from vector pHERD20T; co-expression of the phage IPI* inhibitor protein eliminates this toxicity (see a schematic diagram in Fig. 7A, B), presumably as a result of IPI* neutralizing activity towards GmrSD REase. Table 3 summarizes the growth of different T4-related phages on cell lawns of E. coli DH10B expressing either CT596GmrSD or CT596GmrSD plus IPI*. The co-expression of IPI* prevented restriction of phages normally sensitive to EcoCT596GmrSD, e.g., T2. The ultimate purification of the single-chain EcoCT596GmrSD is needed to confirm its nuclease activity and cofactor requirement in vitro. Consistent with the toxicity of over-expressed CT596GmrSD in RecA-deficient E. coli cells, constitutive expression of Eco94GmrSD from pBR322 (with a strong ribosome binding site GGAGGT-N6-ATG start codon, under Tc promoter) was quite toxic to RecA minus E. coli cells, probably as the result of GmrSD star activity (relaxed nuclease activity on dC and 5mC DNA). The toxicity was reflected by two observations: 100 to 1000 fold-lower transformation efficiency of RecA-deficient cells and poor cell lawn formation during phage infection (data not shown).

Figure 7. Co-expression of CT596GmrSD and IPI* in the same host and the proposed mechanism of anti-restriction activity of IPI*.

Figure 7

(A). Scheme of pHERD20T plasmid construct used to express both CT596GmrSD and T4 IPI* genes and inhibition of GmrSD restriction by IPI* as shown by dual gene expression. The phage restriction activities by GmrSD are summarized in Table 3. (B). A schematic diagram of packaged internal protein IPI* (a.k.a. inhibitor protein) in T4 head and its inhibition of GmrSD restriction activity following DNA/IPI* ejection into host cells.

Table 3. Growth of different T4-related phages on cell lawns of E. coli DH10B expressing either CT596GmrSD or CT596GmrSD and IPI*.

Phage Phage has IPI* pHERD20Ta pHERD20T + CT596GmrSD pHERD20T + CT596GmrSD + IPI* (see Fig. 7)
T4 Yes + + +
T4ip1HA35 No + +
T4eG192 Yes + + +
T4eG506 No + +
RB15 No + + +
T2b No + +
RB49 (no 5hmC) No + + +
T4ip1KAI No + +
a.A control of host (DH10B) containing the expression vector alone. Cultures were induced with 0.4% arabinose. A series of spots containing 102, 104, 105 and 108 phage particles were examined for growth after overnight incubation. −, indicates no phage growth; +, indicates EOP (efficiency of phage plating) >0.6; +/− indicates EOP < 0.6.
b.The co-expression of IPI* prevented restriction of phages normally sensitive to GmrSD, e.g., T2.

Discussion

ATP/GTP stimulate endonuclease activity

Although GmrSD endonuclease activity is stimulate by ATP/GTP, there is no predicted ATPase/GTPase domain in the protein by NCBI BlastP analysis. Therefore Eco94GmrSD may carry a novel type of NTPase activity. ATP binding and/or hydrolysis may help with protein translocation, tracking along the DNA substrate, or allosteric activation of the enzyme. It is known that Type IV REase McrBC requires GTP hydrolysis for endonuclease activity, and SauUSI requires ATP hydrolysis for enzyme activity32. ATP and GTP also stimulate the endonuclease activity of BceSIV (GCWGC)33. More biochemical and structural studies of GmrSD enzyme are necessary to understand the molecular mechanism of NTP/dNTP stimulation of endonuclease activity.

In log phase E. coli cells cultured in LB broth the averaged ATP concentration was calculated to be 1.54 mM16. GmrSD endonuclease activity is stimulated by a range of ATP concentrations at 0.1–2 mM with the upper limit near the physiological concentration. Conversely, a high concentration of ATP (10 mM) inhibits GmrSD activity by some yet unknown mechanism.

Eco94GmrSD cut sites

The sequenced cut sites can be summarized as 5hmC N(17–23) G (two 5hmC in the opposite strands separated by 17–23 bp), where cleavage frequently takes place at the semi-symmetric sites 5hmC N(9–11)↓N(8–11)G. Sequencing a large number of cleavage sites (cleavage products) would be required to determine the preferred cut sites. The plasmid-borne modification-dependent REase PvuRts1I prefers to cleave a symmetric site at 5′-5hmC N(11–12)↓N(9–10) G-3′9. The crystal structure of PvuRts1I has been solved recently34,35. Based on the structure, PvuRts1I variants have been engineered to preferentially cleave 5hmC-modified DNA over glc-5hmC DNA35. AbaSI endonuclease, a member of the PvuRts1I-family, cleaves DNA containing 5hmC and glc-5hmC, but not DNA containing 5mC or dC. The best substrate for AbaSI cleavage is symmetrically modified 5hmC with a 22-bp spacer (5hmC N22 G), most likely cleaved by a homotetramer36.

Domain organization of Eco94GmrSD

EcoCT596GmrSD and Eco94GmrSD both contain two conserved protein domains DUF262 (Domain of Unknown Function 262) or pfam03235 (Protein family 03235) at the N-terminus, and DUF1524 (pfam07510) at the C-terminus. The DUF1524 family proteins (pfam07510) contain the conserved amino acid motif (D/E/H)HXXP, a motif found in His-metal nuclease superfamily. It is possible that the N-terminal DUF262 domain is involved in DNA recognition and the C-terminal DUF1524 domain is involved in Mg2+/Mn2+ ion binding and DNA cleavage. A similar domain organization exists in the Type IIS restriction enzymes MnlI and FokI whose N-termini are involved in DNA binding/recognition and C-termini have functions in nuclease catalytic activity28,37. In contrast, the N-terminus of AbaSI contains a Vsr-like nuclease domain with a single catalytic site and the C-terminal domain harbors the Sra-like 5hmC-binding domain36.

The differences in enzyme properties of the single-chain Eco94GmrSD and two-chain GmrS/GmrD complex

The major differences of the single-chain Eco94GmrSD and the two-chain GmrS/GmrD are: 1) Ca2+ is not required for Eco94GmrSD activity, (Ca2+ and Mg2+ required for the two-chain enzyme), Eco94GmrSD requires Mg2+ or Mn2+ as a cofactor for catalytic activity; 2) ATP, dATP, and dTTP strongly stimulate the activity of Eco94GmrSD, while UTP, GTP and CTP simulate the activity of the CT enzyme; 3) Eco94GmrSD displays endonuclease activity on 5hmC-modified T4gt or PCR DNA containing 5hmC, but the two-chain GmrS/GmrD has poor activity on 5hmC DNA (4 aa changes and 84-aa deletion may have contributed to this altered specificity); 4) GroEL/ES protein co-purified with Eco94GmrSD similar to the two-chain enzyme, but GroEL/ES proteins can be easily removed by a heparin column chromatography.

Potential application for Eco94GmrSD endonuclease

GmrSD endonuclease activity may be utilized for in vivo detection of 5mC conversion to 5hmC. For example, E. coli dinD::lacZ “endo-blue” indicator strain ER1992 is Dcm+, McrBC, Mrr, and McrA38 (dinD, DNA damage inducible gene D). The gmrSD gene could be cloned into pACYC184 plasmid under ParaB control (chloramphenicol resistant, CmR). Co-transformation and expression of plasmid (AmpR) carrying Tet family dioxygenase will likely covert C5mCWGG to C5hmCWGG in the presence of cofactors39. The C5hmCWGG modified sites are substrates for Eco94GmrSD endonuclease. Controlled low expression of GmrSD can cause dsDNA damage and induce host SOS response. The dinD::lacZ indicator strain will likely form dark blue colonies on X-gal, Amp, Cm plate. Thus, co-expression of GmrSD and DNA hydroxylase in a dinD::lacZ indicator strain could be used to screen functional DNA demethyase variants from cDNA expression library40.

Other gmrSD genes associated with Type I and IV restriction systems in the immigration control region (ICR)

Close homologs to GmrSD are found in some pathogenic E. coli strains and more diverged homologs in other bacterial genomes. Fig. 8 shows that in some E. coli the GmrSD genes are associated with the immigration control region (ICR) that carries Type I and Type IV Mrr restriction systems3,41. The Type IV restriction enzyme Mrr restricts methylated DNA with N6mA or 5mC modifications42. The 5mC and 5hmC-dependent McrBC endonuclease (E. coli K strain) is not present in this locus in these strains. For example, the avian pathogenic E. coli strain APEC O1 genome carries two GmrSD homologs, one which is more similar to Eco94GmrSD, the 604- aa APECO1_3911 (93% identity by BlastP) and a more diverged homolog, the 733- aa APECO1_2080 (24% identity by BlastP). Both proteins contain the conserved motifs of DUF262 and DUF1524 characteristic of these enzymes. Like Eco94GmrSD and UTI89GmrSD the APECO1_3911 is likely located on a prophage (its gene is next to the putative phage tail fiber gene APECO1_3910). APECO1_2080, however, is located in an ICR that encodes a putative DNA transposase, endoribonuclease, Type I specificity (hsdS), modification (hsdM), restriction (hsdR), Mrr, and a GTPase. It is possible that APECO1_3911 and APECO1_2080 enzymes are both maintained in the same bacterium to restrict/exclude T-even phages with differences in sugar modifications and/or the two enzymes may display different immunity to the diverse inhibitor proteins (ip1 locus encoded proteins IPI*) ejected by T4-like phages. Either or both of these functions would provide more fitness to this host than those with only one (or none) GmrSD in resisting phage infection.

Figure 8. Some putative E. coli gmrSD genes associated with putative DNA transposases, Type I R-M systems and Type IV restriction systems (Mrr) in bacterial immigration control region (ICR).

Figure 8

The “Gene cluster” function on the web server kegg.jp was used to generate the table listing GmrSD homologs and associated DNA transposases and Type I and IV restriction systems. All gene product abbreviations can be found in www.kegg.jp. GmrSD homologs (second column) in some E. coli strains: E. coli O1 K1 H7 (APEC) = APECO1_2080, E. coli O6 K15 H31 536 (UPEC) = ECP_4674, E. coli O18 K1 H7 UTI89 (UPEC) = UTI89_C5048, E. coli LF82 = LF82_736, E. coli clone D i14 = i14_4938, E. coli O6 K2 H1 CFT073 (UPEC) = c5421, E. coli clone D i2 = i02_4938, E. coli ABU 83972 = ECABU_c49760, E. coli O81 ED1a (commensal) = ECED1_5210, E. coli PMV-1 = ECOPMV1_04800, E. coli O7 K1 IAI39 (ExPEC) = ECIAI39_4815, E. coli IHE3034 = ECOK1_4852, E. coli UM146 = UM146_22465, E. coli O83 H1 NRG 857C = NRG857_21960, E. coli P12b = P12B_c4425, E. coli SMS-3-5 (environmental) = EcSMS35_4888, E. coli C ATCC 8739 = EcolC_3721, E. coli O157 H7 EDL933 (EHEC) = Z5943m, E. coli O7 K1 CE10 = CE10_5085. Note, the 5mC (and 5hmC)-dependent type IV restriction genes mcrB/mcrC are replaced by gmrSD gene in these genomes.

Methods

Bacterial strains, culture media, cloning vector, and DNA substrates

E. coli B strain T7 Express (C2566) (New England Biolabs, NEB) were used for gene cloning and protein expression. E. coli cells were grown in LB or phage broth (10 g tryptone, 5 g NaCl, 0.5 g MgCl2 in 1 L) supplemented with appropriate antibiotics (Amp at 100 μg/ml, Cm at 33 μg/ml, Km at 50 μg/ml). All restriction and modification enzymes, and DNA polymerases were from NEB. The IMPACT protein expression and purification system (with pTYB1 vector, NEB) was used for GmrSD expression43. The eco94gmrSD gene (GenBank ID WP_000834395, gene flanked by NdeI and XhoI sites) was synthesized by IDT and inserted in a pIDT (kanamycin resistant, KmR) vector. The NdeI-XhoI fragment was sub-cloned into pET21b in fusion with a C-terminal 6xHis tag (N-terminal 6xHis tag not tested) or pTYB1, which allows expression of target protein as a fusion to the intein-CBD tag (in the C-terminus of the target protein). T4, T4gt, and λvir phages were from Lise Raleigh's collection (NEB). T4 eG506 Δip1 (ip1 gene deletion mutant), and T4 eG192 IPI+ (control overlapping ΔIPII ΔIPIII deletion)44, IPI deficient ip1 missense mutations HA35 and KAI12 and E. coli strains DH10B containing pBeloBAC vector (CmR) or pBeloBAC-ecoCT596gmrSD (a.k.a. DL26)7 were from Lindsay W. Black's collection. Cells were grown to mid-log phase in phage broth plus Amp or Cm, concentrated 10-fold and used for phage plating assays or phage spot test. For phage spot tests on E. coli lawns, phage stock was diluted by 100-fold serial dilution and 10 μl of the diluted phage was spotted onto the cell lawn.

Protein purification

For enzyme purification from 2 L of IPTG-induced cells, Eco94GmrSD-6xHis was purified from fast flow nickel-NTA agarose columns (Qiagen). The eluted fractions (5 ml × 6) were analyzed by SDS-PAGE and fractions containing GmrSD were further purified by chromatography through a 5 ml HiTrap heparin HP column (GE Life Sciences). Pooled protein fractions were diluted in a low salt buffer (20 mM Tris-HCl, pH 7.5, 50 mM NaCl, 1 mM DTT, 1 mM EDTA, 20 mM NaCl, 5% glycerol) and loaded onto a heparin HP column using an AKTA FPLC system (GE Life Sciences). Elution was carried out using a salt gradient of elution buffer (50 mM to 1 M NaCl, 20 mM Tris-HCl, pH 7.5, 1 mM DTT, 1 mM EDTA, 5% glycerol). The eluted fractions corresponding to UV absorption peaks were analyzed by SDS-PAGE. Active enzyme fractions were pooled and processed for buffer exchange by running through an Amicon protein concentrator (Millipore). Protein was carefully recovered from the membrane by washing it a few times with a storage buffer (100 mM NaCl, 20 mM Tris-HCl, pH 7.5, 1 mM DTT, 50% glycerol) and the purified enzyme was stored at −20°C.

To determine the optimal temperature for GmrSD-intein-CBD fusion protein production, IPTG-induction (0.5 mM) was carried out at 16°C to 37°C for 4 h to overnight. The protein purification procedure was based on NEB's manual except that DTT-stimulated intein cleavage was carried out at 4°C for 48 h. The target protein was then eluted and analyzed by SDS-PAGE. Eco94GmrSD protein was further purified by chromatography through a heparin column as described above for the 6xHis-tagged version.

Site-directed mutagenesis of the putative active site residues

Site-directed mutagenesis of eco94gmrSD gene was carried out by PCR as described22. Mutant alleles were sequenced to confirm the desired mutation(s). Six single or double Eco94GmrSD mutants (in the putative endonuclease catalytic motif PD Xn E/D-X-K or PD Xn E/D-X-E) located at the N-terminus were constructed this way using pTYB1-eco94gmrSD: (1) D217A, (2) E228A/D230A, (3) D249A, (4) E260A/E262A, (5) E271A/E273A, (6) E278A/K280A. Additional six single GmrSD mutants (with C-terminal 6xHis tag) in the putative endonuclease catalytic motif D-H-N located at the C-terminus were also constructed using pET21b-eco94gmrSD: (7) D507A, (8) H508A, (9) C517A, (10) N522A, (11) N528A, (12) N535A. Eight mutants were purified by chromatography through nickel-NTA agarose columns. Three inactive mutants (D507A, H508A, N522A), three partially active mutant (C517A, N528A, and N535A), and two double mutants (E271A/E273A, E278A/K280A) were further purified by chromatography through HiTrap heparin HP column.

DNA binding assay (DNA mobility shift assay)

DNA mobility shift assay was carried out as described29. A 266-bp PCR fragment containing 5hmC or dC was used in the binding assays. For binding to glc-5hmC-modified DNA, T4 MluCI restriction fragments (100 to 500 bp mixture) were used in the DNA mobility shift assay. PCR DNA (10 ng) was incubated with 50 ng, 0.1 μg, 0.25 μg, 0.5 μg protein (the molar ratio of GmrSD protein to DNA was estimated at 6.0, 11.9, 29.7, and 59.5, assuming the active form of enzyme is a dimer with DNA) in 1× binding buffer (0.1 M NaCl, 10 mM Tris-HCl, pH 7.5, 1 mM DTT, 0.1 μg of λ carrier DNA) supplemented separately by 1) 5 mM EDTA, 2) 10 mM CaCl2, 3) 10 mM MgCl2, 4) 10 mM MgCl2 and 1 mM ATP, at room temperature for 10 min. Glycerol was added to a final concentration of 10% and the DNA-protein complex was loaded onto a pre-run TBE gel (10%, Life Technologies) and electrophoresis was carried out using 0.5× TBE buffer with gel box emerged in ice water. DNA was stained by SYBR Gold stain (Life Technologies) in 0.5× TBE for 15 min and DNA imaging was carried out on a Typhoon 9400 Imager (GE Life Sciences).

GmrSD enzyme activity assay

T4 (glc-5hmC), T4gt (5hmC), and λ DNA (Dam+ Dcm+) or 5hmC-modified PCR DNA were digested with purified GmrSD enzyme in NEB buffer 2 (50 mM NaCl, 10 mM Tris-HCl, 10 mM MgCl2, 1 mM DTT) supplemented with 1 mM ATP at 37°C for 1 h unless specified otherwise. To generate 5hmC-modified PCR DNA substrates (266 bp, 0.5 kb, 1.0 kb, 1.9 kb, 3.8 kb), 5hm-dCTP (Zymo Research) was incorporated into PCR DNA by Taq DNA polymerase during PCR reactions. As a control, similar PCR fragments were also generated using regular dNTP. To test the enzyme requirement for divalent cations, T4 DNA was digested in a basic buffer (50 mM NaCl, 10 mM Tris-HCl, pH 7.5, 1 mM DTT), and supplemented with different metal ions (MgCl2, MnCl2, CaCl2, CoCl2, NiSO4, ZnSO4) as indicated in each digestion. To test NTP stimulation of GmrSD activity, NTP (0.1, 0.5, and 1 mM), dNTP (1 and 10 mM), and γ–S-ATP (1 mM) were added to GmrSD digestions. One GmrSD endonuclease unit is defined as the amount of enzyme required for complete digestion of T4 DNA (170 kb) into fragments less than 500 bp in 1 h at 37°C in buffer 2 supplemented with 1 mM ATP. To examine the optimal temperature for GmrSD activity, T4 DNA was digested at 25°C to 65°C for 30 min in limited digestion.

Author Contributions

X.H. performed initial experiments on Eco94GmrSD enzyme purification using the IMPACT system, enzyme activity assays, and substrate preference. V.H. purified WT and mutant enzymes and performed site-directed mutagenesis of the putative catalytic site, mutant activity assay, and DNA binding assays on modified substrates. J.T. and L.W.B. contributed the work on expression, purification, and activity assays for UTI89_C2960 protein, cloning of the single chain EcoCT596GmrSD gene with or without IPI* gene and prophage analyses. F.X. produced PCR DNAs containing 5hmC or dC. S.-Y.X. performed GmrSD activity assays in the presence of various divalent cations/NTP/dNTP/temperatures, phage-plating assay, and phage spot test. Y.G. constructed the model of GmrSD catalytic site. S.G. purified the WT GmrSD protein for trial crystallography. X.H., V.H., J.A.T. and S.-Y.X. analyzed data. S.-Y.X. and L.W.B. wrote the manuscript.

Additional Information

How to cite this article: He, X. et al. Expression and purification of a single-chain Type IV restriction enzyme Eco94GmrSD and determination of its substrate preference. Sci. Rep. 5, 9747; doi: 10.1038/srep09747 (2015).

Supplementary Material

Supplementary Information

Supplement information

srep09747-s1.doc (4.1MB, doc)

Acknowledgments

We thank Richard Roberts, Lise Raleigh, and William Jack for critical comments. We are grateful to Lise Raleigh for providing phage strains. We appreciate the help from NEB's DNA and protein core sequencing labs for DNA and N-terminal protein sequencing. X.H. was partially supported by a scholarship from Ministry of Education of China. Additional funding for this project was provided by New England Biolabs, Inc. J.A.T. and L.W.B. were supported by NIH AID 011676 grant. We thank Don Comb and Jim Ellard for support.

References

  1. Loenen W. A., Dryden D. T., Raleigh E. A., Wilson G. G. & Murray N. E. Highlights of the DNA cutters: a short history of the restriction enzymes. Nucleic Acids Res 42, 3–19 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Arber W., Hattman S. & Dussoix D. On the host-controlled modification of bacteriophage lambda. Virology 21, 30–35 (1963). [DOI] [PubMed] [Google Scholar]
  3. Loenen W. A. & Raleigh E. A. The other face of restriction: modification-dependent enzymes. Nucleic Acids Research 42, 56–69 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Janosi L., Yonemitsu H., Hong H. & Kaji A. Molecular cloning and expression of a novel hydroxymethylcytosine-specific restriction enzyme (PvuRts1I) modulated by glucosylation of DNA. J Mol Biol 242, 45–61 (1994). [DOI] [PubMed] [Google Scholar]
  5. Dila D., Sutherland E., Moran L., Slatko B. & Raleigh E. A. Genetic and sequence organization of the mcrBC locus of Escherichia coli K-12. J. Bacteriol. 172, 4888–4900 (1990). [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bair C. L. & Black L. W. A type IV modification dependent restriction nuclease that targets glucosylated hydroxymethyl cytosine modified DNAs. J Mol Biol 366, 768–778 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bair C. L., Rifat D. & Black L. W. Exclusion of glucosyl-hydroxymethylcytosine DNA containing bacteriophages is overcome by the injected protein inhibitor IPI*. J Mol Biol 366, 779–789 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Wang H. et al. Comparative characterization of the PvuRts1I family of restriction enzymes and their application in mapping genomic 5-hydroxymethylcytosine. Nucleic Acids Res 39, 9294–9305 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Szwagierczak A. et al. Characterization of PvuRts1I endonuclease as a tool to investigate genomic 5-hydroxymethylcytosine. Nucleic Acids Res 39, 5149–5156 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Roberts R. J., Vincze T., Posfai J. & Macelis D. REBASE--a database for DNA restriction and modification: enzymes, genes and genomes. Nucleic Acids Res 38, D234–236 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Abremski K. & Black L. W. The function of bacteriophage T4 internal protein I in a restrictive strain of Escherichia coli. Virology 97, 439–447 (1979). [DOI] [PubMed] [Google Scholar]
  12. Rifat D., Wright N. T., Varney K. M., Weber D. J. & Black L. W. Restriction endonuclease inhibitor IPI* of bacteriophage T4: a novel structure for a dedicated target. J Mol Biol 375, 720–734 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Sutherland E., Coe L. & Raleigh E. A. McrBC: a multisubunit GTP-dependent restriction endonuclease. J. Mol. Biol. 225, 327–358 (1992). [DOI] [PubMed] [Google Scholar]
  14. Zheng Y. et al. A unique family of Mrr-like modification-dependent restriction endonucleases. Nucleic Acids Res 38, 5527–5534 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Alatossava T., Jutte H., Kuhn A. & Kellenberger E. Manipulation of intracellular magnesium content in polymyxin B nonapeptide-sensitized Escherichia coli by ionophore A23187. J Bacteriol 162, 413–419 (1985). [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Yaginuma H. et al. Diversity in ATP concentrations in a single bacterial cell population revealed by quantitative single-cell imaging. Sci Rep 4, 6522 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Vasu K. et al. Increasing cleavage specificity and activity of restriction endonuclease KpnI. Nucleic Acids Res (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Xu S. Y. & Gupta Y. K. Natural zinc ribbon HNH endonucleases and engineered zinc finger nicking endonuclease. Nucleic acids research (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Chan S. H., Opitz L., Higgins L., O'Loane D. & Xu S. Y. Cofactor requirement of HpyAV restriction endonuclease. PLoS One 5, e9071 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Shen B. W., Landthaler M., Shub D. A. & Stoddard B. L. DNA binding and cleavage by the HNH homing endonuclease I-HmuI. J Mol Biol 342, 43–56 (2004). [DOI] [PubMed] [Google Scholar]
  21. Shen B. W. et al. Unusual target site disruption by the rare-cutting HNH restriction endonuclease PacI. Structure 18, 734–743 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Xu S. Y. et al. Structure determination and biochemical characterization of a putative HNH endonuclease from Geobacter metallireducens GS-15. PLoS One 8, e72114 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Pommer A. J. et al. Homing in on the role of transition metals in the HNH motif of colicin endonucleases. The Journal of biological chemistry 274, 27153–27160 (1999). [DOI] [PubMed] [Google Scholar]
  24. Vasu K. et al. Increasing cleavage specificity and activity of restriction endonuclease KpnI. Nucleic Acids Res 41, 9812–9824 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Al-Attar S., Westra E. R., van der Oost J. & Brouns S. J. Clustered regularly interspaced short palindromic repeats (CRISPRs): the hallmark of an ingenious antiviral defense mechanism in prokaryotes. Biol Chem 392, 277–289 (2011). [DOI] [PubMed] [Google Scholar]
  26. Xu S. Y. & Gupta Y. K. Natural zinc ribbon HNH endonucleases and engineered zinc finger nicking endonuclease. Nucleic acids research 41, 378–390 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Enyeart P. J., Mohr G., Ellington A. D. & Lambowitz A. M. Biotechnological applications of mobile group II introns and their reverse transcriptases: gene targeting, RNA-seq, and non-coding RNA analysis. Mobile DNA 5, 2 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Kriukiene E. Domain organization and metal ion requirement of the Type IIS restriction endonuclease MnlI. FEBS Lett 580, 6115–6122 (2006). [DOI] [PubMed] [Google Scholar]
  29. Xu S.-Y. & Schildkraut I. Isolation of BamHI variants with reduced cleavage activities. J. Biol. Chem. 266, 4425–4429 (1991). [PubMed] [Google Scholar]
  30. Pingoud A., Alves J., Flieb G. R., Rueter T. & Wolfes H. Site-directed mutagenesis of the EcoRI restriction endonuclease. Biol. Chem. Hoppe-Seyler 368, 1093 (1987). [Google Scholar]
  31. Jeltsch A. et al. DNA binding specificity of the EcoRV restriction endonuclease is increased by Mg2+ binding to a metal ion binding site distinct from the catalytic center of the enzyme. Biochemistry 34, 6239–6246 (1995). [DOI] [PubMed] [Google Scholar]
  32. Xu S. Y., Corvaglia A. R., Chan S. H., Zheng Y. & Linder P. A type IV modification-dependent restriction enzyme SauUSI from Staphylococcus aureus subsp. aureus USA300. Nucleic Acids Res 39, 5597–5610 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Xu S. Y. et al. Characterization of type II and III restriction-modification systems from Bacillus cereus strains ATCC 10987 and ATCC 14579. Journal of bacteriology 194, 49–60 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Kazrani A. A., Kowalska M., Czapinska H. & Bochtler M. Crystal structure of the 5hmC specific endonuclease PvuRts1I. Nucleic Acids Res 42, 5929–5936 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Shao C., Wang C. & Zang J. Structural basis for the substrate selectivity of PvuRts1I, a 5-hydroxymethylcytosine DNA restriction endonuclease. Acta Crystallogr D Biol Crystallogr 70, 2477–2486 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Horton J. R. et al. Structure of 5-hydroxymethylcytosine-specific restriction enzyme, AbaSI, in complex with DNA. Nucleic Acids Res 42, 7947–7959 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Li L., Wu L. P. & Chandrasegaran S. Functional domains in FokI restriction endonuclease. Proc. Natl. Acad. Sci. USA 89, 4275–4279 (1992). [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Fomenkov A., Xiao J.-P., Dila D., Raleigh E. & Xu S.-Y. The ‘endo-blue method’ for direct cloning of restriction endonuclease genes in E. coli. Nucleic Acids Res. 22, 2399–2403 (1994). [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Tahiliani M. et al. Conversion of 5-methylcytosine to 5-hydroxymethylcytosine in mammalian DNA by MLL partner TET1. Science 324, 930–935 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Iyer L. M., Tahiliani M., Rao A. & Aravind L. Prediction of novel families of enzymes involved in oxidative and other complex modifications of bases in nucleic acids. Cell Cycle 8, 1698–1710 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Raleigh E. A. Restriction and modification in vivo by Escherichia coli K12. Methods Enzymol. 152, 130–141 (1987). [DOI] [PubMed] [Google Scholar]
  42. Waite-Rees P. A. et al. Characterization and expression of the Escherichia coli Mrr restriction system. J Bacteriol 173, 5207–5219 (1991). [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Chong S. et al. Utilizing the C-terminal cleavage activity of a protein splicing element to purify recombinant proteins in a single chromatographic step. Nucleic Acids Res 26, 5109–5115 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Black L. W. Bacteriophage T4 internal protein mutants: isolation and properties. Virology 60, 166–179 (1974). [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information

Supplement information

srep09747-s1.doc (4.1MB, doc)

Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES