Skip to main content
Nucleic Acids Research logoLink to Nucleic Acids Research
. 2003 Apr 1;31(7):1913–1920. doi: 10.1093/nar/gkg276

Repair of hydrolytic DNA deamination damage in thermophilic bacteria: cloning and characterization of a Vsr endonuclease homolog from Bacillus stearothermophilus

Martin Laging 1,a, Eric Lindner 1,b, Hans-Joachim Fritz 1, Wilfried Kramer 1,c
PMCID: PMC152792  PMID: 12655008

Abstract

Hydrolytic deamination of 5-methyl cytosine in double stranded DNA results in formation of a T/G mismatch that—if left unrepaired—leads to a C→T transition mutation in half of the progeny. In addition to several mismatch-specific glycosylases that have been found in both pro- and eukaryotes to channel this lesion into base excision repair by removing the T from the mismatch, Vsr endonuclease from Escherichia coli has been described which initiates repair by an endonucleolytic strand incision 5′ to the mismatched T. We have isolated a gene coding for a homolog of E.coli Vsr endonuclease from the thermophilic bacterium Bacillus stearothermophilus H3 (Vsr.Bst) using a method that allows PCR amplification with degenerated primers of gene segments which code for only one highly conserved amino acid region. Vsr.Bst was produced heterologously in E.coli and purified to apparent homogeneity. Vsr.Bst specifically incises heteroduplex DNA with a preference for T/G mismatches. The selectivity of Vsr.Bst for the sequence context of the T/G mismatch appears less pronounced than for Vsr.Eco.

INTRODUCTION

The genetic material of cells is continually exposed to many exogenous and endogenous agents that can chemically modify the structure of bases and thus alter the informational content of DNA. If left unrepaired, the modified bases may induce mutations or block replication leading to cell death. A repair pathway found ubiquitously in both pro- and eukaryotes to deal with such lesions is base excision repair (1). The modified base is removed by a glycosylase, the strand is incised at the remaining abasic sugar by an AP-endonuclease activity, and after removal of the abasic sugar, the gap is filled by a DNA polymerase with subsequent ligation of the nick. A prerequisite for this repair to occur is detection of the modified base by the glycosylase. For almost all lesions such as alkylation, oxidative or hydrolytic damage this condition is met, since the reaction product is not a normal constituent of DNA. A special case, however, results from deamination of 5-methyl cytosine. Here the reaction product is thymine, a regular base in DNA. The only way to detect this thymine as a product of deamination is by virtue of its opposition to guanine on the complementary strand. Two examples of glycosylases removing thymine from T/G mismatches as an initiating step for base excision repair have been identified so far (2,3).

A different repair pathway counteracting the mutagenic effects of hydrolytic deamination of 5-methyl cytosine, which has been described in Escherchia coli, is very short patch (VSP) repair. VSP repair was first detected genetically as the determinant for an apparent excess of triple crossovers involving a particular set of markers in crosses of phage λ (46). The central enzyme in this pathway was found to be Vsr endonuclease. The gene for Vsr endonuclease overlaps with the gene for the Dcm cytosine DNA methyltransferase (7,8). Biochemical characterization demonstrated that Vsr is a sequence- and mismatch-specific endonuclease that incises at T/G oppositions on the 5′ side of the mismatched thymidine, if the sequence context of the mismatch is derived from certain subsets of the recognition sequence CCA/TGG of Dcm cytosine methyltransferase (9). This substrate specificity could also be deduced from genetic analyses (5,10,11) and is reflected in the genome structure of E.coli K12 (12,13). It seems very likely that the Vsr-catalyzed incision serves as a starting point for an excision repair, removing thymidine and reinserting cytidine, in particular since it was shown genetically that VSP repair requires the presence of DNA polymerase I with an intact 5′–3′ exonuclease (14). VSP repair in vivo is also greatly stimulated by the presence of functional mutS and mutL genes (6,11,15). It has been shown that DNA binding of Vsr endonuclease is enhanced by MutL protein (16).

Recently, the crystal structure of the enzyme alone and in complex with cleaved substrate DNA has been determined (17,18). This has provided further insight into substrate recognition and the DNA cleavage reaction catalyzed by the enzyme. The catalytic center consists of two strictly conserved (see below) aspartate residues (D51, D97), a threonine (T63) and a conserved histidine (H69) (17). In an alanine scanning mutagenesis of conserved residues, D51A and H69A showed no activity and D97A reduced activity (18). Upon binding of Vsr to DNA, the three aromatic residues F67, W68 and W86 intercalate into the DNA duplex directly beneath the mismatched T/G from the major groove side. The strict conservation of residues required for recognition of the T/G mismatch and endonuclease activity in all open reading frames with the capacity to encode proteins with sequence similarity to E.coli Vsr endonuclease that have been found in numerous bacterial species (compare Fig. 1) suggests that all these putative proteins function in repair of damage by hydrolytic deamination of 5-methyl cytosine. However, except for Vsr endonuclease from E.coli, none of these putative proteins has been characterized biochemically. Furthermore, all such open reading frames identified so far originate from mesophilic bacteria. The problem of hydrolytic deamination, however, is strongly aggravated at elevated temperature (19). As part of an extended project of this department to characterize enzymes counteracting hydrolytic DNA deamination damage in thermophilic organisms (3,20,21), we decided to isolate a Vsr homolog from a thermophilic bacterium. No thermophilic bacterium had been shown previously to contain a Vsr homolog. We selected Bacillus stearothermophilus as target organism, since 5-methyl cytosine modification of its DNA had been demonstrated (22,23). We also wanted to expand the knowledge of the biochemical properties of the Vsr family of endonucleases, which seems particularly important in order to understand the forces shaping genomic sequences, since in the E.coli genome the specificity of Vsr endonuclease clearly has a significant impact on the frequency of certain nucleotide strings (12,13).

Figure 1.

Figure 1

Multiple sequence alignment of proteins exhibiting strong similarity to E.coli Vsr and sequences of degenerated primer sets for isolation of vsr.Bst. (A) Multiple sequence alignment. Identical amino acids are shaded black, similar ones are shaded gray. Sequences used for definition of the degenerated primer sets are shown in the upper part; amino acids translated to degenerated primer sets are boxed. Sequences are from Arthrobacter luteus (Alu), B.subtilis (Bsu), E.coli (Eco, D90835), Nocardia aerocolonigenes (Nae, U09581), Haemophilus parainfluenzae (Hpa, L17342) and Xanthomonas oryzae (Xor, U06424). The sequence of the Vsr homolog from B.stearothermophilus (Bst, AJ318782) identified in this work is shown below these sequences. The eight amino acid sequence identified in the initial screen is marked by a box with dashed lines. Subsequently identified homologs found in finished and unfinished microbial genomes are shown below the Bst sequence. Sequences are from Sinorhizobium meliloti (Sme), Neisseria gonorrhoeae (Ngo1 und Ngo2), Dehalococcoides ethenogenes (Det), Neisseria meningitidis (Nme), Legionella pneumophila (Lpn), Salmonella typhimurium (Sty), Desulfovibrio vulgaris (Dvu), Salmonella enteritidis (Sen), Rhodobacter sphaeroides (Rsp), Klebsiella pneumoniae (Kpn), Salmonella paratyphi (Spa) and Caulobacter crescentus (Ccr). (B) Sequences of degenerated primer sets. The column ‘sequences’ indicates the number of different oligonucleotide sequences present in each mixture.

MATERIALS AND METHODS

Strains and media

Bacillus stearothermophilus H3 (source T. A. Trautner) was grown at 50°C in dYT medium (1% Bacto yeast extract, 1.6% Bacto tryptone, 0.5% NaCl) supplemented with beef extract (2 g/l; Gibco BRL). Escherichia coli K12 strain DH5α [F,φ80-dlacZΔM15, endA1, recA1, hsdR1 (rk- mk+), sup E44 thi-1, gyrA96(Nalr), relA1, Δ(lacZYA-argF) U169] (24) was used as host for cloning. Escherichia coli K12 strain BMH 71-18 [Δ(lac-proAB), supE, thi, (F′ lacIq lacZΔM15 proA+ proB+)] (source B. Müller-Hill) was used for protein overproduction. Escherichia coli was grown in either dYT or TB medium (1.2% Bacto yeast extract, 2.4% Bacto tryptone, 0.4% glycerine). For plasmid containing derivatives, ampicillin was added to a final concentration of 100 µg/ml.

Isolation of a Vsr homolog from B.stearothermophilus H3

For isolation of chromosomal DNA from B.stearothermo philus H3, cells from an overnight culture were harvested by centrifugation and resuspended in 1/4 vol 10% (w/v) sucrose, 50 mM EDTA, 50 mM Tris–HCl pH 8.0 containing 5 mg/ml lysozyme. After incubation for 20 min at room temperature, SDS and Proteinase K were added to final concentrations of 1% and 5 mg/ml, respectively, followed by incubation at 65°C for 30 min. After cooling to room temperature, ethidium bromide was added (1 mg/ml) and the mixture was extracted twice with phenol/chloroform (1:1). RNase A was added to the aqueous phase to a final concentration of 1 mg/ml and the mixture was incubated for 1 h at room temperature. The mixture was extracted once with phenol/chloroform (1:1) and DNA was precipitated with 1.5 vol 96% ethanol, washed with 75% ethanol and redissolved in TE buffer (1/10 vol of the starting culture) at 4°C overnight. The chromosomomal DNA was partially digested with Bsp143I. Ligation with the DNA cassette and PCR with six different sets of degenerated primers (see Fig. 1) was essentially carried out as described (25). In the first round, PCR conditions were 10 cycles of a touchdown PCR (starting annealing temperature: 60°C with a decrease of 0.5°C per cycle) followed by 35 cycles with an annealing temperature of 55°C. Products that resulted from amplification with a biotinylated cassette primer were enriched with streptavidin-coated magnetic particles and reamplified (30 cycles, annealing temperature: 50°C). These were cloned via BamHI/EcoRI into pBluescript II SK(–) (Stratagene). Inserts of 68 randomly chosen clones were sequenced. Four of them contained identical 25 bp inserts, which coded for a stretch of amino acids with homology to other Vsr homologs. Based on the sequence of this 25 bp fragment (5′-CGC ATG TGA AAT ACA TGG ACG GAT C-3′), two primers (5′VsrBst: 5′-GCT CTA GAA TAC ATG GAC GGA TC-3′; 3′VsrBst: 5′-GGA CTA GTT GTA TTT CAC ATG CG-3′) were synthesized for an inverted PCR. Chromosomal DNA from B.stearothermophilus was cleaved with several restriction endonucleases and ligated under conditions favoring circularization of the fragments to generate templates for the inverted PCR. The MunI-derived template yielded a 1.2 kb PCR product in the inverted PCR, which was subsequently cloned into pBluescript II SK(–). Sequencing of this fragment identified an open reading frame coding for a Vsr homolog (Fig. 1). The nucleotide sequence of the gene was determined by sequencing three PCR products derived from three independent amplification reactions with chromosomal DNA as template and later confirmed by sequencing of a chromosomal cosmid clone. The gene was amplified with its own RBS with the primers 5′BstVsr-Xba (5′-TGC TCT AGA TAT GGA GTT AAT GTT ATG GC-3′) and 3′BstVsr-Xho (5′-CCG CTC GAG ACT TTG AGA ATC TTT GGC-3′) and cloned into pET21d (Novagen) yielding the expression plasmid pET21d-vsr.Bst-h6. In this plasmid, the 3′ end of the vsr.Bst gene is fused in frame to a vector-derived sequence coding for six histidines. For construction of an alternate expression plasmid, the gene including the sequence coding for the hexahistidine tag was cloned as an XbaI/Eco0109I fragment from pET21d-vsr.Bst-h6 into XbaI/SmaI digested pASK75 (26) to yield pASK75-vsr.Bst-h6. Expres sion levels were comparable for both constructs; overproduction was then optimized for expression with pASK75-vsr.Bst-h6.

Overproduction and purification of Vsr.Bst-His6 protein

A 50 ml overnight culture of strain BMH 71-18 containing expression plasmid pASK75-vsr.Bst-h6 was used to inoculate 1 l TB medium containing ampicillin (100 µg/ml). The culture was grown with agitation at 37°C. After the culture had reached an OD600 of ∼0.8, vsr.Bst expression was induced by addition of 200 µg/l anhydrotetracycline. After induction, cells were incubated with agitation for another hour at 37°C, harvested by centrifugation, resuspended in 25 ml buffer A (25 mM HEPES–KOH, pH 7.6, 0.5 M NaCl, 5 mM β-mercaptoethanol) and lysed by passage through a French pressure cell (138 MPa). Cell debris was removed by centrifugation (Sorvall SS34, 12 000 r.p.m., 30 min, 4°C). The supernatant was briefly sonicated and recentrifuged as above. The resulting supernatant was applied to a column containing 5 ml Chelating Sepharose Fast Flow (Pharmacia Biotech) charged with Ni2+. The column was washed six times with 10 ml buffer A and developed with 15 ml buffer A containing 100 mM imidazole and three times 5 ml of buffer A containing 200, 300 and 500 mM imidazole, respectively. The latter three fractions were pooled and an equal volume buffer B (25 mM HEPES–KOH, pH 7.6, 5 mM β-mercaptoethanol) was added. The diluted solution was applied to a 7.854 ml Poros HS20 column (strong cation exchanger) or, alternatively, to a 7.854 ml Poros HE20 column (heparin) on a Vision™ Workstation BioCad® (Perseptive Biosystems). Flow rate was 25 ml/min. Columns were washed with 8 CV (column volumes) of buffer B and developed with 15 CV of a linear gradient from 0 to 1.5 M NaCl in buffer B. Elution of the Vsr.Bst on the HS20 column started at ∼900 mM NaCl with the peak at ∼1.2 M NaCl. The yield of Vsr.Bst-His6 was ∼5 mg as determined by UV spectroscopy with an ε280 of 30 480 l*mol–1*cm–1 calculated according to Pace et al. (27).

Preparation of oligonucleotide duplices

DNA substrates for cleavage assays and multiple substrate kinetics were prepared by hybridization of two oligonucleotides (see legend to Fig. 4A and Table 1 for sequences). For hybridization, 6 pmol fluorescein-labeled upper strand were mixed with 30 pmol of the corresponding lower strand in 30 µl SSC buffer (15 mM sodium citrate pH 7.2, 150 mM NaCl), heated to 80°C for 2 min and slowly cooled to room temperature. The concentration of double stranded substrate was adjusted to 10 fmol/µl by dilution with SSC buffer.

Figure 4.

Figure 4

Analysis of the sequence context preference of Vsr.Bst-His6 by multiple substrate kinetics. (A) Schematic representation of the substrates. The upper strand of all four substrate was labeled at the 5′ end with a fluorescein moiety. The variable part indicates the region, where sequence differences surrounding the T/G mispair were introduced to investigate possible influence of the sequence context. The upper strands differed in length. The 5′ overhang of a 43mer upper strand was 5′-GCT TGG GTA CTT-3′. Shortening this sequence stepwise by four bases leads to the overhang of a 39 and a 35meric upper strand. 31mers had no 5′ overhang and comprise the double stranded ‘core sequence’ common to all substrates, which for the upper strand was: 5′-GGC TTA TCT CCG XXX XXX TTA ATC TGT CGC A-3′. Lower strands were complementary to the 31meric upper strand. The positions marked with X are variable (compare with Table 1). Cleavage of the substrate will result in differently sized fluorescently labeled products depending on the length of the 5′ overhang. (B) Fluorescence readouts from a gel electrophoretic analysis of reaction products from a multiple substrate kinetics experiment. The times given on the right indicate the time points of sample withdrawal. (C) Example of a plot of amount of remaining substrate versus time. Each point is the mean of three independent measurements. Error bars are standard deviations.

Table 1. Dependence of cleavage efficiency of Bst.Vsr endonuclease at a T/G mismatch on the sequence context.

Substrate number Sequence context Relative rate constant Factor of decrease in efficiency
1 cTcggg 1.00 ± 0.03 1.0
2 aTcggt 0.94 ± 0.03 1.1
3 gaTagc 0.78 ± 0.03 1.3
4 tggTca 0.61 ± 0.04 1.6
5 ccTggg 0.54 ± 0.01 1.9
6 tcTcta 0.47 ± 0.01 2.1
7 aTcaat 0.40 ± 0.01 2.5
8 gcgTgc 0.36 ± 0.05 2.8
9 gTgcgc 0.34 ± 0.05 2.9
10 acTggt 0.29 ± 0.03 3.4
11 aTccct 0.23 ± 0.03 4.3
12 taTtag 0.02 ± 0.005 50.0

The ‘factor of decrease in efficiency’ is the reciprocal value of the ‘relative rate constant’.

Cleavage assay

Samples of 10 fmoles fluorescein-labeled substrate were mixed with 11.4 pmol Vsr.Bst in 20 µl 50 mM NaCl, 10 mM Tris–HCl pH 7.5, 10 mM MgCl2, 0.1 mg/ml BSA and incubated at 50°C for 15 min. The reaction was stopped by transfer of the mixture to a reaction tube containing 1/10 vol 0.5 M EDTA, pH 8.0 plus 1 mg/ml Proteinase K on ice. After incubation at 65°C for 20 min, the samples were analyzed on an A.L.F. sequencer (Pharmacia) as described (13,28).

Multiple substrate kinetics

Four fluorescein-labeled substrates (220 fmol each) were mixed in 220 µl 50 mM NaCl, 10 mM Tris–HCl pH 7.5, 10 mM MgCl2, 0.1 mg/ml BSA and preincubated at 50°C for 30 s. A first aliquot (20 µl) was removed (t = 0) and Vsr.Bst (114 pmol in 20 µl, pre-incubated at 50°C for 20 s) was added. Aliquots of 20 µl were removed after 20 and 40 s and after 1, 2, 5, 7, 10 and 20 min. Reactions were stopped as in the cleavage assay, analyzed on an A.L.F. sequencer and relative second order rate constants calculated as described (13,29). The sets of four substrates each were (in the order 31, 35, 39 and 43mer): substrates 1, 2, 3 and 4; substrates 5, 6, 3 and 4; substrates 7, 2, 8 and 9; substrates 10, 2, 11 and 12 (see Table 1).

RESULTS AND DISCUSSION

Isolation of a vsr homolog from B.stearothermophilus

As organism for isolation of a vsr homolog from thermophilic bacteria we chose B.stearothermophilus H3, which has a growth optimum of 55°C with a maximum of 70°C and has been demonstrated to contain 5-methyl cytosine as DNA modification (22,23). As indicated in Figure 1, degenerated primers were defined for the conserved D/H G/S CFWH motif and used to amplify DNA fragments (25) as detailed in the Materials and Methods. Among 68 sequenced fragments, four identical 25 bp fragments were identified, which coded for a stretch of eight amino acids with significant homology to the corresponding region of the putative Bacillus subtilis homolog (Fig. 1). Three of these were derived from a PCR with the DS1 set of degenerated primers and corresponded, as seen later, to the genomic sequence. The fourth sequence was derived from a PCR with the HS1 set and differed from the chromosomal sequence in two positions within the first six bases of the degenerated primer sequence. PCR primers were defined from these sequences for an inverted PCR with MunI cleaved and recircularized B.stearothermophilus chromosomal DNA as template. The DNA fragment obtained in the inverted PCR yielded enough sequence information for definition of primers to amplify a contiguous open reading frame coding for a protein of 140 amino acids with an Mr = 16 464 (EMBL Nucleotide Sequence Database; accession number AJ318782). As can be seen in Figure 1, the deduced amino acid sequence of this DNA fragment contains all residues conserved in other Vsr homologs as well. The isolated gene was hence called vsr.Bst.

Overproduction and purification of Vsr.Bst

An in-frame sequence coding for a hexahistidine tag was added to the 3′ end of vsr.Bst via PCR and the modified gene with its own (putative) Shine–Dalgarno sequence was cloned into the expression vectors pET21d and pASK75 (26) (see Materials and Methods for details of construction). Overexpression was comparable from both vectors (data not shown). Figure 2 documents an overexpression from the pASK75 derivative with subsequent purification of the protein. After addition of anhydrotetracycline for induction of the tet-promoter of pASK75, a pronounced new band appeared in crude cell extracts analyzed by SDS–PAGE (Fig. 2A, lane 2), whose size correlates well with the expected molecular weight of Vsr.Bst-His6 (Mr = 17.529). The corresponding protein bound almost quantitatively to Ni2+ loaded chelating Sepharose (Fig. 2B, lane 2) and was eluted with a step gradient of 100–500 mM imidazole (Fig. 2B, lanes 3–6). Fractions eluted with 200, 300 and 500 mM imidazole (Fig. 2B, lanes 4–6) were pooled and applied after dilution to a cation ion exchanger or, alternatively, to a heparin column (data not shown). In the resulting preparation (Fig. 2C, lane 1) the protein was apparently homogeneous.

Figure 2.

Figure 2

Overproduction and purification of Vsr.Bst-His6. (A) Total soluble protein from non-induced and induced cultures. Lane 1, extract from a culture before addition of anhydrotetracyclin; lane 2, extract from a culture 1 h after addition of anhydrotetracyclin. M, 10 kDa ladder. (B) Analysis of different purification steps from nickel affinity chromatography. Lane 1, cell extract before loading; lane 2, flowthrough; lane 3, eluate with 100 mM imidazole; lane 4, eluate with 200 mM imidazole; lane 5, eluate with 300 mM imidazole; lane 6, eluate with 500 mM imidazole; M, 10 kDa ladder. (C) Analysis of protein after cation exchange chromatography. Lane 1, pooled fractions of Vsr.Bst-His6; M, 10 kDa ladder. The position of Vsr.Bst-His6 (Mr = 17 529) is marked by an arrowhead. All analyses shown are SDS–polyacrylamide gels stained with Coomassie Brilliant Blue. The bands from the 10 kDa ladder correspond to Mr = 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120 and 200 k.

Mismatch-specific DNA incision by Vsr.Bst

The sequence similarity to E.coli Vsr suggests that the B.stearothermophilus protein fulfils a similar function, i.e. initiating repair at T/G mismatches by endonucleolytic incision. Homo- and heteroduplices were constructed by hybridization of oligonucleotides, one of which was fluorescently labeled (for sequences see Fig. 3A). The duplices were incubated with Vsr.Bst and as a control with Mig.MthI, a glycosylase removing thymine from T/G mismatches (3,20). The reaction products were analyzed by gel electrophoresis in a DNA sequencer (28). The results for a T/G heteroduplex and the corresponding C/G homoduplex are shown in Figure 3B. Whereas no changes can be observed after incubation of homoduplex with Vsr.Bst (compare mock reaction without addition of enzyme labeled ‘H2O’), a new species with an altered electrophoretic mobility can be observed after incubation of the T/G heteroduplex. The reaction product migrates similarly to that of Mig.MthI. Since the Mig.MthI reaction product needs to be liberated by alkaline strand scission of the depyrimidinated reaction product resulting from Mig.MthI treatment and thus contains a 3′ phosphate (3), the slightly slower mobility of the Vsr.Bst reaction product would be consistent with cleavage 5′ to the mismatched T with release of a fluorescently labeled reaction product with a 3′ hydroxyl end as it has been demonstrated for Vsr.Eco (9).

Figure 3.

Figure 3

Endonuclease activity of Vsr.Bst-His6 on homoduplex and T/G containing heteroduplex DNA. (A) Oligonucleotides for construction of substrates. For T/G containing heteroduplex DNA the 39mer upper strand was hybridized with the lower strand, for homoduplex DNA the 35mer upper strand with the lower strand. F indicates a fluorescein moiety. The nucleotides forming the T/G mismatch or the corresponding C/G base pair in homoduplex DNA are given in bold capitals. For a schematic representation of the resulting substrates see Figure 4A. (B) Fluorescence readouts from gel electrophoretic analysis of reaction products of heteroduplex DNA substrate (left panel) and homoduplex–DNA substrate (right panel). (Electrophoretic mobility increases from right to left.) Vsr.Bst, substrates treated with Vsr.Bst-His6; Mig.MthI, substrates treated with Mig.MthI glycosylase with subsequent NaOH treatment to cleave AP-sites; H2O, mock reactions with no enzyme added.

Sixteen substrates, each containing one of the possible base/base oppositions in the sequence context 5′-Fl-GGCTTA TCTCCGCXCGGGTTAATCTGTCGCA-3′ (from substrate 1, Table 1; only the sequence of the upper labeled strand is shown, X marks the variable position) were tested for cleavage by Vsr.Bst. No product was detected for any of the substrates except for the one containing the T/G mismatch (data not shown). This indicates that Vsr.Bst is quite specific for T/G mismatches, at least in the sequence context under investigation. In another sequence context (5′-Fl-GGGTACTTGGCT TATCTCCGAGGTCCTTAATCTGTCGCA; the sequence of labeled upper strand is shown; the potentially mismatched T is marked in bold), where T/A, T/G, T/C and T/T base/base oppositions were analyzed, some residual cleavage of T/T was observed (4% product formation versus 30% with T/G under identical reaction conditions). But again, no cleavage of T/A or T/C was detected (data not shown).

Several T/G processing enzymes have been shown to have equal or even stronger activity towards U/G mismatches (3,30,31), which would result from the hydrolytic deamination of unmodified cytosine. Thus, two substrates with identical sequence context, one containing a T/G and the other a U/G mismatch were compared with multiple substrate kinetics (see below). It was found that the T/G mismatch is processed 2.5 times faster than the U/G mismatch (data not shown). Cleavage at U/G due to contaminating uracil DNA glycosylase (Ung) activity from E.coli can be largely ruled out, since Ung is a monofunctional glycosylase (32) and would require a base catalyzed β-elimination by alkaline treatment for efficient strand cleavage in our assay (3,20). However, no additional product was observed by NaOH treatment of a U/G containing duplex incubated with Vsr.Bst (data not shown). It should be noted that the product resulting from alkaline strand scission has a different electrophoretic mobility than that produced by endonucleolytic incision (compare Fig. 3). The observed selectivity of T/G versus U/G is compatible with the assumption that the major cellular task of Vsr.Bst is the correction of DNA damage resulting from deamination of 5-methyl cytosine.

Sequence context selectivity of Vsr.Bst

If Vsr.Bst is involved in repair of hydrolytic deamination damage of 5-methyl cytosine, its sequence context selectivities can be expected to correlate with that of one or more cytosine DNA methyltransferases. To detect such target sequences, we tested cleavage of chromosomal DNA of B.stearothermophilus H3 with several methylation sensitive restriction enzymes (http://rebase.neb.com). Among the enzymes tested (AatII, AgeI, BamHI, Bsp143I, BssHII, Cfr10I, ClaI, EagI, Eco147I, FspI, Hin6I, HindIII, HpaII, MluI, MspI, NarI, PvuII, PstI, SacII, SalI, SmaI, SnaBI, XbaI and XhoI), chromosomal DNA was found to be refractory to cleavage by AgeI (ACCGGT), BssHII (GCGCGC), Cfr10I (RCCGGY) and XhoI (CTCGAG) (data not shown). This demonstrates that at least some sequences containing the tetranucleotides CCGG and CGCG are modified by 5′-methyl cytosine and may thus represent substrate sequences for Vsr.Bst. [Since the respective enzymes are inhibited by methylation at all cytosines (http://rebase.neb.com), no information on the possible location of the methylation in the B.stearothermophilus strain investigated here can be drawn from this experiment.]

To test the conjecture sketched above of a correlation between methylated sites and sequence selectivity of Vsr.Bst, the relative second order rate constants for Vsr.Bst cleavage at T/G mismatches in different sequence contexts was determined by multiple substrate kinetics (13,20,29). To this end, oligonucleotide hybrids as shown schematically in Figure 4A were constructed. The sequence contexts of the T/G mismatch as listed in Table 1 (corresponding to the region labeled ‘variable’ in Fig. 4A) were either derived from the putative substrate sequences or chosen arbitrarily. The rest of the double stranded region was identical in sequence for all substrates. The 5′ ends of the T containing strands were fluorescently labeled and formed single stranded overhangs that were variable in length to allow discrimination of cleavage products derived from different substrates by their gel electrophoretic mobility.

Since it could not be excluded that these overhangs might influence the efficiency of the enzyme, two substrates with identical sequence context, but differently sized single stranded overhangs, were compared. The substrate with the context cTcggg was prepared with no and with a 12 nt overhang and the substrate with the context aTcggt with 4 and 8 nt overhangs. The four substrates were compared by multiple substrate kinetics. The substrates with identical sequence context but different single stranded overhangs were processed with identical rates (data not shown). This demonstrates that the efficiency of the enzyme is not influenced by the length of the single stranded overhang to a measurable extent.

Relative rates for processing of T/G mismatches in different sequence contexts as shown in Table 1 were determined by analyzing four substrates per reaction as described above for the substrates with differently sized single stranded overhangs. An example of the analysis of such a cleavage reaction on a DNA sequencer is shown in Figure 4B. The percent remaining substrate was plotted versus reaction time and relative second order rate constants were calculated as described (13,29). An example of such a plot is shown in Figure 4C. The rate constants were then normalized to the most efficiently processed substrate (cTcggg). The results are shown in Table 1. The relative rate constant for this substrate and for the slowest substrate differ by a factor of 50. This demonstrates that Vsr.Bst has some selectivity for the sequence context of the T/G mismatch. However, the relative rate constants of the other 11 sequences tested differ only by a factor of less than 5. Since these 11 sequences are not obviously closely related, it can be deduced that Vsr.Bst does not exhibit a pronounced preference for a particular sequence context the T/G mismatch is occurring in. It can, of course, not be rigorously excluded that there might exist a much better substrate than those tested in this survey, in which case the activities observed here would have to be considered as minor side reactions. This seems unlikely, however, since the best substrates found here correlate quite well with the most plausible substrate range deduced from the analysis of resistance of chromosomal DNA to restriction enzyme cleavage.

Identification of all cytosine DNA methyltransferases from B.stearothermophilus H3 would certainly be the best guideline to judge potential substrate sequences for Vsr.Bst. So far, two cytosine DNA methyltransferases have been described for B.stearothermophilus H3: the multispecific phage cytosine DNA methyltransferase M.φBssHII, methylating ACGCGT (MluI), CCGCGG (SacII), RGCGCY (HaeII), RCCGGY (Cfr10I) and GCGCGC (BssHII) (22,23) (dominant methylation sites indicated in bold face, possible minor methylation sites in italics) and the authentic M.BssHII protecting GCGCGC (33). Although the substrate with the highest cleavage effiency is not derived from these recognition sequences, it seems conceivable—given the comparatively small differences of the relative second order rate constants for the different substrate sequences—that these sequences represent natural targets for Vsr.Bst after hydrolytic deamination of a 5-methyl cytosine.

The competition to be expected between Vsr.Bst and the postreplicative DNA mismatch repair might provide an additional tool for the search of potential substrate sites, once the genomic sequence of B.stearothermophilus is available. Mismatches arising during replication as DNA polymerase errors are processed by postreplicative DNA mismatch repair to increase replicational fidelity (34). Since we have identified a mutS and mutL homolog in B.stearo thermophilus (B.Fartmann, M.Laging and W.Kramer, unpublished), it is very likely that DNA mismatch repair is operating in this organism. Any T/G mismatch generated during replication could be processed by either DNA mismatch repair or Vsr.Bst. Whereas DNA mismatch repair removes the wrong nucleotide on the newly synthesized strand, Vsr.Bst would always initiate excision of the T. If the T/G mismatch arises by insertion of G opposite T on the template strand, action of Vsr endonuclease would fix the error. Competition of VSP repair with DNA mismatch repair has been demonstrated in E.coli (11) and its traces left during evolution can still be found imprinted into the E.coli genome (12). Thus, tetra-, penta or hexanucleotide sequences that are underrepresented in the B.stearothermophilus genome might be cytosine DNA methyltransferase recognition sites (or subsets thereof) that are subject to repair by Vsr.Bst after hydrolytic deamination of the 5-methyl cytosine.

Fraction of active enzyme

The cleavage assays were all carried out with a large molar excess of enzyme (∼1000-fold). Nevertheless, no quantitative cleavage of the substrate could be achieved. This could have two reasons: either only a very minor fraction of the enzyme is active or the association constant for formation of the enzyme–substrate complex is too low, which might not allow efficient formation of the complex at the substrate concentrations employed (0.5 nM) unless the enzyme concentration is high (0.57 µM). The upper limit for the amount of labeled substrate is set by the maximum detection capability of the DNA sequencer used. To discriminate between these two possibilities, we used mixtures with a fixed concentration of labeled substrate (1 nM) and varying concentrations of unlabeled substrate and determined product formation after 20 min at 50°C at a constant enzyme concentration (100 nM). The results are shown in Table 2. Already a 5-fold molar excess of the enzyme is sufficient to cleave most of the substrate. This rules out that the large molar excess used in the multiple substrate kinetics is necessary because most of the protein is enzymatically inactive. In an equimolar mixture, ∼50% of the substrate is processed and at a 2-fold excess of substrate ∼25%. This indicates that approximately two molecules of Vsr.Bst are necessary to cleave one substrate molecule under the reaction conditions used. If one assumes that Vsr.Bst has no or only little turnover, as it has been demonstrated for Vsr.Eco (35,36), at least 50% of the enzyme would be enzymatically active.

Table 2. Excess of Vsr.Bst required for efficient substrate cleavage.

Substrate concentration Molar excess of enzyme % product formation
1 nM 100 80
5 nM 20 84
20 nM 5 89
100 nM 1 47
200 nM 0.5 27
1 µM 0.1 8

Acknowledgments

ACKNOWLEDGEMENTS

We thank T. A. Trautner for the generous gift of B.stearothermophilus H3 strain. This work was supported through the DFG-Graduiertenkolleg ‘Chemische Aktivitäten von Mikroorganismen’.

REFERENCES

  • 1.Friedberg E.C., Walker,G.C. and Siede,W. (1995) DNA Repair and Mutagenesis. ASM Press, Washington, DC, pp. 135–190.
  • 2.Wiebauer K. and Jiricny,J. (1990) Mismatch-specific thymine DNA glycosylase and DNA polymerase beta mediate the correction of G.T mispairs in nuclear extracts from human cells. Proc. Natl Acad. Sci. USA, 87, 5842–5845. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Horst J.P. and Fritz,H.-J. (1996) Counteracting the mutagenic effect of hydrolytic deamination of DNA 5-methylcytosine residues at high temperature: DNA mismatch N-glycosylase Mig.Mth of the thermophilic archaeon Methanobacterium thermoautotrophicum THF. EMBO J., 15, 5459–5469. [PMC free article] [PubMed] [Google Scholar]
  • 4.Lieb M. (1983) Specific mismatch correction in bacteriophage lambda crosses by very short patch repair. Mol. Gen. Genet., 191, 118–125. [DOI] [PubMed] [Google Scholar]
  • 5.Lieb M., Allen,E. and Read,D. (1986) Very short patch mismatch repair in phage lambda: repair sites and length of repair tracts. Genetics, 114, 1041–1060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Lieb M. (1987) Bacterial genes mutL, mutS and dcm participate in repair of mismatches at 5-methylcytosine sites. J. Bacteriol., 169, 5241–5246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hanck T., Gerwin,N. and Fritz,H.-J. (1989) Nucleotide sequence of the dcm locus of Escherichia coli K12. Nucleic Acids Res., 17, 5844. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Sohail A., Lieb,M., Dar,M. and Bhagwat,A.S. (1990) A gene required for very short patch repair in Escherichia coli is adjacent to the DNA cytosine methylase gene. J. Bacteriol., 172, 4214–4221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Hennecke F., Kolmar,H., Bründl,K. and Fritz,H.-J. (1991) The vsr gene product of E.coli K-12 is a strand- and sequence-specific DNA mismatch endonuclease. Nature, 353, 776–778. [DOI] [PubMed] [Google Scholar]
  • 10.Lieb M. and Rehmat,S. (1995) Very short patch repair of T:G mismatches in vivo: importance of context and accessory proteins. J. Bacteriol., 177, 660–666. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Zell R. and Fritz,H.-J. (1987) DNA mismatch-repair in Escherichia coli counteracting the hydrolytic deamination of 5-methyl-cytosine residues. EMBO J., 6, 1809–1815. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Merkl R., Kröger,M., Rice,P. and Fritz,H.-J. (1992) Statistical evaluation and biological interpretation of non-random abundance in the E.coli K-12 genome of tetra- and pentanucleotide sequences related to VSP DNA mismatch repair. Nucleic Acids Res., 20, 1657–1662. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Gläsner W., Merkl,R., Schellenberger,V. and Fritz,H.-J. (1995) Substrate preferences of Vsr DNA mismatch endonuclease and their consequences for the evolution of the Escherichia coli K-12 genome. J. Mol. Biol., 245, 1–7. [DOI] [PubMed] [Google Scholar]
  • 14.Dzidic S. and Radman,M. (1989) Genetic requirements for hyper-recombination by very short patch mismatch repair: involvement of Escherichia coli DNA polymerase I. Mol. Gen. Genet., 217, 254–256. [DOI] [PubMed] [Google Scholar]
  • 15.Jones M., Wagner,R. and Radman,M. (1987) Mismatch repair of deaminated 5-methyl-cytosine. J. Mol. Biol., 194, 155–159. [DOI] [PubMed] [Google Scholar]
  • 16.Drotschmann K., Aronshtam,A., Fritz,H.-J. and Marinus,M.G. (1998) The Escherichia coli MutL protein stimulates binding of Vsr and MutS to heteroduplex DNA. Nucleic Acids Res., 26, 948–953. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Tsutakawa S.E., Jingami,H. and Morikawa,K. (1999) Recognition of a TG mismatch: the crystal structure of very short patch repair endonuclease in complex with a DNA duplex. Cell, 99, 615–623. [DOI] [PubMed] [Google Scholar]
  • 18.Tsutakawa S.E., Muto,T., Kawate,T., Jingami,H., Kunishima,N., Ariyoshi,M., Kohda,D., Nakagawa,M. and Morikawa,K. (1999) Crystallographic and functional studies of very short patch repair endonuclease. Mol. Cell., 3, 621–628. [DOI] [PubMed] [Google Scholar]
  • 19.Lindahl T. and Nyberg,B. (1974) Heat-induced deamination of cytosine residues in deoxyribonucleic acid. Biochemistry, 13, 3405–3410. [DOI] [PubMed] [Google Scholar]
  • 20.Fondufe-Mittendorf Y.N., Härer,C., Kramer,W. and Fritz,H.-J. (2002) Two amino acid replacements change the substrate preference of DNA mismatch glycosylase Mig.MthI from T/G to A/G. Nucleic Acids Res., 30, 614–621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Starkuviene V. and Fritz,H.-J. (2002) A novel type of uracil-DNA glycosylase mediating repair of hydrolytic DNA damage in the extremely thermophilic eubacterium Thermus thermophilus. Nucleic Acids Res., 30, 2097–2102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Schumann J., Willert,J., Wild,C., Walter,J. and Trautner,T.A. (1995) M.BssHII: a new multispecific C5-DNA-methyltransferase. Gene, 157, 103–104. [DOI] [PubMed] [Google Scholar]
  • 23.Schumann J., Walter,J., Willert,J., Wild,C., Koch,D. and Trautner,T.A. (1996) M.BssHII, a multispecific cytosine-C5-DNA-methyltransferase with unusual target recognizing properties. J. Mol. Biol., 257, 949–959. [DOI] [PubMed] [Google Scholar]
  • 24.Hanahan D. (1983) Studies on transformation of Escherichia coli with plasmids. J. Mol. Biol., 166, 557–580. [DOI] [PubMed] [Google Scholar]
  • 25.Laging M., Fartmann,B. and Kramer,W. (2001) Isolation of segments of homologous genes with only one conserved amino acid region via PCR. Nucleic Acids Res., 29, e8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Skerra A. (1994) Use of the tetracycline promoter for the tightly regulated production of a murine antibody fragment in Escherichia coli. Gene, 151, 131–135. [DOI] [PubMed] [Google Scholar]
  • 27.Pace C.N., Vajdos,F., Fee,L., Grimsley,G. and Gray,T. (1995) How to measure and predict the molar absorption coefficient of a protein. Protein Sci., 4, 2411–2423. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Gläsner W., Merkl,R., Schmidt,S., Cech,D. and Fritz,H.-J. (1992) Fast quantitative assay of sequence-specific endonuclease activity based on DNA sequencer technology. Biol. Chem. Hoppe Seyler, 373, 1223–1225. [DOI] [PubMed] [Google Scholar]
  • 29.Schellenberger V., Siegel,R.A. and Rutter,W.J. (1993) Analysis of enzyme specificity by multiple substrate kinetics. Biochemistry, 32, 4344–4348. [DOI] [PubMed] [Google Scholar]
  • 30.Neddermann P. and Jiricny,J. (1994) Efficient removal of uracil from G.U mispairs by the mismatch-specific thymine DNA glycosylase from HeLa cells. Proc. Natl Acad. Sci. USA, 91, 1642–1646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Gläsner W., Hennecke,F. and Fritz,H.-J. (1992) In Lilley,D.M.J., Heumann,H. and Suck,D. (eds), Structural Tools for the Analysis of Protein-Nucleic Acid Complexes. Birkhäuser Verlag, Basel, pp. 165–173.
  • 32.Lindahl T. (1980) Uracil-DNA glycosylase from Escherichia coli. Methods Enzymol., 65, 284–290. [DOI] [PubMed] [Google Scholar]
  • 33.Xu S., Xiao,J., Posfai,J., Maunus,R. and Benner,J.,II (1997) Cloning of the BssHII restriction-modification system in Escherichia coli: BssHII methyltransferase contains circularly permuted cytosine-5 methyltransferase motifs. Nucleic Acids Res., 25, 3991–3994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Modrich P. and Lahue,R. (1996) Mismatch repair in replication fidelity, genetic recombination and cancer biology. Annu. Rev. Biochem., 65, 101–133. [DOI] [PubMed] [Google Scholar]
  • 35.Turner D.P. and Connolly,B.A. (2000) Interaction of the E.coli DNA G:T-mismatch endonuclease (vsr protein) with oligonucleotides containing its target sequence. J. Mol. Biol., 304, 765–778. [DOI] [PubMed] [Google Scholar]
  • 36.Gonzalez-Nicieza R., Turner,D.P. and Connolly,B.A. (2001) DNA binding and cleavage selectivity of the Escherichia coli DNA G:T-mismatch endonuclease (vsr protein). J. Mol. Biol., 310, 501–508. [DOI] [PubMed] [Google Scholar]

Articles from Nucleic Acids Research are provided here courtesy of Oxford University Press

RESOURCES