Abstract
RNA transcribed from clustered regularly interspaced short palindromic repeats (CRISPRs) protects many prokaryotes from invasion by foreign DNA such as viruses, conjugative plasmids, and transposable elements. Cas3 (CRISPR-associated protein 3) is essential for this CRISPR protection and is thought to mediate cleavage of the foreign DNA through its N-terminal histidine-aspartate (HD) domain. We report here the 1.8 Å crystal structure of the HD domain of Cas3 from Thermus thermophilus HB8. Structural and biochemical studies predict that this enzyme binds two metal ions at its active site. We also demonstrate that the single-stranded DNA endonuclease activity of this T. thermophilus domain is activated not by magnesium but by transition metal ions such as manganese and nickel. Structure-guided mutagenesis confirms the importance of the metal-binding residues for the nuclease activity and identifies other active site residues. Overall, these results provide a framework for understanding the role of Cas3 in the CRISPR system.
Keywords: Crystal Structure, DNA, DNA-binding Protein, Metals, Mutagenesis Site-specific, CRISPR, Cas3, HD Domain, Nuclease
Introduction
Clustered regularly interspaced short palindromic repeat (CRISPR)2 loci are composed of short DNA repeat sequences separated by stretches of variable spacer sequences (1) that are derived from viral and plasmid DNA (2–5). CRISPR loci are located near clusters of CRISPR-associated (cas) genes that, together with the RNA transcribed from the CRISPR loci, mediate the resistance pathway (6, 7). Although cas gene clusters are extremely diverse in both gene number, sequence, and organization, they can be grouped into three types of CRISPR/Cas system (6). cas3 is the defining gene of the type I system, which is also the most widespread of the systems, being found in ∼95% of all genomes containing cas genes (6).
CRISPR/Cas systems operate in three stages: adaptation, expression, and interference. During adaptation, foreign DNA is recognized, processed, and integrated into CRISPR loci as new spacer sequences. In the expression stage, CRISPR loci are transcribed and processed into small RNAs (crRNA). In the final interference stage, crRNAs serve as the guide sequences in the silencing of target nucleic acid.
In the type I system, the cascade (CRISPR-associated complex for antiviral defense) catalyzes the processing of CRISPR transcripts into crRNA (7). Cascade is a 405-kDa complex comprised of five functionally essential Cas proteins (Cse1 (CRISPR subtype Escherichia coli 1), Cse2, Cas5, Cas6, and Cas7) (6–8). Following processing, the mature crRNA remains associated with cascade (7) and targets this complex to bind foreign dsDNA. Upon binding, cascade melts the dsDNA, allowing crRNA to form Watson-Crick base pairs with the complementary strand, whereas the noncomplementary strand is exposed as single-stranded DNA (ssDNA) (8). In the type II (9) and type III (10) systems, the ultimate fate of the targeted nucleic acid is cleavage by Cas nucleases. In the type I system, cascade lacks any detectable DNase activity (8). However, ssDNA endonuclease activity has recently been described for Streptococcus thermophilus cas3 (11). In vivo studies have shown that Cas3 is indispensible for the phage-resistant phenotype of E. coli (7). Moreover, the nuclease activity of Cas3 has been linked to the interference stage of the CRISPR/Cas response and is essential for CRISPR-dependent inhibition of biofilm formation in Pseudomonas aeruginosa (12). Thus, a model has been proposed whereby Cas3 functions by cleaving the ssDNA region exposed by cascade upon target DNA binding (8, 11).
In most cases, Cas3 proteins are composed of an N-terminal histidine-aspartate (HD) domain, followed by a C-terminal super family 2 helicase domain (13, 14). In some genomes, the HD and helicase domains are encoded as separate subunits and are then named Cas3″ and Cas3′, respectively (6). Throughout the manuscript, we will refer to the HD domain of Cas3 as Cas3HDdom and in accordance with Ref. 6, the HD domain subunit will be referred to as Cas3″. The HD domain catalyzes the ssDNA endonuclease activity of Cas3, and the helicase domain catalyzes the ATP-dependent unwinding of dsDNA and RNA-DNA duplexes (11). In contrast, Sulfolobus solfataricus Cas3″ has distinct substrate specificity as it cleaves double-stranded but not single-stranded DNA or RNA (15).
The HD domain superfamily of metal-dependent phosphohydrolases catalyzes both phosphomonoesterase and phosphodiesterase reactions for a broad range of substrates (16). All HD domains contain three highly conserved motifs (motifs I, II, and V) (16). Motif II contains the signature histidine-aspartate sequence, and motifs I and V contain conserved histidine and aspartate residues, respectively. Crystal structures of several HD domains reveal a canonical divalent metal ion-binding site formed by these residues (17–19). Two additional motifs, containing conserved histidine residues (motifs III and IV), are found in some superfamily members, including Cas3 and Cas3″ (16). The function of these additional motifs is unknown but likely involves metal ion binding.
As part of our investigation into the molecular basis of the CRISPR system, we undertook crystallographic and biochemical studies of the HD domain of Thermus thermophilus Cas3. We report here the crystal structure of this domain alone and with bound Ni2+. We show that this domain is a metal-dependent ssDNA endonuclease, active in the presence of transition metals but not Mg2+ or Ca2+, and that two metal ions are likely bound at the active site. We also characterize potential active site residues through structure-guided mutagenesis experiments.
EXPERIMENTAL PROCEDURES
Cloning and Mutagenesis
The 780-bp sequence encoding Cas3HDdom was amplified from the Thermus thermophilus HB8 cas3 gene (TTHB187) and cloned into the pHAT2 expression vector (20) between its NcoI and EcoRI restriction sites. The resulting plasmid encodes the Cas3HDdom polypeptide fused to an N-terminal His6 tag. Alanine mutations were introduced into the cas3HDdom gene by the QuikChange site-directed mutagenesis method (Stratagene). All mutations were verified by DNA sequencing.
Expression and Purification
Wild-type and mutant pHAT2-cas3HDdom plasmids were transformed into the T7 EXPRESS strain of E. coli (New England Biolabs). The cells were grown at 37 °C in Luria-Bertani medium to an A600 of ∼0.4. Expression was induced by the addition of 0.2 mm isopropyl 1-thio-β-d-galactopyranoside. Following overnight incubation at 20 °C, the cells were harvested by centrifugation. Cell pellets were lysed in lysis buffer (20 mm Tris-HCl, pH 8.0, 1 m NaCl, and 10% glycerol) and then clarified by centrifugation at 18,000 rpm for 30 min. The lysates were loaded on a 5-ml immobilized metal affinity column (Bio-Rad) charged with nickel sulfate. The column was washed with lysis buffer containing 20 mm imidazole, and the bound protein was then eluted with 250 mm imidazole. The elution was then loaded onto a HiLoad 26/60 S200 column (GE Healthcare) pre-equilibrated with gel-filtration buffer (20 mm Tris-HCl, pH 8.0, 1 mm EDTA, and 200 mm NaCl). Fractions containing Cas3HDdom were pooled, dialyzed against gel-filtration buffer lacking EDTA, and concentrated to ∼8 mg/ml using an Ultracel 10 K centrifugal filter unit (Millipore). Purified proteins were >95% pure as judged by SDS-PAGE and Coomassie staining.
Crystallization
Crystals of Cas3HDdom were obtained using the sitting-drop vapor-diffusion method by mixing 2 μl of Cas3HDdom at ∼8 mg/ml with 1 μl of solution containing 10 mm phosphate-citrate buffer at pH 4.2 and 10% PEG 300. Crystals were stabilized and cryo-protected in a solution containing 10 mm sodium acetate, pH 4.2, and 30% PEG 300, and then flash-frozen in liquid nitrogen. Platinum and osmium derivative crystals were obtained by soaking crystals for 90 min in a stabilization solution containing either 1 mm of K2PtCl4 or (NH4)2OsCl6.
Data Collection and Structure Determination
X-ray diffraction data were collected at beamline 9.2 at the Stanford Synchrotron Radiation Light Source and processed with either XDS (21) or HKL2000 (22). SHELX (23) was used to find the positions of osmium sites in the (NH4)2OsCl6 derivative crystal first. The phases derived from these osmium sites were then used to calculate a difference Fourier map to find the heavy atom positions in the platinum derivative. All phases were calculated using SOLVE and improved by solvent-flattening in RESOLVE (24). Model building was carried out in Coot (25), and the model was refined with PHENIX (26). Coordinates of the metal-free and metal-bound structures have been deposited with the Protein Data Bank (PDB) codes 3SK9 and 3SKD, respectively.
Analysis of Methanococcus jannaschii Cas3″
To access the structure of M. jannaschii Cas3″, we calculated difference electron density maps (2.3 Å resolution) from the deposited coordinates and structure factors (PDB code 3M5F) using the phenix.model_vs_data script (27). The Rwork/Rfree from this calculation is 22.9/26.3%, in good agreement with the published Rwork/Rfree of 23.1/26.0%.
Nuclease Assay
Cas3HDdom nuclease assays were performed as described previously (11). Magnesium chloride (Mg2+), manganese chloride (Mn2+), nickel sulfate (Ni2+), copper chloride (Cu2+), cobalt chloride (Co2+), calcium chloride (Ca2+), and zinc chloride (Zn2+) were used in metal ion substitution reactions at the indicated concentrations. All reactions were terminated with 20 mm EDTA. The products of reactions were separated by electrophoresis through 1% agarose gels and visualized by ethidium bromide staining.
Thermofluor Assay
The apparent melting temperature values of wild-type and mutant cas3HDdom were determined as described previously (28). Experiments were performed in a buffer containing 20 mm Tris-HCl, pH 8.0, and 200 mm NaCl. The final concentration of protein was 10 μm (ϵ = 55460 m−1 cm−1). Reactions were heated from 20 to 80 °C, and the fluorescence intensity was recorded at 0.2 °C intervals. Fluorescence intensities were plotted as a function of temperature, and the midpoint of the unfolding transition taken as an estimation of the melting temperature.
RESULTS
Crystal Structure of Cas3HDdom
Sequence analysis predicts that the Cas3HDdom spans residues 5–260 of T. thermophilus HB8 cas3. The DNA sequence encoding this region was cloned into the pHAT2 expression vector (20), expressed in E. coli, and purified to homogeneity by affinity and size-exclusion chromatography. Cas3HDdom was crystallized by vapor diffusion with PEG 300 as the precipitant. Divalent metal ions inhibited crystallization and therefore were not included in crystallization experiments. Crystals belong to the space group P4322 (a = b = 48.5 Å and c = 205.9 Å) and contain one Cas3HDdom molecule per asymmetric unit. The structure was determined by multiple isomorphous replacement and refined at 1.8 Å resolution to an Rwork of 16.8% and an Rfree of 19.5%. A representative section of unbiased electron density is shown in Fig. 1A. The final model displays good geometry with no Ramachandran outliers (Table 1) and contains all of the Cas3HDdom sequence except residues 81–101 and 183–188, for which electron density was not interpretable. No divalent metal ions were apparent in the electron density map. The tertiary structure of Cas3HDdom, composed of 10 α-helices and two β-strands, is illustrated in Fig. 1B. Overall, Cas3HDdom adopts a globular structure with a concave surface formed by the five conserved motifs of the HD superfamily (Fig. 1C) (16).
FIGURE 1.
Crystal structure of T. thermophilus Cas3HDdom. A, unbiased Fo − Fc electron density map contoured at 3σ. The residues, which are represented as sticks, were omitted from the map calculation. B, ribbon trace of the Cas3HDdom structure. The HD domain motifs are colored as follows: motif I (red), motif II (green), motif III (orange), motif IV (blue) and motif V (pink). The black sphere represents the cognate metal binding site. The N and C termini are labeled. C, Amino acid sequence conservation scores mapped onto the surface of two orthogonal views of the Cas3HDdom using CONSURF (41). The structure to the left is in the same orientation as shown in B. The conservation scale is drawn below the two views of the structure. The black sphere represents the cognate metal-binding site. D, Cas3HDdom metal ion-binding site. Residues colored by element are in the metal ion-bound configuration. Residues colored yellow are in the protein alone configuration. The Ni2+ (blue) and two water molecules (red) are represented as spheres. Anomalous difference electron density map contoured at 5σ (white mesh) reveals the site of the Ni2+ ion.
TABLE 1.
Data collection and processing statistics
| Native | Nickel soak | Platinum soak | Osmium soak | |
|---|---|---|---|---|
| Data collection | ||||
| Resolution (Å)a | 1.80 (1.90-1.80) | 2.00 (2.11-2.00) | 2.23 (2.31-2.23) | 2.29 (2.37-2.29) |
| Rsyma,b | 6.1 (49.5) | 15.4 (70.5) | 14.9 (93.6) | 6.5 (14.9) |
| I/σa | 22.7 (4.5) | 9.5 (3.2) | 8.2 (1.2) | 18.1 (9.4) |
| Redundancya | 11.7 (10.7) | 12.0 (11.5) | 3.8 (3.8) | 3.8 (3.8) |
| Completeness (%)a | 99.4 (96.4) | 99.9 (99.5) | 100 (100) | 99.6 (97.6) |
| Wilson B factor | 25.56 | |||
| Mean figure of merit | 0.51 | |||
| Heavy atom sites | 1 | 4 | 1 | |
| Refinement | ||||
| Rworkc | 16.69 | 16.52 | ||
| Rfreec | 19.54 | 20.62 | ||
| r.m.s.d. bond (Å)d | 0.016 | 0.018 | ||
| r.m.s.d. angle | 1.425° | 1.518° | ||
| No. of atoms | 1953 | 1957 | ||
| B-factors | 32.7 | 37.3 | ||
a The values in parentheses are for the highest resolution shell.
b Rsym is Σ|Io − I|/ΣIo, where Io is the intensity of an individual reflection, and I is the mean intensity for multiply recorded reflections.
c Rwork is ‖Fo − Fc‖/Fo, where Fo is an observed amplitude, and Fc is the calculated amplitude; Rfree is the same statistic calculated over a subset of the data that has not been used for refinement.
d r.m.s.d., root mean square deviation.
To investigate the molecular basis of divalent metal ion binding by Cas3HDdom, we measured diffraction data from crystals soaked in a stabilization solution containing 100 μm nickel sulfate. Soaks were performed at the crystallization pH of 4.2 because attempts to increase the pH severely reduced the quality of the x-ray diffraction data. Anomalous difference electron density maps, calculated from soaked data, contained a single strong peak (∼25σ) positioned at the canonical metal ion-binding site. The Ni2+ ion is coordinated, with octahedral geometry, by two water molecules and four conserved residues, within motifs I (His-24), II (His-69 and Asp-70), and V (Asp-205) (Fig. 1D). Superposition of the structures of Cas3HDdom with and without bound metal ion results in a root mean square deviation of 0.28 Å over 235 Cα atoms, demonstrating that the structure of Cas3HDdom remains largely unchanged upon metal ion binding. The only significant difference is seen at the metal ion-binding site where the side chain of His-69 rotates to coordinate the metal ion (Fig. 1D).
Comparison with Other HD Domain Proteins
As expected, a search of the structural database by DALI (29) shows that Cas3HDdom is related to other HD domains, including many unpublished structures deposited by structural genomics initiatives. The most closely related is the unpublished structure of a Cas3″ protein (MJ0384) from Methanocaldococcus jannaschii (PDB code 3M5F). These two proteins share only 14% amino acid identity, yet their structures align with a root mean square deviation of 3.2 Å over 142 Cα atoms (Z score of 7.9) (Fig. 2A). Although overall similar, there are some notable differences between the structures. First, two regions differ in topology. A 13-amino acid loop (residues 176–191) connects the seventh and eighth helices of Cas3HDdom, whereas in Cas3″, an insertion results in an additional α-helix connecting these helices (residues 168–202) via an alternate path (Fig. 2B). The C terminus of Cas3HDdom also ends with a helix-strand-strand arrangement that is absent in the Cas3″ structure (Fig. 2A). However, this difference may be the result of disorder in the electron density map of Cas3″. The last 30 residues of Cas3″ are not modeled, and the electron density describing the last helix (which contains motif V) lacks clear side chain features. The coordinates of many of the residues in this helix have been truncated in the model. The second difference between Cas3HDdom and Cas3″ is in the configuration of the metal ion binding sites. Two Ca2+ ions are modeled at the active site of M. jannaschii Cas3″ (Fig. 3A). Strikingly, neither of these metal ions is found at the canonical metal binding site of HD domains (site A in Fig. 3), perhaps due to the disorder observed in the electron density in the region of motif V. Instead, four conserved residues within motifs II, III, and IV coordinate one of the Ca2+ ions (site B in Fig. 3A). The significance of the position of the second Ca2+ ion (site C in Fig. 3A) is unclear as it is located 4.0 Å from the nearest protein atom or water molecule.
FIGURE 2.
Comparison of Cas3HDdom with other HD domains. A, structure of T. thermophilus Cas3HDdom (yellow and red) superimposed on the structure of M. jannaschii Cas3″ (white). The additional helix-strand-strand element in the structure of Cas3HDdom is colored red. The molecules are oriented as shown in B. B, side-by-side view of the structures of T. thermophilus Cas3HDdom (left) and M. jannaschii Cas3″ (right). The helical insertion found in M. jannaschii Cas3″ and its equivalent loop in T. thermophilus Cas3 are colored blue. The helices either side of this region are colored green. C, ribbon trace of the minimal core five α-helices of the HD domain taken from the structure of Cas3HDdom. Residues from the five HD domain motifs are colored as shown in B. A surface representation of Cas3HDdom is shown in the background.
FIGURE 3.
Metal ion-binding sites. The residues that form the metal ion-binding sites of M. jannaschii Cas3″ (A), T. thermophilus Cas3HDdom (B), and a protein of unknown function (PDB 2PQ7) from a Thermotogales species (C). The text color of residue labels indicates the motif that the residue belongs to as shown in Fig. 1B. Labeled metal ions are shown as green spheres. His-138 of T. thermophilus Cas3HDdom is modeled in two alternative conformers; for clarity, only one of these conformers is shown.
The Cas3″ residues interacting with the site B Ca2+ ion are conserved in Cas3HDdom (Fig. 3B), suggesting that Cas3HDdom could bind two metal ions. Yet, in the electron density maps generated from our Ni2+-soaking experiments, we fail to observe a metal ion at site B. The motif IV histidine residues (His-137 and His-138), which coordinate the site B metal ion in Cas3″, are oriented away from the binding site in the Cas3HDdom structure (Fig. 3B). However, these residues are in a loop that is constrained by crystal packing contacts and, as a result, may be unavailable for metal ion binding. We were unable to crystallize Cas3HDdom in the presence of divalent metal ions. In solution, Cas3HDdom may bind two metal ions, one at site A and the other at site B. At least five structures of HD domains have been determined (PDB codes 2PQ7, 3HC1, 3CCG, 2OGI, and 2O08) that have a metal ion bound at each of these sites (an example of which is shown in Fig. 3C). These five HD domains are of unknown function, but they all contain a conserved histidine residue in motif III and either one or two conserved histidine residues in motif IV.
The availability of the crystal structures of many HD superfamily members permits us to define the minimal fold of the HD domain. An inspection of the overlay of HD domain structures reveals a common core of five α-helices that, along with their connecting loops, house the five motifs that define the HD superfamily (Fig. 2C) (16). Beyond this core structure, different members of the HD superfamily have unique structural elements that presumably help specify the individual functions of each HD domain family.
Nuclease Activity of Cas3HDdom
S. thermophilus Cas3 has been shown to cleave ssDNA but not dsDNA in a Mg2+-dependent manner. To investigate the activity of the T. thermophilus Cas3HDdom, we incubated various concentrations of the protein in the presence of Mg2+ and ssDNA (M13mp18). The reactions were then analyzed by electrophoresis through agarose gels, and the DNA species was visualized with ethidium bromide (Fig. 4A). Under these conditions, no cleavage of ssDNA was detected. We also assayed cleavage of dsDNA (PvuII-linearized pUC19) and again observed no cleavage (Fig. 4B). As HD domains have been reported to utilize a variety of different divalent cations as cofactors (30–33), we evaluated the ability of several other divalent cations (Ca2+, Mn2+, Co2+, Ni2+, Cu2+, and Zn2+) to activate the nuclease activity of Cas3HDdom. Thus, cleavage reactions were repeated, substituting increasing concentrations of each of these metal ions for Mg2+. In these experiments, reaction products were detected as a smear on the agarose gel in the presence of low concentrations (20 μm) of Ni2+, Mn2+, Co2+, Cu2+, and Zn2+ (Fig. 5A). No cleavage was observed in either the absence of divalent metal ions, the presence of EDTA, or the presence of up to 20 mm Ca2+. We also tested and detected no cleavage of dsDNA in the presence of any of these metal ions (Fig. 5B).
FIGURE 4.
The nuclease activity of Cas3HDdom is not activated by Mg2+. A, activity of Cas3HDdom on ssDNA in the presence of Mg2+. Reaction mixtures containing 10 mm Tris-HCl, pH 7.5, 60 mm KCl, 10 mm MgCl2, 10% glycerol, and 4 nm circular single-stranded M13mp18, and the indicated amounts of Cas3HDdom were incubated for 2 h at 37 °C. Reactions with EDTA or no added metal served as controls. All reactions were quenched with 20 mm EDTA, and the products were then resolved by a 1% agarose gel and visualized by ethidium bromide staining. B, activity of Cas3HDdom on dsDNA in the presence of Mg2+. Assays were performed as above except that the reaction mixtures contained 4 nm pUC19 linearized with PvuII.
FIGURE 5.
The ssDNA endonuclease activity of Cas3HDdom is activated by transition metal ions. A, activity of Cas3HDdom in the presence of difference divalent metal ions. Reaction mixtures containing 10 mm Tris-HCl, pH 7.5, 60 mm KCl, 10% glycerol, 1 μm Cas3HDdom, and 4 nm circular single-stranded M13mp18 DNA, and the indicated amount of each divalent metal ion were incubated for 2 h at 37 °C. Reactions with EDTA, no added metal (NM) or no protein (C) served as controls. All reactions were quenched with 20 mm EDTA, and the products were then resolved by a 1% agarose gel and visualized by ethidium bromide staining. B, activity of Cas3HDdom on dsDNA in the presence of different divalent metal ions. Assays were performed as above except that reaction contained 4 nm pUC19 linearized with PvuII.
Metal Ion Binding Increases Thermal Stability of Cas3HDdom
To further characterize the interaction between Cas3HDdom and divalent cations, we performed a Thermofluor assay (28, 34, 35). This assay measures the change in the fluorescence signal of SYPRO orange dye as it interacts with a protein undergoing thermal unfolding. The fluorescence signal of the dye is quenched in an aqueous environment but becomes unquenched when exposed to the hydrophobic core of the protein upon unfolding. The midpoint of the unfolding transition is taken as an approximation of the melting temperature (Tm). This assay can assess ligand binding because ligands that bind more tightly to the folded form of the protein than to the unfolded form are likely to increase the apparent Tm of that protein (36). In the presence of EDTA, the apparent Tm of Cas3HDdom is 49.6 ± 0.2 °C (Fig. 6). In line with our activity data, the addition of 20 mm Mg2+ and 100 μm Ni2+ increases the apparent Tm of Cas3HDdom by ∼6 °C and ∼15 °C, respectively (Fig. 6).
FIGURE 6.
Effects of metal ions on the thermal stability of Cas3HDdom. Thermofluor assays were performed in the absence of metal or in the presence of either 20 mm Mg2+ or 100 μm Ni2+. The apparent melting temperatures are indicated by arrows. R.F.U., relative fluorescence units.
Mutational Analysis of Cas3HDdom
A series of point mutant proteins were generated in which putative active site residues (His-24, His-69, Asp-70, Lys-73, His-105, His-138, His-139 Ser-202, Ser-209, and Asp-205) or the surface-exposed aromatic residues that surround this site (Trp-102 and Phe-253) were replaced with alanine. The location of these mutations in Cas3HDdom is highlighted in Fig. 7A. Alanine mutants were expressed and purified in the same manner as the wild-type protein (Fig. 7B). To ensure that any potential defects observed in nuclease activity could not be attributed to global misfolding and to assess divalent metal ion binding, the apparent Tm of each alanine mutant was determined in the absence or presence of Ni2+ (Table 2). In the absence of Ni2+, nine of the 12 mutant proteins have Tm values similar to or greater than that of wild-type protein, indicating that these mutations did not destabilize the fold of Cas3HDdom. Three of the mutant proteins (K73A, H105A, and S209A) have a Tm lower than wild-type, suggesting that some destabilization did occur. For wild-type Cas3HDdom, the addition of Ni2+ results in an increase in apparent Tm (ΔTm) of 15 °C. The mutation of residues not implicated in metal ion binding (Lys-73, Trp-102, Ser-202, Ser-209, and Phe-253) results in similar or larger ΔTm values, supporting the evidence that these residues do not participate in metal ion binding. With the exception of His-105, the mutation of residues predicted to bind metal ions (His-24, His-69, Asp-70, His-138, His-139, and Asp-210) results in a decreased ΔTm (Table 2), suggesting that, in solution, these residues are involved in metal ion binding.
FIGURE 7.
Mutational analysis of Cas3HDdom. A, ribbon trace of Cas3HDdom (white). Residues selected for mutation are represented as sticks (red), and the Ni2+ ion as a sphere (blue). B, an SDS-PAGE of the purified mutants, stained with Coomassie Blue. C, ssDNA endonuclease activity of the mutants. Reaction mixtures containing 10 mm Tris-HCl, pH 7.5, 60 mm KCl, 10% glycerol, 100 nm Cas3HDdom, and 60 μm NiSO4, and 4 nm circular single-stranded M13mp18 DNA were incubated for 50 min at 37 °C. Reactions with EDTA or no Cas3HDdom (−) served as controls. The reaction products were resolved by a 1% agarose gel and visualized by ethidium bromide staining.
TABLE 2.
Melting temperatures of wild-type and mutant Cas3HDdom
| Melt temperature |
||
|---|---|---|
| 0 μm Ni2+ | 100 μm Ni2+ | |
| °C | ||
| WT | 49.6 ± 0.2 | 64.8 ± 0.6 |
| H24A | 53.6 ± 0.4 | 62.8 ± 0.4 |
| H69A | 49.8 ± 1.8 | 62.2 ± 0.8 |
| D70A | 52.6 ± 0.2 | 55.2 ± 0.4 |
| K73A | 45.0 ± 0.4 | 65.0 ± 0.6 |
| W102A | 49.0 ± 0.4 | 67.2 ± 0.4 |
| H105A | 45.8 ± 0.4 | 61.2 ± 0.4 |
| H137A | 50.8 ± 0.4 | 62.0 ± 0.2 |
| H138A | 49.2 ± 0.2 | 61.8 ± 0.4 |
| S202A | 51.2 ± 0.8 | 66.2 ± 0.4 |
| D205A | 50.6 ± 0.6 | 61.0 ± 0.4 |
| S209A | 47.2 ± 0.2 | 62.4 ± 0.4 |
| F253A | 49.6 ± 0.4 | 65.4 ± 0.6 |
We next tested each of the alanine mutants for ssDNA endonuclease activity using the M13mp18 phage cleavage assay in the presence of Ni2+ (Fig. 7C). Under these conditions, wild-type Cas3HDdom generated cleavage products that migrated as a tight smear on an agarose gel, whereas the mutation of residues predicted to bind metal ions abolished this nuclease activity. These results confirm that the activity we observed is not the result of a contaminating protein. The other alanine mutations abolished (K73A), suppressed (W102A and S209A), or had little or no affect (S202A and F252A) on the nuclease activity of Cas3HDdom.
DISCUSSION
Cas3 is functionally essential (7, 12) and is the signature gene of the type I CRISPR/Cas system (6). The HD nuclease domain of Cas3 is proposed to cleave the ssDNA revealed upon cascade binding to target DNA (11). Consistent with this hypothesis, we show that T. thermophilus Cas3HDdom cleaves ssDNA but not dsDNA. This result also establishes that the helicase domain of Cas3 does not alter the substrate specificity of its HD domain. Mg2+ activated the endonuclease activities of S. thermophilus Cas3 (11) and S. sulfataracus Cas3″ (15). In contrast, the transition metal ions Mn2+, Co2+, Ni2+, and Zn2+ activate the endonuclease activity of T. thermophilus Cas3HDdom but not Mg2+ or Ca2+ (Fig. 7C). It is also noteworthy that T. thermophilus Cas3HDdom appears much more active in the presence of transition metal ions, particularly Ni2+, than S. thermophilus Cas3 is in the presence of Mg2+ (11). More quantitative data will be needed to establish whether this is significant. Which metal ion, or ions, is used in vivo by T. thermophilus Cas3 remains to be determined. We cannot rule out the possibility that the in vitro requirement for transition metal ions could be a sign of a missing cofactor found in the cell. However, in T. thermophilus, the total intracellular concentration of Mn2+, Ni2+, and Zn2+ are ∼160, 150, and 550 μm, respectively, whereas the concentration of Co2+ is undetectable (37). Considering that only 20 μm of Mn2+ or Ni2+ is needed to activate Cas3HDdom in vitro, these two ions are the most likely in vivo candidates of the ions studied.
The crystal structure of T. thermophilus Cas3HDdom provides the first view of an HD domain with nuclease activity. Cas3HDdom adopts a globular structure with a concave surface that contains the active site and presumably binds substrate DNA. Comparison of the structure of the Cas3HDdom with bound Ni2+ and other HD domains with bound divalent metal ions suggests that Cas3HDdom binds two metal ions at its active site. In line with this, the mutation of residues predicted to form the metal ion binding sites results in proteins with smaller ΔTm values upon addition of Ni2+ compared with the wild-type protein. The D70A mutant has the smallest ΔTm value, consistent with the observation that among two metal-ion dependent enzymes, the most critical residue for metal binding is often an aspartate (38). His-105 is the only residue predicted to bind metal ion that, when mutated, has a ΔTm value comparable with that of wild-type protein (Table 2). However, this residue most likely is a metal ion ligand as it is highly conserved and coordinates metal binding in structures of other HD domains (PDB codes 2PQ7, 3HC1, 3CCG, 2OGI, and 2O08).
The metal-binding data and analysis presented here also imply that all HD domain proteins with histidine residues in motifs III and IV will have two metal ions at their active sites. Thus, unlike the HD domains characterized to date (17–19), proteins in this subset of HD domains, which include Cas3 and Cas3″, are likely to utilize a two metal-ion mechanism for catalysis (39, 40). The fact that Cas3HDdom appears to bind two metal ions in the absence of substrate is also somewhat distinct, as it is generally observed that metal ion binding by two-metal-ion dependent enzymes requires the presence of cognate substrate (38).
We used structure-guided mutagenesis to confirm both the importance of metal ion-binding residues to the activity of Cas3HDdom and to examine the role of other residues close to the metal ion-binding sites. The mutation of the residues predicted to bind metal ions, including His-105 (Fig. 3B), completely ablates the nuclease activity of Cas3HDdom under the conditions tested. These results, coupled with both our structural analysis and ΔTm data (Table 2), are consistent with two metal ions bound at the Cas3HDdom active site and establish the importance of these ions for nuclease activity. Mutation of the invariant Lys-73 produced an inert enzyme (Fig. 7C). Because of the proximity of this residue to the metal ion binding sites (Fig. 7A) and its positive charge, it is likely that it helps correctly position a phosphate group of the substrate for catalysis. Mutation of Trp-102 or Ser-209 also resulted in a protein with a reduced activity (Fig. 7C). The position of these residues within the substrate-binding cleft (Fig. 7A) suggests that they may play a role in substrate recognition.
Studies of S. solfataricus Cas3″ have shown that this enzyme has distinct substrate specificity compared with S. thermophilus and T. thermophilus Cas3, as it cleaves dsDNA but not ssDNA (15). Additionally, comparison of mutational studies between T. thermophilus cas3 (presented here) and S. solfataricus Cas3″ (15) suggests that the active site geometry of S. solfataricus Cas3″ may also be distinct. First, mutation of either His-69 or His-105 ablates Cas3HDdom nuclease activity (Fig. 7C). However, mutation of the corresponding residues in S. solfataricus Cas3″ results in a protein with near wild-type activity (15). Secondly, mutation of Glu-92 in S. solfataricus Cas3″, a motif III residue that is conserved in Cas3″ but not Cas3, inactivated nuclease activity. Inspection of the structure of M. jannaschii Cas3″ suggests that this glutamate may replace the second histidine residue of motif IV (His-143 in T. thermophilus and His-124 in M. jannaschii). This histidine forms part of metal ion site B (Fig. 3) and is essential for the nuclease activity of T. thermophilus Cas3HDdom (Fig. 7C). In the structure of M. jannaschii Cas3″, Glu-92 is in close proximity to site B, but its side chain is oriented away from the metal ion (Fig. 3A). S. solfataricus Cas3″ lacks the second histidine of motif IV. Thus, Glu-92 could substitute for this histidine in coordinating the metal ion, potentially explaining the importance of this residue for the nuclease activity of S. solfataricus Cas3″. The significance of this mutational data and the difference in substrate specificity awaits further studies of the Cas3 and Cas3″ proteins. However, these results may indicate that S. solfataricus Cas3″ has a different functional role or mechanism of action within the CRISPR response.
Acknowledgments
We thank Gabriel Brandt and Jennifer Kavran for critical reading of the manuscript and Jürgen Bosch for use of a real-time PCR machine.
This work was supported, in whole or in part, by National Institutes of Health Grant GM097330. This work was also supported by start-up funds provided by the Department of Biochemistry and Molecular Biology and the Johns Hopkins Malaria Research Institute.
The atomic coordinates and structure factors (codes 3SK9 and 3SKD) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).
- CRISPR
- clustered regularly interspaced short palindromic repeat(s)
- crRNA
- CRISPR RNA
- dsDNA
- double-stranded DNA
- ssDNA
- single-stranded DNA
- PDB
- Protein Data Bank
- HD domain
- histidine-aspartate domain.
REFERENCES
- 1. Nakata A., Amemura M., Makino K. (1989) J. Bacteriol. 171, 3553–3556 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Mojica F. J., Díez-Villaseñor C., García-Martínez J., Soria E. (2005) J. Mol. Evol. 60, 174–182 [DOI] [PubMed] [Google Scholar]
- 3. Pourcel C., Salvignol G., Vergnaud G. (2005) Microbiology 151, 653–663 [DOI] [PubMed] [Google Scholar]
- 4. Bolotin A., Quinquis B., Sorokin A., Ehrlich S. D. (2005) Microbiology 151, 2551–2561 [DOI] [PubMed] [Google Scholar]
- 5. Barrangou R., Fremaux C., Deveau H., Richards M., Boyaval P., Moineau S., Romero D. A., Horvath P. (2007) Science 315, 1709–1712 [DOI] [PubMed] [Google Scholar]
- 6. Makarova K. S., Haft D. H., Barrangou R., Brouns S. J., Charpentier E., Horvath P., Moineau S., Mojica F. J., Wolf Y. I., Yakunin A. F., van der Oost J., Koonin E. V. (2011) Nat. Rev. Microbiol. 9, 467–477 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Brouns S. J., Jore M. M., Lundgren M., Westra E. R., Slijkhuis R. J., Snijders A. P., Dickman M. J., Makarova K. S., Koonin E. V., van der Oost J. (2008) Science 321, 960–964 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Jore M. M., Lundgren M., van Duijn E., Bultema J. B., Westra E. R., Waghmare S. P., Wiedenheft B., Pul U., Wurm R., Wagner R., Beijer M. R., Barendregt A., Zhou K., Snijders A. P., Dickman M. J., Doudna J. A., Boekema E. J., Heck A. J., van der Oost J., Brouns S. J. (2011) Nat. Struct. Mol. Biol. 18, 529–536 [DOI] [PubMed] [Google Scholar]
- 9. Garneau J. E., Dupuis M. È., Villion M., Romero D. A., Barrangou R., Boyaval P., Fremaux C., Horvath P., Magadán A. H., Moineau S. (2010) Nature 468, 67–71 [DOI] [PubMed] [Google Scholar]
- 10. Hale C. R., Zhao P., Olson S., Duff M. O., Graveley B. R., Wells L., Terns R. M., Terns M. P. (2009) Cell 139, 945–956 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Sinkunas T., Gasiunas G., Fremaux C., Barrangou R., Horvath P., Siksnys V. (2011) EMBO J. 30, 1335–1342 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Cady K. C., O'Toole G. A. (2011) J. Bacteriol. 193, 3333–3445 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Haft D. H., Selengut J., Mongodin E. F., Nelson K. E. (2005) PLoS Comput. Biol. 1, e60. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Makarova K. S., Grishin N. V., Shabalina S. A., Wolf Y. I., Koonin E. V. (2006) Biol. Direct 1, 7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Han D., Krauss G. (2009) FEBS Lett. 583, 771–776 [DOI] [PubMed] [Google Scholar]
- 16. Aravind L., Koonin E. V. (1998) Trends Biochem. Sci. 23, 469–472 [DOI] [PubMed] [Google Scholar]
- 17. Hogg T., Mechold U., Malke H., Cashel M., Hilgenfeld R. (2004) Cell 117, 57–68 [DOI] [PubMed] [Google Scholar]
- 18. Kondo N., Nakagawa N., Ebihara A., Chen L., Liu Z. J., Wang B. C., Yokoyama S., Kuramitsu S., Masui R. (2007) Acta Crystallogr. D. Biol. Crystallogr. 63, 230–239 [DOI] [PubMed] [Google Scholar]
- 19. Zimmerman M. D., Proudfoot M., Yakunin A., Minor W. (2008) J. Mol. Biol. 378, 215–226 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Peränen J., Rikkonen M., Hyvönen M., Kääriäinen L. (1996) Anal. Biochem. 236, 371–373 [DOI] [PubMed] [Google Scholar]
- 21. Kabsch W. (2010) Acta Crystallogr. D. Biol. Crystallogr. 66, 125–132 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Otwinowski Z., Minor W. (1997) Method Enzymol. 276, 307–326 [DOI] [PubMed] [Google Scholar]
- 23. Sheldrick G. M. (2008) Acta Crystallogr. A. 64, 112–122 [DOI] [PubMed] [Google Scholar]
- 24. Terwilliger T. (2004) J. Synchrotron. Radiat. 11, 49–52 [DOI] [PubMed] [Google Scholar]
- 25. Emsley P., Cowtan K. (2004) Acta Crystallogr. D. Biol. Crystallogr. 60, 2126–2132 [DOI] [PubMed] [Google Scholar]
- 26. Afonine P. V., Grosse-Kunstleve R. W., Adams P. D. (2005) Acta Crystallogr D. Biol. Crystallogr. 61, 850–855 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Afonine P. V., Grosse-Kunstleve R. W., Chen V. B., Headd J. J., Moriarty N. W., Richardson J. S., Richardson D. C., Urzhumtsev A., Zwart P. H., Adams P. D. (2010) J. Appl Crystallogr. 43, 669–676 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Vedadi M., Niesen F. H., Allali-Hassani A., Fedorov O. Y., Finerty P. J., Jr., Wasney G. A., Yeung R., Arrowsmith C., Ball L. J., Berglund H., Hui R., Marsden B. D., Nordlund P., Sundstrom M., Weigelt J., Edwards A. M. (2006) Proc. Natl. Acad. Sci. U.S.A. 103, 15835–15840 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Holm L., Sander C. (1993) J. Mol. Biol. 233, 123–138 [DOI] [PubMed] [Google Scholar]
- 30. Yakunin A. F., Proudfoot M., Kuznetsova E., Savchenko A., Brown G., Arrowsmith C. H., Edwards A. M. (2004) J. Biol. Chem. 279, 36819–36827 [DOI] [PubMed] [Google Scholar]
- 31. Proudfoot M., Kuznetsova E., Brown G., Rao N. N., Kitagawa M., Mori H., Savchenko A., Yakunin A. F. (2004) J. Biol. Chem. 279, 54687–54694 [DOI] [PubMed] [Google Scholar]
- 32. Seto D., Bhatnagar S. K., Bessman M. J. (1988) J. Biol. Chem. 263, 1494–1499 [PubMed] [Google Scholar]
- 33. An G., Justesen J., Watson R. J., Friesen J. D. (1979) J. Bacteriol. 137, 1100–1110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Lo M. C., Aulabaugh A., Jin G., Cowling R., Bard J., Malamas M., Ellestad G. (2004) Anal. Biochem. 332, 153–159 [DOI] [PubMed] [Google Scholar]
- 35. Pantoliano M. W., Petrella E. C., Kwasnoski J. D., Lobanov V. S., Myslik J., Graf E., Carver T., Asel E., Springer B. A., Lane P., Salemme F. R. (2001) J. Biomol. Screen 6, 429–440 [DOI] [PubMed] [Google Scholar]
- 36. Matulis D., Kranz J. K., Salemme F. R., Todd M. J. (2005) Biochemistry 44, 5258–5266 [DOI] [PubMed] [Google Scholar]
- 37. Kondo N., Nishikubo T., Wakamatsu T., Ishikawa H., Nakagawa N., Kuramitsu S., Masui R. (2008) Extremophiles 12, 217–223 [DOI] [PubMed] [Google Scholar]
- 38. Yang W., Lee J. Y., Nowotny M. (2006) Mol. Cell 22, 5–13 [DOI] [PubMed] [Google Scholar]
- 39. Freemont P. S., Friedman J. M., Beese L. S., Sanderson M. R., Steitz T. A. (1988) Proc. Natl. Acad. Sci. U.S.A. 85, 8924–8928 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Beese L. S., Steitz T. A. (1991) EMBO J. 10, 25–33 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Armon A., Graur D., Ben-Tal N. (2001) J. Mol. Biol. 307, 447–463 [DOI] [PubMed] [Google Scholar]







