Skip to main content
Nature Portfolio logoLink to Nature Portfolio
. 2022 Mar 2;603(7900):343–347. doi: 10.1038/s41586-022-04470-1

Structural basis for mismatch surveillance by CRISPR–Cas9

Jack P K Bravo 1,#, Mu-Sen Liu 1,#, Grace N Hibshman 1,2, Tyler L Dangerfield 1,2, Kyungseok Jung 1, Ryan S McCool 1,2, Kenneth A Johnson 1,2,, David W Taylor 1,2,3,4,
PMCID: PMC8907077  PMID: 35236982

Abstract

CRISPR–Cas9 as a programmable genome editing tool is hindered by off-target DNA cleavage14, and the underlying mechanisms by which Cas9 recognizes mismatches are poorly understood57. Although Cas9 variants with greater discrimination against mismatches have been designed810, these suffer from substantially reduced rates of on-target DNA cleavage5,11. Here we used kinetics-guided cryo-electron microscopy to determine the structure of Cas9 at different stages of mismatch cleavage. We observed a distinct, linear conformation of the guide RNA–DNA duplex formed in the presence of mismatches, which prevents Cas9 activation. Although the canonical kinked guide RNA–DNA duplex conformation facilitates DNA cleavage, we observe that substrates that contain mismatches distal to the protospacer adjacent motif are stabilized by reorganization of a loop in the RuvC domain. Mutagenesis of mismatch-stabilizing residues reduces off-target DNA cleavage but maintains rapid on-target DNA cleavage. By targeting regions that are exclusively involved in mismatch tolerance, we provide a proof of concept for the design of next-generation high-fidelity Cas9 variants.

Subject terms: Enzyme mechanisms, Cryoelectron microscopy, DNA metabolism, Genetic engineering


Cryo-electron microscopy structures of Cas9 during mismatch cleavage provide insight into the mechanisms that control off-target effects of Cas9, which will aid in the future design of high-fidelity Cas9 variants with reduced off-target cleavage.

Main

For therapeutic applications of CRISPR–Cas9, off-target DNA cleavage must be minimized13. Although a variety of high-fidelity Cas9 variants with improved mismatch discrimination have been developed7,9, their enhanced specificity comes at the cost of severely reduced rates of on-target DNA cleavage5,11. Mismatches induce alternative Cas9 conformations12,13; however, the structures used to guide rational redesign of such variants were bound to on-target DNA and in inactive conformations14,15. To understand the molecular mechanisms that govern off-target recognition, here we used kinetic analysis to guide sample preparation for cryo-electron microscopy (cryo-EM) and obtained structural snapshots of Cas9 pre-cleavage activation intermediates in the presence of various guide RNA–DNA target strand (gRNA–TS) mismatches.

Kinetics of Cas9 on mismatched DNA

We measured the rates of target strand cleavage by Cas9 in the presence of contiguous triple nucleotide mismatches at different positions along the gRNA–TS duplex (Extended Data Fig. 1a, Extended Data Table 1). Compared to rapid on-target cleavage (around 1.0 s−1) the well-characterized protospacer adjacent motif (PAM)-distal 18–20 MM5,9,12,13 (three mismatches 18–20 bp distal from the PAM) caused a reduction in rate of around 40-fold. Other mismatches (6–8 MM, 9–11 MM and 15–17 MM) resulted in a greater-than-2,000-fold reduction in cleavage rates, with only 20% of the DNA cleaved after 2 h of incubation (Extended Data Fig. 1b).

Extended Data Fig. 1. Kinetic basis for mismatch discrimination by Cas9.

Extended Data Fig. 1

a, Schematic representation of mismatch constructs used for kinetic analysis. b, Time course of cleavage of on-target and mismatched DNA (10 nM) by Cas9. Magenta arrows correspond to time-points used to prepare cryo-EM samples. Aobs corresponds to amplitude of product formed (i.e. total cleavage). For 12–14 MM, target strand cleavage is shown with larger filled circles, while NTS cleavage is given with smaller open circles. For other mismatches we only show target strand cleavage. We previously reported NTS cleavage data for on-target28 and 18–20 MM substrates5.

Extended Data Table 1.

List of nucleotide sequences used in the study

graphic file with name 41586_2022_4470_Tab1_ESM.jpg

Notably, the 12–14 MM allowed Cas9 activation but with rates around 10-fold slower than those of the 18–20 MM. Although Cas9 cleavage is markedly slower for both 12–14 MM-and 18–20 MM-containing DNA than for on-target DNA, more than 80% of either substrate was cleaved within an hour of incubation with Cas9. This time frame for off-target cleavage poses problems for genome-editing applications, which typically occur on the time scale of days to weeks16.

Structures of Cas9 with mismatched DNA

To understand the structural basis for Cas9 activation of mismatched DNA, we vitrified Cas9 with 12–14 MM DNA after a 5-min reaction, in which only around 10% of DNA was cleaved (Extended Data Table 2). We determined a cryo-EM structure at a global resolutionof 3.6 Å (Fig. 1a, Extended Data Fig. 2, Extended Data Table 3). The target-strand-cleaving HNH endonuclease domain was not observed, indicating conformational heterogeneity before activation17,18. Of note, the distal end of the gRNA–TS duplex was in a linear conformation relative to the PAM-proximal DNA–DNA duplex—a state that differs from previously determined on-target DNA-bound Cas9 structures that depict a kinked duplex (around 70°)14,18, although this state is reminiscent of early R-loop formation intermediates19.

Extended Data Table 2.

Correlation between fraction of DNA cleaved and fraction of cryo-EM particles in linear or kinked duplex conformations

graphic file with name 41586_2022_4470_Tab2_ESM.jpg

Fig. 1. Mismatch-induced Cas9 conformational intermediates.

Fig. 1

a, Cryo-EM reconstructions of Cas9 in complex with various partially mismatched DNA substrates, determined at nominal resolutions ranging from 2.8 to 3.6 Å. Cryo-EM structures are coloured according to the domain map for Cas9. Nucleotides are coloured: target strand (TS), green; NTS, pink; and gRNA, red. The fraction of target strand DNA cleaved by Cas9 containing contiguous triple mismatches at the position and time point used for structural determination is shown above each structure. b, Domain organization of SpCas9. CTD, C-terminal domain. c, Models of Cas9 in complex with mismatched DNA substrates shown as isosurface representations. The angle between PAM-proximal and PAM-distal duplexes (θ) is shown. θ is equivalent to around 25º for all linear conformations observed.

Extended Data Fig. 2. Resolution estimates and orientation distributions of cryo-EM maps.

Extended Data Fig. 2

a, Unsharpened maps coloured according to local resolution. b, Gold-standard FSC curves for cryo-EM reconstructions. Resolutions were estimated at FSC=0.143. c, Euler diagrams showing orientation distributions of cryo-EM reconstructions.

Extended Data Table 3.

Cryo-EM data collection, refinement and validation statistics

graphic file with name 41586_2022_4470_Tab3_ESM.jpg

We then vitrified samples of Cas9 with 12–14 MM DNA after a 1-h incubation in which around 80% of the DNA was cleaved (Fig. 1b). Two distinct conformations were observed: a linear duplex conformation consistent with the 5-min structure of 12–14 MM and the kinked duplex conformation described above (Fig. 1a, c). The Cas9 conformations in the two 12–14 MM structures are identical (Fig. 2), but the PAM-distal gRNA–TS duplex end was shifted by around 30 Å and stably docked with REC3 (Fig. 2c). We propose that the linear duplex conformation corresponds to an early intermediate of Cas9, before HNH rearrangement and docking to cleave the DNA9,18. This is supported by recent structural analyses of catalytically dead Cas9 in complex with various R-loop formation intermediates, several of which exhibit linear gRNA–TS duplex conformations that are similar to our linear duplex structures20.

Fig. 2. Positions 12–14 of the gRNA–TS duplex occupy a blind spot for REC3 mismatch detection.

Fig. 2

a, b, Structures of 12–14 MM at 5 min (a) and 1 h (b) in linear and kinked conformations, respectively. The position of the 12–14 MM is shown as light green and light pink for the gRNA and the target strand, respectively. Models are shown as isosurface representations. c, Conformational change of the PAM-distal gRNA–TS duplex. The Cas9 protein structure is largely unchanged (root-mean-square deviation (RMSD) of less than 2 Å for equivalent C-alpha atoms), but the PAM-distal gRNA–TS duplex end undergoes a 30 Å conformational change, docking with REC3. d, Close-up view of positions 12–14, showing that because of the phase of the gRNA–TS duplex, REC3 makes no contacts with these base pairs. e, Schematic of interactions between REC3 and positions 9–17 of the gRNA–TS duplex. No interactions occur between Cas9 REC3 and positions 12–14 MM. Position 1 of the duplex is the first base of the target strand that hybridizes with the gRNA spacer.

Notably, positions 12–14 of the gRNA–TS make no direct contacts with the REC3 domain of Cas9 (Fig. 2). Although positions 9–11 and 15–17 make considerable contacts with REC3, the alignment of the gRNA–TS duplex leaves positions 12–14 without any engagement with this domain (Fig. 2d, e). Because REC3 has a critical role in sensing PAM-distal mismatches9, the 12–14 MM is likely to be able to evade mismatch discrimination by REC3 as it is positioned in a blind spot.

We reasoned that mismatches that prevent the PAM-distal gRNA–TS duplex from docking on REC3 would be unable to assume the kinked conformation, leading to considerably reduced DNA cleavage. To test this hypothesis, we determined a structure of Cas9 with 15–17 MM double-stranded DNA (dsDNA) substrate after 1 h of incubation with the enzyme (Fig. 1b). This mismatch inhibits cleavage by Cas9, but still permits DNA binding as measured by high-throughput profiling21. We observed only the linear duplex conformation (Fig. 1a, c). These structures support a model in which a linear duplex conformation precedes the canonical kinked duplex conformation that is required for activation, and mismatches that block formation of the kinked conformation escape DNA cleavage by Cas9.

The 18–20 mismatch supports Cas9 activation

We next sought to understand how certain mismatches can evade Cas9 discrimination to allow more efficient Cas9 activation and DNA cleavage relative to other mismatches. We examined Cas9 after incubation with 18–20 MM DNA at the 1-min time point at which around 65% of the DNA was cleaved (Extended Data Fig. 1b), to determine whether this more tolerated mismatch undergoes the same structural transition as that of 12–14 MM DNA. Consistent with the fraction of product formation, we observed a mixed population of particles including the linear (Fig. 1a, c) and the kinked duplex conformation. In the kinked duplex structure, we observed HNH docked at the target site scissile phosphate, indicating the fully active conformation. This arrangement of HNH is entirely consistent with the previously observed active Cas9 conformation12,18. These results suggest that the population of particles showing a linear conformation represents an early intermediate in the pathway, and that the kinking of the gRNA–TS duplex is linked to HNH docking.

We observed target strand cleavage between nucleotides 3 and 4 (Fig. 3, Extended Data Fig. 3) and non-target strand (NTS) cleavage at the canonical site three bases upstream from the PAM. We report a direct observation of an RuvC active site with the non-target strand bound in the product state (Fig. 3, Extended Data Fig. 3). R986 is in the ‘down’ conformation, stabilizing the two magnesium ions as predicted by molecular dynamics simulations22 (Fig. 3), whereas F916 wedges between the −2 and −3 bases through stacking interactions and positions the −3 position within the RuvC active site. These observations are in agreement with previous structural and mutagenesis studies23,24. Our structure suggests a histidine-mediated catalytic mechanism, consistent with two-metal-ion-dependent catalysis25 and supported by quantum-classical simulations26. Furthermore, our product state reveals that the two Mg2+ ions are around 4.2 Å from each other, in agreement with the product state of the histidine-mediated mechanism (Extended Data Fig. 3).

Fig. 3. Linkers L1 and L2 mediate the structural transition to the active state.

Fig. 3

a, Overview of the 18–20 MM active conformation. b, c, Detailed view of HNH (b) and RuvC (c) active sites. d, Docking of the L1 linker helix against the PAM-distal gRNA–TS duplex, shown as an isosurface representation. e, Interactions of L1 and L2 regions with the minor groove of the gRNA–TS duplex. HNH extending from L1 and L2 linkers has been removed for clarity and does not interact with this region of the gRNA–TS duplex.

Extended Data Fig. 3. Representative cryo-EM densities for 18–20 MM 1-min kinked (product) structure.

Extended Data Fig. 3

a, HNH active site, showing cleaved target strand. b, L1 linker docked on PAM-distal kinked gRNA–TS duplex. Two water molecules are involved within the network of interactions that stabilize the L1 helix conformation. c, RuvC active site, showing cleaved NTS, and positioning of two Mg2+ ions. d, RuvC DNA cleavage mechanism. This is a typical two-metal-ion mechanism as described in25 and agrees with QM/MM simulations for histidine-mediated activation26, and the proposed mechanisms of Cas12j and Cas12i41,42.

The fully active configuration requires marked conformational rearrangements, including an approximately 140° rotation of the HNH domain from the inactive state. Furthermore, our structures reveal the molecular mechanisms that underlie this rearrangement. The L1 and L2 linker domains tether HNH to the rest of Cas9 and are often missing from crystal structures, presumably owing to their intrinsic flexibility. However, in our active structure, we observe high-quality density for both L1 and L2. Notably, the L1 helix docks against the minor groove of the PAM-distal gRNA–TS duplex and forms an extended network of interactions, including multiple water-mediated hydrogen bonds with both strands (Fig. 3). As L1 docks on the minor groove, these interactions are gRNA–TS structure-specific rather than sequence-specific and can only occur when the PAM-distal duplex end is in the kinked conformation. This provides a structural basis for our observation that the kinked duplex conformation is an intermediate that precedes Cas9 activation and DNA cleavage. Comparisons of our model with Cas9 structures in inactive (Electron Microscopy Data Bank (EMDB) code EMD-3276) and active (EMD-0584) conformations confirmed that L1 docking against the gRNA–TS duplex is correlated with HNH rearrangement and Cas9 activation (Extended Data Fig. 4). Furthermore, our observation of L1 and L2 ‘locking’ HNH in an active conformation is supported by the slow rate of dissociation of Cas9 from target DNA after cleavage27.

Extended Data Fig. 4. Structural analysis of Cas9.

Extended Data Fig. 4

a, Left, comparison of Cas9 protein only between 12–14 MM 60 min linear (colour) and 12–14 MM 1-h kinked (grey) models. Right, comparison of Cas9 protein only active conformation (18–20 MM 1 min linear, colour) and kinked pre-active (12–14 MM 60 min kinked, grey) models. While there is no significant conformational change associated between transition from linear to kinked pre-active (root-mean standard deviation (RMSD) between equivalent Cα atoms of 1.904 Å), the change from kinked pre-active to active conformations is associated with a larger conformational change (4.647 Å, most of which occurs within the REC3 domain). b, Close-up view of REC3 conformational changes that occur upon activation, as viewed from one angle. REC3 moves forwards towards the kinked duplex by ~15 Å upon activation and HNH repositioning. c, Schematic representation of Cas9–nucleic acid contacts in the context of 18–20 MM. Residues mutated in SuperFi-Cas9 are denoted by an asterisk. d, Conformations of HNH domain (green) and L1 (gold) and L2 (purple) linkers in the context of Cas9 binary complex (i.e. with gRNA, PDB 4ZT0), Cas9–gRNA complex bound to dsDNA in an inactive conformation (PDB 5F9R), and in the active Cas9 18–20 MM structure presented in this work. Upon activation, HNH is repositioned at the target strand cleavage site, driven by large conformational changes in the L1 and L2 linkers. e, Comparison with the active Cas9 18–20 MM structure presented in this work and previously determined cryo-EM maps (transparent grey) of inactive (left, EMD-327614) and active (right, EMD-058418) Cas9 bound to on-target dsDNA. The inactive Cas9 has no density for L1 helix at the kinked distal-docked gRNA–TS site, whereas there is clear density for L1 at this site in the active Cas9 cryo-EM map. f, Mapping of residues mutated to alanine in selected high-fidelity Cas9 variants. EvoCas9 (yellow) – M495, Y515, R661, K526. Cas9-HF1 (red) – N497, R661, Q695, Q926. HypaCas9 (blue) – N692, M694, Q695, H698. Residues shared between Cas9-HF-1 and either EvoCas9 or HypaCas9 are shown as orange and purple, respectively.

Residue F916 stabilizes the NTS and is within the L2 linker domain; however, within the inactive Cas9 conformation, L2 is positioned more than 20 Å away from the RuvC active site. L1-facilitated positioning of HNH on the target strand enables relocation of L2, which in turn enables positioning of the NTS within the RuvC active site (Extended Data Fig. 4). This mechanism provides a structural explanation for the observed coupling of target strand and NTS cleavage, in which HNH docking precedes alignment of the NTS at the RuvC site for cleavage5,28. The HNH and RuvC cleavage reactions appear to occur simultaneously because the alignment is rate-limiting.

Although previous studies have noted the importance of L1 docking onto the gRNA–TS duplex for HNH repositioning23,29, our observation that a linear gRNA–TS duplex conformation induced by PAM-distal mismatches precludes L1 docking provides a structural explanation for why certain PAM-distal mismatched substrates are able to bind Cas9, while not triggering rapid DNA cleavage21.

The 18–20 mismatch reorders an RuvC loop

The 18–20 MM contains an unusual duplex conformation at the site of the mismatch. The C:C mismatch at position 18 on the target strand, TS(18), is stabilized by stacking interactions with adjacent Watson–Crick base pairs. However, the gRNA is otherwise distorted with gRNA position 2 (gRNA(2)) flipped out by around 180º so that gRNA(1) then intercalates between TS(19) and TS(20). TS(19) participates in water-mediated hydrogen bonds to Q1027, and TS(20) resumes base-pairing with NTS (Fig. 4, Extended Data Fig. 5).

Fig. 4. Stabilization of distorted 18–20 MM by the RuvC domain and improved fidelity of SuperFi-Cas9.

Fig. 4

a, Overall structure of the 18–20 MM active conformation viewed from the back. b, c, Magnified views of Cas9 interacting with the distal end of the duplex. Flipped gRNA base position 2 is accommodated by stacking interactions and hydrogen bonding with RuvC tyrosine side-chains, whereas a network of interactions (including a water-mediated hydrogen bond) stabilizes the stretched target strand configuration, which allows TS(20) to resume base-pairing with the NTS. d, Schematic of distorted PAM-distal gRNA–TS duplex. Red circles correspond to water molecules. e, Kinetics of on-target and off-target (18–20 MM) Mg2+-initiated cleavage by the 7-D Cas9 mutant (SuperFi-Cas9). f, g, Cleavage competition assay for wild-type Cas9 (f) and SuperFi-Cas9 (g). 25 nM of either Cas9 variant was mixed with 50 nM of each substrate and the cleaved DNA product was monitored. Discrimination in favour of the on-target DNA is defined by the ratio of amplitudes for on-target and off-target product formed.

Extended Data Fig. 5. Representative cryo-EM density for the RuvC loop.

Extended Data Fig. 5

Two different views are shown (a, b). Unsharpened and B-factor sharpened maps are shown for each view with the RuvC loop shown as dark magenta. Key residues involved in stabilizing this distorted conformation are labelled.

This unusual nucleic acid conformation is stabilized by RuvC and appears to facilitate the binding of this mismatch. The residues within RuvC that contact and stabilize this distorted configuration are absent in previous on-target structures14,15,18,30 (Extended Data Fig. 6), despite the overall similarity between our model and a previously determined active on-target Cas9 (Extended Data Fig. 7). This indicates that these resolved RuvC residues are involved only in mismatch binding and not in on-target activation (Fig. 4). Although this mechanism to accommodate certain mismatches may provide an essential mechanism for bacteria to restrict phage variants, it is counterproductive for the use of Cas9 in gene editing.

Extended Data Fig. 6. RuvC loop in on-target SpCas9 structures.

Extended Data Fig. 6

a, On-target inactive Cas9 bound to dsDNA (PDB 4UN3)15. RuvC loop is missing between 1013–1029. b, On-target inactive (primed – HNH rearranged and adjacent to target strand scissile phosphate) Cas9 bound to dsDNA (PDB 5F9R)14. RuvC loop has been built primarily as alanine ‘stub’ residues, but electron density is very poor and diffuse for this region. c, On-target inactive Cas9 bound to dsDNA (PDB 4OO8)43. RuvC loop is missing between 1017–1028. d, On-target active Cas9 bound to dsDNA in postcatalysis state18. RuvC loop is missing between 1001–1077. e, On-target active Cas9 bound to dsDNA in product state18. RuvC loop is missing between 1000–1075. In ac, electron density is displayed as a grey surface, and in d, e cryo-EM density is shown as a grey surface. In all structures, missing residues are depicted as a red dashed line with the RuvC loop in b shown as magenta. Position of RuvC loop is denoted by a black dashed box in the left panel for each model.

Extended Data Fig. 7. Comparison of Cas9 with previous structures.

Extended Data Fig. 7

a, Comparison of 18–20 MM kinked product state Cas9 with a selection of previously determined structures. RMSD between equivalent C-alpha atoms is shown. b, Alignment of HNH from the 18–20 MM kinked product state presented here (transparent grey) and the previously determined ‘post-catalysis’ state (PDB 6O0Y). The catalytically competent HNH conformation between these two structures is highly similar.

Previous rationally engineered variants ‘hyper-accurate Cas9’ (HypaCas9; N692A, M694A, Q695A and H698A mutations) and ‘high-fidelity Cas9’ (Cas9-HF1; N467A, R661A, Q695A and Q926A mutations) achieve somewhat higher fidelity at the expense of up to 100-fold reduced efficiency of on-target DNA cleavage5,8,9. The mutated residues are mainly located within the REC3 domain and make numerous interactions only with the kinked duplex end. Therefore, by abolishing interactions between REC3 and the PAM-distal duplex, these high-fidelity variants reduce the capacity of Cas9 to stabilize the kinked duplex configuration that is required for the docking of L1, and thereby reduce HNH repositioning and cleavage activity. Our data provide a structural explanation for why these high-fidelity Cas9 variants reduce the activation of Cas99 by off-target substrates, but also reduce on-target Cas9 activity.

To test the role of this loop for mismatch stabilization, we designed a 7-D mutant (in which all seven of the stabilizing residues in Fig. 4b are mutated to aspartic acid) and tested the effects of this mutant on DNA cleavage. Although this 7-D mutant cleaved on-target DNA at a similar rate to wild-type Streptococcus pyogenes Cas9 (SpCas9) (2 s−1), we observed that cleavage of 18–20 MM DNA was 500-fold slower (0.004 s−1) (Fig. 4e). This indicates that this loop is critical for stabilizing the distorted mismatch-induced PAM-distal duplex conformation, thereby allowing the duplex to adopt the kinked conformation that is prerequisite for Cas9 activation. We refer to our designed high-fidelity variant that retains wild-type on-target cleavage rates as ‘SuperFi-Cas9’.

Because enzyme specificity is a kinetic phenomenon that is not determined solely by the rates of the chemical reaction, we performed a direct competition assay, in which on-target and off-target (18–20 MM) dsDNA substrates were mixed simultaneously with enzyme and cleavage was monitored over time. Although wild-type Cas9 showed some preference for on-target substrates (a 1.55-fold specificity ratio favouring the on-target over 18–20 MM off-target DNA), SuperFi-Cas9 showed rapid cleavage of on-target DNA and minimal cleavage of 18–20 MM DNA (6.3-fold preference for on-target DNA) (Fig. 4f, g). The ability to discriminate between on- and off-target DNA substrates without compromising DNA cleavage efficiency appears to be unique to SuperFi-Cas911. Although further studies are needed to fully define the kinetic basis for the change in discrimination, our current data constitute a proof of concept and provide a rationale for engineering improved variants of Cas9 using our structure.

Discussion

Through kinetics-guided structural determination, we have described a gRNA–TS duplex conformational intermediate that precedes Cas9 activation (Fig. 5). Notably, we observe that the well-characterized and widespread off-target cleavage of DNA containing mismatches at the extreme PAM-distal end (positions 18–20 (refs.5,9,12,31,32)) is attributed to a unique mechanism that stabilizes a highly distorted duplex conformation, involving a domain loop in RuvC that penetrates the duplex. This region is missing in previously determined structures of Cas9, which suggests that it has a role solely in mismatch tolerance at these positions. Our results provide molecular insights into the underlying structural mechanisms that govern off-target effects of Cas9, and provide a molecular blueprint for the design of next-generation high-fidelity Cas9 variants that reduce off-target DNA cleavage while retaining efficient cleavage of on-target DNA.

Fig. 5. Model for Cas9 activation.

Fig. 5

During R-loop propagation (step 1), the gRNA–TS duplex adopts a linear conformation. After R-loop completion, the PAM-distal end of the linear duplex is captured by REC3 (steps 2 and 3). Mismatches in the PAM-distal region appear to prevent REC3 docking and thereby block subsequent steps of Cas9 activation. Once the kinked R-loop conformation has been formed, L1 and L2 linkers use the gRNA–TS duplex as a scaffold to position the HNH domain at the scissile phosphate of the target strand and to position the NTS in the RuvC site (step 4), which enables Cas9 to make a double-strand break (step 5). According to this model, mutations in the RuvC loop (corresponding to SuperFi-Cas9) inhibit formation of the kinked conformation and subsequent cleavage of the gRNA–TS duplex with mismatches at the PAM-distal end.

Methods

Protein expression and purification

SpCas9 was expressed and purified as described previously5.

Nucleic acid preparation

DNA duplexes (55 nt) were prepared from PAGE-purified oligonucleotides synthesized by Integrated DNA Technologies. DNA duplexes used in cleavage assays were prepared by mixing 6-FAM- or Cy3-labelled target strands with unlabelled non-target strands at a 1:1.15 molar ratio in annealing buffer (10 mM Tris-HCl pH 8, 50 mM NaCl and 1 mM EDTA), heating to 95 °C for 5 min, then cooling to room temperature over the course of 1 h. The sgRNA was purchased from Synthego and annealed in annealing buffer using the same protocol as for the duplex DNA substrates. The sequences of the synthesized oligonucleotides, including the positions of mismatches, are listed in Extended Data Table 1.

Kinetics

Buffer composition for kinetic reactions

Cleavage reactions were performed in 1× cleavage buffer (20 mM Tris-Cl, pH 7.5, 100 mM KCl, 5% glycerol and 1 mM DTT) at 37 °C.

DNA cleavage kinetics

The reaction of Cas9 with on- and off-target DNA was performed by preincubating Cas9.gRNA (28 nM active-site concentration of Cas9, 100 nM gRNA) with 10 nM DNA with a 6-FAM label on the target strand in the absence of Mg2+. The reaction was initiated by adding Mg2+ to 10 mM, then stopped at various times by mixing with 0.3 M EDTA (Extended Data Fig. 1). Products of the reaction were resolved and quantified using an Applied Biosystems DNA sequencer (ABI 3130xl)33. Data were fit using either a single or a double-exponential equation, as shown below.

Single exponential equation:

Y=A1eλ1t+C 1

in which Y represents the concentration of the cleavage product, A1 represents the amplitude and λ1 represents the observed decay rate (eigenvalue). The half-life was calculated as t1/2 = ln(2)/λ1.

Double exponential equation:

Y=A1eλ1tA2eλ2t+C 2

in which Y represents the concentration of the cleavage product, A1 represents the amplitude and λ1 represents the observed rate for the first phase. A2 represents the amplitude and λ2 represents the observed rate for the second phase.

Kinetic competition assay

Enzyme specificity is a kinetic phenomenon that is a function of all steps leading up to and including the first largely irreversible step in the pathway and it is common for mutants to introduce a change in specificity determining steps34. Therefore, we designed an assay to monitor relative rates of cleavage for on- and off-target DNA when the enzyme was presented with both substrates simultaneously. The competition assay was performed by mixing a solution of 25 nM (active site concentration) Cas9 and 100 nM sgRNA, in the presence of 10 mM Mg2+, with 50 nM on-target DNA and 50 nM off-target DNA, in which the DNA contained a 5′-6-FAM label or a 5′-Cy3 label on the target or off-target DNA, respectively. Time points were collected by mixing with 0.3 M EDTA and reaction products were resolved and quantified by capillary electrophoresis, as described above. On-target cleavage data were fit to a single exponential function and off-target cleavage data were fit to a double exponential function. Discrimination was calculated as the ratio of the total amplitude of on-target cleavage divided by the amplitude for off-target cleavage to derive the relative specificity constants for the on-target DNA compared to the off-target DNA.

Cryo-EM sample preparation, data collection and processing

Cas9 in complex with various mismatched DNA substrates was frozen at different time points, on the basis of kinetic analysis (Extended Data Fig. 1). A non-productive mismatch complex (15–17 MM, 1 h); a slow productive mismatch (12–14) at early (5 min) and late (1 h) time points; and a fast productive mismatch (18–20, 1 min) were chosen. MDCC-Cas9 was used for structure determination to couple structural analysis with ongoing kinetic studies monitoring changes in fluorescence. It has previously been shown that the kinetics of MDCC-Cas9 were indistinguishable from those of wild-type enzyme5. The cleavage reaction was triggered by mixing 10 µM DNA duplex preincubated with 10 mM MgCl2 and 8 µM MDCC-labelled Cas9: 8 µM gRNA was preincubated with 10 mM MgCl2, in reaction buffer (19 mM Tris-Cl, pH 7.5, 95 mM KCl, 4.75% glycerol and 5 mM DTT) at a 1:1 ratio. Four microlitres of sample was applied to glow-discharged holey carbon grids (C-flat 2/2, Protochips), blotted for 1 s with a blot force of 4 and rapidly plunged into liquid nitrogen-cooled ethane using an FEI Vitrobot MarkIV. Reactions were quenched through vitrification.

Data were collected on an FEI Titan Krios cryo-electron microscope equipped with a K3 Summit direct electron detector (Gatan). Images were recorded with SerialEM35 with a pixel size of 1.1 Å for 12–14 MM datasets, and 0.81 Å for 18–20 MM and 15–17 MM datasets, over a defocus range of −1.5 to −2.5 µm. During collection of the 12–14 MM 5-min time-point dataset, a preferred orientation was observed. To ameliorate this, a second dataset was collected at 30° tilt. Movies were recorded at 13.3 electrons per pixel per s for 6 s (80 frames) to give a total dose of 80 electrons per pixel. CTF correction, motion correction and particle picking were performed in real-time using cryoSPARC Live. Further data processing was performed with cryoSPARC v.3.236.

Multiple rounds of 3D classification within cryoSPARC yielded reconstructions of six distinct Cas9 complexes at resolutions ranging from 2.7 to 3.6 Å (Extended Data Table 3). To aid the separation of multiple Cas9 conformational states from within the same dataset, 3D variability analysis was performed within CryoSPARC. First and last frames from suitable eigenvector trajectory were then used as references for heterogeneous refinement (that is, reference-based 3D classification), and particles from resulting classes were refined using non-uniform refinement and used for final reconstructions37. Active Cas9 (Protein Data Bank (PDB) code: 6O0X) was rigid-body fitted into each map using ChimeraX38. Regions of the model not present in a given map were truncated, and flexible fitting was performed using Namdinator39. Further modelling was performed using Isolde40, and the models were ultimately subjected to real-space refinement as implemented in PHENIX.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this paper.

Online content

Any methods, additional references, Nature Research reporting summaries, source data, extended data, supplementary information, acknowledgements, peer review information; details of author contributions and competing interests; and statements of data and code availability are available at 10.1038/s41586-022-04470-1.

Supplementary information

Reporting Summary (1.7MB, pdf)
Peer Review File (882.4KB, docx)

Acknowledgements

This work was supported in part by Welch Foundation grants F-1604 (to K.A.J.) and F-1938 (to D.W.T.), and by a Robert J. Kleberg, Jr. and Helen C. Kleberg Foundation Medical Research Grant (to D.W.T.). D.W.T is a CPRIT Scholar supported by the Cancer Prevention and Research Institute of Texas (RR160088). We thank G. Palermo and members of her group for discussions.

Extended data figures and tables

Author contributions

J.P.K.B. prepared samples for and performed cryo-EM, structure determination and modelling. M.-S.L. performed initial kinetic studies. K.J. purified SpCas9 and MDCC-Cas9 used for structure determination and kinetic analysis. R.S.M. assisted with preliminary analysis of the 12–14 MM 5-min structure. G.N.H. cloned, expressed and purified SuperFi-Cas9 mutants. G.N.H. and T.L.D. performed kinetic analysis of SuperFi-Cas9 versus wild-type enzyme. J.P.K.B., M.-S.L., D.W.T., T.L.D. and K.A.J. analysed and interpreted the data and wrote the manuscript. D.W.T. and K.A.J. supervised and secured funding for the studies.

Peer review

Peer review information

Nature thanks Daan Swarts, John van der Oost and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Data availability

The structures of 12–14 MM 5 min, 12–14 MM 60-min linear and 18–20 MM 1-min kinked active, and their associated atomic coordinates, have been deposited into the EMDB and the PDB with EMDB accession codes EMD-24833, EMD-24835 and EMD-24838 and PDB accession codes7S4U, 7S4V and 7S4X, respectively. Maps of 12–14 MM 60-min linear, 15–17 MM 60-min linear and 18–20 1-min linear have been deposited into the EMDB with accession codes EMD-23834, EMD-24836 and EMD-24837, respectively.

Competing interests

J.P.K.B., M.-S.L., G.N.H., T.L.D., K.A.J. and D.W.T. are inventors on a patent application based on this research titled ‘Methods and compositions for improved Cas9 specificity’ filed by the Board of Regents, The University of Texas System. The US Patent and Trademark Office (USPTO) has assigned US application no. 63/243,481 to this application, and the filing date of 13 September 2021. K.A.J. is the president of KinTek, which provided the chemical-quench flow instruments and the KinTek Explorer software used in this study.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Jack P. K. Bravo, Mu-Sen Liu

Change history

3/22/2022

A Correction to this paper has been published: 10.1038/s41586-022-04655-8

Contributor Information

Kenneth A. Johnson, Email: kajohnson@utexas.edu

David W. Taylor, Email: dtaylor@utexas.edu

Extended data

is available for this paper at 10.1038/s41586-022-04470-1.

Supplementary information

The online version contains supplementary material available at 10.1038/s41586-022-04470-1.

References

  • 1.Jinek M, et al. RNA-programmed genome editing in human cells. eLife. 2013;2:e00471. doi: 10.7554/eLife.00471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Cong L, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339:819–823. doi: 10.1126/science.1231143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Fu Y, et al. High-frequency off-target mutagenesis induced by CRISPR–Cas nucleases in human cells. Nat. Biotechnol. 2013;31:822–826. doi: 10.1038/nbt.2623. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Doudna JA. The promise and challenge of therapeutic genome editing. Nature. 2020;578:229–236. doi: 10.1038/s41586-020-1978-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Liu M, et al. Engineered CRISPR/Cas9 enzymes improve discrimination by slowing DNA cleavage to allow release of off-target DNA. Nat. Commun. 2020;11:3576. doi: 10.1038/s41467-020-17411-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Kim D, Luk K, Wolfe SA, Kim JS. Evaluating and enhancing target specificity of gene-editing nucleases and deaminases. Annu. Rev. Biochem. 2019;88:191–220. doi: 10.1146/annurev-biochem-013118-111730. [DOI] [PubMed] [Google Scholar]
  • 7.Slaymaker IM, Gaudelli NM. Engineering Cas9 for human genome editing. Curr. Opin. Struct. Biol. 2021;69:86–98. doi: 10.1016/j.sbi.2021.03.004. [DOI] [PubMed] [Google Scholar]
  • 8.Kleinstiver BP, et al. High-fidelity CRISPR–Cas9 nucleases with no detectable genome-wide off-target effects. Nature. 2016;529:490–495. doi: 10.1038/nature16526. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Chen JS, et al. Enhanced proofreading governs CRISPR–Cas9 targeting accuracy. Nature. 2017;550:407–410. doi: 10.1038/nature24268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Slaymaker IM, et al. Rationally engineered Cas9 nucleases with improved specificity. Science. 2016;351:84–88. doi: 10.1126/science.aad5227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Kim N, et al. Prediction of the sequence-specific cleavage activity of Cas9 variants. Nat. Biotechnol. 2020;38:1328–1336. doi: 10.1038/s41587-020-0537-9. [DOI] [PubMed] [Google Scholar]
  • 12.Sternberg SH, Lafrance B, Kaplan M, Doudna JA. Conformational control of DNA target cleavage by CRISPR–Cas9. Nature. 2015;527:110–113. doi: 10.1038/nature15544. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Singh D, et al. Mechanisms of improved specificity of engineered Cas9s revealed by single-molecule FRET analysis. Nat. Struct. Mol. Biol. 2018;25:347–354. doi: 10.1038/s41594-018-0051-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Jiang F, et al. Structures of a CRISPR–Cas9 R-loop complex primed for DNA cleavage. Science. 2016;351:867–871. doi: 10.1126/science.aad8282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Anders C, Niewoehner O, Duerst A, Jinek M. Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease. Nature. 2014;513:569–573. doi: 10.1038/nature13579. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Ran FA, et al. Genome engineering using the CRISPR–Cas9 system. Nat. Protoc. 2013;8:2281–2308. doi: 10.1038/nprot.2013.143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Dagdas YS, Chen JS, Sternberg SH, Doudna JA, Yildiz A. A conformational checkpoint between DNA binding and cleavage by CRISPR–Cas9. Sci. Adv. 2017;3:eaao0027. doi: 10.1126/sciadv.aao0027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Zhu X, et al. Cryo-EM structures reveal coordinated domain motions that govern DNA cleavage by Cas9. Nat. Struct. Mol. Biol. 2019;26:679–685. doi: 10.1038/s41594-019-0258-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Cofsky, J. C., Soczek, K. M., Knott, G. J., Nogales, E. & Doudna, J. A. CRISPR–Cas9 bends and twists DNA to read its sequence. Preprint at 10.1101/2021.09.06.459219 (2021). [DOI] [PMC free article] [PubMed]
  • 20.Pacesa, M. & Jinek, M. Mechanism of R-loop formation and conformational activation of Cas9. Preprint at 10.1101/2021.09.16.460614 (2021).
  • 21.Jones SK, et al. Massively parallel kinetic profiling of natural and engineered CRISPR nucleases. Nat. Biotechnol. 2021;39:84–93. doi: 10.1038/s41587-020-0646-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Palermo G. Structure and dynamics of the CRISPR–Cas9 catalytic complex. J. Chem. Inf. Model. 2019;59:2394–2406. doi: 10.1021/acs.jcim.8b00988. [DOI] [PubMed] [Google Scholar]
  • 23.Zhang Y, et al. Catalytic-state structure and engineering of Streptococcus thermophilus Cas9. Nat. Catal. 2020;3:813–823. doi: 10.1038/s41929-020-00506-9. [DOI] [Google Scholar]
  • 24.Jinek M, et al. Structures of Cas9 endonucleases reveal RNA-mediated conformational activation. Science. 2014;343:1247997. doi: 10.1126/science.1247997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Steitz TA, Steitz JA. A general two-metal-ion mechanism for catalytic RNA. Proc. Natl Acad. Sci. USA. 1993;90:6498–6502. doi: 10.1073/pnas.90.14.6498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Casalino L, Nierzwicki Ł, Jinek M, Palermo G. Catalytic mechanism of non-target DNA cleavage in CRISPR–Cas9 revealed by ab initio molecular dynamics. ACS Catal. 2020;10:13596–13605. doi: 10.1021/acscatal.0c03566. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Aldag P, et al. Probing the stability of the SpCas9–DNA complex after cleavage. Nucleic Acids Res. 2021;49:12411–12421. doi: 10.1093/nar/gkab1072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Gong S, Yu HH, Johnson KA, Taylor DW. DNA unwinding is the primary determinant of CRISPR–Cas9 activity. Cell Rep. 2018;22:359–371. doi: 10.1016/j.celrep.2017.12.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Sun W, et al. Structures of Neisseria meningitidis Cas9 complexes in catalytically poised and anti-CRISPR-inhibited states. Mol. Cell. 2019;76:938–952. doi: 10.1016/j.molcel.2019.09.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Nishimasu H, et al. Crystal structure of Cas9 in complex with guide RNA and target DNA. Cell. 2014;156:935–949. doi: 10.1016/j.cell.2014.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Tsai SQ, et al. GUIDE-seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases. Nat. Biotechnol. 2015;33:187–198. doi: 10.1038/nbt.3117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kuscu C, Arslan S, Singh R, Thorpe J, Adli M. Genome-wide analysis reveals characteristics of off-target sites bound by the Cas9 endonuclease. Nat. Biotechnol. 2014;32:677–683. doi: 10.1038/nbt.2916. [DOI] [PubMed] [Google Scholar]
  • 33.Dangerfield TL, Huang NZ, Johnson KA. High throughput quantification of short nucleic acid samples by capillary electrophoresis with automated data processing. Anal. Biochem. 2021;629:114239. doi: 10.1016/j.ab.2021.114239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Johnson, K. A. Kinetic Analysis for the New Enzymology (KinTek, 2019).
  • 35.Mastronarde DN. Automated electron microscope tomography using robust prediction of specimen movements. J. Struct. Biol. 2005;152:36–51. doi: 10.1016/j.jsb.2005.07.007. [DOI] [PubMed] [Google Scholar]
  • 36.Punjani A, Rubinstein JL, Fleet DJ, Brubaker MA. CryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods. 2017;14:290–296. doi: 10.1038/nmeth.4169. [DOI] [PubMed] [Google Scholar]
  • 37.Punjani A, Zhang H, Fleet DJ. Non-uniform refinement: adaptive regularization improves single-particle cryo-EM reconstruction. Nat. Methods. 2020;17:1214–1221. doi: 10.1038/s41592-020-00990-8. [DOI] [PubMed] [Google Scholar]
  • 38.Pettersen EF, et al. UCSF ChimeraX: structure visualization for researchers, educators, and developers. Protein Sci. 2021;30:70–82. doi: 10.1002/pro.3943. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Kidmose RT, et al. Namdinator—automatic molecular dynamics flexible fitting of structural models into cryo-EM and crystallography experimental maps. IUCrJ. 2019;6:526–531. doi: 10.1107/S2052252519007619. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Croll TI. ISOLDE: a physically realistic environment for model building into low-resolution electron-density maps. Acta Crystallogr. D. 2018;74:519–530. doi: 10.1107/S2059798318002425. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Pausch P, et al. DNA interference states of the hypercompact CRISPR–CasΦ effector. Nat. Struct. Mol. Biol. 2021;28:652–661. doi: 10.1038/s41594-021-00632-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Huang X, et al. Structural basis for two metal-ion catalysis of DNA cleavage by Cas12i2. Nat. Commun. 2020;11:5241. doi: 10.1038/s41467-020-19072-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Nishimasu H, et al. Crystal structure of Staphylococcus aureus Cas9. Cell. 2015;162:1113–1126. doi: 10.1016/j.cell.2015.08.007. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Reporting Summary (1.7MB, pdf)
Peer Review File (882.4KB, docx)

Data Availability Statement

The structures of 12–14 MM 5 min, 12–14 MM 60-min linear and 18–20 MM 1-min kinked active, and their associated atomic coordinates, have been deposited into the EMDB and the PDB with EMDB accession codes EMD-24833, EMD-24835 and EMD-24838 and PDB accession codes7S4U, 7S4V and 7S4X, respectively. Maps of 12–14 MM 60-min linear, 15–17 MM 60-min linear and 18–20 1-min linear have been deposited into the EMDB with accession codes EMD-23834, EMD-24836 and EMD-24837, respectively.


Articles from Nature are provided here courtesy of Nature Publishing Group

RESOURCES