Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Jul 14.
Published in final edited form as: Mol Cell. 2025 Apr 23;85(9):1730–1742.e9. doi: 10.1016/j.molcel.2025.03.024

Rapid two-step target capture ensures efficient CRISPR-Cas9-guided genome editing

Honglue Shi 1,2,#, Noor Al-Sayyad 3,4,#, Kevin M Wasko 1,5,#, Marena I Trinidad 1,6, Erin E Doherty 1,7, Kamakshi Vohra 1,7, Ron S Boger 1,8, David Colognori 1,7, Joshua C Cofsky 1,5,15, Petr Skopintsev 1,7, Zev Bryant 3,9,*, Jennifer A Doudna 1,2,5,7,10,11,12,13,14,16,*
PMCID: PMC12258621  NIHMSID: NIHMS2082181  PMID: 40273916

SUMMARY

RNA-guided CRISPR-Cas enzymes initiate programmable genome editing by recognizing a ~20-base-pair DNA sequence next to a short protospacer-adjacent motif (PAM). To uncover the molecular determinants of high-efficiency editing, we conducted biochemical, biophysical, and cell-based assays on Streptococcus pyogenes Cas9 (SpyCas9) variants with wide-ranging genome-editing efficiencies that differ in PAM-binding specificity. Our results show that reduced PAM specificity causes persistent non-selective DNA binding and recurrent failures to engage the target sequence through stable guide RNA hybridization, leading to reduced genome-editing efficiency in cells. These findings reveal a fundamental trade-off between broad PAM recognition and genome-editing effectiveness. We propose that high-efficiency RNA-guided genome editing relies on an optimized two-step target capture process, where selective but low-affinity PAM binding precedes rapid DNA unwinding. This model provides a foundation for engineering more effective CRISPR-Cas and related RNA-guided genome editors.

eTOC blurb

A two-step target capture process—specific but low-affinity PAM binding followed by rapid DNA unwinding—underpins efficient CRISPR-Cas9 editing. Shi et al. show that PAM-relaxed Cas9 variants like SpRY become kinetically trapped between these steps, reducing editing efficiency. These insights provide new principles for engineering improved RNA-guided genome editors.

Graphical Abstract

graphic file with name nihms-2082181-f0008.jpg

INTRODUCTION

The widespread utility of CRISPR-Cas9 for programmable genome editing in human cells, plants and other eukaryotes has propelled interest in understanding the determinants of efficient genome modification1,2. Cas9 recognizes target sequences within genomes using an ATP-independent process in which Cas9-bound guide RNAs base pair with a 20-nucleotide sequence within double-stranded DNA. This mechanism begins with Cas9 binding to a 2–4 base pair (bp) protospacer adjacent motif (PAM), triggering DNA unwinding and RNA-DNA hybridization to form an R-loop36 (Fig. 1A). While stable target capture requires RNA-DNA hybridization at the seed region (5–10 bps), efficient DNA cleavage necessitates complete R-loop formation with extensive RNA-DNA complementarity3,4,79 (Fig. 1A). The process of Cas9-mediated R-loop formation has been well characterized4,711, but the mechanism by which RNA-guided enzymes avoid entrapment by the vast excess of non-specific sequences in the genome remains unclear12,13.

Fig. 1. PAM-relaxed Cas9 variants display reduced genome editing efficiencies and bulk DNA cleavage kinetics on a canonical NGG PAM.

Fig. 1.

(A) A model of RNA-guided DNA targeting by Cas9. The two-step target capture process that differentiates PAM-relaxed Cas9 variants from WT SpyCas9 is highlighted. (B) The three SpyCas9 proteins used in this study with their respective PAM recognition: WT SpyCas9 (gray), SpG (pink) and SpRY (purple). (C) Quantifications of DNA double-stranded breaks (DSB) by droplet digital PCR (ddPCR) on the target site at 8 h post plasmid transfection (n=3 replicates). See also Fig. S1C for more data. (D) Quantifications of indels on the target site at 72 h post RNP nucleofection (n=3). See also Fig. S1E for more data. (E) Time-course analysis of average DNA cleavage products (n=3) for sgRNA2 in 5 mM Mg2+ at 24°C for TS (see Methods). The average rate constants, with the amplitudes from the observed double-exponential decay, are provided in figure legends. See also Fig. S2A for NTS cleavage and Fig. S2C-I for other sgRNAs and conditions. (F) Time-course analysis of the average fluorescence signal (n=3) in the 2AP assay for the same sequence in similar conditions as (E) (see Methods). For WT SpyCas9, catalytically dead Cas9 (dCas9) is used here. The rate constants of the observed mono-exponential decay are provided in figure legends. See also Fig. S2B for the 2AP-labeling position (15th nt from PAM) and intercept determination (using non-targeting sgRNA). All error bars in (C-F) represent the standard deviation of n = 3 replicates.

Although R-loop formation and DNA cleavage occur within seconds to minutes upon target DNA recognition in vitro11,14 and in cells15, genome edits take hours to days, indicating that other steps are rate-limiting. For S. pyogenes Cas9 (SpyCas9), the abundance of its NGG PAM in the genome (roughly every 16 dinucleotides) necessitates repeated PAM binding and release until the target is located12,13. Furthermore, the PAM requirement for target capture restricts SpyCas9 to modifying sequences next to an NGG motif3,4. To expand target access, efforts have focused on reducing or eliminating PAM specificity1619, yielding PAM-relaxed SpyCas9 variants including SpG and SpRY18 that differ by only a few point mutations in the PAM-interacting domain. These variants achieve “PAM-less” target recognition by enabling conformational flexibility and non-specific interactions with the DNA backbone20 — and provide a basis for investigating the role of PAM specificity and affinity in SpyCas9’s target capture, cleavage, and genome editing efficiencies. Previous studies have shown that PAM-relaxed variants are less competent at target unwinding20 and spend more time searching the genome21, but leave open questions about the mechanistic origins of these limitations.

Using cell-based assays as well as biochemical and single-molecule experiments, we show that compared to wild-type (WT) SpyCas9, the PAM-relaxed Cas9 variants SpG and SpRY are slower to identify target sequences due to non-specific DNA interactions. Even after arriving at the target, these variants unwind DNA more slowly because they become kinetically trapped in a stable initial binding complex and are less proficient at initiating R-loops. Efforts to accelerate unwinding and stabilize the R-loop at the target site only partially mitigate these effects, as off-target binding continues to limit overall efficiencies. Our findings support a model in which increased PAM specificity with limited DNA affinity can enhance genome editing efficiency. These results highlight the crucial trade-off between PAM flexibility and editing efficiency and show that the effectiveness of SpyCas9 for genome editing arises from its intrinsic PAM recognition properties.

RESULTS

Reduced editing efficiencies arise from enzymatic limitations before R-loop completion

PAM-relaxed variants of SpyCas9, SpG (PAM: NG) and SpRY (PAM: NR; R=A,G) (Fig. 1B), are less efficient genome editing enzymes relative to WT SpyCas9 (PAM: NGG)18,22. We used these activity differences as a basis for exploring the mechanism by which PAM recognition influences enzyme-mediated editing. To quantify genome editing outcomes, we expressed WT, SpG and SpRY Cas9 separately in HEK293T cells along with a single guide RNA (sgRNA) targeting an NGG-proximal sequence in the gene EMX1 (sgRNA1). Despite similar expression levels of Cas9 and sgRNA at early time points after plasmid transfection in each case (Fig. S1A, B), SpG and SpRY were 2–4 fold slower than WT SpyCas9 at both DNA cutting (Fig. 1C; Fig. S1C) and inducing genome edits (Fig. S1D). Furthermore, when defined amounts (10 pmol) of pre-assembled Cas9 ribonucleoproteins (RNPs) with similar active enzyme fractions (Fig. S1G) were nucleofected into HEK293T cells, WT SpyCas9 induced ~30% indels with sgRNA1 within 72 hours (h), while SpG and SpRY reached only 4% and 3%, respectively, at 72 h (Fig. 1D). This inefficiency becomes even more pronounced with different sgRNAs and higher RNP doses (Fig. S1E), consistent with guide-dependent effects reported in previous studies18,22,23. These results suggest that reduced efficiencies of PAM-relaxed variants result from their intrinsic enzymatic properties.

To determine the molecular basis of the reduced efficiencies of SpG and SpRY, we focused on key steps in Cas9’s targeting mechanism: PAM binding, R-loop formation and DNA cleavage (Fig. 1A). We first examined whether differences in R-loop formation or DNA cleavage kinetics accounted for the observed editing efficiency differences. Bulk DNA cleavage assays using dual-fluorescently labeled double-stranded DNA (dsDNA) substrates revealed two phases of DNA cleavage: a dominant fast phase governed by kfast, and a slower phase governed by kslow (Fig. 1E), consistent with prior studies14,24. WT SpyCas9 completed the fast phase of target strand (TS) cutting within one minute, while SpG and SpRY showed ~3.5-fold slower kfast (Fig. 1E), with similar results for non-target strand (NTS) cleavage (Fig. S2A). The reduced cleavage rate was guide-dependent, as previously reported18,22,23, and occurred under reduced Mg2+ concentration (0.2 mM) conditions that reflect intracellular Mg2+ levels in mammalian cells25,26 (Fig. S2CI). Using 2-aminopurine (2AP) labeled dsDNA to track Cas9-induced DNA unwinding27,28, we found that the apparent rate constants of R-loop formation (kobs) for SpG and SpRY were similarly ~3.5-fold slower, matching the kfast in the DNA cleavage assay (Fig. 1F; Fig. S2B). These results are consistent with the kinetic delays for SpG and SpRY occurring before R-loop completion, rather than from differences in DNA cleavage chemistry20.

SpRY is inefficient at forming an initial stable R-loop intermediate

Bulk biochemical assays here and in prior work20 have suggested that SpG and SpRY’s inefficiency arises before R-loop completion. To determine which substeps in R-loop formation are affected, we employed Gold Rotor Bead Tracking (AuRBT)29 to detect DNA structural transitions upon Cas9 binding at base-pair resolution. We immobilized a DNA tether, containing an NGG site next to a target sequence, and measured real-time changes in DNA twist (Δθ) during R-loop formation and collapse (Fig. 2A and Fig. S3A). In these experiments we focused on SpRY due to its most pronounced differences in kinetic behavior relative to WT SpyCas9. To prevent DNA cleavage during data collection, we introduced D10A and H840A mutations to create catalytically inactive dCas9 and dSpRY, which retain R-loop formation ability while allowing dynamic unwinding and rewinding at the DNA target11,14,30. After introducing 0.8 nM dCas9 or 4 nM dSpRY RNP into the channel, we observed stepwise changes in Δθ corresponding to transitions between R-loop states with differing numbers of bp unwound (Fig. 2B, C). Unlike previous work that focused on the effects of supercoiling on R-loop dynamics11, here we monitored equilibrium fluctuations between states on a relaxed DNA tether.

Fig. 2. SpRY disfavors the formation of the R-loop intermediate state in AuRBT.

Fig. 2.

(A) Schematic overview of the AuRBT experimental setup. dCas9 and dSpRY, both with similar active protein fractions (Fig. S1H), were used at concentrations of 0.8 nM and 4 nM, respectively. sgRNA3 contains a 20-bp match to the Target1 sequence flanking an NGG site. θ is the measured rotor bead angle. All AuRBT experiments were performed under F = 7 pN of tension. (B-C) Example traces from time-resolved measurements of DNA unwinding (Δθ) for (B) 0.8 nM dCas9 (gray) and (C) 4 nM dSpRY (purple). 250-ms averaged traces are shown in black. Diagonal arrows highlight zoomed-in regions of R-loop formation and collapse events through a discrete intermediate (right panels). Yellow lines represent idealized traces generated by the Steppi change-point analysis. (D) (Top) Scatter plots illustrating the unwinding lifetime and change in equilibrium twist (Δθ0) for merged Steppi-scored states (see Methods) across all binding events for 0.8 nM dCas9 (gray, left) and 4 nM dSpRY (purple, right), highlighting a closed DNA state “C”, an intermediate state “I” and an open DNA state “O”. The total collection time (Ttotal) and number of DNA tethers (n) are provided in the legend. (Bottom) The lifetime weighted population at each corresponding Δθ0 with bin size of 0.5 bp (see Methods). (E) Rates constants of transitions between C and I and between I and O for dCas9 (gray, upper) and dSpRY (purple, lower) showing the primary distinction between dCas9 and dSpRY lies in the rates associated with the transition between C and I (bold). The average unwinding (bp) in each state is provided. Note that these transition rate constants are measured at a fixed [RNP] and do not represent the bimolecular binding rate constants.

We identified R-loop states and characterized their kinetics using automated change-point detection (“Steppi”)31, scoring transitions between three different levels of DNA unwinding: a closed state (C) with no unwinding, an intermediate state (I) with ~8–10 bp unwound, and an open state (O) with ~20–21 bp unwound (Fig. 2D). This state structure was consistent between dSpRY and dCas9, and with previous measurements of dCas9 obtained using non-equilibrium twist ramping11, with differences in kinetics as expected given the distinct target sequence used here (Target 1: 5′-CTGCGTATTTCTACTCTGTT-3′). Complete R-loop formation and collapse proceeded through the transient I state, and we analyzed CIO transitions to obtain the corresponding rates kCI, kIC, kIO, and kOI. Compared to dCas9, dSpRY had a 7-fold slower kCI (despite being 5-fold higher in concentration), and a 5-fold faster kIC (Fig. 2E), indicating dSpRY is far less likely to form the R-loop intermediate, instead favoring the closed state. The subsequent R-loop propagation (IO) kinetics were similar between dSpRY and dCas9 (Fig. 2E). These results show that differences in R-loop formation between the two enzymes are confined to the earliest steps involved in unwinding the bps adjacent to the PAM site, leading to slow and unfavorable R-loop completion that could explain the observed reduction in SpRY’s DNA cleavage rate in bulk assays.

SpRY is kinetically trapped in a low-energy initial binding complex

To determine why SpRY is less efficient at melting DNA bps to enable guide RNA strand invasion and form the R-loop intermediate, we used AuRBT to dissect the target capture process (Fig. 1A). Because the CI transition involves both binding of the RNP to the target and DNA unwinding, kCI is expected to depend on the RNP concentration ([RNP]). For dCas9, kCI increases linearly with rising [RNP], as expected for moderate to weak PAM binding6,20,24,32 that is far from saturation within our experimental range (Fig. 3A). At low concentrations kCI also increases with [RNP] for SpRY, but with characteristically lower rates, reflecting the slow formation of the seed intermediate. Furthermore, kCI reaches a plateau at moderate [RNP], suggesting a rate-limiting step after initial binding for dSpRY (Fig. 3A). The remaining kinetic rates, which involve transitions between bound states, are approximately independent of RNP concentration as expected, with closely equivalent kinetics between the two enzymes for the late I ↔ O step and faster collapse of the intermediate kIC for dSpRY (Fig. 3B, S3FH). In our interpretations of these data, note that the total [RNP] introduced into the chamber is used as a close approximation to the free [RNP], justified by the large excess of RNP over DNA under these single-molecule conditions (see Note 1 in Methods).

Fig. 3. SpRY is trapped in a low-energy initial binding complex.

Fig. 3.

(A) kCI as a function of [RNP] for dCas9 (gray) and dSpRY (purple) using the same DNA and RNA sequences in Fig. 2A. Error bars were calculated assuming Poisson statistics. Solid lines are fit of kCI vs [RNP] (RNP concentration), assuming a linear model with kon,eff(dCas9) for dCas9 and a hyperbolic saturation model with initial affinity Kd,init(dSpRY) and maximal unwinding rate kopen(dSpRY) for dSpRY. The equations for each model are provided as insets. The faded dashed line has a slope kon,eff(dSpRY)=kopen(dSpRY)/Kd,init(dSpRY) and shows the linear dependence of kCI on dSpRY [RNP] in the low [RNP] regime. See also Note 1 in Methods for model selection, Table S1 for rate constants, Table S2 for fit details, Fig. S3BD for example raw traces, and Fig. S3FH for other rate constants (kIC, kIO and kOI). (B) A model illustrating distinct kinetics in the two steps of target capture between dCas9 and dSpRY. Rate constants in bold are derived from our AuRBT measurements, while those in gray are inferred from model constraints or prior measurements6,11,20,24,32. See also Note 1 in Methods for model details. (C) The free energy landscape of the kinetic model including Cfree, Cbound, I and O, assuming Kd,init(dCas9)=1μM for dCas9, kon,init(dCas9)=kon,init(dSpRY)=0.1nM1s1, and [RNP] = 100 nM. The “‡” symbol indicates a transition state. See also Note 1 in Methods and Table S3 for the chosen values to build the free energy diagram. (D) Schematic showing how a mismatch (MM) bubble next to an NGG PAM reduces the energetic cost of DNA unwinding. (E) Bar graph depicting the average fast-phase rate constant (kfast) for DNA cleavage with and without an MM bubble adjacent to an NGG PAM. The gray dashed line represents the detection limit of the assay. Error bars represent the standard deviation of n = 3 replicates. See also Fig. S2D, G, I and Fig. S3I for time-course data.

Based on these measurements, we developed a kinetic model (Fig. 3B; see Note 1 in Methods) that describes Cas9’s target capture as two key steps: initial binding CfreeCbound and subsequent DNA unwinding/R-loop intermediate formation CboundI. Here, the directly observed closed DNA conformation (state C) is kinetically separated into a free state Cfree where the target site is unoccupied, and a bound state Cbound where an enzyme at the target site is poised to initiate R-loop formation. In this model, dCas9 shows weaker initial binding Kd,init110μM6,20,24,32 but rapidly transitions to form the R-loop intermediate kopen>50s1. In contrast, dSpRY has much stronger initial binding Kd,init10nM but transitions to the intermediate over 800 times slower than dCas9kopen0.06s1, explaining the slower and hyperbolic kinetics of dSpRY. This kinetic model can be depicted as a free energy landscape (Fig. 3C; see Note 1 in Methods) that shows dSpRY initially occupying a low-energy binding state, and then encountering a high energy barrier towards subsequent DNA unwinding. Importantly, this low-energy state may represent an ensemble of binding modes by dSpRY rather than a single defined conformation. Based on this energy landscape, the primary cause for dSpRY’s inefficiency at forming an R-loop is its tendency to become kinetically trapped in this initial binding complex. In addition, the R-loop intermediate state in dSpRY is thermodynamically disfavored (Fig. 3C), likely due to weaker stabilizing interactions. The combined effects of the kinetic trap and unstable R-loop intermediate result in a ΔGCboundl that is approximately 8 kBT higher than that of dCas9. Note that our model shares several features with a previous model derived from bulk measurements20: both indicate that the overall formation of the R-loop is slower and more thermodynamically uphill in SpRY compared to WT SpyCas9. However, our model localizes these differences to the R-loop seed formation step, and also directly contradicts the previous assumption20 that the initial binding affinities of WT SpyCas9 and SpRY are equivalent.

Based on our model, we predict that reducing ΔGCboundl could enhance SpRY’s reduced cleavage rate in bulk assays. To test this, we introduced a small mismatch (MM) bubble next to the NGG PAM, aiming to lower the energy barrier for the CboundI transition by destabilizing the target duplex next to the NGG33,34 (Fig. 3D). In DNA cleavage assays, this MM bubble had little to no impact on WT SpyCas9’s cleavage rate but significantly improved SpG and SpRY’s rates, rendering their bulk cleavage activities to match WT SpyCas9 (Fig. 3E; Fig. S2D, G, I; Fig. S3I). This result supports our hypothesis that difficulties in unwinding the initial seed region contribute to the reduced efficiencies of SpG and SpRY, and suggests that lowering the energy barrier towards the R-loop intermediate can overcome these limitations.

Kinetic traps induce strong and promiscuous DNA binding

Given that Cas9 must navigate through many non-target sequences during genome surveillance, we asked if SpRY’s kinetic trapping also exacerbates non-specific binding. To address this, we investigated how SpRY interrogates non-specific sequences compared to WT SpyCas9.

Using AuRBT, we monitored the dynamics of target capture by introducing 50 nM RNP with a sgRNA containing only a 3-bp match next to an NGG PAM. For WT SpyCas9, we observed short-lived, small-magnitude spikes in Δθ, indicating transient DNA unwinding events (Fig. 4A, B). These states were consistently recorded across various sgRNA-DNA combinations (Fig. S4A) but disappeared when NGG was replaced with NCG (Fig. 4C, D; Fig. S4B) or when apo Cas9 was used (Fig. S4C). The amount of unwinding (~2 bps) was also consistent with predictions from cryo-EM structures (Fig. S4D) (PDB: 7S38)6, and the unwinding lifetime aligns with dwell times seen in single-molecule FRET studies of DNA binding (Fig. S4E)30. In sharp contrast, SpRY exhibited a prolonged Δθ baseline shift under the same conditions (Fig. 4E). This prolonged shift was [RNP]-dependent (Fig. 4F; Fig. S4F, G), independent of the NGG PAM (Fig. S4H), and saturates at moderate [RNP]. These observations suggest SpRY strongly interacts with non-specific sites, leading to a slight unwinding signal. Multiple off-target sites may contribute to the observed shift, where each SpRY binds tightly yet only poorly populates a partially unwound state, resulting in an ensemble-average unwinding of ~3.7 bp even at high SpRY concentrations.

Fig. 4. Promiscuous DNA binding by PAM-relaxed Cas9 variants.

Fig. 4.

(A, C, E) Example traces of time-resolved measurements of equilibrium twist change Δθ over time for (A and C) WT SpyCas9 (50 nM) and (E) SpRY (50 nM), each paired with sgRNA4.1, containing a 3-bp match to the Target2 sequence flanking (A and E) an NGG site or (C) an NCG site on the DNA tether. 250-ms averaged traces are shown in black. For (E), the vertical dashed lines (--) and dash-dot lines (−.) indicate the start and end, respectively, of the flow of SpRY RNP into the chamber. (B, D) Scatter plots of unwinding lifetime and Δθ0 for merged Steppi-scored states corresponding to experiments described in (A, C). The total collection time (Ttotal) and number of DNA tethers (n) are provided in the legend. See also Fig. S4AC for more data with different sequences and conditions. (F) Fit of average Δθ baseline shift and [RNP] for SpRY to a binding equation yielded an apparent Kd~14nM for sgRNA4.1 on the Target2 sequence flanking an NGG site. See also Fig. S4FH for raw traces. (G) Schematic overview of the permanganate DNA footprinting assay. The UREA-PAGE gel image used to generate the graph in (I) highlights KMnO4-dependent enriched bands for WT SpyCas9 (gray), SpG (pink) and SpRY (purple) in colored dashed boxes. The gel image was rendered in ImageLab 6.1 (BioRad) and cropped to exclude irrelevant neighboring lanes. (H) Sequence of the sgRNA (sgRNA3) and the dsDNA substrate with all the reactive thymines on the TS underlined. (I) Oxidation probabilities of thymines across the dsDNA substrate.

Fitting this [RNP]-dependent Δθ baseline shift for SpRY yielded an effective Kd,eff of ~14 nM (Fig. 4F). We obtained a similar result Kd,eff15nM by fitting an underlying baseline shift observed in the previous dSpRY experiments using full-match sgRNA (see Methods, Fig. S3E), indicating consistent non-specific binding behavior across conditions. The agreement between these measurements and dSpRY’s affinity for the initial binding complex (Kd,init10nM, from Fig. 3A) hints that a common non-specific binding mode may be responsible for both observations: the kinetic trap hindering target DNA capture also leads to prolonged, unproductive off-target interactions.

To visualize the locations of DNA binding and unwinding between these Cas9 variants in the absence of sgRNA sequence complementarity but in the presence of a canonical NGG PAM, we used a permanganate footprinting assay. Consistent with previous findings6, WT SpyCas9 induced bp melting adjacent to the NGG site, while the xPBA variant, lacking PAM-binding arginines, showed minimal signal (Fig. 4GI). In contrast, SpG and SpRY did not induce detectable bp melting near the NGG site but caused promiscuous thymine exposure at various locations (Fig. 4GI). Together, these results suggest that PAM-relaxed Cas9 variants tend to strongly and promiscuously bind DNA.

Relaxed PAM specificity increases non-specific DNA interference

Based on these observations of SpRY’s promiscuous and strong non-specific binding, we hypothesized that the DNA cleavage efficiency of PAM-relaxed Cas9 variants would be more significantly inhibited when exposed to excess non-specific DNA. We tested this by examining the DNA cleavage kinetics of these Cas9 variants in the presence of excess non-specific DNA (Fig. 5A), mimicking genomic conditions in cells. We repeated the DNA cleavage assay from Fig. 1E at 37°C with 100 nM RNP (sgRNA2) and 10 nM fluorescent target DNA substrate in the presence of ~8 nM (0.8×) 2.2-kb supercoiled pGGAselect DNA plasmids lacking complementarity to sgRNA2 as non-specific competitor. When combining the substantial plasmid concentration with the many potential binding sites per plasmid, we expected these conditions to provide an excess of non-specific sites with respect to RNP — unlike the dilute DNA conditions used in our single-molecule AuRBT experiments. The plasmid competitor did not affect WT SpyCas9’s cleavage rate, but the kfast for SpG and SpRY was approximately ~5-fold and ~15-fold slower, respectively, compared to conditions without competitor (Fig. 5B, C; Fig. S2D; Fig. S5A). In addition, SpRY exhibited a larger slow phase (30%) (Fig. 5B, C), suggesting that non-specific competitor interactions could contribute to the slow phase of DNA cleavage. The slower DNA cleavage by SpG and SpRY was not due to plasmid supercoiling, as similar inhibition was observed with a linear PCR product with identical DNA sequences (Fig. S5B).

Fig. 5. Inhibition of PAM-relaxed Cas9 variants cleavage activity by DNA competitors.

Fig. 5.

(A) Schematic overview of bulk DNA cleavage assay conducted in the presence of various DNA competitors. (B-E) Time-course analysis of average DNA cleavage products (n=3) for sgRNA2 at 10 mM Mg2+ and 37°C under different conditions: (B) without competitors (C) with a 0.8× 2.2-kb plasmid competitor (D) with a 40× 2.2-kb plasmid competitor, and (E) with a 40× salmon sperm DNA competitor. The plasmid competitor in (C and D) is estimated to contain approximately 251 NGG PAMs, 1070 NG PAMs, and 2230 NR PAMs. The average rate constants (kobs for a mono-exponential decay model; kfast, kslow for a double-exponential decay model) along with their amplitudes are provided in figure legends (n=3). See also Fig. S2D and Fig. S5A, C, D for NTS cleavage. Note that the time-course cleavage analysis in Fig. 5B is also shown in Fig. S2D; the analysis in Fig. 5C is also shown in Fig. S5A; the analysis in Fig. 5D is also shown in both Fig. 6A and Fig. S5C; and the analysis in Fig. 5E is also shown in Fig. S5D for direct comparison. (F) Bar graph depicting the average rate constant (kobs) of the mono-exponential decay for DNA cleavage in the presence of a short dsDNA competitor containing various numbers of 3-bp seed sequences flanking an NGG site (n=3). The gray dashed line represents the detection limit of the assay. See also Fig. S5G for time-course data. All error bars in (B-F) represent standard deviations of n = 3 replicates.

To further challenge the DNA cleavage kinetics, we increased the concentration of the plasmid competitor to 400 nM (40×), resulting in significantly greater inhibition. Under this condition, WT SpyCas9 showed a minimal 3.5-fold reduction in the kfast (Fig. 5D; Fig. S5C). However, with the data adequately fit to a mono-exponential model, the kobs for SpG was ~40-fold slower, and for SpRY, it was ~200-fold slower than the predominant kfast without competitor (Fig. 5D; Fig. S5C). We then replaced the plasmid competitor with purified salmon sperm DNA (~200–500 bps), resulting in both increased sequence diversity and possibly higher concentration of heterogeneous exposed ends, increasing non-specific binding sites. Repeating the cleavage assay at the same mass concentration (40×) showed that the kfast for WT SpyCas9 was ~5-fold slower, and the product amplitude was reduced to ~20% (Fig. 5E; Fig. S5D). For SpG and SpRY, no detectable DNA cleavage activity was observed (Fig. 5E; Fig. S5D). Substantial inhibition of SpG and SpRY was observed across different sgRNAs and different competitor concentrations (Fig. S5E), and at 0.2 mM Mg2+ (Fig. S5F). Electrophoretic Mobility Shift Assay (EMSA) further confirmed SpRY’s higher affinity for the linearized plasmid competitor DNA, as 0.5 μM SpRY RNP caused a band shift, while WT SpyCas9 showed no binding even at 16 μM (Fig. S6AB).

We then tested whether DNA cleavage inhibition correlated with the number of NGG-dependent off-target sites. Using a 1000-fold molar excess of 60-mer dsDNA with zero, two, five or ten off-target sites (3-bp match of sgRNA2 next to NGG), we found that the cleavage rate reductions correlated with the number of NGG-dependent off-target sites for WT SpyCas9 (Fig. 5F; Fig. S5G). In contrast, this correlation was less evident for SpG and minimal for SpRY (Fig. 5F; Fig. S5G), indicating that SpG and SpRY experience more generalized inhibition by short dsDNA competitors, irrespective of NGG presence. Taken together, these results suggest that increased PAM promiscuity in SpG and SpRY leads to widespread off-target interactions that prolong target identification and subsequent cleavage, correlating with their reduced editing efficiencies in cells.

Facilitating unwinding improves efficiency but does not resolve off-target interactions

The competition experiments demonstrated that even with rapid unwinding kinetics like those of WT SpyCas9, the presence of numerous PAM-containing off-target sites can still hinder cleavage efficiency. Given that SpG and SpRY possess broader PAM recognition and exhibit promiscuous DNA binding, we hypothesized that their inefficiencies could not be fully mitigated by simply accelerating on-target unwinding. Previously, by introducing a MM bubble next to an NGG PAM to lower the energy barrier for DNA unwinding ΔGCbound1, we restored SpG and SpRY’s on-target activity to WT SpyCas9 levels in bulk assays (Fig. 3D, E). Here, we repeated the competition assays using a target with an MM bubble and a 40× plasmid competitor. While WT SpyCas9 showed a modest cleavage rate increase (~1.7-fold), SpG and SpRY exhibited significant improvements in kobs (~5-fold), and the total product amplitude increased from ~15% to 80% with a MM bubble (Fig. 6A; Fig. S6C). However, despite these enhancements, SpRY’s rate remained 30-fold slower than WT under competitor conditions, indicating persistent off-target binding still hampers its efficiency as expected.

Fig. 6. Facilitating on-target unwinding is insufficient to overcome off-target interactions for PAM-relaxed Cas9 variants.

Fig. 6.

(A) Time-course analysis of average DNA cleavage products (n=3) for sgRNA2 at 10 mM Mg2+ and 37°C with a 40× plasmid competitor and an MM bubble. Error bars represent standard deviations of n = 3 replicates. See also Fig. S6C for NTS cleavage. Note that the time-course cleavage analysis without MM bubbles (top) is also shown in Fig. 5D and Fig. S5C for direct comparison, while the analysis with an MM bubble (bottom) is presented in Fig. S6C in which a double-exponential fitting is applied for SpG and SpRY. (B) EMSA analysis of target DNA binding by dCas9 or dSpRY, comparing conditions with and without competitor DNA, with or without an MM bubble. Gel images were rendered in ImageLab 6.1 (BioRad) and cropped to exclude irrelevant neighboring lanes. See also Fig. S6D for data on sgRNA1.

To further investigate, we performed EMSA assays on dCas9 and dSpRY binding to a target DNA with a MM bubble (Fig. 6B). In the presence of a 40× plasmid competitor, dSpRY’s on-target binding improved from 2% to 29% with a MM bubble while dCas9’s binding remained unaffected (~90%). However, with 40× salmon sperm DNA, dSpRY’s on-target binding was completely abolished, regardless of the MM bubble (Fig. 6B; Fig. S6D), highlighting that strong non-specific DNA binding remains a major barrier for SpRY. ChIP-seq data in human cells supported this, showing diminished on-target signals and higher non-specific binding for dSpRY compared to dCas9 (Fig. S6EG; see Note 2 in Methods). Therefore, while reducing the energy barrier for on-target unwinding significantly improves SpG and SpRY’s cleavage activity, it is insufficient to overcome the strong non-specific DNA binding that continues to hinder efficiency under competitive conditions.

DISCUSSION

Understanding the molecular basis for genome editing efficiency by CRISPR-Cas9 is important for both fundamental knowledge and practical improvement of editing tools used in clinical and agricultural applications. In this study we investigated the observed loss of editing efficacy in Cas9 variants derived from the highly effective SpyCas9 enzyme but lacking specificity for the target-adjacent PAM sequence5,6,20. PAM-relaxed variants, such as SpRY, are slower to engage correct target DNA sequences20 and bind off-target sequences more frequently in vitro20 and in bacteria21.

Our high-resolution single-molecule measurements resolve distinct kinetic substeps in target DNA recognition, providing details that were inaccessible to prior studies20,21. These data reveal that differences between SpRY and WT SpyCas9 are localized to early binding transitions, while later R-loop propagation is similar. Specifically, SpRY’s inefficiency stems from fundamental issues in the two-step target capture process — initial PAM binding followed by DNA unwinding. Two factors drive SpRY’s inefficiency (Fig. 7). First, it becomes kinetically trapped in a nonproductive binding state at non-specific sites, leading to a prolonged target search time. Second, even after reaching the target site, the interactions responsible for kinetic trapping also slow target DNA unwinding.

Fig. 7. A model explaining the reduced efficiency of PAM-relaxed Cas9 variants.

Fig. 7.

The reduced genome editing efficiencies of PAM-relaxed Cas9 variants (e.g. SpRY) are explained by two factors: (Top) PAM-relaxed Cas9 variants display prolonged target search times, largely due to kinetic trapping at non-specific binding sites, which reduces the pool of free RNPs available for on-target identification. (Bottom) Upon reaching the target site, these variants also become kinetically trapped in a low-energy initial binding complex and unwind DNA slowly.

In contrast, WT SpyCas9 excels due to its selective but low-affinity PAM binding, which rapidly transitions to unwinding and enables high-efficiency genome editing, including base and prime editing3537 and transcriptional regulation38,39. A previous theoretical study40 predicted that moderate PAM binding affinity optimizes the target search time, and simple theoretical modeling41 has also been used to argue for an optimal PAM dissociation rate: overly strong PAM binding stalls the enzyme at off-target sites, while overly weak PAM binding leads to premature dissociation at the target. WT SpyCas9’s rapid unwinding allows it to function effectively with moderate-to-low PAM affinity which optimizes target search, whereas SpRY’s orders-of-magnitude slower unwinding and stronger non-specific binding make it more prone to kinetic trapping. Based on these findings, we propose that increasing PAM specificity, while limiting initial binding affinity, accelerates genome editing.

Despite the potential utility of a single Cas enzyme that can target all possible PAM sequences while maintaining robust on-target genome editing activity, our data show that specific PAM binding is fundamental to RNA-guided genome manipulation. While it might seem intuitive that extending the length of PAM motifs could improve editing efficiency, previous studies have shown that longer PAM motifs, such as those recognized by SpyCas9 variants like EQR (PAM=NGAG) and VRER (PAM=NGCG), do not necessarily improve editing efficiency42. Indeed, both EQR and VRER demonstrated similar or even reduced DNA cleavage efficiencies relative to WT SpyCas9 in the presence of competitor DNA (Fig. S7). These results align with a recent theoretical study suggesting that two-base-pair PAMs optimize target search speed for Cas940. Together, these findings underscore the importance of carefully balancing PAM length and affinity when developing a comprehensive “PAM-catalog” of Cas9 or related RNA-guided enzymes43.

Furthermore, our findings suggest that optimizing ground-state PAM binding may be insufficient for developing more efficient genome editors. Future engineering efforts to improve genome editing efficiency should focus on reducing the energy barrier between initial DNA encounters and DNA unwinding, preventing enzyme stalling at intermediate states. This shift toward optimizing transition-state kinetics presents a possible strategy for advancing the efficiency of other RNA-guided genome editors.

In summary, these results highlight a two-step target capture mechanism ensuring high-efficiency Cas9-mediated genome editing and reveal the inherent limitations of PAM-relaxed enzymes. This work not only provides crucial insights into the kinetic bottlenecks that reduce editing efficiency but also offers a framework for future engineering strategies to overcome these challenges. Extending these studies to other CRISPR-Cas enzymes and ancestral RNA-guided endonucleases, such as IscB and TnpB, could reveal the central principles governing RNA-guided target search and recognition. Ultimately, these insights will drive the next generation of genome editing technologies, enhancing both precision and versatility in therapeutic and functional applications.

Limitations of the Study

This study provides valuable mechanistic insights into the trade-off of PAM relaxation; however, several limitations warrant consideration. First, while our single-molecule AuRBT approach effectively resolves distinct R-loop states and their transitions, it does not impose torsional stress on the DNA, limiting our ability to assess the influence of supercoiling. Second, while our analysis infers properties of an initial binding state, our single-molecule measurements do not directly monitor binding independent of unwinding. Future single-molecule work should extend our comparison of Cas9 and SpRY by using twist ramping and FluoRBT to investigate supercoiling dependence and correlate mechanical measurements with fluorescently labeled RNP binding. Third, our simple model does not directly consider factors such as enzyme–enzyme interactions, local 1D sliding, and the complexities of cellular environments (e.g., chromatin structure and molecular crowding). Fourth, we performed AuRBT experiments using only one example of a fully-matched guide RNA, although our bulk biochemical assays and prior work20 support similar conclusions with other guides. Future work should extend the model by incorporating additional factors into the theory and by comparing across multiple guide sequences. Finally, whether our kinetic model for SpyCas9 extends to other CRISPR-Cas enzymes remains to be determined. In some cases, reducing kinetic trapping may enable both PAM-less recognition and high efficiency. Future studies should explore whether similar or distinct mechanisms govern target search and editing efficiency across diverse CRISPR systems.

Resource Availability

Lead Contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Jennifer A. Doudna (doudna@berkeley.edu).

Materials Availability

Plasmids generated in this study will be deposited to Addgene upon publication. This study did not generate new unique reagents.

Data and Code Availability

  • NGS sequencing data have been deposited in the National Institutes of Health NCBI SRA under the BioProject ID: PRJNA1239632 and are publicly available as of date of publication. Single-molecule data have been deposited in the Stanford Digital Repository (https://doi.org/10.25740/yb505yt4817) as MATLAB .fig files and are publicly available as of the date of publication. All other source data — including biochemical analysis data points, uncropped gel images and processed single-molecule data used to generate figures — have been deposited at Mendeley (https://doi.org/10.17632/hzr4fnvc88.1) and are publicly available as of the date of publication.

  • Customized Python code for NGS sequencing analysis has been deposited online and made publicly available on Github (https://github.com/Doudna-lab/Cas9_Target_Search_ChIPseq). Customized MATLAB code for single-molecule data analysis has been deposited in the Stanford Digital Repository (https://doi.org/10.25740/yb505yt4817).

  • Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.

Experimental Model and Study Participant Details

Mammalian cell lines

Human Embryonic Kidney 293T (HEK293T) cells were obtained through the UC Berkeley Cell Culture Facility and were cultured at 37 °C with 5% CO2 in complete media consisting of Dulbecco’s Modified Eagle Medium (DMEM), 10% fetal bovine serum (FBS), and 100 U/ml penicillin/streptomycin. Cells are derived from a female donor.

Method Details

Plasmid construction

Mammalian expression plasmids were derived from pSpCas9(BB)-2A-Puro (PX459) V2.0, which encodes Cas9 with N- and C-terminal nuclear localization signals (NLS) (Tables S4S6) driven by a CAG promoter, and a sgRNA under control of a U6 promoter. The 2A sequence and puromycin resistance protein were removed using ligation cloning. SpG, SpRY, and catalytically dead mutants were generated via Gibson assembly. The U6 promoter, sgRNA sequence and protein sequences are detailed in Tables S5S6. For ChIP-seq experiments, the sgRNA or Cas9 expression cassettes were removed by restriction digestion and re-ligation to generate plasmids expressing solely Cas9 protein or a sgRNA separately.

Bacterial expression vectors for Cas9 and its variants were generated by assembling gBlocks (Integrated DNA Technologies) containing specific Cas9 mutations into custom pET-based vectors using Gibson assembly (Table S4). These vectors feature a T7 promoter, followed by an N-terminal His10-tag, maltose-binding protein (MBP), a tobacco etch virus (TEV) protease cleavage site, and the respective Cas9 variants (Table S6). Cas9 and its variants used in this study include WT SpyCas9, SpG, SpRY, EQR, VRER, xPBA, dCas9, dSpRY, as well as WT SpyCas9–2NLS, SpG-2NLS, and SpRY-2NLS (Table S7).

Nucleic acid preparations

sgRNAs used in biochemical and biophysical experiments were generated by in-vitro transcription, following the protocol described by Cofsky et al6. Transcription reactions contained 25 μg DNA template (Table S8) assembled by polymerase chain reaction (PCR), 20 mM of each nucleoside triphosphate, 30 mM Tris-HCl (pH 8.0), 25 mM MgCl2, 10 mM DTT, 0.01% (v/v) Triton X-100, 2 mM spermidine, and 100 μg/mL T7 RNA polymerase. Reactions were incubated at 37°C for 4 h, followed by purification by 8% denaturing urea polyacrylamide gel electrophoresis (UREA-PAGE). The gel containing RNA was crushed and soaked in DEPC-treated water overnight at 4°C, followed by six washes with DEPC water. sgRNA for cellular experiments were purchased from Integrated DNA Technologies, with chemical modifications at 3′- or 5′-ends for enhanced stability (details in Table S9) and resuspended in IDTE Buffer (10 mM Tris-HCl, pH 7.5; 0.1 mM EDTA). All sgRNA were annealed before use by heating at 80 °C for 2 min, then placing the reaction directly on ice.

All the DNA oligonucleotides used in this study were synthesized by Integrated DNA Technologies. All DNA primers were ordered with standard desalting (Tables S10S12), whereas both the ddPCR probes (Table S11) and the DNA oligonucleotides for biochemical assays (Table S13) were HPLC-purified. For the double-stranded DNA (dsDNA) substrates used in DNA cleavage assays and electrophoretic mobility shift assays (EMSA), the target strand (TS) was 5′-labeled with FAM, and the non-target strand (NTS) was 5′-labeled with Cy5. In the 2AP fluorescence assay, the NTS of the dsDNA substrate contained a single 2-aminopurine (2AP) modification within the target sequence, while the TS remained unlabeled. For the short dsDNA competitors used in competition cleavage assay, the competitor DNA was unlabeled. In the permanganate footprinting assay, the TS of the dsDNA substrate was 5′-labeled with FAM, while the NTS remained unlabeled. All the dsDNA substrates were annealed by heating a 1:1 NTS/TS molar (unless otherwise specified) mixture of complementary single-stranded oligonucleotides to 95°C followed by gradually cooling to 35°C over 45 min in DNA Annealing Buffer (10 mM Tris-HCl, pH 8.0; 100 mM NaCl; 1 mM EDTA). The plasmid (pGGAselect; sequence provided in Table S14) used in competition cleavage assay was ordered from New England Biolabs and prepared in-house using a HiSpeed Plasmid Mega EF kit (QIAGEN). Purified and sheared salmon sperm DNA (10 mg/mL, Invitrogen) with size of 200–500 bps was purchased from ThermoFisher Scientific, in which the sheared ends may comprise a range of structures and single strand overhangs.

The A260 absorbance of both DNA and RNA oligonucleotides was measured using a NanoDrop spectrophotometer (ThermoFisher Scientific). Concentrations were calculated based on previously reported extinction coefficients51. The sequences of all DNA and RNA oligonucleotides used in this study are listed in Tables S9S17.

Tissue culture and DNA transfection

HEK293T cells (UC Berkeley Cell Culture Facility) were cultured in DMEM (Corning) supplemented with 10% fetal bovine serum (Gibco) and 100 U/ml penicillin-streptomycin (Gibco). 5×106 cells were seeded into a 10 cm tissue culture dish (Corning) and transfected with 10 μg plasmid DNA using Lipofectamine 3000 according to the manufacturer’s instructions. Cells were harvested at 8, 16, 24, and 72 h post-transfection, followed by genomic DNA extraction.

Electroporation of ribonucleoprotein

100 pmol Cas9 (WT SpyCas9–2NLS, SpG-2NLS, or SpRY-2NLS) was diluted in Protein Storage Buffer (20 mM HEPES, pH 7.5; 150 mM KCl; 10% (v/v) glycerol; 1 mM TCEP) and mixed with an equal volume of 125 pmol sgRNA in IDTE Buffer (1:1.25 molar ratio) for a total of 5 μL RNP. RNP was allowed to complex at room temperature for 10 min prior to diluting as necessary in the same mixture of Protein Storage Buffer and IDTE Buffer (1:1 by volume). HEK293T cells were trypsinized, washed once with PBS, and resuspended in supplemented Lonza SF Cell Line Nucleofector Solution. 2×105 cells in 20 μL SF solution were mixed with 5 μL of pre-complexed RNP and 25 μL was transferred to the cuvette. Electroporation was performed with a Lonza 4D-Nucleofector 96-well Unit using the pulse code DS-150. 75 μL pre-warmed media was added to the cells immediately after electroporation and the contents of each cuvette were divided among three wells of a 96-well culture plate containing pre-warmed media. Cells were incubated for 72 h, at which point the three wells from each nucleofection were re-pooled prior to genomic extraction.

Genomic DNA extraction

HEK293T cells were trypsinized and pelleted at 300× g for genomic DNA extraction. Genomic DNA was extracted from HEK293T cells by resuspending live or snap-frozen cell pellets in QuickExtract DNA Extraction Solution (Biosearch Technologies) followed by incubation at 65°C for 15 min, 68°C for 15 min, then 98°C for 10 min.

Next-Generation Sequencing (NGS) library prep and sequencing

The extracted genomic DNA was then subjected to NGS library preparation through a two-step PCR process using Q5 High-Fidelity 2× Master Mix (New England Biolabs). The first PCR step amplified the genomic loci and attached adapter sequences (primers listed in Table S10) using the following protocol: an initial denaturation at 95°C for 30 sec; 25 cycles of 95°C for 10 sec, 60°C for 15 sec, and 72°C for 15 sec; followed by a final extension at 72°C for 2 min, then held at 4°C indefinitely. The second PCR step added Illumina index, P5, and P7 sequences using an initial denaturation at 95°C for 30 sec; 6 cycles of 95°C for 10 sec, 60°C for 15 sec, and 72°C for 15 sec; followed by a final extension at 72°C for 2 min, and then held at 4°C indefinitely. Each sample was pooled at equimolar concentrations into a library and sequenced on the Illumina NextSeq 1000/P1 platform (2×150 bp) to obtain at least 100,000 reads per sample.

The paired-end sequencing reads were first trimmed using the BBDuk tool in Geneious Prime (https://www.geneious.com/prime), setting a minimum quality threshold of 20 and a minimum length of 20 bps. Following trimming, the reads were merged with the BBmerge tool in Geneious Prime. These merged reads were then analyzed with CRISPResso2 (https://github.com/pinellolab/CRISPResso2) to quantify the indel rate with the following command, according to the methods described by Ma et al52, with adjustment of certain parameters: CRISPRess --fastq_r1 MERGED_READS --amplicon_seq AMPLICON_SEQUENCE --guide_seq GUIDE_SEQUENCE -n nhej -wc −3 -w 5 --plot_window_size 20 -o OUTPUT_FILE

Droplet digital PCR (ddPCR) for double-strand breaks

The ddPCR assay was designed following methods in previous studies53,54. Two ~150 bp amplicons were designed: Amplicon 1 spans the EMX1 target site, while Amplicon 2 is located about 200 bp downstream. A double-strand break (DSB) at the target site would prevent amplification of Amplicon 1, whereas Amplicon 2 would remain unaffected. Probes for the target and reference amplicons were ordered labeled with the fluorophores FAM and HEX, respectively.

The DSB-ddPCR reactions were assembled with ddPCR Supermix for Probes (No dUTP), 900 nM of each primer, 250 nM of each probe, and 15 ng of genomic DNA, quantified via Qubit dsDNA Quantification Assay (ThermoFisher Scientific). The sequences for all the primers and probes are provided in Table S11. Droplets were formed using a Bio-Rad QX200 Droplet Generator according to the manufacturer’s instructions. Thermal cycling was performed according to the following protocol: (95°C for 10:00; 40 cycles of 94°C for 0:30 followed by 63.3°C for 3:00; 97°C for 10:00; held at 4°C). Droplets were held at 4°C overnight following thermocycling and analyzed on a Bio-Rad X200 Droplet Reader the next day. Data was analyzed using the QX Manager Software. The percentage of alleles with double-strand breaks was calculated from the number of droplets that amplified the target amplicon (labeled with FAM) compared to those that amplified the reference amplicon (labeled with HEX). The equation utilized is as follows:

%DSB=100×1NtargetNreference

, where Ntarget and Nreference represents the number of droplets that amplified the target amplicon and the reference amplicon, respectively.

Western blot

Fresh or snap-frozen cell pellets were resuspended in RIPA Lysis and Extraction Buffer (ThermoFisher Scientific) containing 1× Protease Inhibitor Cocktail (ThermoFisher Scientific). Lysis was allowed to continue for 30 min, occasionally passing lysate through a syringe needle to thoroughly shear nuclei. Debris was removed by centrifugation at 20,000×g for 20 min at 4°C and the total protein concentration in the supernatant was measured using a Pierce BCA assay kit. 4 μg total protein was denatured at 95°C for 5 min and resolved by SDS-PAGE. Protein was transferred to a PVDF membrane. The membrane was blocked with Blocking Buffer (1× PBS; 0.1% (v/v) Tween-20, 5% (m/v) milk) for 1 h at room temperature, incubated with primary antibody in Blocking Buffer overnight at 4°C, washed three times with PBS/0.05% Tween-20 for 5 min each, incubated with dye-conjugated secondary antibody in Blocking Buffer for 1 h at room temperature and washed three times again with PBS/0.05% Tween-20 for 5 min each. Protein bands were visualized on an LI-COR Odyssey CLx with Image Studio v5.2 software using 700 nm and 800 nm channels.

Quantitative reverse transcription PCR (RT-qPCR) for sgRNA

At the indicated time points post-transfection, 1×106 cells were washed with PBS, resuspended in QIAzol, and stored at −80°C. RNA extraction was performed using an miRNeasy Mini Kit according to the manufacturer’s instructions. 1 μg total RNA per sample was used for reverse transcription using the High-Capacity cDNA Reverse Transcription Kit according to the manufacturer’s instructions. qPCR was performed using Power SYBR Green PCR Master Mix and primers specific for either the Cas9 sgRNA or β-actin mRNA (Table S12) on a CFX96 Touch Real-Time PCR Detection System.

ChIP-seq

5×106 HEK293T cells were seeded on a tissue-culture treated 10 cm plate the day before transfection. Cells were transfected using Lipofectamine 3000 with 7.5 μg of plasmid encoding dCas9 or dSpRY protein and 12.5 μg of plasmid encoding an EMX1-targeting sgRNA. Cells were passaged into a 15 cm plate one day post-transfection. At 72 h post-transfection, 50 million cells were harvested and fixed in 1% formaldehyde in PBS for 10 min. Fixation reaction was quenched with the addition of 125 mM glycine, cells were washed twice with ice-cold PBS, and pellets were flash-frozen in liquid nitrogen and stored at −80°C. Cells were thawed on ice, resuspended in Lysis Buffer 1 (50 mM HEPES, pH 7.5; 140 mM NaCl; 0.5% IGEPAL CA-630; 0.25% (v/v) Triton X-100; 10% (v/v) glycerol; 1 mM EDTA; supplemented with protease inhibitor cocktail, cOmplete EDTA-free, Roche) and incubated at 4°C with rotation for 10 min. Nuclei were pelleted, resuspended in Lysis Buffer 2 (10 mM HEPES, pH 7.5; 200 mM NaCl; 1 mM EDTA; 0.5 mM EGTA; 200 μg/mL RNase A; supplemented with protease inhibitor cocktail, cOmplete EDTA-free, Roche) and incubated at 4°C with rotation for 10 min. Nuclei were pelleted again and resuspended in 1.5 mL Sonication Buffer (10 mM HEPES, pH 7.5; 100 mM NaCl; 1 mM EDTA; 0.5 mM EGTA; 0.1% sodium deoxycholate; 0.5% N-Lauroylsarcosine; supplemented with protease inhibitor cocktail, cOmplete EDTA-free, Roche). Genomic DNA was sheared using a BioRuptor Pico (10 cycles, 30 s on, 30 s off). Anti-Cas9 antibody (Diagenode) was co-incubated with Protein A beads for ≥ 6 h prior to immunoprecipitation. Lysate was mixed with antibody-bound magnetic beads and rotated at 4°C overnight. Beads were washed 5 times with RIPA Buffer (50 mM HEPES, pH 7.5; 500 mM LiCl; 1 mM EDTA; 1% IGEPAL CA-630; 0.7% sodium deoxycholate), once with TEN Buffer (10 mM Tris-HCl, pH 8.0; 50 mM NaCl; 1 mM EDTA), and once with TE Buffer, with 3 min of rotation at 4°C between each wash, prior to elution in TES Buffer (50 mM Tris-HCl, pH 8.0; 10 mM EDTA; 1% SDS). 8 units of Proteinase K were added to samples and incubated at 55°C for 1 h. Reverse crosslinking was performed at 65°C overnight. DNA was purified using phenol-chloroform precipitation.

Sequencing libraries were generated with the NEBNext Ultra II DNA Library Prep Kit for Illumina according to the manufacturer’s instructions. DNA was purified using SPRIselect beads and library quality was confirmed on an Agilent 2100 Bioanalyzer prior to sequencing. Sequencing was performed on an Illumina NovaSeq 6000 using a 100×100 paired-end configuration and an S1 Reagent Kit (v1.5), achieving an average depth of ~160 million reads per sample. FASTQ files were aligned to GRCh38 using Bowtie 2 (version 2.5.2)45 with parameters ‘--no-discordant --local --no-mixed --maxins 1000’ to limit alignment artifacts for peak calling. Alignments were deduplicated, normalized to Counts Per Million (CPM) and converted into BigWig format using deepTools2 (version 3.5.1)46. The processed data was visualized in Integrative Genomics Viewer (IGV)47.

To assess the quality and reproducibility of the ChIP-seq experiment, pseudoreplicates were generated. Mapped reads from the original alignments were extracted and downsampled with seqtk (version 1.3) (https://github.com/lh3/seqtk) to create three pseudoreplicates, each with 50 million paired-end reads. Peak calling was conducted with MACS2 (version 2.2.9.1)48, using pseudoreplicates of the corresponding apo sample as background controls and a p-value threshold of 0.01. The resulting peaks were used to calculate the signal-to-noise ratio, defined as the ratio of reads within peaks to those outside. To further investigate noise, pseudoreplicate peak reproducibility was determined using the Irreproducible Discovery Rate (IDR) framework49 with the IDR package (version 2.0.4.2) (https://github.com/kundajelab/idr) at a false discovery rate of 0.05. Weak SNR (WT = 2.89, SpRY = 2.78) and low reproducibility (WT = 0.228%, SpRY = 0.151%) indicated suboptimal IP efficiency and background noise, rendering peak calling unfeasible. Instead, coverage was evaluated in genomic regions containing or lacking the seed region and PAM sequences—5′-AAGAANGG-3′ for WT and 5′-AAGAANRN-3′ for SpRY—specifically within a 25-bp range around the seed region and PAM. Coverage metrics, quantified as average reads per kilobase million (RPKM), were computed for designated regions in deduplicated bams (Picardtools version 2.21.9) (https://broadinstitute.github.io/picard/) and blacklisted regions of the genome were excluded, according to best practices55. Background subtraction was performed using averages from the corresponding apo replicates, ensuring only like-regions were adjusted. This process was facilitated by BEDTools (version 2.29.2)50 and a custom Python script.

Protein expression and purification

To express and purify Cas9 variants, we employed a modified protocol based on Cofsky et al6. Briefly, E. coli Rosetta (DE3) cells (Sigma-Aldrich), transformed with the appropriate bacterial expression plasmids, were cultured in Terrific Broth (TB) medium (ThermoFisher Scientific) containing ampicillin (0.1 mg/mL) and chloramphenicol (0.034 mg/mL). The cultures were initiated from a 1:80 dilution of an overnight starter culture and grown at 37°C. Once the optical density at 600 nm (OD600) reached 0.6–0.8, protein expression was induced with 0.5 mM isopropyl β-d-1-thiogalactopyranoside (IPTG) following a cold shock. Induction proceeded overnight at 16°C.

The next day, cells were pelleted by centrifugation and resuspended in Bacterial Lysis Buffer (20 mM HEPES, pH 7.5; 500 mM KCl; 10 mM imidazole; 10% (v/v) glycerol; 1 mM TCEP; supplemented with protease inhibitor cocktail, cOmplete EDTA-free, Roche). The cells were lysed by sonication and were then ultra-centrifuged. The supernatant was applied to Ni-NTA resin (QIAGEN), washed with Wash Buffer (20 mM HEPES, pH 7.5; 500 mM KCl; 30 mM imidazole; 5% (v/v) glycerol; 1 mM TCEP) and then eluted with 300 mM imidazole in the same buffer. Proteins were treated with TEV protease overnight at 4°C during dialysis in the Dialysis Buffer (20 mM HEPES, pH 7.5; 300 mM KCl; 30 mM imidazole; 5% (v/v) glycerol; 1 mM TCEP). Digested and dialyzed proteins were separated using a HisTrap column (Cytiva), and the Cas9-containing flow-through was collected. This was followed by further purification using a HiTrap Heparin HP affinity column (Cytiva), with protein elution performed via a KCl gradient from 300 mM to 1 M. The final purification step involved size-exclusion chromatography on a Superdex 200 Increase 10/300 GL column (Cytiva) in the Protein Storage Buffer (20 mM HEPES, pH 7.5; 150 mM KCl; 10% (v/v) glycerol; 1 mM TCEP). Purified Cas9 proteins were aliquoted, snap-frozen, and stored at −80°C.

The protein purification was validated using SDS-PAGE analysis (Fig. S1F) and DNA cleavage assay, which will be discussed shortly (Fig. S1I). For SDS-PAGE analysis, protein samples were prepared by mixing one volume of the sample with 0.25 volume of 5× SDS Loading Dye (250 mM Tris-HCl, pH 6.8; 75 mM EDTA; 30% (v/v) glycerol; 10% SDS supplemented with bromophenol blue), heated at 90°C for 2 min, and then 10 pmol was loaded onto 10% Mini-PROTEAN TGX Precast Protein Gels (Bio-Rad), with a PageRuler Prestained Protein Ladder (ThermoFisher Scientific). The gels were stained with Coomassie Staining Buffer (30% (v/v) ethanol; 10% (v/v) acetic acid; 1g R250-coomassie) followed by destaining (40% (v/v) ethanol; 10% (v/v) acetic acid) or with instant InstantBlue Coomassie Protein Stain (Abcam). The gels were imaged using a Bio-Rad ChemiDoc system.

DNA cleavage assay

For most DNA cleavage assay, Cas9 (or its variants) and sgRNA were co-incubated at a 1:1.25 molar ratio to form RNP complexes at 24°C or 37°C for 15 min in the 1.05× cleavage reaction buffers, each tailored to different Mg2+ ion concentration. The 10 mM Mg2+ Cleavage Buffer (1×) contained 20 mM HEPES (pH 7.5), 100 mM KCl, 10 mM MgCl2, 1% (v/v) glycerol, 0.5 mM DTT. The 5 mM Mg2+ Cleavage Buffer (1×) contained 20 mM Tris-HCl (pH 7.5), 100 mM KCl, 5 mM MgCl2, 5% (v/v) glycerol. The 0.2 mM Mg2+ Cleavage Buffer (1×) contained 20 mM HEPES (pH 7.5), 100 mM KCl, 0.2 mM MgCl2, 1% (v/v) glycerol, 0.5 mM DTT. The reaction was initiated by adding the RNP complex (19 volumes) to the target dsDNA substrate (1 volume). For competition cleavage assays, target dsDNA substrate and competitor DNA were pre-mixed before adding the RNP complex. The final concentrations were 10 nM dsDNA substrate, 100 nM Cas9, 125 nM sgRNA, 1× cleavage buffer, unless otherwise specified. For protein active fraction experiments, the reactions involved 100 nM dsDNA substrate, 100 nM Cas9 and 125 nM sgRNA, with the fraction of DNA cleaved after 2 h representing the active enzyme fraction (the active fractions ranging from 37% to 50% as shown in Fig. S1G). Cleavage reactions were conducted at either 24°C or 37°C for up to 2 h and quenched at different time points by adding an equal volume of 2× Cleavage Quenching Buffer (94% (v/v) formamide; 30 mM EDTA; 400 μg/mL heparin, supplemented with bromophenol blue). Samples were then heated at 90°C for 5 min and resolved on a 15% UREA-PAGE gel.

The gel was scanned using a Typhoon (Amersham, GE Healthcare) with excitation at 488 nm and a Cy2 emission filter (525BP20) for FAM-labeled oligonucleotide, or with excitation at 635 nm and a Cy5 emission filter (670BP30) for Cy5-labeled oligonucleotide. Band intensities were quantified using Bio-Rad ImageLab 6.1 software, and the resulting data were fitted to a mono-exponential decay or double-exponential decay using the curve_fit function in Scipy Python package:

Y=A×1ekobs×t(mono-exponential)Y=Afast×1ekfast×t+Aslow×1ekslow×t(double-exponential)

, where Y is the cleaved DNA product (%) at a given time, A (or Afast and Aslow) represents the amplitude of the exponential decay, kobs (or kfast and kslow) represents the rate constant of the exponential decay, t is time (min). All data were initially fitted using a double-exponential model. If the second phase was poorly defined or its amplitude was close to zero, a mono-exponential model was then applied. All the experiments were performed in biological triplicates. The fitting parameters and their errors were calculated as the average and their standard deviation across the replicates.

2-aminopurine fluorescence assay

Cas9 and sgRNA were assembled with Cas9 in more than a 2-fold molar excess over sgRNA in 5 mM Mg2+ Cleavage Buffer (1×), and incubated at 24°C for 30 min. The reaction was initiated by adding the RNP complex to 2AP-containing dsDNA substrate at a final molar ratio of 8:1 (RNP complex to DNA) at room temperature (24°C) in a black 384 well plate. The final concentrations in the reactions were 1 μM dsDNA substrate, 10 μM Cas9 RNP (defined by 10 μM sgRNA), 5 mM Mg2+ Cleavage Buffer (1×). Fluorescence emission (λem370nm, λex320nm) for each reaction was recorded every 20 seconds on a Cytation 5 plate reader (Biotek, software Gen v3.04). The fluorescence data over time were fitted to the monoexponential decay using the curve_fit function in Scipy Python package:

Y=Y0+Ymax×1ekobs×t

, where Y is the fluorescence signal at a given time, Ymax is the fitted fluorescence endpoint, Y0 is the average value from a control reaction where the RNP complex (with a non-targeting sgRNA, sgRNA8) was added to 2AP containing dsDNA (Fig. S2B), and kobs is the observed rate constant of the exponential decay. Each reaction was carried out in triplicate, and the average fluorescence values were used to fit the parameters, with their standard fitting errors reported.

Permanganate DNA footprinting assay

We adapted a similar protocol based on Cofsky et al6. Briefly, a dsDNA substrate with an NTS:TS ratio of 1.5:1, where the TS was 5′-labeled with FAM, was mixed with Cas9 RNP (Cas9 and sgRNA at 1:1.25 molar ratio) in a 1.1× Permanganate Reaction Buffer and incubated at 30°C for 15 min. The 1× Permanganate Reaction Buffer included 20 mM Tris-HCl (pH 7.9), 24 mM KCl, 5 mM MgCl2, 0.1 mg/mL BSA, 0.01% (v/v) Tween-20, 1 mM TCEP. The reaction was initiated by combining the RNP-DNA complex (9 volumes) with 50 mM KMnO4 (1 volume) and the mixture was incubated for 2 min at 30°C. The final concentrations in the reaction were 200 nM dsDNA substrate, 16 μM Cas9 RNP, 5 mM of KMnO4 and 1× Permanganate Reaction Buffer. The reaction was quenched by the addition of equal volume of 2× Permanganate Quench Solution (2 M β-mercaptoethanol; 30 mM EDTA).

The DNA from the quenched reaction was extracted using phenol:chloroform:isoamyl (PCI) and precipitated with 2.5 volumes of 100% ethanol at −20°C overnight. The ethanol precipitate was then washed twice using 70% ethanol and then resuspended in 70 μL of 10% (v/v) piperidine, followed by incubated at 90°C for 30 min. The samples were lyophilized using vacuum concentration, resuspended in 10 μL of 1× UREA-PAGE Loading Dye (50% (v/v) formamide; supplemented with bromophenol blue) and then analyzed on a 15% UREA-PAGE gel. The gels were imaged by Typhoon as described above.

The probability of cleavage occurring at a specific thymine i is defined as:

Pcleave,i=Vij=1nVj

, where Vi represents the volume of band corresponding to the cleavage at position i within a lane containing n bands (with band 1 being the smallest cleavage fragment and band n corresponding to the full length DNA). The oxidation probability of thymine i is then defined as:

Pox,i=Pcleave,i,+PMPcleave,i,PM

, where +PM refers to the experiment with 5 mM KMnO4 and -PM refers to the control experiment without permanganate. An extensive description of the analysis is provided in Cofsky et al56.

EMSA

Binding reactions were performed by incubating 100 nM catalytically inactive Cas9 RNP (dCas9 or dSpRY is incubated with sgRNA at 1:1.25 molar ratio) with 10 nM target dsDNA substrate in the 10 mM Mg2+ Cleavage Buffer for 30 min. For binding experiments involving competitor DNA, the competitor was premixed with the dsDNA substrate, as was done in the cleavage assay. The reactions were quenched by mixing with an equal volume of 2× Native Gel Loading Solution (10 mM Tris-HCl, pH 7.5; 50% (v/v) glycerol) and analyzed on a 6% non-denaturing PAGE gel. The gels were imaged by Typhoon as described above.

In binding experiments of competitor DNA (without target dsDNA substrate), the competitor DNA was added to a final concentration of 15 nM and mixed with the RNP complex in 10 mM Mg2+ Cleavage Buffer. The binding reactions were incubated at 37°C for 1 h, then mixed with 0.2 volumes of 6× Agarose Gel Loading Dye (Purple, no SDS, New England Biolabs) and analyzed on a 1% agarose gel containing SYBR Safe. The gels were imaged by a Bio-Rad ChemiDoc system.

Gold rotor bead tracking (AuRBT)

Reagents: Bottom-constrained DNA tethers were assembled by ligating restriction enzyme digested PCR products11,29,57. These products included a segment with multiple incorporated dUTP-digoxigenin (Roche) for creating a torsionally constrained attachment at the coverslip surface, and another segment using a 5’-modified PCR primer (Integrated DNA Technologies) for an unconstrained attachment at the magnetic bead. The sequence of interest, which contained the 20-bp target sequence adjacent to either an NGG site (PAM), or an NCG site (no PAM controls), was assembled by annealing two 5’ phosphorylated oligonucleotides (Integrated DNA Technologies). Detailed information on the tether construction, including building blocks, can be found in Fig. S3A and Table S15S17. Magnetic beads (ThermoFisher, Dynabeads MyOne Carboxylic Acid) were crosslinked with an antibody to Fluorescein/Oregon Green (ThermoFisher, A889) via EDC (ThermoFisher, 77149), as previously described58. These beads were prepared and stored at 4°C for up to two months.

Chambers:

Flow chambers were constructed by sandwiching custom laser-cut Nescofilm channels between a hole-punched vinyl coverslip and a glass coverslip spin coated with 0.1% nitrocellulose in isopentyl acetate (Ladd Research, 10800). On the day before experiments, streptavidin-coated gold nanospheres (20 μL original suspended volume per channel) with a nominal diameter of 60 nm (Cytodiagnostics, ACC-60–04-15) were washed twice in Au wash buffer (50 mM Tris-HCl, pH 8.0; 0.05% TWEEN-20), centrifuging at 2000 g for 5 minutes between washes. These were finally resuspended to their original suspended volume in blocking buffer (40 mM Tris-HCl, pH 8.0; 0.5 M NaCl; 0.2% TWEEN-20; 0.01% sodium azide; 5 mg/mL BSA). DNA tethers (~2 pM) and magnetic beads (1.5 μL original suspended volume per channel) were added to the gold nanospheres in blocking buffer and incubated overnight at 4°C on a rotator. Channels were also incubated overnight at 4°C with 12 μg/mL anti-digoxigenin in PBS buffer. On the day of experiments, channels were (1) incubated with blocking buffer supplemented with 0.25% w/v casein for 1 h, (2) incubated with DNA tethers and beads for 1 h, and (3) washed with approximately 15 channel volumes of binding buffer (40 mM Tris-HCl, pH 8.0; 0.5 M NaCl; 0.2% TWEEN-20; 0.01% sodium azide; 0.1 mg/mL BSA). Before imaging, channels were again washed with approximately 6 channel volumes of C9T imaging buffer (20 mM Tris-HCl, pH 7.5; 100 mM KCl; 5 mM MgCl2; 0.1 mM EDTA; 1 mM TCEP; 0.2 mg/mL BSA). All flow steps were controllably carried out with a syringe pump.

Microscopy:

Experiments were performed on a custom-built AuRBT microscope as previously described29,57. Briefly, this comprised a modified Nikon Ti-S inverted microscope and an 845-nm single-mode diode laser (Lumics LU0845M200) equipped with a polarization-maintaining fiber (Corning PM 780). For evanescent darkfield imaging of the rotor beads, a custom mount was used to hold small mirrors that couple light into and out of the objective. The totally internally reflected return beam was collected on a position-sensitive detector for the purpose of focal plane stabilization. Scattered light from rotor beads was imaged through an optical path splitter (Cairn Research, Optosplit III) onto a high-speed CMOS camera (Mikrotron, EoSens CL). Magnetic tweezers were implemented using permanent magnets mounted on motorized stages for rotation and translation. For all AuRBT measurements, the DNA was held under 7 pN of tension, and movies were recorded at a frame rate of 5 kHz at room temperature (22 ± 1°C).

Data collection:

To ensure a selected DNA molecule has made stable attachments, we first recorded the thermal fluctuations of the rotor bead in C9T buffer for at least 5 min. We also confirmed the correct attachments were made by rotating the magnets. Following this, we introduced the RNP complexes. These Cas9-sgRNA complexes were formed by incubating the Cas9 with a 1.25× molar excess of sgRNA for 10 min at 37°C in C9T Buffer. The RNP was diluted to the specified concentrations in C9T Buffer before being introduced into the channel. Data sets for each combination of Cas9 protein, DNA sequence, and sgRNA sequence were compiled across at least two imaging sessions. Detailed AuRBT trace statistics — including tether counts, unique chambers, tracking durations, and numbers of transition events — are provided in Tables S18S20.

Data processing:

AuRBT imaging data were processed following the methods previously described11,29. Briefly, each image is fit using a 2D Gaussian fitting function to extract peak intensity x and y positions. Residual drift was corrected by fitting an ellipse to each 1-s interval of xy trajectory data and then subtracting the centers. Finally, rotor bead angles were calculated by taking the inverse tangent of each corrected pair of (x,y) coordinates and then unwrapping the result. Rare zero crossings (erroneous jumps of integer rotations) caused by a bead trajectory passing through the origin and/or spurious shifts in angle caused by background diffusing particles were identified and manually corrected in MATLAB.

For analysis of rotor bead angle trajectories, we assume that any deviations from zero twist are due to RNP interactions leading to the unwinding of B-DNA, with helicity of 10.5 bp/turn. To analyze these unwinding events, we first filtered the data to 500 Hz and then employed the Steppi change-point analysis tool31, modeling the data as originating from an Ornstein-Uhlenbeck process. Global stiffness and coupling parameters were fixed by analyzing a portion of the trace before the introduction of any RNP. We also set the level slope parameter to zero. The only free model parameters were the mean angle of the rotor bead and the change point time.

For the data collected with WT SpyCas9 and sgRNAs containing a 3-bp match to the target DNA, we classified the DNA as “unwound” if the change in mean position of the rotor bead exceeded 1 bp; otherwise, the DNA is classified as “closed”. Consecutively scored events in the same state were merged. To faithfully reflect asymmetry in the raw scored data, we also merged states where the DNA was overwound by more than 1 bp. The extent of unwinding in these merged states was calculated as the weighted average of contributing states, with weights based on their lifetime, and the dwell times of these states were summed to obtain the total time spent in each state. The same analysis protocol was applied to no PAM control experiments. The average lifetime in the unwound state was calculated by fitting the dwell time distribution of states in the unwound cluster to a 100-ms left-censored double exponential using maximum likelihood estimation in MEMLET44. Trace statistics for these measurements are provided in Table S19.

Similar experiments were conducted with SpRY. For Target2, the sgRNA (sgRNA4.1) used consisted of up to 3-bp matching from the seed to the DNA below the rotor bead. After introducing SpRY:sgRNA4.1 to the tether, we observed a concentration-dependent and prolonged shift in the rotor bead’s mean angle, reflecting unwinding of the DNA helix (Fig. 4E, 4F). This unwinding was interpreted as SpRY interacting specifically or non-specifically with the DNA. The mean change in DNA twist was calculated by subtracting the initial mean rotor angle from the final mean rotor angle after SpRY introduction. These mean changes in DNA twist as a function of SpRY concentration were fit to a Langmuir model to estimate the dissociation constant KD and the saturating angle ΔΘsat

Δθ0[RNP]=ΔΘsat[RNP]KD+[RNP].

A similar analysis was also applied to the non-specific binding events observed in the dSpRY:sgRNA3 experiments on Target1 (to be introduced shortly), where sgRNA3 has 20-bp match to the target DNA (Fig. S3E). For the data analysis in these experiments, the mean change in DNA twist was instead calculated by subtracting the initial mean rotor angle before SpRY introduction from the final mean rotor angle, which was determined by the averaged positions of the closed states.

For the data collected with sgRNA3 (20-bp match to Target1), states were categorized into “closed” (C), “intermediate” (I), or “open” (O) clusters and then merged as previously described11. State boundaries were set to align with state definitions from previous work11. For experimental traces obtained using dSpRY, data were re-zeroed so that the predominant closed state corresponded to Δθ0=0 bp, accounting for minor unwinding from off-target binding that persisted throughout the data collection period (Fig. S3D). Transition rates between the three states were calculated by dividing the number of transitions from state i to state j by the total time spent in state i. Transitions are counted starting after the first transition event, typically CI. Although direct transitions between the closed and open states were rare, transitions among all three states were considered, and transition rate constants (kCI, kIC, kIO, kOI, kCO, and kOC) were calculated (Tables S1 and S18). Nonlinear least-squares fitting of a linear model for dCas9 and a hyperbolic saturation model for dSpRY was performed using MATLAB to fit kCI versus RNP concentration. Error bars were calculated assuming Poisson statistics.

Cartoons of free energy landscapes (Fig. 3C) were drawn with reference to the kinetic models shown in Fig. 3B11. Apparent equilibrium constants were computed using

Kij=kijkji

, where kij represents the transition rate from state i to state j. Estimated free energy differences were then computed using

ΔGij=kBTlnKij

, where kB represents the Boltzmann constant and T is temperature. The landscapes were constructed using a concentration of 100 nM RNP. Well positions for Cbound, I, and O states were set to the average bps unwound in each merged state cluster (0 bp, 10 bp, and 20 bp, respectively), and transition state locations were informed by previous work11. We arbitrarily set the locations of Cfree for illustrative purposes. Relative barrier heights were derived from estimated and calculated transition rates and represented as

kBTlnkij+C

, where C=7kBT is an arbitrary constant.

Quantification and Statistical Analysis

Statistical analyses for HEK293T editing experiments were performed using GraphPad Prism (version 10.4.1). ChIP-seq data and biochemical assays were analyzed with Python, and AuRBT experiments were analyzed using MATLAB (version 9.6.0). Replicate numbers, details of statistical tests, and descriptions of error bars are provided in the figure captions and method details. AuRBT trace quantifications and statistics — including tether counts, unique chambers, tracking durations, and numbers of transition events — are provided in Tables S18S20.

Supplementary Material

1
2

KEY RESOURCES TABLE

REAGENT or RESOURCE SOURCE IDENTIFIER
Antibodies
Anti-human β2-microglobulin (2M2)-APC BioLegend Cat#316312
Monoclonal ANTI-FLAG M2 antibody produced in mouse Millipore sigma Cat#F1804
CRISPR/Cas9 antibody Diagenode Cat#C15310258
Histone H3 (D1H2) XP Rabbit mAb Cell Signaling Technologies Cat#4499S
Fluorescein/Oregon Green Polyclonal Antibody ThermoFisher Cat#A-889
Anti-Digoxigenin from sheep MilliporeSigma (Roche) Cat#11333089001
Bacterial and virus strains
E. coli Rosetta 2 (DE3) Competent Cells EMD Millipore Cat#71397
Chemicals, peptides, and recombinant proteins
GTP Sigma-Aldrich G8877–1G
ATP Sigma-Aldrich A8937–1G
UTP Sigma-Aldrich U6625–1G
CTP Sigma-Aldrich C1506–1G
Spermidine Sigma-Aldrich S2626–1G
T7 RNA polymerase N/A N/A
Ni-NTA Superflow Qiagen Cat#30430
TEV protease N/A N/A
Mini-PROTEAN TGX Precast Protein Gel Bio-Rad Cat#4561033
Piperidine Sigma-Aldrich 411027–100ML
Potassium permanganate Sigma-Aldrich 223468–25G
dUTP-digoxigenin MilliporeSigma (Roche) Cat#11093088910
Streptavidin-coated gold nanospheres Cytodiagnostics Cat#ACC-60–04-15
Dynabead MyOne Carboxylic Acid ThermoFisher Cat#65012
Proteinase K NEB Cat#P8107S
Dynabeads Protein A for Immunoprecipitation Invitrogen Cat#10002D
Salmon Sperm DNA ThermoFisher AM9680
Dulbecco’s Modified Eagle Medium Corning Cat#10–013-CV
Fetal Bovine Serum VWR Cat#97068–085
Penicillin/streptomcyin Gibco Cat#15140122
Lipofectamine 3000 Transfection Reagent Invitrogen Cat#L3000015
QuickExtract DNA Extraction Solution Biosearch Technologies Cat#QE09050
Q5 High-Fidelity 2x Master Mix NEB Cat#M0492L
ddPCR Supermix for Probes (No dUTP) Bio-Rad Cat#1863024
RIPA Lysis and Extraction Buffer Fisher Scientific Cat#PI89900
Protease Inhibitor Cocktail (100X) Thermo Scientific Cat#78429
High-Capacity cDNA Reverse Transcription Kit Applied Biosystems Cat#4368814
Power SYBR Green PCR Master Mix Applied Biosystems Cat#4367659
SPRIselect Beckman Coulter Cat#B23317
cOmplete, Mini, EDTA-free Protease Inhibitor Cocktail Roche Cat#11836170001
Isopropyl β-d-1-thiogalactopyranoside (IPTG) Gold Biotechnology Cat#I2481C
Critical commercial assays
SF Cell Line 96-well Nucleofector Kit Lonza Cat#V4SC-2096
NEBNext Ultra II DNA Library Prep Kit for Illumina NEB Cat#E7645L
Qubit dsDNA Quantification Assay Invitrogen Cat#Q32854
Pierce BCA Protein Assay Kit Thermo Scientific Cat#23225
miRNeasy Mini Kit Qiagen Cat#217004
Deposited data
Amplicon sequencing data from genome editing experiments This study NCBI SRA BioProject ID: PRJNA1239632
ChIP-seq data This study NCBI SRA BioProject ID: PRJNA1239632
Unprocessed gel images & raw data from cell & single molecule experiments This study Mendeley data: https://doi.org/10.17632/hzr4fnvc88.1
Experimental models: Cell lines
Human embryonic kidney 293T cells UC Berkeley Cell Culture Facility N/A
Oligonucleotides
Fluorescently labeled DNA substrates IDT Table S13
Cas9 sgRNAs IDT Tables S5, S8, and S9
Primers and oligonucleotides for AuRBT tether construction IDT Tables S15 and S16
RT-qPCR primers IDT Table S12
ddPCR primers/probes IDT Table S11
NGS primers IDT Table S10
Recombinant DNA
Mammalian Cas9 or sgRNA expression plasmids This study, modified from Addgene#62988 Tables S4S7
Bacterial expression plasmids for Cas9 variants This study, modified from Addgene#179525 Tables S4, S6 and S7
pGGASelect NEB Cat#195714
Table S14
Software and algorithms
FlowJo v10.10.0 FlowJo https://www.flowjo.com/solutions/downloads/
Image Lab 6.1 Bio-Rad Laboratories, Inc. https://www.bio-rad.com/en-us/product/image-lab-software?ID=KRE6P5E8Z
MATLAB 9.6.0.1472908 (R2019a) Update 9 The MathWorks Inc. https://www.mathworks.com
MEMLET 44 https://michaelswoody.github.io/MEMLET/
Bowtie2 version 2.5.2 45 https://github.com/BenLangmead/bowtie2
deepTools2 version 3.5.1 46 https://github.com/deeptools/deepTools
IGV version 2.19.1 47 https://igv.org/
Seqtk version 1.3 N/A https://github.com/lh3/seqtk
MACS2 version 2.2.9.1 48 https://github.com/macs3-project/MACS
IDR version 2.0.4.2 49 https://github.com/kundajelab/idr
Picard version 2.21.9 Broad Institute https://github.com/broadinstitute/picard
BEDTools version 2.29.2 50 https://github.com/arq5x/bedtools2
Python version 3.9.12 Python Software Foundation https://www.python.org/
GraphPad Prism version 10.4.1 GraphPad https://www.graphpad.com/

HIGHLIGHTS.

  • Cas9 identifies target sequences in two steps: initial PAM binding and DNA unwinding.

  • A PAM-relaxed Cas9 is kinetically trapped after initial binding.

  • Kinetic traps slow target search and target unwinding, reducing editing efficiency.

  • Efficient editing is favored by specific yet weak PAM binding and fast unwinding.

Acknowledgements

We thank Mr. Peter Yoon and other members of the Doudna lab and the Bryant lab for helpful discussions. We would also like to acknowledge Ms. Netravathi Krishnappa (NGS Core Operations Manager and Sequencing Specialist, Center for Translational Genomics, Innovative Genomics Institute, UC Berkeley) and Dr. Suhua Feng (Associate Project Scientist, High-Throughput Sequencing Core, Broad Stem Cell Research Center, UCLA) for NGS. H.S. is an HHMI Fellow of The Jane Coffin Childs Fund for Medical Research. N.A. is a recipient of the National Science Foundation Graduate Research Fellowship. P.S. is supported by the Swiss National Science Foundation Mobility fellowship (P500PB_214418). E.E.D. is supported by a fellowship award from the National Institutes of Health (F32GM153031). J.C.C. is a fellow of the Helen Hay Whitney Foundation. J.A.D. is an Investigator of the Howard Hughes Medical Institute (HHMI). This project was supported by the National Institutes of Health (U01AI142817 to J.A.D. and R01GM106159 to Z.B.). Figure cartoons generated with biorender.com.

Declaration of interests

The Regents of the University of California have patents issued and pending for CRISPR technologies on which J.A.D. is an inventor. J.A.D. is a cofounder of Azalea Therapeutics, Caribou Biosciences, Editas Medicine, Evercrisp, Scribe Therapeutics_and Mammoth Biosciences. J.A.D. is a scientific advisory board member at Evercrisp, Caribou Biosciences, Scribe Therapeutics, Mammoth Biosciences, The Column Group and Inari. She also is an advisor for Aditum Bio. J.A.D. is Chief Science Advisor to Sixth Street, a Director at Johnson & Johnson, Altos and Tempus, and has a research project sponsored by Apple Tree Partners.

Footnotes

Declaration of generative AI and AI-assisted technologies

During the preparation of this work, the authors used GPT-4o to assist with text editing. After using this tool or service, the authors reviewed and edited the content as needed, and take full responsibility for the content of the publication.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  • 1.Wang JY, and Doudna JA (2023). CRISPR technology: A decade of genome editing is only the beginning. Science 379, eadd8643. doi: 10.1126/science.add8643. [DOI] [PubMed] [Google Scholar]
  • 2.Pacesa M, Pelea O, and Jinek M. (2024). Past, present, and future of CRISPR genome editing technologies. Cell 187, 1076–1100. 10.1016/j.cell.2024.01.042. [DOI] [PubMed] [Google Scholar]
  • 3.Jinek M, Chylinski K, Fonfara I, Hauer M, Doudna JA, and Charpentier E. (2012). A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science 337, 816–821. 10.1126/science.1225829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Sternberg SH, Redding S, Jinek M, Greene EC, and Doudna JA (2014). DNA interrogation by the CRISPR RNA-guided endonuclease Cas9. Nature 507, 62–67. 10.1038/nature13011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Anders C, Niewoehner O, Duerst A, and Jinek M. (2014). Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease. Nature 513, 569–573. 10.1038/nature13579. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Cofsky JC, Soczek KM, Knott GJ, Nogales E, and Doudna JA (2022). CRISPR-Cas9 bends and twists DNA to read its sequence. Nature structural & molecular biology 29, 395–402. 10.1038/s41594-022-00756-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Jiang FG, Taylor DW, Chen JS, Kornfeld JE, Zhou KH, Thompson AJ, Nogales E, and Doudna JA (2016). Structures of a CRISPR-Cas9 R-loop complex primed for DNA cleavage. Science 351, 867–871. 10.1126/science.aad8282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Bravo JPK., Liu MS., Hibshman GN., Dangerfield TL., Jung K., McCool RS., Johnson KA., and Taylor DW. (2022). Structural basis for mismatch surveillance by CRISPR-Cas9. Nature 603, 343–347. 10.1038/s41586-022-04470-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Pacesa M, Loeff L, Querques I, Muckenfuss LM, Sawicka M, and Jinek M. (2022). R-loop formation and conformational activation mechanisms of Cas9. Nature 609, 191–196. 10.1038/s41586-022-05114-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Szczelkun MD, Tikhomirova MS, Sinkunas T, Gasiunas G, Karvelis T, Pschera P, Siksnys V, and Seidel R. (2014). Direct observation of R-loop formation by single RNA-guided Cas9 and Cascade effector complexes. Proceedings of the National Academy of Sciences of the United States of America 111, 9798–9803. 10.1073/pnas.1402597111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Ivanov IE, Wright AV, Cofsky JC, Aris KDP, Doudna JA, and Bryant Z. (2020). Cas9 interrogates DNA in discrete steps modulated by mismatches and supercoiling. Proceedings of the National Academy of Sciences of the United States of America 117, 5853–5860. 10.1073/pnas.1913445117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Knight SC, Xie L, Deng W, Guglielmi B, Witkowsky LB, Bosanac L, Zhang ET, El Beheiry M, Masson JB, Dahan M, et al. (2015). Dynamics of CRISPR-Cas9 genome interrogation in living cells. Science 350, 823–826. 10.1126/science.aac6572. [DOI] [PubMed] [Google Scholar]
  • 13.Jones DL, Leroy P, Unoson C, Fange D, Curic V, Lawson MJ, and Elf J. (2017). Kinetics of dCas9 target search in Escherichia coli. Science 357, 1420–1424. 10.1126/science.aah7084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Gong S, Yu HH, Johnson KA, and Taylor DW (2018). DNA Unwinding Is the Primary Determinant of CRISPR-Cas9 Activity. Cell reports 22, 359–371. 10.1016/j.celrep.2017.12.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Liu Y, Zou RS, He S, Nihongaki Y, Li X, Razavi S, Wu B, and Ha T. (2020). Very fast CRISPR on demand. Science 368, 1265–1269. 10.1126/science.aay8204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Hu JH, Miller SM, Geurts MH, Tang W, Chen L, Sun N, Zeina CM, Gao X, Rees HA, Lin Z, and Liu DR (2018). Evolved Cas9 variants with broad PAM compatibility and high DNA specificity. Nature 556, 57–63. 10.1038/nature26155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Nishimasu H, Shi X, Ishiguro S, Gao LY, Hirano S, Okazaki S, Noda T, Abudayyeh OO, Gootenberg JS, Mori H, et al. (2018). Engineered CRISPR-Cas9 nuclease with expanded targeting space. Science 361, 1259–1262. 10.1126/science.aas9129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Walton RT, Christie KA, Whittaker MN, and Kleinstiver BP (2020). Unconstrained genome targeting with near-PAMless engineered CRISPR-Cas9 variants. Science 368, 290–296. 10.1126/science.aba8853. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Zhao L, Koseki SRT, Silverstein RA, Amrani N, Peng C, Kramme C, Savic N, Pacesa M, Rodriguez TC, Stan T, et al. (2023). PAM-flexible genome editing with an engineered chimeric Cas9. Nature communications 14, 6175. 10.1038/s41467-023-41829-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Hibshman GN, Bravo JPK, Hooper MM, Dangerfield TL, Zhang H, Finkelstein IJ, Johnson KA, and Taylor DW (2024). Unraveling the mechanisms of PAMless DNA interrogation by SpRY-Cas9. Nature communications 15, 3663. 10.1038/s41467-024-47830-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Olivi L, Bagchus C, Pool V, Bekkering E, Speckner K, Offerhaus H, Wu WY, Depken M, Martens KJA, Staals RHJ, and Hohlbein J. (2024). Live-cell imaging reveals the trade-off between target search flexibility and efficiency for Cas9 and Cas12a. Nucleic acids research 52, 5241–5256. 10.1093/nar/gkae283. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Zhang W., Yin J., Zhang-Ding Z., Xin C., Liu M., Wang Y., Ai C., and Hu J. (2021). In-depth assessment of the PAM compatibility and editing activities of Cas9 variants. Nucleic acids research 49, 8785–8795. 10.1093/nar/gkab507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Yang C, Zhou Z, Sun X, Ju H, Yue X, Rao S, and Xue C. (2024). PAMless SpRY exhibits a preference for the seed region for efficient targeting. Cell reports 43, 114225. 10.1016/j.celrep.2024.114225. [DOI] [PubMed] [Google Scholar]
  • 24.Raper AT, Stephenson AA, and Suo Z. (2018). Functional Insights Revealed by the Kinetic Mechanism of CRISPR/Cas9. Journal of the American Chemical Society 140, 2971–2984. 10.1021/jacs.7b13047. [DOI] [PubMed] [Google Scholar]
  • 25.Raju B, Murphy E, Levy LA, Hall RD, and London RE (1989). A fluorescent indicator for measuring cytosolic free magnesium. The American journal of physiology 256, C540–548. 10.1152/ajpcell.1989.256.3.C540. [DOI] [PubMed] [Google Scholar]
  • 26.Fatholahi M, LaNoue K, Romani A, and Scarpa A. (2000). Relationship between Total and Free Cellular Mg2+ during Metabolic Stimulation of Rat Cardiac Myocytes and Perfused Hearts. Archives of biochemistry and biophysics 374, 395–401. 10.1006/abbi.1999.1619. [DOI] [PubMed] [Google Scholar]
  • 27.Jean JM, and Hall KB (2001). 2-Aminopurine fluorescence quenching and lifetimes: role of base stacking. Proceedings of the National Academy of Sciences of the United States of America 98, 37–41. 10.1073/pnas.98.1.37. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Li Y, Liu Y, Singh J, Tangprasertchai NS, Trivedi R, Fang Y, and Qin PZ (2022). Site-Specific Labeling Reveals Cas9 Induces Partial Unwinding Without RNA/DNA Pairing in Sequences Distal to the PAM. CRISPR J 5, 341–352. 10.1089/crispr.2021.0100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Lebel P, Basu A, Oberstrass FC, Tretter EM, and Bryant Z. (2014). Gold rotor bead tracking for high-speed measurements of DNA twist, torque and extension. Nature methods 11, 456–462. 10.1038/nmeth.2854. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Singh D, Sternberg SH, Fei J, Doudna JA, and Ha T. (2016). Real-time observation of DNA recognition and rejection by the RNA-guided endonuclease Cas9. Nature communications 7, 12778. 10.1038/ncomms12778. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Wiggins PA (2015). An information-based approach to change-point analysis with applications to biophysics and cell biology. Biophysical journal 109, 346–354. 10.1016/j.bpj.2015.05.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Mekler V, Minakhin L, and Severinov K. (2017). Mechanism of duplex DNA destabilization by RNA-guided Cas9 nuclease during target interrogation. Proceedings of the National Academy of Sciences 114, 5443. 10.1073/pnas.1619926114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Aboul-ela F, Koh D, Tinoco I Jr., and Martin FH (1985). Base-base mismatches. Thermodynamics of double helix formation for dCA3XA3G + dCT3YT3G (X, Y = A,C,G,D. Nucleic acids research 13, 4811–4824. 10.1093/nar/13.13.4811. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Oberstrass FC, Fernandes LE, and Bryant Z. (2012). Torque measurements reveal sequence-specific cooperative transitions in supercoiled DNA. Proceedings of the National Academy of Sciences of the United States of America 109, 6106–6111. 10.1073/pnas.1113532109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Komor AC, Kim YB, Packer MS, Zuris JA, and Liu DR (2016). Programmable editing of a target base in genomic DNA without double-stranded DNA cleavage. Nature 533, 420–424. 10.1038/nature17946. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Gaudelli NM, Komor AC, Rees HA, Packer MS, Badran AH, Bryson DI, and Liu DR (2017). Programmable base editing of A*T to G*C in genomic DNA without DNA cleavage. Nature 551, 464–471. 10.1038/nature24644. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Anzalone AV, Randolph PB, Davis JR, Sousa AA, Koblan LW, Levy JM, Chen PJ, Wilson C, Newby GA, Raguram A, and Liu DR (2019). Search-and-replace genome editing without double-strand breaks or donor DNA. Nature 576, 149–157. 10.1038/s41586-019-1711-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Qi LS, Larson MH, Gilbert LA, Doudna JA, Weissman JS, Arkin AP, and Lim WA. (2013). Repurposing CRISPR as an RNA-Guided Platform for Sequence-Specific Control of Gene Expression. Cell 152, 1173–1183. 10.1016/j.cell.2013.02.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Gilbert LA, Larson MH, Morsut L, Liu Z, Brar GA, Torres SE, Stern-Ginossar N, Brandman O, Whitehead EH, Doudna JA, et al. (2013). CRISPR-mediated modular RNA-guided regulation of transcription in eukaryotes. Cell 154, 442–451. 10.1016/j.cell.2013.06.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Lu Q, and Pigolotti S. (2024). Target search in the CRISPR/Cas9 system: facilitated diffusion with target cues. New J Phys 26, 043007. 10.1088/1367-2630/ad3841. [DOI] [Google Scholar]
  • 41.Cofsky JC (2021). CRISPR-Cas enzymes as DNA sculptors. (UC Berkeley). [Google Scholar]
  • 42.Kleinstiver BP, Prew MS, Tsai SQ, Topkar VV, Nguyen NT, Zheng Z, Gonzales AP, Li Z, Peterson RT, Yeh JR, et al. (2015). Engineered CRISPR-Cas9 nucleases with altered PAM specificities. Nature 523, 481–485. 10.1038/nature14592. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Collias D, and Beisel CL (2021). CRISPR technologies and the search for the PAM-free nuclease. Nature communications 12, 555. 10.1038/s41467-020-20633-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Woody MS, Lewis JH, Greenberg MJ, Goldman YE, and Ostap EM (2016). MEMLET: An Easy-to-Use Tool for Data Fitting and Model Comparison Using Maximum-Likelihood Estimation. Biophysical journal 111, 273–282. 10.1016/j.bpj.2016.06.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Langmead B, and Salzberg SL (2012). Fast gapped-read alignment with Bowtie 2. Nature methods 9, 357–359. 10.1038/nmeth.1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Ramirez F, Ryan DP, Gruning B, Bhardwaj V, Kilpert F, Richter AS, Heyne S, Dundar F, and Manke T. (2016). deepTools2: a next generation web server for deep-sequencing data analysis. Nucleic acids research 44, W160–165. 10.1093/nar/gkw257. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Robinson JT, Thorvaldsdottir H, Winckler W, Guttman M, Lander ES, Getz G, and Mesirov JP (2011). Integrative genomics viewer. Nature biotechnology 29, 24–26. 10.1038/nbt.1754. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Zhang Y, Liu T, Meyer CA, Eeckhoute J, Johnson DS, Bernstein BE, Nusbaum C, Myers RM, Brown M, Li W, and Liu XS (2008). Model-based analysis of ChIP-Seq (MACS). Genome biology 9, R137. 10.1186/gb-2008-9-9-r137. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Qunhua L, James BB, Haiyan H, and Peter JB (2011). Measuring reproducibility of high-throughput experiments. The Annals of Applied Statistics 5, 1752–1779. 10.1214/11-AOAS466. [DOI] [Google Scholar]
  • 50.Quinlan AR, and Hall IM (2010). BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842. 10.1093/bioinformatics/btq033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Cavaluzzi MJ, and Borer PN (2004). Revised UV extinction coefficients for nucleoside-5’-monophosphates and unpaired DNA and RNA. Nucleic acids research 32, e13. 10.1093/nar/gnh015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Ma E, Chen K, Shi H, Stahl EC, Adler B, Trinidad M, Liu J, Zhou K, Ye J, and Doudna Jennifer A. (2022). Improved genome editing by an engineered CRISPR-Cas12a. Nucleic acids research, gkac1192. 10.1093/nar/gkac1192. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Rose JC, Stephany JJ, Valente WJ, Trevillian BM, Dang HV, Bielas JH, Maly DJ, and Fowler DM (2017). Rapidly inducible Cas9 and DSB-ddPCR to probe editing kinetics. Nature methods 14, 891–896. 10.1038/nmeth.4368. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Tsuchida CA, Brandes N, Bueno R, Trinidad M, Mazumder T, Yu B, Hwang B, Chang C, Liu J, Sun Y, et al. (2023). Mitigation of chromosome loss in clinical CRISPR-Cas9-engineered T cells. Cell 186, 4567–4582 e4520. 10.1016/j.cell.2023.08.041. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Amemiya HM, Kundaje A, and Boyle AP (2019). The ENCODE Blacklist: Identification of Problematic Regions of the Genome. Scientific reports 9, 9354. 10.1038/s41598-019-45839-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Cofsky JC., Karandur D., Huang CJ., Witte IP., Kuriyan J., and Doudna JA. (2020). CRISPR-Cas12a exploits R-loop asymmetry to form double-strand breaks. eLife 9, e55143. 10.7554/eLife.55143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Ivanov IE, Lebel P, Oberstrass FC, Starr CH, Parente AC, Ierokomos A, and Bryant Z. (2018). Multimodal Measurements of Single-Molecule Dynamics Using FluoRBT. Biophysical journal 114, 278–282. 10.1016/j.bpj.2017.11.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Gore J, Bryant Z, Stone MD, Nollmann M, Cozzarelli NR, and Bustamante C. (2006). Mechanochemical analysis of DNA gyrase using rotor bead tracking. Nature 439, 100–104. 10.1038/nature04319. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1
2

Data Availability Statement

  • NGS sequencing data have been deposited in the National Institutes of Health NCBI SRA under the BioProject ID: PRJNA1239632 and are publicly available as of date of publication. Single-molecule data have been deposited in the Stanford Digital Repository (https://doi.org/10.25740/yb505yt4817) as MATLAB .fig files and are publicly available as of the date of publication. All other source data — including biochemical analysis data points, uncropped gel images and processed single-molecule data used to generate figures — have been deposited at Mendeley (https://doi.org/10.17632/hzr4fnvc88.1) and are publicly available as of the date of publication.

  • Customized Python code for NGS sequencing analysis has been deposited online and made publicly available on Github (https://github.com/Doudna-lab/Cas9_Target_Search_ChIPseq). Customized MATLAB code for single-molecule data analysis has been deposited in the Stanford Digital Repository (https://doi.org/10.25740/yb505yt4817).

  • Any additional information required to reanalyze the data reported in this paper is available from the lead contact upon request.

RESOURCES