Skip to main content
The Journal of Biological Chemistry logoLink to The Journal of Biological Chemistry
. 2023 Apr 12;299(5):104700. doi: 10.1016/j.jbc.2023.104700

Binding to the conserved and stably folded guide RNA pseudoknot induces Cas12a conformational changes during ribonucleoprotein assembly

Sruthi Sudhakar 1,, Christopher L Barkau 2,, Ramadevi Chilamkurthy 2, Halle M Barber 3, Adrian A Pater 4, Sean D Moran 4, Masad J Damha 3, PI Pradeepkumar 1,, Keith T Gagnon 2,4,
PMCID: PMC10200996  PMID: 37059184

Abstract

Ribonucleoproteins (RNPs) comprise one or more RNA and protein molecules that interact to form a stable complex, which commonly involves conformational changes in the more flexible RNA components. Here, we propose that Cas12a RNP assembly with its cognate CRISPR RNA (crRNA) guide instead proceeds primarily through Cas12a conformational changes during binding to more stable, prefolded crRNA 5′ pseudoknot handles. Phylogenetic reconstructions and sequence and structure alignments revealed that the Cas12a proteins are divergent in sequence and structure while the crRNA 5′ repeat region, which folds into a pseudoknot and anchors binding to Cas12a, is highly conserved. Molecular dynamics simulations of three Cas12a proteins and their cognate guides revealed substantial flexibility for unbound apo-Cas12a. In contrast, crRNA 5′ pseudoknots were predicted to be stable and independently folded. Limited trypsin hydrolysis, differential scanning fluorimetry, thermal denaturation, and CD analyses supported conformational changes of Cas12a during RNP assembly and an independently folded crRNA 5′ pseudoknot. This RNP assembly mechanism may be rationalized by evolutionary pressure to conserve CRISPR loci repeat sequence, and therefore guide RNA structure, to maintain function across all phases of the CRISPR defense mechanism.

Keywords: CRISPR-cas12a, evolution, induced fit, molecular dynamics, ribonucleoprotein assembly


Conformational changes are the basis of protein and RNA folding from long biopolymers into biologically active molecules (1). The affinity, specificity, and stability of molecular interactions often scales with buried surface area (or number of molecular contacts) and the degree of surface complementarity (2). For RNA-guided enzymes, coevolution of protein and cognate RNA usually results in burial of significant portions of the RNA in the protein (3, 4, 5, 6, 7). Thus, conformational accommodations, primarily for the guide RNA, are an important mechanism for molding and shaping interactions. Archaeal box C/D ribonucleoprotein (RNP) complexes, for example, have been characterized to undergo substantial remodeling and induced fit during RNP assembly (4, 8, 9, 10, 11). Argonaute enzymes undergo minimal, although potentially important, conformational changes while the guide RNA is stretched and splayed through the middle of the protein (71213).

CRISPR were first observed in bacteria in 1987 (14) and 1989 (15) as conserved repeat sequences with regularly spaced interrupting sequences. However, the significance of CRISPR loci remained unclear. In 2002, when these repeats were observed in other prokaryotes and CRISPR associated (Cas) protein-coding genes were found nearby, a unifying name of CRISPR was coined for these loci (16). It was later found that CRISPR repeats contained sequences that correspond to bacteriophage, suggesting an origin for the repeats (17, 18, 19). In 2006, it was suggested that these repeats act as an adaptive immune system to protect against viral invaders (20). This hypothesis was subsequently shown to be correct (21, 22). The study of CRISPR function and evolution quickly followed, leading to characterization of diverse RNA-guided CRISPR-Cas systems (23).

CRISPR systems act in three phases: adaptation, expression, and interference. During adaptation and expression, foreign DNA sequences are incorporated into CRISPR loci and then expressed as RNA for assembly into CRISPR-Cas RNP effector complexes (24, 25). Cas12a enzymes originate from type V CRISPR systems comprising the genes cas12a, cas4, cas1, cas2, and the repeat array (25). The conserved Cas1/Cas2 complex functions in spacer acquisition (26, 27). The adaptation complex comprises dimers of Cas2, which bridge pairs of Cas1 dimers, and binds potential protospacers to coordinate their incorporation into the CRISPR array. Cas1 is thought to have originated from transposon-like elements (termed casposons) which encode Cas1 homologs. Diverse Cas1 homologs exhibit low sequence conservation but retain a similar structure across the Cas1 phylogeny (28). Recently it has been shown that the Cas1/Cas2 complex in the type I-B system of Haloarcula hispanica binds the upstream-most repeat and leader at motifs of the CRISPR locus in a sequence and spacing-specific manner to facilitate new spacer integration (29). Cas12a has been shown to process its own CRISPR RNA (crRNA), cleaving upstream of the repeat sequence, which is a stem–loop structure referred to as the 5′ handle. These steps all rely on recognition of the CRISPR array repeat sequence, either at the DNA or RNA level, which serves as a common thread across adaptation, expression, and interference phases of the CRISPR defense mechanism. This offers a rationale for strong conservation of repeat sequences in CRISPR array loci. In contrast, other components, like the Cas12a protein, may be permitted to undergo greater divergence so long as they still recognize CRISPR arrays or crRNA and execute their respective phases.

During the interference phase, RNA-guided CRISPR-Cas effector enzymes find and pair to complementary nucleic acids, most often DNA, and catalyze phosphodiester bond cleavage to induce degradation (25, 30, 31). CRISPR effector enzymes are composed of one or two RNA molecules and an endonucleolytic Cas protein (16, 32, 33). During interference, CRISPR–Cas complexes will first identify candidate targets by transient Cas protein binding to a protospacer adjacent motif (PAM) DNA sequence (31). The crRNA guide sequence will then pair to the complementary protospacer sequence adjacent to the PAM (34). The class 2, type V CRISPR systems use a single effector enzyme, Cas12a, and a single 42 nucleotide long crRNA guide (35). Cas12a appears to have low off-target effects in gene editing applications (36) and possesses a T-rich PAM sequence requirement (35). Interestingly, CRISPR–Cas12a complexes exhibit a nonsequence-specific, multiturnover single-stranded DNase activity, called trans activity, after sequence-specific cleavage of target DNA has occurred (37, 38). For this reason, Cas12a has been used to develop diagnostic assays, such as for COVID-19 detection (38). In addition, CRISPR–Cas12a is being adapted for human therapeutics (39, 40, 41).

Conformational changes that occur during CRISPR-Cas RNP assembly and their effects on enzyme activity have been investigated previously. The Doudna laboratory observed global structural changes during RNA binding to Cas9 (42, 43) that coincide with catalytic competence and have been supported by cryo-EM studies (44). Other studies have investigated RNP assembly and conformational dynamics for Cas9 (42, 45, 46). Several Cas enzyme crystal structures have been solved (5, 6, 47, 48, 49, 50, 51). Recent cryo-EM studies of Cas9 during precatalytic or target binding steps have revealed some new conformational states, yet these involve substrate DNA, which is difficult to uncouple from guide RNA-induced changes (44, 52). These structures provide important insight into CRISPR-Cas9 structure-function, however the full picture of RNP assembly dynamics may still be incomplete. Cas12a dynamics during RNP assembly with its cognate guide have been less explored, owing in part to the historical lack of available high-resolution structures of Cas12a in the absence of a guide RNA (53, 54). Cas12a dynamics studies have identified large-scale “open” and “closed” conformational states upon crRNA binding that appear to depend on the originating species of the CRISPR-Cas12a system (51, 53, 55). These studies focused on one Cas12a and large-scale global structural changes with target DNA substrate binding (55, 56).

In this study, we sought to investigate Cas12a RNP assembly through protein and RNA sequence and structural alignments, phylogenetics, molecular dynamics (MD) simulations, and spectroscopic and biochemical experiments. We find that Cas12a sequence diverged during evolution, although protein structure was relatively well-conserved in the RNP state. In contrast, the sequence and structure of the crRNA 5′ handle remained highly conserved. MD simulations of Acidaminococcus sp. strain BV3L6 (As) Cas12a, as well as Cas12a from Lachnospiraceae bacterium ND2006 (Lb) and Francisella tularensis subsp. novicida (Fn), identified substantial protein structural dynamics that were reduced upon RNA binding, especially in the Wedge domain involved in RNA and DNA binding. In contrast, the crRNA 5′ handle structure showed minimal variation between free and bound states. In-solution limited proteolysis followed by MS identified AsCas12a lysine residues with reduced accessibility after RNP assembly. Differential scanning fluorimetry showed improved thermal stability and folding cooperativity of AsCas12a as an RNP. For the crRNA 5′ handle, thermal denaturation and CD analyses suggested stable independent folding in the free state. Interdomain angle calculations from MD simulations predict little or no sampling of the RNP conformational state in apo-Cas12a, suggesting an induced fit over conformational selection model. Together, these results indicate that Cas12a protein undergoes substantial conformational rearrangements induced by binding to the largely prefolded crRNA 5′ pseudoknot structure. We propose that this assembly mechanism may be common for CRISPR-Cas systems due to the intrinsic need to conserve CRISPR loci repeat sequences and therefore crRNA structures.

Results

To understand RNP assembly across CRISPR-Cas12a systems, we began by performing a maximum likelihood phylogenetic analysis on an alignment of 147 publicly available Cas12a protein sequences. In our resulting consensus tree (Fig. 1A), most positions in the alignment were poorly conserved (∼39% on average at positions homologous to the AsCas12a homolog), leading to low fine-grain phylogenetic resolution. Three of the Cas12a proteins in the alignment have been structurally characterized previously, AsCas12a (6), FnCas12a (57), and LbCas12a (51). The structures of these homologs are shown in Figure 1B with their placement on the phylogenetic tree indicated. The resulting topology strongly suggests that among these three Cas12a homologs, FnCas12a and LbCas12a are more similar to each other than to AsCas12a. Overall, their sequence conservation is moderate based on alignment and BLAST (Fig. S1), with sequence identity/similarity being 40%/59% for LbCas12a to FnCas12a, 34%/50% for LbCas12a to AsCas12a, and 35%/51% for AsCas12a to FnCas12a (35, 51). Their structure, by contrast, is well-conserved, with pairwise RMSD values calculated at 2.56 Å for LbCas12a to FnCas12a, 2.20 Å for LbCas12a to AsCas12a, and 2.78 Å for AsCas12a to FnCas12a when assembled as RNP complexes (FnCas12a) or ternary complexes with guide RNA and target DNA (LbCas12a and AsCas12a) (Fig. 1C). These results suggest that the structure-function of Cas12a RNPs is conserved despite substantial protein sequence divergence.

Figure 1.

Figure 1

Sequence and structural relationships among Cas12a homologs.A, consensus maximum likelihood tree of available Cas12a protein sequences. Structurally characterized homologs are labeled in red. B, individual crystal structures of Lb, Fn, and AsCas12a corresponding to their location on the tree shown in panel A. Structural alignment of As (PDB ID: 5B43), Fn (PDB ID: 5NG6), and Lb (PDB ID: 5XUS) Cas12a proteins (C) and crRNAs (D). As, Acidaminococcus sp; Fn, Francisella tularensis subsp. Novicida; Lb, Lachnospiraceae bacterium.

RMSD values calculated between the crRNAs of each homolog revealed striking structural similarity (Fig. 1D), with the distances of AsCas12a to LbCas12a and FnCas12a crRNAs being 0.12 Å and 0.17 Å, respectively. The distance between LbCas12a and FnCas12a crRNAs was 0.21 Å. We also aligned repeat sequences from 28 type V CRISPR systems. In contrast to the Cas12a proteins, these sequences were highly conserved in the pseudoknot folding repeat region (35) (Fig. 2, A and B). The relative distance relationship observed among LbCas12a, FnCas12a, and AsCas12a proteins (Fig. 1A) was recapitulated in the topology of the nearest-neighbor tree of their cognate repeat crRNA sequences constructed by LocARNA (58). Secondary structure analysis of this alignment using LocARNA, which does not predict pseudoknots, revealed a conserved hairpin structure (35) (Fig. 2B) consistent with the core of the 5′ pseudoknot observed in solved Cas12a structures. The predicted structure is shown with the consensus sequence of the aligned repeats (Fig. 2C). These results suggest that the sequence and structure of the crRNA 5′ handle has remained remarkably stable.

Figure 2.

Figure 2

Conservation of Cas12a-associated repeat sequence and folding.A, nearest neighbor joining tree and (B) alignment of 28 type V CRISPR repeats. The LocARNA-predicted secondary structure (right) is indicated at the top of the sequence alignment. The 5′ leader sequence is highlighted in orange and the stem and loop of the predicted stem–loop structure is highlighted in green and blue, respectively. C, repeat consensus sequence shown with the secondary structure predicted for the alignment in (A) with the same color coding. D, superimposed image of the representative structures of the major cluster in the free pseudoknot (magenta) and AsCas12a-bound pseudoknot (green) in the AsCas12a RNP complex from 1 μs molecular dynamics simulation trajectories. RNP, ribonucleoprotein.

To gain insights into the structural dynamics of Cas12a-crRNA RNP assembly, MD studies were conducted. Three different systems were initially prepared–AsCas12a protein alone (apo-AsCas12a), the AsCas12a RNA pseudoknot alone (free pseudoknot), and AsCas12a bound to its cognate crRNA (AsCas12a RNP)–and then subjected to 1 μs of an unrestrained production run in the graphics processor unit accelerated version of Particle Mesh Ewald Molecular Dynamics (59, 60, 61) in AMBER 18 (https://ambermd.org/) (62).

The analysis of trajectories was carried out in two stages. The pseudoknot structural stability and flexibility were explored in the first stage and the protein structural stability and flexibility were examined in the second stage. The RMSD of the pseudoknot backbone atoms were calculated using the first simulation frame as reference (Fig. S2A). The RMSD plot indicates minimal deviations between the free pseudoknot alone and the pseudoknot in the RNP. This suggests that the simulations are equilibrated and the pseudoknot is stable throughout the 1 μs production run. The RMSD value is slightly higher in the case of free pseudoknot, mainly due to the fraying of the terminal nucleotides and the absence of interactions with the protein. This is further demonstrated by the RMS fluctuations (RMSF) values (Fig. S2B). The RMSF plot shows that the residues exhibit similar fluctuations, with nearly identical values in the core hairpin of the structure, indicating that the pseudoknot behaves similarly in the free and bound forms.

The end-to-end distances in the pseudoknot can be used to report structural rigidity and stability during the simulation. The terminal nucleotides were not used for the measurement to avoid possible fluctuations due to the terminal fraying of base pairs. When the end-to-end distances were calculated, the values showed negligible deviations in the free and bound forms (Fig. S2C). Thus, the rigidity of the pseudoknot is maintained in both free and bound forms. The 1 μs simulation trajectories were clustered into ten ensembles. The representative structures of the major cluster of the free and bound pseudoknot were then superimposed to visualize structural fluctuations (Fig. 2D). The structures superimposed with an RMSD value less than 2.5 Å, confirming that the pseudoknot does not undergo any significant structural changes when bound to the Cas12a protein.

In the second stage of analysis, the RMSD of the backbone atoms of AsCas12a protein were measured with respect to the initial frame of the simulation (Fig. S3A). The equilibration of the simulations was observed from the RMSD plots, which showed marginal differences in the protein structure alone without the crRNA and in the RNP complex. The RMSF plots indicate that several regions, which include residues ranging from 150 to 300 and 750 to 85 show significant deviations between the protein alone versus in the RNP complex (Fig. S3B). These regions include parts of the PI, wedge, and REC domains. A previous study reported increased occupancy of a “closed” high fluorescence energy transfer (FRET) state for FnCas12a upon crRNA binding, as indicated by REC and NUC domain distance measurements via single molecule FRET (55). The RMSF values for the protein in the RNP complex were less than the protein alone without the pseudoknot, indicating that the protein becomes more ordered in the presence of the crRNA. The distance between the Cα atoms of amino acids in close vicinity of the pseudoknot were calculated to identify any rearrangements in the protein that might help accommodate the crRNA pseudoknot. We selected K15-L807, R863-L1022, and H977-D966 distances as representative since they are all in the wedge and RuvC domains and make close contact with either the loop or the terminals of the pseudoknot (Fig. 3A). The distances varied between the systems, and the protein in RNP showed smaller distance fluctuations than the protein alone, especially for K15-L807 and R863-L1022, which suggests conformational sampling by AsCas12a protein to accommodate the crRNA pseudoknot during RNP assembly (Fig. 3, BD).

Figure 3.

Figure 3

Molecular dynamics simulations indicate that AsCas12a is structurally less dynamic and less solvent accessible when bound to its cognate crRNA.A, representative image illustrating the amino acids considered for distance measurements during molecular dynamic simulations. The distance between the Cα atoms of (B) K15 and L807, (C) R863 and L1022, and (D) H977 and D966 from 1 μs trajectories. The running averages of the distances are represented in the plot.

The 2D-RMSD values were calculated for each protein domain to detect the regions of high protein mobility. The values clearly showed that the REC domain and WED-III domain showed significant differences between the two systems. The 2D-RMSD values of the wedge domain in the protein without pseudoknot RNA displayed values as high as 10 Å, while the values remained <5 Å in the case of the RNP (Fig. S4, A and B). This confirms our previous observation from the RMSF values that the protein becomes more stabilized in the presence of the crRNA. The representative structures of the major clusters (Fig. 4, A and B) were visually inspected to validate the variations seen in the 2D-RMSD plots. The WED-III domain and REC domain rearranged to incorporate the RNA into the RNP complex. The pseudoknot rearranges the WED-III domain while the single-stranded guide region rearranges the REC domain in the RNP complex for enhanced binding.

Figure 4.

Figure 4

Molecular dynamics simulations predict distinct dynamics and domain motion for AsCas12a protein versus AsCas12a RNP. Representative snapshots of the major clusters of (A) AsCas12a protein system and (B) AsCas12a RNP complex from the 1 μs trajectories. The protein is shown in surface representation. The PI domain is represented in beige color, the wedge domain in yellow, RuvC domain in cyan, REC domain in gray, and the Nuc domain in magenta. The pseudoknot is represented as cartoon in red. Representative images of the principal motion of protein in (C) AsCas12a protein alone and (D) AsCas12a RNP complex. Domain colors are the same as in panels A and B except the bridge helix (BH) is shown in green. The arrows are marked according to the pseudo trajectory and do not indicate the amplitude of motion. The 3D perspective of the domain movements might not be clearly visible in the given representation. Density plots of the interdomain angles in (E) apo-AsCas12a protein alone or (F) AsCas12a RNP. The density plot represents the sampling of the angle and the maximum population is represented with the highest density. RNP, ribonucleoprotein.

To identify the major mode of domain motion (maximum variance), principal component (PC) analysis was carried out (Fig. 4, C and D). The translational motion was initially removed by fitting the trajectory to an average structure. The projection diagrams and pseudo trajectories from PC analysis were analyzed to obtain a pattern of the major movements resulting in the conformational preferences discussed in the cluster analysis. In the protein only system, the REC2 and the WED-III directions allow them to occupy some of the space where the RNA is usually bound. Compared to the RNP complex, the REC2 and WED-III domains show movement directed toward creating space to accommodate the crRNA pseudoknot and guide spacer sequence. The PI and REC1 domains in the RNP complex exhibit a circular motion with one part slightly moving away from the cleft, while the smaller helix moves toward the cleft, which would subsequently bind the DNA target for catalysis (56). This motion is not exactly the same in the absence of crRNA. The amplitude of motion shows minimal variations in domains other than the PI in the first PC. The histogram of the first and second PC modes of the RNP and protein alone (Fig. S5A) shows no convergence suggesting the variation in conformational preferences. Though the deviation is not high, the regions spanned by the PC mode 1 and mode 2 of the apo protein and RNP complex are different (Fig. S5B).

Solvent accessible surface area (SASA) of the protein was calculated from the trajectories of protein alone and the RNP complex simulations (Fig. S6). The ΔSASA was found to be ∼4000 Å2, with the RNP complex having a less value, suggesting the protein becomes more ordered and compact upon RNP assembly. The rearrangements in the protein to accommodate RNA were explored further by comparing the surface electrostatic potential images of the protein (Fig. S7). The positively charged regions are rearranged in the RNP complex, likely to ensure better interaction with the negatively charged backbone of the RNA.

To rule out potential bias by using a manually generated model of apo-AsCas12a from a DNA-bound ternary structure, MD simulations were also carried out on apo form deposited in the AlphaFold database. Though the RMSF values were slightly lower than the manually generated structure, the overall trend in the structural properties (RMSD, RMSF, distances, and SASA) remained the same in both models (Figs. S8–S11).

MD simulations on FnCas12a (PDB ID: 5NG6) and LbCas12a (PDB ID: 5ID6) RNP, apoprotein, and pseudoknot were also carried out to determine whether our observations for AsCas12a might apply more broadly to other CRISPR–Cas12a complexes. For both FnCas12a and LbCas12a, the RNA pseudoknot was stable throughout the simulations with similar values of RMSF (Fig. S12) and end-to-end distances (Fig. S13) in both the free and bound forms. Due to terminal and loop nucleotide fraying, the RMSD is slightly higher for the free forms in both cases (Fig. S14). The major clusters of the free and bound form of pseudoknot superimposed with an RMSD value <2.7 Å and <2.6 Å in FnCas12a (Fig. S15A) and LbCas12a (Fig. S15B) complexes, respectively. Similar to AsCas12a, the pseudoknot structure also remained stable in these complexes.

The protein alone was more dynamic than the RNP complex for both FnCas12a and LbCas12a, with a higher value for the latter. In the FnCas12a system, the RMSF plots revealed more deviations in the wedge, PI, and RuvC domains, similar to what was observed in AsCas12a (Fig. S16A). The domain designations are different in LbCas12a but greater deviations in the RMSF values were found in the oligonucleotide binding domain (OBD) and RuvC domain (Fig. S16B). These results show that the domains responsible for interacting with RNA exhibit variation in dynamics during RNP assembly. The distance analysis between the Cα atoms of the amino acids in the immediate vicinity of the RNA pseudoknot revealed that the Cas12a protein had undergone some rearrangement to accommodate the pseudoknot in both FnCas12a and LbCas12a complexes (Fig. S17). While the values are constant in the RNP complexes, variations in the apo form were observed that indicate a compaction of the protein upon RNA binding.

The superimposition of major clusters of Fn and Lb Cas12a in the apo and RNP complexes revealed changes in the domain indicated by the RMSF values. While the WED-III domain loop showed maximum variation in FnCas12a (Fig. S18A), the OBD, UK, and RuvC domains showed high fluctuation in LbCas12a (Fig. S18B). Irrespective of the Cas12a type, the rearrangement is nearly identical around the RNA pseudoknot. Further, the electrostatic surface potential images also confirmed the rearrangement of positively charged residues to better accommodate the negatively charged backbone of the RNA pseudoknot (Fig. S19). In FnCas12a and LbCas12a, the RNP assembly results in about a 5% reduction in the overall SASA (Fig. S20). Overall, MD studies of FnCas12a and LbCas12a led to results quite similar to those of AsCas12a.

To assess whether conformational changes associated with the Cas12a RNP assembly are a result of induced fit or conformational selection, two related but distinct binding modes, interdomain angles from AsCas12a simulations were sampled (63). The RNA pseudoknot and single-stranded portion of the guide RNA are near the Wedge, REC, RuvC, and NUC domains. Therefore, the orientation of these domains might differ in the apo form and RNP complex. The interdomain angles between WED-II: WED-III: RuvC (Fig. S21A) and REC2: RuvC: NUC (Fig. S21B) were calculated (RuvC-I and RuvC-II were considered together). The density plots with these angles as coordinates revealed that the sampling differs in the apo and RNP forms (Fig. 4, E and F). The apo form exhibits an open structure with a large interdomain angle, while the RNP exhibits a closed structure with a slight interdomain angle. The angle associated with the closed form is only seen in the RNP and not present in the sampling of the apo-protein. The cluster distribution of the apo and RNP also confirmed the same. The major clusters of the apo and RNP forms show high RMSD upon superimposing with each other and no representative structures from the clustering are similar in both of these forms (Fig. 4, A and B). Thus, the closed form is associated only with the RNP and the apo form does not exhibit the flexibility to adopt open/closed conformations. These results suggest that Cas12a RNP assembly proceeds through induced fit rather than conformational selection since the necessary RNP conformation is essentially never sampled in the apo form (63).

To experimentally investigate the structural flexibility of AsCas12a, we subjected the free protein or assembled RNP to limited trypsin proteolysis. The presence of crRNA appeared to reduce cleavage efficiencies at three sites (Fig. 5A). Mass spectrometric analysis of these bands revealed that they corresponded to protection from trypsin proteolysis at K739, K570, and K576 (Fig. S22). Of these, K570 and K576 are found in the OBD-II, which is involved in crRNA and target DNA binding (51) (Fig. 5B). To observe global effects on AsCas12a conformation and stability during RNP assembly, we performed differential scanning fluorimetry (DSF) on AsCas12a with and without crRNA. This method uses a dye specific for protein rather than UV absorbance at 280 nm, eliminating potentially confounding UV signal overlap from the RNA. AsCas12a alone exhibited a biphasic melt profile, suggesting at least two structures that largely unfolded independently (64). Their melting temperature (Tm) values were calculated at 43.10 °C ± 0.01 deg. C and 48.62 °C ± 0.02 deg. C. When crRNA was present, AsCas12a unfolded cooperatively as an apparent single structure with a Tm of 46.15 °C ± 0.04 deg. C (Fig. 5C). To assess the unfolding model that best fits the data, we performed fits to a two-state folded–unfolded model and to a three-state sequential folded-intermediate-unfolded model (Fig. S23). While this model fitting is not sufficient to provide a detailed understanding of the protein folding landscape, unfolding via a structural intermediate in the three-state model fit the data best. This suggests that significant structural or stability changes to Cas12a occurred upon RNP assembly.

Figure 5.

Figure 5

Cas12a undergoes conformational changes upon RNP assembly, whereas the crRNA pseudoknot is independently folded.A, limited trypsin proteolysis of AsCas12a with and without its cognate crRNA bound. Digestion products corresponding to differential trypsin accessibility are indicated to the right. B, amino acids corresponding to differentially accessible sites from (A), determined by MS, are highlighted in red spheres on the structure of AsCas12a. C, differential scanning fluorimetry of AsCas12a alone or bound to crRNA. D, sequence and predicted 2D structure of RNA and DNA used for thermal denaturation and CD analyses. E, UV-monitored thermal denaturation analysis and (F) CD spectra of the AsCas12a 5′ handle pseudoknot unbound to AsCas12a. DNA and RNA hairpins are comparative controls. Error is shown as SEM of three experimental replicates. RNP, ribonucleoprotein.

To determine the stability and folding of the crRNA pseudoknot in solution, we performed UV-monitored thermal denaturation and CD analyses. The RNA pseudoknot sequence is composed of the 19 nucleotides of the AsCas12a crRNA 5′ end. Control DNA and RNA hairpin sequences were created by replacing the three 5′ terminal nucleotides with cytidines to prevent pseudoknot formation (Fig. 5D). Thermal denaturation revealed that the pseudoknot unfolded similarly to the control, model hairpin RNA sequence, with similar Tm values of 61.8 °C ± 0.3 deg. C and 60.7 °C ± 0.3 deg. C, respectively (Fig. 5E). The slight increase in Tm for the pseudoknot may be attributable to additional base-pairing interactions in the structure (65). As expected, the DNA hairpin appeared to possess a less stable structure (65), with a Tm of 48.3 °C ± 0.6 deg. C. When we performed CD analyses, we observed nearly identical spectra for the pseudoknot RNA and hairpin RNA, characteristic of A-form structure (66), with a small shift in the 250 to 300 nm range (Fig. 5F). The DNA hairpin generated a distinct spectra indicative of B-form structure (66). These results point to a well-formed crRNA pseudoknot in solution independent of protein binding.

Discussion

Our results indicate that Cas12a proteins undergo conformational rearrangements to accommodate crRNA binding during RNP assembly. In contrast, the crRNA 5′ pseudoknot, the primary “handle” that Cas12a binds, appears to undergo very little structural alterations. The highly conserved nature of the crRNA repeat sequence, corresponding to the 5′ pseudoknot handle, and its interconnected role across adaptation, expression, and interference phases of CRISPR mechanism, implies that alterations to sequence or structure would likely be deleterious to the system. Cas12a proteins, on the other hand, appear to evolve much more rapidly, sustaining far more sequence changes, and to a lesser extent structural alterations, through evolution. In-solution studies demonstrated conformational changes of AsCas12a upon crRNA binding while the crRNA pseudoknot handle itself appeared quite stable alone, in agreement with MD simulations.

RNA-guided enzymes are a unique class of enzymes that share an intimate relationship with their cognate RNA partners. The protein component is typically inactive when not associated with its RNA guide (67). The guide RNA specifies the substrate through Watson–Crick hybridization and correctly positions it into the protein's active site (7, 68). The guide sequence can vary, allowing a single type of protein to catalyze the same reaction on countless targets of varying sequence. Since RNA-guided enzymes can be programmed to target different sequences, understanding the rules of RNP assembly is important for understanding how these systems evolved and how to better leverage them for biotechnology and biomedical applications.

For engineering Cas12a enzymes, it is important to not introduce modifications that can significantly perturb the folding or stability of the pseudoknot structure. A recent study investigated a diverse class of chemical modifications in the pseudoknot and indeed concluded that smaller modifications known to maintain RNA structure, such as 2′-fluororibose, were the most compatible (39). Likewise, targeted mutagenesis of Cas12a protein should avoid altering the dynamic flexibility or folding of the enzyme, such as increasing rigidity, in regions that must mediate guide RNA contact or else risk weaker binding affinity and poor RNP assembly or stability. Interestingly, unbiased Cas12a mutagenesis screens aimed at improved activity found that favorable mutations lie outside of the wedge domain, which was the most dynamic during our crRNA binding simulations (41, 69).

These findings for Cas12a may also extend to other CRISPR-Cas systems. The need to conserve CRISPR array repeat sequences across all phases of the CRISPR mechanism is a common feature for CRISPR-Cas systems. For Cas9 systems, the structure of Cas9 from Streptococcus pyogenes has a large number of unresolved residues in the absence of a bound guide RNA (43, 70). Upon comparison to Cas9 RNP or ternary structures, these regions become structured and appear to be heavily involved in RNA binding (5, 43, 48, 70). In addition, the binding of crRNA and trans-acting crRNA guides, as well as single-guide RNAs, to Cas9 have been proposed to modulate the conformational dynamics of Cas9 (4245, 46).

While our study cannot definitively distinguish between induced fit and conformational selection binding modes during RNP assembly, results are better explained by an induced fit model. In induced fit mechanisms, the ligand binds an “inactive” state of the receptor, usually “loosely” or at low affinity, but then small, sequential conformational changes can occur where the protein creates more ligand contacts. For assembly of RNP complexes, this usually involves induced fit of the RNA rather than the protein or else a mutual induced fit of both protein and RNA (9, 11, 71, 72). Induced fit can be considered a special case of conformational selection but the two are often viewed as competing models (73, 74). Conformational selection posits that the receptor or enzyme occupies all possible states at different likelihoods and toggles between sampling “active” and “inactive” states. Thus, the kinetics of sampling can become important. Only when the protein transiently samples the active state can the ligand bind, which shifts the population of enzyme toward an active form. This creates a “selection” for the active conformation (74).

Some criteria for distinguishing induced fit and conformational selection can be applied (73). First, the range of conformational states can be informative (74, 75). For AsCas12a, the residues and domains that interact with the 5′ handle appear to occupy a broad range of dynamic conformations. This would suggest the ability to accommodate many incremental changes that could tune its structure to fit the crRNA ligand (74, 76). Indeed, the protein appears to wrap around and protect the RNA after complex formation, which generates very low nanomolar binding affinity (77) (Fig. 4). Importantly, however, interdomain angle sampling from MD simulations revealed that the angles, and therefore conformational states, primarily occupied by the RNP form of AsCas12a are almost never sampled in simulations of the apo-AsCas12a. These results strongly argue against conformational selection as a likely mechanism of RNP assembly (63). Previous simulations of U1A protein binding to the hairpin II of U1 snRNA argued for induced fit based on the absence of the bound conformer when not assembled into the RNP. Only when the RNP complex was formed was the bound conformer observed and stable (75), similar to our interdomain angle analyses (Fig. 4, E and F).

Second, the timescale of conformational transitions, and therefore kinetics, is important (73, 76). If conformational transitions from an unbound/inactive to bound/active structure are infrequent and slow for Cas12a, then conformational selection could be argued. However, AsCas12a transitions in and out of the free and RNP conformational states are very rapid, on a nanosecond timescale, as observed by K15-L807 and H977-D966 residue distance measurements (Fig. 3). Third, although we did not explicitly focus on the role of the variable guide/spacer sequence of crRNA during binding, it is possible that this contributes an additional binding module. The necessity of distinct binding steps to produce high affinity binding of U1A to U1 hairpin II RNA has been used to support induced fit mechanisms (78). The first step was electrostatic-mediated binding, referred to as the “lure,” which could be partially fulfilled by the negatively charged phosphate backbone of the crRNA spacer sequence, followed by a more specific “lock” step, potentially fulfilled by the crRNA 5′ pseudoknot handle. The observation that modified 5′ extensions to the end of the Cas12a crRNA can improve enzyme activity may be partially attributable to improved binding affinity via electrostatic interactions with Cas12a (77, 79, 80). Thus, the entire crRNA may participate in stepwise conformational changes that would be indicative of induced fit binding.

In this study, we were limited by the number of Cas12a and crRNA sequences suitable for our analysis. Nonetheless, they were sufficient to generate phylogenetic trees that could assess relatedness and for interpretation of sequence conservation across species. A greater number of sequences could refine our analyses but are unlikely to change the overarching observation of relatively low protein sequence homology compared to high repeat sequence (5′ pseudoknot) conservation. For structural comparisons, we were also limited to the three available structures of Cas12a. However, these were fortuitously well-distributed across the inferred phylogenetic tree, which provided higher confidence that they were representative. Importantly, despite sequence divergence, the final folded structures of Cas12a proteins, especially around the guide RNA 5′ pseudoknot, were highly similar. The crRNA pseudoknot structures themselves were nearly identical. While this study focused more on AsCas12a, simulations using FnCas12a and LbCas12a were also in agreement with our conclusions. Initial model selection can also be a limitation. Our simulations began with manually generated apo-AsCas12a from the DNA-bound ternary structure using coordinate extraction. Beginning with an apo crystal structure could generate fine variations in the results, though we would not expect these to significantly alter the major conclusions from this study. Indeed, beginning with an alternative de novo predicted apo-AsCas12a structure from the AlphaFold database resulted in remarkably similar outcomes in our MD simulations.

We also limited our in-solution experimental studies to AsCas12a as a model system. It is likely that limited trypsin proteolysis of other Cas12a members would yield alternative cleavage patterns as these will depend on surface accessibility of lysine or arginine residues. However, we would still expect to observe altered peptide cleavage patterns. Finer mapping of AsCas12a may also be achieved with other peptide hydrolysis methods or proteases and would be expected to create a more complete picture of conformational changes during RNP assembly. Ultimately, other methodologies to map the conformational states of Cas12a and their occupancy level when free and bound to guide RNA, especially at very fast time scales during RNP assembly, would help further establish the molecular mode of binding. These could include experimental structural, thermodynamic, molecular distance, and kinetic measurements that could further support or refute induced fit mechanisms.

Experimental procedures

Cas12a sequence alignment, phylogenetic analysis, and structure alignment

Cas12a sequences were downloaded from UniProt (uniprot.org). Sequences less than 800 amino acids and duplicate taxa were removed before alignment with ClustalW (81). This alignment was used for maximum likelihood phylogenetic inference in MEGA11 with 500 bootstrap replicates and the JTT model of protein evolution (82). For simplicity, a consensus tree was produced with nodes having a bootstrap value less than 50 collapsed into polytomies. For comparing Cas12a RNP structures, Cas12a RNP or ternary complexes solved for Lachnospiraceae bacterium ND2006 (Lb) Cas12a (PDB ID: 5XUS) (51), F. tularensis subsp. novicida (Fn) Cas12a (PDB ID: 5NG6) (57), and Acidaminococcus sp. strain BV3L6 (As) Cas12a (PDB ID: 5B43) (6) were aligned in PyMOL (https://pymol.org/2/) (v1.5.0.5).

crRNA repeat alignment and secondary structure prediction

Type V repeat sequences were downloaded from CRISPRdb (https://crispr.i2bc.paris-saclay.fr/). These repeats were aligned and their secondary structure predicted with LocARNA (https://rna.informatik.uni-freiburg.de/LocARNA) (58). A consensus sequence for the alignment was generated with a cut-off of 55%. A visualization of the predicted secondary structure from LocARNA with this consensus sequence was constructed with forna (http://rna.tbi.univie.ac.at/forna/) (83).

MD structure preparation

FnCas12a, and LbCas12a

The crystal structure of the Cas12a complex (PDB ID: 5B43) (6) FnCas12a (PDB ID: 5NG6) (57) and LbCas12a (PDB ID: 5ID6) (51) were used for MD studies. The missing residues in the protein were filled by using the SWISS-MODEL (84) webserver. To generate the Cas12a RNP model, the DNA part of the crystal structure was manually removed. Similarly, the protein only structure was generated by removing the RNA part. The coordinates of the pseudoknot sequence were manually extracted and used for the preparation of the free pseudoknot RNA. The apo structure of AsCas12a from AlphaFold database (https://alphafold.ebi.ac.uk/entry/U2UMQ6) has also been utilized for the simulation. The Amber ff14SB (85) force field for protein, the ff99bsc0+χOL3 (86, 87, 88) force field for RNA, and TIP3P (89) model for water have been employed. The system was solvated using a 10 Å rectangular water box and Na+/Cl- ions were added to neutralize the complexes in the tleap module of AmberTools 19.

MD methodology

The MD protocol used in earlier studies by Palermo and co-workers was used with slight modifications for the protein and RNP systems (56). An initial minimization of 10,000 steps (5000 steps of steepest descent along with 5000 steps of conjugate gradient) with a restraint of 300 kcal/mol·Å2 on the biomolecules was carried out followed by another 10,000 steps (5000 steps of steepest descent along with 5000 steps of conjugate gradient) without any restraints. The heating was done in four stages where systems were heated up from 0 to 50 K and 50 to 100 K by running two simulations of 50 ps each (NVT) imposing harmonic restraints of 100 kcal/mol·Å2 on the protein and RNA. The temperature was then increased to 200 K in 100 ps (NVT) with a smaller restraint of 25 kcal/mol·Å2 and to 300 K in 500 ps without any restraints (NPT). One nanosecond equilibration followed by a test production run of 10 ns was carried out (NPT).

The MD simulation of the pseudoknot was carried out starting with 1000 steps of minimization (500 steps of steepest descent), followed by a 20 ps of equilibration with a restraint force of 25 kcal/mol·Å2 on the pseudoknot, at a temperature of 100 K (NVT). A further 2500 steps of unrestrained minimization with 1000 steps of steepest descent and 100 ps of unrestrained equilibration using a constant pressure periodic boundary of 1 atm was carried out where the temperature was increased from 100 K to 300 K. Finally, production run for 1 μs was performed on the protein, RNP complex, and pseudoknot at 300 K using the GPU accelerated version of Particle Mesh Ewald Molecular Dynamics of AMBER 18. The time step used in MD simulations was 2 fs and the cut off for long range electrostatic interaction 10 Å. The SHAKE algorithm was applied to the bonds containing hydrogens. Langevin dynamics was used for temperature control (90) and Berendsen barostat (91) was used for pressure control for all the simulations.

MD simulation analysis

All the analyses were carried out using the CPPTRAJ (92) module of the AmberTools 19 (https://ambermd.org/AmberTools.php), VMD (https://www.ks.uiuc.edu/Research/vmd/) (93), and PyMOL Molecular Graphics System (https://pymol.org/2/), version 2.0 (Schrodinger, LLC). The RMSD of the protein backbone and the RNA backbone were calculated using the first frame of the 1 μs simulation as the reference. All frames were considered for the calculation. RMSF analysis and the distance between the cα atoms of the amino acids were calculated using the CPPTRAJ module considering every frame of the simulation. The 2DRMSD studies were carried out using every 1000th frame of simulation. The command 2drms in CPPTRAJ was used for the calculation and only the C, N, O, CA, and CB backbone atoms were considered. The information saved was further plotted using gnuplot (http://www.gnuplot.info/). The surf command in CPPTRAJ was used to calculate the SASA, which uses the LCPO algorithm (94). The surface electrostatic potential calculation was carried out using the ABPS electrostatic plugin in PyMOL. This calculation was carried out using the major cluster snapshots of the systems. The trajectories were visualized in VMD and the images were rendered using PyMOL.

PC analysis

PC analysis was carried out using the whole 1 μs simulation of the MD trajectories. Only the Cα atoms were considered for the analysis. Initially, the translational and rotational motion of the protein is removed by fitting the trajectory to the average MD structure. Further coordinate covariance matrix, eigen values, and eigen vectors were calculated. A pseudo trajectory to visualize the first PC was also generated. The principal motions were visualized using the normal mode wizard plugin in the VMD software.

Cluster analysis

Average-linked hierarchical agglomerative clustering algorithm was used for the cluster analysis using CPPTRAJ. The rms distances of the protein backbone was used as the reference distance for the clustering procedure. The centroid of the cluster summary was extracted as the major cluster snapshot. The epsilon value for the analysis was 3 Å and every 15th frame was considered for the analysis.

Limited trypsin hydrolysis and MS of AsCas12a protein and RNP

Plasmid encoding His(6x)-MBP-AsCas12a was obtained from Addgene (79,007) (95) and protein prepared as described (39). crRNA targeting an EGFP sequence (crEGIP) (39) was synthesized and purified by Integrated DNA Technologies. Trypsin proteolysis was performed similarly as described (39). Briefly, 30 μg of AsCas12a was incubated in the presence and absence of crRNA (1:1.5 M ratio) at 23 °C for 5 min. The resulting samples were incubated with Trypsin-EDTA solution (0.05%, Invitrogen) at a mass ratio of 100:1 and incubated at 37 °C for 15 min. The reaction was stopped by the addition of SDS-PAGE loading buffer (New England Biolabs) and incubation at 95 °C for 5 min. The reaction products were analyzed by 12% SDS-PAGE and stained with Coomassie brilliant blue G-250 in 50% (v/v) methanol and 10% (v/v) acetic acid then destained in the same solution without dye. Differential bands of interest were gel extracted and shipped to the UT Southwestern Medical Center Proteomics Core facility for LC/MS/MS mass spectrometric identification using their Gel Band ID service. Data was analyzed using Proteome Discoverer 2.4 (https://www.thermofisher.com/us/en/home/industrial/mass-spectrometry/liquid-chromatography-mass-spectrometry-lc-ms/lc-ms-software/multi-omics-data-analysis/proteome-discoverer-software.html) and searched using the His(6x)-MBP-AsCas12a protein sequence.

Differential scanning fluorimetry of AsCas12a

Thermal-shift assays were performed using a Bio-Rad CFX96 instrument with the FRET channel. AsCas12a alone and AsCas12a:crRNA (1:1) complex were incubated at room temperature for 10 min in a final assay concentration of 1x DSF buffer (100 mM Hepes, pH 7.5, 100 mM NaCl, 1 mM MgCl2). SYPRO Orange Protein Gel Stain 5000× (Thermo Fisher Scientific) was freshly diluted to 171.25× in nuclease-free water and 2 μl was spotted into the wells of a Hard-Shell 96-well PCR Plate (Bio-Rad). The incubated solution was then added to wells to a final assay volume of 25 μl and AsCas12a or AsCas12a:crRNA (1:1) concentration of 3.3 μΜ. The plate was sealed with Microseal “B” PCR Plate Sealing Film (Bio-Rad) and samples were heated from 25 °C to 85 °C at a ramp rate of 1 °C/min. A single fluorescence was collected every 30 s. The melting temperature (Tm) was determined using wTSA-CRAFT (https://bioserv.cbs.cnrs.fr/TSA_CRAFT/) (96).

Temperature-dependent SYPRO Orange emission intensity curves were analyzed to extract midpoint temperatures (Tm) and Van’t Hoff enthalpies of folding (ΔH) using two-state (F ⇌ U) and sequential three-state (F ⇌ I ⇌ U) unfolding models as described (97, 98). Briefly total signals (St) in DSF curves were baseline corrected by extrapolating linear regions (99) (e.g., 25–30 °C and 75–80 °C) and fractional signals were fit to each model using unconstrained nonlinear regressions in MATLAB (MathWorks). For each complex, mean values and 95% confidence intervals from three independent trials are reported.

Nucleic acid thermal denaturation monitored by UV absorbance

Experiments were performed on a Varian Cary 100 Bio UV-Visible Spectrophotometer equipped with a Cary Temperature Controller using a 1 cm path length cuvette. Oligonucleotides were purchased from Integrated DNA Technologies. Samples were prepared at a concentration of 2 μM oligonucleotide in 10 mM sodium phosphate buffer, pH 7.0, and 2 mM MgCl2 with a final sample volume of 500 μl. Samples were slow-annealed by heating to 95 °C for 10 min, slowly cooling to 25 °C over 3 h, and then incubating them at 4 °C for 16 h. Absorbance values were acquired at 260 nm over the range of 5 to 95 °C at a ramp rate of 0.5 °C/min with a data collection interval of 0.5 °C. The background was subtracted using Varian software and measurements performed in triplicate. The maximum value of the first derivative curves were used to determine the Tm values. The average Tm of each oligonucleotide with the corresponding SD is reported along with the normalized absorbance spectra.

Nucleic acid CD

Experiments were performed on a Chirascan VX Spectrophotometer using a 1 mm path length cuvette. Oligonucleotide samples were prepared at a concentration of 20 μM with a final sample volume of 200 μl. The buffer and slow-annealing conditions were the same as for thermal denaturation experiments. Ellipticity spectra were recorded over the range of 320 to 200 nm with a bandwidth of 1 nm, a sampling rate of 0.5 s-per-point, and a data collection interval of 1 nm. Three acquisitions were obtained for each oligonucleotide to create an average spectrum, from which the background spectrum was subtracted. The final reported spectra were smoothed using Chirascan software.

Data availability

The datasets and computer code generated or used in this study are available upon request to P. I. P. Phylogenetic reconstruction datasets and sequences, as well as original MS data, are available upon request to K. T. G.

Supporting information

This article contains supporting information.

Conflict of interest

The authors declare that they have no conflicts of interest with the contents of this article.

Acknowledgments

We thank Dr Andrew Lemoff (University of Texas Southwestern Medical Center Proteomics Core) for assistance with mass spectrometric analyses and data interpretation. This research was enabled in part by support provided by Calcul Québec (www.calculquebec.ca) and Compute Canada (www.computecanada.ca).

Author contributions

S. S., C. L. B, R. C., H. M. B., A. A. P., S. D. M., M. J. D., and K. T. G. investigation; K. T. G. conceptualization; M. J. D. and K. T. G. methodology; S. S., C. L. B., M. J. D., P. I. P., and K. T. G. writing–original draft; S. D. M., M. J. D., and P. I. P. formal analysis; S. S., C. L. B., R. C., P. I. P., and K. T. G. visualization.

Funding and additional information

This work was supported by a National Institutes of Health grant (1R01GM135646-01) to K. T. G. and M. J. D., an IRCC IIT Bombay research award grant to P. I. P., a National Institutes of Health grant (R35GM119818) to S. D. M., and a Prime Minister’s Research Fellowship (PMRF) scheme for the PhD fellowship to S. S. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Reviewed by members of the JBC Editorial Board. Edited by Craig Cameron

Contributor Information

P.I. Pradeepkumar, Email: pradeep@chem.iitb.ac.in.

Keith T. Gagnon, Email: ktgagnon@siu.edu.

Supporting information

Supporting Figures S1–S23
mmc1.pdf (8.8MB, pdf)

References

  • 1.Moran L.A., Horton H.R., Scrimgeour K.G., Perry M.D. Principles of Biochemistry. 5th Ed. Pearson; London, England: 2012. [Google Scholar]
  • 2.Jones S., Thornton J.M. Principles of protein-protein interactions. Proc. Natl. Acad. Sci. U. S. A. 1996;93:13–20. doi: 10.1073/pnas.93.1.13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Li L., Ye K. Crystal structure of an H/ACA box ribonucleoprotein particle. Nature. 2006;443:302–307. doi: 10.1038/nature05151. [DOI] [PubMed] [Google Scholar]
  • 4.Lin J., Lai S., Jia R., Xu A., Zhang L., Lu J., et al. Structural basis for site-specific ribose methylation by box C/D RNA protein complexes. Nature. 2011;469:559–563. doi: 10.1038/nature09688. [DOI] [PubMed] [Google Scholar]
  • 5.Nishimasu H., Ran F.A., Hsu P.D., Konermann S., Shehata S.I., Dohmae N., et al. Crystal structure of Cas9 in complex with guide RNA and target DNA. Cell. 2014;156:935–949. doi: 10.1016/j.cell.2014.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Yamano T., Nishimasu H., Zetsche B., Hirano H., Slaymaker I.M., Li Y., et al. Crystal Structure of Cpf1 in Complex with Guide RNA and Target DNA. Cell. 2016;165:949–962. doi: 10.1016/j.cell.2016.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Elkayam E., Kuhn C.D., Tocilj A., Haase A.D., Greene E.M., Hannon G.J., et al. The structure of human argonaute-2 in complex with miR-20a. Cell. 2012;150:100–110. doi: 10.1016/j.cell.2012.05.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Gagnon K.T., Zhang X., Agris P.F., Maxwell E.S. Assembly of the archaeal box C/D sRNP can occur via alternative pathways and requires temperature-facilitated sRNA remodeling. J. Mol. Biol. 2006;362:1025–1042. doi: 10.1016/j.jmb.2006.07.091. [DOI] [PubMed] [Google Scholar]
  • 9.Turner B., Melcher S.E., Wilson T.J., Norman D.G., Lilley D.M. Induced fit of RNA on binding the L7Ae protein to the kink-turn motif. RNA. 2005;11:1192–1200. doi: 10.1261/rna.2680605. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Gagnon K.T., Zhang X., Qu G., Biswas S., Suryadi J., Brown B.A., et al. Signature amino acids enable the archaeal L7Ae box C/D RNP core protein to recognize and bind the K-loop RNA motif. RNA. 2010;16:79–90. doi: 10.1261/rna.1692310. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Suryadi J., Tran E.J., Maxwell E.S., Brown B.A., 2nd The crystal structure of the methanocaldococcus jannaschii multifunctional L7Ae RNA-binding protein reveals an induced-fit interaction with the box C/D RNAs, Biochemistry. 2005;44:9657–9672. doi: 10.1021/bi050568q. [DOI] [PubMed] [Google Scholar]
  • 12.Kong R., Xu L., Piao L., Zhang D., Hou T.J., Chang S. Exploring the RNA-bound and RNA-free human argonaute-2 by molecular dynamics simulation method. Chem. Biol. Drug Des. 2017;90:753–763. doi: 10.1111/cbdd.12997. [DOI] [PubMed] [Google Scholar]
  • 13.Pourjafar-Dehkordi D., Zacharias M. Binding-induced functional-domain motions in the argonaute characterized by adaptive advanced sampling. PLoS Comput. Biol. 2021;17 doi: 10.1371/journal.pcbi.1009625. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Ishino Y., Shinagawa H., Makino K., Amemura M., Nakata A. Nucleotide sequence of the iap gene, responsible for alkaline phosphatase isozyme conversion in Escherichia coli, and identification of the gene product. J. Bacteriol. 1987;169:5429–5433. doi: 10.1128/jb.169.12.5429-5433.1987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Nakata A., Amemura M., Makino K. Unusual nucleotide arrangement with repeated sequences in the Escherichia coli K-12 chromosome. J. Bacteriol. 1989;171:3553–3556. doi: 10.1128/jb.171.6.3553-3556.1989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Jansen R., Embden J.D., Gaastra W., Schouls L.M. Identification of genes that are associated with DNA repeats in prokaryotes. Mol. Microbiol. 2002;43:1565–1575. doi: 10.1046/j.1365-2958.2002.02839.x. [DOI] [PubMed] [Google Scholar]
  • 17.Pourcel C., Salvignol G., Vergnaud G. CRISPR elements in Yersinia pestis acquire new repeats by preferential uptake of bacteriophage DNA, and provide additional tools for evolutionary studies. Microbiology. 2005;151:653–663. doi: 10.1099/mic.0.27437-0. [DOI] [PubMed] [Google Scholar]
  • 18.Mojica F.J., Diez-Villasenor C., Garcia-Martinez J., Soria E. Intervening sequences of regularly spaced prokaryotic repeats derive from foreign genetic elements. J. Mol. Evol. 2005;60:174–182. doi: 10.1007/s00239-004-0046-3. [DOI] [PubMed] [Google Scholar]
  • 19.Bolotin A., Quinquis B., Sorokin A., Ehrlich S.D. Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal origin. Microbiology. 2005;151:2551–2561. doi: 10.1099/mic.0.28048-0. [DOI] [PubMed] [Google Scholar]
  • 20.Makarova K.S., Grishin N.V., Shabalina S.A., Wolf Y.I., Koonin E.V. A putative RNA-interference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action. Biol. Direct. 2006;1:7. doi: 10.1186/1745-6150-1-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Barrangou R., Fremaux C., Deveau H., Richards M., Boyaval P., Moineau S., et al. CRISPR provides acquired resistance against viruses in prokaryotes. Science. 2007;315:1709–1712. doi: 10.1126/science.1138140. [DOI] [PubMed] [Google Scholar]
  • 22.Brouns S.J., Jore M.M., Lundgren M., Westra E.R., Slijkhuis R.J., Snijders A.P., et al. Small CRISPR RNAs guide antiviral defense in prokaryotes. Science. 2008;321:960–964. doi: 10.1126/science.1159689. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Mojica F.J., Rodriguez-Valera F. The discovery of CRISPR in archaea and bacteria. FEBS J. 2016;283:3162–3169. doi: 10.1111/febs.13766. [DOI] [PubMed] [Google Scholar]
  • 24.Nuñez J.K., Kranzusch P.J., Noeske J., Wright A.V., Davies C.W., Doudna J.A. Cas1-Cas2 complex formation mediates spacer acquisition during CRISPR-Cas adaptive immunity. Nat. Struct. Mol. Biol. 2014;21:528–534. doi: 10.1038/nsmb.2820. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Makarova K.S., Wolf Y.I., Alkhnbashi O.S., Costa F., Shah S.A., Saunders S.J., et al. An updated evolutionary classification of CRISPR-Cas systems. Nat. Rev. Microbiol. 2015;13:722–736. doi: 10.1038/nrmicro3569. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Yosef I., Goren M.G., Qimron U. Proteins and DNA elements essential for the CRISPR adaptation process in Escherichia coli. Nucleic Acids Res. 2012;40:5569–5576. doi: 10.1093/nar/gks216. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Nuñez J.K., Lee A.S., Engelman A., Doudna J.A. Integrase-mediated spacer acquisition during CRISPR-Cas adaptive immunity. Nature. 2015;519:193–198. doi: 10.1038/nature14237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Krupovic M., Béguin P., Koonin E.V. Casposons: mobile genetic elements that gave rise to the CRISPR-Cas adaptation machinery. Curr. Opin. Microbiol. 2017;38:36–43. doi: 10.1016/j.mib.2017.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Wang R., Li M., Gong L., Hu S., Xiang H. DNA motifs determining the accuracy of repeat duplication during CRISPR adaptation inHaloarcula hispanica. Nucleic Acids Res. 2016;44:4266–4277. doi: 10.1093/nar/gkw260. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Jinek M., Chylinski K., Fonfara I., Hauer M., Doudna J.A., Charpentier E. A Programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science. 2012;337:816–821. doi: 10.1126/science.1225829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Mojica F.J., Díez-Villaseñor C., García-Martínez J., Almendros C. Short motif sequences determine the targets of the prokaryotic CRISPR defence system. Microbiology (Reading) 2009;155:733–740. doi: 10.1099/mic.0.023960-0. [DOI] [PubMed] [Google Scholar]
  • 32.Sander J.D., Joung J.K. CRISPR-Cas systems for editing, regulating and targeting genomes. Nat. Biotechnol. 2014;32:347–355. doi: 10.1038/nbt.2842. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Sapranauskas R., Gasiunas G., Fremaux C., Barrangou R., Horvath P., Siksnys V. The Streptococcus thermophilus CRISPR/Cas system provides immunity in Escherichia coli. Nucleic Acids Res. 2011;39:9275–9282. doi: 10.1093/nar/gkr606. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Sternberg S.H., Redding S., Jinek M., Greene E.C., Doudna J.A. DNA interrogation by the CRISPR RNA-guided endonuclease Cas9. Nature. 2014;507:62–67. doi: 10.1038/nature13011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Zetsche B., Gootenberg J.S., Abudayyeh O.O., Slaymaker I.M., Makarova K.S., Essletzbichler P., et al. Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR-cas system. Cell. 2015;163:759–771. doi: 10.1016/j.cell.2015.09.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Kim D., Kim J., Hur J.K., Been K.W., Yoon S.H., Kim J.S. Genome-wide analysis reveals specificities of Cpf1 endonucleases in human cells. Nat. Biotechnol. 2016;34:863–868. doi: 10.1038/nbt.3609. [DOI] [PubMed] [Google Scholar]
  • 37.Chen J.S., Ma E., Harrington L.B., Da Costa M., Tian X., Palefsky J.M., et al. CRISPR-Cas12a target binding unleashes indiscriminate single-stranded DNase activity. Science. 2018;360:436–439. doi: 10.1126/science.aar6245. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Broughton J.P., Deng X., Yu G., Fasching C.L., Servellita V., Singh J., et al. CRISPR-cas12-based detection of SARS-CoV-2. Nat. Biotechnol. 2020;38:870–874. doi: 10.1038/s41587-020-0513-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Ageely E.A., Chilamkurthy R., Jana S., Abdullahu L., O’Reilly D., Jensik P.J., et al. Gene editing with CRISPR-Cas12a guides possessing ribose-modified pseudoknot handles. Nat. Commun. 2021;12:6591. doi: 10.1038/s41467-021-26989-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Wu S.S., Li Q.C., Yin C.Q., Xue W., Song C.Q. Advances in CRISPR/Cas-based gene therapy in human genetic diseases. Theranostics. 2020;10:4374–4382. doi: 10.7150/thno.43360. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Zhang L., Zuris J.A., Viswanathan R., Edelstein J.N., Turk R., Thommandru B., et al. AsCas12a ultra nuclease facilitates the rapid generation of therapeutic cell medicines. Nat. Commun. 2021;12:3908. doi: 10.1038/s41467-021-24017-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Sternberg S.H., LaFrance B., Kaplan M., Doudna J.A. Conformational control of DNA target cleavage by CRISPR-Cas9. Nature. 2015;527:110–113. doi: 10.1038/nature15544. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Jinek M., Jiang F., Taylor D.W., Sternberg S.H., Kaya E., Ma E., et al. Structures of Cas9 endonucleases reveal RNA-mediated conformational activation. Science. 2014;343 doi: 10.1126/science.1247997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Zhu X., Clarke R., Puppala A.K., Chittori S., Merk A., Merrill B.J., et al. Cryo-EM structures reveal coordinated domain motions that DNA cleavage by Cas9. Nat. Struct. Mol. Biol. 2019;26:679–685. doi: 10.1038/s41594-019-0258-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Lim Y., Bak S.Y., Sung K., Jeong E., Lee S.H., Kim J.S., et al. Structural roles of guide RNAs in the nuclease activity of Cas9 endonuclease. Nat. Commun. 2016;7 doi: 10.1038/ncomms13350. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Osuka S., Isomura K., Kajimoto S., Komori T., Nishimasu H., Shima T., et al. Real-time observation of flexible domain movements in CRISPR-cas9. EMBO J. 2018;37 doi: 10.15252/embj.201796941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Anders C., Niewoehner O., Duerst A., Jinek M. Structural basis of PAM-dependent target DNA recognition by the Cas9 endonuclease. Nature. 2014;513:569–573. doi: 10.1038/nature13579. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Jiang F., Zhou K., Ma L., Gressel S., Doudna J.A. A Cas9-guide RNA complex preorganized for target DNA recognition. Science. 2015;348:1477–1481. doi: 10.1126/science.aab1452. [DOI] [PubMed] [Google Scholar]
  • 49.Jiang F., Taylor D.W., Chen J.S., Kornfeld J.E., Zhou K., Thompson A.J., et al. Structures of a CRISPR-Cas9 R-loop complex primed for DNA cleavage. Science. 2016;351:867–871. doi: 10.1126/science.aad8282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Nishimasu H., Cong L., Yan W.X., Ran F.A., Zetsche B., Li Y., et al. Crystal structure of Staphylococcus aureus Cas9. Cell. 2015;162:1113–1126. doi: 10.1016/j.cell.2015.08.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Dong D., Ren K., Qiu X., Zheng J., Guo M., Guan X., et al. The crystal structure of Cpf1 in complex with CRISPR RNA. Nature. 2016;532:522–526. doi: 10.1038/nature17944. [DOI] [PubMed] [Google Scholar]
  • 52.Cofsky J.C., Soczek K.M., Knott G.J., Nogales E., Doudna J.A. CRISPR-Cas9 bends and twists DNA to read its sequence. Nat. Struct. Mol. Biol. 2022;29:395–402. doi: 10.1038/s41594-022-00756-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Min K., Yoon H., Jo I., Ha N.C., Jin K.S., Kim J.S., et al. Structural insights into the apo-structure of Cpf1 protein from Francisella novicida. Biochem. Biophys. Res. Commun. 2018;498:775–781. doi: 10.1016/j.bbrc.2018.03.057. [DOI] [PubMed] [Google Scholar]
  • 54.Li W., Chan C., Zeng C., Turk R., Behlke M.A., Cheng X., et al. Rational design of small molecules to enhance genome editing efficiency by selectively targeting distinct functional states of CRISPR-cas12a. Bioconjug. Chem. 2020;31:542–546. doi: 10.1021/acs.bioconjchem.0c00062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Wörle E., Jakob L., Schmidbauer A., Zinner G., Grohmann D. Decoupling the bridge helix of Cas12a results in a reduced trimming activity, increased mismatch sensitivity and impaired conformational transitions. Nucleic Acids Res. 2021;49:5278–5293. doi: 10.1093/nar/gkab286. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Saha A., Arantes P.R., Hsu R.V., Narkhede Y.B., Jinek M., Palermo G. Molecular dynamics reveals a DNA-induced dynamic switch triggering activation of CRISPR-cas12a. J. Chem. Inf. Model. 2020;60:6427–6437. doi: 10.1021/acs.jcim.0c00929. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Swarts D.C., van der Oost J., Jinek M. Structural basis for guide RNA processing and seed-dependent DNA targeting by CRISPR-cas12a. Mol. Cell. 2017;66:221–233. doi: 10.1016/j.molcel.2017.03.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Will S., Joshi T., Hofacker I.L., Stadler P.F., Backofen R. LocARNA-P: accurate boundary prediction and improved detection of structural RNAs. RNA. 2012;18:900–914. doi: 10.1261/rna.029041.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Götz A.W., Williamson M.J., Xu D., Poole D., Le Grand S., Walker R.C. Routine microsecond molecular dynamics simulations with AMBER on GPUs. 1. generalized born. J. Chem. Theory Comput. 2012;8:1542–1555. doi: 10.1021/ct200909j. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Salomon-Ferrer R., Götz A.W., Poole D., Le Grand S., Walker R.C. Routine microsecond molecular dynamics simulations with AMBER on GPUs. 2. explicit solvent particle Mesh Ewald. J. Chem. Theory Comput. 2013;9:3878–3888. doi: 10.1021/ct400314y. [DOI] [PubMed] [Google Scholar]
  • 61.Le Grand S., Götz A.W., Walker R.C. SPFP: speed without compromise-a mixed precision model for GPU accelerated molecular dynamics simulations. Comput. Phys. Commun. 2013;184:374–380. [Google Scholar]
  • 62.Case D.A., Ben-Shalom I.Y., Brozell S.R., Cerutti D.S., Cheatham T.E., III, Cruzeiro V.W.D., et al. 2015. Amber 2015 (University of California, San Francisco)ambermd.org/doc12/Amber15.pdf Available from. [Google Scholar]
  • 63.Bucher D., Grant B.J., McCammon J.A. Induced fit or conformational selection? the role of the semi-closed state in the maltose binding protein. Biochemistry. 2011;50:10530–10539. doi: 10.1021/bi201481a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Gao K., Oerlemans R., Groves M.R. Theory and applications of differential scanning fluorimetry in early-stage drug discovery. Biophys. Rev. 2020;12:85–104. doi: 10.1007/s12551-020-00619-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Kankia B.I., Marky L.A. DNA, RNA, and DNA/RNA oligomer duplexes: a comparative study of their stability, heat, hydration, and Mg2+ binding properties. J. Phys. Chem. B. 1999;103:8759–8767. [Google Scholar]
  • 66.Kypr J., Kejnovska I., Renciuk D., Vorlickova M. Circular dichroism and conformational polymorphism of DNA. Nucleic Acids Res. 2009;37:1713–1725. doi: 10.1093/nar/gkp026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Liu J., Carmell M.A., Rivas F.V., Marsden C.G., Thomson J.M., Song J.J., et al. Argonaute2 is the catalytic engine of mammalian RNAi. Science. 2004;305:1437–1441. doi: 10.1126/science.1102513. [DOI] [PubMed] [Google Scholar]
  • 68.Landthaler M., Gaidatzis D., Rothballer A., Chen P.Y., Soll S.J., Dinic L., et al. Molecular characterization of human argonaute-containing ribonucleoprotein complexes and their bound target mRNAs. RNA. 2008;14:2580–2596. doi: 10.1261/rna.1351608. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Liu X., Liu C., Zhou J., Lv X., He Y., Liu H., et al. Engineered FnCas12a with enhanced activity through directional evolution in human cells. J. Biol. Chem. 2021;296 doi: 10.1016/j.jbc.2021.100394. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Jiang F., Doudna J.A. CRISPR-Cas9 structures and mechanisms. Annu. Rev. Biophys. 2017;46:505–529. doi: 10.1146/annurev-biophys-062215-010822. [DOI] [PubMed] [Google Scholar]
  • 71.Williamson J.R. Induced fit in RNA-protein recognition. Nat. Struct. Biol. 2000;7:834–837. doi: 10.1038/79575. [DOI] [PubMed] [Google Scholar]
  • 72.Guo X., Campbell F.E., Sun L., Christian E.L., Anderson V.E., Harris M.E. RNA-dependent folding and stabilization of C5 protein during assembly of the E. coli RNase P holoenzyme. J. Mol. Biol. 2006;360:190–203. doi: 10.1016/j.jmb.2006.05.001. [DOI] [PubMed] [Google Scholar]
  • 73.Chakraborty P., Di Cera E. Induced fit is a special case of conformational selection. Biochemistry. 2017;56:2853–2859. doi: 10.1021/acs.biochem.7b00340. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Boehr D.D., Nussinov R., Wright P.E. The role of dynamic conformational ensembles in biomolecular recognition. Nat. Chem. Biol. 2009;5:789–796. doi: 10.1038/nchembio.232. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Qin F., Chen Y., Wu M., Li Y., Zhang J., Chen H.F. Induced fit or conformational selection for RNA/U1A folding. RNA. 2010;16:1053–1061. doi: 10.1261/rna.2008110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Zhou H.X. From induced fit to conformational selection: a continuum of binding mechanism controlled by the timescale of conformational transitions. Biophys. J. 2010;98:L15–L17. doi: 10.1016/j.bpj.2009.11.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Nguyen L.T., Smith B.M., Jain P.K. Enhancement of trans-cleavage activity of Cas12a with engineered crRNA enables amplified nucleic acid detection. Nat. Commun. 2020;11:4906. doi: 10.1038/s41467-020-18615-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Katsamba P.S., Myszka D.G., Laird-Offringa I.A. Two functionally distinct steps mediate high affinity binding of U1A protein to U1 hairpin II RNA. J. Biol. Chem. 2001;276:21476–21481. doi: 10.1074/jbc.M101624200. [DOI] [PubMed] [Google Scholar]
  • 79.Lee K., Mackley V.A., Rao A., Chong A.T., Dewitt M.A., Corn J.E., et al. Synthetically modified guide RNA and donor DNA are a versatile platform for CRISPR-Cas9 engineering. Elife. 2017;6 doi: 10.7554/eLife.25312. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Park H.M., Liu H., Wu J., Chong A., Mackley V., Fellmann C., et al. Extension of the crRNA enhances Cpf1 gene editing in vitro and in vivo. Nat. Commun. 2018;9:3313. doi: 10.1038/s41467-018-05641-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Thompson J.D., Gibson T.J., Higgins D.G. Multiple sequence alignment using ClustalW and ClustalX. Curr. Protoc. Bioinformatics. 2003:2.3. 1–2.3. 22. doi: 10.1002/0471250953.bi0203s00. [DOI] [PubMed] [Google Scholar]
  • 82.Jones D.T., Taylor W.R., Thornton J.M. The rapid generation of mutation data matrices from protein sequences. Bioinformatics. 1992;8:275–282. doi: 10.1093/bioinformatics/8.3.275. [DOI] [PubMed] [Google Scholar]
  • 83.Kerpedjiev P., Hammer S., Hofacker I.L. Forna (force-directed RNA): simple and effective online RNA secondary structure diagrams. Bioinformatics. 2015;31:3377–3379. doi: 10.1093/bioinformatics/btv372. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Waterhouse A., Bertoni M., Bienert S., Studer G., Tauriello G., Gumienny R., et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 2018;46:W296–W303. doi: 10.1093/nar/gky427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Maier J.A., Martinez C., Kasavajhala K., Wickstrom L., Hauser K.E., Simmerling C. ff14SB: Improving the accuracy of protein side chain and backbone parameters from ff99SB. J. Chem. Theory Comput. 2015;11:3696–3713. doi: 10.1021/acs.jctc.5b00255. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Pérez A., Marchán I., Svozil D., Sponer J., Cheatham T.E., III, Laughton C.A., et al. Refinement of the AMBER force field for nucleic acids: improving the description of α/γ conformers. Biophys. J. 2007;92:3817–3829. doi: 10.1529/biophysj.106.097782. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Zgarbová M., Otyepka M., Šponer J., Mládek A.t., Banáš P., Cheatham T.E., III, et al. Refinement of the Cornell et al. nucleic acids force field based on reference quantum chemical calculations of glycosidic torsion profiles. J. Chem. Theory Comput. 2011;7:2886–2902. doi: 10.1021/ct200162x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Banás P., Hollas D., Zgarbová M., Jurecka P., Orozco M., Cheatham T.E., III, et al. Performance of molecular mechanics force fields for RNA simulations: stability of UUCG and GNRA hairpins. J. Chem. Theory Comput. 2010;6:3836–3849. doi: 10.1021/ct100481h. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Jorgensen W.L., Chandrasekhar J., Madura J.D., Impey R.W., Klein M.L. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983;79:926–935. [Google Scholar]
  • 90.Turq P., Lantelme F., Friedman H.L. Brownian dynamics: its application to ionic solutions. J. Chem. Phys. 1977;66:3039–3044. [Google Scholar]
  • 91.Berendsen H.J., Postma J.v., van Gunsteren W.F., DiNola A., Haak J.R. Molecular dynamics with coupling to an external bath. J. Chem. Phys. 1984;81:3684–3690. [Google Scholar]
  • 92.Roe D.R., Cheatham T.E., III PTRAJ and CPPTRAJ: software for processing and analysis of molecular dynamics trajectory data. J. Chem. Theory Comput. 2013;9:3084–3095. doi: 10.1021/ct400341p. [DOI] [PubMed] [Google Scholar]
  • 93.Humphrey W., Dalke A., Schulten K. VMD: visual molecular dynamics. J. Mol. Graph. 1996;14:33–38. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
  • 94.Weiser J., Shenkin P.S., Still W.C. Approximate atomic surfaces from linear combinations of pairwise overlaps (LCPO) J. Comput. Chem. 1999;20:217–230. [Google Scholar]
  • 95.Hur J.K., Kim K., Been K.W., Baek G., Ye S., Hur J.W., et al. Targeted mutagenesis in mice by electroporation of Cpf1 ribonucleoproteins. Nat. Biotechnol. 2016;34:807–808. doi: 10.1038/nbt.3596. [DOI] [PubMed] [Google Scholar]
  • 96.Lee P.H., Huang X.X., Teh B.T., Ng L.M. TSA-CRAFT: a free software for automatic and robust thermal shift assay data analysis. SLAS Discov. 2019;24:606–612. doi: 10.1177/2472555218823547. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Gray R.D., Buscaglia R., Chaires J.B. Populated intermediates in the thermal unfolding of the human telomeric quadruplex. J. Am. Chem. Soc. 2012;134:16834–16844. doi: 10.1021/ja307543z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Dignam J.D., Qu X., Chaires J.B. Equilibrium unfolding of Bombyx mori Glycyl-tRNA synthetase. J. Biol. Chem. 2001;276:4028–4037. doi: 10.1074/jbc.M006840200. [DOI] [PubMed] [Google Scholar]
  • 99.Santoro M.M., Bolen D.W. Unfolding free energy changes determined by the linear extrapolation method. 1. unfolding of phenylmethanesulfonyl .alpha.-chymotrypsin using different denaturants. Biochemistry. 1988;27:8063–8068. doi: 10.1021/bi00421a014. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Figures S1–S23
mmc1.pdf (8.8MB, pdf)

Data Availability Statement

The datasets and computer code generated or used in this study are available upon request to P. I. P. Phylogenetic reconstruction datasets and sequences, as well as original MS data, are available upon request to K. T. G.


Articles from The Journal of Biological Chemistry are provided here courtesy of American Society for Biochemistry and Molecular Biology

RESOURCES