SUMMARY
CRISPR-Cas is a prokaryotic adaptive immune system that provides sequence-specific defense against foreign nucleic acids. Here we report the structure and function of the effector complex of the Type III-A CRISPR-Cas system of Thermus thermophilus: the Csm complex (TtCsm). TtCsm is composed of five different protein subunits (Csm1–Csm5) with an uneven stoichiometry and a single crRNA of variable size (35–53 nt). The TtCsm crRNA content is similar to the Type III-B Cmr complex, indicating that crRNAs are shared among different subtypes. A negative stain EM structure of the TtCsm complex exhibits the characteristic architecture of Type I and Type III CRISPR-associated ribonucleoprotein complexes. crRNA-protein crosslinking studies show extensive contacts between the Csm3 backbone and the bound crRNA. We show that, like TtCmr, TtCsm cleaves complementary target RNAs at multiple sites. Unlike Type I complexes, interference by TtCsm does not proceed via initial base pairing by a seed sequence.
INTRODUCTION
The arsenal of prokaryotic defense mechanisms against mobile genetics elements (MGE), such as bacteriophages and (conjugative) plasmids, includes adaptive immunity that serves as a sequence-specific memory of prior infections (Barrangou and Marraffini, 2014; Gasiunas et al., 2014; Reeks et al., 2013; Terns and Terns, 2014; van der Oost et al., 2014). These systems are made up of arrays of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) and CRISPR-associated (cas) genes that are present in approximately half of sequenced bacteria and most archaea (Grissa et al., 2007; Haft et al., 2005; Makarova et al., 2011). CRISPR-Cas systems are categorized into three major types (Types I, II, and III) on the basis of their specific Cas proteins (Koonin and Makarova, 2013; Makarova et al., 2011).
CRISPR arrays are short repeated sequences (24–50 bp) interspaced by similar-sized sequences with homology to MGE (spacers). The array is preceded by a leader sequence, which contains the promoter for transcription of the array. The spacers are acquired from the MGE and inserted in the chromosomal CRISPR array of the host by a process called “acquisition” that requires Cas1 and Cas2 proteins (Arslan et al., 2014; Barrangou et al., 2007; Yosef et al., 2012), although the requirement for these proteins in Type III systems has not been demonstrated so far. Transcription of the CRISPR array generally results in a long pre-crRNA. In Type I and Type III systems, this pre-crRNA is subsequently processed by a Cas6-type endoribonuclease into separate crRNAs, containing part(s) of the repeat and a single spacer sequence (Brouns et al., 2008; Carte et al., 2008). Type II systems use Cas9, the host factor RNase III, and a transactivating crRNA (tracrRNA) with complementarity to the repeat for crRNA maturation (Carte et al., 2014; Deltcheva et al., 2011). In some CRISPR-Cas systems, unknown nucleases trim the 5′ or 3′ ends of the crRNA (Deltcheva et al., 2011; Hatoum-Aslan et al., 2011; Richter et al., 2012; Scholz et al., 2013). In the interference stage, Cas protein(s) and the mature crRNA associate to form a ribonucleoprotein (RNP) complex that targets nucleic acid sequences complementary to the crRNA (the protospacer) for degradation by a trans-acting nuclease (Cas3) in Type I systems (Westra et al., 2012) or by intrinsic nuclease activity in Type II and Type III-B crRNP complexes (Gasiunas et al., 2012; Hale et al., 2009; Jinek et al., 2012). While Type I and Type II complexes target DNA, the Type III-B complex is the only CRISPR-Cas system characterized to date that targets RNA.
Despite these differences, recent studies have highlighted key similarities in the architecture of Type I (Cascade-like) and Type III complexes (Reeks et al., 2013; Rouillon et al., 2013; Spilman et al., 2013; Staals et al., 2013; Zhang et al., 2012), suggesting that these complexes have evolved from a common ancestor. These complexes share a “backbone” consisting of 4–6 copies of Cas7(-like) proteins and contain a smaller Cas5-like protein, which is thought to be involved in binding the 5′ end of the crRNA. An important distinction between Type I and Type III complexes concerns the large subunit positioned at the base of the backbone, which is Cas8 in most Type I systems and Cas10 in Type III systems (Reeks et al., 2013; van der Oost et al., 2014).
Thermus thermophilus HB8 is a convenient model organism to study CRISPR-Cas systems, since it has 11 CRISPR arrays (CRISPR8 is not considered a genuine CRISPR array, since it solely consists of a single repeat sequence and no spacers) and 4 different CRISPR-Cas systems: Type I-E, Type III-A, Type III-B, and an unclassified Type I system (Agari et al., 2010) (Figure 1). We previously characterized the RNA-targeting Cmr complex of the Type III-B system of T. thermophilus (Staals et al., 2013). As opposed to the well-characterized Type I-E system, the Type III-A system has not been extensively studied biochemically.
Figure 1. Schematic Representation of CRISPR Arrays and cas Genes on the Chromosome and Plasmid pTT27 of T. thermophilus HB8.
CRISPR arrays (1–12) are indicated in different grayscales, depending on the repeat type (I, II, or III). Cas (-related) genes belonging to a particular CRISPR-Cas subtype are colored in green (subtype III-A), blue (subtype III-B), or yellow (subtype I-E). Additional cas genes are indicated in white. For each of these CRISPR arrays, the bottom panel summarized the genomic location, the consensus repeat sequence, repeat type, and the number of spacers. The 5′ tag sequences, as found by our RNA-seq analysis, are underlined. The figure and legend are adapted from Staals et al. (2013).
The Csm operon of the Type III-A system encodes five Csm proteins (Csm1–Csm5) that form an RNP complex with the mature crRNA and sometimes an additional protein (called Csm6 or Csx1) (Hatoum-Aslan et al., 2013; Rouillon et al., 2013). The Cas7-like Csm3 forms the backbone of the complex and binds RNA in a sequence-independent fashion (Hrle et al., 2013; Koonin and Makarova, 2013; Rouillon et al., 2013). After primary cleavage of the pre-crRNA, guide maturation in the Type III-A system involves secondary trimming of the 3′ end by a ruler-like mechanism (Hatoum-Aslan et al., 2011), in which each Csm3 subunit binds and extends 6 nt segments of the mature crRNA and exposes unbound 3′ ends for cleavage by an unknown nuclease (Hatoum-Aslan et al., 2013).
The Type III-A system prevents autoimmunity (e.g., targeting the CRISPR array) by a self-discrimination versus non-self-discrimination mechanism based on complementarity with the 5′ repeat-derived fragment of the crRNA (Carte et al., 2008; Marraffini and Sontheimer, 2010), i.e., by “self-inactivation” (van der Oost et al., 2014). In contrast, Type I systems use a protospacer adjacent motif (PAM)-dependent targeting mechanism, which does not rely on base pairing (Westra et al., 2013), i.e., by “non-self-activation.”
Despite these initial genetic and structural insights, no in vitro activity of the Type III-A Csm complex has yet been reported. In this study, we investigated the structural and biochemical properties of the native Type III-A Csm complex isolated from T. thermophilus. Unexpectedly, and in contrast to previous findings, we show that Csm exhibits endoribonuclease activity using a cleavage mechanism similar to the Type III-B CRISPR-Cas complex Cmr.
RESULTS
Purification and Protein Composition of the TtCsm Complex
The csm genes of T. thermophilus HB8 are located in the vicinity of the CRISPR4 region on megaplasmid pTT27, comprising an operon composed of csm1, csm2, csm3, csm4, csm5, and csx1 (Figure 1). We constructed a recombinant T. thermophilus strain that produces the Csm5 protein fused to a (His)6 tag at its C terminus. The protein complex was purified to homogeneity using five subsequent column chromatography steps as described in the Supplemental Experimental Procedures available online. The purified protein complex was composed of five proteins (Figure 2A). We confirmed by mass-spectrometry-based analyses (not shown) that the proteins corresponded to TTHB147 (Csm1/Cas10), TTHB148 (Csm2), TTHB149 (Csm3), TTHB150 (Csm4), and TTHB151 (Csm5). Blue native polyacrylamide gel electrophoresis (BN-PAGE) analysis of the Csm complex revealed two major bands of 430–450 kDa, suggesting minor heterogeneity of the purified Csm complex (Figure 2B). We also constructed a recombinant T. thermophilus strain that produces the Csx1 protein fused with a (His)6 tag at its C terminus. Under the conditions we used, Csx1 did not copurify with the Csm complex.
Figure 2. Purification of the Native T. thermophilus Type III-A Csm Complex.
(A) SDS-PAGE analysis of the T. thermophilus Csm complex. A representative sample of the purified protein (2 μg) was analyzed on a 15% polyacrylamide gel (lane 2), followed by staining with Coomassie Brilliant Blue R-250. Each subunit is indicated. The Csm5 has a (His)6-tag at its C terminus. Lane 1, molecular-mass markers.
(B) BN-PAGE analysis of the T. thermophilus Csm complex. Two μg of the representative sample was analyzed on a 4%–16% linear polyacrylamide gradient gel in the presence of 0.02% Coomassie Brilliant Blue G-250 (lane 2). Lane 1, molecular-mass markers. Protein concentration used was determined by the Bradford method (Bradford, 1976), using bovine serum albumin as a standard.
crRNA Content of the TtCsm Complex
Denaturing gel analysis of the copurifying nucleic acids revealed that TtCsm binds crRNAs of variable lengths (Figure 3A). RNA sequencing (RNA-seq) was performed to determine the size distribution and nature of these sequences. RNA-seq confirmed that the most abundant crRNAs varied in length from 35 to 53 nt and were enriched for the 45 and 53 nt species. Although not as clear as in other studies of Type III systems (Hale et al., 2012; Hatoum-Aslan et al., 2013; Staals et al., 2013), there is a trend of 5 or 6 nt steps in the size distribution of crRNAs isolated from TtCsm (35-40-45 nt and 35-41-47-53 nt) (Figure 3B). Similar to our previous observation with crRNAs from the T. thermophilus Cmr complex, the majority of the Csm-bound crRNAs (71%) contained an 8 nt repeat-derived 5′ handle: 5′-AUUGCGAC (Staals et al., 2013). Csm-bound crRNAs retained either the complete or a truncated spacer region (39–42 nt), with the larger species containing a few nucleotides (3–6 nt) of the downstream repeat sequence.
Figure 3. RNA-Seq Analysis of TtCsm-Bound crRNAs.
(A) crRNAs were isolated by phenol-chloroform-isoamyl alcohol extraction and analyzed by denaturing PAGE (20% AA, 7M urea). Discontinuous gel lanes are indicated by dashed lines.
(B) Histogram of the size-distributions of the Csm-bound crRNAs.
(C) Histogram of the distribution of the Csm-bound crRNAs over the 11 CRISPR arrays.
(D) Mapping of the Csm-bound crRNAs on CRISPR4.
Overview of all mapped crRNAs and comparison of the crRNA content of the TtCsm and TtCmr complexes are provided in Figure S1.
The vast majority of the reads of the RNA-seq data set could be mapped to the genome of T. thermophilus HB8 (94.18%) and revealed that most crRNAs were derived from CRISPRs 1, 4, and 11 (84.73%), while CRISPRs 6, 7, 9, and 10 were highly underrepresented (0.13%) (Figure 3C; Figure S1A). This bias strongly correlates with the different classes of repeat sequences (Figure 1). In addition, major variation of Csm-bound crRNAs occurs among sequences derived from the same CRISPR array: e.g., crRNA 4.5 (spacer 5 from CRISPR array 4, 15.5%) was one of the most abundant guides in the complex, while levels of crRNA 4.4 (0.1%) were extremely low (Figure 3D). Strikingly, the observed bias in the Csm-bound crRNA population closely resembles that of the recently established Cmr-bound crRNAs (Staals et al., 2013) (Figure S1B), indicating that similar crRNAs can be shared among the effector complexes of different CRISPR-Cas systems within one host. Despite these similarities, TtCsm-bound crRNAs are somewhat longer than those found in the TtCmr complex (Juranek et al., 2012; Staals et al., 2013). This finding most likely suggests that crRNAs initially assemble with premature protein complexes that differ in number of backbone subunits (Csm3 in TtCsm, Cmr4 in TtCmr) and that trimming of their 3′ overhang results in the aforementioned 5–6 nt size differences. These data showed that the Csm complex binds crRNA species of multiple lengths with a conserved 8 nt 5′ handle from a subset of CRISPR arrays and spacers.
crRNA-Protein Interactions
To study the protein-crRNA interactions within the Csm complex, we used UV-induced protein-RNA crosslinking (Kramer et al., 2011; Schmidt et al., 2012). UV crosslinking was followed by enzymatic digestion of the protein and RNA moiety, enrichment of crosslinked peptide-RNA oligonucleotides and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analysis. Peptide-RNA oligonucleotides were identified with their crosslinked amino acid and nucleotide by dedicated database searches (Kramer et al., 2011). Using this approach, we found 12 peptides crosslinked to different mono-, di-, and trinucleotides from the crRNAs in the TtCsm complex (Table 1). For each of the five protein subunits (Csm1 to Csm5), at least one crosslinked peptide was identified. Remarkably, six different crosslinked peptides were identified in the Csm3 subunit. By inspection of the MS/MS fragment ion spectra, we identified different amino acids as crosslinking sites (Table 1; Figure S2). The crosslinking site on the RNA was always uracil, because this is the most UV-reactive nucleotide for this technique (Kramer et al., 2014). In most cases, the same peptide sequence was found to be crosslinked to di- and trinucleotide RNAs of various compositions. However, because the sample contained a mixture of natural guides isolated from T. thermophilus, unambiguous identification of the exact crosslinking site on the crRNAs was not possible.
Table 1.
List of RNA-Protein Crosslinks Identified in the TtCsm Complex
Protein | Peptide Sequence | RNA Moieties Observed Crosslinked to the Peptide |
---|---|---|
Csm1 | 371 RLHEALAR 378 | UUA |
Csm2 | 35 LKSSQFR 41 | U, U-H2O |
Csm3 | 21 IGMSRDQMAIGDLDNPVVR 39 | U, UU, UG |
Csm3 | 40 NPLTDEPYIPGSSLK 54 | U, U-H2O, UG, UA |
Csm3 | 91 IFGLAPENDER 101 | U, UU, UC, UG |
Csm3 | 136 GGLYTEIKQEVFIPR 150 | U, UU, UC, UG, UCG, UUC, UUG |
Csm3 | 151 LGGNANPR 158 | UA, UC, UG UGG, UCA, UUA |
Csm3 | 159 TTERVPAGAR 168 | U, UG, UGG, UUG |
Csm4 | 69 LPPVQVEETTLRK 81 | U, UG, UUA |
Csm4 | 126 TRVGVDR 132 | UC, UU |
Csm5 | 132 SPLGAYLPGSSVK 144 | U, UA, UG, UUA |
Csm5 | 255 MVLLAETFR 263 | U, U-H2O, UG |
Overview of peptide-RNA oligonucleotide crosslinks identified in TtCsm complex. The positions and sequence of the crosslinked peptides as identified by MS is shown and the crosslinked amino acids are underlined. For crosslinked peptides of Csm1 (positions 371–378) and Csm3 (positions 21–39), the crosslinked amino acid could not be unambiguously identified because of the lack of corresponding fragment ions in the MS/MS carrying a nucleotide moiety (for details, see Figure S2). For peptides crosslinked to mono-, di-, or trinucleotides shown in italics, the corresponding MS/MS spectra are given in Figure S2.
Stoichiometry of the TtCsm Complex
To investigate the architecture of the TtCsm complex, we determined the composition of the Csm protein complex using native mass spectrometry as performed previously (Jore et al., 2011; Staals et al., 2013; van Duijn et al., 2012). Denaturing and tandem MS analyses provided accurate mass measurements for each protein subunit of TtCsm. The measured masses of the individual subunits were consistent with the theoretical values on the basis of their amino acid sequence (Table S1). Analysis of the intact assembly by native MS revealed the presence of two major species. From their well-resolved charge state distributions, we accurately determined their masses as 426,998 ± 217 Da and 381,896 ± 261 Da (Figure 4A; Table S1), in agreement with the estimate from native gel electrophoresis (430–450 kDa; Figure 2B). Although we could measure the masses quite accurately, the stoichiometry of the Csm subunits could not be resolved unambiguously.
Figure 4. Subunit Composition of TtCsm.
(A) Native nanoelectrospray ionization mass spectrum of the native TtCsm complex. Two main well-resolved charge state distributions are present at high m/z values, corresponding to complexes of 427 kDa (blue) and 382 kDa (red).
(B) TtCsm (sub)complexes analyzed by native mass spectrometry. The subcomplexes were formed after in-solution dissociation with 30% DMSO v/v or 175 mM ammonium acetate acidified with acetic acid (pH 3.6–4). More in-depth calculations of the different (sub)complexes can be found in Table S1 and Table S2.
The two most abundant TtCsm complexes observed (427 and 382 kDa) most likely represent the intact Csm and a Csm subcomplex lacking Csm5, respectively. The measured mass difference between the two assemblies is 46.1 kDa (the mass of a Csm5 monomer is 44,286 Da). Previously, we used collision-induced dissociation on mass-selected ions of intact Cascade complexes (van Duijn et al., 2012). However, selection and activation did not result in substantial fragmentation of TtCsm, because of the exceptional intrinsic stability of the complex. As the TtCsm assembly could not be disrupted by tandem MS, we sought to further explore the TtCsm structure using a variety of in solution dissociation experiments on intact TtCsm. As previously described for the Sulfolobus solfataricus Csm complex (Rouillon et al., 2013), we lowered the pH of the solvent used for electrospray and also added organic modifiers to the spray solution.
From the results of all mass spectrometric experiments, we obtained a plethora of masses for subcomplexes of TtCsm formed by elimination of individual subunits (Table S2). Using these data, the stoichiometry of the complex was narrowed down to two possible solutions, Csm11Csm23Csm36Csm42Csm51crRNA1 (model 1; Figure 4B) or Csm11Csm23Csm32Csm44Csm52crRNA1 (model 2). Model 1 contains multiple copies of Csm3 and has an expected mass of 427,040 Da, whereas model 2 shows a more diverse stoichiometry and has an expected mass of 427,462 Da (Table S2). Both of these models have masses that are in reasonable agreement with the measured mass of 426,998.1 Da. Although, it is not possible to distinguish between these two models on the basis of the MS data alone, we favor model 1 for two reasons. First, model 1 suggests that Csm3 (rather than Csm4) is present in multiple copies, in better agreement with the abundances observed by SDS/PAGE (Figure 2A). Second, the structural similarities with other Type I and Type III CRISPR-Cas complexes (Reeks et al., 2013; van der Oost et al., 2014) suggest that the Cas7-like Csm3 protein forms the backbone of the complex, as previously proposed (Hrle et al., 2013; Rouillon et al., 2013).
Enzymatic Activity of the TtCsm Complex
The Csm complex of Staphylococcus epidermidis has previously been shown in vivo to provide resistance against conjugative plasmids (Hatoum-Aslan et al., 2014; Marraffini and Sontheimer, 2008), and it has been proposed to rely on a DNA-targeting mechanism. However, its cleavage activity has not yet been demonstrated in vitro. In the present study, we performed in vitro Csm activity assays with radiolabeled single-stranded DNA (ssDNA) and double-stranded DNA (dsDNA) oligos, as well as plasmid targets complementary to abundant crRNAs in the complex. Despite numerous attempts in the presence of potential cofactors, such as Mg2+, Mn2+, Fe2+, Cu2+, Co2+, Ni2+, and Zn2+ (data not shown) and different topologies of the DNA target (oligos and plasmids), no specific activity could be detected in these assays (Figures S3A and S3B).
This prompted us to investigate the ribonuclease capabilities of the complex. A 50 nt, 5′ radiolabeled ssRNA substrate complementary to crRNA 4.5 (one of the most abundant Csm-bound crRNAs; Figure 3D; Figure S1A) was incubated with the Csm complex in a buffer containing different cofactors (Mg2+, Mn2+, Zn2+, and Cu2+) followed by denaturing gel analysis. Surprisingly, we observed Mg2+- and Mn2+-stimulated endoribonuclease activity, while Zn2+ and Cu2+ did not stimulate specific activity (Figure S3C). For this reason, we performed all subsequent in vitro activity assays in the presence of Mg2+. Multiple degradation products of different sizes accumulated, with 21, 15, and 9 nt being the most predominant sizes observed.
Experiments with different concentrations of TtCsm (Figure 5A) or a low Mg2+ concentration (Figure S3D) showed that the larger degradation products were short-lived, which suggests that the target RNA is cleaved in a stepwise fashion starting at its 3′ end. To establish that this activity was Csm specific, we tested three RNA substrates (Figure S3D). The results showed that only RNA targets complementary to crRNAs loaded in the Csm complex (crRNAs 4.5 and 11.3) were degraded, while an unrelated target RNA with no complementary to Csm-bound crRNAs remained unaffected. These results demonstrated that the endoribonuclease activity in our assays was specific for Csm and not due to any copurifying contaminant nuclease. The degradation products of the 4.5 and 11.3 target RNAs had similar sizes, indicating a sequence-independent cleavage mechanism.
Figure 5. In Vitro Activity Assays with the TtCsm Complex.
(A) A 5′-labeled ssRNA target complementary to crRNA 4.5 was incubated with different concentration of the purified, endogenous TtCsm complex in a buffer containing 2 mM Mg2+. Samples were analyzed by denaturing PAGE (20% AA, 7 M urea), followed by phosphorimaging.
(B) The ssRNA target was incubated with 100 nM of the endogenous Csm or Cmr complex for the indicated amount of time. OH, alkaline hydrolysis ladder of the 50 nt RNA target.
(C) RNA targets (complementary to crRNA 4.5) with base-pair-disrupting mutations at the indicated positions (also see Figure 6A) were incubated in the absence (“−”) or presence (“+”) of TtCsm. In order to visualize more (transient) degradation products, the assay was performed with a lower (10 μM) Mg2+ concentration. “WT” refers to the unmodified “wild-type” target RNA.
(D) RNA targets (complementary to crRNA 4.5) with mutated, base-pair-disrupting regions at the indicated positions (also see Figure 6A) were incubated in the absence (“−”) or presence (“+”) of TtCsm in a buffer containing 10 μM Mg2+. Additional Csm activity assays with different RNA or DNA substrates and different cofactors are provided in Figure S3. Discontinuous gel lanes are indicated by the dashed line.
The pattern of cleavage products and their 6 nt periodicity had a striking resemblance to those observed with the Type III-B TtCmr complex, i.e., 33, 27, 21, and 15 nt (Figure S3D; Figure 5A) (Staals et al., 2013). These results suggested that TtCsm, like TtCmr, cleaves complementary RNAs with a 5′ ruler-like mechanism, cleaving its target RNA at 6 nt intervals measured from the 5′ end and progressing from the 3′ end. To confirm this, we performed a similar activity assay with either TtCsm or TtCmr and followed the cleavage activity in time (Figure 5B). Although TtCsm appeared to favor the formation of the smaller degradation products more quickly than TtCmr, both complexes indeed had similar cleavage patterns. In further support, activity assays with a 3′ end-labeled 50 nt ssRNA target (complementary to crRNA 4.5) with either TtCsm or TtCmr resulted in the accumulation of a degradation product of predominantly 12 nt (Figure S3E). This indicates that the cleavage of the target RNA by both complexes is initiated at the 3′ end, followed by 6 nt interspaced, periodic cleavage events progressing toward its 5′ end (the 3′ labeling reaction adds one additional nucleotide at the 3′ end). These results indicate that TtCsm cleaves complementary target RNAs with a 5′ ruler-like mechanism analogous to TtCmr.
Since complementarity to the Csm-bound crRNAs is required for activity (Figure S3D), we analyzed the determinants of target recognition in more detail, by testing activity on target RNAs with single base-pair-disrupting mismatches at positions 1 to 7 (Figure 5C, also see Figure 6A). Only the mismatch at position 5 hampered cleavage at the site directly downstream of it, as is reflected by the less abundant 39 nt band. Apart from that, it was found that single nucleotide mismatches did not affect degradation of these target RNAs, showing that perfect complementarity is not required at these positions. These results are in contrast with the stringent requirement for base-pairing interactions at positions 1–5 and 7–8 in the Type I-E Cascade complex, which make up the “seed sequence” in this system (Fineran et al., 2014; Semenova et al., 2011; Wiedenheft et al., 2011b). Experiments with mismatching nucleotides in the remaining regions of the target RNAs showed that base pairing in these regions was also dispensable for overall target degradation, suggesting that RNA targeting by TtCsm is rather flexible (Figure 5D). Interestingly, similar to the mutation at position 5, we observed that cleavage directly downstream of the mismatched region was affected, whereas cleavages at other sites proceeded normally. The results of these experiments are schematically summarized in Figure 6B.
Figure 6. Model of Cleavage Activity of the TtCsm Complex.
(A) Schematic representation of the cleavage activity of the TtCsm complex.
(B) Schematic representation of the impact of base-pair-disrupting mutations in regions of the target RNA on activity (also see Figure 5D). Cleavages observed in this study are indicated by dotted lines. Skipped cleavage sites are indicated with red crosses.
Nucleotides in red indicate repeat-derived sequences of the crRNA.
Structural Analysis of the TtCsm Complex
We used single-particle electron microscopy (EM) and 3D reconstruction of negatively stained TtCsm complexes to gain structural information about this crRNP. Raw EM micrographs showed mono-dispersed, elongated particles with a length of ~220 Å in the largest direction (Figure 7A). Using the automated data collection program Leginon (Suloway et al., 2009) and the Appion image-processing pipeline (Lander et al., 2009), we recorded ~420 micrographs and extracted a stack of ~60,000 individual particle images. Reference-free 2D alignment and classification produced class averages with striking features, clearly showing a wormlike architecture of two intertwined filaments with a base resembling a foot (Figure 7B). Using the structure of E. coli Cascade (EMDB-5314) (Wiedenheft et al., 2011a) low-pass filtered to 60 Å as an initial model, we performed iterative projection-matching refinement to generate a final 3D electron density map at 17 Å (using the 0.5 Fourier Shell Correlation criterion) (Figure 7C; Figure S4). The 3D structure of TtCsm resembles a “sea worm” similar to TtCmr (Staals et al., 2013), composed of two intertwined filaments that terminate in a foot-like base. We segmented the TtCsm structure on the basis of visual inspection of the map, our mass spectrometry analyses, and comparison with our previous segmentation of TtCmr. Of the two filaments, one is clearly larger and appears to be composed of identical, repeating subunits. This larger filament is likely composed of Csm3 on the basis of homology to the Cmr4 backbone subunit of TtCmr and our native mass spectrometry analyses (Figure 4). The smaller filament is likely composed of Csm2 subunits on the basis of a similar analysis. The head of the complex is most likely capped by Csm5 and the footlike base contains Csm1 at the toe on the basis of comparison with our previous Cmr structure and homology between these respective subunits (Figures S5A and S5B).
Figure 7. Molecular Architecture of the T. thermophilus Csm Complex.
(A) Raw micrograph of negatively stained TtCsm complexes. Scale bar, 100 nm.
(B) Representative reference-free 2D class averages of TtCsm complexes. The width of the boxes is ~400 Å.
(C) Working segmentation of the TtCsm complex reconstruction at 17 Å resolution highlighting the “sea worm” architecture. Segmented regions are colored and labeled as Csm1 (purple), Csm2 (red), Csm3 (alternating light blue and gray), Csm4 (green), and Csm5 (orange).
At the current resolution, it is difficult to unambiguously segment and dock atomic structures into the EM density; nevertheless, we constructed a model that is consistent with our biochemical results above. Using Cas7 from the E. coli Cascade crystal structure (Jackson et al., 2014), the Csm3 crystal structure from M. kandleri (35% identity) (Hrle et al., 2013), and the PHYRE automatic fold recognition server (Kelley and Sternberg, 2009), structures for TtCsm3 were generated that fit reasonably well into the larger of the two filaments, especially for the three Csm3 subunits near the head (Figure S5C). The three Csm3 subunits near the foot appear to have substantial heterogeneity and/ or adopt a different helical geometry. We hypothesize that an additional Csm3 backbone subunit in a subpopulation of the purified TtCsm sample (isolated from Thermus) may contribute to the observed heterogeneity. Additionally, the crRNA-protein crosslinking experiments described above suggest a likely path of the crRNA along the docked TtCsm3 homology structures. In this model the crRNA would bind along the ~25 Å wide channel located between the Csm3 and Csm2 filaments and engage the conserved thumb domain of Csm3 (Figures S5D–S5H).
Interestingly, the two Csm3 structure predictions show handlike characteristics similar to those attributed to Cas7 in the recent crystal structures of Cascade (Jackson et al., 2014; Mulepati et al., 2014; Zhao et al., 2014) (Figures S5D–S5H). This thumb domain is particularly evident in the PHYRE model (Figure S5D) based on Cas7 (Jackson et al., 2014), whose crRNA binding regions are known from the structure of the intact Cascade. We generated a multiple sequence alignment for Csm3 using ClustalO and showed that most of the proposed crosslinked residues are highly conserved (Figure S6). Further structural studies will be necessary to verify the accuracy of our model.
DISCUSSION
By studying the native Type III-A Csm complex, purified from T. thermophilus, we have revealed several important features of this CRISPR-Cas system, including its composition, structure, and activity in vitro.
The TtCsm complex consists of five Csm proteins (Csm1–Csm5) and one crRNA of variable sizes. Our RNA-seq analysis revealed that the Csm-bound crRNAs contain an invariable 5′ handle, 5′-AUUGCGAC, which is consistent with the primary, Cas6-mediated cleavage of the pre-crRNA at this position (Carte et al., 2008). The presence of the tag in the mature crRNA may play a role in initiating RNP complex formation, with the tag being specifically recognized and bound by one of the protein subunits of the complex. In comparison with other CRISPR-Cas complexes, it is most likely that the Cas5-like Csm4 subunit might perform this function (van der Oost et al., 2014; Wiedenheft et al., 2011a). The fact that mature Csm-bound crRNAs have a variable 3′ end hints at a processing step for trimming these crRNAs that appears to be a typical feature for Type III CRISPR-Cas complexes (Hale et al., 2009; Hatoum-Aslan et al., 2011; Rouillon et al., 2013; Staals et al., 2013; Zhang et al., 2012). Interestingly, we found more extensive 3′ end processing of Csm-bound crRNAs with a noncanonical 5′ handle (e.g., crRNAs from CRISPR6 and CRISPR7). This indicates that proper crRNP complex assembly (including recognition of the 5′ tag sequence) is directly coupled to mature 3′ end crRNA formation and suggests that 3′ processing occurs when bound within the surveillance complex. Hence, failure to properly assemble the complex (in the case of a noncanonical tag sequence) would result in crRNAs with more exposed 3′ ends that are susceptible to 3′ trimming (Brendel et al., 2014). These observations are consistent with a model where the sizes of mature crRNAs are determined by the dimensions of the complex. Indeed, TtCsm crRNAs (45/53 nt) were somewhat longer than the TtCmr-bound ones (40/46 nt), because of a complex with a slightly more extended backbone and therefore more protection of their 3′ ends (Figure S7). Our model where Csm3 forms the backbone of the complex (model 1) is therefore in good agreement with the previously reported 6 nt binding periodicity of this protein (Hatoum-Aslan et al., 2013).
The crRNA content of the TtCsm complex showed a remarkable bias for particular spacers, both in terms of variety and abundance. These results were strikingly similar to our previous observations with the TtCmr complex (Staals et al., 2013) (Figure S1B), suggesting that crRNAs can be shared among complexes from different CRISPR-Cas subtypes. In stark contrast, Sulfolobus solfataricus Csm and Cmr complexes interact with crRNAs derived from different CRISPR arrays (Rouillon et al., 2013; Zhang et al., 2012). This sorting phenomenon might be accounted for by the differences in repeat specificity of these complexes or the different repertoire of Cas6 paralogues in these species and the handover mechanisms of these paralogues (Niewoehner et al., 2014; Sokolowski et al., 2014). It is possible that bias in spacer selection occurs at the level of primary processing by Cas6. Indeed, in the Cascade-associated Cas6e/6f, single turnover processing of pre-crRNAs results in delivery to the appropriate crRNP complexes (Niewoehner et al., 2014; Reeks et al., 2013; van der Oost et al., 2014).
Type III-A CRISPR-Cas systems have been implicated in providing protection against plasmid conjugation and transformation in vivo (Hatoum-Aslan et al., 2014; Marraffini and Sontheimer, 2008, 2010). The results of these studies were interpreted as evidence for a DNA-targeting mechanism, although no confirmation of this activity has yet been obtained in vitro. Interestingly, the in vitro analyses presented here reveal that (under the tested conditions) the TtCsm complex harbors in vitro RNase rather than DNase activity. TtCsm catalyzes the cleavage of complementary target RNAs with a 5′ ruler-like endoribonuclease mechanism similar to that of Type III-B systems (Hale et al., 2009; Staals et al., 2013). This ruler mechanism defines six cleavage sites at the target RNA, each separated by 6 nt distances. Analogous to the Type I Cascade complex, the target RNA is most likely base pairing with the crRNA guide along the backbone of the TtCsm, a model that is supported by the specificity of the TtCsm complex for complementary RNA targets and the extensive crRNA-Csm3 crosslinks observed in this study. We therefore hypothesize that the Cas7-like subunits, constituting the backbone of these Type III complexes (Csm3 and Cmr4 in Type III-A and Type III-B, respectively), harbor the active sites. Our in vitro activity assay with partially mismatching target RNAs showed that adjacent active sites were impaired when base pairing of the guide’s upstream nucleotide(s) was disrupted. Nevertheless, these partially mismatching target RNAs were still degraded at the more distantly located active sites, which indicates that the TtCsm crRNA guides lack a defined seed sequence as is present in at least a subset of the Type I systems (Maier et al., 2013; Semenova et al., 2011; Wiedenheft et al., 2011b). Although a more detailed analysis on the boundaries of target recognition is still required, these results indicate that RNA targeting by Type III-A systems is quite flexible. This flexibility could also explain why target interrogation in Type III systems does not rely on a PAM (as in Type I systems), since target versus nontarget discrimination should be dispensable for RNA. Whether or not RNA targeting relies on other motifs outside of the protospacer region is an interesting task for further investigations.
The intriguing discrepancy between the apparent DNA-targeting activities of the Csm complex in vivo (Marraffini and Sontheimer, 2008) and its RNA targeting activity in vitro (this study) opens up the question of whether DNA, RNA, or both are the natural targets of the Type III-A system (and possibly of the III-B system). Several recent studies may provide pieces of the puzzle. (1) A Type III system of Sulfolobus islandicus has been reported to result in degradation of plasmid DNA. Interestingly, DNA interference appeared to be dependent on both transcription of the target sequence and on the presence of Csx1 (Csm6) (Deng et al., 2013). (2) Csx1/Csm6 are members of a highly variable protein family sharing a CARF domain (CRISPR-Cas-associated Rossmann fold) and are strongly associated with Type III CRISPR-Cas systems (Makarova et al., 2014). Furthermore, Csx1 of Pyrococcus furiosus has been demonstrated to associate with both dsDNA and dsRNA (Kim et al., 2013). (3) The large subunit of the Csm complex (Cas10/Csm1) of Staphylococcus epidermidis degrades single-stranded DNA and RNA in vitro (Ramia et al., 2014). (4) In S. epidermidis, deletion of the csm6 gene (encoding a Csx1 homolog) and mutations of conserved residues in the Palm polymerase domain of Cas10/Csm1 prevented CRISPR immunity in vivo, without affecting either complex formation or crRNA production, strongly suggesting their involvement in target degradation (Hatoum-Aslan et al., 2014). (5) Given the clustering of the gene encoding Csx1/Csm6 with the five genes encoding the Csm complex (Figure 1) (Makarova et al., 2014), it is possible the Type III-A system uses a similar (Csm6/Csx1-dependent and transcription-dependent) DNA-targeting mechanism as well. Indeed, during the revision of this manuscript, a new study by the Marraffini group showed that interference by the Type III-A Csm complex in S. epidermidis proceeds in a transcription-dependent fashion, which was shown to confer resistance against lytic viruses (Goldberg et al., 2014). These observations together with the in vitro RNase activities from this study strongly suggest a role of the Type III-A system in degrading transcriptionally active MGEs. Future in vivo and in vitro analyses are required to fully understand how this intriguing CRISPR-Cas variant functions to protect its host from MGE invasions.
EXPERIMENTAL PROCEDURES
Purification of the Csm Complex and Identification of the Csm Proteins
The TtCsm complex was from a (His)6-tagged Csm complex expressing T. thermophilus HB8 strain, as described in detail in the Supplemental Experimental Procedures.
RNA-Seq Analysis
crRNAs were purified and sequenced essentially as described previously (Staals et al., 2013), the details of which can be read in the Supplemental Experimental Procedures.
In Vitro Activity Assays
In vitro activity assays were performed essentially as described previously (Staals et al., 2013), details are described in the Supplemental Experimental Procedures.
UV Crosslinking and Identification of crRNA-Protein Interactions by LC-MS/MS
Protein-RNA crosslinking was performed using UV irradiation at 254 nm and the crosslinked peptides were enriched as described previously (Kramer et al., 2014; Schmidt et al., 2012). The sample was analyzed by LC-MS/MS essential according to Kramer et al. (2011). A more detailed description is provided in the Supplemental Experimental Procedures.
Native Mass Spectrometry
Native MS was performed as described in detail previously (van Duijn et al., 2012). Details about measurements of TtCsm subcomplexes and individual TtCsm proteins under denaturing conditions can be found in the Supplemental Experimental Procedures.
Single Particle Electron Microscopy and Analysis
TtCsm complexes diluted to ~25–50 nM were applied immediately to a glow-discharged continuous-carbon grid and then negatively stained with four consecutive droplets of 2% uranyl acetate. The sample was examined using a Technai-20 electron microscope equipped with a field emission gun and operated at 120 kV acceleration voltage. Image processing and 3D reconstruction were performed as described in the Supplemental Experimental Procedures.
Supplementary Material
Acknowledgments
The authors thank Virgis Siksnys (Vilnius) for sharing data before publication. We thank Aimi Osaki for construction of the recombinant T. thermophilus strain. R.H.J.S. was supported by the University of Otago’s Division of Health Sciences Career Development postdoctoral fellowships. This work was supported by an ALW grant (820.02.003 to J.v.d.O.) from the Netherlands Organization for Scientific Research (NWO) and by JSPS KAKENHI grant no.25440013 (to A.S.). A.B. and A.J.R.H. were supported by the Netherlands Proteomics Centre. E.N. and J.A.D. are Howard Hughes Medical Institute Investigators.
Footnotes
ACCESSION NUMBERS
The EM-derived density map of the TtCsm complex has been deposited in the Electron Microscopy Data Bank under accession number EMD-6122. The RNA-seq data set of the Csm-bound crRNAs has been deposited in the European Nucleotide Archive database under accession number PRJEB7461.
Supplemental Information includes Supplemental Experimental Procedures, seven figures, and three tables and can be found with this article online at http://dx.doi.org/10.1016/j.molcel.2014.10.005.
AUTHOR CONTRIBUTIONS
A.S., K. Sakamoto, T.S., N.D., S.Y., and J.v.d.O. contributed to the initial design of this study. K. Sakamoto, S.Y., and A.S. contributed to the production and purification of the Csm complex. T.S. and N.D. contributed to the MS identification of the Csm proteins. R.H.J.S., J.J.K., and P.J.S. contributed to RNA-seq analysis. R.H.J.S., Y.Z., K.V., N.N., and M.V. contributed to biochemical analyses. D.W.T., J.E.K., E.N., and J.A.D. contributed to electron microscopy and 3D structure determination. K. Sharma and H.U. contributed to MS-based protein-RNA crosslinking. A.B. and A.J.R.H. contributed to MS-based stoichiometry analysis. R.H.J.S., Y.Z., D.W.T., K. Sharma, A.B., A.S., and J.v.d.O., with input from all, contributed to the writing of manuscript.
References
- Agari Y, Sakamoto K, Tamakoshi M, Oshima T, Kuramitsu S, Shinkai A. Transcription profile of Thermus thermophilus CRISPR systems after phage infection. J Mol Biol. 2010;395:270–281. doi: 10.1016/j.jmb.2009.10.057. [DOI] [PubMed] [Google Scholar]
- Arslan Z, Hermanns V, Wurm R, Wagner R, Pul Ü. Detection and characterization of spacer integration intermediates in type I-E CRISPR-Cas system. Nucleic Acids Res. 2014;42:7884–7893. doi: 10.1093/nar/gku510. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barrangou R, Marraffini LA. CRISPR-Cas systems: prokaryotes upgrade to adaptive immunity. Mol Cell. 2014;54:234–244. doi: 10.1016/j.molcel.2014.03.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barrangou R, Fremaux C, Deveau H, Richards M, Boyaval P, Moineau S, Romero DA, Horvath P. CRISPR provides acquired resistance against viruses in prokaryotes. Science. 2007;315:1709–1712. doi: 10.1126/science.1138140. [DOI] [PubMed] [Google Scholar]
- Bradford MM. A rapid and sensitive method for the quantitation of microgram quantities of protein utilizing the principle of protein-dye binding. Anal Biochem. 1976;72:248–254. doi: 10.1016/0003-2697(76)90527-3. [DOI] [PubMed] [Google Scholar]
- Brendel J, Stoll B, Lange SJ, Sharma K, Lenz C, Stachler AE, Maier LK, Richter H, Nickel L, Schmitz RA, et al. A complex of Cas proteins 5, 6, and 7 is required for the biogenesis and stability of clustered regularly interspaced short palindromic repeats (crispr)-derived rnas (crrnas) in Haloferax volcanii. J Biol Chem. 2014;289:7164–7177. doi: 10.1074/jbc.M113.508184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brouns SJ, Jore MM, Lundgren M, Westra ER, Slijkhuis RJ, Snijders AP, Dickman MJ, Makarova KS, Koonin EV, van der Oost J. Small CRISPR RNAs guide antiviral defense in prokaryotes. Science. 2008;321:960–964. doi: 10.1126/science.1159689. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carte J, Wang R, Li H, Terns RM, Terns MP. Cas6 is an endoribonuclease that generates guide RNAs for invader defense in prokaryotes. Genes Dev. 2008;22:3489–3496. doi: 10.1101/gad.1742908. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carte J, Christopher RT, Smith JT, Olson S, Barrangou R, Moineau S, Glover CV, 3rd, Graveley BR, Terns RM, Terns MP. The three major types of CRISPR-Cas systems function independently in CRISPR RNA biogenesis in Streptococcus thermophilus. Mol Microbiol. 2014;93:98–112. doi: 10.1111/mmi.12644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deltcheva E, Chylinski K, Sharma CM, Gonzales K, Chao Y, Pirzada ZA, Eckert MR, Vogel J, Charpentier E. CRISPR RNA maturation by trans-encoded small RNA and host factor RNase III. Nature. 2011;471:602–607. doi: 10.1038/nature09886. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deng L, Garrett RA, Shah SA, Peng X, She Q. A novel interference mechanism by a type IIIB CRISPR-Cmr module in Sulfolobus. Mol Microbiol. 2013;87:1088–1099. doi: 10.1111/mmi.12152. [DOI] [PubMed] [Google Scholar]
- Fineran PC, Gerritzen MJ, Suárez-Diez M, Künne T, Boekhorst J, van Hijum SA, Staals RH, Brouns SJ. Degenerate target sites mediate rapid primed CRISPR adaptation. Proc Natl Acad Sci USA. 2014;111:E1629–E1638. doi: 10.1073/pnas.1400071111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gasiunas G, Barrangou R, Horvath P, Siksnys V. Cas9-crRNA ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in bacteria. Proc Natl Acad Sci USA. 2012;109:E2579–E2586. doi: 10.1073/pnas.1208507109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gasiunas G, Sinkunas T, Siksnys V. Molecular mechanisms of CRISPR-mediated microbial immunity. Cell Mol Life Sci. 2014;71:449–465. doi: 10.1007/s00018-013-1438-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goldberg GW, Jiang W, Bikard D, Marraffini LA. Conditional tolerance of temperate phages via transcription-dependent CRISPR-Cas targeting. Nature Published online August. 2014;31:2014. doi: 10.1038/nature13637. http://dx.doi.org/10.1038/nature13637. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grissa I, Vergnaud G, Pourcel C. The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats. BMC Bioinformatics. 2007;8:172. doi: 10.1186/1471-2105-8-172. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haft DH, Selengut J, Mongodin EF, Nelson KE. A guild of 45 CRISPR-associated (Cas) protein families and multiple CRISPR/Cas subtypes exist in prokaryotic genomes. PLoS Comput Biol. 2005;1:e60. doi: 10.1371/journal.pcbi.0010060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hale CR, Zhao P, Olson S, Duff MO, Graveley BR, Wells L, Terns RM, Terns MP. RNA-guided RNA cleavage by a CRISPR RNA-Cas protein complex. Cell. 2009;139:945–956. doi: 10.1016/j.cell.2009.07.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hale CR, Majumdar S, Elmore J, Pfister N, Compton M, Olson S, Resch AM, Glover CV, 3rd, Graveley BR, Terns RM, Terns MP. Essential features and rational design of CRISPR RNAs that function with the Cas RAMP module complex to cleave RNAs. Mol Cell. 2012;45:292–302. doi: 10.1016/j.molcel.2011.10.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hatoum-Aslan A, Maniv I, Marraffini LA. Mature clustered, regularly interspaced, short palindromic repeats RNA (crRNA) length is measured by a ruler mechanism anchored at the precursor processing site. Proc Natl Acad Sci USA. 2011;108:21218–21222. doi: 10.1073/pnas.1112832108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hatoum-Aslan A, Samai P, Maniv I, Jiang W, Marraffini LA. A ruler protein in a complex for antiviral defense determines the length of small interfering CRISPR RNAs. J Biol Chem. 2013;288:27888–27897. doi: 10.1074/jbc.M113.499244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hatoum-Aslan A, Maniv I, Samai P, Marraffini LA. Genetic characterization of antiplasmid immunity through a type III-A CRISPR-Cas system. J Bacteriol. 2014;196:310–317. doi: 10.1128/JB.01130-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hrle A, Su AA, Ebert J, Benda C, Randau L, Conti E. Structure and RNA-binding properties of the type III-A CRISPR-associated protein Csm3. RNA Biol. 2013;10:1670–1678. doi: 10.4161/rna.26500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jackson RN, Golden SM, van Erp PB, Carter J, Westra ER, Brouns SJ, van der Oost J, Terwilliger TC, Read RJ, Wiedenheft B. Structural biology. Crystal structure of the CRISPR RNA-guided surveillance complex from Escherichia coli. Science. 2014;345:1473–1479. doi: 10.1126/science.1256328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jinek M, Chylinski K, Fonfara I, Hauer M, Doudna JA, Charpentier E. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science. 2012;337:816–821. doi: 10.1126/science.1225829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jore MM, Lundgren M, van Duijn E, Bultema JB, Westra ER, Waghmare SP, Wiedenheft B, Pul U, Wurm R, Wagner R, et al. Structural basis for CRISPR RNA-guided DNA recognition by Cascade. Nat Struct Mol Biol. 2011;18:529–536. doi: 10.1038/nsmb.2019. [DOI] [PubMed] [Google Scholar]
- Juranek S, Eban T, Altuvia Y, Brown M, Morozov P, Tuschl T, Margalit H. A genome-wide view of the expression and processing patterns of Thermus thermophilus HB8 CRISPR RNAs. RNA. 2012;18:783–794. doi: 10.1261/rna.031468.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kelley LA, Sternberg MJ. Protein structure prediction on the Web: a case study using the Phyre server. Nat Protoc. 2009;4:363–371. doi: 10.1038/nprot.2009.2. [DOI] [PubMed] [Google Scholar]
- Kim YK, Kim YG, Oh BH. Crystal structure and nucleic acid-binding activity of the CRISPR-associated protein Csx1 of Pyrococcus furiosus. Proteins. 2013;81:261–270. doi: 10.1002/prot.24183. [DOI] [PubMed] [Google Scholar]
- Koonin EV, Makarova KS. CRISPR-Cas: evolution of an RNA-based adaptive immunity system in prokaryotes. RNA Biol. 2013;10:679–686. doi: 10.4161/rna.24022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kramer K, Hummel P, Hsiao HH, Luo X, Wahl M, Urlaub H. Mass-spectrometric analysis of proteins cross-linked to 4-thio-uracil- and 5-bromo-uracil-substituted RNA. Int J Mass Spectrom. 2011;304:184–194. [Google Scholar]
- Kramer K, Sachsenberg T, Beckmann BM, Qamar S, Boon KL, Hentze MW, Kohlbacher O, Urlaub H. Photo-cross-linking and high-resolution mass spectrometry for assignment of RNA-binding sites in RNA-binding proteins. Nat Methods. 2014;11:1064–1070. doi: 10.1038/nmeth.3092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lander GC, Stagg SM, Voss NR, Cheng A, Fellmann D, Pulokas J, Yoshioka C, Irving C, Mulder A, Lau PW, et al. Appion: an integrated, database-driven pipeline to facilitate EM image processing. J Struct Biol. 2009;166:95–102. doi: 10.1016/j.jsb.2009.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maier LK, Lange SJ, Stoll B, Haas KA, Fischer S, Fischer E, Duchardt-Ferner E, Wöhnert J, Backofen R, Marchfelder A. Essential requirements for the detection and degradation of invaders by the Haloferax volcanii CRISPR/Cas system I-B. RNA Biol. 2013;10:865–874. doi: 10.4161/rna.24282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Makarova KS, Haft DH, Barrangou R, Brouns SJ, Charpentier E, Horvath P, Moineau S, Mojica FJ, Wolf YI, Yakunin AF, et al. Evolution and classification of the CRISPR-Cas systems. Nat Rev Microbiol. 2011;9:467–477. doi: 10.1038/nrmicro2577. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Makarova KS, Anantharaman V, Grishin NV, Koonin EV, Aravind L. CARF and WYL domains: ligand-binding regulators of prokaryotic defense systems. Front Genet. 2014;5:102. doi: 10.3389/fgene.2014.00102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marraffini LA, Sontheimer EJ. CRISPR interference limits horizontal gene transfer in staphylococci by targeting DNA. Science. 2008;322:1843–1845. doi: 10.1126/science.1165771. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marraffini LA, Sontheimer EJ. Self versus non-self discrimination during CRISPR RNA-directed immunity. Nature. 2010;463:568–571. doi: 10.1038/nature08703. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mulepati S, Héroux A, Bailey S. Structural biology. Crystal structure of a CRISPR RNA-guided surveillance complex bound to a ssDNA target. Science. 2014;345:1479–1484. doi: 10.1126/science.1256996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Niewoehner O, Jinek M, Doudna JA. Evolution of CRISPR RNA recognition and processing by Cas6 endonucleases. Nucleic Acids Res. 2014;42:1341–1353. doi: 10.1093/nar/gkt922. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramia NF, Tang L, Cocozaki AI, Li H. Staphylococcus epidermidis Csm1 is a 3′ –5′ exonuclease. Nucleic Acids Res. 2014;42:1129–1138. doi: 10.1093/nar/gkt914. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reeks J, Naismith JH, White MF. CRISPR interference: a structural perspective. Biochem J. 2013;453:155–166. doi: 10.1042/BJ20130316. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Richter H, Zoephel J, Schermuly J, Maticzka D, Backofen R, Randau L. Characterization of CRISPR RNA processingin Clostridium thermocellum and Methanococcus maripaludis. Nucleic Acids Res. 2012;40:9887–9896. doi: 10.1093/nar/gks737. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rouillon C, Zhou M, Zhang J, Politis A, Beilsten-Edmands V, Cannone G, Graham S, Robinson CV, Spagnolo L, White MF. Structure of the CRISPR interference complex CSM reveals key similarities with cascade. Mol Cell. 2013;52:124–134. doi: 10.1016/j.molcel.2013.08.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmidt C, Kramer K, Urlaub H. Investigation of protein-RNA interactions by mass spectrometry—Techniques and applications. J Proteomics. 2012;75:3478–3494. doi: 10.1016/j.jprot.2012.04.030. [DOI] [PubMed] [Google Scholar]
- Scholz I, Lange SJ, Hein S, Hess WR, Backofen R. CRISPR-Cas systems in the cyanobacterium Synechocystis sp PCC6803 exhibit distinct processing pathways involving at least two Cas6 and a Cmr2 protein. PLoS ONE. 2013;8:e56470. doi: 10.1371/journal.pone.0056470. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Semenova E, Jore MM, Datsenko KA, Semenova A, Westra ER, Wanner B, van der Oost J, Brouns SJ, Severinov K. Interference by clustered regularly interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence. Proc Natl Acad Sci USA. 2011;108:10098–10103. doi: 10.1073/pnas.1104144108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sokolowski RD, Graham S, White MF. Cas6 specificity and CRISPR RNA loading in a complex CRISPR-Cas system. Nucleic Acids Res. 2014;42:6532–6541. doi: 10.1093/nar/gku308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spilman M, Cocozaki A, Hale C, Shao Y, Ramia N, Terns R, Terns M, Li H, Stagg S. Structure of an RNA silencing complex of the CRISPR-Cas immune system. Mol Cell. 2013;52:146–152. doi: 10.1016/j.molcel.2013.09.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Staals RH, Agari Y, Maki-Yonekura S, Zhu Y, Taylor DW, van Duijn E, Barendregt A, Vlot M, Koehorst JJ, Sakamoto K, et al. Structure and activity of the RNA-targeting Type III-B CRISPR-Cas complex of Thermus thermophilus. Mol Cell. 2013;52:135–145. doi: 10.1016/j.molcel.2013.09.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Suloway C, Shi J, Cheng A, Pulokas J, Carragher B, Potter CS, Zheng SQ, Agard DA, Jensen GJ. Fully automated, sequential tilt-series acquisition with Leginon. J Struct Biol. 2009;167:11–18. doi: 10.1016/j.jsb.2009.03.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Terns RM, Terns MP. CRISPR-based technologies: prokaryotic defense weapons repurposed. Trends Genet. 2014;30:111–118. doi: 10.1016/j.tig.2014.01.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van der Oost J, Westra ER, Jackson RN, Wiedenheft B. Unravelling the structural and mechanistic basis of CRISPR-Cas systems. Nat Rev Microbiol. 2014;12:479–492. doi: 10.1038/nrmicro3279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Duijn E, Barbu IM, Barendregt A, Jore MM, Wiedenheft B, Lundgren M, Westra ER, Brouns SJ, Doudna JA, van der Oost J, Heck AJ. Native tandem and ion mobility mass spectrometry highlight structural and modular similarities in clustered-regularly-interspaced shot-palindromic-repeats (CRISPR)-associated protein complexes from Escherichia coli and Pseudomonas aeruginosa. Mol Cell Proteomics. 2012;11:1430–1441. doi: 10.1074/mcp.M112.020263. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Westra ER, van Erp PB, Künne T, Wong SP, Staals RH, Seegers CL, Bollen S, Jore MM, Semenova E, Severinov K, et al. CRISPR immunity relies on the consecutive binding and degradation of negatively supercoiled invader DNA by Cascade and Cas3. Mol Cell. 2012;46:595–605. doi: 10.1016/j.molcel.2012.03.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Westra ER, Semenova E, Datsenko KA, Jackson RN, Wiedenheft B, Severinov K, Brouns SJ. Type I-E CRISPR-cas systems discriminate target from non-target DNA through base pairing-independent PAM recognition. PLoS Genet. 2013;9:e1003742. doi: 10.1371/journal.pgen.1003742. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wiedenheft B, Lander GC, Zhou K, Jore MM, Brouns SJ, van der Oost J, Doudna JA, Nogales E. Structures of the RNA-guided surveillance complex from a bacterial immune system. Nature. 2011a;477:486–489. doi: 10.1038/nature10402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wiedenheft B, van Duijn E, Bultema JB, Waghmare SP, Zhou K, Barendregt A, Westphal W, Heck AJ, Boekema EJ, Dickman MJ, Doudna JA. RNA-guided complex from a bacterial immune system enhances target recognition through seed sequence interactions. Proc Natl Acad Sci USA. 2011b;108:10092–10097. doi: 10.1073/pnas.1102716108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yosef I, Goren MG, Qimron U. Proteins and DNA elements essential for the CRISPR adaptation process in Escherichia coli. Nucleic Acids Res. 2012;40:5569–5576. doi: 10.1093/nar/gks216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang J, Rouillon C, Kerou M, Reeks J, Brugger K, Graham S, Reimann J, Cannone G, Liu H, Albers SV, et al. Structure and mechanism of the CMR complex for CRISPR-mediated antiviral immunity. Mol Cell. 2012;45:303–313. doi: 10.1016/j.molcel.2011.12.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao H, Sheng G, Wang J, Wang M, Bunkoczi G, Gong W, Wei Z, Wang Y. Crystal structure of the RNA-guided immune surveillance Cascade complex in Escherichia coli. Nature. 2014 doi: 10.1038/nature13733. Published online August 12, 2014. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.