Abstract
Prokaryotes have developed several strategies to defend themselves against foreign genetic elements. One of those defense mechanisms is the recently identified CRISPR/Cas system, which is used by approximately half of all bacterial and almost all archaeal organisms. The CRISPR/Cas system differs from the other defense strategies because it is adaptive, hereditary and it recognizes the invader by a sequence specific mechanism. To identify the invading foreign nucleic acid, a crRNA that matches the invader DNA is required, as well as a short sequence motif called protospacer adjacent motif (PAM). We recently identified the PAM sequences for the halophilic archaeon Haloferax volcanii, and found that several motifs were active in triggering the defense reaction. In contrast, selection of protospacers from the invader seems to be based on fewer PAM sequences, as evidenced by comparative sequence data. This suggests that the selection of protospacers has stricter requirements than the defense reaction. Comparison of CRISPR-repeat sequences carried by sequenced haloarchaea revealed that in more than half of the species, the repeat sequence is conserved and that they have the same CRISPR/Cas type.
Keywords: Haloferax volcanii, CRISPR/Cas, PAM, archaea, prokaryotic immune system, haloarchaea
The Prokaryotic Defense System
The CRISPR/Cas system is one of several defense systems that prokaryotes can use to prevent invasion by foreign genetic elements (for a more detailed description see recent reviews1-6). The function and significance of this system was only recently discovered, and it differs from other known defense systems because it is heritable, can adapt to new invaders and it is sequence specific. The system uses a set of proteins and short RNA molecules, termed Cas proteins and crRNA, respectively. The crRNAs are processed from a longer pre-crRNA that is encoded in the CRISPR locus, a peculiar series of short, directly repeated sequences between which are unique spacer sequences (Fig. 1). The latter sequences originate from previous (and unsuccessful) invading elements, which were degraded. This was accompanied by inserting a short piece of sequence into the CRISPR locus. Thus the CRISPR locus is a memory of previously encountered invaders to which the cell has adapted and is immune.
Immune defense proceeds in three stages: (1) adaptation, (2) expression and (3) interference. In the first stage, the nucleic acid of the invading element enters the cell, and is immediately recognized as a foreign element. A piece of the invader DNA, termed protospacer, is selected and then integrated into the CRISPR locus as a new spacer (Fig. 2). Note that the sequence is called protospacer as long as it is still part of the invader. As soon as it is integrated into the CRISPR locus it is called spacer. Selection as a new spacer depends on the presence of a certain neighboring sequence, the protospacer adjacent motif (PAM).7 This has been shown for CRISPR/Cas type I and type II systems.
This motif is not only important for spacer selection but also for accurately targeting the defense reaction.7-10
In the second stage of the defense reaction, the CRISPR locus is expressed, generating a pre-crRNA which is subsequently processed to short crRNAs, each of which is specific for a single invader (Fig. 1). Together with the Cas proteins, this crRNA recognizes the invader in the third stage of the defense reaction. The spacer sequence of the crRNA base pairs with the invader sequence from which it was derived, rendering the defense sequence specific.
CRISPR/Cas System of Hfx. volcanii
The CRISPR/Cas system of Haloferax consists of eight Cas proteins and three CRISPR RNAs, and phylogenetically they belong to the type I-B group of CRISPR/Cas systems.7,11 We could show that all three CRISPR RNAs are constitutively expressed and processed,11 indicating that although the strain has been in the laboratory for more than 30 y and probably did not encounter any invaders during that time, the defense system has remained active. Comparison of the spacer sequences of the three Haloferax CRISPR loci to sequences deposited in the public sequence databases showed only two matches. One spacer matched to the Haloferax genome within an annotated open-reading frame (HVO_0372) encoding a protein of unknown function. The 5′ part of the spacer was identical to the genomic sequence, but the 3′ part showed nine mismatches, which is probably sufficient to prevent autoimmune targeting by the CRISPR/Cas system. The second spacer was similar to an environmental sequence recovered from a salt lake in Australia (Lake Tyrrell) and differed at only four positions, distributed along the sequence. The 5′ part of the sequence matches perfectly, and in E. coli it has been shown that a perfect match in the 5′ sequence (termed the seed sequence) is essential for recognition and target degradation.10,12 It is likely then, that invading elements containing this sequence would be targeted by the CRISPR/Cas system.
The low number of spacer matches to known sequences probably reflects that relatively few haloarchaeal viruses have been isolated and sequenced. Another factor is that the DS2 strain was isolated from the Dead Sea in 1974. Viral populations would have changed in the 40 y since, making it unlikely that the original matching sequences would now be common enough to have been recovered and sequenced.
Several Motifs Direct Degradation
To investigate the Haloferax defense system, we developed a plasmid invader system similar to the one described for Sulfolobus.9 The invader plasmid contained a piece of invader DNA, and an adjacent motif marking the DNA as invader – the so-called PAM sequence. As invader DNA, we chose a spacer sequence included in the Haloferax CRISPR loci, and this was cloned into a Haloferax shuttle vector (Fig. 3A). The PAM sequences used by Haloferax (or by other haloarchaea) were not known before this study, and it was also unknown if they are located upstream or downstream of the protospacer. So we tested all possible di- and trinucleotide combinations (PAM sequences are generally 2–5 nucleotides long) (Fig. 3A). In addition, the plasmid contained a marker gene allowing growth without uracil, so only cells carrying the plasmid are able to grow on selective media. If the defense mechanism is active against the plasmid, then it is destroyed (together with the selection marker) and such cells cannot grow on selective medium, which results in a severe (about 100-fold) reduction of transformation efficiency. Using this approach six different trinucleotide sequences were identified that were active in triggering the defense response, which is currently the highest number of PAMs identified for a single CRISPR repeat group. In addition, we could show that this motif has to be located upstream of the protospacer sequence to activate the defense reaction (Fig. 3B).
While the majority of cells challenged with these six types of invader plasmids were unable to grow without uracil, a low level of background colonies were observed. When examples of these were analyzed, the majority was found to have mutations in, or complete deletions of the cas gene cluster, thereby inactivating the defense system and allowing the cell to maintain the plasmid.
Conservation of CRISPR/Cas Types in Haloarchaea
To gain more insight into the PAM sequences used for adaptation, we searched for other haloarchaea for which recent metagenomic data were available. We used the spacer sequences encoded in the Haloquadratum walsbyi CRISPR loci to look for matches in the databases. Eight matches were found and the PAM sequences obtained for them were in seven cases TTC, which is identical to one of the six PAM sequences we found experimentally for Haloferax. Like Haloferax, Hqr. walsbyi contains a CRISPR/Cas type I-B system, with CRISPR repeat sequences that are very similar to those of Haloferax (Fig. 4). Further comparison with other haloarchaea showed that those that are available in the CRISPR database (crispr.u-psud.fr/crispr/, July 2012) and which encode Cas proteins all belong to the type I-B CRISPR/Cas group. BLAST searches with the Haloferax repeat sequence show that in 20 of the 32 haloarchaeal genomes currently deposited in the NBCI database (www.ncbi.nlm.nih.gov/sutils/genom_table.cgi, August 2012) and in the JGI IMG database (http://www.jgi.doe.gov/), the repeat sequence is well conserved, with only one to five mismatches between the repeats from the different haloarchaeal organisms (Fig. 4). A similar observation has recently been made by Lynch et al.13 Since PAM sequences have been reported to be connected to the repeat sequence and to the CRISPR/Cas type3 it is reasonable to expect that these haloarchaeal sequences require the same PAM sequence. Whether this is true is a question to be addressed in future studies.
Speculation
In our study we analyzed the requirements for the CRISPR/Cas defense reaction and identified six different PAM sequences that were able to trigger this reaction. Such a high number of permissible PAM sequences could be advantageous when defending against related invading elements, as it tolerates individual mutations as well as clonal divergence, making the system more broadly effective. This makes sense because the prokaryotic defense mechanism will remain active against virus mutants that otherwise could avoid immune recognition. In contrast, the few data we collected in silico concerning PAM motifs for Haloquadratum walsbyi revealed only two PAM motifs. Since Hqr. walsbyi and Hfx. volcanii have very similar repeat sequences and belong to the same CRISPR/Cas type they might have similar PAM requirements. Taken together, the evidence suggests that the adaptation step is more restrictive in terms of its PAM requirements compared with the interference step.
In Streptococcus thermophilus (CRISPR/Cas type II), a similar recognition of several different PAM sequences on invader plasmids was observed.2 The PAM requirements of CRISPR/Cas type III systems have not yet been reported, but it has been shown in Staphylococcus epidermidis, that the protospacer adjacent sequence of this system must be different from the repeat sequence located upstream of the spacer in the CRISPR locus, so ensuring differentiation between self DNA (CRISPR locus) and the foreign genetic element.14
A clearer picture of how this immune reaction operates will be revealed as more data are collected, from all of the known CRISPR/Cas systems.
Acknowledgments
This work was supported by the Deutsche Forschungsgemeinschaft, in the frame of the priority program, “Unraveling the prokaryotic immune system” (FOR1680). We wish to thank all members of the Research Unit for helpful discussions.
Glossary
Abbreviations:
- CRISPR
Cas, PAM, crRNA
- NOTE
Author, please provide abbreviations
Footnotes
Previously published online: www.landesbioscience.com/journals/mge/article/22530
References
- 1.Barrangou R, Fremaux C, Deveau H, Richards M, Boyaval P, Moineau S, et al. CRISPR provides acquired resistance against viruses in prokaryotes. Science. 2007;315:1709–12. doi: 10.1126/science.1138140. [DOI] [PubMed] [Google Scholar]
- 2.Garneau JE, Dupuis ME, Villion M, Romero DA, Barrangou R, Boyaval P, et al. The CRISPR/Cas bacterial immune system cleaves bacteriophage and plasmid DNA. Nature. 2010;468:67–71. doi: 10.1038/nature09523. [DOI] [PubMed] [Google Scholar]
- 3.Al-Attar S, Westra ER, van der Oost J, Brouns SJ. Clustered regularly interspaced short palindromic repeats (CRISPRs): the hallmark of an ingenious antiviral defense mechanism in prokaryotes. Biol Chem. 2011;392:277–89. doi: 10.1515/bc.2011.042. [DOI] [PubMed] [Google Scholar]
- 4.Bhaya D, Davison M, Barrangou R. CRISPR-Cas systems in bacteria and archaea: versatile small RNAs for adaptive defense and regulation. Annu Rev Genet. 2011;45:273–97. doi: 10.1146/annurev-genet-110410-132430. [DOI] [PubMed] [Google Scholar]
- 5.Garrett RA, Vestergaard G, Shah SA. Archaeal CRISPR-based immune systems: exchangeable functional modules. Trends Microbiol. 2011;19:549–56. doi: 10.1016/j.tim.2011.08.002. [DOI] [PubMed] [Google Scholar]
- 6.Marchfelder A, Fischer S, Brendel J, Stoll B, Maier LK, Jäger D, et al. Small RNAs for defence and regulation in archaea. Extremophiles. 2012;16:685–96. doi: 10.1007/s00792-012-0469-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Makarova KS, Haft DH, Barrangou R, Brouns SJ, Charpentier E, Horvath P, et al. Evolution and classification of the CRISPR-Cas systems. Nat Rev Microbiol. 2011;9:467–77. doi: 10.1038/nrmicro2577. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Mojica FJ, Díez-Villaseñor C, García-Martínez J, Almendros C. Short motif sequences determine the targets of the prokaryotic CRISPR defence system. Microbiology. 2009;155:733–40. doi: 10.1099/mic.0.023960-0. [DOI] [PubMed] [Google Scholar]
- 9.Gudbergsdottir S, Deng L, Chen Z, Jensen JV, Jensen LR, She Q, et al. Dynamic properties of the Sulfolobus CRISPR/Cas and CRISPR/Cmr systems when challenged with vector-borne viral and plasmid genes and protospacers. Mol Microbiol. 2011;79:35–49. doi: 10.1111/j.1365-2958.2010.07452.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Semenova E, Jore MM, Datsenko KA, Semenova A, Westra ER, Wanner B, et al. Interference by clustered regularly interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence. Proc Natl Acad Sci U S A. 2011;108:10098–103. doi: 10.1073/pnas.1104144108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Fischer S, Maier LK, Stoll B, Brendel J, Fischer E, Pfeiffer F, et al. An archaeal immune system can detect multiple protospacer adjacent motifs (PAMs) to target invader DNA. J Biol Chem. 2012;287:33351–63. doi: 10.1074/jbc.M112.377002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Cady KC, Bondy-Denomy J, Heussler GE, Davidson AR, O’Toole GA. The CRISPR/Cas Adaptive Immune System of Pseudomonas aeruginosa Mediates Resistance to Naturally Occurring and Engineered Phages. J Bacteriol. 2012;194:5728–38. doi: 10.1128/JB.01184-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Lynch EA, Langille MG, Darling A, Wilbanks EG, Haltiner C, Shao KS, et al. Sequencing of seven haloarchaeal genomes reveals patterns of genomic flux. PLoS One. 2012;7:e41389. doi: 10.1371/journal.pone.0041389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Marraffini LA, Sontheimer EJ. Self versus non-self discrimination during CRISPR RNA-directed immunity. Nature. 2010;463:568–71. doi: 10.1038/nature08703. [DOI] [PMC free article] [PubMed] [Google Scholar]