Skip to main content
PLOS Biology logoLink to PLOS Biology
. 2023 Apr 14;21(4):e3002065. doi: 10.1371/journal.pbio.3002065

CRISPR-Cas effector specificity and cleavage site determine phage escape outcomes

Michael A Schelling 1, Giang T Nguyen 1, Dipali G Sashital 1,*
Editor: Jeremy J Barr2
PMCID: PMC10132644  PMID: 37058530

Abstract

CRISPR-mediated interference relies on complementarity between a guiding CRISPR RNA (crRNA) and target nucleic acids to provide defense against bacteriophage. Phages escape CRISPR-based immunity mainly through mutations in the protospacer adjacent motif (PAM) and seed regions. However, previous specificity studies of Cas effectors, including the class 2 endonuclease Cas12a, have revealed a high degree of tolerance of single mismatches. The effect of this mismatch tolerance has not been extensively studied in the context of phage defense. Here, we tested defense against lambda phage provided by Cas12a-crRNAs containing preexisting mismatches against the genomic targets in phage DNA. We find that most preexisting crRNA mismatches lead to phage escape, regardless of whether the mismatches ablate Cas12a cleavage in vitro. We used high-throughput sequencing to examine the target regions of phage genomes following CRISPR challenge. Mismatches at all locations in the target accelerated emergence of mutant phage, including mismatches that greatly slowed cleavage in vitro. Unexpectedly, our results reveal that a preexisting mismatch in the PAM-distal region results in selection of mutations in the PAM-distal region of the target. In vitro cleavage and phage competition assays show that dual PAM-distal mismatches are significantly more deleterious than combinations of seed and PAM-distal mismatches, resulting in this selection. However, similar experiments with Cas9 did not result in emergence of PAM-distal mismatches, suggesting that cut-site location and subsequent DNA repair may influence the location of escape mutations within target regions. Expression of multiple mismatched crRNAs prevented new mutations from arising in multiple targeted locations, allowing Cas12a mismatch tolerance to provide stronger and longer-term protection. These results demonstrate that Cas effector mismatch tolerance, existing target mismatches, and cleavage site strongly influence phage evolution.


CRISPR-Cas immune systems provide immunity against viruses using RNA-guided endonucleases like Cas9 and Cas12a. Viruses can evade these Cas endonucleases through the evolution of mutants that block cleavage by creating mismatches between the guiding RNA and the viral DNA. We show that although Cas12a can tolerate some mismatches, the presence and locations of these mismatches strongly influences the outcomes of further viral evolution.

Introduction

CRISPR-Cas (clustered regularly interspaced short palindromic repeats-CRISPR associated) systems are adaptive immune systems found in bacteria and archaea [13]. These systems use ribonucleoprotein effector complexes to find and destroy foreign nucleic acids that have entered the cell. CRISPR effector complexes are guided by a CRISPR RNA (crRNA) to a nucleic acid target that is complementary to a section of the crRNA called the spacer. Bacteria can acquire new spacer sequences that allow them to mount an immune response against threats they have not previously encountered [4,5].

An important function of CRISPR-Cas systems is to prevent infection by bacteriophages, which can have significant impact on the composition of a bacterial population [69]. As a CRISPR effector complex requires a match between its crRNA and a target to engage in interference, selection occurs for phages with mutations in targeted genomic regions [1012]. Mutations in CRISPR targets create mismatches between the target and the crRNA that weaken the base-pairing interaction [1315], slowing or stopping target matching by Cas effectors [16] and allowing phages to safely multiply in the bacterial cell. Different CRISPR-Cas systems have DNA or RNA as a primary target and prevent infection at the cellular and population level [1722]. Target binding is more stringent in DNA targeting systems, mitigating highly damaging off-target cleavage of host DNA [23]. In these systems, a protospacer adjacent motif (PAM) next to the target is required to initiate base pairing [2427]. Complete base pairing is especially important in the region next to the PAM, called the seed region [2834]. Accordingly, mutations that allow phages to escape CRISPR immunity are often single mutations in the PAM or seed region [1012,35].

There have been multiple proposed but noncompeting mechanisms for this mutagenesis. Mutants may exist due to natural genetic variation in the population and these could be selected through CRISPR pressure and become dominant in the population over time [11,36]. Alternatively, escape mutations may be generated by Cas effector cleavage and subsequent error prone DNA repair [37]. It has been shown that cleavage by Cas effectors causes large deletions to appear in the genome of T4 phage, resulting in loss of the crRNA target sequence [38]. Lambda phage encoded Red recombinase has been implicated in generating mutations in CRISPR targets that allow escape [39]. It remains unclear to what degree each of these mutagenesis pathways contribute to phage escape under different conditions.

Escape mutations are evident in natural settings as bacterial CRISPRs often contain mismatched spacers to common mobile genetic elements and the genomes of phages [27,4043]. New spacers are added at the leader end of CRISPR arrays and these new spacers are more likely to match a target in a phage genome exactly [44,45]. Genomic evidence also shows that spacer sequences in a CRISPR array do not commonly develop mutations and are fixed once they are acquired [12,46]. Instead, spacers are lost from the array entirely when they lose effectiveness as mutations accumulate in targeted genomic elements.

Mismatched spacers may provide some benefit to the host. Spacers against mutated targets drive some Cas effectors towards primed spacer acquisition, in which new spacers are preferentially acquired from genomes targeted by the Cas effector [11,31,4750]. Mismatched crRNAs may also provide low-level immunity through continued target cleavage. Cas effectors tolerate mismatches between the crRNA and target, allowing cleavage of mutated targets [26,28,29,5155]. This lax specificity may partially prevent phage escape. The type V-A Cas12a effector has been shown to tolerate multiple mismatches between its guiding crRNA and the target in vitro [53,56,57] leading to rare off-target genome edits in cells [5761]. However, this mismatch tolerance varies depending on the crRNA sequence and type of mismatch. Multiple Cas12a variants also have the ability to nick double-stranded DNA targets with many (3–4) mismatches [53,62]. These in vitro observations raise the question of how the specificity of Cas12a affects its role in preventing infection by phage with target mutations.

To test this, we subjected bacteria expressing Cas12a and crRNAs with varying target mismatches to phage infection. We found unexpected discrepancies between the effect of crRNA mismatches on target cleavage in vitro and survival of bacteria upon phage infection. This led us to monitor mutant emergence in phage populations. Using high-throughput sequencing, we discovered enrichment of a large variety of mutations when the phage was targeted by different crRNAs with and without mismatches. Single crRNA mismatches, even those outside of the seed region, had a drastic effect on the ability of bacteria to survive phage exposure, demonstrating the importance of spacer diversity as mutations in target genomic regions propagate. crRNA mismatches increased the rate at which mutant phage arose in the population. Mismatches in the PAM-distal region led to mutations in proximity to the original mismatch, leading to highly deleterious combinations of PAM-distal mismatches. Although similar mismatches were also deleterious for Cas9 cleavage, similar mutants did not emerge when phage was challenged with Cas9-crRNA complexes bearing PAM-distal mutations, suggesting that PAM-distal cleavage by Cas12a may result in more phage escape via PAM-distal mutations. Overall, we find that phage populations evolve in different ways to resist CRISPR interference depending on Cas effector specificity, existing crRNA-target mismatches, the location of CRISPR targets in the phage genome, and the cleavage site of the Cas effector.

Results

crRNA mismatches throughout the spacer decrease phage protection provided by Cas12a

To investigate the effect of crRNA mismatches on phage immunity provided by Cas12a, we developed a heterologous type V-A CRISPR-Cas12a system in Escherichia coli. We expressed Cas12a from Francisella novicida and various pre-crRNAs from 2 different plasmids in E. coli K12 using a strong inducible promoter (PBAD) or a relatively weak constitutive promoter [49,63]. We infected these cells with lambda phage to measure the immunity provided by Cas12a (Fig 1A). We used λvir, a mutant of lambda phage that cannot engage in lysogeny and is locked into the lytic mode of replication. This eliminates CRISPR self-targeting that could occur if a target phage becomes a lysogen in the bacterial genome. We chose 2 lambda genomic targets: one target was in an intergenic region upstream of gene J and the other target was inside the coding region of gene L (Fig 1A). Both genes encode essential structural tail tip proteins. The Cas12a expression system exhibited a high level of protection for both promoters, with targeting crRNAs showing about 106 fold less phage infection than the non-targeting control (Fig 1B).

Fig 1. Effects of mismatched crRNAs on Cas12a-mediated phage defense.

Fig 1

(A) Schematic of crRNA expression and processing by FnCas12a and crRNA phage target locations. crRNA mismatches were introduced by mutating individual nucleotides in the spacer sequence. After expression of the pre-crRNA, Cas12a processes it into a guiding crRNA that partially matches the lambda phage genome targets upstream of gene J and in the coding region of gene L. See S1a for target and crRNA spacer sequences. (B) Measurement of phage protection provided by crRNAs with and without target mismatches. Spot assays were performed with bacteria expressing FnCas12a and a crRNA construct that either perfectly matches the lambda phage genome (perfect) or has a crRNA mismatch (MM) at a position in the spacer (position x, sequences shown in S1A Fig). A non-targeting crRNA construct (NT) was used as a negative control. Lambda phage was spotted on cells with 10-fold decreasing concentration at each spot going from left to right. Expression of FnCas12a and pre-crRNAs were controlled by a stronger inducible PBAD promoter or a weaker constitutive promoter. (C) Observed rate constants for in vitro cleavage by Cas12a armed with crRNAs containing target mismatches. Plasmids bearing target sequences for gene J or L were used to measure Cas12a cleavage. Mismatch positions or perfect crRNAs (P) are indicated on the horizontal axis. Data from 3 replicates are plotted. * P ≤ 0.05, ** P ≤ 0.01, *** P ≤ 0.001, ns = no significant difference compared to the perfect crRNA based on unpaired two-tailed t test. See S1B and S1C Fig and S1 Data for gels and quantification. See S1 Fig for crRNA and target sequences, representative gels, and fit data. (D) Growth curves for E. coli expressing mismatched crRNAs following phage infection. Bacteria containing the PBAD FnCas12a expression plasmid and various crRNA expression plasmids were inoculated in liquid culture and induced immediately. Lambda phage was added 1.5 h after inoculation and OD600 measurements were taken every hour. Bacteria expressed no cRNA, a crRNA with no mismatches to the target (perfect) or a crRNA with a mismatch at the indicated position (position x). A no phage condition was performed as a negative control. The average of 2 replicates is plotted, with error bars representing standard deviation. See S2 Data for quantified data.

We designed 4 mutant crRNAs with varying levels of in vitro cleavage defects (Fig 1C) and tested their effects on phage defense (Fig 1B). These mismatches spanned the target with 1 in the seed region, 1 in the mid-target region, and 2 in the PAM-distal region. We observed a strong defect for the seed mutant when we used the weaker promoter to express Cas12a. However, this defect was reduced upon Cas12a overexpression using the stronger promoter (Fig 1B), consistent with the defect being caused by reduced Cas12a targeting. Mid-target and PAM-distal mismatches caused almost no visible defects in protection for the gene J target and small defects for the gene L target when Cas12a expression was controlled by the stronger promoter. These results correlated with the cleavage defects measured in vitro for the corresponding mismatched crRNAs, where seed mismatches had stronger defects than other mismatches but still enabled complete cleavage (Figs 1C and S1 and S1 Data), consistent with previous results [53,54]. However, when Cas12a expression was controlled by the weaker promoter, we observed a large loss of protection for the mid-target mismatched cRNA targeting gene J, which had no significant loss of cleavage in our in vitro assay (Figs 1B, 1C, and S1 and S1 Data). These results suggest that factors outside of reduced targeting may affect Cas12a-mediated protection at low expression levels.

We then tested the effects of mismatched crRNAs in liquid culture when Cas12a was expressed from the stronger promoter. Cells containing a matching crRNA grew at the same rate as cells that were uninfected with phage, demonstrating complete Cas12a protection in the time frame tested (Fig 1D and S2 Data). In contrast, most mismatched crRNAs caused lysis to occur regardless of the mismatch location in the target. For the gene J target, a crRNA mismatch in the seed region caused lysis to begin 1 h after infection, similar to a culture bearing a non-targeting crRNA. This indicated that the seed mismatch was allowing nearly full phage escape, consistent with this mismatch causing the largest reduction of target cleavage in vitro (Fig 1C).

In contrast, the seed mismatched crRNA against gene L provided protection for several hours post infection, with lysis beginning 3 h post-infection (Fig 1D). Mismatches in the mid- or PAM-distal region offered protection until 4 or 5 hours following infection. Interestingly, the rate of cleavage for these crRNAs did not always correlate with the level of protection provided in liquid culture (Fig 1C and 1D). Some crRNA mismatches that caused small decreases or no significant effect on cleavage rates in vitro led to lysis of the culture (e.g., gene J position 8 and gene L position 15). These results indicate that loss of cleavage caused by crRNA mismatches did not completely account for loss of immunity.

Phage target mutations depend on location of existing mismatches

Our initial results showed that crRNA mismatches have less of an effect on solid media than in liquid culture when Cas12a is expressed from a strong promoter. We hypothesized that these differences were caused by phage mutation upon CRISPR immune pressure. Unlike on solid medium, phage mutants that arise can quickly and uniformly spread throughout the culture in a liquid medium. Thus, preexisting mismatches or mismatches that arise through imperfect DNA repair following Cas12a cleavage may accelerate the selection for escape mutants as they quickly spread throughout the population, causing lysis in liquid culture. These results overall suggested that loss of protection from crRNA mismatches is due in part to emergence of phage mutants that further disable CRISPR interference.

To test this hypothesis, we investigated mutations that arose in phage populations in response to CRISPR pressure by Cas12a effector complexes with or without preexisting crRNA mismatches (Fig 2A). We isolated phage from liquid cultures of E. coli expressing matching or mismatched crRNAs and PCR amplified the regions of the phage genome that were being targeted. High-throughput sequencing was then used to identify mutations in the target regions (S3 Data).

Fig 2. crRNA mismatches cause emergence of diverse lambda phage mutations.

Fig 2

(A) Schematic of workflow for determining the genetic diversity of phage exposed to interference by Cas12a. Phage samples were collected from liquid cultures at various time points and the target region was PCR amplified. Mutations were observed using MiSeq high-throughput sequencing of these amplicons. (B) Line graph tracking the fraction of phage with mutated target sequences over time. Samples were taken from liquid cultures at time points after phage infection. The “0 h” samples were taken directly after addition of phage to the culture tubes. The average of 2 replicates are plotted with error bars representing standard deviation. See S3 Data for quantified data. (C) Heat maps showing the location of enriched phage mutations in target regions at the 8 h time point for gene J or L targets. Z-scores for abundance of single-nucleotide variants, including nucleotide identity changes or deletions, were determined for each sample relative to the non-targeted control phage population. Experiments were performed using λvir phage (WT) and λvir phage with the red operon deleted (Δred). Enriched sequences indicate high Z-scores. Z-scores range from 0 (white) to 10.3 (darkest red). Single-nucleotide deletions are shown at adjacent position to the 3′ side. Positions with crRNA mismatches are labeled with solid black boxes in the heat map. A thin outline indicates that the majority of sequences contain single point mutations at these positions while a thick outline indicates that the majority of sequences contain multiple point mutations at these positions. See S3 Data for quantification of variant abundance.

We first quantified the percent of the phage population that had mutations in genomic regions targeted by Cas12a over time in liquid culture. We observed mutations within the targeted region of PCR amplicon sequences, but not outside of the target region. Phages targeted by a matching crRNA gradually developed target mutations over time, with about 20% of the phage population becoming mutated after the 8 h time course (Fig 2B and S3 Data). A crRNA mismatch at any of the positions we tested led to a large acceleration of mutant emergence causing the phage population to become almost entirely mutated after 4 h. Interestingly, phages exposed to bacteria expressing crRNAs with a seed mismatch also rapidly mutated, even though our in vitro results showed the original crRNA mismatches were highly deleterious for target cleavage (Fig 1C). For most individual replicates of our samples, we did not observe substantial variability in the distribution of mutations after the phage population became highly mutated (S2A Fig and S3 Data). We therefore chose to focus on the longest time point (8 h) for further analysis.

As previously shown in other CRISPR systems [10,35,64], phage populations targeted with a perfectly matching crRNA developed mutations in the seed region of the target (Fig 2C and S3 Data). When the sequences of the crRNA plasmids were changed to create mismatches between the crRNA and the phage genome target, the position of individual point mutations within the phage target became substantially more variable. A crRNA mismatch at position 3 for the crRNA targeting the region upstream of gene J caused 9 different individual point mutations to appear at 8 positions spread across the PAM and seed, none at position 3 as expected given the preexisting mismatch (Fig 2C). For the gene L target, a crRNA mismatch at position 3 only caused 2 different mutations to appear, with one of them being the predominant mutation seen when targeting with a matching crRNA. Mismatches in the mid-target region at position 8 also caused seed mutations to arise. Surprisingly, PAM-distal crRNA mismatches caused enrichment of PAM-distal mutations and prevented nearly all seed mutations from emerging. Most of the mutations present in liquid culture were also observed when sequencing phage from spot assays, although the distribution differed in some cases (S2B Fig and S4 Data). This indicates that the differences we observed between our solid media and liquid cultures experiments were caused by the increased mobility of phages in liquid culture and were unrelated to the types and location of mutations that could arise.

The genomic context of target sequences had a clear effect on the types of mutants that emerged (S2C Fig and S3 Data). For the matching crRNA targeting the region upstream of gene J, the most common mutation observed was a single-nucleotide deletion at position 6. The most common mutation for the gene L target was a single nucleotide substitution at position 2 which is a wobble base position in the codon. Similar to the other mismatched crRNA constructs targeting gene L, most mutations we observed were either silent or caused amino acid changes from valine, threonine or serine to alanine or from proline to leucine. No deletions were observed in the gene L target in any samples with crRNA mismatches, while deletions were observed in the gene J upstream target in samples with crRNA mismatches at positions 15 and 19. This difference in mutational variability reflects the more vulnerable target region of gene L where base substitutions are likely to change the amino acid sequence of the protein and single deletions will cause frame-shifts. The relatively weak constraints on viable mutations in the upstream region of gene J may enable more routes for escape from Cas12a targeting, resulting in the loss of protection at earlier time points (Fig 1D).

Phage mutations can arise following exposure to Cas12a

Our results indicate that mutations can arise rapidly in regions targeted by Cas12a when a preexisting mismatch is present between the crRNA and target. It is possible that Cas12a targeting selects mutant phages that are present in the population at the time of infection. We investigated whether the mutated phage we observed in our CRISPR active samples were present in negative control samples. While many of the single-nucleotide substitutions that were enriched following Cas12a targeting were present at very low levels in the control sample, we could not distinguish actual nucleotide variations from sequencing or PCR error (S3 Fig and S3 Data). However, we did observe that 2 out of 3 sequences containing single-nucleotide deletions that were consistently present in control samples were the only 2 deletion mutants that became highly enriched in the escaped mutant phage population (S2C and S4 Figs and S3 Data). These results suggest that deletion mutants that were enriched upon Cas12a-mediated selection were preexisting in the population.

To further test this, we designed crRNAs targeting non-essential regions in the lambda phage genome (S5 Fig and S5 Data). Cas12a-mediated defense against lambda phage using these crRNAs caused large deletions to appear based on recombination at microhomology sites, as has been previously observed [38] (S5A and S5B Fig). We used long-read sequencing to determine whether these regions of the wild-type lambda phage population contained the same deletions. Similar to the single-nucleotide deletions, microhomology-mediated deletions that were enriched upon Cas12a-mediated selection were among the most abundant mutations preexisting in the wild-type population (S5C Fig and S5 Data). Together, these results suggest that mutants that emerge upon Cas12a targeting may be selected from natural genetic variants in the population.

Our observation that enriched deletion mutations preexist in the wild-type population does not rule out the possibility that mutations may be actively acquired following Cas12a cleavage. Previous studies have suggested that DNA cleavage by Cas9 or Cas12a may induce mutation in the target site through DNA repair [35,38]. To test this possibility, we deleted the red operon from the λvir genome, which has previously been implicated in escape mutation enrichment following targeting by Cas9 [39]. We performed liquid culture infection assays using the Δred phage in cultures expressing Cas12a and panel of crRNAs described above (S6A Fig and S2 Data). Unlike with wild-type phage, infection with the Δred phage did not cause lysis in most cultures expressing a perfect or mismatched crRNA.

We next sequenced phage populations harvested from cultures 8 h after infection (Figs 2C and S6B and S3 Data). Consistent with the lack of lysis in many cultures, we did not observe mutants arising in cultures expressing either perfect crRNA, the PAM-distal mismatched crRNAs targeting gene J, nor the mid-target mismatched crRNA targeting gene L. For most of the remaining crRNAs, we observed mutations that were observed in the wild-type phage populations challenged with the same crRNA, although the number of mutants and distribution of these mutants varied between phage strains.

One notable exception where we observed new mutations arising in the Δred strain was for the phage challenged with the mid-target (position 8) mismatched crRNA targeting gene J (Fig 2C and S3 Data). Surprisingly, we observed a range of double mutations in the mid and PAM-distal region in Δred phage challenged with this crRNA (Figs 2C, S6B, and S6C and S3 Data). Importantly, all double mutants observed in individual replicates contained unique mutations and appeared to originate with a single point mutation (S6C Fig). These results strongly suggest an active mechanism of mutant generation unrelated to the red operon. Overall, our results provide evidence that both preexisting and actively acquired mutations may be selected during Cas12a-mediated immunity.

Multiple mismatches in the PAM-distal region allow phage escape from Cas12a

A striking result from our sequencing of mutant phage populations was the emergence of PAM-distal mutants upon challenge with crRNAs containing PAM-distal mismatches. Given that seed mutants appeared when other Cas12a crRNAs were used, these results suggested that multiple PAM-distal mismatches are at least as deleterious for Cas12a cleavage as a seed mismatch combined with a PAM-distal mismatch. It has been shown that pairs of mismatches are more deleterious for Cas12a cleavage when they are closer together, including when both are in the PAM-distal region [53,56]. Together, these results suggest that double mismatches in the PAM-distal region can lead to phage escape from Cas12a.

To test this, we added second PAM-distal crRNA mismatches to crRNAs targeting gene J that initially contained a single PAM-distal mismatch. We chose the second mismatch position based on phage mutants that appeared when exposed to the original mismatched crRNA (Figs 2C and S7A). Adding a second mismatch at position 14 to the crRNA that contained a mismatch at position 15 caused a small defect in phage protection (Fig 3A). Although this mismatch pair had small effects on the overall cleavage rate in vitro regardless of the mismatch type at position 14 (Figs 3B and S7B–S7C and S1 Data), we did observe a cleavage defect, in which the DNA was nicked by Cas12a through cleavage of only 1 strand (S7B Fig). This defect in second-strand cleavage may allow more phage infection, resulting in partial loss of phage defense on solid media (Fig 3A). Consistently, bacteria expressing a crRNA with a position 15 mismatch did not lyse in liquid culture (Fig 1D), despite the emergence of the position 14 mutation (Fig 2C). In contrast, adding any type of second mismatch at position 16 to a crRNA that contained the original mismatch at position 19 caused nearly complete loss of protection (Fig 3A) consistent with lysis observed for the position 19 mismatched crRNA in liquid culture (Fig 1D) and the dramatic loss of cleavage activity in vitro (Figs 3B and S7B–S7C and S1 Data).

Fig 3. Two PAM-distal mismatches are more deleterious than seed mismatches.

Fig 3

(A) Spot assays performed using E. coli expressing FnCas12a and a crRNA that perfectly matches the lambda phage gene J target (perfect) or has mismatches at the indicated positions. Three types of second mismatches were added and the type of the mismatch is indicated in parenthesis next to the position number. See S7A Fig for crRNA spacer sequences. (B) Observed rate constants for in vitro cleavage by Cas12a armed with crRNAs containing 2 target mismatches. Cleavage was measured for plasmid DNA containing a gene J target. The types of mismatches for the second mismatch are indicated. * P ≤ 0.05, **** P ≤ 0.0001, ns P > 0.05 compared to the perfectly matching crRNA based on unpaired two-tailed t test. See S7B and S7C Fig and S1 Data for gels, and quantified and fit data. (C) Phage spot assays for target mutant phages isolated upon challenge with Cas12a programmed with a position 15 mismatched crRNA targeting gene L. Spot assays were performed with E. coli expressing a non-targeting crRNA (NT), a crRNA that perfectly matched wild-type phage (Perfect), or the crRNA with a mismatch at position 15 (MM15). Phage mutations were in the seed (A2T) or PAM-distal (G17T) region. (D) Observed rate constants for in vitro Cas12a cleavage of plasmids bearing wild-type (WT), seed mutant (A2T), or PAM-distal mutant (G17T) gene L target sequences. Cas12a cleavage was measured for both the perfectly matched crRNA (P) or the MM15 crRNA (15). Significance was tested pairwise for all crRNA/target combination by unpaired two-tailed t test. * P ≤ 0.05, ** P ≤ 0.01, *** P ≤ 0.001, ns P > 0.05. Pairwise comparisons for which P value are not indicated had a P < 0.0001. See S8 Fig and S1 Data for crRNA and target sequences, gels, and quantified and fit data. (E) Schematic of competition assay. Two mutant phages, A2T and G17T, were mixed at approximately equal titers. E. coli expressing Cas12a and the position 15 mismatched crRNA were infected with a dilution series of the mutant phage mix. Lysates were harvested and the proportion of each mutant phage was determined by high-throughput sequencing. (F) Ratio of seed mutant (A2T) to PAM-distal mutant (G17T) following lysis of cultures infected with a dilution series of the mixed phage. E. coli expressed Cas12a and either a non-targeting (NT, red) or position 15 mismatched (MM15, blue) crRNA. “Mix” indicates the initial mixture of phage. The average of 3 replicates is shown, with error bars indicating standard deviation. See S6 Data for raw and quantified data.

We next investigated why PAM-distal mutations may be preferentially selected over PAM or seed mutations that may be highly deleterious to Cas12a cleavage on their own. Notably, while PAM and seed mutations were not highly enriched for wild-type phage challenged with PAM-distal mismatched crRNAs in liquid culture (Fig 2C), we did observe PAM and seed mutants when we assayed the phage population present in spot assays on solid media (S2B Fig). This difference may be due to competition between different mutant phages, in which phages bearing mutations that are more deleterious to Cas12a cleavage may outcompete less deleterious mutants. Such competition is more likely to occur in liquid media where phages are mobile.

To test this hypothesis, we isolated 2 mutant phages selected upon targeting with the position 15 mismatched crRNA (MM15) targeting gene L (see Methods). The mutant phages contained a single point mutation in either the seed (A2T) or the PAM-distal region (G17T) of the gene L target region. As expected, the seed mutant phage caused a far greater loss of protection than the PAM-distal mutant when the 2 mutant phages were used to infect bacteria expressing Cas12a and the perfectly matching crRNA in phage spotting assays (Fig 3C). In contrast, both mutants caused a similar loss of protection in cells expressing the MM15 crRNA.

We next tested the extent to which these target mutations cause Cas12a cleavage defects using both the perfect crRNA and the MM15 crRNA (Figs 3D and S8 and S1 Data). Cleavage of the A2T seed mutant target was reduced by 15 to 20× in comparison to the WT target, and we did not observe a significant difference in cleavage of this target by Cas12a bearing either the perfect or MM15 crRNA (Fig 3D). Strikingly, Cas12a cleavage was reduced by approximately 90× when the G17T target was cleaved with Cas12a bearing the MM15 crRNA. Importantly, this loss of cleavage upon introduction of 2 PAM-distal mismatches was significantly more deleterious than the cleavage defect observed when seed and PAM-distal mismatches were combined (MM15/A2T combination).

Together, our results strongly suggest that PAM-distal mutants emerge upon challenge with crRNAs bearing PAM-distal mismatches because 2 PAM-distal mismatches can be more deleterious to Cas12a cleavage than a seed and a PAM-distal mismatch. To directly test this, we performed a competition assay in liquid culture in which cells expressing either a non-targeting or the MM15 crRNA were coinfected with a mixture of the A2T and G17T mutant phages (Fig 3E and 3F). We performed the competition at decreasing phage concentrations, allowing competition to occur as the phages were propagated in the culture. Lysates were sampled after 8 h, the target region was PCR amplified, and the abundance of each mutant was determined by high-throughput sequencing (S6 Data).

Notably, although both the A2T and G17T substitutions are silent mutations, the A2T mutant slightly outcompeted the G17T mutant in cultures expressing the non-targeting crRNA (Fig 3F), suggesting that the A2T mutant phage may be slightly more fit than the G17T mutant. In contrast, when coinfection was performed in cultures expressing the MM15 crRNA, the G17T mutant became dominant in the phage population when cultures were infected with highly diluted phage mixtures. Overall, our results demonstrate that PAM-distal mutants emerge in liquid cultures expressing PAM-distal mismatched crRNA because of the highly deleterious effect of dual PAM-distal mismatches on Cas12a cleavage.

Emergence of PAM-distal escape mutants occurs for Cas12a but not Cas9

Our results show that some pairs of PAM-distal mismatches are deleterious enough to cause escape from CRISPR-Cas12a immunity. We also note that Cas12a cleaves in the PAM-distal region of the target [26], and that our and previous results suggest that mutations may arise during repair of DNA breaks induced by Cas endonuclease cleavage [38,39]. Thus, it is possible that Cas12a is uniquely prone to emergence of escape mutations in the PAM-distal region.

To test this hypothesis, we performed similar experiments for Streptococcus pyogenes Cas9 (SpCas9), which cleaves targets in the PAM-proximal region [55,65] and therefore may be more prone to emergence of mutations in the PAM and seed region. Similar to FnCas12a, in vitro cleavage assays using SpCas9 revealed that 2 PAM-distal mismatches cause a significantly larger defect than a seed and PAM-distal mismatch (Figs 4A and S9 and S1 Data). Surprisingly, for the target tested in our in vitro cleavage assays, the PAM-distal mutation was more deleterious than the seed mutation even when targeted by the perfect crRNA. These results suggest that PAM-distal mutations should be sufficient to cause escape from SpCas9-mediated immunity.

Fig 4. Cas9 challenge does not cause emergence of PAM-distal mutants.

Fig 4

(A) Observed rate constants for cleavage of a target plasmid bearing a wild type (WT), seed mutant (G2T) and PAM-distal mutant (A17T) gene L target sequence. Cas9 cleavage was measured for both the perfectly matched crRNA (P) or the MM15 crRNA (15). Significance was tested pairwise for all crRNA/target combination by unpaired two-tailed t test. * P ≤ 0.05, *** P ≤ 0.001, ns P > 0.05. Pairwise comparisons for which P value are not indicated had a P < 0.0001. See S9 Fig and S1 Data for crRNA and target sequences, gels, and quantified and fit data. (B) Growth curves of E. coli expressing Cas9 and sgRNAs bearing either a non-targeting sequence, the perfectly matching spacer sequence (perfect), or a spacer containing mismatch at the indicated position with respect to the PAM. Phage was added when the cells reached mid log phase at approximately 2 h after inoculation. The average of 3 replicates is plotted for each condition, with error bars representing standard deviation. See S2 Data for quantified data. (C) Heat maps showing the location of enriched phage mutations in target regions at the 8 h time point for gene J or L targets after Cas9-mediated selection. Z-scores for abundance of single-nucleotide variants, including nucleotide identity changes or deletions, were determined for each sample relative to the non-targeted control phage population. Enriched sequences indicate high Z-scores. Z-scores range from 0 (white) to 7.0 (darkest red). See S7 Data for quantification of variant abundance.

To test whether such mutants emerge, we performed phage challenge assays in E. coli expressing SpCas9 programmed with single-guide RNA (sgRNA) containing mismatches at the same positions relative to the PAM as those tested for FnCas12a (Figs 4B and S10A and S2 Data). Similar to FnCas12a, mismatches caused minimal defects in SpCas9-mediated phage defense on solid media (S10A Fig). In liquid media, delayed lysis occurred in all cultures, including those expressing perfect crRNAs (Fig 4B). To determine whether lysis occurred due to the emergence of phage mutants, we PCR amplified the target regions of phage collected from these lysates and sequenced the amplicons by high-throughput sequencing (S7 Data). Mutants emerged in all samples and target mutations were confined to the PAM and seed, although the positions of these mutations varied (Figs 4C and S10B). Seed and mid-target crRNA mismatches caused a shift away from the PAM and into the seed region. Unlike Cas12a, no PAM-distal mutants emerged for either target when challenged by Cas9 bearing PAM-distal mismatched sgRNAs. These results suggest that both Cas effector specificity and cut site may impact the location within targets at which escape mutations may emerge.

Phage targeted with mismatched spacers develop conditional escape mutations

Our results suggest that individual mismatches are often not sufficiently deleterious to allow phages to escape Cas12a targeting. Instead, the combination of the preexisting mismatch and an additional mutation in the target is necessary for escape to occur. It remains unclear to what extent these new mutations contribute to phage escape in the presence of a preexisting mismatch. We chose to pursue further experiments using the crRNA with a seed mismatch targeting gene J because although it was highly deleterious for cleavage in vitro (Fig 1C), it caused rapid phage mutation in liquid culture (Fig 2B). In addition, this mismatch caused the largest variety of mutants to arise for all the crRNAs we tested with mutations at nearly all positions in the seed region (Figs 2C and S2). We hypothesized that this target in an intergenic region was less restrictive of mutation, exacerbating the defect of this crRNA mismatch in vivo.

To test this hypothesis, we generated mutated phage populations using the seed mismatched crRNA targeting gene J. We first infected E. coli cells expressing the mismatched crRNA with lambda phage in liquid culture at a wide range of MOIs (Fig 5A). The phages were able to clear the culture at MOIs greater than 1.5 × 10−3 (S11A Fig and S2 Data). We then analyzed the genomic diversity of the phage population in the targeted region using high-throughput sequencing (S8 Data). The phage population retained the wild-type sequence of the target region at the 2 highest MOIs tested (0.15 and 0.075 MOI), indicating that the wild-type phage can overcome Cas12a-mediated immunity when the bacteria are exposed to enough phage particles (Fig 5B). At the lowest MOIs tested, 1.5 × 10−4 and 1.5 × 10−5, 99% of the phage population contained a single mutation at the first position of the protospacer. Mutations that arose were most varied at intermediate MOIs. These mutations were in the seed region or mid target region near the existing crRNA mismatch. The number of different mutations observed was also higher compared to the bacterial strain with a matching crRNA to the WT phage target.

Fig 5. Combined mismatches are necessary for complete phage escape.

Fig 5

(A) Schematic for experiment to test the impact of MOI on escape phage diversity. Cultures expressing Cas12a and the position 3 mismatched crRNA targeting gene J were infected with lambda phage at varied MOIs. Mutant phages in lysates were detected by high-throughput sequencing. (B) Heat map showing the position of phage mutations that arose when infecting bacteria expressing seed mismatch crRNA at different MOIs. Phage was harvested from liquid cultures containing bacteria expressing FnCas12a and a crRNA with a C-T mismatch at position 3. Phage was added to the culture at mid-log phase at a range of MOIs starting at 0.15 and serial 2-fold dilutions from 1/2 to 1/32 and an additional sample at an MOI of 1.5 × 10−3. Phage was harvested 5 h after infection. High-throughput sequencing was used to determine the percent of phages in each that had a mutation in the target region. The heat map shows the positions in the target that were enriched with mutations. These positions are colored darker red according to their Z-score relative to the control phage population. Z-scores range from 0 (white) to 10.1 (darkest red). The position of the initial crRNA mismatch is indicated in solid black. See S8 Data for quantification of variant abundance. (C) Spot assays using phage isolated from liquid cultures as described in (A) on bacteria expressing a matching crRNA. Phage harvested in (A) was 10-fold serial diluted and spotted on bacteria with a crRNA matching the wild-type lambda phage genome target (matching crRNA) or bacteria without a crRNA guiding Cas12a (no crRNA). Wild-type phage controls were spotted on these same bacterial strains. Phages harvested from the lowest MOI cultures were omitted due to their low titer which prevented visible plaque formation on the CRISPR active E. coli strain. See S11B Fig for full plates. (D) Schematic for experiment shown in panel (E). Wild-type or mutant phage populations were used for spot assays on plates with lawns of E. coli expressing Cas12a and either the perfect or the seed mismatched crRNA to determine whether the combination of the preexisting mismatch and newly acquired target mutations are necessary for complete escape from Cas12a targeting. (E) Spot assays using mutationally diverse phage on bacteria expressing crRNAs with and without mismatches. Phage was isolated from the liquid culture as described in (A) that was initially infected with phage diluted 1:8. Mutated phage and unmutated control phage (WT) were then used for spot assays on bacterial lawns expressing FnCas12a and a matching crRNA (perfect), a crRNA with the original seed mismatch, or no crRNA as negative control.

Next, we harvested phage from the cultures at all of the MOIs tested and compared protection against this mutant phage population by a crRNA that perfectly matched the wild-type target and a crRNA bearing the original seed mismatch used to generate the mutant population. Visible infection using these new phage lysates was first observed when using phage that was 48% mutated (Figs 5C and S11B). While the perfect crRNA still offered some level of protection against the mutated phage, the crRNA containing the mismatch resulted in complete loss of protection (Fig 5D and 5E). These results indicate that some mutations that emerge in the phage population are only significantly deleterious to Cas12a interference in the presence of the preexisting mismatch, revealing the importance of combined mismatches for phage escape.

Combining mismatched spacers increases level of protection

Our results indicated that loss of protection by Cas12a due to crRNA mismatches was only partially caused by loss of Cas12a cleavage due to the preexisting mismatch and that mutant emergence generating a second mismatch also contributed substantially to this loss of protection. These results imply that Cas12a mismatch tolerance should enable stronger and longer term protection under conditions where phage mutants are less likely to emerge. To further test this, we introduced both the gene J and L crRNAs into a CRISPR array for co-expression of both crRNAs (Fig 6A). It has been demonstrated previously that bacterial populations with diverse spacer content in their CRISPR arrays show robust immunity against phage at the population level as phage are unable to develop escape mutations in all of their targeted genomic regions [66,67]. Cas12a expressed along with multiple different cRNAs shows increased phage resistance compared to when a single crRNA is present, but the mechanism of this increased resistance is not clear [64]. We investigated this mechanism further in the context of our previous experiments with mismatched crRNAs. If the loss of protection due to a crRNA mismatch is caused only by a slowing of the rate of cleavage, then 2 different mismatched spacers should not provide more protection than 1 spacer repeated twice. Alternatively, if phage mutant emergence significantly contributes to loss of protection in the presence of a crRNA mismatch, 2 different mismatched spacers should provide better protection than a single-mismatched spacer repeated twice.

Fig 6. Multiple mismatched crRNAs provide more protection than individual mismatched crRNAs.

Fig 6

(A) Schematic of experiment in which 2 crRNAs bearing mismatches at position 3 are expressed from a CRISPR plasmid. The CRISPRs either have 2 identical spacers targeting gene J (J+J) or gene L (L+L), or have 2 different spacers targeting gene J and L (J+L). (B) Number of plaques formed on lawns of bacteria expressing multiple mismatched crRNAs. Plaque assays were performed using bacteria containing a plasmid expressing FnCas12a along with different crRNA expression plasmids. Plasmid expressed either the perfect crRNA (P) or the position 3 mismatched crRNA (3). For the perfect crRNAs, plasmids expressed either 1 (gene J or gene L alone) or 2 (J+L) spacers. For the position 3 mismatched crRNAs, each crRNA expression plasmid contains 2 spacers: both targeting gene J both targeting gene L and 1 spacer targeting each of gene J and gene L (J+L). (-) indicates a negative control in which no crRNA was expressed. Plaque forming units (pfu) was calculated using the number of plaques on each plate and the volume of phage lysate added. The average of multiple replicates (n = 2 to 4) is plotted with error bars representing standard deviation. Unpaired, two-tailed t tests were used to determine the statistical significance of each of the single spacer constructs compared to the double spacer construct, *** P ≤ 0.001, **** P ≤ 0.0001. See S2 Data for quantified data. (C) Growth curves using the same bacterial strains described in (A). Phage was added when the cells reached mid log phase at approximately 2 h after inoculation. The average of 3 replicates is plotted for each condition, with error bars representing standard deviation. See S2 Data for quantified data. (D) Spot assays used to measure the titer of phage over time in phage infection cultures. Spot assays were performed using 10-fold serial dilutions of phage harvested from cultures in (B) that infected bacterial strains with 2 mismatched spacers at different time points on lawns of CRISPR-inactive E. coli. (E) Sequences of both CRISPR targets in single phage plaques for phage harvested from E. coli cultures expressing a double spacer construct. The 2 crRNAs contained mismatches at positions highlighted in black. Target regions for the gene J and gene L target were sequenced for 6 individual plaques using Sanger sequencing. Target sequences are aligned to the WT sequence (top row) and mutations are underlined. See S12B Fig for chromatograms.

Consistent with the second possibility, the CRISPR construct with 2 unique mismatched spacers (hereafter referred to as double spacer construct) showed a significantly higher level of protection than either of the crRNA constructs with 2 copies of a single-mismatched spacer (hereafter referred to as single spacer construct) when measured by plaque assay (Fig 6B and S2 Data). Similar to liquid cultures with bacteria expressing a single copy of the crRNA, we observed faster lysis of the gene J targeting crRNA in comparison to the gene L targeting crRNA, consistent with the higher chance of escape mutant emergence against the gene J crRNA. Bacteria expressing the double spacer construct showed slowed growth between 1 and 2 h but recovered quickly and did not lyse over the time course tested (Fig 6C and S2 Data). Phage from these cultures was harvested over time and used to infect CRISPR inactive bacteria to determine the relative titers. Phage titers decreased over time in cultures expressing the double spacer construct, while the phage titer increased over time in cultures with cells expressing the single spacer constructs (Fig 6D).

Spotting these same phage lysates on CRISPR active cells showed no noticeable infection by lysate harvested from the double spacer culture, but moderate infection by the single spacer lysate (S12A Fig), suggesting that escape mutants did not emerge from bacteria expressing 2 different mismatched crRNAs. Consistently, sequencing of both target regions in individual plaques revealed mutations in only 1 of the 2 target regions (Figs 6E and S12B). Together with our previous results, these results suggest that loss of protection due to a crRNA mismatch is caused by a combination of loss of Cas12a targeting and the emergence of mutant phages that further block CRISPR interference.

Preexisting target mutations cause different CRISPR escape outcomes

We have shown that target mismatches artificially introduced by changing crRNA sequences accelerate phage escape and increase the diversity of mutations that appear. We wanted to determine if the same effect would appear if the crRNA-target mismatch was instead caused by a phage genome mutation. We first generated clonal phage populations with single target mutations by isolating individual plaques of mutant phage that emerged following exposure to Cas12a bearing various crRNAs (Fig 7A). Using a crRNA containing a seed mismatch, we isolated phages with mutations in the PAM (T-2C) or seed (C2A) (S13A and S13B Fig), while a crRNA containing a mismatch at position 19 allowed us to isolate 2 separate plaques containing phage with a mutation at position 16 (G16T) (S13C and S13D Fig). After propagating phage from these plaques, we challenged the mutant phages to CRISPR pressure by bacteria expressing crRNAs with a spacer matching the wild-type phage genome target. We observed that the phage with a seed region mutation caused rapid lysis of CRISPR active bacteria (Fig 7B and S2 Data). This indicated that the C2A mutation was a complete escape mutation. However, phage mutations in the PAM or PAM-distal region caused delayed lysis to occur. In particular, of the 2 G16T isolates, only one caused lysis to occur in some of the experimental replicates (Fig 7B and S2 Data).

Fig 7. Generation of double-mutant phage is driven by insufficiently deleterious mutations.

Fig 7

(A) Schematic of the process for generating and purifying single-mutant phage populations. Wild-type phage was used to challenge E. coli expressing a crRNA with a mismatch to the target in the phage genome in liquid culture. The resulting phage were isolated and used for a plaque assay on lawns of bacteria expressing the same mismatched crRNA. Single plaques were isolated and again used to challenge bacteria expressing a mismatched crRNA in liquid culture, further purifying and propagating single mutants. Finally, single-mutant phages were used to challenge bacteria expressing a perfectly matching crRNA in liquid culture to determine whether second mutations would appear. (B) Growth curves of bacteria expressing a perfectly matching crRNA challenged with wild-type phage and phage with various single target mutations. Locations of the single mutations in the target are labeled (PAM mutant, seed mutant, and PAM-distal mutant). Position and type of mutations are indicated in parenthesis. Three biological replicates are shown separately for each experimental condition. See S2 Data for quantified data. (C) Diagram of initial and selected mutations that appeared when a single-mutant phage was used to challenge bacteria expressing a perfectly matching crRNA. Initial mutants are single mutants that were generated and purified as described in (A). Sequences below arrows show phage mutants that appeared in different biological replicates (rep 1, 2, or 3) after initial mutant phage lysates were used to infect bacteria expressing a perfectly matching crRNA in liquid culture. Positions with ambiguous base calls are indicated with 2 bases (X/Y) at that position. Target sequences were interpreted from Sanger sequencing chromatograms (see S13 Fig). (D) Spot assays challenging bacteria expressing a perfectly matching crRNA with various single- and double-mutant phage lysates. WT phage or phages with the indicated target mutations were spotted on bacteria expressing a non-targeting crRNA (left column) and a perfectly matching crRNA (right column).

We harvested phage from the previous cultures and sequenced PCR amplicons of the phage genome targets using Sanger sequencing. In the seed mutant (C2A) phage cultures, the phage retained the same seed mutation and did not develop additional mutations (Figs 7C and S13B), further indicating that C2A is a bona fide escape mutation on its own. In phage with preexisting mutations in the PAM, mutations appeared at the edge of the seed region (Figs 7C and S13A). In phage with a preexisting mutation in the PAM-distal region at position 16, mutations appeared at positions 14 or 18 for phage harvested from cultures that lysed. Repeating the same experiment with PAM-distal mutants in the gene L target similarly caused further mutations to occur in the mid-target and PAM-distal regions (S13E–S13H Fig).

We proceeded with further experiments using only replicates in which a clonal phage population was generated based on an unambiguous Sanger sequencing chromatogram (S13A, S13B, and S13D Fig). Using these phages, we sought to verify that these second mutations were allowing CRISPR escape. We compared infection of bacteria expressing the matching crRNA by purified phage containing a single target mutation and phage with 2 target mutations. As expected, phage with the seed target mutation infected bacteria expressing the perfect crRNA at the same level as bacteria expressing a non-targeting crRNA (Fig 7D). Phage with single target mutations in the PAM or PAM-distal region infected bacteria expressing the perfect crRNA at a level close to wild-type phage, while phage with a second mutation infected 104 to 105 times more (Fig 7D). We conclude that target mutations that do not lead to significant CRISPR escape can accelerate the appearance of second mutations that allow complete escape.

Discussion

In order for Cas12a to be an effective immune effector, it must provide immunity from bacteriophage in diverse conditions. The physical environment controls which bacteria are exposed to which phages and the dispersal of new phage particles after infection and lysis [9,68]. We found that Cas12a overall provided more robust immunity on solid media than in liquid culture. The effect of seed crRNA mismatches using either media correlated with the deleterious effect of the crRNA mismatch on the rate of cleavage in vitro. Mid-target and PAM-distal mismatches, however, showed a much more drastic effect in liquid culture than defects observed in vitro or on solid media when Cas12a was expressed from a strong promoter, causing eventual lysis of the bacterial population, sometimes at a rate similar to seed mismatches. These results strongly indicate that the effect of crRNA mismatches varies depending on the environment where phage exposure may occur.

Although phage can escape CRISPR interference with a single mutation in the seed region [10,35], our results indicate that a seed mutant is sometimes not deleterious enough to allow complete escape. Instead, the mechanism of phage escape occurs through the emergence of mutations that further weaken CRISPR interference when a mismatch is present. This is supported by the rapid emergence of mutant phage we observed even when a highly deleterious seed crRNA mismatch was present. In addition, the number of different mutations that appeared increased when a crRNA mismatch was present and the distribution of these mutations greatly varied depending on the location of the mismatch. These results suggest that mismatches between the crRNA and target decrease phage protection by broadening the range of mutations that allow escape. This mechanism also explains the deleterious effect of mismatches at some positions outside of the seed region on immunity in liquid culture that does not appear during in vitro cleavage.

In natural settings, spacer-target mismatches are caused by phage genome mutation rather than CRISPR spacer mutation that would change the sequence of the crRNA [12,69]. Phages with a single target mutation might not be able to completely escape from certain CRISPR spacers, requiring selection of a second target mutation for full evasion of CRISPR interference [36]. This scenario may become more likely if the seed region, where mutations would normally arise, is located in a critical part of the genome where mutations are highly deleterious. Supporting this, we isolated phage with single mutations in both intergenic (gene J) and coding (gene L) regions that did not cause significant CRISPR escape that then developed second mutations that allowed full escape when exposed to the matching crRNA. In addition, CRISPR spacers with multiple mismatches become more abundant over time in natural populations [46]. Thus, the presence of mutations may drive further mutation in CRISPR targets over time.

When a PAM-distal crRNA mismatch or a PAM-distal target mutation was present, mutations arose in close proximity to the preexisting mismatch for Cas12a, but not for Cas9. Our in vitro cleavage results suggest that 2 PAM-distal mismatches may be more deleterious than individual seed mismatches or even combinations of seed and PAM-distal mismatches. Thus, phage mutations that result in multiple PAM-distal mismatches are more likely to be selected than PAM or seed mutants when a PAM-distal mismatch already exists.

The enrichment of PAM-distal mutations after exposure to Cas12a and not Cas9 may be influenced by their cut sites, which are in the PAM-distal region and downstream of the target for Cas12a and in the seed region for Cas9 [26,55,65]. Mutations may be more likely to arise around the cleavage site due to DNA repair that occurs after cleavage by Cas12a or Cas9. Notably, many of the crRNAs used in our study caused initial nicking, rather than complete double-strand cleavage, in in vitro cleavage assays (S1B, S7B, S8B, and S9B Figs). Such nicking events may result in recombination or other DNA repair mechanisms that result in alteration of the sequence around the cleavage site. Indeed, there is evidence that phage escape mutation is accelerated by error prone repair after cleavage by Cas effectors and mediated by phage recombinases, especially when a crRNA provides a low level of protection [38,39]. Lambda phage recombination enzymes in particular have been shown to increase the appearance of mutated phage in populations under CRISPR pressure [39]. Our results suggest that some mutants that arose upon Cas12a challenge preexisted in the population, especially for mutants involving single nucleotide or long deletions. However, we also observed strong evidence that mutants arose following Cas effector targeting, including in phage strains lacking Red recombination machinery. We speculate that preexisting mutations may be propagated in the phage population by lambda encoded recombinases, but that error prone repair following Cas effector cleavage is not dependent on Red recombination. Phage with target mutations in their genome that initially survive interference could be used as recombination substrates to pass along that mutation to other phages in the cell. This could account for the rapid increase in mutant phages we observe and the deleterious effect of phage recombination enzyme knockouts [39].

Overall, our results reveal that mismatches throughout the crRNA-target duplex can drastically decrease protection provided by Cas12a. However, there are fundamental differences between our heterologous system and natural CRISPR-Cas systems. Adaptation is an important part of CRISPR immunity. As phage mutate and become immune to targeting by existing spacers, new spacers from the phage genome can be added to the CRISPR array [11,70]. It remains to be investigated how mismatched spacers contribute to acquisition of new spacers in type V systems, especially using a primed mechanism as occurs in type I and type II systems.

Materials and methods

Expression plasmid construction

All primers and plasmids used in this study are listed in S1 Table. Cas12a and Cas9 expression plasmids were constructed using pACYCDuet-1. A gene expressing FnCas12a or SpCas9 was inserted downstream of a pBAD promoter in pACYCDuet-1 using Gibson assembly. Promoters were replaced with a previously described low copy P4 promoter [49] using 1 piece Gibson assembly. crRNA expression plasmids were constructed using pUC19. A pBAD promoter was inserted into pUC19 in the multiple cloning site with Gibson assembly. “Round the horn” PCR and ligation was used to add a mini CRISPR array with 1 or 2 spacers downstream of the pBAD promoter. The same method was used to replace mini CRISPR arrays with Cas9 sgRNA expression constructs.

Bacterial and phage strains

E. coli strain BW25113, a K12 derivative [71] was used for all experiments along with a modified lambda phage strain (λvir) [72] that is locked into the lytic lifestyle.

For all CRISPR interference assays, bacteria were transformed with Cas12a and crRNA expression plasmids by heat shock. Transformants were plated on LB plates containing ampicillin at 100 μg/mL and chloramphenicol at 25 μg/mL to select for plasmids pUC19 and pACYCDuet-1, respectively.

The red operon deletion strain was generated by using 20 μL of 1:10 diluted λvir stock to infect E. coli BW25113 cells harboring pUC19 plasmid with approximately 800 bp section of the lambda genome inserted in the multiple cloning site with the lambda red operon removed. The culture was grown in 2 mL LB media containing 25 μg/mL ampicillin and 10 mM MgSO4. After lysis, cell debris was removed by centrifugation and supernatant containing phage was isolated. Phages were then passaged twice through cultures containing E. coli harboring a pACYCDuet-1 FnCas12a expression plasmid and separate pUC19 plasmid allowing expression of a pre-crRNA with 2 spacers targeting different locations in the lambda red operon to select for phage with the operon deleted. Approximately 20 μL of previously isolated phage lysate was added to each subsequent culture. These 2 mL LB media cultures contained 100 μg/mL ampicillin, 25 μg/mL chloramphenicol, 20 mM arabinose, and 10 mM MgSO4. Phage lysates were harvested, and deletions were confirmed with PCR amplification of the lambda red operon flanking region and Sanger sequencing of the PCR product.

Phage spot assays

Overnight cultures were started using E. coli transformed with Cas12a and crRNA expression plasmids in LB media with ampicillin and chloramphenicol for selection. The next day, 20 μL of these overnight cultures were used to inoculate 2 mL cultures in LB media containing 100 μg/mL ampicillin, 25 μg/mL chloramphenicol, 20 mM arabinose, and 10 mM MgSO4. The cultures were grown in a shaking incubator at 200 rpm and 37°C until OD600 0.4 was reached, and 300 μL of cell culture was added to 3 mL 0.7% soft agar containing the same concentrations of ampicillin, chloramphenicol, arabinose, and MgSO4. The cell-soft agar mixture was vortexed for 5 s and spread onto an LB plate containing ampicillin and chloramphenicol. The plate was dried for 5 min. Lambda phage suspended in LB media with 10 mM MgSO4 was serially diluted in 7 steps with a 1:10 dilution each step, and 2 μL of each phage dilution was then spotted on top of the soft agar layer and the plate was dried for 10 min. Plates were incubated overnight at 30°C.

Liquid culture phage assays and growth curves

Overnight cultures were started using a single colony of E. coli with Cas12a and crRNA expression plasmids in LB media with ampicillin and chloramphenicol added for selection. The next day, these overnight cultures were used to inoculate cultures 1 to 100 in LB media containing 100 μg/mL ampicillin, 25 μg/mL chloramphenicol, 20 mM arabinose, and 10 mM MgSO4. The cultures were grown in a shaking incubator at 200 rpm and 37°C for 1.5 h until OD600 approximately 0.25 was reached.

Lambda phage was added at MOI 0.02, or as indicated in figure legends. For growth curves, OD readings were immediately taken after addition of the phage and this value represented the “0 hour” time point. For growth curves shown in Figs 1D, 5A, and S5A, OD was measured at 600 nm wavelength every 1 h in a WPA Biowave CD8000 Cell Density Meter if growing in culture tubes. For growth curves shown in Figs 6B and S4B, 150 μL cultures were grown in a TECAN infinite M Nano+ 96-well plate reader at 280 rpm and 37°C and OD measurements at 600 nm wavelength were measured every 10 min.

Phage plaque assays

E. coli bacterial cultures were prepared and induced the same way as in the phage spot assays. Approximately 50 μL of phage diluted between 1:1 and 1:106 in LB media was added to 300 μL of induced cell culture at OD600 0.4. This mixture was then added to 3 mL 0.7% soft agar containing ampicillin, chloramphenicol, arabinose, and MgSO4 as in the phage spot assays, and the mixture was vortexed for 5 s and spread onto an LB plate containing ampicillin and chloramphenicol. The plate was dried for 10 min and left to incubate at 37°C overnight. Plaques were counted the next morning.

Cas9 and Cas12a expression and purification

Cas9 and Cas12a proteins were expressed in E. coli (DE3) cells. LB broth supplemented with 50 μg/mL kanamycin was inoculated with overnight culture of the cells carrying the expression plasmid in 1:100 ratio. Protein expression was induced by adding 0.5 mM IPTG when the culture reached an OD600 of 0.5 to 0.6. The culture was incubated at 18°C overnight (about 16 h) with shaking.

Cas9 and Cas12a were purified by the following protocol, adapted from previous methods with modifications [53,73]. After harvesting, the cell pellets were resuspended in lysis buffer (20 mM Tris-HCl (pH 8.0), 500 mM NaCl, 5 mM imidazole, and 5% glycerol) supplemented with 1 mM PMSF. The resuspended cells were lysed by sonication and the lysate then was centrifuged to remove insoluble material. The clarified supernatant was transferred to a HisPur Ni-NTA resin (Thermo Fisher Scientific) column pre-equilibrated with lysis buffer without disturbing the pellets. The column was washed with 50 column volumes of lysis buffer, then washed again with 50 column volumes of wash buffer (20 mM Tris-HCl (pH 8.0), 500 mM NaCl, 15 mM imidazole, and 5% glycerol). Elution buffer (20 mM Tris-HCl (pH 8.0), 500 mM NaCl, 250 mM imidazole, and 5% glycerol) was applied to elute bound protein that was collected and cleaved with TEV protease in a 1:100 (w/w) ratio to remove MBP. TEV cleavage reaction was incubated overnight at 4°C during dialysis in dialysis buffer (10 mM HEPES-KOH (pH 7.5), 200 mM KCl, 1 mM DTT, and 5% glycerol). The protein was concentrated and then diluted with dilution buffer (20 mM HEPES-KOH (pH 7.5) and 5% glycerol) to a final concentration of 100 mM KCl. The protein was loaded on a HiTrap Heparin HP (GE Healthcare) column pre-equilibrated with Buffer A (20 mM HEPES-KOH (pH 7.5), 100 mM KCl, and 5% glycerol). The column was washed with 20% of Buffer B (20 mM HEPES-KOH (pH 7.5), 1 M KCl, and 5% glycerol). The protein was eluted with Buffer B by applying a gradient from 20% to 100% over a total volume of 60 mL. Peak fractions were collected and analyzed by SDS-PAGE. Fractions containing interested protein were combined and concentrated to 1 mL volume. The upper concentrator chamber was refilled with SEC buffer (20 mM HEPES-KOH (pH 7.5), 200 mM KCl, and 1 mM DTT) and then centrifuged to 1 mL volume (repeated this step 3 times) in order to exchange buffer. Finally, the concentrated proteins were aliquoted, flash-frozen in liquid nitrogen, and stored at −80°C.

crRNA and tracrRNA preparation

All crRNAs were in vitro transcribed using short oligonucleotides (IDT) consisting of a T7 promoter region and a template sequence. The oligonucleotides first pre-annealed T7 promoter sequence at 90°C for 2 min and then incubated a room temperate for 10 min. The 500 μL transcription reaction was performed in transcription buffer (40 mM Tris (pH 8.0), 38 mM MgCl2, 1 mM Spermidine (pH 8.0), 0.01% Triton X-100, 5 mM ATP, 5 mM CTP, 5 mM GTP, 5 mM UTP, and 5 mM DTT) with 0.5 μM pre-annealed DNA and excess of T7 RNA polymerase at 37°C for 4 h, and 2X RNA dye was added into the reaction, heated for 5 min at 95°C and then kept on ice. The reaction was run on 10% denaturing acrylamide gel for 3 h with 1× TBE buffer. The crRNA band was visualized under UV-light and was excised from the gel. The gel was crushed and soaked overnight in 1 mL nuclease-free water at 4°C with rocking. The gel tube was centrifuged 5 min at 2,000 X g and the supernatant was transferred to Costar Spin-X centrifuge tube filters (Sigma Aldrich). The tube filter was centrifuged at highest speed for 2 min to collect crRNA solution at the collection chamber. Ethanol precipitation was performed to concentrate crRNA.

tracrRNA were also in vitro synthesized as described above, however, the tracrRNA template was cloned into pUC19 plasmid with an EcoRI restriction site at the end of the template sequence. The tracrRNA plasmid was first linearized with EcoRI and then used as template for in vitro transcription without pre-annealing step.

In vitro cleavage assays

Cleavage assays were prepared in reaction buffer (20 mM HEPES (pH 7.5), 100 mM KCl, 1 mM MgCl2, 1 mM DTT, and 5% glycerol) with a final concentration of 50 nM FnCas12a or SpCas9. FnCas12a RNP complex was formed by incubating FnCas12a and crRNA at a 1:1.5 ratio at 37°C for 10 min. Cas9 RNP complex was formed by incubating Cas9:crRNA:tracRNA at a 1:1.5:1.5 ratio at 37°C for 10 min. To initiate cleavage, target plasmid was added to a final concentration of 15 ng/μL and a final reaction volume of 100 μL. The reaction was incubated at 37°C. Aliquots (10 μL) were quenched at 7, 15, 30, 60, 300, 900, and 1,800 s by adding 10 μL phenol-chloroform-isoamyl alcohol (25:24:1 v/v, Invitrogen). After phenol-chloroform extraction, DNA products were separated by electrophoresis on a 1% agarose gel and visualized with SYBR Safe (Invitrogen) staining. The densitometry of individual DNA bands was measured in ImageJ (https://imagej.nih.gov/ij/). The fraction cleaved was determined by dividing the total cleaved DNA (nicked and linearized DNA) by total DNA (nicked, linearized, and supercoiled DNA). Fraction cleaved was plotted versus time and fit to a first-order rate equation to determine an observed rate constant for cleavage (kobs). Rates were measured in triplicate.

High-throughput sequencing sample preparation

Phage samples were isolated from spots in spot assays at the highest phage dilution in which a cleared spot was observed to ensure a diverse population of mutant phages would be sampled. A 1 mL pipette tip was poked through a phage spot and the agar in the tip is suspended in 30 μL deionized H2O in a 1.5 mL microcentrifuge tube and incubated in a 48°C water bath for 20 min to melt the agar and dissolve the phage particles. The tubes were vortexed briefly and incubated in the water bath for another 10 min. The water/melted agar mixture that contains the phage was removed.

For phage samples from liquid culture, 50 μL of each culture was transferred to a 1.5 mL tube and bacteria were pelleted from the liquid culture by centrifuging at 15,000 rpm for 5 min. The supernatant containing phage particles was then removed.

Approximately 5 μL of phage solution was used as the template for a 25 cycle PCR reaction with primers containing Nextera adapters. These PCR products were cleaned up using the Promega Wizard PCR purification kit and used as a template for an 8 cycle PCR reaction to add barcodes for sample identification. Q5 DNA polymerase (New England Biolabs) was used for all adapter and barcode PCR reactions. Samples were pooled and gel purified using the Promega Wizard PCR purification kit. Gel purified samples were then submitted for MiSeq high-throughput sequencing. Conditions for MiSeq runs were Nextera DNA MiSEQ 150-Cycle which included two 75 base pair paired end reads. Adapter PCR primers were designed so both of the paired R1 and R2 reads overlapped in the entire protospacer region including the PAM.

Samples were prepared for PacBio sequencing by 35 cycle PCR amplification of phage samples isolated from liquid culture. PCR products were purified using the Promega Wizard PCR purification kit and submitted for PacBio sequencing.

High-throughput sequencing data processing

A script written in Python 3.8 was used to process fastq data files received from a MiSeq run. The script extracts target region sequences and determines if the target region contains a mutation relative to the wild-type target sequence. Base substitutions and deletions were classified along with the location of the substitution or deletion relative to the PAM sequence of the target. Mutations were also classified based on the type of mutation (A to C for example). A Microsoft Excel sheet was then created with the data using the “Xlsxwriter” Python package [74]. Z-score calculations and heat maps for each sample were created using Microsoft Excel. Z-scores for each position in the phage target regions were calculated using the average proportion of reads with mutations at each position in experimental samples x and control samples μ and the standard deviation of the proportion of mutations at all positions in all Samples σ.

Z=x_μ_σ (1)

A separate script written in Python 3.8 was used to process data files received from PacBio high-throughput sequencing and find deletions in the lambda phage genome. Sequences were extracted from fastq files and matched piecewise to the WT sequence of the genome region that was PCR amplified. Deletions are output as coordinates in the PCR amplified region and these coordinates were translated to the lambda phage genome to create the bar graph in Fig 7B.

Nucleotide diversity was calculated using the proportion of all pairs of sequences xi and xj in each sample and the number of nucleotide differences between each pair of sequences πij divided by the length of the PAM and protospacer region, which is 24 nt in length.

Diversity=124i=2nj=1i12xixjπij (2)

Find the scripts at https://github.com/alexsq2/lambda-phage-CRISPR-mutants.

Generation and purification of mutant phage

The gene L A2T mutant phage reported in Fig 3 was generated on solid media by isolation of single plaques. In this case, 300 μL of E. coli BW25113 transformed with the FnCas12a and MM15 crRNA expression plasmids at OD600 approximately 0.4 was added to 3 mL 0.7% agar containing 100 μg/mL ampicillin, 25 μg/mL chloramphenicol, 20 mM arabinose, 10 mM MgSO4, and 20 μL of undiluted WT phage lysate. Soft agar was vortexed for 5 s and poured onto LB agar plates containing the same concentrations of ampicillin and chloramphenicol. Plates were incubated at 37°C overnight. Phage was isolated from a single plaque by poking into the plaque with a 1 mL pipette tip and the agar was suspended in 20 μL deionized H2O in a 1.5 mL microcentrifuge tube and incubated in a 48°C water bath for approximately 20 min. Soft agar containing phage was transferred to a fresh 1.5 mL tube.

For mutants reported in Fig 7, overnight cultures were started using a single colony of E. coli with Cas12a and indicated crRNA expression plasmids in LB media with ampicillin and chloramphenicol added for selection. The next day, these overnight cultures were used to inoculate cultures 1 to 100 in LB media containing 100 μg/mL ampicillin, 25 μg/mL chloramphenicol, 20 mM arabinose, and 10 mM MgSO4. The cultures were grown in a shaking incubator at 200 rpm and 37°C for 1.5 h until OD600 approximately 0.4 was reached. Lambda phage was added at MOI 0.02. Cultures continued to grow in the shaking incubator for 5 h. Cultures were transferred to 1.5 mL tubes and centrifuged at 5,000 rpm for 5 min. Supernatant containing phage was transferred to a fresh 1.5 mL tube.

In both cases, phage was then diluted and used for phage plaque assays on lawns of bacteria expressing the same crRNA as in the previous infection to select against remaining WT phage. Single plaques were isolated and used to infect bacterial cultures again expressing the same crRNA under the same conditions as described above. Cultures were grown in a shaking incubator at 200 rpm and 37°C for 5 h. Approximately 5 μL of phage solution was then used as a template for a 35 cycle PCR reaction with Phusion polymerase to amplify the target region. Sanger sequencing was used to confirm the presence and purity of mutations in the target region.

Mutant phage competition assay

Single-mutant phages were purified as in “Generation and purification of mutant phage”. Two different mutant phage lysates were mixed in 1:2 and 2:1 ratios by titer and these mixes were serially diluted 1:10 in 7 dilution steps in LB media containing 10 mM MgSO4, and 3 μL of each dilution was added to 150 μL E. coli BW25113 cultures at OD600 0.4 with cells harboring a pACYCDuet-1 FnCas12a expression plasmid and separate pUC19 plasmid allowing expression of a pre-crRNA targeting the region of the lambda phage genome containing the mutation. These 150 μL LB media cultures contained 100 μg/mL ampicillin, 25 μg/mL chloramphenicol, 20 mM arabinose, and 10 mM MgSO4. Complete lysis was observed for all cultures at 8 h after infection and phage lysates were isolated by centrifugation and removal of the supernatant. Sequencing samples were prepared as in “High-throughput sequencing sample preparation” to determine the proportion of each mutant phage in each sample. Ratio of seed:PAM-distal mutants in the population were determined by dividing the number of reads for the seed mutant by the number of reads for the PAM-distal mutant for each sample.

Sanger sequencing of phage plaques or phage lysate

Phage was isolated from a single plaque by poking into the plaque with a 1 mL pipette tip and the agar was suspended in 20 μL deionized H2O in a 1.5 mL microcentrifuge tube and incubated in a 48°C water bath for approximately 20 min. Melted agar and H2O mixture containing phages was transferred to a 1.5 mL microcentrifuge tube.

Phage was also isolated from liquid cultures by transferring 1 mL of liquid culture to a 1.5 mL microcentrifuge tube and centrifuging at 15,000 rpm for 5 min. Supernatant containing phages was transferred to a clean 1.5 mL microcentrifuge tube.

Approximately 5 μL of phage solution was used as a template for a PCR reaction that amplifies the target region in the middle of approximately 800 base pair PCR product. One of the primers used for the PCR reaction was used for sequencing of the PCR product.

Supporting information

S1 Fig. Cleavage assays by FnCas12a with single mismatch crRNAs.

(A) Sequence of the target DNAs, perfectly matching crRNAs and single-mismatched crRNAs. (B) Representative agarose gels showing time course cleavage of negatively supercoiled plasmid (nSC) using perfectly matching crRNA or single-mismatched crRNA by FnCas12a, resulting in linear (li) and/or nicked (n) products. Time points at which the samples were collected were 7 s, 15 s, 30 s, 1 min, 2 min, 5 min, 15 min, and 30 min. All controls were performed under the same conditions as the longest time point for the experimental samples. (C) Quantification of cleaved products (linear and nicked fractions) from the time course cleavage. Averages of the cleaved fraction values were plotted versus time and fit to a first-order rate equation with error bars; n = 3 replicates. For values reported in Fig 1C, each individual replicate was fit, and kobs was reported as the average value for the 3 replicates. See S1 Data for data quantification.

(EPS)

S2 Fig. Analysis of phage mutations that emerge following exposure to Cas12a-mediated interference with mismatched crRNAs.

(A) Line graphs showing the nucleotide diversity of phage target regions over time after exposure to bacteria cells expressing crRNAs with and without mismatches. Target regions are gene J or gene L and crRNAs either match the target region (perfect) or contain mismatches at position x. Nucleotide diversity is calculated using the proportion of each sequence in the sample and the number of nucleotide differences between each pair of sequences. Two individual replicates are shown for each condition. See S3 Data for quantified data. (B) Heat maps showing location of target mutations that arose due to CRISPR targeting by FnCas12a on a solid medium. Z-scores for abundance of single-nucleotide variants, including nucleotide identity changes or deletions, were determined for each sample relative to the non-targeted control phage population. Enriched sequences indicate high Z-scores. Z-scores range from 0 (white) to 7.7 (darkest red). Single-nucleotide deletions are shown at adjacent position to the 3′ side. Positions with crRNA mismatches are labeled with solid black boxes in the heat map. See S4 Data for variant abundance quantification. (C) Graphs showing single-nucleotide variations for mutated phage target sequences present at the 8 h time point for 2 individual replicates. Bar graph height shows the proportion of sequences in each sample with the mutation type at each position in the target. Deletions (Δ) are plotted at the first position where a mismatch occurs between the crRNA and the target. See S3 Data for variant abundance quantification.

(EPS)

S3 Fig. MiSeq sample counts and R1/R2 file overlap.

(A) Table showing absolute counts from MiSeq for each replicate of the 8 h time point for each experimental condition for E. coli infected with wild-type phage. Each count represents an extracted sequence in which R1 and R2 reads matched. The negative control (non-targeting crRNA) samples were run in a separate MiSeq run to maximize the number of reads and minimize barcode overlap with mutated samples, allowing for analysis of preexisting mutants in the wild-type population. (B) Bar charts showing mutated sequences at each position in the high-throughput sequencing reads of the negative control lambda phage population for the gene J and gene L region. The target region is highlighted with a red box. R1 and R2 reads do not overlap in the target region (no overlap) or overlap in the target region (target overlap). R1 reads are used for the target region in the no overlap condition. When R1 and R2 reads overlap, sequences in which the target region sequence does not agree for both the R1 and R2 reads are removed from analysis and are not shown in this figure. This measure was taken to ensure that variations observed in negative control samples arose solely from PCR errors or the natural variation of the population. While some variations were still observed with stringent R1/R2 overlap enforced, it is not possible to distinguish PCR errors from natural variation. No mutations were substantially enriched outside of the target region for any of the samples tested in this study. See S3 Data for R1 and R2 variant read counts.

(EPS)

S4 Fig. Single deletions enriched by CRISPR exposure.

Bar charts showing single-nucleotide deletions from the lambda phage gene J target (A) and gene L target (B) in phage that were exposed to cells expressing a non-targeting crRNA (CRISPR inactive) and cells expressing crRNAs with and without mismatches to the lambda phage gene J target (C). (A, B) Deletions are mapped along the target sequences for all time points and both biological replicates for the negative control samples. (C) CRISPR active samples shown for gene J target that contained deletion sequences that represented more than 1% of all reads. No such deletions were observed in the gene L target. Deletions were observed in the gene L coding region in phage in the wild-type population. The deletions could remain in genomes in the population as these genomes are packaged along with functional structural proteins in successfully infected cells. See S3 Data for quantification of single-nucleotide deletion abundance.

(EPS)

S5 Fig. Deletions in non-essential genomic regions that are selected following Cas12a targeting preexist in the phage population.

(A) PCR amplification of regions surrounding essential and non-essential genes targeted by Cas12a. Spot assays were performed using E. coli expressing crRNAs that match 2 non-essential (nin204 and nin146) and 2 essential regions (gene J and gene L) of the lambda phage genome. Controls were performed with a plasmid not encoding a crRNA. Phages were isolated from the phage spots and target and flanking regions of the phage genome were PCR amplified and run on an agarose gel. (B) Sanger sequencing chromatograms of phage genome deletions in non-essential regions targeted by Cas12a. Non-essential regions in the lambda phage genome were targeted with matching crRNAs on solid media. Phages were isolated and the target regions were PCR amplified. Single bands were gel purified and PCR amplified in a second round. These second PCR products were sequenced and the chromatograms were aligned to the WT lambda phage genome. Homology at each end of the deletions was identified and highlighted in blue. (C) Map of genomic deletions observed by PacBio sequencing of PCR amplicons from phage unexposed to CRISPR targeting. DNA from lambda phage unexposed to CRISPR targeting was used as a template for PCR reactions that amplified the same non-essential regions as in (A). This PCR product was sequenced with PacBio long-read sequencing and the obtained sequences were matched with the wild-type lambda genome sequence to identify any deletions present. These deletions are plotted on the chart relative to their position in the genome. The quantity of each deletion is identified by a color code. Deletions found by Sanger sequencing in CRISPR active samples are indicated with an asterisk (*). See S5 Data for sequences and quantification.

(EPS)

S6 Fig. Growth curves and mutant emergence for Δred phage infection.

(A) Growth curves of E. coli expressing FnCas12a and crRNAs bearing non-targeting, perfectly matching, or mismatched crRNAs infected with Δred phage. Phage was added when the cells reached mid log phase at approximately 2 h after inoculation. The average of 3 replicates is plotted for each condition, with error bars representing standard deviation. Large error bars indicate that not all replicate cultures lysed. See S2 Data for quantified data. (B) Graphs showing mutation type in Δred phage target sequences present at the 8 h time point. Bar graph height shows the proportion of sequences in each sample with the mutation type at each position in the target. Deletions (Δ) and multiple mutations are plotted at the first position where a mismatch occurs between the crRNA and the target. The average of 3 replicates is shown, with error bars representing standard deviation. See S3 Data for variant abundance quantification. (C) Comparison of target sequences of phage isolated from cultures in (A) containing cells expressing a crRNA targeting gene J with a mismatch at position 8. The WT target sequence is underlined. Three individual replicates are shown and the percent of each mutant sequence in the sample is listed. Mutated positions relative to the WT sequence are highlighted in orange.

(EPS)

S7 Fig. Cleavage assays by FnCas12a with double mismatch crRNAs.

(A) Sequence of the gene J target DNA, perfectly matching crRNA and double-mismatched crRNAs. (B) Representative agarose gels showing time course cleavage of negatively supercoiled plasmid (nSC) using perfectly matching crRNA or double-mismatched crRNA by FnCas12a, resulting in linear (li) and/or nicked (n) products. Time points at which the samples were collected were 7 s, 15 s, 30 s, 1 min, 2 min, 5 min, 15 min, and 30 min. All controls were performed under the same conditions as the longest time point for the experimental samples. The gel for the perfect crRNA is reproduced from S1B Fig. (C) Quantification of cleaved products from the time course cleavage. The average cleaved fraction was plotted versus time and fit to a first-order rate equation with error bars representing standard deviation; n = 3 replicates. For values reported in Fig 3B, each individual replicate was fit, and kobs was reported as the average value for the 3 replicates. See S1 Data for quantification. Quantification for the perfect crRNA is also shown in S1C Fig.

(EPS)

S8 Fig. Cleavage assays by FnCas12a of wild-type and mutant target sequences.

(A) Sequences the perfectly matching crRNA, position 15 mismatched crRNA, and 3 gene L target sequences used for cleavage assays. (B) Representative agarose gels showing time course cleavage of negatively supercoiled plasmid (nSC) bearing a wild-type (WT), A2T, or G17T-containing gene L sequence using perfectly matching crRNA or position 15 mismatched crRNA by FnCas12a, resulting in linear (li) and/or nicked (n) products. Time points at which the samples were collected were 7 s, 15 s, 30 s, 1 min, 2 min, 5 min, 15 min, and 30 min. All controls were performed under the same conditions as the longest time point for the experimental samples. The gels for the perfect crRNA and 15 mismatched crRNA cleaving WT target are reproduced from S1B Fig. (C) Quantification of cleaved products from the time course cleavage. The average cleaved fraction was plotted versus time and fit to a first-order rate equation with error bars representing standard deviation; n = 3 replicates. For values reported in Fig 3D, each individual replicate was fit, and kobs was reported as the average value for the 3 replicates. See S1 Data for quantification. Quantifications for perfect and MM15 crRNA cleaving WT target are also shown in S1C Fig.

(EPS)

S9 Fig. Cleavage assays by SpCas9 of wild-type and mutant target sequences.

(A) Sequences the perfectly matching crRNA, position 15 mismatched crRNA, and 3 gene L target sequences used for cleavage assays. Cleavage was performed using a crRNA-tracrRNA pair. (B) Representative agarose gels showing time course cleavage of negatively supercoiled plasmid (nSC) bearing a wild-type (WT), G2T, or A17T-containing gene L sequence using perfectly matching crRNA or position 15 mismatched crRNA by SpCas9, resulting in linear (li) and/or nicked (n) products. Time points at which the samples were collected were 7 s, 15 s, 30 s, 1 min, 2 min, 5 min, 15 min, and 30 min. All controls were performed under the same conditions as the longest time point for the experimental samples. (C) Quantification of cleaved products from the time course cleavage. The average cleaved fraction was plotted versus time and fit to a first-order rate equation with error bars representing standard deviation; n = 3 replicates. For values reported in Fig 4A, each individual replicate was fit, and kobs was reported as the average value for the 3 replicates. See S1 Data for quantification.

(EPS)

S10 Fig. Phage protection by and mutant emergence from SpCas9 with sgRNA mismatches.

(A) Spot assays using lambda phage on lawns of bacteria expressing SpCas9 along with sgRNAs with and without mismatches. SgRNAs target gene J or gene L and contain mismatches at position X or match the target (perfect). (B) Graphs showing single-nucleotide variants in phage target sequences present at the 8 h time point following challenge by Cas9 bearing different sgRNAs. Bar graph height shows the proportion of sequences in each sample with the mutation type at each position in the target. Deletions (Δ) are plotted at the first position where a mismatch occurs between the crRNA and the target. The average of 3 replicates is shown, with error bars representing standard deviation. See S7 Data for variant abundance quantification.

(EPS)

S11 Fig. Mutant emergence at varied MOIs.

(A) Growth curves using cells expressing a crRNA targeting gene J with a mismatch in the seed region and infected with phage at different MOIs. Phage was added at the indicated MOIs when cells reached mid log phase and the OD600 of the culture was measured over time. Cultures at lower MOIs did not lyse and are omitted from the graph. See S2 Data for quantified data. (B) Spot assays estimating the titer of phage lysates exposed to interference by Cas12a armed with a seed mismatched crRNA. Full plates from Fig 4B, including lowest MOI samples which produced phages with low titers.

(EPS)

S12 Fig. Phage targeted by multiple spacers develops mutations in 1 or more targeted regions.

(A) Spot assays performed using lambda phage that previously infected E. coli in liquid culture expressing FnCas12a and 2 different crRNAs targeting gene J and L (lysate spacers J+L) or 2 of the same crRNA targeting gene L (lysate spacers L+L) both with mismatches in the seed region. Phage was harvested at different time points of the liquid culture (0, 1, 2, 3, and 5 h after infection). The previous phage lysates were spotted on cells expressing a crRNA that matches the gene L target in the lambda genome (gene L perfect). (B) Sanger sequencing chromatograms showing sequences of target regions of the genome in phage exposed to cells expressing 2 mismatched crRNAs targeting gene J and gene L, respectively. Phage from single plaques was isolated and both target regions were sequenced. Mutated bases are highlighted.

(EPS)

S13 Fig. Purified single-mutant and double-mutant chromatograms.

(A–H) Sanger sequencing chromatograms of single- and double-mutant phage lysates. Single-mutant phages were generated by exposure to crRNAs with mismatches (MM crRNA) at different positions (position X) and purified as shown in Fig 7A. Mutants were generated in the gene J and gene L CRISPR target. Purified single-mutant phage was used to challenge bacteria expressing a perfect crRNA and target regions were sequenced by Sanger sequencing to determine if second mutations appeared. Both mixed and clonal double-mutant populations were generated after this step. Mutated bases are highlighted.

(EPS)

S1 Table. Lists of plasmids, primers, and oligonucleotides used in this study.

(XLSX)

S1 Data. Quantification of percent cleaved over time for cleavage assays.

Contains data presented in Figs 2C, 3B, 3D, 4A, S1C, S7C, S8C, and S9C.

(XLSX)

S2 Data. Quantification of bacterial liquid growth curves and plaque assays.

Contains data presented in Figs 1D, 4B, 6B, 6C, 7B, S6A, and S11A.

(XLSX)

S3 Data. Quantification of high-throughput sequencing data for gene J and L targets in wild-type and Δred phage upon Cas12a challenge with perfect and mismatched crRNAs in liquid media.

Contains data presented in Figs 2B, 2C, S2A, S2C, S3, S4, and S6B.

(XLSX)

S4 Data. Quantification of high-throughput sequencing data for gene J and L targets upon Cas12a challenge with perfect and mismatched crRNAs on solid media.

Contains data presented in S2B Fig.

(XLSX)

S5 Data. Quantification of PacBio sequencing of nin146 and nin204 genomic regions following Cas12a challenge.

Contains data presented in S5C Fig.

(XLSX)

S6 Data. Quantification of phage competition between phages containing mutations in the seed or PAM-distal region of the gene L target.

Contains data presented in Fig 3F.

(XLSX)

S7 Data. Quantification of high-throughput sequencing data for gene J and L targets upon Cas9 challenge with perfect and mismatched crRNAs in liquid media.

Contains data presented in Figs 4C and S10B.

(XLSX)

S8 Data. Quantification of high-throughput sequencing data for gene J target upon phage infection at different MOIs.

Contains data presented in Fig 5B.

(XLSX)

S1 Raw Images. PDF file containing all raw gel images.

Original gel images for all images presented in the Supporting information figures or used for triplicate quantification of Cas12a or Cas9 cleavage rates. All gels were visualized using SyberSafe staining. When relevant, annotations list which samples were used for gel images in S1B, S5A, S7B, S8B or S9B Figs. Samples that were not included in the Supporting information figures contain replicates that were used for quantification of observed rate constants. All samples are ordered as labeled in the relevant Supporting information figure image.

(PDF)

Acknowledgments

We thank Michael Baker and Kevin Cavallin of the Iowa State DNA Facility for advice on MiSeq sample preparation and data processing. MiSeq sequencing was performed at the Iowa State DNA Facility and PacBio sequencing was performed by the DNA Sequencing Center of Brigham Young University.

Data Availability

All relevant data are within the paper and its Supporting Information files.” Scripts used to analyze high-throughput sequencing data are available at https://github.com/alexsq2/lambda-phage-CRISPR-mutants.

Funding Statement

Financial support for this research was provided by National Science Foundation award 1652661 (to D.G.S.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Grissa I, Vergnaud G, Pourcel C. The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats. BMC Bioinformatics. 2007;8:172. doi: 10.1186/1471-2105-8-172 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Godde JS, Bickerton A. The repetitive DNA elements called CRISPRs and their associated genes: evidence of horizontal transfer among prokaryotes. J Mol Evol. 2006;62:718–729. doi: 10.1007/s00239-005-0223-z [DOI] [PubMed] [Google Scholar]
  • 3.Makarova KS, Wolf YI, Iranzo J, Shmakov SA, Alkhnbashi OS, Brouns SJJ, Charpentier E, et al. Evolutionary classification of CRISPR–Cas systems: a burst of class 2 and derived variants. Nat Rev Microbiol. 2020;18:67–83. doi: 10.1038/s41579-019-0299-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Sorek R, Lawrence CM, Wiedenheft B. CRISPR-Mediated Adaptive Immune Systems in Bacteria and Archaea. Annu Rev Biochem. 2013;82:237–266. doi: 10.1146/annurev-biochem-072911-172315 [DOI] [PubMed] [Google Scholar]
  • 5.Mohanraju P. Diverse evolutionary roots and mechanistic variations of the CRISPR-Cas systems. Science. 2016;353. doi: 10.1126/science.aad5147 [DOI] [PubMed] [Google Scholar]
  • 6.Johnke J. Multiple micro-predators controlling bacterial communities in the environment. Curr Opin Biotechnol. 2014;27:185–190. doi: 10.1016/j.copbio.2014.02.003 [DOI] [PubMed] [Google Scholar]
  • 7.Fuhrman JA. Marine viruses and their biogeochemical and ecological effects. Nature. 1999;399:541–548. doi: 10.1038/21119 [DOI] [PubMed] [Google Scholar]
  • 8.Wommack KE, Ravel J, Hill RT, Colwell RR. Hybridization Analysis of Chesapeake Bay Virioplankton. Appl Environ Microbiol. 1999;65:241–250. doi: 10.1128/AEM.65.1.241-250.1999 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Avrani S, Wurtzel O, Sharon I, Sorek R, Lindell D. Genomic island variability facilitates Prochlorococcus–virus coexistence. Nature. 2011;474:604–608. doi: 10.1038/nature10172 [DOI] [PubMed] [Google Scholar]
  • 10.Deveau H, Barrangou R, Garneau JE, Labonté J, Fremaux C, Boyaval P, et al. Phage Response to CRISPR-Encoded Resistance in Streptococcus thermophilus. J Bacteriol. 2008;190:1390–1400. doi: 10.1128/JB.01412-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Datsenko KA. Molecular memory of prior infections activates the CRISPR/Cas adaptive bacterial immunity system. Nat Commun. 2012;3:945. doi: 10.1038/ncomms1937 [DOI] [PubMed] [Google Scholar]
  • 12.Sun CL. Phage mutations in response to CRISPR diversification in a bacterial population. Environ Microbiol. 2013;15:463–470. doi: 10.1111/j.1462-2920.2012.02879.x [DOI] [PubMed] [Google Scholar]
  • 13.Rossetti G. The structural impact of DNA mismatches. Nucleic Acids Res. 2015;43:4309–4321. doi: 10.1093/nar/gkv254 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Sugimoto N, Nakano M, Nakano S. Thermodynamics−Structure Relationship of Single Mismatches in RNA/DNA Duplexes. Biochemistry. 2000;39:11270–11281. doi: 10.1021/bi000819p [DOI] [PubMed] [Google Scholar]
  • 15.Borer PN, Dengler B, Tinoco I, Uhlenbeck OC. Stability of ribonucleic acid double-stranded helices. J Mol Biol. 1974;86:843–853. doi: 10.1016/0022-2836(74)90357-x [DOI] [PubMed] [Google Scholar]
  • 16.Feng H, Guo J, Wang T, Zhang C, Xing X. Guide-target mismatch effects on dCas9–sgRNA binding activity in living bacterial cells. Nucleic Acids Res. 2021;49:1263–1277. doi: 10.1093/nar/gkaa1295 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Meeske AJ, Nakandakari-Higa S, Marraffini LA. Cas13-induced cellular dormancy prevents the rise of CRISPR-resistant bacteriophage. Nature. 2019;1. doi: 10.1038/s41586-019-1257-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Zheng Y. Endogenous Type I CRISPR-Cas: From Foreign DNA Defense to Prokaryotic Engineering. Front Bioeng Biotechnol. 2020:8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Chylinski K, Makarova KS, Charpentier E, Koonin EV. Classification and evolution of type II CRISPR-Cas systems. Nucleic Acids Res. 2014;42:6091–6105. doi: 10.1093/nar/gku241 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Li Y. Cmr1 enables efficient RNA and DNA interference of a III-B CRISPR-Cas system by binding to target RNA and crRNA. Nucleic Acids Res. 2017;45:11305–11314. doi: 10.1093/nar/gkx791 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Wang R, Li H. The mysterious RAMP proteins and their roles in small RNA-based immunity. Protein Sci. 2012;21:463–470. doi: 10.1002/pro.2044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Kolesnik MV, Fedorova I, Karneyeva KA, Artamonova DN, Severinov KV. Type III CRISPR-Cas Systems: Deciphering the Most Complex Prokaryotic Immune System. Biochemistry (Mosc). 2021;86:1301–1314. doi: 10.1134/S0006297921100114 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Vercoe RB. Cytotoxic Chromosomal Targeting by CRISPR/Cas Systems Can Reshape Bacterial Genomes and Expel or Remodel Pathogenicity Islands. PLoS Genet. 2013;9:e1003454. doi: 10.1371/journal.pgen.1003454 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Mojica FJM, Díez-Villaseñor C, García-Martínez J, Almendros C. Short motif sequences determine the targets of the prokaryotic CRISPR defence system. Microbiology. 2009;155:733–740. doi: 10.1099/mic.0.023960-0 [DOI] [PubMed] [Google Scholar]
  • 25.Sternberg SH, Redding S, Jinek M, Greene EC, Doudna JA. DNA interrogation by the CRISPR RNA-guided endonuclease Cas9. Nature. 2014;507:62–67. doi: 10.1038/nature13011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Zetsche B. Cpf1 Is a Single RNA-Guided Endonuclease of a Class 2 CRISPR-Cas System. Cell. 2015;163:759–771. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Diversity Horvath P., Activity, and Evolution of CRISPR Loci in Streptococcus thermophilus. J Bacteriol. 2008;190:1401–1412. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Pattanayak V. High-throughput profiling of off-target DNA cleavage reveals RNA-programmed Cas9 nuclease specificity. Nat Biotechnol. 2013;31:839–843. doi: 10.1038/nbt.2673 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Cong L. Multiplex Genome Engineering Using CRISPR/Cas Systems. Science. 2013;339:819–823. doi: 10.1126/science.1231143 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Künne T, Swarts DC, Brouns SJJ. Planting the seed: target recognition of short guide RNAs. Trends Microbiol. 2014;22:74–83. doi: 10.1016/j.tim.2013.12.003 [DOI] [PubMed] [Google Scholar]
  • 31.Semenova E. Interference by clustered regularly interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence. Proc Natl Acad Sci U S A. 2011;108:10098–10103. doi: 10.1073/pnas.1104144108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Jiang W, Bikard D, Cox D, Zhang F, Marraffini LA. RNA-guided editing of bacterial genomes using CRISPR-Cas systems. Nat Biotechnol. 2013;31:233–239. doi: 10.1038/nbt.2508 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Maier L-K. Essential requirements for the detection and degradation of invaders by the Haloferax volcanii CRISPR/Cas system I-B. RNA Biol. 2013;10:865–874. doi: 10.4161/rna.24282 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Wiedenheft B. RNA-guided complex from a bacterial immune system enhances target recognition through seed sequence interactions. Proc Natl Acad Sci U S A. 2011;108:10092–10097. doi: 10.1073/pnas.1102716108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Tao P, Wu X, Rao V. Unexpected evolutionary benefit to phages imparted by bacterial CRISPR-Cas9. Sci Adv Dermatol. 2018;4:eaar4134. doi: 10.1126/sciadv.aar4134 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Chabas H. Variability in the durability of CRISPR-Cas immunity. Phil Trans R Soc B Biol Sci. 2019;374:20180097. doi: 10.1098/rstb.2018.0097 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Pereira-Gómez M, Sanjuán R. Effect of mismatch repair on the mutation rate of bacteriophage ϕX174. Virus Evol. 2015;1:vev010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Wu X, Zhu J, Tao P, Rao VB. Bacteriophage T4 Escapes CRISPR Attack by Minihomology Recombination and Repair. MBio;12:e01361–e01321. doi: 10.1128/mBio.01361-21 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Hossain AA, McGinn J, Meeske AJ, Modell JW, Marraffini LA. Viral recombination systems limit CRISPR-Cas targeting through the generation of escape mutations. Cell Host Microbe. 2021;29:1482–1495.e12. doi: 10.1016/j.chom.2021.09.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Mojica FJM, Díez-Villaseñor C, García-Martínez J, Soria E. Intervening Sequences of Regularly Spaced Prokaryotic Repeats Derive from Foreign Genetic Elements. J Mol Evol. 2005;60:174–182. doi: 10.1007/s00239-004-0046-3 [DOI] [PubMed] [Google Scholar]
  • 41.Bolotin A, Quinquis B, Sorokin A, Ehrlich SD. Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal origin. Microbiology. 2005;151:2551–2561. doi: 10.1099/mic.0.28048-0 [DOI] [PubMed] [Google Scholar]
  • 42.Heidelberg JF, Nelson WC, Schoenfeld T, Bhaya D. Germ Warfare in a Microbial Mat Community: CRISPRs Provide Insights into the Co-Evolution of Host and Viral Genomes. PLoS ONE. 2009;4:e4169. doi: 10.1371/journal.pone.0004169 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Semenova E, Nagornykh M, Pyatnitskiy M, Artamonova II, Severinov K. Analysis of CRISPR system function in plant pathogen Xanthomonas oryzae. FEMS Microbiol Lett. 2009;296:110–116. doi: 10.1111/j.1574-6968.2009.01626.x [DOI] [PubMed] [Google Scholar]
  • 44.Pourcel C, Salvignol G, Vergnaud G. CRISPR elements in Yersinia pestis acquire new repeats by preferential uptake of bacteriophage DNA, and provide additional tools for evolutionary studies. Microbiology. 2005;151:653–663. doi: 10.1099/mic.0.27437-0 [DOI] [PubMed] [Google Scholar]
  • 45.Andersson AF, Banfield JF. Virus Population Dynamics and Acquired Virus Resistance in Natural Microbial Communities. Science. 2008;320:1047–1050. doi: 10.1126/science.1157358 [DOI] [PubMed] [Google Scholar]
  • 46.Sun CL, Thomas BC, Barrangou R, Banfield JF. Metagenomic reconstructions of bacterial CRISPR loci constrain population histories. ISME J. 2016;10:858–870. doi: 10.1038/ismej.2015.162 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Nussenzweig PM, McGinn J, Marraffini LA. Cas9 Cleavage of Viral Genomes Primes the Acquisition of New Immunological Memories. Cell Host Microbe. 2019;26:515–526.e6. doi: 10.1016/j.chom.2019.09.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Fineran PC. Degenerate target sites mediate rapid primed CRISPR adaptation. Proc Natl Acad Sci U S A. 2014;111:E1629–E1638. doi: 10.1073/pnas.1400071111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Xue C. CRISPR interference and priming varies with individual spacer sequences. Nucleic Acids Res. 2015;43:10831–10847. doi: 10.1093/nar/gkv1259 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Xue C, Whitis NR, Sashital DG. Conformational Control of Cascade Interference and Priming Activities in CRISPR Immunity. Mol Cell. 2016;64:826–834. doi: 10.1016/j.molcel.2016.09.033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Cho SW. Analysis of off-target effects of CRISPR/Cas-derived RNA-guided endonucleases and nickases. Genome Res. 2014;24:132–141. doi: 10.1101/gr.162339.113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Hsu PD. DNA targeting specificity of RNA-guided Cas9 nucleases. Nat Biotechnol. 2013;31:827–832. doi: 10.1038/nbt.2647 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Murugan K, Seetharam AS, Severin AJ, Sashital DG. CRISPR-Cas12a has widespread off-target and dsDNA-nicking effects. J Biol Chem. 2020. doi: 10.1074/jbc.RA120.012933 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Strohkendl I, Saifuddin FA, Rybarski JR, Finkelstein IJ, Russell R. Kinetic Basis for DNA Target Specificity of CRISPR-Cas12a. Mol Cell. 2018;71:816–824.e3. doi: 10.1016/j.molcel.2018.06.043 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Jinek M. A Programmable Dual-RNA–Guided DNA Endonuclease in Adaptive Bacterial Immunity. Science. 2012;337:816–821. doi: 10.1126/science.1225829 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Jones SK. Massively parallel kinetic profiling of natural and engineered CRISPR nucleases. Nat Biotechnol. 2020:1–10. doi: 10.1038/s41587-020-0646-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Kleinstiver BP. Genome-wide specificities of CRISPR-Cas Cpf1 nucleases in human cells. Nat Biotechnol. 2016;34:869–874. doi: 10.1038/nbt.3620 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Endo A, Masafumi M, Kaya H, Toki S. Efficient targeted mutagenesis of rice and tobacco genomes using Cpf1 from Francisella novicida. Sci Rep. 2016;6:38169. doi: 10.1038/srep38169 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Alok A. The Rise of the CRISPR/Cpf1 System for Efficient Genome Editing in Plants. Front Plant Sci. 2020:11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Kim D. Genome-wide analysis reveals specificities of Cpf1 endonucleases in human cells. Nat Biotechnol. 2016;34:863–868. doi: 10.1038/nbt.3609 [DOI] [PubMed] [Google Scholar]
  • 61.Zhong Z. Plant Genome Editing Using FnCpf1 and LbCpf1 Nucleases at Redefined and Altered PAM Sites. Mol Plant. 2018;11:999–1002. doi: 10.1016/j.molp.2018.03.008 [DOI] [PubMed] [Google Scholar]
  • 62.Chen JS et al. CRISPR-Cas12a target binding unleashes indiscriminate single-stranded DNase activity. Science. 2018;360:436–439. doi: 10.1126/science.aar6245 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Phan PT, Schelling M, Xue C, Sashital DG. Fluorescence-based methods for measuring target interference by CRISPR-Cas systems. Methods Enzymol. 2019;616:61–85. doi: 10.1016/bs.mie.2018.10.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Liu Y et al. Covalent Modifications of the Bacteriophage Genome Confer a Degree of Resistance to Bacterial CRISPR Systems. J Virol. 2020;94:e01630–e01620. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Gasiunas G, Barrangou R, Horvath P, Siksnys V. Cas9–crrna ribonucleoprotein complex mediates specific DNA cleavage for adaptive immunity in bacteria. Proc Natl Acad Sci. 2012;109. doi: 10.1073/pnas.1208507109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Barrangou R et al. CRISPR Provides Acquired Resistance Against Viruses in Prokaryotes. Science. 2007;315:1709–1712. doi: 10.1126/science.1138140 [DOI] [PubMed] [Google Scholar]
  • 67.van Houte S et al. The diversity-generating benefits of a prokaryotic adaptive immune system. Nature. 2016;532:385–388. doi: 10.1038/nature17436 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Held NL, Whitaker RJ. Viral biogeography revealed by signatures in Sulfolobus islandicus genomes. Environ Microbiol. 2009;11:457–466. doi: 10.1111/j.1462-2920.2008.01784.x [DOI] [PubMed] [Google Scholar]
  • 69.Zoephel J. RNA-Seq analyses reveal CRISPR RNA processing and regulation patterns. Biochem Soc Trans. 2013;41. doi: 10.1042/BST20130129 [DOI] [PubMed] [Google Scholar]
  • 70.Pyenson NC, Marraffini LA. Co-evolution within structured bacterial communities results in multiple expansion of CRISPR loci and enhanced immunity. Elife. 2020;9:e53078. doi: 10.7554/eLife.53078 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Baba T et al. Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol Syst Biol. 2006;2:2006.0008. doi: 10.1038/msb4100050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Jacob F, Wollman EL. Processes of conjugation and recombination in Escherichia coli. I. Induction by conjugation or zygotic induction. Ann Inst Pasteur (Paris). 2006;91:486–510. [PubMed] [Google Scholar]
  • 73.Murugan K, Suresh SK, Seetharam AS, Severin AJ, Sashital DG. Systematic in vitro specificity profiling reveals nicking defects in natural and engineered CRISPR–cas9 variants. Nucleic Acids Res. 2021;49(7):4037–4053. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Creating Excel files with Python and XlsxWriter—XlsxWriter Documentation. https://xlsxwriter.readthedocs.io/index.html.

Decision Letter 0

Paula Jauregui, PhD

18 Jul 2022

Dear Dr. Sashital,

Thank you for submitting your manuscript entitled "CRISPR-Cas effector specificity and target mismatches determine phage escape outcomes" for consideration as a Research Article by PLOS Biology. I apologize for not sending this email earlier, due to the summer holiday season we are a bit slower than usual.

Your manuscript has now been evaluated by the PLOS Biology editorial staff and I am writing to let you know that we would like to send your submission out for external peer review.

However, before we can send your manuscript to reviewers, we need you to complete your submission by providing the metadata that is required for full assessment. To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire.

Once your full submission is complete, your paper will undergo a series of checks in preparation for peer review. After your manuscript has passed the checks it will be sent out for review. To provide the metadata for your submission, please Login to Editorial Manager (https://www.editorialmanager.com/pbiology) within two working days, i.e. by Jul 20 2022 11:59PM.

If your manuscript has been previously peer-reviewed at another journal, PLOS Biology is willing to work with those reviews in order to avoid re-starting the process. Submission of the previous reviews is entirely optional and our ability to use them effectively will depend on the willingness of the previous journal to confirm the content of the reports and share the reviewer identities. Please note that we reserve the right to invite additional reviewers if we consider that additional/independent reviewers are needed, although we aim to avoid this as far as possible. In our experience, working with previous reviews does save time.

If you would like us to consider previous reviewer reports, please edit your cover letter to let us know and include the name of the journal where the work was previously considered and the manuscript ID it was given. In addition, please upload a response to the reviews as a 'Prior Peer Review' file type, which should include the reports in full and a point-by-point reply detailing how you have or plan to address the reviewers' concerns.

During the process of completing your manuscript submission, you will be invited to opt-in to posting your pre-review manuscript as a bioRxiv preprint. Visit http://journals.plos.org/plosbiology/s/preprints for full details. If you consent to posting your current manuscript as a preprint, please upload a single Preprint PDF.

Feel free to email us at plosbiology@plos.org if you have any queries relating to your submission.

Kind regards,

Paula

Paula Jauregui, PhD,

Senior Editor

PLOS Biology

pjaureguionieva@plos.org

Decision Letter 1

Paula Jauregui, PhD

20 Sep 2022

Dear Dr. Sashital,

Thank you for your patience while your manuscript "CRISPR-Cas effector specificity and target mismatches determine phage escape outcomes" was peer-reviewed at PLOS Biology. It has now been evaluated by the PLOS Biology editors, an Academic Editor with relevant expertise, and by several independent reviewers.

In light of the reviews, which you will find at the end of this email, we would like to invite you to revise the work to thoroughly address the reviewers' reports.

As you will see below, the reviewers agree that a major weakness of the manuscript is the lack of mechanistic insight. This is highlighted by the comments of reviewer #2 and #3. Addressing the reviewers' criticisms would require additional experimental work, improved clarity and depth in your discussion. Please address all the reviewers' comments.

Given the extent of revision needed, we cannot make a decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is likely to be sent for further evaluation by all or a subset of the reviewers.

We expect to receive your revised manuscript within 3 months. Please email us (plosbiology@plos.org) if you have any questions or concerns, or would like to request an extension.

At this stage, your manuscript remains formally under active consideration at our journal; please notify us by email if you do not intend to submit a revision so that we may withdraw it.

**IMPORTANT - SUBMITTING YOUR REVISION**

Your revisions should address the specific points made by each reviewer. Please submit the following files along with your revised manuscript:

1. A 'Response to Reviewers' file - this should detail your responses to the editorial requests, present a point-by-point response to all of the reviewers' comments, and indicate the changes made to the manuscript.

*NOTE: In your point-by-point response to the reviewers, please provide the full context of each review. Do not selectively quote paragraphs or sentences to reply to. The entire set of reviewer comments should be present in full and each specific point should be responded to individually, point by point.

You should also cite any additional relevant literature that has been published since the original submission and mention any additional citations in your response.

2. In addition to a clean copy of the manuscript, please also upload a 'track-changes' version of your manuscript that specifies the edits made. This should be uploaded as a "Revised Article with Changes Highlighted" file type.

*Re-submission Checklist*

When you are ready to resubmit your revised manuscript, please refer to this re-submission checklist: https://plos.io/Biology_Checklist

To submit a revised version of your manuscript, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' where you will find your submission record.

Please make sure to read the following important policies and guidelines while preparing your revision:

*Published Peer Review*

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*PLOS Data Policy*

Please note that as a condition of publication PLOS' data policy (http://journals.plos.org/plosbiology/s/data-availability) requires that you make available all data used to draw the conclusions arrived at in your manuscript. If you have not already done so, you must include any data used in your manuscript either in appropriate repositories, within the body of the manuscript, or as supporting information (N.B. this includes any numerical values that were used to generate graphs, histograms etc.). For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5

*Blot and Gel Data Policy*

We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare them now, if you have not already uploaded them. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Thank you again for your submission to our journal. We hope that our editorial process has been constructive thus far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Paula

---

Paula Jauregui, PhD,

Senior Editor

PLOS Biology

pjaureguionieva@plos.org

------------------------------------

REVIEWS:

Reviewer #1: CRISPR.

Reviewer #2: Phages-host interactions.

Reviewer #3: Phages and CRISPR.

Reviewer #1: CRISPR-Cas systems use RNA-guided nucleases to interfere against foreign genetic elements, like bacteriophage, in a sequence-specific manner. Because targeting is based on homology between the crRNA and the target sequence, phages frequently escape immunity by evolving mutations in the target site. The impact of these mutations on Cas protein recognition and cleavage depends on their location within the target sequence, PAM and seed mutations confer the greatest resistance to CRISPR interference, while PAM-distal mutations usually have minimal effects. Here the Sashital group discovered that mismatches in all areas of the spacer/target restore the ability of phages to overcome interference by the nuclease Cas12a, even if they do not affect target DNA cleavage in vivo or in vitro. The authors show that these mismatches promote the rise of phage mutants containing additional mismatches in the target region, which more completely evade CRISPR immunity. This is a very thorough paper, with nicely presented data that fully supports the conclusions. I commend the authors for their nice work, and support publication with very minor editorial revisions regarding the points below.

The data in Fig 2C show that the additional phage escape mutations that arise in the context of pre-existing crRNA:target mismatches correlate nicely with the site of the pre-existing mismatch. E.g. seed mismatches promote the accumulation of additional seed and PAM mutations, while PAM-distal mismatches accumulate additional PAM-distal mismatches. To me the observation that the observed escape mutations cluster near the prior mismatch is fascinating, and warrants a bit of additional discussion:

1. Does it not suggest that the additional escape mutations are not pre-existing in the phage population and might be instead introduced during repair after Cas12 targeting? Since PAM and seed mutations are strongly advantageous for the phage, and are sufficient to evade targeting on their own, I would expect these to accrue in all circumstances regardless of the location of the previous mismatch, but this is not what is observed. To me the data support a model in which these escape mutations are actively being introduced.

2. Cas12 is known to cleave ssDNA nonspecifically in response to target binding. Could collateral cleavage at the exposed ssDNA at the mismatch bubble be responsible for the accumulation of escape mutations?

In the mutation measurements by deep sequencing in Fig 2, it would be interesting to see the number of mutations in the rest of the phage genome (outside the target region). Could the authors present this data if they have it?

Line 96: Could the authors specify whether Cas12a was expressed from a plasmid or in the E. coli chromosome, and which promoter was used?

Line 115: Correlation between cleavage of mismatched target DNA and efficacy in EOP assays - There seems to be a discrepancy between the EOP assay and cleavage rate in the gene J target for some targets. In particular, the MM8 target is cleaved as well as P in vitro (and better than MM19), but in vivo MM8 is severely impaired in targeting and MM19 is just as susceptible as P.

Line 351: "perfect crRNA"

Reviewer #2: Schelling et al investigate the impacts of various spacer mismatches on the efficiency of phage targeting by a Type V-A CRISPR-Cas system and the corresponding phage mutations that arise in escapers at the targeted locus. The researchers use Cas12a from Francisella novicida (FnCas12a) expressed in E. coli and phage lambda as a model host-virus system. They demonstrate that mismatches between two representative spacers and their corresponding protospacers in phage lambda cause defects in immunity in a position-specific manner, with seed region mismatches being most detrimental to CRISPR immunity (Fig. 1). The authors go on to characterize phages remaining in culture after Cas12a challenge and show that the position of protospacer mutations in these 'escapers' differ depending upon the position of the mismatched nucleotide (Fig. 2), and that multiple PAM distal mutations can cause loss of immunity by Cas12a (Fig. 3). The multiplicity of infection (MOI) impacts the diversity of phage escaper mutations in the presence of a single mismatch, and combined mismatches are necessary to promote phage escape (Fig. 4). However, two different crRNAs containing mismatches are more effective against phage than individual mismatched crRNAs (Fig. 5). The authors go on to test the impact of protospacer mutations on the diversity of subsequent mutations by isolating individual phage escapers and challenging those with the WT crRNA (Fig. 6). The authors observed that a single mutation naturally-acquired in the seed region is sufficient to afford full protection, while PAM and PAM-distal mutations require the acquisition of additional mutations to allow for phage escape (Fig. 6). Finally, the authors examine the pre-existing mutations in the phage population and found deletions in non-essential genomic loci which get selected upon Cas12a targeting (Fig. 7).

Overall, the manuscript is well-written and reports on interesting observations relating to the evolutionary dynamics of a phage population when faced with 'imperfect' targeting by Cas12a. The effects of Cas12a on phage evolution is understudied; therefore, these findings are expected to advance our understanding of the biology and impacts of Type V CRISPR-Cas systems. While some of the findings are not surprising (such as the disproportional impact of seed region mutations on Cas12a function, and the presence of pre-existing deletions in nonessential genomic loci in the phage population), there are some compelling observations relating to the compounded inhibitory effect of multiple PAM-distal mutations on Cas12a function, and the diverse mutations that can arise in the presence of mismatches which vary according to the position of the mismatch. However, one major weakness in the study is the absence of definitive evidence explaining how crRNA mismatch position drives mutations in various locations in the protospacer. This and other specific points are listed below in order of importance.

1. Figure 7: The conclusion that the SNPs in escaper phages following Cas12a 'imperfect' targeting are pre-existing in the population (lines 403-406) is not convincing. The entire paper is focused on the acquisition of SNPs in the essential genes L and J, but when the authors look for pre-existing mutations in the phage population, they only look for deletions in non-essential loci. Is it possible that their origins differ? Recent studies have shown that DNA cleavage by Types I and II CRISPR-Cas systems drive mutagenesis in Lambda phage, and this is mainly driven by the phage-encoded repair machinery (Hossain et al, 2021). The conclusion would be more convincing if the authors repeated one of the targeting/mismatch assays in using a mutant phage variant lacking repair machinery and show that the frequency and pattern of mutations in escaper phages remains unchanged.

2. Figure 3C: The authors characterize Cas9 phage escapers in the presence of mismatched sgRNAs and were unable to identify PAM-distal mutations using Sanger sequencing. From this observation, the authors conclude that Cas12a may be unique in its susceptibility to PAM-distal mutations (lines 232-237). However, Sanger sequencing is far less sensitive than Illimuna (which was used for the Cas12a escaper characterization). Is it possible that PAM-distal mutants are present following Cas9 targeting, but rare? Can the authors follow-up with Illumina sequencing of Cas9 escapers to show a definitive difference between the two CRISPR types?

3. Figure 1: Early in the manuscript, the authors should provide more information about genes K and J. What do they encode? Are they essential for phage survival? The latter is discussed somewhere in the middle of the manuscript, but this information should be introduced along with the assay at the beginning since this greatly affects the types of mutations that can be acquired. Also, why were these specific genes chosen as representatives? Do the authors think that the positional effects of mismatches are generalizable for other spacer sequences when targeting essential genes?

4. Figure 2C and D: It is unclear whether the data represents one phage with several point mutations, or many phages, each carrying a smaller subset of the mutations. Can the authors provide more clarification to help guide the reader in the interpretation?

5. Figure 5 A, B: The perfect match crRNA control is missing.

Reviewer #3: Experiments described in this manuscript investigate the effects of CRISPR RNA (crRNA) mismatches on the ability of phages to escape targeting by the Cas12a CRISPR-Cas system of Francisella novicida.

The experimental set-up used here involves heterologous expression of Cas12a and crRNA in E. coli. The crRNA molecules expressed were designed to target phage lambda at two different positions. Experiments involved designing spacers with mismatches at several different positions across the spacer region. Many interesting observations arise from these experiments. In particular, the authors find that even though a mismatch has no effect on the ability of the Cas12 complex to cleave DNA in vitro, phages escaping CRISPR immunity arise much more readily when any mismatch is present. Escaper mutations in phages tend to be close to the site of the mismatch in the crRNA. Interestingly, a similar system testing Cas9 gave a very different result.

The experiments performed here are done rigorously and are clearly described. The results are definitely interesting and warrant publication. My problem with this manuscript with respect to publication in PLoS Biology is that I'm uncertain of the big picture conclusions emerging from this work. While the results are interesting and somewhat unexpected in some instances, it is difficult to draw mechanistic insight from these experiments. Since Cas9 behaved differently, I wonder what the difference is between the Cas12 and Cas9 mechanisms that could cause this. Is it possible that mismatches could affect the Cas12 non-specific nuclease activity that is induced upon target-binding? Why did the authors focus on Cas12? The authors should attempt to draw some more general conclusions from this work if it is possible.

Some minor comments:

Line 107-Authors should briefly discuss the effects of the point mutations in vitro. Are these the expected results. A reader who is unfamiliar with the system may be expecting larger effects.

Line 169-Authors need to refer to a figure here.

Figure 4 could be improved by providing schematics of the experiments. These experiments were somewhat complicated.

Decision Letter 2

Paula Jauregui, PhD

22 Feb 2023

Dear Dr. Sashital,

Thank you for your patience while we considered your revised manuscript "CRISPR-Cas effector specificity and cleavage site determine phage escape outcomes" for publication as a Research Article at PLOS Biology. This revised version of your manuscript has been evaluated by the PLOS Biology editors, the Academic Editor, and the original reviewers.

Based on the reviews, we are likely to accept this manuscript for publication, provided you address the following data and other policy-related requests.

1. DATA POLICY:

You may be aware of the PLOS Data Policy, which requires that all data be made available without restriction: http://journals.plos.org/plosbiology/s/data-availability. For more information, please also see this editorial: http://dx.doi.org/10.1371/journal.pbio.1001797

Note that we do not require all raw data. Rather, we ask that all individual quantitative observations that underlie the data summarized in the figures and results of your paper be made available in one of the following forms:

A) Supplementary files (e.g., excel). Please ensure that all data files are uploaded as 'Supporting Information' and are invariably referred to (in the manuscript, figure legends, and the Description field when uploading your files) using the following format verbatim: S1 Data, S2 Data, etc. Multiple panels of a single or even several figures can be included as multiple sheets in one excel file that is saved using exactly the following convention: S1_Data.xlsx (using an underscore).

B) Deposition in a publicly available repository. Please also provide the accession code or a reviewer link so that we may view your data before publication.

Regardless of the method selected, please ensure that you provide the individual numerical values that underlie the summary data displayed in the following figure panels as they are essential for readers to assess your analysis and to reproduce it: Figures 1CD, 2BC, 3BDF, 4ABC, 5B, 6BC, 7B, S1C, and Supplementary Figures S2ABC, S3B, S4, S6AB, S7C, S8C, S9C, S10B, S11A.

NOTE: the numerical data provided should include all replicates AND the way in which the plotted mean and errors were derived (it should not present only the mean/average values).

Please also ensure that figure legends in your manuscript include information on where the underlying data can be found, and ensure your supplemental data file/s has a legend.

Please ensure that your Data Statement in the submission system accurately describes where your data can be found.

2. BLOT AND GEL REPORTING REQUIREMENTS:

We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare and upload them now. Please provide this for Supplementary Figures S5A, S7B, S8B, S9B.

Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements

As you address these items, please take this last chance to review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the cover letter that accompanies your revised manuscript.

We expect to receive your revised manuscript within two weeks.

To submit your revision, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' to find your submission record. Your revised submission must include the following:

- a cover letter that should detail your responses to any editorial requests, if applicable, and whether changes have been made to the reference list

- a Response to Reviewers file that provides a detailed response to the reviewers' comments (if applicable)

- a track-changes file indicating any changes that you have made to the manuscript.

NOTE: If Supporting Information files are included with your article, note that these are not copyedited and will be published as they are submitted. Please ensure that these files are legible and of high quality (at least 300 dpi) in an easily accessible file format. For this reason, please be aware that any references listed in an SI file will not be indexed. For more information, see our Supporting Information guidelines:

https://journals.plos.org/plosbiology/s/supporting-information

*Published Peer Review History*

Please note that you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*Press*

Should you, your institution's press office or the journal office choose to press release your paper, please ensure you have opted out of Early Article Posting on the submission form. We ask that you notify us as soon as possible if you or your institution is planning to press release the article.

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please do not hesitate to contact me should you have any questions.

Sincerely,

Paula

---

Paula Jauregui, PhD,

Senior Editor,

pjaureguionieva@plos.org,

PLOS Biology

------------------------------------------------------------------

Reviewer remarks:

Reviewer #1: The additional experiments and textual changes made by the authors have substantially improved the study, I have no further queries and support publication.

Reviewer #2: The authors have done an excellent job addressing the reviewers' concerns through added experiments and discussion to better support their claims. I see no reason to further delay publication of this manuscript.

Reviewer #3: The authors have done a thorough job of addressing my concerns and those of the other reviewers. I believe that the paper is now suitable for publishing in PLoS Biology.

Decision Letter 3

Paula Jauregui, PhD

6 Mar 2023

Dear Dr Sashital,

Thank you for the submission of your revised Research Article "CRISPR-Cas effector specificity and cleavage site determine phage escape outcomes" for publication in PLOS Biology. On behalf of my colleagues and the Academic Editor, Jeremy Barr, I am pleased to say that we can in principle accept your manuscript for publication, provided you address any remaining formatting and reporting issues. These will be detailed in an email you should receive within 2-3 business days from our colleagues in the journal operations team; no action is required from you until then. Please note that we will not be able to formally accept your manuscript and schedule it for publication until you have completed any requested changes.

Please take a minute to log into Editorial Manager at http://www.editorialmanager.com/pbiology/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production process.

PRESS

We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with biologypress@plos.org. If you have previously opted in to the early version process, we ask that you notify us immediately of any press plans so that we may opt out on your behalf.

We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/.

Thank you again for choosing PLOS Biology for publication and supporting Open Access publishing. We look forward to publishing your study. 

Sincerely, 

Paula

---

Paula Jauregui, PhD,

Senior Editor

PLOS Biology

pjaureguionieva@plos.org

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Cleavage assays by FnCas12a with single mismatch crRNAs.

    (A) Sequence of the target DNAs, perfectly matching crRNAs and single-mismatched crRNAs. (B) Representative agarose gels showing time course cleavage of negatively supercoiled plasmid (nSC) using perfectly matching crRNA or single-mismatched crRNA by FnCas12a, resulting in linear (li) and/or nicked (n) products. Time points at which the samples were collected were 7 s, 15 s, 30 s, 1 min, 2 min, 5 min, 15 min, and 30 min. All controls were performed under the same conditions as the longest time point for the experimental samples. (C) Quantification of cleaved products (linear and nicked fractions) from the time course cleavage. Averages of the cleaved fraction values were plotted versus time and fit to a first-order rate equation with error bars; n = 3 replicates. For values reported in Fig 1C, each individual replicate was fit, and kobs was reported as the average value for the 3 replicates. See S1 Data for data quantification.

    (EPS)

    S2 Fig. Analysis of phage mutations that emerge following exposure to Cas12a-mediated interference with mismatched crRNAs.

    (A) Line graphs showing the nucleotide diversity of phage target regions over time after exposure to bacteria cells expressing crRNAs with and without mismatches. Target regions are gene J or gene L and crRNAs either match the target region (perfect) or contain mismatches at position x. Nucleotide diversity is calculated using the proportion of each sequence in the sample and the number of nucleotide differences between each pair of sequences. Two individual replicates are shown for each condition. See S3 Data for quantified data. (B) Heat maps showing location of target mutations that arose due to CRISPR targeting by FnCas12a on a solid medium. Z-scores for abundance of single-nucleotide variants, including nucleotide identity changes or deletions, were determined for each sample relative to the non-targeted control phage population. Enriched sequences indicate high Z-scores. Z-scores range from 0 (white) to 7.7 (darkest red). Single-nucleotide deletions are shown at adjacent position to the 3′ side. Positions with crRNA mismatches are labeled with solid black boxes in the heat map. See S4 Data for variant abundance quantification. (C) Graphs showing single-nucleotide variations for mutated phage target sequences present at the 8 h time point for 2 individual replicates. Bar graph height shows the proportion of sequences in each sample with the mutation type at each position in the target. Deletions (Δ) are plotted at the first position where a mismatch occurs between the crRNA and the target. See S3 Data for variant abundance quantification.

    (EPS)

    S3 Fig. MiSeq sample counts and R1/R2 file overlap.

    (A) Table showing absolute counts from MiSeq for each replicate of the 8 h time point for each experimental condition for E. coli infected with wild-type phage. Each count represents an extracted sequence in which R1 and R2 reads matched. The negative control (non-targeting crRNA) samples were run in a separate MiSeq run to maximize the number of reads and minimize barcode overlap with mutated samples, allowing for analysis of preexisting mutants in the wild-type population. (B) Bar charts showing mutated sequences at each position in the high-throughput sequencing reads of the negative control lambda phage population for the gene J and gene L region. The target region is highlighted with a red box. R1 and R2 reads do not overlap in the target region (no overlap) or overlap in the target region (target overlap). R1 reads are used for the target region in the no overlap condition. When R1 and R2 reads overlap, sequences in which the target region sequence does not agree for both the R1 and R2 reads are removed from analysis and are not shown in this figure. This measure was taken to ensure that variations observed in negative control samples arose solely from PCR errors or the natural variation of the population. While some variations were still observed with stringent R1/R2 overlap enforced, it is not possible to distinguish PCR errors from natural variation. No mutations were substantially enriched outside of the target region for any of the samples tested in this study. See S3 Data for R1 and R2 variant read counts.

    (EPS)

    S4 Fig. Single deletions enriched by CRISPR exposure.

    Bar charts showing single-nucleotide deletions from the lambda phage gene J target (A) and gene L target (B) in phage that were exposed to cells expressing a non-targeting crRNA (CRISPR inactive) and cells expressing crRNAs with and without mismatches to the lambda phage gene J target (C). (A, B) Deletions are mapped along the target sequences for all time points and both biological replicates for the negative control samples. (C) CRISPR active samples shown for gene J target that contained deletion sequences that represented more than 1% of all reads. No such deletions were observed in the gene L target. Deletions were observed in the gene L coding region in phage in the wild-type population. The deletions could remain in genomes in the population as these genomes are packaged along with functional structural proteins in successfully infected cells. See S3 Data for quantification of single-nucleotide deletion abundance.

    (EPS)

    S5 Fig. Deletions in non-essential genomic regions that are selected following Cas12a targeting preexist in the phage population.

    (A) PCR amplification of regions surrounding essential and non-essential genes targeted by Cas12a. Spot assays were performed using E. coli expressing crRNAs that match 2 non-essential (nin204 and nin146) and 2 essential regions (gene J and gene L) of the lambda phage genome. Controls were performed with a plasmid not encoding a crRNA. Phages were isolated from the phage spots and target and flanking regions of the phage genome were PCR amplified and run on an agarose gel. (B) Sanger sequencing chromatograms of phage genome deletions in non-essential regions targeted by Cas12a. Non-essential regions in the lambda phage genome were targeted with matching crRNAs on solid media. Phages were isolated and the target regions were PCR amplified. Single bands were gel purified and PCR amplified in a second round. These second PCR products were sequenced and the chromatograms were aligned to the WT lambda phage genome. Homology at each end of the deletions was identified and highlighted in blue. (C) Map of genomic deletions observed by PacBio sequencing of PCR amplicons from phage unexposed to CRISPR targeting. DNA from lambda phage unexposed to CRISPR targeting was used as a template for PCR reactions that amplified the same non-essential regions as in (A). This PCR product was sequenced with PacBio long-read sequencing and the obtained sequences were matched with the wild-type lambda genome sequence to identify any deletions present. These deletions are plotted on the chart relative to their position in the genome. The quantity of each deletion is identified by a color code. Deletions found by Sanger sequencing in CRISPR active samples are indicated with an asterisk (*). See S5 Data for sequences and quantification.

    (EPS)

    S6 Fig. Growth curves and mutant emergence for Δred phage infection.

    (A) Growth curves of E. coli expressing FnCas12a and crRNAs bearing non-targeting, perfectly matching, or mismatched crRNAs infected with Δred phage. Phage was added when the cells reached mid log phase at approximately 2 h after inoculation. The average of 3 replicates is plotted for each condition, with error bars representing standard deviation. Large error bars indicate that not all replicate cultures lysed. See S2 Data for quantified data. (B) Graphs showing mutation type in Δred phage target sequences present at the 8 h time point. Bar graph height shows the proportion of sequences in each sample with the mutation type at each position in the target. Deletions (Δ) and multiple mutations are plotted at the first position where a mismatch occurs between the crRNA and the target. The average of 3 replicates is shown, with error bars representing standard deviation. See S3 Data for variant abundance quantification. (C) Comparison of target sequences of phage isolated from cultures in (A) containing cells expressing a crRNA targeting gene J with a mismatch at position 8. The WT target sequence is underlined. Three individual replicates are shown and the percent of each mutant sequence in the sample is listed. Mutated positions relative to the WT sequence are highlighted in orange.

    (EPS)

    S7 Fig. Cleavage assays by FnCas12a with double mismatch crRNAs.

    (A) Sequence of the gene J target DNA, perfectly matching crRNA and double-mismatched crRNAs. (B) Representative agarose gels showing time course cleavage of negatively supercoiled plasmid (nSC) using perfectly matching crRNA or double-mismatched crRNA by FnCas12a, resulting in linear (li) and/or nicked (n) products. Time points at which the samples were collected were 7 s, 15 s, 30 s, 1 min, 2 min, 5 min, 15 min, and 30 min. All controls were performed under the same conditions as the longest time point for the experimental samples. The gel for the perfect crRNA is reproduced from S1B Fig. (C) Quantification of cleaved products from the time course cleavage. The average cleaved fraction was plotted versus time and fit to a first-order rate equation with error bars representing standard deviation; n = 3 replicates. For values reported in Fig 3B, each individual replicate was fit, and kobs was reported as the average value for the 3 replicates. See S1 Data for quantification. Quantification for the perfect crRNA is also shown in S1C Fig.

    (EPS)

    S8 Fig. Cleavage assays by FnCas12a of wild-type and mutant target sequences.

    (A) Sequences the perfectly matching crRNA, position 15 mismatched crRNA, and 3 gene L target sequences used for cleavage assays. (B) Representative agarose gels showing time course cleavage of negatively supercoiled plasmid (nSC) bearing a wild-type (WT), A2T, or G17T-containing gene L sequence using perfectly matching crRNA or position 15 mismatched crRNA by FnCas12a, resulting in linear (li) and/or nicked (n) products. Time points at which the samples were collected were 7 s, 15 s, 30 s, 1 min, 2 min, 5 min, 15 min, and 30 min. All controls were performed under the same conditions as the longest time point for the experimental samples. The gels for the perfect crRNA and 15 mismatched crRNA cleaving WT target are reproduced from S1B Fig. (C) Quantification of cleaved products from the time course cleavage. The average cleaved fraction was plotted versus time and fit to a first-order rate equation with error bars representing standard deviation; n = 3 replicates. For values reported in Fig 3D, each individual replicate was fit, and kobs was reported as the average value for the 3 replicates. See S1 Data for quantification. Quantifications for perfect and MM15 crRNA cleaving WT target are also shown in S1C Fig.

    (EPS)

    S9 Fig. Cleavage assays by SpCas9 of wild-type and mutant target sequences.

    (A) Sequences the perfectly matching crRNA, position 15 mismatched crRNA, and 3 gene L target sequences used for cleavage assays. Cleavage was performed using a crRNA-tracrRNA pair. (B) Representative agarose gels showing time course cleavage of negatively supercoiled plasmid (nSC) bearing a wild-type (WT), G2T, or A17T-containing gene L sequence using perfectly matching crRNA or position 15 mismatched crRNA by SpCas9, resulting in linear (li) and/or nicked (n) products. Time points at which the samples were collected were 7 s, 15 s, 30 s, 1 min, 2 min, 5 min, 15 min, and 30 min. All controls were performed under the same conditions as the longest time point for the experimental samples. (C) Quantification of cleaved products from the time course cleavage. The average cleaved fraction was plotted versus time and fit to a first-order rate equation with error bars representing standard deviation; n = 3 replicates. For values reported in Fig 4A, each individual replicate was fit, and kobs was reported as the average value for the 3 replicates. See S1 Data for quantification.

    (EPS)

    S10 Fig. Phage protection by and mutant emergence from SpCas9 with sgRNA mismatches.

    (A) Spot assays using lambda phage on lawns of bacteria expressing SpCas9 along with sgRNAs with and without mismatches. SgRNAs target gene J or gene L and contain mismatches at position X or match the target (perfect). (B) Graphs showing single-nucleotide variants in phage target sequences present at the 8 h time point following challenge by Cas9 bearing different sgRNAs. Bar graph height shows the proportion of sequences in each sample with the mutation type at each position in the target. Deletions (Δ) are plotted at the first position where a mismatch occurs between the crRNA and the target. The average of 3 replicates is shown, with error bars representing standard deviation. See S7 Data for variant abundance quantification.

    (EPS)

    S11 Fig. Mutant emergence at varied MOIs.

    (A) Growth curves using cells expressing a crRNA targeting gene J with a mismatch in the seed region and infected with phage at different MOIs. Phage was added at the indicated MOIs when cells reached mid log phase and the OD600 of the culture was measured over time. Cultures at lower MOIs did not lyse and are omitted from the graph. See S2 Data for quantified data. (B) Spot assays estimating the titer of phage lysates exposed to interference by Cas12a armed with a seed mismatched crRNA. Full plates from Fig 4B, including lowest MOI samples which produced phages with low titers.

    (EPS)

    S12 Fig. Phage targeted by multiple spacers develops mutations in 1 or more targeted regions.

    (A) Spot assays performed using lambda phage that previously infected E. coli in liquid culture expressing FnCas12a and 2 different crRNAs targeting gene J and L (lysate spacers J+L) or 2 of the same crRNA targeting gene L (lysate spacers L+L) both with mismatches in the seed region. Phage was harvested at different time points of the liquid culture (0, 1, 2, 3, and 5 h after infection). The previous phage lysates were spotted on cells expressing a crRNA that matches the gene L target in the lambda genome (gene L perfect). (B) Sanger sequencing chromatograms showing sequences of target regions of the genome in phage exposed to cells expressing 2 mismatched crRNAs targeting gene J and gene L, respectively. Phage from single plaques was isolated and both target regions were sequenced. Mutated bases are highlighted.

    (EPS)

    S13 Fig. Purified single-mutant and double-mutant chromatograms.

    (A–H) Sanger sequencing chromatograms of single- and double-mutant phage lysates. Single-mutant phages were generated by exposure to crRNAs with mismatches (MM crRNA) at different positions (position X) and purified as shown in Fig 7A. Mutants were generated in the gene J and gene L CRISPR target. Purified single-mutant phage was used to challenge bacteria expressing a perfect crRNA and target regions were sequenced by Sanger sequencing to determine if second mutations appeared. Both mixed and clonal double-mutant populations were generated after this step. Mutated bases are highlighted.

    (EPS)

    S1 Table. Lists of plasmids, primers, and oligonucleotides used in this study.

    (XLSX)

    S1 Data. Quantification of percent cleaved over time for cleavage assays.

    Contains data presented in Figs 2C, 3B, 3D, 4A, S1C, S7C, S8C, and S9C.

    (XLSX)

    S2 Data. Quantification of bacterial liquid growth curves and plaque assays.

    Contains data presented in Figs 1D, 4B, 6B, 6C, 7B, S6A, and S11A.

    (XLSX)

    S3 Data. Quantification of high-throughput sequencing data for gene J and L targets in wild-type and Δred phage upon Cas12a challenge with perfect and mismatched crRNAs in liquid media.

    Contains data presented in Figs 2B, 2C, S2A, S2C, S3, S4, and S6B.

    (XLSX)

    S4 Data. Quantification of high-throughput sequencing data for gene J and L targets upon Cas12a challenge with perfect and mismatched crRNAs on solid media.

    Contains data presented in S2B Fig.

    (XLSX)

    S5 Data. Quantification of PacBio sequencing of nin146 and nin204 genomic regions following Cas12a challenge.

    Contains data presented in S5C Fig.

    (XLSX)

    S6 Data. Quantification of phage competition between phages containing mutations in the seed or PAM-distal region of the gene L target.

    Contains data presented in Fig 3F.

    (XLSX)

    S7 Data. Quantification of high-throughput sequencing data for gene J and L targets upon Cas9 challenge with perfect and mismatched crRNAs in liquid media.

    Contains data presented in Figs 4C and S10B.

    (XLSX)

    S8 Data. Quantification of high-throughput sequencing data for gene J target upon phage infection at different MOIs.

    Contains data presented in Fig 5B.

    (XLSX)

    S1 Raw Images. PDF file containing all raw gel images.

    Original gel images for all images presented in the Supporting information figures or used for triplicate quantification of Cas12a or Cas9 cleavage rates. All gels were visualized using SyberSafe staining. When relevant, annotations list which samples were used for gel images in S1B, S5A, S7B, S8B or S9B Figs. Samples that were not included in the Supporting information figures contain replicates that were used for quantification of observed rate constants. All samples are ordered as labeled in the relevant Supporting information figure image.

    (PDF)

    Attachment

    Submitted filename: ReviewerResponse_submit.pdf

    Data Availability Statement

    All relevant data are within the paper and its Supporting Information files.” Scripts used to analyze high-throughput sequencing data are available at https://github.com/alexsq2/lambda-phage-CRISPR-mutants.


    Articles from PLOS Biology are provided here courtesy of PLOS

    RESOURCES