Abstract
CRISPR–Cas proteins are RNA-guided nucleases used to introduce double-stranded breaks (DSBs) at targeted genomic loci. DSBs are repaired by endogenous cellular pathways such as non-homologous end joining (NHEJ) and homology-directed repair (HDR). Providing an exogenous DNA template during repair allows for the intentional, precise incorporation of a desired mutation via the HDR pathway. However, rates of repair by HDR are often slow compared to the more rapid but less accurate NHEJ-mediated repair. Here, we describe comprehensive design considerations and optimized methods for highly efficient HDR using single-stranded oligodeoxynucleotide (ssODN) donor templates for several CRISPR–Cas systems including S.p. Cas9, S.p. Cas9 D10A nickase, and A.s. Cas12a delivered as ribonucleoprotein (RNP) complexes. Features relating to guide RNA selection, donor strand preference, and incorporation of blocking mutations in the donor template to prevent re-cleavage were investigated and were implemented in a novel online tool for HDR donor template design. These findings allow for high frequencies of precise repair utilizing HDR in multiple mammalian cell lines. Tool availability: https://www.idtdna.com/HDR
Subject terms: Homologous recombination, Non-homologous-end joining
Introduction
CRISPR–Cas systems have revolutionized genomics by enabling efficient and precise genome editing in a wide variety of biological systems, including eukaryotic cells1–5. Type II CRISPR–Cas systems require an RNA-guided DNA endonuclease and a target-specific guide RNA (gRNA) to generate a double-stranded break (DSB) at a desired genomic location, which must be flanked by a short protospacer adjacent motif (PAM). Streptococcus pyogenes Cas9 (S.p. Cas9) is one of the most commonly used CRISPR enzymes for genome editing. The native gRNA for Cas9 is hybridized from two RNA molecules: a CRISPR RNA (crRNA) and a universal, trans-activating crRNA (tracrRNA)6. The two components of the native gRNA can also be combined as a single, unimolecular structure to form a single-guide RNA (sgRNA)7. Association of Cas9 protein with a gRNA forms a ribonucleoprotein (RNP) complex, which surveys a dsDNA substrate and generates a DSB when its complementary target sequence with a PAM sequence 3’ of that target is recognized by an active Cas9 RNP complex8–10.
Recent reports have demonstrated that RNP delivery of nucleases has benefits over plasmid delivery as it enables a faster onset of action, reduces off-target cleavage, and eliminates the risk of random plasmid integration into the host genome9, 11, 12. At the same time, RNP delivery allows for the use of chemically modified gRNA with improved stability and reduced toxicity13. In addition, generating RNP complexes in vitro prior to delivery allows accurate control of the ratio of protein and gRNA to maximize RNP complexation efficiency. This also enables the formation of each gRNA:protein complex independently which mitigates the competition for Cas9 protein by other intracellular RNA molecules or by different gRNAs during multiplexing experiments14.
S.p. Cas9 contains two endonuclease domains (HNH and RuvC) that function together to generate a blunt DSB by each domain cleaving opposite DNA strands. Inactivating one of the two endonuclease domains results in Cas9 variants called “nickases”: the RuvC-inactive variant (Cas9 D10A) nicks the target (gRNA complementary) strand, while the HNH-inactive variant (Cas9 H840A) nicks the non-target (gRNA non-complementary) strand7. Cas9 nickases can be used with an individual guide to induce single DNA nicks and induce a repair pathway termed alternative-HDR15, 16. However, it is more common and often more efficient to perform genome editing at DSBs generated by using a nickase with a pair of gRNAs targeting opposite DNA strands in a “paired nicking strategy”17. It has been demonstrated that nickases allow for the reduction of off-target editing by ~ 50 to 1500 fold in comparison to Cas9 WT17–19. At the same time, the paired nicking strategy can facilitate highly robust editing in many model systems, including mammalian tissue culture, mouse zygotes, plants, yeast, and bacteria1, 17–26.
Cas12a enzymes are also RNA-guided double-stranded DNA nucleases (Type V CRISPR–Cas systems) that provide an alternative to the commonly used S.p. Cas9 nuclease with similar editing outcomes. Unlike S.p. Cas9, which recognizes an 5’-NGG-3’ PAM sequence, A.s. Cas12a recognizes a 5’-TTTV-3’ (V = A/G/C) PAM site which allows for a broadened range of targeting sites in AT-rich regions. Cas12a relies on a single, short (41–44 nt) gRNA and generates staggered DSBs with 5’ overhangs4. In addition, Cas12a has been shown to be advantageous in genome editing applications due to intrinsically high specificity which the potential for off-target cleavage27, 28. However, Cas12a also has non-specific single-stranded DNase (ssDNase) activity that is activated upon binding to the target DNA strand29. This could potentially impact the ability of Cas12a to mediate efficient HDR if the ssODN is degraded before it is able to act as a donor template.
To facilitate genome editing, CRISPR–Cas enzymes are used to generate a DSB at a genomic locus which can then be repaired by a variety of endogenous cellular repair pathways including non-homologous end joining (NHEJ) and homology-directed repair (HDR)30, 31. NHEJ is an imperfect process and commonly creates small insertions or deletions (indels), which can be exploited to introduce diverse, but reproducible genetic mutations or gene knockouts32. On the other hand, HDR is a process that can lead to precise sequence alterations at specified genomic locations but requires the use of a carefully designed HDR donor template that contains sequences homologous to the specific sequence flanking the cut site, defined as ‘homology arms’. However, the time course of repair by HDR is slow compared to the more rapid but less accurate NHEJ-mediated repair33. For small mutations or insertions, an ssODN can be used as the HDR donor template34–37. These substrates can be readily synthesized up to 200 nucleotides (nt) in length on solid-support oligonucleotide synthesis platforms, allowing for insertions up to 160 nt while preserving, at minimum, 20-nt homology arms at both the 5’- and 3’-ends of the ssODN.
Two distinct pathways for the incorporation of single-stranded donor templates at a DSB have been proposed—single-strand DNA incorporation (ssDI) and synthesis-dependent strand annealing (SDSA), with SDSA being preferentially utilized as the repair path for ssODN donor templates in the presence of a DSB38. Previous studies have examined design considerations for ssODNs when using CRISPR–Cas enzymes. The optimal length of homology arms has been reported to be as little as 30-nt in length on either side of the DSB, and it has been demonstrated that asymmetric donor oligos can improve HDR39, 40. HDR efficiency is highest when the intended edit is placed near the DSB and is greatly reduced at loci distal to this event35, 41. In addition, reports have indicated that there may be a preference for utilizing a donor oligo with sequences either complementary or non-complementary to the gRNA37, 42–44. CRISPR–Cas9 can also re-cut dsDNA after a desired repair outcome if the protospacer and PAM sequence remains unaltered, lowering perfect HDR efficiency. This outcome can be prevented by strategically incorporating blocking mutations into the donor template35, 45.
Other approaches to improving HDR are of great interest to the genome editing community. It has been previously reported that incorporating chemical modifications such as phosphorothioate (PS) linkages may improve HDR when using ssODN donors39, 46. Another route to improving HDR frequency is using chemical compounds that inhibit key DSB repair enzymes that play a role in the competing NHEJ pathway. Several chemical compounds have been reported to increase HDR47–49.
In this work, we thoroughly investigated design features for both S.p. Cas9 and A.s. Cas12a nucleases relating to gRNA selection, donor strand preference, the placement and composition of blocking mutations, and the number of blocking mutations that are required for maximum HDR efficiency. Altogether, this study presents a set of design considerations which can be applied to CRISPR editing experiments to maximize HDR efficiency and reduce time spent generating desired mutants. Our findings constitute an empirically defined ruleset for S.p. Cas9 and S.p. Cas9 D10A nickase which have been built into a novel bioinformatic tool for HDR donor template design. Further, we provide design recommendations for A.s. Cas12a nuclease, which has not yet been systematically studied in the same manner as Cas9.
Results
Cas9 donor strand preference and gRNA selection
The selection criteria for an ssODN donor template begins with identifying the targeting strand (T strand; ssODN that is complementary to the CRISPR–Cas9 gRNA) or the non-targeting strand (NT strand; ssODN that is non-complementary to the gRNA and contains the ‘NGG’ PAM sequence) as driving the most efficient repair via HDR. While some studies have suggested that there is a strand preference for the ssODN donor template, results have varied and no universal strand preference has been identified37, 42–44. To expand on the question of strand preference with WT Cas9 nuclease, particularly when delivered as an ribonucleoprotein (RNP) complex, HDR efficiency was tested at 254 genomic loci in Jurkat cells and 239 genomic loci in HAP1 cells. Donor ssODNs containing 40-nt homology arms were designed to insert a six base EcoRI restriction digest recognition site (‘GAATTC’) at the Cas9 cleavage site which canonically lies three bases in the 5’ direction of the PAM, as illustrated in Fig. 1A. ssODNs were delivered to Jurkat and HAP1 cells along with their respective gRNAs as Cas9 RNP complexes by nucleofection, and the editing frequencies were assessed by next generation sequencing (NGS). Perfect HDR, defined as the precise insertion of the EcoRI sequence at the canonical cut site and otherwise maintaining the WT sequence, was quantified, and comparisons were made between donor templates consisting of either the T strand, or the NT strand. In Jurkat cells there was no statistical difference (p > 0.05, paired t-test) in total editing when either the T or NT strand was used. However, a significant difference in editing efficiency (p < 0.0001, paired t-test) was observed in HAP1 cells where the mean editing was 80.2% when the NT strand was used and 67.8% when the T strand was used (Supplemental Fig. 1A). Others have indicated this result is driven by the T strand binding to the Cas9 RNP complex and reducing overall editing efficiency within the cellular environment36. As demonstrated in the top two panels of Fig. 1B, the strand that leads to higher frequencies of HDR varies depending on the genomic locus and cell type being used. HAP1 cells had HDR frequencies ranging from 0 to 51.1% with a significantly higher mean HDR frequency across all sites when the NT strand was used (20.6%) than the T strand (15.2%) (p < 0.0001, paired t-test), and this correlates with the difference in total editing efficiency in these two treatment groups noted above. In contrast, we observed significantly higher mean HDR frequencies in Jurkat cells when the T strand was used compared to the NT strand (11.3% vs 7.5%, respectively) (p < 0.0001, paired t-test). In addition, the bottom two panels of Fig. 1B show that although efficient total editing is required for HDR to occur, high editing does not always lead to high HDR-mediated insertion at each site tested. In HAP1 cells, 53% of the sites tested had > 90% total editing while 74% of the sites tested in Jurkat cells had > 90% total editing; yet, there is a broad range of HDR frequencies which varied from 0 to 60% among these highly edited loci for both the NT and T strands. Further studies to investigate the optimal selection of gRNAs which have the highest potential HDR frequency, including related studies to understand cell type-specific differences, are underway.
To investigate the balance between guide cleavage efficiency and distance of the desired HDR mutation to the cut site, we selected 13 guides flanking the stop codon of GAPDH to determine which guide led to the highest HDR insertion frequency of an EcoRI site just upstream of the ‘TAA’ stop codon. These guides had cut sites that ranged from 2 to 22 bases from the desired insertion position (Fig. 1C). The available guides in the nearby region included PAMs on both strands of the genomic DNA, and ssODNs for both the targeting and non-targeting strand were designed and tested in K562 and HEK293 cells for their ability to mediate the same desired HDR event. As shown in Fig. 1D in K562 cells, guides with low editing efficiency yielded low HDR insertion, even if the cut site was close to the desired insertion location. For example, the guide that generates a DSB two bases from the desired insertion (-2) had 32.1% total editing of which 12.6% was HDR insertion (NT strand). Similarly, the guide that directs Cas9 cleavage five bases from the desired insertion (-5) had 10.7% total editing and only 1.1% HDR insertion. In contrast, the guides that direct DSBs 14 and 6 bases from the desired insertion (−14, + 6) had 96.0% total editing of which 40.7% was HDR insertion (NT strand) and 87.8% total editing of which 39.7% was HDR insertion (NT strand), respectively. As determined by NGS, these guides had the highest total editing and HDR insertion rates, even though their cut sites were positioned further from the desired insertion. This case study indicates that guide efficiency is a critical factor for efficient HDR, and guide selection that is as close as possible to the desired HDR mutation is a secondary consideration. This was also observed in HEK293 cells, where a guide that cuts 6 bases from the insertion (+ 6: 97% total editing, 34% HDR) led to higher HDR than guides 2 or 5 bases from the insertion (−5: 34.7% total editing, 12.7% HDR; −2: 62.2% total editing, 22.4% HDR). This effect was less prominent using guides further from the desired insertion site (e.g. −14) in HEK293 cells, which may be due to differences in the available repair machinery and capacity for HDR in each cell type (Supplemental Fig. 1B).
We performed a similar experiment at a second genomic locus (TNPO3) in HEK293 cells to further examine factors influencing HDR (Supplemental Fig. 1C). The total editing was high for nearly all guides tested in this experiment. However, for a guide with a cut site 9 bases from the desired insertion location (−9), the total editing was 92.6%, and this site yielded reduced HDR efficiency of 7.0% compared to the guide that cut one base further from the desired insertion (−10) with 98.5% total editing that also gave an increased HDR insertion frequency of 24.2% with the NT strand, further supporting that guide activity can be more impactful on HDR efficiency than optimal positioning with respect to the cut site.
Cas9 D10A mediates efficient HDR distant from nick sites
In comparison to WT Cas9, which needs only one gRNA to cut both strands of the target DNA, Cas9 D10A and H840A nickases can be used with paired guides to generate a DSB to mediate genome editing. To facilitate a DSB, the guides must target opposite strands of the genomic DNA and can be positioned with their PAM sites oriented towards each other (PAM-in), or apart from each other (PAM-out) (Supplemental Figure S2A). Consistent with other reports utilizing Cas9 nickase variants expressed from a plasmid, we found a higher rate of indel formation when guide RNAs for either of the D10A and H840A nickases were designed in a PAM-out orientation, and the gRNAs must position the nickases with optimal spacing of the nick sites to mediate efficient editing (Supplemental Figure S2B)17, 21. Here, we designed a set of paired guides against the human HPRT1 gene with either PAM-out or PAM-in orientation and target nick sites separated by 18–130 bp. We specifically selected gRNAs that have > 40% editing efficiency (data not shown) when delivered with WT Cas9 as RNP into HEK-293 cells, to rule out the possibility that poor editing by the nickase RNP pair is the result of poor cleavage efficiency mediated by an individual gRNA. The optimal distance between the two nicks was 40–68 nt for Cas9 D10A, and 51–68 nt for Cas9 H840A. For the PAM-out pair with nicks 68-nt apart, the total editing was 86.3% with Cas9 D10A and 77.6% with Cas9 H840A. This was reduced to 29.8% and 2.7%, respectively, when the nicks were 85-nt apart. Similarly, for Cas9 H840A, the editing was 74.7% when the nicks were spaced 51-nt apart, and this was further reduced to 34.6% when the distance between the nicks was decreased to 46-nt. We have investigated distances smaller than 40-nt between the nicks in PAM-out orientation for Cas9 D10A and found that editing was poor for spacing < 35-nt (data not shown).
When Cas9 D10A nickase RNP complexes targeting both strands in the PAM-out orientation nick the genomic DNA, a DSB with 3’ overhangs is generated. Because both strands are targeted by one of the two gRNAs, there is no canonical ‘targeting’ and ‘non-targeting’ strand in nickase experiments; thus, they are referred to as top and bottom strands. The paired-guide, double nicking strategy does not generate a blunt-ended cut like WT Cas9, so we further explored the possibility of using Cas9 D10A to insert exogenous sequences between flanking nick sites at a location that would be otherwise considered sub-optimal for WT Cas9 designs using WT Cas9 complexed with either gRNA on its own. We designed ssODN donor templates for HPRT1 in which a 6-nt EcoRI restriction enzyme recognition site was introduced at different locations along a donor template (Fig. 2A). HDR events mediated by Cas9 D10A in HEK293 cells, as measured by the percentage of EcoRI digestion, ranged from 13 to 25% across the 51-nt region (Fig. 2B, top left panel). In contrast, WT Cas9-mediated HDR decreased dramatically as the intended insertion site moved away from the cleavage site (Fig. 2B, top middle and right panels), consistent with our earlier findings. At the position centered between the two cleavage sites (25-nt from left and 26-nt from the right), Cas9 D10A was able to induce a higher HDR insertion frequency than WT Cas9 with either of the individual gRNAs. As a comparison, neither WT Cas9 nor Cas9 H840A with the same gRNA pair demonstrated HDR insertion frequency as high as Cas9 D10A at all positions tested (Supplementary Figure S2C). In addition, at these sites WT Cas9 demonstrated a strong preference for the NT strand donor template (bottom strand for WT with left gRNA, and top strand for WT with right gRNA), especially at positions distant from the cut site, while Cas9 D10A did not show a strand preference. We performed the same experiment in K562 cells, which demonstrated robust HDR overall with WT Cas9. As expected, despite both Cas9 D10A and WT Cas9 showing a higher frequency of HDR in this cell line, the ability of Cas9 D10A to mediate higher HDR when moving away from cleavage sites was retained (Fig. 2B, bottom panels).
In order to verify that the above observations are not site specific, we conducted a similar experiment at a different locus (AAVS1, PAM-out design with 46-nt spacing). In addition to 5 insert positions at or between the two nick sites, we also included two positions 12-nt upstream or downstream to the left or right nick sites, respectively (Supplementary Figure S2D). Consistent with the results described above, Cas9 D10A outperformed WT Cas9 at the position centered between the cleavage sites (position “D”) in HEK293 cells (Supplementary Figure S2E). Insertions outside of the nick sites were not as efficient as ones placed between the nick sites. The observation that Cas9 D10A has no identifiable strand preference for the HDR donor template also held true at this site. For HDR experiments with Cas9 D10A, the highest editing efficiency occurred when paired gRNAs were in the PAM-out orientation with the nick sites spaced 40–68 nt apart. Overall, we observed that the desired mutation is best achieved when placed between the two nicks, and we did not observe a consistent strand preference. The use of paired gRNAs with Cas9 D10A nickase allows for HDR insertions at locations not accessible by WT Cas9 nuclease due to design limitations and may be advantageous over WT Cas9 in situations where there is a lack of efficient Cas9 guides near the intended HDR mutation.
Optimizing placement and number of blocking mutations with Cas9
Previous studies have demonstrated that incorporating blocking mutations within the donor oligo to prevent re-cleavage by Cas9 nuclease after a desired HDR event improves rates of HDR35, 45. However, these studies have been limited in the number of constructs tested and have not examined if there is a preference for transversions (e.g. G-to-C purine to pyrimidine conversion), or transitions (e.g. G-to-A purine to purine conversion) in the blocking mutations used. We aimed to further investigate this to define a ruleset for the placement and number of blocking mutation(s) required to maximize HDR efficiency. First, we designed an experiment to determine the effect of a single blocking mutation within the PAM or the seed region of Cas9, which is defined as the PAM-proximal 10–12 bases on the 3’ end of the guide7. Mismatches within the seed region and PAM are known to significantly reduce Cas9 binding and cleavage efficiency, so would be expected to confer the highest reduction in re-cleavage by Cas98, 50. Two genomic loci were selected and HDR ssODN donor templates (NT strand) were designed to generate a single base change 3’ of the PAM to serve as the desired HDR mutation which would not impact Cas9 re-cleavage, as it falls outside of the protospacer/PAM sequence. In addition to the desired HDR mutation, a single blocking mutation in the seed region of the guide or PAM was included, where each position tested was changed to every possible alternate base in a unique donor template to determine if any of the four DNA bases are preferred when utilizing blocking mutations (Fig. 3A). HDR ssODN donor templates were delivered along with their respective RNP complexes targeting two different genomic loci into HEK293 and K562 cells, and the rate of perfect HDR including both the desired HDR mutation and blocking mutation (where applicable) was assessed by NGS (Fig. 3B). As expected, the frequency of the desired HDR mutation (‘ctrl’) was low, at < 2% for all four conditions tested in the absence of any blocking mutation. Adding a blocking mutation in the second or third base of the ‘NGG’ PAM resulted in the greatest increase in HDR efficiency, with HDR reaching 8.0–17.8%. Blocking mutations placed around the Cas9 cleavage site and near the 3’ end of the guide were also highly effective, with the impact reduced as the position of the blocking mutation moved PAM-distal. No base was universally preferred over others in these experiments.
Next, we aimed to determine if a single blocking mutation was sufficient to prevent re-cleavage of the genomic DNA and maximize the frequency of HDR events, or if a combination of multiple blocking mutations would lead to an increase in HDR. Donor templates were designed with a single blocking mutation in the PAM, two blocking mutations in the PAM, and/or blocking mutations at two locations within the seed region. Various combinations of these mutations were delivered as ssODNs along with corresponding RNP complexes targeting four loci in Jurkat cells to assess their ability to mediate the same single base change as in the previous experiment, 3’ of the PAM via HDR. An example sequence showing the placement of blocking mutation(s) in the donor templates tested is provided in Supplemental Fig. 3A. As demonstrated by Supplemental Fig. 3B, donor templates containing two blocking mutations led to more robust improvement in HDR efficiency than donor templates containing a single blocking mutation, and this effect was greatest when the blocking mutations were within the PAM or nearer to the 3’ end of the guide. Incorporating three or four blocking mutations did not further enhance HDR efficiency over the best combination of two blocking mutations (2 PAM or 1 PAM + 1 seed).
We next investigated the impact of 2 PAM blocking mutations when a larger HDR mutation is inserted, such as an EcoRI restriction site, as well as examined the impact of blocking mutations when the HDR insertion was placed at various positions relative to the Cas9 cleavage site. To determine if blocking mutations are beneficial with a 6-nt insertion, we selected four gRNAs and designed donor templates to insert an EcoRI restriction digest recognition site at the Cas9 cleavage site. Donor templates included no blocking mutation (no PAM mutation) or a ’GG’ to ‘CC’ blocking mutation within the PAM sequence (PAM mutation). In addition, donor templates to insert the EcoRI sequence at varying locations relative to the Cas9 cleavage were designed; as a result, these donor templates would facilitate the 6-nt insertion as close as 3-nt from the Cas9 cleavage site and extending as far as 45-nt in both the 5’ and 3’ direction. (Supplemental Fig. 3C). All donor templates consisted of the NT-strand and maintained homology arms of 40-nt from both the EcoRI insertion location and the Cas9 cut site. The set of 24–32 ssODNs for the four targets were delivered along with their respective Cas9 RNP complex to Jurkat cells by nucleofection (N = 120 ssODNs tested across 4 sites). Additionally, donor templates and respective Cas9 RNP complexes for two of the targets were also delivered to HEK293 cells (N = 48). An EcoRI cleavage assay was used to determine the HDR frequencies, and results are shown in Supplemental Fig. 3D. Mutating the PAM from an ‘NGG’ to ‘NCC’ increased HDR at locations further from the cut site in the 3’ direction, downstream of the PAM and outside of the protospacer sequence, where the HDR insertion would not prevent Cas9 re-cleavage. When the EcoRI insertion was within the guide seed region or PAM, incorporating additional PAM mutations negatively impacted the HDR efficiency. This suggests that there is a limit to the number of additional mutations that should be added to prevent Cas9 re-cleavage, and if too many mutations are present, the HDR efficiency can be negatively affected.
This data represents a subset of HDR donor designs that we tested to fully elucidate a ruleset for the placement and number of blocking mutations required for various HDR mutation types. Using HDR efficiency results from Fig. 3B, we generated relative HDR efficiencies (i.e., HDR efficiency with varying blocking mutations divided by HDR efficiency without blocking mutations) and a position specific scoring matrix (PSSM) for each potential blocking mutation. The PSSM then represents the relative HDR improvement gained by mutating the HDR donor template at each position along the length of the Cas9 spacer sequence. Using the linear combination PSSM values representing blocking mutations in each HDR donor template, we calculated a blocking score for each of 374 donor template designs associated with 9 guides and delivered into three cell lines (HEK293, Jurkat, and Hepa1-6). We generated a model representing a non-linear correlation between blocking scores and HDR efficiency (Fig. 3C). A score of 1.97 approximately corresponds to mutating both G nucleotides in the Cas9 PAM. The model predicts blocking mutations with scores < 1.97 will have a positive impact on HDR rates; while blocking mutations with scores greater than 1.97 have a less certain, and perhaps detrimental, impact.
We embedded the PSSM, blocking score model, a guide-to-target mutation model, and other heuristics in a novel design tool for HDR donor templates. The combination of models allows the tool to recommend high quality paired HDR donor templates and guides. In addition, the design tool uses blocking scores to select blocking mutations that do not change the protein coding sequence (when transcript information is provided). We tested the design tool’s donor template recommendations using four unique target HDR mutations with or without the addition of silent blocking mutations (Fig. 3D), and we delivered the donor templates to HEK293, Hela and Jurkat cells. In every case except one in Jurkat cells, where the HDR rate was unchanged, the donor template designed with the addition of silent mutations yielded higher HDR events than donor template designs without blocking silent mutations (Fig. 3E).
HDR mutation location determines donor strand preference
Achieving efficient HDR at greater distances from the cut site using WT Cas9 to broaden the capabilities of CRISPR genome editing is a desirable feature. We aimed to investigate design considerations for HDR mutations that fall outside of the optimal editing window to determine if there is a donor strand preference. Additionally, we tested the benefit of incorporating more mutations along the ssODN repair track, which is defined as the portion of the donor template between the Cas9 cut site and mutation location. We designed ssODN donor templates to create an EcoRI insert 25-nt from the Cas9 cleavage site either on the PAM-containing side of the Cas9 cut (PAM-proximal) or on the non-PAM side of the Cas9 cut (PAM-distal) for three genomic loci. Donor templates were designed to have: (1) no mutation, (2) an ‘NGG’ to ‘NCC’ PAM mutation to prevent re-cleavage after HDR, (3) mutations placed along the repair track every 5th nt, either alone or (4) a combination of the PAM mutations with repair track mutations (Fig. 4A). Both the T and NT strands were tested to determine which donor templates facilitated the highest HDR incorporation of an insert outside of the previously established optimal placement. ssODN donor templates were delivered along with their respective Cas9 RNP complexes to HeLa cells by nucleofection, and the frequency of perfect HDR containing both the desired HDR mutation and any additional mutations was determined by NGS. The mean HDR rate for each ssODN design across three biological replicates for each of the three genomic loci tested is shown in Fig. 4B (for each ssODN design n = 9). For PAM-distal insertions, the NT strand donor templates containing repair track mutations gave a mean HDR frequency of 12.7% compared to 1.6% when the T strand with repair track mutations was used. In contrast, for PAM-proximal insertions, the T strand containing repair track and PAM mutations gave higher HDR than the NT strand with the same mutations (8.6% vs 0.8%, respectively). For PAM-proximal insertions, the repair track mutations marginally improved the HDR efficiency above incorporating a PAM mutation alone, increasing the HDR from 7.5 to 8.6% and not found to be a significant improvement (p > 0.05, paired t-test). However, for PAM-distal insertions, the repair track mutations significantly improved the frequency of HDR 3.4-fold over having only a PAM mutation (p < 0.01, paired t-test).
To further investigate the role of strand preference and repair track mutations in HDR improvement, we tested the impact on 6-nt insertions placed at suboptimal distances (> 15-nt) away from the Cas9 cleavage site. In this experiment, 12 loci from the set of 254 targets presented in Fig. 1B were selected as a subset of gRNAs. These gRNAs were selected as sites for HDR because they demonstrated one of three characteristics: no apparent strand preference, an obvious strand preference for the T strand, or an obvious strand preference for the NT strand in either Jurkat or HAP1 cells. Donor templates were designed following the same principles as the prior experiment, placing an EcoRI insertion either at the Cas9 cleavage site or 20 bases PAM-proximal or PAM-distal. Given the results shown in Fig. 4B that identify PAM-distal insertions with PAM mutations as mediating sub-optimal insertion frequencies, a single PAM mutation alone was subsequently tested for PAM-proximal insertions. In addition, repair track mutations were incorporated every 3–7 nt between the Cas9 cleavage site and the desired HDR mutation (Fig. 4C). These ssODN donor templates were delivered to Jurkat and HeLa cells along with their respective Cas9 RNP complexes by nucleofection, and the frequency of perfect HDR was determined by NGS with the mean HDR rate for each ssODN across the 12 genomic loci shown in Fig. 4D. Across all 12 sites tested, the NT strand gave higher HDR than the T strand for PAM-distal insertions; conversely, the T strand gave higher HDR than the NT strand for PAM-proximal insertions (Fig. 4D, Supplemental Fig. 4A) as had been seen previously. Similar to the previous experiment, repair track mutations in combination with a single PAM mutation for PAM-proximal insertions had a modest improvement in HDR rates over the single PAM mutation alone, increasing from 3.0 to 4.9% in Jurkat cells and 5.4% to 8.7% in Hela cells. The level of HDR improvement for the various mutation strategies had site-to-site variability (Supplemental Fig. 4B). However, the strand preference was universal to all sites tested, indicating that for PAM-proximal insertions, the T strand is consistently favored as the donor template, and for PAM-distal insertions, the NT strand consistently delivers the highest rate of HDR.
Optimized design rules for HDR with Cas12a
Cas12a is a type II CRISPR–Cas nuclease with several distinct differences to Cas9. Cas12a generates a DSB with 5’ overhangs, requires a ‘TTTV’ PAM, and enables editing in AT-rich genomes4. We designed experiments to characterize HDR design rules for A.s. Cas12a in a manner consistent with what has been described with Cas9. First, the optimal placement of an insertion was determined by designing donor templates for five sites in the HPRT1 gene. These donor templates were designed to insert an EcoRI restriction digest recognition site at varying positions relative to the guide sequence (Fig. 5A), ranging from 9 bases in the 5’ direction from the first base of the guide to 45 bases 3’ of the first base of the guide covering a span of 54 bases. The optimal HDR activity for this insert is not centered around the A.s. Cas12a cleavage site, canonically positioned as staggered between positinos18 and 23 bases from the PAM, as was the case for Cas9. Rather, the results demonstrate a strong preference for insertions between positions 12–16 of the guide (Fig. 5B).
Interestingly, there is an increase in EcoRI insertion frequency around position 24 even though this position falls outside of the protospacer region. We hypothesized this to be a result of imperfect HDR where an EcoRI site is inserted via HDR, but is met with Cas12a re-cleavage which then allows the insertion of other indels from NHEJ repair. To investigate this possibility, we performed NGS analysis of one of the five sites from Fig. 5B to examine the frequency of perfect HDR insertion relative to imperfect HDR insertion. At position 24, while the amount of EcoRI insertion was 6.4% by EcoRI cleavage (Supplemental Fig. 5A), the amount of perfect HDR when measured by NGS is < 1% and the imperfect HDR, which includes HDR insertion of an EcoRI site and additional indels, was 5.9% (Supplemental Fig. 5B). Thus, we confirmed by NGS that the optimal position for Cas12a-mediated perfect HDR is between positions 12–16 of the guide, and moving an insertion outside of the protospacer can give the desired insertion, but is complicated by undesired editing.
To investigate if Cas12a demonstrates a universal strand preference when an EcoRI insertion was optimally placed, a set of 15 Cas12a guide RNAs was selected, and donor templates were designed to insert an EcoRI restriction digest recognition site 16 bases 3’ of the PAM. Both the T and NT strand ssODN donor templates were delivered with their respective RNP complexes to Jurkat and HAP1 cells by nucleofection, and NGS was used to measure the frequency of total editing and perfect HDR. The combined results from the fifteen sites comparing T and NT strand donors in two cell lines is shown in Fig. 5C. Although there are differences in total editing across the 15 sites tested (varying from 30 to > 95% total editing), universally, the total editing was lower when the T strand was used. This is shown in Fig. 5C, top panel by the data points generally clustering below the line through the origin because of the increased total editing when delivered with the NT strand. As a result of the discrepancy observed in favor of the NT strand mediating increased total editing (top panel of 5C), the frequency of HDR was also higher when the NT strand was used as the donor template than when the T strand was used. This represents a statistically significant preference for the use of the NT strand as the donor template to achieve optimal results in HDR experiments using Cas12a nuclease.
The results from Fig. 5B and follow-up in Supplemental Fig. 5B suggest that blocking mutations could also be beneficial in Cas12a-mediated HDR. We designed experiments to investigate whether HDR could be improved at a position outside of the protospacer sequence by incorporating blocking mutations within the ssODN donor template. Donor templates with an EcoRI insertion optimally placed at position 15 of the guide, or sub-optimally at position 24 from the first base of the guide (outside of the protospacer sequence) were designed to include either no blocking mutation, a blocking mutation of the PAM sequence (TTTV to TVTV), or a blocking mutation within the protospacer sequence at various positions (Fig. 5D). These were tested as NT strand donor templates at two genomic loci within the HPRT1 gene and in two cell lines, Jurkat and Hela. When the EcoRI cleavage site was inserted within the guide sequence, there was no benefit to including blocking mutations to prevent further re-cleavage, likely because the EcoRI site disrupts subsequent cleavage events (Fig. 5E, left panel). However, when the EcoRI insertion was outside of the PAM/protospacer region, blocking mutations increased the rate of HDR from 0.7%, to 13.3% with a mutation in the PAM and up to 13.0% HDR with a mutation at position 14 of the guide (Fig. 5E, right panel). These results show that, similar to Cas9, blocking mutations are beneficial with Cas12a and can be used to broaden the available window for efficient HDR insertions.
Discussion
ssODN donor templates are routinely used to generate mutations or small insertions with CRISPR–Cas proteins. This is desirable for many applications including the generation of functional domains such as epitope tags or fluorescent proteins fused to endogenous genes for biological studies, creation of cell lines with a known mutation for disease modeling, and correction of a genetic disease for therapeutic applications36, 51, 52. However, the design of these donor templates remains challenging for researchers due to uncertainty about which CRISPR–Cas system should be applied, selection of gRNA(s) for each new HDR mutation location, and the most beneficial template strand to be used to achieve the highest frequency of HDR. In addition, the design process for ssODN donors can be time-consuming, particularly with the incorporation of silent blocking mutations to prevent re-cleavage yet maintain amino acid translation. We have thoroughly investigated design considerations for S.p. Cas9 nuclease, S.p. Cas9 D10A nickase and A.s. Cas12a nuclease and present optimized design considerations for each enzyme, including positioning of the gRNA(s) relative to the desired mutation, donor strand preference, and the incorporation of blocking mutations to improve desired HDR.
Initiating an HDR genome editing project requires consideration of which CRISPR–Cas enzyme to utilize. Our results support that this choice should be dependent on where the relative genomic location of the desired mutation(s) resides in relation to the available CRISPR–Cas PAMs/guides. If there is an ‘NGG’ PAM near the desired mutation (< 15 bases), and this guide is expected or known to edit efficiently, then the data suggests WT Cas9 can be used. If the available ‘NGG’ PAM sites are greater than 15 bases from the desired mutation, then the data presented in Fig. 2 indicates that a PAM-out paired guide design with Cas9 D10A nickase may confer higher HDR than WT Cas9, provided the mutation is placed between the two nick sites generated by Cas9 D10A. This can be particularly useful if additional blocking mutations are not desired or off-target DSBs are a concern. Alternatively, the results shown in Fig. 5 support the use of Cas12a when a ‘TTTV’ PAM site positions the HDR mutation between the 12–16th bases of a Cas12a protospacer; however, like S.p. Cas9, this window can be extended with the incorporation of blocking mutations. When multiple gRNA options are available for a desired HDR edit, screening (via directly testing editing efficiency or in silico screening for activity) serves to eliminate any inherently low activity guides to preserve the highest likelihood of efficient HDR.
The lack of efficient Cas9 guides available near a desired mutation is a significant limitation for many HDR experiments. As shown in Fig. 1C,D, the guide or guides closest to the desired HDR mutation can be sub-optimal in terms of cleavage efficiency or proximity. In Fig. 2, we show utility in a paired-guide/Cas9 nickase strategy to address this limitation of PAM proximity, but the requirement of having two guides with optimal spacing, activity, and orientation can still limit the design options and precludes this strategy for certain sites where there are no nickase designs available. Paix et al.44 demonstrated that incorporating additional mutations in the repair track between the cut site and desired HDR mutation location is another strategy to facilitate a wider region of donor integration. We observed this to be beneficial for PAM-distal HDR events as shown in Fig. 4. However, repair track mutations in combination with PAM mutations did not further increase HDR, and the literature suggests that the PAM mutation would likely fall outside the effective conversion zone for SDSA repair38. For PAM-proximal HDR mutations, repair track mutations were beneficial for some sites, but not all; however, the track mutations in tandem with the PAM mutations do show an additive benefit (Fig. 4B,D). Our results show that incorporating mutations within the repair track between the cut site and desired HDR mutation or within the PAM may improve the rate of successful HDR when optimally spaced Cas9 or Cas12a guide designs with respect to the intended mutation are not present.
The addition of blocking mutations has demonstrated improvement in HDR depending on the selected guide RNA and its relative positioning to the desired HDR mutation. Blocking mutations are beneficial when the desired HDR mutation does not prevent re-cleavage by the CRISPR–Cas nuclease. We have optimized the design of blocking mutations for use with Cas9 nuclease, including the placement and number of blocking mutations required, and this has been built into our HDR template design tool which facilitates simple donor template design. This work comprises the Alt-R HDR Design Tool which allows for gRNA selection for WT Cas9, balancing the distance from the cut to mutation and on- and off-target scores of available gRNAs, and Cas9 D10A nickase, where the gRNA orientation and distance between nick sites is considered. In addition, the tool provides the option to add silent blocking mutations using our empirically defined ruleset in a publicly-available user-friendly interface. In our study, we identified no bias favoring which mismatched base was used as the blocking mutation to prevent Cas9 re-cleavage, which imparts flexibility in designing appropriate silent blocking mutations so as to preserve the coding sequence. However, this could also indicate that the effect size is small or site-specific and with a larger data set, potential differences between alternate bases used for silent blocking mutations could be resolved. Further investigation into the optimized number and placement of blocking mutations with Cas12a is underway with the expectation that this will be built into a tool for Cas12a HDR donor template design.
After a CRISPR–Cas system and gRNA(s) have been selected and the donor template has been designed, homology arm lengths remains a variable for consideration. In previous work investigating HDR improvements in the cell lines mentioned above, asymmetric homology arms did not improve HDR beyond symmetrical homology arms when arm length was ≥ 30-nt from both the mutation location and the Cas9 cleavage site (data not shown). As such, one standard approach that could be employed is to design ssODN donor templates with 40-nt homology arms; however, the design tool allows for custom homology arm lengths to accommodate asymmetric designs, if desired.
A final donor template design consideration that we investigated was strand preference for the donor template. Cas9 D10A nickase did not demonstrate a strong strand preference as shown in both HEK293 and K562 cells in Fig. 2B, so testing both strands to determine which results in the highest HDR frequency may be prudent. However, for WT Cas9 we have shown that the preferred strand is strongly dependent upon where the desired HDR mutation is, relative to the Cas9 gRNA. Previous reports have demonstrated that when using ssODN donor templates with Cas9 nuclease the SDSA mechanism of repair is preferentially utilized, which consists of two steps38. After a DSB is generated, the ends are resected, generating 3’ overhangs which are then available for base pairing with the donor DNA. This donor DNA then serves as a template for 5’ to 3’ DNA synthesis. Although we observed no universal donor strand preference in the experiment outlined in Fig. 1B, the HDR insertion was placed directly at the Cas9 cleavage site where the SDSA model predicts high relative HDR regardless of the donor strand used. However, for insertions further from the Cas9 cleavage site there is a preference for the donor strand that contains 3’ sequence complementary to the overhangs generated during DSB repair44. To drive maximal HDR efficiency, the SDSA model for repair with ssONDs and data shown above agree with T strand usage for PAM-proximal insertions, and the same is true for PAM-distal mutations favoring the NT strand. For mutations directly at the cut site, we provide some evidence that the use of the T strand may reduce total editing with Cas9 which negatively impacts HDR, but this was not the case for both cell types tested. Using Cas12a, we observed a distinct reduction in total editing rates when the T strand was used universally. We hypothesize that the donor template binds to the RNP complex, reducing the intracellular RNP concentration available for genome editing, or activates the non-specific ssDNase activity of Cas12a. The NT strand conferred increased HDR for experiments with Cas12a over the T strand. However, the effect of HDR insertion placement has not been thoroughly investigated for Cas12a to determine if the T strand will be advantageous over the NT strand for PAM-proximal mutations in a manner consistent with our Cas9 observations.
We have studied design rules for A.s. Cas12a nuclease, which had not yet been systematically examined. Further, the ruleset for S.p. Cas9 and S.p. Cas9 D10A nickase have been incorporated into a novel bioinformatic tool for HDR donor template design. Taken altogether, these findings present design recommendations for achieving high frequency of precise repair outcomes utilizing HDR in mammalian cell lines.
Methods
Ribonucleoprotein complex formation
Cas9 gRNAs were prepared by mixing equimolar amounts of Alt-R™ crRNA and Alt-R tracrRNA (Integrated DNA Technologies, Coralville, IA, USA) in IDT Duplex Buffer (30 mM HEPES, pH 7.5, 100 mM potassium acetate; Integrated DNA Technologies), heating to 95 °C and slowly cooling to room temperature or using Alt-R sgRNA (Integrated DNA Technologies) hydrated in IDTE pH 7.5 (10 mM Tris, pH 7.5, 0.1 mM EDTA; Integrated DNA Technologies). Cas12a gRNAs consisted of Alt-R Cas12a crRNAs (Integrated DNA Technologies) hydrated in IDTE pH 7.5. RNP complexes were assembled by combining the CRISPR–Cas nuclease (Alt-R S.p. Cas9 Nuclease V3, Alt-R S.p. HiFi Cas9 Nuclease V3, Alt-R S.p. Cas9 D10A V3, Alt-R S.p. Cas9 H840A V3, Alt-R A.s. Cas12a V3, or Alt-R A.s. Cas12a Ultra; Integrated DNA Technologies) and the Alt-R gRNA at a 1:1 to 1.2:1 molar ratio of gRNA:protein and incubating at room temperature for 30 min. For paired nicking experiments, each RNP was formed separately, and two RNPs were mixed together at an equal molar ratio prior to adding to the cells at the time of transfection. The 20-nt target specific sequences of the gRNAs used in this study are listed in Supplementary Table 1.
HDR ssODN donor templates
Alt-R™ HDR Donor Oligos (Integrated DNA Technologies) used in this study consisted of either Alt-R modified (containing two phosphorothioate linkages at the ultimate and penultimate backbone linkage and an IDT proprietary end-blocking modification at 5’ and 3’ ends), PS modified (containing two phosphorothioate linkages at the ultimate and penultimate backbone linkage at 5’ and 3’ ends) or unmodified DNA. Donor oligos were hydrated using IDTE pH 7.5 (Integrated DNA Technologies). Sequences of the HDR oligos used in this study are listed in Supplementary Table 1.
Cell culture
HAP1, HEK293, HeLa, Jurkat E6-1, and K562 cells were purchased from ATCC® (Manassas, VA, USA), and maintained in DMEM (HEK293, and HeLa), RPMI-1640 (Jurkat) and IMDM (HAP1, K562) (ATCC), each supplemented with 10% fetal bovine serum (FBS) and 1% penicillin–streptomycin (Thermo Fisher Scientific, Carlsbad, CA, USA). HEK293 cells that constitutively express Cas9 nuclease (”HEK293-Cas9″) were generated by stable integration of a human-codon optimized S.p. Cas9 as well as the flanking 5’ and 3’ nuclear localizing sequences and 5’-V5 tag from the GeneArt CRISPR Nuclease Vector (Thermo Fisher Scientific). HEK293-Cas9 cells were maintained in DMEM supplemented with 10% FBS, 1% penicillin–streptomycin, and 500 μg/mL G418 (Thermo Fisher Scientific). Cells were incubated at 37 °C with 5% CO2 and passaged every 3 days. HAP1 cells were used for transfection at 50–70% confluency. HEK293 and HeLa cells were used for transfection at 70–90% confluency. Jurkat and K562 were used for transfection at 5–8 × 105 cells/mL density. After transfection, cells were grown for 48–72 h in total, after which genomic DNA was isolated using QuickExtract™ DNA Extraction Solution (Epicentre, Madison, WI, USA).
Delivery of genome editing reagents by lipofection
Lipofection was performed in 96-well plates. First, 25 µL of Opti-MEM® (Thermo Fisher Scientific) containing 1.2 µL (RNP delivery) or 0.75 µL (gRNA delivery) of Lipofectamine® RNAiMAX (Thermo Fisher Scientific) was combined with equal volume of Opti-MEM containing RNP or gRNA and HDR donor template (when present), and incubated at room temperature for 20 min. After lipoplex formation, 4.5 × 104 cells resuspended in 100 µL of DMEM + 10% FBS were added to the transfection complex which resulted in a final concentration of 10 nM RNP or gRNA and 3 nM HDR oligo on a per-well basis. Transfection plates were incubated at 37 °C and 5% CO2.
Delivery of genome editing reagents by nucleofection
Electroporation was performed using the Lonza™ Nucleofector™ 96-well Shuttle™ System (Lonza, Basel, Switzerland). For each nucleofection, cells were washed with 1× phosphate buffered saline (PBS) and resuspended in 20 µL of solution SF or SE (Lonza). Cell suspensions were combined with RNP complex(es), Alt-R Cas9 or Cpf1 (Cas12a) Electroporation Enhancer (Integrated DNA Technologies) and HDR donor template (if applicable). This mixture was transferred into one well of a Nucleocuvette™ Plate (Lonza) and electroporated using manufacturer’s recommended protocols (except for HEK293, which used protocol 96-DS-150). After nucleofection, 75 µL pre-warmed culture media was added to the cell mixture in the cuvette, mixed by pipetting, and 25 µL was transferred to a 96-well culture plate with 175 µL pre-warmed culture media. Transfection plates were incubated at 37 °C and 5% CO2.
T7 endonuclease I (T7EI) assay and restriction enzyme digestion
Genomic DNA was extracted after 48–72 h incubation using 50 µL Quick Extract™ DNA Extraction Solution (Lucigen, Middleton, WI, USA) following the manufacturer’s protocol. Genomic DNA was diluted threefold with nuclease-free water and 1.5 µL was PCR-amplified using 0.15 U KAPA HiFi HotStart DNA Polymerase (Kapa Biosystems, Wilmington, MA, USA) in a final volume of 10 µL. For HDR analysis using restriction enzyme digestion, 10 µL of the PCR product was incubated with 2 U of EcoRI-HF® in 1× CutSmart® Buffer (New England BioLabs, Ipswich, MA, USA) at 37 °C for 60 min. Total editing rate was measured using the Alt-R™ Genome Editing Detection Kit (T7EI) (Integrated DNA Technologies) following the manufacturer’s protocol. Cleavage products were separated on the Fragment Analyzer™ using the CRISPR Mutation Discovery Kit (Agilent Technologies, Santa Clara, CA, USA). Editing and HDR frequencies were calculated using the following formula: average molar concentration of the cut products/(average molar concentration of the cut products + molar concentration of the uncut product) × 100. PCR primers are listed in Supplementary Table 1.
Quantification of editing events by next-generation sequencing (NGS)
On-target editing and HDR efficiencies were also measured by NGS. Libraries were prepared using an amplification-based method as described previously 53. In short, the first round of PCR was performed using target specific primers, and the second round of PCR incorporates P5 and P7 Illumina adapters to the ends of the amplicons for universal amplification. Libraries were purified using Agencourt® AMPure® XP system (Beckman Coulter, Brea, CA, USA), and quantified with qPCR before loading onto the Illumina® MiSeq platform (Illumina, San Diego, CA, USA). Paired end, 150 bp reads were sequenced using V2 chemistry. Data were analyzed using a custom-built pipeline. Data was demultiplexed using Picard tools v2.9 (https://github.com/broadinstitute/picard). Forward and reverse reads were merged into extended amplicons (flash v1.2.11)54 before being aligned against the GRCh38 genomic reference (minimap2 v2.12)55. Reads were aligned to the target, favoring alignment choices with indels near the predicted cut site(s). At each target, editing was calculated as the percentage of total reads containing an indel within an 8 bp window of the cut site for Cas9 or a 9 bp window from the −3 position of the Cas12a PAM distal cut site. PCR primers are listed in Supplementary Table 1.
Statistical analysis
The data collected from experiments were analysed on GraphPad Prism 8 using two-tailed unpaired t-test to evaluate significance (*p < 0.05, **p < 0.01, ***p < 0.001, and ****p < 0.0001).
Supplementary Information
Acknowledgements
The authors thank Ashley Jacobi and Kim Lennox for editing the manuscript.
Author contributions
Conceptualization, M.S.S., B.T., J.W., R.T., S.Y., M.S.M. and G.R.R.; Software, G.K. and M.S.M.; Investigation, M.S.S., B.T., J.W., R.T. and S.Y.; Data Curation, M.S.S. and M.S.M.; Writing-Original Draft, M.S.S.; Writing-Review & Editing, M.S.S., B.T., J.W., R.T., S.Y., G.K., M.S.M. and G.R.R.; Supervision: G.R.R.
Data availability
The Alt-R HDR Design Tool is a free online tool that is available from the Integrated DNA Technologies website (https://www.idtdna.com/pages/tools/alt-r-crispr-hdr-design-tool). NGS data used for the figures and supplementary figures have been made available at SRA BioProject Accession # PRJNA638623.
Competing interests
This work was supported by internal funds from Integrated DNA Technologies, Inc. The authors are employed by Integrated DNA Technologies, Inc., (IDT) which offers reagents for sale similar to some of the compounds described in the manuscript.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-021-98965-y.
References
- 1.Cong L, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339:819–823. doi: 10.1126/science.1231143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Mali P, et al. RNA-guided human genome engineering via Cas9. Science. 2013;339:823–826. doi: 10.1126/science.1232033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Wang H, La Russa M, Qi LS. CRISPR/Cas9 in genome editing and beyond. Annu. Rev. Biochem. 2016;85:227–264. doi: 10.1146/annurev-biochem-060815-014607. [DOI] [PubMed] [Google Scholar]
- 4.Zetsche B, et al. Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR–Cas system. Cell. 2015;163:759–771. doi: 10.1016/j.cell.2015.09.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Hur JK, et al. Targeted mutagenesis in mice by electroporation of Cpf1 ribonucleoproteins. Nat. Biotechnol. 2016 doi: 10.1038/nbt.3596. [DOI] [PubMed] [Google Scholar]
- 6.Deltcheva E, et al. CRISPR RNA maturation by trans-encoded small RNA and host factor RNase III. Nature. 2011;471:602–607. doi: 10.1038/nature09886. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Jinek M, et al. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science. 2012;337:816–821. doi: 10.1126/science.1225829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Jiang F, Doudna JA. CRISPR–Cas9 structures and mechanisms. Annu. Rev. Biophys. 2017;46:505–529. doi: 10.1146/annurev-biophys-062215-010822. [DOI] [PubMed] [Google Scholar]
- 9.Kim S, Kim D, Cho SW, Kim J, Kim JS. Highly efficient RNA-guided genome editing in human cells via delivery of purified Cas9 ribonucleoproteins. Genome Res. 2014;24:1012–1019. doi: 10.1101/gr.171322.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Anders C, Jinek M. In vitro enzymology of Cas9. Methods Enzymol. 2014;546:1–20. doi: 10.1016/B978-0-12-801185-0.00001-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Liang X, et al. Rapid and highly efficient mammalian cell engineering via Cas9 protein transfection. J. Biotechnol. 2015;208:44–53. doi: 10.1016/j.jbiotec.2015.04.024. [DOI] [PubMed] [Google Scholar]
- 12.Liang Z, et al. Efficient DNA-free genome editing of bread wheat using CRISPR/Cas9 ribonucleoprotein complexes. Nat. Commun. 2017;8:14261. doi: 10.1038/ncomms14261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hendel A, et al. Chemically modified guide RNAs enhance CRISPR–Cas genome editing in human primary cells. Nat. Biotechnol. 2015;33:985–989. doi: 10.1038/nbt.3290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Mekler V, Minakhin L, Semenova E, Kuznedelov K, Severinov K. Kinetics of the CRISPR–Cas9 effector complex assembly and the role of 3'-terminal segment of guide RNA. Nucleic Acids Res. 2016;44:2837–2845. doi: 10.1093/nar/gkw138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Davis L, Maizels N. Homology-directed repair of DNA nicks via pathways distinct from canonical double-strand break repair. Proc. Natl. Acad. Sci. U S A. 2014;111:E924–932. doi: 10.1073/pnas.1400236111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Chen X, et al. In trans paired nicking triggers seamless genome editing without double-stranded DNA cutting. Nat. Commun. 2017;8:657. doi: 10.1038/s41467-017-00687-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Ran FA, et al. Double nicking by RNA-guided CRISPR Cas9 for enhanced genome editing specificity. Cell. 2013;154:1380–1389. doi: 10.1016/j.cell.2013.08.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Kocher T, et al. Cut and paste: Efficient homology-directed repair of a dominant negative KRT14 mutation via CRISPR/Cas9 nickases. Mol. Ther. 2017;25:2585–2598. doi: 10.1016/j.ymthe.2017.08.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Cho SW, et al. Analysis of off-target effects of CRISPR/Cas-derived RNA-guided endonucleases and nickases. Genome Res. 2014;24:132–141. doi: 10.1101/gr.162339.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Bothmer A, et al. Characterization of the interplay between DNA repair and CRISPR/Cas9-induced DNA lesions at an endogenous locus. Nat. Commun. 2017;8:13905. doi: 10.1038/ncomms13905. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Shen B, et al. Efficient genome modification by CRISPR–Cas9 nickase with minimal off-target effects. Nat. Methods. 2014;11:399–402. doi: 10.1038/nmeth.2857. [DOI] [PubMed] [Google Scholar]
- 22.Fauser F, Schiml S, Puchta H. Both CRISPR/Cas-based nucleases and nickases can be used efficiently for genome engineering in Arabidopsis thaliana. Plant J. 2014;79:348–359. doi: 10.1111/tpj.12554. [DOI] [PubMed] [Google Scholar]
- 23.Satomura A, et al. Precise genome-wide base editing by the CRISPR nickase system in yeast. Sci. Rep. 2017;7:2095. doi: 10.1038/s41598-017-02013-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Xu T, et al. Efficient genome editing in Clostridium cellulolyticum via CRISPR–Cas9 nickase. Appl. Environ. Microbiol. 2015;81:4423–4431. doi: 10.1128/AEM.00873-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Schiml S, Fauser F, Puchta H. The CRISPR/Cas system can be used as nuclease for in planta gene targeting and as paired nickases for directed mutagenesis in Arabidopsis resulting in heritable progeny. Plant J. 2014;80:1139–1150. doi: 10.1111/tpj.12704. [DOI] [PubMed] [Google Scholar]
- 26.Mikami M, Toki S, Endo M. Precision targeted mutagenesis via Cas9 paired nickases in rice. Plant Cell Physiol. 2016;57:1058–1068. doi: 10.1093/pcp/pcw049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Tsai SQ, et al. GUIDE-seq enables genome-wide profiling of off-target cleavage by CRISPR–Cas nucleases. Nat. Biotechnol. 2015;33:187–197. doi: 10.1038/nbt.3117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Kim D, et al. Genome-wide analysis reveals specificities of Cpf1 endonucleases in human cells. Nat. Biotechnol. 2016 doi: 10.1038/nbt.3609. [DOI] [PubMed] [Google Scholar]
- 29.Chen JS, et al. CRISPR–Cas12a target binding unleashes indiscriminate single-stranded DNase activity. Science. 2018;360:436–439. doi: 10.1126/science.aar6245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Rouet P, Smih F, Jasin M. Introduction of double-strand breaks into the genome of mouse cells by expression of a rare-cutting endonuclease. Mol. Cell Biol. 1994;14:8096–8106. doi: 10.1128/mcb.14.12.8096. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Carroll D. Genome engineering with targetable nucleases. Annu. Rev. Biochem. 2014;83:409–439. doi: 10.1146/annurev-biochem-060713-035418. [DOI] [PubMed] [Google Scholar]
- 32.van Overbeek M, et al. DNA repair profiling reveals nonrandom outcomes at Cas9-mediated breaks. Mol. Cell. 2016;63:633–646. doi: 10.1016/j.molcel.2016.06.037. [DOI] [PubMed] [Google Scholar]
- 33.Mao Z, Bozzella M, Seluanov A, Gorbunova V. Comparison of nonhomologous end joining and homologous recombination in human cells. DNA Repair (Amst.) 2008;7:1765–1771. doi: 10.1016/j.dnarep.2008.06.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Chen F, et al. High-frequency genome editing using ssDNA oligonucleotides with zinc-finger nucleases. Nat. Methods. 2011;8:753–755. doi: 10.1038/nmeth.1653. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Paquet D, et al. Efficient introduction of specific homozygous and heterozygous mutations using CRISPR/Cas9. Nature. 2016;533:125–129. doi: 10.1038/nature17664. [DOI] [PubMed] [Google Scholar]
- 36.Skarnes WC, Pellegrino E, McDonough JA. Improving homology-directed repair efficiency in human stem cells. Methods. 2019;164–165:18–28. doi: 10.1016/j.ymeth.2019.06.016. [DOI] [PubMed] [Google Scholar]
- 37.Wang Y, et al. Systematic evaluation of CRISPR–Cas systems reveals design principles for genome editing in human cells. Genome Biol. 2018;19:62. doi: 10.1186/s13059-018-1445-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kan Y, Ruis B, Takasugi T, Hendrickson EA. Mechanisms of precise genome editing using oligonucleotide donors. Genome Res. 2017;27:1099–1111. doi: 10.1101/gr.214775.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Liang X, Potter J, Kumar S, Ravinder N, Chesnut JD. Enhanced CRISPR/Cas9-mediated precise genome editing by improved design and delivery of gRNA, Cas9 nuclease, and donor DNA. J. Biotechnol. 2017;241:136–146. doi: 10.1016/j.jbiotec.2016.11.011. [DOI] [PubMed] [Google Scholar]
- 40.Richardson CD, Ray GJ, DeWitt MA, Curie GL, Corn JE. Enhancing homology-directed genome editing by catalytically active and inactive CRISPR–Cas9 using asymmetric donor DNA. Nat. Biotechnol. 2016;34:339–344. doi: 10.1038/nbt.3481. [DOI] [PubMed] [Google Scholar]
- 41.O'Brien AR, Wilson LOW, Burgio G, Bauer DC. Unlocking HDR-mediated nucleotide editing by identifying high-efficiency target sites using machine learning. Sci. Rep. 2019;9:2788. doi: 10.1038/s41598-019-39142-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Yang L, et al. Optimization of scarless human stem cell genome editing. Nucleic Acids Res. 2013;41:9049–9061. doi: 10.1093/nar/gkt555. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Bialk P, Rivera-Torres N, Strouse B, Kmiec EB. Regulation of gene editing activity directed by single-stranded oligonucleotides and CRISPR/Cas9 systems. PLoS ONE. 2015;10:e0129308. doi: 10.1371/journal.pone.0129308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Paix A, et al. Precision genome editing using synthesis-dependent repair of Cas9-induced DNA breaks. Proc. Natl. Acad. Sci. U.S.A. 2017;114:E10745–E10754. doi: 10.1073/pnas.1711979114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Okamoto S, Amaishi Y, Maki I, Enoki T, Mineno J. Highly efficient genome editing for single-base substitutions using optimized ssODNs with Cas9-RNPs. Sci. Rep. 2019;9:4811. doi: 10.1038/s41598-019-41121-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Renaud JB, et al. Improved genome editing efficiency and flexibility using modified oligonucleotides with TALEN and CRISPR–Cas9 nucleases. Cell Rep. 2016;14:2263–2272. doi: 10.1016/j.celrep.2016.02.018. [DOI] [PubMed] [Google Scholar]
- 47.Maruyama T, et al. Increasing the efficiency of precise genome editing with CRISPR–Cas9 by inhibition of nonhomologous end joining. Nat. Biotechnol. 2015;33:538–542. doi: 10.1038/nbt.3190. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Riesenberg S, Maricic T. Targeting repair pathways with small molecules increases precise genome editing in pluripotent stem cells. Nat. Commun. 2018;9:2164. doi: 10.1038/s41467-018-04609-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Ma X, et al. Small molecules promote CRISPR–Cpf1-mediated genome editing in human pluripotent stem cells. Nat. Commun. 2018;9:1303. doi: 10.1038/s41467-018-03760-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Pattanayak V, et al. High-throughput profiling of off-target DNA cleavage reveals RNA-programmed Cas9 nuclease specificity. Nat. Biotechnol. 2013;31:839–843. doi: 10.1038/nbt.2673. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Antony JS, et al. Gene correction of HBB mutations in CD34(+) hematopoietic stem cells using Cas9 mRNA and ssODN donors. Mol. Cell Pediatr. 2018;5:9. doi: 10.1186/s40348-018-0086-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Dewari PS, et al. An efficient and scalable pipeline for epitope tagging in mammalian stem cells using Cas9 ribonucleoprotein. Elife. 2018 doi: 10.7554/eLife.35069. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Jacobi AM, et al. Simplified CRISPR tools for efficient genome editing and streamlined protocols for their delivery into mammalian cells and mouse zygotes. Methods. 2017;121–122:16–28. doi: 10.1016/j.ymeth.2017.03.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Magoc T, Salzberg SL. FLASH: Fast length adjustment of short reads to improve genome assemblies. Bioinformatics. 2011;27:2957–2963. doi: 10.1093/bioinformatics/btr507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Li H. Minimap2: pairwise alignment for nucleotide sequences. Bioinformatics. 2018;34:3094–3100. doi: 10.1093/bioinformatics/bty191. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The Alt-R HDR Design Tool is a free online tool that is available from the Integrated DNA Technologies website (https://www.idtdna.com/pages/tools/alt-r-crispr-hdr-design-tool). NGS data used for the figures and supplementary figures have been made available at SRA BioProject Accession # PRJNA638623.