Abstract
CRISPR/Cas9, base editors and prime editors comprise the contemporary genome editing toolbox. Many studies have optimized the use of CRISPR/Cas9, as the original CRISPR genome editing system, in substituting single nucleotides by homology directed repair (HDR), although this remains challenging. Studies describing modifications that improve editing efficiency fall short of isolating clonal cell lines or have not been validated for challenging loci or cell models. We present data from 95 transfections using a colony forming and an immortalized cell line comparing the effect on editing efficiency of donor template modifications, concentration of components, HDR enhancing agents and cold shock. We found that in silico predictions of guide RNA efficiency correlated poorly withactivity in cells. Using NGS and ddPCR we detected editing efficiencies of 5–12% in the transfected populations which fell to 1% on clonal cell line isolation. Our data demonstrate the variability of CRISPR efficiency by cell model, target locus and other factors. Successful genome editing requires a comparison of systems and modifications to develop the optimal protocol for the cell model and locus. We describe the steps in this process in a flowchart for those embarking on genome editing using any system and incorporate validated HDR-boosting modifications for those using CRISPR/Cas9.
Keywords: CRISPR, CRISPR/Cas9, genome editing, prime editing, homology directed repair (HDR), cell line, stem cells
1 Introduction
CRISPR-associated protein 9 (Cas9) genome editing systems have revolutionized biological research. The ability to target and modify genetic loci offers the potential to investigate the functional consequences of variants discovered through genome wide association studies (Gallagher and Chen-Plotkin, 2018; Chen et al., 2022) and to correct the >75,000 known human pathogenic variants listed in Clinvar (Landrum and Kattman, 2018). The original CRISPR/Cas9 system has been available for more than a decade and extensive research has characterized how CRISPR/Cas9 works and how this can be optimized for maximal efficiency, with over 18,000 publications related to the term “CRISPR” listed on PubMed. CRISPR/Cas9 components are widely commercially available as recombinant components and plasmids. It is possible to achieve efficiencies of up to 80% in human cells when utilizing CRISPR/Cas9 to disrupt the DNA sequence and generate gene knock-outs (Guo et al., 2018). However, when using CRISPR/Cas9 to substitute nucleotides (knock-in), the efficiency is far lower, often in the order of 1%, although efficiencies of up to ∼50% are reported following protocol modifications (Paquet et al., 2016; Kwart et al., 2017; Guo et al., 2018; Okamoto et al., 2019; Maurissen and Woltjen, 2020) (reviewed in Liu et al., 2019a). More recently, it was demonstrated in yeast that retron-based CRISPR/Cas9 could boost multiplexing knock-in experiments, opening new possibilities for human cells (Zhao et al., 2022). Nevertheless nucleotide substitution by CRISPR/Cas9 relies on the non-dominant DNA repair pathway, homology directed repair (HDR). CRISPR/Cas9’s reliance on DNA damage repair limits the efficiency of substituting nucleotides and can introduce off-target mutations.
Prime editing (PE) emerged in 2019 to address the low knock-in efficiency and propensity for off-target effects of CRISPR/Cas9 and is available in its latest iteration with PE4 and 5 (Anzalone et al., 2019; Chen et al., 2021). PE utilizes the guide RNA-directed “search” ability of CRISPR/Cas9 but builds on the accuracy of the “replace” ability by avoiding a double stranded break and reliance on cellular DNA repair. Base editors represent another alternative to CRISPR/Cas9 for single nucleotide substitution but are limited by their editing of bystander nucleotides making them unsuitable for repetitive nucleotides (Rees and Liu, 2018). Together, CRISPR/Cas9, PE and base editors comprise the main genome editing toolbox available to the contemporary researcher.
The efficiency of genome editing varies with the target locus, distance to the protospacer adjacent motif (PAM) and mismatch repair proficiency of the cell model (Chen et al., 2021; Ferreira da Silva et al., 2022). For example, the editing efficiency of CRISPR/Cas9 falls as distance from the PAM increases and stem cells show relative resistance to transfection compared to immortalized cell lines (Madsen and Semple, 2019). PE shows improved efficiency and reduced off-target changes compared to CRISPR/Cas9, however PE necessitates delivery as a plasmid, the design of more components (pegRNA spacer, extension and ngRNA for PE3b) and more extensive optimization compared to CRISPR/Cas9. PE efficiency is highly variable (up to 50-fold difference) depending on the genetic background of the cell model and component design (Anzalone et al., 2019). These factors make it likely that CRISPR/Cas9-mediated HDR will still be employed for repetitive target loci or cell models that are not amenable to PE, where a plasmid delivery is not desirable or where the rate of CRISPR/Cas9-mediated HDR is sufficiently high to avoid the need for extensive design and optimization processes. Selecting the right system for an experiment will depend on the target locus, intended substitution and cell model.
In this study we edited three single nucleotide loci implicated in the development of sarcomas using the CRISPR/Cas9 system. These loci are challenging to edit because they are repetitive, precluding the use of base editors, and distant from a PAM (>15 nucleotides), making editing by CRISPR/Cas9 or other Cas enzymes challenging. We applied several previously reported optimizing modifications across 95 transfections in induced pluripotent stem cells (iPSC), representing a colony forming model, and an adherent cell model to validate their utility in editing these challenging loci. Editing outcomes were characterized at each stage using contemporary technologies. While PE was not available at the time of our experiments, we have addressed how we would incorporate its use. Finally, we synthesize a flowchart that can be adapted to any subsequent CRISPR genome editing system.
2 Materials and Methods
2.1 Induced Pluripotent Stem Cells
2.1.1 Cell Culture
The human episomal line of induced pluripotent stem cells (A18945, Gibco; Thermo Fisher Scientific, Inc, Waltham, MA, United States ) was maintained in feeder-free culture on Geltrex (A1413202, Invitrogen, Thermo Fisher Scientific) and Essential 8 Flex (E8 Flex) medium (A28585, Gibco, Thermo Fisher Scientific) supplemented with 0.5% of Penicillin (10,000 U/ml). Cells were cultured at 37 °C in a humidified atmosphere with 5% CO2. Cells were passaged by incubation for 5 min at 37 °C with Dulbecco’s phosphate buffered saline (DPBS)-EDTA 0.5 mM pH 8.00 (Invitrogen, Thermo Fisher Scientific, 14190250 and 15575020).
2.1.2 CRISPR/Cas9 Editing of iPSC
All CRISPR/Cas9 components were purchased through IDT. Single-stranded oligodeoxynucleotides (ssODN) were ordered as Alt-R™ HDR Donor Oligos.
1. The gRNA was prepared by duplexing Alt-R® CRISPR-Cas9 crRNA and Alt-R® CRISPR-Cas9 tracrRNA (IDT, 1072532).
2. The ribonucleoprotein (RNP) was formed using 0.78 μl Alt-R® CRISPR-Cas9 gRNA, 1.02 μl Alt-R® S. p. Cas9 Nuclease V3 (IDT, 1081058) and 1.2 μl PBS (total 3 μl).
3. Cells were detached, counted and transfected using the Lonza™ P3 Primary Cell 4D-Nucleofector™ X (Lonza, V4XP-3024) and electroporation program CA137.
4. To prepare the electroporation mixture, 0.5 × 106 cells were resuspended in 20 μl Lonza electroporation buffer, 1 μl of RNP complex and 0.5 μl Alt-R™ HDR Donor Oligo.
5. Following transfection, cells were recovered: (i) at 37°C, (ii) at 32 °C for 24 h then at 37 °C, (iii) with Alt-R® CRISPR-Cas9 HDR enhancer (IDT, 1081072) in E8 Flex without antibiotics for 24 h, (iv) with DMSO at 1% in E8 Flex without antibiotics for 24 h.
2.1.3 Colony Picking
For all stressful steps, such as single cell dissociation and colony picking, E8 Flex medium with RevitaCell™ was used for 2 h before and until colonies formed. In all other steps E8 Flex was used without RevitaCell™.
Cells were detached with Accutase® (Innovative Cell Technologies, Inc, San Diego, CA AT104), counted and 700–1,000 cells per dish were plated. Medium-sized colonies were picked using a P200 pipette. The aspirated colony was transferred to a 96 well plate and triturated 10 times. When cells were 50–70% confluent they were split 1:2, half were seeded for genomic DNA extraction and half for subculture or freezing.
2.1.4 Mirror Plates for Freezing and for DNA Extraction
Colonies expanded in 96 well plates were washed with 100 μl DPBS and detached with 30 μl of Accutase®. A mirror plate for freezing was prepared containing 50 μl of 2X freezing medium (E8 Flex plus 20% DMSO). Cells were collected with 70 μl of E8 Flex and 50 μl were transferred to the mirror plate and stored at −80°C. The remaining 50 μl of suspension was kept for DNA extraction: the plate was spun at 1950 RCF for 30 min at 4 °C, the medium was removed, and the plate stored at −80°C. Upon thawing, 30–50 μl of Lucigen QuickExtract™ DNA Extraction Solution (LGC, Middlesex, United Kingdom, QE09050) was added, then triturated and heated at 65°C for 6 min then 98°C for 2 min and used directly for genotyping.
2.2 U-CH1 Chordoma Cell Line
2.2.1 Cell Culture
The human U-CH1 chordoma cell line (ATCC® CRL-3217™, www.chordomafoundation.org) was grown as previously described (Scheipl et al., 2016). Cell authentication was regularly performed by Short Tandem Repeat fingerprinting (Culture Collections, Public Health England, United Kingdom) (Supplementary Table 2).
2.2.2 CRISPR/Cas9 Editing
1. The gRNA was prepared by duplexing Alt-R® CRISPR-Cas9 crRNA and Alt-R® CRISPR-Cas9 tracrRNA (IDT, 1072532).
2. 3.9 μl Alt-R® CRISPR-Cas9 gRNA, 5.1 μl of 10 mg/ml (or 10 μg/μl = 50 μg = 300 pmol) Alt-R® S. p. Cas9 Nuclease V3 (IDT, 1081058) and 5.9 μl sterile DPBS (total 15 μl) were combined for the RNP.
3. Cells were detached, counted, and transfected using the Lonza Amaxa® Cell Line Nucleofector® Kit V (Lonza, Basel, Switzerland VCA-1003) using electroporation program A30. To prepare the electroporation mixture, 2 × 106 cells were resuspended in 60 μl Lonza electroporation buffer, with 10 μl of RNP complex (final Cas9 concentration ∼100 pmol) and 3 μl of modified Alt-R™ HDR Donor Oligo.
4. Following transfection, the cells were recovered in medium without antibiotics.
2.2.3 Analysis of editing outcomes using digital droplet polymerase chain reaction (ddPCR)
A common primer set and probes for each allele were designed: a hexachlorofluorescein (HEX) probe for the parental allele (A/T) and fluorescein amidite (FAM) probe for the edited allele (G/C). ddPCR assays were designed using primer3plus (Untergasser et al., 2007) and the BioRad Droplet Digital™ PCR Applications Guide. ddPCR experiments were carried out using the BioRad QX200 ddPCR supermix for probes (no dUTP) workflow, Automated Droplet Generator, BioRad Automated Droplet Generation Oil for Probes (BioRad, Hercules, California, United States; #1864110), Eppendorf vapo. protect thermocycler and QX200 Automated Droplet Reader. Results were analyzed using the BioRad QuantaSoft™ Analysis Pro Software using rare event detection.
2.2.4 Flow Cytometry Activated Cell Sorting (FACS)
Transfected U-CH1 cells were recovered for 36 h before single cell sorting into collagen-coated 96 well plates using a BD FACS Aria Fusion Cell Sorter™ (Becton Dickinson, Franklin Lakes, New Jersey, United States ) running FACSDiva Software version 6. Cells were incubated with TOPRO3+ (Garvey et al., 2016) before sorting to allow the exclusion of dead cells and the top 10% of ATTO-550 positive cells were selected.
2.2.5 Mirror Plate for DNA Extraction
Cells were washed with 100 μl DPBS and detached with 50 μl of Accutase® (Innovative Cell Technologies, AT104) at 37 °C for 10–15 min 50μl of medium was added, 50 μl were taken for genomic DNA extraction using Lucigen QuickExtract™ (LGC, QE09050) while the other 50 μl were replated for subculture.
2.3 Techniques Common to Both Cell Models
Regular testing was performed to exclude mycoplasma contamination using the EZ-PCR Mycoplasma Test Kit (K1-0210, Geneflow, Lichfield, Staffordshire, United Kingdom).
2.3.1 Genotyping Using Illumina MiSeq™ Next Generation Sequencing (NGS)
DNA was extracted using Zymo Column Extraction (Zymo Research, Irvine, California, United States , D3024) (for bulk transfections) or Lucigen QuickExtract™ DNA Extraction Solution (for picked colonies). PCR was performed with Kapa Hifi HotStart polymerase (for <500 base pair product) (Kapa Biosystems, Roche Molecular Systems, Inc, Pleasanton, California, United States , KR0370): 12.5 μl 2X KAPA HiFi HotStart ReadyMix, 0.75 μl 10 μM Forward Primer (with MiSeq™ adapter, Supplementary Table S3), 0.75 μl 10 μM Reverse Primer (with MiSeq™ adapter), 2 μl DNA and PCR-grade water up to 25 μl. 3 μl of DNA extracted in Lucigen was used for PCR. The PCR products were purified using the QIAquick PCR Purification Kit (QIAGEN Ltd, Manchester, England). MiSeq™ was performed in-house.
2.3.2 Analysis of Editing Outcomes Using MiSeq™ Data
FASTQ files were analyzed using Cas Analyser (Park et al., 2017) with the following parameters: Nuclease type = single nuclease, comparison range (R) = 40, Minimum frequency (n) = 1 and no optional wild type marker. Rates of unedited and edited outcomes (NHEJ ± substitution and HDR) were calculated by number of reads containing outcome/total number of reads. Indels were changes in sequence length compared to the reference sequence.
We defined the outcomes as follows:
• Wild type/unedited (90–100% of reads match the reference sequence)
• (Homozygous) knock-out (90–100% of reads show indels)
• Heterozygous knock-in (40–60% of the reads match the reference and 40–60% show the knock-in)
• Homozygous knock-in (90–100% of reads show the knock-in)
• Combined/mixed repair (40–60% of the reads show indels and 40–60% match the reference sequence or show the knock-in).
2.3.3 Genotyping Using Sanger Sequencing
PCR was performed: 12.5 μl AmpliTaq Gold™ 360 Master Mix (Thermo Fisher Scientific, 4398881), 0.5 μl 10 μM forward-reverse primer mix (Supplementary Table S3), 10 μl water plus 2 μl Lucigen QuickExtract™ DNA. PCR products were cleaned using the ExoSAP-IT™ Express PCR Product Cleanup Reagent (Applied Biosystems, Thermo Fisher Scientific, 15563677) and sent for Sanger sequencing (Source BioScience, Nottingham, United Kingdom).
2.3.4 Genotyping Using TaqMan™ qPCR
TaqMan™ genotyping was performed for rs2305089 (Applied Biosystems, 4351379): 5 μl TaqMan™ genotyping mastermix, 0.5 μl Taqman™ primer/probe mix (C__11223433_10), 3.5 μl water, 1 μl DNA. Results were analysed using the Genotyping application on the Thermo Fisher Connect™ cloud.
2.4 Data Analysis
All analysis and statistics were performed using R version 4.0.5 (2021–03–31) and GraphPad Prism version 8.0.0 for Windows (GraphPad Software, San Diego, California United States , www.graphpad.com). Cartoons were created with Biorender.com.
3 Results
3.1 Editing of TP53 SNVs in iPSC or Other Colony Forming Cell Model
We tested the ability of CRISPR/Cas9 modifications to introduce the germline pathogenic G245D and R248Q variants in TP53 which lie in proximity, distant from a PAM and in repetitive sequences, into iPSC (Figure 1A). iPSCs tolerate transfection and single cell sorting poorly and are expensive to maintain in culture making it important to characterize editing outcomes early in the workflow (Figure 1B). TP53 is expressed in iPSCs making it likely that the chromatin will be open allowing access of the RNP (Liu et al., 2019b).
3.2 Preliminary Checks of Cell Model
iPSCs accumulate genetic alterations during cell culture including SNVs at G245D, R248Q and other loci in TP53 (Amps et al., 2011; Merkle et al., 2017; Liu et al., 2019b). We therefore first ensured that a genetically pure population, free of TP53 SNVs, was utilized for transfection by picking 20 clonal sublines grown from the parental iPSC line; and checked five by Sanger sequencing which were found to be free of SNVs in the region around the G245D and R248Q loci. One of these sublines was selected for transfection (Supplementary Figure S1).
3.3 Design of Components Targeting TP53 in iPSCs
We designed 6 guide RNAs (gRNAs) for CRISPR/Cas9 using online tools E-CRISP Heigwer et al., 2014, CHOPCHOP Labun et al., 2019 and the IDT Alt-R™ CRISPR HDR Design Tool (https://eu.idtdna.com/pages/tools/alt-r-crispr-hdr-design-tool) (Supplementary Figure S2B). All gRNAs were comparable in their likelihood of on- and off-target effects when assessed in silico. When tested in cells, only one gRNA generated a double strand break (DSB) on Sanger sequencing (Supplementary Figure S2C and Figure 1A) and did so at >15 nucleotides from the SNVs limiting our choice of gRNA and editing efficiency (Paquet et al., 2016; Kwart et al., 2017) (Figure 1A and Supplementary Figure S2B).
Designing PE components is more complex. PegRNA design is a major factor determining editing efficiency and several primer binding sites (PBS) and reverse transcriptase (RT) template combinations are possible when designing the pegRNA but only a fraction of these will achieve optimal efficiency (Anzalone et al., 2019). Various tools have been developed to aid design including PEGfinder (Chow et al., 2020), PE-designer (Hwang et al., 2021) and PrimeDesign (Hsu et al., 2021). For the G245D locus in TP53, 19 pegRNA designs were generated using PrimeDesign which could be combined with >10 PE gRNAs (Hsu et al., 2021) (data not shown). However, the low tolerance of iPSC to multiple transfections discouraged the use of the PE system for this study.
3.4 Optimizing the HDR Efficiency of CRISPR/Cas9 in iPSCs
HDR-enhancing modifications to the CRISPR/Cas9 protocol have been extensively investigated (Supplementary Table S1). As we could not modify the cut-to-mutation distance, we tested if modifying ssODN design or post-transfection experimental conditions would improve the rate of HDR. We performed 75 individual transfections of iPSCs (Figures 1A,B) (Supplementary Table S1) which tolerated electroporation with 40–50% viability after transfection, and compared editing outcomes in the bulk populations using MiSeq (reviewed in Sledzinski et al., 2020).
The editing efficiency was variable, ranging from 0 to 12% for the knock-in without indels (accurate HDR) (Figures 2A,B). We corroborated previous reports (Richardson et al., 2016; Liang et al., 2017; O’Brien et al., 2019) that asymmetry of the homology arms improves accurate HDR (range 1–5%, mean 2.0%, 10 samples) as did addition of phosphorothioated nucleotides (range 1–4%, mean 2.3%, 4 samples) (Papaioannou et al., 2009; Gutierrez-Triana et al., 2018) and arms of equal length (range 2–3%, mean 2.5%, 2 samples) (Figure 2A). The introduction of a silent mutation of the PAM (Paquet et al., 2016; Okamoto et al., 2019), or the reverse complement (RC) of the asymmetric design (Richardson et al., 2016) (Figure 1A, Supplementary Table S1 and Supplementary Table S3) did not improve the HDR efficiency (Figure 2A), however only two experiments were performed for these conditions.
Next, using the asymmetric ssODN, we proceeded to modify post-transfection conditions by adding 1% dimethyl sulfoxide (DMSO) or the IDT Alt-R™ HDR Enhancer (Stratigopoulos et al., 2018) and culturing cells at 32°C (cold shock) (Guo et al., 2018) (Supplementary Table S1). The most effective protocol included cold shock and Alt-R™ HDR Enhancer, a finding consistent with previous studies (Skarnes et al., 2019; Di Stazio et al., 2021) (Figure 2B). We confirmed that DMSO increases HDR, making it a cost-effective alternative to commercial HDR enhancers (Stratigopoulos et al., 2018) (Figure 2B).
3.5 Isolating Cell Lines Reduces the Editing Efficiency Observed in the Bulk Population by NGS
We chose two transfected populations which showed a promising rate of accurate HDR (11 and 12%) for single cell line isolation (Figure 1B and Figure 2C). After colony picking and expansion, most colonies were repaired by NHEJ (77/100, 77%) or showed a mixed repair (13/100, 13%). The final editing efficiency was 1% despite the initially promising rate of HDR at the level of the bulk population.
Finally, quality assurance of the clonal lines was undertaken to assess the homogeneity/purity of the population. Using MiSeq we established that the clonal lines were pure populations free of off-target alterations in the ∼200 base pairs surrounding the variants. No off-target sites were predicted by the gRNA design tools.
Our data show that in silico predictions of gRNA efficiency correlated poorly with activity in our iPSC. Rates of HDR vary between transfections and can be boosted with asymmetric donors, with or without PT modifications, and HDR-enhancing modifications. The isolation of clonal cell lines from bulk populations resulted in a significant attrition of HDR efficiency.
3.6 Editing of TBXT in the U-CH1 Immortalized Cancer Cell Line
Many cell models grow as adherent cultures that tolerate single cell sorting and are relatively cheap to culture. The emphasis for these models is high throughput generation and genotyping of cell lines. U-CH1 is a cell model of the rare bone cancer, chordoma, and is associated with the G177D SNV in TBXT. We employed the U-CH1 chordoma cell line, which expresses TBXT at high levels and likely to be in euchromatin, to investigate the functional impact of the G177D SNV (Kelley et al., 2010; Pillay et al., 2012).
3.7 Design of CRISPR/Cas9 Components for the U-CH1 Chordoma Cell Line
We ensured a pure population free of SNVs in the region surrounding the G177D variant using MiSeq to avoid the time taken for single cell sorting (Supplementary Figure S4A).
Four candidate gRNAs were designed and assessed using the IDT CRISPR-Cas9 guide RNA design checker. Of the four gRNAs predicted to be effective in silico, two caused a DSB when tested in cells (Figures 3A,B and Supplementary Figure S4B–C).
We tested whether varying Cas9 (2.05 μl (17 pmol final concentration) versus 10.2 μl (404 pmol final concentration)), donor template concentration or combining gRNAs would improve the editing efficiency as measured by MiSeq (Lin et al., 2014). We established that the RNP complexed with a single gRNA, at recommended concentrations, showed the best performance (Figure 3C).
3.8 Screening of Transfected Bulk Populations Using Digital Droplet PCR
After transfection we screened the bulk population for HDR using digital droplet PCR (ddPCR) as a faster alternative to MiSeq before proceeding to single cell sorting, allowing us to maintain the cells in culture during screening as U-CH1 cells grow slowly (Figure 3B). The partitioning technology of ddPCR enables the detection of rare knock-in events at a frequency as low as 0.1% (Miyaoka et al., 2014, Miyaoka et al., 2018). Screening of 10 transfected populations by ddPCR showed an editing efficiency of 2–10%. The transfected cells tolerated single cell sorting by FACS 24–36 h after transfection (Figure 3B) then required 3–5 weeks to expand sufficiently for subculture into a “mirror” plate for genotyping. The Lucigen one-step DNA extraction protocol was utilized, allowing direct input of DNA into TaqMan™ qPCR or ddPCR for high throughput genotyping.
3.9 High Throughput Screening of Hundreds of U-CH1 Clones
We screened ∼500 transfected clonal lines and isolated seven cell lines which were free of indels: four wild type, two heterozygous knock-ins and one homozygous knock-in, giving an overall HDR efficiency of 0.6% (Figure 3D).
The relationship between HDR efficiency and cut-to-mutation distance (Paquet et al., 2016; Kwart et al., 2017) was again observed: the silent blocking mutation introduced into the ssODN was successfully edited more frequently than the G177D SNV and in a homozygous fashion, in contrast to the G177D SNV which was only edited in one allele.
Finally, we checked the isolated clones by sequencing 1,000 bp around the site of the edit by Sanger sequencing: we confirmed the presence of the introduced edit and ensured no off-target alterations were present (Supplementary Figure S6). No other off target sites were predicted by the design tools.
In summary, we show the results of screening hundreds of potential edited clones using high throughput technologies but highlight a significant attrition in efficiency between bulk populations and single cell line isolation. Varying the concentration of CRISPR/Cas9 components did not improve editing efficiency in U-CH1.
4 Discussion
When planning a genome editing experiment, the methods available to the scientist have expanded substantially over the past decade. Each system, CRISPR/Cas9, base editors and PE, offers solutions for introducing different alterations: CRISPR/Cas9 for knock-outs and knock-ins, base editors for substitutions in non-repetitive nucleotides and PE for substitutions, transversions and indels. As genome editing systems require increasingly complex components and modified systems emerge, preliminary experiments become increasingly important for informing final experimental design (Anzalone et al., 2019). We propose a flowchart that incorporates critical steps and tools utilized at each stage to edit colony-forming cells or adherent cell lines (Figure 4).
Bioinformatic platforms are indispensable for designing CRISPR/Cas9 components but the accuracy of predictions can be affected by the chromatin organization of the target locus and varies by cell model. This was confirmed by our results that show that in silico predictions of gRNAs efficiency did not show the same performance in cells. Even when experiments are repeated there was several-fold variation in the HDR efficiency, potentially related to factors such as inherited genetic variability and cell cycle phase, highlighting the importance of characterizing bulk populations before single cell line isolation.
Cut-to-mutation distance is a major factor in determining CRISPR/Cas9 HDR efficiency and zygosity of the resulting edit (Paquet et al., 2016; Kwart et al., 2017). This was confirmed in our experiments with the frequent homozygous knock-in of the silent PAM mutation, close to the cut site, compared to the heterozygous knock-in of the target locus, distant from the cut site. This may have accounted for the successful homozygous knock-in in U-CH1 where the mutation is closer to the cut site than in iPSCs. If a homozygous knock-in is required, gRNAs that cut close to the target locus are required. If these gRNAs are not effective in preliminary testing, it is worthwhile considering Cas9 nucleases with different PAM requirements or using PE. We were able to validate the beneficial effect of an asymmetric donor, HDR enhancer and cold shock, the combination of which increased the low baseline HDR efficiency several fold. Asymmetric donors are thought to influence annealing and release of strands being repaired (O'Brien et al., 2019) while HDR enhancers block NHEJ (Pinder, Maurissen, reviewed in Bischoff) to favor HDR and cold shock is thought to affect G2/M transition or persistence of the RNP (Guo, Maurissen). Tools that aid optimal design of ssODNs may be employed (O’Brien et al., 2019; Schubert et al., 2021).
Despite these optimizations boosting the HDR efficiency to a promising mean rate of 11.5% in the bulk population, a single cell line was isolated from transfected iPSCs with a similar picture in U-CH1. This may be related to the low overall editing efficiency, consistent over repeated transfections, compared to previous optimization studies. Although some studies showed impressive rates of HDR, it is important to note that they instead utilized a bulk population-based read out without isolating cell lines (Zhang et al., 2017; Skarnes et al., 2019; Di Stazio et al., 2021) or they edited the more robust HEK293 cell model (Richardson et al., 2016; Liang et al., 2017; Di Stazio et al., 2021). The significant attrition of editing efficiency is an important consideration when modest editing efficiencies are achieved.
Characterization of the factors affecting the efficiency of PE is less advanced but initial studies suggest optimizing the melting temperature of the PBS, using a dual pegRNA strategy (Lin et al., 2021) and disrupting the action of the mismatch repair pathway (Ferreira da Silva et al., 2022). Analysis of PE outcomes using MiSeq data is a newer concept and one tool is available, PE-Analyzer, with more likely to follow (Hwang et al., 2021). Sanger sequencing represents a cheap alternative for analyzing the composition of editing outcomes in bulk populations. Tools include Synthego’s ICE (Inference of CRISPR Edits) (Hsiau et al., 2019) and Tracking Indels by Decomposition (TIDE) (Brinkman et al., 2014). Sanger Sequencing tools designed for CRISPR/Cas9 have been applied to PE experiments highlighting their versatility (Ferreira da Silva et al., 2022).
Given the challenges of editing the loci described in our work, a more complex system such as PE might be worth exploring in future studies. PE could be compared to CRISPR/Cas9 using the methods described to determine whether the trade-off between the relative simplicity of CRISPR/Cas9 is balanced by increased efficiency. Compared to the extensive characterization of CRISPR/Cas9, the optimization of PE is a growing field. A significant advance is the manipulation of mismatch repair pathways in PE (Chen et al., 2021; Ferreira da Silva et al., 2022). At present pegRNA design is the major determinant of PE efficiency (Anzalone et al., 2019). Finding the most efficiency combination of PBS and RT designs is an important preliminary step.
For those who will attempt HDR by CRISPR/Cas9 based on its relative simplicity and the robustness of the CRISPR/Cas9 recombinant components compared to PE, if an acceptable efficiency is attained after clonal isolation, the modifications we have validated may be employed to boost efficiency.
5 Conclusion
For SNVs for which a cut site can be generated at < 15 nucleotides, it may be simpler and faster to use CRISPR/Cas9 with protocol modifications to achieve HDR. For more challenging loci, such as those presented in this study, PE could be considered but would require more extensive optimization. For all genome editing systems the efficiency varies with cell model and target locus amongst other factors. Preliminary testing will inform the choice of system and protocol modifications. We propose a flowchart which could be used to guide the planning of CRISPR/Cas9 experiments to edit SNVs (Figure 4).
Acknowledgments
We acknowledge Sunniyat Rahman, Samuel Weeks, Daniel Wetterskog, Yang Li, Iben Lyskjaer and Florian Merkle for their invaluable advice in setting up our protocols.
Data Availability Statement
The datasets presented in this study can be found online at https://www.ebi.ac.uk/ena with accession number PRJEB52736.
Author Contributions
Conceptualization, IU, SA, GB, AF, and LC; methodology, IU, SA, LL, and LC; formal analysis, IU, EH, and CC; investigation, IU, LL, and LC; writing—original draft preparation, IU and LC; writing—review and editing, LL, SA, EH, CC, AF, and LC; visualization, IU and LC; supervision, LC and AF; funding acquisition, IU, GB, AF, and LC. All authors have read and agreed to the published version of the manuscript.
Funding
This work was funded by the Tom Prince Trust (AF), The Pathological Society of Great Britain and Ireland (grant number TSGS041903) (IU and AF), Cancer Research UK (University College London (CRUK UCL) Centre Award C416/A25145) (AF, LC, and GB), the Royal National Orthopaedic Hospital NHS Trust R&D Department, Bone Cancer Research Trust, Chordoma Foundation. AF was supported by the National Institute for Health Research, the University College London Hospitals Biomedical Research Centre, and the Cancer Research United Kingdom University College London Experimental Cancer Medicine Centre. IU was supported by Chordoma United Kingdom. SA was supported by the Hardy Keinan fellowship.
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s Note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgeed.2022.932434/full#supplementary-material
References
- Amps K., Andrews P. W., Anyfantis G., Armstrong L., Avery S., Baharvand H., et al. (2011). Screening Ethnically Diverse Human Embryonic Stem Cells Identifies a Chromosome 20 minimal Amplicon Conferring Growth Advantage. Nat. Biotechnol. 29 (12), 1132–1144. 10.1038/NBT.2051 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Anzalone A. V., Randolph P. B., Davis J. R., Sousa A. A., Koblan L. W., Levy J. M., et al. (2019). Search-and-replace Genome Editing without Double-Strand Breaks or Donor DNA. Nature 576 (7785), 149–157. 10.1038/S41586-019-1711-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bio-Rad Laboratories, Inc. (2022). Bulletin 6407 Ver B, [Online]. Available at: http://www.biorad.com/webroot/web/pdf/lsr/literature/Bulletin_6407.pdf (Accessed February 01, 2022).
- Brinkman E. K., Chen T., Amendola M., van Steensel B. (2014). Easy Quantitative Assessment of Genome Editing by Sequence Trace Decomposition. Nucleic Acids Res. 42 (22), e168. 10.1093/nar/gku936 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen P. J., Hussmann J. A., Yan J., Knipping F., Ravisankar P., Chen P.-F., et al. (2021). Enhanced Prime Editing Systems by Manipulating Cellular Determinants of Editing Outcomes. Cell. 184 (22), 5635–5652.e29. 10.1016/J.CELL.2021.09.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen R., Yang Z., Liu J., Cai X., Huo Y., Zhang Z., et al. (2022). Functional Genomic Analysis Delineates Regulatory Mechanisms of GWAS-Identified Bipolar Disorder Risk Variants. Genome Med. 14 (1), 53. 10.1186/S13073-022-01057-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chow R. D., Chen J. S., Shen J., Chen S. (2020) ‘pegFinder: A pegRNA Designer for CRISPR Prime Editing’, bioRxiv, p. 2020.05.06.081612. 10.1101/2020.05.06.081612 [DOI] [Google Scholar]
- Di Stazio M., Foschi N., Athanasakis E., Gasparini P., d’Adamo A. P. (2021). Systematic Analysis of Factors that Improve Homologous Direct Repair (HDR) Efficiency in CRISPR/Cas9 Technique. PLoS ONE 16 (3), e0247603. March 2021. 10.1371/JOURNAL.PONE.0247603 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ferreira da Silva J., Oliveira G. P., Arasa-Verge E. A., Kagiou C., Moretton A., Timelthaler G., et al. (2022). Prime Editing Efficiency and Fidelity Are Enhanced in the Absence of Mismatch Repair. Nat. Commun. 13 (1). 10.1038/S41467-022-28442-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gallagher M. D., Chen-Plotkin A. S. (2018). The Post-GWAS Era: From Association to Function. Am. J. Hum. Genet. 102 (5), 717–730. 10.1016/J.AJHG.2018.04.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garvey C. M., Spiller E., Lindsay D., Chiang C.-T., Choi N. C., Agus D. B., et al. (2016). A High-Content Image-Based Method for Quantitatively Studying Context-dependent Cell Population Dynamics. Sci. Rep. 6, 29752. 10.1038/SREP29752 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guo Q., Mintier G., Ma-Edmonds M., Storton D., Wang X., Xiao X., et al. (2018). 'Cold Shock' Increases the Frequency of Homology Directed Repair Gene Editing in Induced Pluripotent Stem Cells. Sci. Rep. 8 (1), 2080. 10.1038/S41598-018-20358-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gutierrez-Triana J. A., Tavhelidse T., Thumberger T., Thomas I., Wittbrodt B., Kellner T., et al. (2018). Efficient Single-Copy HDR by 5' Modified Long dsDNA Donors. eLife 7, 39468. 10.7554/ELIFE.39468 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heigwer F., Kerr G., Boutros M. (2014). E-CRISP: Fast CRISPR Target Site Identification. Nat. Methods 11 (2), 122–123. 10.1038/NMETH.2812 [DOI] [PubMed] [Google Scholar]
- Hsiau T., Conant D., Rossi N., Maures T., Waite K., Yang J., et al. (2019, Inference of CRISPR Edits from Sanger Trace Data), bioRxiv, p. 251082. 10.1101/251082 [DOI] [PubMed] [Google Scholar]
- Hsu J. Y., Grünewald J., Szalay R., Shih J., Anzalone A. V., Lam K. C., et al. (2021). PrimeDesign Software for Rapid and Simplified Design of Prime Editing Guide RNAs. Nat. Commun. 12 (11), 1–6. 10.1038/s41467-021-21337-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hwang G.-H., Jeong Y. K., Habib O., Hong S.-A., Lim K., Kim J.-S., et al. (2021). PE-designer and PE-Analyzer: Web-Based Design and Analysis Tools for CRISPR Prime Editing. Nucleic Acids Res. 49 (W1), W499–W504. 10.1093/NAR/GKAB319 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kelley M. J., Brüderlein S., Sommer J. B., Meltzer P. S., Li S., Osada T., et al. (2010). Molecular Characterization of Putative Chordoma Cell Lines. Sarcoma 2010, 14. No Title. 10.1155/2010/630129 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kwart D., Paquet D., Teo S., Tessier-Lavigne M. (2017). Precise and Efficient Scarless Genome Editing in Stem Cells Using CORRECT. Nat. Protoc. 12 (2), 329–354. 10.1038/NPROT.2016.171 [DOI] [PubMed] [Google Scholar]
- Labun K., Montague T. G., Krause M., Torres Cleuren Y. N., Tjeldnes H., Valen E. (2019). CHOPCHOP V3: Expanding the CRISPR Web Toolbox beyond Genome Editing. Nucleic Acids Res. 47 (W1), W171–W174. 10.1093/NAR/GKZ365 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Landrum M. J., Kattman B. L. (2018). ClinVar at Five Years: Delivering on the Promise. Hum. Mutat. 39 (11), 1623–1630. 10.1002/HUMU.23641 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liang X., Potter J., Kumar S., Ravinder N., Chesnut J. D. (2017). Enhanced CRISPR/Cas9-mediated Precise Genome Editing by Improved Design and Delivery of gRNA, Cas9 Nuclease, and Donor DNA. J. Biotechnol. 241, 136–146. 10.1016/J.JBIOTEC.2016.11.011 [DOI] [PubMed] [Google Scholar]
- Lin Q., Jin S., Zong Y., Yu H., Zhu Z., Liu G., et al. (20212021). High-efficiency Prime Editing with Optimized, Paired pegRNAs in Plants. Nat. Biotechnol. 3939 (88), 923–927. 10.1038/s41587-021-00868-w [DOI] [PubMed] [Google Scholar]
- Lin S., Staahl B. T., Alla R. K., Doudna J. A. (2014). Enhanced Homology-Directed Human Genome Engineering by Controlled Timing of CRISPR/Cas9 Delivery. eLife 3, e04766. 10.7554/ELIFE.04766 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu M., Rehman S., Tang X., Gu K., Fan Q., Chen D., et al. (2019a). Methodologies for Improving HDR Efficiency. Front. Genet. 10 (JAN), 691. 10.3389/FGENE.2018.00691 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu M., Rehman S., Tang X., Gu K., Fan Q., Chen D., et al. (2019b). Methodologies for Improving HDR Efficiency. Front. Genet. 9 (JAN). 10.3389/FGENE.2018.00691 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Madsen R. R., Semple R. K. (2019). Luminescent Peptide Tagging Enables Efficient Screening for CRISPR-Mediated Knock-In in Human Induced Pluripotent Stem Cells. Wellcome Open Res. 4, 37. 10.12688/WELLCOMEOPENRES.15119.3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Maurissen T. L., Woltjen K. (2020). Synergistic Gene Editing in Human iPS Cells via Cell Cycle and DNA Repair Modulation. Nat. Commun. 11 (1), 16643. 10.1038/S41467-020-16643-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Merkle F. T., Ghosh S., Kamitaki N., Mitchell J., Avior Y., Mello C., et al. (2017). Human Pluripotent Stem Cells Recurrently Acquire and Expand Dominant Negative P53 Mutations. Nature 545 (7653), 229–233. 10.1038/NATURE22312 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miyaoka Y., Chan A. H., Judge L. M., Yoo J., Huang M., Nguyen T. D., et al. (2014). Isolation of Single-Base Genome-Edited Human iPS Cells without Antibiotic Selection. Nat. Methods 11 (3), 291–293. 10.1038/NMETH.2840 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miyaoka Y., Mayerl S. J., Chan A. H., Conklin B. R. (2018). Detection and Quantification of HDR and NHEJ Induced by Genome Editing at Endogenous Gene Loci Using Droplet Digital PCR. Methods Mol. Biol. 1768, 349–362. 10.1007/978-1-4939-7778-9_20 [DOI] [PMC free article] [PubMed] [Google Scholar]
- O’Brien A. R., Wilson L. O. W., Burgio G., Bauer D. C. (2019). Unlocking HDR-Mediated Nucleotide Editing by Identifying High-Efficiency Target Sites Using Machine Learning. Sci. Rep. 9 (1), 39142. 10.1038/S41598-019-39142-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Okamoto S., Amaishi Y., Maki I., Enoki T., Mineno J. (2019). Highly Efficient Genome Editing for Single-Base Substitutions Using Optimized ssODNs with Cas9-RNPs. Sci. Rep. 9 (1), 41121. 10.1038/S41598-019-41121-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Papaioannou I., Disterer P., Owen J. S. (2009). Use of Internally Nuclease-Protected Single-Strand DNA Oligonucleotides and Silencing of the Mismatch Repair Protein, MSH2, Enhances the Replication of Corrected Cells Following Gene Editing. J. Gene Med. 11 (3), 267–274. 10.1002/JGM.1296 [DOI] [PubMed] [Google Scholar]
- Paquet D., Kwart D., Chen A., Sproul A., Jacob S., Teo S., et al. (2016). Efficient Introduction of Specific Homozygous and Heterozygous Mutations Using CRISPR/Cas9. Nature 533 (7601), 125–129. 10.1038/NATURE17664 [DOI] [PubMed] [Google Scholar]
- Park J., Lim K., Kim J.-S., Bae S. (2017). Cas-analyzer: An Online Tool for Assessing Genome Editing Results Using NGS Data. Bioinformatics 33 (2), 286–288. 10.1093/BIOINFORMATICS/BTW561 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pillay N., Plagnol V., Tarpey P. S., Lobo S. B., Presneau N., Szuhai K., et al. (2012). A Common Single-Nucleotide Variant in T Is Strongly Associated with Chordoma. Nat. Genet. 44 (11), 1185–1187. 10.1038/NG.2419 [DOI] [PubMed] [Google Scholar]
- Rees H. A., Liu D. R. (2018). Base Editing: Precision Chemistry on the Genome and Transcriptome of Living Cells. Nat. Rev. Genet. 19 (12), 770–788. 10.1038/S41576-018-0059-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Richardson C. D., Ray G. J., DeWitt M. A., Curie G. L., Corn J. E. (2016). Enhancing Homology-Directed Genome Editing by Catalytically Active and Inactive CRISPR-Cas9 Using Asymmetric Donor DNA. Nat. Biotechnol. 34 (3), 339–344. 10.1038/NBT.3481 [DOI] [PubMed] [Google Scholar]
- Scheipl S., Barnard M., Cottone L., Jorgensen M., Drewry D.H., Zuercher W.J., et al. (2016). EGFR Inhibitors Identified as a Potential Treatment for Chordoma in a Focused Compound Screen. J. Pathol. 239 (3), 320–334. 10.1002/PATH.4729 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schubert M. S., Thommandru B., Woodley J., Turk R., Yan S., Kurgan G., et al. (20212021). Optimized Design Parameters for CRISPR Cas9 and Cas12a Homology-Directed Repair. Sci. Rep. 11 (11), 1–15. 10.1038/s41598-021-98965-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Skarnes W. C., Pellegrino E., McDonough J. A. (2019). Improving Homology-Directed Repair Efficiency in Human Stem Cells. Methods 164-165 (165), 18–28. 10.1016/J.YMETH.2019.06.016 [DOI] [PubMed] [Google Scholar]
- Sledzinski P., Nowaczyk M., Olejniczak M. (2020). Computational Tools and Resources Supporting CRISPR-Cas Experiments. Cells 9 (5), 1288. 10.3390/CELLS9051288 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stratigopoulos G., De Rosa M. C., LeDuc C. A., Leibel R. L., Doege C. A. (2018). DMSO Increases Efficiency of Genome Editing at Two Non-coding Loci. PLoS ONE 13 (6), e0198637. 10.1371/JOURNAL.PONE.0198637 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Untergasser A., Nijveen H., Rao X., Bisseling T., Geurts R., Leunissen J.A.M. (2007). Primer3Plus, an Enhanced Web Interface to Primer3. Nucleic Acids Res. 35 (suppl. 2), W71–W74. 10.1093/nar/gkm306 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang J.-P., Li X.-L., Li G.-H., Chen W., Arakaki C., Botimer G. D., et al. (2017). Efficient Precise Knockin with a Double Cut HDR Donor after CRISPR/Cas9-mediated Double-Stranded DNA Cleavage. Genome Biol. 18 (1). 10.1186/S13059-017-1164-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao B., Chen S.-A. A., Lee J., Fraser H. B. (2022). Bacterial Retrons Enable Precise Gene Editing in Human Cells. CRISPR J. 5 (1), 31–39. 10.1089/CRISPR.2021.0065 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The datasets presented in this study can be found online at https://www.ebi.ac.uk/ena with accession number PRJEB52736.