Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Jul 3.
Published in final edited form as: Cell Stem Cell. 2014 Jul 3;15(1):27–30. doi: 10.1016/j.stem.2014.04.020

Low incidence of off-target mutations in individual CRISPR-Cas9 and TALEN targeted human stem cell clones detected by whole-genome sequencing

Adrian Veres 1,2, Bridget S Gosis 1, Qiurong Ding 1, Ryan Collins 3, Ashok Ragavendran 3, Harrison Brand 3, Serkan Erdin 3, Michael E Talkowski 3,4,5, Kiran Musunuru 1,2,5,6
PMCID: PMC4082799  NIHMSID: NIHMS590712  PMID: 24996167

SUMMARY

Genome editing has attracted wide interest for the generation of cellular models of disease using human pluripotent stem cells and other cell types. CRISPR-Cas systems and TALENs can target desired genomic sites with high efficiency in human cells, but recent publications have led to concern about the extent to which these tools may cause off-target mutagenic effects that could potentially confound disease-modeling studies. Using CRISPR-Cas9 and TALEN targeted human pluripotent stem cell clones, we performed whole-genome sequencing at high coverage to assess the degree of mutagenesis across the entire genome. In both types of clones, we found that off-target mutations attributable to the nucleases were very rare. From this analysis, we suggest that while some cell types may be at risk for off-target mutations, the incidence of such effects in human pluripotent stem cells may be sufficiently low to not be a significant concern for disease modeling and other applications.


Clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated (Cas) systems and transcription activator-like effector nucleases (TALENs) are recently developed genome-editing tools that target desired genomic sites in mammalian cells (Miller et al., 2011; Hockemeyer et al., 2011; Cong et al., 2013; Mali et al., 2013b; Cho et al., 2013; Jinek et al., 2013). The most commonly employed CRISPR-Cas system, derived from Streptococcus pyogenes, uses Cas9 nuclease that complexes with a guide RNA that hybridizes a 20-nucleotide DNA sequence (protospacer) immediately preceding an NGG motif (protospacer-associated motif, or PAM), resulting in a double-strand break (DSB) three basepairs (bp) upstream of the NGG (Jinek et al., 2012). TALENs bind as a pair on sequences surrounding a genomic site, positioning a dimer of FokI nuclease domains to generate a DSB at the site. The introduction of a DSB at a specified genomic site allows for modification of the site via either non-homologous end joining (NHEJ), which typically introduces an insertion or deletion (indel), or homology-directed repair (HDR), which can be exploited to knock in a point mutation or insert a desired sequence at the site.

One important application of genome-editing technology is disease modeling (Musunuru, 2013). The ability to generate isogenic wild-type and mutant clones for phenotypic comparison would enable rigorous functional genetic studies. However, both CRISPR-Cas9 and TALENs have been demonstrated to produce off-target effects, i.e., mutagenesis at sites in the genome other than the desired on-target site (Hockemeyer et al., 2011; Mussolino et al., 2011; Fu et al., 2013; Hsu et al., 2013; Mali et al., 2013a; Pattanayak et al., 2013; Cradick et al., 2013; Cho et al., 2014). These studies have largely focused on sites with high sequence similarity to the on-target site and have documented mutagenesis rates as high as 77% for CRISPR-Cas9 and 1% for TALENs at individual off-target sites. Relatively unexplored is whether CRISPR-Cas9 or TALENs produce off-target effects at sites with low sequence similarity to the on-target site. Although the nucleases might have poor affinity and have a low probability of generating a mutation at any given single site in the genome, they might nonetheless generate a sizeable number of nonspecific mutations across the billions of basepairs of the genome in any single cell. This would significantly confound the validity of disease-modeling studies that rely upon genome-edited clones.

To date, most studies of nuclease off-target effects have been performed in aggregated pools of transformed or immortalized cultured human cells, such as HEK 293T and K562 cells, that are not well suited for disease modeling. We therefore decided to study nuclease off-target effects generated in a “real-world” application of genome editing, centered on human pluripotent stem cell (hPSC) clones being actively used for biological studies (e.g., Ding et al., 2013a).

We assessed the degree of genome-wide off-target mutagenesis in hPSC clones targeted with either CRISPR-Cas9 or TALENs. We performed whole-genome sequencing at high coverage (60× target coverage) of ten cell lines, including nine clones we had previously generated with genome editing (Ding et al., 2013a; Ding et al., 2013b) (Figure 1): the human embryonic stem cell line HUES 9; three HUES 9 clones exposed to TALENs targeting the SORT1 gene, with one clone remaining wild-type in both alleles (clone A) and two clones bearing indels in both SORT1 alleles (clones B and C); three HUES 9 clones exposed to CRISPR-Cas9 targeting the same site in the SORT1 gene, with one wild-type clone (clone D) and two clones bearing indels in both SORT1 alleles (clones E and F); and three HUES 9 clones exposed to CRISPR-Cas9 targeting the LINC00116 gene, with one wild-type clone (clone G) and two clones bearing indels in both LINC00116 alleles (clones H and I). All of the HUES 9 clones were derived from the same stock of parental HUES 9 cells. Of note, we had found the targeting efficiency of the SORT1 TALENs to be 11%, in contrast to CRISPR-Cas9 for SORT1, which was 76%; the targeting efficiency of CRISPR-Cas9 for LINC00116 was 57% (Ding et al., 2013b).

Figure 1. On-target and Off-target Mutations.

Figure 1

(A) HUES 9 clones targeted in the SORT1 gene with TALENs or CRISPR-Cas9. (B) HUES 9 clones targeted in the LINC00116 gene with CRISPR-Cas9. For TALEN targeted clones, the boxes indicate the TALEN on-target and off-target binding sequences. For CRISPR-Cas9 targeted clones, the boxes indicate the 20-bp sequence matching the protospacer and the 3-bp PAM. For the on-target sites, deletions and insertions in the two alleles of each clone are indicated. For the off-target site, the mismatches with the TALEN on-target binding sequences are indicated in bold, and the deletion in one allele of the clone is indicated.

Upon obtaining the whole-genome sequencing data, we assessed the clones for small indels, single nucleotide variants (SNVs), and structural variants (SVs), which include chromosomal inversions, rearrangements, duplications, and deletions (Supplemental Experimental Procedures). We largely focused on the identification of small indels and SVs because they comprise virtually all of the mutations introduced by NHEJ. After filtering for the small indels most likely to be true positives and to be potential off-target mutations (rather than mutations that arose in the parental cell pool) and confirmation with Sanger sequencing, we identified a total of 28 such indels across the nine experimental clones, compared against the parental HUES 9 cells as the reference. Of note, all of the previously known on-target indels (seven in total) were correctly identified by the whole-genome sequencing and filtering (Table 1 and Table S1). One of the 28 off-target indels was a frameshift in the coding sequence of ZDHHC11 (in clone I). None of the other indels lay in either the coding sequence of a gene or the expressed sequence of an annotated non-coding RNA.

Table 1.

Numbers of Unique On-target and Candidate Off-target Indels and Structural Variants (SVs), As Well As Unique Single Nucleotide Variants (SNVs), in TALEN and CRISPR-Cas9 Targeted Clones

SORT1 TALENs SORT1 CRISPR-Cas9 LINC00116
CRISPR-Cas9
clones A B C D E F G H I
on-target indels 2 2 1a 1a 1 1a
on-target SVs 1b

likely off-target indel 1
other candidate off-
target indels
2 1 2 4 4 2 3 5 4
candidate off-target SVs 1

SNVs 64 115 142 55 94 74 111 127 112

See also Figure S1 and Tables S1S3.

a

Homozygous for indels.

b

261 bp duplicated insertion.

None of the indels in CRISPR-Cas9 clones were within 100 nucleotides of a potential off-target site as predicted by sequence similarity—up to six mismatches—with the on-target site, and none lay near sequences that matched the on-target sites better than would be expected by chance (Figure S1). Moreover, none of the indels lay within 100 nucleotides of a sequence perfectly matching the last ten nucleotides of the protospacer with an adjacent PAM site [NGG as well as NAG, which has also been shown to be tolerated (Hsu et al., 2013; Pattanayak et al., 2013)]. Furthermore, we paid special attention to the indels that lay within five bases upstream of a potential PAM site (Table S1), where CRISPR-Cas9-mediated DSBs would be expected to occur. Although the majority of clones had a potential PAM site, none of the adjacent sequences matched the on-target site better than would be expected by chance (Figure S1).

One of the indels in a TALEN clone was located between two potential off-target binding sites as predicted by sequence similarity with the on-target sites—one with three mismatches, and the other with four mismatches—with the binding sites being 17 bp apart, within the optimal range for generating a DSB with TALENs of this type (Ding et al., 2013a) (Figure 1A). None of the other TALEN clone indels were optimally positioned near a pair of degenerate TALEN binding sites (up to five mismatches with the on-target site), and none lay near sequences that matched the on-target sites better than would be expected by chance (Figure S1).

None of the SVs and SNVs that passed our filtering criteria in CRISPR-Cas9 clones was within 100 nucleotides of a predicted off-target site. None of the variants in TALEN clones were optimally positioned near a pair of degenerate TALEN binding sites. We detected 894 unique SNVs across the nine clones (average of 100 per clone) compared to the parental HUES 9 cell line (Table 1). The SV analysis revealed two structural variants unique to an individual clone: a 5.5-kb deletion on chromosome 6 in clone F and a 261-bp segment of chromosome 4 inserted within the LINC00116 CRISPR-Cas9 on-target site on chromosome 2 in clone H (Table 1 and Table S2). Sanger sequencing confirmed that both alleles of the chromosome 4 region were intact in clone H, signifying a duplicated insertion into the chromosome 2 on-target site rather than a balanced translocation. We speculate that due to microhomology, the chromosome 4 region was used as a repair template for a DSB at the on-target site.

Just one of the detected variants—a TALEN clone indel—seems certain to be a nuclease-mediated off-target effect. It is probable that some if not all of the other indels/SVs reflect clonal heterogeneity within the original stock of HUES 9 cells. Previous studies have documented mutagenesis occurring during the derivation and expansion of hPSCs (Hussein et al., 2011; Gore et al., 2011; Howden et al., 2011; Yusa et al., 2011). Furthermore, each clone harbored a sizable number of unique SNVs, which would not be predicted to result from NHEJ. Nonetheless, with a maximum of just two to five confirmed events in each individual clone, our results suggest that nuclease-mediated off-target effects of CRISPR-Cas9 and TALENs do not intrinsically cause a large degree of indiscriminate, nonspecific mutagenesis across the genome.

We note the limitations of this study. Even with whole-genome sequencing at high coverage, it is likely that some variants in the clones were not detected given the limitations of short-read sequencing. The small number of sequenced clones targeted at just two loci prevents generalization to all hPSC clones targeted with any CRISPR-Cas9 or TALENs of any configuration by any methodology. Furthermore, our results are not relevant to therapeutic applications targeting up to millions of cells at a time, where rare events may have deleterious consequences.

We do note that clonal heterogeneity may represent a more serious obstacle to the generation of truly isogenic cell lines than nuclease-mediated off-target effects, since each of our clones harbored a very small number of unique indels and SVs (two to five) compared to a relatively larger number of unique SNVs (average of 100) that likely arose spontaneously in culture. This suggests that even if one had in hand a genome-editing tool with perfect specificity, targeted clones would still be likely to harbor some differences elsewhere in the genome. Rigorous studies will require whole-genome sequencing of the clones used for experiments to fully characterize their mutational profiles, or they will need to include multiple clones for each experimental condition to ensure that potential confounding by any single off-target mutation in a clone is minimized.

Supplementary Material

01
02

HIGHLIGHTS.

  • CRISPR-Cas9 and TALENs were used to genome-edit human pluripotent stem cells

  • Whole-genome sequencing of individual clones found very few off-target mutations

ACKNOWLEDGMENTS

This work was supported in part by the Harvard Presidential Scholars Fund of the Harvard Medical School MD/PhD Program (A.V.); grants R00-HL098364 (K.M.), R01-HL118744 (K.M.), R01-DK097768 (K.M.), and R00-MH095867 (M.E.T.) from the United States National Institutes of Health (NIH); the Harvard Stem Cell Institute (K.M.); and Harvard University (Q.D., K.M.). We thank Vamsee Pillalamarri, Carrie Hanscom, and the staffs of the MGH Genomics and Technology Core and NextGen Sequencing Core for technical assistance.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

REFERENCES

  1. Cho S.W, Kim, S., Kim Y, Kweon J, Kim HS, Bae S, Kim JS. Analysis of off-target effects of CRISPR/Cas-derived RNA-guided endonucleases and nickases. Genome Res. 2014;24:132–141. doi: 10.1101/gr.162339.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Cho SW, Kim S, Kim JM, Kim JS. Targeted genome engineering in human cells with the Cas9 RNA-guided endonuclease. Nat. Biotechnol. 2013;31:230–232. doi: 10.1038/nbt.2507. [DOI] [PubMed] [Google Scholar]
  3. Cong L, Ran FA, Cox D, Lin S, Barretto R, Habib N, Hsu PD, Wu X, Jiang W, Marraffini LA, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339:819–823. doi: 10.1126/science.1231143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Cradick TJ, Fine EJ, Antico CJ, Bao G. CRISPR/Cas9 systems targeting β-globin and CCR5 genes have substantial off-target activity. Nucleic Acids Res. 41:9584–9592. doi: 10.1093/nar/gkt714. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Ding Q, Lee YK, Schaefer EA, Peters DT, Veres A, Kim K, Kuperwasser N, Motola DL, Meissner TB, Hendriks WT, et al. A TALEN genome-editing system for generating human stem cell-based disease models. Cell Stem Cell. 2013a;12:238–251. doi: 10.1016/j.stem.2012.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Ding Q, Regan SN, Xia Y, Oostrom LA, Cowan CA, Musunuru K. Enhanced efficiency of human pluripotent stem cell genome editing through replacing TALENs with CRISPRs. Cell Stem Cell. 2013b;12:393–394. doi: 10.1016/j.stem.2013.03.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Fu Y, Foden JA, Khayter C, Maeder ML, Reyon D, Joung JK, Sander JD. High-frequency off-target mutagenesis induced by CRISPR-Cas nucleases in human cells. Nat. Biotechnol. 2013;31:822–826. doi: 10.1038/nbt.2623. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Gore A, Li Z, Fung HL, Young JE, Agarwal S, Antosiewicz-Bourget J, Canto I, Giorgetti A, Israel MA, Kiskinis E, et al. Somatic coding mutations in human induced pluripotent stem cells. Nature. 2011;471:63–67. doi: 10.1038/nature09805. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Hockemeyer D, Wang H, Kiani S, Lai CS, Gao Q, Cassady JP, Cost GJ, Zhang L, Santiago Y, Miller JC, et al. Genetic engineering of human pluripotent cells using TALE nucleases. Nat. Biotechnol. 2011;29:731–734. doi: 10.1038/nbt.1927. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Howden SE, Gore A, Li Z, Fung HL, Nisler BS, Nie J, Chen G, McIntosh BE, Gulbranson DR, Diol NR, et al. Genetic correction and analysis of induced pluripotent stem cells from a patient with gyrate atrophy. Proc. Natl. Acad. Sci. U. S. A. 2011;108:6537–6542. doi: 10.1073/pnas.1103388108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Hsu PD, Scott DA, Weinstein JA, Ran FA, Konermann S, Agarwala V, Li Y, Fine EJ, Wu X, Shalem O, et al. DNA targeting specificity of RNA-guided Cas9 nucleases. Nat. Biotechnol. 2013;31:827–832. doi: 10.1038/nbt.2647. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Hussein SM, Batada NN, Vuoristo S, Ching RW, Autio R, Närvä E, Ng S, Sourour M, Hämäläinen R, Olsson C, et al. Copy number variation and selection during reprogramming to pluripotency. Nature. 2011;471:58–62. doi: 10.1038/nature09871. [DOI] [PubMed] [Google Scholar]
  13. Jinek M, Chylinski K, Fonfara I, Hauer M, Doudna JA, Charpentier E. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science. 2012;337:816–821. doi: 10.1126/science.1225829. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Jinek M, East A, Cheng A, Lin S, Ma E, Doudna J. RNA-programmed genome editing in human cells. Elife. 2013;2:e00471. doi: 10.7554/eLife.00471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Mali P, Aach J, Stranges PB, Esvelt KM, Moosburner M, Kosuri S, Yang L, Church GM. CAS9 transcriptional activators for target specificity screening and paired nickases for cooperative genome engineering. Nat. Biotechnol. 2013a;31:833–838. doi: 10.1038/nbt.2675. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Mali P, Yang L, Esvelt KM, Aach J, Guell M, DiCarlo JE, Norville JE, Church GM. RNA-guided human genome engineering via Cas9. Science. 2013b;339:823–826. doi: 10.1126/science.1232033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Miller JC, Tan S, Qiao G, Barlow KA, Wang J, Xia DF, Meng X, Paschon DE, Leung E, Hinkley SJ, et al. A TALE nuclease architecture for efficient genome editing. Nat. Biotechnol. 2011;29:143–148. doi: 10.1038/nbt.1755. [DOI] [PubMed] [Google Scholar]
  18. Mussolino C, Morbitzer R, Lütge F, Dannemann N, Lahaye T, Cathomen T. A novel TALE nuclease scaffold enables high genome editing activity in combination with low toxicity. Nucleic Acids Res. 2011;39:9283–9293. doi: 10.1093/nar/gkr597. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Musunuru K. Genome editing of human pluripotent stem cells to generate human cellular disease models. Dis. Model. Mech. 2013;6:896–904. doi: 10.1242/dmm.012054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Pattanayak V, Lin S, Guilinger JP, Ma E, Doudna JA, Liu DR. High-throughput profiling of off-target DNA cleavage reveals RNA-programmed Cas9 nuclease specificity. Nat. Biotechnol. 2013;31:839–843. doi: 10.1038/nbt.2673. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Yusa K, Rashid ST, Strick-Marchand H, Varela I, Liu PQ, Paschon DE, Miranda E, Ordóñez A, Hannan NR, Rouhani FJ, et al. Targeted gene correction of α1-antitrypsin deficiency in induced pluripotent stem cells. Nature. 2011;478:391–394. doi: 10.1038/nature10424. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

01
02

RESOURCES