Dear Editor,
The CRISPR-Cas9 system has been widely applied to animal genome editing1–4. Recently, Schaefer et al. identified the presence of >1300 single-nucleotide variants (SNVs) and >100 small insertions and deletions (indels) in CRISPR-edited mice using whole-genome sequencing (WGS)5. Nevertheless, no substantial homology between the SNV and indel regions with single guide RNA (sgRNA) sequences was found, and a large number of unique SNVs and indels were also present in the FVB/NJ control mouse6, 7, raising the question as to whether or not the majority of the observed mutations should be ascribed to the naturally occurring genetic variation of the inbred mice instead.
We happened to be in the process of generating a Zkscan1 partial gene knockout (Zkscan1+/-) mouse strain using the CRISPR-Cas9 system (Materials and methods in Supplementary Information). To identify potential off-target mutations, we performed WGS and comparative analysis of the CRISPR-edited and control mice using standard DNA extraction, sequencing, and bioinformatic pipelines (Materials and methods in Supplementary Information). In light of the recently raised questions on the extent of genetic variation in the inbred animals and its potential confounding effect, we purposely sequenced three mice from the same production colony as controls.
We detected a total of 7416 SNVs and 1996 indels in all four mice (Fig. 1a), among which the CRISPR-edited mouse carried 725 unique SNVs and 57 unique indels. In comparison, the three control mice also harbored comparable amounts of unique SNVs and indels (Fig. 1a), and no significantly more variants were found in the CRISPR-edited mouse than in the controls (in fact, the highest number of unique SNVs was detected in one of the control mice). To further examine whether the CRISPR-edited animal genetically deviated from the control mice, we calculated the pair-wise discordance among the four mice by analyzing the number of SNV and indel sites at which the experimental mice have different genotypes compared with each other. We found that the pair-wise genetic discordance between the CRISPR-edited animal and each control was not higher than that between the control mice themselves (Fig. 1b; Supplementary Fig. S1 and S2). Because mutagenic processes most commonly result in heterozygous mutations, we analyzed the heterozygosity of the detected SNV and indel sites in four mice and found roughly equal levels of heterozygosity in the CRISPR-edited animal and the control mice (Supplementary Table S1).
Prior studies on the CRISPR-Cas9 system indicated that Streptococcus pyogenes Cas9 (SpCas9)-mediated DNA cleavage requires the presence of an NGG protospacer adjacent motif sequence approximately 3-bp downstream from the cleavage site8, 9. We analyzed the flanking sequences of all the detected SNVs and indels in each experimental mouse to calculate the number of SNV or indel sites located at 1–5-bp upstream from an NGG sequence (Fig. 1c; Supplementary Fig. S3), and observed roughly equal amounts of such sites in the CRISPR-edited animal and the control mice. We also analyzed the potential off-target sites bioinformatically by the CRISPR Design tool10 for the two sgRNA targets (Supplementary Table S2), with 71 potential off-target sites for sgRNA-1 and 15 for sgRNA-2 predicted (Supplementary Table S3); however, no overlaps between the potential off-target sites and SNV or indel regions were found.
The extent of off-target mutations in a CRISPR-edited animal are undoubtedly influenced by various factors such as sgRNA design and the specific genome-editing protocol used, and more studies are needed to comprehensively understand CRISPR-Cas9 off-targeting. Based on our results, we did not find enough evidence to ascribe the majority of the SNVs and indels detected in this study to the off-target effects of CRISPR-Cas9 editing per se. Instead, we found that the extensive genetic variation in inbred experimental animals could become a significant confounding factor in the analysis of CRISPR-Cas9 off-target mutations, which needs to be properly assessed by well-designed control experiments in future studies.
While this work was being peer reviewed, another article was uploaded to preprint server, reporting the trio sequencing of CRISPR-edited mice and pedigree-matched controls11, the main findings of which are consistent with those in our work.
Data availability
Sequencing data are available at SRA: BioProject PRJNA419684 (accession SRP126009). A UCSC Genome Browser Track Hub (https://genome.ucsc.edu/cgi-bin/hgHubConnect) is available for import from URL: https://de.cyverse.org/dl/d/27589272-8544-466E-8242-418BB8B9BED3/hub.txt.
Electronic supplementary material
Acknowledgements
This work was supported in part by funding from the Ministry of Science and Technology of China (2016YFC0206300, 2015CB553402, and 2016YFC0900103), the National Natural Science Foundation of China (31470532, 91543102, 31711530153, 21750005, and 21675098), the Beijing Nova Program (Z171100001117011), the Tsinghua University Initiative Scientific Research Program (20161080152), the Tsinghua University-Peking University Center for Life Sciences (CLS), and the Beijing Advanced Innovation Center for Structural Biology.
Authors' contributions
J.W. and T.F.Z. designed the study. G.W. and M.D. performed the experiments and analyzed the data. G.W., J.W., and T.F.Z. wrote the paper.
Conflict of interest
The authors declare that they have no conflict of interest.
Footnotes
These authors contributed equally: Guanqun Wang, Meijie Du.
Electronic supplementary material
Supplementary Information accompanies the paper at 10.1038/s41421-018-0025-2.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Jianbin Wang, Phone: +86-10-62785730, Email: jianbinwang@tsinghua.edu.cn.
Ting F. Zhu, Phone: +86-10-62797325, Email: tzhu@tsinghua.edu.cn
References
- 1.Mali P, et al. RNA-guided human genome engineering via Cas9. Science. 2013;339:823–826. doi: 10.1126/science.1232033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Cong L, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339:819–823. doi: 10.1126/science.1231143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Jinek M, et al. RNA-programmed genome editing in human cells. eLife. 2013;2:e00471. doi: 10.7554/eLife.00471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Hsu PD, Lander ES, Zhang F. Development and applications of CRISPR-Cas9 for genome engineering. Cell. 2014;157:1262–1278. doi: 10.1016/j.cell.2014.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Schaefer KA, et al. Unexpected mutations after CRISPR–Cas9 editing in vivo. Nat. Methods. 2017;14:547–548. doi: 10.1038/nmeth.4293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Kim, S.-T. et al. Questioning unexpected CRISPR off-target mutations in vivo. Preprint at https://www.biorxiv.org/content/early/2017/06/30/157925 (2017).
- 7.Wilson, C. J. et al. The experimental design and data interpretation in “Unexpected mutations after CRISPR–Cas9 editing in vivo” by Schaefer et al. are insufficient to support the conclusions drawn by the authors. Preprint at https://www.biorxiv.org/content/early/2017/07/10/153338 (2017).
- 8.Jiang F, Doudna JA. CRISPR–Cas9 structures and mechanisms. Annu. Rev. Biophys. 2017;46:505–529. doi: 10.1146/annurev-biophys-062215-010822. [DOI] [PubMed] [Google Scholar]
- 9.Ran FA, et al. Genome engineering using the CRISPR-Cas9 system. Nat. Protoc. 2013;8:2281–2308. doi: 10.1038/nprot.2013.143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Hsu PD, et al. DNA targeting specificity of RNA-guided Cas9 nucleases. Nat. Biotechnol. 2013;31:827–832. doi: 10.1038/nbt.2647. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Iyer, V. et al. No unexpected CRISPR-Cas9 off-target activity revealed by trio sequencing of gene-edited mice. Preprint at https://www.biorxiv.org/content/early/2018/02/09/263129 (2018). [DOI] [PMC free article] [PubMed]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Sequencing data are available at SRA: BioProject PRJNA419684 (accession SRP126009). A UCSC Genome Browser Track Hub (https://genome.ucsc.edu/cgi-bin/hgHubConnect) is available for import from URL: https://de.cyverse.org/dl/d/27589272-8544-466E-8242-418BB8B9BED3/hub.txt.