Skip to main content
Cell Discovery logoLink to Cell Discovery
letter
. 2018 Apr 17;4:18. doi: 10.1038/s41421-018-0025-2

Genetic variation may confound analysis of CRISPR-Cas9 off-target mutations

Guanqun Wang 1,#, Meijie Du 1,#, Jianbin Wang 1,, Ting F Zhu 1,
PMCID: PMC5902475  PMID: 29675267

Dear Editor,

The CRISPR-Cas9 system has been widely applied to animal genome editing14. Recently, Schaefer et al. identified the presence of >1300 single-nucleotide variants (SNVs) and >100 small insertions and deletions (indels) in CRISPR-edited mice using whole-genome sequencing (WGS)5. Nevertheless, no substantial homology between the SNV and indel regions with single guide RNA (sgRNA) sequences was found, and a large number of unique SNVs and indels were also present in the FVB/NJ control mouse6, 7, raising the question as to whether or not the majority of the observed mutations should be ascribed to the naturally occurring genetic variation of the inbred mice instead.

We happened to be in the process of generating a Zkscan1 partial gene knockout (Zkscan1+/-) mouse strain using the CRISPR-Cas9 system (Materials and methods in Supplementary Information). To identify potential off-target mutations, we performed WGS and comparative analysis of the CRISPR-edited and control mice using standard DNA extraction, sequencing, and bioinformatic pipelines (Materials and methods in Supplementary Information). In light of the recently raised questions on the extent of genetic variation in the inbred animals and its potential confounding effect, we purposely sequenced three mice from the same production colony as controls.

We detected a total of 7416 SNVs and 1996 indels in all four mice (Fig. 1a), among which the CRISPR-edited mouse carried 725 unique SNVs and 57 unique indels. In comparison, the three control mice also harbored comparable amounts of unique SNVs and indels (Fig. 1a), and no significantly more variants were found in the CRISPR-edited mouse than in the controls (in fact, the highest number of unique SNVs was detected in one of the control mice). To further examine whether the CRISPR-edited animal genetically deviated from the control mice, we calculated the pair-wise discordance among the four mice by analyzing the number of SNV and indel sites at which the experimental mice have different genotypes compared with each other. We found that the pair-wise genetic discordance between the CRISPR-edited animal and each control was not higher than that between the control mice themselves (Fig. 1b; Supplementary Fig. S1 and S2). Because mutagenic processes most commonly result in heterozygous mutations, we analyzed the heterozygosity of the detected SNV and indel sites in four mice and found roughly equal levels of heterozygosity in the CRISPR-edited animal and the control mice (Supplementary Table S1).

Fig. 1. Genetic variants identified in the CRISPR-edited and control mice.

Fig. 1

a Venn diagrams displaying the number of SNVs (left) or indels (right) identified in the CRISPR-edited animal and control mice. B6-1, B6-2, and B6-3: C57BL/6J control mice. CRISPR-edited: Zkscan1+/− mouse. b Heat maps showing the number of SNV (green) and indel (purple) sites at which the experimental mice have different genotypes compared with each other. c Number of SNV or indel sites located at 3-bp upstream from an NGG sequence (on both strands)

Prior studies on the CRISPR-Cas9 system indicated that Streptococcus pyogenes Cas9 (SpCas9)-mediated DNA cleavage requires the presence of an NGG protospacer adjacent motif sequence approximately 3-bp downstream from the cleavage site8, 9. We analyzed the flanking sequences of all the detected SNVs and indels in each experimental mouse to calculate the number of SNV or indel sites located at 1–5-bp upstream from an NGG sequence (Fig. 1c; Supplementary Fig. S3), and observed roughly equal amounts of such sites in the CRISPR-edited animal and the control mice. We also analyzed the potential off-target sites bioinformatically by the CRISPR Design tool10 for the two sgRNA targets (Supplementary Table S2), with 71 potential off-target sites for sgRNA-1 and 15 for sgRNA-2 predicted (Supplementary Table S3); however, no overlaps between the potential off-target sites and SNV or indel regions were found.

The extent of off-target mutations in a CRISPR-edited animal are undoubtedly influenced by various factors such as sgRNA design and the specific genome-editing protocol used, and more studies are needed to comprehensively understand CRISPR-Cas9 off-targeting. Based on our results, we did not find enough evidence to ascribe the majority of the SNVs and indels detected in this study to the off-target effects of CRISPR-Cas9 editing per se. Instead, we found that the extensive genetic variation in inbred experimental animals could become a significant confounding factor in the analysis of CRISPR-Cas9 off-target mutations, which needs to be properly assessed by well-designed control experiments in future studies.

While this work was being peer reviewed, another article was uploaded to preprint server, reporting the trio sequencing of CRISPR-edited mice and pedigree-matched controls11, the main findings of which are consistent with those in our work.

Data availability

Sequencing data are available at SRA: BioProject PRJNA419684 (accession SRP126009). A UCSC Genome Browser Track Hub (https://genome.ucsc.edu/cgi-bin/hgHubConnect) is available for import from URL: https://de.cyverse.org/dl/d/27589272-8544-466E-8242-418BB8B9BED3/hub.txt.

Electronic supplementary material

Supplementary Information (489.1KB, pdf)
Dataset 1 (933.3KB, txt)
Dataset 2 (586.1KB, txt)

Acknowledgements

This work was supported in part by funding from the Ministry of Science and Technology of China (2016YFC0206300, 2015CB553402, and 2016YFC0900103), the National Natural Science Foundation of China (31470532, 91543102, 31711530153, 21750005, and 21675098), the Beijing Nova Program (Z171100001117011), the Tsinghua University Initiative Scientific Research Program (20161080152), the Tsinghua University-Peking University Center for Life Sciences (CLS), and the Beijing Advanced Innovation Center for Structural Biology.

Authors' contributions

J.W. and T.F.Z. designed the study. G.W. and M.D. performed the experiments and analyzed the data. G.W., J.W., and T.F.Z. wrote the paper.

Conflict of interest

The authors declare that they have no conflict of interest.

Footnotes

These authors contributed equally: Guanqun Wang, Meijie Du.

Electronic supplementary material

Supplementary Information accompanies the paper at 10.1038/s41421-018-0025-2.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Jianbin Wang, Phone: +86-10-62785730, Email: jianbinwang@tsinghua.edu.cn.

Ting F. Zhu, Phone: +86-10-62797325, Email: tzhu@tsinghua.edu.cn

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information (489.1KB, pdf)
Dataset 1 (933.3KB, txt)
Dataset 2 (586.1KB, txt)

Data Availability Statement

Sequencing data are available at SRA: BioProject PRJNA419684 (accession SRP126009). A UCSC Genome Browser Track Hub (https://genome.ucsc.edu/cgi-bin/hgHubConnect) is available for import from URL: https://de.cyverse.org/dl/d/27589272-8544-466E-8242-418BB8B9BED3/hub.txt.


Articles from Cell Discovery are provided here courtesy of Nature Publishing Group

RESOURCES