Skip to main content
Nature Communications logoLink to Nature Communications
. 2020 May 11;11:2325. doi: 10.1038/s41467-020-16173-0

Multiplex precise base editing in cynomolgus monkeys

Wenhui Zhang 1,2,#, Tomomi Aida 3,✉,#, Ricardo C H del Rosario 4,#, Jonathan J Wilde 3,#, Chenhui Ding 5, Xiaohui Zhang 6, Zulqurain Baloch 1, Yan Huang 1, Yu Tang 1, Duanduan Li 1, Hongyu Lu 1, Yang Zhou 3, Minqing Jiang 3, Dongdong Xu 1, Zhihao Fang 1, Zhanhong Zheng 1,2, Qunshan Huang 1, Guoping Feng 3,4,, Shihua Yang 1,2,
PMCID: PMC7214463  PMID: 32393762

Abstract

Common polygenic diseases result from compounded risk contributed by multiple genetic variants, meaning that simultaneous correction or introduction of single nucleotide variants is required for disease modeling and gene therapy. Here, we show precise, efficient, and simultaneous multiplex base editing of up to three target sites across 11 genes/loci in cynomolgus monkey embryos using CRISPR-based cytidine- and adenine-base editors. Unbiased whole genome sequencing demonstrates high specificity of base editing in monkey embryos. Our data demonstrate feasibility of multiplex base editing for polygenic disease modeling in primate zygotes.

Subject terms: Genetic engineering, CRISPR-Cas9 genome editing


Due to the polygenic nature of most diseases, simultaneous correction or introduction of single nucleotide variants is needed. Here, the authors demonstrated the feasibility of multiplex base editing for polygenes disease modeling in cynomolgus monkey embryos with high specificity.

Introduction

Genome-wide association studies have identified thousands of single nucleotide variations (SNVs) associated with increased risk for various common human diseases. However, because each SNV only contributes a small amount to disease risk, common diseases are thought to be caused by accumulated risk from multiple SNVs1. To model complex common diseases in animals, thus, it is necessary to develop a method for simultaneous and precise installation of multiple SNVs by genome editing. The CRISPR (clustered regularly interspaced short palindromic repeats)/Cas9 (CRISPR-associated protein 9) system has enabled single nucleotide corrections in both somatic cells and zygotes in various species including humans2,3. However, it is still difficult to control the output of the editing because DNA double-strand breaks (DSBs) induced by CRISPR are primarily repaired through non-homologous end-joining (NHEJ), which can lead to the formation of small insertions or deletions (indels), rather than precise base correction by homologous recombination (HR) in the presence of a donor DNA template. As such, low HR frequency makes multiplexed editing very difficult. As an alternative to DSB-mediated HR for gene editing, targeted base editing by cytidine deaminase or adenosine deaminase Cas9 fusion proteins (using Cas9 nickase or catalytically-dead Cas9), which convert cytosine (C) to thymine (T) or adenine (A) to guanine (G) without DSB induction, has recently been developed46. Thus, base editing has a great potential for precise therapeutic gene correction and disease modeling710.

Cynomolgus monkey (Macaca fascicularis) is the animal model best suited for biomedical research that requires a model organism with a close relationship to humans in both genetic and physiological aspects11,12. Thus, targeted precise genome editing in cynomolgus monkey is critically important not only for human disease modeling, but also for the development of gene therapies. However, the long cynomolgus reproduction cycle (5–6 years)13 makes it unfeasible to produce polygenic monkey models carrying multiple SNVs by crossing different single SNV mutant monkeys (>10 years would be required). To overcome this limitation, efficient multiplex genome editing is required to generate disease models in F0 animals. Although CRISPR-Cas9 has been shown to efficiently work in cynomolgus monkey zygotes for knockout monkey production12,14,15, it is still unknown whether base editing works in cynomolgus monkey zygotes for targeted base conversions. Here, using cynomolgus monkey zygotes, we demonstrate the feasibility of precise, efficient, and multiplexed base editing and identify editing strategies best suited for the production of F0 animals with polygenic traits.

Results

Single C-to-T base editing in monkey embryos

First, we tested whether targeted base editing works in cynomolgus monkey embryos using BE3, a cytosine base editor (CBE) consisting of the Cas9 nickase fused to rAPOBEC1 and a uracil glycosylase inhibitor5. We chose the gene fumarylacetoacetate hydrolase (FAH), in which mutations cause tyrosinemia16, as an initial target for our studies. Currently, ~100 mutations have been found in hereditary tyrosinemia type 1 (HT1) patients16. We designed a single guide RNA (sgRNA) to introduce FAH W78X, which results in a stop codon in exon 4 of FAH (Fig. 1a) and was reported as a causative mutation in HT1 patients17. We prepared sgRNA and BE3 mRNA and injected them into the cytoplasm of cynomolgus monkey zygotes (Fig. 1b). Injected embryos were then cultured for 3–4 days and collected for genotyping. To genotype each embryo, whole-genome amplification (WGA) was performed and the target site was amplified by PCR, cloned, and analyzed by Sanger sequencing. These genotyping procedures did not introduce any mutations at the target sites in uninjected wild-type embryos (Supplementary Table 1). In exon 4 of FAH, two Cs (C5 and C6) are located within the BE3 activity window of the FAH sgRNA (core positions 4–8) and the conversion of the two Cs to Ts introduces the W78X mutation (Fig. 1a). We found that 11 out of 16 injected embryos had genomic modifications at the target site in the FAH exon 4 locus (Table 1). Two embryos (#7 and #8) had targeted C-to-T (G-to-A) conversions with 100% allele frequency at both C5 and C6 within sequenced clones (Fig. 1c, h, Supplementary Fig. 1a). An additional eight embryos had targeted C-to-T (G-to-A) conversions with allele frequencies that ranged between 12.1% and 100% for at least one C (Fig. 1c, Supplementary Fig. 1a). These embryos were mosaic with wild-type and/or non-C-to-T conversions at each C (Fig. 1c, e, Supplementary Fig. 1a). Indels were also found in 3 out of 11 edited embryos (Fig. 1c, g, Supplementary Fig. 1a). One embryo (#10) had C-to-A (G-to-T) conversion with 100% allele frequency only at C5 (Fig. 1c, e, Supplementary Fig. 1a). As a result, 10 out of 11 edited embryos had W78X stop mutations with allele frequencies ranging between 12.1% and 100% by targeted C-to-T conversion (Supplementary Fig. 1a). Because BE3 often induces non-C-to-T conversions and indels (Fig. 1c, e, g, Table 1, Supplementary Fig. 1a), a BE variant with increased accuracy, BE4-Gam, was tested using the same FAH sgRNA as used with BE3 (ref. 18). Although BE4-Gam only induced C-to-T conversions, its conversion frequency was much lower than that of BE3 (Supplementary Fig. 2a–c). Furthermore, BE4-Gam induced indels at similar rates as BE3 (Supplementary Fig. 2a, d and Supplementary Table 2). Thus, we continued to use BE3 for the rest of the experiments in this study due to the requirement of high editing efficiency in cynomolgus.

Fig. 1. Single C-to-T or A-to-G base editing in monkey embryos.

Fig. 1

a Target sites of FAH exon 4 for C-to-T base editing by BE3 and APP for A-to-G base editing by ABE. Top black bars: sgRNA spacer sequences, top magenta bars: PAM sequences, bottom black bars: amino acid codons, bottom magenta bars: target amino acids to be base edited. Wt: wild-type. b Schematic representation of base editing in monkey zygotes. c, d Base editing outcomes by (c) BE3 and (d) ABE. Numbers shown on the left of the tables indicate embryo ID. In each embryo, top and bottom rows indicate intended (C-to-T or A-to-G) and unintended (non-C-to-T or non-A-to-G) conversions, respectively. e, f Efficiencies and accuracies of base editing outcomes by (e) BE3 and (f) ABE. g Indel frequencies. In eg data are represented as boxplots where the middle line is the mean (also shown as value on top of the bar) and the top and bottom lines correspond to the maximum and minimum mutant allele frequencies. Each dot represents one embryo. FAH-E4: injections were performed thrice (total n = 16); APP: injections were performed twice (total n = 9). h, i representative sequence chromatograms of cloned PCR amplicons. Numbers of base conversion-positive clones and the total clones sequenced are shown in parentheses.

Table 1.

Single C-to-T or A-to-G base editing in monkey embryos.

Editor Target sites Analyzed Edited Indel C > T or A > G 100% C > T or A > G
BE3 FAH 16 11 (68.8%) 3 (27.3%) 10 (91%) 2 (20%)
ABE APP 9 6 (66.7%) 0 (0%) 6 (100%) 0 (0%)

FAH-E4 injections were performed thrice, APP injections were performed twice.

Single A-to-G base editing in monkey embryos

Next, we tested A-to-G base editing in cynomolgus monkey embryos using adenine base editor (ABE), which consists of the Cas9 nickase fused with TadA4. As an initial target gene for A-to-G base editing, we chose amyloid precursor protein (APP), which is mutated in familial early-onset Alzheimer’s disease19. We designed an sgRNA to introduce two missense mutations into exon 2 of APP (Fig. 1a) and injected the sgRNA together with ABE7.10 mRNA (Fig. 1b). Proper A-to-G editing of the two As (A5 and A7) within the ABE7.10 activity window introduces the Q33R and I34V missense mutations into APP (Fig. 1a). We found that 6 out of 9 injected embryos had genomic modifications at the target site in the APP exon 2 locus (Table 1). These were all A-to-G conversions with allele frequencies that ranged between 20% and 88.6% for at least at one A (Fig. 1d, i, Supplementary Fig. 1b). A7 was converted to G more than A5 (six out of six embryos for A7 and three out of six embryos for A5). All of the modified embryos were incompletely edited and mosaic with the wild-type allele (Fig. 1d, Supplementary Fig. 1b). Importantly, neither non-A-to-G conversions nor indels were detected (Fig. 1d, f, g, Supplementary Fig. 1b). As a result, six out of six edited embryos had the I34V mutation and three out of six edited embryos had both Q33R and I34V mutations with allele frequencies that ranged between 20% and 75% by targeted A-to-G conversion (Supplementary Fig. 1b). These results suggest that both BE3- and ABE-mediated targeted base editing are effective and precise in cynomolgus monkey embryos.

Multiplex C-to-T or A-to-G base editing in monkey embryos

Encouraged by these results, we next tried to simultaneously edit multiple loci by co-injection of sgRNAs targeting multiple sites with BE3 or ABE in cynomolgus monkey embryos. To this end, we first designed multiple sgRNAs for double and triple editing of FAH targets with BE3 (Fig. 2a). Then, the BE3 mRNA and two or three sgRNAs were co-injected. For double base editing at FAH exons 7 and 9, seven Cs (C4-5 and C7 in exon 7, and C4 and C6-8 in exon 9 sgRNA1) locate within the BE3 activity window of FAH exon 7 and 9 sgRNAs (core positions 4–8) (Fig. 2a). We found that 8 out of 9 injected embryos had genomic modifications at target sites in the FAH exon 7 or 9 locus (Table 2). Importantly, 5 embryos had targeted C-to-T (G-to-A) conversions at both sites with allele frequencies that ranged between 1.7% and 100% for at least one C. C7 in exon 7 was efficiently converted to T (100% in 3 embryos), but the conversions of other Cs were incomplete (Fig. 2b, Supplementary Fig. 3a). All embryos were mosaic with wild-type and/or non C-to-T conversions at each C (Fig. 2b, e, Supplementary Fig. 3a) and indels were also observed (Fig. 2b, f, Supplementary Fig. 3a). Interestingly, only C7 in exon 7 and C4 and C6 in exon 9 were efficiently converted, whereas the other flanking Cs were not converted or had fewer conversions (Fig. 2b, Supplementary Fig. 3a).

Fig. 2. Double/triple multiplex C-to-T or A-to-G base editing in monkey embryos.

Fig. 2

a Target sites of FAH exons 4, 7, 9 and 14 for double/triple C-to-T base editing by BE3, and HBB and TP53 for double A-to-G base editing by ABE. Top black bars: sgRNA spacer sequences, top magenta bars: PAM sequences, bottom black bars: amino acid codons, bottom magenta bars: target amino acids to be base edited. Wt: wild-type. bd Base editing outcomes. b Double or c triple multiplex C-to-T base editing by BE3 and d double multiplex A-to-G base editing by ABE are shown. Numbers shown in the left of the tables indicate embryo ID. Magenta indicates b, d double and c triple base-edited embryos. In each embryo, top and bottom rows indicate intended (C-to-T or A-to-G) and unintended (non-C-to-T or non-A-to-G) conversions, respectively. e Efficiencies and accuracies of base editing outcomes. f Indel frequencies. In ef data are represented as boxplots where the middle line is the mean (also shown as value on top of the bar) and the top and bottom lines correspond to the maximum and minimum mutant allele frequencies. Each dot represents one embryo. BE3-Double: injections were performed twice (total n = 9); BE3-Triple: injections were performed twice (total n = 10); ABE-Double: injections were performed twice (total n = 5). g, h Single blastomere genotyping. g Single blastomere genotyping results in each blastomere are shown by color codes. B blastomere. h Single blastomere genotyping results for each embryo (bar graphs) or blastomeres from all embryos (pie charts).

Table 2.

Double/triple multiplex C-to-T or A-to-G base editing in monkey embryos.

Editor # of target Target sites Analyzed Edited Single Double Triple 100% Single 100% Double 100% Triple
BE3 2 FAH-E7/E9-1 9 8 (88.9%) 3 (37.5%) 5 (62.5%) 2 (66.7%) 0 (0%)
BE3 3 FAH-E4/E9-2/E14 10 8 (80%) 3 (37.5%) 3 (37.5%) 2 (25%) 0 (0%) 0 (0%) 1 (50%)
ABE 2 HBB/TP53 5 5 (100%) 1 (20%) 4 (80%) 0 (0%) 0 (0%)

Single, double, or triple represents single, double, or triple C > T or A > G conversions. 100% Single, 100% Double, or 100% Triple represents 100% single, 100% double, or 100% triple C > T or A > G conversions. Injections were performed twice.

For triple base editing at FAH exons 4, 9, and 14, seven Cs (C5-6 in exon 4, C4, C6, and C8 in exon 9, and C6-7 in exon 14) locate within the BE3 activity window (core positions 4–8) (Fig. 2a). We found that 8 out of 10 injected embryos had base modifications at target sites in FAH exon 4, 9, or 14 locus (Table 2). Two embryos (#6 and #9) had targeted C-to-T (G-to-A) conversions at all three sites with allele frequencies that ranged between 14.3% and 100% for at least one C in each site (Fig. 2c, Table 2, Supplementary Fig. 3b, e). Remarkably, one embryo (#9) had 5 C-to-T conversions across 3 exons with 100% frequencies within sequenced clones, although the other flanking Cs were not converted or had fewer conversions (Fig. 2c, Table 2, Supplementary Fig. 3b, e). The other 6 embryos had targeted C-to-T (G-to-A) conversions at 1 or 2 sites with allele frequencies that ranged between 8.3% and 100% for at least one C (Fig. 2c, Table 2, Supplementary Fig. 3b). These embryos were also incompletely edited and mosaic with wild-type and/or non-C-to-T conversions at each C (Fig. 2c, e, Table 2, Supplementary Fig. 3b). C4 in exon 9 sgRNA2 locates within the BE3 core activity window, but was not efficiently edited (only 1 in 10 embryos) (Fig. 2c, Supplementary Fig. 3b). Also, Cs located outside of the BE3 core activity window were edited in a several embryos (Fig. 2c, Supplementary Fig. 3b) and indels were also observed (Fig. 2c, f, Supplementary Fig. 3b).

Next, using ABE we designed sgRNAs to simultaneously target two genes, Hemoglobin Beta (HBB) and Tumor Protein p53 (TP53), in which mutations cause beta-thalassemia and cancer, respectively20,21 (Fig. 2a). Three As (A4 and A5 in HBB, and A7 in TP53) locate within the ABE activity window of HBB and TP53 sgRNAs (core positions 4–7) (Fig. 2a). We found that all of the 5 injected embryos only had A-to-G conversions at the target site in both the HBB and/or TP53 loci (Fig. 2d, Table 2, Supplementary Fig. 3c, f). Importantly, neither non-A-to-G conversion nor indels were detected (Fig. 2d–f, Supplementary Fig. 3c, f). Four embryos had targeted A-to-G conversions at both sites with allele frequencies that ranged between 5.9% and 40.7% for at least at one A (Fig. 2d, Table 2, Supplementary Fig. 3c, f). These target sites were incompletely converted and the embryos were mosaic with wild-type at each A (Supplementary Fig. 3c).

To investigate whether multiplex base editing occurs in individual cells, we first performed long-range PCR across FAH exons 7-9 in BE3-double injected embryos, cloned 4 kb PCR amplicons, and sequenced the clones. C-to-T conversions in both exons 7 and 9 were detected in single clones (Supplementary Fig. 4). As a second method of examining multiplex editing in single cells, we also performed single blastomere genotyping (Supplementary Fig. 5a). In BE3 double base editing at FAH exons 7 and 9, 3 out of 6 embryos had double C-to-T conversions in single blastomeres (Fig. 2g–h, Supplementary Fig. 5b–d). Embryo 1 had 2 double-edited homozygous blastomeres among 4 blastomeres; embryo 3 had 3 double-edited homozygous or heterozygous blastomeres among 6 blastomeres; and embryo 5 had 1 double-edited homozygous blastomere among 4 blastomeres (Fig. 2g, Supplementary Fig. 5b, c). Overall double-editing efficiency in a single blastomere was 19.4% (6 out of 31 blastomeres in 6 embryos) (Fig. 2h). Other blastomeres were single-edited with C-to-T conversion (19.4% at FAH exons 7 and 6.5% at FAH exon 9), indels (6.5%), and non-edited wild-type alleles (Fig. 2h). In BE3 triple base editing at FAH exons 4, 9, and 14, 2 out of 6 embryos had triple C-to-T conversions in single blastomeres (Fig. 2g–h, Supplementary Fig. 5b–d). Embryo 2 had 4 triple-edited homozygous or heterozygous blastomeres among 5 blastomeres; and embryo 5 had 2 triple-edited heterozygous blastomere among 4 blastomeres (Fig. 2g, Supplementary Fig. 5b, c). Overall, triple editing efficiency in a single blastomere was 22.2% (6 out of 27 blastomeres in 6 embryos) (Fig. 2h). Other blastomeres were single (18.5% at FAH exons 4) or double (7.4% at FAH exons 9 and 14) edited with C-to-T conversion, or edited with indel or non-C-to-T conversions (14.8%), and non-edited wild-type sequence (Fig. 2h).

In ABE double base editing at HBB and TP53, 2 out of 3 embryos had double A-to-G conversions in single blastomeres (Fig. 2g–h). Embryo 1 had 2 double-edited homozygous or heterozygous blastomeres among 4 blastomeres; and embryo 2 had 3 double-edited homozygous or heterozygous blastomeres among 5 blastomeres (Fig. 2g, Supplementary Fig. 5b, c). Overall double editing efficiency in a single blastomere was 38.5% (5 out of 13 blastomeres in 3 embryos) (Fig. 2h). Other blastomeres were single edited with A-to-G conversion (7.7% at HBB and 23.1% at TP53), and non-edited wild-type allele (Fig. 2h). These results indicate the feasibility of BE3- or ABE-mediated multiple targeted base editing at up to three target genes/sites in a single blastomere of cynomolgus monkey embryos at the same time, but also highlight the necessity of further improvements of efficiency and accuracy of base editing.

Multiplex C-to-T and A-to-G base editing in monkey embryos

Finally, we further tried to improve the accuracy and efficiency of multiplex base editing, especially CBE, by combining Streptococcus pyogenes Cas9 (SpCas9)-based ABE and Staphylococcus aureus Cas9 (SaCas9)-based SaKKH-BE3 (ref. 22) to avoid sgRNA crosstalk between ABE and BE3. For triple C-to-T and G-to-A base editing, we designed two SaKKH-BE3-sgRNAs for Empty Spiracles Homeobox 1 (EMX1) and FA complementation group F (FANCF) and an SpABE-sgRNA for BRCA1. The three genes are associated with Kallmann Syndrome23, Fanconi anemia24, and cancers25, respectively (Fig. 3a). Six Cs (C7-8 in EMX1 and C4 and C6-8 and C12 in FANCF) locate within SaKKH-BE3 activity window of EMX1 and FANCF sgRNAs (core positions 3-12) for double C-to-T base editing and an A5 in BRCA1 locates within the ABE activity window of the BRCA1 sgRNA (core positions 4–7) (Fig. 3a). SaKKH-BE3 and SpABE mRNAs, along with the three sgRNAs were co-injected and embryos were genotyped following the previously described methods. Surprisingly, we found that all 8 of the injected embryos had genomic modifications at target sites in EMX1 and FANCF and 5 out of 8 embryos at a target site in BRCA1 (Fig. 3b, Table 3, Supplementary Fig. 6). All embryos had SaKKH-BE3-mediated C-to-T conversions at target sites in EMX1 and FANCF (Fig. 3b, Table 3, Supplementary Fig. 6) and, remarkably, the C-to-T conversion frequencies by ranged from 47% to 100%, with 6 out of 8 embryos showing greater than 60% conversion across all 6 Cs. Additionally, all 6 Cs within the target window were converted in all embryos (Fig. 3b, Supplementary Fig. 6). Importantly, no indels were detected and we only observed non-C-to-T conversion at a low level (18%) at a single base in one embryo (Fig. 3c–e, Supplementary Fig. 6). At the BRCA1 SpABE target site, five out of eight embryos showed some degree of A-to-G conversion (Fig. 3b, Table 3, Supplementary Fig. 6). The A-to-G conversion frequency at BRCA1 in individual embryos ranged from 10% to 86% (Fig. 3b, Supplementary Fig. 6) and neither non-A-to-G conversion nor indels were detected (Fig. 3b, d, e, Supplementary Fig. 6). Importantly, the 5 embryos with A-to-G conversions at the target site of BRCA1 also had high levels of C-to-T conversion at target sites in EMX1 and FANCF, demonstrating feasible and efficient multiplex C-to-T and A-to-G base editing (Fig. 3b, Table 3, Supplementary Fig. 6).

Fig. 3. Triple multiplex C-to-T and A-to-G base editing in monkey embryos.

Fig. 3

a Target sites of EMX1, FANCF and BRCA1 for triple multiplex C-to-T and A-to-G base editing by SaKKH-BE3 and ABE. Top black bars: sgRNA spacer sequences, top magenta bars: PAM sequences, bottom black bars: amino acid codons, bottom magenta bars: target amino acids to be base edited. Wt: wild-type. b Triple C-to-T and A-to-G base editing outcomes by SaKKH-BE3 and ABE. Numbers shown to the right of the tables indicate embryo ID. Magenta indicates triple base-edited embryos. In each embryo, top and bottom rows indicate intended (C-to-T or A-to-G) and unintended (non-C-to-T or non-A-to-G) conversions, respectively. c, d Efficiencies and accuracies of base editing by outcomes by (c) SaKKH-BE3 and (d) ABE. Each dot represents one embryo. e Indel frequencies. Each dot represents one embryo. In c–e data are represented as boxplots where the middle line is the mean (also shown as value on top of the bar) and the top and bottom lines correspond to the maximum and minimum mutant allele frequencies. Each dot represents one embryo. Injections were performed twice (total n = 8). f, g Single blastomere genotyping. f Single blastomere genotyping results in each blastomere are shown by color codes. B blastomere. g Single blastomere genotyping results for each embryo (bar graphs) or blastomeres of all embryos (pie charts).

Table 3.

Triple multiplex C-to-T and A-to-G base editing in monkey embryos.

Editor # of target Target sites Analyzed Edited Single Double Triple 100% Single 100% Double 100% Triple
SaKKH-BE3 ABE 3 EMX1/FANCF BRCA1 8 8 (100%) 0 (0%) 3 (37.5%) 5 (62.5%) 0 (0%) 2 (66.7%) 0 (0%)

Single, double, or triple represents single, double, or triple C > T and/or A > G conversions. 100% Single, 100% Double, or 100% Triple represents 100% single, 100% double, and/or 100% triple C > T or A > G conversions. Injections were performed twice.

Next, we performed single blastomere genotyping to investigate whether multiplex base editing occurred in single cells. With respect to SaKKH-BE3 double base editing at EMX1 and FANCF, all embryos had double C-to-T conversions in single blastomeres (Fig. 3f–g, Supplementary Fig. 7). Furthermore, all blastomeres except for 2 blastomeres in embryo 6 had double homozygous (29/37) or heterozygous (8/37) C-to-T conversions (Fig. 3f, Supplementary Fig. 7). Remarkably, embryo 7 was homozygous non-mosaic embryo (Fig. 3f, Supplementary Fig. 7). In line with our results from bulk genotyped embryos, no blastomeres were edited with non-C-to-T conversions or indels (Supplementary Fig. 7). Overall, double C-to-T editing efficiency in a single blastomere reached 94.9% (37 out of 39 blastomeres in 7 embryos) (Fig. 3g) with all other blastomeres genotyping as unedited wild-type (Fig. 3f–g, Supplementary Fig. 7). At the BRCA1 ABE locus, three out of seven embryos had A-to-G conversions in single blastomeres (Fig. 3f–g, Supplementary Fig. 7). Embryo 1 had 3 homozygous or heterozygous edited blastomeres among 6 blastomeres; embryo 3 had 2 heterozygous edited blastomeres among four blastomeres; and embryo 5 had 4 homozygous or heterozygous edited blastomeres among six blastomeres (Fig. 3f, Supplementary Fig. 7). Total A-to-G editing efficiency in a single blastomere was 23.1% (9 out of 39 blastomeres in 7 embryos) (Fig. 3g) with all other blastomeres genotyping as wild-type (Fig. 3f–g, Supplementary Fig. 7). Importantly, 3 out of 7 embryos had triple C-to-G and A-to-G conversions in single blastomeres (Fig. 3f–g, Supplementary Fig. 7). Embryo 1 had 3 homozygous or heterozygous triple edited blastomeres among 6 blastomeres; embryo 3 had 2 heterozygous triple edited blastomeres among 4 blastomeres; and embryo 5 had 4 homozygous or heterozygous edited blastomeres among 6 blastomeres (Fig. 3f, Supplementary Fig. 7). Overall, triple C-to-T and A-to-G editing efficiency in a single blastomere was 23.1% (9 out of 39 blastomeres in 7 embryos) (Fig. 3g). No blastomeres were edited with non-C-to-T or non-A-to-G conversions or indels (Fig. 3f–g, Supplementary Fig. 7). Again, we observed mosaicism mainly due to less efficient editing of BRCA1 by SpABE, whereas SaKKH-BE3 edited its targets with efficiency approaching 100% (Fig. 3f–g, Supplementary Fig. 7). These results indicate that SaKKH-BE3- and SpABE-mediated multiplexed base editing works efficiently and precisely at up to three target genes in a single blastomere of cynomolgus monkey embryos.

Off-target analyses

To characterize the genome-wide consequences of targeted base editing in cynomolgus monkey embryos, we performed both targeted and unbiased analyses to search for off-target mutations. First, we picked the top 5 off-target candidate sites for each sgRNA as predicted by Benchling and BLAST search (Supplementary Data 1), PCR amplified them, and analyzed the loci with the T7E1 assay (Supplementary Data 2). At all the analyzed candidate sites, we found no off-target modification (Supplementary Fig. 8 and Supplementary Table 3). Next, we generated an FAH mutant fetus by BE3 (Fig. 4a, Supplementary Table 4 and 5). We injected sgRNA and BE3 mRNA into the cytoplasm of cynomolgus monkey zygotes as described in Fig. 1, then cultured the embryos before transfer into surrogate mothers (Supplementary Tables 4 and 5). Among 67 injected embryos with pronuclei, 56 embryos developed to between the 8-cell and blastocyst stages and were transferred into 5 surrogate mothers, resulting in 3 pregnant monkeys with 3 fetuses. We collected amniotic fluid for fetal genotyping and identified one fetus as an FAH W78Y mutant caused by both C-to-T and non-C-to-T conversions (Supplementary Table 5). Unfortunately, all pregnancies were miscarried after amniotic fluid collection, so we collected fetal tissue and analyzed eight organs by deep sequencing. This confirmed the FAH W78Y mutation in one fetus with ~50% allele frequency across organs (Fig. 4a, b). Whole-genome sequencing of the mutant fetus and its parents (30-40x Illumina paired-end sequencing) was performed, followed by analysis with the GATK SNP calling pipeline and the GATK Genotype Refinement pipeline to detect de novo variants in the mutant genome (see Methods) (Fig. 4c). From this analysis, we identified 66 de novo SNVs and 5 de novo indels (Fig. 4d, Supplementary Fig. 9, Supplementary Data 3), but none of these 71 de novo variants overlapped with any of the predicted potential off-target sites of the FAH sgRNA (Fig. 4e, Supplementary Data 46). Two recent papers showed that the cytidine deaminase domain of BE3 can induce random C-to-T (or G-to-A) conversion throughout the genome in mice and rice in a guide RNA-independent manner26,27. To analyze this possibility in our results, we used the cynomolgus macaque SNPs from dbSNP/EVA to estimate the expected fraction of SNPs for each of the 9 possible base substitutions (Supplementary Table 6). We estimated the number of off-target C-to-T (G-to-A) as the observed C-to-T (G-to-A) de novo counts minus expected (total de novo SNVs times fraction of C-to-T (G-to-A) in dbSNP/EVA), and obtained 2.3 and 1.3, for C-to-T and G-to-A, respectively. Thus, the low number of de novo SNVs and indels in our mutant, together with our low estimate of de novo BE3 off-targets (3.6 SNVs) suggest that we could not detect clear guide RNA-independent C-to-T (or G-to-A) conversion by BE3 beyond the range of germline de novo SNVs per generation in cynomolgus macaque. Taken together, these results suggest that base editing is highly specific in cynomolgus monkey embryos, although due to the limitation of the experimental design, we could not distinguish BE3-induced off-target mutation from naturally occurring de novo mutation.

Fig. 4. Off-target analysis.

Fig. 4

a FAH on-target genotyping in liver of FAH mutant fetus. Bulk Sanger sequence chromatogram is shown. Target site is underlined. Numbers of base conversion-positive clones and the total clones sequenced are shown in parentheses. Wt wild-type, Fetus: FAH mutant fetus. b Deep sequencing analysis. Deep sequencing data from one father (blood), one mother (blood), and one fetus (heart, liver, lung, brain, intestine, tail, placenta, and muscle) are shown. Sequencing read numbers are as follows: Father: n = 154,718; mother: n = 122,083; heart: n = 67,112; liver n = 52,976; lung: n = 51,608; brain: n = 48,480; intestine: n = 46,350; tail: n = 48,008; placenta: n = 69,593 and muscle: n = 68,145. c Analytical pipeline of trio-based whole-genome sequencing. d Summary of trio-based whole-genome sequencing. e Venn diagram of in silico predicted potential FAH-sgRNA off-target sites and de novo mutations.

Discussion

Because it does not require the induction of double-strand breaks, base editing has great potential for precise therapeutic gene correction and disease modeling. Base editors are continuously being improved and numerous variants with enhanced efficiency, narrower target windows, and broader PAM recognition have already been produced22,2831. Nonetheless, base editors are now well-tested for use in non-human primate embryos, which are extremely valuable for disease modeling and therapeutic development. In this study, we directly tested 4 base editors at 11 genes/loci in cynomolgus monkey embryos. We obtained the best results from SaKKH-BE3 for C-to-T conversions and ABE7.10 for A-to-G conversions. Although we only tested SaKKH-BE3 in two genes, this BE3 variant worked extremely well compared to SpCas9-BE3 or -BE4: all the embryos had C-to-T conversions with very high frequency (near 100% C-to-T conversion in 13 out of 16 edited cases), almost 100% C-to-T conversion accuracy (only 1 out of 48 edited Cs was converted incorrectly), and no indels. Although results may vary at other genes/loci and should be tested further, we suggest SaKKH-BE3 as the first choice for C-to-T base editing in monkey embryos. Of all editors that we tested, ABE showed the highest conversion accuracy (27/27 As converted to Gs with no indels), an important feature for gene therapy since incorrect editing can result in additional, potentially worse mutations. However, in contrast to its high accuracy, the conversion efficiency of ABE7.10 was relatively low and variable between genes and embryos, suggesting a need for further improvement of adenine base editors. Nonetheless, the work presented here demonstrates the advantages of ABEs, as editing at the TP53 locus by ABE outperformed a traditional HDR approach that we employed in a previous study in both efficiency and accuracy15.

In BE3 and ABE, mammalian codon optimization of Cas9 and removal of potential poly A signal sites within Cas9 were recently reported to drastically increase conversion efficiency28,29. Because we used mRNA for injection, the same strategy may also improve A-to-G conversion efficiency by ABE in monkey embryos. Our success of highly efficient and precise C-to-T and A-to-G base editing by using BE3 variant and ABE enables introduction and potential restoration of a significant number of human pathogenic SNVs in cynomolgus monkey embryos and can accelerate monkey model production for human diseases.

To overcome the limitations imposed by the long reproductive cycle of cynomolgus monkeys, we successfully showed the feasibility of multiplexed base editing with BEs and/or ABEs in cynomolgus monkey embryos. We observed multiplex base editing in up to three sites not only in whole embryos and, most importantly, in single embryo cells (blastomeres). By blastomere genotyping, we also found significant mosaicism in all the multi-edited embryos, which could be due to the use of mRNA instead of ribonucleoprotein complexes (RNPs) and the late timing of injection. It is reported that the direct injection of CRISPR as an RNP3234, which acts fast and efficiently on the target site, can eliminate mosaicism when paired with injection into MII oocytes35. Alternatively, injection of base editor mRNA and sgRNA into two- or four-cell stage human embryos much improved conversion efficiency, and thus reduced mosaicism, making one third of injected human embryos (5 out of 15) non-mosaic 100% base edited36. Taken together, production and direct delivery of recombinant base editor protein7 would be a major step forward toward non-mosaic embryos.

Obviously, obtaining non-mosaic founder monkeys is the ultimate goal of genome editing in non-human primates. However, in many cases mosaic founder monkeys are still useful for initial phenotypic analysis before production of second-generation non-mosaic monkeys, as demonstrated by recent studies of founder mutant monkeys with MECP2 (ref. 37) and SHANK3 (ref. 12) mutations. We recently generated SHANK3 knockout monkeys including two non-mosaic and three mosaic founders12. Importantly, we observed strong autism-related phenotypes in all founder monkeys, including heterozygous mosaics. These studies strongly suggest that mosaic founder monkeys generated by base editors could be useful for preliminary phenotypic analyses, given that in our current study, SaKKH-BE3 and ABE in a multiplexed condition yielded averages of 80–95% and 50–80% edited allele frequencies (within edited embryos), respectively. Nonetheless, it is critical to improve base editing with a goal of efficiently generating non-mosaic, cleanly edited founder monkeys as this will significantly reduce the time and cost of non-human primate studies of disease and facilitate the analysis of polygenic phenotypes.

Trio-based whole-genome sequencing analysis identified only 71 fetus-specific de novo variants, comprised of 66 SNVs and 5 indels. The number of de novo variants observed is in the range of naturally occurring germline de novo SNVs per generation in cynomolgus macaque embryos. The lack of overlap between these de novo variants and the predicted FAH sgRNA off-target sites suggests that base editors do not induce guide RNA-dependent off-target mutations, consistent with previous whole-genome sequencing studies in mouse38 and human36 embryos edited with CBEs. The use of SaKKH-Cas9, which recognizes a longer PAM and thus reduces guide RNA-dependent off-target mutations, will further increase the utility of base editing not only by its accurate base editing capacity, but also its lower capacity for RNA-dependent off-target mutation. However, previous studies have shown that guide RNA-independent deaminase-induced random genomic mutation is a significant concern with CBE-, but not ABE-mediated base editing. These reports showed a 20-fold increase in genome-wide mutation rates (average 283 SNVs in BE3-treated cells per generation compared to average 14 SNVs in control cells derived from the same mouse embryo) by GOTI (Genome-wide Off-target analysis by Two-cell embryo Injection)26, and a 2-fold increase (average 221 SNVs in BE3-treated mice compared to average 132 SNVs in control mice) by large scale trio-based whole-genome sequencing38, mainly due to increased C-to-T and G-to-A conversions mediated by cytidine deaminase. We only detected 66 fetus-specific de novo SNVs, which is in the range of the naturally occurring spontaneous mutation rate in primates (22-78 per generation)3941. Furthermore, our estimate of off target C-to-T and G-to-A is only 3.6. Thus, in our trio-based whole-genome sequencing experimental design, we did not detect obvious guide RNA-independent random C-to-T (or G-to-A) conversion by BE3. Taken together, these results suggest that base editing is highly specific in cynomolgus monkey embryos. Since we only analyzed a base edited fetus together with its parents, further whole-genome sequencing studies with more controls, such as untreated control monkeys or endogenous controls made by 2-cell injection strategies are very important to evaluate both Cas9-dependent and deaminase-dependent off-target mutations by base editing. Additionally, although guide RNA-independent deaminase-induced random RNA off-target mutation has been reported in both CBE- and ABE-mediated base editing4244, we delivered CBE and ABE as mRNA that acts transiently, reducing the risk of severe effects from unwanted RNA editing. Future studies using improved engineered deaminases with reduced random DNA4547 and RNA4244 editing activities will only serve to further reduce these risks and improve base editing efforts.

In summary, we not only show efficient and precise targeted base editing in cynomolgus monkey zygotes by BEs and/or ABE mRNA microinjection, but also demonstrate its utility for multiplexed simultaneous targeting of up to three target genes/sites as a means of overcoming long reproduction times required for developing primate models of polygenic diseases. Further, we show high on-target specificities of BE3/ABE by genome-wide analysis. Our results demonstrate the feasibility of BE3/ABE-based strategies for the development of polygenic primate disease models and identify areas for future improvement of base editing technologies.

Methods

Animals

The animal experiments in this study were conducted at Guangdong Landao Biotechnology Co. Ltd (Landao). The Landao is an Association for Assessment and Accreditation of Laboratory Animal Care accredited facility. All animal protocols were approved in advance by the Institutional Animal Care and Use Committee of Guangdong Landao Biotechnology Co. Ltd and South China Agricultural University.

RNA preparation

The pCMV-BE3 (#73021), pCMV-ABE7.10 (#102919), saKKH-BE3 (#85170), and BE4-Gam (#100806) plasmids were obtained from Addgene. These plasmids were digested with NotI (NEB, Cat. No. R0189S) and BE3, BE4-Gam, and ABE7.10 mRNAs were synthesized using mMESSAGE mMACHINE T7 Ultra kit (Life Technologies, Cat. No. AM1345). T7-sgRNA cassettes were synthesized by PCR using pX459 plasmid (Addgene, #62988) and primers containing spacer and T7 sequences (Supplementary Table 7) with Premix Taq (Takara, Cat. No. R003Q). The sgRNAs were synthesized using the MEGAshortscript T7 Transcription Kit (Life Technologies, Cat. No. AM1354). The mRNAs and sgRNAs were purified using the MEGAclear Kit (Life Technologies, Cat, No. AM1908).

Microinjection

Cynomolgus monkey zygotes were prepared as follows48. Monkey semen was collected from a male monkey (aged 11 years and body weight 9.8 kg) by penile probe electro-ejaculation. Four female monkeys (aged 5–9 years and body weight 3.6–5.1 kg) at menses were intramuscularly administered 1.4 μg rhFSH (recombinant human follitropin alfa; GONAL-F, Merck Serono) twice per day for eight days, followed by 80 μg rhCG (recombinant human chorionic gonadotropin alfa; OVIDREL, Merck Serono) on day 9. Cumulus-oocyte complexes were collected by laparoscopic follicular aspiration 33–36 hours after rhCG administration and oocytes were stripped of cumulus cells by pipetting after brief exposure (<1 min) to hyaluronidase (0.5 mg/ml) at 37 °C. Metaphase II (MII, first polar body is present) oocytes were cultured in CMRL-1066 medium containing 0.1% Na-lactate and 10% fetal bovine serum (FBS; Hyclone Laboratories, SH30088.02) and fertilized by intracytoplasmic sperm injection (ICSI). At 10–12 hours after ICSI, a mixture of base editor mRNA (100 ng/μl) and sgRNA (50 ng/μl) were injected into the cytoplasm of using an Eppendorf’s microinjection system and a microinjector (FemtoJet 4i, Eppendorf). Injections were performed thrice for FAH-E4 and twice for the others. The zygotes were then cultured under mineral oil in hamster embryo culture medium-9 containing 10% fetal bovine serum at 37 °C in 5% CO2, 5% O2, and 90% N2. The zygotes were cultured more than 3 days to collect for genotyping or embryo transfer. For transfer, surrogate mothers with two consecutive normal menstrual cycles (26–32 days), a freshly ovulated follicle, and uterus with normal echo in uterus cavity immediately before embryo transfer were selected as surrogate mothers. Four to seven embryos at the eight-cell to blastocyst stage were transferred into one oviduct of each recipient via laparoscopy with a fixed polythene catheter. Pregnancy in surrogate mothers was diagnosed after imaging an apparent conceptus with beating heart at 25 days post-transfer.

Genotyping

Genomic DNA was amplified from embryos using the REPLI-g Single Cell Kit (QIAGEN, Cat. No. 150345). PCR amplification was performed using Ex Taq (Takara, Cat. No. RR01AM) or PrimeSTAR GXL DNA Polymerase (Takara, Cat. No. R050A). The list of genotyping primers is shown in Supplementary Table 8. The PCR amplicons were gel purified and sequenced. Sanger sequence results were analyzed using SnapGene statistics were performed using Graphpad Prism 7. To analyze mutation frequencies, PCR products were subcloned using the pMD18-T Vector Cloning Kit (Takara, Cat. No. 6011) or Mighty TA-cloning Reagent Set for PrimeSTAR (Takara, Cat. No. 6019), and transformed into E. coli. Plasmids were purified and analyzed by Sanger sequencing. For blastomere genotyping, blastomeres were isolated from 4 to 8 cell stage embryos and genomic DNA was amplified and analyzed using the same methods employed for embryo genotyping. For fetus genotyping, genomic DNA was extracted from various tissues of the mutant fetus and blood of the father and mother using TIANamp Genomic DNA Kit (Tiangen, DP304), then PCR amplification of FAH target site was performed and subjected for deep sequencing at Genewiz.

T7E1 cleavage assay

Targeted DNA fragments were amplified (Primers are shown in Supplementary Data 2) with PrimerSTAR HS DNA polymerase (Takara, Cat. No. R010A) and purified via DNA Fragment Purification Kit (Takara, Cat. No. 9761). Purified PCR amplicons were heat denatured, re-annealed, and then digested with T7E1 (NEB, M0302L) for 30 min at 37 °C and separated on 2% agarose gel.

Whole-genome sequencing

Genomic DNA was extracted from the liver of the mutant fetus and blood of the father and mother using TIANamp Genomic DNA Kit (Tiangen, DP304). Two-hundred nanograms of genomic DNA was subjected to whole-genome sequencing at a sequencing depth of 30–40× using an Illumina HiSeq 2000 platform at Novogene Bioinformatics Institute, Beijing, China. Reads were aligned to the reference genome using bwa-mem (bwa version 0.7.15-r1140; Macaca fascicularis genome assembly GCF_000364345.1) and duplicates were marked using PicardTools (version 1.111(1909)). To detect variant sites, the bam files from the mother, father and fetus were provided as input to GATK Haplotype Caller (GATK version gatk-4.1.3.0). Only autosomal chromosomes were included in the analysis. The raw SNV calls were filtered using the following thresholds: QD < 2.0 or FS > 60.0 or MQ < 40 or MQRankSum < −12.5 or ReadPosRankSum < −8.0 or QUAL < 100. For indels, the following filters were used: QD < 2.0 or FS > 200 or ReadPosRankSum < −20 or QUAL < 100. Except for the quality filter, the filter values were recommended by GATK Best Practices for non-model genome assemblies. We used a more stringent quality filter (QUAL < 100) since not all steps in the GATK Best Practices could be followed due to incomplete resources (e.g.-known high-quality SNP and indel sites). Regions with a high density of variant sites could be due to mapping issues, hence we filtered out sites that have at least 5 variants within a 100 bp window. We obtained 19,131,754 SNVs and 3,131,601 indels. To identify de novo variants, we followed the GATK Genotype Refinement workflow. First, we ran GATK’s CalculateGenotypePosteriors where a PED file of the trio was provided and the option “–skip-population-priors” was used. Second, GATK’s VariantFiltration was run to mark individuals with GQ < 30, and finally, GATK’s VariantAnnotator was run with the option “-A PossibleDenovo”. The Genotype Refinement workflow yielded 1031 High Confidence de novo SNVs and 165 High Confidence de novo indels. We performed another set of filtering to obtain the final set of de novo variants. First, we ran samtools mpileup to count the raw number of reads at each candidate de novo site. Each site must have at least 20 reads for the mother, father, and fetus (DP < 20), if parents are homozygous ref then the alternate allele must not be found in the parents, if parents are homozygous alt then the reference allele must not be found in the parents, and the allele balance of the fetus must be between 0.3 and 0.7. This yielded our final number of 66 de novo SNVs and 5 de novo indels. These were visualized by IGV (version 2.6.1). We computationally analyzed the overlap between these de novo mutations and in silico predicted potential off-target sites predicted by Benchling (https://benchling.com), CRISPOR49, and Cas-OFFinder50, and then manually confirmed. To calculate the expected number of de novo SNVs, we downloaded the Cynomolgus macaque variant sites from dbSNP which are now released via the European Variation Archive (EVA, https://www.ebi.ac.uk/eva). For each of the possible 9 base substitutions, we counted the number of dbSNP/EVA sites, divided the count by the total number of dbSNP/EVA sites, and multiplied the fraction with 66 to get the expected number of de novo SNVs (Supplementary Data 3).

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Supplementary information

Dataset 1 (14.7KB, xlsx)
Dataset 2 (11.6KB, xlsx)
Dataset 3 (11.8KB, xlsx)
Dataset 4 (12.6KB, xlsx)
Dataset 5 (51.4KB, xlsx)
Dataset 6 (84.8KB, xlsx)
Peer Review File (199.6KB, pdf)
Reporting Summary (199.6KB, pdf)
41467_2020_16173_MOESM10_ESM.pdf (5.7KB, pdf)

Description of Additional Supplementary Files

Acknowledgements

This work was supported by grants from Guangdong Special Support Program (2019BT02Y276), the National Key R&D Program of China (2018YFA0801404), the Frontier and Innovation of Key Technology Project in Science and Technology Department of Guangdong Province (2014B020225007), Program for New Century Excellent Talents in University (NCET-12-1078), National Natural Science Foundation of China (81941006 and 81771579), South China Agricultural University Ph.D. students abroad (outside) joint training project (2018LHPY009), Rett Syndrome Research Trust, and the Hock E. Tan and K. Lisa Yang Center for Autism Research. We thank Baron Koylass (European Variation Archive, EMBL-EBI) for providing the Macaca fascicularis dbSNP data, and Qian Chen (MIT), Menglong Zeng (MIT) and Dheeraj Roy (Broad Institute) for helpful discussions.

Source data

Source Data (56.7MB, xlsx)

Author contributions

S.Y., G.F., T.A., and W.Z. designed and supervised the study. W.Z., C.D., Y.H., Y.T., D.X., Z.F., D.L., and Z.Z. performed experiments. R.D.R., T.A., W.Z., and M.J. performed whole-genome sequencing analysis. S.Y., G.F., T.A., W.Z., Y.Z., J.J.W., Z.B., Q.H. and X.Z. collected and analysed data. T.A., R.D.R., J.J.W., and W.Z. wrote the paper with the input from all the authors. All of the authors read and approved the final paper.

Data availability

All deep sequencing and whole-genome sequencing data from this study has been deposited at NCBI Sequence Read Archive (SRA) under accession numbers PRJNA561611 and PRJNA505503, respectively. The source data 603 underlying Supplementary Figs. 4a and 8a–l are provided as a Source data file. Data supporting the findings of this work are available within the paper and its Supplementary Information files. The datasets generated and analyzed during the current study are available from the corresponding author upon request.

Competing interests

The authors declare no competing interests.

Footnotes

Peer review information Nature Communications thanks Afonso Silva and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Wenhui Zhang, Tomomi Aida, Ricardo C.H. del Rosario, Jonathan J. Wilde.

These authors jointly supervised this work: Tomomi Aida, Guoping Feng, Shihua Yang.

Contributor Information

Tomomi Aida, Email: aidat@mit.edu.

Guoping Feng, Email: fengg@mit.edu.

Shihua Yang, Email: yangsh@scau.edu.cn.

Supplementary information

Supplementary information is available for this paper at 10.1038/s41467-020-16173-0.

References

  • 1.Kaiser T, Feng G. Modeling psychiatric disorders for developing effective treatments. Nat. Med. 2015;21:979–988. doi: 10.1038/nm.3935. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Schwank G, et al. Functional repair of CFTR by CRISPR/Cas9 in intestinal stem cell organoids of cystic fibrosis patients. Cell Stem Cell. 2013;13:653–658. doi: 10.1016/j.stem.2013.11.002. [DOI] [PubMed] [Google Scholar]
  • 3.Wu Y, et al. Correction of a genetic disease in mouse via use of CRISPR-Cas9. Cell Stem Cell. 2013;13:659–662. doi: 10.1016/j.stem.2013.10.016. [DOI] [PubMed] [Google Scholar]
  • 4.Gaudelli NM, et al. Programmable base editing of A*T to G*C in genomic DNA without DNA cleavage. Nature. 2017;551:464–471. doi: 10.1038/nature24644. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Komor AC, Kim YB, Packer MS, Zuris JA, Liu DR. Programmable editing of a target base in genomic DNA without double-stranded DNA cleavage. Nature. 2016;533:420–424. doi: 10.1038/nature17946. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Nishida K, et al. Targeted nucleotide editing using hybrid prokaryotic and vertebrate adaptive immune systems. Science. 2016;353:aaf8729. doi: 10.1126/science.aaf8729. [DOI] [PubMed] [Google Scholar]
  • 7.Kim K, et al. Highly efficient RNA-guided base editing in mouse embryos. Nat. Biotechnol. 2017;35:435–437. doi: 10.1038/nbt.3816. [DOI] [PubMed] [Google Scholar]
  • 8.Li GL, et al. Highly efficient and precise base editing in discarded human tripronuclear embryos. Protein Cell. 2017;8:776–779. doi: 10.1007/s13238-017-0458-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Liu Z, et al. Efficient generation of mouse models of human diseases via ABE- and BE-mediated base editing. Nat. Commun. 2018;9:2338. doi: 10.1038/s41467-018-04768-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Ryu SM, et al. Adenine base editing in mouse embryos and an adult mouse model of Duchenne muscular dystrophy. Nat. Biotechnol. 2018;36:536–539. doi: 10.1038/nbt.4148. [DOI] [PubMed] [Google Scholar]
  • 11.Chan AW. Progress and prospects for genetic modification of nonhuman primate models in biomedical research. ILAR J. 2013;54:211–223. doi: 10.1093/ilar/ilt035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Zhou Y, et al. Atypical behaviour and connectivity in SHANK3-mutant macaques. Nature. 2019;570:326–331. doi: 10.1038/s41586-019-1278-0. [DOI] [PubMed] [Google Scholar]
  • 13.MacDonald GJ. Reproductive patterns of three species of macaques. Fertil. Steril. 1971;22:373–377. doi: 10.1016/S0015-0282(16)38285-1. [DOI] [PubMed] [Google Scholar]
  • 14.Niu Y, et al. Generation of gene-modified cynomolgus monkey via Cas9/RNA-mediated gene targeting in one-cell embryos. Cell. 2014;156:836–843. doi: 10.1016/j.cell.2014.01.027. [DOI] [PubMed] [Google Scholar]
  • 15.Wan H, et al. One-step generation of p53 gene biallelic mutant Cynomolgus monkey via the CRISPR/Cas system. Cell Res. 2015;25:258–261. doi: 10.1038/cr.2014.158. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Morrow G, Angileri F, Tanguay RM. Molecular Aspects of the FAH Mutations Involved in HT1 Disease. Adv. Exp. Med. Biol. 2017;959:25–48. doi: 10.1007/978-3-319-55780-9_3. [DOI] [PubMed] [Google Scholar]
  • 17.Arranz JA, et al. Splicing mutations, mainly IVS6-1(G>T), account for 70% of fumarylacetoacetate hydrolase (FAH) gene alterations, including 7 novel mutations, in a survey of 29 tyrosinemia type I patients. Hum. Mutat. 2002;20:180–188. doi: 10.1002/humu.10084. [DOI] [PubMed] [Google Scholar]
  • 18.Komor AC, et al. Improved base excision repair inhibition and bacteriophage Mu Gam protein yields C:G-to-T:A base editors with higher efficiency and product purity. Sci. Adv. 2017;3:eaao4774. doi: 10.1126/sciadv.aao4774. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Goate A, et al. Segregation of a missense mutation in the amyloid precursor protein gene with familial Alzheimer’s disease. Nature. 1991;349:704–706. doi: 10.1038/349704a0. [DOI] [PubMed] [Google Scholar]
  • 20.Cao A, Galanello R. Beta-thalassemia. Genet. Med. 2010;12:61–76. doi: 10.1097/GIM.0b013e3181cd68ed. [DOI] [PubMed] [Google Scholar]
  • 21.Olivier M, Hollstein M, Hainaut P. TP53 mutations in human cancers: origins, consequences, and clinical use. Cold Spring Harb. Perspect. Biol. 2010;2:a001008. doi: 10.1101/cshperspect.a001008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Kim YB, et al. Increasing the genome-targeting scope and precision of base editing with engineered Cas9-cytidine deaminase fusions. Nat. Biotechnol. 2017;35:371–376. doi: 10.1038/nbt.3803. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Kim HG, et al. WDR11, a WD protein that interacts with transcription factor EMX1, is mutated in idiopathic hypogonadotropic hypogonadism and kallmann syndrome. Am. J. Hum. Genet. 2010;87:465–479. doi: 10.1016/j.ajhg.2010.08.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.de Winter JP, et al. The Fanconi anaemia gene FANCF encodes a novel protein with homology to ROM. Nat. Genet. 2000;24:15–16. doi: 10.1038/71626. [DOI] [PubMed] [Google Scholar]
  • 25.Miki Y, et al. A strong candidate for the breast and ovarian cancer susceptibility gene BRCA1. Science. 1994;266:66–71. doi: 10.1126/science.7545954. [DOI] [PubMed] [Google Scholar]
  • 26.Zuo E, et al. Cytosine base editor generates substantial off-target single-nucleotide variants in mouse embryos. Science. 2019;364:289–292. doi: 10.1126/science.aav9973. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Jin S, et al. Cytosine, but not adenine, base editors induce genome-wide off-target mutations in rice. Science. 2019;364:292–295. doi: 10.1126/science.aaw7166. [DOI] [PubMed] [Google Scholar]
  • 28.Koblan LW, et al. Improving cytidine and adenine base editors by expression optimization and ancestral reconstruction. Nat. Biotechnol. 2018;36:843–846. doi: 10.1038/nbt.4172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Zafra MP, et al. Optimized base editors enable efficient editing in cells, organoids and mice. Nat. Biotechnol. 2018;36:888–893. doi: 10.1038/nbt.4194. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Hu JH, et al. Evolved Cas9 variants with broad PAM compatibility and high DNA specificity. Nature. 2018;556:57–63. doi: 10.1038/nature26155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Nishimasu H, et al. Engineered CRISPR-Cas9 nuclease with expanded targeting space. Science. 2018;361:1259–1262. doi: 10.1126/science.aas9129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kim S, Kim D, Cho SW, Kim J, Kim JS. Highly efficient RNA-guided genome editing in human cells via delivery of purified Cas9 ribonucleoproteins. Genome Res. 2014;24:1012–1019. doi: 10.1101/gr.171322.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Aida T, et al. Cloning-free CRISPR/Cas system facilitates functional cassette knock-in in mice. Genome Biol. 2015;16:87. doi: 10.1186/s13059-015-0653-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Hashimoto M, Yamashita Y, Takemoto T. Electroporation of Cas9 protein/sgRNA into early pronuclear zygotes generates non-mosaic mutants in the mouse. Dev. Biol. 2016;418:1–9. doi: 10.1016/j.ydbio.2016.07.017. [DOI] [PubMed] [Google Scholar]
  • 35.Ma H, et al. Correction of a pathogenic gene mutation in human embryos. Nature. 2017;548:413–419. doi: 10.1038/nature23305. [DOI] [PubMed] [Google Scholar]
  • 36.Zhang ML, et al. Human cleaving embryos enable robust homozygotic nucleotide substitutions by base editors. Genome Biol. 2019;20:101. doi: 10.1186/s13059-019-1703-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Chen Y, et al. Modeling Rett syndrome using TALEN-Edited MECP2 mutant cynomolgus monkeys. Cell. 2017;169:945–955.e10. doi: 10.1016/j.cell.2017.04.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Lee HK, et al. Cytosine base editor 4 but not adenine base editor generates off-target mutations in mouse embryos. Commun. Biol. 2020;3:19. doi: 10.1038/s42003-019-0745-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Kong A, et al. Rate of de novo mutations and the importance of father’s age to disease risk. Nature. 2012;488:471–475. doi: 10.1038/nature11396. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Besenbacher S, et al. Direct estimation of mutations in great apes reconciles phylogenetic dating. Nat. Ecol. Evol. 2019;3:286–292. doi: 10.1038/s41559-018-0778-x. [DOI] [PubMed] [Google Scholar]
  • 41.Luo X, et al. Trio deep-sequencing does not reveal unexpected off-target and on-target mutations in Cas9-edited Rhesus Monkeys. Nat. Commun. 2019;10:5525. doi: 10.1038/s41467-019-13481-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Grunewald J, et al. Transcriptome-wide off-target RNA editing induced by CRISPR-guided DNA base editors. Nature. 2019;569:433–437. doi: 10.1038/s41586-019-1161-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Zhou CY, et al. Off-target RNA mutation induced by DNA base editing and its elimination by mutagenesis. Nature. 2019;571:275–278. doi: 10.1038/s41586-019-1314-0. [DOI] [PubMed] [Google Scholar]
  • 44.Grunewald J, et al. CRISPR DNA base editors with reduced RNA off-target and self-editing activities. Nat. Biotechnol. 2019;37:1041–1048. doi: 10.1038/s41587-019-0236-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Zuo, E. et al. High-fidelity base editor with no detectable genome-wide off-target effects. Preprint at bioRxiv10.1101/2020.02.07.939074 (2020).
  • 46.Doman, J. L., Raguram, A., Newby, G. A., Liu D. R. Evaluation and minimization of Cas9-independent off-target DNA editing by cytosine base editors. Nat. Biotechnol. 10.1038/s41587-020-0414-6 (2020). [DOI] [PMC free article] [PubMed]
  • 47.Yu, Y. et al. Next-generation cytosine base editors with minimized unguided DNA and RNA off-target events and high on-target activity. Preprint at bioRxiv 10.1101/2020.02.11.944165 (2020). [DOI] [PMC free article] [PubMed]
  • 48.Ma Y, et al. Efficient production of cynomolgus monkeys with a toolbox of enhanced assisted reproductive technologies. Sci. Rep. 2016;6:25888. doi: 10.1038/srep25888. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Haeussler M, et al. Evaluation of off-target and on-target scoring algorithms and integration into the guide RNA selection tool CRISPOR. Genome Biol. 2016;17:148. doi: 10.1186/s13059-016-1012-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Bae S, Park J, Kim JS. Cas-OFFinder: a fast and versatile algorithm that searches for potential off-target sites of Cas9 RNA-guided endonucleases. Bioinformatics. 2014;30:1473–1475. doi: 10.1093/bioinformatics/btu048. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Dataset 1 (14.7KB, xlsx)
Dataset 2 (11.6KB, xlsx)
Dataset 3 (11.8KB, xlsx)
Dataset 4 (12.6KB, xlsx)
Dataset 5 (51.4KB, xlsx)
Dataset 6 (84.8KB, xlsx)
Peer Review File (199.6KB, pdf)
Reporting Summary (199.6KB, pdf)
41467_2020_16173_MOESM10_ESM.pdf (5.7KB, pdf)

Description of Additional Supplementary Files

Data Availability Statement

All deep sequencing and whole-genome sequencing data from this study has been deposited at NCBI Sequence Read Archive (SRA) under accession numbers PRJNA561611 and PRJNA505503, respectively. The source data 603 underlying Supplementary Figs. 4a and 8a–l are provided as a Source data file. Data supporting the findings of this work are available within the paper and its Supplementary Information files. The datasets generated and analyzed during the current study are available from the corresponding author upon request.


Articles from Nature Communications are provided here courtesy of Nature Publishing Group

RESOURCES