Significance
Precise genomic correction of pathogenic mutations is an attractive therapeutic strategy. Here, we show that a unique class of stem cell-derived nonpathogenic virus, hematopoietic stem cell-derived adeno-associated virus vector (AAVHSC), mediates precise genome editing at unprecedented efficiencies in primary human cells, including postmitotic cells, and in vivo. Unlike the majority of current editing platforms, genome editing by AAVHSC is uniquely based on homologous recombination (HR), requiring no exogenous nucleases. AAVHSC-mediated editing is seamless, with no evidence of on-target insertion/deletion mutations common to nuclease-based platforms. Efficient in vivo editing was achieved by i.v. injection of AAVHSC editing vectors. The combination of efficient and precise HR-based genome editing coupled with the superior in vivo transduction properties of AAV facilitates progress toward in vivo therapeutic gene editing.
Keywords: adeno-associated virus, genome editing, homologous recombination, hematopoietic stem cells, in vivo genome editing
Abstract
The precise correction of genetic mutations at the nucleotide level is an attractive permanent therapeutic strategy for human disease. However, despite significant progress, challenges to efficient and accurate genome editing persist. Here, we report a genome editing platform based upon a class of hematopoietic stem cell (HSC)-derived clade F adeno-associated virus (AAV), which does not require prior nuclease-mediated DNA breaks and functions exclusively through BRCA2-dependent homologous recombination. Genome editing is guided by complementary homology arms and is highly accurate and seamless, with no evidence of on-target mutations, including insertion/deletions or inclusion of AAV inverted terminal repeats. Efficient genome editing was demonstrated at different loci within the human genome, including a safe harbor locus, AAVS1, and the therapeutically relevant IL2RG gene, and at the murine Rosa26 locus. HSC-derived AAV vector (AAVHSC)-mediated genome editing was robust in primary human cells, including CD34+ cells, adult liver, hepatic endothelial cells, and myocytes. Importantly, high-efficiency gene editing was achieved in vivo upon a single i.v. injection of AAVHSC editing vectors in mice. Thus, clade F AAV-mediated genome editing represents a promising, highly efficient, precise, single-component approach that enables the development of therapeutic in vivo genome editing for the treatment of a multitude of human gene-based diseases.
The ultimate goal of genetic medicine is to precisely and efficiently edit the human genome to correct somatic mutations or insert therapeutic sequences at targeted chromosomal locations. Despite significant recent progress, current genome editing technologies have yet to attain therapeutically relevant levels of efficiency and precision, particularly in vivo. The majority of current gene editing platforms utilize nuclease-mediated DNA cleavage as the first step in editing events. The efficiency and accuracy of nuclease-mediated cleavage are often variable (1), and repair of double-stranded DNA breaks by homologous recombination (HR) is low (2–11). These nuclease-induced DNA breaks are then repaired by the error-prone nonhomologous end-joining (NHEJ) pathway, which results in random nucleotide insertions/deletions (indels) at the cleavage site, potentially creating undesirable on-target mutations (12) or insertion of the entire vector genome, including the inverted terminal repeats (ITRs) (13). In contrast, DNA break repair by HR is more precise and can be engineered to accurately insert specific sequences at precise chromosomal locations, but it occurs at a lower frequency than NHEJ-mediated repair (2–11, 14–16). While nuclease-based genome editing is effective for gene disruption, the predictable correction of mutations and targeted insertion of therapeutic sequences are harder to achieve due to the combination of the low frequency of HR repair coupled with the requirement to deliver multiple components to target cells. Another significant challenge to the therapeutic translation of genome editing is that classic HR is restricted to dividing cells, thus excluding most cells in vivo, which exist in a postmitotic state (17). Nuclease-based editing platforms also carry the burden of promiscuous off-target cleavage by which double-stranded DNA breaks are created at unintended sites, resulting in the potential for genome-wide mutagenesis (18–23). Targeting of nucleases to specific genomic sites is accomplished by the use of DNA-binding domains or guide RNA, which can place constraints on the exact location of cleavage sites (23). Finally, efficient in vivo delivery of genome-editing components to target tissues remains challenging. Most platforms consist of multiple components, which are often too large to fit into a single delivery vector. To achieve genome editing in vivo, each target cell would require the delivery of multiple editing components, further reducing efficiency. Thus, while much effort has been focused upon translating genome editing to therapeutic applications, significant challenges persist. An accurate, efficient, and predictable genome editing technology based solely on HR pathways would thus represent a significant advance in the field and would allow precise targeted insertion of therapeutic sequences and gene correction without the risk of on-target mutagenesis or promiscuous off-target effects. If such an editing platform functioned efficiently in postmitotic cells in vivo and consisted of a single component amenable to in vivo use, it would further resolve the significant challenges currently facing therapeutic genome editing.
Adeno-associated virus (AAV)–based vectors have previously been shown to mediate genome editing without the requirement for exogenous nucleases (24–29). However, the frequencies of genome editing reported were very low, generally well below 0.1%. In this study, we interrogated a class of CD34+ hematopoietic stem cell-derived AAV vectors (AAVHSCs) (30), for their genome editing capacity. The AAVHSCs represent a family of natural, human, nonpathogenic, single-stranded, replication-defective AAVs that infect both dividing and nondividing cells and display broad systemic tropism in vivo (30–36). Here, we show that AAVHSCs mediate precise, efficient on-target genome editing in primary human cells and at multiple genomic locations, including therapeutically relevant genes in vitro and in vivo, without the requirement for prior nucleolytic DNA cleavage. Importantly, AAVHSC-based genome editing occurs exclusively via the HR pathway and is highly efficient in primary human cells. We further show that the precision of on-target editing is highly accurate, with no evidence of genomic scarring, indels, or incorporation of a residual viral footprint. We demonstrate a highly precise, exclusively HR-based, nuclease-free genome editing platform that functions at unparalleled efficiencies in primary cells and in vivo and is composed of a single component with built-in in vivo delivery properties.
Results
AAVHSCs Mediate Efficient Genome Editing in Human Cells in the Absence of Exogenous Nucleases.
We designed a reporter-based editing assay to measure the targeted genomic insertion of a promoterless green fluorescent protein (GFP) into the safe harbor locus, AAVS1, located on human chromosome 19 (37), such that expression would be dependent upon accurate insertion downstream of a chromosomal promoter (Fig. 1A). The editing vector genome was bounded at both ends by AAV ITRs, consisted of a promoterless GFP ORF immediately downstream of a splice acceptor (SA) and T2A (SA/T2A) sequence followed by a polyadenylation (pA) signal, and was flanked bilaterally by 800-bp homology arms (HAs) complementary to the PPP1R12C gene sequence on human chromosome 19 within the safe harbor locus AAVS1 (37). This vector, termed PPP1R12C-GFP, was designed to insert the GFP ORF into intron 1 of the PPP1R12C gene (Fig. 1A and SI Appendix, Table S1). We reasoned that accurate editing of the promoterless GFP cassette into the intron downstream of the PPP1R12C promoter would result in GFP expression driven by the endogenous PPP1R12C promoter (Fig. 1A).
Fig. 1.
Genome editing of a promoterless GFP into intron 1 of the human PPP1R12C gene. (A) Schema for genome editing assay. (Upper) Map of the PPP1R12C-GFP editing vector genome. In line 1, the insert cassette consists of a GFP ORF (green) preceded by SA/T2A (SA/2A) sequences and followed by a pA (PA) signal (cyan), and it is flanked on either side by 800-bp HAs (blue) complementary to chromosomal sequences in PPP1R12C. HAL, left HA; HAR, right HA. The entire editing construct is bounded by AAV2 ITRs (purple). In line 2, the PPP1R12C gene is depicted. Promoter (P) is shown in yellow, exons (Ex) are shown in dark green, and introns are shown in pink. Line 3 depicts a hypothetical HR schema. Line 4 depicts the edited PPP1R12C gene showing the insertion of the GFP cassette into intron 1. (B) Dose–response curves correlating genome editing with the MOI of the PPP1R12C-GFP editing vector. The editing vector genome was packaged in the capsids noted within each box. (Left) Editing of primary human CD34+ cells. (Right) Editing of the human hepatocellular carcinoma cell line HepG2. (C) Heat map showing genome editing efficiency of targeted insertion of GFP into PPP1R12C by AAV serotype (columns) in different human cell types (rows) as follows: (1) transformed cell lines, (2) primary human cells, and (3) immortalized LCLs carrying genetic mutations. Values represent specific GFP expression as assessed by flow cytometry. GFP expression was assessed only in viable cells, as determined by DAPI exclusion. Data shown are aggregates of >906 individual measurements. Cell type details are shown in SI Appendix, Fig. S1. (D) Comparison of median genome editing efficiency across AAV clade A (AAV6), clade B (AAV2), clade E (AAV8), and clade F (AAV9) (AAVHSCs). Editing efficiencies were measured by GFP expression by flow cytometry. Cells were transduced at a MOI of 1.5E5 and evaluated by flow cytometry at 48 h. Sample sizes for the experimental groups are as follows: untreated (Untd), n = 25; AAV6, n = 33; AAV2, n = 17; AAV8, n = 19; AAV9, n = 30; and AAVHSCs combined, n = 390. AAVHSC represents data compiled from AAVHSC1, AAVHSC4, AAVHSC5, AAVHSC7, AAVHSC9, AAVHSV12, AAVHSC13, AAVHSV15, AAVHSV16, and AAVHSC17. Outliers are represented by individual circles. Significance was determined by a paired two-tailed t test using AAVHSCs as the comparison reference. The vector genomes (VG) were quantitated in nuclei purified from AAV-treated CD34+ cells 48 h posttreatment. The number of VG per nucleus was determined by real-time PCR for GFP and the housekeeping gene hApoB. Values shown are averages of three replicates per transduction and three transductions with each AAV vector.
The promoterless GFP editing vector was packaged in AAVHSC5, AAVHSC7, AAVHSC17, and AAV6 capsids. A titration of the multiplicities of infection (MOIs) revealed a linear relationship between GFP expression and vector concentration for each AAV serotype tested in both primary CD34+ cells and the hepatocellular carcinoma cell line HepG2 (Fig. 1B), indicating that GFP expression correlated directly with the cell/vector ratio in a dose-dependent manner. The percentage of cells expressing GFP and the slopes of the titration curves revealed that the AAVHSCs mediated targeted insertion of GFP more efficiently than AAV6 in both cell types (Fig. 1B). All subsequent in vitro experiments described here were performed at a MOI of 1.5E5.
We next compared levels of GFP expression after insertion into intron 1 of PPP1R12C following transduction by AAVHSCs, AAV2, AAV6, AAV8, and AAV9 vectors, in over 906 individual measurements across an array of human cells (Fig. 1C and SI Appendix, Fig. S1), including primary human CD34+ HSCs, liver sections, hepatic sinusoidal endothelial cells, myoblasts, and EBV-immortalized human B lymphoblastoid cell lines (LCLs) (SI Appendix, Fig. S1). Cumulative GFP expression was higher in cells treated with clade F HSC-derived AAVs, including AAVHSCs and AAV9, up to >50% across a spectrum of human cell types (Fig. 1C and SI Appendix, Fig. S1). In contrast, cells treated with AAV2 (clade B), AAV6 (clade A), and AAV8 (clade E) displayed lower GFP expression (Fig. 1C and SI Appendix, Fig. S1), in concordance with previous reports (24–29). Importantly, evaluation of cell viability by staining with 4′,6-diamidino-2-phenylindole (DAPI) followed by flow cytometric analysis showed no difference between AAVHSC-treated cells and controls, indicating that genome editing with AAV vectors was not toxic (SI Appendix, Table S2). This was further confirmed by morphological analysis.
A comparison of the median cumulative GFP expression revealed that clade F AAVs edited at a significantly higher efficiency, with a median efficiency of 24% across all human cell types studied, compared with clade B (AAV2, 2.12%), clade A (AAV6, 0.43%), and clade E (AAV8, 1.7%) (Fig. 1D and SI Appendix, Fig. S2A). While the AAVHSCs edited at a higher efficiency than AAV2, AAV6, and AAV8, we observed variability in GFP expression between different AAVHSCs (SI Appendix, Fig. S2A). Interestingly, AAVHSCs and AAV9 were found in human CD34+ cells, share significant sequence homology, map to clade F (30), and displayed similar enhanced genome editing properties. Thus, we conclude that stem cell-derived clade F AAVs represent a powerful class of vectors that mediate efficient genome editing and targeted insertion at specified chromosomal locations, at unparalleled efficiencies in primary human cells in the absence of exogenous nucleases.
Both genome editing and gene transfer by AAV vectors must undergo common early cell interaction processes, including receptor-mediated entry, translocation to the nucleus, nuclear entry, and uncoating before vector genomes are released and available for either transgene expression, in the case of gene transfer, or genome editing. Thus, we compared the efficiency of genome editing, as measured by promoterless GFP expression, with that of gene transfer for the different AAV serotypes (SI Appendix, Fig. S2B). Results revealed a direct correlation between transduction and editing (R2 = 0.8337), suggesting that specific virus–cell interactions may represent rate-limiting steps that are common to both processes and indicating that the overall transduction efficiency, as measured by transgene expression, may be a contributing factor in editing efficiency in vitro. Since serotype-dependent differences have been shown to play a role in nuclear entry (38), we quantitated intranuclear vector genomes in purified nuclei following transduction of CD34+ cells with different AAV clades (Fig. 1D). Results revealed that the number of intranuclear vector genomes was higher for clade F AAVs [average = 149.29 vector genomes (vg) per nucleus] at 48 h posttransduction, compared with AAV vectors belonging to clade A (5.49 vg per nucleus), clade B (0.18 vg per nucleus), and clade E (4.26 vg per nucleus). Thus, clade F AAVs are over 27-fold more efficient at delivering genomes to the nucleus compared with AAV belonging to clades A, B, and E. Additional factors, including second-strand synthesis and genome concatamerization, may also play roles in guiding editing efficiencies but remain as yet undefined.
Molecular Characterization of AAVHSC-Mediated Genome Editing in CD34+ Cells.
We next evaluated the on-target precision and efficiency of HR at the allelic level in primary CD34+ cells treated with the promoterless PPP1R12C-GFP editing vectors designed to insert the GFP ORF into intron 1 of the PPP1R12C gene in AAVS1. Flow cytometric analysis of AAVHSC-treated CD34+ cells revealed distinct GFP-expressing populations. In a representative experiment, CD34+ cells treated with AAVHSC5, AAVHSC7, AAVHSC12, and AAVHSC17 expressed GFP in 47.4%, 41.8%, 58.1%, and 48.9% of cells within 48 h, respectively (Fig. 2A). In contrast, no specific GFP expression was observed following treatment with the promoterless AAV6 PPP1R12C-GFP editing vector (Fig. 2A). Similarly, AAV2 and AAV8 showed low levels of GFP expression in primary CD34+ cells (SI Appendix, Fig. S1). Donor variability in transduction efficiencies of primary cells, including CD34+ cells, has previously been reported (32, 39) and may account for differences observed between experiments with cells from different donors.
Fig. 2.
AAVHSCs mediate precise and efficient genome editing in primary human cells. (A) Flow cytometric analysis of GFP expression in cord blood CD34+ cells following editing of the promoterless GFP ORF into intron 1 of PPP1R12C. Cells were transduced at a MOI of 1.5E5 and assayed for GFP expression after 48 h. The fraction of specific GFP expression is noted in each flow profile. Untd, untreated. (B) Schema depicting the edited PPP1R12C locus showing targeted insertion of GFP, the expected 1.7-kb TI amplicon, and the location of primers for the TI assay. Gels show TI amplicons (red arrow) specific for GFP insertion into intron 1 of PPP1R12C in HepG2 and CD34+ cells at 24 h and 39 d posttransduction in vitro. FWD, forward; HAL, left HA; MW, molecular weight marker; P, promoter; REV, reverse. (C) Correlation of GFP expression in edited cells with molecular ddPCR-based quantitation of edited alleles. CD34+ cells were treated with AAVHSC17 PPP1R12C-GFP editing vector at a MOI of 1.5E5. Flow cytometry-based GFP expression and ddPCR-based allele quantitation were measured from the same samples after 48 h. (D) Enrichment of edited alleles in flow-sorted K562 cells. K562 cells were treated with the AAVHSC9 PPP1R12C-GFP editing vector at a MOI of 150,000. Cells were assessed for the frequency of edited alleles before and 3 wk after enrichment of GFP-expressing cells. (E) NGS analysis of the edited PPP1R12C genomic region from edited CD34+ cells. Shown is the NGS read depth (y axis) and the covered chromosomal region (x axis), including HAs and the GFP insert. The fidelity of editing and errors per allele is noted, showing seamless editing with no inserted viral sequences being detected. The locations of errors are denoted by red arrows under the map. Each arrow signifies a single error. (F) Targeted editing of the promoterless GFP cassette into two locations within the human IL2RG gene in primary human CD34+ cells. The map depicts the IL2RG gene. Insertion locations are at either the ATG or in intron 7 (HA; depicted in blue). Flow analyses of GFP expression following insertion of the GFP ORF at either the initiation codon or in intron 7 are shown. Expression is driven by the chromosomal IL2RG promoter. Percent-specific GFP is noted within each flow profile.
To confirm the accuracy of on-target editing, we developed a targeted integration (TI) assay (Fig. 2B and SI Appendix, Fig. S3). A GFP-specific primer and a chromosome-specific primer complementary to the genomic sequences outside the 5′ HA were used to amplify a 1.7-kb fragment, representing accurate insertion of the GFP ORF into intron 1 of PPP1R12C (Fig. 2B and SI Appendix, Fig. S3A). The correctly sized 1.7-kb band was amplified only from AAVHSC-treated, but not untreated, HepG2 and CD34+ cells (Fig. 2B). Densitometric analyses of the gels revealed an average 20-fold increase of band intensity of the TI-specific amplicon in lanes containing amplified DNA from cells treated with the editing vector compared with untransduced and no-template controls (SI Appendix, Fig. S4A), and an eightfold increase over untransduced controls from CD34+ day 39 gel (Fig. 2B and SI Appendix, Fig. S4B). Specific TI amplicons were observed within 24 h of treatment and were readily detected as late as 39 d posttreatment in CD34+ cells maintained in culture in the absence of selective pressure, suggesting that the genome editing event occurred rapidly and was stable over time (Fig. 2B). GFP expression was assayed in these long-term cultures in parallel on days 20 and 39 (SI Appendix, Table S3). Notably, due to the asymmetrical differentiation of primary CD34+ cells in culture into different hematopoietic lineages with varying lifespans, long-term cultures do not accurately reflect the initial cultures. TI assays with primers spanning the 3′ edited chromosomal junction region also revealed amplification of the correctly sized bands (SI Appendix, Fig. S3A). Importantly, control amplifications of untransduced cell DNA spiked with an excess of purified PPP1R12C-GFP editing vector genomes did not show any amplicons (SI Appendix, Fig. S3B, Lower Left, lane 5), excluding the possibility of cross-over amplification with episomal vector DNA.
To confirm that the TI amplicons represented the correct targeted insertion product, the 1.7-kb amplicon spanning the 5′ chromosomal junction, including the left HA and into the SA/2A-GFP insert, was subjected to Sanger sequencing (SI Appendix, Fig. S5 A and B). Since the high GC content of the palindromic AAV ITRs is known to be difficult to sequence through, we developed a method that keeps the ITRs sufficiently denatured so as to reproducibly obtain complete sequence reads (40). Sequence analysis from edited CD34+, K562, and HepG2 cells revealed that targeted insertion was precise, with the chromosomal sequences being contiguous with HA sequences, followed by the SA/T2A and the GFP ORF. There was no evidence of sequence alterations in either the chromosome or junction regions or in the insert (SI Appendix, Fig. S5 A and B). To further confirm the accuracy of editing, we sequenced both junction regions, including the target chromosome-HA as well as the junction between each HA and GFP insert using chromosome-specific primers complementary to regions external to HAs (SI Appendix, Fig. S5C). In concordance with the previous results, editing was found to be seamless at all junctions with no indel mutations or sequence disruptions. Importantly, no elements of the viral genomic backbone, including the ITRs, were found inserted into the genome, suggesting that HR could be involved. These results provide sequence confirmation of successful and precise editing of the promoterless GFP cassette into the intended chromosomal location. Importantly, this was observed in primary human cells and cell lines, including human CD34+ cells, which predominantly existed in a resting state at the time of treatment (36).
To quantitate the fraction of alleles that were successfully edited, we developed a droplet digital PCR (ddPCR)-based allele quantitation assay (Fig. 2C and SI Appendix, Fig. S6). In this assay, cellular genomes are diluted to ∼30 cells per 20-μL reaction, a concentration at which both the cellular and vector genomes are within the linear range of detection in these experiments. To detect editing, we employed multiplexed Taqman assays specific for GFP and the chromosomal location, respectively. A duplex Taqman assay was performed with each genomic DNA sample within an oil emulsion that partitions each reaction such that each droplet is either positive or negative for GFP and the genome. Each droplet is read out by distinguishing PCR-positive droplets from PCR-negative droplets within each channel. Free vector partitions to the upper left quadrant (SI Appendix, Fig. S6A, blue dots), and unedited chromosomes partition to the lower right quadrant (SI Appendix, Fig. S6A, green dots). Edited genomes containing both the GFP-specific signal and the chromosome-specific signal in cis on a single molecule of DNA are represented in the partition to the upper right quadrant (SI Appendix, Fig. S6A, orange dots), as well as the possible coincidence of genome and free vector. To control for the coincidence of GFP and genome probes, the number of expected double-positive droplets generated by chance was subtracted from the observed number of double-positive droplets. A standard curve generated for this assay (SI Appendix, Fig. S5B) shows a correlation coefficient of 0.9717. SI Appendix, Table S4 provides the complete statistical analysis of the allele quantitation assay. To confirm the quantitation of editing and to control for error due to sampling, each DNA sample was also analyzed following restriction digestion with an enzyme that cleaves between the GFP and chromosome-specific probe-binding sites. This artificial separation of the editing signal from the genome signal results in the independent partitioning of each probe target (SI Appendix, Fig. S7A). SI Appendix, Fig. S7B and Table S5 show that following restriction digestion, the signal in the upper right quadrant is fully resolved into the free vector and free locus signals, indicating that the edited allele signal (SI Appendix, Fig. S7B, orange dots) accurately represented the binding of both vector and locus probes to the same piece of DNA (SI Appendix, Fig. S7B and Table S5). Thus, we conclude that this ddPCR-based allele quantitation assay accurately measures edited chromosomes.
To determine if GFP expression in edited cells correlated with the frequency of edited alleles detected by ddPCR, we treated CD34+ cells with the AAVHSC17 PPP1R12C-GFP editing vector. Results revealed that GFP expression, as measured by flow cytometry, was highly correlated with edited alleles (R2 = 0.76) (Fig. 2C). A detailed statistical analysis is shown in SI Appendix, Table S6. These results validated that GFP detection by flow cytometry was an accurate measure of genome editing. To further confirm this, we tested whether flow-sorted, GFP-positive, AAVHSC-treated cells would display an enrichment of edited alleles. K562 erythroleukemia cells were treated with the AAVHSC9 PPP1R12C-GFP editing vector. Allele quantitation by ddPCR before and after GFP-based flow sorting revealed an enrichment of edited alleles, with 37% of alleles being edited 6 wk after transduction compared with 11% before sorting, indicating the stability of the AAVHSC-mediated genome editing over time (Fig. 2D). Given that K562 represents an aneuploid cell line, and the possibility of monoallelic editing, these results confirmed that GFP expression correlated well with the frequency of edited alleles and was stable over time in culture without selective pressure.
Editing Is Seamless with No Evidence of Genomic Scarring.
Although HR can be achieved at a low range of efficiencies with nuclease-based genome editing platforms in the presence of donor molecules (16, 41), the vast majority of edited alleles undergo repair via the NHEJ pathway, resulting in a spectrum of on-target genomic indel mutations (42). Thus, the presence of genomic scars such as indels or insertion of AAV ITRs in edited genomes would then be indicative of on-target mutations and utilization of non–homology-directed repair pathways. To evaluate the presence of on-target genomic alterations across millions of alleles, we performed next-generation sequencing (NGS) of TI amplicons spanning >2 kb of the PPP1R12C target site in human CD34+ cells edited with the PPP1R12C-GFP editing vector. To measure indel frequency and eliminate primer-dimer reads, we first identified our primers in each read. We then checked that each read contained at least 50 bases beyond the 3′ end of each of the genomic primers to ensure proper alignment to the reference genome using Bowtie 2 alignment software. A total of 1,901,904 demultiplexed reads satisfied this criterion. These reads were used to identify indels by comparing each sequence with those in the reference sequence, excluding natural variants in the human genome (dbSNP 147). Multiple alignment analysis for specific reads was done using the Geneious iterative k-mer multiple alignment tool (Geneious 6.1.7) (43). Of >1.9E6 reads aligned with the genomic PPP1R12C target site, only four sequences displayed alterations or differences, compared with a reference target sequence, resulting in an on-target error rate of <4.21E−6 indels per read (Fig. 2E), which is notably lower than on-target error rates generated by other editing platforms. The four indels observed were all within the GFP ORF and are denoted by red arrows (Fig. 2E). No recombination scars were observed at either the ends of the HAs or the insertion junctions.
Importantly, NGS analysis further revealed the complete absence of reads showing incorporation of AAV ITRs at the edited site, demonstrating the absence of insertion of viral backbone sequences into the genome. To confirm that AAV ITRs could actually be identified by NGS analysis, we spiked K562 cell DNA with a 1,000-fold excess of ITR-containing plasmids before NGS analysis. Results showed that ITRs were indeed detected in these libraries, with >34,000 and >8,000 reads covering at least 50 bp of the 5′ and 3′ ITRs, respectively (SI Appendix, Fig. S8). A small minority of reads spanned the entire ITR, with the majority of sequences dropping off after ∼60 bp. Thus, if ITRs were indeed incorporated at editing sites, at least 50 bp of each ITR sequence would have been detectable in the NGS analysis. NGS analysis also confirmed the findings of accurate and seamless editing as determined by Sanger sequencing (SI Appendix, Fig. S5). These results established that AAVHSC-mediated genome editing is highly precise and occurs without any concurrent on-target mutations commonly associated with DNA strand breaks and is consistent with the utilization of an HR editing pathway.
AAVHSCs Mediate Genome Editing at the Human IL2RG Locus.
To confirm that AAVHSC-mediated genome editing was not limited to the single PPP1R12C locus, we tested editing of other loci within the human genome. We evaluated targeted insertion of the promoterless GFP ORF within two locations in the clinically relevant human IL2RG locus, mutations of which cause X-linked SCID disease. We evaluated editing at the ATG start codon and also within intron 7, such that GFP expression would be driven by the chromosomal IL2RG promoter (44).
In both editing vectors, the promoterless GFP ORF cassette was bilaterally flanked by 800-bp HAs that specified the chromosomal insertion location (SI Appendix, Table S1). For insertion at the start codon, no SA/T2A sequences were utilized. The editing vector targeting the intron included a SA/T2A sequence immediately upstream of the GFP ORF. Flow cytometric analysis of primary human CD34+ cells treated with AAVHSC17-IL2RG revealed GFP expression in 7.93% of cells with insertion at the ATG start codon and 8.3% with insertion in intron 7 (Fig. 2F). Locus-dependent differences in gene editing efficiencies have been described and likely account for the differences in efficiency observed between the PPP1R12C and IL2RG loci (45). Thus, we conclude that AAVHSC-mediated editing of the IL2RG gene is potentially therapeutic. Similarly, we have also observed successful editing at two locations within the human factor VIII gene.
AAVHSCs Mediate Nucleotide Replacement.
With current genome editing technologies, correction of point mutations is challenging. Instead of correcting mutations, DNA breaks repaired via the NHEJ pathway result in unpredictable on-target indels and the possible introduction of new mutations. Hence, we tested the potential of AAVHSC vectors to accurately correct point mutations at the nucleotide level. A dinucleotide editing vector was used to alter a wild-type “TA” sequence to an “AT” in intron 1 of PPP1R12C (nucleotide replacement vector; SI Appendix, Table S1). This dinucleotide change results in the conversion of a naturally occurring NheI restriction site in PPP1R12C intron 1 to a SphI restriction site (Fig. 3A). In addition to the dinucleotide replacement, the nucleotide replacement editing vector was designed to insert a 12-bp linker 245 bp downstream of the NheI site (Fig. 3A) to create a primer-binding site for the insert-specific primer for the TI PCR assay (Fig. 3A). A control vector was designed to only insert the downstream 12-bp primer-binding site, but not to alter the NheI site. Both the nucleotide replacement and control vectors had a 583-bp left HA 5′ to the NheI site and an 800-bp HA downstream of the 12-bp linker. The total AAV vector genome size was 2,011 bp (SI Appendix, Table S1). CD34+, K562, and HepG2 cells were treated with either the nucleotide replacement (SphI) or control (NheI) vector packaged in AAVHSC7 and AAVHSC17. A 1,002-bp PCR amplicon was generated using a primer specific for the 12-bp linker and a chromosome-specific primer located upstream of the left HA (Fig. 3A). Sanger sequencing of amplicons from transduced HepG2 cells was compared with the reference control genomic sequence (Fig. 3B). The dinucleotide sequence change was found to be introduced precisely at the correct location, with no additional sequence alterations observed either locally (500-bp level) or at the nucleotide level (Fig. 3B). These data demonstrate that AAVHSCs can precisely replace specified genomic nucleotides within the genome without any other concurrent sequence alterations. These data also demonstrate that AAV editing vector genomes as small as 2011b effectively mediate genome editing. Self-complementary AAV (scAAV) editing vectors were not tested for nucleotide replacement since previous reports showed a complete loss of detectable editing with scAAV (26, 46). We conclude that AAVHSC editing vectors may be used for high-fidelity correction of point mutations or for the precise editing of nucleotide variants without the introduction of inadvertent base changes or genomic scarring.
Fig. 3.
Replacement of a specific nucleotide to model correction of point mutations. (A) Schema of dinucleotide replacement editing assay. (Right) In line 1, the nucleotide (NT) replacement vector was designed to replace the wild-type genomic “TA” in intron 1 of PPP1R12C with “AT”, replacing the wild-type NheI site with a SphI site. The nucleotide replacement vector was also designed to insert a 12-bp linker 245 bp downstream (depicted in orange). (Left) In line 1, control vector was designed to only insert the 12-bp linker, without replacing the TA dinucleotide. HAL, left HA; HAR, right HA. Line 2 shows the wild-type chromosome, highlighting the region to be edited. Line 3 shows the expected configuration of the edited chromosome. Green boxes represent exons, and introns are shown as pink lines. FWD, forward; REV, reverse. (B) Sequence alignment showing dinucleotide editing in intron 1 of PPP1R12C in HepG2 cells. Both 500-bp and 80-bp windows are shown. Alignment across sequenced regions is displayed as a stack plot by position (green). Precise dinucleotide editing is displayed as a cumulative position-weighted matrix (PWM) of each edited sequence aligned to a reference trace.
Genome Editing by AAVHSC Requires BRCA2.
To identify the mechanism of AAVHSC editing, we tested the hypothesis that mutations in specific genes known to mediate DNA damage response and maintain genome integrity would result in altered editing outcomes (47–59). Targeted insertion of the promoterless GFP ORF into PPP1R12C was quantitated in each of 12 patient-derived B LCLs that harbor loss-of-function mutations in selected DNA repair genes and compared with that in CD34+ cells (Fig. 4A and SI Appendix, Table S7). Cumulative averages of editing for each AAV serotype across mutant cell lines revealed a spectrum of impacts (49–59) (Fig. 4B and SI Appendix, Fig. S9A). Cell lines with certain mutations displayed potentiated HR, while others displayed diminished HR (Fig. 4 A and B and SI Appendix, Fig. S9). However, since the LCLs were derived from different patients and therefore not congenic, this precluded precise conclusions on the role of each repair gene. Interestingly, however, the overall pattern of each AAV serotype was similar for each mutation, suggesting that the impact of specific repair genes on genome editing was similar for each AAV serotype (Fig. 4B).
Fig. 4.
Role of DNA repair genes in AAVHSC- and AAV9-mediated genome editing. (A) Heat map of genome editing efficiencies in LCLs harboring mutations in DNA repair genes. LCLs were treated with AAVHSC PPP1R12C-GFP vectors at a MOI of 1.5E5. Cells were analyzed by flow cytometry after 48 h. Data represent percent GFP expression in live cells following editing of the GFP ORF into intron 1 of PPP1R12C (Fig. 1A). Row 1 depicts primary human CD34+ cells for comparison. Subsequent rows depict immortalized cell lines bearing DNA repair mutations as follows: row 2, BLM; row 3, ERCC4/XPF; row 4, NBS1; row 5, RAG1; row 6, ATM; row 7, FANCF; row 8, FANCB; row 9, FANCC; row 10, FANCA; row 11, FANCD2; row 12, BRCA2+/−; and row 13, BRCA2−/−. Columns depict different AAV serotypes. (B) Stack plot of average cumulative editing efficiencies in LCLs harboring loss-of-function mutations in selected DNA damage response genes. Genes in the x axis are as follows: 1, GM04408 (BLM); 2, GM08437 (ERCC4); 3, GM15818 (NBS1); 4, ID00078 (RAG1); 5, GM03332 (ATM); 6, GM13023 (BRCA2−/−); 7, GM13071 (FANCB); 8, GM14622 (BRCA2+/−); 9, GM12794 (FANCC); 10, GM16749 (FANCA); 11, GM16756 (FANCD2); and 12, GM16757 (FANCF). Each AAV serotype is denoted by a specific color. (C) AAVHSCs successfully mediate gene transfer into BRCA2−/− fibroblasts but do not edit. Flow cytometric analysis of the EUFA423 BRCA2−/− or BRCA2+/+ fibroblast line transduced with AAVHSC17 PPP1R12C-GFP editing vector or a CBA-GFP gene transfer vector is shown, where GFP expression is driven by the CBA promoter. Robust GFP expression is observed after gene transfer, but no GFP is observed following treatment with the editing vector in BRCA2−/− fibroblasts. The absence of editing in BRCA2−/− cells is denoted by lack of GFP expression. Untd, untreated.
AAVHSC vectors successfully mediated editing of every DNA repair mutant line tested with the single exception of the cell line GM13023 (59), a compound heterozygous mutant of BRCA2 (SI Appendix, Table S7), an essential mediator of HR (60, 61) (Fig. 4 A and B), indicating an absolute requirement for functional BRCA2 for AAVHSC editing. Cells that were heterozygous for functional BRCA2 (GM14622; SI Appendix, Table S7), however, underwent efficient editing, suggesting that a single functional BRCA2 allele was sufficient (Fig. 4 A and B). The absolute requirement for BRCA2 was further confirmed using the unrelated BRCA2-null fibroblast line EUFA423 (62) (Fig. 4C). BRCA2-null fibroblasts were transduced with either a control promoter-driven AAVHSC17 GFP gene transfer vector or a promoterless AAVHSC17 PPP1R12C-GFP editing vector. Flow analysis revealed the complete lack of GFP expression in BRCA2-null cells treated with the editing vector despite robust GFP expression following transduction with the control gene transfer vector (Fig. 4C). The difference in the level of GFP expression in the BRCA2-null versus wild-type fibroblasts following transduction with the gene transfer vector is likely due to differences in background genes, since the two fibroblast cell lines were derived from unrelated individuals. These results clearly demonstrate that AAVHSC17-mediated genome editing did not occur in the absence of BRCA2, despite the ability of the vector to efficiently transduce EUFA423 cells. The same AAVHSC17 PPP1R12C-GFP editing vector readily mediated genome editing in other BRCA2+/+ cell lines and primary cells (Figs. 1C, 2A, and 4A). Thus, we conclude that BRCA2 is essential for AAVHSC-mediated genome editing. We additionally confirmed the complete absence of editing at other loci in the human genome, including the FVIII and IL2RG genes in BRCA2−/− cells. These results, together with the complete absence of editing by any AAVHSC vector in BRCA2−/− cell lines (Fig. 4 A and B and SI Appendix, Fig. S9), strongly support the requirement for BRCA2 in the AAVHSC-mediated genome editing pathway and demonstrate that HR is the operative mechanism of genome editing by AAVHSC.
AAVHSC Editing Vectors Mediate in Vivo Editing.
To evaluate the potential of AAVHSC for in vivo genome editing, we tested whether i.v. injection of an AAVHSC editing vector could efficiently edit a murine genomic locus. The editing vector, Rosa26-Luc, was designed to insert a promoterless luciferase ORF into intron 1 of the ubiquitously expressed murine Rosa26 locus (63, 64) (Fig. 5A and SI Appendix, Table S1), leading to expression from the endogenous Rosa26 promoter. The promoterless luciferase ORF was cloned downstream of a SA/T2A sequence and was followed by a pA signal (Fig. 5A). The entire cassette was flanked by 800-bp HAs (Fig. 5A and SI Appendix, Table S1). The editing vector, Rosa26-Luc, and a control vector containing the luciferase cassette without HAs (noHA) were packaged in AAVHSC15. Nonobese diabetic/SCID mice were injected i.v. with 5e11 vg with the AAVHSC15 Rosa26-Luc editing vector, the AAVHSC15 noHA control vector, or an AAV8 Rosa26-Luc editing vector. AAVHSC15 and AAV8 were chosen for these experiments since both serotypes have previously been shown to mediate efficient in vivo gene transfer in mice (33, 65, 66) (SI Appendix, Fig. S10A). Serial whole-body in vivo bioluminescent imaging revealed luciferase expression only in mice injected with the promoterless AAVHSC15 Rosa26-Luc editing vector, but not in mice injected with the control noHA vector lacking HAs, and only nominal expression in mice injected with the AAV8 Rosa26-Luc editing vector (Fig. 5B and SI Appendix, Figs. S10 and S11). The complete absence of luciferase expression in mice treated with noHA vectors confirmed the requirement for HAs for targeting.
Fig. 5.
AAVHSCs mediate efficient in vivo genome editing after i.v. injection. (A) Schema showing the vector and genome maps for the Rosa26-Luc editing vector that inserts the promoterless luciferase ORF into intron 1 of the murine Rosa26 locus. The luciferase ORF is preceded by SA/T2A (SA/2A) and followed by pA (PA). The luciferase cassette is flanked by 800-bp HAs targeting intron 1 of Rosa26. HAL, left HA; HAR, right HA. Also depicted is the map of the edited Rosa26 locus. (B) Serial in vivo imaging of luciferase expression in mice injected with AAVHSC15 Rosa26-Luc vector (5e11 vg per mouse). Luciferase expression is under the control of the chromosomal Rosa26 promoter. Also shown are AAVHSC15 noHA, a negative control containing the luciferase cassette but noHA, and an AAV8 Rosa26-Luc editing vector. Days postinjection (D) are depicted below each image. The ventral (Upper) and dorsal (Lower) images of each mouse are shown. Sample sizes for the experimental groups are as follows: AAVHSC15 Rosa26-Luc group, n = 5; AAVHSC15 noHA group, n = 3; and AAV8 Rosa26-Luc group, n = 3. (C) Sanger sequence analysis of the junction sequences between the HA and genomic DNA (gDNA) confirmed precise editing of the luciferase ORF into the Rosa26 locus. (D) Southern blot analysis of the edited murine Rosa26 locus. Genomic DNA from the liver was digested with SpeI, gel-electrophoresed, transferred to a nylon membrane, and hybridized with a luciferase-specific probe. The expected 7-kb band is observed with the luciferase probe in the edited liver DNA (lane 5), but not from untreated liver DNA (lane 4). Also shown is a titration of the vector plasmid as a positive control for the probe (lanes 1–3). MW, molecular weight marker. Maps show the location of the 7-kb band in the edited Rosa26 locus and the 3,049-bp band in the Rosa26-Luc vector plasmid (pRosa26-Luc) cut with DrdI and SpeI. Potential sizes are also depicted on a map of the vector genome. Red double-headed arrows depict the luciferase-binding site.
We compared the ability of AAVHSC15 and AAV8 to mediate gene transfer and genome editing in vivo (SI Appendix, Fig. S10). Mice were injected with either a gene transfer vector encoding the luciferase gene under control of the chicken beta actin (CBA) promoter or the promoterless Rosa26-Luc editing vector packaged in either AAV8 or AAVHSC15. Bioluminescent imaging and flux measurements 50–60 d postinjection revealed that AAVHSC15 mediated in vivo gene transfer 3.7-fold more efficiently than AAV8 (SI Appendix, Fig. S10A). However, for mice injected with the promoterless Rosa26-Luc editing vector, the AAVHSC15-treated group showed 43.3-fold greater luciferase expression compared with the AAV8-treated group (SI Appendix, Fig. S10B). The low level of editing-specific luciferase expression observed in the AAV8 Rosa26-Luc–treated group is comparable to previously described results (27, 67, 68). The markedly increased luciferase expression observed with AAVHSC15 relative to AAV8 was consistent with the enhanced editing capacity of AAVHSCs relative to other AAV vectors in vitro (Figs. 1C and 5B and SI Appendix, Figs. S1 and S2).
Luciferase expression was detectable as early as day 3 after injection with the AAVHSC15 Rosa26-Luc editing vector, and was stable to day 112 postinjection, the last time point assayed. Expression gradually increased after injection and plateaued within ∼4–6 wk. Luciferase expression was observed systemically, consistent with the expected ubiquitous expression of Rosa26 (69). In vivo imaging indicated strong widespread systemic luciferase expression (Fig. 5B). Organ-specific expression was assessed in isolated organs at the end of the experiment. Quantitation of flux in isolated organs revealed the highest luciferase expression, as measured by flux, in the liver, followed by muscle (SI Appendix, Table S8). Luciferase expression was also detected in the heart, lungs, kidney, and brain. Quantitation of vector was performed for isolated organs by ddPCR specific for the luciferase gene relative to a single-copy endogenous gene, apoB. The liver showed the most copies of the luciferase gene at 0.737 copies per cell, followed by muscle (0.398 copies per cell) and heart (0.317 copies per cell) (SI Appendix, Table S8). Notably, no toxicity due to AAVHSC editing was noted for up to 6 mo postinjection, the end of the study.
To confirm editing at the molecular level, we employed linear amplification PCR (LAM-PCR) followed by sequence analyses. LAM-PCR was initiated in the chromosomal sequences external to the HAs, spanning both HAs and reading into the luciferase ORF. Sequence analyses confirmed accurate insertion into the intended location, intron 1 of the Rosa26 gene (Fig. 5C). No indel mutations or AAV ITRs were detected in the edited genomes. We further confirmed TI of the luciferase cassette into the Rosa26 locus by Southern blot analysis of mouse liver DNA, harvested 70 d after injection of AAVHSC15 Rosa26-Luc. The expected 7-kb band was visualized following hybridization of SpeI-digested DNA with a luciferase-specific probe (Fig. 5D). No luciferase-specific bands were detectable in untreated liver DNA from control mice (Fig. 5D), showing specificity. Interestingly, no 3,966-bp monomer-length free vector genomes were detected in the Southern blot analysis, possibly suggesting that the editing genome may either not have initiated second-strand synthesis or not formed double-stranded concatamers commonly observed following AAV transduction (70). Digestion of double-stranded concatamers with a single cutter enzyme would be expected to yield monomer-length AAV vector genomes, which were not observed, suggesting that such concatamers were absent. If any single-stranded AAV genomes were present, they would likely have formed “snapback” structures due to the complementarity of the ITRs and may have migrated faster in nondenaturing gels. If free vector genomes persisted as double-stranded monomers, SpeI digestion would yield fragments of 2,783 bp and 1,183 bp (Fig. 5D), which were not observed. It is also possible that the number of free vector genomes was below the limit of detection for the Southern blot assay (71). However, the presence of a 7-kb band on the Southern blot analysis, together with in vivo expression and sequence analysis, confirmed the accurate targeted insertion of the luciferase ORF into intron 1 of Rosa26 following a single i.v. injection of the AAVHSC15 editing vector. Southern blot analysis also revealed no discernible off-target bands, suggesting the absence of off-target integrations within the limits of detection of the assay.
Discussion
Here, we present evidence for a unique, precise, exclusively HR-based platform for genome editing that operates at unprecedented efficiencies in primary cells in vitro and in vivo, without the need for prior nuclease-mediated DNA breaks. The genomic site to be edited is specified solely by the HAs. Remarkably, this editing platform functions efficiently in postmitotic cells, such as the adult liver in vivo, freshly isolated human liver sections, myocytes, and unstimulated primary CD34+ cells. On-target editing events were highly precise and seamless, with no evidence of indels. This editing pathway functions exclusively through HR, as evidenced by the absolute requirement for BRCA2, an essential mediator of HR, and the complete absence of any evidence of NHEJ-based repair, including indels and incorporation of AAV ITRs. This genome editing modality may be used not only for efficient editing of therapeutic sequences at specified chromosomal locations but also for accurate correction of disease-causing point mutations. While AAV-mediated genome editing in the absence of nucleases has been previously reported (24–29, 67, 68), efficiencies were too low for most therapeutic applications (29). Here, we present data supporting genome editing efficiencies >50% in primary human cells as measured by expression of a reporter gene following targeted chromosomal insertion. This property of enhanced genome editing efficiency is uniquely shared by members of stem cell-derived AAV clade F, including AAVHSCs and AAV9, and provides a distinctive alternative to current genome editing platforms.
Recently, AAV6 has been used in conjunction with nuclease-based genome editing platforms for the delivery of donor DNA for homology-dependent repair of double-stranded DNA breaks (72–76). While AAV6-mediated donor delivery was efficient and editing was observed following the creation of DNA breaks, no AAV6-mediated editing was observed in the absence of nucleases (72–75), in concordance with our findings.
Our results indicate that vector genomes ranging from ∼2 to 4 kb were found to mediate enhanced genome editing, suggesting that genome size may not be critical. The editing vectors used here were designed with 800b symmetrical HAs, with the exception of the nucleotide substitution vector, which had asymmetrical HAs of 583b and 1045b, indicating that flexibility in HA design is feasible.
This distinctive editing platform is based upon the naturally occurring AAVs present in human HSCs, including AAVHSCs and AAV9 (30). The unique single-stranded AAV genomes bounded by the palindromic G/C-rich ITRs likely activate a cellular HR pathway, which leads to genome editing. This is supported by previous observations that scAAV genomes, which are fully double-stranded, do not mediate recombination (26, 46). Results presented here suggest that in addition to the unique AAV genome structure, the capsid sequences of clade F AAV play an essential role in potentiating HR and genome editing. Identical editing vector genomes packaged in non-clade F capsids, including AAV2, AAV6, or AAV8, mediate editing at significantly lower efficiencies both in vitro and in vivo.
Our results indicate that clade F AAVs were over 27-fold more efficient at nuclear entry than AAVs belonging to clades A, B, and E. Since nuclear entry is essential for both genome editing and transduction, it is possible that more efficient nuclear entry might be partially responsible for the higher level of genome editing observed with clade F viruses, potentially accounting for the observed correlation between transduction and editing efficiencies in vitro. However, in vivo comparisons of gene transfer with genome editing via AAVHSC15 and AAV8 suggested that editing efficiency did not directly correlate with transduction efficiency in vivo. Although both AAV8 and AAVHSC15 transduced mice efficiently, AAVHSC15 was more efficient at editing than AAV8. Whether specific HR pathways are additionally initiated upon receptor engagement by AAVHSC or other cellular mechanisms are activated upon interaction of target cells with the capsids is under investigation.
Upon nuclear entry and uncoating, single-stranded AAV genomes undergo second-strand synthesis and often exist as circular concatamers. Interestingly, under certain conditions, such as in the absence of ATM, AAV genomes were observed to remain in a linear configuration, inaccessible to the NHEJ pathway (70). Notably, no free vector genomes were detected in our Southern blot analysis, consistent with the possibility that circular concatamers were not generated. It is conceivable that a prolonged single-stranded phase of editing vector genomes may additionally lead to enhanced editing. While single-stranded vector genomes could potentially be unstable in the nuclear environment, association with a cascade of repair proteins may serve to protect them from degradation and promote HR.
Importantly, the ability of AAVHSC and AAV9 to mediate HR efficiently in resting and postmitotic cells was evidenced by successful editing of freshly isolated liver sections, myocytes, and unstimulated CD34+ cells, as well as in vivo, including the liver and muscle in adult mice. The ability of AAVHSC and AAV9 to support genome editing in postmitotic adult tissue is unique (17) and has both mechanistic as well as therapeutic implications.
Thus, AAV clade F, including AAVHSC and AAV9, represents a uniquely HR-based, highly precise, and efficient seamless genome editing modality in a single-component, nuclease-free platform, with a built-in delivery component that overcomes many of the challenges currently facing therapeutic translation of genome editing. Thus, together with its potent in vivo editing ability, clade F AAVs represent a distinct platform for the development of therapeutic in vivo genome editing for the treatment and potential cure of human diseases.
Materials and Methods
All animal care and experiments were performed under protocols approved by the City of Hope Institutional Animal Care and Use Committee. Deidentified, cytokine-primed, peripheral blood CD34+ cells were obtained with informed consent from healthy donors under a City of Hope Institutional Review Board (IRB)-approved protocol as previously described (29). Deidentified liver samples were obtained from the City of Hope operating room under an IRB-approved protocol for the use of discard material. Recombinant DNA work was performed according to NIH guidelines. Tissue culture, transductions, flow cytometry, Southern blot analysis, and PCR reactions were performed using standard procedures unless otherwise specified. A more detailed description of treatments and analyses is provided in SI Appendix, SI Materials and Methods.
Supplementary Material
Acknowledgments
We thank Dr. Indra M. Newman for valuable advice and assistance with the manuscript; Drs. Stephen Elledge, Kush Parmar, Albert Seymour, Arthur Tzianabos, and Sam Rasty for critical scientific feedback; and Drs. James McSwiggen and Mike O’Callaghan for stimulating discussions, scientific feedback, and helpful interactions. City of Hope core facilities utilized for this study include the Analytical Cytometry, DNA Sequencing, and Small Animal Imaging Cores. This research was supported by the National Heart, Lung, and Blood Institute and National Cancer Institute of the NIH through Grant HL087285 (to S.C.) and by Cancer Center Support Grant P30CA033572.
Footnotes
Conflict of interest statement: S.C. and L.J.S. are cofounders of Homology Medicines, Inc., and S.C., L.J.S., J.W., T.S.M., H.R., J.L.E., and D.K. hold equity in Homology Medicines, Inc.
This article is a PNAS Direct Submission.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1802343115/-/DCSupplemental.
References
- 1.Jasin M. Genetic manipulation of genomes with rare-cutting endonucleases. Trends Genet. 1996;12:224–228. doi: 10.1016/0168-9525(96)10019-6. [DOI] [PubMed] [Google Scholar]
- 2.Szostak JW, Orr-Weaver TL, Rothstein RJ, Stahl FW. The double-strand-break repair model for recombination. Cell. 1983;33:25–35. doi: 10.1016/0092-8674(83)90331-8. [DOI] [PubMed] [Google Scholar]
- 3.Pâques F, Haber JE. Multiple pathways of recombination induced by double-strand breaks in Saccharomyces cerevisiae. Microbiol Mol Biol Rev. 1999;63:349–404. doi: 10.1128/mmbr.63.2.349-404.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Bibikova M, Golic M, Golic KG, Carroll D. Targeted chromosomal cleavage and mutagenesis in Drosophila using zinc-finger nucleases. Genetics. 2002;161:1169–1175. doi: 10.1093/genetics/161.3.1169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Zhang F, et al. Efficient construction of sequence-specific TAL effectors for modulating mammalian transcription. Nat Biotechnol. 2011;29:149–153. doi: 10.1038/nbt.1775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Miller JC, et al. A TALE nuclease architecture for efficient genome editing. Nat Biotechnol. 2011;29:143–148. doi: 10.1038/nbt.1755. [DOI] [PubMed] [Google Scholar]
- 7.Mali P, et al. RNA-guided human genome engineering via Cas9. Science. 2013;339:823–826. doi: 10.1126/science.1232033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Cong L, et al. Multiplex genome engineering using CRISPR/Cas systems. Science. 2013;339:819–823. doi: 10.1126/science.1231143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Jinek M, et al. A programmable dual-RNA-guided DNA endonuclease in adaptive bacterial immunity. Science. 2012;337:816–821. doi: 10.1126/science.1225829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Zetsche B, et al. Cpf1 is a single RNA-guided endonuclease of a class 2 CRISPR-Cas system. Cell. 2015;163:759–771. doi: 10.1016/j.cell.2015.09.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Lieber MR. The mechanism of double-strand DNA break repair by the nonhomologous DNA end-joining pathway. Annu Rev Biochem. 2010;79:181–211. doi: 10.1146/annurev.biochem.052308.093131. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Jeggo PA. DNA breakage and repair. Adv Genet. 1998;38:185–218. doi: 10.1016/s0065-2660(08)60144-3. [DOI] [PubMed] [Google Scholar]
- 13.Daya S, Cortez N, Berns KI. Adeno-associated virus site-specific integration is mediated by proteins of the nonhomologous end-joining pathway. J Virol. 2009;83:11655–11664. doi: 10.1128/JVI.01040-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Rudin N, Haber JE. Efficient repair of HO-induced chromosomal breaks in Saccharomyces cerevisiae by recombination between flanking homologous sequences. Mol Cell Biol. 1988;8:3918–3928. doi: 10.1128/mcb.8.9.3918. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Capecchi MR. Altering the genome by homologous recombination. Science. 1989;244:1288–1292. doi: 10.1126/science.2660260. [DOI] [PubMed] [Google Scholar]
- 16.Rouet P, Smih F, Jasin M. Expression of a site-specific endonuclease stimulates homologous recombination in mammalian cells. Proc Natl Acad Sci USA. 1994;91:6064–6068. doi: 10.1073/pnas.91.13.6064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Hustedt N, Durocher D. The control of DNA repair by the cell cycle. Nat Cell Biol. 2016;19:1–9. doi: 10.1038/ncb3452. [DOI] [PubMed] [Google Scholar]
- 18.Cox DB, Platt RJ, Zhang F. Therapeutic genome editing: Prospects and challenges. Nat Med. 2015;21:121–131. doi: 10.1038/nm.3793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Fu Y, et al. High-frequency off-target mutagenesis induced by CRISPR-Cas nucleases in human cells. Nat Biotechnol. 2013;31:822–826. doi: 10.1038/nbt.2623. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Kuscu C, Arslan S, Singh R, Thorpe J, Adli M. Genome-wide analysis reveals characteristics of off-target sites bound by the Cas9 endonuclease. Nat Biotechnol. 2014;32:677–683. doi: 10.1038/nbt.2916. [DOI] [PubMed] [Google Scholar]
- 21.Tsai SQ, et al. GUIDE-seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases. Nat Biotechnol. 2015;33:187–197. doi: 10.1038/nbt.3117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Ran FA, et al. In vivo genome editing using Staphylococcus aureus Cas9. Nature. 2015;520:186–191. doi: 10.1038/nature14299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Komor AC, Badran AH, Liu DR. CRISPR-based technologies for the manipulation of eukaryotic genomes. Cell. 2017;169:559, erratum (2017) 168:20–36. doi: 10.1016/j.cell.2016.10.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Russell DW, Hirata RK. Human gene targeting by viral vectors. Nat Genet. 1998;18:325–330. doi: 10.1038/ng0498-325. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Porteus MH, Cathomen T, Weitzman MD, Baltimore D. Efficient gene targeting mediated by adeno-associated virus and DNA double-strand breaks. Mol Cell Biol. 2003;23:3558–3565. doi: 10.1128/MCB.23.10.3558-3565.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Vasileva A, Linden RM, Jessberger R. Homologous recombination is required for AAV-mediated gene targeting. Nucleic Acids Res. 2006;34:3345–3360. doi: 10.1093/nar/gkl455. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Barzel A, et al. Promoterless gene targeting without nucleases ameliorates haemophilia B in mice. Nature. 2015;517:360–364. doi: 10.1038/nature13864. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Miller DG, et al. Gene targeting in vivo by adeno-associated virus vectors. Nat Biotechnol. 2006;24:1022–1026. doi: 10.1038/nbt1231. [DOI] [PubMed] [Google Scholar]
- 29.Khan IF, Hirata RK, Russell DW. AAV-mediated gene targeting methods for human cells. Nat Protoc. 2011;6:482–501. doi: 10.1038/nprot.2011.301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Smith LJ, et al. Gene transfer properties and structural modeling of human stem cell-derived AAV. Mol Ther. 2014;22:1625–1634. doi: 10.1038/mt.2014.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Podsakoff G, Wong KK, Jr, Chatterjee S. Efficient gene transfer into nondividing cells by adeno-associated virus-based vectors. J Virol. 1994;68:5656–5666. doi: 10.1128/jvi.68.9.5656-5666.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Santat L, et al. Recombinant AAV2 transduction of primitive human hematopoietic stem cells capable of serial engraftment in immune-deficient mice. Proc Natl Acad Sci USA. 2005;102:11053–11058. doi: 10.1073/pnas.0502902102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Zincarelli C, Soltys S, Rengo G, Rabinowitz JE. Analysis of AAV serotypes 1-9 mediated gene expression and tropism in mice after systemic injection. Mol Ther. 2008;16:1073–1080. doi: 10.1038/mt.2008.76. [DOI] [PubMed] [Google Scholar]
- 34.Chatterjee S, Johnson PR, Wong KK., Jr Dual-target inhibition of HIV-1 in vitro by means of an adeno-associated virus antisense vector. Science. 1992;258:1485–1488. doi: 10.1126/science.1359646. [DOI] [PubMed] [Google Scholar]
- 35.Fisher-Adams G, Wong KK, Jr, Podsakoff G, Forman SJ, Chatterjee S. Integration of adeno-associated virus vectors in CD34+ human hematopoietic progenitor cells after transduction. Blood. 1996;88:492–504. [PubMed] [Google Scholar]
- 36.Paz H, et al. Quiescent subpopulations of human CD34-positive hematopoietic stem cells are preferred targets for stable recombinant adeno-associated virus type 2 transduction. Hum Gene Ther. 2007;18:614–626. doi: 10.1089/hum.2006.188. [DOI] [PubMed] [Google Scholar]
- 37.Kotin RM, et al. Site-specific integration by adeno-associated virus. Proc Natl Acad Sci USA. 1990;87:2211–2215. doi: 10.1073/pnas.87.6.2211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Zhong L, et al. Impaired nuclear transport and uncoating limit recombinant adeno-associated virus 2 vector-mediated transduction of primary murine hematopoietic cells. Hum Gene Ther. 2004;15:1207–1218. doi: 10.1089/hum.2004.15.1207. [DOI] [PubMed] [Google Scholar]
- 39.Ponnazhagan S, et al. Adeno-associated virus type 2-mediated transduction in primary human bone marrow-derived CD34+ hematopoietic progenitor cells: Donor variation and correlation of transgene expression with cellular differentiation. J Virol. 1997;71:8262–8267. doi: 10.1128/jvi.71.11.8262-8267.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Mroske C, Rivera H, Ul-Hasan T, Chatterjee S, Wong KK. A capillary electrophoresis sequencing method for the identification of mutations in the inverted terminal repeats of adeno-associated virus. Hum Gene Ther Methods. 2012;23:128–136. doi: 10.1089/hgtb.2011.231. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Sargent RG, Brenneman MA, Wilson JH. Repair of site-specific double-strand breaks in a mammalian chromosome by homologous and illegitimate recombination. Mol Cell Biol. 1997;17:267–277. doi: 10.1128/mcb.17.1.267. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Pipiras E, Coquelle A, Bieth A, Debatisse M. Interstitial deletions and intrachromosomal amplification initiated from a double-strand break targeted to a mammalian chromosome. EMBO J. 1998;17:325–333. doi: 10.1093/emboj/17.1.325. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Sherry ST, et al. dbSNP: The NCBI database of genetic variation. Nucleic Acids Res. 2001;29:308–311. doi: 10.1093/nar/29.1.308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Genovese P, et al. Targeted genome editing in human repopulating haematopoietic stem cells. Nature. 2014;510:235–240. doi: 10.1038/nature13420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Miyaoka Y, et al. Systematic quantification of HDR and NHEJ reveals effects of locus, nuclease, and cell type on genome-editing. Sci Rep. 2016;6:23549. doi: 10.1038/srep23549. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Hirata RK, Russell DW. Design and packaging of adeno-associated virus gene targeting vectors. J Virol. 2000;74:4612–4620. doi: 10.1128/jvi.74.10.4612-4620.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Powell SN, Kachnic LA. Roles of BRCA1 and BRCA2 in homologous recombination, DNA replication fidelity and the cellular response to ionizing radiation. Oncogene. 2003;22:5784–5791. doi: 10.1038/sj.onc.1206678. [DOI] [PubMed] [Google Scholar]
- 48.Oettinger MA, Schatz DG, Gorka C, Baltimore D. RAG-1 and RAG-2, adjacent genes that synergistically activate V(D)J recombination. Science. 1990;248:1517–1523. doi: 10.1126/science.2360047. [DOI] [PubMed] [Google Scholar]
- 49.Liu P, et al. Regional mapping of human DNA excision repair gene ERCC4 to chromosome 16p13.13-p13.2. Mutagenesis. 1993;8:199–205. doi: 10.1093/mutage/8.3.199. [DOI] [PubMed] [Google Scholar]
- 50.Meyn MS. Ataxia-telangiectasia and cellular responses to DNA damage. Cancer Res. 1995;55:5991–6001. [PubMed] [Google Scholar]
- 51.Tauchi H, et al. Nbs1 is essential for DNA repair by homologous recombination in higher vertebrate cells. Nature. 2002;420:93–98. doi: 10.1038/nature01125. [DOI] [PubMed] [Google Scholar]
- 52.Ellis NA, et al. The Bloom’s syndrome gene product is homologous to RecQ helicases. Cell. 1995;83:655–666. doi: 10.1016/0092-8674(95)90105-1. [DOI] [PubMed] [Google Scholar]
- 53.Joenje H, et al. Evidence for at least eight Fanconi anemia genes. Am J Hum Genet. 1997;61:940–944. doi: 10.1086/514881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Joenje H, et al. Complementation analysis in Fanconi anemia: Assignment of the reference FA-H patient to group A. Am J Hum Genet. 2000;67:759–762. doi: 10.1086/303067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Meetei AR, et al. X-linked inheritance of Fanconi anemia complementation group B. Nat Genet. 2004;36:1219–1224. doi: 10.1038/ng1458. [DOI] [PubMed] [Google Scholar]
- 56.Ikeda H, et al. Genetic reversion in an acute myelogenous leukemia cell line from a Fanconi anemia patient with biallelic mutations in BRCA2. Cancer Res. 2003;63:2688–2694. [PubMed] [Google Scholar]
- 57.Takata M, et al. The Fanconi anemia pathway promotes homologous recombination repair in DT40 cell line. Subcell Biochem. 2006;40:295–311. doi: 10.1007/978-1-4020-4896-8_17. [DOI] [PubMed] [Google Scholar]
- 58.Michl J, Zimmer J, Tarsounas M. Interplay between Fanconi anemia and homologous recombination pathways in genome integrity. EMBO J. 2016;35:909–923. doi: 10.15252/embj.201693860. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Strathdee CA, Gavish H, Shannon WR, Buchwald M. Cloning of cDNAs for Fanconi’s anaemia by functional complementation. Nature. 1992;356:763–767. doi: 10.1038/356763a0. [DOI] [PubMed] [Google Scholar]
- 60.Moynahan ME, Pierce AJ, Jasin M. BRCA2 is required for homology-directed repair of chromosomal breaks. Mol Cell. 2001;7:263–272. doi: 10.1016/s1097-2765(01)00174-5. [DOI] [PubMed] [Google Scholar]
- 61.Davies AA, et al. Role of BRCA2 in control of the RAD51 recombination and DNA repair protein. Mol Cell. 2001;7:273–282. doi: 10.1016/s1097-2765(01)00175-7. [DOI] [PubMed] [Google Scholar]
- 62.Feng Z, et al. Rad52 inactivation is synthetically lethal with BRCA2 deficiency. Proc Natl Acad Sci USA. 2011;108:686–691. doi: 10.1073/pnas.1010959107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Friedrich G, Soriano P. Promoter traps in embryonic stem cells: A genetic screen to identify and mutate developmental genes in mice. Genes Dev. 1991;5:1513–1523. doi: 10.1101/gad.5.9.1513. [DOI] [PubMed] [Google Scholar]
- 64.Kisseberth WC, Brettingen NT, Lohse JK, Sandgren EP. Ubiquitous expression of marker transgenes in mice and rats. Dev Biol. 1999;214:128–138. doi: 10.1006/dbio.1999.9417. [DOI] [PubMed] [Google Scholar]
- 65.Wang L, et al. Comparative study of liver gene transfer with AAV vectors based on natural and engineered AAV capsids. Mol Ther. 2015;23:1877–1887. doi: 10.1038/mt.2015.179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Smith L, et al. Functional mapping of tissue tropism of naturally occurring adeno-associated virus isolates from human hematopoietic stem cells. Mol Ther. 2011;19(Suppl 1):S127–S128. [Google Scholar]
- 67.Paulk NK, et al. Adeno-associated virus gene repair corrects a mouse model of hereditary tyrosinemia in vivo. Hepatology. 2010;51:1200–1208. doi: 10.1002/hep.23481. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Borel F, et al. Survival advantage of both human hepatocyte xenografts and genome-edited hepatocytes for treatment of alpha-1 antitrypsin deficiency. Mol Ther. 2017;25:2477–2489. doi: 10.1016/j.ymthe.2017.09.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Giel-Moloney M, Krause DS, Chen G, Van Etten RA, Leiter AB. Ubiquitous and uniform in vivo fluorescence in ROSA26-EGFP BAC transgenic mice. Genesis. 2007;45:83–89. doi: 10.1002/dvg.20269. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Cataldi MP, McCarty DM. Differential effects of DNA double-strand break repair pathways on single-strand and self-complementary adeno-associated virus vector genomes. J Virol. 2010;84:8673–8682. doi: 10.1128/JVI.00641-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Brown T. Southern blotting. Curr Protoc Mol Biol. 1993;21:2.9.1–2.9.20. doi: 10.1002/0471142727.mb0209as21. [DOI] [PubMed] [Google Scholar]
- 72.Sather BD, et al. Efficient modification of CCR5 in primary human hematopoietic cells using a megaTAL nuclease and AAV donor template. Sci Transl Med. 2015;7:307ra156. doi: 10.1126/scitranslmed.aac5530. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Wang J, et al. Homology-driven genome editing in hematopoietic stem and progenitor cells using ZFN mRNA and AAV6 donors. Nat Biotechnol. 2015;33:1256–1263. doi: 10.1038/nbt.3408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Wang J, et al. Highly efficient homology-driven genome editing in human T cells by combining zinc-finger nuclease mRNA and AAV6 donor delivery. Nucleic Acids Res. 2016;44:e30. doi: 10.1093/nar/gkv1121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.De Ravin SS, et al. Targeted gene addition in human CD34(+) hematopoietic cells for correction of X-linked chronic granulomatous disease. Nat Biotechnol. 2016;34:424–429. doi: 10.1038/nbt.3513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Ling C, et al. High-efficiency transduction of primary human hematopoietic stem/progenitor cells by AAV6 vectors: Strategies for overcoming donor-variation and implications in genome editing. Sci Rep. 2016;6:35495. doi: 10.1038/srep35495. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.





