Abstract
DNA base editors use deaminases fused to a programmable DNA-binding protein for targeted nucleotide conversion. However, the most widely used TadA deaminases lack post-translational control in living cells. Here, we present a split adenine base editor (sABE) that utilizes chemically induced dimerization (CID) to control the catalytic activity of the deoxyadenosine deaminase TadA-8e. sABE shows high on-target editing activity comparable to the original ABE with TadA-8e (ABE8e) upon rapamycin induction while maintaining low background activity without induction. Importantly, sABE exhibits a narrower activity window on DNA and higher precision than ABE8e, with an improved single-to-double ratio of adenine editing and reduced genomic and transcriptomic off-target effects. sABE can achieve gene knockout through multiplex splice donor disruption in human cells. Furthermore, when delivered via dual adeno-associated virus vectors, sABE can efficiently convert a single A•T base pair to a G•C base pair on the PCSK9 gene in mouse liver, demonstrating in vivo CID-controlled DNA base editing. Thus, sABE enables precise control of base editing, which will have broad implications for basic research and in vivo therapeutic applications.
Subject terms: Molecular engineering, CRISPR-Cas9 genome editing, CRISPR-Cas systems
TadA deaminases widely used in many base editors lack post-translational control in cells. Here the authors report a split adenine base editor (sABE) using chemically induced dimerisation (CID) to control the catalytic activity of TadA8e and show this can be used for PCSK9 gene editing in the mouse liver.
Introduction
As an emerging class of precision genome-editing tools, DNA base editors consist of a deaminase fused to a programmable DNA-binding protein, enabling targeted nucleotide conversions without introducing double-stranded DNA breaks1,2. Adenine base editors (ABEs) utilize an evolved Escherichia coli tRNA adenosine deaminase (TadA) and act on a single-stranded DNA substrate for A•T to G•C base conversions2, which have been tested in animal models3–9 and primary human cells4,8–11 with various applications, including site-directed mutagenesis12, gene silencing13, gene knockout10, gene isoform discovery14, functional screens of epigenetic markers15 or pathogenic mutations16, and molecular recording17. ABEs are particularly useful for investigating or therapeutically correcting human pathogenic alleles because nearly half of the disease-causing point mutations could be corrected by reversing the pathogenic A•T base pair to a G•C base pair18,19. Recently, TadA-ABEs have been re-engineered to achieve other types of base editing, including C-to-T20–22, C-to-G21, C/A-to-T/G20,22, or A-to-Y23 conversions.
However, the lack of precise control over the deamination activity of ABE limits its application in research and therapy. The current TadAs in ABEs are constitutively active, and the uncontrolled deaminase can cause undesirable genomic and transcriptomic off-target effects24–27, raising concerns for ABEs’ application for the production of genetically modified organisms and gene therapy. For instance, BEs lead to both genomic and transcriptomic off-target due to long-term expression in vivo in transgenic mice, and mice zygotes injected with ABE7.10 encoded AAV exhibit low birth rates28. Although inducible promoters can be used to regulate the expression of ABEs29,30, the leaky expressions and the delayed response from transcription to translation are highly undesirable. Post-translational inducible control of Cas proteins31–34 can potentially regulate ABE recruitment to the genome but still cannot directly control the deaminase activity of ABE, which does not curtail its off-target effects25–27,35. Thus, precision control of the deaminase activity of ABEs would greatly expand its applications.
Here, we present a split ABE (sABE) design with inducible deaminase activity by integrating the chemically induced dimerization (CID) system36. We demonstrate that ABE8e24 can be split into two inactive parts in the TadA-8e deaminase domain: one fused to FK506-binding protein 3 (FKBP3) and the other to FKBP-rapamycin binding (FRB) protein37. These two ABE components can reassemble into an active form upon rapamycin-induced FRB-FKBP3 heterodimerization. Through extensive engineering and optimization, we engineer sABE v3.22, which shows efficient and precise on-target single adenine editing upon rapamycin induction and significantly reduced genomic and transcriptomic off-target effects. Using dual adeno-associated viral (AAV) vectors to deliver sABE v3.22 in mice, we perform inducible editing of the PCSK9 gene and demonstrate high precision on the targeted adenine, showcasing in vivo CID-controlled DNA base editing.
Results
Chemically inducible split ABE (sABE) with tightly regulated deaminase activity
To monitor the DNA deaminase activity, we created a fluorescence reporter by introducing a premature stop codon into the EYFP gene via a C•G to T•A base pair conversion, rendering it dysfunctional (EYFP*) (Fig. 1a). ABE guided by a single guide RNA (sgRNA) can edit the adenine on the antisense strand of the EYFP* gene and convert the A•T base pair back to the G•C base pair, thereby restoring the original glutamine codon and resulting in the expression of full-length, functional EYFP (Fig. 1a). We validated the response of EYFP fluorescence to ABE8e in HEK293T cells, without detectable background fluorescence in the absence of ABE8e (Fig. 1a).
To achieve inducible control over ABE deaminase, we split TadA-8e into two inactive parts (TadA-8eN and TadA-8eC) and fused each part to FRB and FKBP3, respectively. In the presence of rapamycin, FKBP3 and FRB will heterodimerize, bringing the two parts of TadA-8e into proximity and enabling their assembly into a functional unit (Fig. 1b). We constructed sABE v1 and sABE v2 by splitting the TadA-8e38 deaminase into two fragments. The split sites occurred at loop-25 for sABE v1 and loop-74 for sABE v2 (Fig. 1d). An FKBP3-FRB dimer insertion into these peripheral flexible loop regions is unlikely to alter the TadA-8e core catalytic domain or the reassembly of TadA-8eN and TadA-8eC. In sABE v1, TadA-8eN contains the first 24 amino acids of the TadA-8e, which is linked to an FRB via a flexible linker to its C-terminus (Supplementary Fig. 2a). We also fuse a bipartite SV40 nuclear localization signal (NLS) at the N-terminus of TadA-8eN. TadA-8eC contains the remaining 142 amino acids of the TadA-8e and is fused to an FKBP3 at its N terminus and a Streptococcus pyogenes Cas9 nickase (nSpCas9, D10A) at its C terminus. Each terminus has a monopartite SV40 NLS. These two components, sABE(N) and sABE(C), are expressed separately from two plasmids under a cytomegalovirus promoter (pCMV). The constructs in sABE v2 are similar, except that the split site occurs after Arginine 74 of TadA-8e. We co-transfected HEK293T cells with plasmids encoding sABE(N), sABE(C), EYFP*, and sgRNA, using EBFP as a negative control, followed by induction of the sABE activity with 100 nM rapamycin 12 h after transfection (Fig. 1c). Forty-eight hours after induction, we quantified the normalized fluorescence intensity and the percentage of EYFP-positive cells by flow cytometry (Fig. 1c, Supplementary Fig. 1). sABE v1 and v2 successfully activated the EYFP reporter upon rapamycin induction, with sABE v2 showing higher EYFP activation but also higher background (Fig. 1h, Supplementary Fig. 2b). To further investigate split sites adjacent to Arginine 74, we created sABEs v2.1 to v2.4 by shifting the split site one amino acid at a time (Supplementary Fig. 2c). We found that sABE v2.3, with the split occurring after Isoleucine 76 of TadA-8e, had higher EYFP activation upon rapamycin induction and lower background compared to sABE v2 (Supplementary Fig. 2d).
To further improve the rapamycin-induced deaminase activity and reduce the background activity under the non-induced condition, we optimize components in the sABE construct (Fig. 1e). First, we developed sABE v2.7 by adding a nucleoplasmin NLS to the C terminus of sABE v2.3(N) while keeping the same sABE v2.3(C). Although this modification enhanced editing efficiency, we also observed a higher background activity, probably due to the auto-reassembly of the TadA-8e fragments in the nucleus when they are abundant (Fig. 1h, Supplementary Fig. 3b). Next, we characterized the effect of the dimerization domain copy number on sABE activity. We constructed sABE v2.8, v2.9, and v3.11 by introducing an additional copy of the dimerization domains based on sABE v2.3 (Supplementary Fig. 2e). We found that sABE v3.11, harboring two copies of FRB domain at the C terminus of sABE(N) and two copies of FKBP3 domain at the N terminus of sABE(C), led to a comparable level of EYFP reporter activation with a reduced background (Supplementary Fig. 2f). We then tested different types of linkers with varying lengths between TadA-8eN and 2×FRB domain in sABEv3.11(N) and between 2×FKBP3 domain and TadA-8eC in sABE v3.11(C), creating four versions of sABE(N) and four versions of sABE(C). We transfected different combinations of resulting sABE(N) and sABE(C) constructs and screened a total of 16 sABEs (v3.11 to v3.44) using our fluorescence reporter assay (Supplementary Fig. 3a). We chose sABE v3.22 as the final version since it showed comparable EYFP activation with sABE v3.11 while exhibiting significantly reduced background activity under the non-induced condition (Fig. 1f).
Further, after evaluating a range of rapamycin concentrations, we found that 100 nM rapamycin effectively activated sABE v3.22 (Fig. 1g, Supplementary Fig. 3d). We decided to use this concentration for subsequent experiments. In addition, we selected five sABEs (v1, v2, v2.3, v2.7, and v3.11) to compare reporter assay responses and endogenous gene editing efficiencies at three genomic sites in HEK293T cells. The results showed a strong correlation between the sABE activities in these two assays (Fig. 1h, i, Supplementary Fig. 3b, c). We also examined whether the sABE system could be deactivated. Cells transfected with sABE v3.22 were treated with 10, 25, 50, or 100 nM rapamycin for 2 h, after which the culture medium was changed to remove the rapamycin. Both the reporter assay and genomic editing data showed sABE v3.22 activation in rapamycin-treated groups. The group from which rapamycin was removed showed decreased deaminase activity compared to the rapamycin-sustained group (Supplementary Fig. 4). This effect was less significant when the initial concentration of rapamycin was increased beyond 50 nM, likely due to the residual intracellular rapamycin and the inefficient excretion and degradation of rapamycin in HEK293T cells in vitro39. In sum, we successfully split the ABE8e into two inactive parts at the TadA-8e deaminase domain and rendered its deaminase activity chemically inducible using the FKBP3-FRB CID. Through engineering approaches and fluorescence reporter screening, we developed sABE v3.22, which has a high level of induced base editing activity and a low level of non-induced background activity.
sABE v3.22 achieves high DNA on-target editing efficiencies and enhanced precision
We compared the performance of sABE v3.22 to the intact ABE8e by targeting 19 human genomic loci that span different sequence contexts (Fig. 2a, Supplementary Fig. 5a). ABE8e achieved A-to-G conversions ranging from 7.2% to 72% in the conventional A4-A8 activity window, with a mean of 56% at the A4-A5 positions. In the absence of rapamycin, sABE v3.22 showed very low background A-to-G conversions in the A4-A8 window ranging from 0.1% to 3.1%, with a mean of 0.7%. The deaminase activity of sABE v3.22 was induced by an average of 89-fold (ranging from 15-fold to 389-fold), reaching a mean of 80% (ranging from 53% to 97%) of the activities of intact ABE8e at A4-A5 positions (Fig. 2b). Additionally, sABE v3.22 exhibited a narrower activity window of A4 and A5, with reduced activity on A6 and A7 and minimum activity elsewhere in the protospacer (Fig. 2c, Supplementary Fig. 5b).
As a result of the narrower activity window of sABE v3.22, we observed a significant change in the distribution of reads with A-to-G conversions. For example, sABE v3.22 and ABE8e achieved comparable (62% and 65%) A-to-G conversion on A5 at Site 9. However, of all reads with A-to-G conversions from the intact ABE8e-transfected samples, only 0.53% had single A5 editing. Over 99% had A5 and A7 double edits or more than two A-to-G conversions across the seven adenines between positions 2 and 12 (Fig. 2d). In contrast, in the sABE v3.22-transfected samples, 74% of all reads with A-to-G conversions had single A5 editing, 26% showed A5 and A7 editing, and there was no editing at more than two adenines. At nine out of the 19 sites tested, the ratio of single adenine edits increased from <25% to >73% when using sABE v3.22 (Fig. 2e). At the other ten sites, the ratios of single and double edits increased, while the ratio of multiple edits decreased significantly (Supplementary Fig. 5c). Taken together, the sABE v3.22 system demonstrates higher precision and reduced bystander editing compared to ABE8e, which would allow for more precise single adenine editing.
We subsequently constructed and compared the performance of sABEs with different TadA variants, including sABE(V106W) v3.22 and sABE(F148A) v3.22, as V106W25 and F148A27 are beneficial mutations that decrease TadA transcriptomic off-target effects. Among the eight genomic sites tested, the sABE v3.22 demonstrated a mean A-to-G conversion rate of 62% (ranging from 30% to 82%), achieving 94% (ranging from 57% to 113%) of the intact ABE8e activity (ranging from 52% to 76%) with an average induction of 18-fold (ranging from 4-fold to 48-fold) (Supplementary Fig. 6a). sABE(V106W) v3.22 showed an average A-to-G conversion rate of 50% (ranging from 22% to 69%), achieving 73% (ranging from 38% to 97%) of the intact ABE8e(V106W) activity (ranging from 56% to 76%) with an average induction of 28-fold (ranging from 4-fold to 83-fold) (Supplementary Fig. 6b). sABE(F148A) v3.22 exhibited an average of 57% A-to-G conversion rate (ranging from 22% to 69%) among these sites, achieving 82% (ranging from 56% to 97%) of the ABE8e(F148A) activity (ranging from 60% to 77%), with an average induction fold of 24-fold (ranging from 4-fold to 77-fold) (Supplementary Fig. 6c). Consistent with sABE v3.22, both V106W and F148A variants demonstrated a narrower editing window, with peak activity at the A4 and A5 positions.
We further explore the compatibility of sABE v3.22 by replacing the nSpCas9 with the more compact Staphylococcus aureus Cas9 nickase (nSaCas9)24,40. At the two genomic sites tested, sSaABE8e demonstrated editing efficiencies of 44% and 13%, respectively. We also exhibited an average of 30-fold induced activity compared to the non-induced group (Supplementary Fig. 7a). Under optimized assay conditions, TadA8e, when coupled with the engineered dead Cas12f variants from an uncultured archaeon (Un1Cas12f1), specifically CasMINI v3.1 and CasMINI v441–43, along with engineered sgRNA scaffold ge4.144, resulted in 2-4% A-to-G conversion rates across the three sites examined (Supplementary Fig. 7b, c). The corresponding split systems, sCasMINI v3.1 and sCasMINI v4 showed 1-2% A-to-G conversion rates with undetectable background (Supplementary Fig. 7b, c). Together, these data demonstrate that the sABE v3.22 architecture paired with SpCas9 showed superior editing efficiency and is compatible with other TadA variants bearing beneficial mutations, such as V106W and F148A. sABE v3.22 architecture is also compatible with smaller Cas domains, including SaCas9 and engineered Un1Cas12f1.
Genomic and transcriptomic off-target effects in mammalian cells
We analyzed genomic off-target effects of sABE v3.22 and ABE8e in HEK293T cells at Cas9-dependent DNA off-targets that have been reported11 or predicted using Cass-OFFinder45 (Fig. 3b). We detected A-to-G conversions at 13 out of the 15 analyzed off-target sites. In the absence of rapamycin, sABE v3.22 exhibited low non-induced A-to-G conversions at on-target sites (mean 1.2%) and at off-target sites (mean 0.52%) within the A4-A8 window (Supplementary Fig. 8a). With 100 nM rapamycin induction, sABE v3.22 showed a mean of 8.6% off-target A-to-G conversions within the A4-A8 activity window, decreasing the Cas9-dependent off-target effects by >75% compared to the intact ABE8e (mean 35%), and resulting in 1.8 ~ 130-fold increases in the on-to-off-target ratio (Fig. 3a, c). Furthermore, due to the narrower activity window of sABE v3.22, we observed no A-to-G conversion outside the A4-A8 window on Cas9-dependent off-targets (Supplementary Fig. 8a).
Next, we characterized Cas9-independent off-target effects of sABE v3.22 and ABE8e using the previously established orthogonal dSaCas9 R-loop assay46 (Fig. 3d). In this assay, HEK293T cells are cotransfected with plasmids containing sABE v3.22 constructs and a SpCas9 sgRNA specific for the desired on-target site, along with additional plasmids encoding a dead SaCas9 (dSaCas9) and a SaCas9 sgRNA which is orthogonal to the SpCas9 sgRNA and targets an unrelated genomic locus46.In this setup, dSaCas9 unwinds the DNA double helix to reveal single-stranded DNA, which can serve as a substrate for the deaminase fused to nSpCas9, independently of SpCas9 binding. Thus, Cas9-independent off-target effects could be determined by measuring A-to-G conversion rates at this genomic locus unrelated to the SpCas9 target sequences. We observed Cas9-independent off-target A-to-G conversions by intact ABE8e ranging from 0.87% to 9.2% across the five tested orthogonal R-loops (Fig. 3f). sABE v3.22 reduced these off-target activities to undetectable levels in three orthogonal R-loops and <0.36% in the other two sites. Meanwhile, DNA on-target editing efficiencies remain comparable between sABE v3.22 and ABE8e (Fig. 3e, Supplementary Fig. 8b). The non-induced group showed no difference from the mock-transfected control (Fig. 3e, Supplementary Fig. 8b). Additionally, we repurposed our EYFP* reporter assay to detect Cas9-independent off-target effects by cotransfecting ABEs with dSaCas9 and a SaCas9 sgRNA that aims to form an R-loop at the premature stop codon site. Consistent with the genomic R-loop assay, we detected activated EYFP fluorescence in the intact ABE8e-transfected HEK293T cells but not in sABE v3.22-transfected cells (Supplementary Fig. 8c, d), indicating lower Cas9-independent off-target editing using sABE v3.22.
To compare the extent of transcriptomic off-target effects of sABE v3.22 and ABE8e (Fig. 3g), we transfected HEK293T cells with plasmids encoding ABE8e-P2A-EGFP, sABE v3.22(C)-P2A-EGFP-P2A-sABE v3.22(N), or nCas9(D10A)-P2A-EGFP (Supplementary Fig. 9a). Each plasmid also encodes a sgRNA targeting Site 11. We sorted the transfected cells with the top 5% mean fluorescence intensities and extracted their RNA and DNA for high-throughput sequencing or Sanger sequencing (Supplementary Fig. 10). In sorted cells, rapamycin-induced sABE v3.22 achieved comparable on-target DNA editing (mean A5 73%) to ABE8e (mean A5 79%). Non-induced sABE v3.22 showed a higher background activity (mean A5 20%) when compared to those observed in previous experiments since the sorted cells had the highest expression of ABEs, indicated by their EGFP fluorescence intensities (Fig. 3h, Supplementary Fig. 9b). Using the Genome Analysis Toolkit47 (GATK) best practices for variant calling and further downstream filtering, we identified mRNA nucleotide positions that were altered in cells expressing ABE8e, sABE v3.22, or nCas9 but not in the mock-transfected controls (details in Methods). We found a significant increase in transcriptome-wide A-to-I single nucleotide variations in ABE8e-transfected HEK293T cells (mean 24,670) compared to the nCas9(D10A)-transfected cells (mean 125) (Fig. 3i, Supplementary Fig. 9c). Meanwhile, sABE v3.22 reduced 70% of transcriptome A-to-I mutations, with a mean of 7,279 A-to-I conversions called. Without rapamycin induction, the number of transcriptional A-to-I mutations in the sABE v3.22-transfected cells (mean 149) was similar to that in nCas9(D10A)-transfected cells (mean 125). In summary, these data suggest that the small-molecule-controlled sABE v3.22 maintains a comparable level of on-target activity with reduced genomic and transcriptomic off-target effects.
Inducible multiplex gene knockouts in mammalian cells
ABEs can achieve gene knockout by targeting gene splice donor regions, leading to disrupted pre-mRNA splicing processes, such as exon skipping, intron inclusion, and cryptic splice-site utilization10,48. To assess the performance of sABEv3.22 for inducible gene knockout, we targeted two genes expressing Beta-2 microglobulin (B2M) or CD46 regulatory proteins that have been widely studied in the context of allogeneic cell therapies and cancer research49–52. We utilized our recently reported drive-and-process multiplex base editing (DAP-MBE) array53 to express multiple sgRNAs that guide ABEs to disrupt B2M or CD46 splice donors (Fig. 4a). To compare sABE v3.22 and intact ABE8e, we co-transfected HEK293T cells with the DAP-MBE array expressing two sgRNAs targeting B2M splice donors and either sABE v3.22 or ABE8e plasmids, followed by 100 nM rapamycin treatment 12 h later. Cells were cultured for another 8 days for B2M protein degradation and cell division before antibody-based FACS analysis (Supplementary Fig. 11, 12). With rapamycin induction, sABE v3.22 achieved over 60% knockout efficiency for B2M, which was similar to intact ABE8e (Fig. 4b)53. There was no difference between the mock-transfected group and the non-induced group transfected with sABE v3.22 and DAP-MBE array (Fig. 4b). We also compared the DAP-MBE array with individual sgRNAs or pooled sgRNAs delivered from two plasmids. Targeting both splice donors resulted in a higher B2M knockout rate compared to targeting only the B2M intron 1 splice donor (mean 48% by sABE v3.22 and 53% by ABE8e) or only the B2M intron 2 splice donor (mean 9% by sABE v3.22 and 17% by ABE8e) (Fig. 4b, Supplementary Fig. 13a). The DAP-MBE strategy led to more efficient B2M knockout compared to pooled sgRNAs strategy.
Among the four CD46 splice donors with nearby NGG sequences, only two can be targeted with the target adenine in the ABE activity window. Therefore, we constructed ABE8e-SpG and sABE v3.22-SpG by integrating the recently reported SpCas9 variant SpG54 with a relaxed NGN PAM requirement instead of NGG. Through this approach, we were able to target five additional splice donors. We found that multiplex disruption of seven CD46 splice donors using the DAP-MBE array led to the highest CD46 knockout efficiency, with a mean of 71% by sABE v3.22 and 69% by ABE8e-SpG (Fig. 4c). The group with non-induced sABE v3.22-SpG showed no statistical difference from the mock-transfected group. Similar to the B2M knockout results, disrupting fewer CD46 splice donors was less efficient than disrupting all seven targetable splice donors (Fig. 4c, Supplementary Fig. 13b). The DAP-MBE strategy also led to more efficient CD46 knockout compared to the pooled sgRNAs strategy (38% by sABE v3.22-SpG and 48% by ABE8e).
Inducible in vivo editing of mouse PCSK9 gene
To explore the in vivo application of sABE v3.22, we packaged it into three AAV vectors. We split the sABE(C) into two parts before lysine 468 in the nCas9 domain, fused each part to the corresponding split moiety of gp41-1 intervening protein (intein)55, and packaged them into separate AAV vectors. The sABE(N) and a sgRNA targeting Site 9 were packaged into a third AAV vector (Fig. 5a). When delivered into HEK293T cells via triple AAV vectors, the sABE v3.22 achieved 40% on-target A5 editing with 100 nM rapamycin induction and showed 4.2% A5 background activity without induction (Fig. 5b, Supplementary Fig. 14a). Notably, of the sequencing reads with A-to-G conversions, 92% showed single A5 editing (Supplementary Fig. 14b). Similarly, when the sABE v3.22 system was delivered into HEK293T cells via three lentiviral vectors (Supplementary Fig. 14c), we observed 43% on-target A5 editing with 100 nM rapamycin induction and 3.0% A5 background activity without induction (Supplementary Fig. 14d). Consistently, 86% of sequencing reads with A-to-G conversions showed single A5 editing (Supplementary Fig. 14e). These results demonstrate the compatibility of our sABE system with both viral platforms for gene delivery.
The proprotein convertase subtilisin-like kexin type 9 (PCSK9) gene is an attractive target for treating atherosclerotic cardiovascular diseases4,5,56. Adenine base editing of the PCSK9 gene has been shown to lower cholesterol in vivo durably4,5. To test inducible gene editing on the PCSK9 gene in mice, we rearranged and incorporated the sABE v3.22 components into two AAV vectors, targeting the mouse PCSK9 (mPCSK9) intron 1 splice donor (Fig. 5c). The first vector carried the first half of sABE v3.22(C) driven by a pP3 liver-specific promoter57 and the sgRNA driven by a pU6 promoter. The second vector carried the other half of sABE v3.22(C), linked to the sABE v3.22(N) through a P2A self-cleaving peptide and promoted by pP3 (Fig. 5d). We packaged the sABE v3.22 into AAV serotype 1 and transduced it into a HEK293T stable cell line integrated with a 200-bp mPCSK9 gene fragment containing the mPCSK9 intron 1 splice donor. We observed a mean of 40% targeted A6 edit with 13% bystander A4 edit 14 days after transduction, while the non-induced activity was <2.5% (Fig. 5e).
To ensure tissue-specific delivery of the sABE v3.22 system in vivo, we packaged the dual-AAV sABEv3.22 into AAV8, a serotype with high tropism for hepatocytes58. We delivered 1 × 1011 genome copies (gc) of each AAV to 15-week-old C57BL/6 mice via tail-vein injection (Fig. 5f). Three days later, we treated the mice with 3 mg/kg rapamycin every other day for 8 days via intraperitoneal (i.p.) injection. We euthanized the mice 6 days after the last rapamycin dose, isolated genomic DNA from their liver tissue, and analyzed the on-target efficiency. We observed a mean of 2.4% non-induced background targeted A6 A-to-G conversion. With just four doses of rapamycin, the dual-AAV-delivered sABE v3.22 exhibited up to 40% targeted A6 editing with a mean of 1.6% low bystander A4 editing (Fig. 5g, Supplementary Fig. 15a). We also observed no A-to-G conversion above the background at reported Cas9-dependent off-target sites5 (Supplementary Fig. 15b, c). Thus, the inducible sABE v3.22 is suitable for in vivo applications.
Discussion
We present a chemically inducible sABE architecture that utilizes rapamycin-induced dimerization of FKBP3 and FRB to control the activity of TadA-8e deaminase. We engineered an sABE construct (v3.22) that exhibits low background activity and, upon induction, achieves high base-editing activity comparable to ABE8e. The sABE v3.22 system demonstrates higher precision than ABE8e, allowing for more precise single adenine editing with significantly reduced off-target effects. In addition, sABE v3.22 architecture is compatible with TadA variants bearing beneficial mutations, including V106W and F148A, that have the potential to further mitigate transcriptomic off-target effects, as well as other Cas effectors with alternative PAMs, such as SaCas9 and engineered Un1Cas12f1. sABE v3.22 enables highly efficient knockout of human endogenous genes, including B2M and CD46, via multiplex disruptions of splice donors. This multiplex strategy could potentially be applied to other genes. Upon packaging into dual-AAV vectors, the sABE v3.22 achieves efficient inducible base-editing of a therapeutically relevant PCSK9 gene in the mouse liver. We envision that our control of the sABE system could potentially mitigate the risks associated with the prolonged expression of active ABE8e59. Thus, sABE greatly expands the capability of inducible gene editing, with broad implications for basic research and in vivo therapeutic applications. The simplicity of sABE v3.22 applications makes it a valuable tool for inducibly introducing precision A-to-G conversions compared to Cas9 nuclease editing60 and prime editing technologies61,62.
Given that the split site (isoleucine 76) can accommodate the insertion of 2×FRB and 2×FKBP, we expect the sABE architecture can be adapted to other post-translational control mechanisms that rely on protein dimerization, including FKBP-FRB with rapamycin analogs (rapalogs)63, chemically inducible split proteins64, proteolysis targeting chimeric (PROTAC)-chemically induced dimerization systems65, and light-induced dimerization (LID) systems66. Adapting the sABE v3.22 to work with rapamycin-independent dimerization systems could potentially mitigate side effects of rapamycin, such as immunosuppression63 and PCSK9 upregulation67. Moreover, we envision that our sABE architecture can be extended to other TadA-containing base editors, such as the recently reported TadCBEs20–22(cytosine base editors), TadDEs20,22 (dual base editors), AYBE23 (an adenine transversion base editor), Td-CGBE21 (a C-to-G base editor), TALE-ABEs68 (a mitochondrial base editor), and inlaid ABEs69,70 (for simultaneous control of Cas and deaminase activity), which will greatly facilitate the high-precision genome editing applications.
In summary, we demonstrate an inducible split ABE system that utilizes small-molecule induced dimerization to regulate the TadA-8e deaminase for controllable, efficient, high-precision, and in vivo applicable A-to-G base editing. Our work expands the scope of inducible genome editing with the potential for broad research and therapeutic applications.
Methods
Ethical Statement
All research conducted complies with relevant regulations. Animal experiment protocols were approved by the Institutional Animal Care and Use Committee (IACUC) of Baylor College of Medicine (BCM).
Molecular cloning
DNA amplifications were performed by PCR using the 2 × Phanta Max master Mix (Dye Plus, Vazyme, P525). Vectors were linearized by PCR or by restriction digestion. A PCR program with 60 °C annealing temperature and 25 cycles was programmed for 20 µl PCR reaction systems with 100 pmol of each primer and 10-50 ng template to amplify DNA fragments > 2 kb; the cycle number was changed to 35 to amplify DNA fragments <2 kb. Gel electrophoresis of the amplified DNA was conducted in 1.5% DNA agarose gel with 0.5 µg/ml UltraPure Ethidium Bromide (Thermo Fisher Scientific, BP1302-10). The small gel region containing the target DNA fragment was excised, and DNA was extracted using QIAquick Gel Extraction Kit (Qiagen, 28704). Golden Gate assembly was used to assemble the DNA fragments in a 10 µL system containing purified DNA fragments, 1 µl 10 × T4 DNA ligase buffer (New England BioLabs, B0202S), 0.5 µl T4 DNA ligase (200U, New England BioLabs, M0202S), and 0.5 µl BsaI-HFv2 (10U, New England BioLabs, R3733S) or Esp3I (5U, Thermo Fisher Scientific, ER0452). The Golden Gate assembly mixture was cycled between 37 °C and 16 °C for 5 min at each temperature for 15 cycles and incubated at 60 °C for 5 min as the last step. Transformations were performed using Stbl3 competent cells prepared by the Mix & Go! E. coli Transformation Kit (Zymo, T3001).
DNA oligonucleotides were obtained from Integrated DNA Technologies (IDT). Plasmids containing short insertions, such as sgRNA protospacers, were constructed by ligating annealed and phosphorylated oligonucleotides with other amplified DNA fragments through the Golden Gate assembly. A 20 µl annealing system containing 0.2 nmol of each oligonucleotide and 2 µl 10 × T4 DNA ligase buffer (New England BioLabs) was heated up to 95 °C for 5 min, followed by −1 °C/min ramp down to 37 °C. Next, 1 µl annealed oligonucleotides were added to a 10 µL system containing 0.5 µL T4 Polynucleotide Kinase (5U, New England Biolabs, M0201S) and 1 µL T4 DNA ligase buffer (New England BioLabs) and was incubated at 37 °C for 30 min. Finally, 1 µl annealed and phosphorylated oligonucleotides were used for the Golden Gate assembly. ABE8e (#138489) and lentiGuide-Puro (#52963) were obtained from Addgene and used directly or as PCR templates. FKBP3, FRB, and gp41-1 intein were synthesized by gBlocks (IDT).
Plasmids were isolated using buffers from QIAprep Spin Miniprep Kit (Qiagen, 27104) and were filtered and collected from DNA spin columns (Epoch Life Science, 1920-250). Constructs were verified by Sanger sequencing across all assembly junctions and key regions, including the sequences of the deaminase, FKBP, and FRB. The annotated sequences of each key plasmid developed in this work are available in the shared Benchling links (Supplementary Data. 1).
Mammalian cell culture
HEK293T cells (American Type Culture Collection, CRL-3216) were cultured in Dulbecco’s Modified Eagle’s Medium (DMEM) plus GlutaMAX (Gibco, 10569044) supplemented with 10% (v/v) fetal bovine serum (Gibco, 10437028) and 1% (v/v) penicillin-streptomycin (Gibco, 15140122), hereafter referred as the complete media, in 10 cm TC treated cell culture dish with vents (Greiner Bio-One, 664160). Cells were grown at 37 °C in 5% CO2 incubators and passaged upon reaching 80% confluency.
Transfection
Cells of low passage number (1–10, passage number of freshly thawed cells is counted as 0) were counted by Countess II FL (Thermo Fisher Scientific) and plated at 2 × 104 cells per 100 µl complete media per well for reporter experiments or 0.75 × 104 cells per 100 µl complete media per well for genomic editing experiments in 96-well plates (Corning, 3598) 16~20 h before transfections. The seeded plate was incubated at room temperature for 15 min before being placed into the incubator. For each well on the plate, transfection plasmids and 0.5 µL Lipofectamine 2000 (Thermo Fisher Scientific, 11668019) were separately diluted in 5 µl Opti-MEM I Reduced Serum Medium (Thermo Fisher Scientific, 31985062). They were then combined into 10 µl and incubated at room temperature for 5 min before being pipetted onto the supernatant in each well. In EYFP reporter experiments, 60 ng EBFP plasmid, 60 ng EYFP* plasmid, 60 ng sgRNA plasmid, and 60 ng ABE8e or 60 ng of each sABE plasmid were transfected. Cells were collected for FACS 48 h after rapamycin addition. In the repurposed EYFP reporter R-loop experiment, 60 ng EBFP plasmid, 60 ng EYFP* plasmid, 60 ng dSaCas9 plasmid, and 60 ng SaCas9 sgRNA plasmid were transfected. In genomic editing experiments, 75 ng sgRNA plasmid and 225 ng ABE8e or 225 ng of each sABE plasmid were transfected. In the R-loop assay, 100 ng SpCas9 sgRNA plasmid, 150 ng ABE8e or 150 ng of each sABE plasmid, 100 ng SaCas9 sgRNA plasmid, and 200 ng dSaCas9 plasmid were transfected. 1 µl 10 µM rapamycin was added to each well in the induction group 10~16 h after transfection. Genomic DNA was extracted 72 h after induction.
Genomic DNA extraction
The media of each well was gently aspirated. Next, 100 µl freshly prepared lysis buffer [10 mM Tris-HCl, pH 7.5, 0.05% SDS, 25 µg/ml proteinase K (Thermo Fisher Scientific, EO0491)] was added to each well and was incubated at 37 °C for 15 min. The cell lysate was then heat-inactivated at 80 °C for 30 min and used immediately or stored at 4 °C.
Fluorescence-activated cell sorting
Fourty-eight hours post induction in the EYFP reporter experiments, the media of each well was gently aspirated. 100 µl TrypLE Express (Thermo Fisher Scientific, 12608-028) was added to each well and was incubated at room temperature for 5 min to detach cells. 200 µl complete media was then added to each well, and the mixture was pipetted 30 ~ 40 times for cell suspension. Flow cytometry was carried out on SA3800 Spectral Cell Analyzer (Sony Biotechnology), and the data was analyzed using FlowJo 10.8.1 (FlowJo, LLC). Live cells were gated by side scatter area versus forward scatter area (SSC-A vs. FSC-A). Singlets were selected by forward scatter height versus forward scatter area (FSC-H vs. FSC-A). The fluorescence-Positive population was gated against the mock-transfected control.
Targeted amplicon sequencing and data analysis
The genomic region flanking each targeted locus was amplified, purified, quantified, and sent for Sanger sequencing (Epoch Life Science) or high-throughput sequencing (HTS) (Amplicon-EZ, Genewiz). Amplicon and primer sequences are available in the shared Benchling links in Supplementary Data. 2, 3, and 4. Partial Illumina adapters provided by Amplicon-EZ were added to the 5’ end of each forward and reverse primer. A unique 6–8 bp barcode was added between the Illumina adapter and the genome-binding sequence to distinguish amplicons from different repeats or conditions pooled in the same sample. PCR was performed in a 10 µl system with 50 pmol of each primer, 10 ~ 50 ng template, and 5 µl 2 × Phanta Max master mix (Dye Plus, Vazyme). The annealing temperature was set to 60 °C, and 35 cycles were used for amplification. The desired DNA fragment was primarily extracted using buffer PB (Qiagen, 19066) when a single and clear band was expected or extracted using QIAquick PCR & Gel Cleanup Kit (Qiagen, 28506) after gel electrophoresis. For Sanger sequencing, each amplicon was eluted in 10 µl ultrapure water (Millipore) and quantified by NanoDrop One (Thermo Fisher Scientific). The Sanger sequencing premix was prepared by adding 1.5 µl eluted DNA ( ~ 40 ng) and 2.5 µl 10 µM sequencing primer into 9 µl ultrapure water. For amplicon HTS, multiple different amplicons were pooled together, then purified and eluted in 50 µl ultrapure water. 25 µl eluted DNA was sent in for Amplicon-EZ. Sanger sequencing results were analyzed using EditR (version 1.0.0, https://github.com/MoriarityLab/EditR). Amplicon HTS results were analyzed using CRISPResso2 (version 2.2.9, https://github.com/pinellolab/CRISPResso2).
RNA-seq experiment
Low-passage HEK293T cells were seeded at 5 × 106 cells per 10 ml complete media per 10-cm cell culture dish 16 h before transfection. 12 µg construct plasmid was added to 260 µl serum-free DMEM in a 50-ml tube, followed by the addition of 78 µl PEI Max (1 mg/ml pH = 7.1, Polysciences, 24765-100). The mixture was vortexed and incubated at room temperature for 10 min and then was diluted into a 10 ml complete media. This media replaced the old one in the 10 cm culture dish. 12 h after transfection, 1 µl 1 mM rapamycin was added to each 10-cm plate in the induction group. 48 h after transfection when cells were transfected with ABE8e-P2A-EGFP or nCas9-P2A-EGFP plasmid, or 48 h after induction when cells were transfected with the All-In-One sABE v3.22 plasmid, cells from each 10-cm plate were dissociated with 2 ml of TrypLE Express, centrifuged at 400 × g for 3 min at room temperature, and resuspended in 5 ml complete media. 0.5 to 0.7 × 106 cells with the top 5% GFP signal were sorted using the MA900 multi-application cell-sorter (Sony). Live cells were gated by the back scatter area versus the forward scatter area (BSC-A vs. FSC-A). Singlets were selected by forward scatter height versus forward scatter area (FSC-H vs. FSC-A). The fluorescence-positive population was gated against the mock-transfected control. RNA was extracted from the sorted cells using the E.Z.N.A Total RNA Kit (Omega Bio-Tek, M6399-00). A quarter of the sorted cells were collected in a separate tube for genomic DNA extraction and on-target DNA base editing analysis. RNA samples were submitted to the Cancer Genomics Center at the University of Texas Health Science Center at Houston (CPRIT RP180734). Total RNA was quality-checked using Agilent RNA 6000 Pico kit (#5067-1513) by Agilent Bioanalyzer 2100 (Agilent Technologies, Santa Clara, USA). RNA with an integrity number >7 was used for library preparation. Libraries were prepared with NEBNext Poly(A) mRNA Magnetic Isolation Module (E7490L, New England Biolabs), NEBNext Ultra II Directional RNA Library Prep Kit for Illumina (E7760L, New England Biolabs), and NEBNext Multiplex Oligos for Illumina (E6609S, New England Biolabs) following the manufacturer’s instructions. The qualities of the final libraries were examined using Agilent High Sensitive DNA Kit (#5067-4626) by Agilent Bioanalyzer 2100 (Agilent Technologies, Santa Clara, USA), and the library concentrations were determined by qPCR using Collibri Library Quantification kit (#A38524500, Thermo Fisher Scientific). The libraries were pooled evenly and went for the paired-end 75-cycle sequencing on an Illumina NextSeq 550 System (Illumina, Inc., USA) using High Output Kit v2.5 (#20024907, Illumina, Inc., USA).
RNA-seq data analysis
RNA-Seq data analysis was conducted according to the GATK Best Practices for RNA-seq short variant discovery (https://github.com/broadinstitute/gatk). Briefly, the RNA-Seq reads were first mapped to GRCh38 using STAR aligner (version 2.7.10a https://github.com/alexdobin/STAR) in two-pass mode with default parameters. Next, the Picard tool MarkDuplicates (version 2.27.4) was applied to mark duplicates in the sorted and mapped BAM files. The refined BAM files were subject to the GATK SplitNCigarReads tool (version 4.2.6.1), which splits reads that contain Ns in their cigar string because they span splicing junctions. Next, GATK BaseRecalibator (version 4.2.6.1) was used to generate a recalibration table for Base Quality Score Recalibration (BQSR). Known variants in dbSNP version 151 were used during BQSR. Finally, BQSR was applied, and “Analysis-Ready” BAM files were generated. Variant calling was done by GATK HaplotypeCaller (version 4.2.6.1) using default settings with an additional setting to not use the soft-clipped base. SNP variants were filtered out using the GATK selectVariant (version 4.2.6.1). Filters recommended by the GATK for variant calling on RNA-Seq data were used to hard-filtrate qualified variants. Clusters of more than three SNVs identified within a 35-bp window were filtered to maintain high-confidence variants. Hard fitering was applied to select qualified variants with QualByDepth >2.0, FisherStrand <30.0, StrandOddsRatio <3.0, RMSMappingQuality >40.0, MQRankSum >-12.5, ReadPosRankSum >-8.0, and QUAL > 30. The downstream analyses focused only on variants on canonical (1 ~ 22, X, Y, and M) chromosomes. A-to-G variants were selected, and bam-readcount (version 1.0.1 https://github.com/genome/bam-readcount) was used to quantify the per-base nucleotide abundances per A-to-G variant.
Inducible knockout experiments
Transfection was performed according to the dosage and method for genomic editing experiments (Described in the transfection method section). 72 h after the media change, the media was gently aspirated. Cells were detached with 100 µl TrypLE Express (Thermo Fisher Scientific) and were incubated at room temperature for 5 min. Next, 500 µl complete media was added to each well. The cell suspension was pipetted firmly 5 ~ 10 times before being transported to and cultured in 24-well treated tissue culture plates (Genesee Scientific, 25-107). Four days after the transfer, the media was gently aspirated. Cells were detached with 500ul TrypLE Express and were incubated at room temperature for 5 min. The suspended cells were pipetted firmly 5 ~ 10 times, transferred to 1.5 ml microcentrifuge tubes, and centrifuged at 500 × g for 3 min. The supernatant was discarded, and 500 µl cell staining buffer (BioLegend, 420201) was used to resuspend the cells. Cells were counted by Countess II FL (Thermo Fisher Scientific), and 2~3 × 105 cells were diluted in 100 µl cell staining buffer. 3 µl of 200 µg/ml FITC anti-human CD46 antibody (BioLegend, Catalog 315304, Lot B339203, Clone MEM-258) or 3 µl of 150 µg/ml PE/Cy7 anti-human β2-microglobulin antibody (BioLegend, Catalog 316318, Lot B371988, Clone 2M2) was mixed with the 100 µl cell suspension and was incubated in the dark on ice for 20 min. The supernatant was discarded, and the cells were washed with 500 µl cell staining buffer by centrifugation at 500 × g for 3 min. The final cell pellet was suspended in 500 µl cell staining buffer. FACS was performed using the SA3800 Spectral Cell Analyzer (Sony Biotechnology). Data were analyzed as described in the fluorescence-activated cell sorting section.
AAV production and transduction for dual-AAV in vitro editing
Low-passage HEK293T cells were seeded at 5 × 106 cells per 10 ml complete media per 10-cm cell culture dish (Greiner Bio-One) 16 h before transfection. 3 µg of transfer vector, 5 µg of pHelper plasmid (Cell Biolabs), and 4 µg of AAV-Rep-Cap plasmid (Addgene #112862) were added to 260 µl of serum-free DMEM in a 50-ml tube, followed by addition of 78 µl PEI Max (1 mg/ml PH = 7.1, Polysciences). The mixture was vortexed and incubated at room temperature for 10 min and then was diluted in a 10 ml complete media. This media replaced the old one in the 10-cm culture dish. 48 h after transfection, all supernatant was collected in a 15-ml tube and centrifuged at 3200 × g for 5 min at room temperature to remove the cell debris. The supernatant was then concentrated using a PEG virus precipitation kit (Biovision, K904-50) with an optimized protocol. Briefly, 2.5 ml PEG solution was added to the supernatant. The mixture was inverted evenly, refrigerated at 4 °C for 24 h, and then centrifuged at 3200 × g and 4 °C for 30 min. Several aspiration and centrifugation rounds were applied to remove the supernatant from the pellet entirely. The freshly prepared AAVs were used immediately for transduction. For AAV transduction, HEK293T cells were seeded at 1,500 cells per 100 µl complete media per well in the 96-well Poly-D-lysine plate (Corning, 356690) and the cells were incubated at room temperature for 15 min. Then, 10 µl of each AAV vector was added to each well, and transduced cells were cultured at 37 °C with 5% CO2. Genomic DNA extractions were performed on day 7 after transduction.
Lentivirus production and transduction for mPCSK9 HEK293T model
A 200-bp DNA fragment containing the mPCSK9 genomic locus of interest was amplified from the C57BL/6 mouse genome and was ligated to the lentiviral transfer plasmid (Addgene #52963) through Golden Gate assembly. Low-passage HEK293T cells were seeded at 2 × 104 cells per 100 µl complete media per well on a 96-well Poly-D-lysine plate (Corning) 16 h before transfection. They were incubated at room temperature for 15 min before transferring into the 37 °C, 5% CO2 incubator. For each well, 111 ng of transfer plasmid, 83 ng of packing plasmid psPAX2 (Addgene, #12260), and 56 ng of envelope plasmid pMD2.G (Addgene, #12259) were added to 5 µl Opti-MEM I Reduced Serum Medium (Thermo Fisher Scientific). The mixture was combined with another 5 µl Opti-MEM solution containing 0.5 µl Lipofectamine 2000 (Thermo Fisher Scientific) and incubated at room temperature for 5 min before being added to the well. 48 h after transfection, the supernatant was collected and centrifuged at 3000 × g for 5 min at room temperature. Low passage cells were seeded at 100,000 cells per 500 µl complete media per well in a 24-well plate (Genesee Scientific, 25-107) and incubated at room temperature for 15 min. 5 µl lentivirus-containing supernatant was added to the well on the 24-well plate, and the cells were cultured in 37 °C, 5% CO2 incubator. 24 h after transduction, the old culture media was replaced by 500 µl fresh complete media with 1 µg/ml puromycin (Gibco, A1113802) to initiate puromycin selection. When the surviving cells reached 80% confluency, they were dissociated with 200 µl/well TrypLE Express (Thermo Fisher Scientific) and added to 10 ml complete media containing 1 µg/ml puromycin in a 10-cm plate (Greiner Bio-One) for further proliferation. The stable cell line was verified by targeted genomic DNA PCR amplification followed by Sanger sequencing. The verified stable cell line was cryo-stored until use.
AAV and lentivirus production and transduction for triple-AAV or triple-lentiviral vector in vitro editing
2 × 106 HEK293T cells were seeded into 10-cm dishes (Greiner Bio-One) in 15 ml of complete media. When cells reached 30% confluency, for AAV production, 3 µg of pAAV2/2 plasmids (Addgene #104963) and 3 µg of pAdDeltaF6 plasmids (Addgene #112867) were mixed with 3 µg of transgene plasmids, 4 ml DMEM, and 60 µl PEI (1 mg/ml pH = 7.1, Polysciences). For lentivirus production, 3 µg of PspAX2 (Addgene #12260), 3 µg of pMD2.G (Addgene #12259), and 3 µg of transgene plasmids were mixed with 4 ml DMEM and 60 µL PEI (1 mg/ml PH = 7.1, Polysciences). The mixture was incubated for 20 min at room temperature before being added to the cell culture. Twenty-four hours after transfection, the old media was replaced by 15 ml fresh complete media. 72 h (or 48 h for lentivirus production) after media change, the cell culture medium was transferred to a 50 ml conical tube. Cells were dissociated with trypsin-EDTA (0.25%) (Gibco, 25200056) and transferred to the same tube. DMEM was added to achieve a final volume of 30 ml. 3 ml chloroform was added, and the mixture was vortexed for 5 min. Next, 7.6 ml of 5 M NaCl was added, and the mixture was vortexed for 10 s before centrifugation at 3000 ×g for 5 min at 4 °C. The aqueous phase was collected, and 9.4 ml 50% (v/v) PEG 8000 (Fisher BioReagents, BP2331) was added. This mixture was vortexed for 10 s and incubated at 4 °C overnight. The next day, it was centrifuged at 3000 × g for 30 min at 4 °C. The cell pellet was resuspended with 700 µl PBS buffer (Gibco, 10010-023). 1 µl Cryonase Cold-active nuclease (TaKaRa #2670 A) and 1.75 µl 1 M MgCl2 were added to each tube, and the mixture was incubated at room temperature for 30 mins. 700 µl chloroform was added, then the mixture was vortexed for 10 s and centrifuged at 3000 × g for 5 min at 4 °C. The virus-containing aqueous phase was either used immediately or stored at 4 °C.
HEK293T cells were seeded in 96-well plates (Corning) at 10% confluency for transduction. 12 h post seeding, 10 µL of each AAV virus or lentivirus was mixed and added into the culture medium. For lentivirus transduction, 5 µg/mL polybrene (Sigma #TR1003G) was supplied into the cell media. The media was replaced with fresh complete media after 24 h. Three days later, cells were detached and transferred to a 24-well plate (Genesee Scientific). After another 4 days, genomic DNA was extracted.
AAV production for animal studies
High-purity AAV viruses with AAV2 inverted terminal repeat pseudo-typed with AAV8 capsid were produced by the Gene Vector Core at the Baylor College of Medicine. The titers of AAV viruses were measured by real-time qPCR.
Animal studies
A total of twelve C57BL/6 male mice used in animal experiments were purchased from the Jackson Laboratory. They were maintained and handled following laboratory animal treatments approved by the Institutional Animal Care and Use Committee (IACUC) of Baylor College of Medicine (BCM). All mice were housed in an animal facility with standard conditions such as pathogen-free, light-dark cycle (12 h:12 h), 22-–25 °C air temperature, and 40-70% air humidity on 2920X Teklad Global Extruded Rodent Diet (Soy Protein-Free; Harlan Laboratories). At 15 weeks of age, mice were randomly grouped into three groups of four each and subjected to the experimental treatments. Specifically, mice in the experimental groups received 1 × 1011 genome copy (gc) per AAV vector buffered in 200 µl sterile saline via tail-vein injection. Three days after the AAV injection, mice in the induction group were injected with 3 mg/kg rapamycin buffered in the vehicle [a mixture of equal volume 10% PEG400 (MiliporeSigma, 8.07485.1000) and 10% Tween 80 (MiliporeSigma, 1754-25 ML)] every other day for 8 days through intraperitoneal injection (i.p.). Mice in the other groups were injected with the same volume of vehicle. Six days after the final injection of rapamycin, mice were euthanized, and 50 mg of mouse liver tissue was homogenized in 600 µl DPBS (Corning, 21-031-CV). The homogenate was then centrifuged at 2000 × g for 5 min at 4 °C, and the pellet was lysed using 600 µl lysis buffer [10 mM Tris-HCl, pH 7.5, 0.05% SDS, 25 µg/ml proteinase K (Thermo Fisher Scientific)] and incubated at 65 °C for 15 min, 68 °C for 15 min, and 98 °C for 10 min.
Statistics and reproducibility
All bar plots were created with dots indicating individual biological replicates. When there were more than two replicates, error bars were used to represent standard deviation. Groups were compared using either multiple unpaired two-tailed t-tests or unpaired two-tailed t-tests, with significance notations as ns (not significant), *P < 0.05, **P < 0.01, ***P < 0.001, and ****P < 0.0001. Relevant statistical details can be found in figure legends or descriptions. No statistical method was used to predetermine the sample size. Sample sizes were determined by observed variability across independent experiments, and no data were excluded from the analyses. These sizes align with standard practices in related research. For animal experiments, at 15 weeks of age, the twelve C57BL/6 male mice were randomly divided into three groups, with four mice in each group. The investigators were not blinded to allocation during experiments and outcome assessment, as the data were processed and analyzed in exactly the same way, and there were no subjective decisions or interpretations made during the data analysis phase.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Supplementary information
Source data
Acknowledgements
This work was supported by the National Science Foundation CAREER AWARD (CBET-2143626 to X.G.); Robert A. Welch Foundation grant (C-1952 to X.G.); National Institutes of Health grants (HL157714 to X.G.; DK111436 and AG069966 to Z.S.); the DLDCCC (P30CA125123 to Z.S.), the Specialized Programs of Research Excellence (SPORE) program (P50CA126752 to Z.S.), the Gulf Coast Center for Precision Environmental Health (P30ES030285 to Z.S.), and the Texas Medical Center Digestive Diseases Center (P30DK056338 to Z.S.). We thank the Cancer Genomics Center at The University of Texas Health Science Center at Houston for performing the RNA-sequencing service. We thank Harshavardhan Deshmukh at Rice University Shared Equipment Authority for support in FACS analysis. Figure 1d is created with PyMOL71. Figure 5f is created with BioRender.com.
Author contributions
H.Z., Q.Y., D.M., F.P., Z.S., and X.G. designed the research. H.Z., Q.Y., D.M., F.P., A.L., K.C., P.G., and E. C. O. performed the experiments. H.Z., Q.Y., and D.M. analyzed the data. H.Z. performed computational analysis of RNA-Seq data. H.Z. wrote the initial draft. H.Z., Q.Y., Z.S., and X.G. revised the manuscript with help from all authors. X.G. and Z.S. supervised the project.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.
Data availability
High-throughput DNA- and RNA-Seq data generated in this study have been deposited at the Sequence Read Archive PRJNA923001. Data presented in each figure are provided in Source Data. Nucleic acid sequences of all constructs are provided in the the Supplementary Data. 1. Nucleic acid sequence of genomic loci tested and primers used in this study are provided in Supplementary Data. 2, 3, and 4. The structure of TadA-8e can be found in Protein Data Bank PDB: 6VPC38 [https://www.rcsb.org/structure/6vpc]. Source data are provided with this paper.
Competing interests
X.G. and H.Z. are in the process of filing a provisional patent application on this work. The remaining authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Qichen Yuan, Fei Peng, Dacheng Ma.
Contributor Information
Zheng Sun, Email: zheng.sun@bcm.edu.
Xue Gao, Email: xue.gao@rice.edu.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-023-41331-5.
References
- 1.Daniel TC, Zeng H, Osikpa EC, Gao X. Revolutionizing genetic disease treatment: recent technological advances in base editing. Curr. Opin. Biomed. Eng. 2023;28:100472. doi: 10.1016/j.cobme.2023.100472. [DOI] [Google Scholar]
- 2.Anzalone AV, Koblan LW, Liu DR. Genome editing with CRISPR–Cas nucleases, base editors, transposases and prime editors. Nat. Biotechnol. 2020;38:824–844. doi: 10.1038/s41587-020-0561-9. [DOI] [PubMed] [Google Scholar]
- 3.Koblan LW, et al. In vivo base editing rescues Hutchinson–Gilford progeria syndrome in mice. Nature. 2021;589:608–614. doi: 10.1038/s41586-020-03086-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Musunuru K, et al. In vivo CRISPR base editing of PCSK9 durably lowers cholesterol in primates. Nature. 2021;593:429–434. doi: 10.1038/s41586-021-03534-y. [DOI] [PubMed] [Google Scholar]
- 5.Rothgangl T, et al. In vivo adenine base editing of PCSK9 in macaques reduces LDL cholesterol levels. Nat. Biotechnol. 2021;39:949–957. doi: 10.1038/s41587-021-00933-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Davis JR, et al. Efficient in vivo base editing via single adeno-associated viruses with size-optimized genomes encoding compact adenine base editors. Nat. Biomed. Eng. 2022;6:1272–1283. doi: 10.1038/s41551-022-00911-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Levy JM, et al. Cytosine and adenine base editing of the brain, liver, retina, heart and skeletal muscle of mice via adeno-associated viruses. Nat. Biomed. Eng. 2020;4:97–110. doi: 10.1038/s41551-019-0501-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Mayuranathan T, et al. Adenosine base editing of γ-Globin promoters Induces fetal hemoglobin and Inhibit erythroid sickling. Blood. 2020;136:21–22. doi: 10.1182/blood-2020-141498. [DOI] [Google Scholar]
- 9.Yen JS, et al. Base editing eliminates the sickle cell mutation and pathology in hematopoietic stem cells derived erythroid cells. Blood. 2020;136:13–14. doi: 10.1182/blood-2020-139016. [DOI] [Google Scholar]
- 10.Kluesner MG, et al. CRISPR-Cas9 cytidine and adenosine base editing of splice-sites mediates highly-efficient disruption of proteins in primary and immortalized cells. Nat. Commun. 2021;12:2437. doi: 10.1038/s41467-021-22009-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Gaudelli NM, et al. Directed evolution of adenine base editors with increased activity and therapeutic application. Nat. Biotechnol. 2020;38:892–900. doi: 10.1038/s41587-020-0491-6. [DOI] [PubMed] [Google Scholar]
- 12.Li C, et al. Targeted, random mutagenesis of plant genes with dual cytosine and adenine base editors. Nat. Biotechnol. 2020;38:875–882. doi: 10.1038/s41587-019-0393-7. [DOI] [PubMed] [Google Scholar]
- 13.Wang X, et al. Efficient gene silencing by adenine base editor-mediated start codon mutation. Mol. Ther. 2020;28:431–440. doi: 10.1016/j.ymthe.2019.11.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Winter J, et al. Targeted exon skipping with AAV-mediated split adenine base editors. Cell Discov. 2019;5:41. doi: 10.1038/s41421-019-0109-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Cheng W, et al. Parallel functional assessment of m6A sites in human endodermal differentiation with base editor screens. Nat. Commun. 2022;13:478. doi: 10.1038/s41467-022-28106-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Huang C, Li G, Wu J, Liang J, Wang X. Identification of pathogenic variants in cancer genes using base editing screens with editing efficiency correction. Genome Biol. 2021;22:80. doi: 10.1186/s13059-021-02305-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Kempton HR, Love KS, Guo LY, Qi LS. Scalable biological signal recording in mammalian cells using Cas12a base editors. Nat. Chem. Biol. 2022;18:742–750. doi: 10.1038/s41589-022-01034-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Gaudelli NM, et al. Programmable base editing of A•T to G•C in genomic DNA without DNA cleavage. Nature. 2017;551:464–471. doi: 10.1038/nature24644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Rees HA, Liu DR. Base editing: precision chemistry on the genome and transcriptome of living cells. Nat. Rev. Genet. 2018;19:770–788. doi: 10.1038/s41576-018-0059-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Neugebauer ME, et al. Evolution of an adenine base editor into a small, efficient cytosine base editor with low off-target activity. Nat. Biotechnol. 2022;41:673–685. doi: 10.1038/s41587-022-01533-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Chen L, et al. Re-engineering the adenine deaminase TadA-8e for efficient and specific CRISPR-based cytosine base editing. Nat. Biotechnol. 2022;41:663–672. doi: 10.1038/s41587-022-01532-7. [DOI] [PubMed] [Google Scholar]
- 22.Lam DK, et al. Improved cytosine base editors generated from TadA variants. Nat. Biotechnol. 2023;41:686–697. doi: 10.1038/s41587-022-01611-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Tong, et al. Programmable A-to-Y base editing by fusing an adenine base editor with an N-methylpurine DNA glycosylase. Nat. Biotechnol. 2023;41:1080–1084. doi: 10.1038/s41587-022-01595-6. [DOI] [PubMed] [Google Scholar]
- 24.Richter MF, et al. Phage-assisted evolution of an adenine base editor with improved Cas domain compatibility and activity. Nat. Biotechnol. 2020;38:883–891. doi: 10.1038/s41587-020-0453-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Rees HA, Wilson C, Doman JL, Liu DR. Analysis and minimization of cellular RNA editing by DNA adenine base editors. Sci. Adv. 2019;5:eaax5717. doi: 10.1126/sciadv.aax5717. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Grünewald J, et al. Transcriptome-wide off-target RNA editing induced by CRISPR-guided D. N. A. base editors. Nature. 2019;569:433–437. doi: 10.1038/s41586-019-1161-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Zhou C, et al. Off-target RNA mutation induced by DNA base editing and its elimination by mutagenesis. Nature. 2019;571:275–278. doi: 10.1038/s41586-019-1314-0. [DOI] [PubMed] [Google Scholar]
- 28.Yan N, et al. Cytosine base editors induce off-target mutations and adverse phenotypic effects in transgenic mice. Nat. Commun. 2023;14:1784. doi: 10.1038/s41467-023-37508-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Dow LE, et al. Inducible in vivo genome editing with CRISPR-Cas9. Nat. Biotechnol. 2015;33:390–394. doi: 10.1038/nbt.3155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Zafra MP, et al. Optimized base editors enable efficient editing in cells, organoids and mice. Nat. Biotechnol. 2018;36:888–893. doi: 10.1038/nbt.4194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Zetsche B, Volz SE, Zhang F. A split-Cas9 architecture for inducible genome editing and transcription modulation. Nat. Biotechnol. 2015;33:139–142. doi: 10.1038/nbt.3149. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Nihongaki Y, Kawano F, Nakajima T, Sato M. Photoactivatable CRISPR-Cas9 for optogenetic genome editing. Nat. Biotechnol. 2015;33:755–760. doi: 10.1038/nbt.3245. [DOI] [PubMed] [Google Scholar]
- 33.Hemphill J, Borchardt EK, Brown K, Asokan A, Deiters A. Optical control of CRISPR/Cas9 gene editing. J. Am. Chem. Soc. 2015;137:5642–5645. doi: 10.1021/ja512664v. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Davis KM, Pattanayak V, Thompson DB, Zuris JA, Liu DR. Small molecule–triggered Cas9 protein with improved genome-editing specificity. Nat. Chem. Biol. 2015;11:316–318. doi: 10.1038/nchembio.1793. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Grünewald J, et al. base editors with reduced RNA off-target and self-editing activities. Nat. Biotechnol. 2019;37:1041–1048. doi: 10.1038/s41587-019-0236-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Fegan A, White B, Carlson JCT, Wagner CR. Chemically controlled protein assembly: techniques and applications. Chem. Rev. 2010;110:3315–3336. doi: 10.1021/cr8002888. [DOI] [PubMed] [Google Scholar]
- 37.Lee S-Y, et al. Proximity-directed labeling reveals a new rapamycin-Induced heterodimer of FKBP25 and FRB in Live Cells. ACS Cent. Sci. 2016;2:506–516. doi: 10.1021/acscentsci.6b00137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Lapinaite A, et al. DNA capture by a CRISPR-Cas9-guided adenine base editor. Science. 2020;369:566–571. doi: 10.1126/science.abb1390. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.PubChem. PubChem Compound Summary for CID 5284616, Sirolimus (National Center for Biotechnology Information, 2023).
- 40.Ran FA, et al. In vivo genome editing using staphylococcus aureus Cas9. Nature. 2015;520:186–191. doi: 10.1038/nature14299. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Xu X, et al. Engineered miniature CRISPR-Cas system for mammalian genome regulation and editing. Mol. Cell. 2021;81:4333–4345.e4334. doi: 10.1016/j.molcel.2021.08.008. [DOI] [PubMed] [Google Scholar]
- 42.Harrington LB, et al. Programmed DNA destruction by miniature CRISPR-Cas14 enzymes. Science. 2018;362:839–842. doi: 10.1126/science.aav4294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Zhang S, et al. TadA reprogramming to generate potent miniature base editors with high precision. Nat. Commun. 2023;14:413. doi: 10.1038/s41467-023-36004-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Kim DY, et al. Efficient CRISPR editing with a hypercompact Cas12f1 and engineered guide RNAs delivered by adeno-associated virus. Nat. Biotechnol. 2022;40:94–102. doi: 10.1038/s41587-021-01009-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Bae S, Park J, Kim J-S. Cas-OFFinder: a fast and versatile algorithm that searches for potential off-target sites of Cas9 RNA-guided endonucleases. Bioinformatics. 2014;30:1473–1475. doi: 10.1093/bioinformatics/btu048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Doman JL, Raguram A, Newby GA, Liu DR. Evaluation and minimization of Cas9-independent off-target DNA editing by cytosine base editors. Nat. Biotechnol. 2020;38:620–628. doi: 10.1038/s41587-020-0414-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Van der Auwera G. A. & O’Connor B. D. Genomics in the Cloud: Using Docker, GATK, and WDL in Terra (O’Reilly Media, 2020).
- 48.Anna A, Monika G. Splicing mutations in human genetic disorders: examples, detection, and confirmation. J. Appl. Genet. 2018;59:253–268. doi: 10.1007/s13353-018-0444-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Liu X, et al. CRISPR-Cas9-mediated multiplex gene editing in CAR-T cells. Cell Res. 2017;27:154–157. doi: 10.1038/cr.2016.142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Riolobos L, et al. HLA engineering of human pluripotent stem cells. Mol. Ther. 2013;21:1232–1241. doi: 10.1038/mt.2013.59. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Morgan MA, Büning H, Sauer M, Schambach A. Use of cell and genome modification technologies to enerate Improved “Off-the-Shelf” CAR T and CAR NK cells. Front. Immunol. 2020;11:1965. doi: 10.3389/fimmu.2020.01965. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Yamamoto H, Fara AF, Dasgupta P, Kemper C. CD46: The ‘multitasker’ of complement proteins. Int. J. Biochem. Cell Biol. 2013;45:2808–2820. doi: 10.1016/j.biocel.2013.09.016. [DOI] [PubMed] [Google Scholar]
- 53.Yuan Q, Gao X. Multiplex base- and prime-editing with drive-and-process CRISPR arrays. Nat. Commun. 2022;13:2771. doi: 10.1038/s41467-022-30514-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Walton RT, Christie KA, Whittaker MN, Kleinstiver BP. Unconstrained genome targeting with near-PAMless engineered CRISPR-Cas9 variants. Science. 2020;368:290–296. doi: 10.1126/science.aba8853. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Carvajal-Vallejos P, Pallissé R, Mootz HD, Schmidt SR. Unprecedented rates and efficiencies revealed for new natural split inteins from metagenomic sources*. J. Biol. Chem. 2012;287:28686–28696. doi: 10.1074/jbc.M112.372680. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Cohen JC, Boerwinkle E, Mosley TH, Hobbs HH. Sequence variations in PCSK9, Low LDL, and protection against coronary heart disease. N. Engl. J. Med. 2006;354:1264–1272. doi: 10.1056/NEJMoa054013. [DOI] [PubMed] [Google Scholar]
- 57.Viecelli HM, et al. Treatment of phenylketonuria using minicircle-based naked-DNA gene transfer to murine liver. Hepatology. 2014;60:1035–1043. doi: 10.1002/hep.27104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Villiger L, et al. Treatment of a metabolic liver disease by in vivo genome base editing in adult mice. Nat. Med. 2018;24:1519–1525. doi: 10.1038/s41591-018-0209-1. [DOI] [PubMed] [Google Scholar]
- 59.Maguire AM, et al. Efficacy, safety, and durability of voretigene neparvovec-rzyl in RPE65 mutation–associated inherited retinal dystrophy: results of phase 1 and 3 trials. Ophthalmology. 2019;126:1273–1285. doi: 10.1016/j.ophtha.2019.06.017. [DOI] [PubMed] [Google Scholar]
- 60.Jinek M, et al. A programmable dual-RNA– guided DNA endonuclease in adaptive bacterial immunity. Science. 2012;337:816–821. doi: 10.1126/science.1225829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Anzalone AV, et al. Search-and-replace genome editing without double-strand breaks or donor DNA. Nature. 2019;576:149–157. doi: 10.1038/s41586-019-1711-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.She K, et al. Dual-AAV split prime editor corrects the mutation and phenotype in mice with inherited retinal degeneration. Signal Transduct. Target. Ther. 2023;8:57. doi: 10.1038/s41392-022-01234-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Li J, Kim Sang G, Blenis J. Rapamycin: one drug, many effects. Cell Metab. 2014;19:373–379. doi: 10.1016/j.cmet.2014.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Rihtar E, et al. Chemically inducible split protein regulators for mammalian cells. Nat. Chem. Biol. 2022;19:64–71. doi: 10.1038/s41589-022-01136-x. [DOI] [PubMed] [Google Scholar]
- 65.Ma D, et al. Engineered PROTAC-CID systems for mammalian inducible gene regulation. J. Am. Chem. Soc. 2023;145:1593–1606. doi: 10.1021/jacs.2c09129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Gautier A, et al. How to control proteins with light in living systems. Nat. Chem. Biol. 2014;10:533–541. doi: 10.1038/nchembio.1534. [DOI] [PubMed] [Google Scholar]
- 67.Ai D, et al. Regulation of hepatic LDL receptors by mTORC1 and PCSK9 in mice. J. Clin. Investig. 2012;122:1262–1270. doi: 10.1172/JCI61919. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Cho S-I, et al. Targeted A-to-G base editing in human mitochondrial DNA with programmable deaminases. Cell. 2022;185:1764–1776.e1712. doi: 10.1016/j.cell.2022.03.039. [DOI] [PubMed] [Google Scholar]
- 69.Chu SH, et al. Rationally designed base editors for precise editing of the sickle cell disease mutation. CRISPR J. 2021;4:169–177. doi: 10.1089/crispr.2020.0144. [DOI] [PubMed] [Google Scholar]
- 70.Nguyen Tran MT, et al. Engineering domain-inlaid SaCas9 adenine base editors with reduced RNA off-targets and increased on-target DNA editing. Nat. Commun. 2020;11:4871. doi: 10.1038/s41467-020-18715-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.The PyMOL Molecular Graphics System, Version 2.1. (Schrödinger L., 2015).
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
High-throughput DNA- and RNA-Seq data generated in this study have been deposited at the Sequence Read Archive PRJNA923001. Data presented in each figure are provided in Source Data. Nucleic acid sequences of all constructs are provided in the the Supplementary Data. 1. Nucleic acid sequence of genomic loci tested and primers used in this study are provided in Supplementary Data. 2, 3, and 4. The structure of TadA-8e can be found in Protein Data Bank PDB: 6VPC38 [https://www.rcsb.org/structure/6vpc]. Source data are provided with this paper.