AAV- and CRISPR-Cas9–mediated gene targeting in neonatal mice led to clinical benefits in a mouse model of OTC deficiency.
Abstract
Ornithine transcarbamylase (OTC) deficiency is an X-linked urea cycle disorder associated with high mortality. Although a promising treatment for late-onset OTC deficiency, adeno-associated virus (AAV) neonatal gene therapy would only provide short-term therapeutic effects as the non-integrated genome gets lost during hepatocyte proliferation. CRISPR-Cas9-mediated homology-directed repair can correct a G-to-A mutation in 10% of OTC alleles in the livers of newborn OTC spfash mice. However, an editing vector able to correct one mutation would not be applicable for patients carrying different OTC mutations, plus expression would not be fast enough to treat a hyperammonemia crisis. Here, we describe a dual-AAV vector system that accomplishes rapid short-term expression from a non-integrated minigene and long-term expression from the site-specific integration of this minigene without any selective growth advantage for OTC-positive cells in newborns. This CRISPR-Cas9 gene-targeting approach may be applicable to all patients with OTC deficiency, irrespective of mutation and/or clinical state.
INTRODUCTION
Ornithine transcarbamylase (OTC) deficiency (OTCD) is an X-linked recessive disorder that accounts for nearly half of all inborn errors of the urea cycle (1). Severe OTCD in the neonatal period can result in hyperammonemic coma, which can rapidly become fatal without treatment (2). Current therapies include dialysis, the use of alternate nitrogen clearance pathways, and liver transplantation for severely affected patients; however, the mortality rate is still high (3).
Adeno-associated virus (AAV) vector–based gene therapy could provide an alternative to current treatment options. Over the past few years, AAV gene therapy has shown promising results in clinical trials for several diseases (4–7). Recently, the Food and Drug Administration approved the first AAV gene augmentation therapy for an inherited disease (8). An AAV8 vector that we have developed is currently being evaluated in a clinical trial for adult patients with OTCD (9). For patients with an early-onset form of OTCD, treatment in the early stage would be desirable. However, AAV-mediated, liver-directed neonatal gene therapy would only achieve short-term effects (10–13). Because of the nonintegrating nature of the AAV vector, most of the vector genome would be lost during hepatocyte proliferation. We hypothesize that directed integration of the OTC transgene into the host genome by genome editing could solve this problem.
Targeted genome editing is the holy grail of gene therapy (14). Pioneered by zinc finger nucleases (ZFNs), which rely on protein-DNA binding, genome editing initially entered the clinic with an ex vivo approach (15) and more recently in vivo using an AAV vector to target the liver of patients with hemophilia B or mucopolysaccharidosis I (16, 17). The discovery and development of CRISPR-Cas9 as a genome editing technology have provided a relatively simple method for site-specific genome modifications, owing to its unique mechanism of RNA-mediated DNA binding (18). Upon generation of site-specific double-stranded breaks (DSBs), nonhomologous end joining (NHEJ) creates insertions and deletions (indels); when a donor DNA template is present, homology-directed repair (HDR) incorporates the DNA sequence on the donor template into the endogenous locus. Given its high transduction efficiency in many tissues, AAV vector has been used as an efficient vehicle to deliver the nucleases and/or donor template for in vivo genome editing applications (19–24).
We recently developed a dual AAV vector approach for in vivo delivery of three key components of the CRISPR-Cas9 system: Cas9 enzyme from Staphylococcus aureus (SaCas9), a single-guide RNA (sgRNA) to a sequence in the murine OTC gene, and a donor template to drive HDR (25). We demonstrated HDR-based correction of a G-to-A mutation in 10% of OTC alleles in the livers of newborn spfash mice, which provide a model of chronic hyperammonemia, and clinical benefits following in vivo genome editing. However, this vector, which was developed for a specific mutation, would not be applicable for patients with mutations elsewhere in the OTC gene. Like most monogenic diseases, OTCD is caused by >300 different mutations scattered throughout the gene rather than a single predominant mutation (27). Moreover, our previous mutation correction approach is not only ineffective in adult spfash mice but also causes lethality in spfash mice treated as adults due to the loss of residual OTC expression caused by large deletions extending to exon 4 of the mouse OTC (mOTC) gene (25).
In this study, we aimed to develop a mutation-independent CRISPR-Cas9–mediated gene targeting approach, in which the vector system could be applied to most patients with a specific disease, in this instance, OTCD. We report that a single injection of a dual AAV8 vector system in neonatal mice achieves transient, high-level expression from the unintegrated transgene that could be useful in treating the acute neonatal crisis. Genome editing–directed integration of the transgene results in efficient, sustained, and clinically beneficial gene targeting in liver in the absence of any selective growth advantage for OTC-positive cells.
RESULTS
Development of a dual AAV vector system for CRISPR-Cas9–mediated gene targeting
To develop a broadly applicable genome editing vector for OTCD, we constructed a new AAV8 donor vector that contains (i) an sgRNA driven by the U6 promoter to target intron 4 of the mOTC locus (25) and (ii) a fully functional minigene expressing codon-optimized human OTC (hOTCco) driven by a liver-specific thyroxine binding globulin (TBG) promoter (TBG.hOTCco.pA) flanked by 0.9-kb homology arms on each side (referred to as AAV8.targeted donor; Fig. 1). The untargeted control donor vector contains all components except for the 20-nucleotide target sequence (referred to as AAV8.untargeted donor). Following CRISPR-Cas9–mediated HDR, the transgene cassette (referred to as the hOTCco minigene) should be inserted into intron 4 of the mOTC locus (Fig. 1). We selected intron 4 as the site of integration because it was used to correct the mutation in the spfash mice, providing a direct comparison of the two approaches independent of the efficiency and site of the on-target DSBs (25).
In vivo gene targeting of the OTC locus in the OTC spfash mouse liver by AAV.SaCas9
To determine the in vivo gene targeting efficiency, we coinjected AAV8.SaCas9 [5 × 1010 genome copies (GC) per pup] and AAV8.targeted donor or AAV8.untargeted donor (5 × 1011 GC per pup) vectors into postnatal day 2 (p2) spfash male pups via the temporal vein. We harvested liver samples at 3 and 8 weeks after vector injection for immunohistochemistry of OTC and histochemical staining of OTC enzyme activity (Fig. 2, A to C). Mice treated with the targeted vector (referred to as targeted mice) showed 25 and 35% of OTC-expressing hepatocytes at 3 and 8 weeks, respectively; this was four- and threefold higher than the mice treated with the untargeted vector (referred to as untargeted mice) at the same time points (Fig. 2D). Treated animals showed clusters of OTC-expressing cells scattered in the liver, including regions around the central vein where urea cycle enzymes are not normally expressed (Fig. 2C). Histochemical staining of OTC enzyme activity in the targeted mice at both 3 and 8 weeks showed expression of functional OTC in 26% of hepatocytes, which was threefold higher than that in the untargeted mice (Fig. 2E). At both time points, most OTC-positive hepatocytes were located in clusters scattered throughout all portions of the liver in the targeted mice, consistent with integration followed by clonal expansion in the context of a growing liver. Direct measurements of OTC enzyme activity from liver homogenates obtained from the targeted mice at 3 and 8 weeks showed 70 and 79% of wild-type (WT) levels, respectively, which were two- and threefold higher than the untargeted mice at the same time points (Fig. 2F). The OTC activity levels measured in the liver homogenates were about threefold higher than the percentage of OTC-positive hepatocytes by histochemical staining of OTC enzyme activity (Fig. 2E), suggesting that edited hepatocytes express levels of OTC higher than the endogenous gene. This may be due to the codon optimization of the hOTCco complementary DNA and the use of a strong liver-specific TBG promoter in the minigene. The OTC expression levels in individual hepatocytes from the hOTCco minigene were higher than those in hepatocytes in WT mice, as demonstrated by immunohistochemistry (Fig. 2, A and B).
We also measured hOTCco copies in the liver by quantitative polymerase chain reaction (PCR). At 3 weeks after vector treatment, we detected seven to eight copies of hOTCco per diploid genome in both untargeted and targeted mice (Fig. 2G). At 8 weeks after vector treatment, hOTCco copies decreased to two copies per diploid genome in untargeted mice and five copies in targeted mice, yet the difference between the targeted and untargeted groups was not statistically significant (Fig. 2G). Some hOTCco donor vector DNA (untargeted or targeted) may exist in the cells as episomal concatemers or have been randomly integrated into the host genome.
Clinical benefits evaluated by a high-protein diet challenge
To further assess the impact of gene targeting on the clinical manifestations of OTCD, we evaluated the tolerance of spfash mice to a 7-day course of a high-protein diet at 7 weeks postneonatal vector administration. We included WT littermates, untreated spfash mice, and untargeted spfash mice as controls. At the end of the course of the high-protein diet, we found that plasma ammonia was elevated from 37 ± 5 μM (n = 10) in WT controls to 314 ± 55 μM (n = 15) in the spfash controls (P < 0.001) (Fig. 3A), consistent with our previous finding. We observed substantial variations in plasma ammonia levels in the untreated spfash mice, consistent with findings in patients with OTCD who show considerable fluctuations in ammonia over relatively short periods of time (26). We did not observe a significant difference between untreated spfash mice and untargeted spfash mice (n = 13; Fig. 3A), which indicates that the residual hOTC expression in the untargeted mice was not sufficient to achieve clinical benefits. In contrast, we observed a statistically significant 60% reduction in ammonia levels in targeted mice (n = 12) as compared to untreated spfash mice (P < 0.05; Fig. 3A), slightly more efficient than what we achieved by our previous gene correction approach (40% reduction) (25). The plasma ammonia levels in the targeted group were reduced with respect to untreated spfash animals and did not statistically differ from those in WT mice showing complete correction. All WT mice (n = 10) and targeted mice (n = 15) survived the 7-day course of a high-protein challenge, whereas 27% of untreated spfash mice (n = 18) and 35% of untargeted spfash mice (n = 17) developed clinical signs of hyperammonemia, became lethargic, and had to be euthanized before the end of the 7-day course (Fig. 3B).
On-target indel frequency and HDR-mediated gene targeting efficiency
To assess the on-target editing efficiency at the mOTC intron 4, we analyzed on-target indel frequency in DNA isolated from spfash mice 8 weeks after vector treatment by deep sequencing of PCR amplicons of the targeted mOTC locus. The targeted mice had a mean indel frequency of 28% (22 to 38%, n = 8) (Fig. 4A). This was similar to the indel frequency that we observed with our previous gene correction approach using the same guide RNA (25). The untargeted mice had background levels at 0.3%, similar to the untreated mice (Fig. 4A).
To estimate the HDR-mediated gene targeting efficiency, we first digested the genomic DNA with Hae III or TaqI, which has recognition sites in the homology arms and in the donor vector (fig. S1, A and B), followed by ligation-mediated PCR coupled with unique molecular indices (LMU-PCR), which generate relatively similar sizes of PCR amplicons from untargeted (1.21 kb) and targeted OTC loci (1.09 and 0.99 kb for TaqI- and Hae III–digested samples, respectively) (fig. S1, C and D). Following deep sequencing of the nested PCR amplicons, we analyzed the mapped reads to calculate the ratio of reads containing the expected hOTCco minigene sequence and total mapped reads and defined it as HDR-mediated gene targeting efficiency. The targeted mice showed a mean targeting efficiency of 6% by both Hae III and TaqI assays (2 to 10% for Hae III, 4 to 8% for TaqI; n = 8; Fig. 4B). The untargeted mice (n = 8) showed undetectable (TaqI assay) or background levels (0.07%; Hae III assay) (Fig. 4B).
A more detailed analysis of the sequences obtained by LMU-PCR revealed a more complex pattern in the nature of the target region after genome editing. In addition to the sequences corresponding to parental genomic DNA and HDR-mediated insertion of the transgene, we found sequences mapping to different elements of the AAV vector such as the promoter, polyA, transgene, and inverted terminal repeats (ITRs) (fig. S2).
Comparison of neonatal OTC gene therapy and CRISPR-Cas9–mediated gene correction and gene targeting
To compare the gene targeting approach with our previously developed CRISPR-Cas9–mediated gene correction strategy and neonatal gene therapy, we treated p2 spfash mice with AAV8.SaCas9 in combination with vectors for gene targeting (AAV8.sgRNA1.hOTCco donor), gene correction of the spfash mutation (AAV8.sRNA1.donor vector), or gene therapy (AAV8.control.hOTCco donor). We harvested liver samples at 1, 3, 7, 21, and 56 days after vector injection to evaluate the kinetics of OTC expression by immunostaining. For the CRISPR-Cas9–mediated gene correction approach, isolated OTC-expressing cells started to appear at day 3 after vector treatment at a low frequency and increased over time (Fig. 5A). Spfash mice treated with the gene therapy vector (Fig. 5B) or gene-targeting vector (Fig. 5C) showed OTC-expressing cells as early as day 1 with a rapid increase to virtually all cells expressing OTC by day 7. These two groups, however, diverged between days 21 and 56, with the mice treated with the gene therapy vector showing decreased OTC-expressing cells (Fig. 5B) and the mice treated with the gene-targeting vector showing clusters of OTC-positive cells (Fig. 5C). Therefore, because of the slow kinetics of OTC expression, the gene correction approach would not be suitable to treat patients with OTC during the acute phase of hyperammonemia, although it would have long-lasting effects after the initial phase after treatment. Conversely, neonatal gene therapy with AAV alone functions quickly during the acute phase of hyperammonemia, but its therapeutic effects attenuate following hepatocyte proliferation. The gene targeting approach, however, would have both therapeutic benefits at an early phase and sustained efficacy through hepatocyte proliferation in the absence of any selective growth advantage for OTC-positive cells.
DISCUSSION
Using a highly efficient AAV delivery platform together with potent and specific guide RNAs for CRISPR-Cas9, we and others have demonstrated efficient in vivo genome editing in mouse models (25). Following cleavage by endonuclease, HDR is generally a less efficient pathway compared to NHEJ, which creates gene-disabling indels. AAV vector has exhibited advantages as an efficient vehicle to deliver donor DNA both in vitro and in vivo. We previously demonstrated successful correction of a G-to-A mutation in 10% of OTC alleles in the liver of newborn OTC spfash mice by a CRISPR-Cas9–mediated HDR approach (25). However, this approach cannot benefit all OTC-deficient patients because disease-causing mutations and large deletions are found scattered at approximately 320 different positions throughout the OTC gene (27). The HDR-mediated gene-targeting approach described in the current study could be broadly applied to all patients carrying mutations in the same causal gene, similar to gene replacement therapy, as long as they do not carry polymorphisms at the guide RNA target site. In the current proof-of-concept study, we used the same guide RNA as our previous study to target intron 4 of the murine OTC gene, 47 base pairs (bp) downstream of the spfash mutation. As a result, we could compare the efficiency of the two approaches without the complication of having different efficiencies by different guide RNAs. One of the advantages of gene targeting of a minigene is the flexibility of the targeting site. The gene targeting site can be explored in safe harbors containing protospacer sequences for highly efficient and specific guide RNAs. In contrast, for the gene correction approach, the selection of guide RNA is limited by the availability of protospacer adjacent motif sequences adjacent to the mutation. Among the limited choices of guide RNAs, a guide RNA with both sufficient efficiency and specificity may not exist.
For many genetic diseases that present clinically in the neonatal period with lethal effects, such as urea cycle disorders, early treatment and sustained therapeutic efficacy are essential. We compared three approaches: neonatal gene therapy with AAV expressing an OTC minigene alone, CRISPR-Cas9–mediated gene correction, and CRISPR-Cas9–mediated targeted integration of a therapeutic transgene cassette. The latter showed the advantages of neonatal gene therapy during the early phase combined with long-term benefits from the integrated transgene cassette due to genome editing (Fig. 5). Unlike the gene correction approach, not all OTC-expressing hepatocytes were derived from HDR-mediated genome editing. The source of OTC expression in this system could be multifactorial, being derived from the episomal donor vector genome that persists despite dilution from proliferating cells or a randomly integrated vector genome as seen in mice treated with the control donor vector (Fig. 5B) in addition to site-specific integration of the minigene. In mice treated with the gene-targeting vector, most of the OTC expression at early time points was derived from episomal vector DNA, similar to neonatal AAV gene therapy. With the high dose of donor vector used in this study, the OTC expression levels in the early phase (i.e., the first week) are likely to be over the normal levels, which do not have untoward effects. At later time points, most of the OTC expression came from targeted integration of the human OTC minigene. Also similar to the gene therapy approach and as predicted for endogenous OTC, OTC-expressing cells with the gene-targeting approach were scattered in the liver, including around central veins (Fig. 2C). This was in contrast to the gene correction approach, in which OTC-expressing cells were localized within all portions of the portal axis except around the central veins (25). OTC-expressing cells in the pericentral areas would have no impact on ureagenesis, as the urea cycle is least active in these areas. Although our dual vector gene-targeting approach achieved clinical benefits in OTC spfash mice treated as neonates, it should be noted that the vector dose used in the gene-target study is much higher than those used in AAV gene therapy studies, which, in most cases, only involve a single vector.
Because of the large homology arms (~900 bp at each end) of the donor vector, there is no straightforward way to measure HDR-mediated gene targeting efficiency. Therefore, we developed a method, called LMU-PCR, to quantify the levels of insertion of the hOTCco minigene. Assays with two different restriction enzymes showed a similar targeting efficiency of 6% (Fig. 4B). Detailed analysis of the sequences obtained by LMU-PCR revealed a complex pattern of genome structure around the target region (fig. S2). Besides the sequences corresponding to parental genomic DNA and HDR-mediated insertion of the hOTCco minigene, different parts of the AAV vector sequences were found inserted into the DSB, including partial portions of the promoter, polyA, transgene, and AAV ITRs. Some of these insertions may be too large to be captured by PCR amplicon next-generation sequencing (NGS); therefore, PCR amplicon NGS likely underestimates the true indel frequencies. The subcategories of insertions detected by this method can be influenced by the location of the restriction enzyme sites. We detected more ITR sequences in Hae III–digested versus TaqI-digested samples, likely because Hae III cuts three times in the ITR and TaqI does not cut within this region. The hairpin structure of the AAV ITR is known to impede PCR amplification (28). When delivered by AAV vectors, insertion of AAV sequences into the Cas9 cleavage site is expected, as AAV vector sequences can be integrated into the genome after induction of DSBs in the cell genome (29). Furthermore, AAV ITR sequences were detected after treatment of cells with a ZFN targeting the AAVS1 locus (30). These studies have shown that insertion of AAV sequences by NHEJ is an expected secondary effect of any AAV-delivered genome editing approach.
We cannot determine the extent of the AAV sequences integrated into the target region because of the limitations of the NGS amplicons. However, it is possible that some hOTC expression is derived from the donor vector inserted by NHEJ rather than HDR, as the donor vector contains promoter and polyA signals. NHEJ-mediated integration of the gene-targeting vector has been reported in previous studies using ZFNs (31, 32). Last, some hOTC expression is likely due to the episomal donor vector DNA or randomly integrated donor vector, as seen in the untargeted mice.
Before CRISPR-Cas9–mediated gene targeting, AAV/ZFN-mediated in vivo genome targeting in the liver demonstrated sustained and therapeutic levels of coagulation factor IX in both neonatal and adult mice (31–34). To take advantage of the strong albumin promoter, researchers developed ZFN to target the albumin locus and achieved high levels of gene expression for multiple transgenes (32). Encouraged by the preclinical data, clinical trials in patients with hemophilia B or mucopolysaccharidosis I (16, 17) have started. Although this approach has worked well for secreted proteins, the targeting efficiency at ~0.5% is likely too low to be effective for nonsecreted proteins such as enzymes of the urea cycle that require broad expression in a large number of hepatocytes.
In conclusion, we have demonstrated the therapeutic effect of AAV-delivered, CRISPR-Cas9–mediated gene targeting in a mouse model of OTCD. In the absence of any selective growth advantage for OTC-positive cells, a single injection of dual AAV gene-targeting vectors in neonatal mice achieved robust and sustained expression of OTC that was clinically beneficial. This genome editing strategy is not mutation position specific and can be broadly applied to all patients with the same disease. This strategy can also be adapted to other hereditary disorders of the liver.
MATERIALS AND METHODS
Study design
Sample size
Test and control vectors were evaluated in at least five mice per group at each time point to ensure reproducibility. Sample sizes are noted in figure legends.
Outliers
All data are presented.
Selection of endpoints
We chose 3 and 8 weeks after dosing in neonatal mice as relative short- and long-term time points, respectively, to evaluate OTC expression and gene targeting efficiency.
Replicates
Each vector group was evaluated in multiple litters of male spfash pups. Transduction efficiency in each mouse was analyzed on at least five images. Plasma NH3 was assayed in duplicate. Indel and HDR analysis on each sample was performed once.
Research objectives
This study aimed to develop a mutation-independent CRISPR-Cas9–mediated gene targeting approach in which the vector system could be applied to most patients with OTCD.
Research subjects
Newborn (p2) male spfash mouse pups were the subjects of this study. Untreated WT and spfash hemizygous mice served as controls.
Experimental design
Pups received a temporal vein injection of a mixture of two vectors at the intended doses for each with a volume of 50 μl. Mice were euthanized at various time points after vector treatment, and liver samples were harvested for analyses. Mice were genotyped at weaning or at the time of necropsy to confirm genotype. For testing the efficacy of OTC gene targeting, a high-protein diet (40% protein; Animal Specialties and Provisions, Quakertown, PA) was given to 7-week-old mice for 7 days. After this time, plasma was collected for measurement of plasma NH3 using an Ammonia Assay Kit (Sigma-Aldrich, St. Louis, MO).
Randomization
Note that the entire litter of newborn male pups was injected with either the targeting or control vectors, and no specific randomization method was used.
Blinding
The following assays were performed in a blinded fashion, in which the investigator was unaware of the nature of the vectors or vector dose: vector injection, OTC immunostaining, OTC enzyme activity staining and quantification, OTC enzyme activity assay, and vector GC analysis.
Plasmid construction
The AAV OTC gene-targeting vector contains the U6-OTC sgRNA1 cassette and the TBG.hOTCco.pA cassette flanked by 0.9-kb homology arms on each side (Fig. 1). This vector was generated by cloning of the TBG.hOTCco.pA cassette from pAAVss.TBG.hOTCco (35) into the Mfe I site of the pAAV.sgRNA1.donor, which was previously constructed for gene correction of the OTC spfash mutation (25). The “untargeted” AAV.control.targeting donor differs from the “targeted” AAV.sgRNA1.TBG.hOTCco.pA.donor by the lack of the protospacer sequence from the U6-OTC sgRNA1 cassette. All plasmid constructs were verified by sequencing. The SaCas9 expression vector AAV.TBG.SaCas9 has been previously described (25).
AAV vector production
All AAV8 vectors were produced by the Penn Vector Core at the University of Pennsylvania as previously described (36). The genome titer (GC ml−1) of AAV vectors was determined by quantitative PCR. All vectors used in this study passed an endotoxin assay using the QCL-1000 Chromogenic LAL test kit (Cambrex Bio Science).
Animal studies
spfash mice were maintained in an Association for Assessment and Accreditation of Laboratory Animal Care–accredited and Public Health Service–assured facility at the University of Pennsylvania, as described previously (37). All animal procedures were performed in accordance with protocols approved by the Institutional Animal Care and Use Committee of the University of Pennsylvania. Mating cages were monitored daily for births.
OTC enzyme activity staining and OTC immunostaining
Sliced liver tissue (2 mm) was fixed, embedded, sectioned (9 μm), and mounted onto slides for histochemical staining of OTC enzyme activity, as previously described (37). Immunofluorescence for OTC and glutamine synthetase expression was performed on frozen liver sections, as previously described (37). Quantification of percentages of OTC-expressing hepatocytes was performed as previously described (25).
OTC enzyme activity assay
OTC enzyme activity was assayed on liver homogenates as previously described (25).
Vector GC analysis
Genomic DNA was isolated from liver and extracted using the QIAamp DNA mini kit (Qiagen). Vector genomes were quantified by real-time PCR using primers/probe set corresponding to hOTCco.
In vivo on-target indel frequency analysis
On-target indel frequencies were evaluated on liver samples collected at 8 weeks following vector administration by deep sequencing on PCR amplicons using primers and methods described previously (24, 25). Libraries were made from 250 ng of the 433-bp PCR products using the NEBNext Ultra II DNA Library Prep Kit for Illumina (New England BioLabs) following the manufacturer’s instructions. Individual libraries were analyzed for quality and size using a high-resolution cartridge in the QIAxcel advanced system (Qiagen); library concentrations were measured by the PicoGreen assay (Thermo Fisher Scientific, Waltham, MA) before library pooling at equal molarity. The pooled library was subsequently size selected from 300 to 800 bp using an E-Gel Electrophoresis system (Thermo Fisher Scientific) and a QIAquick Gel Extraction kit (Qiagen). The final pooled and purified NGS library was quantified by a Qubit 3.0 Fluorometer, denatured, and diluted to 8 pM according to Illumina’s instructions. To increase library diversity for on-target indel analysis, 10 to 15% PhiX and the same percentage of an irrelevant, barcoded NGS library made from plasmid DNA were supplemented into the final amplicon-library pool before being loaded onto a MiSeq reagent cartridge. Sequencing was then performed using an Illumina MiSeq Reagent V2 kit (250-bp pair end; Illumina) to enable successful read merging for downstream data analysis.
Paired-end read pairs were assembled using PEAR46. The merged reads were aligned to the reference sequence with Burrows-Wheeler aligner (BWA) maximal exact match (MEM) (http://bio-bwa.sourceforge.net/), and the mapped reads were filtered to have mapping quality scores of at least 20. To identify indels, the target window was defined as a region that is within 20 bp upstream or downstream of the predicted cleavage site. We counted the total number of reads that mapped to the target window and the number of reads that contained indels. The indel frequency was calculated as the number of indel-containing reads divided by the total number of reads mapped to the target window.
Gene targeting efficiency analysis
HDR-mediated integration of the hOTCco cassette into the mouse OTC locus was quantified by LMU-PCR, as previously described (39). Briefly, 1 μg of genomic DNA was digested with 20 U of Hae III or TaqI for 4 hours at 37°C (for Hae III) or 65°C (for TaqI). Digested DNA was purified with Agencourt AMPure XP beads (Beckman Coulter, Brea, CA) at a ratio of 2× and eluted in 20 μl of elution buffer (Qiagen). Purified DNA was quantified using the Quant-It PicoGreen dsDNA assay (Thermo Fisher Scientific, Waltham, MA). A total of 180 ng of purified DNA was end-repaired and ligated to Y-adapters (see table S1), containing unique molecular indexes to reduce PCR bias, as previously described (40). Ligated DNA was purified with AMPure XP beads at a ratio of 0.9× and eluted in 15 μl of elution buffer (Qiagen). DNA was amplified by touchdown PCR using Platinum Taq DNA polymerase (Thermo Fisher Scientific) and the primers P5_1 plus either left-OTC-F1 (for Hae III–digested DNA) or right-OTC-F1 (for TaqI-digested DNA). The PCR program consisted of the following: 1 cycle of 95°C for 5 min; 15 cycles of 95°C for 30 s, 70°C for 2 min (−1°C per cycle), and 72°C for 1.5 min; and 15 cycles of 95°C for 30 s, 55°C for 1 min, and 72°C for 1.5 min. PCR product was purified with AMPure XP beads at a ratio of 0.9× and eluted in 15 μl of elution buffer (Qiagen). A total of 0.015 μl of PCR product was amplified with a second round of touchdown PCR using the primers P5_2 plus left- or right-OTC-F2 and the same PCR program as PCR1. PCR product was again purified with AMPure XP beads at a ratio of 0.9× and eluted in 15 μl of elution buffer (Qiagen). DNA libraries were prepared for NGS using unique P7 primers (P701 to P734) for each sample along with P5_2 plus left- or right-OTC-F3 primers and the product of the second PCR as a template. The following PCR program was used for the third and final PCR: 1 cycle of 95°C for 5 min; 15 cycles of 95°C for 30 s, 70°C for 2 min (−1°C per cycle), and 72°C for 30 s; 10 cycles of 95°C for 30 s, 55°C for 1 min, and 72°C for 30 s; and 1 cycle of 72°C for 5 min, 4°C hold. PCR product was purified with AMPure XP beads at a ratio of 2× and eluted in 25 μl of elution buffer (Qiagen). NGS library concentrations were measured by the Quant-It PicoGreen dsDNA assay (Thermo Fisher Scientific, Waltham, MA) before library pooling at equal molarity (40). The final pooled and purified NGS library was quantified by a Qubit 3.0 Fluorometer, denatured, and diluted to 8 pM according to Illumina’s instructions with a 15% PhiX spike-in. Sequencing was then performed using custom sequencing primers as described in the GUIDE-seq protocol (40) and the Illumina MiSeq Reagent V2 kit 500 cycle to enable successful read merging for downstream data analysis.
Adapter sequences were trimmed using BBduk (https://jgi.doe.gov/data-and-tools/bbtools/). Paired-end read pairs were assembled using PEAR (41), and the merged reads were aligned to the reference genomic sequence using BWA-MEM (http://bio-bwa.sourceforge.net/). Unique molecules were identified using UMI-tools (42). The molecules were further filtered to ensure they had a mapping quality score ≥20, had a length for the mapped portion ≥60 bp for Hae III–digested samples and ≥55 bp for TaqI-digested samples, and started with the GSP3 primer sequence plus a portion of the homology arm. Unique molecules containing more than 60 bp of aligned portion for Hae III (or 55 bp for TaqI) and less than 10 bp of unaligned portion were classified as “mOTC locus.” For unique molecules containing unaligned portion ≥10 bp, the sequences of unaligned portion were extracted and further aligned to HDR, ITR, SaCas9, polyA, hOTCco, TBG, or other regions on the donor vector and SaCas9 vector sequences using BWA-backtrack (43). The HDR integration percentage was calculated as the number of unique molecules mapped to HDR divided by the total number of unique molecules after filtering.
Statistical analysis
Statistical analyses were performed with GraphPad Prism 6.03 for Windows. The log-rank test was used to test the survival distributions for differences. A one-way analysis of variance (ANOVA) and Dunnett’s multiple comparisons test were used to compare a number of variables with a single control. To compare untargeted and targeted groups, the Mann-Whitney test was used. Because of the relatively small sample size, normality testing was not feasible. Group averages are presented as means ± SEM.
Supplementary Material
Acknowledgments
We thank the Penn Vector Core for supplying AAV vectors and the Nucleic Acid Technologies Core at the Gene Therapy Program for assistance with deep sequencing. Funding: This work was supported by the National Institute of Child Health and Human Development P01-HD057247 (J.M.W.) and the Kettering Family Foundation (M.L.B.). The State Key Laboratory of Biotherapy and Cancer Center, West China Hospital, Sichuan University and the Collaborative Innovation Center for Biotherapy, Chengdu, Sichuan, China provided the salary for Y.Y. Author contributions: L.W., Y.Y., and J.M.W. conceived this study and designed the experiments. Y.Y. constructed the donor vectors. L.W., Y.Y., J.W., and D.M. performed mouse studies. Y.Y., C.B., J.Z., A.S., and C.L. performed indel and targeting efficiency analyses. Y.C. and M.L. conducted the bioinformatics analysis of the deep sequencing data and statistical analyses. P.B. and H.Y. performed histology analyses. Z.H. performed vector DNA analysis. C.X. and H.M. performed the OTC enzyme activity assay. L.W. and J.M.W. wrote the manuscript. M.L.B., H.M., J.Z., and Y.Y. edited the manuscript. All authors read and approved the final manuscript. Competing interests: J.M.W. is a paid advisor to and holds equity in Scout Bio and Passage Bio; holds equity in Surmount Bio; and also has a sponsored research agreement with Ultragenyx, Biogen, Janssen, Precision Biosciences, Moderna Inc., Scout Bio, Passage Bio, Amicus Therapeutics, and Surmount Bio, which are licensees of Penn technology. L.W., Y.Y., and J.M.W. are inventors on patents/patent applications related to this work, as well as other patents/applications filed by the University of Pennsylvania, some of which are licensed to various biopharmaceutical companies for which they may receive payments. Data and materials availability: All data needed to evaluate the conclusions in the paper are present in the paper and/or the Supplementary Materials. Additional data related to this paper may be requested from the authors. The original deep-sequencing data are available at NCBI BioProject under accession code PRJNA454822. The AAV plasmid constructs used in this manuscript can be provided by Penn Vector Core pending scientific review and a completed material transfer agreement. Requests for the plasmid should be submitted to: vector@mail.med.upenn.edu
SUPPLEMENTARY MATERIALS
REFERENCES AND NOTES
- 1.Tuchman M., Lee B., Lichter-Konecki U., Summar M. L., Yudkoff M., Cederbaum S. D., Kerr D. S., Diaz G. A., Seashore M. R., Lee H. S., McCarter R., Krischer J. P., Batshaw M. L.; Urea Cycle Disorders Consortium of the Rare Diseases Clinical Research Network , Cross-sectional multicenter study of patients with urea cycle disorders in the United States. Mol. Genet. Metab. 94, 397–402 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Tuchman M., Morizono H., Rajagopal B. S., Plante R. J., Allewell N. M., The biochemical and molecular spectrum of ornithine transcarbamylase deficiency. J. Inherit. Metab. Dis. 21 ( Suppl. 1), 40–58 (1998). [DOI] [PubMed] [Google Scholar]
- 3.Batshaw M. L., Tuchman M., Summar M., Seminara J.; Members of the Urea Cycle Disorders Consortium , A longitudinal study of urea cycle disorders. Mol. Genet. Metab. 113, 127–130 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Nathwani A. C., Reiss U. M., Tuddenham E. G. D., Rosales C., Chowdary P., McIntosh J., Della Peruta M., Lheriteau E., Patel N., Raj D., Riddell A., Pie J., Rangarajan S., Bevan D., Recht M., Shen Y. M., Halka K. G., Basner-Tschakarjan E., Mingozzi F., High K. A., Allay J., Kay M. A., Ng C. Y. C., Zhou J., Cancio M., Morton C. L., Gray J. T., Srivastava D., Nienhuis A. W., Davidoff A. M., Long-term safety and efficacy of factor IX gene therapy in hemophilia B. N. Engl. J. Med. 371, 1994–2004 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.George L. A., Sullivan S. K., Giermasz A., Rasko J. E. J., Samelson-Jones B. J., Ducore J., Cuker A., Sullivan L. M., Majumdar S., Teitel J., McGuinn C. E., Ragni M. V., Luk A. Y., Hui D., Wright J. F., Chen Y., Liu Y., Wachtel K., Winters A., Tiefenbacher S., Arruda V. R., van der Loo J. C. M., Zelenaia O., Takefman D., Carr M. E., Couto L. B., Anguela X. M., High K. A., Hemophilia B gene therapy with a high-specific-activity factor IX variant. N. Engl. J. Med. 377, 2215–2227 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Rangarajan S., Walsh L., Lester W., Perry D., Madan B., Laffan M., Yu H., Vettermann C., Pierce G. F., Wong W. Y., Pasi K. J., AAV5-factor VIII gene transfer in severe hemophilia A. N. Engl. J. Med. 377, 2519–2530 (2017). [DOI] [PubMed] [Google Scholar]
- 7.Mendell J. R., al-Zaidy S., Shell R., Arnold W. D., Rodino-Klapac L. R., Prior T. W., Lowes L., Alfano L., Berry K., Church K., Kissel J. T., Nagendran S., L’Italien J., Sproule D. M., Wells C., Cardenas J. A., Heitzer M. D., Kaspar A., Corcoran S., Braun L., Likhite S., Miranda C., Meyer K., Foust K. D., Burghes A. H. M., Kaspar B. K., Single-dose gene-replacement therapy for spinal muscular atrophy. N. Engl. J. Med. 377, 1713–1722 (2017). [DOI] [PubMed] [Google Scholar]
- 8.Smalley E., First AAV gene therapy poised for landmark approval. Nat. Biotechnol. 35, 998–999 (2017). [DOI] [PubMed] [Google Scholar]
- 9.ClinicalTrials.gov, Safety and Dose-Finding Study of DTX301 (scAAV8OTC) in Adults With Late-Onset OTC Deficiency (CAPtivate), vol. NCT02991144.
- 10.Cunningham S. C., Spinoulas A., Carpenter K. H., Wilcken B., Kuchel P. W., Alexander I. E., AAV2/8-mediated correction of OTC deficiency is robust in adult but not neonatal Spfash mice. Mol. Ther. 17, 1340–1346 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Wang L., Bell P., Lin J., Calcedo R., Tarantal A. F., Wilson J. M., AAV8-mediated hepatic gene transfer in infant rhesus monkeys (Macaca mulatta). Mol. Ther. 19, 2012–2020 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Wang L., Wang H., Bell P., McMenamin D., Wilson J. M., Hepatic gene transfer in neonatal mice by adeno-associated virus serotype 8 vector. Hum. Gene Ther. 23, 533–539 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Cunningham S. C., Dane A. P., Spinoulas A., Logan G. J., Alexander I. E., Gene delivery to the juvenile mouse liver using AAV2/8 vectors. Mol. Ther. 16, 1081–1088 (2008). [DOI] [PubMed] [Google Scholar]
- 14.Fehse B., Abramowski-Mock U., The Time Is Ripe for Somatic Genome Editing: NIH Program to Strengthen Translation. Mol. Ther. 26, 671–674 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Tebas P., Stein D., Tang W. W., Frank I., Wang S. Q., Lee G., Spratt S. K., Surosky R. T., Giedlin M. A., Nichol G., Holmes M. C., Gregory P. D., Ando D. G., Kalos M., Collman R. G., Binder-Scholl G., Plesa G., Hwang W. T., Levine B. L., June C. H., Gene editing of CCR5 in autologous CD4 T cells of persons infected with HIV. N. Engl. J. Med. 370, 901–910 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.ClinicalTrials.gov, Ascending Dose Study of Genome Editing by Zinc Finger Nuclease Therapeutic SB-FIX in Subjects With Severe Hemophilia B, vol. NCT02695160.
- 17.ClinicalTrials.gov, Ascending Dose Study of Genome Editing by the Zinc Finger Nuclease (ZFN) Therapeutic SB-318 in Subjects With MPS I, vol. NCT02702115.
- 18.Jinek M., Chylinski K., Fonfara I., Hauer M., Doudna J. A., Charpentier E., A programmable dual-RNA–guided DNA endonuclease in adaptive bacterial immunity. Science 337, 816–821 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Ran F. A., Cong L., Yan W. X., Scott D. A., Gootenberg J. S., Kriz A. J., Zetsche B., Shalem O., Wu X., Makarova K. S., Koonin E. V., Sharp P. A., Zhang F., In vivo genome editing using Staphylococcus aureus Cas9. Nature 520, 186–191 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Tabebordbar M., Zhu K., Cheng J. K. W., Chew W. L., Widrick J. J., Yan W. X., Maesner C., Wu E. Y., Xiao R., Ran F. A., Cong L., Zhang F., Vandenberghe L. H., Church G. M., Wagers A. J., In vivo gene editing in dystrophic mouse muscle and muscle stem cells. Science 351, 407–411 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Nelson C. E., Hakim C. H., Ousterout D. G., Thakore P. I., Moreb E. A., Rivera R. M. C., Madhavan S., Pan X., Ran F. A., Yan W. X., Asokan A., Zhang F., Duan D., Gersbach C. A., In vivo genome editing improves muscle function in a mouse model of Duchenne muscular dystrophy. Science 351, 403–407 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Long C., Amoasii L., Mireault A. A., McAnally J. R., Li H., Sanchez-Ortiz E., Bhattacharyya S., Shelton J. M., Bassel-Duby R., Olson E. N., Postnatal genome editing partially restores dystrophin expression in a mouse model of muscular dystrophy. Science 351, 400–403 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Suzuki K., Tsunekawa Y., Hernandez-Benitez R., Wu J., Zhu J., Kim E. J., Hatanaka F., Yamamoto M., Araoka T., Li Z., Kurita M., Hishida T., Li M., Aizawa E., Guo S., Chen S., Goebl A., Soligalla R. D., Qu J., Jiang T., Fu X., Jafari M., Esteban C. R., Berggren W. T., Lajara J., Nuñez-Delicado E., Guillen P., Campistol J. M., Matsuzaki F., Liu G. H., Magistretti P., Zhang K., Callaway E. M., Zhang K., Belmonte J. C. I., In vivo genome editing via CRISPR/Cas9 mediated homology-independent targeted integration. Nature 540, 144–149 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Wang L., Smith J., Breton C., Clark P., Zhang J., Ying L., Che Y., Lape J., Bell P., Calcedo R., Buza E. L., Saveliev A., Bartsevich V. V., He Z., White J., Li M., Jantz D., Wilson J. M., Meganuclease targeting of PCSK9 in macaque liver leads to stable reduction in serum cholesterol. Nat. Biotechnol. 36, 717–725 (2018). [DOI] [PubMed] [Google Scholar]
- 25.Yang Y., Wang L., Bell P., McMenamin D., He Z., White J., Yu H., Xu C., Morizono H., Musunuru K., Batshaw M. L., Wilson J. M., A dual AAV system enables the Cas9-mediated correction of a metabolic liver disease in newborn mice. Nat. Biotechnol. 34, 334–338 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Lee B., Diaz G. A., Rhead W., Lichter-Konecki U., Feigenbaum A., Berry S. A., Le Mons C., Bartley J. A., Longo N., Nagamani S. C., Berquist W., Gallagher R., Bartholomew D., Harding C. O., Korson M. S., McCandless S. E., Smith W., Cederbaum S., Wong D., Merritt II J. L., Schulze A., Vockley J., Kronn D., Zori R., Summar M., Milikien D. A., Marino M., Coakley D. F., Mokhtarani M., Scharschmidt B. F., Blood ammonia and glutamine as predictors of hyperammonemic crises in patients with urea cycle disorder. Genet. Med. 17, 561–568 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Caldovic L., Abdikarim I., Narain S., Tuchman M., Morizono H., Genotype–phenotype correlations in ornithine transcarbamylase deficiency: A mutation update. J. Genet. Genomics 42, 181–194 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Petri K., Fronza R., Gabriel R., Käppel C., Nowrouzi A., Linden R. M., Henckaerts E., Schmidt M., Comparative next-generation sequencing of adeno-associated virus inverted terminal repeats. Biotechniques 56, 269–273 (2014). [DOI] [PubMed] [Google Scholar]
- 29.Miller D. G., Petek L. M., Russell D. W., Adeno-associated virus vectors integrate at chromosome breakage sites. Nat. Genet. 36, 767–773 (2004). [DOI] [PubMed] [Google Scholar]
- 30.De Ravin S. S., Reik A., Liu P.-Q., Li L., Wu X., Su L., Raley C., Theobald N., Choi U., Song A. H., Chan A., Pearl J. R., Paschon D. E., Lee J., Newcombe H., Koontz S., Sweeney C., Shivak D. A., Zarember K. A., Peshwa M. V., Gregory P. D., Urnov F. D., Malech H. L., Targeted gene addition in human CD34+ hematopoietic cells for correction of X-linked chronic granulomatous disease. Nat. Biotechnol. 34, 424–429 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Anguela X. M., Sharma R., Doyon Y., Miller J. C., Li H., Haurigot V., Rohde M. E., Wong S. Y., Davidson R. J., Zhou S., Gregory P. D., Holmes M. C., High K. A., Robust ZFN-mediated genome editing in adult hemophilic mice. Blood 122, 3283–3287 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Sharma R., Anguela X. M., Doyon Y., Wechsler T., DeKelver R. C., Sproul S., Paschon D. E., Miller J. C., Davidson R. J., Shivak D., Zhou S., Rieders J., Gregory P. D., Holmes M. C., Rebar E. J., High K. A., In vivo genome editing of the albumin locus as a platform for protein replacement therapy. Blood 126, 1777–1784 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Li H., Haurigot V., Doyon Y., Li T., Wong S. Y., Bhagwat A. S., Malani N., Anguela X. M., Sharma R., Ivanciu L., Murphy S. L., Finn J. D., Khazi F. R., Zhou S., Paschon D. E., Rebar E. J., Bushman F. D., Gregory P. D., Holmes M. C., High K. A., In vivo genome editing restores haemostasis in a mouse model of haemophilia. Nature 475, 217–221 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Barzel A., Paulk N. K., Shi Y., Huang Y., Chu K., Zhang F., Valdmanis P. N., Spector L. P., Porteus M. H., Gaensler K. M., Kay M. A., Promoterless gene targeting without nucleases ameliorates haemophilia B in mice. Nature 517, 360–364 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Bell P., Wang L., Chen S. J., Yu H., Zhu Y., Nayal M., He Z., White J., Lebel-Hagan D., Wilson J. M., Effects of self-complementarity, codon optimization, transgene, and dose on liver transduction with AAV8. Hum. Gene Ther. Methods 27, 228–237 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Lock M., Alvira M., Vandenberghe L. H., Samanta A., Toelen J., Debyser Z., Wilson J. M., Rapid, simple, and versatile manufacturing of recombinant adeno-associated viral vectors at scale. Hum. Gene Ther. 21, 1259–1271 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Moscioni D., Morizono H., McCarter R. J., Stern A., Cabrera-Luque J., Hoang A., Sanmiguel J., Wu D., Bell P., Gao G. P., Raper S. E., Wilson J. M., Batshaw M. L., Long-term correction of ammonia metabolism and prolonged survival in ornithine transcarbamylase-deficient mice following liver-directed treatment with adeno-associated viral vectors. Mol. Ther. 14, 25–33 (2006). [DOI] [PubMed] [Google Scholar]
- 38.Wang L., Wang H., Morizono H., Bell P., Jones D., Lin J., McMenamin D., Yu H., Batshaw M. L., Wilson J. M., Sustained correction of OTC deficiency in spfash mice using optimized self-complementary AAV2/8 vectors. Gene Ther. 19, 404–410 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Wang L., Yang Y., Breton C. A., White J., Zhang J., Che Y., Saveliev A., McMenamin D., He Z., Latshaw C., Li M., Wilson J. M., CRISPR/Cas9-mediated in vivo gene targeting corrects hemostasis in newborn and adult factor IX–knockout mice. Blood 133, 2745–2752 (2019). [DOI] [PubMed] [Google Scholar]
- 40.Tsai S. Q., Zheng Z., Nguyen N. T., Liebers M., Topkar V. V., Thapar V., Wyvekens N., Khayter C., Iafrate A. J., le L. P., Aryee M. J., Joung J. K., GUIDE-seq enables genome-wide profiling of off-target cleavage by CRISPR-Cas nucleases. Nat. Biotechnol. 33, 187–197 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Zhang J., Kobert K., Flouri T., Stamatakis A., PEAR: A fast and accurate Illumina Paired-End reAd mergeR. Bioinformatics 30, 614–620 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Smith T., Heger A., Sudbery I., UMI-tools: Modeling sequencing errors in Unique Molecular Identifiers to improve quantification accuracy. Genome Res. 27, 491–499 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Li H., Durbin R., Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.