Abstract
Different approaches are used in the production of recombinant adeno-associated virus (rAAV). The two leading approaches are transiently transfected human HEK293 cells and live baculovirus infection of Spodoptera frugiperda (Sf9) insect cells. Unexplained differences in vector performance have been seen clinically and preclinically. Thus, we performed a controlled comparative production analysis varying only the host cell species but maintaining all other parameters. We characterized differences with multiple analytical approaches: proteomic profiling by mass spectrometry, isoelectric focusing, cryo-EM (transmission electron cryomicroscopy), denaturation assays, genomic and epigenomic sequencing of packaged genomes, human cytokine profiling, and functional transduction assessments in vitro and in vivo, including in humanized liver mice. Using these approaches, we have made two major discoveries: (1) rAAV capsids have post-translational modifications (PTMs), including glycosylation, acetylation, phosphorylation, and methylation, and these differ between platforms; and (2) rAAV genomes are methylated during production, and these are also differentially deposited between platforms. Our data show that host cell protein impurities differ between platforms and can have their own PTMs, including potentially immunogenic N-linked glycans. Human-produced rAAVs are more potent than baculovirus-Sf9 vectors in various cell types in vitro (p < 0.05–0.0001), in various mouse tissues in vivo (p < 0.03–0.0001), and in human liver in vivo (p < 0.005). These differences may have clinical implications for rAAV receptor binding, trafficking, expression kinetics, expression durability, vector immunogenicity, as well as cost considerations.
Keywords: AAV, adeno-associated virus, PTM, post-translational modification, potency, methylation, human, baculovirus-Sf9, capsid, mass spectrometry
Graphical Abstract
Paulk and colleagues performed an unbiased comparative characterization of rAAV manufactured by transient transfection of HEK293 or baculoviral infection of insect cells. They discovered that rAAV capsids have post-translational modifications, including glycosylation, acetylation, phosphorylation, and methylation, and these differ between platforms. Additionally, rAAV genomes are differentially methylated during production between platforms.
Introduction
Adeno-associated virus (AAV) is a single-stranded DNA virus that is non-pathogenic to humans, exhibits low immunogenicity but high transduction efficiency, and is unable to replicate itself.1 Recombinant AAV (rAAV) can stably express gene products from either unintegrated episomes2 in quiescent tissues, or via integration in actively dividing tissues3 when designed with appropriate homology arms. Gene therapies and passive vaccines with rAAV are rapidly gaining attention and investment following the first rAAV therapy approvals in the US market. Nearly 200 rAAV clinical trials for various indications have been performed, and numerous investigational new drug applications are in various stages of review at the US Food and Drug Administration (FDA) and European Medicines Agency (EMA). The predominant rAAV production platform used for preclinical and clinical studies to date has been transient transfection of adherent human HEK293 cells.4 However, growth limitations using adherent HEK293 cells have spurred manufacturing innovations to increase yields. A newer platform utilizing live baculovirus infection of Spodoptera frugiperda (Sf9) insect cells produces an average of 7e4 vector genomes (vg)/cell.5, 6, 7 Several recent clinical trials have used vector from this new platform. One difference between their preclinical validation and clinical trials was a transition in manufacturing platform from human HEK293 to baculovirus-Sf9 when scale-up was needed. This often corresponded to an increased vector dose being administered to achieve relevant expression, and in some cases patients developed severe adverse events directly related to treatment.8,9 Thus, while poised to revolutionize treatment of rare and common diseases alike, a thorough characterization of differences in rAAV vector safety and potency produced by the two manufacturing platforms was needed.
We characterized differences in vector lot composition at both the genomic and proteomic levels. This was motivated by the fact that human and insect cells have different capacities to produce protein post-translational modifications (PTMs).10 Protein folding and PTMs can influence therapeutically administered proteins, including altering stability, targeting/trafficking, functional activity, and immunogenicity, all of which could affect rAAV potency. Additionally, a concern in producing recombinant proteins for humans within insect cells is potential immunotoxicity. Humans can have acute allergenic reactions to non-mammalian N-glycans, as well as any N-glycan with an α1,3-fucose or β1,2-xylose linkage on the innermost N-acetylglucosamine (GlcNAc),11 both of which are modifications found on insect glycoproteins. Thus, certain insect glycoproteins in baculovirus-Sf9-produced vector lots could pose potential risks.
Impurities of any kind are a known concern for all production methods. For example, the EMA assessment report for the rAAV Glybera (alipogene tiparvovec) found the final vector lot impurities to be “unacceptably high.”12 Collectively, we therefore hypothesized that rAAV vector, both capsids and packaged genomes, produced in different manufacturing platforms may be compositionally diverse. Using an array of chemical, molecular, structural, bioinformatic, and functional assessment approaches, we characterized and report herein the differences and similarities in rAAV vector produced by these two leading manufacturing platforms.
Results
The Vector PTM Landscape Differs in rAAV Vectors Made Using the Human and Baculovirus-Sf9 Production Platforms
We sought to determine whether rAAV capsids were post-translationally modified, and whether capsid PTMs or any host cell protein (HCP) process impurities differed between production methods and lots. We used deep proteomic profiling and liquid chromatography-tandem mass spectrometry (LC-MS/MS). To eliminate variables between human and baculovirus-Sf9 platforms for comparative high-resolution MS/MS analysis, the following safeguards were implemented: (1) all vector productions were carried out at the same time in one facility using identical equipment by the same individuals; (2) vector lots were harvested, purified, and underwent quality control together using the same assays; (3) vector aliquots were frozen down identically and simultaneously; and (4) aliquots were thawed together and prepared for analysis from lots that had been frozen for the same amount of time to eliminate the effects of time spent frozen on potential PTM retention. One additional parameter that was important to control was the input DNA and backbone vector sequence itself. rAAV transfer vectors are encoded on plasmids for transfection in the human manufacturing platform and within a live baculovirus in the insect platform. To overcome this difference, we designed a custom transfer vector plasmid that has the necessary backbone components to be used in both systems (Figure S1). Thus, the only differences remaining were those standard to each method: human HEK293 cells were grown in adherent cultures and transiently transfected with three production plasmids, while insect Sf9 cells were grown in suspension cultures infected with two second-generation5 recombinant baculoviruses needed to produce rAAV.
A set of rAAV8 productions was carried out and subdivided into four preparations for simultaneous purification: lots purified from cell lysates and media supernatant from each platform. LC-MS/MS analysis resulted in rAAV8 capsid protein coverage of 98.8% (human) and 97.3% (baculovirus-Sf9). Capsid modifications observed included O-linked glycans, N-terminal acetylation to start methionine, lysine/arginine acetylation, tyrosine/serine/threonine phosphorylation, lysine/arginine methylation, and aspartate deamidation (Figures S2A–S2F). PTMs were distributed along the entire polypeptide, and more PTMs were observed on baculovirus-rAAV8 capsids compared to those produced in human cells (Figures 1A–1E). In addition, externally facing PTMs were more common than those in the capsid lumen in both platforms. However, the first 220 aa in the N terminus of rAAV8 are disordered; thus, we lack structural data to classify the orientation of PTMs in that region. Limited data suggest that these amino acids are located within the capsid.13,14 Interestingly, several PTMs within this disordered and presumed luminal region were shared between the human and baculovirus-rAAV8 lots: N-terminal acetylation on the initial start methionine, and phosphorylation on serines 149 and 153. These shared PTMs may play a conserved role in capsid-genome interactions or in intracellular trafficking following phospholipase A2 domain extrusion once in the endosome.15 We confirmed the presence of all 16 recently reported deamidations that were observed on human cell lysate-purified rAAV816 on both human and baculovirus-rAAV8 cell lysate and media-purified preparations.
For the remaining residues 220–738 on rAAV8 for which robust structural data exist,17 those residues that are externally facing may play roles in mediating blood clearance, antibody binding, cellular tropism, receptor binding, and so forth. We detected numerous PTMs within known external functional regions (Figures 1A–1E). For example, within the laminin receptor (LamR) binding region (residues 491–547 and 593–623),18 we observed one shared and numerous differential PTMs between human and baculovirus-produced vectors. The one conserved modification was an O-GlcNAc on T494 within the LamR binding domain. However, baculovirus-rAAV8 had five different external modifications, compared to the two in human-rAAV8 in this same domain. This included K530, which we detected to be capable of both acetylation and methylation on different capsids within the same vector lot. Interestingly, the C-terminal end of the human-produced vectors lacked observable PTMs, both internally and externally, while the baculovirus-produced rAAV8 vectors were often modified in this region, including an additional external O-GlyNAc on T663. No PTMs were detected in the known ADK8 neutralizing epitope region (586–591).19 Additionally, there are eight lysines on the mapped structure of AAV8 VP3 predicted20 to be capable of ubiquitination: K259, K333, K510, K530, K547, K569, K668, and K709. In human rAAV8 productions we observed no PTMs on or near any of these residues, while in baculovirus-Sf9 productions we observed PTMs on two ubiquitinable lysines (K530 was methylated and acetylated; K709 was methylated), thus potentially blocking them from future ubiquitination.
Negative staining and transmission electron microscopy (TEM) confirmed that, despite identical purification, human-rAAV had more full capsids, regardless of whether the rAAVs were purified from cell lysates or media supernatant (Figures 1F–1I). We observed normal 1:1:1021 ratios of VP1/VP2/VP3 by silver stain (Figure 1J) for each vector lot; however, the baculovirus-Sf9 vectors displayed additional truncated VPs, as seen by others previously.22 HCP impurities were present in all vectors, regardless of manufacturing platform, but they were different between platforms (Table S1). Of concern, we observed Sf9 insect HCP impurities with N-linked glycans (Table 1; Figures S2G and S2H). Given that humans can have acute allergenic reactions to non-mammalian N-glycans,11 the insect N-glycans found in baculovirus-Sf9-produced vector could pose potential risks. In addition, we performed cytokine profiling in primary human fibroblasts following transduction with each of the four vector lots by Luminex assay and demonstrated that responses were more similar for human-produced rAAV8 vectors than for baculovirus-Sf9 vectors (Figure S3). Specifications on the viral lots are outlined in Table S2.
Table 1.
Source | Sf9 HCP Impurity Peptides with N-linked Glycans | Unmodified Peptide Mass (Da) | Modified Peptide Mass (Da) | HexNAc | Hex | Me-HexA | Fuc |
---|---|---|---|---|---|---|---|
Cell lysate | NYTVELHELEALAK | 1,629.8483 | 2,668.2303 | 2 | 3 | 1 | |
2,871.3117 | 3 | 3 | 1 | ||||
2,506.1707 | 2 | 2 | 1 | ||||
2,696.2244 | 2 | 2 | 1 | 1 | |||
2,522.1711 | 2 | 3 | 1 | ||||
1,978.9856 | 1 | 1 | |||||
1,832.9276 | 1 | ||||||
NYTVELHELEALAAK | 1,700.8854 | 1,903.9647 | 1 | ||||
ANYTVELHELEALAK | 1,700.8854 | 1,903.9647 | 1 | ||||
N(+28)YTVELHELEALAK | 1,657.8796 | 1,860.9589 | 1 | ||||
2,696.2200 | 2 | 3 | 1 | ||||
Media | NYTVELHELEALAK | 1,,629.8483 | 2,668.2303 | 2 | 3 | 1 | |
2,871.3117 | 3 | 3 | 1 | ||||
2,506.1757 | 2 | 2 | 1 | ||||
1,978.9881 | 1 | 1 | |||||
2,522.1714 | 2 | 3 | |||||
2,696.2214 | 2 | 2 | 1 | 1 | |||
1,832.9298 | 1 | ||||||
KNYTVELHELEALAK | 1,757.9436 | 2,650.2657 | 2 | 3 | |||
2,796.3262 | 2 | 3 | 1 | ||||
2,999.3988 | 3 | 3 | 1 | ||||
AKNYTVELHELEALAK | 1,828.9803 | 2,721.3024 | 2 | 3 | |||
2,867.3606 | 2 | 3 | 1 | ||||
NYTVELHELEALANR | 1,771.8973 | 2,810.304 | 2 | 3 | 1 | ||
NYTVELHELEALNR | 1,700.8602 | 2,739.2646 | 2 | 3 | 1 | ||
2,593.2048 | 2 | 3 | |||||
NYTVELHELEALAAK | 1,700.8854 | 1,903.9647 | 1 | ||||
ANYTVELHELEALAK | 1,700.8854 | 1,903.9647 | 1 |
N-linked glycans (modified residue is shown in bold) are identified on the common Sf9 HCP impurity ferritin identified by LC-MS/MS for rAAV8 from both cell lysate and media-purified vectors. Excluded from the list are common process contaminants that occur in routine sample preparation (e.g., human keratin, trypsin), impurities with mutations/aberrations such that they did not map to known proteins by BLASTp search, and any modification that could not be site localized. Unmodified and modified peptide masses are shown, as well as the number of glycan moieties each modified mass represents. HexNAc, N-acetylhexoseamine; Hex, hexose; Me-HexA, methylated hexuronic acid; Fuc, fucose. No HCP impurity peptides in the human rAAV8 preparations were observed with N-linked glycans.
To determine whether the baculovirus-Sf9 rAAV results would be different with first-generation5 baculoviral constructs—as were used in the rAAV1-LPL trial23 for Glybera—we prepared another four lots of rAAV8, only now using first-generation baculoviral support constructs for the baculovirus-Sf9 productions. Here again, results with first-generation constructs were consistent with those reported for the former replicates using second-generation constructs (Figure S4; Table S3). Host cell ferritin contamination of suspension-grown rAAV preparations is known,24 and was seen in these baculovirus-Sf9 lots.
2D Gel Electrophoresis Independently Confirmed Vector Differences
To validate differences observed at the proteome level from LC-MS/MS, we performed 2D gel electrophoresis (in-solution isoelectric focusing followed by SDS-PAGE) on four paired rAAV8 vector samples from Figure 1. Briefly, samples were loaded into wells above a pre-cast pH gradient gel strip and run in the presence of current such that proteins migrated through the gel to the well where the isoelectric point (pI) matched the well pH. After migrating, samples in each well were removed and run in separate lanes on a standard SDS-PAGE gel to separate by size in the second dimension (Figure S5). Chemical differences affecting pI in capsid proteins manifest as lateral banding, and we observed VP1/VP2/VP3 proteins that migrated beyond the well corresponding to the natural pI of rAAV8 (pI 6.3)25 (Figures 1K and 1L). These bands reflect differences in PTMs or other unidentified modifications. In different conditions (pH/temperature), rAAV can undergo conformational changes due to phospholipase A2 catalytic domain flipping within VP1.25 However, this would not result in the observed lateral banding, as conformational changes do not significantly alter net charge, and thus pI, of a protein.26 Additionally, lateral banding was observed predominantly on VP3, which lacks the phospholipase domain. Of note, VP3 bands had substantially greater lateral banding than did VP1 or VP2, possibly due to its higher abundance. To rule out the potential influence of host cell proteases being responsible for the banding patterns seen by 2D gel electrophoresis, an independent control was run wherein rAAV8 samples were treated with and without protease inhibitors and run as before to ensure the banding patterns remained unchanged. No difference was seen between treatments (Figure S6A), confirming the previous findings that banding patterns were not due to contaminating proteases. To determine whether lateral banding resulted from rAAV cleavage products or substantial loss of PTMs, immunoblot analysis of the 2D gels was performed with an anti-AAV VP1/VP2/VP3 antibody.27 Immunoblots confirmed that lateral banding was rAAV capsid proteins and that vertical banding was likely attributable to HCPs (Figures S6B and S6C). To determine whether differences imparted by capsid PTMs could influence capsid stability, thermal melt curves were performed and no differences were found between baculovirus-Sf9 and human-produced rAAV8 (Figures 1M and S7A). All displayed standard rAAV8 melt curves initiating at 70°C and completing at 75°C, similar to rAAV8 vector lots from other academic and commercial providers (Figures S7B–S7D) for each platform. Of note, every rAAV serotype, regardless of method of production, has a unique capsid melting temperature (Tm) that can vary from as low as ∼72°C for rAAV2 to as high as ∼91°C for rAAV5, with other common serotypes falling in between (Figures S7B–S7D; Table S4).
Capsid Alterations and HCP Impurities Are Consistent across Serotypes
To understand whether the former results with rAAV8 were serotype-specific or inherent to each platform, an identical set of rAAV1 preparations was produced and subdivided into four separate groups for simultaneous purification as was done for rAAV8. LC-MS/MS results, with an average of 87.9% (human) and 83.4% (baculovirus-Sf9) capsid coverage, showed that rAAV1 capsids are also post-translationally modified, and that the types and frequencies of each again differed between the two platforms (Figures S8A–S8E). Four rAAV1 PTMs within the disordered N-terminal region of the capsid were shared between the human and baculovirus-rAAV1 vectors: N-terminal acetylation on the initial start methionine, methylation of lysine 61, and phosphorylation on serines 149 and 153. For the remaining residues 217–736 on rAAV1, we detected numerous PTMs within known antigenic motifs that react with known neutralizing antibodies against rAAV1 (456-AQNK-459, 492-TKTDNNNS-499, and 588-STDPATGDVH-597).28 One PTM was detected within the 4E4 neutralizing epitope residue (456–459 and 492–498),29,30 and several additional PTMs were detected near the 5H7 neutralizing epitopes (494, 496–499, 582, 583, 588–591, 593–595, and 597), as well as the ADK1a neutralizing epitope (500). Additionally, there are 11 lysines on the mapped structure of rAAV1 VP3 predicted to be capable of ubiquitination: K258, K459, K491, K493, K508, K528, K533, K545, K567, K666, and K707. In both human baculovirus-Sf9 rAAV1 productions we observed PTMs on two ubiquitinable lysines where K459 and K528 were methylated. Here again, HCP impurities were present and different between platforms (Table S5), and Sf9 insect HCP impurities were detected with N-linked glycans (Table S6). Specifications on the viral lots are outlined in Table S7. Negative staining and TEM imaging confirmed that, despite identical purification procedures, human-produced rAAV1 had fewer HCP impurities and reproducibly full capsids compared to baculovirus-Sf9 vector preparations (Figures S8F–S8I). We observed normal VP1/VP2/VP3 ratios by silver stain (Figure S8J), and again we did not detect significant differences in thermal capsid stability (Figures S7E and S8K).
rAAV Capsids Are Structurally Similar Regardless of Production Method
Having demonstrated differences in composition at both the capsid PTM and vector lot impurity levels when produced by different production platforms, we next wanted to assess whether global capsid structures differed by transmission electron cryomicroscopy (cryo-EM). To fairly assess potential differences, we first needed comparable structures from identically purified vector lots subjected to cryo-EM using the same methodology. Thus, we generated four new cryo-EM structures of full and empty rAAV8 capsids from each manufacturing platform (Table S8). Despite differential PTM deposition, no significant differences were observed between the averaged structures at the resolutions achieved (3.3–3.6 Å) (Figures 2 and S9). The four new capsid structures have been deposited in both the Protein Data Bank (PDB) and Electron Microscopy Data Bank (EMDB): human full AAV8 (PDB: 6PWA, EMD: 20502), human empty AAV8 (PDB: 6U20, EMD: 20615), baculovirus-Sf9 full AAV8 (PDB: 6U2V, EMD: 20626), and baculovirus-Sf9 empty AAV8 (PDB: 6UBM, EMD: 20710).
Common rAAV Serotypes Have Capsid PTMs and HCP Impurities
After having shown that standard rAAV production methods produce capsids decorated with PTMs, we wanted to determine whether this was specific to the two serotypes tested or a generalizable phenomenon. To assess the presence and/or absence of capsid PTMs across serotypes, we sourced a panel of common rAAV serotypes 1–8, produced in either the human or baculovirus-Sf9 platforms (Table S9) from a single manufacturer to minimize potential production variation, and performed LC-MS/MS. We observed that all rAAV capsids, regardless of serotype or manufacturing platform, possessed capsid PTMs (Table S10). Additionally, all vector lots had HCP impurities (Table S11), and some HCPs were detected with N-glycosylation (Table S12).
Capsid PTM Deposition and HCP Impurities Are Universally Seen across Manufacturing Platforms, Purification Methods, Facilities, and Scale
Up to this point, vector lots from a single manufacturing facility in highly controlled production and purification settings were assessed. We next wanted to determine whether these findings were generalizable across different production batches and manufacturing sites. To examine this, we sourced rAAV vector lots from an array of production facilities spanning private industry, leading academic core facilities, government consortia, and individual academic laboratories (Table S13). Here the intent was not to rigorously control the production and purification parameters, but rather to determine the extent of modifications and impurities with samples obtained from vendors supplying vectors to scientists working at every stage in the R&D pipeline. LC-MS/MS demonstrated that capsid PTMs and HCP impurities were universal (Table S15), regardless of the scale of manufacturing, purification method used, manufacturing facility, or production platform. Cumulative comparative HCP impurity analyses on all vector lots tested to date using gene ontology analyses highlighted trends in HCP impurity types (Figure S10; Tables S16, S17, and S18). Interestingly, the most common human impurities were related to nucleic acid and protein binding for RNA processing, the most common insect impurities were related to endopeptidase activity and proteolysis, while baculoviral impurities were either structural in nature or pertained to viral escape.
Lastly, despite the presence of putative endoplasmic reticulum (ER) signal peptides31 at the N terminus of VP1 on all common rAAV serotypes (Table S19), our LC-MS/MS data show that rAAVs are often N-terminally acetylated, a PTM known to block ER translocation of polypeptides in eukaryotes.32 The potential ER exclusion is consistent with the glycosylation patterns seen on rAAV capsids that have been observed to have simple O-GlcNAc modifications (deposited in the nucleus/cytoplasm), but lack N-glycans or complex O-glycans (deposited in the ER/Golgi).
rAAV Genomes Are Differentially Methylated during Production
Given the numerous unexpected differences in rAAV vectors at the proteomic level, we hypothesized whether similar differences may also be occurring at the epigenomic level. We were particularly interested in quantifying genome methylation, given its known role in gene regulation, which could impact vector potency. However, similar to our discovery of capsid PTMs presented earlier, no literature existed on whether rAAV genomes could be methylated during packaging. In this study, we sought to determine whether packaged rAAV genomes were capable of being epigenetically methylated, and, if so, whether this differed between production platforms. We used whole-genome bisulfite sequencing (WGBS) to generate genome-wide methylation maps of the rAAV genome at single-base resolution. Single-stranded DNA (ssDNA) for WGBS was isolated from rAAV vectors from each production platform, and unmethylated bacteriophage lambda DNA was spiked into each rAAV sample prior to bisulfite conversion. Sequencing generated 17.3 million raw reads that were subsequently mapped to the rAAV transfer vector and reference genomes. We discovered that rAAV genomes are indeed methylated and that the cytosine methylation was distributed along the entire genome (Figure 3). Overall methylation levels were low (∼1%) and similar between the two platforms. However, three sites in particular showed statistically significant differential methylation between vectors from the two platforms: repressive promoter methylation at C:1208 and poly(A) methylation at C:3779 was significantly higher in baculovirus-rAAV, while activating intragenic methylation at C:2970 was significantly higher in human-rAAV.
Human-Produced rAAV Exhibits Significantly Greater Potency In Vitro
Having established that capsid PTMs, HCP impurities, and epigenomic methylation were key differences in rAAV vector lots, we next sought to quantify vector potency differences between manufacturing platforms. We set up comparative expression experiments in a panel of cell lines spanning different species, cell types, differentiation states, and immortalization statuses. Immortalized human HEK293T and Huh7 cells, primary human fetal fibroblasts, primary human induced pluripotent stem cell (iPSCs), and immortalized mouse C2C12 myoblasts were transduced with rAAV1 firefly luciferase (FLuc) vectors to measure potential expression differences. FLuc assays confirmed significant potency differences in all cases, regardless of species or cell type, with human-produced rAAV vectors outperforming insect-produced rAAV vectors in all cases (p < 0.05–0.0001) (Figure 4A).
To validate that expression differences were not due to differential packaging efficiency, we developed a next-generation sequencing (NGS) protocol called Fast-Seq33 to sequence packaged rAAV genomes, including ITRs. Fast-Seq validated that all vectors used in the potency assessments had no difference in the packaged genomic sequence between any vector lot by either production method (Figure S11; Tables S20). These findings substantiate that observed expression differences were not due to differences in genome sequence or integrity.
To assess whether HCP impurities and/or empty capsids were poisoning the potency of the baculovirus-Sf9 preparations, we set up an in vitro experiment with rAAV1-FLuc wherein the total capsid MOI was kept constant and standard genome-containing capsids were mixed with purified lots containing only empty capsids and their associated HCPs at different ratios (100:0, 90:10, 50:50, 10:90, 0:100) to determine whether the signal drop-off would correlate with the percent of empty vector and/or HCP impurities. Human-rAAV1 expression dropped less quickly (Figure 4B) than did baculovirus-produced rAAV1 (Figure 4C) when >10% of the total vector material came from empty vector/HCP impurities. To further probe the influence of insect/baculoviral impurities, we set up a similar experiment wherein the capsid MOI was not fixed but we spiked empty capsids and HCPs on top of the existing 30,000 full baculoviral capsids and again similarly assessed transduction. If insect/baculoviral HCP impurities and/or empty capsids were not influencing potency, then the spike-ins should have no effect. However, as is seen with 10% or 100% baculoviral empty spike-ins, we observed a significant loss of expression from the full capsids (Figure 4D). However, similar spike-ins with human empties had no effect, and this also demonstrated that empty capsids where not merely outcompeting full capsids for available receptors. Taken together, these experiments support the hypothesis that the presence of insect/baculoviral impurities and/or empty capsids in baculovirus-Sf9 preparations is influencing the potency of these vector preparations.
Human-rAAV Has Significantly Greater Muscle Transduction In Vivo
To assess potency differences in vivo, we set up a transduction comparison with the following parameters: (1) age-matched mice to prevent confounding effects of age-related transduction; (2) mice of each sex were treated; (3) the same ssAAV-cytomegalovirus (CMV)-FLuc-SV40pA transfer vector was used to facilitate comparisons between production methods and manufacturers; (4) vectors were sourced from two popular manufacturers for each production platform (viral lot specifications in Table S22); (5) all mice were injected intramuscularly (i.m.) by the same person on the same day and housed in their own cage per treatment group to prevent cross-contamination by vector shedding; (6) we used the same serotype (rAAV1) and method of injection (i.m.) as was used clinically in the trials for Glybera;12 and (7) a low dose of 5e10 vg/mouse was chosen, as this is sufficient to provide adequate FLuc signal but not so high that the maximal expression would already be reached, thereby masking potential potency differences. Twelve mice (six male, six female) were injected i.m. and live imaged weekly for FLuc expression (Figures 4E and S12A). Time course results revealed that human-rAAV1 achieved significantly higher skeletal muscle expression than did baculovirus-rAAV1 in both sexes (p < 0.0001 at week 8) (Figures 4G, S12C, and S12D).
To determine whether potency differences were restricted to i.m. delivery of rAAV1, we repeated the experiment with rAAV8, a serotype currently being tested as an i.m. HIV passive vaccine vector,34 despite reports of poor transduction in human skeletal muscle35 and other tissues,36,37 as well as high neutralizing antibody levels in HIV+ individuals.38 Here again, 12 mice (six male, six female) were injected i.m. with 5e10 vg of ssAAV8-CMV-FLuc vector and live imaged weekly for FLuc expression (Figures 4F and S12B). Human-rAAV8 achieved significantly higher expression than did baculovirus-rAAV8 in both sexes (p < 0.0001 at week 8) (Figures 4H, S12C, and S12D).
Significant Differential Potency and Sexually Dimorphic Functional Transduction Were Observed with rAAV in Mouse Liver In Vivo
Administration of rAAV i.m. is typically only done when treating muscle disorders or when using muscle as a secretion depot for passive vaccines.35 Many gene therapy trials administer vector intravenously (i.v.) to facilitate distribution to internal organs not easily accessible by direct injection. Thus, we next assessed whether the potency differences seen with i.m. administration would be recapitulated with i.v. administration of human- and baculovirus-Sf9-produced rAAV. For this deeper set of comparisons, we utilized vector lots produced from a single facility and our custom dual-use transfer vector (Figure S1) to even more stringently control for variations between the two production platforms. Twenty-four mice (12 male, 12 female) were injected i.v. with 5e10 vg/mouse of ssAAV1-EF1α-FLuc produced in either the human or baculovirus-Sf9 platform and live imaged weekly for FLuc expression (Figure 5A). Results highlighted several key findings. First, human-rAAV1 achieved significantly higher FLuc expression than did baculovirus-rAAV1 (p < 0.005–0.0001 at week 4) (Figure 5C). Second, significant sexually dimorphic differences were seen where males experienced greater functional liver transduction than did females (p < 0.04–0.0001). Although sexually dimorphic liver transduction has been observed previously with human-produced rAAV,39 we think that this is the first observation of this phenomenon with baculovirus-Sf9 rAAV.
To determine whether the significant functional and sexually dimorphic hepatic expression differences were restricted to rAAV1, we repeated the same in vivo experiment with rAAV8. Here again, 24 mice (12 male, 12 female) were injected i.v. with 5e10 vg/mouse of ssAAV8-EF1α-FLuc produced in either platform and live imaged weekly for FLuc expression (Figure 5B). The results demonstrated that our previous differential potency findings with i.v. rAAV1 were not serotype-specific, as they replicated with rAAV8 (p < 0.03–0.0001) (Figure 5D). Significant sexual dimorphism was again observed, with males having significantly greater functional liver transduction than females (p < 0.009–0.0001).
Human-rAAV Has Significantly Greater Potency in Human Liver In Vivo
To determine whether the significant potency differences seen in mouse liver following i.v. administration would also be seen in humans, we repeated the same in vivo experiment but now using humanized liver mice. Sixteen humanized liver mice (eight male, eight female) were produced where male mice were transplanted with human male hepatocytes and female mice were transplanted with human female hepatocytes to >80% human liver repopulation (Figure 6A; Table S24). All humanized liver mice were injected i.v. with 5e10 vg/mouse of ssAAV1-EF1α-FLuc produced in either platform and live imaged for FLuc expression (Figure 6B). Results demonstrated that human-produced rAAV1 achieved significantly higher functional human liver transduction than did baculovirus-Sf9-produced rAAV1 (p < 0.005 at week 8) in male humanized mice. The same trend was observed in the female cohort, albeit with a higher p value (p < 0.08 at week 8) due to decreased power from the loss of two female subjects during the experiment (one female died post-transplant before AAV administration and one female had a failed tail vein injection of AAV). These results suggest that our previous findings with i.v. administered rAAV in mouse liver were not species-specific, as they were replicated in human liver in vivo as well. Note that these experiments were not repeated with rAAV8 given previous studies demonstrating that rAAV8 does not functionally transduce human liver in vivo.36,37,40, 41, 42
Discussion
Of the nearly 400 FDA-approved protein-containing therapies in use in the United States—e.g., enzymes, growth factors, interferons, hormones, blood products, vaccines, antibodies, anti-venoms, anti-toxins, immunoglobulins, gene therapies, and cell therapies—only two (both vaccines) are produced in any insect manufacturing system (Table S25).43,44 Within the rAAV gene therapy space, the human and insect manufacturing systems are predominantly used for production. Two prior studies comparing vectors produced in human and insect cells have been published by groups who were involved in developing the insect system,45,46 and we think that our study is the first independent comparative analysis of vectors from the two systems.
Clinicians, patients, and regulators need to be confident in the safety and efficacy of rAAV for gene delivery. One safety concern is the potential for immunotoxicity against insect impurities.47 Indeed, several recent severe adverse events with fever8,9 occurred following high-dose rAAV administration with baculovirus-Sf9-produced vector. Given the recent explosion of rAAV use—in trials for gene therapy, passive vaccines, and as a critical delivery agent in the rapidly expanding gene editing space—a thorough characterization of rAAV produced using different manufacturing methods was warranted.
Here, we summarize our discoveries and those findings with critical implications for the field of gene therapy as follows: when given at the same vector genome dose, human-produced rAAVs are more potent than baculovirus-Sf9 rAAV vectors both in vitro and in numerous mouse and human tissues in vivo; rAAV capsids can be post-translationally modified (acetylation, methylation, phosphorylation, O-GlcNAcylation), and these capsid PTMs differ when produced in the baculovirus-Sf9 and human production platforms; deamidation degradation marks are universal; HCP impurities differ in vector lots between platforms and can have their own PTMs, including potentially immunogenic N-linked glycans; capsid PTMs and HCP impurities were seen across all rAAV serotypes, manufacturers, purification types, and different generations of the manufacturing technologies; packaged rAAV genomes are epigenetically methylated in both production platforms, and baculovirus-Sf9 rAAV has significantly more methylation in key repressive regions; lastly, regardless of the manufacturing platform, functional rAAV transduction is sexually dimorphic in the liver when administered i.v.
It is known that human and insect cells have different capacities to post-translationally modify and fold proteins.10 However, we were initially hesitant to speculate on rAAV capsid PTM presence given a publication from 2006 stating that there were none.48 We noted several key differences between this study and ours. First, they used wild-type AAV2 produced in HeLa cells with wild-type replication-competent adenovirus 2 (Ad-2). Modern rAAV production now produces vector in HEK293 cells and provides the adenoviral helper function from adenovirus 5 (Ad-5) in trans from plasmids.4 Collectively, the differences in cell type (HeLa versus HEK293), method of helper viral administration (live Ad-2 versus Ad-5 plasmid), and type of AAV (wild-type AAV versus rAAV) made us question whether the AAV particles that each of us assessed were comparable. Indeed, with these differences and using more sensitive LC-MS/MS instruments, we observed numerous rAAV capsid PTMs. Additionally, there have been several recent attempts16,45,46,49 to characterize rAAV vector lots; however, each study had limitations in technology or scope, and called for the deeper research performed herein. Our study used orthogonal methods to deeply characterize and validate differences in a sensitive and unbiased manner.
Human HEK293 and insect Sf9 cells are both capable of methylating recombinant DNAs produced within them, given that each expresses the known methyltransferase machinery DNMT1 and DNMT3. However, no gene therapy literature existed on whether rAAV genomes are methylated when produced in either cell line. In eukaryotes, cytosine methylation is functionally used to regulate genomic expression. Promoter methylation, similar to what we observed on baculovirus-Sf9-produced rAAV, is typically used to transcriptionally repress the associated gene.50,51 In contrast, intragenic methylation, similar to what we observed on human-produced rAAV, is typically associated with active transcription.50,52,53 Thus, our data both support that rAAV genomes can be methylated during production, and that they are differentially methylated in key regions depending on the platform used to produce them, with human-produced rAAVs having a more favorable methylation pattern for vector potency.
As we designed our experiments, we had to consider limitations as to which experimental parameters we could control. First, while clinical trials use good manufacturing practice (GMP)-grade vector, it is not possible to acquire aliquots of existing GMP lots as an external investigator, and the cost to produce even a single GMP lot renders cross-comparative paired studies, as performed here, prohibitive if not impossible. Thus, although we used preclinical grade vector, we ensured that it was of the highest possible purity and was identically purified for each production platform. However, it is important to recognize that GMP vectors may or may not exhibit the differences we observed in the present study.
Several technical challenges prevented us from performing desired enzymatic experiments to add/remove PTMs to intact capsids to assess their contributions to potency. For example, there are no pan-demethylase or pan-deacetylase enzymes that work on intact proteins. Similarly, deglycosidases such as PNGase-F perform optimally on denatured proteins, making it difficult to assess whether capsid glycans influence vector potency without altering capsid integrity. Furthermore, many of these modifications are on the lumenal face and are not accessible. One could set up a number of mutational analyses knocking out each host gene encoding each respective transferase for each PTM; however, given the number of different PTMs, there are too many genes to knock out simultaneously without negatively affecting host cell viability.
Importantly, although we observed differences in PTMs on baculovirus-Sf9 and human-produced vector lots by LC-MS/MS, note that PTM absence does not indicate that one was not present. Additionally, observed PTMs were substoichiometric, with most identified peptides being unmodified. Furthermore, labile modifications may be lost during sample preparation or ionization prior to detection. Quantifying PTM frequency is challenging. In many cases, PTMs are not site-localizable, which can dramatically sway accuracy of quantitative evaluations of site occupancy. Additionally, different peptides from an individual protein can differ in cleavage efficiency. Thus, the list we have reported herein should not be considered exhaustive, but, rather, partially complete. A requirement for inclusion in our list of observed PTMs was that the modification must be site-localizable. To ensure that our spectra were correctly assigned via automated database searching, every spectrum matching to a post-translationally modified peptide was also manually validated.
The field would benefit from a continued examination of the influence of production variables on vector lot composition, stability, potency, and safety. These include factors such as suspension versus adherent cell culture, different purification methods, vector age, and more. For example, what effect does the use of live viruses such as baculovirus (or herpes simplex virus) have during production? The initial motivation to adopt the baculovirus-Sf9 production system stemmed from poor vector yields in adherent HEK293 cells. However, yields from new suspension-adapted HEK293 cells boost vector yields up to 1e5 vg/cell,24 higher than that achieved with the baculovirus-Sf9 system at 7e4 vg/cell.5, 6, 7 Additional parameters that could be enhanced include improving continuous rAAV collection from suspension media,54 boosting host expression of key production factors,55 engineering other mammalian lines to safely and efficaciously produce at high yield, developing tunable cell-free methods to produce rAAV, and beyond. Given the catalog of observed differences between the two rAAV vector production platforms characterized herein, a continued investigation into the implications of these differences for the clinical and research communities is warranted.
Materials and Methods
Generation of pFB.AAV-EF1ɑ-FLuc-WPRE-hGHpA_BAC-293 Dual-Use Transfer Vector Plasmid
pAAV-EF1ɑ-FLuc-WPRE-hGHpA transfer vector plasmid37 (Addgene, catalog no. 87951) was cut with SbfI dual-cutter just beyond each ITR to create a 4.3-kb fragment. To prepare the universal shuttle backbone, pFB.AAV-msc_bGHpA (no. G0202 from the University of Iowa) was also cut with SbfI, and the resulting purified backbone was ligated to the 4.3-kb transfer vector fragment to create the 8.1-kb final construct. All components for insect replication (gentamicin-R and Tn7 transposition sites) and bacterial replication (ampicillin-R) are present on the dual-use backbone. Academic material transfer agreements (MTAs) to N.K.P. at the University of California San Francisco (UCSF) were acquired from The Salk Institute for WPRE use, and the University of Iowa for pFB.AAV-msc_bGHpA plasmid backbone. The final pFB.AAV-EF1ɑ-FLuc-WPRE-hGHpA_BAC-293 plasmid has been deposited at Addgene (catalog no. 118412). Figure S1A was made with SnapGene v4.2.11 and Figure S1B was made with GraphPad Prism v8.0.1.
Plasmid Transfection and FLuc Expression Assessment to Validate pFB.AAV-EF1ɑ-FLuc-WPRE-hGHpA_BAC-293
Each well of a six-well plate was seeded with 500,000 HEK293T cells (ATCC, catalog no. CRL-3216). Once cells reached 80% confluency, equimolar concentrations of plasmid (custom plasmid and control plasmids) were transfected via Ca3(PO4)2 transient transfection, and media were changed after 8 h. Cells were lysed 48 h later for FLuc quantitation using a Luciferase 1000 assay system kit (Promega, catalog no. E4550) according to the manufacturer’s instructions and read on a Veritas luminometer. Experiments were performed in technical triplicate.
Generation of rAAV Bacmids for Baculovirus Production
A custom pFB.AAV-EF1ɑ-FLuc-WPRE-hGHpA bacmid was produced using the Bac-to-Bac production system where MAX Efficiency DH10Bac E. coli (Thermo Fisher Scientific, catalog no. 10361012) contained a baculovirus shuttle vector that can recombine with a donor plasmid, pFastBac (pFB), to create an expression bacmid containing the cloned construct for eventual rAAV transfer vector production. The donor plasmid was the custom pFB.AAV-EF1ɑ-FLuc-WPRE-hGHpA_BAC-293 construct described above. 10 ng of pFB.AAV-EF1ɑ-FLuc-WPRE-hGHpA_BAC-293 donor plasmid was transposed into 100 mL of DH10Bac-competent cells and plated on DHBac10 Luria-Bertani (LB) agar plates for 48–72 h at 37°C in a non-CO2 incubator. Plates with colonies were kept at 4°C thereafter until ready for use. A single large isolated transposed white colony was picked and re-streaked for isolation on another DHBac10 LB agar plate. The confirmation colony was inoculated into DHBac10 LB broth, and the bacmid was grown for 20 h in a 200-rpm shaking incubator at 37°C. Bacmid DNA was isolated with a NucleoBond Xtra Midi EF kit (Macherey-Nagel, catalog. no. 740.420.50), resuspended in TE (Tris with EDTA) buffer, and stored in aliquots at −20°C. The packaging bacmids for generating baculoviruses for producing different serotypes of rAAV were produced as described above, except using pFB donor plasmids encoding AAV2 Rep and the capsid of interest (pFB-AAV-Rep2/Cap8 or pFB-AAV-Rep2/Cap1) from the University of Iowa.
Generation of Baculoviral Lots for rAAV Production
For BAC-AAV-EF1ɑ-FLuc-WPRE-hGHpA and BAC-AAV-Rep2/Cap1 or Cap8, 2 × 106 Sf9 cells (Expression Systems, catalog. no. 94-001S) were seeded per well in a six-well plate in protein-free ESF 921 media (Expression Systems, catalog. no. 96-001) with no additives or antibiotics, using cells from a 3-day-old suspension culture in mid-log phase (95%–100% visual viability). Cells were allowed to attach at 27°C for 1 h (∼80% confluency). The transfection solution was prepared with 45 mg of purified bacmid in 600 mL of media, and mixed with 36 mL of Cellfectin (Invitrogen, catalog no. 10362-010) in 600 mL of media, mixed, and incubated at room temperature (RT) for 30 min. Adherent cells were washed and then transfected with 700 mL per well of transfection solution diluted in 3 mL of media. Cells were incubated for 5 h in a 27°C incubator in a humidity chamber. Following incubation, transfection mix was removed and 2 mL of fresh media was added per well before incubation for 5 days at 27°C. Resultant baculovirus was harvested from the media supernatant and mixed with serum to (final) 5% for storage. Media were spun at 500 × g for 5 min to pellet any cellular debris. Solution was filtered through a Millex 0.22-μm sterile syringe filter with polyethersulfone (PES) membrane (EMD Millipore, catalog. no. SLGP033RB) and aliquoted. Baculoviral titers were determined using a BackPAK baculovirus rapid titer kit (Clontech, catalog. no. 631406).
Production of rAAV from Baculovirus Infection of Sf9 Cells for Paired Productions
For rAAV production, 2 × 106 suspension Sf9 cells/mL were seeded in 400 mL of ESF 921 media in roller bottles. 24 h later, cells were counted for infection calculations and to ensure high viability. 100 mL of fresh media was added to each flask prior to infection. Baculoviral aliquots were thawed, resuspended, and an MOI of 0.5 of each baculovirus (transfer vector virus and pseudotyping virus) was added directly to the media and bottles were returned to the incubator for 72 h at 27°C. For production of the special empty capsid lots, the transfer vector baculovirus was omitted and only the pseudotyping baculovirus was used to ensure that no transfer vector was present to package.
Production of rAAV in HEK293FT Cells for Paired Productions
rAAV vector productions were produced in 40 plates (15 cm) using a Ca3(PO4)2 transient triple transfection protocol in adherent HEK293FT cells (Thermo Fisher Scientific, catalog. no. R70007) in DMEM + 10% fetal bovine serum (FBS) + 1% penicillin/streptomycin. Plasmids included the following: pHelper (1,000 mg/40 plates), pFB.AAV-EF1ɑ-FLuc-WPRE-hGHpA_BAC-293 transfer vector plasmid (500 mg/40 plates), and a pseudotyping plasmid (pRep2/Cap8 or pRep2/Cap1; 1,500 mg/40 plates). Transfection solution was incubated with cells for 16 h at 37°C, and then the media were changed to serum-free Expi293 media (Thermo Fisher Scientific, catalog no. A14351) for the remaining 60 h of incubation at 37°C in 5% CO2 for a total incubation of 76 h. For production of the special empty capsid lots, the transfer vector plasmid was omitted and only the pHelper and pseudotyping plasmids were used to ensure no transfer vector was present to package.
Universal Harvest Protocol for Isolating rAAV from Cell Lysates and Media Supernatant from Both Human and Baculovirus-Sf9 Production Platforms for Paired Productions
To isolate rAAV particles from cell lysates and media supernatant, the following protocols were followed. For cell-purified rAAV, cell pellets were lysed with a 12-min glass bead vortex, treated with 100 U/mL Benzonase (Sigma, catalog. no. E8263-25KU) for 30 min at 37°C, followed by addition of 10% sodium deoxycholate and another 30-min 37°C incubation to dissociate particles from membranes. Then, they were iodixanol gradient purified in OptiSeal polyallomer tubes (Beckman Coulter, catalog. no. 362183) in a VTi-50 rotor spun at 40,000 rpm for 1 h 50 min at 18°C in an Optima LE-80K Beckman Coulter ultracentrifuge, further purified by MustangQ ion-exchange chromatography on a peristaltic pump, with buffer exchange through an Amicon Ultra-15 centrifugal filter concentrator (Millipore, catalog no. UFC910096) into 1× Dulbecco’s PBS (DPBS) (pH 7.4) + 180 mM NaCl + 0.001% Lutrol (v/v). For media-purified rAAV, media were first put through a 100-kDa molecular weight cutoff (MWCO) filter (Spectrum, catalog no. S02-E100-10N) and then tangential flow filtered at 400 mL/min with transmembrane pressure (TMP) of 4.0. Triton X-100 was then added to 0.5% (final) and incubated in a shaking water bath at 37°C for 1 h, after which solution was treated with 200 U/L TurboNuclease (Sigma, catalog no. T4330-50KU) for 1 h at 37°C and then iodixanol gradient purified as described above, followed by MustangQ ion exchange chromatography as described above, and buffer exchanged as described above. rAAV vectors were simultaneously titered by TaqMan qPCR (on the same plate) within the hGHpA element with the following primer/probe set: forward, 5′-TGTCTGACTAGGTGTCCTTCTA-3′, reverse, 5′-CTCCAGCTTGGTTCCCAATA-3′, probe, 5′-6-FAM-AAGTTGGGAAGACAACCTGTAGGGC-IBFQ-3′.
Production of rAAV in HEK293T Cells for Non-paired Productions
rAAV vector productions were produced using a Ca3(PO4)2 transient triple transfection protocol in adherent HEK293T cells (ATCC, catalog no. CRL-3216) followed by double cesium chloride density gradient purification and dialysis as previously described,35,38 and resuspended in DPBS with 5% sorbitol (w/v) and 0.001% Pluronic F-68 (v/v). Plasmids included the following: pAd5 helper, rAAV transfer vector (ssAAV-CAG-TdTomato, Addgene #59462), and pseudotyping plasmids for each capsid of interest. rAAV vectors were titered by TaqMan qPCR within the CAG promoter with the following primer/probe set: forward, 5′-GTTACTCCCACAGGTGAGC-3′, reverse, 5′-AGAAACAAGCCGTCATTAAACC-3′, probe, 5′-FAM-CTTCTCCTCCGGGCTGTAATTAGC-BHQ-1-3′.
2D Gel Electrophoresis
Protein samples for each production platform were fractionated by 2D gel electrophoresis. To determine whether differences between gels resulted from differences in capsid PTMs, half of the samples were treated with a broad-spectrum lambda protein phosphatase (NEB, catalog no. P0753L) and PNGase-F (Promega, catalog no. V4831) prior to 2D gel electrophoresis. 50 mg of total protein from each production platform was used per preparation. Total sample volumes were adjusted to 200 μL after addition of 20 μL each of the 10× MgCl2 and 10× phosphatase buffer (supplied with enzyme) per tube (regardless of enzyme treatment). 1.25 μL of enzyme was added to the phosphatase-treated samples, with the remainder being water. Samples were gently mixed and incubated for 2 h at 30°C at 1,000 rpm. 100 mM dithiothreitol (DTT) in water (Bio-World, catalog no. 404200001) was then added to each tube to a final concentration of 5 mM, and samples were heated at 95°C for 5 min. Samples were cooled to 25°C before adding 5 μL of PNGase-F and incubated at 37°C overnight at 1,000 rpm. To separate proteins by their pIs, samples were then loaded in equal volumes into 24 wells of an Agilent 3100 OFFGEL fractionator using a high-resolution fractionation kit (Agilent Technologies, catalog no. 5067-0201). Each sample, with and without enzyme treatment, was run on a separate lane of a 24-well gel strip with a linear gradient from pH 3 to 10 following the manufacturer’s instructions. Runs were completed upon reaching a total of 64 kW-h. Samples from each well were concentrated to ∼30 μL with Amicon Ultra 10-kDa molecular weight cutoff spin filters (EMD Millipore, catalog no. UFC501008) for 2 h before loading into a 4%–12% Bis-Tris SDS-PAGE gel (Invitrogen, catalog no. NP0322BOX). Gels were washed, fixed, and stained overnight with SYPRO Ruby protein gel stain (Fisher Scientific, catalog no. S12000) following the manufacturer’s instructions. Gels were then washed and imaged with a UV transilluminator. To test whether proteases were present (in either the rAAV lots, phosphatase, or PNGase-F reagents) and responsible for the banding patterns seen in Figure S6, samples were prepared as above, but with or without pre-treatment with protease inhibitor (Roche, catalog no. 04693159001) in combination with phosphatase treatment, PNGase-F treatment, or both. Lactoferrin was used as a positive glycoprotein control.
Subsequent Western Blots on 2D Gel Electrophoresis SDS-PAGE Gels
Gels were transferred onto nitrocellulose membranes (Bio-Rad, catalog no. 1620112) with a Trans-Blot Turbo transfer system (Bio-Rad), blocked with Odyssey blocking buffer (TBS) (LI-COR Biosciences, catalog no. 927-50000), blotted with rabbit anti-AAV-VP1/VP2/VP3 polyclonal antibody (ARP, catalog no. 03-61084) at 1:200 overnight at 4°C, and then washed with PBS-T (PBS with Tween 20) and incubated with secondary goat anti-rabbit immunoglobulin G (IgG) IRDye 800CW (LI-COR Biosciences, catalog no. 926-32211) at 1:10,000 for 1 h at 25°C. The membranes were then washed with PBS-T and visualized on a LI-COR Odyssey CLx infrared imaging system. Imaging analysis was performed using the LI-COR Image Studio Lite software.
Proteolytic Digests of rAAV Samples for LC-MS/MS
Protein preparations for paired rAAV productions were done using equal amounts of total protein (10–50 μg depending on the run). Each sample was precipitated using 4× vol of LC-grade acetone at −80°C overnight. Precipitated protein was centrifuged at 12,500 × g for 15 min at 4°C. Supernatant was decanted and discarded and the protein pellet dried within a biosafety hood for 30 min. The pellet was resuspended in 100 μL of 0.2% Protease Max surfactant trypsin enhancer (Promega, catalog. no. V2071), 50 mM ABC buffer, and reduced using DTT to a final concentration of 10 mM at 55°C for 30 min. Following reduction, proteins were alkylated with 20 mM propionamide at 25°C for 30 min. For those samples undergoing endoglycosidase H (Endo-H) treatment, samples were heated to 95°C for 5 min, followed by the addition of 5 μL of Endo-H reaction buffer and 5 μL of Endo-H (Promega, catalog no. PRV4875, 2,500 U). Deglycosylation proceeded for 4 h at 37°C. Endo-H is a recombinant glycosidase that hydrolyses the bond connecting the two GlcNAc groups comprised in the chitobiose core, leaving a single GlcNAc covalently bound for mass spectrometry detection. 500 ng of trypsin/Lys-C mix (Promega, catalog no. V5072) was added and allowed to digest overnight for ∼18 h at 37°C. The digest was acidified to 1% formic acid to quench protease activity. Peptides were purified on C18 MonoSpin SPE columns (GL Sciences, catalog no. 5010-21701), and peptide pools were dried to completion in a speed vac.
LC-MS/MS for Non-glycosylated Peptides
Peptide pools were reconstituted in 0.2% formic acid, 2% acetonitrile, and water and then injected onto a Waters M-class ultra-performance liquid chromatography (UPLC). The analytical column was pulled and packed in-house using a ReproSil-Pur C18-AQ 2.4 μM resin matrix (Dr. Maisch GmbH HPLC), at approximately 25 cm in length. The flow rate was 300 or 450 nL/min where mobile phase A was 0.2% formic acid in water and mobile phase B was 0.2% formic acid in acetonitrile. The mass spectrometer used was an Orbitrap Fusion Tribrid (Thermo Fisher Scientific) set to acquire data in a dependent fashion using multiple fragmentation types, including collision-induced dissociation (CID), higher-energy collisional dissociation (HCD), and electron-transfer dissociation (ETD). The instrument parameters used in this study were the following: CID data were generated when the precursor mass resolution was 120,000 with an m/z window of 400–1,500, and charge states of 2+, 3+, and 4+ were sampled. The precursor automated gain control (AGC) settings were 200,000 ions, and the “fastest” mode was used for MS/MS in the ion trap. The ion trap sampled the most intense precursors where the isolation window was set at 1.6 Da and the collision energy at 35. In an HCD experiment, the settings were the same as for CID except a normalized collision energy of 42 was used and fragment ions were sent to the Orbitrap for detection. Finally, in ETD/HCD-triggered experiments, the precursor mass resolution was 120,000, with an m/z window of 400–1600, and charge states of 2–6+ were sequenced and the precursor AGC target was 400,000. The instrument alternated sequencing the same precursor masses first by ETD, where the settings were a 50-ms reaction time and 200,000 reagent target, and a maximum ETD time of 200 ms. Fragment ions were read out in the ion trap in a centroid fashion. HCD had a collision energy of 32 and a maximum inject time of 54 ms where fragment ions were read out of the Orbitrap in a centroid fashion. The isolation window for both the fragmentation types was 1.2 Da, and the data-dependent acquisition exclusion settings allowed for sampling of the same precursor three times before it was placed on the exclusion list. The exclusion list was set to 10 ppm and in the ETD/HCD trigger experiment timed out after 10 s.
Triggered LC-MS/MS for Glycosylated Peptides
Samples were analyzed by online nanoflow LC-MS/MS using an Orbitrap Fusion Tribrid mass spectrometer (Thermo Fisher Scientific) coupled to a Dionex Ultimate 3000 HPLC (Thermo Fisher Scientific). Samples were loaded via an autosampler onto a 20-μL sample loop and injected at 0.3 μL/min onto a 75-μm × 150-mm EASY-Spray column (Thermo Fisher Scientific) containing 2-μm C18 beads. Columns were held at 40°C using a column heater in the EASY-Spray ionization source (Thermo Fisher Scientific). Samples were eluted at 0.3 μL/min using a 90-min gradient and 185-min instrument method. Solvent A was comprised of 0.1% formic acid in water, whereas solvent B was 0.1% formic acid in acetonitrile. The gradient profile was as follows (min:%B): 0:3, 3:3, 93:35, 103:42, 104:98, 109:98, 110:3, 185:3. The instrument method used a primary mass spectrum (MS1) resolution of 60,000 at full width at half maximum (FWHM) at 400 m/z, an AGC target of 3e5, and a mass range from 300 to 1,500 m/z. Dynamic exclusion was enabled with a repeat count of 3, repeat duration of 10 s, and an exclusion duration of 10 s. Only charge states 2–6 were selected for fragmentation. Secondary ion fragments (MS2s) were generated at top speed for 3 s. HCD was performed on all selected precursor masses with the following parameters: isolation window of 2 m/z, 28%–30% collision energy, Orbitrap (resolution 30,000) detection, and an AGC target of 1e4 ions. ETD was performed when (1) the precursor mass was between 300 and 1,000 m/z, and (2) three of seven glyco-fingerprint ions (126.055, 138.055, 144.07, 168.065, 186.076, 204.086, 274.092, 292.103) were present at ±0.5 m/z and greater than 5% relative intensity. ETD parameters were as follows: calibrated charge-dependent ETD times, 2e5 reagent target, and precursor AGC target 1e4.
Mass Spectra Data Analysis
MS/MS data were analyzed using both Preview and Byonic v2.10.5 software (Protein Metrics). Data were first analyzed in Preview to validate calibration and trypsin/chymotrypsin digestion efficiency. Initial Byonic analyses used a concatenated FASTA file containing the AAV sequences, host proteomes (Spodoptera frugiperda, Homo sapiens, and Autographa californica multiple nucleopolyhedrovirus [baculovirus]), and other likely contaminants and impurities to verify sample purity. Once sample complexity was determined, a second round of Byonic analyses was completed using a targeted FASTA file containing the AAV sequence of interest and likely impurities to identify peptides and potential PTMs. Data were searched at 10–12 ppm mass tolerances for precursors (depending on the run), with 10–12 ppm or 0.1–0.4 Da fragment mass tolerances for HCD and ETD fragmentation, respectively. The algorithm allowed up to two missed cleavages per peptide and semi-specific, N-ragged tryptic digestion. These data were validated at a 1% false discovery rate using standard reverse-decoy techniques.56 The resulting identified peptide spectral matches and assigned proteins were then exported for further analysis and validated using custom tools to provide visualization and statistical characterization. All PTM mass spectra were also manually validated.
De Novo Glycan Identification
Glycopeptide sequences were determined by de novo manual interpretation of HCD and ETD mass spectra. Extracted chromatograms were generated for all MS2s containing 204.0867 ions (±10 ppm), and all spectra containing this ion were analyzed. HCD spectra were used to approximate glycan structures, and ETD spectra were used to define peptide sequence and site-localize modifications. GalNAc and GlcNAc were distinguished by their defined N-acetylhexoseamine (HexNAc) fingerprints, as described previously.57
In Vitro Transduction Analysis by FLuc Quantitation
Five cell types were assessed: immortalized human HEK293T cells (ATCC, catalog. no. CRL-3216), immortalized mouse C2C12 myoblasts (ATCC, catalog no. CRL-1772), immortalized human Huh7 hepatocarcinoma cells (gift of Linda Couto), and primary Hs27 human fetal foreskin fibroblasts (ATCC, catalog no. CRL-1634), all maintained in DMEM (Gibco, catalog. no. 11995065) supplemented with 10% FBS and 1% antibiotic/antimycotic; and primary human iPSCs (Coriell, catalog. no. GM25256) derived from the skin biopsy of a healthy 30-year-old Japanese man, which were maintained with the following modifications from a recent protocol:58 used Matrigel growth factor reduced basement membrane matrix (Corning Life Sciences, catalog no. 354230) in feeder-free media conditions (Gibco Essential 8 Flex media, catalog. no. A2858501) with a seeding density of 35,000/cm2. Accutase (STEMCELL Technologies, catalog. no. 07920) and 10-M RHO/ROCK pathway inhibitor Y-276932 (STEMCELL Technologies, catalog. no. 72304) were used to dissociate iPSCs to single cells during passaging. In all cell types, 1 h prior to transduction at 80% confluency, media were changed, and then 80,000 cells/condition were transduced with ssAAV1-EF1α-FLuc vectors diluted in DPBS at an MOI of 80,000. The following day, media were replaced to remove vectors that did not transduce, and media were not changed again. FLuc levels were measured 3 days post-AAV administration using a Luciferase 1000 assay system (Promega, catalog no. E4550) per the manufacturer’s instructions and read on a Veritas luminometer at 100-μL luciferin injection, 2-s delay, and 10-s measure. Experiments were performed in technical triplicate and corrected for background by subtracting signal from PBS-diluent negative control wells. Figure 4A was generated with GraphPad Prism v8.0.1.
Human Tissue
We obtained anonymized human iPSCs derived from normal controls enrolled in ongoing, Committee on Human Research-approved studies for basic research purposes by the Human Gamete, Embryo, and Stem Cell Research Committee to Dr. Bruce Conklin under the UCSF Institutional Review Board (IRB) no. 10-02521 (“Induced Pluripotent Stem Cells for Genetic Research”). The iPSCs were originally derived from a 30-year-old Japanese man who signed a consent form approving the donation of iPSCs to the public stem cell bank Coriell (catalog no. GM25256).
Mixed Titration/Empty Spike-In Transduction Analyses In Vitro
Human HEK293T cells were maintained as described above. One hour prior to transduction at 80% confluency, media were changed, and 60,000 cells were transduced with varying ratios of full/empty ssAAV1-EF1α-FLuc-WPRE-hGHpA vectors produced by either platform and diluted in DPBS (only cell-purified vector was used). The total capsid content was kept constant at an MOI of 30,000, while the ratio of full/empty varied: 30,000/0 (0% empty), 27,000/3,000 (10% empty), 15,000/15,000 (50% empty), 3,000/27,000 (90% empty), and 0/30,000 (100% empty). The following day, media were replaced to remove non-transducing vectors, and the media were not changed again. FLuc levels were measured 3 days post-AAV administration using a Luciferase 1000 assay system kit (Promega, catalog no. E4550) per the manufacturer’s instructions and read on a Veritas luminometer with the following settings: 100-μL luciferin injection, 2-s delay, and 10-s measure. Experiments were performed in six biological replicates and corrected for background by subtracting the signal from the PBS-diluent negative control wells. For the spike-in experiment, experimental conditions were the same except that the baculoviral full MOI was kept constant at 30,000 and additional empty vector was spiked-in from either platform at 10% (3,000) or 100% (30,000) and assessed as previously described. This experiment was a single biological replicate performed in technical triplicate and corrected for background by subtracting the signal from PBS-diluent negative control wells. Figures 4B–4D were generated with GraphPad Prism v8.0.1.
Live In Vivo Transduction Analysis by FLuc Imaging and Quantitation in Mice
All mice having received either i.v. normodynamic lateral tail vein injections or i.m. (tibialis anterior) injections of 5e10 vg/mouse of ssAAV-EF1α-FLuc (both rAAV8 and rAAV1) were imaged non-invasively every 7 days on a Xenogen IVIS (in vivo imaging system) Spectrum imaging system (Caliper Life Sciences). D-luciferin substrate (Biosynth, catalog no. L-8220) was administered at 120 mg/kg in saline by intraperitoneal injection with a 1-cc insulin syringe. Images were acquired 10 min after luciferin administration under inhalation isoflurane anesthesia. Living Image v4.5 software was used for image analysis, and average radiance was quantified in photons/s/cm2/sr. All mice in each experiment are shown on the same non-individualized radiance scale to enable accurate comparisons of bioluminescent intensity.
Silver Staining
Viral lots were TaqMan qPCR titered (described above), and 1e10 vg worth of vector was mixed with 5 μL of 4× NuPage LDS (lithium dodecyl sulfate) buffer (Invitrogen, catalog no. NP0007) and 2 μL of 10× NuPage reducing agent (Invitrogen, catalog no. NP0009) and boiled for 8 min. Boiled samples were loaded into 4%–12% 1-mm, 12-well Bis-Tris NuPage gels (Invitrogen, catalog no. NP0322PK2) along with 0.5 μL of Benchmark unstained protein ladder (Invitrogen, catalog no. 10747-012) and run in 1× NuPAGE MES (2-(N-morpholino)ethanesulfonic acid) buffer (Invitrogen, catalog no. NP0002) supplemented with 500 μL of NuPage antioxidant (Invitrogen, catalog no. NP0005) at 100 V for 2.8 h. Gels were rinsed in 18 MΩ water, fixed, and stained using a SilverQuest staining kit (Invitrogen, catalog no. LC6070) according to the manufacturer’s instructions.
Luminex Cytokine Assay
Primary human fetal foreskin fibroblasts (ATCC, catalog no. CRL-1634) were maintained in DMEM (Gibco, catalog no. 11995065) supplemented with 10% FBS and 1% antibiotic/antimycotic and seeded at 12,000 per well in a 96-well plate. 24 h following seeding, at 24,000 confirmed cells/well (control wells for each condition were lifted, counted, and verified for viability), media were changed and allowed to sit for 1 h prior to transduction with four rAAV lots of ssAAV8-EF1α-Fluc-WPRE produced by either platform and purified from either cell lysate or media supernatant (as described above) at an MOI of 100,000 in a total final media volume of 200 μL. No further media changes or washes occurred until media harvest at 24 h following rAAV addition. Controls included “media only” wells (no cells or rAAV) and “media + cells” wells (no rAAV) for background normalization calculations. Media were harvested from each well, spun at 350 relative centrifugal force (rcf) for 5 min at 4°C to pellet any cells, and media supernatants were then aliquoted in four protein LoBind polypropylene tubes (Eppendorf, catalog no. 022431081) per well for storage at −80°C until use in the assay. Conditions were set up in biological triplicate and run in technical duplicate. The assay was performed at the Human Immune Monitoring Center at Stanford University. A custom human 63-plex kit was purchased from eBioscience and used according to the manufacturer’s recommendations with modifications described below. Briefly, beads were added to a 96-well plate and washed in a BioTek ELx405 washer. Thawed samples were added to the plate containing the mixed antibody-linked beads and incubated at 25°C for 1 h followed by overnight incubation at 4°C with shaking. Cold and 25°C incubation steps were performed on an orbital shaker at 500–600 rpm. Following the overnight incubation, plates were washed in a BioTek ELx405 washer and then biotinylated detection antibody added for 75 min at 25°C with shaking. The plate was washed as above and streptavidin-phycoerythrin (PE) was added. After incubation for 30 min at 25°C, a wash was performed as above and reading buffer was added to each well. Each sample was measured in duplicate. Plates were read using a Luminex 200 instrument with a lower bound of 50 beads per sample per cytokine. Custom assay control beads by Radix Biosolutions were added to all wells. Data were normalized to controls with no AAV added during production (plain host cells cultured for either 24 or 48 h) and averaged across all biological replicates. Data were visualized in RStudio.
NGS of Packaged rAAV Genomes Using Fast-Seq
Total packaged genomic DNA (gDNA) was extracted from 1e11 full rAAV particles for each lot of rAAV8 and rAAV1 sequenced. AAV libraries were prepared following a recent Tn5 tagmentation-based protocol called Fast-Seq.33 The following indexed adapters compatible with Illumina were obtained from Integrated DNA Technologies: index 1 (i7), CAAGCAGAAGACGGCATACGAGATNNNNNNNNNNNNGTCTCGTGGGCTCGG; index 2 (i5),
AATGATACGGCGACCACCGAGATCTACACNNNNNNNNNNNNTCGTCGGCAGCGTC (where bold type indicates the P5/P7 adaptor sequence, italic type indicates a unique 12-bp barcode index, and underlining indicates the primer for mosaic ends added during tagmentation). Each adaptor contained a 12-nt unique barcode for identifying samples after multiplexing. Indexes used for each sample are listed in Table S20. The resulting library was diluted to 10 pM in 600 μL of HT1 hybridization buffer (Illumina Nextera XT kit, catalog no. FC-131-1024), and 10 μL was loaded onto a 300-cycle MiSeq Nano v2 flow cell (Illumina, catalog no. MS-102-2002) for paired-end 2 × 75-bp sequencing. Resultant reads were de-multiplexed using Illumina’s bcl2fastq v2.19.0.316. Data were returned in a fastq format and filtered using Trimmomatic59 to remove adaptor sequences, low-quality reads (Phred score <30, or length <50 bp), and unmapped and unpaired reads. Trimmed reads were then aligned to the rAAV transfer vector plasmid reference sequence with BWA v0.7.17,60 using the mem algorithm. Alignments were saved as BAM files, which were then used to generate VCF files using the GATK HaplotypeCaller algorithm.61 SNPs and indels identified in VCF files were filtered using the BCFtools filter algorithm, with a 15× depth threshold and a 90% allele fraction requirement. A consensus sequence was generated using the BCFtools consensus algorithm.62 Alignment and fragment distribution statistics were obtained with Picard tools.63 Coverage spanned 100% of the rAAV transfer vector genome reference sequence. Figure S11B was generated with ggplot. Figure S11C was generated with the ViennaRNA web services structure prediction software for DNA available at http://rna.tbi.univie.ac.at.
Capsid Protein Thermal Stability
Purified rAAV capsids of various serotypes produced by each manufacturing platform were compared for their denaturing thermal stability. 10 μL of undiluted rAAV of each lot was loaded into a high-sensitivity capillary (NanoTemper, catalog no. PR-C006) and run on a NanoTemper Prometheus NT.48 nanoDSF instrument. Thermal denaturation programs were run with a 1°C/min thermal ramp from 30°C to 95°C with fluorescent detection at 330 and 350 nm. Data were analyzed with PR.ThermControl software v2.1.5 to determine the temperatures for Tonset and Tm. Figures 1M, S7, and S8K were made with GraphPad Prism v8.0.1.
Negative Staining and TEM Imaging
Grids were glow discharged in the presence of argon gas for 20 s in a Denton Desktop Turbo III vacuum system and then covered with 5–12 μL of rAAV and allowed to adsorb to either a 300-mesh copper or 400-mesh carbon grid with formvar and carbon coatings (catalog no. FCF300-CU) for 3 min. The AAV was washed off by touching the grid, sample side down, to two drops of double-distilled H2O (ddH2O) (18 MΩ). 1–2% uranyl acetate in ddH2O (18-MΩ) was then dripped through a 0.2-μm mesh syringe filter over the grid allowing the third drop to remain on the grid for 1 min before most of the stain was wicked away with filter paper and allowed to dry. Grids were observed on either a JEOL JEM-1400 or JEOL JEM-1230 transmission electron microscope at 120 kV and images were taken using Gatan Orius digital cameras (either 2K-×-2K or 4K-×-4K). Packaging percentage determinations from TEM images were measured in ImageJ v2.0.0-rc-43/1.52b.64
False-Colored Structural Capsid PTM Mapping
The rAAV8 PTMs shown in Figure 1 were mapped onto the AAV8 capsid structure PDB: 3RA817 using Pymol v1.7.6.0. The rAAV1 PTMs shown in Figure S8 were mapped onto the AAV1 capsid structure PDB: 3NG9.65 Exterior capsid views have all chains represented, while internal cross-section views have chains surrounding a cylinder at the 5-fold symmetry axis removed, exposing the interior capsid lumen with fogging cued to enable depth perception of the interior. Online UbPred software (http://www.ubpred.org) was used to predict the possible ubiquitination sites on AAV1 and AAV8 capsid proteins.20 The output of the analysis is the prediction of the lysine residues important for ubiquitination within the indicated AAV serotype capsid sequence. There were 11 lysines on rAAV1 VP3: K258, K459, K491, K493, K508, K528, K533, K545, K567, K666, and K707. There were eight lysines on rAAV8 VP3: K259, K333, K510, K530, K547, K569, K668, and K709.
Cryo-EM Data Acquisition and Image Processing
Purified rAAV8 samples (human-produced empty and full, baculovirus-Sf9-produced empty and full) were applied to lacey carbon Quantifoil grids (EMS, catalog. no. Q225CR-06) and flash-frozen. Images were collected using an FEI Titan Krios G3i cryogenic transmission electron microscope operating at 300 kV with a Gatan K2 Summit detector and Gatan imaging filter (GIF) using a 20-eV slit width. Movies were collected in counting mode with the automated imaging software EPU and a pixel size of 1.07 Å. 30 frames were collected in a 6-s exposure with a per frame dose rate of 2.2 e−/Å2 for a total dose of ∼66 e−/Å2. A total of 1,647 movies were collected for human-produced rAAV8, and 396 movies were collected for baculovirus-Sf9-produced rAAV8. Movies were drift corrected using MotionCor2.66 All image processing was performed using cisTEM.67 No symmetry was applied for initial 2D classification, and icosahedral symmetry was applied for 3D classification and the final reconstruction. The previous crystal structure determined for rAAV8 (PDB: 3RA8) was used as an initial starting point for all models. Residues were manually fit in Coot68 and refined against the cryo-EM maps using Phenix.real_space_refine.69 The four new structures have been deposited in both PDB and EMDB as follows: human HEK293-produced full AAV8 capsid (PDB: 6PWA, EMD: 20502). Human HEK293-produced empty AAV8 capsid (PDB: 6U20, EMD: 20615), baculovirus-Sf9-produced full AAV8 capsid (PDB: 6U2V, EMD: 20626), and baculovirus-Sf9-produced empty AAV8 capsid (PDB: 6UBM, EMD: 20710).
Signal Peptide Determination
Putative ER signal peptides of the AAV VP1 polypeptide were found using LocSigDB software (http://genome.unmc.edu/LocSigDB).31 Only signals within the first 100 aa residues of VP1 from the N-terminal end were assessed.
Gene Ontology Analysis on HCP Impurities
Cumulative HCP impurities from all vector lots analyzed by LC-MS/MS (human impurities observed in human vector preparations; human impurities observed in baculovirus-Sf9 vector preparations; Spodoptera frugiperda impurities observed in baculovirus-Sf9 vector preparations; baculoviral impurities observed in baculovirus-Sf9 vector preparations) were ranked by peptide-spectrum match (PSM). For human impurities, those with a PSM >3 were analyzed by gene ontology functional enrichment analysis using g:Profiler70 version e94_eg41_p11_9f195a1, with g:SCS multiple testing correction method applying a significance threshold of 0.05 and an ordered query to generate Figure S10A. Only impurities from Homo sapiens could be assessed with these bioinformatics tools, as both Spodoptera frugiperda and Autographa californica multiple nucleopolyhedrovirus (NPVAC, baculovirus) were not supported organisms. Gene ontologies for Sf9 and NPVAC were assessed manually using UniProtKB.
WGBS of Packaged rAAV Genomes
Total packaged gDNA was extracted from 1e11 full rAAV particles from two productions of ssAAV8-EF1ɑ-FLuc-WPRE-hGHpA, produced from each manufacturing platform and purified from cell lysates. To first remove unincorporated DNA, ssAAV vector samples were incubated with 40 U of exonuclease (Lucigen, catalog no. E3101K) and 10 U of endonuclease (Lucigen, catalog no. DB0715K) for 30 min at 37°C and then incubated at 65°C for 10 min to stop the digestion. The entire 200-μL reaction was used as input for ssAAV gDNA extraction using a MinElute virus spin kit (QIAGEN, catalog no. 57704). 50 ng of ssDNA from each sample was bisulfite converted using the EZ Methylation-Gold kit (Zymo Research, catalog no. D5005) according to the manufacturer’s protocol. WGBS libraries were generated from the converted samples with the TruSeq DNA methylation kit (Illumina, catalog no. EGMK81312). Consistent fragment size distribution of the resulting libraries was assessed via micro-capillary gel electrophoresis on a 2100 Bioanalyzer (Agilent Technologies, catalog no. G2939BA). Libraries were quantified by Qubit fluorometry (Life Technologies, catalog no. Q33240) and pooled in equimolar ratios. The library pool was quantified via qPCR with a KAPA Library-Quant kit (Kapa Biosystems, catalog no. KK4824). Each sample was split in half and half were sequenced on an Illumina HiSeq 4000 system for single-end 100-bp reads, and half were sequenced on an Illumina NovaSeq 6000 system for paired-end 2 × 150-bp reads. Raw reads were subjected to quality control. Adapters were removed using scythe version c128b19 (https://github.com/vsbuffalo/scythe). Bases with quality <30 were trimmed from the 3′ end of the reads using sickle version 85e5117 (https://github.com/najoshi/sickle). Reads <30 bp in length after trimming were discarded. BS-Seeker271 (version 2.1.5) was used in Bowtie 2 mode to map all reads that passed quality control onto the AAV genome. All cytosine sites that were covered by reads were extracted using the BS-Seeker2 bs_seeker2-call_methylation.py program. Cytosine sites that were not covered by all samples were excluded from further analysis. Differential methylation at each cytosine site was tested using Fisher’s exact test. Multiple test correction was carried out using the Benjamini-Hochberg false discovery rate (FDR)72 approach. Coverage spanned 100% of the rAAV transfer vector genome reference sequence. Figure 3 was generated with Circos73 (version 0.69-9).
Mice
Adult severe combined immunodeficiency (SCID)-BALB/cJ mice (CBySmn.CB17-Prkdcscid/J) of each sex were purchased from The Jackson Laboratory (catalog no. 1803). Fah/Rag2/Il2rgc (FRG) deficient mice of each sex on a C57BL/6J background transplanted with human hepatocytes74 of each sex were a generous gift from Yecuris (http://www.yecuris.com). All mice were housed and maintained in specific pathogen-free barrier facilities. FRG mice were maintained on irradiated high-fat, low-protein mouse chow (Envigo Teklad global 16% protein, catalog no. 2916) ad libitum to decrease flux through the tyrosine pathway. Following human hepatocyte transplantation and prior to AAV administration, FRG mice were maintained on a rotating water schedule as follows: 1 week of 3.25% dextrose (Sigma, catalog no. G6152) water, 1 week of 3.25% dextrose + 100 mg/mL cotrimoxazole (Sigma, catalog no. A2487) water, and 1 week of 3.25% dextrose + 100 mg/mL cotrimoxazole + 8 mg/L nitisinone (Yecuris, catalog no. 20-0026) water. Following AAV administration and for the duration of the imaging studies, humanized FRG mice were maintained on 3.25% dextrose + 100 mg/mL cotrimoxazole + 8 mg/L nitisinone water to ensure no liver turnover during the AAV transduction period. The Institutional Animal Care and Use Committees of UCSF and Stanford University approved all mouse procedures.
Statistical Analysis
Statistical analyses were conducted with Prism v8.0.1 software. Experimental differences for Figures 4A–4D and S1B were evaluated using a one-way ANOVA with Tukey’s correction for multiple comparisons assuming equal variance. Experimental differences for Figures 4E–4H, 5, 6, and S12 were evaluated using a two-way ANOVA with Fisher’s least significant difference (LSD) assuming equal variance. p values <0.05 were considered statistically significant (∗p ≤ 0.05, ∗∗p ≤ 0.01, ∗∗∗p ≤ 0.001, ∗∗∗∗ p ≤ 0.0001).
Author Contributions
N.K.P. conceived the study. N.K.P., N.G.R., S.A.M., and A.P.M. designed experiments. N.K.P., N.G.R., S.A.M., N.P., L.H.M., G.C., C.M.A., R.D.L., D.T., S.S., K.H., P.S., and A.P.M. generated reagents and protocols, performed experiments, and analyzed data. N.K.P. wrote the manuscript. N.G.R., S.A.M., N.P., L.H.M., and N.K.P. generated the figures. All authors reviewed, edited, and approved the manuscript.
Conflicts of Interest
The authors declare no competing interests.
Acknowledgments
The authors wish to acknowledge the following: the generous gift of 16 humanized liver FRG mice from Yecuris Corporation; Lutz Froenicke and Emily Kumimoto of the UC Davis Genome Center for help with methylation sequencing; Jie Lie and Matthew Settles of the UC Davis DNA Technologies Core for help with WGBS data analysis; John Perrino of the Stanford University Cell Sciences Imaging Facility and Tom Moninger of the University of Iowa Central Microscopy Research Facility for help with TEM images; Yael Rosenberg-Hasson of the Stanford Human Immune Monitoring Core for help with the Luminex assays; Parastoo Azadi of the Complex Carbohydrate Research Center at the University of Georgia for glycoproteomic help; the University of Massachusetts for the baculovirus-Sf9 AAV2-hAADC samples; Linda Couto of The Children’s Hospital of Philadelphia for Huh7 cells; Donghui Wang of UCSF for mouse tail vein injections; the University of Iowa Viral Vector Core Facility for collaborative viral productions; and Bruce Conklin of the Gladstone Institute and UCSF for the human iPSCs. The authors also wish to thank the following investigators for initial discussions: Carolyn Bertozzi and Mark Kay of Stanford University and Joseph DeRisi of UCSF. This research was supported by grants to N.K.P. from the NIH (K01-DK107607, U01-HL145795), an American Society of Gene & Cell Therapy Career Development grant, the Sandler Family Foundation, a Stanford University Immunity, Transplantation and Infection Young Investigator Award, and a Vincent Coates Foundation Mass Spectrometry Laboratory seed grant; grants to the University of Iowa Viral Vector Core from the NIH (P01-HL51670), the Center for Gene Therapy of Cystic Fibrosis (P30-DK54759), and the Holden Comprehensive Cancer Center (P30-CA086862); computing resources from Wynton, a UCSF Shared Research Computing Cluster; methylation sequencing was carried out at the DNA Technologies and Expression Analysis Core at the UC Davis Genome Center and supported by a NIH Shared Instrumentation Grant 1S10OD010786-01; and funding from the Chan Zuckerberg Biohub and Cold Spring Harbor Laboratory; N.G.R. and S.A.M. were supported by the NIH (U01-CA207702) and HHMI, and S.A.M. was also supported by an NIH fellowship F32-GM126663; and N.P. was supported by an NIH Microbial Pathogenesis and Host Defense training grant T32-AI060537. The contents of this publication are solely the responsibility of the authors and do not necessarily represent the official views of the funding bodies, or respective universities, institutes, and organizations.
Footnotes
Supplemental Information can be found online at https://doi.org/10.1016/j.omtm.2020.05.018.
Supplemental Information
References
- 1.High K.A., Roncarolo M.G. Gene therapy. N. Engl. J. Med. 2019;381:455–464. doi: 10.1056/NEJMra1706910. [DOI] [PubMed] [Google Scholar]
- 2.Nakai H., Yant S.R., Storm T.A., Fuess S., Meuse L., Kay M.A. Extrachromosomal recombinant adeno-associated virus vector genomes are primarily responsible for stable liver transduction in vivo. J. Virol. 2001;75:6969–6976. doi: 10.1128/JVI.75.15.6969-6976.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Alexander I.E., Russell D.W., Spence A.M., Miller A.D. Effects of gamma irradiation on the transduction of dividing and nondividing cells in brain and muscle of rats by adeno-associated virus vectors. Hum. Gene Ther. 1996;7:841–850. doi: 10.1089/hum.1996.7.7-841. [DOI] [PubMed] [Google Scholar]
- 4.Grimm D., Kern A., Rittner K., Kleinschmidt J.A. Novel tools for production and purification of recombinant adenoassociated virus vectors. Hum. Gene Ther. 1998;9:2745–2760. doi: 10.1089/hum.1998.9.18-2745. [DOI] [PubMed] [Google Scholar]
- 5.Urabe M., Ding C., Kotin R.M. Insect cells as a factory to produce adeno-associated virus type 2 vectors. Hum. Gene Ther. 2002;13:1935–1943. doi: 10.1089/10430340260355347. [DOI] [PubMed] [Google Scholar]
- 6.Smith R.H., Levy J.R., Kotin R.M. A simplified baculovirus-AAV expression vector system coupled with one-step affinity purification yields high-titer rAAV stocks from insect cells. Mol. Ther. 2009;17:1888–1896. doi: 10.1038/mt.2009.128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Aslanidi G., Lamb K., Zolotukhin S. An inducible system for highly efficient production of recombinant adeno-associated virus (rAAV) vectors in insect Sf9 cells. Proc. Natl. Acad. Sci. USA. 2009;106:5059–5064. doi: 10.1073/pnas.0810614106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.BioMarin . 2009. BioMarin highlights key milestones for 2019 at 37th Annual J.P. Morgan Healthcare Conference in San Francisco.https://investors.biomarin.com/2019-01-07-BioMarin-Highlights-Key-Milestones-for-2019-at-37th-Annual-J-P-Morgan-Healthcare-Conference-in-San-Francisco [Google Scholar]
- 9.Pfizer . 2019. Sangamo and Pfizer announce phase 1/2 interim data for investigational hemophilia A gene therapy.https://www.pfizer.com/news/press-release/press-release-detail/sangamo_and_pfizer_announce_phase_1_2_interim_data_for_investigational_hemophilia_a_gene_therapy [Google Scholar]
- 10.Varki A. Cold Spring Harbor Laboratory Press; 2017. Essentials of Glycobiology. [PubMed] [Google Scholar]
- 11.van Ree R. Carbohydrate epitopes and their relevance for the diagnosis and treatment of allergic diseases. Int. Arch. Allergy Immunol. 2002;129:189–197. doi: 10.1159/000066770. [DOI] [PubMed] [Google Scholar]
- 12.European Medicines Agency . 2012. Assessment report: Glybera.https://www.ema.europa.eu/en/documents/assessment-report/glybera-epar-public-assessment-report_en.pdf [Google Scholar]
- 13.Padron E., Bowman V., Kaludov N., Govindasamy L., Levy H., Nick P., McKenna R., Muzyczka N., Chiorini J.A., Baker T.S., Agbandje-McKenna M. Structure of adeno-associated virus type 4. J. Virol. 2005;79:5047–5058. doi: 10.1128/JVI.79.8.5047-5058.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Kronenberg S., Böttcher B., von der Lieth C.W., Bleker S., Kleinschmidt J.A. A conformational change in the adeno-associated virus type 2 capsid leads to the exposure of hidden VP1 N termini. J. Virol. 2005;79:5296–5303. doi: 10.1128/JVI.79.9.5296-5303.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Sonntag F., Bleker S., Leuchs B., Fischer R., Kleinschmidt J.A. Adeno-associated virus type 2 capsids with externalized VP1/VP2 trafficking domains are generated prior to passage through the cytoplasm and are maintained until uncoating occurs in the nucleus. J. Virol. 2006;80:11040–11054. doi: 10.1128/JVI.01056-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Giles A.R., Sims J.J., Turner K.B., Govindasamy L., Alvira M.R., Lock M., Wilson J.M. Deamidation of amino acids on the surface of adeno-associated virus capsids leads to charge heterogeneity and altered vector function. Mol. Ther. 2018;26:2848–2862. doi: 10.1016/j.ymthe.2018.09.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Nam H.J., Lane M.D., Padron E., Gurda B., McKenna R., Kohlbrenner E., Aslanidi G., Byrne B., Muzyczka N., Zolotukhin S., Agbandje-McKenna M. Structure of adeno-associated virus serotype 8, a gene therapy vector. J. Virol. 2007;81:12260–12271. doi: 10.1128/JVI.01304-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Akache B., Grimm D., Pandey K., Yant S.R., Xu H., Kay M.A. The 37/67-kilodalton laminin receptor is a receptor for adeno-associated virus serotypes 8, 2, 3, and 9. J. Virol. 2006;80:9831–9836. doi: 10.1128/JVI.00878-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Gurda B.L., Raupp C., Popa-Wagner R., Naumer M., Olson N.H., Ng R., McKenna R., Baker T.S., Kleinschmidt J.A., Agbandje-McKenna M. Mapping a neutralizing epitope onto the capsid of adeno-associated virus serotype 8. J. Virol. 2012;86:7739–7751. doi: 10.1128/JVI.00218-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Radivojac P., Vacic V., Haynes C., Cocklin R.R., Mohan A., Heyen J.W., Goebl M.G., Iakoucheva L.M. Identification, analysis, and prediction of protein ubiquitination sites. Proteins. 2010;78:365–380. doi: 10.1002/prot.22555. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Becerra S.P., Koczot F., Fabisch P., Rose J.A. Synthesis of adeno-associated virus structural proteins requires both alternative mRNA splicing and alternative initiations from a single transcript. J. Virol. 1988;62:2745–2754. doi: 10.1128/jvi.62.8.2745-2754.1988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Galibert L., Savy A., Dickx Y., Bonnin D., Bertin B., Mushimiyimana I., van Oers M.M., Merten O.W. Origins of truncated supplementary capsid proteins in rAAV8 vectors produced with the baculovirus system. PLoS ONE. 2018;13:e0207414. doi: 10.1371/journal.pone.0207414. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Gaudet D., Méthot J., Déry S., Brisson D., Essiembre C., Tremblay G., Tremblay K., de Wal J., Twisk J., van den Bulk N. Efficacy and long-term safety of alipogene tiparvovec (AAV1-LPLS447X) gene therapy for lipoprotein lipase deficiency: an open-label trial. Gene Ther. 2013;20:361–369. doi: 10.1038/gt.2012.43. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Grieger J.C., Soltys S.M., Samulski R.J. Production of recombinant adeno-associated virus vectors using suspension HEK293 cells and continuous harvest of vector from the culture media for GMP FIX and FLT1 clinical vector. Mol. Ther. 2016;24:287–297. doi: 10.1038/mt.2015.187. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Venkatakrishnan B., Yarbrough J., Domsic J., Bennett A., Bothner B., Kozyreva O.G., Samulski R.J., Muzyczka N., McKenna R., Agbandje-McKenna M. Structure and dynamics of adeno-associated virus serotype 1 VP1-unique N-terminal domain and its role in capsid trafficking. J. Virol. 2013;87:4974–4984. doi: 10.1128/JVI.02524-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Ui N. Conformational studies on proteins by isoelectric focusing. Ann. N Y Acad. Sci. 1973;209:198–209. doi: 10.1111/j.1749-6632.1973.tb47529.x. [DOI] [PubMed] [Google Scholar]
- 27.Wistuba A., Kern A., Weger S., Grimm D., Kleinschmidt J.A. Subcellular compartmentalization of adeno-associated virus type 2 assembly. J. Virol. 1997;71:1341–1352. doi: 10.1128/jvi.71.2.1341-1352.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Tse L.V., Klinc K.A., Madigan V.J., Castellanos Rivera R.M., Wells L.F., Havlik L.P., Smith J.K., Agbandje-McKenna M., Asokan A. Structure-guided evolution of antigenically distinct adeno-associated virus variants for immune evasion. Proc. Natl. Acad. Sci. USA. 2017;114:E4812–E4821. doi: 10.1073/pnas.1704766114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Gurda B.L., DiMattia M.A., Miller E.B., Bennett A., McKenna R., Weichert W.S., Nelson C.D., Chen W.J., Muzyczka N., Olson N.H. Capsid antibodies to different adeno-associated virus serotypes bind common regions. J. Virol. 2013;87:9111–9124. doi: 10.1128/JVI.00622-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Tseng Y.S., Gurda B.L., Chipman P., McKenna R., Afione S., Chiorini J.A., Muzyczka N., Olson N.H., Baker T.S., Kleinschmidt J., Agbandje-McKenna M. Adeno-associated virus serotype 1 (AAV1)- and AAV5-antibody complex structures reveal evolutionary commonalities in parvovirus antigenic reactivity. J. Virol. 2015;89:1794–1808. doi: 10.1128/JVI.02710-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.King B.R., Guda C. ngLOC: an n-gram-based Bayesian method for estimating the subcellular proteomes of eukaryotes. Genome Biol. 2007;8:R68. doi: 10.1186/gb-2007-8-5-r68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Forte G.M., Pool M.R., Stirling C.J. N-terminal acetylation inhibits protein targeting to the endoplasmic reticulum. PLoS Biol. 2011;9:e1001073. doi: 10.1371/journal.pbio.1001073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Maynard L.H., Smith O., Tilmans N.P., Tham E., Hosseinzadeh S., Tan W., Leenay R., May A.P., Paulk N.K. Fast-Seq: a simple method for rapid and inexpensive validation of packaged single-stranded adeno-associated viral genomes in academic settings. Hum. Gene Ther. Methods. 2019;30:195–205. doi: 10.1089/hgtb.2019.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Deal C.E., Balazs A.B. Vectored antibody gene delivery for the prevention or treatment of HIV infection. Curr. Opin. HIV AIDS. 2015;10:190–197. doi: 10.1097/COH.0000000000000145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Paulk N.K., Pekrun K., Charville G.W., Maguire-Nguyen K., Wosczyna M.N., Xu J., Zhang Y., Lisowski L., Yoo B., Vilches-Moure J.G. Bioengineered viral platform for intramuscular passive vaccine delivery to human skeletal muscle. Mol. Ther. Methods Clin. Dev. 2018;10:144–155. doi: 10.1016/j.omtm.2018.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Vercauteren K., Hoffman B.E., Zolotukhin I., Keeler G.D., Xiao J.W., Basner-Tschakarjan E., High K.A., Ertl H.C., Rice C.M., Srivastava A. Superior in vivo transduction of human hepatocytes using engineered AAV3 capsid. Mol. Ther. 2016;24:1042–1049. doi: 10.1038/mt.2016.61. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Paulk N.K., Pekrun K., Zhu E., Nygaard S., Li B., Xu J., Chu K., Leborgne C., Dane A.P., Haft A. Bioengineered AAV capsids with combined high human liver transduction in vivo and unique humoral seroreactivity. Mol. Ther. 2018;26:289–303. doi: 10.1016/j.ymthe.2017.09.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Liu Q., Huang W., Zhang H., Wang Y., Zhao J., Song A., Xie H., Zhao C., Gao D., Wang Y. Neutralizing antibodies against AAV2, AAV5 and AAV8 in healthy and HIV-1-infected subjects in China: implications for gene therapy using AAV vectors. Gene Ther. 2014;21:732–738. doi: 10.1038/gt.2014.47. [DOI] [PubMed] [Google Scholar]
- 39.Davidoff A.M., Ng C.Y., Zhou J., Spence Y., Nathwani A.C. Sex significantly influences transduction of murine liver by recombinant adeno-associated viral vectors through an androgen-dependent pathway. Blood. 2003;102:480–488. doi: 10.1182/blood-2002-09-2889. [DOI] [PubMed] [Google Scholar]
- 40.Nathwani A.C., Tuddenham E.G., Rangarajan S., Rosales C., McIntosh J., Linch D.C., Chowdary P., Riddell A., Pie A.J., Harrington C. Adenovirus-associated virus vector-mediated gene transfer in hemophilia B. N. Engl. J. Med. 2011;365:2357–2365. doi: 10.1056/NEJMoa1108046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Nathwani A.C., Reiss U.M., Tuddenham E.G., Rosales C., Chowdary P., McIntosh J., Della Peruta M., Lheriteau E., Patel N., Raj D. Long-term safety and efficacy of factor IX gene therapy in hemophilia B. N. Engl. J. Med. 2014;371:1994–2004. doi: 10.1056/NEJMoa1407309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Lisowski L., Dane A.P., Chu K., Zhang Y., Cunningham S.C., Wilson E.M., Nygaard S., Grompe M., Alexander I.E., Kay M.A. Selection and evaluation of clinically relevant AAV variants in a xenograft liver model. Nature. 2014;506:382–386. doi: 10.1038/nature12875. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.U.S. Food and Drug Administration. Vaccines, Blood & Biologics. https://www.fda.gov/BiologicsBloodVaccines/default.htm.
- 44.Walsh G. Biopharmaceutical benchmarks 2014. Nat. Biotechnol. 2014;32:992–1000. doi: 10.1038/nbt.3040. [DOI] [PubMed] [Google Scholar]
- 45.Kondratov O., Marsic D., Crosson S.M., Mendez-Gomez H.R., Moskalenko O., Mietzsch M., Heilbronn R., Allison J.R., Green K.B., Agbandje-McKenna M., Zolotukhin S. Direct head-to-head evaluation of recombinant adeno-associated viral vectors manufactured in human versus insect cells. Mol. Ther. 2017;25:2661–2675. doi: 10.1016/j.ymthe.2017.08.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Mietzsch M., Grasse S., Zurawski C., Weger S., Bennett A., Agbandje-McKenna M., Muzyczka N., Zolotukhin S., Heilbronn R. OneBac: platform for scalable and high-titer production of adeno-associated virus serotype 1–12 vectors for gene therapy. Hum. Gene Ther. 2014;25:212–222. doi: 10.1089/hum.2013.184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Ghaderi D., Zhang M., Hurtado-Ziola N., Varki A. Production platforms for biotherapeutic glycoproteins. Occurrence, impact, and challenges of non-human sialylation. Biotechnol. Genet. Eng. Rev. 2012;28:147–175. doi: 10.5661/bger-28-147. [DOI] [PubMed] [Google Scholar]
- 48.Murray S., Nilsson C.L., Hare J.T., Emmett M.R., Korostelev A., Ongley H., Marshall A.G., Chapman M.S. Characterization of the capsid protein glycosylation of adeno-associated virus type 2 by high-resolution mass spectrometry. J. Virol. 2006;80:6171–6176. doi: 10.1128/JVI.02417-05. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Dong B., Duan X., Chow H.Y., Chen L., Lu H., Wu W., Hauck B., Wright F., Kapranov P., Xiao W. Proteomics analysis of co-purifying cellular proteins associated with rAAV vectors. PLoS ONE. 2014;9:e86453. doi: 10.1371/journal.pone.0086453. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Jones P.A. Functions of DNA methylation: islands, start sites, gene bodies and beyond. Nat. Rev. Genet. 2012;13:484–492. doi: 10.1038/nrg3230. [DOI] [PubMed] [Google Scholar]
- 51.Schübeler D. Function and information content of DNA methylation. Nature. 2015;517:321–326. doi: 10.1038/nature14192. [DOI] [PubMed] [Google Scholar]
- 52.Feng S., Cokus S.J., Zhang X., Chen P.Y., Bostick M., Goll M.G., Hetzel J., Jain J., Strauss S.H., Halpern M.E. Conservation and divergence of methylation patterning in plants and animals. Proc. Natl. Acad. Sci. USA. 2010;107:8689–8694. doi: 10.1073/pnas.1002720107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Zemach A., McDaniel I.E., Silva P., Zilberman D. Genome-wide evolutionary analysis of eukaryotic DNA methylation. Science. 2010;328:916–919. doi: 10.1126/science.1186366. [DOI] [PubMed] [Google Scholar]
- 54.Benskey M.J., Sandoval I.M., Manfredsson F.P. Continuous collection of adeno-associated virus from producer cell medium significantly increases total viral yield. Hum. Gene Ther. Methods. 2016;27:32–45. doi: 10.1089/hgtb.2015.117. [DOI] [PubMed] [Google Scholar]
- 55.Rodrigues A.F., Carrondo M.J., Alves P.M., Coroadinha A.S. Cellular targets for improved manufacturing of virus-based biopharmaceuticals in animal cells. Trends Biotechnol. 2014;32:602–607. doi: 10.1016/j.tibtech.2014.09.010. [DOI] [PubMed] [Google Scholar]
- 56.Elias J.E., Gygi S.P. Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry. Nat. Methods. 2007;4:207–214. doi: 10.1038/nmeth1019. [DOI] [PubMed] [Google Scholar]
- 57.Malaker S.A., Penny S.A., Steadman L.G., Myers P.T., Loke J.C., Raghavan M., Bai D.L., Shabanowitz J., Hunt D.F., Cobbold M. Identification of glycopeptides as posttranslationally modified neoantigens in leukemia. Cancer Immunol. Res. 2017;5:376–384. doi: 10.1158/2326-6066.CIR-16-0280. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Libby A.R., Joy D.A., So P.L., Mandegar M.A., Muncie J.M., Mendoza-Camacho F.N., Weaver V.M., Conklin B.R., McDevitt T.C. Spatiotemporal mosaic self-patterning of pluripotent stem cells using CRISPR interference. eLife. 2018;7:e36045. doi: 10.7554/eLife.36045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Bolger A.M., Lohse M., Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Li H., Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.McKenna A., Hanna M., Banks E., Sivachenko A., Cibulskis K., Kernytsky A., Garimella K., Altshuler D., Gabriel S., Daly M., DePristo M.A. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–1303. doi: 10.1101/gr.107524.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Danecek P., McCarthy S.A. BCFtools/csq: haplotype-aware variant consequences. Bioinformatics. 2017;33:2037–2039. doi: 10.1093/bioinformatics/btx100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Broad Institute. Picard tools. https://broadinstitute.github.io/picard/.
- 64.Rueden C.T., Schindelin J., Hiner M.C., DeZonia B.E., Walter A.E., Arena E.T., Eliceiri K.W. ImageJ2: ImageJ for the next generation of scientific image data. BMC Bioinformatics. 2017;18:529. doi: 10.1186/s12859-017-1934-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Miller E.B., Gurda-Whitaker B., Govindasamy L., McKenna R., Zolotukhin S., Muzyczka N., Agbandje-McKenna M. Production, purification and preliminary X-ray crystallographic studies of adeno-associated virus serotype 1. Acta Crystallogr. Sect. F Struct. Biol. Cryst. Commun. 2006;62:1271–1274. doi: 10.1107/S1744309106048184. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Zheng S., Palovcak E., Armache J.-P., Cheng Y., Agard D. Anisotropic correction of beam-induced motion for improved single-particle electron cryo-microscopy. bioRxiv. 2016 doi: 10.1101/061960. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Grant T., Rohou A., Grigorieff N. cisTEM, user-friendly software for single-particle image processing. eLife. 2018;7:e35383. doi: 10.7554/eLife.35383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Emsley P., Lohkamp B., Scott W.G., Cowtan K. Features and development of Coot. Acta Crystallogr. D Biol. Crystallogr. 2010;66:486–501. doi: 10.1107/S0907444910007493. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Afonine P.V., Headd J.J., Terwilliger T.C., Adams P.D. Phenix.real_space_refine. Computat. Crystallogr. Newsl. 2013;4:43–44. https://www.phenix-online.org/newsletter/CCN_2013_07.pdf [Google Scholar]
- 70.Reimand J., Arak T., Adler P., Kolberg L., Reisberg S., Peterson H., Vilo J. g:Profiler-a web server for functional interpretation of gene lists (2016 update) Nucleic Acids Res. 2016;44(W1):W83–W89. doi: 10.1093/nar/gkw199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Guo W., Fiziev P., Yan W., Cokus S., Sun X., Zhang M.Q., Chen P.Y., Pellegrini M. BS-Seeker2: a versatile aligning pipeline for bisulfite sequencing data. BMC Genomics. 2013;14:774. doi: 10.1186/1471-2164-14-774. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Benjamini Y., Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B Methodol. 1995;57:289–300. [Google Scholar]
- 73.Krzywinski M., Schein J., Birol I., Connors J., Gascoyne R., Horsman D., Jones S.J., Marra M.A. Circos: an information aesthetic for comparative genomics. Genome Res. 2009;19:1639–1645. doi: 10.1101/gr.092759.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Azuma H., Paulk N., Ranade A., Dorrell C., Al-Dhalimy M., Ellis E., Strom S., Kay M.A., Finegold M., Grompe M. Robust expansion of human hepatocytes in Fah−/−/Rag2−/−/Il2rg−/− mice. Nat. Biotechnol. 2007;25:903–910. doi: 10.1038/nbt1326. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.