Skip to main content
PLOS One logoLink to PLOS One
. 2023 Dec 14;18(12):e0293277. doi: 10.1371/journal.pone.0293277

A novel method for quantitation of AAV genome integrity using duplex digital PCR

Lauren Tereshko 1,*,#, Xiaohui Zhao 1,¤,#, Jake Gagnon 2, Tinchi Lin 3, Trevor Ewald 1, Yu Wang 1, Marina Feschenko 1,¤, Cullen Mason 1
Editor: Simone Agostini4
PMCID: PMC10721069  PMID: 38096204

Abstract

Recombinant adeno-associated virus (rAAV) vectors have become a reliable strategy for delivering gene therapies. As rAAV capsid content is known to be heterogeneous, methods for rAAV characterization are critical for assessing the efficacy and safety of drug products. Multiplex digital PCR (dPCR) has emerged as a popular molecular approach for characterizing capsid content due to its high level of throughput, accuracy, and replicability. Despite growing popularity, tools to accurately analyze multiplexed data are scarce. Here, we introduce a novel statistical model to estimate genome integrity from duplex dPCR assays. This work demonstrates that use of a Poisson-multinomial mixture distribution significantly improves the accuracy and quantifiable range of duplex dPCR assays over currently available models.

Introduction

rAAV production processes result in heterogeneous populations, where in addition to the expected encapsidated full genomes, capsids may be empty, partial, or over-packaged [1, 2]. Unwanted residual DNA from plasmids or host cells involved in the production process may also be encapsidated [1, 3, 4]. Current purification processes can efficiently separate empty from full capsids, however analytical methods that rely on separating capsid populations based on density (AUC, Cryo-EM, TEM), mass-to-charge ratios (CDMS), absorbance characteristics (A260/A280), or mass (SEC-MALS), vary in their abilities to accurately detect partially filled capsids and cannot differentiate between aberrant versus intended genomes [16]. It is therefore critical to develop assays to thoroughly characterize rAAV encapsidated content and genome integrity. Several next generation sequencing (NGS) platforms have been effectively used to characterize rAAV capsid content; however, use of these methods for process development is limited due to high material requirements, high cost, and the need for complex data analysis. Furthermore, NGS methods have limitations for use with non-purified samples [4, 711]. Single and multiplexed digital PCR (dPCR) assays have emerged as effective quantitative tools for characterizing encapsidated DNA given their high level of accuracy, low sample requirements and high-throughput capabilities [1214]. dPCR assays are therefore advantageous for testing drug product and intermediate samples, and for supporting process development of early-stage gene therapy products.

dPCR technologies employ microfluidics to partition template DNA into thousands of independent amplification reactions that are expected to contain zero or one template molecule [15]. End-point fluorescence reactions result in a binary output of partitions that are either negative or positive for template. In singleplex reactions, Poisson statistics can be used to correct for the possibility of multiple templates being partitioned together and can accurately estimate the absolute concentration of template DNA [16, 17]. Reactions can be easily duplexed to assess the integrity of individual genes, or rAAV genomes by disrupting capsids prior to template partitioning, and concurrently amplifying targets at the 5’ and 3’ ends of the gene(s) with fluorescent probes of different wavelengths [18, 19]. With this strategy, DNA templates containing both targets are considered intact, while templates containing only one target are considered partial. Unlike singleplex reactions, Poisson statistics are not suitable to model the three-category positive data resulting from duplex reactions (5’ target, 3’ target, or double-positive partitions) [16].

Currently there is a divide between the current technological capability to multiplex assays and the ability to accurately analyze data from such experiments. To bridge the gap, there is a pressing unmet need to develop statistical models that can accurately quantitate end-point fluorescence data from duplex and higher-order multiplexed reactions. Here, we propose a novel Poisson-multinomial model for accurate quantitation of gene and rAAV genome integrity from duplex droplet dPCR (ddPCR) reactions using primers and probes targeting regions near the inverted terminal repeats (ITR) of the viral genomes. We compare the accuracy and precision of the model to contemporary statistical models for both plasmid and AAV samples by analyzing duplexed ddPCR data from samples across a range of genome integrities and concentrations. We demonstrate the model expands the dynamic range and improves accuracy and precision of genome integrity estimates compared to simpler models. Finally, we show that integrity values calculated with the Poisson-multinomial model have higher accuracy for both simulated and heat-fragmented rAAV material. These findings establish use of the multinomial Poisson model as a robust approach for analyzing duplex ddPCR data in support of characterizing critical quality attributes of rAAV therapies.

Results

Comparison of statistical models for simulated genome integrity samples using plasmid DNA

The accuracy of four analytical models was compared by using mixtures of digested viral vector plasmid (pAAV) to simulate variable degrees of genome integrity. Duplexed primer/probe sets differentially labeled with FAM and VIC dyes were designed to target the CMV enhancer (CMV) and polyadenylation (polyA) regions of the viral genome, which border the 5’- and 3’-ITRs respectively. Intact genomes were simulated by restriction enzyme digestion with MfeI, which cuts pAAV outside of the viral genome sequence. As the samples retained both target sites on a single intact template, they were assigned an expected integrity of 100%. Fragmented genomes were simulated by double-digestion of pAAV with MfeI and NheI, which bisects the viral genome sequence. As the fragmented samples contain only one target (either CMV or polyA), and zero intact templates, they were assigned an expected integrity of 0% (Fig 1 and S1 Fig). By titrating varying ratios of intact and fragmented genomes, samples were produced with either 0, 8, 18, 29, 43, 60, 82 or 100% expected integrities. The samples were then diluted over 12 points to cover the dynamic range of the Bio-Rad QX200 ddPCR system (~1–5000 copies/μL) and duplex ddPCR was performed to assess genome integrity [17, 20]. For all experiments, pre-defined acceptance criteria were used to evaluate the suitability of the models. Relative standard deviation (RSD) of all concentrations tested for a particular sample must be <20%, and recoveries must be between 80–120% of the theoretical value.

Fig 1. Diagram of pAAV sequence.

Fig 1

Restriction sites are indicated by enzyme name. CMV and polyA primer/probe sets are indicated by fluorophore illustrations (blue and green respectively).

Calculation of pAAV genome integrity by simple percentage formula

Bio-Rad’s proprietary ddPCR system outputs the number of droplets in each of four categories (double-positive, single-positive target 1, single-positive target 2, and empty droplets) and uses a Poisson distribution-based model to calculate the concentration of DNA template molecules in copies per microliter (λ), where p = the fraction of positive droplets in total droplets, and V is the average droplet volume (0.85 nL) (Formula 1, Bio-Rad, personal communication):

λ=ln(1p)/V (1)

Variations of Formula 1 have been proposed by previous studies to calculate genome integrity as the percentage of double-positive droplets out of total positive droplets in terms of either droplet number or concentration (Formula 2) [18, 21].

%Integrity=NumberofdoublepositivedropletsNumberoftotalpositivedropletsx100 (2)

Using a Poisson distribution, it can be determined that when samples are highly dilute (≤150 copies/μL), most positive droplets contain a single DNA template (probability = 0.94), and thereby most double-positive droplets will contain true intact templates as compared to the chance co-localization of 5’ and 3’ targets. As theorized, use of Formula 2 to calculate the genome integrity of the simulated plasmid samples was accurate over only a small portion of the theoretical dynamic range of the QX200 system. Over the tested concentration range (8–5000 copies/μL), for samples with less than 100% expected integrity, the accuracy of calculated genome integrity declined as sample concentration increased, and as expected integrity decreased (Fig 2 and Table 1).

Fig 2. Genome integrity of simulated pAAV samples calculated by Formula 2.

Fig 2

Genome integrity values calculated by Formula 2 are plotted as single data points (n = 1), connecting lines depict average of experimental replicates (N = 2). Dashed lines depict the expected integrity values for the samples.

Table 1. Genome integrity and recovery of simulated pAAV samples calculated by Formula 2.

Expected genome integrity
Copies/μL 0% 8% 18% 29% 43% 60% 82% 100%
8 NC 110.2 95.5 83.6 91.5 98.5 93.6 93.0
16 NC 104.3 106.7 98.0 90.0 91.9 95.8 97.2
31 NC 120.8 107.6 100.7 101.1 98.3 96.4 97.0
63 NC 128.6 108.3 101.3 97.3 96.4 96.7 97.2
125 NC 154.3 115.9 106.9 103.9 101.8 98.5 97.4
250 NC 224.5 145.2 122.5 109.5 104.7 101.0 97.9
500 NC 336.6 189.3 149.3 123.8 110.8 101.7 97.9
1000 NC 564.1 277.0 194.2 149.1 123.8 106.2 98.3
2000 NC 898.7 407.4 263.9 188.0 142.7 113.0 99.0
3000 NC 1096.6 486.0 306.7 210.8 155.0 117.1 99.3
4000 NC 1188.6 524.7 329.2 222.4 160.8 119.2 99.6
5000 NC 1224.9 544.3 337.9 228.1 164.0 120.7 99.7

Shaded cells indicate recoveries outside of the acceptable range (80–120%). Recoveries of 0% samples were not calculated (NC).

Calculation of pAAV genome integrity by physical linkage models

Recently, a formula for genome integrity utilizing the calculation of percent “linkage” has been suggested, where genes contained on the same template are considered linked, and genes that are physically separated are considered unlinked [22]. Linkage is defined as the number of double-positive droplets in excess of what is expected due to chance co-localization of two unlinked targets [23]. The calculation of linked target concentration is included in Bio-Rad’s QuantaSoft raw data file output in terms of copies/μL.

In instances where the concentration of each target is similar, Regan et al suggest that percent linkage (i.e., genome integrity) can be calculated by dividing the linkage concentration by the average concentrations of the 5’ and 3’ target (CMV and polyA respectively, Formula 3) [23]. The authors note that if the concentrations of the two targets are unequal due to amplification bias resulting from experimental conditions (method-induced genome fragmentation, differences in genome accessibility, or differences in amplicon size) that a compensated version of the equation can be used, which involves adjustment of the linkage value by addition of the absolute difference in concentrations of the 5’ and 3’ target, followed by division by the maximum value of either the concentration the 5’ or the 3’ target (Formula 4) [22].

%Linkageaverage=[linkage]([CMV]+[polyA])/2x100 (3)
%Linkagecompensated=([linkage]+abs([CMV][polyA]))max([CMV]or[polyA])x100 (4)

When either Formula 3 or Formula 4 was used to calculate the percent genome integrity of the plasmid samples, the results were relatively stable across the tested concentration range (8–5000 copies/μL), as demonstrated by the assessment of the RSD (≤17.1%, ≤27.4% respectively, Table 2). Of note, for both models, the calculated integrity values were significantly more variable as the expected integrity of the samples decreased. Additionally, the integrity values for all fragmented genome samples were overestimated as demonstrated by the calculated recoveries relative to the expected values, and the overestimations became more pronounced as expected integrity decreased (Fig 3A and 3B and Table 2). In this data set, the amplification of the 5’ and 3’ targets are even, and the calculated values from Formula 3 and Formula 4 are therefore expected to be similar. In contrast, Formula 3 was more accurate and precise than Formula 4 across the full range of integrity (Formula 3 accuracy 136.7%, intermediate precision 21.6%, vs Formula 4 accuracy 146.4%, intermediate precision 28.4%) (Table 2). The lack of accuracy and precision of Formula 3 for samples with lower integrity would require shortening the linear range of the method for accurate quantitation (Fig 3A, 3B and 3D).

Table 2. Genome integrity and recovery of simulated pAAV samples calculated by linkage models (Formulas 3 and 4).

Percent Linkageavg (Formula 3) Percent Linkagecomp (Formula 4)
Sample # Expected integrity (%) Average calculated integrity (%) RSD (%) Recovery (%) Average calculated integrity (%) RSD (%) Recovery (%)
1 100 98.3 0.8 98.3 99.1 0.5 99.1
2 82 88.3 1.0 107.7 89.3 1.1 108.8
3 60 73.4 2.7 122.3 74.9 2.6 124.9
4 43 58.5 3.3 136.1 61.4 5.4 142.8
5 29 43.8 6.2 151.1 46.1 6.1 159
6 18 29.0 14.2 161.1 30.6 13.5 169.9
7 8 14.4 17.1 180.1 17.6 27.4 220.6
8 0 0.8 NC NC 3.9 NC NC
Overall accuracy 136.7 146.4
Intermediate precision 21.6 28.4

Each sample was tested at twelve dilutions and the results were then averaged to produce a calculated integrity value and percent RSD. Shaded cells indicate recoveries outside of the acceptable range (80–120%) or RSD >20%. RSD and recovery not calculated for 0% expected integrity (NC).

Fig 3. Genome integrity of simulated pAAV samples calculated by linkage and Poisson-multinomial models.

Fig 3

A), average percent linkage (Formula 3), B), compensated percent linkage (Formula 4). C), Poisson-multinomial (Formula 7). Calculated percent integrity values are plotted as single data points (n = 1), connecting lines depict average of experimental replicates (N = 2). Dashed lines depict the expected values for the samples. D), Linearity of models. Average calculated percent integrity values from sample dilutions are plotted as points (n = 12, N = 2).

Calculation of pAAV genome integrity by Poisson-multinomial model

As an alternative method for genome integrity calculation, we developed a more robust statistical model that utilizes a Poisson-multinomial mixture distribution. In the model below, three categories of positive droplets are considered, with k molecules in a droplet, as a multinomial model with three species containing 1.) 5’ target only, 2.) 3’ target only, and 3.) both 5’ and 3’ targets (double-positive). To model the probability of k molecules within a droplet, we can utilize the binomial distribution, p(k) with k successes and m trials, where m is the total number of molecules [24]. However, when the number of droplets is large, the binomial distribution can be approximated by the Poisson distribution with parameter, lambda.

This Poisson-multinomial model has two unknown parameters, pCMV and ppolyA. To estimate these parameters, the measured numbers of CMV single-positive droplets, polyA single-positive droplets, and the total number of droplets are needed. The upper limit of the dynamic range of the QX200 ddPCR plate reader is 5000 copies/μL. Given that the average droplet volume is 0.85 nL, this upper limit is equal to 4.25 copies/droplet. It can be further determined by the Poisson distribution that the probability of more than 20 copies of template molecules being present in a droplet is negligible (probability = 5.4E−9). It is therefore numerically sufficient to model the situation where the average number of template molecules within a droplet is ≤ 20 copies.

The calculated number of CMV single positive-droplets across all droplets can be derived from the Poisson-multinomial model where k = the number of template molecules within a droplet, and D is the total number of droplets (Formula 5).

(k=120p(k)×pCMVk)×D=CMVsinglepositivedroplets (5)

Similarly, the calculated number of polyA single-positive droplets across all droplets can be derived as Formula 6.

(k=120p(k)×ppolyAk)×D=polyAsinglepositivedroplets (6)

Solving the 20th degree polynomial gives us the unknown quantities pCMV and ppolyA. Genome integrity can then be calculated as the expected (E) number of full genomes across all droplets divided by the expected number of total DNA template molecules across total droplets:

E[Fullgenome]/E[TotalDNAtemplate]

where the numerator quantity is derived from the double expectation rule:

E[E[Fullgenomegivenkmoleculesinadroplet]]=(1pCMVppolyA)×λ×D

and the denominator is given by:

λ×D

With these expressions, Formula 7 can be used to calculate the percentage of templates that are fully intact:

%Integrity=(1pCMVppolyA)×100% (7)

When the Poisson-multinomial model (Formula 7) was used to calculate the percent genome integrity of the plasmid mock samples, all values were consistent (RSD ≤18%) and aligned with the expected values (recoveries between 94.3–96.9%) across the tested concentration range (8–5000 copies/μL) (Fig 3C and Table 3). The overall accuracy of the Poisson-multinomial distribution model was 96.3% and the accuracy was consistent across the full range of intact genomes (0–100%). Although the calculated integrity values were significantly more variable as the integrity of the simulated plasmid samples decreased, the Poisson-multinomial model was more precise than linkage models (Poisson-multinomial intermediate precision <1%, vs >21% and 28% for linkage models), as calculated by the RSD of all the recoveries (Tables 2 and 3). Plotting the sample integrity versus the calculated integrity values for each model shows that the Poisson-multinomial model is more linear than linkage models in the range of 8–100% when the data were fit using generalized least squares (GLS) to account for correlations among experimental replicates (Fig 3D, S2 Fig and S1 Table).

Table 3. Genome integrity and recovery of simulated pAAV samples calculated by the Poisson-multinomial distribution model.

Sample # Expected Integrity (%) Average Calculated integrity (%) RSD (%) Recovery (%)
1 100 96.7 1.4 96.7
2 82 79.1 1.7 96.4
3 60 58.0 4.1 96.6
4 43 41.4 4.8 96.2
5 29 28.0 8.2 96.7
6 18 17.0 15.8 94.3
7 8 7.8 18.0 96.9
8 0 -0.1 NC NC 
Overall accuracy 96.3
Intermediate precision 0.9%

Each sample was tested at twelve dilutions and the results were then averaged to produce a calculated integrity value and percent RSD. There were no recoveries outside of the acceptable range (80–120%) or RSD >20%. RSD. Percent recovery and RSD not calculated for 0% expected integrity (NC).

Comparison of linkage vs Poisson-multinomial models for simulated integrity samples using rAAV material

To evaluate the Poisson-multinomial model for genome integrity empirically with rAAV material, fragmented rAAV genomes were simulated by combining two samples (AAV-GFP and AAV-BIIB) containing common promoter sequences (CMV) but differing polyA sequences. Primer/probe sets were designed to target the shared enhancer regions of both samples, and the polyA sequence of AAV-GFP (polyA-1, Fig 4). Since AAV-GFP is a heterogenous mixture containing some partial genomes, it was first assayed with both primer/probe sets in a duplex reaction to estimate genome integrity from dilutions <250 copies/μL using Formula 2. The calculated integrity of AAV-GFP (84%) was then set as the maximal expected integrity value. When assayed with the same primer/probe sets, AAV-BIIB is expected to have no amplification with the polyA-1 targeting primer/probe and is considered 0% intact. In this manner, six mock integrity samples were prepared by mixing AAV-GFP and AAV-BIIB in varying concentration ratios to produce samples with either 0, 5, 10, 21, 42, 63 or 84% of genomes expected to contain both ddPCR targets.

Fig 4. Diagram of rAAV sequences.

Fig 4

AAV-GFP and AAV-BIIB genomes share a common promoter (CMV) but different polyA sequences. 5’ (green) and 3’ (blue) primer/probe sets are indicated by fluorophore illustrations.

It is possible that rAAV samples may contain both targets within a single capsid without the targets being physically linked. To truly assess genome integrity as opposed to double-positive capsids, the samples must be decapsidated prior to droplet generation. Decapsidation was performed by alkaline lysis and a short incubation at low heat (10 minutes, 60°C) to minimize hydrolysis of the phosphate DNA backbone [22, 2531]. Duplexed ddPCR assays were performed on the rAAV samples as described for pAAV samples, but tested over four dilutions that spanned a narrower range of concentrations (62–498 copies/μL).

The genome integrity of each sample was calculated using either average linkage percentage (Formula 3) or the Poisson-multinomial model (Formula 7). Comparison with the compensated linkage percentage (Formula 4) was not included as the concentrations of CMV and polyA-1 are expected to be uneven due to the experimental design. Both Formula 3 and Formula 7 yielded consistent results across the range of concentrations tested, as evidenced by the flat lines for each simulated integrity sample (Fig 5A and 5B). The integrity values calculated by Formula 3, however, were consistently overestimated compared to those produced by the Poisson-multinomial model. (Fig 5A and 5B).

Fig 5. Comparison of calculated genome integrities of simulated rAAV samples.

Fig 5

Percent integrity was calculated with A.) average percent linkage model (Formula 3) or B.) Poisson-multinomial model (Formula 7). Calculated percent integrity values are plotted as single data points (n = 1), connecting lines depict average of experimental replicates (N = 6). Dashed lines depict the expected values for the samples.

To compare the accuracy and precision of the models, the percent recoveries of calculated genome integrities were evaluated in six independent experiments. When integrity was calculated using the average percent linkage model, all sample replicates showed acceptable precision (RSD ≤8.5%), however the majority of samples below 84% expected integrity over-recovered (>120%) (Table 4). Consistent with what was observed with the simulated plasmid samples, over-estimation of integrity worsened as the expected integrity of the simulated rAAV samples decreased (Tables 2 and 4). In comparison, when the Poisson-multinomial model was used, all sample replicates demonstrated acceptable recovery (86.6–117.8%) and precision (RSD<9%), as summarized in Table 4., The Poisson-multinomial model showed better overall accuracy and intermediate precision when compared with the average percent linkage model (accuracy: 98.1% vs 149.6%, intermediate precision: 6.5% vs 21.8% respectively).

Table 4. Comparison of average genome integrities and percent recoveries calculated via Formula 3 and the Poisson-multinomial model for simulated rAAV samples.

Percent Linkageavg Poisson-multinomial
Sample # Expected Integrity (%) Average replicate integrity (%) RSD (%) Recovery (%) Average replicate integrity (%) RSD (%) Recovery (%)
1 84 93.4 0.6 111.1 87.6 1.1 104.3
92.3 0.8 109.9 85.5 1.3 101.8
92.2 0.6 109.7 85.6 1.1 101.9
92.5 0.5 110.1 86.2 0.9 102.7
92.7 0.4 110.3 86.3 0.8 102.7
92.4 0.4 110.0 85.8 0.8 102.1
2 63 78.9 0.7 125.2 65.1 1.2 103.4
76.3 0.3 121.1 61.6 0.7 97.8
75.0 1.4 119.0 60.0 2.2 95.3
75.0 2.2 119.0 60.0 3.4 95.2
75.2 1.1 119.4 60.2 1.8 95.6
75.5 1.1 119.9 60.7 1.9 96.4
3 42 60.7 1.1 144.6 43.6 1.4 103.9
56.0 4.2 133.4 38.9 5.8 92.5
55.6 1.7 132.4 38.5 2.2 91.6
55.5 2.0 132.2 38.4 2.5 91.4
56.2 1.5 133.8 39.0 2.1 92.9
55.8 1.9 132.9 38.8 2.8 92.3
4 21 34.8 0.7 165.9 21.1 0.8 100.5
31.6 5.1 150.4 18.7 6.0 89.2
30.8 1.7 146.7 18.2 1.8 86.6
32.0 1.4 152.6 19.1 2.0 90.9
32.7 1.5 155.6 19.5 1.6 92.9
33.0 2.6 157.1 19.7 2.8 94.0
5 10 19.7 7.3 197.3 10.9 8.1 109.4
18.1 6.8 180.7 9.9 7.4 99.4
17.2 6.4 172.0 9.4 7.1 94.1
17.0 5.3 170.1 9.3 5.8 92.8
17.5 5.4 174.6 9.6 5.6 95.7
16.9 6.4 168.6 9.2 7.2 91.9
6 5 11.2 8.5 223.6 5.9 9.0 117.8
10.0 3.8 200.5 5.3 4.0 105.5
9.7 3.9 194.9 5.1 3.5 102.0
9.6 6.7 192.7 5.1 7.4 101.1
9.8 6.5 196.8 5.2 6.7 103.4
9.5 1.4 190.3 5.0 1.3 99.5
7 0 0.0 NC NC NC NC NC
0.0 NC NC NC NC NC
0.0 NC NC NC NC NC
0.0 NC NC NC NC NC
0.0 NC NC NC NC NC
0.0 NC NC NC NC NC
Overall accuracy 149.6 98.1
Intermediate precision 21.8 6.5

Highlighted cells are outside the acceptable recovery range (80–120%). RSD and recovery not calculated for 0% expected integrity (NC).

To compare the linearity of the models, the average calculated percent genome integrity of each sample replicate (summarized in Table 4) was plotted against the expected integrity value, and the data were fit using GLS to account for correlations among experimental replicates. Although both models showed linear sample recovery in the range of 5–84%, the Poisson-multinomial data had a better linear fit with a slope of 1.01 and pseudo R2 of 0.996 (Fig 6, S3 Fig and S2 Table).

Fig 6. Linearity of the average percent linkage model (Formula 3) and Poisson-multinomial model for rAAV simulated genome integrity samples.

Fig 6

Average calculated percent integrity values from experimental replicates are plotted as points (n = 4, N = 6).

Comparison of linkage vs Poisson-multinomial models for genome integrity of heat-fragmented rAAV material

To evaluate the Poisson-multinomial model in a more realistic context, we wanted to generate genomes of varying integrity from a single source of rAAV material. Thermal stress has been shown to degrade DNA through spontaneous hydrolysis [22, 2531]. Replicate AAV-BIIB samples were therefore decapsidated by alkaline lysis and followed by either a 0, 1, 5, 10, 20, or 30-minute incubation at 95°C to generate variably intact, heat-fragmentated genomes. Primer/probe sets targeting the CMV enhancer and polyA-2 sequences were used to assay the samples in duplex reactions. Sample material was previously determined to have an estimated integrity of 48% via single-molecule long-read NGS.

Compared to the NGS data, both linkage-based models (Formula 3 or Formula 4) had greater overestimated integrity of the unheated sample material, than the Poisson-multinomial model (Fig 7). Additionally, although genome integrity decreased as incubation time at 95°C increased for all models, the values were much higher when calculated by either linkage-based model compared to the Poisson-multinomial model. Of note, the raw concentration of each single ddPCR target remained consistent across incubation time, demonstrating that heat treatment of up to 30 minutes leads to increased fragmentation, and not complete degradation of the template or reannealing of the ssDNA.

Fig 7. Genome integrities of heat-fragmented rAAV calculated by linkage or Poisson-multinomial models.

Fig 7

Average calculated genome integrity values are plotted as single bars (n = 12, N = 2). Dashed line depicts expected integrity as determined by long-read NGS. Average percent linkage (Formula 3), compensated percent linkage (Formula 4).

Discussion

This work summarizes the development of a novel analytical model for the calculation of gene and genome integrity from duplexed dPCR assays. The new model has been coded in R as a Shiny application, and the code is publicly available. Use of the Poisson-multinomial mixture distribution offers more rigorous modeling compared to simplistic percentage-based calculations (described by Formula 2), which are accurate only at highly dilute concentrations. Comparatively, our model is highly accurate across a wide range of template concentrations, expanding the dynamic concentration range of duplex integrity assays to at least four orders.

Unlike Poisson models which are limited to the prediction of single categories of droplet species (positive or negative), the Poisson-multinomial model considers the probability of multiple species of positive droplets, making it more suitable for the interpretation of duplexed data. Although linkage models also consider multiple species of positive droplets by subtracting the number of droplets with linked targets expected by chance from the number of double-positive droplets, we found our model to have improved intermediate precision and accuracy (compared to Formulas 3 and 4). Critically, use of linkage models consistently resulted in inflated integrity values for both simulated integrity samples (plasmid and rAAV) and heat-degraded rAAV samples.

We speculate linkage models may over-estimate genome integrity by calculating the fraction of full genomes (concentration of linked targets) out of the concentration of genome fragments (as either the average or maximum of 5’ and 3’ target concentrations) when the total concentration of both targets (sum of 5’ and 3’ target concentrations, minus linked target concentration) should be considered instead. The linkage models may have been developed with the assumption that even if genomes are fragmented that both 5’ and 3’ regions will be present, when in fact, truncated genomes often have fragmentation bias. Capsids containing truncated rAAV genomes can result from a variety of reasons such as defective replication or errors in viral packaging [2224]. Encapsidation errors have been shown to have bias towards 5’ truncation [24, 25]. Furthermore, the type and rate of truncation event is also dependent on the production method, vector sequence, and whether the virus is single-stranded or self-complementary [2628]. For these reasons, we believe it is more appropriate to consider the total concentration of template DNA as opposed to the average (or maximum) of 5’ and 3’ targets.

Accurately characterizing the encapsidated DNA of gene therapy material is a critical aspect of drug product safety and efficacy. Duplex dPCR assays have the potential to increase throughput and lower the cost of molecular characterization relative to NGS-based methods. By targeting regions near each ITR, duplex dPCR assays can provide accurate estimates of genome integrity and offer the advantage of absolute quantitation without the sample preparation bias that is inherent to NGS [3234]. Despite advancements in single-molecule long-read technologies such as PacBio and Nanopore, these platforms also exhibit some degree of sequencing biases that can impact quantitative results [4, 10, 34, 35]. Additionally, duplex dPCR reactions require minimal material for testing, can accommodate less purified samples and are relatively high throughput [1214]. Together, high levels of accuracy, precision, and sensitivity, combined with high throughput, makes duplex dPCR assays useful methods for characterizing rAAV and as an orthogonal approach to NGS. By expanding both the concentration dynamic range, and the linear range of the duplex dPCR integrity values to cover >0 to 100%, the Poisson-multinomial model is well-suited to serve as a screening method to support DOE studies and guide process decisions. Despite these strengths, it is important to note that duplex dPCR assays detect only the presence or absence of the primer and probe targets, and do not provide information about empty capsids, or encapsidated DNA sequences. This limitation underscores the need for orthogonal methods such as EM, CDMS, AUC, SEC-MALS, and NGS to fully characterize capsid populations, rAAV genomes, and contaminants.

The work summarized here for quantitating genome integrity can be applied to other duplex dPCR assays, including those aimed at monitoring residual DNA size and identity. FDA guidance recommends that residual DNAs be limited to under 10 ng/dose and less than 200 base pairs in length in final drug product [36]. As contaminants are expected to be present in very low concentrations in final drug products, utilizing the Poisson-multinomial model for accurate analysis of residual DNA integrity may be a useful tool in assessing the integrity and therefore risk level of contaminant DNAs. Multiplexed dPCR methods could potentially be expanded beyond duplex reactions to simultaneously measure both the quantity and integrity of specific residual DNAs from plasmids or host cell genes. By controlling whether decapsidation occurs prior to, or after droplet generation, one could potentially determine the levels of encapsidated residuals relative to the rAAV genome and provide characterization data around DNA content in different capsid populations: those that contain the rAAV genome and those that do not. Such analyses would require both the viral genome and residual plasmids to be detectable at similar dilutions, for which the expanded dynamic range of the Poisson-multinomial model may be advantageous. Currently, limited data is available comparing quantification of contaminant DNA using dPCR vs NGS. We look forward to future progress showing the correlation of orthogonal methods and the continued advancement of molecular technologies.

Materials and methods

Plasmid and rAAV material

pAAV material was produced by Biogen. AAV-GFP was purchased from Charles River Laboratories (CV10006). AAV-BIIB material was produced by Biogen using a transient transfection process.

Primers and probes

Custom primer and TaqMan probe sets targeting cytomegalovirus enhancer (CMV), and polyadenylation signals were purchased from Thermo Fisher Scientific. Sequences targeting CMV were designed as Forward Primer: 5′-ATGGAGTTCCGCGTTACAT-3′; Reverse Primer: 5′-AGTCCCTATTGGCGTTACTATG-3′; Probe: 5′-FAM- AACTTACGGTAAATGGCCCGCCT-MGB/NFQ-3′ (pAAV and heat fragmentation experiments) and 5′-VIC-AACTTACGGTAAATGGCCCGCCT-MGB/NFQ-3′ (simulated rAAV experiment). Sequences targeting pAAV polyA were designed as Forward Primer: 5′-GTTGGGAAGACAACCTGTAGGG-3′; Reverse Primer: 5′-GATTGCAGTGAGCCAAGATTG-3′; Probe: 5′-VIC-TCCAGCTTGGTTCCCAATAGACCC-MGB/NFQ-3′. Sequences targeting the AAV-GFP polyA (polyA-1 in Fig 4) were designed as Forward Primer: 5′-AGCAATAGCATCACAAATTTCACAA-3′; Reverse Primer: 5′-CCGCGTTAAGATACATTGATGAGTT-3′; Probe: 5′-FAM-AGCATTTTTTTCACTGCATTCTAGTTGTGGTTTGTC-MGB/NFQ-3′. Sequences targeting AAV-BIIB polyA (polyA-2 in Fig 4) were designed as Forward Primer: 5′-GCCAGCCATCTGTTGTTTGC-3′; Reverse Primer: 5′-GCGATGCAATTTCCTCATTT-3′; Probe: 5′-VIC- TTAGGAAAGGACAGTGGGAGTGGC-MGB/NFQ-3′.

Sample preparation

Plasmid simulated genomes

pAAV was digested with MfeI alone or with MfeI and NheI together (New England Biolabs). DNA concentrations of digests were measured using a Nanodrop spectrophotometer (Thermo Fisher Scientific) and converted to copies/mL. Digested plasmids (MfeI and MfeI/NheI) were diluted in TE and mixed at varying ratios to simulate varying degrees of genome integrity.

rAAV simulated genomes

Simulated integrity samples were prepared by mixing varying ratios of two rAAVs (AAV-GFP and AAV-BIIB) containing the same promoter (CMV) and different polyA tails (polyA-1 and polyA-2 respectively). Samples were pre-diluted to a target concentration range (1.125e5-5e6 vg/μL) and digested with DNase I for 30 minutes at 37°C. Viral capsids were disrupted with SDS solution and incubation at low heat (10 minutes, 60°C).

Heat fragmented rAAV genomes

AAV-BIIB samples were pre-diluted, DNase treated and decapsidated following the procedure described for rAAV simulated genomes. Following decapsidation, samples were incubated at 95°C for either 0, 1, 5, 10, 20 or 30 minutes.

ddPCR

Following sample preparation, samples were serially diluted and combined with ddPCR master mix with the addition of SmaI (New England Biolabs). Samples were partitioned into approximately 20,000 droplets using a Bio-Rad Automated Droplet Generator (1864101). Droplets were subjected to ITR restriction digestion and endpoint PCR thermal cycling in a BioRad C1000 Touch thermocycler programmed to follow: 1 cycle of 37°C × 15’; 1 cycle of 95°C × 10’; 40 cycles of 94°C × 30”, 60°C × 1’, and 1 cycle of: 98°C × 10’; 4°C hold). Samples were read on a Bio-Rad QX200 droplet reader.

Data analysis

Droplet analysis was performed using BioRad QuantaSoft software (version 1.7.4). Inter-assay replicates and experimental replicates are indicated in the figure legends. Percent recovery was calculated by comparing the calculated genome integrity to the expected genome integrity using the following equation: %Recovery=calculatedintegrityexpectedintegrityx100

Percent relative standard deviation (RSD) was calculated as the standard deviation of the calculated integrity replicates divided by the average of the calculated integrity replicates, multiplied by 100. Assay accuracy was calculated as the grand average of the replicate average percent recovery values. Intermediate precision was calculated as the RSD of the replicate average percent recovery values. Pre-defined acceptance criteria of RSD <20% and recoveries between 80–120% were used for data analysis of all experiments.

Long-read NGS

1-2E+12 vg of AAV-BIIB material was DNase treated and lysed with SDS solution. Extracted ssDNA was annealed by rapid heating and cooling using a C1000 Touch thermocycler (BioRad) to form dsDNA and purified using a QIAquick PCR purification kit (Qiagen). Independent libraries of rAAV were prepared as 1 μg samples diluted to 50 ng/μL with 20 ng of λDNA-BstEII digest (New England Biolabs) added for fragment size controls [4] and sequenced on two instruments: MinION Mk1B (Oxford Nanopore Technologies) and Sequel IIe (PacBio) following manufacturers guidelines for Native Barcoding and Ligation sequencing kits and SMRTbell prep kits respectively. A custom python-based analytical pipeline was used to pre-process Nanopore reads as FASTQ files. All sequencing reads were aligned with BWA-MEM [37], low-quality alignments were removed with SAMtools [38]. Percent genome integrity was estimated using the analytical pipeline that defined full-length reads as >90% ITR-ITR coverage, and calculated integrity as: %Integrity=numberfulllengthreadsnumberofpositivereadsx100. Genome integrity results from both sequencers (46% Nanopore, 49% PacBio) were averaged and used as the expected integrity (48%) for AAV-BIIB genome in heat fragmentation experiments.

Shiny app for Poisson multinomial model

A Web-based Shiny application was developed and deployed on the cloud-based Posit platform [39, 40]. The code for the app and raw data can be accessed publicly at: https://github.com/tlin-biogen/genome-integrity-public

Shiny app description

Data files generated by the QX200 droplet reader (BioRad) are imported into the app which extracts relevant raw data to calculate genome integrity with the Poisson multinomial model. Each well in the assay plate has two rows which correspond to two respective targets. Columns B through L are raw data extracted from the raw data file, while genome integrity is appended in column M in duplicate rows for each well. It is worth noting that “NA” will be displayed in column M when any droplet number in columns I-K is 0, as it fails to meet the prerequisites for Poisson-multinomial model. To ensure data integrity, the application verifies that the two percent full genome values in duplicate rows are identical, which is displayed as Boolean values in column N.

Supporting information

S1 Fig. Confirmation of pAAV restriction digest.

1.5 μg of pAAV was digested with either MfeI alone, or with MfeI and NheI together for 1 hour at 37°C. Samples were run on a 1% agarose gel and visualized with SYBR Safe (Invitrogen) on an Odessey M imaging system (Licor). From left to right: Lane 1: Thermo 1KB ladder (10787018); Lane 2: undigested pAAV; Lane 3: pAAV MfeI digest; Lane 4: pAAV Mfe, NheI double-digest. Digested bands were of expected sizes (Lane 3: 7284, 4266; Lane 4: 7284, 2464, 1802).

(TIF)

S2 Fig. Linearity of linkage and Poisson-multinomial models for simulated plasmid genome integrity.

Fitted values versus residuals plotted for each model.

(EPS)

S3 Fig. Linearity of linkage and Poisson-multinomial models for simulated rAAV genome integrity.

Fitted values versus residuals plotted for each model.

(EPS)

S1 Table. Linear fit information for linear models plotted in Fig 3D.

(PDF)

S2 Table. Linear fit information for linear models plotted in Fig 6.

(PDF)

S1 Raw images. Raw image of S1 Fig.

(PDF)

Acknowledgments

We thank Biogen’s NGS & Genetic Technologies Lab for assistance with sequencing rAAV material, and Chao-Jung (Julie) Wu for sequence analysis. We thank Romi Admanit and Svetlana Bergelson for analytical review and resource management.

Data Availability

All data files and R code are available for download from GitHub at https://github.com/tlin-biogen/genome-integrity-public.

Funding Statement

This work was sponsored by Biogen. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. All experimental design, data collection, analyses, and preparation of the manuscript were performed by Biogen employees.

References

  • 1.Wright JF. Product-related impurities in clinical-grade recombinant AAV vectors: Characterization and risk assessment. Biomedicines. 2014;2: 80–97. doi: 10.3390/biomedicines2010080 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Gimpel AL, Katsikis G, Sha S, Maloney AJ, Hong MS, Nguyen TNT, et al. Analytical methods for process and product characterization of recombinant adeno-associated virus-based gene therapies. Mol Ther—Methods Clin Dev. 2021;20: 740–754. doi: 10.1016/j.omtm.2021.02.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Schnödt M, Büning H. Improving the Quality of Adeno-Associated Viral Vector Preparations: The Challenge of Product-Related Impurities. Hum Gene Ther Methods. 2017;28: 101–108. doi: 10.1089/hgtb.2016.188 [DOI] [PubMed] [Google Scholar]
  • 4.Tai PWL, Xie J, Fong K, Seetin M, Heiner C, Su Q, et al. Adeno-associated Virus Genome Population Sequencing Achieves Full Vector Genome Resolution and Reveals Human-Vector Chimeras. Mol Ther—Methods Clin Dev. 2018;9: 130–141. doi: 10.1016/j.omtm.2018.02.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Werle AK, Powers TW, Zobel JF, Wappelhorst CN, Jarrold MF, Lyktey NA, et al. Comparison of analytical techniques to quantitate the capsid content of adeno-associated viral vectors. Mol Ther—Methods Clin Dev. 2021;23: 254–262. doi: 10.1016/j.omtm.2021.08.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Frenkel R, Tribby D, Boumajny B, Larson N, Sampson M, Barney C, et al. ACUVRA: Anion-Exchange Chromatography UV-Ratio Analysis—A QC-Friendly Method for Monitoring Adeno-Associated Virus Empty Capsid Content To Support Process Development and GMP Release Testing. AAPS J. 2023;25: 1–10. doi: 10.1208/s12248-022-00768-0 [DOI] [PubMed] [Google Scholar]
  • 7.Lecomte E, Tournaire B, Cogné B, Dupont JB, Lindenbaum P, Martin-Fontaine M, et al. Advanced characterization of DNA molecules in rAAV vector preparations by single-stranded virus next-generation sequencing. Mol Ther—Nucleic Acids. 2015;4: e260. doi: 10.1038/mtna.2015.32 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Guerin K, Rego M, Bourges D, Ersing I, Haery L, Harten Demaio K, et al. A Novel Next-Generation Sequencing and Analysis Platform to Assess the Identity of Recombinant Adeno-Associated Viral Preparations from Viral DNA Extracts. Hum Gene Ther. 2020;31: 664–678. doi: 10.1089/hum.2019.277 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Radukic MT, Brandt D, Haak M, Muller KM, Kalinowski J. Nanopore sequencing of native adeno-Associated virus (AAV) single-stranded DNA using a transposase-based rapid protocol. NAR Genomics Bioinforma. 2020;2: 1–16. doi: 10.1093/nargab/lqaa074 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Namkung S, Tran NT, Manokaran S, He R, Su Q, Xie J, et al. Direct ITR-to-ITR Nanopore Sequencing of AAV Vector Genomes. Hum Gene Ther. 2022;33: 1187–1196. doi: 10.1089/hum.2022.143 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Tran NT, Lecomte E, Saleun S, Namkung S, Robin C, Weber K, et al. Human and insect cell-produced rAAVs show differences in genome heterogeneity. Hum Gene Ther. 2022;33: 371–388. doi: 10.1089/hum.2022.050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Kojabad AA, Farzanehpour M, Galeh HEG, Dorostkar R, Jafarpour A, Bolandian M, et al. Droplet digital PCR of viral ‎DNA/RNA, current progress, challenges, and future perspectives. J Med Virol. 2021;93: 4182–4197. doi: 10.1002/jmv.26846 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Shmidt AA, Egorova T V. PCR-based analytical methods for quantification and quality control of recombinant adeno-associated viral vector preparations. Pharmaceuticals. 2022;15. doi: 10.3390/ph15010023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Dobnik D, Kogovšek P, Jakomin T, Košir N, Žnidarič MT, Leskovec M, et al. Accurate quantification and characterization of adeno-associated viral vectors. Front Microbiol. 2019;10. doi: 10.3389/fmicb.2019.01570 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Morley AA. Digital PCR: A brief history. Biomol Detect Quantif. 2014;1: 1–2. doi: 10.1016/j.bdq.2014.06.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Dube S, Qin J, Ramakrishnan R. Mathematical analysis of copy number variation in a DNA sample using digital PCR on a nanofluidic device. PLoS One. 2008;3: 1–9. doi: 10.1371/journal.pone.0002876 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Hindson BJ, Ness KD, Masquelier DA, Belgrader P, Heredia NJ, Makarewicz AJ, et al. High-throughput droplet digital PCR system for absolute quantitation of DNA copy number. Anal Chem. 2011;83: 8604–8610. doi: 10.1021/ac202028g [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Furuta-Hanawa B, Yamaguchi T, Uchida E. Two-Dimensional Droplet Digital PCR as a Tool for Titration and Integrity Evaluation of Recombinant Adeno-Associated Viral Vectors. Hum Gene Ther Methods. 2019;30: 127–136. doi: 10.1089/hgtb.2019.031 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Whale AS, Huggett JF, Tzonev S. Fundamentals of multiplexing with digital PCR. Biomol Detect Quantif. 2016;10: 15–23. doi: 10.1016/j.bdq.2016.05.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Pinheiro LB, Coleman VA, Hindson CM, Herrmann J, Hindson BJ, Bhat S, et al. Evaluation of a droplet digital polymerase chain reaction format for DNA copy number quantification. Anal Chem. 2012;84: 1003–1011. doi: 10.1021/ac202578x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Zanker J, Lázaro-Petri S, Hüser D, Heilbronn R, Savy A. Insight and Development of Advanced Recombinant Adeno-Associated Virus Analysis Tools Exploiting Single-Particle Quantification by Multidimensional Droplet Digital PCR. Hum Gene Ther. 2022;00. doi: 10.1089/hum.2021.182 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Prantner A, Maar D. Genome concentration, characterization, and integrity analysis of recombinant adeno-associated viral vectors using droplet digital PCR. PLoS ONE. 2023. doi: 10.1371/journal.pone.0280242 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Regan JF, Kamitaki N, Legler T, Cooper S, Klitgord N, Karlin-Neumann G, et al. A rapid molecular approach for chromosomal phasing. PLoS One. 2015;10: 1–15. doi: 10.1371/journal.pone.0118270 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Basu AS. Digital Assays Part I: Partitioning Statistics and Digital PCR. SLAS Technol. 2017;22: 369–386. doi: 10.1177/2472630317705680 [DOI] [PubMed] [Google Scholar]
  • 25.Eigner J, Boedtker H, Michaels G. The thermal degradation of nucleic acids. Biochim Biophys Acta. 1961;51: 165–168. doi: 10.1016/0006-3002(61)91028-9 [DOI] [PubMed] [Google Scholar]
  • 26.Greer S, Zamenhof S. Studies on depurination of DNA by heat. J Mol Biol. 1962;4: 123–141. doi: 10.1016/s0022-2836(62)80046-1 [DOI] [PubMed] [Google Scholar]
  • 27.Marguet E, Forterre P. DNA stability at temperatures typical for hyperthermophiles. Nucleic Acids Res. 1994;22: 1681–1686. doi: 10.1093/nar/22.9.1681 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Marguet E, Forterre P. Protection of DNA by salts against thermodegradation at temperatures typical for hyperthermophiles. Extremophiles. 1998;2: 115–122. doi: 10.1007/s007920050050 [DOI] [PubMed] [Google Scholar]
  • 29.Peak MJ, Robb FT, Peak JG. Extreme resistance to thermally induced DNA backbone breaks in the hyperthermophilic archaeon Pyrococcus furiosus. J Bacteriol. 1995;177: 6316–6318. doi: 10.1128/jb.177.21.6316-6318.1995 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Lindahl T. Instability and decay of the primary structure of DNA. Nature. 1993;362: 709–715. doi: 10.1038/362709a0 [DOI] [PubMed] [Google Scholar]
  • 31.Lindahl T, Nyberg B. Rate of Depurination of Native Deoxyribonucleic Acid. Biochemistry. 1972;11: 3610–3618. doi: 10.1021/bi00769a018 [DOI] [PubMed] [Google Scholar]
  • 32.Van Dijk EL, Jaszczyszyn Y, Thermes C. Library preparation methods for next-generation sequencing: Tone down the bias. Exp Cell Res. 2014;322: 12–20. doi: 10.1016/j.yexcr.2014.01.008 [DOI] [PubMed] [Google Scholar]
  • 33.Meyer CA, Liu XS. Identifying and mitigating bias in next-generation sequencing methods for chromatin biology. Nat Rev Genet. 2014;15: 709–721. doi: 10.1038/nrg3788 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Amarasinghe SL, Su S, Dong X, Zappia L, Ritchie ME, Gouil Q. Opportunities and challenges in long-read sequencing data analysis—Genome Biology—Full Text. Genome Biol. 2020;21: 1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Zhang X, Davenport KW, Gu W, Daligault HE, Christine Munk A, Tashima H, et al. Improving genome assemblies by sequencing PCR products with PacBio. Biotechniques. 2012;53: 61–62. doi: 10.2144/0000113891 [DOI] [PubMed] [Google Scholar]
  • 36.Yang H. Establishing acceptable limits of residual DNA. PDA J Pharm Sci Technol. 2013;67: 155–163. doi: 10.5731/pdajpst.2013.00910 [DOI] [PubMed] [Google Scholar]
  • 37.Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. 2013;00: 1–3. Available: http://arxiv.org/abs/1303.3997 [Google Scholar]
  • 38.Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25: 2078–2079. doi: 10.1093/bioinformatics/btp352 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.R Core Team. A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria; 2022. Available: https://www.r-project.org/ [Google Scholar]
  • 40.Chang W, Cheng J, Allaire J, Sievert C, Schloerke B, Xie Y, et al. shiny: Web Application Framework for R. 2022. Available: https://cran.r-project.org/package=shiny [Google Scholar]

Decision Letter 0

Ruslan Kalendar

9 Aug 2023

PONE-D-23-22796A novel method for quantitation of AAV genome integrity and residual DNAs using duplex digital PCRPLOS ONE

Dear Dr. Tereshko,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Sep 23 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Ruslan Kalendar

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Thank you for stating the following financial disclosure: 

"This work was sponsored by Biogen.

All experimental design, data collection, analyses, and preparation of the manuscript were performed by Biogen employees."

  

Please state what role the funders took in the study.  If the funders had no role, please state: "The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript." 

If this statement is not correct you must amend it as needed. 

Please include this amended Role of Funder statement in your cover letter; we will change the online submission form on your behalf.

3. We note that you have stated that you will provide repository information for your data at acceptance. Should your manuscript be accepted for publication, we will hold it until you provide the relevant accession numbers or DOIs necessary to access your data. If you wish to make changes to your Data Availability statement, please describe these changes in your cover letter and we will update your Data Availability statement to reflect the information you provide.

4. We are unable to open your Supporting Information file [File Name]. Please kindly revise as necessary and re-upload.

5. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information. 

6. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: N/A

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Reviewer #1: 

Everybody should use this analysis method in the conditions described or be inspired to likewise improve upon the poor mathematics that somehow slipped into usage.

Line 35. A colleague commented to me that long read methods including NGS methods are inferior in precision to dPCR when quantitating amounts of species. Better precision is a technical advantage which might recommend dPCR even if the practical reasons given relating to cost, effort, data analysis and amount of material. My second hand opinion is that even if NGS were improved to be faster, data analysis automated by better software, and miniaturized to use less sample, that primer based methods will still be preferred for higher precision. I suggest that you add a sentence or two based on your experience of NGS methods. First, a statement that NGS supports the method development of dPCR primers. And then a statement of the technical reasons why primer based dPCR can achieve higher precision than NGS methods.

Line 320 Early discussions about electron microscopy use in the field gave me the opinion that Cryo EM worked much better than TEM but the industry tended to use TEM only because of cost despite its technical inferiority. The study in a paper by Werle et al that compared different methods on an unusual AAV sample with very low or missing partially empty capsids showed that even with no partially empty capsids present, TEM has precision issues. (Werle AK, Powers TW, Zobel JF, Wappelhorst CN, Jarrold MF, Lyktey NA, Sloan CDK, Wolf AJ, Adams-Hall S, Baldus P, Runnels HA. Comparison of analytical techniques to quantitate the capsid content of adeno-associated viral vectors. Mol Ther Methods Clin Dev. 2021 Sep 1;23:254-262. doi: 10.1016/j.omtm.2021.08.009. PMID: 34703846; PMCID: PMC8505359.) I suggest adding the Werle reference here and emphasizing that even in the simplest systems, TEM and other orthogonal methods have deficiencies in precision and dynamic range when quantitating percent intact genomes of AAV samples. (For instance, the EM they used always sees some empty or some full capsids even when the tested sample was monodisperse. Image recognition software is a bit imprecise.) The Werle paper also re-enforces the idea that all methods have technical and data analysis limitations that often require orthogonal information to correct. Too often instrument manufacturers distort the science by pressing the opinion that the “customer” will implement only one assay and using pecuniary pseudo-truths try to sell only one instrument.

I commend the authors on using theoretical considerations, on very well characterized plasmid derived samples, and on AAV samples controlling the temperature stress as a parameter. This was an excellent experimental plan. The scheme to present diagrams and figures to visualize a mainly mathematical point, particularly the use of the %recovery parameter, was excellent.

Reviewer #2: 

Recombinant AAV are highly efficient viral vectors for in vivo gene transfer. After decades of research and development, AAV-derived drugs are currently on the market for the treatment of genetic diseases. However, vector production processes do not guarantee homogeneous products. Whereas the holy grail being one AAV capsid containing one whole therapeutic cassette, most of AAV batches contain empty, partial, full and overloaded capsids. Thus, careful control of AAV genome integrity with accurate methods is of utmost importance.

In this manuscript, Lauren Tereshko and collaborators have used multiplex droplet digital PCR to assess rAAV genome integrity. The originality of the manuscript resides in the comparison of different calculation formulas to analyze genome integrity ddPCR data. Interestingly, they have proposed a new and Poisson-multinomial model that improves the accuracy and quantifiable range of duplex ddPCR assays to determine rAAV genome intergrity.

Only minor comments should be addressed:

- Introduction:

(lines 30-34) Add citations for rAAV heterogeneity, residual DNA and purification process statements.

(lines 58-59) Clarify the sentence “We compare the accuracy … genomes of 0-100% across a range of concentrations”. I assume 0-100% in length?

- Material and Methods must be fleshed out.

Please provide primers and probe sequences and PCR cycles conditions.

How was ITR restriction digestion performed?

How genome fragments were generated, verified and quantified? Restriction enzyme sites can be illustrated on Fig1. Add an agarose gel with plasmid fragments as supplemental figure.

Give more details about NGS (library preparation, injected quantity, bioinformatics analysis)

- Results:

Double stranded DNA is not degraded by 95°C heat (doi: 10.1089/dna.2013.2056). Is there another interpretation of your data? Heat effect on AAV single-stranded DNA? Fragments reannealing?

A verification of AAV DNA degradation by heat using an orthogonal method (microfluidic capillary electrophoresis…) may be interesting.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: David B Hayes

Reviewer #2: Yes: Magalie Penaud-Budloo

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2023 Dec 14;18(12):e0293277. doi: 10.1371/journal.pone.0293277.r002

Author response to Decision Letter 0


22 Sep 2023

Please note the responses below are included in our “cover letter” and "response to reviewers" documents.

Cover letter:

Dear PLOS ONE Editors,

Thank you for your thoughtful review of our research article titled “A novel method for quantitation of AAV genome integrity using duplex digital PCR”. We appreciate the opportunity to incorporate your suggestions into the manuscript. We believe the changes detailed in our Response to Reviewers have improved the overall clarity and quality of the article.

We acknowledge that there were issues with our Supporting Information files. We have moved the captions for these files to the end of the revised manuscript. We have revised and re-uploaded all figure files. We have also reviewed our References list and verified that no retracted publications are cited. Changes to the reference list are noted in the Response to Reviewers.

After reviewing the Journal Requirements, we have updated the formatting of our manuscript to align with PLOS ONE’s guidelines. In addition to our Response to Reviewers, we would like to amend the following submission sections:

1. Authorship credits: In our first submission, we accidentally omitted an author. We would like to amend the authorship credits to include Yu Wang for Conceptualization of the project.

2. Role of Funder statement: We would like to amend our statement to, "This work was sponsored by Biogen. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. All experimental design, data collection, analyses, and preparation of the manuscript were performed by Biogen employees."

3. Data Availability statement: We have moved the datasets, R scripts underlying our figures, and the code for our Shiny app to a public GitHub repository. We would like to amend our statement to, “All data files and R code are available for download from GitHub at https://github.com/tlin-biogen/genome-integrity-public”

Response to Reviewers:

We thank the reviewers for their insights and suggestions. We feel that they greatly improved the clarity of the manuscript and added context to how the new methodology relates to orthogonal methods for AAV capsid and genome characterization. Please find our responses to each point interspersed with the reviewers’ comments (italicized) below:

Response to reviewer #1:

Line 35. A colleague commented to me that long read methods including NGS methods are inferior in precision to dPCR when quantitating amounts of species. Better precision is a technical advantage which might recommend dPCR even if the practical reasons given relating to cost, effort, data analysis and amount of material. My second hand opinion is that even if NGS were improved to be faster, data analysis automated by better software, and miniaturized to use less sample, that primer based methods will still be preferred for higher precision. I suggest that you add a sentence or two based on your experience of NGS methods. First, a statement that NGS supports the method development of dPCR primers. And then a statement of the technical reasons why primer based dPCR can achieve higher precision than NGS methods.

We appreciate Reviewer 1’s point about the limitations of the precision of NGS. We clarified our statement on the limitations of NGS in the introduction (lines 44-47). We also provided statements around the benefits of the accuracy/precision/sensitivity of duplex dPCR relative to NGS to the discussion section (lines 366-383).

Line 320 Early discussions about electron microscopy use in the field gave me the opinion that Cryo EM worked much better than TEM but the industry tended to use TEM only because of cost despite its technical inferiority. The study in a paper by Werle et al that compared different methods on an unusual AAV sample with very low or missing partially empty capsids showed that even with no partially empty capsids present, TEM has precision issues. (Werle AK, Powers TW, Zobel JF, Wappelhorst CN, Jarrold MF, Lyktey NA, Sloan CDK, Wolf AJ, Adams-Hall S, Baldus P, Runnels HA. Comparison of analytical techniques to quantitate the capsid content of adeno-associated viral vectors. Mol Ther Methods Clin Dev. 2021 Sep 1;23:254-262. doi: 10.1016/j.omtm.2021.08.009. PMID: 34703846; PMCID: PMC8505359.) I suggest adding the Werle reference here and emphasizing that even in the simplest systems, TEM and other orthogonal methods have deficiencies in precision and dynamic range when quantitating percent intact genomes of AAV samples. (For instance, the EM they used always sees some empty or some full capsids even when the tested sample was monodisperse. Image recognition software is a bit imprecise.) The Werle paper also re-enforces the idea that all methods have technical and data analysis limitations that often require orthogonal information to correct. Too often instrument manufacturers distort the science by pressing the opinion that the “customer” will implement only one assay and using pecuniary pseudo-truths try to sell only one instrument.

We agree with the reviewer’s suggestion to further emphasize the distinction between capsid characterization methods and genome characterization methods. We added a statement to the introduction to highlight the gap in capsid methodologies for characterizing genomes (lines 38-47). We have also added to the discussion as suggested, to now emphasize the value and necessity of employing orthogonal methods for characterizing capsid content and added several references, including the suggested reference Werle et al 2021 (lines 366-383).

I commend the authors on using theoretical considerations, on very well characterized plasmid derived samples, and on AAV samples controlling the temperature stress as a parameter. This was an excellent experimental plan. The scheme to present diagrams and figures to visualize a mainly mathematical point, particularly the use of the %recovery parameter, was excellent.

We thank the reviewer for these kind words regarding the experimental plan and presentation of the data.

Response to Reviewer #2:

In this manuscript, Lauren Tereshko and collaborators have used multiplex droplet digital PCR to assess rAAV genome integrity. The originality of the manuscript resides in the comparison of different calculation formulas to analyze genome integrity ddPCR data. Interestingly, they have proposed a new and Poisson-multinomial model that improves the accuracy and quantifiable range of duplex ddPCR assays to determine rAAV genome integrity.

Only minor comments should be addressed:

(lines 30-34) Add citations for rAAV heterogeneity, residual DNA and purification process statements.

We added several citations for rAAV heterogeneity and process impurities to the introduction (lines 36-38).

(lines 58-59) Clarify the sentence “We compare the accuracy … genomes of 0-100% across a range of concentrations”. I assume 0-100% in length?

We removed the 0-100% statement as it added confusion. The range of 0-100% simulated integrities corresponds to ranges in the percentage of the presence of both genetic termini from the population of template DNA molecules present. We added details regarding the preparation of mock integrity samples throughout the manuscript (see in particular lines 85-94). We hope these edits clarify that duplex dPCR reactions serve as a proxy for genome integrity by measuring the binary presence or absence of each genetic terminus and therefore do not directly measure genome length.

Material and Methods must be fleshed out. Please provide primers and probe sequences and PCR cycles conditions. How was ITR restriction digestion performed? Give more details about NGS (library preparation, injected quantity, bioinformatics analysis)

We have added significantly to the Materials and Methods section and now provide details for: DNA material identity, primer/probe sequences, thermocycling program, ITR digestion, NGS library preparation, sequencing, and analytical pipeline. We have added references to this section as needed.

How genome fragments were generated, verified and quantified? Restriction enzyme sites can be illustrated on Fig1. Add an agarose gel with plasmid fragments as supplemental figure.

Figure 1 was updated for clarity and includes the restriction sites. We have added a supplemental figure (S1 Fig) confirming the digestion of pAAV with MfeI alone and MfeI/NheI together. The image of the gel shows that the digested plasmid has the expected banding pattern for “intact” and “fragmented” genomes.

Double stranded DNA is not degraded by 95°C heat (doi: 10.1089/dna.2013.2056). Is there another interpretation of your data? Heat effect on AAV single-stranded DNA? Fragments reannealing? A verification of AAV DNA degradation by heat using an orthogonal method (microfluidic capillary electrophoresis…) may be interesting.

We appreciate Reviewer 2’s suggestion to verify the effects of heat on AAV DNA. In contrast to Karni et al 2013 (doi: 10.1089/dna.2013.2056), which tested the effects of heat on plasmid DNA, we have used linear dsDNA and ssDNA which have been shown to be less stable than circular DNA (Marguet and Forterre 1994 (https://doi.org/10.1093/nar/22.9.1681), Lindahl and Nyberg 1972 (https://doi.org/10.1021/bi00769a018)). Although Karni et al 2013 did not see significant degradation of plasmid DNA at 95°C, this was tested only with dry DNA for 5 minutes. In their discussion section, the authors state “In an aqueous solution, it is not possible to determine the degradation temperature using our method, since applying pressure on the DNA solution causes the DNA to be more sensitive to heat, and therefore the DNA degrades already above 90°C.” In addition to the two mentioned above, multiple other peer-reviewed publications have previously reported thermal degradation of dsDNA and ssDNA in aqueous solution. Our reference section has been updated to expand on these citations (see references 22, 25-31).

To address Reviewer 2’s suggestion, we have performed an alternative experiment. Unfortunately, we do not have enough of the ssDNA AAV material that was used in Figure 7 to visualize the degradation by electrophoresis. We instead linearized AAV vector plasmid DNA by restriction digest with BamHI, performed heat stress, and resolved the samples by agarose gel electrophoresis. Below are the results of incubating 500 ng samples of linearized plasmid DNA for 0, 5, 15 or 30 minutes at 95°C. The control lane (0 minutes at 95°C) shows the majority of DNA is of the expected length (~12 KB). Upon heat treatment, the band gradually becomes smaller. At 15 and 30 minutes at 95°C, smearing below the band can be seen at increasingly small sizes.

While the data here demonstrate the thermal instability of linearized plasmid DNA, this does not directly correlate to the manuscript as Fig 7 uses AAV material; we therefore did not include this as a supplemental figure. However, we believe these data combined with our results from Fig 7, and the additional cited publications support our conclusion that rAAV DNA degrades into smaller fragments upon heat treatment thereby reducing the number of intact templates. Although re-annealing of ssDNA fragments could potentially occur, this alone would not affect measured genome integrities, as it would not change the ratio of the number of 5’ and 3’ targets that are present during PCR amplification cycles. Furthermore, we do not see changes in the concentration of ddPCR targets in the raw data across incubation times, which would be expected to decrease if annealing occurred.

Attachment

Submitted filename: Response to Reviewers.docx

Decision Letter 1

Simone Agostini

10 Oct 2023

A novel method for quantitation of AAV genome integrity using duplex digital PCR

PONE-D-23-22796R1

Dear Dr. Tereshko,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Simone Agostini, Ph.D.

Academic Editor

PLOS ONE

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The comments about the original manuscript asked for some more information and clarification. The authors did a good job adding the requested information and looking up extra references. I support adding some information as supplemental. Also, it seems to me that the initial temperature degradation studies were sufficient for the purpose of the paper. I believe that the extra experiments performed were helpful in supporting the initial results and that the authors also properly discussed why heat degradation studies are perhaps more complicated than one would expect.

Reviewer #2: (line 38) "Current purification processes can efficiently separate empty from full capsids". Please moderate "efficiently". Empty capsids (and even more partial capsids) are hard to completely remove.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: David B Hayes

Reviewer #2: Yes: Dr PENAUD-BUDLOO Magalie

**********

Acceptance letter

Simone Agostini

5 Dec 2023

PONE-D-23-22796R1

A novel method for quantitation of AAV genome integrity using duplex digital PCR

Dear Dr. Tereshko:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at customercare@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Simone Agostini

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Confirmation of pAAV restriction digest.

    1.5 μg of pAAV was digested with either MfeI alone, or with MfeI and NheI together for 1 hour at 37°C. Samples were run on a 1% agarose gel and visualized with SYBR Safe (Invitrogen) on an Odessey M imaging system (Licor). From left to right: Lane 1: Thermo 1KB ladder (10787018); Lane 2: undigested pAAV; Lane 3: pAAV MfeI digest; Lane 4: pAAV Mfe, NheI double-digest. Digested bands were of expected sizes (Lane 3: 7284, 4266; Lane 4: 7284, 2464, 1802).

    (TIF)

    S2 Fig. Linearity of linkage and Poisson-multinomial models for simulated plasmid genome integrity.

    Fitted values versus residuals plotted for each model.

    (EPS)

    S3 Fig. Linearity of linkage and Poisson-multinomial models for simulated rAAV genome integrity.

    Fitted values versus residuals plotted for each model.

    (EPS)

    S1 Table. Linear fit information for linear models plotted in Fig 3D.

    (PDF)

    S2 Table. Linear fit information for linear models plotted in Fig 6.

    (PDF)

    S1 Raw images. Raw image of S1 Fig.

    (PDF)

    Attachment

    Submitted filename: Response to Reviewers.docx

    Data Availability Statement

    All data files and R code are available for download from GitHub at https://github.com/tlin-biogen/genome-integrity-public.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES