Abstract
Many biological assays are employed in virology to quantify parameters of interest. Two such classes of assays, virus quantification assays (VQAs) and infectivity assays (IAs), aim to estimate the number of viruses present in a solution and the ability of a viral strain to successfully infect a host cell, respectively. VQAs operate at extremely dilute concentrations, and results can be subject to stochastic variability in virus-cell interactions. At the other extreme, high viral-particle concentrations are used in IAs, resulting in large numbers of viruses infecting each cell, enough for measurable change in total transcription activity. Furthermore, host cells can be infected at any concentration regime by multiple particles, resulting in a statistical multiplicity of infection and yielding potentially significant variability in the assay signal and parameter estimates. We develop probabilistic models for statistical multiplicity of infection at low and high viral-particle-concentration limits and apply them to the plaque (VQA), endpoint dilution (VQA), and luciferase reporter (IA) assays. A web-based tool implementing our models and analysis is also developed and presented. We test our proposed new methods for inferring experimental parameters from data using numerical simulations and show improvement on existing procedures in all limits.
Introduction
Understanding viral dynamics is an important task in medicine, epidemiology, public health, and, in particular, for the development of antiviral therapies and vaccines. Drugs that hinder viral infection include blockers of viral entry into the host cell (1, 2, 3, 4, 5, 6) and inhibitors of genetic activity and protein assembly inside the cytoplasm and nucleus (7, 8, 9). Mechanistic models of drug action have recently emerged as useful tools in helping design ad hoc experiments to study drug efficacy and in interpreting results (10, 11, 12, 13). Mathematical models typically assume prior knowledge of given physical quantities pertaining to the virus, host cell, or the biological assay being studied. Once these parameters are assigned, viral and cell population dynamics and their statistical properties can be predicted. Among the different experimental assays, one often seeks to evaluate the number of virus particles in a stock solution or the number of viruses that have successfully infected host cells (6, 14, 15, 16, 17, 18, 19).
In the case of virus quantification assays (VQAs), performing repeated controlled experiments on viral dynamics or comparing results across multiple studies requires knowing how many viruses are present in the initial stock solution of each experiment (4, 5). Furthermore, antigens that induce immune responses against viral infections may be engineered from viral components such as capsid proteins, viral enzymes, and genetic vectors (20) and may be used in the development of vaccines. Being able to determine the exact number of virus-derived antigens helps control the efficacy of vaccines and optimize yield (21, 22, 23).
Given the central role of VQAs, several assays have been designed to estimate viral particle counts. These include plaque (24) and endpoint dilution (23, 25) assays, which will be discussed in more detail in the remainder of this work. For now, we note that these assays involve repeatedly diluting an initial solution of virus particles in the presence of a layer of plated cells until viral concentrations are low enough that the dynamics of an individual virus can be extrapolated. At these low particle counts, however, the discrete nature of the infection process cannot be neglected and can cause substantial discrepancies when replicating experiments. Average quantities are not necessarily representative, and a more in-depth approach in quantifying virus-cell interactions is necessary.
Infectivity assays (IAs), on the other hand, aim to quantify the number of viruses that have successfully infected host cells under varying antiviral drug environments (14, 15, 16). IAs may measure the total transcription activity across all cells, such as the luciferase reporter assay (15, 26), or may count the number of host cells that were successfully infected, such as the enzyme-linked immunosorbent assay and the immunofluorescence assay with fluorescence-activated cell sorting (4, 14, 15, 18). These assays are performed using undiluted solutions with large numbers of viral particles, reducing stochastic variability. The average number of viruses that infect a cell is estimated as the ratio of the number of viruses in solution to the number of plated cells, a quantity known as the multiplicity of infection (MOI) (19). However, each cell may be infected by different numbers of viruses distributed around the average given by the MOI. In these cases, one may be interested in the complete probability distribution for the number of virus infections in each plated cell and in the related statistical variance.
In this work, we derive a probability model for the distribution of viral infections per host cell, which we call the statistical multiplicity of infection (SMOI). The SMOI can be used as a starting point to help estimate the number of viral particles in solution in VQAs and to determine a viral strain’s ability to successfully infect host cells in IAs. In Probabilistic Models of Statistical Multiplicity of Infection, we present the mathematical foundations for the SMOI in the two experimentally relevant parameter regimes of small and large viral particle counts and derive a probability model for the total number of infected cells under any dilution level. In Plaque Assay, we apply our models to the plaque assay and, to our knowledge, formulate a new method of analyzing plaque count data. In Endpoint Dilution Assay, we employ a special case of the derived probability distribution to the endpoint dilution assays and compare our results to those arising from traditional titration techniques such as the Reed and Muench (RM) (27) and Spearman-Karber (SK) methods (28). In Luciferase Reporter Assay, we use the large particle limit of our model to describe the luciferase reporter assay. Lastly, a discussion of our results, a side-by-side comparison with existing methods, and a link to web-based data analysis tools are provided in Conclusions. Mathematical appendices and further discussion of experimental attributes such as cell-size variability, coinfection, viral interference, and optimal experimental design using parameter-sensitivity analysis are presented in the Supporting Materials and Methods.
Methods
Probabilistic models of statistical multiplicity of infection
A typical viral assay is initiated by laying a monolayer of M cells on the bottom of a microtiter well, as illustrated in Fig. 1 (17, 24, 25). Although variability exists among experiments, M is often set within the range of 104–105 (14, 26) and is assumed to be a known experimental parameter. A supernatant containing N0 virus particles in the range of 105–107 (24, 25, 26) is then added to the microtiter well. Although, theoretically, all N0 particles are capable of infection, not all will successfully infect a cell. Because infection of a host cell requires a complex sequence of biochemical processes that may include receptor binding, membrane fusion, reverse transcription, nuclear pore transport, and DNA integration (10, 19), virus particles that fail at one or several of these sequential steps lead to abortive infections. To differentiate, the particles that do succeed are called infectious units (IU) or plaque-forming units (PFU). We will denote the number of IUs as N ≤ N0. Depending on the strain of virus, the particular experimental protocol used, and specific conditions of the assay, the random quantity N is distributed according to N0 and the overall effective probability that an arbitrary viral particle successfully infects a host cell. A proxy that is typically used in place of this effective probability is the “particle to PFU ratio” Q, an experimentally determined parameter that quantifies, on average, the minimal number of particles required to ensure at least one infected cell (29, 30). Q is often treated as an a priori measured quantity, primarily associated with the particular strain of virus being studied. Low values of Q, such as with poliovirus (Q = 30) (30), have a high likelihood of successful infection compared to viruses with large Q, such as human immunodeficiency virus type 1 (HIV-1) (Q = 107) (31). Thus, the reciprocal Q−1 can be interpreted as the probability for a single virus to infect a host cell. Assuming an initial stock of N0 particles, the discrete probability density function of N is
(1) |
which defines a binomial distribution with parameters N0 and Q−1. Although we assume Q to be a priori known, in actuality, the probability of a virus successfully infecting a host is highly dependent on the methods used to harvest the virus stock, the experimental parameters of the assay, the host receptor concentrations and binding rates, and the dynamics of the physiological processes leading to infection (29, 32). A thorough investigation into these processes would be necessary to mechanistically model Q and is outside of the scope of this work. However, we will discuss in Conclusions how, with direct measurements of certain other parameters, especially N0, our derived methods may also be used to infer Q.
We assume each viral particle in solution acts independently of others and that host cell infection attempts are random events. At high ratios N0/M of particles to cells, a quantity referred to as the MOI, it becomes increasingly probable for more than one IU to infect the same host cell. We define M0 as the count of cells not infected by any IU, M1 as the count of cells infected by exactly one IU, and so on, up to MN, the number of cells infected by all N IUs. The SMOI is defined as the ensemble of cell counts . Note that two constraints must hold: to account for all infected and uninfected cells and for conservation of the total number of IUs. If we assume all M cells are of identical size and volume, they carry equal probability of being infected by a particular virus. Thus, evaluating the probability distribution that Mr takes on the value mr reduces to the well-known occupancy problem of randomly placing balls into identical urns (33), and we derive
(2) |
where the r term is repeated j times in the lower argument of the multinomial coefficient. The derivation of Eq. 2 is detailed in Appendix A in the Supporting Materials and Methods, and an investigation into the effects of inhomogeneous cell sizes is presented in Appendix B. Furthermore, in Appendix A, we derive the expected value and variance of Mr as
(3) |
and
(4) |
Note that the variance is equal to the expected value with two additional correction terms that cancel each other as N and M increase, indicating the probability distribution of Mr is Poisson-like for large N and M. A plot of a representative probability distribution and a test of agreement between our analytical result and numerical simulation is provided in Fig. 2.
We also derive the joint probability that the SMOI takes on the set of values as
(5) |
The first and second multinomial expressions enumerate the degeneracy of how the M identical cells are distributed across the configuration and how the N identical IUs are chosen for those cells, respectively. Although the second expression in Eq. 5 is more succinct, it must be explicitly conditioned on the constraints and .
The expressions in Eqs. 2 and 5 provide an exact discrete description of the stochasticity of the MOI but are computationally expensive to evaluate for large values of N and M. In a typical virology experiment, the number of viral particles N0 and host cells M are large enough for certain asymptotic methods to be applicable. Furthermore, for intermediate values of Q and based on Eq. 1, the expected number of IUs N would be similarly large. We can thus take the mathematical limit while keeping the ratio fixed and approximate Eq. 2 as
(6) |
Equation 6 implies that Mr is Poisson-distributed, with mean and variance
(7) |
A mathematical justification of Eq. 6 is given in Appendix A, and comparisons of Eq. 6 and the analytical result in Eq. 2 to simulations are shown in Fig. 3.
Under the same large M,N limit and using Eq. 6, we show in Appendix A
(8) |
which implies that as , the random variables ,, are independently distributed. In the next section, we will apply results of our probability model of SMOI to the case of a repeatedly diluted solution of virus particles, a procedure used in many VQAs.
Serial dilution
Low viral-particle concentrations in assays are typically obtained via serial dilution processes to increase the sensitivity to individual viral infections (4, 24, 25). The initial viral stock containing N0 particles is diluted by a fixed factor of D, and the process is repeated dmax times. At each dilution number d, an assay can be performed to determine if the concentration of virus particles in the diluted solution is sufficient to generate a qualitative signal of infection, known as a “cytopathic effect” (CE). For example, the diluted stock can be administered in vivo to a model organism such as a mouse. The mouse’s death would indicate that at least one lethal unit of the virus was present at that dilution level. Alternatively, an in vitro assay can be carried out to measure a signal that, for example, quantifies the exact number of plated cells that were successfully infected. To model these assays, we first define M∗ as the number of host cells infected by at least one IU and that are capable of producing new viruses. In Appendix A we derive the discrete probability density function for finding M∗ = m infected cells at a given dilution number d and find
(9) |
Equation 9 shows that the number of infected cells M∗ is binomially distributed with expected value
(10) |
and variance
(11) |
We can define the probability of observing a CE at dilution number d as the probability of finding one or more infected cells:
(12) |
The definition we use in Eq. 12 assumes an in vitro assay that can exhibit a cytopathic signal after a single cell infection or more. For in vivo assays, the probability that m infected cells are sufficient for a CE will depend on many complex physiological factors such as immune pressure, in-host viral evolution, and virion burst size (34). A plot of how the initial particle count N0 and dilution factor D effect the characteristic functional form of Eq. 12 is shown in Fig. 4. Although both Eqs. 9 and 12 assume each IU contains all viral genes required for in-host replication, an extended probability model that factors in genetic mutation and degradation is provided in Appendix C. Furthermore, for the case of retroviruses, infectious processes inside the host cytoplasm may be suppressed by previous infections, known as viral interference, and is explored in Appendix D. In Plaque Assay, we will use Eq. 9 to analyze the plaque assay. Equation 12 will be used for “binary” assays that are only concerned with the presence or absence of a CE, such as the endpoint dilution assay, which we will explore in Endpoint Dilution Assay.
Results and Discussion
Plaque assay
The plaque assay is an example of a VQA in which the objective is to infer the total number of viruses N0 present in a solution, assuming the PFU to particle ratio Q has been independently measured and estimated (24, 25, 35). After d serial dilutions, the viral stock is added to a monolayer of M cells, and a layer of agar gel is added to the well to inhibit the diffusion of virus particles in the plate. If a virus successfully infects a host cell, the agar will limit the range of new infections to the most adjacent cells. Viral infection thus spreads out radially from the initial nucleation infection and forms a visible discoloration in the plate called a “plaque.” For high particle concentrations, the number of plaques formed may be large enough to cover the entire plate surface. After a sufficient critical dilution number dc, however, the number of plaques formed are low enough to be visibly distinct and countable. For each dilution number d, the assay can be performed for T number of trials. The “signal” data arising from the plaque assay Pd,t is defined as the number of visible plaques counted, where is the trial number. The standard method of obtaining an estimate of the true particle count N0 is to apply the sample mean of the data at the critical dilution level dc to the formula
(13) |
which posits that the average number of plaques is directly proportional to the particle count N0. Equation 13 assumes that each infected cell corresponds to one IU, which is not necessarily true in the context of SMOI. Furthermore, although data corresponding to dilution numbers d < dc are unusable, data for d > dc corresponding to countable plaques are not used at all in Eq. 13.
To improve on Eq. 13 by using the entire set of plaque counts Pd,t for our estimate of N0, we propose a maximal likelihood estimation (MLE) scheme. Using the mathematical models derived above, we can construct an expression of the probability that the observed data Pd,t can be generated assuming a particular value for N0, known as a likelihood function. A value for N0 that maximizes corresponds to the most probable estimate that could have generated the data. As each nucleation of a plaque corresponds to a distinct infected cell (and assuming that overlapping lesions of necrotic cells are still discernible as distinct plaques), we can equate Pd,t to the total number of successfully infected cells M∗. We will ignore the dynamics of coinfection and viral interference. Using Eq. 9, we propose the following likelihood function of the data given N0:
(14) |
To obtain the MLE , we take the derivative of the natural log of Eq. 14 with respect to N0 and set the result to zero to obtain
(15) |
We can solve Eq. 15 for using numerical methods such as Newton-Raphson (36), an iterative scheme that approaches the solution of an equation asymptotically starting from an initial guess . To increase the stability of convergence to the solution, we choose by equating the sample average of plaque counts with the expected number of infected cells E[M∗] in Eq. 10 at the critical dilution dc to derive
(16) |
An example of raw plaque-count data and the resulting estimates for N0 are given in Fig. 5. To quantify the relative improvement of the MLE of N0 over the standard method in Eq. 13, we simulate plaque assay data assuming a fixed, known N0 value. In our simulation, we use the models established in Probabilistic Models of Statistical Multiplicity of Infection to sample the N0 particles according to Eq. S9 in Appendix A to account for serial dilution and sample again the resulting particles according to Eq. 1 to obtain the number of IUs N. The IUs are distributed randomly to the M cells with equal probability, and the resulting number of infected cells M∗ is recorded. Because plates of cells with too many infections render the number of plaques uncountable, a “countable plaque threshold” renders the data unusable when the number of infected cells exceed the threshold. Thus, the resulting plaque data Pd,t for a given dilution d and trial t is assigned the number of simulated infected cells if the latter is less than the given threshold. A scatter plot of the data Pd,t of one such simulation is shown in Fig. 6 a, and the corresponding likelihood function from Eq. 14 is plotted in Fig. 6 b. Because the MLE method utilizes a full probabilistic model of the plaque count distribution instead of relying only on the expected value at the single critical dilution dc, it produces an estimate consistently closer to the original N0 that generated the data. To better quantify this property, in Appendix E, we derive an asymptotic approximation of the variance of as
(17) |
The variance is an explicit function of Q, which is assumed to be a priori known. If there is uncertainty in the value of Q, Eq. 17 can quantify how sensitive the distribution of is to variation in Q, as shown in Fig. 7 a. We can see that for small assumed Q, such as in poliovirus (30), error in this measurement can cause a large relative change in the accuracy of . This type of sensitivity analysis on estimation variance can be done with any experimental parameter included in the likelihood function in Eq. 14. Furthermore, for directly controllable parameters, such as the serial dilution factor D, Eq. 17 can provide insight into optimizing the assay protocol, as shown in Fig. 7 b. Although it is evident that small D would increase the accuracy of the estimate, doing so requires more serial dilutions, which increases the time and expense of the assay. Thus, our sensitivity analysis provides a quantitative method for making experimental design choices between minimizing uncertainty versus the cost of an assay protocol. Lastly, if we compute the variance of the standard method in Eq. 13 due to the known variance in the data Pd,t and compare with Eq. 17, we find, when using realistic parameter values from Fig. 5, the standard method results in a 10.7% higher variance than that of our method. Although the significance of the relative increase in precision of estimating N0 found using our method is highly dependent on the context of the experimental study for which the assay was performed, similar sensitivity analysis can be used to determine such tolerances.
Endpoint dilution assay
Another widely used assay for quantifying the initial viral particle count N0 is the endpoint dilution or endpoint titration assay (23, 25, 37). It is often used in place of the plaque assay, as it can be more rapidly performed and is useful for viral strains that are unable to form plaques. Here, serial dilutions at a factor of D are employed, and at every dilution number d, an assay is performed T times to test for a successful CE. The number Ed of observed CEs among the T trials at a given dilution number d is recorded as the signal. For low dilution, we expect many cells to be infected, and the probability of observing a CE, as shown in Eq. 12, is close to 1. If every trial of the assay is likely to display a CE, then Ed is expected to be close to T. However, at high dilution, the probability in Eq. 12 rapidly decreases to 0, as shown in Fig. 4, and Ed will be similarly small. For a large initial stock of viral particles N0, a larger dilution number d is needed to ensure the dramatic change in probability in Eq. 12. Thus, the critical dilution at which Ed most rapidly decreases from T can be used to estimate the particle count N0. This occurs at the point of inflection when d = logD(N0Q−1) and corresponds to when the expected number of successful trials E[Ed] = T (1 − e−1), as shown in Fig. 8.
One commonly used way to estimate N0 is the RM method, which utilizes the two dilution numbers that capture the greatest change in the data Ed (27). We first define a critical dilution number to be the largest dilution such that at least 50% of the trials exhibit a CE. The estimate for the particle count N0 is given by
(18) |
The RM method effectively attempts to approximate the steepest descent of the CE probability given in Eq. 12 with a line connecting the assay data at dilutions and , as displayed in Fig. 8 a. Unfortunately, this line always rests above the actual expectation curve of Ed, so any estimate obtained from this method will overestimate the true N0. Another commonly used estimation scheme is the SK method, which uses the critical dilution number , the largest dilution such that 100% of trials exhibit a CE (28, 37). The SK estimate is given by
(19) |
In this method, the downward slope for the expectation of Ed is assumed to follow a decaying exponential starting at dilution , as shown in Fig. 8 b. The intention is to find the dilution at which Te−1 CEs are expected by calculating the area under the exponential curve, given by the summation term in Eq. 19. However, the actual values of Ed will follow the expected curve from our model, leading to an overestimate of the area and, by extension, a larger value for . Both standard methods were derived from the heuristic observation that Ed exhibits sigmoidal behavior as a function of the dilution number d, but an underlying probabilistic model was missing, resulting in consistent overestimation of the true N0. Furthermore, neither method uses the “particle to PFU ratio” Q, accounts for the stochasticity of serial diluting viral samples, considers the dynamics of SMOI, or employs the entire set of data Ed.
We present an alternative way to infer N0, using Eq. 12 to establish an MLE scheme. We restrict ourselves to in vitro assays in which a single infected cell is sufficient to display a CE. Then each cytopathic count is binomially distributed with parameters T and the probability given in Eq. 12. Thus, for a set of data , we propose the likelihood function
(20) |
Equation 20 is an expression of the probability of the data given the current assumed value of N0. To obtain the best estimate of N0, we maximize the likelihood function by taking the log and derivative of with respect to N0 and set it equal to zero to obtain
(21) |
As with Eq. 15, solving Eq. 21 for requires a numerical method such as Newton-Raphson. As an appropriate initial estimate for , the formula
(22) |
can be used, where dc is the largest dilution number such that at least half of the trials exhibit a CE. Equation 22 is the average of the N0 estimates at dilutions dc and dc+1 when setting the CE probability in Eq. 12 to 1/2. For a comparison of our MLE method with the RM and SK methods, we simulate data similar to that described in Plaque Assay. Here, we take the number of trials such that the simulated count of infected cells is greater than zero as the values of Ed for a given dilution number d. We plot the likelihood from Eq. 20 and compare the MLE of N0 with those derived by the RM and SK methods in Fig. 9 a. Although both RM and SK estimate very similar values of , they both consistently overestimate the a priori set N0 relative to the MLE method. This demonstrates the advantage of a probabilistic model for parameter inference over heuristically determined formulas.
The expressions we derived in Eqs. 14 and 20, applied to simulated data, can also help quantify tradeoffs in experimental design. As discussed above, there exist viruses that cannot form plaques, restricting the options of VQAs to endpoint dilution. However, for many cases, the choice between using one assay over the other can be one of convenience. More specifically, endpoint dilution assays can often be performed more rapidly than plaque assays. Using the same simulated data for both assays, we plot Eqs. 14 and 20 together in Fig. 9 b. The plots clearly show the superiority of the plaque assay for estimating the viral stock number N0 in respect to both how close the MLE infers the true N0 value and the amount of variance in that estimate. Although the amount of variability and error that is tolerable for an experiment may be context dependent, the plots in Fig. 9 b provide a quantitative way to differentiate between the two methods.
Luciferase reporter assay
The luciferase reporter assay is commonly used to measure the infectivity of a viral strain. Here, the ratio μ = N/M of total infections over the number of plated cells is estimated by measuring the transcription activity of viral proteins (14, 15, 16). The reporter employs an oxidative enzyme luciferase that facilitates a reaction when introduced to the substrate luciferin, resulting in bioluminescence. The protocol begins with attaching the luciferase gene to the viral genome. The altered viral strain is cloned to a total particle count N0, which, in this case, is assumed to be fixed and known. The solution of viruses is added to a plated monolayer of M host cells. An incubation time is allowed for transcription of viral proteins and, incidentally, the luciferase enzyme. Subsequently, all cells are lysed to release all cytoplasmic contents into the solution, upon which luciferin is added. The oxidation of luciferin is facilitated by the luciferase enzyme, and the resulting bioluminescence yields a measurable signal (38). The light intensity is thus a measure of total transcription activity of the viral genome in all infected cells and can be used as a proxy for the total number of viruses N that successfully infected host cells.
Although there is stochasticity in transcription-factor binding and, in the case of retroviruses, the number of integration sites on the host DNA, we will assume that each successful virus infection contributes one viral genome to be transcribed and each transcription occurs at a constant rate proportional to the total number of integrated viral genomes. Note that the limited number of transcription factors, ribosomes, and other cell machinery necessary to produce viral proteins and the luciferase reporter causes the production rate to saturate as the number of infecting viruses r per cell increases. Thus, transcription activity saturates with increasing number of infections r. We can model this effect by defining a monotonically increasing function f(r) representing the number of transcribed viral proteins when a cell is infected by r viruses over the course of the assay. Thus, for a given SMOI , we will model the intensity signal L of the total luciferase reporter luminescence with
(23) |
where L0 is the fluorescence intensity arising from a single luciferase reporter present in the solution. Although f(r) may take on many functional forms, a commonly used model for transcription-factor kinetics is the Hill function (39) given by
(24) |
where is the maximal transcription activity of luciferase, h is the Hill coefficient that effectively describes cooperative binding of multiple transcription factors at a promoter region, and K is an effective dissociation constant relating the binding and unbinding rates of transcription factor. The functional form of Eq. 24 accounts for the limited transcription machinery available for the multiple copies of viral genome present in the cell. In Fig. 10 a, we calculate the discrete probability distribution by considering the cumulative weight of every allowable configuration of N viruses infecting M cells through Eq. 23.
Because luciferase reporter assays typically involve large values of initial virus count N0 and cell count M, we can use the asymptotic approximations in Eqs. 6 and 7 along with the central limit theorem (40) to assume L is normally distributed with expected value
(25) |
and variance
(26) |
A visualization of the normal approximation of the probability distribution of L is shown in Fig. 10 b. Furthermore, with Eqs. 25 and 26, we can derive the likelihood function of the data , given μ:
(27) |
where 1 ≤ t ≤ T is the trial number. Because of the complicated functional form of the mean and variance of L, creating a maximal likelihood scheme to estimate μ from experimental data is intractable, so we use Eq. 25 by replacing the expected value with the experimental average of measurements . If we assume no cooperative transcription binding (h = 1), we solve for the estimate by applying the Newton-Raphson iterative method to the equation
(28) |
The typical method, under the assumption that luminescent intensity is proportional to the number of IUs N, is to use the sample mean via the formula . This standard approach fails to account for the effects of SMOI, but can be used to generate an initial guess for solving Eq. 28 iteratively. To compare the two estimates, we simulate data similar the descriptions in the previous two sections. Here, we do not dilute the initial particle count and, after distributing the N IUs to the M cells with equal probability, we compile the SMOI configuration and calculate using Eq. 23. The results are shown in Fig. 11. The iterative method produces an estimate far closer to the true value of μ than the former method. A similar approach can be used to compare methods for alternative functional forms of the viral protein transcription dynamics described in Eq. 24.
Conclusions
In this work, we derived probability models that quantify the viral infectivity of host cells in an in vitro environment. By factoring in the stochastic nature of virus-host engagement, defective and/or abortive events, and the possibility of multiple infections of a single host, we defined the SMOI and determined related probabilistic models. We analyzed two limiting regimes: small numbers of infecting viruses N and large N. For the low N regime, Eqs. 2 and 5 model how the limited number of IUs are distributed among the M host cells. Alternatively, for large N, we showed that the cell counts of the SMOI become statistically independent, as displayed in Eq. 8, and that they display a Poisson distribution (Eq. 6). Lastly, we explored the effects of serial dilution on the total number of infected cells and the probability of observing an infectious signal in Eq. 9.
Using our probability models along with reasonable assumptions of applied combinatorics and nonlinear inference, we analytically derived expressions for several virus assays to improve on existing methods of experimental data analysis. For virus quantification assays, serial dilution results in low numbers of viral particles. Using the appropriate probability model, we created to our knowledge new methods of estimating the particle count N0 in the initial viral stock for the plaque assay and the endpoint dilution assay. For measuring infectivity of a viral strain, the objective is to determine the effective multiplicity of infection μ = N/M as the ratio of successfully infecting viruses N and the total number of cells M included in the assay. As these assays operate under no dilution, we employed the large N limit probability model to analytically derive expressions for the luciferase reporter assay to estimate μ. A summary of each estimation method along with the most commonly used counterpart is displayed in Table 1.
Table 1.
Comparison of Virological Assay Analyses | ||
---|---|---|
Assay (parameter) | Standard Method | New Method |
Plaque (N0) | ||
Initial guess: | ||
Endpoint dilution (N0) | RM: | |
SK: | Initial guess: | |
Luciferase reporter | ||
Initial guess: |
For virus quantification assays, such as the plaque and endpoint dilution assays, one typically wishes to estimate the number of initial viral particles N0. For luciferase reporter infectivity assay, the ratio μ = N/M is desired. Our improved parameter estimation methods are listed next to standard methods currently used.
VQAs are primarily concerned with inferring N0 and assume a priori knowledge of M and the particle to PFU ratio Q. In actuality, there can be variability in the number of cells present in the microtiter well, and, as discussed in Probabilistic Models of Statistical Multiplicity of Infection, the true value of Q is dependent on the particular protocol and particular conditions under which an assay was performed. If an alternative assay (RNA tagging, spectroscopy, super-resolution imaging, etc.) not using cell infection can accurately measure N0, then, in theory, a subsequent infection assay can be used to infer a more reliable measure of Q. In fact, in our analysis of the plaque assay presented in Appendix E of the Supporting Materials and Methods, we show that one can determine a significantly higher amount of information about Q with the same assay protocol if N0 is a priori known rather than the reverse case. Thus, one may argue that assays that employ serial dilution, such as plaque and endpoint dilution assays, may be better utilized to infer Q. Because the underlying likelihood of the data in all assays would be the same, the same derivation techniques would follow with respect to Q to formulate its MLE. This analysis shows the robust utility of a full probabilistic model and data likelihood function.
Although the derived assay models provide explicit equations for inference, many of the expressions are analytically unsolvable and require numerical solutions. To improve the accessibility of some of our results, we have created a web-based tool (available at https://bamistry.github.io/SMOI/) that can accept data from plaque, endpoint dilution, or luciferase reporter assays and automatically estimate the parameter of interest. Ultimately, these tools can be used for analysis of future virological studies but may also be useful when revisiting the results of studies that stress quantifying viral infectivity (15, 41). For studies that use serial dilution assays, our approach stresses the advantages of using information in the data associated with all dilution numbers rather than just that of the critical dilution.
Our probabilistic models of viral infection can be further generalized to include, for example, the effects of cell size inhomogeneity, coinfection, and viral interference. In the Supporting Materials and Methods, we provide a framework that would allow one to explore how these confounding factors can further alter the signal of a virus assay. Future refinement of these extensions can help to ultimately derive a mechanistic model for the probability of a single virus successfully infecting a host cell, which we defined as Q−1. Understanding this probability of infection can help aid further experimental design and allow better quantification and resolution of the infection dynamics of particular viral strains.
Author Contributions
B.A.M. derived mathematical formulae, developed statistical inference framework, performed simulations, generated plots, and wrote the initial draft. B.A.M. (bamistry@ucla.edu) also developed the web-based analysis tool at https://bamistry.github.io/SMOI/. M.R.D. and T.C. verified the mathematical results, contributed to their analyses, and edited the manuscript. T.C. conceptualized, designed, and supervised the research.
Acknowledgments
We are especially grateful to Dr. Nicholas Webb, Prof. Benhur Lee, and Prof. Jerome Zack for insightful discussions.
This work was supported in part by grants from the National Science Foundation (DMS-1516675) and the Army Research Office (W911NF-14-1-0472).
Editor: Ruth Baker.
Footnotes
Supporting Materials and Methods are available at http://www.biophysj.org/biophysj/supplemental/S0006-3495(18)30575-7.
Supporting Material
References
- 1.Pegu A., Yang Z.Y., Nabel G.J. Neutralizing antibodies to HIV-1 envelope protect more effectively in vivo than those to the CD4 receptor. Sci. Transl. Med. 2014;6:243ra88. doi: 10.1126/scitranslmed.3008992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Osbourn J.K., Earnshaw J.C., McCafferty J. Directed selection of MIP-1 α neutralizing CCR5 antibodies from a phage display human antibody library. Nat. Biotechnol. 1998;16:778–781. doi: 10.1038/nbt0898-778. [DOI] [PubMed] [Google Scholar]
- 3.Qiu S., Yi H., Li W. The binding mode of fusion inhibitor T20 onto HIV-1 gp41 and relevant T20-resistant mechanisms explored by computational study. Curr. HIV Res. 2012;10:182–194. doi: 10.2174/157016212799937191. [DOI] [PubMed] [Google Scholar]
- 4.Platt E.J., Gomes M.M., Kabat D. Reversible and efficient activation of HIV-1 cell entry by a tyrosine-sulfated peptide dissects endocytic entry and inhibitor mechanisms. J. Virol. 2014;88:4304–4318. doi: 10.1128/JVI.03447-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Platt E.J., Durnin J.P., Kabat D. Kinetic factors control efficiencies of cell entry, efficacies of entry inhibitors, and mechanisms of adaptation of human immunodeficiency virus. J. Virol. 2005;79:4347–4356. doi: 10.1128/JVI.79.7.4347-4356.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Fätkenheuer G., Pozniak A.L., van der Ryst E. Efficacy of short-term monotherapy with maraviroc, a new CCR5 antagonist, in patients infected with HIV-1. Nat. Med. 2005;11:1170–1172. doi: 10.1038/nm1319. [DOI] [PubMed] [Google Scholar]
- 7.Jonckheere H., Anné J., De Clercq E. The HIV-1 reverse transcription (RT) process as target for RT inhibitors. Med. Res. Rev. 2000;20:129–154. doi: 10.1002/(sici)1098-1128(200003)20:2<129::aid-med2>3.0.co;2-a. [DOI] [PubMed] [Google Scholar]
- 8.Thierry S., Munir S., Delelis O. Integrase inhibitor reversal dynamics indicate unintegrated HIV-1 dna initiate de novo integration. Retrovirology. 2015;12:24. doi: 10.1186/s12977-015-0153-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Paterson D.L., Swindells S., Singh N. Adherence to protease inhibitor therapy and outcomes in patients with HIV infection. Ann. Intern. Med. 2000;133:21–30. doi: 10.7326/0003-4819-133-1-200007040-00004. [DOI] [PubMed] [Google Scholar]
- 10.Wilen C.B., Tilton J.C., Doms R.W. HIV: cell binding and entry. Cold Spring Harb. Perspect. Med. 2012;2(8):a006866. doi: 10.1101/cshperspect.a006866. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Qian K., Morris-Natschke S.L., Lee K.H. HIV entry inhibitors and their potential in HIV therapy. Med. Res. Rev. 2009;29:369–393. doi: 10.1002/med.20138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Boulant S., Stanifer M., Lozach P.Y. Dynamics of virus-receptor interactions in virus binding, signaling, and endocytosis. Viruses. 2015;7:2794–2815. doi: 10.3390/v7062747. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Chou T. Stochastic entry of enveloped viruses: fusion versus endocytosis. Biophys. J. 2007;93:1116–1123. doi: 10.1529/biophysj.107.106708. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Chikere K., Chou T., Lee B. Affinofile profiling: how efficiency of CD4/CCR5 usage impacts the biological and pathogenic phenotype of HIV. Virology. 2013;435:81–91. doi: 10.1016/j.virol.2012.09.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Johnston S.H., Lobritz M.A., Lee B. A quantitative affinity-profiling system that reveals distinct CD4/CCR5 usage patterns among human immunodeficiency virus type 1 and simian immunodeficiency virus strains. J. Virol. 2009;83:11016–11026. doi: 10.1128/JVI.01242-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Webb N.E., Lee B. Quantifying CD4/CCR5 usage efficiency of HIV-1 Env using the affinofile system. Methods Mol. Biol. 2016;1354:3–20. doi: 10.1007/978-1-4939-3046-3_1. [DOI] [PubMed] [Google Scholar]
- 17.Killian M.L. Hemagglutination assay for the avian influenza virus. Methods Mol. Biol. 2008;436:47–52. doi: 10.1007/978-1-59745-279-3_7. [DOI] [PubMed] [Google Scholar]
- 18.Mascola J.R., Louder M.K., Roederer M. Human immunodeficiency virus type 1 neutralization measured by flow cytometric quantitation of single-round infection of primary human T cells. J. Virol. 2002;76:4810–4821. doi: 10.1128/JVI.76.10.4810-4821.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Brown C.M., Bidle K.D. Attenuation of virus production at high multiplicities of infection in Aureococcus anophagefferens. Virology. 2014;466-467:71–81. doi: 10.1016/j.virol.2014.07.023. [DOI] [PubMed] [Google Scholar]
- 20.Sette A., Fikes J. Epitope-based vaccines: an update on epitope identification, vaccine design and delivery. Curr. Opin. Immunol. 2003;15:461–470. doi: 10.1016/s0952-7915(03)00083-9. [DOI] [PubMed] [Google Scholar]
- 21.Gerdil C. The annual production cycle for influenza vaccine. Vaccine. 2003;21:1776–1779. doi: 10.1016/s0264-410x(03)00071-9. [DOI] [PubMed] [Google Scholar]
- 22.Tree J.A., Richardson C., Looby D. Comparison of large-scale mammalian cell culture systems with egg culture for the production of influenza virus A vaccine strains. Vaccine. 2001;19:3444–3450. doi: 10.1016/s0264-410x(01)00053-6. [DOI] [PubMed] [Google Scholar]
- 23.Neumann G., Fujii K., Kawaoka Y. An improved reverse genetics system for influenza A virus generation and its implications for vaccine production. Proc. Natl. Acad. Sci. USA. 2005;102:16825–16829. doi: 10.1073/pnas.0505587102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kropinski A.M., Mazzocco A., Johnson R.P. Enumeration of bacteriophages by double agar overlay plaque assay. Methods Mol. Biol. 2009;501:69–76. doi: 10.1007/978-1-60327-164-6_7. [DOI] [PubMed] [Google Scholar]
- 25.Johnson V.A., Byington R.E., Nara P.L. Palgrave Macmillan; Basingstoke, UK: 1990. Quantitative Assays for Virus Infectivity; pp. 71–86. [Google Scholar]
- 26.Agrawal-Gamse C., Lee F.H., Laakso M.M. Adaptive mutations in a human immunodeficiency virus type 1 envelope protein with a truncated V3 loop restore function by improving interactions with CD4. J. Virol. 2009;83:11005–11015. doi: 10.1128/JVI.01238-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Reed L.J., Muench H. A simple method of estimating fifty percent endpoints. Am. J. Hyg. 1938;27:493–497. [Google Scholar]
- 28.Hamilton M.A., Russo R.C., Thurston R.V. Trimmed Spearman-Karber method for estimating median lethal concentrations in toxicity bioassays. Environ. Sci. Technol. 1977;11:714–719. [Google Scholar]
- 29.Schwerdt C.E., Fogh J. The ratio of physical particles per infectious unit observed for poliomyelitis viruses. Virology. 1957;4:41–52. doi: 10.1016/0042-6822(57)90042-9. [DOI] [PubMed] [Google Scholar]
- 30.Klasse P.J. Molecular determinants of the ratio of inert to infectious virus particles. Prog. Mol. Biol. Transl. Sci. 2015;129:285–326. doi: 10.1016/bs.pmbts.2014.10.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Layne S.P., Merges M.J., Nara P.L. Factors underlying spontaneous inactivation and susceptibility to neutralization of human immunodeficiency virus. Virology. 1992;189:695–714. doi: 10.1016/0042-6822(92)90593-e. [DOI] [PubMed] [Google Scholar]
- 32.Turner T.E., Schnell S., Burrage K. Stochastic approaches for modelling in vivo reactions. Comput. Biol. Chem. 2004;28:165–178. doi: 10.1016/j.compbiolchem.2004.05.001. [DOI] [PubMed] [Google Scholar]
- 33.Roberts F.S., Tesman B. Pearson Prentice Hall; Upper Saddle River, NJ: 2005. Applied Combinatorics. [Google Scholar]
- 34.Gilchrist M.A., Coombs D., Perelson A.S. Optimizing within-host viral fitness: infected cell lifespan and virion production rate. J. Theor. Biol. 2004;229:281–288. doi: 10.1016/j.jtbi.2004.04.015. [DOI] [PubMed] [Google Scholar]
- 35.Sloutskin A., Goldsten R.S. Infectious focus assays and multiplicity of infection (MOI) calculations for alpha-herpesviruses. Bio Protoc. 2014;4:e1295. [Google Scholar]
- 36.Lange K. Springer Science+Business Media; New York, NY: 2013. Optimization. [Google Scholar]
- 37.Ramakrishnan M.A. Determination of 50% endpoint titer using a simple formula. World J. Virol. 2016;5:85–86. doi: 10.5501/wjv.v5.i2.85. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Montefiori D.C. Measuring HIV Neutralization in a luciferase reporter gene assay. Methods Mol. Biol. 2009;485:395–405. doi: 10.1007/978-1-59745-170-3_26. [DOI] [PubMed] [Google Scholar]
- 39.Weiss J.N. The Hill equation revisited: uses and misuses. FASEB J. 1997;11:835–841. [PubMed] [Google Scholar]
- 40.Lange K. Springer Science+Business Media; New York, NY: 2003. Applied Probability. [Google Scholar]
- 41.Mistry B., D’Orsogna M.R., Chou T. Quantifying the sensitivity of HIV-1 viral entry to receptor and coreceptor expression. J. Phys. Chem. B. 2016;120:6189–6199. doi: 10.1021/acs.jpcb.6b02102. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.