Time to revisit the endpoint dilution assay and to replace the TCID50 as a measure of a virus sample’s infection concentration

Daniel Cresta; Donald C Warren; Christian Quirouette; Amanda P Smith; Lindey C Lane; Amber M Smith; Catherine A A Beauchemin

doi:10.1371/journal.pcbi.1009480

. 2021 Oct 18;17(10):e1009480. doi: 10.1371/journal.pcbi.1009480

Time to revisit the endpoint dilution assay and to replace the TCID₅₀ as a measure of a virus sample’s infection concentration

Daniel Cresta ^1,^¤,^#, Donald C Warren ^2,^#, Christian Quirouette ¹, Amanda P Smith ³, Lindey C Lane ³, Amber M Smith ³, Catherine A A Beauchemin ^1,^2,^*

Editor: Roland R Regoes⁴

PMCID: PMC8553042 PMID: 34662338

Abstract

The endpoint dilution assay’s output, the 50% infectious dose (ID₅₀), is calculated using the Reed-Muench or Spearman-Kärber mathematical approximations, which are biased and often miscalculated. We introduce a replacement for the ID₅₀ that we call Specific INfection (SIN) along with a free and open-source web-application, midSIN (https://midsin.physics.ryerson.ca) to calculate it. midSIN computes a virus sample’s SIN concentration using Bayesian inference based on the results of a standard endpoint dilution assay, and requires no changes to current experimental protocols. We analyzed influenza and respiratory syncytial virus samples using midSIN and demonstrated that the SIN/mL reliably corresponds to the number of infections a sample will cause per mL. It can therefore be used directly to achieve a desired multiplicity of infection, similarly to how plaque or focus forming units (PFU, FFU) are used. midSIN’s estimates are shown to be more accurate and robust than the Reed-Muench and Spearman-Kärber approximations. The impact of endpoint dilution plate design choices (dilution factor, replicates per dilution) on measurement accuracy is also explored. The simplicity of SIN as a measure and the greater accuracy provided by midSIN make them an easy and superior replacement for the TCID₅₀ and other in vitro culture ID₅₀ measures. We hope to see their universal adoption to measure the infectivity of virus samples.

Author summary

The infectivity of a virus sample is measured by the infections it causes. One approach, the endpoint dilution assay, aims to estimate the number of TCID₅₀ contained in a sample, where one TCID₅₀ is the dose at which a virus sample is expected to infect a tissue or cell culture 50% of the time, on average. Unfortunately, the commonly used methods to estimate the TCID₅₀ from the assay’s outcome yield biased approximations that relate poorly to the number of infections the sample will cause. We propose replacing the TCID₅₀ with a more accurate, robust, and biologically meaningful measurement unit we call Specific INfection (SIN). It corresponds to the number of infections the virus sample will cause, which can be used directly to achieve the desired multiplicity of infection. Computing the SIN from one’s endpoint dilution assay outcome requires no change in experimental procedure, and can be done conveniently via a web-application we developed, called midSIN. midSIN can be accessed for free on any device (laptop, cellular phone, tablet) from any web browser, without the need to download and install software.

Introduction

The progression of a virus infection in vivo or in vitro, or the effectiveness of therapeutic interventions in reducing viral loads, are monitored over time through sample collections to measure changes (increases or decreases) in virus concentrations. As such, accurate measurement of the concentration in a sample is critical to study and manage virus infections.

Methods to count infectious virus are based on counting the infections they cause, rather than the particles themselves. In practice, however, not all infection-competent virions contained in a sample will go on to successfully cause infection. Experimental conditions, cell type used or temperature or acidity of the medium, can alter the rate at which virions, that were infection-competent in the sample, will lose infectivity before they can cause infection and thus be counted. This is why, hereafter, we will refer to the quantity measured by infectivity assays as the infection concentration or the number of infections the sample will cause per unit volume, rather than its concentration of infectious virions, which is not a measurable quantity. Two main types of assays are used to quantify the infection concentration within a virus sample: (1) the plaque forming (PFU) or focus forming (FFU) assays; and (2) assays we will collectively refer to as endpoint dilution (ED) assays, which include the 50% tissue culture infectious dose (TCID₅₀), or cell culture infectious dose (CCID₅₀) or egg infectious dose (EID₅₀) assays, etc. Herein, we focus on ED assays. Technically, the plaque and focus forming assays are also endpoint dilution assays because they rely on the counting of plaques or foci (the endpoint) as a function of dilutions. However, herein, we will refer to them as plaque or foci forming assays rather than endpoint dilution assays.

The ED assay has one major, remediable weakness: its output quantity, the TCID₅₀ (or CCID₅₀ or EID₅₀), does not directly correspond, or trivially relate, to causing one infected cell. The simplistic calculations, introduced by Spearman-Kärber (SK) [1, 2] and Reed-Muench (RM) [3] nearly a century ago, remain the most commonly used methods to quantify a virus sample’s infectivity in units of TCID₅₀ (or CCID₅₀ or EID₅₀) using the ED assay. Many research groups rely on spreadsheet calculators that are passed down through generations of trainees or found on the internet, and can contain errors (e.g., versions 2 and 3 of the spreadsheet calculator provided by the Lindenbach Lab at Yale University (http://lindenbachlab.org/resources.html), which have since been removed). Theoretically, a dose of 1 TCID₅₀ is expected to cause −1/ln(50%) = 1.44 infections [4]. However, the approximation used by the RM and SK methods introduces an often overlooked bias where 1 TCID₅₀ ≈ 1.781 infections where 1.781 = e^γ and γ = 0.5772 is the Euler-Mascheroni constant [5, 6]. This makes it problematic to experimentally achieve the desired multiplicity of infection when inoculating from a sample quantified via the RM or SK methods. Many have proposed replacements for the RM and SK calculations based on logit or probit transforms of the data [4, 6, 7] or on statistical analysis of the ED assay output [7, 8] with some implemented as website applications [9, 10]. Sadly, none of these improvements were widely adopted to improve estimates of the TCID₅₀, possibly due to a lack of visibility of these publications, or the lack of widespread awareness of the limitations of the RM and SK methods. None proposed replacing the TCID₅₀ measurement unit, with a more meaningful measure.

The work herein proposes to:

Encourage the use of the ED assay (e.g., TCID₅₀ assay), but replace its output, the TCID₅₀/mL (or CCID₅₀/mL, EID₅₀/mL, etc.), with a new quantity in units of Specific INfections or SIN/mL which corresponds to the number of infections the sample will cause per mL. The word specific highlights the fact that the infectivity of a sample is specific to the particulars of the experimental conditions (temperature, medium, cell type, incubation time, etc.).
Replace the Reed-Muench and Spearman-Kärber approximations with a computer software, midSIN (measure of infectious dose in SIN), that relies on Bayesian inference to measure the SIN/mL of a virus sample. To avoid calculation errors and make the new method widely accessible, midSIN is maintained and distributed as free, open-source software on GitHub (https://github.com/cbeauc/midSIN) for user installation, but also via a free-to-use website application (https://midsin.physics.ryerson.ca) with an intuitive user interface.

Here, we present examples of midSIN being used to analyze influenza and respiratory syncytial virus samples. We demonstrate that midSIN’s output, SIN/mL, is an accurate estimate of the number of infections the sample will cause per unit volume. We show how the accuracy of the SIN concentration estimate can be controlled by experimental choice of plate layout, including the dilution factor, and the number of replicates per dilution. We compare midSIN’s performance to that of the RM and SK methods, and demonstrate how the latter estimators are inaccurate under various circumstances, underlining the need to adopt midSIN to quantify virus samples via the ED assay.

Results

Key features of midSIN’s output

Let us consider a fictitious ED experiment, with 11 dilutions and 8 replicate wells per dilutions, in which the minimum sample dilution, $D_{1} = 1 / 100 = 10^{- 2}$ , is serially diluted by a factor of 10^−0.5 ≈ 0.32 ( $D_{2} = 10^{- 2.5}$ , $D_{3} = 10^{- 3}$ , …, $D_{11} = 10^{- 7}$ ), and the total volume of inoculum (diluted virus sample + dilutant) placed in each well is V_inoc = 0.1 mL. Now, consider that a virus sample is measured using this ED experiment and one observes (8,8,8,8,8,7,7,5,2,0,0) infected wells out of 8 replicates at each of the 11 dilutions, as illustrated in Fig 1A.

midSIN provides a graphical output of its results, shown in Fig 1B and 1C for this example. Note how the posterior distribution for log₁₀(SIN/mL) (Fig 1B) is approximately a normal distribution. This is why log₁₀ of the infection concentration should be used and reported, rather than the concentration itself. midSIN also graphically compares the number of infected wells observed experimentally (Fig 1C, black dots) against the theoretically expected values (blue curve and grey CI bands). This graphical representation makes it easy to identify issues with the data entered or with the experiment itself.

Importantly, midSIN provides a more useful quantity to the user than the TCID₅₀: an estimate of the concentration of infections the sample will cause, SIN/mL. For this example, the concentration is 10^6.2±0.1 SIN/mL, where 6.2 is the mode (most likely value) of log₁₀(SIN/mL), and ±0.1 is its 68% credible interval (CI). The SIN/mL corresponds to the number of infections that will be caused per mL of the sample, which can be directly used to determine the sample dilution required to obtain a desired multiplicity of infection (MOI).

In a laboratory setting, ED experiments can be performed in batches, such as to quantify the infectious concentration in samples collected at several time points over the course of a cell culture infection. For such applications, midSIN provides a comma separated value (csv) template file readily editable in a spreadsheet program, to collect and submit the results for batch processing. Details on the format of the template file are available on midSIN’s website (https://midsin.physics.ryerson.ca). Fig 2 illustrates the output for a subset of measurements for in vitro infection with the respiratory syncytial virus (RSV). Each sample was measured twice, and midSIN’s estimates are in good agreement with one another (within 95% CI).

Fig 2 — Each row corresponds to a different experiment (mock-yield [my] or single-cycle [sc]) and sampling time point (e.g., 8 h, 36 h), and each sample was measured in duplicate (rep1, rep2). These data were collected from *in vitro* infections with the RSV A Long strain, and were previously reported in [11]. The ED measurement experiment were conducted using a plate layout of 11 dilutions, with 8 replicates per dilution, an inoculum volume of V_inoc = 0.1 mL, serial dilutions from $D_{1} = 10^{- 1}$ to $D_{11} = 10^{- 6}$ , separated by a dilution factor of 10^−0.5.

The y-axis in the left graph panels of midSIN’s graphical output is the non-normalized scale of the posterior distribution for log₁₀(SIN/mL), which ranges between 10⁻⁷ and 10⁻². The scale loosely relates to the likelihood of observing a particular ED experimental outcome (see Methods). Unlikely ED outcomes appear as large departures of the observed number of infected wells (right panels, black dots) from what is theoretically expected (right panels, curve). It is interesting that the uncertainty (CI) of midSIN’s estimated log₁₀(SIN/mL) appears to be independent of how much the ED outcome deviates from theoretical expectations. That is, the accuracy of midSIN is not strongly affected even when it is provided more unlikely, noisy experimental data. This robustness is explored further below.

Comparing SIN to TCID₅₀ and PFU virus sample concentrations

The midSIN calculator provides an estimate of the number of infections that will be caused per mL of a virus sample (SIN/mL). In principle, a plaque assay also measures the number of infections a sample will cause, with each infection expected to develop into a plaque. If a plaque assay is performed under experimental conditions and protocols as similar as possible to those of the ED assay (i.e., using the same cells, medium, period of incubation, rinsing method, etc.), midSIN’s SIN/mL estimate is expected to be comparable, in theory, to the number of PFU/mL observed in the plaque assay. In practice, however, the plaque assay likely provides a biased estimate of the true concentration of infections in a sample due to various experimental limitations (e.g., distinguishing between two merged plaque and a larger one, or between small plaques and staining artifacts). To evaluate midSIN’s performance compared to existing methods, the infection concentration in two influenza A (H1N1) virus strain samples were measured via both plaque and ED assays, and their concentration in units of PFU, TCID₅₀, and SIN were compared (Fig 3). Details regarding the samples, and how the plaque and ED assays were performed are provided in Methods.

The TCID₅₀ concentrations estimated via the RM and SK methods are ∼1.5–1.7 times larger (Fig 3C and 3D) than the SIN concentration, and the set of ratios are statistically inconsistent with the assumption of equality (p-value: 0.01–0.03). Theoretically, 1 TCID₅₀ is expected to cause 1.44 infections (= 1/ln(2)) [4]. However, the RM or SK approximations are known to introduce a bias such that 1 TCID₅₀ estimated by these methods is expected to cause 1.781 infections (= e^γ where γ = 0.5772 is the Euler-Mascheroni constant) [5, 6]. Using the RM, SK, and SIN measurements presented in Fig 3A and 3B, we confirmed (the mean log₁₀(ratio) was re-computed for ratio = (RM/1.781)/SIN and (SK/1.781)/SIN, and found to be 0.85–0.93, which is statistically consistent (p-value: 0.1–0.3) with the assumption of equality, i.e., ratio = 10⁰ = 1.) that 1.781 SIN ≈ 1 TCID₅₀ when the latter is estimated via the RM or SK approximations, as expected theoretically if SIN is indeed measuring the infection concentration in a sample.

Similarly, the ratio of the PFU concentration determined via the plaque assay and the SIN concentrations estimated by midSIN is ∼0.89–0.93, which is statistically consistent with the assumption of equality (p-value: 0.2–0.5). These results confirm the theoretical expectation that 1 PFU ≈ 1 SIN when the plaque and ED assays are performed in the same manner, as was the case here. This provides further support, via two independent assays, that the SIN concentration estimated by midSIN from the ED assay is a robust measure of the infection concentration of a virus sample.

Comparing midSIN’s performance to that of the RM and SK methods

The RM and SK methods rely on the number of infected wells decreasing as dilution increases. Their estimates are affected when the number of infected wells remains unchanged or even increases as dilution increases, which statistics and experimental data herein (Fig 2) tell us can reasonably occur experimentally. The RM and SK methods also mostly require that at the lowest and highest sample dilutions, all wells be infected and uninfected, respectively. Fig 4 provides a graphical representation of how the RM and SK methods estimate the TCID₅₀ concentration from an ED assay. Simply stated, the RM and SK methods use geometric arguments to estimate the sample dilution at which 50% of wells would be infected. While they are sometimes accurate (Fig 4A and 4B), their simplicity often leads to biased estimates (Fig 4C and 4D).

Fig 4 — A,C: The RM method first smooths the data by taking the cumulative sum of the number of infected wells from the highest to the lowest dilution, and that of uninfected wells from the lowest to the highest dilution (grey dashed curve). It then identifies the dilution (vertical solid orange line) corresponding to the smooth curve’s 50% crossing point (4/8 wells, horizontal grey line) based on the highest dilution with > 50% wells infected, and the lowest dilution with < 50% wells infected. B,D: The SK method identifies the dilution (vertical dashed green line) such that the area under the curve to its right (pale red) would exactly fill the area over the curve to its left (pale blue). The agreement between the true TCID₅₀ (blue plus) and the RM and SK estimates is good for the symmetric ED plate outcome in (A,B), but poor for the more irregular outcome in (C,D).

In contrast, midSIN is robust to these issues. Fig 5 demonstrates how midSIN can provide an estimate for the log₁₀(SIN/mL) in a sample using the number of infected wells at a single dilution, as long as at least one well is uninfected if all others are infected or vice-versa. This is because midSIN relies on Bayesian inference, i.e., when more than one column is available, it uses information from each column successively to revise and improve its estimate. This allows midSIN to correct for even large deviations from theoretical expectations, and thus improves its accuracy.

Fig 6 illustrates how well the midSIN, RM, and SK methods recover a known input sample concentration in simulated ED experiments, based on a plate layout consisting of 11 dilutions ( $D_{1} = 10^{- 2}$ to $D_{11} = 10^{- 8}$ ), a dilution factor of 1/4, and 8 replicates per dilutions. The infection concentration estimated by midSIN is in excellent agreement with the input concentration. For the RM and SK methods, which estimate the log₁₀(TCID₅₀/mL) rather than the log₁₀(SIN/mL), the agreement is generally poor due to the bias they introduce. Furthermore, the RM and SK predictions are more variable (wavy pattern), and lose accuracy dramatically as the sample concentration approaches the limits of detection (the 2 ends) which, for the example plate layout simulated here, is around 10³ SIN/mL and 10⁹ SIN/mL. Interestingly, the basic calculations behind the RM and SK methods constrain the set of values they can return (sparsely populated grey histograms), compared to the more continuous range returned by midSIN, which contributes to its increased accuracy.

Estimate accuracy as a function of plate layout

In Fig 2, we observed that even for large discrepancies between the expected (right panels, blue curve) and observed (right panels, black dots) ED assay outcome, the uncertainty (CI) of midSIN’s estimate remains relatively unchanged. This apparent robustness is because the uncertainty is primarily determined by the experimental design, namely the change in dilution between columns (dilution factor) and the number of replicate wells per dilution. Fig 7 explores the impact of varying either only the dilution factor, or only the number of replicates at each dilution, or varying one at the expense of the other by using a fixed number of wells (96 wells). When using midSIN, smaller changes in dilution (e.g., going from a dilution factor of 2.2/100 to 61/100) or more replicates per dilution (4 to 24) each improves the measure’s accuracy (narrower CIs) by comparable amounts, but only when the total number of wells is allowed to increase to accommodate the change. When the total number of wells used is fixed, changing one at the expense of the other leaves the accuracy (CI) unchanged. This is somewhat also true for the log₁₀(TCID₅₀) output concentration estimated by the RM and SK methods. However, at the smallest dilution factors (10/100 and 2.2/100), the bias introduced by the RM and SK methods becomes even larger and more unpredictable. For the input concentration considered in Fig 7 (10⁵ SIN/mL), the dilution at which 50% of wells are infected is near the middle dilution. For sample concentrations such that 50% infected wells occur near or at the lowest or highest dilution chosen, the effect is even more significant.

Fig 7 also demonstrates that varying the dilution by smaller increments (e.g., a dilution factor of 61/100 rather than 10/100) provides greater granularity (uniqueness) of ED plate outcomes, and thus, greater accuracy of the log₁₀ infection concentration estimates. Here, a distinct plate outcome means a distinct number of infected wells at each dilution, with no distinction as to exactly which of the replicate wells (e.g., the second versus the fourth) is infected at each dilution. An ED plate with serial dilutions ranging over 6 orders of magnitude (e.g., 10⁻² to 10⁻⁷), with 4 different dilutions and 24 replicates/dilution (i.e., dilution factor of 2.2/100) provides ∼10⁶ ([24 + 1]⁴) possible, distinct ED plate outcomes (Fig 7C, 7F and 7I, leftmost histogram). In contrast, a plate with the same serial dilution range, but with 24 different dilutions and 4 replicates/dilution (i.e., dilution factor of 61/100) yields ∼10¹⁷ ([4 + 1]²⁴) distinct outcomes (Fig 7C, 7F and 7I, rightmost histogram). More generally, [reps + 1]^dils is the number of distinct plate outcomes for a chosen number of dilutions (dils) and replicates (reps). Having fewer possible plate outcomes means that a larger range of concentrations would share the same most-likely ED plate outcome, yet each plate outcome only maps to one (the most likely) concentration estimate. This means that with fewer dilutions, the concentration estimate is forced to take on the nearest possible value it can take (Fig 7, the next closest grey band in the stack), and the accuracy of the concentration estimate is therefore reduced. So although having a greater number of dilutions is more labour intensive, it should be preferred over having a greater number of replicates per dilution.

Discussion

We have introduced a new calculator tool called midSIN to replace the Reed-Muench (RM) and Spearman-Kärber (SK) calculations to quantify the infectivity of a virus sample based on an endpoint dilution (ED) assay. Rather than estimating the TCID₅₀ of a virus sample, midSIN calculates the number of infections the sample will cause, reported in units of specific infections (SIN). It does so without requiring any changes to current ED assay protocols, and can be accessed for free via an open-source web-application (https://midsin.physics.ryerson.ca). Importantly, because the SIN of a virus sample corresponds to the number of infections it will cause, it can be used directly to determine what dilution of the sample will achieve the desired multiplicity of infection (MOI).

Using a combination of in vitro and simulated experimental data, we demonstrated that midSIN provides more accurate and robust estimates than the biased RM and SK approximations. We confirmed that the RM and SK approximations overestimate the TCID₅₀ by 23.5%, such that 1 TCID₅₀ estimated by these methods will cause 1.781 rather than 1.44 infections [5, 6]. While, in theory, the intended MOI can be obtained by multiplying the TCID₅₀ by 0.7 (or rather ln(2) = 0.693), one should instead multiply by 0.561 to account for the overestimation by RM and SK. Even when accounting for the overestimation, we showed that these methods perform particularly poorly when too few replicate wells per dilutions are used or when the change in dilution is large between successive serial dilutions. The two methods perform especially poorly when quantifying samples whose infection concentration approaches, but is still well within, the detection limit of the ED assay. In such cases, the bias introduced by these methods becomes even larger and more significant. For example, if the minimum and maximum dilutions of an ED plate are 10⁻² and 10⁻⁸, virus samples with a concentration less than 10^2.2 SIN or greater than 10^7.6 SIN per inoculated well volume (typically 0.1 mL), will see their concentration estimated with an even larger bias by the RM and SK methods.

Using midSIN to measure the infectivity of a virus sample based on an ED assay does not require any change to ED experimental protocols and methods currently in use in one’s laboratory (e.g., dilution factor, replicate per dilution, minimum dilution). Indeed, we demonstrated that midSIN can estimate a virus sample’s SIN concentration based on even just a single dilution, as long as replicate wells at that dilution are not all infected or all uninfected. For a given number of ED wells used to titrate the sample and fixed minimum and maximum dilutions (ED detection range), we showed that having smaller changes between dilutions should be favoured over more replicates at each dilution. For example, using 11 dilutions, with a 4-fold dilution factor between dilutions and 8 replicate wells per dilution uses up 88 wells, leaving 8 wells of a 96-well plate for controls. This ED plate design, analyzed using midSIN, accurately measures virus sample concentrations ranging over ∼6 orders of magnitude (e.g., [10¹–10⁷] SIN/mL, or [10⁶–10¹²] SIN/mL, etc.) with an accuracy of ∼1.6-fold (×10^±0.2, 95% CI). In comparison, using 7 dilutions, with a 10-fold dilution factor, and 4 replicates (which uses 28 rather than 88 wells) would also span 6 orders of magnitude, but with an accuracy of ∼3.2-fold (×10^±0.5, 95% CI). To put these 2 accuracies in perspective: 1 mL of a sample measured to contain 10 SIN/mL, is expected to yield either 6–16 or 3–31 infections 95% of the time, given an accuracy of either ×10^±0.2 or ×10^±0.5 SIN/mL, respectively. Such an important decrease in accuracy means a reduced ability to detect experimental changes as statistically significant, with the ×10^±0.5 accuracy requiring a >10-fold change for statistical significance. Failing to identify a change as statistically significant as part of a study is far more costly than using more wells for each sample to increase measurement accuracy, and thus the statistical power of the study.

The midSIN-estimated SIN obtained from an ED assay was also compared to the PFU from a plaque assay for a set of influenza A virus samples. When the plaque and ED assays are performed as identically as possible (cell type, incubation time, etc.), as was the case here, 1 SIN ≈ 1 PFU. This demonstrates that indeed midSIN’s SIN is a measure of the number of infections a virus sample will cause. However, the plaque and focus forming assays have experimental limitations (time required, sensitivity of target cells to overlay, limited to viruses that cause CPE, subjectivity in counting plaques/foci, etc.) that cause many researchers to titrate virus using ED assays. Indeed midSIN’s SIN is a measure of the number of infections a virus sample will cause, and estimating the SIN concentration of a virus sample using data from ED assays is accessible, accurate, and predictive.

The work herein focused on the virus sample infectivity estimated from an unmodified ED assay. In principle, further improvements in accuracy could be achieved through the use of machine-automated scoring of infected wells using fluorescence intensity or colorimetry. Plate readers can be quite expensive, as are the consumable compounds they require, such as fluorescent antibodies, or antibodies loaded with compounds that can precipitate in the presence of another (colorimeter). In contrast, staining with crystal violet, trypan blue, etc. is an inexpensive and efficient way to identify the widespread cellular pathogenic effect of infection by a lytic virus, as are red blood cells to identify the presence of notable virus concentration in the supernatant of a well infected with a hemagglutination-capable virus. Since the aim of the ED assay is merely to establish whether or not infection occurred, the scoring of a well as having been infected or not, even when done visually, is likely less ambiguous. Therefore, in future work, it would be interesting to compare human- vs machine-scoring of wells to evaluate this step’s contribution to the accuracy of the measure obtained.

Beyond the work presented herein, the development of midSIN will continue online as we implement new features and inputs for integration with various colorimetric and fluorescence instruments. The ease of use of midSIN and the greater usefulness and relevance of SIN as a measure of a virus sample’s infectivity make them far superior to the TCID₅₀, and other ID₅₀ measures. We hope to see them adopted widely.

Methods

The mathematics of the dose-response assay

Considering a single well

Consider a virus sample of volume V_sample which contains an unknown concentration of infectious virions, C_inf, which we aim to determine. Drawing a small volume, V_inoc < V_sample, from the sample of volume V_sample, is analogous to drawing balls out of a bag containing green and yellow balls, and considering green balls a success, and yellow ones a failure. It is a series of Bernoulli trials where

n = V_inoc/V_vir is the number of draws, i.e., the number of virion-size volumes (V_vir) drawn from the sample to form the inoculum volume (V_inoc), analogous to the number of balls drawn.
k is the number of successes, i.e., the number of infectious virions drawn from the sample to form the inoculum, analogous to the number of green balls drawn.
p is the probability of success, i.e., the fraction of virion-size volumes in the sample that are occupied by infectious virions, analogous to the probability of drawing a green ball.

The probability of success, p, is related to the concentration of infectious virus in the sample, C_inf, as

p = \frac{Number of virions in sample}{Number of virion-size volumes in the sample} = \frac{C_{inf} V_{sample}}{V_{sample} / V_{vir}} = C_{inf} V_{vir},

where C_inf is the quantity we aim to estimate. Unlike the ball analogy where it is easy to count how many green balls k were drawn, after having drawn n virion-size volumes from the sample into our inoculum, we cannot count how many infectious virions were drawn into the inoculum. However, if this inoculum is deposited onto a susceptible cell culture, we can observe whether or not infection occurs, and this would indicate that the inoculum contained at least one or more infectious virions. Note that, as explained in the Introduction, even a productively infectious virion, i.e., one capable of completing the full virus replication from attachment to progeny release, might not result in a productive infection. As such, from hereon, C_inf is used to designate the concentration of specific infections in the sample, which is smaller or equal to the concentration of infectious virions, i.e., measures the subset of the infectious virions.

Having deposited the inoculum into one well of the 96-well plate of our ED experiment, the likelihood that the well will not become infected, q_noinf, corresponds to the likelihood of having drawn k = 0 infectious virions (or rather, specific infections) out of the n virion volumes that make up our inoculum, namely

\begin{matrix} q_{noinf} & = Binomial (k = 0 | n = V_{inoc} / V_{vir}, p = C_{inf} V_{vir}) \\ = \frac{n!}{0! (n - 0)!} p^{0} {(1 - p)}^{n - 0} = {(1 - p)}^{n} \\ q_{noinf} & = {(1 - C_{inf} V_{vir})}^{V_{inoc} / V_{vir}} \end{matrix}

(1)

where q_noinf can be simplified by realizing that

\begin{matrix} ln (1 - x) & \overset{| x | < 1}{=} - x - \frac{x^{2}}{2} - \frac{x^{3}}{3} - \dots \overset{| x | ≪ 1}{\approx} - x \\ ln (q_{noinf}) & = \frac{V_{inoc}}{V_{vir}} ln (1 - C_{inf} V_{vir}) \approx \frac{V_{inoc}}{V_{vir}} (- C_{inf} V_{vir}) = - C_{inf} V_{inoc} . \end{matrix}

As such,

\begin{matrix} q_{noinf} = {(1 - C_{inf} V_{vir})}^{V_{inoc} / V_{vir}} \approx exp [- C_{inf} V_{inoc}] \end{matrix}

(2)

where q_noinf and (C_inf V_vir) ∈ [0, 1] because C_inf = N_vir/V_sample and the number of specific infections in the sample, N_vir, is at a minimum zero, and at most the maximum number of virion-size volumes that can physically fit in the sample volume, namely V_sample/V_vir. As such, the maximum possible infection concentration, given a sample of volume V_sample, is C_inf = (V_sample/V_vir)/V_sample = 1/V_vir, and C_inf ∈ [0,1/V_vir].

Considering replicate wells at a given dilution

The ED assay is based on serial dilutions of the sample, with each dilution separated by a fixed dilution factor. We define the dilution factor ∈ (0,1) as the fraction of the inoculum volume drawn from the previous dilution. For example, if the inoculum for a well, V_inoc = 100 μL, comprises 10 μL drawn from the previous dilution and 90 μL of dilution media, the dilution factor is 10/100 = 0.1. If the serial dilution begins with a dilution of $D_{1} = 0.2$ , then the following dilution will be $D_{2} = 0.02$ . In Eq (1), the dilution under consideration, $D_{i}$ , will affect n, the number of virion-sized volumes drawn from the sample and deposited into the wells of the i^th dilution, such that now $n = D_{i} V_{inoc} / V_{vir}$ . Therefore, the probability that a well at the i^th dilution will not become infected is given by

\begin{matrix} q_{i} \equiv q_{noinf}^{D_{i}} = {(1 - C_{inf} V_{vir})}^{D_{i} V_{inoc} / V_{vir}} \approx exp [- C_{inf} V_{inoc} D_{i}] \end{matrix}

(3)

where 1 − q_i is the probability of infection for a well at the i^th dilution, where $D_{i} \in [0, 1]$ .

When conducting an ED assay, each dilution in the assay contains a number of independent infection wells (replicates), all inoculated with the same dilution, $D_{i}$ . This is analogous again to drawing balls out of a bag, but this time there are n_i draws (replicate wells), and the probability of success (i.e., that a well becomes infected) is simply one minus the probability of failure (i.e., that a well does not become infected, q_i). The probability that k_i out of the n_i wells become infected at dilution $D_{i}$ , is described by the Binomial distribution

Binomial (k = k_{i} | n = n_{i}, p = 1 - q_{i}) = \frac{n_{i}!}{k_{i}! (n_{i} - k_{i})!} {(1 - q_{i})}^{k_{i}} q_{i}^{n_{i} - k_{i}} \propto {(1 - q_{noinf}^{D_{i}})}^{k_{i}} q_{noinf}^{D_{i} (n_{i} - k_{i})}

where n_i is the number of replicate wells at each dilution, but could be less if any well at dilution $D_{i}$ are spoiled or contaminated.

However, our interest is not in determining k₁ given q_noinf, but rather in determining q_noinf given that we observed k₁ infected wells out of n₁ wells in the first column. To this aim, we can make use of Bayes’ theorem which, in our context, can be expressed as

P (p | data) = \frac{P (data | p) P (p)}{\int_{0}^{1} P (data | p) P (p) d p}

or rather

\begin{matrix} P_{post,1} (q_{noinf} | k_{1}) & = \frac{P (k_{1} | q_{noinf}) P_{prior} (q_{noinf})}{\int_{0}^{1} P (k_{1} | q_{noinf}) P_{prior} (q_{noinf}) d q_{noinf}} \\ = \frac{[{(1 - q_{noinf}^{D_{1}})}^{k_{1}} q_{noinf}^{D_{1} (n_{1} - k_{1})}] P_{prior} (q_{noinf})}{\int_{0}^{1} P (k_{1} | q_{noinf}) P (q_{noinf}) d q_{noinf}} \\ P_{post,1} (q_{noinf} | k_{1}) & \propto [{(1 - q_{noinf}^{D_{1}})}^{k_{1}} q_{noinf}^{D_{1} (n_{1} - k_{1})}] P_{prior} (q_{noinf}) \end{matrix}

where $P_{post,1} (q_{noinf} | k_{1})$ is our updated, posterior belief about q_noinf after having observed k₁ successes out of n₁ trials in the first column (i = 1), and given our prior belief, $P_{prior} (q_{noinf})$ , about q_noinf before making this observation.

Considering all dilutions of the ED assay

As mentioned above, in the 96-well ED assay, each dilution contains a number of independent infection wells (replicates) inoculated with the same sample concentration. This process is then repeated over a series of dilutions, each separated from the previous by a fixed dilution factor. Having observed the fraction of wells infected at the first dilution considered, $D_{1}$ , we have updated our posterior belief about q_noinf. We will now use this updated belief as our new prior as we observe our second dilution ( $D_{2}$ ), such that

\begin{matrix} P_{post,2} (q_{noinf} | {\vec{k}}_{2}) & \propto P (k_{2} | q_{noinf}) P_{post,1} (q_{noinf} | k_{1}) \\ P_{post,2} (q_{noinf} | {\vec{k}}_{2}) & \propto [{(1 - q_{noinf}^{D_{2}})}^{k_{2}} q_{noinf}^{D_{2} (n_{2} - k_{2})}] [{(1 - q_{noinf}^{D_{1}})}^{k_{1}} q_{noinf}^{D_{1} (n_{1} - k_{1})}] P_{prior} (q_{noinf}) \\ P_{post,2} (q_{noinf} | {\vec{k}}_{2}) & \propto Q ({\vec{k}}_{2} | q_{noinf}) P_{prior} (q_{noinf}), \end{matrix}

where we introduce ${\vec{k}}_{2} = {k_{1}, k_{2}}$ and

Q ({\vec{k}}_{2} | q_{noinf}) = [{(1 - q_{noinf}^{D_{2}})}^{k_{2}} q_{noinf}^{D_{2} (n_{2} - k_{2})}] [{(1 - q_{noinf}^{D_{1}})}^{k_{1}} q_{noinf}^{D_{1} (n_{1} - k_{1})}]

as short-hands for convenience. From this, it is easy to extrapolate the posterior distribution after having observed all J dilutions ( $D_{1}, D_{2}, \dots, D_{J}$ ) of the ED assay, namely

\begin{matrix} P_{post,J} (q_{noinf} | {\vec{k}}_{J}) \propto Q ({\vec{k}}_{J} | q_{noinf}) P_{prior} (q_{noinf}) \end{matrix}

(4)

where

\begin{matrix} Q ({\vec{k}}_{J} | q_{noinf}) = [\prod_{j = 1}^{J} {(1 - q_{noinf}^{D_{j}})}^{k_{j}}] q_{noinf}^{\sum_{j = 1}^{J} D_{j} (n_{j} - k_{j})} . \end{matrix}

(5)

Note that this expression is largely equivalent to that obtained by Mistry et al. [8] in the context of estimating the TCID₅₀ of a virus sample, and by many others in the broader context of infection dose quantification [12, 13].

Considering the choice of prior

In Eq (4), we obtained a posterior for q_noinf. Our objective, however, is to estimate the posterior distribution for C_inf, the specific infection concentration in our sample, rather than q_noinf. In fact, because both the plaque and ED assays provide an accuracy that is normally distributed in log₁₀(C_inf) rather than C_inf, it follows that log₁₀(C_inf) (hereafter ℓ_Cinf) rather than C_inf is the quantity of interest. We note that $Q ({\vec{k}}_{J} | q_{noinf})$ in Eq (4) is a probability density function in ${\vec{k}}_{J} = {k_{1}, k_{2}, \dots, k_{J}}$ , rather than in q_noinf. As such, a change of variables from q_noinf to ℓ_Cinf would affect only the prior, because $Q ({\vec{k}}_{J} | q_{noinf}) = Q ({\vec{k}}_{J} | q_{noinf} (ℓ_{Cinf})) = Q ({\vec{k}}_{J} | ℓ_{Cinf})$ . Thus, the posterior distribution for ℓ_Cinf is given by

\begin{matrix} P_{post, J} (ℓ_{Cinf} | {\vec{k}}_{J}) \propto Q ({\vec{k}}_{J} | q_{noinf} (ℓ_{Cinf})) P_{prior} (ℓ_{Cinf}) . \end{matrix}

(6)

To complete this expression, we need to choose a physically and biologically appropriate prior belief regarding ℓ_Cinf. Prior to conducting the ED assay, we know at least that C_inf ∈ [1/V_Earth,1/V_vir], where 1/V_vir is the maximum possible concentration, namely that if the entire volume of the sample is constituted solely of infectious virions, and 1/V_Earth is the minimum possible concentration, namely that if there was only one infectious virion left on Earth. As we explain below, these limits are not important; only the fact that they are convincingly physically bounded both from above and below, i.e., ∈ (0, ∞), is relevant.

If we choose our prior to be uniform in C_inf ∈ [1/V_Earth,1/V_vir], namely $P_{prior} (C_{inf}) = 1 / (1 / V_{vir} - 1 / V_{Earth}) \approx V_{vir}$ , and using the fact that $P_{prior} (C_{inf}) d C_{inf} = P_{prior} (ℓ_{Cinf}) d ℓ_{Cinf}$ , we can write

P_{prior} (ℓ_{Cinf}) = P_{prior} (C_{inf}) \frac{d C_{inf}}{d ℓ_{Cinf}} = V_{vir} \frac{d [10^{ℓ_{Cinf}}]}{d ℓ_{Cinf}} = V_{vir} ln (10) 10^{ℓ_{Cinf}} \propto 10^{ℓ_{Cinf}}

which yields

\begin{matrix} P_{post, J} (ℓ_{Cinf} | {\vec{k}}_{J}) \propto Q ({\vec{k}}_{J} | q_{noinf} (ℓ_{Cinf})) 10^{ℓ_{Cinf}} . \end{matrix}

(7)

We see here that the range chosen for the uniform prior in C_inf is not important because it only contributes a constant to our proportionality Eq (6).

Alternatively, because the ED assay estimates ℓ_Cinf rather than C_inf, our prior belief about the virus concentration is more appropriately expressed in ℓ_Cinf rather than C_inf. Again, the bounds of the uniform distribution in ℓ_Cinf is unimportant, provided that it is finite in extent such that $ℓ_{Cinf} \in [ℓ_{Cinf}_{_{min}}, {log}_{10} (1 / V_{vir})]$ where $ℓ_{Cinf}_{_{min}} > - \infty$ , such that we can write

\begin{matrix} P_{post, J} (ℓ_{Cinf} | {\vec{k}}_{J}) \propto Q ({\vec{k}}_{J} | q_{noinf} (ℓ_{Cinf})) . \end{matrix}

(8)

Fig 8 illustrates the two distinct priors assumed to arrive at Eqs (7) and (8) and their impact on the posterior $P_{post, J} (ℓ_{Cinf} | {\vec{k}}_{J})$ for the example ED experiment described in Fig 1. Fig 8A illustrates the consequence of choosing a prior uniform in C_inf, i.e., a bias towards higher virus concentrations. This is because a uniform prior in C_inf corresponds to a belief that one is as likely to measure a set of virus concentrations in the range [0.001, 0.002] as in the range [1,000, 000.001, 1, 000, 000.002]. When plotted on a log-scale, there are 100× more intervals of width 0.001 in [10⁴, 10⁵] than in [10², 10³]. Thus, this prior corresponds to a belief that the likelihood of measuring a certain virus concentration increases exponentially as ℓ_Cinf increases linearly. In contrast, a prior uniform in ℓ_Cinf corresponds to a belief that one is as likely to measure a set of virus concentrations in the range [0.001, 0.002] as in the range [1, 000, 000, 2, 000, 000], or rather in the range [1, 2] × 10⁻³ as in the range [1, 2] × 10⁶. As such, a uniform distribution in ℓ_Cinf is more physically and biologically sensible and therefore was chosen for our estimation method.

Calculation of midSIN’s outputs

One of the graphical outputs of midSIN is the non-normalized posterior distribution of ℓ_Cinf given the number of wells that were infected at each dilution, ${\vec{k}}_{J}$ , like that shown in Fig 1(left panel), computed as

\begin{matrix} U_{post} (ℓ_{Cinf} | {\vec{k}}_{J}) = \prod_{j = 1}^{J} \frac{n_{j}!}{k_{j}! (n_{j} - k_{j})!} \cdot p_{j}^{k_{j}} \cdot {(1 - p_{j})}^{n_{j} - k_{j}} \end{matrix}

(9)

where

\begin{matrix} p_{j} = 1 - exp [- 10^{ℓ_{Cinf}} \cdot V_{inoc} \cdot D_{j}] . \end{matrix}

(10)

While $U_{post}$ is not the normalized posterior distribution for ℓ_Cinf, its maximum value at its mode ( $ℓ_{Cinf}_{_{, mode}}$ ) is the normalized probability of observing this particular ED plate outcome ( ${\vec{k}}_{J}$ ) out of all other possible plate outcomes, assuming the true, specific infection concentration in the sample is $ℓ_{Cinf}_{_{, mode}}$ .

Another visual output of midSIN is a graphical representation of the theoretical number of wells that would be infected given the most likely ℓ_Cinf, like that shown in Fig 1(right panel). It is computed following

\begin{matrix} N_{wells infected} (x) = N_{wells total} [1 - exp (- 10^{ℓ_{Cinf,_{mode}}} V_{inoc} 10^{- x})], \end{matrix}

(11)

where x is the log₁₀ of the dilution such that $D = 10^{- x}$ is the dilution. It corresponds to the continuous equivalent of this quantity which is discrete in the ED assay, namely $D_{i} = 10^{- x_{i}}$ which is the i^th dilution of the sample. As such, $D_{i} = (minimum dilution) \cdot (dilution factor between columns)^{i - 1}$ where i ∈ [1, J]. For example, if the dilution of the least diluted column is 0.1 = 10⁻¹ and the dilution factor between dilutions in the ED assay is such that it halves the concentration between each dilution, i.e., $1 / 2 = 2^{- 1} = 10^{-}^{{log}_{10} (2)} \approx 10^{- 0.301}$ , then $D_{i} = 10^{- 1} \cdot 10^{- 0.301 \cdot (i - 1)}$ such that $D_{1} = 10^{- 1}$ , $D_{2} = 10^{- 1.301}$ , $D_{3} = 10^{- 1.602}$ , and so on, such that x₁ = 1, x₂ = 1.301, x₃ = 1.602, and so on.

In the graphical representation of the ED assay, the edges of the grey bands flanking the theoretical blue curve correspond to Eq (11) wherein $ℓ_{Cinf}_{_{, mode}}$ has been replaced by the 68% and 95% CI values for ℓ_Cinf. These CI bands do not correspond to the 68% and 95% CI of the expected number of infected wells at each dilution given $ℓ_{Cinf}_{_{, mode}}$ .

The sample dilution corresponding to 1 TCID₅₀ estimated based on the biased RM and SK approximations (right panels) are converted to SIN (left panels) based on 1 TCID₅₀ = e^γ=0.5772 SIN = 1.781 SIN [5, 6]. In contrast, the log₁₀(SIN/mL) computed by midSIN can be converted to a true (unbiased) estimate of log₁₀(TCID₅₀) using 1 TCID₅₀ = 1/ln(2) SIN = 1.44 SIN [4].

Infection concentration measures of influenza A virus samples

Cell culture

Madin-Darby canine kidney cells (MDCKs) were cultured in growth media (complete MEM media with 5% heat-inactivated FBS), in tissue culture treated T75 flasks, at 37°C with 5% CO₂ and 95% relative humidity. Cells were split 1/10 every 3–4 days or upon reaching approximately 95% confluency. One passage of cells was expanded for use by both researchers in one experiment to quantify the 50% tissue culture infectious dose (TCID₅₀) and plaque forming units (PFU) of one viral strain.

Viral stocks

Stocks of influenza A/Puerto Rico/8/34 (H1N1) (PR8) and influenza A/California/4/09 (Cali/09) were stored at -80°C and thawed on ice immediately before use. The TCID₅₀ and PFU of stock viruses was known to both researchers prior to this study. Serial dilutions were made in MDCK infection media (complete MEM media with 4.25% BSA) and dilutions were made by each researcher independently for titering. ‘Researcher A’ and ‘Researcher B’ independently performed the TCID₅₀ and PFU assays of one viral strain for one experiment on the same day using the same viral stock, reagents, and passage of cells. Each experiment was performed on a separate day (Fig 3).

Plaque assay

MDCKs were seeded in six-well plates (5.5 × 10⁵ cells/mL, 2 mL/well) and grown to 90% confluency overnight (37°C, 5% CO₂, 95% relative humidity). Each six-well plate contained 10-fold serial dilutions plated in singlet as well as a negative control and five 6-well plates were carried out per experiment. Cells were washed twice with PBS containing Ca²⁺Mg²⁺ (PBS w/ Ca²⁺Mg²⁺) (Gibco), before the addition of 500 μL of viral dilutions per well. After 1 h at room temperature on a rocker, the inoculum was aspirated, cells were washed with PBS w/ Ca²⁺Mg²⁺, and gently covered with 2 mL of agarose overlay (complete media, 4.25% BSA, 0.9% agarose, 1 μg/mL TPCK-Trypsin). After drying the overlay at room temperature, plates were inverted and incubated (37°C, 5% CO₂, 95% relative humidity) for 3 d (PR8) or 4 d (Cali/09). Plaques were visualized by staining cells with 0.1% crystal violet solution in 37% formaldehyde for 30 min and counted by ‘Researcher A’ or ‘Researcher B’ on their respective experiments (Fig 3).

TCID₅₀ assay

MDCKs were seeded in 96-well flat bottom plates (5 × 10⁴ cells/100 μL, 100 μL/well) and grown to 80% confluency overnight (37°C, 5% CO₂, 95% relative humidity). For each experiment, 4 replicate wells, at each of 7 different dilutions separated by a 10-fold dilution, were infected, and the dilution series was performed 5 times. Cells were washed with PBS w/ Ca²⁺Mg²⁺ before the addition of 100 μL of viral dilutions per well. After 1 h at room temperature on a rocker, the inoculum was aspirated and replaced with 100 μL of infection media containing 1 μg/mL TPCK-Trypsin. Cells were incubated (37°C, 5% CO₂, 95% relative humidity) for 3 d (PR8) or 4 d (Cali/09). Supernatants from each of the MDCK-containing wells were transferred to a matching well in a 96-well U-bottom plate in the same configuration, and mixed with chicken red blood cells (30 min, room temperature). This enabled us to score each of the original MDCK-containing wells as either positive or negative for infection, based on whether their supernatant caused hemagglutination. This was performed and read by ‘Researcher A’ or ‘Researcher B’ on their respective experiments.

Statistical analysis

The data points reported in Fig 3C and 3D were computed by taking each of the 5 replicates measured with either the PFU, RM, or SK and the 5 replicates measured via SIN (5 replicates × 5 replicates = 25 pairs) for each of the 2 experiments by each of the 2 researchers, yielding 100 pairs. For each pair, the log₁₀ of ratio of either PFU, RM or SK over SIN was computed. The mean and standard deviation of the resulting 100 log₁₀(ratio) were computed and are reported in Fig 3C and 3D. The statistical significance (p-value) of the differences between (PFU,RM,SK) and (SIN) was computed using the Mann-Whitney U test (scipy.stats.mannwhitneyu).

Data Availability

The authors confirm that all data underlying the findings are fully available without restriction. The code is freely available on GitHub (https://github.com/cbeauc/midSIN) and the midSIN tool is available as a web application (https://midsin.physics.ryerson.ca).

Funding Statement

This work was supported in part by Discovery Grant 355837-2013 (CAAB) from the Natural Sciences and Engineering Research Council of Canada (www.nserc-crsng.gc.ca), Early Researcher Award ER13-09-040 (CAAB) from the Ministry of Research and Innovation of the Government of Ontario (www.ontario.ca/page/early-researcher-awards), by the Interdisciplinary Theoretical and Mathematical Sciences programme (iTHEMS, ithems.riken.jp) at RIKEN (CAAB), and by R01 AI139088 (AMS, APS, LCL) from the NIH NIAID (www.niaid.nih.gov). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1. Spearman C. The method of “right and wrong cases” (constant stimuli) without Gauss’s formula. Br J Psychol. 1908;II(Part 3):227–242. doi: 10.1111/j.2044-8295.1908.tb00176.xref [DOI] [Google Scholar]
2. Kärber G. Beitrag zur kollecktiven behandlung pharmakologischer reihenversuche. Archiv f Experiment Pathol u Pharmakol. 1931;162(4):480–483. doi: 10.1007/BF01863914 [DOI] [Google Scholar]
3. Reed LJ, Muench H. A simple method of estimating fifty per cent endpoints. Am J Hygiene. 1938;27(3):493–497. doi: 10.1093/oxfordjournals.aje.a118408 [DOI] [Google Scholar]
4. Bryan WR. Interpretation of host response in quantitative studies on animal viruses. Ann N Y Acad Sci. 1957;69(4):698–728. doi: 10.1111/j.1749-6632.1957.tb49710.x [DOI] [PubMed] [Google Scholar]
5. Wulff NH, Tzatzaris M, Young PJ. Monte Carlo simulation of the Spearman-Kaerber TCID50. J Clin Bioinforma. 2012;2(1):5. doi: 10.1186/2043-9113-2-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
6. Govindarajulu Z. 4. The Logit Approach. In: Statistical techniques in bioassay. 2nd ed. Basel; New York: Karger; 2001. p. 35–90. [Google Scholar]
7. LaBarre DD, Lowy RJ. Improvements in methods for calculating virus titer estimates from TCID₅₀ and plaque assays. J Virol Methods. 2001;96(2):107–126. doi: 10.1016/S0166-0934(01)00316-0 [DOI] [PubMed] [Google Scholar]
8. Mistry BA, D’Orsogna MR, Chou T. The effects of statistical multiplicity of infection on virus quantification and infectivity assays. Biophys J. 2018;114(12):2974–2985. doi: 10.1016/j.bpj.2018.05.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Mistry BA. Website application associated with [8];. Available from: http://www.bhavenmistry.com/SMOI/.
10.Spouge JL. Website application associated with [12];. Available from: https://www.ncbi.nlm.nih.gov/CBBresearch/Spouge/html_ncbi/html/id50/id50.cgi.
11. Beauchemin CAA, Kim YI, Yu Q, Ciaramella G, DeVincenzo JP. Uncovering critical properties of the human respiratory syncytial virus by combining in vitro assays and in silico analyses. PLOS ONE. 2019;14(4):e0214708. doi: 10.1371/journal.pone.0214708 [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Spouge JL. Statistical analysis of sparse infection data and its implications for retroviral treatment trials in primates. Proc Natl Acad Sci USA. 1992;89(16):7581–7585. doi: 10.1073/pnas.89.16.7581 [DOI] [PMC free article] [PubMed] [Google Scholar]
13. Weir MH, Mitchell J, Flynn W, Pope JM. Development of a microbial dose response visualization and modelling application for QMRA modelers and educators. Environ Model Softw. 2017;88:74–83. doi: 10.1016/j.envsoft.2016.11.011 [DOI] [PMC free article] [PubMed] [Google Scholar]

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009480.r001

Decision Letter 0

Rob J De Boer, Roland R Regoes

2 May 2021

Dear Dr. Beauchemin,

Thank you very much for submitting your manuscript "Time to revisit the endpoint dilution assay and to replace TCID50 and PFU as measures of a virus sample's infection concentration" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

Your paper was reviewed by an expert on the statistics of estimating ID50s, and two experimental virologists with a strong quantitative research agenda. The two experimental reviewers saw great value in the approach you are proposing. All reviewers agree that the paper could be improved by shortening it, focusing on describing your method. The more general parts on the various benefits or drawbacks of plaque-forming and dilutions assays could be skipped, especially because the experimentalists among the reviewers did not fully agree with the material presented. We also believe that your paper would improve by comparing your method to more recent advances in the estimation of TCID50 as Reed and Munch is outdated (although still used in some circles). Reviewer 1 gives a few starting references for a more comprehensive comparison with existing methods.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Roland R Regoes

Associate Editor

PLOS Computational Biology

Rob De Boer

Deputy Editor

PLOS Computational Biology

***********************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The article uses Bayesian probabilities to calculate the posterior probability distribution of the effective concentration of infectious particles in a viral stock. It also implements the calculation on the web. The article is unsuitable for publication in its present form. The authors should announce the software (e.g., Google “journal bioinformatics software”), confining their explanation to a few pages at most. The article omits an adequate survey of even rudimentary references. The end of the review lists some references relevant to similar problems in animal trials, not necessarily for citation, but as a start for searching for appropriate citations.

Most of the authors’ explanation is unnecessary. The discussion of the relative merits of plaque-forming and dilutions assays, e.g., is irrelevant. Each type of assay has its merits and drawbacks, but the decision to use one or the other is subordinate to experimental means and ends. The article can therefore take the use of a dilution assay as dependent on ends, as a given. The article motivates itself with the Spearman-Karber and Reed-Muench methods. Although the methods still appear in the literature, they have been discredited for at least 30 years. The article’s notation also obscures the simplicity of its ideas. Psychological experiments have shown that mathematical subscripts should be single letters, preferably with mnemonic value, because lengthy subscripts slow readers’ comprehension. To appreciate the point, replace q[noinf] by q (without subscript) in all equations.

The Bayesian probability model motivating the article is routine. Physically, infection is modeled by a Poisson likelihood. The article then gives a lengthy physical justification of the model prior. A routine non-informative prior may be preferable, but in any case, a Bayesian posterior should not be sensitive to the prior but depend mostly on the data. Any lengthy physical justification of the prior is therefore irrelevant.

Here are the promised citations.

Calculation of ID-50 https://pubmed.ncbi.nlm.nih.gov/26285041/

Infectious Particle Concentration https://pubmed.ncbi.nlm.nih.gov/1323844/

Harold Jeffreys "Theory of Probability" discusses non-informative priors, placing in context the unimportance of a "physical" Bayesian prior in statistical calculations.

Reviewer #2: This paper describes a new and better method for computing viral titers from endpoint dilution assays. The method is accompanied by an online and downloadable calculator. As a virologist whose lab routinely performs TCID50s, I can say that the method used here is a substantial improvement, and is something I am going to start recommending to my group. I definitely think that based on its content, this paper deserves to be published in PLoS Computational Biology.

That said, I have some pretty substantive suggested revisions. One of these is something that I think needs to be removed, and the others are my impression of things that need to be changed if the paper is going to be understood and well received by virologists.

MAJOR COMMENTS:

The paper makes two arguments: (1) endpoint dilutions assays such as TCID50 are better than plaque assays, and (2) the midSIN method is better than things like Reed-Muench for computing titers from endpoint dilution assays. The second of these points is definitely true, and forms the strong basis for the content of this paper. However, I don’t think the first point (superiority of endpoint dilution over plaque assay) is clearly established, nor do I think it’s at all necessary for this paper. I say this as someone who personally prefers endpoint dilution assays (TCID50) to plaque assay. But some virologists prefer plaque assays for a variety of reasons, including liking to see the plaques, the additional information they get from examining plaque sizes, etc. If I hadn’t read all the way through because I was a reviewer, I would have dismissed this paper after the first few paragraphs as an opinion piece arguing for TCID50 over plaque assays, and not paid attention to any of the rest. I strongly recommend the authors focus on what they clearly objectively demonstrate (that the midSIN method is better than alternatives for computing endpoint titers), and dispense with the more subjective arguments based on experimental factors that make them personally prefer endpoint assays to plaque assays.

I think the paper would benefit from a clearer “intuitive layman’s explanation” of what exactly is wrong with the Reed-Muench formula compared to midSIN. Right now there is little explanation in main text, and then highly technical details in Methods but not good bridging of these.

Although this is more a stylistic comment and one that is ultimately at the authors’ discretion, I’d suggest that the paper will have more impact if it’s more succinct, has less vague discussion of experiments and philosophical issues of titers, and really cuts more quickly to the heart of the issue which is that they have an improved way to calculate endpoint titers, they have implemented a calculator, and that their method allows calculation of how experimental choices (like dilution factor, number of dilution series, etc) affect accuracy.

MINOR COMMENTS:

The number of acronyms introduced just in the abstract (RM, SK, etc) becomes overwhelming and decreases readability. Maybe some of the less commonly used acronyms could be eliminated in favor of just writing out the full phrase?

Lines 6-17: another limitation of counting virions under a microscope is that it does not distinguish physical from infectious particles. The same is true for qPCR. This is a really serious limitation, moreso even than cost, etc. In fact, I sort of wonder if this entire first paragraph is a little bit irrelevant to the question at hand, which is titrating infectious particles.

Lines 19-22: Again, this isn’t quite true. They are certainly not easy to separate, but for instance with influenza there is some evidence that defective virions lacking genes sometimes have slightly different morphologies, etc—and can at least be partially separated by certain types of centrifugation. Again, like for lines 6-17, I sort of feel like the authors are spending a lot of time on not 100% accurate text that isn’t even really relevant to their main point and finding, which is titrating infectious particles.

Lines 68-71: The same limitation can apply to endpoint dilution (e.g., TCID50) assays, as the actual cell being used for the experiment doesn’t always work for the endpoint dilution assay. For instance, people performing flu infections of human primary airway cells still titer the virus by TCID50 on MDCK cells as you can’t do a TCID50 in human primary airway cells.

Reviewer #3: The manuscript by Cresta et al suggest a new platform for analyzing endpoint dilution assays of virus infection, which are currently based on mathematical approximations that introduce a bias into the outcome. While these errors are known and can be corrected, the calculations still show inconsistencies is specific cases.

I think its in general a good idea to revisit such traditional assays, identify potential limitations and work on their improvement. The authors also provide their analysis as an open online tool, which is a nice example for the open science principles. The manuscript is well written but the real value of the tool needs to be worked out a bit more. While the presented tool overcomes some limitations, its use seems overstated as it does not solve all of the mentioned problems.

1) There are many ways to quantify a virus sample and its important to consider what the assays measure. The authors aim at improving the estimation of an infection concentration meaning how many infections a virus sample could cause per unit volume. They compare plaque/focus forming assays with endpoint dilution methods. While the introduction gives a nice overview of these assays and highlights their limitations, in particular those of the plaque assay, it’s not a fair comparison. The plaque or focus assays use an overlay medium to restrict infection spread to only the neighboring cells. Infected cells and their infected neighbors will then after some time and following some form of coloring be visible as a plaque/focus. The ED assay (TCID50) typically does not use overlay medium and relies on immunostaining or CPE to score infection. Plaque and ED assay thus have usually different readouts, replication or infection. In the paper, the authors use hemagglutinating units to quantify the amount of released virus in their TCID50 assays. That is yet a different measure and might in include non-replicating particles.

In my opinion is the proposed midSIN platform best suited to analyze traditional ED assays where infection is labelled with antibodies and can be analyzed with an automated reader. The manuscript (title, intro and discussion) is thus misleading in several places by saying that midSIN overcomes the limitations of a plaque assay.

2) Another aspect that I think could be improved is the requirement of a threshold to score a well as positive. Can the analysis be performed using raw plate analyzer readings (fluorescence units per well)? That would be ideal and remove all personal bias from the analysis.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

PLoS Comput Biol. 2021 Oct 18;17(10):e1009480. doi: 10.1371/journal.pcbi.1009480.r002

Author response to Decision Letter 0

18 Aug 2021

Attachment

Submitted filename: response.pdf

Click here for additional data file.^{(203.7KB, pdf)}

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009480.r003

Decision Letter 1

Rob J De Boer, Roland R Regoes

26 Sep 2021

Dear Dr. Beauchemin,

We are pleased to inform you that your manuscript 'Time to revisit the endpoint dilution assay and to replace TCID50 as a measure of a virus sample's infection concentration' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology.

Best regards,

Roland R Regoes

Associate Editor

PLOS Computational Biology

Rob De Boer

Deputy Editor

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have made a partial effort to comply with requests to shorten the manuscript. The manuscript has benefited greatly, but it is still excessively long. The length still partially obscures the justification for replacing the ID50 with the midSIN. The manuscript also retains the Spearman-Karber and Reed-Muench methods as a standard of comparison for the midSIN, and the bias in these methods is already known.

I remain supportive of the development of web calculators to replace the Spearman-Karber and Reed-Muench methods, but the manuscript's presentation of the midSIN is unlikely to promote the replacement much, if at all.

Reviewer #2: The paper is much improved, and I support is publication.

Reviewer #3: The authors have addressed my comments and, together with the other revisions, well improved the readability of the manuscript. I realised that my comments were not always as precise as I wished, but the authors well grasped their point and responded adequately. midSin is an easy to use tool that I will see to include in my groups activities.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009480.r004

Acceptance letter

Rob J De Boer, Roland R Regoes

13 Oct 2021

PCOMPBIOL-D-21-00213R1

Time to revisit the endpoint dilution assay and to replace TCID₅₀ as a measure of a virus sample's infection concentration

Dear Dr Beauchemin,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Andrea Szabo

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Attachment

Submitted filename: response.pdf

Click here for additional data file.^{(203.7KB, pdf)}

Data Availability Statement

[pcbi.1009480.ref001] 1. Spearman C. The method of “right and wrong cases” (constant stimuli) without Gauss’s formula. Br J Psychol. 1908;II(Part 3):227–242. doi: 10.1111/j.2044-8295.1908.tb00176.xref [DOI] [Google Scholar]

[pcbi.1009480.ref002] 2. Kärber G. Beitrag zur kollecktiven behandlung pharmakologischer reihenversuche. Archiv f Experiment Pathol u Pharmakol. 1931;162(4):480–483. doi: 10.1007/BF01863914 [DOI] [Google Scholar]

[pcbi.1009480.ref003] 3. Reed LJ, Muench H. A simple method of estimating fifty per cent endpoints. Am J Hygiene. 1938;27(3):493–497. doi: 10.1093/oxfordjournals.aje.a118408 [DOI] [Google Scholar]

[pcbi.1009480.ref004] 4. Bryan WR. Interpretation of host response in quantitative studies on animal viruses. Ann N Y Acad Sci. 1957;69(4):698–728. doi: 10.1111/j.1749-6632.1957.tb49710.x [DOI] [PubMed] [Google Scholar]

[pcbi.1009480.ref005] 5. Wulff NH, Tzatzaris M, Young PJ. Monte Carlo simulation of the Spearman-Kaerber TCID50. J Clin Bioinforma. 2012;2(1):5. doi: 10.1186/2043-9113-2-5 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1009480.ref006] 6. Govindarajulu Z. 4. The Logit Approach. In: Statistical techniques in bioassay. 2nd ed. Basel; New York: Karger; 2001. p. 35–90. [Google Scholar]

[pcbi.1009480.ref007] 7. LaBarre DD, Lowy RJ. Improvements in methods for calculating virus titer estimates from TCID₅₀ and plaque assays. J Virol Methods. 2001;96(2):107–126. doi: 10.1016/S0166-0934(01)00316-0 [DOI] [PubMed] [Google Scholar]

[pcbi.1009480.ref008] 8. Mistry BA, D’Orsogna MR, Chou T. The effects of statistical multiplicity of infection on virus quantification and infectivity assays. Biophys J. 2018;114(12):2974–2985. doi: 10.1016/j.bpj.2018.05.005 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1009480.ref009] 9.Mistry BA. Website application associated with [8];. Available from: http://www.bhavenmistry.com/SMOI/.

[pcbi.1009480.ref010] 10.Spouge JL. Website application associated with [12];. Available from: https://www.ncbi.nlm.nih.gov/CBBresearch/Spouge/html_ncbi/html/id50/id50.cgi.

[pcbi.1009480.ref011] 11. Beauchemin CAA, Kim YI, Yu Q, Ciaramella G, DeVincenzo JP. Uncovering critical properties of the human respiratory syncytial virus by combining in vitro assays and in silico analyses. PLOS ONE. 2019;14(4):e0214708. doi: 10.1371/journal.pone.0214708 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1009480.ref012] 12. Spouge JL. Statistical analysis of sparse infection data and its implications for retroviral treatment trials in primates. Proc Natl Acad Sci USA. 1992;89(16):7581–7585. doi: 10.1073/pnas.89.16.7581 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1009480.ref013] 13. Weir MH, Mitchell J, Flynn W, Pope JM. Development of a microbial dose response visualization and modelling application for QMRA modelers and educators. Environ Model Softw. 2017;88:74–83. doi: 10.1016/j.envsoft.2016.11.011 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Time to revisit the endpoint dilution assay and to replace the TCID50 as a measure of a virus sample’s infection concentration

Daniel Cresta

Donald C Warren

Christian Quirouette

Amanda P Smith

Lindey C Lane

Amber M Smith

Catherine A A Beauchemin

Roles

Abstract

Author summary

Introduction

Results

Key features of midSIN’s output

Fig 1. Visual representation of midSIN’s output for the example ED plate.

Fig 2. Quantification of RSV sampled from in vitro infections.

Comparing SIN to TCID50 and PFU virus sample concentrations

Fig 3. Comparing SIN to TCID50 and PFU for influenza A virus samples.

Comparing midSIN’s performance to that of the RM and SK methods

Fig 4. Visualizing TCID50 estimation by the RM and SK methods.

Fig 5. midSIN’s estimate of a sample’s infection concentration based on a single dilution.

Fig 6. Comparing known input to estimated output concentrations.

Estimate accuracy as a function of plate layout

Fig 7. Comparing the effect of the dilution factor and number of replicates per dilution.

Discussion

Methods

The mathematics of the dose-response assay

Considering a single well

Considering replicate wells at a given dilution

Considering all dilutions of the ED assay

Considering the choice of prior

Fig 8. Impact of the choice of prior on the posterior distribution for ℓCinf.

Calculation of midSIN’s outputs

Infection concentration measures of influenza A virus samples

Cell culture

Viral stocks

Plaque assay

TCID50 assay

Statistical analysis

Data Availability

Funding Statement

References

Decision Letter 0

Rob J De Boer

Roland R Regoes

Roles

Author response to Decision Letter 0

Decision Letter 1

Rob J De Boer

Roland R Regoes

Roles

Acceptance letter

Rob J De Boer

Roland R Regoes

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Time to revisit the endpoint dilution assay and to replace the TCID₅₀ as a measure of a virus sample’s infection concentration

Comparing SIN to TCID₅₀ and PFU virus sample concentrations

Fig 3. Comparing SIN to TCID₅₀ and PFU for influenza A virus samples.

Fig 4. Visualizing TCID₅₀ estimation by the RM and SK methods.

Fig 8. Impact of the choice of prior on the posterior distribution for ℓ_Cinf.

TCID₅₀ assay