Skip to main content
PLOS One logoLink to PLOS One
. 2023 Feb 13;18(2):e0281710. doi: 10.1371/journal.pone.0281710

Operational analysis for COVID-19 testing: Determining the risk from asymptomatic infections

Marc Mangel 1,2,3,*
Editor: Rehana Naz4
PMCID: PMC9925232  PMID: 36780871

Abstract

Testing remains a key tool for managing health care and making health policy during the coronavirus pandemic, and it will probably be important in future pandemics. Because of false negative and false positive tests, the observed fraction of positive tests—the surface positivity—is generally different from the fraction of infected individuals (the incidence rate of the disease). In this paper a previous method for translating surface positivity to a point estimate for incidence rate, then to an appropriate range of values for the incidence rate consistent with the model and data (the test range), and finally to the risk (the probability of including one infected individual) associated with groups of different sizes is illustrated. The method is then extended to include asymptomatic infections. To do so, the process of testing is modeled using both analysis and Monte Carlo simulation. Doing so shows that it is possible to determine point estimates for the fraction of infected and symptomatic individuals, the fraction of uninfected and symptomatic individuals, and the ratio of infected asymptomatic individuals to infected symptomatic individuals. Inclusion of symptom status generalizes the test range from an interval to a region in the plane determined by the incidence rate and the ratio of asymptomatic to symptomatic infections; likelihood methods can be used to determine the contour of the rest region. Points on this contour can be used to compute the risk (defined as the probability of including one asymptomatic infected individual) in groups of different sizes. These results have operational implications that include: positivity rate is not incidence rate; symptom status at testing can provide valuable information about asymptomatic infections; collecting information on time since putative virus exposure at testing is valuable for determining point estimates and test ranges; risk is a graded (rather than binary) function of group size; and because the information provided by testing becomes more accurate with more tests but at a decreasing rate, it is possible to over-test fixed spatial regions. The paper concludes with limitations of the method and directions for future work.

Introduction

Entering the third year of the 2019 coronavirus disease (henceforth COVID-19) pandemic, it is clear that the world was woefully underprepared, in many different ways, for dealing with it. ItThe current pandemic is an illustration of the natural evolutionary play [1] so that one should expect future pandemics and lose no time in preparing for them while dealing with the present one.

Some authors have suggested that it is appropriate to prepare for the next pandemic as one prepares for war [25]. Operational analysis grew out of the scientific approach to operational questions in World War II [69]. One of the key tenets of operational analysis is to model the process as well as the data [7, 10]. Among the advantages, when the process is modeled, one knows the true state of world, which allows assessment of the quality of the analyses by comparison of analytical outcomes with a known situation. This gives confidence that the methods will work when the true state of the world is unknown.

Models for dynamics and control of the disease, prioritizing hospital care, and setting policy [1118] require information about the health status of the population. This is determined by testing for infection, which thus emerged as a crucial component of managing health policy during the current pandemic and will probably be key in future pandemics [19]. For example, the Global Influenza Surveillance and Response Network (the “flu network” [20, 21]) established in 1952 played a key role in the early responses to the COVID-19 pandemic. The time is now to prepare for future testing.

Testing is complicated an individual is in an early stage of the infection may give a false negative test, infected individuals may be asymptomatic and thus not tested, and symptomatic but uninfected individuals may give false positive test results. Thus, test errors involve both false negative tests, and false positive tests, in which an uninfected individual tests positive [2227]. These are called the false negative probability [28], denoted here by pFN, and false positive probability, denoted here pFP. It may one day be possible to drive the false positive probability to zero with improved specificity of tests, but the ontogeny of the disease within an individual means that there will always be false negative tests [25, 26].

A starting point for the interpretation of testing results is to envision that a population is divided into infected (antigen positive) and uninfected (antigen negative) individuals, with the goal of estimating the fraction of infected individuals (the incidence rate) from the number of positive results P when T tests are given. Because of both kinds of test errors, the surface positivity rate P/T (which is observed; henceforth simply called positivity rate) will generally differ from the incidence rate (which is not observed). It is natural and intuitive to ask for the unobserved incidence rate that is most likely given the test results; this is called the Maximum Likelihood Estimate (MLE) of the incidence rate.

Brown and Mangel [29] and Mangel and Brown [30] (also see [31, 32] where similar methods are used) show that the Maximum Likelihood Estimtate for the incidence rate, denoted by f^ is

f^=P/T-pFP1-pFN-pFP, (1)

which is to be interpreted as f^=0 if the right side of Eq 1 is negative. As will be explained below application of Bayesian methods allows determination of a probability distribution for the incidence rate when the right side of Eq 1 is negative.

In addition to a point estimate for the incidence rate, it is valuable to have a range of incidence rates that are consistent with the model and the data since then one can bound the incidence rate and its associated risk of further infection (Eqs 3 and 4 below). That is, forecasting for a pandemic can be improved by using a predictive distribution, rather than the point estimates in Eq 1, (cf. [33]).

In [29, 30], we show that an appropriate test range, denoted by Range(f^) is

Range(f^)=3.92p+(f^)(1-p+(f^))T(1-pFN-pFP)2, (2)

where p+(f^)=f^(1-pFN)+(1-f^)pFP.

McElreath [34, p. 54] describes and equation such as Eq 2 as the 95% compatibility interval, avoiding the undesired implications of words such as “confidence” or “credible” [35]. Range(f^) is symmetrically distributed around the true range with very small mean error between the two [30], so that lower and upper limits for the estimated incidence rate are f^lower=f^-0.5·Range(f^) and f^upper=f^+0.5·Range(f^).

Mangel and Brown [30] also show how likelihood methods can be used to obtain a test range when positivity is 0 (so that f^=0). In this paper, we will show how to determine the test range when 0 < P/T < pFP However, for the remainder of this section, we assume that P/T > pFP so that the estimate of incidence rate is strictly positive.

Eqs 1 and 2 lead to the operational recommendation that one should stratify testing data according to test errors. When this is not possible, one should stratify tests according to the estimated time since exposure, assign best estimates to the test errors, and conduct sensitivity analyses of the results.

Test results can play an important role in policymaking because they can be used to determine the risk of spreading infection associated with groups of different sizes. Doing so requires a definition of risk. We define the risk to be including at least one infected individual in a group of specified size. The risk R(h,f^) associated with a group of size h when the estimate for incidence rate is f^ is [29, 30]

R(h,f^)=1-(1-f^)h. (3)

Eq 3 allows us to explore the risk ramifications of groups of different sizes. Were the true incidence rate known, we replace f^ by ft (Fig 1). Fig 1 can be used to determine the risk associated with groups of different sizes by choosing a group size on the x-axis, drawing a vertical line to intersect the curve, drawing a horizontal line that intersects the y-axis, and reading off the level of risk. When the true value of the incidence rate is unknown, we create upper and lower bounds for risk by generating curves similar to Fig 1 using the lower and upper bounds f^lower=f^-0.5·Range(f^) and f^upper=f^+0.5·Range(f^), as in Brown and Mangel [[29, Fig 2].

Fig 1. The risk of groups of different sizes (Eq 3) when the true fraction of infected individuals is ft = 0.05 (i.e., we set f^=0.05 in Eq 3).

Fig 1

This figure can be used to determine the risk associated with groups of different sizes (ranging from 2 to 100) by choosing a group size on the x-axis, drawing a vertical line to intersect the curve, drawing a horizontal line that intersects the y-axis, and reading off the level of risk.

We can invert Eq 3 by specifying a level of acceptable risk Racc and then solve fof the group size hacc(Racc,f^) consistent with the specified acceptable risk and the estimate of incidence rate is f^:

hacc(Racc,f^)=log(1-Racc)log(1-f^). (4)

Replacing the estimate of incidence rate by its maximum and minimum values fmax=f^+0.5·Range(f^) and fmin=f^-0.5·Range(f^) in Eq 4 allows us to bound the acceptable group size consistent with the level of acceptable risk.

In Fig 2, I show 16 realizations of acceptable group size using the simulation methods described in [30]. The dotted line shows the group size consistent with the level of acceptable risk when the incidence rate is ft, the solid black line is the group size using the estimate in Eq 4, and the red and blue lines are the group sizes using the maximum and minimum estimates for incidence rate, fupper=f^+0.5·Range(f^) and flower=f^-0.5·Range(f^), respectively. One key observation is that the group size determined if the incidence rate were known (dotted line) falls between those determined from the upper and lower limits of incidence rate determined by the test range.

Fig 2. Sixteen realizations of the group size as a function of acceptable risk using the simulation methods described in [30].

Fig 2

In all panels, the number of tests is T = 2500, the true incidence rate is ft = 0.05, and the probabilities of false negative and false positives tests are 0.25 and 0.05. The dotted line shows the group size consistent with the level of acceptable risk when the incidence rate is ft, the solid black line is the group size using the estimate in Eq 1 determined using the positivity rate from the individual realization of the simulation, and the red and blue lines are the group sizes using the maximum and minimum estimates for incidence rate, fupper=f^+0.5·Range(f^) and flower=f^-0.5·Range(f^), respectively. One key observation is that the group size determined if the incidence rate were known (dotted line) falls between those determined from the upper and lower limits of incidence rate determined by the test range.

It is also now well established that asymptomatic infected individuals can readily transmit infection [3647]. Birx [2] emphasizes the role of untested asymptomatic individuals in the spread of the disease. Because of asymptomatic cases, policies that exclude symptomatic individuals from groups may still have considerable risk of including infected individuals who can transmit the disease.

The first purpose of this paper is to show how to obtain the test range when there is no information on symptoms and positivity is less than the probability of a false positive test. The second purpose of this paper is to generalize Eqs 14 and develop the analogue of Fig 1 when asymptomatic and symptomatic individuals are identified at the time of testing.

When there is information on symptoms (Fig 3), a fraction ft of the population is infected and symptomatic; a fraction ρtft is infected and asymptomatic; a fraction gt is uninfected but symptomatic; and the remaining fraction, 1 − ft(1 + ρt) − gt, is neither infected nor symptomatic. Infected individuals have probabilities of a false negative test, denoted by pSFN and pAFN where the subscript S and A correspond to symptomatic and asymptomatic individuals, respectively. Uninfected individuals have probabilities of a false positive test denoted by pSFP and pAFP, respectively. Using testing information, we seek point estimates and the analogue of test ranges for the unknown incidence rates and ratio of asymptomatic to symptomatic cases.

Fig 3. The population divided into four classes according to infection and symptom status.

Fig 3

A fraction ft of the population is symptomatic and infected (antigen positive); such individuals have a probability of a false negative test pSFN. A fraction gt of the population is symptomatic but not infected; such individuals have a probability of a false positive test pSFP. A fraction ρtft of the population is infected but not symptomatic; such individuals have a probability of a false negative test pAFN. Finally, fraction 1 − ftgtρtft = 1 − ft(1 + ρt) − gt of the population is neither infected nor symptomatic; such individuals have a probability of a false positive test pAFP. The subscript t indicates that these three parameters characterize the true state of the world, however none of them are observable.

Materials and methods

Determining test range with no information on symptoms and positivity less than the probability of a false positive test

In this case, as with Eqs 14, there is a single unknown incidence rate, which we continue to denote by ft. The methods used are generalized when there is information on symptoms, so this section is a warm-up to the harder problem.

The probability of obtaining a positive test when the incidence rate is f is

p+(f)=f(1-pFN)+(1-f)pFP). (5)

The first term on the right hand side of Eq 5 corresponds to individuals who are infected and have a true positive test; the second term corresponds to individuals who are not infected and have a false positive test.

When T tests are given, the number of positive tests P is binomially distributed with parameters T and p+(f) [3032], which we write as P=B(·,T,p+(ft)), where B(P,T,p+(f))=(TP)p+(f)P(1-p+(f))T-P. The likelihood of an incidence rate f given the test data P and T has the same form [3032], but is a function of the incidence rate f conditioned on the values of the test data

L(p+(f)|P,T)=(TP)p+(f)P(1-p+(f))T-P. (6)

In S1 Section in S1 File, we show that the maximum likelihood estimate for the fraction of the population infected f^ satisfies

p+(f^)p+(f^)(1-p+(f^))[P-Tp+(f^)]=0 (7)

where p+(f^)=1-pFN-pFP.

Since p+(f^)=f^(1-pFN)+(1-f^)pFP, we conclude that if there is an internal maximum of the likelihood (i.e. f^>0), it must occur when P=Tp+(f^)=T[f^(1-pFN)+(1-f^)pFP]. Solving this equation for f^ gives Eq 1. When PTp+(f), we set f^ = 0 and arrive at the nettlesome case of this subsection.

In Fig 4, I show the logarithm of the likelihood (the log-likelihood function) as a function of incidence rate f for four values of positivity. In Fig 4(A), P/T = 0.075 and the peak of the likelihood is clearly away from the boundary f = 0. As positivity declines but stays larger than pFP, as in Fig 4(B) and Fig 4(C), there is still an internal peak of the likelihood function. However, when positivity falls below pFP, as in Fig 4(D), the maximum of the log-likelihood function occurs on the boundary.

Fig 4. Behavior of the log-likelihood function.

Fig 4

Shown is the log-likelihood function (the logarithm of the right side of Eq 6) as the positivity rate declines when pFN = 0.25, pFP = 0.05 and T = 100 tests are administered for positivity (A) 0.075, (B) 0.06, (C) 0.0525, and (D) 0.04.

We convert from a likelihood to a probability distribution by assuming a uniform prior on f and use Bayes’s theorem to write the probability density for f given the test data (also see S2 Section in S1 File):

φ(f|P,T)=L(f|P,T)f=01L(f|P,T)df. (8)

Although the denominator in Eq 8 can be written in terms of the classical beta function [48], it is most simply viewed as a constant obtained by using a very fine discretization of the interval [0, 1].

When the maximum of the likelihood occurs at the boundary f = 0, the probability φ(f) will also have its maximum at the boundary. In this case, the test range is no longer symmetrical but is an interval [0, f0.95], where f0.95 is the value of incidence rate such that f=0f0.95ϕ(f|P,T)df=0.95 (or the equivalent when a summation instead of an integral is used in Eq 8).

Analysis when there is information on symptoms

The operational situation with information on symptoms

We assume that T tests are administered to a population in which some individuals are symptomatic and others are not (recorded at the time of testing) and each individual tested has either a positive or negative test for coronavirus. As described in Fig 3, there are now four classes of individuals:

  • A fraction ft of the population is symptomatic and infected (antigen positive); these individuals have a probability of a false negative test pSFN.

  • A fraction gt of the population is symptomatic but not infected; these individuals have a probability of a false positive test pSFP.

  • A fraction ρtft of the population is infected but not symptomatic; these individuals have a probability of a false negative test pAFN.

  • The remaining fraction of the population, 1 − ftgtρtft = 1 − ft(1 + ρt) − gt, is neither infected nor symptomatic; these individuals have a probability of a false positive test pAFP.

When this situation holds, three kinds of test data are generated:

  • The number P of positive tests.

  • The number TS of symptomatic individuals.

  • The number PS of symptomatic individuals who tested positive.

Point estimates for the fractions of infected and uninfected symptomatic individuals and the ratio of asymptomatic to symptomatic infected individuals

The following causal chain characterizes the operation of testing:

  • The total number of tests, T, leads to number of symptomatic individuals in the sample, TS.

  • TS leads to the number of positive tests of symptomatic individuals, PS.

  • T, TS, and PS combined lead to the remaining positive results, PPS of TTS tests from asymptomatic individuals.

As above, we let B(·|N,p) denote a binomial distribution with number of samples N and probability of a positive event p, where the dot can run from 0 (no positive event) to N (only positive events). If pS denotes the probability of sampling a symptomatic individual, pS+ the probability of obtaining a positive test from a symptomatic individual, and pA+ the probability of obtaining a positive test from an asymptomatic individual, the test results have distributions

TSB(·|T,pS), (9)
PSB(·|T,TS,pS+),and (10)
P-PSB(·|T,TS,PS,pA+). (11)

The probabilities on the right sides of Eqs 911 are constructed from the assumptions summarized in Fig 3. The fraction of individuals who are symptomatic is ft + gt so that

pS=ft+gt. (12)

Since pS+ is the probability that an individual tests positive given that the individual is symptomatic, from the definition of conditional probability

pS+=Pr[symptomaticandtestpositive]Pr[symptomatic]=ft(1-pSFN)+gtpSFPft+gt. (13)

The probability that an individual tests positive given that the individual is asymptomatic is computed similarly:

pA+=ftρt(1-pAFN)+(1-gt-ft(1+ρt))pAFP1-ft-gt. (14)

We let f^,g^ and ρ^ denote the maximum likelihood estimates (MLEs) for the fraction of the population that is infected and symptomatic or not infected and symptomatic respectively, and for the ratio of the fraction that is infected and asymptomatic to that which is infected and symptomatic.

When a random variable has the binomial distribution B(·|N,p), given K positive events, the MLE for p is p^=K/N (see S1 Section in S1 File) so that the MLEs for the probabilities in Eqs 911 are TS/T, PS/TS, and (PPS)/(TTS) from which we conclude

f^+g^=TST, (15)
f^(1-pSFN)+g^pSFPf^+g^=PSTS,and (16)
f^ρ^(1-pAFN)+(1-f^(1+ρ^)-g^)pAFP=(1-f^-g^)P-PST-TS. (17)

Eqs 15 and 16 are independent of ρ^ and can be rewritten as

f^=g^(PS/TS)-pSFP1-pSFN-(PS/TS), (18)

which we write as f^=c1(PS,TS)g^, where c1(PS, TS) is the combination of terms multiplying g^ on the right side of Eq 18 and the test errors are suppressed. With this notation, we substitute into Eq 15 and solve to obtain

g^=TS(1+c1(PS,TS))T. (19)

Thus, both f^ and g^ are known; they are random variables because PS and TS are random variables.

We now rewrite Eq 17 as

f^ρ^(1-pAFN-pAFP)=(1-f^-g^)P-PST-TS-pAFP(1-f^-g^), (20)

let c2 = 1 − pAFNpAFP, and solve for ρ^ to obtain

ρ^=(1-f^-g^)c2f^[P-PST-TS-pAFP]. (21)

Since the right side of Eq 21 depends on the test data TS, PS, and P, ρ^ is also a random variable.

Eqs 18, 19 and 21 generalize Eq 2 to the case in which symptomatic and asymptomatic individuals are identified at the time of testing. We have already thus generalized the method in [29, 30] to obtain point estimates of the fractions of the population of infected and symptomatic, uninfected and symptomatic, and infected and asymptomatic individuals. We next explore the properties of these point estimates and then generalize the notion of test range and compute and the risk of groups of different sizes including asymptomatic infected individuals.

The means of the estimates

We compute the means of the estimates, continuing to use ft, gt, and ρt to denote their true values, with two goals. We explore 1) whether the estimates in Eqs 18, 19 and 21 are unbiased, in the sense that their expectations (over the stochastic sampling process) are the underlying true values generating the data, and 2) if there is a bias how to characterize it.

The mean of g^

We begin by rewriting Eq 19 as

g^=TST·11+c(PS,TS). (22)

In S2 Section in S1 File, I show that 1+c(PS,TS)=1-pSFN-pSFP1-pSFN-(PS/TS) so that we can rewrite Eq 22 as

g^=TST·[1-pSFN-(PS/TS)1-pSFN-pSFP]=1T·[TS(1-pSFN)-PS1-pSFN-pSFP]. (23)

Since the denominator in Eq 23 is a constant, the expectation of g^ is

E(g^)=1T·11-pSFN-pSFP·[E(TS)(1-pSFN)-E(PS)]. (24)

In the S2 Section in S1 File, I show that E(TS)=T(ft+gt) and E(PS)=T[ft(1-pSFN)+gtpSFP], from which it follows that E(g^)=gt; the expected value of g^ is the true value that underlies the testing process.

The mean of f^

We begin by multiplying the top and bottom of the right side of Eq 18 by TS to obtain

f^=g^PS-pSFPTSTS(1-pSFN)-PS (25)

and now use the version of g^ on the far right side of Eq 23 to obtain

f^=1T·[TS(1-pSFN)-PS1-pSFN-pSFP]·[PS-pSFPTSTS(1-pSFN)-PS]=1T[PS-pSFPTS1-pSFN-pSFP]. (26)

Taking expectations on the far right side of Eq 26, we obtain

E(f^)=1T·11-pSFN-pSFP·[T(ft(1-pSFN)+gtpSFP)-T(ft+gt)pSFP]=1T·11-pSFN-pSFP·Tft(1-pSFN-pSFP)=ft, (27)

so that the expected value of f^ is the true value that underlies the testing process.

The mean of ρ^

We begin with Eq 21, rewritten as

ρ^=(1-f^-g^)(1-pAFN-pAFP)f^[P-PS-pAFP(T-TS)T-TS]. (28)

Eq 28 is a nonlinear function of f^ and g^ and involves the quotients of the random variables. We can approximate the expectation of ρ^ using the delta-method [30, 49], which involves Taylor expansion of the right hand side of Eq 28 to second order and then taking expectations. (Details are in the S2 Section in S1 File). The result is

E(ρ^)=ρt+12[2ρtft2Var(f^)-2Tc2ft2Cov(f^,P-PS)+2pAFPTc2ft2Cov(f^,T-TS)]. (29)

where Var(X) and Cov(X, Y) denote the variance and covariance of random variables X and Y, which arise from the second order Taylor expansion.

The right side of Eq 29 shows that the leading term in the expected value of ρ^ is the true value generating the data and that this is corrected by variances and covariances that account for the nonlinearity in Eq 24.

Joint properties of f^ and ρ^ via likelihood analysis

Eq 2 for the test range can be obtained by direct manipulation of the relevant random variables [30]. When we separate symptomatic and asymptomatic infections, the compatibility interval for the incidence rate is replaced by a compatibility region (CR) for the incidence rate of symptomatic infected individuals and the ratio of asymptomatic to symptomatic individuals. Because Eqs 18 and 19 are nonlinear in the test results (which are random variables) and Eq 21 is also nonlinear in f^, the analytical approach used in Mangel and Brown [30] is less feasible now.

We develop the analogue of Eq 2 for test range by using likelihood analysis [5052], exploiting the general property that for a smooth and well-behaved likelihood (which those that follow are), a 95% CR can be approximated by finding the range of variables for which the log-likelihood is below the peak log-likelihood by 1.96 times the number of free parameters. This is essentially a generalization of the Gaussian approximation to the binomial distribution [53] that leads to Eq 2 [30].

We denote the test results by T˜S,P˜S, and P˜. For any values of f, g, and ρ, the rules of conditional probability imply (suppressing the dependence on T which is known)

P(TS,PS,P|f,g,ρ)=Pr[T˜S=TS,P˜S=PS,P˜=P|f,g,ρ]=Pr[T˜S=TS|f,g]Pr[P˜S=PS|TS,f,g]Pr[P˜=P|PS,TS,f,g,ρ]. (30)

Each term on the right side of Eq 30 is a binomial distribution. In particular, for any values of f, g, and ρ,

Pr[T˜S=TS|f,g,ρ]=B(TS,T,f+g), (31)
Pr[P˜S=PS|TS,f,g,ρ]=B(PS,TS,f(1-pSFN)+gpSFPf+g),and (32)
Pr[P˜=P|PS,TS,f,g,ρ]=B(P-PS,T-TS,fρ(1-pAFN)+(1-g-f(1+ρ))pAFP1-f-g, (33)

the probabilities on the right side of Eqs 3133 are, respectively, pS, pS+, and pA+ in Eqs 1214 for any values of f, g and ρ, rather than their true but unknown values.

When data TS, PS, and P are obtained, the likelihoods, given the data, that the state of the environment is f, g, and ρ are

LTS(f,g|TS,T)=B(TS,T,pS(f,g)), (34)
LPS(f,g|PS,TS,T)=B(PS,TS,pS+(f,g)),and (35)
LP(f,g,ρ|P,PS,TS,T)=B(P-PS,T-TS,pA+(f,g,ρ)). (36)

The likelihood of the data {TS, PS} from symptomatic individuals only depends on the values of f and g and is

LS(f,g|PS,TS,T)=LPS(f,g|PS,TS,T)·LTS(f,g|TS,T). (37)

and the total likelihood of all the data {TS, PS, P} is

L(f,g,ρ|P,PS,TS,T)=LP(f,g,ρ|P,PS,TS,T)·LS(f,g|PS,TS,T). (38)

The likelihoods in Eqs 37 and 38 are products of binomial distributions that are well approximated, for sufficient numbers of tests, by the appropriate Gaussian distribution [30, 53]. In the results, we will explore log-likelihoods for both the binomial distributions and their Gaussian approximations.

Simplifying the likelihoods

Keeping our eyes on the prize of computing the risk of including infected but asymptomatic individuals in groups of different sizes, we focus on f and ρ when constructing the CR. Exploring the likelihood is more convenient if one can eliminate having to deal with g explicitly. Two methods are the profile likelihood and the marginal likelihood [49]; both reduce the number of parameters from 3 to 2.

For the profile likelihood, we replace g in Eqs 37 and 38 by the MLE g^, so that

LS,profile(f|PS,TS,T)=LPS(f,g^|PS,TS,T)·LTS(f,g^|TS,T)and (39)
Lprofile(f,ρ|P,PS,TS,T)=LP(f,g^,ρ|P,PS,TS,T)·LS(f,g^|PS,TS,T). (40)

For the marginal likelihood, we integrate Eqs 37 and 38 over g, so that

LS,marginal(f|PS,TS,T)=01LPS(f,g|PS,TS,T)·LTS(f,g|TS,T)dgand (41)
Lmarginal(f,ρ|P,PS,TS,T)=01LP(f,g,ρ|P,PS,TS,T)·LS(f,g|PS,TS,T)dg. (42)

By numerical exploration, I found that for the operational questions modeled here, the two methods give virtually the same results for the answers. Were we interested in the tails of the likelihood, this might not be the case. Since the profile likelihood is computationally much speedier, I report results using it. The third Rscript in S4 Section in S1 File allows one to explore the differences between marginal and profile likelihoods for the symptomatic data.

The compatibility regions from the profile likelihood

I computed the approximate 95% CR from the total profile likelihood using a generalization of the method of Hudson [51] by first finding the maximum value of the profile log-likehood and then determining the region in f, g, or f, ρ-space in which the log-likelihood was 2 ⋅ 1.96 = 3.92 below its maximum value.

I did computations using R Studio 1.0.143 with underlying R 3.6.1 GUI 1.70 El Capitan build (7684) on an iMac running Mac OS 12.1.

Results

Test range with no information on symptoms and positivity less than the probability of a false positive test

When positivity is less than the probability of false positive test, φ(f|P, T) has, similar to the likelihood, its maximum at f = 0 and monotonically declines. The peak value of φ(f|P, T) and the rate of decline depend on the positivity and the number of tests (Fig 5).

Fig 5. The normalized likelihoods for incidence rate.

Fig 5

Shown are normalized likelihoods (i.e., posteriors with a uniform prior) when pFN = 0.25, pFP = 0.05, and positivity is (A) 0.025, (B) 0.0125, or (C) 0.00625. The colored curves correspond to different numbers of tests shown in the legend inset; since positivity is specified, higher numbers of tests are associated with lower levels of positivity.

These normalized likelihoods In Fig 5 have a test range that depends on the number of tests (Fig 6). As with the situation in which positivity exceeds the probability of a false positive test, the test range declines with test numbers but at a decreasing rate.

Fig 6. The test range for incidence rate.

Fig 6

Shown are the test ranges for posteriors with a uniform prior when pFN = 0.25, pFP = 0.05, and positivity is (A) 0.025, (B) 0.0125, or (C) 0.00625.

Point estimates, compatibility regions, and risk when there is information on symptoms

In the base case for Monte Carlo simulations, I set N = 1000 replicates of T = 1500 tests. Since simulation and test errors scale as the reciprocal of their values, these choices have inherent errors of the order of 3%, which are sufficient to understand the qualitative patterns and most of the quantitative patterns. I chose the parameters for the true state of the world and the test errors from those reported in [2227]: ft = 0.05, gt = 0.04, and ρt = 1.5 and the test errors are pSFN = 0.25, pSFP = 0.03, pAFN = 0.5, and pAFP = 0.003. S3 Section in S1 File contains results for other choices of the true but unknown state of the world.

For the likelihood calculations and the associated risk computations, I first assume that the test results are the expected values TS, PS, and P, which is a reasonable assumption when T is large enough, after which I allow the test results to vary more widely. For the base case parameters, the mean values are T¯S=135,P¯S=58.05, and P¯=118.0575. Since actual test data can only produce integer values, I rounded the P¯S and P¯ to 58 and 118, respectively. Doing so gives the point estimates f^=0.04995,g^=0.04005, and ρ^=1.50119 (significant digits included to illustrate how little accuracy is lost by the rounding process; since the true values are ft = 0.05, gt = 0.04, and ρt = 1.5).

Illustrative simulated data

The nth replicate of the simulation of the testing process yields estimates f^n,g^n, and ρ^n. In Fig 7, I show the first 100 values of the simulation replicates. Each circle represents the value of f^n,g^n, or ρ^n on the nth replicate of the simulation. The thick red lines represent the averages over the entire 1000 simulations. There are also black lines at the true values of the three parameters.

Fig 7. Results of simulating the process of testing.

Fig 7

Shown (for ease of presentation) are the first 100 values of the point estimates for ft (upper left panel), gt (upper right panel), and ρt (lower left panel). The lower right panel is an expanded version of the point estimates for ρt. Each circle represents the value of f^n,g^n, or ρ^n on the nth replicate of the simulation. The thick red lines represent the averages over the entire 1000 simulations. There are also black lines at the true values of the three parameters. In the lower right panel, the y-axis is expanded to show that the mean of the ρ^n exceeds ρt; see the text for an explanation. The means of f^n and g^n essentially sit on top of the true values; again see the text for an explanation. The thin dotted lines show the means of the estimates ±1.96 times their standard deviations.

The means of f^n and g^n essentially sit on top of the true values, as we would expect from the analysis in Eqs 2227 showing that E(f^)=ft and E(g^)=gt. To quantify this agreement, I computed the mean relative error (ME) for the three estimates. For example, for f^, it is

ME(f^n)=1Nn=1Nf^n-ftft. (43)

For the simulation illustrated in Fig 7, ME(f^n)=0.0039 and ME(g^n)=0.0036 (i.e., both a fraction of a percent).

The lower right panel of Fig 7 has an expanded y-axis to show that the mean of the ρ^n exceeds ρt. For this run of the simulation, ME(ρ^n)=0.0141. While this is less than 2.0%, it is almost four times larger than the mean errors of f^n and g^n.

In Fig 7, the thin dotted lines show the means of the estimates ±1.96 times their standard deviations. These are a naive 95% compatibility interval under a Gaussian approximation because they ignore the other two parameters. Even so, for the full set of 1000 replicates of the simulation, the fractions of points outside this naive interval are 0.045, 0.055, and 0.05, respectively, for f^n, g^n, and ρ^n.

We conclude that the formulas for the MLEs accurately capture their true values. The specific results, of course, depend on the simulation results and the number of tests given (both addressed in the next section). For example, in a different run of 1000 simulations of the testing process, the mean relative errors were -0.0073, -0.0006, and -0.033 for f^n, g^n, and ρ^n respectively, and the fractions of points outside of the naive 95% CR were 0.054, 0.035, and 0.045 for f^n, g^n, and ρ^n, respectively.

Likelihood, compatibility regions, and the risk of groups of different sizes

In order to focus on a single value of “test data” we continue using the expected values of TS, PS, and P. After exploring the situation when test data are the mean value, we will vary the test data.

The likelihood of the symptomatic data

On the way to the goal of estimating the fraction of asymptomatic infections, it is worthwhile to briefly stop and explore the likelihood of the symptomatic data, which are independent of ρ (Eq 37). In Fig 8, I show the likelihood when the means of TS and PS are the test results. In this figure, the white dot denotes the true values of parameters and sits at the peak of the heat map.

Fig 8. The likelihood (Eq 37) of the fractions of individuals who are infected and symptomatic, f, and uninfected and symptomatic g, when the test data are the means of TS and PS.

Fig 8

The white dot denotes the true values of the parameters.

When the incidence rate is f and the fraction of symptomatic individuals who are uninfected is g, the mean value of TS = T(f + g) so that we expect a negative correlation between values of f and g, which is evidenced in the figure by the orientation of the contours of likelihood.

For the purposes of the risk calculation, the most important role of the likelihood function of the symptomatic data is to provide the MLE value of g for construction of the profile likelihood in Eq 40, to which we now turn.

The likelihood of all the data

In Fig 9, I show the profile likelihood (Eq 40) for f and ρ when the test data are the mean values of of TS, PS, and P. The banana shape of the contours of the 95% CR computed is a result of the nonlinearity in Eq 28. In this case, the likelihood is centered at the true values of the parameters (shown by the white dot), and when Eq 28 is converted to a function ρ(f) by using the MLE g^ and replacing f^ by f and ρ^ by ρ, the true values of the parameters sit on the resulting curve, which runs through the middle of the 95% CR.

Fig 9. The profile log-likelihood (Eq 40) for the fraction of individuals who are infected and symptomatic f and the ratio of of the fraction of individuals who are infected and asymptomatic to those who are infected and symptomatic ρ when the test data are the means of TS, PS, and P.

Fig 9

The white dot denotes the true values of the parameters. The 95% CR contour is shown in white for the exact binomial likelihoods and in gray for the Gaussian approximation to those likelihoods. The white dotted line is obtained by replacing g^ in Eq 28 by its MLE value, replacing f^ by an arbitrary value f, and then viewing the right side as an equation for the ratio ρ(f) of asymptomatic to symptomatic infected individuals.

One property of the range formula in Eq 2 is that test range declines as 1/T. That is, although the range declines as the number of tests increases, it does so at a decreasing rate [30, Figures 6-8 and p. 16ff]. This observation is more than an academic point, because it has the operational implication that it is possible to over-sample by providing too many tests in a single spatial region (also [30]).

In Fig 10, I explore the consequences of simultaneously increasing the number of tests and relaxing the assumption that the test data are the mean values of TS, PS, and P so that the true values (the white dots) no longer sit in the middle of the 95% CR or on the curve ρ(f) and the contours move in space, as determined by the test results. As with Eq 2, contours shrink as the number of tests increases, but at a decreasing rate.

Fig 10. The consequences of varying test numbers and letting test data vary from the mean values of TS, PS and P.

Fig 10

In each panel the white dot represents ft and ρt, and the dotted curve is the function ρ(f) described in the caption to the previous figure and which now depends on the test results. The upper left panel reproduces Fig 9, in which T = 1500 and the test data are the mean values of TS, PS, and P. In the other panels, the test data are a random realization of the simulation of the testing process and going clockwise from the upper left panel, T = 2000, 3000, 3500, 4000, and 4500.

The prize: The risk of including asymptomatic infected individuals in groups of different sizes

We are now able to compute the risk of including asymptomatic but infected individuals in groups of different sizes, to generate a curve analogous to Fig 1. For any values of f and ρ, the fraction of asymptomatic infected individuals in the population is ρf. Hence, the analogue of Eq 3 is

R(h,f,ρ)=1-(1-ρf)h. (44)

In Fig 11, I show the risk computed using the mean test data, profile likelihood and ρf=ρ^f^, the minimum of ρf on the 95% CR contour, or the maximum of ρf on the 95% CR contour in Eq 44. Clearly, one can invert Eq 44 in analogy to Eq 4 and compute analogues of the results shown in Fig 2.

Fig 11. The risk of including asymptomatic individuals in groups of different sizes.

Fig 11

The solid line corresponds to using the MLEs for f^,g^, and ρ^ in the risk formula (Eq 44), and the two dotted lines correspond to using the minimum and maximum values of ρf on the 95% CR contour.

Discussion

It is important to recognize that the analysis presented in this paper is a procedure that allows one to go from test information to the risk of including infected individuals in groups of various sizes when there is no information on symptoms at the time of testing or to the risk of including asymptomatic individuals in groups of different sizes when there is information on testing. Rather than being binary (risky or not), this risk is graded and the specific details of the relationship between group size and risk depends upon the operational details of testing such as test numbers and errors. Once these are specified, the procedures can be employed.

Let us now consider three limitations of the methods developed here. First note that Eq 26 has the same problem as Eq 1 when the positivity is very small. To see this, we factor out TS on the far right side of Eq 26 to obtain

f^=TST[PS/TS-pSFP1-pSFN-pSFP],

so that if the positivity rate among symptomatic individuals falls below the probability of a false positive test among symptomatic individuals, f^ is less than zero. As in the situation with no information on symptoms, the operational interpretation is that we then set f^ to 0. Alternatively, we may generalize the analysis for the simpler case by putting a prior on the parameters and determining the CR in that manner.

Second, an objection may be made that the binomial distribution underlying the analysis relies on the strong assumption that tests are independent events but that often groups of people will test together so that modeling the testing process requires an aggregated distribution. This is a fair objection, however: 1) the binomial distribution is the appropriate starting point, and if the sample is large and diverse enough (e.g., from many different testing sites), the independence assumption should be at least approximately valid; and 2) a negative binomial distribution of the form used in ecology to model aggregated counts [48, pp. 103–111] is a natural starting point for extending the work here.

Third, an objection may be made that we have assumed the values for test errors rather than estimating them. DiCiccio et al. [54] show that estimating test errors at the same time as incidence rate is a much more complex problem, and is likely one whose solution is not easily transferred to recommendations for practice. An alternative is to stratify test results by both symptomatic or not and time since putative exposure, have approximate values for the test errors for each time since exposure, and conduct sensitivity analysis by varying the values of the test errors. Furthermore, Mangel and Brown [30, pp. 25–27] show how to generalize Eq 1 for the case of a distribution of test errors using the delta-method. A similar extension of Eqs 18, 19 and 21 is a potential next step in this work.

In some locations, individuals are already asked whether they are symptomatic or not at the time of testing. For example, Nomi Health in Utah requires a self-reporting form for obtaining a coronavirus test, and the form includes yes or no questions such as: “Do you have a fever, a cough, new or increased shortness of breath, decreased smell or taste, a sore throat, muscle aches or pains, a headache, congestion or a runny nose, nauseas or vomiting, diarrhea, fatigue?”

During the 2020–2022 academic years, natural experiments in testing were occurring on college campuses [55]. The results of those tests will provide a trove of information to explore with the methods developed here.

Conclusions

In conclusion, the approach of modeling and simulating the process of testing before analyzing testing data leads to a range of insights and at least the following operational recommendations:

  • At the time of testing, collect information on whether and individual is symptomatic or not.

  • At the time of testing, collect information on putative time since exposure to infection.

  • Conduct experiments to obtain information on means and variances of test errors.

There is much to be done and no time to lose before the next pandemic.

Supporting information

S1 File. Including a brief review of the binomial distribution and likelihood, a mathematical appendix with details of calculations in the main text, sensitivity analysis when there is information on symptoms, and codes that generate the results in the main text and sensitivity analysis.

(PDF)

Acknowledgments

I thank Alan Brown for inviting me, early in the pandemic, to think about operational analysis of testing for coronavirus with him and Matt Shaffer for his steady encouragement during the work. For comments on presentations and this manuscript, I thank three anonymous referees, Tiffany Bogich, Rebecca Borchering, Alan Brown, Emily Howerton, John Ivancovich, Elyse Johnson, Chaya Pflugeisen, Matt Shaffer, Katriona Shea, and Joseph Travis.

Data Availability

All relevant data are within the paper and its Supporting information files.

Funding Statement

MM was supported by a consulting contract with the Johns Hopkins University Applied Physics Laboratory. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Lederberg J. Pandemic as a natural evolutionary phenomenon. Soc Res. 1988. Oct 1:343–59. [Google Scholar]
  • 2. Birx D. Silent Invasion. New York: Harper Collins. 2022. [Google Scholar]
  • 3.Doughton S. Bill Gates: We must prepare for the net pandemic like we prepare for war. Seattle Times. 27 Jan 2021 [Cited 2021 Jan 27].
  • 4. Gates B. How To Prevent the Next Pandemic. New York: A.A. Knopf. 2022. [Google Scholar]
  • 5.Gilbert S. Vaccine vs Virus: This race and the next one. The 44th Dimbleby Lecture. 2021. [Cited 2022 Feb 21]. Available from https://www.ox.ac.uk/news/2021-12-07-professor-dame-sarah-gilbert-delivers-44th-dimbleby-lecture
  • 6. Budiansky S. Blackett’s War. The men who defeated the Nazi U-boats and brought science to the art of warfare. New York: A.A. Knopf. 2013. [Google Scholar]
  • 7. Mangel M. Applied mathematicians and naval operators. SIAM Review. 1982. Jul; 24(3):289–300. doi: 10.1137/1024064 [DOI] [Google Scholar]
  • 8.Morse PM. In at the beginnings: A physicist’s life. MIT Press; 1977.
  • 9. Tidman KR. The Operations Evaluation Group: A History of Naval Operations Analysis. Annapolis, MD: Naval Institute Press; 1984. [Google Scholar]
  • 10. Shelton AO, Mangel M. Reply to Sugihara et al: The biology of variability in fish populations. Proceedings of the National Academy of Sciences. 2011. Nov 29;108(48):E1226. Available from www.pnas.org/cgi/doi/10.1073/pnas.1115765108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Atkins BD, Jewell CP, Runge MC, Ferrari MJ, Shea K, Probert WJ, et al. Anticipating future learning affects current control decisions: A comparison between passive and active adaptive management in an epidemiological setting. J Theor Biol. 2020. Dec 7;506:110380. 10.1016/j.jtbi.2020.110380 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Howerton E, Ferrari MJ, Bjørnstad ON, Bogich TL, Borchering RK, Jewell CP, et al. Synergistic interventions to control COVID-19: Mass testing and isolation mitigates reliance on distancing. PLoS Comput Biol. 2021. Oct 28;17(10):e1009518. doi: 10.1371/journal.pcbi.1009518 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Huang NE, Qiao F, Wang Q, Qian H, Tung KK. A model for the spread of infectious diseases compatible with case data. Proceedings of the Royal Society A. 2021. Oct 27;477(2254):20210551. 10.1098/rspa.2021.0551 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Nichols JD, Bogich TL, Howerton E, Bjørnstad ON, Borchering RK, Ferrari M, et al. Strategic testing approaches for targeted disease monitoring can be used to inform pandemic decision-making. PLoS Biol. 2021. Jun 17;19(6):e3001307. doi: 10.1371/journal.pbio.3001307 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Reimer JR, Ahmed SM, Brintz BJ, Shah RU, Keegan LT, Ferrari MJ, et al. The effects of using a clinical prediction rule to prioritize diagnostic testing on transmission and hospital burden: A modeling example of early severe acute respiratory syndrome Coronavirus 2. Clin Infect Dis. 2021. Nov 15;73(10):1822–30. doi: 10.1093/cid/ciab177 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Shea K, Runge MC, Pannell D, Probert WJ, Li SL, Tildesley M, et al. Harnessing multiple models for outbreak management. Science. 2020. May 8;368(6491):577–9. doi: 10.1126/science.abb9934 [DOI] [PubMed] [Google Scholar]
  • 17. Struben J. The coronavirus disease (COVID-19) pandemic: simulation-based assessment of outbreak responses and postpeak strategies. Syst Dyn Rev. 2020. Jul;36(3):247–93. doi: 10.1002/sdr.1660 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Wan R, Zhang X, Song R. Multi-Objective Model-based Reinforcement Learning for Infectious Disease Control. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. Singapore [Virtual Event]. 2021 Aug 14–18. 1634–1644.
  • 19. Nuzzo J, Mullen L, Snyder M, Cicero A, Inglesby TV. Preparedness for a high-impact respiratory pathogen pandemic. Baltimore, MD: Johns Hopkins Center for Health Security. 2019. [Cited 2022 Mar 11]. Available from https://www.centerforhealthsecurity.org/our-work/publications/preparedness-for-a-high-impact-respiratory-pathogen-pandemic [Google Scholar]
  • 20. Kapczynski A. Order without intellectual property law: Open science in influenza. Cornell L. Rev. 2016;102:1539–1615. [PubMed] [Google Scholar]
  • 21. Stein JG. Take It off-site: World order and international institutions after COVID-19. In: Brands H, Gavin FJ. COVID-19 and World Order. Baltimore, MD: Johns Hopkins University Press; 2020. 259–76. [Google Scholar]
  • 22. Arevalo-Rodriguez I, Buitrago-Garcia D, Simancas-Racines D, Zambrano-Achig P, Del Campo R, Ciapponi A, et al. False-negative results of initial RT-PCR assays for COVID-19: a systematic review. PloS One. 2020. Dec 10;15(12):e0242958. 10.1371/journal.pone.0242958 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Green DA, Zucker J, Westblade LF, Whittier S, Rennert H, Velu P, et al. Clinical performance of SARS-CoV-2 molecular tests. J Clin Microbiol. 2020. Jul 23;58(8):e00995–20. 10.1128/JCM.00995-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. He JL, Luo L, Luo ZD, Lyu JX, Ng MY, Shen XP, et al. Diagnostic performance between CT and initial real-time RT-PCR for clinically suspected 2019 coronavirus disease (COVID-19) patients outside Wuhan, China. Respir Med. 2020. Jul 1;168:105980. doi: 10.1016/j.rmed.2020.105980 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25. Kucirka LM, Lauer SA, Laeyendecker O, Boon D, Lessler J. Variation in false-negative rate of reverse transcriptase polymerase chain reaction–based SARS-CoV-2 tests by time since exposure. Ann Intern Med. 2020. Aug 18;173(4):262–7. doi: 10.7326/M20-1495 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Sethuraman N, Jeremiah SS, Ryo A. Interpreting diagnostic tests for SARS-CoV-2. JAMA. 2020. Jun 9;323(22):2249–51. doi: 10.1001/jama.2020.8259 [DOI] [PubMed] [Google Scholar]
  • 27. Watson J, Whiting PF, Brush JE. Interpreting a COVID-19 test result. BMJ. 2020. May 12;369. 10.1136/bmj.m1808 [DOI] [PubMed] [Google Scholar]
  • 28.Waller L, Levi T. Building Intuition Regarding the Statistical Behavior of Mass Medical Testing Programs. Harvard Data Science Review. Special Issue 1—Covid-19: Unprecendented Challenges and Chances. 2021.
  • 29.Brown A, Mangel M. Operational Analysis for Coronavirus Testing: Recommendations for Practice. Laurel, MD: Johns Hopkins University Applied Physics Laboratory. 2021. [Cited 2022 Jan 7]. NSAD-R-21-014. Available from https://www.jhuapl.edu/Content/documents/OperationalAnalysisCoronavirusTesting.pdf
  • 30.Mangel M, Brown A. Operational Analysis for Coronavirus Testing. Laurel, MD: Johns Hopkins University Applied Physics Laboratory Report. 2021. [Cited 2021 Jan 7]. NSAD-R-21-041. Available from https://www.jhuapl.edu/Content/documents/MangelBrown.pdf
  • 31. Böttcher L, D’Orsogna MR, Chou T. A statistical model of COVID-19 testing in populations: effects of sampling bias and testing errors. Philos Trans A Math Phys Eng Sci. 2022. Jan 10;380(2214):20210121. 10.1098/rsta.20210121 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Vasiliauskaite V, Antulov-Fantulin N, Helbing D. On some fundamental challenges in monitoring epidemics. Philos Trans A Math Phys Eng Sci. 2022. Jan 10;380(2214):20210117. doi: 10.1098/rsta.2021.0117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Ioannidis JP, Cripps S, Tanner MA. Forecasting for COVID-19 has failed. Int. J. Forecast. 2020. Aug 25. 38:423–38. 10.1016/j.ijforecast.2020.08.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. McElreath R. Statistical rethinking: A Bayesian course with examples in R and Stan. Chapman and Hall/CRC; 2020. Mar 13. [Google Scholar]
  • 35. Morey RD, Hoekstra R, Rouder JN, Lee MD, Wagenmakers EJ. The fallacy of placing confidence in confidence intervals. Psychonomic bulletin & review. 2016. Feb;23(1):103–23. doi: 10.3758/s13423-015-0947-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Beale S, Hayward A, Shallcross L, Aldridge RW, Fragaszy E. A rapid review and meta-analysis of the asymptomatic proportion of PCR-confirmed SARS-CoV-2 infections in community settings. Wellcome Open Research. 2020;5:266 [Cited 2021 Jun 21]. doi: 10.12688/wellcomeopenres.16387.1 [DOI] [Google Scholar]
  • 37. Buitrago-Garcia D, Egli-Gany D, Counotte MJ, Hossmann S, Imeri H, Ipekci AM, et al. Occurrence and transmission potential of asymptomatic and presymptomatic SARS-CoV-2 infections: A living systematic review and meta-analysis. PLoS Med. 2020. Sep 22;17(9):e1003346. 10.1371/journal.pmed.1003346 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Day M. Covid-19: four-fifth of cases are asymptomatic, China figures indicate. BMJ 2020:369:m1375. 2020. doi: 10.1136/bmj.m1375 [DOI] [PubMed] [Google Scholar]
  • 39. Ferguson J, Dunn S, Best A, Mirza J, Percival B, Mayhew M, et al. Validation testing to determine the sensitivity of lateral flow testing for asymptomatic SARS-CoV-2 detection in low prevalence settings: Testing frequency and public health messaging is key. PLoS Biol 2021. Apr 29;19(4):e3001216. 10.1371/journal.pbio.3001216 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. He X, Lau EH, Wu P, Deng X, Wang J, Hao X, et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. Nat Med. 2020. May;26(5):672–5. doi: 10.1038/s41591-020-0869-5 [DOI] [PubMed] [Google Scholar]
  • 41. He J, Guo Y, Mao R, Zhang J. Proportion of asymptomatic coronavirus disease 2019: A systematic review and meta-analysis. J Med Virol. 2021. Feb;93(2):820–30. doi: 10.1002/jmv.26326 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Kalish H, Klumpp-Thomas C, Hunsberger S, Baus HA, Fay MP, Siripong N, et al. Undiagnosed SARS-CoV-2 seropositivity during the first 6 months of the COVID-19 pandemic in the United States. Sci Transl Med. 2021. Jul 7;13(601):eabh3826. doi: 10.1126/scitranslmed.abh3826 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Li R, Pei S, Chen B, Song Y, Zhang T, Yang W, et al. Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-CoV-2). Science. 2020. May 1;368(6490):489–93. doi: 10.1126/science.abb3221 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Long QX, Tang XJ, Shi QL, Li Q, Deng HJ, Yuan J, et al. Clinical and immunological assessment of asymptomatic SARS-CoV-2 infections. Nat Med. 2020. Aug;26(8):1200–4. doi: 10.1038/s41591-020-0965-6 [DOI] [PubMed] [Google Scholar]
  • 45. Oran DP, Topol EJ. Prevalence of asymptomatic SARS-CoV-2 infection: A narrative review. Ann Intern Med. 2020. Sep 1;173(5):362–7. doi: 10.7326/M20-3012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Rasmussen AL, Popescu SV. SARS-CoV-2 transmission without symptoms. Science. 2021. Mar 19;371(6535):1206–7. doi: 10.1126/science.abf9569 [DOI] [PubMed] [Google Scholar]
  • 47. Shental N, Levy S, Wuvshet V, Skorniakov S, Shalem B, Ottolenghi A, et al. Efficient high-throughput SARS-CoV-2 testing to detect asymptomatic carriers. Sci Adv. 2020. Sep 11;6(37):eabc5961. doi: 10.1126/sciadv.abc5961 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Mangel M. The theoretical biologist’s toolbox: quantitative methods for ecology and evolutionary biology. Cambridge University Press; 2006. Jul 27. [Google Scholar]
  • 49. Hilborn R, Mangel M. The Ecological Detective. Confronting Models with Data. Princeton University Press. 1997. [Google Scholar]
  • 50. Edwards AWF. Likelihood. Expanded Edition, Baltimore and London: The Johns Hopkins University Press. 1992. [Google Scholar]
  • 51. Hudson DJ. Interval estimation from the likelihood function. Philos Trans R Soc Lond B Biol Sci. 1971. Jul;33(2):256–62. [Google Scholar]
  • 52. Severini TA. Likelihood methods in statistics. Oxford University Press; 2000. [Google Scholar]
  • 53. Feller W. An Introduction to Probability Theory and Its Applications, Volume 1. New York: John Wiley & Sons. 1968. [Google Scholar]
  • 54.DiCiccio TJ, Ritzwoller DM, Romano JP, Shaikh AM. Confidence intervals for seroprevalance. arXiv:2103.15018v2. 2021 Aug 9. Statistical Science in press
  • 55.Booeshaghi AS, Tan F, Renton B, Berger Z, Pachter L. Markedly heterogeneous COVID-19 testing plans among US colleges and universities. MedRxiv. Preprint. [Cited 2020 Oct 5]. 10.1101/2020.08.09.20171223 [DOI]

Decision Letter 0

Rehana Naz

26 Oct 2022

PONE-D-22-20932Operational analysis for coronavirus testing: Positivity is not incidence, risk is not binary, and the fraction of asymptomatic infections can be determinedPLOS ONE

Dear Dr. Mangel,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Dec 10 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Rehana Naz

Academic Editor

PLOS ONE

Journal Requirements:

. When submitting your revision, we need you to address these additional requirements.

Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Thank you for stating the following in the Acknowledgments Section of your manuscript: 

"Financial support was provided by a consulting agreement with the Johns Hopkins University Applied Physics Laboratory (APL). I thank Alan Brown for inviting me, early in the pandemic, to think about operations analysis of testing for coronavirus and Matt Shaffer for his steady support during the work."

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. 

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows: 

"MM was supported by a consulting contract with the Johns Hopkins University Applied Physics Laboratory. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript."

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

3. Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

Reviewer #3: Yes

********** 2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

********** 3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

********** 4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

********** 5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The author has shown new applications of a method to translate surface positivity to estimate incidence rates. The author has also discussed limitations of the study. The manuscript is well written and is a significant contribution to the field. I recommend it for publication.

Reviewer #2: The following points should be considered in the manuscript:

1-In the equation-6, the relationship is proportional to and how the author took the logarithm and the derivative of the two side themselves in the equation-7 and even for the far right side, the details must be explained in the manuscript.

2-In the figures-8, 9, and 10, the results give high possibilities of testing errors, of course based on the author results, however, this is not the actual case and the author must add lots of details about those results and their relation to the actual cases.

3-The title of the manuscript is a vague and only can be used in blogs rather than in scientific article and this title must be reworded.

4-There are some equations are approximated equations, especially, in thé appendices. So, the author must mention this point in the manuscript for each one.

5-There are some symbols or abbreviations used in the manuscript without defining them, so, the author must define each symbol even if the symbol is a well known.

6-It is not preferable at all using the first person pronoun 'I' as the author did in the manuscript, especially, in the abstract.

7-The reasons of choosing of the value of the (MLE) fˆ in the results of the figures must be explained in details.

8-The author in the title of the manuscript and other places of the manuscript has used the term 'coronavirus', however, the manuscript focuses on a specific type of the coronaviruses and this point must be reconsidered.

9-The computational errors of the calculations must be discussed with lots of details because the results of the fractions are not consistent with the actual case.

Reviewer #3: The author summarizes some results from a prior study, where an estimator is derived for the incidence rate if the surface positivity and probabilities of false positive and false negative tests are known and the range for the value of the incidence rate (‘test rate’) is obtained. A formula for the test range is derived in the case when the surface positivity is less than the probability of a false positive. Further, the results of the earlier study are generalized when information regarding symptomatic and asymptomatic individuals is known at the time of testing, including point estimate and range for the ratio of asymptomatic to symptomatic cases.

Comments:

The work deals with a very relevant problem, and is well communicated. Possible concerns regarding the approach have been spelt out and satisfactorily answered in the ‘Discussion’ section.

I would suggest that since the work draws from the previous reports ([29] and [30]), it may be a good idea to include the calculations from those studies in the mathematical appendix, as this will make this paper more self-contained.

There may be a typo on Line 281

********** 6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2023 Feb 13;18(2):e0281710. doi: 10.1371/journal.pone.0281710.r002

Author response to Decision Letter 0


16 Dec 2022

Response to comments from the reviewers

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

Reviewer #3: Yes

Response: In response to the specific comments of Reviewer #2, detailed below, I addressed all the points that lead to the “Partly” answer.

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

Response: The statistical analysis, both Bayesian and likelihood methods, is indeed rigorously performed. I believe that the answer from Reviewer #2 is a result of the Reviewer expecting more details of some of the analytical steps. As described below, these are now included, both in the main text ant the supplementary information (depending upon the level of detail involved).

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Response: No response needed.

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

Response: No response needed.

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The author has shown new applications of a method to translate surface positivity to estimate incidence rates. The author has also discussed limitations of the study. The manuscript is well written and is a significant contribution to the field. I recommend it for publication.

Response: Thank you for these comments. Although the manuscript organization has not changed, I worked to improve and tighten the writing and, as detailed below in the response to Reviewer #2, I details of analysis at various points in the manuscript. These are indicated by the line numbers in the new version of the manuscript.

Reviewer #2: The following points should be considered in the manuscript:

Response: Thank you for these thoughtful comments, which helped me make the approach clearer.

1-In the equation-6, the relationship is proportional to and how the author took the logarithm and the derivative of the two side themselves in the equation-7 and even for the far right side, the details must be explained in the manuscript.

Response: I removed the proportional to in Eqn 6, using equality instead and then added a sentence below it explaining that although the mathematical equation is the same, the variables play different roles. I also expanded Supplementary information S1 Brief review of the binomial distribution and binomial likelihood to include a new sub-section that contains a derivation of Eqn 7.

I believe that the detailed derivation is better left in the supplementary information because it is possible to then include all of the mathematical details, taking nothing for granted on the part of the reader. That is, the mathematical tools are relative simple but used in mature ways and putting the derivation in the supplementary information allows me to be very explicit and clear about how the derivation works.

2-In the figures-8, 9, and 10, the results give high possibilities of testing errors, of course based on the author results, however, this is not the actual case and the author must add lots of details about those results and their relation to the actual cases.

Response: I now cite references 22-27 to justify the choice of the true state of nature and the test errors (lines 316-319) and make reference to supplementary information S3 Sensitivity analysis when there is information on symptoms for other assumptions about the true state of nature.

In addition, I added a point in the discussion (lines 403-410) to emphasize that what is developed here is a procedure to go from test information to the risk of including asymptomatic individuals in groups of different sizes, so that one can apply the procedure to their own choice of true states of nature and test errors.

3-The title of the manuscript is a vague and only can be used in blogs rather than in scientific article and this title must be reworded.

Response: As the reviewer surmised, I struggled with finding the right title. The title is now

“ Operational analysis for COVID-19 testing: Determining the risk from asymptomatic infections”

4-There are some equations are approximated equations, especially, in thé appendices. So, the author must mention this point in the manuscript for each one.

Response: The approximated equations are explained in more detail (lines 240-246 and below Eqn S19 in the Supporting information), along with additional citation to reference 30 in which the delta method is explained in detail.

5-There are some symbols or abbreviations used in the manuscript without defining them, so, the author must define each symbol even if the symbol is a well known.

Response: All symbols are now explained at the very first time they are introduced.

6-It is not preferable at all using the first person pronoun 'I' as the author did in the manuscript, especially, in the abstract.

Response: I removed the first person from the abstract, but after checking with the editor handling the manuscript left first person in some places in the main text, because otherwise I would have had to start writing in passive rather than active tense.

7-The reasons of choosing of the value of the (MLE) fˆ in the results of the figures must be explained in details.

Response: Additional explanations for the choice of MLE (lines 37-39) and the compatibility intervals and regions (lines 46-49, 262-264) are given.

8-The author in the title of the manuscript and other places of the manuscript has used the term 'coronavirus', however, the manuscript focuses on a specific type of the coronaviruses and this point must be reconsidered.

Response: I now refer to COVID-19 in the title and in the first sentence of the manuscript write “Entering the third year of the 2019 coronavirus disease (henceforth COVID-19)

9-The computational errors of the calculations must be discussed with lots of details because the results of the fractions are not consistent with the actual case.

Response: I have added discussion, particularly regarding Figure 10, concerning how we interpret the results of the calculations when they differ from the true state of nature. It is possible, however, that I misunderstand what is meant in this comment by “the actual case”

Reviewer #3: The author summarizes some results from a prior study, where an estimator is derived for the incidence rate if the surface positivity and probabilities of false positive and false negative tests are known and the range for the value of the incidence rate (‘test rate’) is obtained. A formula for the test range is derived in the case when the surface positivity is less than the probability of a false positive. Further, the results of the earlier study are generalized when information regarding symptomatic and asymptomatic individuals is known at the time of testing, including point estimate and range for the ratio of asymptomatic to symptomatic cases.

Comments:

The work deals with a very relevant problem, and is well communicated. Possible concerns regarding the approach have been spelt out and satisfactorily answered in the ‘Discussion’ section.

Response: An excellent summary.

I would suggest that since the work draws from the previous reports ([29] and [30]), it may be a good idea to include the calculations from those studies in the mathematical appendix, as this will make this paper more self-contained.

Response: The two previous reports are easily accessed through the links given in the list of references, so I have not followed this suggestion because it would add considerable length to the paper.

There may be a typo on Line 281

Response: I have rewritten the line, because the explanation of the method is given earlier in the paper. In essence, I am generalizing Hudson’s (1971) method in which the confidence interval (I use compatibility interval) is found by moving 2 units down from the peak of the log-likelihood (I am using 1.96) instead of 2 (lines 295-298) This is the case when there is no information on symptoms so that there is 1 unknown parameter. When there is information on symptoms, there are two unknown parameters so the distance from the peak of the likelihood is twice that of the univariate case.

Decision Letter 1

Rehana Naz

31 Jan 2023

Operational analysis for COVID-19 testing: Determining the risk from asymptomatic infections

PONE-D-22-20932R1

Dear Dr. Mangel,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Rehana Naz

Academic Editor

PLOS ONE

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

Reviewer #3: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Partly

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: (No Response)

Reviewer #2: The authors have improved lots of points in the revised version of the manuscript, especially the first and the third critical point. The manuscript is more preferable in the revised form.

Reviewer #3: All concerns have been adequately addressed and the updated manuscript may be accepted for publication.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: Yes: Adnan Ahmed Khan

**********

Acceptance letter

Rehana Naz

3 Feb 2023

PONE-D-22-20932R1

Operational analysis for COVID-19 testing: Determining the risk from asymptomatic infections

Dear Dr. Mangel:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Prof Rehana Naz

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 File. Including a brief review of the binomial distribution and likelihood, a mathematical appendix with details of calculations in the main text, sensitivity analysis when there is information on symptoms, and codes that generate the results in the main text and sensitivity analysis.

    (PDF)

    Data Availability Statement

    All relevant data are within the paper and its Supporting information files.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES