Skip to main content
Wiley - PMC COVID-19 Collection logoLink to Wiley - PMC COVID-19 Collection
. 2022 May 18;41(16):3131–3148. doi: 10.1002/sim.9408

Bayesian inference for asymptomatic COVID‐19 infection rates

Dexter Cahoy 1,, Joseph Sedransk 2
PMCID: PMC9347963  PMID: 35582808

Abstract

To strengthen inferences meta‐analyses are commonly used to summarize information from a set of independent studies. In some cases, though, the data may not satisfy the assumptions underlying the meta‐analysis. Using three Bayesian methods that have a more general structure than the common meta‐analytic ones, we can show the extent and nature of the pooling that is justified statistically. In this article, we reanalyze data from several reviews whose objective is to make inference about the COVID‐19 asymptomatic infection rate. When it is unlikely that all of the true effect sizes come from a single source researchers should be cautious about pooling the data from all of the studies. Our findings and methodology are applicable to other COVID‐19 outcome variables, and more generally.

Keywords: dirichlet process mixture, exchangeable random variables, meta‐analysis, pooling results, reversible jump Markov Chain Monte Carlo, SARS‐CoV‐2

1. INTRODUCTION

Meta‐analyses are commonly used to summarize information from a set of independent experiments, observational studies or sample surveys. Doing this may strengthen inferences when there are deficiencies in the individual studies such as small sample sizes. Methodology for combining findings from repeated research studies has a long history and, in particular, meta‐analyses have become very popular over the past 30 years. From an online search for “books meta‐analysis” we found 49 books. Thus, it was natural that early in 2020 several meta‐analyses were conducted (and subsequently published) about infection rates from the novel coronavirus. Looking at several early review papers we were concerned whether the meta‐analyses were carried out in an appropriate manner. Even after careful evaluation to include only studies thought to be comparable, there may be subsets of the collection of studies where the true (subset) effects are very different. If this is so, pooling the data from all of the studies may result in misleading conclusions. Borenstein et al. 1 add: “If the variation is substantial, then we might want to shift our focus…Rather it should be on the fact that the … effect differs from study to study. Hopefully, it would be possible to identify reasons … that might explain the dispersion.”

In this article, we consider three Bayesian methods that have a more general structure. One can use these methods to check the validity of the more standard approaches by investigating whether the set of true effect sizes come from a common source. If the assumptions underlying the standard approaches are not met, our proposed methodology will lead to more appropriate inferences.

From five review papers we selected for further analysis several studies that have different features. In each of these cases the objective is to make inference about the asymptomatic infection rate.

Please note that we are not evaluating specific meta‐analytic methods. Our concern is about appropriate aggregation of possibly disparate data.

Buitrago‐Garcia et al. 2 explain the importance of a review: “Accurate estimates of the proportions of true asymptomatic and presymptomatic infections are needed urgently because their contribution to overall SARS‐CoV‐2 transmission at the population level will determine the appropriate balance of control measures. If the predominant route of transmission is from people who have symptoms, then strategies should focus on testing, followed by isolation of infected individuals and quarantine of their contacts. If, however, most transmission is from people without symptoms, social distancing measures that reduce contact with people who might be infectious should be prioritized, enhanced by active case‐finding through testing of asymptomatic people.” Referring to a narrative review report (Oran and Topol 3 ) that presents a range (over studies) of 6% to 96% for the proportion of individuals positive for SARS‐CoV‐2 but asymptomatic, the authors point out the need for a careful review.

Standard meta‐analyses typically assume that the true effect sizes, μ=(μ1,,μL)t, come from a common source. Even after including only those studies thought to be comparable, μ may be composed of distinct subsets, each with a different underlying distribution. This seems likely for some of the reviews, for example, the 79 rates in Buitrago‐Garcia et al. 2 that range from 0.01 to 0.92. To make appropriate inferences the three Bayesian methods have a more general structure than that assumed in a standard meta‐analysis. The principal method, termed uncertain pooling, is flexible in that it can identify distinct subsets of μ: For example, for r subsets there would be r true effect sizes, ν1,,νr. Then, pooling the data from all of the studies may lead to misleading inferences. This methodology will also indicate when true effect sizes have a common source, thus leading to an appropriate inference. The more general structure should ensure greater concordance of the data with our model than with a more restricted model. A better fitting model should lead to better inference. Specifically, only similar studies will be combined. It is not surprising that there is strong statistical evidence that in three of the four data sets that we analyze (Section 4) the true effect sizes do not come from a single source. Then the analyst should be cautious about combining the data from all of the studies.

For a general discussion of Bayesian methods for meta‐analysis see Schmid, Carlin and Welton. 4 Borenstein et al. 1 is a basic treatment of fixed and random effects models for a meta‐analysis while Rice, Higgins and Lumley 5 re‐evaluate fixed effect(s) meta‐analysis.

Section 2 has brief descriptions of the data sets that we analyze, together with background information. The methodology is introduced in Section 3 while the results are summarized in Section 4. Section 5 has a brief summary and an extension that accommodates study level covariates with notes about the availability of covariates in the reviews we investigate.

2. STUDY DESCRIPTIONS

This section has brief descriptions of the meta‐analyses that we have analyzed together with some background information. The definitions of asymptomatic infection rate and the conditions required for including individual studies in the meta‐analysis differ considerably, and are too detailed to present all of them in this article.

The first paper that we considered was by He et al. 6 Using data from six studies they obtain estimates of the asymptomatic infection rate, noting that these measures differ considerably over the six studies, and explaining that this may be due to “differences in data collection, sample size, and the conditions.” Since the information from one of the six studies is inconsistent with that in the other five studies we include only the latter in our analyses. As seen in Table 1 the sample proportions range from 0.22 to 0.78 with little or no clustering.

TABLE 1.

Sample effect sizes, SEs, and posterior summaries from the basic uncertain pooling and DPmeta methods and binomial‐beta model for five COVID‐19 studies 6 of the asymptomatic infection rate

InvBeta prior for δ2 DPmeta
Study Effect PostMean SE 95% Cred Int
M=1/5
M=5
1 0.310 0.337 0.128 (0.139, 0.644) 0.292 0.348
2 0.565 0.582 0.103 (0.354, 0.766) 0.655 0.573
3 0.217 0.224 0.045 (0.144, 0.322) 0.246 0.238
4 0.667 0.663 0.061 (0.535, 0.779) 0.701 0.662
5 0.783 0.776 0.032 (0.706, 0.837) 0.743 0.769
Reversible jump MCMC
Study PostMean 95% Cred Int
1 0.273 (0.145, 0.566)
2 0.650 (0.445, 0.780)
3 0.233 (0.152, 0.325)
4 0.682 (0.547, 0.780)
5 0.759 (0.689, 0.827)

The first meta‐analysis concerning only the asymptomatic coronavirus disease rate is He et al. 7 In their Section 1 the authors note the importance of studying the asymptomatic rate and that this rate is not “well characterized.” They conduct meta‐analyses for all 41 studies and five subsets. The rates are markedly heterogeneous (proportions from 0.02 to 0.75 and total numbers of cases from 4 to 44 627), suggesting concern about the aggregation. Our analysis is for the children subgroup, with eleven studies. As seen in Table 2, the sample proportions range from 0.11 to 0.57 while the standard errors (SEs) range from 0.01 to 0.16. Unlike the first meta‐analysis, there is some apparent clustering.

TABLE 2.

Sample effect sizes, SEs, and posterior summaries from the basic uncertain pooling and DPmeta methods and binomial‐beta model for eleven COVID‐19 children studies 7 of the asymptomatic infection rate

InvBeta prior for δ2 DPmeta
Study Effect PostMean SE 95% Cred Int
M=1/6
M=6
1 0.129 0.132 0.012 (0.109, 0.157) 0.139 0.133
2 0.158 0.157 0.027 (0.114, 0.220) 0.142 0.155
3 0.530 0.526 0.046 (0.436, 0.613) 0.507 0.516
4 0.278 0.278 0.074 (0.134, 0.489) 0.231 0.271
5 0.129 0.156 0.060 (0.066, 0.308) 0.144 0.159
6 0.500 0.481 0.125 (0.247, 0.682) 0.497 0.455
7 0.571 0.516 0.132 (0.282, 0.736) 0.501 0.482
8 0.154 0.203 0.100 (0.065, 0.519) 0.163 0.192
9 0.200 0.243 0.126 (0.082, 0.557) 0.188 0.221
10 0.111 0.210 0.104 (0.037, 0.554) 0.173 0.197
11 0.556 0.487 0.165 (0.191, 0.753) 0.488 0.449
Reversible jump MCMC
Study PostMean 95% Cred Int
1 0.133 (0.109, 0.159)
2 0.157 (0.112, 0.215)
3 0.523 (0.430, 0.612)
4 0.272 (0.131, 0.489)
5 0.153 (0.084, 0.257)
6 0.490 (0.274, 0.658)
7 0.524 (0.276, 0.726)
8 0.174 (0.084, 0.378)
9 0.223 (0.088, 0.526)
10 0.171 (0.071, 0.411)
11 0.518 (0.266, 0.724)

The third data set is a subset of six of the eleven studies in He et al. 7 These six studies were chosen to illustrate properties when there is considerable separation. From Table 3, there are two apparent clusters with cluster proportions of about 0.13 and 0.55; overall, the SEs range from 0.01 to 0.16.

TABLE 3.

Sample effect sizes, SEs, and posterior summaries from the basic uncertain pooling and DPmeta methods and binomial‐beta model for six COVID‐19 children studies 7 of the asymptomatic infection rate

InvBeta prior for δ2 DPmeta
Study Effect PostMean SE 95% Cred Int
M=1/6
M=6
1 0.129 0.132 0.012 (0.109, 0.157) 0.135 0.132
2 0.158 0.150 0.027 (0.113, 0.211) 0.137 0.152
5 0.129 0.143 0.060 (0.066, 0.260) 0.138 0.152
6 0.500 0.521 0.125 (0.307, 0.708) 0.501 0.461
7 0.571 0.548 0.132 (0.343, 0.747) 0.504 0.492
11 0.556 0.537 0.165 (0.265, 0.769) 0.495 0.463
Reversible jump MCMC
Study PostMean 95% Cred Int
1 0.132 (0.109, 0.157)
2 0.150 (0.109, 0.200)
5 0.142 (0.082, 0.220)
6 0.528 (0.345, 0.705)
7 0.536 (0.342, 0.716)
11 0.546 (0.359, 0.741)

Buitrago‐Garcia et al. 2 has several important features, that is, the authors consider rates associated with both asymptomatic and presymptomatic cases and only include studies that document follow‐up and symptom status at the beginning and end of follow‐up or modeling. Their meta‐analyses are based on 79 studies, summarized for the entire set and seven subsets. The 79 rates are markedly heterogeneous (proportions from 0.01 to 0.92 and total numbers of cases from 2 to 1012). Our analysis is for the screening subgroup with seven studies, noted by Buitrago‐Garcia et al. 2 as being of special interest. Here, the proportions range from 0.17 to 0.50 while the SEs range from 0.04 to 0.35.

For additional background we give the criteria that He et al. 7 used to select the studies that they used in their meta‐analyses. Then we summarize features of the eleven children study, from He et al., 7 that we have analyzed. He et al. 7 searched two databases, PubMed and Embase, following the Preferred Reporting Items for Systematic Reviews and Meta‐Analyses (PRISMA) guideline. They included the following items: “COVID‐19” and analogous phrases and “Asymptomatic.” They included “articles reporting a specific number of asymptomatic infection cases in confirmed COVID‐19 patients, information describing the epidemiological and clinical features of COVID‐19.” There is no evidence of a risk‐of‐bias assessment or consideration of a sufficient follow‐up period. Byambasuren et al. 8 identify these as characteristics essential when making decisions about which studies to include in a meta‐analysis. Since He et al. 7 is the first meta‐analysis to make inferences about the asymptomatic infection rate, one may conjecture that they were motivated to publish their results quickly.

Of the eleven studies of children five papers are published in Chinese, so there are only summaries in English. Only three of the eleven papers give the age distribution of the children, although most give the mean age and some give the range. Seven of the papers give the sex distribution. In five studies all cases were associated with a single hospital while four studies summarized the results from many hospitals. There was no information for two studies. Several papers noted that most of the patients had a history of close contact with adults with COVID‐19.

3. METHODOLOGY

A common assumption in situations where combining data is plausible is: For i=1,,L,j=1,,ni, the Yij are independent

YiN(μi,σi2/ni) (1)

where Yi=j=1niYij/ni, the σi2 are known, L is the number of studies and ni is the number of replicates. Note that all of the analyses of COVID‐19 data that we consider make assumptions like (1).

3.1. Uncertain pooling

The first method, uncertain pooling, is based on Malec and Sedransk 9 and Evans and Sedransk. 10 Since this method may be unfamiliar, we describe it in some detail. They showed that a prior for μ=(μ1,μ2,,μL)t can be selected to reflect the beliefs that there are subsets of μ such that the μi in each subset are “similar”, and that there is uncertainty about the composition of such subsets of μ. Let G be the total number of partitions of the set ={1,,L}. Denote a particular partition by g=1,,G, let d(g) denote the number of subsets of in the gth partition (1d(g)L), and let Sk(g) denote the set of study labels in subset k for k=1,,d(g). For our analyses L=5,6,7, and 11 with G=37,G=203,G=877, and G=678,570, respectively. For other values of L the total number of partitions of an L‐element set is given by the Bell number, BL. Recent work (eg, Dahl et al. 11 ) proposed using prior information to place a prior on the set of partitions g=1,,G. This will increase the complexity of the computations but avoid the need to consider the G partitions explicitly.

To specify a prior for μ, first condition on g. Malec and Sedransk 9 and Evans and Sedransk 10 assume that there is independence between subsets, and within Sk(g) the μi are independent with

μi|νk(g)N(νk(g),δk2(g))iSk(g). (2)

Also, the νk(g) are mutually independent with

νk(g)|θk(g)N(θk(g),γk2(g)).

Conditioning on the variances above (but suppressing them in our notation), and letting γk2(g) leads to the following expected results. Defining y=(Y1,,YL)t and letting Δ={δk2(g):k=1,,d(g);g=1,,G}

E(μi|y,g,Δ)={λi(g)}μ^i+{1λi(g)}μ^k(g)iSk(g) (3)

and

cov(μi,μt|y,g,Δ)=δk2(g){1λi(g)}+{1λi(g)}2δk2(g)iSk(g)λi(g),i=t;i,tSk(g){1λi(g)}{1λt(g)}δk2(g)iSk(g)λi(g),it;i,tSk(g)0,iSk1(g),tSk2(g),k1k2, (4)

where

λi(g)=δk2(g)δk2(g)+σi2/ni,μ^k(g)=tSk(g)λt(g)μ^ttSk(g)λt(g),andμ^i=Yi. (5)

Inference about μ includes uncertainty about the value of g, that is,

f(μ|y)=f(μ|y,g,Δ)f(g,Δ|y)dgdΔ, (6)

where the notation is simplified by using integration rather than summation for g.

To evaluate (6) we need f(g,Δ|y). One must be careful about specifying how the γk2(g) because the models corresponding to the partitions have different numbers of parameters. We use a method described in Section 3 of Evans and Sedransk 10 that postulates little prior, relative to sample, information about the νk(g) and is invariant to changes in the scale of Y. Let ν(g)=(ν1(g),,νd(g)(g))t and K(f1(ν(g)),f2(ν(g)|y)) be the Kullback‐Leibler information about ν(g). With prior f(g,Δ)=f(g)f(Δ) and letting the γk2(g) subject to K(f1(ν(g)),f2(ν(g)|y)) = constant

f(g,Δ|y)f(Δ)f(g)expd(g)2k=1d(g)iSk(g){1λi(g)}1/2×exp12k=1d(g)iSk(g)λi(g)δk2(g){μ^iμ^k(g)}2, (7)

The factor in the exponent,

Q{d(g)}=k=1d(g)iSk(g)λi(g)δk2(g){μ^iμ^k(g)}2,

a consequence of the limit process just described, is the usual within sum of squares from a conventional, weighted, analysis of variance. Now, Q{d(g)} is likely to decrease as d(g) increases, for example for a new partition of k=1d(g)Sk(g) obtained by creating subsets of the existing {Sk(g)}. Since f(g,Δ|y) increases as Q{d(g)} decreases, it is helpful to have the second term, exp{d(g)/2}, that penalizes partitions with larger values of d(g).

For our analysis we take δk2(g)=δ2 and write λi(g)=δ2/(δ2+σi2/ni). Inference for μ is made using  (6) and (7) with

μ|y,g,δ2N(E(μ|y,g,δ2),V(μ|y,g,δ2)) (8)

where the conditional posterior moments of μ are given in (3) and (4).

Our analyses will indicate whether the true effect sizes come from a common source. If so, then using a standard meta‐analysis will provide appropriate inference. If not, several alternatives should be considered, as discussed below.

If a prior evaluation indicates that one of the studies, i, can be regarded as a gold standard we can consider the posterior distribution corresponding to study i to be the object of inference. Then, using the posterior expected value for illustration,

E(μi|y)=Eδ2,g|yE(μi|y,g,δ2),

where E(μi|y,g,δ2) is defined in (3). Thus, inference for μi is a function of μi^ together with data from the other L1 studies as determined by the form of (3), and, critically, by the likelihood associated with the set of subsets, Sk(g), containing study i. See Evans and Sedransk 10 for additional details and an application to a notable study of the effect of using aspirin by patients following a myocardial infarction.

Otherwise, one must rely on substantive evaluation to decide whether any distinct subsets identified in the analysis should be analyzed separately, for example, by separate standard meta‐analyses. If there are no covariates that can distinguish the subsets, then the distribution of the true effect for study i is a mixture distribution with unknown probabilities associated with the components.

If there are distinct subsets and a single analysis is presented it is important to include a credible interval for the overall true effect. Taken together with the presence of distinct subsets, a very wide interval for the overall true effect would be a strong indication that the single summary rate is not informative.

3.2. Dirichlet process mixture

An alternative to the uncertain pooling method is to use a Dirichlet process mixture (DPM), one of the most popular nonparametric Bayesian methods. This methodology, presented in detail in sections 2.1 and 2.2 of Müller et al., 12 is outlined below, paraphrasing the text in Müller et al. 12 Suppose that there is an observed i.i.d. sample

yi|HiidH,i=1,,n.

For Bayesian inference a prior probability model on H is needed. Starting with the basic Dirichlet process (DP), let M>0 and H0, the base distribution, be a probability measure defined on the sample space 𝒮. A DP with parameters (M,H0), is a random probability measure H defined on 𝒮 which assigns probability H(B) to every measurable set B. The DP is an infinite‐dimensional analog of the finite‐dimensional Dirichlet prior. In particular, E(H(B))=H0(B). If M, the precision parameter, is large H is highly concentrated around H0; as M the process is essentially H0.

With DP H is a discrete measure, so it is typical to extend the DP to DPM, by using a mixture over a simple parametric form such as a N(μ,σ2) pdf. Let Θ be a finite dimensional parameter space. For each θΘ, let fθ be a continuous pdf. Given a probability distribution H defined on Θ, a mixture of fθ with respect to H has the pdf

fH(y)=fθ(y)dH(θ).

This mixture model can be expressed as an equivalent hierarchical model, especially relevant for our application, that is,

yi|θiiidfθi (9)
θi|HiidH (10)

with HDP(M,H0).

For our analysis we have used the R function DPmeta from the package DPpackage: see Jara et al. 13 for additional details.

The model in DPmeta, for σi2 fixed, is (9) and (10) with yi=Yi,θi=μi,fθi the pdf of a N(μi,σi2/ni) random variable, and H0=N(η,τ2).

The (independent) hyperparameters are

M|a0,b0Gamma(a0,b0),
η|ηb,SbN(ηb,Sb),
τ2|φ1,φ2Gamma(φ1/2,φ2/2).

The uncertain pooling method requires only that one specify a prior distribution for g and δ2. By contrast DPmeta requires substantial prior input, that is, values for a0,b0,ηb,Sb,φ1 and φ2. We have concerns about the sensitivity of inferences to some of the choices of distributions in DPmeta and possible over‐fitting since there are more quantities to be specified than in the uncertain pooling method. Moreover, our analyses include data from only five, six, seven, and eleven studies. So, we have omitted the specification M|a0,b0Gamma(a0,b0) and made inferences for a selected set of values of M as suggested by Escobar. 14 Also, we omitted the step, η|ηb,Sb, and replaced η and τ2 with their maximum likelihood estimates.

3.3. Binomial‐beta model and reversible jump Markov chain Monte Carlo

As noted by a reviewer, a limitation of our uncertain pooling method is that the sample SEs are assumed to be known, as is typically done. There is contemporary research (Yao et al. 15 ) that models both the sample mean and the log of the sample SE. However, they assume a bivariate normal distribution for these two statistics, a questionable assumption for our (binomial) case. An alternative that we have investigated assumes a binomial likelihood together with beta and uniform prior distributions in a hierarchical model. The limitation, here, is that one must use Reversible Jump Markov Chain Monte Carlo (RJMCMC) for the computations, and implementation is substantially more difficult than the method presented in Section 3.1.

Recall that the methodology used in Section 3.1 is based on Evans and Sedransk 10 who use a constraint in the limit process to overcome the problem that the partitions, g, have different sizes, that is, that d(g) varies with g Without this adjustment p(g|y) would not be invariant to changes in the scale of the outcome variable, Y. RJMCMC addresses the problem of varying d(g) by introducing additional random variables that enable the matching of parameter space dimensions across the partitions. For a concise outline of the RJMCMC method see Gelman et al. 16 The pioneering paper (Green 17 ) provides the theoretical background for RJMCMC, but also includes, as an example, an application to the uncertain pooling methodology. We have used this model and RJMCMC to analyze the data in the four tables in Section 4.

Assume L independent responses, y1,,yL, with yibin(ni,θi). Within Sk(g)

θiiidbeta(qαk,q(1αk))iSk(g);k=1,,d(g). (11)

The group mean parameters, {αk}, are drawn independently from U(0,1) while log q U(log a, log b). Finally, p(g)d(g)1/#{g:d(g)=d(g)}.

Then the joint distribution of all the variables is p=p(g,α,q,θ,y) with

p=p(g)p(α,q|g)p(θ|g,α,q)p(y|g,α,q,θ)=p(g)p(α|g)p(q)p(θ|g,α,q)p(y|θ)=p(g)[k=1d(g)1]p(q)k=1d(g)iSk(g)θiqαk1(1θi)q(1αk)1)B(qαk,q(1αk))i=1Lniyiθiyi(1θi)niyi,

where the 1 in the last expression is the pdf of the (assumed) uniform distribution for αk.

Section 6.2 of Green 17 gives the full conditionals for θi,q and αj. The step involving a possible move from partition g to a new partition g is much more complicated. Green 17 uses a process that jumps between partitions making only the changes of splitting a group (a birth) and combining two groups (a death). There is an algorithm to select the groups to split and merge. Then births are attempted with probability bg and deaths with probability dg. Jumping to a new partition requires a change in the vector α since its length must increase or decrease by one unit. Several steps are required to develop the associated proposal, finally leading to a complicated acceptance probability.

For the pool‐all partition Hamza, van Houwelingen, and Stijnen 18 suggest

yiindbin(ni,θi)

with ηi=logit(θi) and

ηiiidN(ξ,κ2).

As an alternative it may be feasible to adapt this to the specification in (11), and implement it using RJMCMC.

4. RESULTS AND DISCUSSION

Our inferences are for the asymptomatic rates, that is, population proportions, and Tables 1, 2, 3, 4 use this representation. That is, in Tables 1, 2, 3, 4 we present the sample proportions and the SEs. The posterior means and intervals from basic uncertain pooling (Section 3.1) and summaries from DPmeta (Section 3.2) and RJMCMC (Section 3.3) are for the population proportion. However, for the computations for basic uncertain pooling and DPmeta we have used two different representations of Yi in (1), that is, Yi is the sample proportion, p^i, and Yi is the sample log odds, log[p^i/(1p^i)]. While we only present results for the latter representation, our findings are similar for both choices. From (1), (Yi,μi,σi2/ni) is replaced by

(log[p^i/(1p^i)],log[pi/(1pi)],1/[nipi(1pi)]). (12)

As is typically done in applications such as this, for example, DerSimonian and Laird, 19 we have replaced σi2 with an estimate from the sample. For each of the meta‐analyses we give in Table A1 the number of asymptomatic cases and observations for each of the component studies together with references where these data can be found.

TABLE 4.

Sample effect sizes, SEs, and posterior summaries from the basic uncertain pooling and DPmeta methods and binomial‐beta model for seven COVID‐19 screening studies 2 of the asymptomatic infection rate

InvBeta prior for δ2 DPmeta
Study Effect PostMean SE 95% Cred Int
M=1/6
M=6
1 0.500 0.377 0.353 (0.119, 0.838) 0.310 0.300
2 0.500 0.389 0.250 (0.148, 0.782) 0.310 0.307
3 0.333 0.332 0.136 (0.156, 0.568) 0.310 0.300
4 0.167 0.226 0.068 (0.088, 0.385) 0.302 0.253
5 0.273 0.288 0.067 (0.173, 0.416) 0.307 0.285
6 0.397 0.382 0.057 (0.283, 0.498) 0.318 0.351
7 0.297 0.300 0.038 (0.229, 0.380) 0.308 0.296
Reversible jump MCMC
Study PostMean 95% Cred Int
1 0.363 (0.178, 0.779)
2 0.376 (0.192, 0.761)
3 0.323 (0.172, 0.511)
4 0.260 (0.118, 0.378)
5 0.297 (0.190, 0.406)
6 0.360 (0.266, 0.481)
7 0.302 (0.232, 0.377)

For uncertain pooling inference for μ is made using (6). To start, evaluate the right side of (7) for

{g:g=1,,G;Dgrid points forδ2},

then standardize by dividing the individual terms in the grid by their sum. This provides an approximation for f(g,δ2|y). Then select a random sample of size B from the DG normalized values of f(g,δ2|y). For each selection, (g,δ2), sample μ from f(μ|y,g,δ2). For the logit case, transform μ to p=(p1,,pL)t at each step. Starting with a grid for δ2 and g with a very large range for δ2, we reduced the (g,δ2) space to make the 2D‐grid sampler faster. Specifically, we retained 99.2% of the probability associated with the extensive grid. We generated B=30000 values of μ for the eleven study case and B=10000 for the other cases. Finally, note that approximations for the marginal posterior distributions, that is, f(g|y) and f(δ2|y), can be obtained directly from the grid approximation of f(g,δ2|y).

As noted in Section 1, our study was motivated by a meta‐analysis by He et al. 6 that included early studies of the COVID‐19 asymptomatic infection rate. He et al. 6 carried out a standard meta‐analysis using a normal‐based random effects model with the effect sizes as the outcome random variable and assuming fixed SEs. Since there is considerable variation in the effect sizes it is prudent to be cautious and investigate whether the true effect sizes are from a single source.

Our analysis starts with the basic uncertain pooling method (Section 3.1) and DPmeta (Section 3.2). We summarize the results using the binomial‐beta model and RJMCMC (Section 3.3) at the end of this section. For the basic uncertain pooling method we assumed, a priori, that all 37 partitions have equal probability, that is, p(g)=1/37. For the prior on δ2, independent of g, we used two distributions:

a) InvBeta:p(δ2)1/(1+δ2)δ2,0<δ2<.b) InvGamma:p(δ2)(δ2)(α+1)exp(β/δ2),0<δ2<.

For a standard random effects model, Gelman 20 recommends using a half‐Cauchy prior distribution for δ, p(δ)1/(1+δ2). While our situation is very different, that is, many partitions and weighting depending on δ2 (not δ), we have adopted this suggestion in (a) by transforming the half‐Cauchy pdf to obtain the Inverse Beta (InvBeta) pdf of δ2. Previous research has shown that there are benefits to having the prior distribution for δ2 concentrated near 0, leading to the choice in (b) with α=11.01,β=0.001. In the following we present results only for (a) as those for (b) are similar.

As described in Section 2, a complete specification of DPmeta requires estimates of many hyperparameters. This seems inappropriate for meta‐analyses such as these. So, we have replaced η and τ2 with their maximum likelihood estimates, thus eliminating the need to specify the other hyperparameters. We have adopted the suggestion of Escobar 14 to use a small set of values for M, that is, {L1,L0,L1,L2}, typically augmented by a value for M much smaller than L1 and one larger than L2.

We start by discussing the meta‐analysis that motivated our investigation, that is, He et al. 6 We analyze the data (sample proportions, numbers of observations) from five of their studies, omitting one (Mizumoto et al. 21 ) whose estimates are based on extensive modeling. As such, Buitrago‐Garcia et al. 2 “considered …[this] study …separately, because of the different method of analysis,” and He et al. 7 excluded it entirely. Note that the numbers of asymptomatic cases and observations are included in Table A1.

In Table 1 we present for each of the five studies the sample effect size (sample proportion, p^), posterior expected value of the true effect size, standard error (SE), and 95% credible interval for the true effect size. Here, SE = p^q^/n where n is the total number of observations. From DPmeta, there are the posterior expected values of the true effect size corresponding to M=1/5 and 5 (a good representation of the six values we used in our analysis). The remaining columns give the posterior means and credible intervals obtained by using the Reversible Jump MCMC method. We summarize our results from the basic uncertain pooling method and DPmeta first, adding brief comments about the results obtained by using the RJMCMC method at the end of this section. In general, the results are consistent.

The posterior probability, p(g0|y), associated with the pool‐all partition, g0={1,2,3,4,5}, is minuscule, that is, 4×106. Now suppose that we consider the probability that there is any single large cluster. That is, we include all partitions where g={1,2,3,4,5} and g={(i1,i2,i3,i4),(i5)} for ij{1,2,3,4,5} where ijik for jk. The sum of these probabilities is 1.1×104, a very small quantity.

A standard way (Gelman 20 ) to assess the likelihood that the true effect sizes come from a common source is to assume the pool‐all model, g=g0, and evaluate the posterior predictive P‐value using a standard (Gelman, 20 (6.4)) discrepancy measure. With overall effect ν, define y=(Y1,,YL)t where Yi is now the sample log odds, log[p^i/(1p^i)], σi2/ni is 1/[nipi(1pi)] as in (12), and σ^i2/ni is 1/(nip^i(1p^i)). The discrepancy measure,

T(y,ν,δ2)=i=1L(Yiν)2((σ^i2/ni)+δ2), (13)

is based on the pool‐all model, as defined in (1) and (2). Then the posterior predictive P‐value is

Pr{T(yrep,ν,δ2)T(yobs,ν,δ2)|yobs}

with yrep denoting a replication from the pool‐all model. For these data the P‐value is 3.3×105, showing that the observed data are not concordant with the pool‐all model. These results show that it is highly unlikely that the five true effect sizes come from a single source.

Another way to analyze these data is to construct a similarity matrix. For each pair of studies, i and j, the similarity matrix gives the posterior probability that i and j are in the same cluster. We present in Figure 1 (top) a visual representation of the similarity matrix using the basic uncertain pooling method (Section 3.1) while Figure 1 (bottom) is the corresponding similarity plot using the RJMCMC methodology in Section 3.3. The study numbers are given on both the x and y axes and there is a legend showing, for each cell, the value of the pairwise probability. Figure 1 (top) shows limited strong clustering, {2,4}, while Figure 1 (bottom) shows some additional clustering, {1,3}, {2,4} and {4,5}. Thus, limited aggregation seems appropriate. This may not be surprising since the He et al. 6 review only included observations from the very early part of the COVID‐19 pandemic.

FIGURE 1.

SIM-9408-FIG-0001-c

Cell (i,j) gives the categorized posterior probability that studies i and j are in the same cluster. (Top) Basic uncertain pooling; (bottom) RJMCMC

From Table 1 the 95% credible intervals are very wide, a further indication of considerable uncertainty. Moreover, a 95% credible interval for the overall true effect size, ν, is 0.09ν0.91. This wide interval is a strong indication that a single summary value such as the posterior mean of ν would not be informative.

The results from applying DPmeta provide additional insight. When M is small, for example, M = 1/5, the posterior expected values show considerable clustering, that is, {2,4,5} and {1,3}, similar to those seen in Figure 1(bottom). This clustering is not surprising since choosing a very small value of M means that there is very limited sampling from the parametric centering measure H0. Using a moderate choice, M = 5, the five posterior expected values are close to those obtained using the basic uncertain pooling methodology as are the 95% credible intervals (intervals are not shown in Table 1).

The results in Table 1 and Figure 1 suggest that one should not pool the data from these five studies. The next review that we consider, He et al., 7 also shows the same issue, that is, questionable pooling of the data from all of the eleven studies. However, in this case, our analysis provides evidence of considerable clustering. Looking at the characteristics of the eleven studies could reveal the reasons for this clustering, and the direction to take to make appropriate inferences.

The analysis presented below uses a subset of the data in He et al., 7 namely the asymptomatic infection rate in eleven studies of children. The results are summarized in Table 2 and Figure 2, analogous to Table 1 and Figure 1.

FIGURE 2.

SIM-9408-FIG-0002-c

Cell (i,j) gives the categorized posterior probability that studies i and j are in the same cluster. (Top) Basic uncertain pooling; (bottom) RJMCMC

From Table 2 and Figure 2 it is apparent that there are several distinct subsets. Further investigation could reveal features that separate these subsets, leading to advances in understanding the differences in the asymptomatic rates.

The posterior probability, p(g0|y), associated with the pool‐all partition, is minuscule, that is, 1.5×1011. As described for the He et al. 6 study, the sum of the posterior probabilities associated with partitions having only a single cluster and 0, 1, 2, 3, 4, or 5 singleton subsets is 1.63×104. This result suggests that it is unlikely that the eleven true effect sizes come from a single source. Moreover, using the discrepancy measure in (13), the posterior predictive P‐value is 3.3×105, showing that the observed data are not concordant with the pool‐all model. Finally, a 95% credible interval for the overall true effect size, ν, is 0.15ν0.45. This interval, together with the clustering, is a strong indication that a single summary value such as the posterior mean of ν would not be informative.

From Figure 2 the most likely cluster is {3,6,7,11}, while the next most likely one is {1, 2, 5, 8, 9, 10}. In this case, there is considerable clustering but it does not extend to all eleven studies. Without additional evidence this analysis suggests conducting separate standard meta‐analyses for the two large subsets.

In Table 2 the posterior expected values for the individual studies in cluster {3, 6, 7, 11} are, essentially, an average of the corresponding four sample effect sizes, but reduced in magnitude. This reduction reflects the contributions from the data from the other seven studies.

The clustering seen in Figure 2 is also evident in the posterior expected values from DPmeta with small ; see Table 2 with M=1/6. With M=6 there is good agreement between the posterior expected values for the two methods. For most of the eleven studies there is good agreement in the credible intervals for the two methods with M=6 for DPmeta (not shown in Table 2).

The next analysis uses the data from a subset of six of the eleven studies in He et al. 7 These six studies were chosen to illustrate properties of the methodology when there is considerable separation. The results are summarized in Table 3 and Figure 3.

FIGURE 3.

SIM-9408-FIG-0003-c

Cell (i,j) gives the categorized posterior probability that studies i and j are in the same cluster. (Top) Basic uncertain pooling; (bottom) RJMCMC

From Figure 3 and Table 3 it is apparent that there are two distinct subsets, that is, {1,2,5} and {6,7,11}. Presumably this separation reflects different characteristics of the two populations and/or different ways that the studies were carried out. As expected, p(g0|y) is minuscule, that is, 3.1×106. Proceeding as described for the He et al. 7 study, the sum of the posterior probabilities associated with partitions having only a single large cluster (ie, with at least four members) is 1.1×103. Thus, there is no evidence that the true effect sizes from these six studies come from a single source. Moreover, using the discrepancy measure in (13), the posterior predictive P‐value is 1.3×104, showing that the observed data are not concordant with the pool‐all model. Finally, a 95% credible interval for the overall true effect size, ν, is 0.07ν0.72. This wide interval, together with the clustering, is a strong indication that a single summary value such as the posterior mean of ν would not be informative. The results from DPmeta are consistent with those given above. For a wide range of values of M the posterior expected values for {1, 2, 5} are about 0.14 while those for {6, 7, 11} are about 0.50. With M=6 the posterior expected values and intervals for {1, 2, 5} are quite close to those obtained by using the basic uncertain pooling methodology. For {6, 7, 11} almost all of the expected values and interval endpoints are within 0.06 of those from the uncertain pooling methodology.

The final analysis uses the data from the seven study screening subset in Buitrago‐Garcia et al., 2 identified as being of special interest. The results are summarized in Table 4 and Figure 4. In Table 4 the sample SEs play a large role. That is, for the studies {1,2} with the large SEs the posterior expected values are much smaller than the sample effect sizes. For the studies {5,6,7} with the small SEs the posterior expected values and sample effect sizes are approximately equal. Thus, apart from study 4, the posterior expected values suggest greater commonality than was apparent from the corresponding set of sample effect sizes. However, the individual posterior credible intervals are quite different. Using the discrepancy measure in (13), the posterior predictive P‐value is 0.40, indicating that there may be a common source for the true effects. This result is supported by Figure 4 which suggests relatively uniform clustering (except for studies 4 and 6).

FIGURE 4.

SIM-9408-FIG-0004-c

Cell (i,j) gives the categorized posterior probability that studies i and j are in the same cluster. (Top) Basic uncertain pooling; (bottom) RJMCMC

Since the data in each of Tables 2, 3, 4 are for a subset of a much larger set of studies they are likely to be substantially more homogeneous than the data in the full set of studies. For example, the seven screening studies (Table 4) are a subset of 79 studies with sample proportions ranging from 0.01 to 0.92.

Results from using DPmeta are, for the most part, consistent with these observations. For M[.001,1] all of the posterior expected values are approximately 0.31. For M=6 the posterior expected values range from 0.25 to 0.35, similar to the posterior means in Table 4. For M=6 the intervals corresponding to {5,6,7}, the studies with the smallest SEs, are similar to those from the basic uncertain pooling methodology.

With a few modifications we have implemented the RJMCMC procedure outlined in sections 6.2 and 6.3 of Green, 17 but expand the range for the prior for q by taking log qU(log 100, log 1000) throughout. We summarize by using the posterior means and 95% credible intervals (bottom of Tables 1, 2, 3, 4), and similarity plots (bottom of Figures 1, 2, 3, 4).

Since the sample likelihoods and prior distributions differ between the two approaches, that is, those based on the likelihoods and priors in Sections 3.1 and 3.3, comparisons of the results may not be especially meaningful. However, examining Tables 1, 2, 3, 4 it is notable that the summaries (posterior means and intervals) from the two approaches are consistent, generally close when the SEs are small, less so when the SEs are very large. There are no major differences between the comparable similarity plots corresponding to basic uncertain pooling and RJMCMC.

5. CONCLUDING REMARKS

The importance of good inferences for the COVID‐19 asymptomatic infection rates is clear, as noted in the quotation from Buitrago‐Garcia et al. 2 in Section 1. Conducting meta‐analyses is a common, often useful, way to summarize information from a collection of studies. However, inference will be misleading if there is pooling of data from studies that are not concordant. For example, Byambasuren et al. 8 note: “A recent review by the center for Evidence Based medicine in Oxford found a range of estimates of asymptomatic COVID‐19 cases which ranged from 5% to 80%. However, many of the identified studies were either poorly executed or poorly documented, making the validity of these estimates questionable.”

In this article, we reanalyze data from three review papers, using three Bayesian methods that have a more general structure than the common meta‐analytic ones. This methodology shows, in a principled manner, the extent and nature of the pooling that can be justified statistically. The more general structure should ensure greater concordance of the data with our model than with a more restricted model.

If the authors of a review have screened the studies so that the ones remaining for analysis have no markedly aberrant characteristics then an analysis showing distinct clusters should prompt a further review, and careful consideration of the inferences to make and the method to use.

In some situations there may be covariates associated with the studies that may help to explain differences in the outcomes. To illustrate, use the basic notation and a linear regression offset. That is, replace (1) with

YiN(μi+xitβ,σi2).

Then inference for μ can be made using the extension of (6)

f(μ,β|y)=f(β|μ,y,g,δ2)f(μ|y,g,δ2)f(g,δ2|y)dδ2dg,

where it is easily shown that

β|μ,y,δ2,gMVN(d,A1)

with zμ=(Y1μ1,,YLμL)t,X an L×p matrix of covariates, β a p×1 vector of regression coefficients, V an L×L diagonal matrix with (i,i)th element σi2, A=XtVX, c=zμtVX and d=A1ct.

In the reviews we have considered, only He et al. 7 gives more than one covariate for each study, that is, the number of confirmed cases and percent male. Using the covariates, an exploratory analysis of the residuals showed that such an augmented analysis will not improve inferences.

The three methods can be implemented. For DPmeta there is an R package (DPpackage) and the code for DPmeta is included as Supplementary Material. R packages are being developed for the two uncertain pooling methods, that is, basic uncertain pooling and RJMCMC.

Finally, for basic uncertain pooling and DPmeta both the sample proportion and logit of the sample proportion are used in applications. While we have presented results only for the latter, our findings are similar for both choices.

CONFLICT OF INTEREST

The authors declare no potential conflict of interests.

Supporting information

Data S1 Supplementary Material

ACKNOWLEDGEMENTS

The authors are grateful to the reviewers for their comments which have improved the focus of the article and motivated further methodological development. They are also grateful to Professor Peter Green for his assistance in applying Reversible Jump MCMC to our data. They also appreciate research allocation grants from XSEDE's Pittsburgh Supercomputing Center.

Observed numbers of asymptomatic cases (Y) and observations (n)

1.

 

TABLE A1.

Observed numbers of asymptomatic cases and observations (Y/n)

Study Table 1, a Table 2, b Table 3, b Table 4, c
1 4/13 94/728 94/728 1/2
2 13/23 27/171 27/ 171 2/4
3 18/83 61/115 4/12
4 40/60 10/ 36 5/30
5 130/166 4/31 4/31 12/44
6 8/16 8/16 29/73
7 8/14 8/14 41/138
8 2/13
9 2/10
10 1/9
11 5/9 5/9
a

Table 1: Effect sizes and SEs are given in figure 4 of He et al. 6 (mislabelled as case fatality rate), but the n must be obtained from the background papers: 1. Nishiura et al. 22 ; 2. Kimball et al. 23 ; 3. Song et al. 24 ; 4. Serra 25 ; 5. Day. 26

b

Tables 2 and 3: (Y,n) are given in figure 3 of He et al. 7

c

Table 4: (Y,n) are given in figure 1 of Buitrago‐Garcia et al. 2

Cahoy D, Sedransk J. Bayesian inference for asymptomatic COVID‐19 infection rates. Statistics in Medicine. 2022;41(16):3131–3148. doi: 10.1002/sim.9408

References

  • 1. Borenstein M, Hedges L, Higgins P, Rothstein H. A basic introduction to fixed‐effect and random‐effects models for meta‐analysis. Res Synth Methods. 2010;1:97‐111. doi: 10.1002/jsrm.12 [DOI] [PubMed] [Google Scholar]
  • 2. Buitrago‐Garcia D, Egli‐Gany D, Counotte M, et al. Occurrence and transmission potential of asymptomatic and presymptomatic SARSCoV‐2 infections: a living systematic review and meta‐analysis. PLoS Med. 2020. doi: 10.1371/journal.pmed.1003346 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Oran D, Topol E. Prevalence of asymptomatic Sars‐Cov‐2 infection: a narrative review. Annals of Internal Medicine. 2020. doi: 10.7326/M20-3012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Schmid C, Carlin B, Welton N. Bayesian methods for meta‐analysis. In: Schmid H, Stijnen T, White IR, eds. Handbook of Meta‐Analysis. Boca Raton FL: Chapman & Hall/CRC Press; 2021:91‐127. [Google Scholar]
  • 5. Rice K, Higgins J, Lumley T. A re‐evaluation of fixed effect(s) meta‐analysis. J Royal Stat Soc A. 2018;81(1):205‐227. [Google Scholar]
  • 6. He W, Yi G, Zhu Y. Estimation of the basic reproduction number, average incubation time, asymptomatic infection rate, and case fatality rate for COVID‐19: meta‐analysis and sensitivity analysis. J Med Virol. 2020;92(11):2543‐2550. doi: 10.1002/jmv.26041 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. He J, Guo Y, Mao R, Zhang J. Proportion of asymptomatic coronavirus disease 2019: a systematic review and meta‐analysis. J Med Virol. 2021;93(2):820‐830. doi: 10.1002/jmv.26326 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Byambasuren O, Cardona M, Bell K, Clark J, McLaws M, Glasziou P. Estimating the extent of true asymptomatic COVID‐19 and its potential for community transmission: systematic review and meta‐analysis. Offic J Assoc Med Microbiol Infect Dis Canada. 2020. doi: 10.3138/jammi-2020-0030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Malec D, Sedransk J. Bayesian methodology for combining the results from different experiments when the specifications for pooling are uncertain. Biometrika. 1992;79(3):593‐601. [Google Scholar]
  • 10. Evans R, Sedransk J. Combining data from experiments that may be similar. Biometrika. 2001;88(3):643‐656. [Google Scholar]
  • 11. Dahl D, Day R, Tsai J. Random partition distribution indexed by pairwise information. J Am Stat Assoc. 2017;112:721‐732. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Müller P, Quintana FA, Jara A, Hanson T. Bayesian Nonparametric Data Analysis. Springer Series in Statistics. Cham: Springer; 2015. [Google Scholar]
  • 13. Jara A, Hanson T, Quintana FA, Müller P, Rosner G. DPpackage: Bayesian semi‐ and nonparametric modeling in R. J Stat Softw. 2011;40(5):1‐30. doi: 10.18637/jss.v040.i05 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Escobar M. Estimating normal means with a Dirichlet process prior. J Am Stat Assoc. 1994;89(425):268‐277. [Google Scholar]
  • 15. Yao Y, Ogden T, Zeng C, Chen Q. Bivariate hierarchical Bayesian model for combining summary measures and their uncertainties from multiple sources; 2021. arXiv:2109.07560[stat.ME].
  • 16. Gelman A, Carlin B, Stern H, Rubin D. Bayesian Data Analysis. 2nd ed. Boca Raton FL: Chapman & Hall/CRC Press; 2004. [Google Scholar]
  • 17. Green P. Reversible jump Markov chain Monte Carlo computation and Bayesian model determination. Biometrika. 1995;82(4):711‐732. [Google Scholar]
  • 18. Hamza T, van Houwelingen H, Stijnen T. The binomial distribution of meta‐analysis was preferred to model within‐study variability. J Clin Epidemiol. 2008;61:41‐51. [DOI] [PubMed] [Google Scholar]
  • 19. DerSimonian R, Laird N. Meta‐analysis in clinical trials. Control Clin Trials. 1986;7(3):177‐188. doi: 10.1016/0197-2456(86)90046-2 [DOI] [PubMed] [Google Scholar]
  • 20. Gelman A. Prior distributions for variance parameters in hierarchical models. Bayesian Anal. 2006;1(3):515‐533. [Google Scholar]
  • 21. Mizumoto K, Kagaya K, Zarebski A, Chowell G. Estimating the asymptomatic proportion of coronavirus disease 2019 (COVID‐19) cases on board the Diamond Princess cruise ship, Yokahama, Japan, 2020. Euro Surveill. 2020;25(10). doi:10.2807/1560‐7917 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Nishiura H, Kobayashi T, Miyama T, et al. Estimation of the asymptomatic ratio of novel coronavirus infections (COVID‐19). Int J Infect Dis. 2020;94:154‐155. doi: 10.1016/j.ijid.2020.03.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Kimball A, Hatfield K, Arons M, et al. Asymptomatic and pre‐symptomatic SARS‐CoV‐2 infections in residents of a long‐term care skilled nursing facility ‐ King County, Washington, March 2020. MMWR Morbidity Mortality Weekly Rep. 2020;69:377‐381. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Song H, Xiao J, Qiu J, et al. A considerable proportion of individuals with asymptomatic SARS‐CoV‐2 infection in Tibetan population; 2020. MedRxiv. doi: 10.1101/2020.03.27.20043836 [DOI]
  • 25. Serra M. Coronavirus, Castiglione d'Adda e un caso di studio: 'll 70% dei donatori di sangue 'e positivo'. Top News. Lastampa, Italy; 2020. https://it.notizie.yahoo.com/coronavirus‐castiglione‐d‐adda‐il‐065252105.html?guccounter=1%26guce_referrer=aHR0cHM6Ly93d3cuZ29vZ2xlLmNvbS8%26guce_referrer_sig=AQAAAI4TytcLgIOu9fy3U2v5N8NhGzb0_B0BAu‐9y4CITl4pJ9JfDLdo0UQ0dBAUKbQoLOr94nKkh6l5fFEUlhjTYiu4YtSQ‐UgXuQle0TpbgPhCk3UbhlnqtHcRmXZKxtJFUcsawQ1hcXL94MnTucZEc1RaUpuct9OURAMuirKvhWFz
  • 26. Day M. COVID‐19: four fifths of cases are asymptomatic. China figures indicate. BMJ. 2020;369:m1375. doi: 10.1136/bmj.m1375 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data S1 Supplementary Material


Articles from Statistics in Medicine are provided here courtesy of Wiley

RESOURCES