Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2020 Sep 8;372:113410. doi: 10.1016/j.cma.2020.113410

Visualizing the invisible: The effect of asymptomatic transmission on the outbreak dynamics of COVID-19

Mathias Peirlinck a, Kevin Linka a, Francisco Sahli Costabal b, Jay Bhattacharya c, Eran Bendavid c, John PA Ioannidis c,d, Ellen Kuhl a,
PMCID: PMC7831913  PMID: 33518823

Abstract

Understanding the outbreak dynamics of the COVID-19 pandemic has important implications for successful containment and mitigation strategies. Recent studies suggest that the population prevalence of SARS-CoV-2 antibodies, a proxy for the number of asymptomatic cases, could be an order of magnitude larger than expected from the number of reported symptomatic cases. Knowing the precise prevalence and contagiousness of asymptomatic transmission is critical to estimate the overall dimension and pandemic potential of COVID-19. However, at this stage, the effect of the asymptomatic population, its size, and its outbreak dynamics remain largely unknown. Here we use reported symptomatic case data in conjunction with antibody seroprevalence studies, a mathematical epidemiology model, and a Bayesian framework to infer the epidemiological characteristics of COVID-19. Our model computes, in real time, the time-varying contact rate of the outbreak, and projects the temporal evolution and credible intervals of the effective reproduction number and the symptomatic, asymptomatic, and recovered populations. Our study quantifies the sensitivity of the outbreak dynamics of COVID-19 to three parameters: the effective reproduction number, the ratio between the symptomatic and asymptomatic populations, and the infectious periods of both groups. For nine distinct locations, our model estimates the fraction of the population that has been infected and recovered by Jun 15, 2020 to 24.15% (95% CI: 20.48%-28.14%) for Heinsberg (NRW, Germany), 2.40% (95% CI: 2.09%-2.76%) for Ada County (ID, USA), 46.19% (95% CI: 45.81%-46.60%) for New York City (NY, USA), 11.26% (95% CI: 7.21%-16.03%) for Santa Clara County (CA, USA), 3.09% (95% CI: 2.27%-4.03%) for Denmark, 12.35% (95% CI: 10.03%-15.18%) for Geneva Canton (Switzerland), 5.24% (95% CI: 4.84%-5.70%) for the Netherlands, 1.53% (95% CI: 0.76%-2.62%) for Rio Grande do Sul (Brazil), and 5.32% (95% CI: 4.77%-5.93%) for Belgium. Our method traces the initial outbreak date in Santa Clara County back to January 20, 2020 (95% CI: December 29, 2019–February 13, 2020). Our results could significantly change our understanding and management of the COVID-19 pandemic: A large asymptomatic population will make isolation, containment, and tracing of individual cases challenging. Instead, managing community transmission through increasing population awareness, promoting physical distancing, and encouraging behavioral changes could become more relevant.

Keywords: COVID-19, Epidemiology, Asymptomatic transmission, Machine learning, Uncertainty quantification, Bayesian inference

Highlights

  • Recent studies suggest that COVID-19 remains asymptomatic in most cases.

  • The asymptomatic transmission of COVID-19 is invisible and remains poorly understood.

  • We visualize asymptomatic transmission using an SEIIR model within a Bayesian framework.

  • We illustrate the interplay of symptomatic and asymptomatic populations for nine locations.

  • Understanding asymptomatic transmission could change the way we manage of COVID-19.

1. Motivation

Since its outbreak in December 2019, the COVID-19 pandemic has rapidly swept across the globe and is now affecting 188 countries with more than 25 million cases reported worldwide [1]. In the early stages of a pandemic, doctors, researchers, and political decision makers mainly focus on symptomatic individuals that come for testing and address those who require the most urgent medical attention [2]. In the more advanced stages, the interest shifts towards mildly symptomatic and asymptomatic individuals who – by definition – are difficult to trace and likely to retain normal social and travel patterns [3]. In this manuscript, we collectively use the term “asymptomatic” for undetected individuals who either have mild symptoms that are not directly associated with COVID-19 or display no symptoms at all. Recent antibody seroprevalence studies suggest that the number of asymptomatic COVID-19 cases outnumbers the symptomatic cases by an order of magnitude or more [4], [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22], [23], [24], [25], [26]. Estimating the prevalence and contagiousness of these asymptomatic cases is critical since it will change our understanding of the overall dimension and the pandemic potential of COVID-19 [27]. Yet, at this stage, the effect of the asymptomatic population, its size, and its outbreak dynamics remain largely unknown [28].

The first evidence of asymptomatic individuals in a family cluster of three was reported in late January, where one individual was mildly symptomatic and two remained asymptomatic, with normal lymphocyte counts and chest computer tomography images, but positive quantitative reverse transcription polymerase chain reaction tests [29]. As of mid June, more than 50 studies have reported an asymptomatic population, twenty-three of them with a sample size of at least 500 [30], with a median undercount of 20 across all studies, suggesting that only one in twenty COVID-19 cases is noticed and reported. These studies are based on polymerase chain reaction or antibody seroprevalence tests in different subgroups of the population, at different locations, at different points in time [4], [21], [31]. To no surprise, the reported undercount varies hugely, ranging from 3.5 and 5.0 in Luxembourg [18] and Germany [21] to 543 and 627 in Iran [17] and Japan [6] respectively. Most of these studies are currently only available on preprint servers, but an increasing number is now passing peer review, including a study of 1402 individuals in Wuhan City with an undercount of 22.1 [26], a study of 400 health care workers in London with an undercount of 35.0 [32], a community spreading study of 131 patients with influenza-like symptoms in Los Angeles with an undercount of 100.0 [33], and a seroprevalence study in Los Angeles county with an undercount of 43.5 [34]. The reported trend across all studies is strikingly consistent: A much larger number of individuals displays antibody prevalence than we would expect from the reported symptomatic case numbers. Knowing the exact dimension of the asymptomatic population is critical for two reasons: first, to truly estimate the severity of the outbreak, e.g., hospitalization or mortality rates [27], and second, to reliably predict the success of surveillance and control strategies, e.g., contact tracing or vaccination [35].

While there is a pressing need to better understand the prevalence of asymptomatic transmission, it is also becoming increasingly clear that it will likely take a long time until we can, with full confidence, deliver reliable measurements of this asymptomatic group [36]. In the meantime, mathematical modeling can provide valuable insight into the tentative outbreak dynamics and outbreak control of COVID-19 for varying asymptomatic scenarios [3]. Many classical epidemiology models base their predictions on compartment models in which individuals pass through different stages as they experience the disease [37]. A popular model to simulate the outbreak dynamics of COVID-19 is the SEIR model [2], which is made up of four compartments for the susceptible, exposed, infectious, and recovered populations [38]. Here, to explicitly account for the asymptomatic population, we introduce an SEIIR model, which further divides the infectious population into symptomatic and asymptomatic groups [39]. Similar models have recently been used to study the general role of asymptomatic carriers in disease transmission [40] and to illustrate how asymptomatic individuals have facilitated the rapid spread of COVID-19 throughout China [3], South Korea [41], and Italy [42]. While it is tempting – and easily possible – to introduce many more sub-populations into the model, for example a pre-symptomatic, hospitalized, or mortality group [43], here, we focus on the simplest possible model that allows us to explore the role of the asymptomatic population throughout the COVID-19 pandemic. To systematically probe different scenarios, we combine this deterministic SEIIR model with a dynamic effective reproduction number, machine learning, and uncertainty quantification to learn the reproduction number – in real time – and quantify and propagate uncertainties in the symptomatic-to-asymptomatic ratio and in the initial exposed and infectious populations. We show that this not only allows us to visualize the dynamics and uncertainties of the dynamic contact rate, the effective reproduction number, and the symptomatic, asymptomatic, and recovered populations, but also to estimate the initial date of the outbreak.

2. Methods

2.1. Epidemiology modeling

SEIIR epidemiology model. We model the epidemiology of COVID-19 using an SEIIR model with five compartments, the susceptible, exposed, symptomatic infectious, asymptomatic infectious, and recovered populations. Fig. 1 illustrates our SEIIR model, which is governed by a set of five ordinary differential equations,

S˙=S[βsIs+βaIa]E˙=+S[βsIs+βaIa]αEI˙s=+νsαEγsIsI˙a=+νaαEγaIaR˙=+γsIs+γaIa, (1)

where the fractions of all five populations add up to one, S+E+Is+Ia+R=1.

Fig. 1.

Fig. 1

SEIIR epidemiology model. The SEIIR model contains five compartments for the susceptible, exposed, symptomatic infectious, asymptomatic infectious, and recovered populations. The transition rates between the compartments, β, α, and γ are inverses of the contact period B=1β, the latent period A=1α, and the infectious period C=1γ. The symptomatic and asymptomatic groups have the same latent period A, but they can have individual contact periods Bs=1βs and Ba=1βa and individual infectious periods Cs=1γs and Ca=1γa. The fractions of the symptomatic and asymptomatic subgroups of the infectious population are νs and νa. We assume that the infection either goes through the symptomatic or the asymptomatic path, but not both for one individual.

We assume that both the symptomatic group Is and the asymptomatic group Ia can generate new infections. We introduce these two groups by fractions νs and νa of the subjects transferring from the exposed group E. We postulate that the two infectious groups Is and Ia have the same latent period A=1α, but can have individual contact periods Bs=1βs and Ba=1βa to mimic their different community spreading, and individual infectious periods Cs=1γs and Ca=1γa to mimic their different likelihood of isolation. From the infectious fractions, we can derive the overall contact and infectious rates β and γ from their individual symptomatic and asymptomatic counterparts, βs, βa, γs, and γa,

β=νsβs+νaβaandγ=νsγs+νaγa. (2)

Similarly, we can express the overall contact and infectious periods B and C in terms of their symptomatic and asymptomatic counterparts, Bs, Ba, Cs, and Ca,

B=BaBsνsBa+νaBsandC=CaCsνsCa+νaCs. (3)

Naturally, the different dynamics for the symptomatic and asymptomatic groups also affect the basic reproduction number R0, the number of new infections caused by a single one individual in an otherwise uninfected, susceptible population,

R0=CB=CaCsBaBsνsBa+νaBsνsCa+νaCs=νsβs+νaβaνsγs+νaγa=βγ. (4)

For a large asymptomatic group νa1, the basic reproduction number approaches the ratio between the infectious and contact periods of the asymptomatic population, R0CaBa, which could be significantly larger than the basic reproduction number for the symptomatic group, R0=CsBs, that we generally see reported in the literature. To characterize the effect of changes in social behavior and other interventions that may affect contact, we assume that the contact rate β(t) can vary as a function of time [44], but is the same for the symptomatic and asymptomatic groups,

β=βs=βa=β(t)such thatR(t)=CB(t)=CaCs[νsCa+νaCs]B(t)=β(t)νsγs+νaγa=β(t)γ. (5)

This introduces a time-varying effective reproduction number R(t), which is an important real time characteristic of the current outbreak dynamics.

SEIR epidemiology model. For the special case when the dynamics of the symptomatic and asymptomatic groups are similar, i.e., βs=βa=β and γs=γa=γ, we can translate the SEIIR model (1) into the classical SEIR model (6) with four compartments, the susceptible, exposed, infectious, and recovered populations [45], [46], [47], [48].

For this special case, we can back-calculate the symptomatic and asymptomatic groups from equation (6).3 as Is=νsI and Ia=νaI. Fig. 2 illustrates the SEIR model, which is governed by a set of four ordinary differential equations [49],

S˙=βSIE˙=+βSIαEI˙=+αEγIR˙=+γI. (6)

Fig. 2.

Fig. 2

SEIR epidemiology model. The SEIR model contains four compartments for the susceptible, exposed, infectious, and recovered populations. The transition rates between the compartments, β, α, and γ are inverses of the contact period B=1β, the latent period A=1α, and the infectious period C=1γ. If the transition rates are similar for the symptomatic and asymptomatic groups, the SEIIR model simplifies to the SEIR model with Is=νsI and Ia=νaI.

Discrete SEIIR epidemiology model. We focus on the SEIIR model with five ordinary differential equations (1), which we discretize in time using a finite difference approximation (˙)=[()n+1()n]Δt of the evolution of all five populations ()=S,E,Is,Ia,R. Here Δt=tn+1tn denotes the discrete time increment, usually Δt=1 day, and ()n+1 and ()n denote the populations of the new and previous time steps. We apply an explicit time integration scheme to obtain the following discrete system of equations,

Sn+1=[1[βsIs,n+βaIa,n]Δt]SnEn+1=+[βsIs,n+βaIa,n]ΔtSn+[1αΔt]EnIs,n+1=+νsαΔtEn+[1γsΔt]Is,nIa,n+1=+νaαΔtEn+[1γaΔt]Ia,nRn+1=+[γsIs,n+γaIa,n]Δt+Rn. (7)

At each location, we begin our simulation on day t0, the day of the beginning of the local lockdown, with initial conditions, S0, E0, Is0, Ia0, R0. We further assume that S0=1E0Is0Ia0, E0=[ACs][Is0+Ia0] [50], and νsIa0=[1νs]Is0 to express all five initial conditions in terms of the initial symptomatic infectious population Is0, such that S0=1[1+ACs]νsIs0, E0=[ACs]νsIs0, Ia0=[1νs]νsIs0, and R0=0. From the solution of the time discrete SEIIR system (7), for each point in time, we calculate the detected population,

Dn+1=Is,n+1+Rs,n+1=Dn+νsαEnΔt, (8)

as the discrete sum of the symptomatic infectious and recovered populations Is,n+1 and Rs,n+1 with R˙s=γsIs, or, equivalently, as the explicit update of the previous detected population Dn with the new symptomatic infectious population, νsαEn, that transitions from the exposed to the symptomatic infectious group. We compare the daily simulated detected population D(t) against the daily reported detected population Dˆ(t), which we obtain by scaling the daily number of detected cases by the total population N for each location [51], [52], [53], [54], [55], [56].

2.2. Seroprevalence studies

We consider seroprevalence studies up to mid June 2020 where the representation ratio, the ratio between the amount of seroprevalence samples and the representative population, is larger than 0.02%. Subsequently, we draw the daily number of confirmed reported cases for Heinsberg, Ada County, New York City, Santa Clara County, Denmark, Geneva Canton, Netherlands, Rio Grande do Sul, and Belgium from online data repositories [51], [52], [53], [54], [55], [56]. The symptomatic fraction νs follows from the ratio between the reported confirmed cases for the studied region and the amount of seroprevalence-estimated cases on the last day of the seroprevalence study. Table 1 summarizes the study date, number of samples, representation ratio, population, and symptomatic fraction of each location.

Table 1.

Seroprevalence studies with a representative population larger than 0.02%, location, study date, number of samples, representation ratio, population, and symptomatic fraction νs.

Location Study date Samples Population Repr. ratio Symptomatic fraction νs Ref.
Heinsberg, NRW, Germany March 30–April 6 919 41946 2.191% 20.00% (95%CI: 16.32%–25.20%) [21]
Ada County, ID, USA Late April 4856 481587 1.008% 7.90% (95%CI: 6.40%–9.82%) [5]
New York City, NY, USA May 5–Jun 5 28523 8398748 0.340% 5.76% (95%CI: 5.68%–5.84%) [14]
Santa Clara County, CA, USA April 2–3 3330 1781642 0.187% 1.77% (95%CI: 1.05%–3.82%) [4]
Denmark April 6–17 9496 5824857 0.163% 6.95% (95%CI: 5.13%–13.12%) [8]
Geneva Canton, Switzerland April 20–27 576 504031 0.114% 10.34% (95%CI: 7.66%–16.44%) [22]
Netherlands April 1–15 7361 17282163 0.043% 17.31% (95%CI: 15.14%–19.95%) [20]
Rio Grande do Sul, Brazil April 25–27 4188 11377239 0.037% 8.10% (95%CI: 3.72%–22.07%) [19]
Belgium April 14–May 13 2700 11492641 0.023% 10.21% (95%CI: 8.61%–12.18%) [16]

2.3. Bayesian learning

To calibrate our SEIIR model (1), we compare the model output, the simulated detected population D(t) for a given parameter set ϑ, against the data, the reported detected population Dˆ(t) for each location [51], [52], [53], [54], [55], [56]. Our time-discrete SEIIR model (7) uses the following set of parameters,

ϑ={A,Cs,Ca,νs,β(t),E0,Is0,Ia0,σ}, (9)

the latency period A, the symptomatic and asymptomatic infectious periods Cs and Ca, the symptomatic fraction νs, the dynamic contact rate β(t), the initial exposed and infectious populations E0, Is0, and Ia0, and the likelihood width σ. To reduce the set of unknowns, we fix the latency period to A=2.5 days and the symptomatic infectious period to Cs=6.5 days [57], [58], [59], [60]. Since the asymptomatic infectious period Ca is unreported, we study three cases with Ca=0.5,1.0,2.0Cs resulting in infectious periods of Ca=3.25, 6.5, and 13.0 days. We further express the initial exposed and asymptomatic infectious populations in terms of the symptomatic infectious population as E0=[ACs]νsIs0 [50] and Ia0=[1νs]νsIs0. This results in the following reduced set of parameters,

ϑ={νs,β(t),Is0,σ}, (10)

including the symptomatic fraction νs, the dynamic contact rate β(t), the initial symptomatic infectious population Is0, and the likelihood width σ.

Bayesian inference. We use Bayesian inference to estimate the set of parameters ϑ such that the statistics of the output of the model D(t) agree with those of the data Dˆ(t) [61], [62], [63], [64]. We relate the prior, likelihood, and posterior using Bayes’ theorem [65],

π(ϑDˆ(t))=π(Dˆ(t)ϑ)π(ϑ)π(Dˆ(t)), (11)

where π(ϑ) is the prior, i.e., the probability distribution of the model parameters, π(Dˆ(t)|ϑ) is the likelihood, i.e., the conditional probability of the data when the parameter set is fixed, π(ϑ|Dˆ(t)) is the posterior, i.e., the conditional probability of the parameters for given data Dˆ(t), and π(Dˆ(t)) is the evidence.

Priors. For the prior probability distributions π(ϑ), we assume weakly informative priors for our model parameters, the symptomatic fraction νs, the dynamic contact rate β(t), the initial symptomatic infectious population Is0, and the likelihood width σ. For the symptomatic fraction νs, we adopt the normal distributions from the individual local antibody seroprevalence studies of all nine locations [4], [5], [8], [14], [16], [19], [20], [21], [22]. Specifically, we set νsN(μs,σνs), where μs and σs are the mean and standard deviation of the symptomatic fraction based on the ratio of the reported and seroprevalence-estimated numbers of confirmed cases for each location in Table 1. For the dynamic contact rate β(t), we postulate that its logarithm follows a Gaussian random walk, log(β(t))GRW(t,μβ,σβ). Specifically, this Gaussian random walk assumes a sequence of inter-dependent Gaussian distributions where the contact rate β(tn+1) on day tn+1 depends on the contact rate β(tn) of the previous day tn with the initial condition, log(β0)N(μβ0,σβ), and the daily update, log(β(tn+1))β(tn)N(log(β(tn)),σβ), where μβ0 is the mean initial contact rate, σβ is the daily step width, and μβ is the overall drift between the initial value log(β(t0)) and the final value log(β(tn+1)). For the initial symptomatic infectious population Is0, we postulate a log-normal distribution, log(Is0)N(μIs0,σIs0), where we select a weakly informative prior for the mean μs=Dˆ(t0), as the reported detected population on day t0 and a standard deviation of σs=1. Table 2 summarizes our prior distributions and the SEIIR model parameters.

Table 2.

Prior distributions and SEIIR model parameters.

Parameter Interpretation Distribution
νs Symptomatic fraction See Table 1[4], [5], [8], [14], [16], [19], [20], [21], [22]

A Latent period Fixed (A= 2.5 days) [43], [58], [59]
Cs Symptomatic infectious period Fixed (Cs= 6.5 days) [57], [59], [60]
Ca Asymptomatic infectious period Fixed (Ca= 3.25, 6.5, 13.0 days) [57], [59], [60]

log(β(t)) Dynamic contact rate Gaussian random walk (μ=μβ,σ=σβ)
μβ0 Initial contact rate Normal (μ=0.0,σ=1.0)
μβ Overall drift Normal (μ=0.0,σ=1.0)
σβ Daily step width Half-normal (σ=0.02)

E0 Initial exposed Deterministic ([ACs][Is0+Ia0])[50]
log(Is0) Initial symptomatic Normal (μ=Dˆ(t0),σ=1.0)
Ia0 Initial asymptomatic Deterministic ([1νs]νsIs0)

σ Likelihood width HalfCauchy (β=1.0)

Likelihood. For the likelihood π(Dˆ(t)|ϑ), we introduce a likelihood function L that evaluates the proximity between the model output D(t), the simulated detected population for a given parameter set ϑ, and the data Dˆ(t), the relative reported detected population in probabilistic terms,

L(ϑ,t)=studentTν=4(Dˆ(t)D(ϑ,t),σ). (12)

We choose a Student’s t-distribution because it resembles a Gaussian distribution around the mean with heavy tails, which make the method more robust with respect to outliers and reporting noise [66], [67]. Here σ represents the likelihood width for which we postulate a halfCauchy distribution, see Table 2. The likelihood,

π(Dˆ(t)ϑ)=i=0nL(ϑ,ti)=i=0nstudentTν=4(Dˆ(ti)D(ϑ,ti),σ), (13)

is simply the product of all likelihood functions L evaluated at each time ti for i=0,1,,n.

Posteriors. With the prior π(ϑ) and the likelihood π(Dˆ(t)ϑ), we estimate the posterior π(ϑDˆ(t)) using Bayes’ theorem (11) [68]. Since we cannot describe the posterior distribution over the model parameters ϑ analytically, we adopt approximate-inference techniques to calibrate our model on the available data. We use the NO-U-Turn sampler (NUTS) [69], which is a type of Hamiltonian Monte Carlo algorithm as implemented in PyMC3 [70]. We use four chains. The first 4 times 500 samples serve to tune the sampler and are later discarded. The subsequent 4 times 1000 samples define the posterior distribution of the parameters ϑ. From the converged posterior distribution π(ϑDˆ(t)), we sample multiple combinations of parameters ϑ that describe the time evolution of detected population Dˆ(t). Using these posterior samples, we quantify the uncertainty of each parameter. As such, each parameter set ϑ provides a set of values for the symptomatic fraction νs, the initial symptomatic populations Is0, and the dynamic contact rate β(t). From these values, we quantify the effective reproductive number R(t) using Eq. (5) and the time evolution of the susceptible, exposed, symptomatic infectious, asymptomatic infectious and recovered populations, S(t), E(t), Is(t), Ia(t), and R(t) using Eqs. (7) and report their values with the associated 95% credible interval [71].

Hierarchical modeling. To estimate the asymptomatic infectious period Ca, we create a hierarchical model [72] and analyze the confirmed case data from all nine locations in Table 1. To prevent overfitting, we postulate that the initial reproduction number R0 is the same for all nine locations and remains constant during a two-week time window prior to lockdown. This implies that the contact rate β is static and constant. We assume a normal distribution for the basic reproduction number R0N(μR0,σR0) with mean μR0=3.87 and standard deviation σR0=0.44 [73]. This defines the initial contact rate β0i at each location, i=1,,9, as β0i=R0(νsiCs+(1νsi)Cai). We create a hyper-distribution for the overall asymptomatic infectious period, CaN(μCa,σCa), with mean μCa=6.5 and standard deviation σCahalfN(2.0), and assume that each local asymptomatic infectious period CaiN(Ca,σCa) is drawn from this hyperdistribution. All other parameters are similar to the previous section. Table 3 summarizes the prior distributions and the SEIIR model parameters of our hierarchical model to infer the asymptomatic infectious period Ca.

Table 3.

Prior distributions and SEIIR model parameters to hierarchically infer the asymptomatic infectious period Ca.

Parameter Interpretation Distribution
νs Symptomatic fraction See Table 1[4], [5], [8], [14], [16], [19], [20], [21], [22]

A Latent period Fixed (A= 2.5 days) [43], [58], [59]
Cs Symptomatic infectious period Fixed (Cs= 6.5 days) [57], [59], [60]

Ca Asymptomatic infectious period Normal (μ=6.5,σ=σCa)
σCa Standard deviation Half-normal (σ=2.0)
Cai Local asymptomatic infectious period Normal (μ=Ca,σ=σCa)

R0 Initial reproduction number Normal (μ=3.87,σ=0.44)[73]

E0 Initial exposed Deterministic ([ACs][Is0+Ia0])[50]
log(Is0) Initial symptomatic Normal (μ=Dˆ(t0),σ=1.0)
Ia0 Initial asymptomatic Deterministic ([1νs]νsIs0)

σ Likelihood width HalfCauchy (β=1.0)

To estimate the hierarchical asymptomatic infectious period Ca, we use Bayesian inference. Following Eq. (11), we relate the prior, likelihood, and posterior, with the only difference that the simulated and reported detected populations Di(t) and Dˆi(t) are now vectors that contain the detected populations of all i=1,,9 locations. The likelihood π(Dˆ(t)ϑ) becomes the product of the all likelihood functions L evaluated for a period of n=14 days prior to lockdown,

π(Dˆ(t)ϑ)=i=0nL(ϑ,ti)=i=0nj=19studentTν=4(Dˆj(ti)Dj(ϑ,ti),σ). (14)

For each location, we estimate the posterior π(ϑ|Dˆi(t)) using the NO-U-Turn sampler (NUTS) [69] in PyMC3 [70]. From the converged posterior distribution, we sample multiple ϑ combinations to quantify the evolution of the reproduction number R0 and contact rate β at each location, and the associated susceptible, exposed, symptomatic and asymptomatic infectious, and recovered populations with their associated 95% credible intervals.

Outbreak date. For each sample from the posterior distribution, we use the inferred initial exposed and asymptomatic infectious populations E0 and Ia0 to estimate the date of the very first COVID-19 case in Santa Clara County [74]. Specifically, for each parameter set, we create an SEIIR model and assume that the outbreak begins with one single asymptomatic infectious individual. We fix the latency and symptomatic infectious periods to A=2.5 days and Cs=6.5 days and impose the hierarchically estimated infectious period Ca from the previous section. For each posterior sample for the exposed, symptomatic infectious, and asymptomatic infectious population size on the lockdown date, March 16, 2020, we use the Nelder–Mead optimization method [75] to find the most probable outbreak origin date. Specifically, we solve the SEIIR model forward in time using an explicit time integration, starting from various start dates before March 16, 2020, and iteratively minimize the difference between the computed exposed, symptomatic, and asymptomatic infectious populations and the sample’s actual exposed, symptomatic, and asymptomatic infectious populations. We concomitantly fit a static contact rate β, which is bounded between zero and the posterior sample’s estimated contact rate β0 at the beginning of the simulation t0. Repeating this process for each sample of the Bayesian inference generates a distribution of possible origin dates. From this distribution, we compute the most probable origin date and its uncertainty.

3. Results

3.1. Outbreak dynamics of COVID-19 in Santa Clara County

Fig. 3 illustrates the outbreak dynamics of COVID-19 in Santa Clara County. To explore the effects of the asymptomatic infectious period Ca on the initial effective reproduction number, we start the simulations on March 6, 2020, 10 days before the local lockdown date. The black dots highlight the reported detected population Dˆ(t) from this day forward. Based on these data points, we learn the posterior distributions of our SEIIR model parameters for fixed latent and symptomatic infectious periods A=2.5 days and Cs=6.5 days, and for three asymptomatic infectious periods, Ca=3.25, 6.5, and 13.0 days, from left to right. The gray and green–purple regions highlight the 95% credible intervals on the confirmed cases D(t), top row, and the contact rate β(t), bottom row, based on the reported cases Dˆ(t), while taking into account uncertainties on the symptomatic infectious fraction νs, and the initial exposed and infectious populations E0, Is0, and Ia0. The red, orange, and green histograms on the middle row display the inferred initial exposed and infectious populations, E0, Is0, and Ia0, for the three different asymptomatic infectious periods. The graphs confirm that our dynamic SEIIR epidemiology model is capable of correctly capturing the gradual flattening and deflattening of the curve of confirmed cases in agreement with the decrease in new cases reported in Santa Clara County, top row. The consistent downward trend of the contact rate β(t) after the lockdown date (March 16, 2020) quantifies the efficiency of public health interventions. The different magnitudes in the contact rate highlight the effect of the three different asymptomatic infectious periods Ca: For larger asymptomatic infectious periods Ca, from left to right, to explain the same number of confirmed cases D(t)=Is(t)+Rs(t), the contact rate β(t) has to decrease. On March 6, 2020, the mean contact rate β(t) was 0.688 (95% CI: 0.612–0.764) for an infectious period of Ca = 3.25 days, 0.529 (95% CI: 0.464–0.593) for Ca = 6.5 days, and 0.463 (95% CI: 0.403–0.523) for Ca = 13.0 days. By March 16, 2020, the day Santa Clara County announced the first county-wide shelter-in-place order in the entire United States, these contact rates β(t) were 0.491 (95% CI: 0.462–0.523) for an infectious period of Ca = 3.25 days, 0.328 (95% CI: 0.305–0.352) for Ca = 6.5 days, and 0.252 (95% CI: 0.234–0.271) for Ca = 13.0 days.

Fig. 3.

Fig. 3

Outbreak dynamics of COVID-19 in Santa Clara County. The simulation learns the time-varying contact rate β(t) for fixed latent and symptomatic infectious periods A=2.5 days and Cs=6.5 days, and for three asymptomatic infectious periods Ca=3.25 days, 6.5 days, and 13.0 days (from left to right). Computed and reported confirmed cases in Santa Clara County, D(t)=Is(t)+Rs(t) and Dˆ(t) (top), initial exposed and infectious populations, E0, Is0, and Ia0 (middle), and dynamic contact rate, β(t) (bottom). The gray and green–blue regions highlight the 95% credible intervals on the confirmed cases D(t) (top) and the contact rate β(t) (bottom) based on the reported cases Dˆ(t), while taking into account uncertainties on the fraction of the symptomatic infectious population νs=IsI, and the initial exposed and infectious populations E0, Is0, and Ia0.

3.2. Effect of asymptomatic transmission of COVID-19 in Santa Clara County

Fig. 4 visualizes the effect of asymptomatic transmission in Santa Clara County. The simulation learns the dynamic contact rate β(t), and with it the time-varying effective reproduction number R(t), top row, for three asymptomatic infectious periods Ca=3.25 days, 6.5 days, and 13.0 days, from left to right. The effective reproduction number R(t) follows a similar downward trend as the contact rate β(t). For larger asymptomatic infectious periods Ca, from left to right, since R(t)=Csβ(t)[νs+νaCsCa], as Ca increases, CsCa decreases, and R(t) increases. Since R(t) represents the number of new infections from a single case, a decrease below R(t)<1 implies that a single infectious individual infects less than one new individual, which indicates that the outbreak decays. The dashed vertical lines indicate the date R(t)=1 during which one infectious individual, either symptomatic or asymptomatic, infects on average one other individual. For an asymptomatic infectious period of Ca = 3.25 days, it took until March 28 before Santa Clara County managed to get R(t) below 1 for the first time after the outbreak. For Ca = 6.5 days, this only occurred by April 1 and for Ca = 13.0 days, this occurred on April 8, 2020. This confirms our intuition that, the larger the asymptomatic infectious period Ca, for example because asymptomatic individuals will not isolate as strictly as symptomatic individuals, the higher the effective reproduction number R(t), and the more difficult it will be to control R(t) by public health interventions. For each of the three cases, the symptomatic infectious, asymptomatic infectious, and recovered population, are shown in the bottom row. For larger asymptomatic infectious periods Ca, from left to right, the total infectious population I increases and its maximum occurs later in time. Specifically, the maximum infectious population since March 6, 2020 amounts to 0.70% (95% CI: 0.43%-0.97%) on March 28, 2020 for Ca=3.25 days, 1.23% (95% CI: 0.72%-1.75%) on April 2, 2020 for Ca=6.5 days, and 2.10% (95% CI: 1.25%-2.94%) on April 7, 2020 for Ca=13.0 days. For larger asymptomatic infectious periods Ca, from left to right, the recovered population R decreases. Specifically, on June 15, 2020, the recovered population R amounts to 10.85% (95% CI: 7.06%-16.09%) for an infectious period of Ca=3.25 days, 10.20% (95% CI: 6.07%-14.90%) for Ca=6.5 days, and 9.90% (95% CI: 6.07%-13.93%) for Ca=13.0 days. Similarly, and important when considering different exit strategies, the total infectious population, I=Is+Ia, on June 15, 2020 is estimated to 0.39% (95% CI: 0.24%-0.58%) for Ca=3.25 days, 0.68% (95% CI: 0.40%-1.01%) for Ca=6.5 days, and 1.25% (95% CI: 0.73%-1.77%) for Ca=13.0 days.

Fig. 4.

Fig. 4

Effect of asymptomatic transmission of COVID-19 in Santa Clara County. The simulation learns the time-varying contact rate β(t), and with it the time-varying effective reproduction number R(t), for fixed latent and symptomatic infectious periods A=2.5 days and Cs=6.5 days, and for three asymptomatic infectious periods Ca=3.25 days, 6.5 days, and 13.0 days (from left to right). The downward trend of the effective reproduction number R(t) reflects the efficiency of public health interventions (top row). The dashed vertical lines mark the critical time period during which the effective reproductive reproduction number fluctuates around R(t)=1. The simulation predicts the symptomatic infectious, asymptomatic infectious, and recovered populations Is, Ia, and R (bottom row). The colored regions highlight the 95% credible interval for uncertainties in the number of confirmed cases D, the fraction of the symptomatic infectious population νs=IsI, the initial exposed population E0 and the initial infectious populations Is0 and Ia0.

3.3. Outbreak dynamics of COVID-19 worldwide

Fig. 5 illustrates the hierarchical asymptomatic infectious period Ca estimation for nine locations that reported COVID-19 antibody prevalence in a representative sample of the population: Heinsberg (NRW, Germany), Ada County (ID, USA), New York City (NY, USA), Santa Clara County (CA, USA), Denmark, Geneva Canton (Switzerland), Netherlands, Rio Grande do Sul (Brazil) and Belgium. This estimate assumes an initial fixed reproduction number R0 = 3.87 (95%CI: 3.01–4.66) during a two-week window before lockdown which results in an asymptomatic infectious period of Ca = 5.76 (95%CI: 3.59–8.09) days. With this value, we simulate the outbreak dynamics of COVID-19 in all nine locations. Fig. 6 illustrates the learnt effective reproduction number R(t), and the symptomatic and asymptomatic infectious populations Is and Ia, and the recovered population R for all nine locations. Here, we assume fixed latent and infectious periods of A=2.5 days, Cs=6.5 days, and the hierarchical asymptomatic infectious period Ca = 5.76 (95%CI: 3.59–8.09) days from Fig. 5.

Fig. 5.

Fig. 5

Hierarchical asymptomatic infectious periodCaestimation. Assuming an initial fixed reproduction number R0= 3.87 (95%CI: 3.01–4.66) [73], the simulation generates histograms of the asymptomatic infectious period Ca for each location based on the location-specific symptomatic fraction νs. The black dots and gray regions represent the reported and simulated detected cases Dˆ(t) and D(t) respectively. The hierarchical hyperdistribution for the asymptomatic infectious period results in Ca= 5.76 (95%CI: 3.59–8.09) days, right histogram.

Fig. 6.

Fig. 6

Outbreak dynamics of COVID-19 worldwide. Dynamic effective reproduction number R(t) and symptomatic, asymptomatic, and recovered populations at all nine locations. The simulation learns the time-varying contact rate β(t), and with it the time-varying effective reproduction number R(t), to predict the symptomatic infectious, asymptomatic infectious, and recovered populations Is, Ia, and R, for fixed latent and infectious periods A=2.5 days, Cs=6.5 days, the hierarchical asymptomatic infectious period Ca= 5.76 (95%CI: 3.59–8.09) days from Fig. 5. The dashed vertical lines mark the first time each location managed to lower the effective reproduction below R(t)=1 after lockdown. The colored regions highlight the 95% credible interval for the effective reproductive number R(t) (top), the symptomatic and asymptomatic populations Is and Ia, and the recovered population R (bottom plots), for uncertainties in the number of confirmed cases D, the fraction of the symptomatic infectious population νs, the initial exposed population E0, and the initial infectious populations Is0 and Ia0.

For all nine locations, the calculated metrics display the fine balance between the dynamics of the effective reproduction number and the control of the epidemic outbreak. The downward trend of the effective reproductive number R(t) quantifies how fast each location managed to control the spreading of COVID-19. The dashed vertical line indicates the first time each location managed to lower the effective reproduction below R(t)=1 after lockdown. This critical transition occurred on March 20 for Heinsberg, April 4 for Ada County, April 28 for New York City, March 30 for Santa Clara County, April 8 for Denmark, April 2 for Geneva, April 14 for the Netherlands, June 8 for Rio Grande do Sul and April 11 for Belgium. Based on our simulations, the maximum infectious population size amounted to 3.54% (95% CI: 2.96%-4.11%) on March 19 in Heinsberg, 0.47% (95% CI: 0.41%-0.55%) on May 5 in Ada County, 6.11% (95% CI: 5.99%-6.21%) on April 13 in New York City, 1.11% (95% CI: 0.69%-1.65%) on March 29 in Santa Clara County, 0.38% (95% CI: 0.27%-0.50%) on April 9 in Denmark, 2.10% (95% CI: 1.65%-2.69%) on April 2 in Geneva, 0.62% (95% CI: 0.56%-0.68%) on April 15 in the Netherlands, 0.28% (95% CI: 0.14%-0.48%) on June 8 in Rio Grande do Sul, and 0.70% (95% CI: 0.62%-0.78%) on April 11 in Belgium. On Jun 15, 2020, the estimated recovered population reached 24.15% (95% CI: 20.48%-28.14%) in Heinsberg, NRW, Germany 2.40% (95% CI: 2.09%-2.76%) in Ada County, ID, USA 46.19% (95% CI: 45.81%-46.60%) in New York City, NY, USA 11.26% (95% CI: 7.21%-16.03%) in Santa Clara County, CA, USA 3.09% (95% CI: 2.27%-4.03%) in Denmark 12.35% (95% CI: 10.03%-15.18%) in Geneva Canton, Switzerland 5.24% (95% CI: 4.84%-5.70%) in Netherlands 1.53% (95% CI: 0.76%-2.62%) in Rio Grande do Sul, Brazil 5.32% (95% CI: 4.77%-5.93%) in Belgium.

3.4. Estimating the outbreak date

Fig. 7 shows the estimated outbreak date of COVID-19 in Santa Clara County. For fixed latent and symptomatic infectious periods A=2.5 days and Cs=6.5 days, and for the hierarchically estimated asymptomatic infectious periods Ca=5.76 (95%CI: 3.59–8.09) days, the graph highlights the estimated date of the first COVID-19 case in the county. Based on the reported case data from March 16, 2020 onward, and taking into account uncertainty on the fraction of the symptomatic infectious population νs, on the initial exposed population E0, and on the initial symptomatic and asymptomatic infectious populations Is0 and Ia0, we systematically backtracked the date of the first undetected infectious individual. Our results suggest that the first case of COVID-19 in Santa Clara County dates back to January 20, 2020 (95% CI: December 29, 2019–February 13, 2020).

Fig. 7.

Fig. 7

Estimating the outbreak date of COVID-19 in Santa Clara County varying asymptomatic infectious periodsCa. Estimated date of the first COVID-19 case in Santa Clara County for fixed latent and symptomatic infectious periods A=2.5 days and Cs=6.5 days, and for the hierarchical asymptomatic infectious period Ca= 5.76 (95%CI: 3.59–8.09) days from Fig. 5. The colored regions in the main plot highlight the 95% credible interval for the time evolution of the exposed and asymptomatic infectious populations E and Ia estimated based on the reported cases Dˆ(t) from March 16, 2020 onward and taking into account uncertainties on the fraction of the symptomatic infectious population νs=IsI, and the exposed and asymptomatic infectious populations E0 and Ia0 on March 16, 2020 (right plot). The bottom plot histogram shows the distribution of the most probable origin dates to January 20, 2020 (95% CI: December 29, 2019–February 13, 2020).

4. Discussion

A key question in understanding the outbreak dynamics of COVID-19 is the dimension of the asymptomatic population and its role in disease transmission. Throughout the past months, dozens of studies have been initiated to quantify the fraction of the general population that displays antibody prevalence but did not report symptoms of COVID-19. Here we assume that this subgroup of the population has been infected with the novel coronavirus, but has remained asymptomatic, or only displayed mild symptoms that were not directly reported in the context of COVID-19. We collectively map this subgroup into an asymptomatic population and additively decompose the total infectious population, I=Is+Ia, into a symptomatic group Is and an asymptomatic group Ia. We parameterize this decomposition in terms of a single scalar valued parameter, the symptomatic fraction νs. Within this paradigm, we can conceptually distinguish two scenarios: the special case for which both subgroups display identical contact rates β, latent periods A, and infectious periods C, and the general case for which these transition dynamics are different.

For comparable dynamics, the size of the asymptomatic population does not affect overall outbreak dynamics. For the special case in which both subgroups display identical contact rates β, latent rates α, and infectious rates γ [76], our study shows that the overall outbreak dynamics can be represented by the classical SEIR model [49] using Eqs. (6). Importantly, however, since the reported case data only reflect the symptomatic infectious and recovered groups Is and Rs, the true infectious and recovered populations I=Isνs and R=Rsνs could be about an order of magnitude larger than the SEIR model predictions. From an individual’s perspective, a smaller symptomatic group νs, or equivalently, a larger asymptomatic group νa=[1νs], could have a personal effect on the likelihood of being unknowingly exposed to the virus, especially for high-risk populations: A larger asymptomatic fraction νa would translate into an increased risk of community transmission and would complicate outbreak control [27]. From a health care perspective, however, the special case with comparable transition dynamics would not pose a threat to the health care system since the overall outbreak dynamics would remain unchanged, independent of the fraction νa of the asymptomatic population: A larger asymptomatic fraction would simply imply that a larger fraction of the population has already been exposed to the virus – without experiencing significant symptoms – and that the true hospitalization and mortality rates would be much lower than the reported rates [30].

For different dynamics, the overall outbreak dynamics depend on both size and infectiousness of the asymptomatic group. For the general case in which the transition rates for the symptomatic and asymptomatic groups are different, the overall outbreak dynamics of COVID-19 become more unpredictable, since little is known about the dynamics of the asymptomatic population [40]. To study the effects of different dynamics between the symptomatic and asymptomatic groups, we decided to collectively represent a lower infectivity of the asymptomatic population through a smaller infectious period Ca<Cs and a lack of early isolation of the asymptomatic population through a larger infectious period Ca>Cs, while, for simplicity, keeping the latent period A and contact rate β similar across both groups [59]. Our study shows that the overall reproduction number, R(t)=[CaCs][νsCa+νaCs]β(t), and with it the outbreak dynamics, depend critically on the fractions of the symptomatic and asymptomatic populations νs and νa and on the ratio of the two infectious periods Cs and Ca. To illustrate these effects, we report the results for three different scenarios where the asymptomatic group is half as infectious, Ca=0.5Cs, equally infectious, Ca=Cs, and twice as infectious Ca=2.0Cs as the symptomatic group for Santa Clara County. The second case, the middle column in Fig. 3, Fig. 4, corresponds to the special case with comparable dynamics and similar parameters. Our learnt asymptomatic infectious periods of Ca=5.76 days in Fig. 5 suggest that Ca is smaller than the symptomatic infectious period of Cs=6.5 days and that the asymptomatic population is slightly less infectious as the symptomatic population. This can be a combined effect of being less contagious but, at the same time, having more contacts, since asymptomatic individuals, by definition, do not realize that they are spreading the disease [77].

Dynamic contact rates are a metric for the efficiency of public health interventions. Classical SEIR epidemiology models with static parameters are well suited to model outbreak dynamics under unconstrained conditions and predict how the susceptible, exposed, infectious, and recovered populations converge freely towards the endemic equilibrium [49]. However, they cannot capture changes in disease dynamics and fail to converge towards a temporary equilibrium before the entire population has become sufficiently immune to prevent further spreading [43]. To address this limitation, we introduce a time-dependent contact rate β(t), which we learn dynamically from the reported case data. Fig. 3, Fig. 6 demonstrate that our approach can successfully identify a dynamic rate that not only decreases monotonically, but is also capable of reproducing local contact fluctuations [44]. With this dynamic contact rate, our model can capture the characteristic S-shaped COVID-19 case curve that plateaus before a large fraction of the population has been affected by the disease, resembling a Gompertz function. It also naturally captures a potential regrowth of the contact rate β(t), and with it the effects of a second wave [78]. Previous studies have inferred discrete date points at which the contact rates vary [66] or used sliding windows over the amount of novel reported infections [79] to motivate dynamic contact rates. As such, our framework provides a model-based method for statistical inference of virus transmissibility: It naturally learns the most probable contact rate from the changing time evolution of new confirmed cases and concomitantly quantifies the uncertainty on that estimation.

The dynamics of the asymptomatic population affect the effective reproduction number. Our analysis in Eqs. (4), (5) and our simulations in Fig. 4 illustrate how asymptomatic transmission affects the effective reproduction number, and with it the outbreak dynamics of COVID-19. Our results show that, the larger the infectious period Ca of the asymptomatic group, the larger the initial effective reproduction number, R(t)=Csβ(t)[νs+νaCsCa], and the later the drop of R(t) below the critical value of one. A recent study analyzed the dynamics of the asymptomatic population in three consecutive windows of two weeks during the early outbreak in China [3]. The study found relatively constant latency and infectious periods A and C, similar to our assumption, and a decrease in the contact rate β=1.12,0.52,0.35 days−1 and in the effective reproduction number R(t)=2.38,1.34,0.98, which is consistent with our results. However, rather than assuming constant outbreak parameters within pre-defined time windows, our study learns the effective reproduction number dynamically, in real time, from the available data. Fig. 4, Fig. 6 demonstrate that we can successfully learn the critical time window until R(t) drops below 1, which, in Santa Clara County, took till March 28 for Ca =3.25 days, till April 1 for Ca=6.5 days and till April 8 for Ca = 13.0 days. Our findings are consistent with the observation that the basic reproduction number will be over-estimated if the asymptomatic group has a shorter generation interval, and underestimated if it has a longer generation interval than the symptomatic group [40]. Naturally, these differences are less pronounced under current conditions where the effective reproduction number is low and the entire population has been sheltering in place. It will be interesting to see if the effects of asymptomatic transmission become more visible as we gradually relax the current constraints and allow all individuals to move around and interact with others more freely. Seasonality, effects of different temperature and humidity, and other unknown factors may also influence the extent of transmission.

Estimates of the infectious asymptomatic population may vary, but general trends are similar. Throughout the past months, an increasing number of researchers around the globe have started to characterize the size of the asymptomatic population to better understand the outbreak dynamics of COVID-19 [30]. The reported undercount varies largely with values on the order of one estimated by the World Health Organization [80] and values on the order of ten reported by the Center for Disease Control and Prevention [36]. Two major challenges drive the interest in these studies: estimating the severity of the outbreak, e.g., hospitalization and mortality rates [27], and predicting the success of surveillance and control efforts, e.g., contact tracing or vaccination [35]. This is especially challenging now – in almost complete lockdown – when the differences in transmission dynamics between the symptomatic and asymptomatic populations are small and difficult to quantify. However, as Fig. 4 suggests, these transmission dynamics can have a significant effect on the size of the asymptomatic population: For infectious periods of Ca=0.5, 1.0, and 2.0 Cs, the maximum infectious population in Santa Clara County varies from 0.70% and 1.23% to 2.10%. Interestingly, not only the sum of the infectious and recovered populations, but also the uncertainty of their prediction, remain relatively insensitive to variations in the infectious period. To explore whether this is a universal trend, we perform the same analysis for nine different locations at which COVID-19 antibody prevalence was measured in a representative sample of the population, Heinsberg (NRW, Germany) [21], Ada County (ID, USA) [5], New York City (NY, USA) [14], Santa Clara County (CA, USA) [4], Denmark [8], Geneva Canton (Switzerland) [22], Netherlands [20], Rio Grande do Sul (Brasil) [19], and Belgium [16]. The fraction of the symptomatic population in these nine locations is νs = 20.00%, 7.90%, 5.76%, 1.77%, 6.95%, 10.34%, 17.31%, 8.10%, and 10.21% respectively, broadly representing the range of reported symptomatic versus estimated total cases worldwide [4], [5], [6], [7], [8], [9], [10], [11], [12], [13], [14], [15], [16], [17], [18], [19], [20], [21], [22], [23], [24], [25], [26]. Of the nine locations we analyzed here, Heinsberg tested IgG and IgA, Ada County tested IgG, New York City tested IgG, Santa Clara County tested IgG and IgM, Denmark tested IgG and IgM, Geneva tested IgG, the Netherlands tested IgG, IgM and IgA, Rio Grande do Sul tested IgG and IgM, and Belgium tested IgG. While we did include reported uncertainty on the seroprevalence data, seroprevalence would likely have been higher if all locations had tested for all three antibodies. Despite these differences, the effective reproduction numbers R(t) and the infectious and recovered populations Is, Ia, and R in Fig. 6 display remarkably similar trends: In most locations, the effective reproduction number R(t) drops rapidly to values below one within a window of about three weeks after the lockdown date. However, the maximum infectious population, a value that is closely monitored by hospitals and health care systems, varies significantly ranging from 0.28% and 0.38% in Rio Grande do Sul and Denmark respectively to 3.54% and 6.11% in Heinsberg and New York City respectively. This is consistent with the reported ‘superspreader’ events in these last two locations. An effect that we do not explicitly address is that immune response not only results COVID-19 antibodies (humoral response), but also from innate and cellular immunity [81]. While it is difficult to measure the effects of the unreported asymptomatic group directly, and discriminate it precisely from innate and cellular immunity, mathematical models can provide valuable insight into how this population modulates the outbreak dynamics and the potential of successful outbreak control [3].

Simulations provide a window into the outbreak date. Santa Clara County was home to the first individual who died with COVID-19 in the United States. Although this happened as early as February 6, the case remained unnoticed until April 22 [82]. The unexpected new finding suggests that the new coronavirus was circulating in the Bay Area as early as January. The estimated uncertainty on the exposed, symptomatic infectious, and asymptomatic infectious populations of our model allows us to estimate the initial outbreak date dates back to January 20 (Fig. 7). This back-calculated early outbreak date is in line with our intuition that COVID-19 is often present in a population long before the first official case is reported. Interestingly, our analysis comes to this conclusion purely based on a local serology antibody study [4] and the number of reported cases after lockdown [74].

Limitations. Our approach naturally builds in and learns several levels of uncertainties. By design, this allows us to estimate sensitivities and credible intervals for a number of important model parameters and discover important features and trends. Nevertheless, it has a few limitations, some of them by design, some simply limited by the current availability of data: First, our current SEIIR model assumes a similar contact rate β(t) for symptomatic and asymptomatic individuals. While we can easily adjust this in the model by defining individual symptomatic and asymptomatic rates βs(t) and βa(t), we currently do not have data on the temporal evolution of the hidden asymptomatic infectious population Ia(t) and longitudinal large population antibody follow-up studies would be needed to appropriately calibrate βa(t). Second, the ratio between the symptomatic and asymptomatic populations νs:νa can vary over time, especially, as we have shown, if both groups display notably different dynamics, in our model represented through Cs and Ca. Since this can have serious effects on the overall reproduction number R(t), and with it on required outbreak control strategies, it seems critical to perform more tests and learn the dynamics of the fractions νs(t) and νa(t) of both groups. Third, and this is not only true for our specific model, but for COVID-19 forecasts in general, all predictions can be sensitive to the amount of testing in time. As such, they crucially rely on testing policies and testing capacities. We expect to see a significant increase in the symptomatic-to-asymptomatic, or rather detected-to-undetected, ratio as we move towards systematically testing larger fractions of the population and more and more people who have no symptoms at all. The intensity of testing increases in most locations during our simulation period. For example, in Santa Clara County, testing was extremely limited until early April, increased substantially in the first three weeks of April, and even more after. Including limited testing and more undocumented cases during the early outbreak would shift the case distribution towards earlier days, and predict an even earlier outbreak date. When longitudinal antibody studies become available, additional methodologies can be developed to correct for this limitation which include a dynamic evolution for the symptomatic/asymptomatic fractions. Fourth, while we have included uncertainty in the seroprevalence data, the nine locations we analyzed here tested different types of antibodies and had different sampling procedures. Seroprevalence could have been higher if all locations had tested for the same three antibodies and data may differ depending on biases introduced by the sampling procedure. Finally, our current model does not explicitly account for innate and cellular immunity. If the fraction of the population with innate and cellular immunity is substantially high, we would anticipate a smaller susceptible population and a larger and earlier protective immunity overall. These, and other limitations related to the availability of information, can be easily addressed and embedded in our model and will naturally receive more clarification as studies and data become available in the coming months.

5. Conclusions

The rapid and devastating development of the COVID-19 pandemic has raised many open questions about its outbreak dynamics and unsuccessful outbreak control. From an outbreak management standpoint – in the absence of effective vaccination and treatment – the two most successful strategies in controlling an infectious disease are isolating infectious individuals and tracing and quarantining their contacts. Both critically rely on a rapid identification of infections, typically through clinical symptoms. Recent antibody prevalence studies could explain why these strategies have largely failed in containing the COVID-19 pandemic: Increasing evidence suggests that the number of unreported asymptomatic cases could outnumber the reported symptomatic cases by an order of magnitude or more. Mathematical modeling, in conjunction with reported symptomatic case data, antibody seroprevalence studies, and machine learning allows us to infer, in real time, the epidemiology characteristics of COVID-19. We can now visualize the invisible asymptomatic population, estimate its role in disease transmission, and quantify the confidence in these predictions. A better understanding of asymptomatic transmission will help us evaluate strategies to manage the impact of COVID-19 on both our economy and our health care system. A large asymptomatic population is associated with a high risk of community spread and could require a conscious shift from containment to mitigation induced by behavior changes. Our study suggests that, until vaccination and treatment become available, increasing population awareness, encouraging increased hygiene, mandating the use of face masks, restricting travel, and promoting physical distancing could be the most successful strategies to manage the impact of COVID-19 on both our economy and our health care system.

Declaration of Competing Interest

The authors declare no competing interst.

Acknowledgments

This work was supported by a Stanford Bio-X IIP Seed Grant (M.P. and E.K.), by the NIH, United States Grant U01 HL119578 (E.K.), by a DAAD Fellowship, United States (K.L.), and by the Stanford COVID-19 Seroprevalence Study Fund, United States (J.B. and E.B.).

Appendix.

To visualize the effect of asymptomatic transmission on the outbreak dynamics of COVID-19, we perform a sensitivity analysis with respect to the two most relevant parameters associated with asymptomatic transmission, the symptomatic fraction νs and the asymptomatic infectious period Ca. To highlight the effect of both parameters, we visualize the exposed, symptomatic and asymptomatic infectious, and recovered populations of our SEIIR model,

S˙=Sβ(t)[Is+Ia]E˙=+Sβ(t)[Is+Ia]αEI˙s=+νsαEγsIsI˙a=+νaαEγaIaR˙=+γsIs+γaIa. (15)

We assume that the symptomatic and asymptomatic contact rates are equal, β=βs=βa, and that the differences between symptomatic and asymptomatic transmission manifest themselves exclusively in the infectious periods Cs=1γs and Ca=1γa. For illustrative purposes, rather than using a dynamic contact rate of Gaussian random walk type as we have proposed in the main body of this manuscript, we adopt a smooth, monotonically decreasing dynamic contact rate of hyperbolic tangent type [44],

β(t)=β012[1+tanh([tt]T)][β0βt], (16)

where β0 is the basic contact rate, βt is the reduced contact rate, t is the adaptation time, and T is the adaptation speed. To mimic realistic outbreak dynamics, we adopt the basic contact rate β0=0.65 /days, the reduced contact rate βt=0.10 /days, the adaptation time t=18.61 days, and the adaptation speed T=10.82 /days from the mean values across the 27 countries of the European union [44]. Similar to the main body of this manuscript, we select a latent period of A=1α = 2.5 days and a symptomatic infectious period of Cs=1γa = 6.5 days. For the sensitivity analysis study three cases: (i) varying symptomatic and asymptomatic fractions νs=[0.0,,1.0] and νa=[0.0,,1.0] at Ca=6.5 days; (ii) a varying symptomatic fraction νs=[0.0,,1.0] at fixed Ca=3.25 days and Ca=13 days; and (iii) a varying asymptomatic infectious period Ca=[3.25,,13.00] days at fixed νs=0.2 and νs=0.8. For a symptomatic fraction of νs=0.2, the undercount is four, i.e., for every reported case, there are four unreported cases. For a symptomatic fraction of νs=0.8, the undercount is one fourth, i.e., for every four reported cases, there is one unreported case.

Fig. 8 highlights the effect of varying symptomatic and asymptomatic fractions νs and νa for similar infectious periods Ca=Cs. The darkest curves correspond to the special case of only symptomatic transmission, νs=1.0 and only asymptomatic transmission, νa=1.0; the lightest curves vice versa. For the special case with similar symptomatic and asymptomatic disease dynamics, the SEIIR model (1) reduces to the classical SEIR model (6), and we can simply post-process the symptomatic and asymptomatic infectious populations as Is=νsI and Ia=νaI. The set of blue curves illustrates the symptomatic part of the recovered population, Rs=νsR for varying νs and νa, which is the fraction of the recovered population that is reported in the public data bases. The steepest blue curve is the total recovered population, R=Rs+Ra=[νs+νa]R, which, for this special case, is independent of the symptomatic and asymptomatic fractions νs and νa. For example, for a symptomatic fraction of νs=0.2, the ratio between both curves is RRs=5, which implies that five times more individuals have actually already gone through the disease than we would expect from the reported case data in public databases.

Fig. 8.

Fig. 8

SEIIR epidemiology model. Sensitivity with respect to the symptomatic and asymptomatic fractionsνsandνa. Darkest curves correspond to the special case of only symptomatic transmission, νs=1.0 (left) and only asymptomatic transmission, νa=1.0 (right); lightest curves vice versa. For the special case with similar symptomatic and asymptomatic infectious periods, Ca=Cs, the SEIIR model reduces to the classical SEIR model with Is=νsI and Ia=νaI. The set of blue curves illustrates the symptomatic part of the recovered population Rs=νsR; the steepest blue curve is the total recovered population R, which is independent of νs and νa. Latent period A=2.5 days, symptomatic and asymptomatic infectious periods Cs=6.5 days and Ca=6.5 days, and contact rate β(t)=β012[1+tanh([tt]T)][β0βt] with β0=0.65 /days, βt=0.10 /days, t=18.61 days, and T=10.82 days.

Fig. 9 highlights the effect of the symptomatic fraction νs. The dark red, orange, green, and blue curves correspond to the special case of the regular SEIR model (6) without asymptomatic transmission and thus νs=1.0 and νa=0.0. The lighter curves show the effect of decreasing the symptomatic fraction towards νs=0.0 and νa=1.0. Interestingly, the effect of asymptomatic transmission on the overall outbreak dynamics varies not only with the symptomatic fraction, but is also highly sensitive to the ratio of the symptomatic and asymptomatic infectious periods. For the smaller asymptomatic infectious period of Ca=0.5Cs, the outbreak dynamics are governed by the orange symptomatic population Is and the blue recovered population R decreases with increasing asymptomatic transmission νa. For the larger asymptomatic infectious period of Ca=2Cs, the outbreak dynamics are governed by the green asymptomatic population Ia and the blue recovered population R increases with increasing asymptomatic transmission.

Fig. 9.

Fig. 9

SEIIR epidemiology model. Sensitivity with respect to the symptomatic fractionνs. Darkest curves correspond to the special case of no asymptomatic transmission, νs=1.0, lightest curves to only asymptomatic transmission, νs=0.0. For a smaller asymptomatic infectious period of Ca=0.5Cs, the outbreak dynamics are governed by the symptomatic population Is and the recovered population R decreases with increasing asymptomatic transmission νa (left). For a larger asymptomatic infectious period of Ca=2Cs, the outbreak dynamics are governed by the asymptomatic population Ia and the recovered population increases with increasing asymptomatic transmission (right). Latent period A=2.5 days, symptomatic infectious period Cs=6.5 days, asymptomatic infectious period Ca=3.25 days (left) and Ca=13 days (right), and contact rate β(t)=β012[1+tanh([tt]T)][β0βt] with β0=0.65 /days, βt=0.10 /days, t=18.61 days, and T=10.82 days.

Fig. 10 shows the effect of the asymptomatic infectious period Ca. The darkest red, orange, green, and blue curves correspond to a larger asymptomatic infectious period of Ca=2Cs, the lightest curves to a smaller period Ca=0.5Cs. The middle curves correspond to the special case for which the asymptomatic and symptomatic populations display the same dynamics, Ca=Cs, such that the overall recovered population becomes independent of the symptomatic fraction. This implies that middle blue recovered populations of both graphs are identical and also identical to the dark blue curves of Fig. 8. For the larger symptomatic fraction of νs=0.8, the outbreak dynamics are governed by the orange symptomatic population Is and the blue recovered population R is relatively insensitive to the asymptomatic infectious period Ca. For the smaller symptomatic fraction of νs=0.2, the outbreak dynamics are governed by the green asymptomatic population Ia and the blue recovered population R increases markedly with an increasing asymptomatic infectious period Ca.

Fig. 10.

Fig. 10

SEIIR epidemiology model. Sensitivity with respect to the asymptomatic infectious periodCa. Darkest curves correspond to a larger asymptomatic infectious period of Ca=2Cs, lightest curves to a smaller period Ca=0.5Cs. For a larger symptomatic fraction of νs=0.8, the outbreak dynamics are governed by the symptomatic population Is and the recovered population R is relatively insensitive to the asymptomatic infectious period Ca (left). For a smaller symptomatic fraction of νs=0.2, the outbreak dynamics are governed by the asymptomatic population Ia and the recovered population R increases markedly with an increasing asymptomatic infectious period Ca (right). Latent period A=2.5 days, symptomatic infectious period Cs=6.5 days, asymptomatic infectious period Ca=3.25 days (left) and Ca=13 days (right), and contact rate β(t)=β012[1+tanh([tt]T)][β0βt] with β0=0.65 /days, βt=0.10 /days, t=18.61 days, and T=10.82 days.

Taken together, this sensitivity analysis shows that the outbreak dynamics of COVID-19 are highly sensitive to asymptomatic transmission. This sensitivity becomes more pronounced the more the asymptomatic infectious period differs from its symptomatic counterpart. This difference could be a natural result of asymptomatic individuals isolating themselves less since they are unaware that they can infect others. Knowing the precise dynamics of asymptomatic transmission is therefore critical for a successful management of COVID-19.

References

  • 1.Dong E., Gardner L. An interactive web-based dashboard to track COVID-19 in real time. Lancet Infect. Dis. 2020 doi: 10.1016/S1473-3099(20)30120-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Fang Y., Nie Y., Penny M. Transmission dynamics of the COVID-19 outbreak and effectiveness of government interventions. J. Med. Virol. 2020:1–15. doi: 10.1002/jmv.25750. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Li R., Pei S., Chen B., Song Y., Zhang T., Yang W., Shaman J. Substantial undocumented infection facilitates the rapid dissemination of novel coronavirus (SARS-cov2) Science. 2020;368:489–493. doi: 10.1126/science.abb3221. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.E. Bendavic, B. Mulaney, N. Sood, S. Shah, E. Ling, R. Bromley-Dulfano, C. Lai, Z. Weissberg, R. Saavedra-Walker, J. Tedrow, D. Tversky, A. Bogan, T. Kupiec, D. Eichner, R. Gupta, J.P.A. Ioannidis, J. Bhattacharya, COVID-19 Antibody seroprevalence in Santa Clara County, California, medRxiv (2020) 10.1101/2020.04.14.20062463. [DOI] [PMC free article] [PubMed]
  • 5.Bryan A., Pepper G., Wener M.H., Fink S.L., Morishima C., Chaudhary A., Jerome K.R., Mathias P.C., Greninger A.L. Performance characteristics of the abbott architect SARS-CoV-2 IgG assay and seroprevalence in boise. Idaho J. Clin. Microbiol. 2020 doi: 10.1128/JCM.00941-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.A. Doi, K. Iwata, H. Kuroda, T. Hasuike, A. Kanda, T. Nagao, H. Nishioka, K. Tomii, T. Morimoto, Y. Kihara, Seroprevalence of novel coronavirus disease (COVID-19) in Kobe, Japan. medRxiv (2020) 10.1101/2020.04.26.20079822. [DOI] [PMC free article] [PubMed]
  • 7.Emmenegger M., De Cecco E., Lamparter D., Jacquat R.P.B., Ebner D., Schneider M.M., Condado Morales I., Schneider D., Dogancay B., Guo J., Wiedmer A., Domange J., Imeri M., Moos R., Zografou C., Trevisan C., Gonzalez-Guerra A., Carrella A., Dubach I.L., Xu C.K., Meisl G., Kosmoliaptsis V., Malinauskas T., Burgess-Brown N., Owens R., Mongkolsapaya J., Hatch S., Screaton G.R., Schubert K., Huck J.D., Liu F., Pojer F., Lau K., Hacker D., Probst-Mueller E., Cervia C., Nilsson J., Boyman O., Saleh L., Spanaus K., von Eckardstein A., Schaer D.J., Ban N., Tsai C., Marino J., Schertler G.F.X., Gottschalk J., Frey B.M., Reimann R., Hornemann S., Ring A.M., Knowles T.P.J., Xenarios I., Stuart D.I., Aguzzi A. 2020. Population-wide evolution of SARS-CoV-2 immunity tracked by a ternary immunoassay. medRxiv. [DOI] [Google Scholar]
  • 8.C. Erikstrup, C.E. Hother, O.B.V. Pedersen, K. Molbak, R.L. Skov, D.K. Holm, S. Saekmose, A.C. Nilsson, P.T. Brooks, J.K. Boldsen, C. Mikkelsen, M. Gybel-Brask, E. Sorensen, K.M. Dinh, S. Mikkelsen, B.K. Moller, T. Haunstrup, L. Harritshoj, B.A. Jensen, H. Hjalgrim, S.T. Lillevang, H. Ullum, Estimation of SARS-CoV-2 infection fatality rate by real-time antibody screening of blood donors medRxiv (2020) 10.1101/2020.04.24.20075291. [DOI]
  • 9.A. Fontanet, L. Tondeur, Y. Madec, R. Grant, C. Besombes, N. Jolly, S. Fernandes Pellerin M.N. Ungeheuer, I. Cailleau, L. Kuhmel, S. Temmam, C. Huon, K.Y. Chen, B. Crescenzo, S. Munier, C. Demeret, L. Grzelak, I. Staropoli, T. Bruel, P. Gallian, S. Cauchemez, S. van der Werf, O. Schwartz, M. Eloit, B. Hoen, Cluster of COVID-19 in northern France: A retrospective closed cohort study. medRxiv (2020) 10.1101/2020.04.20071134. [DOI]
  • 10.Hallal H., Hartwig F., Horta B., Victora G.D., Silveira M., Struchiner C., Vidaletti L.P., Neumann N., Pellanda L.S., Dellagostin O.A., Burattini M.N., Menezes A.M., Barros F.C., Barros A.J., C.G C. 2020. Victora remarkable variability in SARS-cov-2 antibodies across Brazilian 1 regions: nationwide serological household survey in 27 states. medRxiv. [DOI] [Google Scholar]
  • 11.Jerkovic I., Ljubic T., Basic Z., Kruzic I., Kunac N., Bezic J., Vuko A., Markotic A., S L.S. Andjelinovic SARS-CoV-2 antibody seroprevalence in industry workers in split-dalmatia and sibenik-knin county. Croatia. 2020 doi: 10.1101/2020.05.11.20095158. medRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Kraehling V., Kern M., Halwe S., Mueller H., Rohde C.P.C., Savini M., Schmidt M., Wilhelm J., Becker S., Ciesek S., Gottschalk R. 2020. Epidemiological study to detect active SARS-cov-2 infections and seropositive persons in a selected cohort of employees in the frankfurt am main metropolitan area. medRxiv. [DOI] [Google Scholar]
  • 13.Ng D., Goldgof G., Shy B., Levine A., Balcerek J., Bapat S.P., Prostko J., Rodgers M., Coller K., Pearce S., Franz S., Du L., Stone M., Pillai S., Sotomayor-Gonzalez A., Servellita V., Sanchez-San Martin C., Granados A., Glasner D.R., Han L.M., Truong K., Akagi N., Nguyen D.N., Neumann N., Qazi D., Hsu E., Gu W., Santos Y.A., Custer B., Green V., Williamson P., Hills N.K., Lu C.M., Whitman J.D., Stramer S., Wang C., Reyes K., Hakim J., Sujishi K., Alazzeh F., Pharm L., Oon C., Miller S., Kurtz T., Hackett J., Simmons G., Busch M.P., C.Y B.M. 2020. Chiu SARS-CoV-2 seroprevalence and neutralizing activity in donor and patient blood from the San Francisco Bay Area. medRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.J. Reifer, N. Hayum, B. Heszkel, I. Klagsbald, V.A. Streva, SARS-CoV-2 IgG antibody responses in New York City medRxiv (2020) 10.1101/2020.2005.2023.20111427. [DOI] [PMC free article] [PubMed]
  • 15.Rosenberg E.S., Tesoriero J.M., Rosenthal E.M., Chung R., Barranco M.A., Styer L.M., Parker M.M., Leung S.J., Morne J.M., Greene D., Holtgrave D.R., Hoefer D., Kumar J., Udo T., Hutton B., Zucker H.A. 2020. Cumulative incidence and diagnosis of SARS-CoV-2 infection in new york. medRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.2020. Sciensano: National public health research institute of Belgium weekly epidemiological bulleting from may 29, 2020. https://covid-19.sciensano.be (assessed: Jun 15, 2020) [Google Scholar]
  • 17.M. Shakiba, S. Nazari, F. Mehrabian, S.M. Rezvani, Z. Ghasempour, A. Heidarzadeh, Seroprevalence of COVID-19 virus infection in Guilan province, Iran. medRxiv (2020) 10.1101/2020.04.26.20079244. [DOI] [PMC free article] [PubMed]
  • 18.C.J. Snoeck, M. Vaillant, T. Abdelrahman, V.P. Satagopam, J.D. Turner, K. Beaumont, C.P.C. Gomes, J.V. Fritz, V.E. Schröder, A. Kaysen, L. Pavelka, L. Stute, G. Ramos Meyers, L. Pauly, M. Hansen, C. Pauly, G.A. Aguayo, M. Perquin, A. Hanff, S. Ghosh, M. Gantenbein, L. Huiart, M. Ollert, R. Krüger, Prevalence of SARS-CoV-2 infection in the Luxembourgish population: the CON-VINCE study. medRxiv (2020) 10.1101/2020.05.11.20092916. [DOI]
  • 19.M. Silveira, A. Barros, B. Horta, L. Pellanda, G. Victora, O. Dellagostin, C. Struchiner, M. Burattini, A. Valim, E. Berlezi, J. Mesa, M.L. Ikeda, M. Mesenburg, M. Mantesso, M. Dall’Agnol, R. Bittencourt, F. Hartwig, A.M. Menezes, F.C. Barros, P. Hallal, C.G. Victora, Repeated population-based surveys of antibodies against SARS-CoV-2 in Southern Brazil medRxiv (2020) 10.1101/2020.05.01.20087205. [DOI] [PubMed]
  • 20.Slot E., Hogema B.M., Reusken C.B.E.M., Reimerink J.H., Molier M., Karregat H.M., Ijist J., Novotny V.M.J., van Lier R.A.W., Zaaijer H.L. Herd immunity is not a realistic exit strategy during a COVID-19 outbreak. Res. Square. 2020 doi: 10.21203/rs.3.rs-25862/v1. [DOI] [Google Scholar]
  • 21.H. Streeck, B. Schulte, B.M. Kümmerer, E. Richter, T. Höller, C. Fuhrmann, E. Bartok, R. Dolscheid, M. Berger, L. Wessendorf, M. Eschbach-Bludau, A. Kellings, A. Schwaiger, M. Coenen, P. Hoffmann, B. Stoffel-Wagner, M.M. Nöthen, A.M. Eis-Hübinger, M. Exner2, R.M. Schmithausen, M. Schmid, G. Hartmann, Infection fatality rate of SARS-CoV-2 infection in a German community with a super-spreading event. medRxiv (2020) 10.1101/2020.05.04.20090076. [DOI]
  • 22.S. Stringhini, A. Wisniak, G. Piumatti, A.S. Azman, S.A. Lauer, H. Baysson, D. De Ridder, D. Petrovic, S. Schrempft, K. Marcus, I. Arm-Vernez, S. Yerly, O. Keiser, S. Hurst, K. Posfay-Barbe, D. Trono, D. Pittet, L. Getaz, F. Chappuis, I. Eckerle, N. Vuilleumier, B. Meyer, A. Flahault, L. Kaiser, I. Guessous, Repeated seroprevalence of anti-SARS-CoV-2 IgG antibodies in a population-based sample from Geneva, Switzerland medRxiv (2020) 10.1101/2020.05.02.20088898. [DOI]
  • 23.Takita M., Matsumura T., Yamamoto K., Yamashita E., Hosoda K., Hamaki T., Kusumi E. 2020. Regional difference in seroprevalence of SARS-CoV-2 in tokyo: Results from the community point-of-care antibody testing. medRxiv. [DOI] [Google Scholar]
  • 24.C. Thompson, N. Grayson, R.S. Paton, J. Lourenco, B.S. Penman, L. Lee, Neutralising antibodies to SARS coronavirus 2 in Scottish blood donors – a pilot study of the value of serology to determine population exposure. medRxiv (2020) 10.1101/2020.04.13.20060467. [DOI] [PMC free article] [PubMed]
  • 25.Valenti L., Bergna A., Pelusi S., Facciotti F., Lai A., Tarkowski M., Berzuini A., Caprioli F., Santoro L., Baselli G., Della Ventura C., Erba E., Bosari S., Galli M., Zehender G., Prati D. 2020. SARS-CoV-2 seroprevalence trends in healthy blood donors during the COVID-19 milan outbreak. medRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.X. Wu, B. Fu, L. Chen, Y. Feng, Serological tests facilitate identification of asymptomatic SARS-CoV-2 infection in Wuhan, China. J. Med. Virol., 10.1002/jmv.25904. [DOI] [PMC free article] [PubMed]
  • 27.Fauci A.S., Lane H.C., Redfield R.R. Covid-19–navigating the uncharted. New Engl. J. Med. 2020;382:1268–1269. doi: 10.1056/NEJMe2002387. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Havers F.P., Reed C., Lim T., Montgomery J.M., Klena J.D., Hall A.J., Fry A.M., Cannon D.L., Chiang C.-F., Gibbons A., Krapiunaya I., Morales-Betoulle M., Roguski K., Rasheed M.A.U., Freeman B., Lester S., Mills L., Carroll D.S., Owen S.M., Johnson J.A., Semenova V., Blackmore C., Blog D., Chai S.J., Dunn A., Hand J., Jain S., Lindquist S., Lynfield R., Pritchard S., Sokol T., Sosa L., Turabelidze G., Watkins S.M., Wiesman J., Williams R.W., Yendell S., Schiffer J., Thornburg N.J. Seroprevalence of antibodies to SARS-cov-2 in 10 wites in the United States. JAMA Intern. Med. 2020 doi: 10.1001/jamainternmed.2020.4130. [DOI] [PubMed] [Google Scholar]
  • 29.Pan X., Chen D., Xia Y., X J.M., Wu J.D., Li T., Ou X., Zhou L., Liu J. Asymptomatic cases in a family cluster with SARS-CoV-2 infection. Lancet Infect. Dis. 2020;20:410–411. doi: 10.1016/S1473-3099(20)30114-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.J.P.A. Ioannidis, The infection fatality rate of COVID-19 inferred from seroprevalence data. medRxiv. (2020) 10.1101/2020.05.13.20101253. [DOI] [PMC free article] [PubMed]
  • 31.Cuomo A.M. 2020. Amid ongoing COVID-19 pandemic, governor cuomo announces results of completed antibody testing study. https://www.governor.ny.gov/news/amid-ongoing-covid-19-pandemic-governor-cuomo-announces-results-completed-antibody-testing (assessed: Jun 15, 2020) [Google Scholar]
  • 32.T.A. Treibel, C. Manisty, M. Burton, A. McKnight, J. Lambourne, J.B. Augusto, X. Couto-Parada, T. Cutino-MOguel, M. Noursadeghi, J.C. Moon, COVID-19: PCR screening of asympotmatic healthcare workers at London hospital. Lancet. 10.1016/S0140-6736(20)31100-4. [DOI] [PMC free article] [PubMed]
  • 33.B. Spellberg, M. Haddix, R. Lee, S. Butler-Wu, P. Holtom, H. Yee, P. Gounder, Community prevalence of SARS-CoV-2 among patients with influenzalike illnesses presenting to a Los Angeles medical center in March 2020. J. Amer. Med. Assoc., 10.1001/jama.2020.4958. [DOI] [PMC free article] [PubMed]
  • 34.Sood N., Simon P., Ebner P., Eichner D., Reynolds J., Bendavid E., Bhattacharya J. Seroprevalence of SARS-CoV-2-specific antibodies among adults in los angeles county, california. JAMA. 2020 doi: 10.1001/jama.2020.8279. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Fraser C., Riley S., Anderson R.M., Ferguson N.M. Factors that make an infectious disease outbreak controllable. Proc. Natl. Acad. Sci. 2004;101:6146–6151. doi: 10.1073/pnas.0307506101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Center for Disease Control and Prevention . 2020. Large-scale geographic seroprevalence surveys. https://www.cdc.gov/coronavirus/2019-ncov/cases-updates/geographic-seroprevalence-surveys.html (assessed: Aug 24, 2020) [Google Scholar]
  • 37.Kermack W.O., McKendrick G. Contributions to the mathematical theory of epidemics, part i. Proc. R. Soc. Lond. Ser. A. 1927;115:700–721. [Google Scholar]
  • 38.Aron J.L., Schwartz I.B. Seasonality and period-doubling bifurcation in an epidemic model. J. Theoret. Biol. 1984;110:665–679. doi: 10.1016/s0022-5193(84)80150-2. [DOI] [PubMed] [Google Scholar]
  • 39.Kuhl E. Data-driven modeling of COVID-19 - lessons learned. Extreme Mech. Lett. 2020 doi: 10.1016/j.eml.2020.100921. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.S.W. Park, D.M. Cornforth, J. Dushoff, J.W. Weitz, The time scale of asymptomatic transmission affects estimates of epidemic potential in the COVID-19 outbreak. medRxiv 2020.03.09.20033514, 10.1101/2020.03.09.20033514. [DOI] [PMC free article] [PubMed]
  • 41.B. Tang, F. Xia, N.L. Bragazzi, Z. McCarthy, X. Wang, S. He, X. Sun, S. Tang, Y. Xiao, J. Wu, Lessons drawn from China and South Korea for managing COVID-19 epidemic: insights from a comparative modeling study, Bull. World Health Organ., 10.2471/BLT.20.257238. [DOI] [PMC free article] [PubMed]
  • 42.Gatto M., Bertuzzo E., Mari L., Miccoli S., Carraro L., Casagrandi R., Rinaldo A. Spread and dynamics of the COVID-19 epidemic in Italy: Effects of emergency containment measures. Proc. Natl. Acad. Sci. 2020 doi: 10.1073/pnas.2004978117. (in press) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Peirlinck M., Linka K., Sahli Costabal F., Kuhl E. Outbreak dynamics of COVID-19 in China and the United States. Biomech. Model. Mechanobiol. 2020 doi: 10.1007/s10237-020-01332-5. online first: [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Linka K., Peirlinck M., Kuhl E. The reproduction number of COVID-19 and its correlation with public health interventions. Comput. Mech. 2020 doi: 10.1007/s00466-020-01880-8. online first: [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Linka K., Peirlinck M., Sahli Costabal F., Kuhl E. Outbreak dynamics of COVID-19 in europe and the effect of travel restrictions. Comput. Methods Biomech. Biomed. Eng. 2020;23:710–717. doi: 10.1080/10255842.2020.1759560. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Linka K., Rahman P., Goriely A., Kuhl E. Is it safe to lift COVID-19 travel restrictions? The newfoundland story. Comput. Mech. 2020 doi: 10.1007/s00466-020-01899-x. online first: [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Zohdi T.I. An agent-based computational framework for simulation of competing hostile planet-wide populations. Comput. Methods Appl. Mech. Engrg. 2017;314:513–526. [Google Scholar]
  • 48.Zohdi T.I. An agent-based computational framework for simulation of global pandemic and social response on planet x. Comput. Mech. 2020 doi: 10.1007/s00466-020-01886-2. (in press) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Hethcote H.W. The mathematics of infectious diseases. SIAM Rev. 2000;42:599–653. [Google Scholar]
  • 50.Engbert R., Drepper F.R. Chance and chaos in population biology–models of recurrent epidemics and food chain dynamics. Chaos Solutions Fractals. 1994;4:1147–1169. [Google Scholar]
  • 51.Painel de casos de doença pelo coronavírus . 2019. (COVID-19) no brasil pelo ministerio da saude. www.covid.saude.gov.br/ (assessed: Jun 15, 2020) [Google Scholar]
  • 52.European Centre for Disease Prevention and Control . 2020. Situation update worldwide. www.ecdc.europa.eu/en/geographical-dis-tribution-2019-ncov-cases(assessed: Jun 15, 2020) [Google Scholar]
  • 53.Heinsberg R. Coronavirus im Kreis Heinsberg; 2020. Aktuelles Aus Dem Kreishaus. www.kreis-heinsberg.de/aktuelles (assessed: Jun 15, 2020) [Google Scholar]
  • 54.New York City Health Department . 2020. NYC Coronavirus disease 2019 (COVID-19) data. www.github.com/nychealth/coronavirus-data (assessed: Jun 15, 2020) [Google Scholar]
  • 55.New York Times . 2020. An ongoing repository of data on coronavirus cases and deaths in the U.S. www.github.com/nytimes/covid-19-data (assessed: Jun 15, 2020) [Google Scholar]
  • 56.2020. Swiss federal office of public health FOPH covid19 case numbers cantons of switzerland and principality of Liechtenstein. www.github.com/openZH/covid19/ (assessed: Jun 15, 2020) [Google Scholar]
  • 57.He X., Lau E.H., Wu P., Deng X., Wang J., Hao X., Lau Y.C., Wong J.Y., Guan Y., Tan X., Mo X. Temporal dynamics in viral shedding and transmissibility of COVID-19. Nat. Med. 2020;26(5):672–675. doi: 10.1038/s41591-020-0869-5. [DOI] [PubMed] [Google Scholar]
  • 58.Lauer S.A., Grantz K.H., Bi Q., Jones F.K., Zheng Q., Meredith H.R., Azman A.S., Reich N.G., Lessler J. The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application. Ann. Intern. Med. 2019;(2020) doi: 10.7326/M20-0504. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Li Q., Guan X., Wu P., Wang X., Zhou L., Tong Y., Ren R., Leung K.S.M., Lau E.H.Y., Wong J.Y., Xing X., Xiang N., Wu Y., Li C., Chen Q., Li D., Liu T., Zhao J., Liu W. Tu M., Chen C., Jin L., Yang R., Wang Q., Zhou S., Wang R., Liu H., Luo Y., Liu Y., Shao G., Li H., Tao Z., Yang Y., Deng Z., Liu B., Ma Z., Zhang Y., Shi G., Lam T.T.Y., Wu J.T., Gao G.F., Cowling B.J., Yang B., Leung G.M., Feng Z. Early transmission dynamics in wuhan, China, of novel coronavirus-infected pneumonia. New Engl. J. Med. 2020;382:1199–1207. doi: 10.1056/NEJMoa2001316. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Sanche S., Lin Y.T., Xu C., Romero-Severson R., Hengartner N., Ke R. High contagiousness and rapid spread of severe acute respiratory syndrome coronavirus 2. Emerg. Infect. Dis. 2020 doi: 10.3201/eid2607.200282. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Lima E.A.B.F., Oden J.T., Wohlmuth B., Shahmoradi A., Hormuth II D.A., Yankeelov T.E., Scaraboso L., Horger T. Selection and validation of predictive models of radiation effects on tumor growth based on noninvasive imaging data. Comput. Methods Appl. Mech. Engrg. 2017;327:277–305. doi: 10.1016/j.cma.2017.08.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Oden J.T., Babuska I., Faghihi D. Predictive computational science: Computer predictions in the presence of uncertainty. Encyclopedia Comput. Mech. 2017 doi: 10.1002/9781119176817.ecm2101. [DOI] [Google Scholar]
  • 63.Peirlinck M., Sahli Costabal F., Sack K.L., Choy J.S., Kassab G.S., Guccione J.M., De Beule M., Segers P., Kuhl E. Using machine learning to characterize heart failure across the scales. Biomech. Model. Mechanobiol. 2019;18:1987–2001. doi: 10.1007/s10237-019-01190-w. [DOI] [PubMed] [Google Scholar]
  • 64.Sahli Costabal F., Matsuno K., Yao J., Perdikaris P., Kuhl E. Machine learning in drug development: Characterizing the effect of 30 drugs on the QT interval using Gaussian process regression, sensitivity analysis, and uncertainty quantification. Comput. Methods Appl. Mech. Engrg. 2019;348:313–333. doi: 10.1016/j.cma.2019.01.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Oden J.T. Adaptive multiscale predictive modelling. Acta Numer. 2018:353–450. [Google Scholar]
  • 66.Dehning J., Zierenberg J., Spitzner F.P., Wibral M., Neto J.P., Wilczek M., Priesemann V. Inferring change points in the spread of COVID-19 reveals the effectiveness of interventions. Science. 2020 doi: 10.1126/science.abb9789. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Lange K.L., Little R.J.A., Taylor M.G. Robust statistical modeling using the t distribution. J. Amer. Statist. Assoc. 1989;84:881–896. [Google Scholar]
  • 68.Jha P.K., Cao L., Oden J.T. Bayesian-based predictions of COVID-19 evolution in texas using multispecies mixture-theoretic continuum models. Comput. Mech. 2020 doi: 10.1007/s00466-020-01889-z. (in press) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Hoffman M.D., Gelman A. The No-U-Turn sampler: adaptively setting path lengths in Hamiltonian Monte Carlo. J. Mach. Learn. Res. 2014;15(1):1593–1623. [Google Scholar]
  • 70.Salvatier J., Wiecki T.V., Fonnesbeck C. Probabilistic programming in python using PyMC3. Peer J. Comput. Sci. 2016;2 [Google Scholar]
  • 71.Oden J.T., Moser R., Ghattas O. Computer predictions with quantified uncertainty I. SIAM News. 2010;43:9. [Google Scholar]
  • 72.Gelman A., Hill J. Cambridge University Press; 2006. Data Analysis using Regression and Multilevel/Hierarchical Models. [Google Scholar]
  • 73.Flaxman S., Mishra S., Gandy A., Unwin H., Coupland H., Mellan T., Zhu H., Berah T., Eaton J., Perez Guzman P., Schmit N., Cilloni L., Ainslie K., Baguelin M., Blake I., Boonyasiri A., Boyd O., Cattarino L., Ciavarella C., Cooper L., Cucunuba Perez Z., Cuomo-Dannenburg G., Dighe A., Djaafara A., Dorigatti I., Van Elsland S., Fitzjohn R., Fu H., Gaythorpe K., Geidelberg L., Grassly N., Green W., Hallett T., Hamlet A., Hinsley W., Jeffrey B., Jorgensen D., Knock E., Laydon D., Nedjati Gilani G., Nouvellet P., Parag K., Siveroni I., Thompson H., Verity R., Volz E., Walters C., Wang H., Wang Y., Watson O., Winskill P., Xi X., Whittaker C., Walker P., Ghani A., Donnelly C., Riley S., Okell L., Vollmer M., Ferguson N., S . Spiral Imperial College London; 2020. Bhatt Report 13: Estimating the Number of Infections and the Impact of Non-Pharmaceutical Interventions on COVID-19 in 11 European Countries. [DOI] [Google Scholar]
  • 74.2020. Santa clara county COVID-19 cases and hospitalizations dashboard. www.sccgov.org (assessed: Jun 15, 2020) [Google Scholar]
  • 75.Gao F., Han L. Implementing the nelder–mead simplex algorithm with adaptive parameters. Comput. Optim. Appl. 2012;51:259–277. [Google Scholar]
  • 76.A. Viguerie, G. Lorenzo, F. Auricchio, D. Baroli, T.J.R. Hughes, A. Patton, A. Reali, T.E. Yankeelov, A. Veneziani, Simulating the spread of COVID-19 via a spatially-resolved SEIRD model with heterogeneous diffusion. Oden Institute Report 20-09, Austin. [DOI] [PMC free article] [PubMed]
  • 77.Lee S., Kim T., Lee E., Lee C., Kim H., Rhee H., Park S.Y., Son H., Yu S., Park J.W., Choo E.J., Park S., Loeb M., Kim T.H. Clinical course and molecular viral shedding among asymptomatic and symptomatic patients with SARS-cov-2 infection in a community treatment center in the republic of Korea. JAMA Intern. Med. 2020 doi: 10.1001/jamainternmed.2020.3862. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.K. Linka, A. Goriely, E. Kuhl, Global and local mobility as a barometer for COVID-19 dynamics, medRxiv (2020) 10.1101/2020.06.13.20130658. [DOI] [PMC free article] [PubMed]
  • 79.S.W. Park, K. Sun, C. Viboud, B.T. Grenfell, J. Dushoff, Potential roles of social distancing in mitigating the spread of coronavirus disease 2019 (COVID-19) in South Korea. medRxiv 2020.03.27.20045815, 10.1101/2020.03.27.20045815. [DOI] [PMC free article] [PubMed]
  • 80.World Health Organization . 2020. Transmission of SARS-cov-2: implications for infection prevention precautions. https://www.who.int/news-room/commentaries/detail/transmission-of-sars-cov-2-implications-for-infection-prevention-precau-tions (assessed: July 9, 2020) [Google Scholar]
  • 81.A. Grifoni, D. Weiskopf, S.I. Ramirez, J. Mateus, J.M. Dan, C. Rydyznski Moderbacher, S.A. Rawlings, A. Sutherland, R.S. Premkumar, L. Jadi, D. Marrama, A.M. de Silva, A. Frazier, A. Carlin, J.A. Greenbaum, B. Peters. F. Krammer. D.M. Smith, S. Crotty, A. Sette, Targets of T cell responses to SARS-CoV-2 coronavirus in humans with COVID-19 disease and unexposed individuals. Cell (2020) 10.1016/j.cell.2020.05.015. [DOI] [PMC free article] [PubMed]
  • 82.Allday E., Kawahara M. San Francisco Chronicle; 2020. First Known U.S. Coronavirus Death Occurred on Feb. 6 in Santa Clara County. https://www.sfchronicle.com/health/article/First-known-U-S-coronavirus-death-occurred-on-15217316.php (assessed: Jun 15, 2020) [Google Scholar]

Articles from Computer Methods in Applied Mechanics and Engineering are provided here courtesy of Elsevier

RESOURCES