Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

Research Square logoLink to Research Square
[Preprint]. 2020 Jul 7:rs.3.rs-40364. [Version 1] doi: 10.21203/rs.3.rs-40364/v1

State-level impact of social distancing and testing on COVID-19 in the United States

Weihsueh A Chiu 1,, Rebecca Fischer 2, Martial L Ndeffo-Mbah 1,2,
PMCID: PMC7362894  PMID: 32702727

Abstract

Social distancing measures have been implemented in the United States (US) since March 2020, to mitigate the spread of SARS-CoV-2, the causative agent of COVID-19. However, by mid-May most states began relaxing these measures to support the resumption of economic activity, even as disease incidence continued to increase in many states. To evaluate the impact of relaxing social distancing restrictions on COVID-19 dynamics and control in the US, we developed a transmission dynamic model and calibrated it to US state-level COVID-19 cases and deaths from March to June 20th, 2020, using Bayesian methods. We used this model to evaluate the impact of reopening, social distancing, testing, contact tracing, and case isolation on the COVID-19 epidemic in each state. We found that using stay-at-home orders, most states were able to curtail their COVID-19 epidemic curve by reducing and achieving an effective reproductive number below 1. But by June 20th, 2020, only 19 states and the District of Columbia were on track to curtail their epidemic curve with a 75% confidence, at current levels of reopening. Of the remaining 31 states, 24 may have to double their current testing and/or contact tracing rate to curtail their epidemic curve, and seven need to further restrict social contact by 25% in addition to doubling their testing and contact tracing rates. When social distancing restrictions are being eased, greater state-level testing and contact tracing capacity remains paramount for mitigating the risk of large-scale increases in cases and deaths.

Keywords: COVID-19, social distancing, testing, contact tracing, mathematical modeling, Bayesian analysis


The novel coronavirus pandemic (COVID-19) emerged in Wuhan, China in December 2019 and has now reached pandemic status, with spread to more than 210 countries and territories, including the United States (US) 1. The US reported its first imported case of COVID-19 on January 20, 2020, arriving via an international flight from China 2. Since then, the disease has spread rapidly within the US, with every state reporting confirmed cases within three weeks of the first reported community transmission. As of June 15th, the US has exceeded 2.1 million cases and 115,000 deaths, heterogeneously distributed across all states 1. So far, states such as New York and New Jersey have borne the highest burden with more than 379,000 cases and 30,000 deaths and 166,000 cases and 12,000 deaths, respectively, while Montana and Alaska have each reported less than 700 cases and 20 deaths each 1.

COVID-19 is caused by a newly described and highly transmissible SARS-like coronavirus (SARS-CoV-2). Severe clinical outcomes have been observed with approximately 20% of symptomatic cases 3,4. There is no vaccine and no cure or approved pharmaceutical intervention for this disease, making the fight against the pandemic reliant on non-pharmaceutical interventions (NPIs). These NPIs include: case-driven measures such as testing, contact tracing, and isolation 5; personal preventive measures such as hand hygiene, cough etiquette, face mask use, eye protection, physical distancing, and surface cleaning, which aim to reduce the risk of transmission during contact with potentially-infectious individuals 6; and social distancing measures to reduce interpersonal contact in the population. In the US, social distancing measures have included policies and guidelines to close schools and workplaces, cancel and restrict mass gatherings and group events, restrict travel, maintain physical separation from others (e.g. keeping six feet distance), and stay-at-home orders 7.

NPIs and other responses to COVID-19, especially stay-at-home orders, have varied widely across states, leading to spatial and temporal variation in the timing and implementation of mitigation strategies. This variation in policies and response efforts may have contributed to the observed heterogeneity in COVID-19 morbidity and mortality across states 8. Recent studies suggest that statewide social distancing measures have likely contributed to reducing the spread COVID-19 epidemic in the US 9,10. Understanding the extent to which NPIs, such as social distance, testing, contact tracing, and self-quarantine, influence COVID-19 transmission in a local context is pivotal for predicting the future course of the epidemic on a state-by-state basis. This in turn will inform how these NPIs should be optimized to mitigate the spread and burden of COVID-19 while awaiting development of pharmaceutical interventions (e.g. therapeutics and vaccines).

After several weeks of statewide stay-at-home orders, most US states have begun to ease their social distancing requirements 11, while attempting to increase their testing and contact tracing capacities 12. Mathematical modeling is a unique tool to help answer these important and timely questions. Models can contribute valuable insight for public health decision-makers by providing an evaluation of the effectiveness of ongoing control strategies along with predictions of the potential impact of various policy scenarios 13.

To address these needs, we developed and validated a data-driven transmission dynamic model to evaluate the impact of social distancing, state-reopening, testing, and contact tracing on the state-level dynamics of COVID-19 infections and mortality in the US. We evaluated the transmissibility of COVID-19 in each state from March, 2020 to early June, 2020, to estimate the state-level impact of shelter-in-place and reopening on COVID-19 transmission. Finally, we evaluated the degree to which increasing testing efforts (rate of identification of infected cases) and/or contact tracing could curtail the spread of the diseases and enable greater relaxation of social distancing restrictions while preventing a resurgence of infections and deaths.

Results

Model performance and validation

We fit our model to state-level daily cases and deaths data using a Bayesian inference approach (see Online Methods). Model performance assessment for several representative states are shown in Figure 1, with full results in Figures S3 and S4. With respect to validation, the posterior 95% credible interval of our model projections, estimated using data through April 30th, 2020, covered 78% of the data points from May 1st through June 20th, 2020. Model performance for fitting all data through June 20th is shown in Figures S5-S7.

Figure 1.

Figure 1.

SEIR model structure, parameter, data sources, and fitting/validation methods. We fitted the model to daily reported cases and confirmed deaths from March 19th to April 30th and validated its projections against data from May 1st to June 20th. On the model projections, the black solid line is the median, the pink band is the 95% credible interval (CrI) and the orange is the inter-quartile range (IQR). We show model fitting and validation for four states: New York (NY), Ohio (OH), Texas (TX), and Washington (WA).

Estimations of effective reproduction number

Using the posterior distribution of our model parameters we estimated the effective reproduction number Reff from March 19th to June 20th, 2020 and identified the minimum level of transmission achieved in each state (Figure 2A). We found that for all, except two states (Arkansas and Utah), the minimum Reff value was less than 1 and these values were mainly achieved during the state shelter-in-place (Figure 2A). On June 20th, 2020, 27 states had at least a 0.5 probability that Reff>1. Thus, the model predicts that as states are reopening, a majority of states are at risk of continued increases in the scale of the outbreak and require additional mitigation to contain the spread of the disease.

Figure 2.

Figure 2.

Estimated effective reproduction number Reff and the level of reopening/rebound in transmission as of June 20th, 2020 for all states. (A) shows estimated Reff (median, IQR, and 95% CrI) across States. The figure shows that “now” (value on June 20th, 2020) and the “minimum” (between March 19th, 2020 and June 20th, 2020) in lighter shades of each color. It also includes the date of the minimum Reff. (B) shows the level of reopening/rebound in disease transmission in each state relative to its minimum value during state shelter-in-place (median, IQR, and 95% CrI).

We conducted an analysis of variance to evaluate the contribution of each parameter to the variation in Reff value (Table S1). Across states, we found that the largest drivers of variation in Reff are the power parameter for social distancing, η, the maximum relative increase in contact after shelter-in-place orders, rmax, and the fraction of contact traced, fC, which together contribute over 60% of variance (Figure S8). This observation is consistent with mobility data alone being insufficient to account for the combined effect of multiple control measures, and suggest that the degree of adoption of non-mobility-related measures, such as enhanced hygiene practices and contact tracing, play a large role in the extent to which a state may reduce disease transmission.

For each state, we also estimated the current level of reopening/rebound in disease transmission relative to its lowest transmission rate observed during shelter-in-place (Figure 2B). We found that only nine states had a 50% or more rebound in COVID-19 transmission by June 20th, 2020 while eight states had a 25% or less rebound in transmission (Figure 2B).

Impact of testing and contact tracing on easing of social distancing

Bringing and keeping the effective reproduction number, Reff, below 1 is necessary and sufficient to curtail the spread of an outbreak. We evaluated the probability of keeping Reff<1 for different levels of testing and contact tracing under the June 20th, 2020 level of state reopening. We found that for 12 states and the District of Columbia have at least 0.975 probability of keeping Reff<1, and 22 states have less than 0.025 probability of bringing and keeping Reff<1, under their current level of testing and contact tracing (Figure S9). We found that for most states bringing and keeping Reff<1 may not be possible without increase contact tracing efforts as increasing testing and isolation alone would be sufficient or require extremely high coverage to curtail the epidemic curve with a 0.975 probability (Figure S9).

To evaluate the impact of scaling up testing and contact tracing on the epidemic dynamics in each state, we assumed a linear “ramp-up” of either testing and/or contact tracing from July 1th – 15th, 2020, after which both parameters remain constant. We then predicted the daily number of cases and deaths (Figures 3 and S10). We found that under current levels of reopening and control, at least 26 states would see a continuous increase in cases and deaths (Figure S10). Even with increased testing and contact tracing, some of these states will still experience a short-term increase in cases and deaths (Figures 3 and S10). For example, Ohio, Texas, and Washington may experience a substantial short-term increase of cases and deaths even if their current testing and contact tracing rate were doubled within the next two weeks (Figure 3B-D). Moreover, reported cases may slightly increase during the “ramp-up” period (Figure 3). We also found that in most states additional relaxation of restrictions without simultaneously increasing contact tracing may exacerbate disease dynamics and result in large-scale outbreaks (Figure S10).

Figure 3.

Figure 3.

Predicted time-course (median, IQR, and 95% CrI) of daily reported cases and deaths under different testing and contact tracing rates (1X and 2X) in New York (A), Ohio (B), Texas (C), and Washington State (D).

We next evaluated the maximal degree of rebound in transmission (i.e., level of reopening) permitted while keeping Reff<1 under different testing and contact tracing scenarios (Figure 4). We found that under the current level of testing and contact tracing rate, 36 states cannot keep their Reff<1 even with only 25% reopening/rebound in transmission (Figure 4A). By doubling the current testing rate, four states (Connecticut, North Dakota, Nebraska, Rhode Island) could keep their Reff<1 even with a 75% level of reopening (Figure 4B). By doubling contact tracing, five states (Connecticut, Delaware, Maryland, Pennsylvania, Rhode Island) could remove all mobility restrictions while keeping Reff<1 (Figure 4C). By doubling both testing rate and contact tracing, 12 states could remove all mobility restrictions while keeping Reff<1 (Figure 4D).

Figure 4.

Figure 4.

Reopening/rebound in transmission permitted (0 = minimum shelter-in-place value, 1 = return to no restrictions) to keep Reff < 1 if (A) testing and contact rates are unchanged, (B) testing rate is doubled, (C) contact tracing is doubled, or (D) both testing and contact tracing are doubled. ∆(t) the level of reopening/rebound in transmission on June 20th, 2020 is shown by the circle. All boxplots show median, IQR and 95% CrI.

We categorized states by the additional amount of mitigation efforts needed to keep Reff < 1 with at least 75% confidence (Figures 5 and S10). We found that under current control efforts, three states (Connecticut, North Dakota, Rhode Island) could continue to curtail their epidemic curve even with an additional 25% reopening (“None” category), and that an additional 16 states and the District of Columbia could curtail their epidemic curve without additional reopening (“Low” category). 13 states could curtail their epidemic curve by doubling their current testing or contact tracing rate (“Moderate” category), while for 11 states by doubling both testing and contact tracing are need (“High” category). The remaining seven states (Alaska, Hawaii, Georgia, Florida, Oklahoma, Oregon, Montana) require not only doubling both testing and contact tracing, but also additional social distancing restrictions, in order to curtail their epidemic curve (“Very High” Category).

Figure 5.

Figure 5.

State-specific level of mitigation needed as a June 20, 2020 to curtail the spread of COVID-19 (keeping Reff < 1 with at least 75% confidence, equivalent to the upper bound of the Interquartile range (IQR)).

Discussion

There is a delicate and continuous balance to strike between the use of social distancing measures to mitigate the spread of an emerging and deadly disease such as COVID-19 and the need for re/opening various sectors of activities for the social, economic, mental, and physical well-being of a community. To address this issue, it is imperative to design measurable, data-driven, and flexible milestones for identifying when to make specific transitions with regard to easing or retightening specific social distancing measures. We developed a data-driven SARS-CoV-2 transmission dynamic model not only to make short-term predictions on COVID-19 incidence and mortality in the US, but more importantly to evaluate the impact that relaxing social distancing measures and increasing testing and contact tracing would have on the epidemic in each state.

We showed that in most states, control strategies implemented during their “shelter-in-place” period were sufficient to contain the outbreak, defined as reducing and ultimately maintaining the effective reproductive number below 1 (Reff<1). However, for the majority of states, our modelling suggests that “reopening” has proceeded too rapidly and/or without adequate testing and contact tracing to prevent a resurgence of the epidemic. Even in states with currently decreasing incidence and mortality, such as New York and New Jersey, additional relaxation of restrictions is likely to “bend the epidemic curve upwards.” However, our model predicts that a combination of increased testing, increased contact tracing, and/or scaling back reopening will be sufficient for curtailing the spread of COVID-19. Specifically, doubling of current testing and contact tracing rates would enable the vast majority of states to either maintain or increase the easing of social distancing restrictions in a “safe” manner. Increasing testing and contact tracing rates entails both increasing the number of tests performed per day as well as requiring early identification and isolation of COVID-19. This can be accomplished through active case detection via efficient contact tracing strategies. However, it should also be noted that increased testing and contact tracing will lead to a short-term increase in reported cases because a larger fraction of the infected population is being observed, and that several weeks may pass before these rates begin to show a decline. It is therefore imperative that policymakers and the public recognize that such a surge is actually a sign that testing and tracing efforts are succeeding, and to have the patience to wait several weeks before these successes are reflected as declining rates of reported cases.

Like all modeling studies, our study has several limitations due to modelling assumptions and the quality of available data. The initiation of social distancing measures, such as stay-at-home orders in the US, for mitigating the spread of COVID-19 has occurred concurrently with increased promotion and application of other NPIs such as hygiene practices (e.g. hand hygiene, surface cleaning, cough etiquette, and wearing of face mask). These hygiene practices coupled with the avoidance of physical contact whenever possible (keeping six feet apart) could impact the spread of COVID-19 by reducing both the risk of exposure and the risk of transmission of SARS-CoV-2 from infected patients 14,15. Though our model explicitly accounts for the differential contribution of social distancing (mobility reduction) versus hygiene practices and physical distancing to reducing COVID-19 transmission, we assume that the impact of hygiene practices and physical distancing was a function of social distancing (mobility reduction). While cell phone mobility data may continue to accurately reflect the contact rates, the impact of enhanced hygiene practices is more difficult to measure independently. As several states are easing their social distancing requirements, especially their stay-at-home orders, compliance with hygiene practices would become even more important for reducing individuals’ risk of getting or transmitting the pathogen. However, keeping a high population-level adherence to these measures is required to mitigate the spread of the COVID-19 epidemic in a city, state, or nation 16. As states are reopening various aspects of their economy, data on compliance with enhance hygiene practices and physical distancing are needed to improve the estimation of these measures’ population-level impact on reducing disease transmission.

Additionally, consistent with previous COVID-19 modeling studies 17-19, our model uses a simple functional form to model increases in testing rate from early March to June, 2020. This testing rate was estimated through model fitting to daily reported case and mortality data. Particularly in states that have seen a substantial increase in testing capability and efforts during the month of May, our simple time varying assumption may underestimate the current level of testing and contact tracing. However, it should be noted that increased testing capacity does not necessarily lead to increased rate of testing if individuals are unaware, unwilling, or unable to be tested 20. Having contact tracing and date of symptoms onset data would enable us to compute a better estimate the current testing and contact tracing rate in each state. Our also model assumes that all individuals who test positive to COVID-19 are effectively isolated for the rest for their infectious period and no longer contribute to disease transmission. Though voluntary compliance to COVID-19 self-quarantine recommendations may be high across the US, it is likely not 100%. Therefore, the assumption of effective isolation of all identified cases may cause our model to slightly overestimate the impact of increase testing rate on disease dynamics. However, we anticipate that this assumption would only have a marginal impact on the qualitative nature of our results. Finally, our model does not explicitly account for age-stratified risk of disease transmission and mortality. This age-stratification is important for designing and evaluating social distancing and testing strategies that are targeted towards the elderly population which are at higher risk of COVID-19-induced hospitalization and death 21. As reopening the economy becomes an imperative for states across the US, age- or risk-targeted interventions may be a valuable tool to mitigate the burden of the pandemic. Future modeling studies could investigate the effectiveness of age- or risk-targeted non-pharmaceutical and potential pharmaceutical (vaccine or therapeutic) interventions for controlling the spread and burden of COVID-19.

In sum, we use a data-driven mathematical modeling approach to study the impacts of social distancing, testing, and contact tracing on the transmission dynamics of SARS-CoV-2. Our findings emphasize the importance for public health authorities not only to monitor the case and mortality dynamics of SARS-CoV-2 in their state, but also to understand the impact of their existing social distancing measures on SARS-CoV-2 transmission and evaluate the effectiveness of their testing and contact tracing programs for promptly identifying and isolating new cases of COVID-19. As reported case rates are increasing widely across US states because social distancing restrictions have been eased to allow more economic activity to resume, we find that most states need to either significantly scale back reopening or enhance their capacity and scale of testing, case isolation, and contact tracing programs in order to prevent large-scale increases in COVID-19 cases and deaths.

Online Methods

Our overall approach is as follows: 1) develop a mathematical model (an SEIR-type compartmental model) that incorporates social distancing data, case identification via testing, isolation of detected cases, and contact tracing; 2) assess the model’s predictive performance by training (calibrating) it to reported cases and mortality data from March 19th to April 30th, 2020 and validating its predictions against data from May 1st to June 20th, 2020; and 3) use the model, trained on data through June 20th, 2020, to predict future incidence and mortality. The final stage of our approach predicts future events under a set of scenarios that include increased case detection though expanded testing rate, contact tracing, and relaxation or increase of measures to promote social distancing. All model fitting is performed in a Bayesian framework in order to incorporate available prior information and address multivariate uncertainty in model parameters.

Model formulation

Our model is illustrated in Figure 1, with parameters and prior distributions listed in Table 1. We modified the standard SEIR model to address testing and contact tracing. In our model formulation I class also includes infectious pre-symptomatic individuals. With respect to testing, separate compartments were added for untested, “freely roaming” infected individuals (IU), tested/isolated cases IT, fatalities FT. In balancing considerations of model fidelity and parameter identifiability, we made the reasonably conservative assumptions that all tested cases are effectively isolated (through self-quarantine or hospitalization) and thus unavailable for transmission, and that all COVID-related deaths are identified/tested.

Table 1.

Model inputs, parameters and prior distributions for Bayesian analysis.

Symbol Definition (units) Sampled parameter(s) Prior [Truncation] Notes
Pop Population size Input (not sampled) Constant 31
Ninit Initial IU on 2020-02-29 Ninit LogN(1000, 10) [1, 10000]
1/α Self-isolation time after contact tracing Tisolation = 1/α LogN(14, 2) [7, 21]
1/κ Latent period (d) Tlatent = 1/κ N(4,1) [2,7] 32,33
c0 Baseline contact rate (contacts d−1) c0 N(13, 5) [7, 20] 34
ρ Recovery rate (d−1) Trecover = 1/ρ LogN(10, 1.5) [5, 30] 33,35
β0 Transmission rate (d−1) R0 = c0β0 N(2.9, 0.78) [1.46, 4.5] 36-38
fC Fraction of contacts traced (unitless) fC LogN(0.25, 2) [0.15, 1] 39
TT Date of startup of testing (d) TT N(70, 10) [60, 90]
λ General positive diagnosis rate (d−1) λ = Ftest Senstest ktest Derived 36,40,41
Ftest General test coverage (unitless) Ftest N(0.5, 0.2) [0.2, 0.8] 36,40,41
Senstest Test sensitivity (unitless) Senstest N(0.7, 0.1) [0.6, 0.95] 42
ktest General testing rate (d−1) τtest = 1/ktest N(7, 3) [2, 12] 43,44
λC Contacts positive diagnosis rate (d−1) λC = Senstest ktest,C Derived
kC,test Contacts testing rate (d−1 τC,test = 1/kC,test N(2, 1) [1, 3]
ρC Rate of infected contacts testing negative (d−1) ρC = (1 – Senstest) ktest,C Derived
δ Fatal illness rate (d−1) IFR (infected fatality rate)* LogN(0.01, 2) [0.001, 0.1] 35,45
θmin Minimum of θ(t) θmin Validation: Beta(2,2) Calibration: State-specific
ƣ
τθ Weibull scale parameter τθ Validation: N(21, 7) [7, 35] Calibration: State-specific
ƣ
nθ Weibull shape parameter nθ Validation: LogN(6, 2) [1,11] Calibration: State-specific
ƣ
η Hygiene effectiveness relative to social distancing (unitless) η Beta(2,2)
τs Duration of shelter in place (d) τs Validation: N(30, 30) [0, 90] Calibration: State-specific 46
τr Duration of linear increase after shelter-in-place (d) τr Validation: N(45, 30) [0, 105] Calibration: State-specific
ƣ
rmax Maximum relative increase in contacts from shelter-in-place (unitless) rmax Validation: Beta(2,2) Calibration: State-specific
ƣ
τcase Lag time for observing confirmed case τcase LogN(7, 2) [1, 14]
τdeath Lag time for observing confirmed death τdeath LogN(7, 2) [1, 14]
αpos Negative Binomial shape parameter for cases likelihood function αpos LogU(4, 40)
αdeath Negative Binomial shape parameter for deaths likelihood function αdeath LogU(8, 40)

LogN(GM, GSD) = lognormal distribution with geometric mean GM and geometric standard deviation GSD

N(M,SD) = normal distribution with mean M and standard deviation SD

U(MIN,MAX) = uniform distribution with minimum MIN and maximum MAX

LogU(MIN, MAX) = log-uniform distribution with minimum MIN and maximum MAX

Time (t) is measured from t=1 corresponds to 2020-01-01.

Assumed, non-informative prior.

Standard contact tracing guidance is to self-isolate for 2 weeks.

Ƣ

For calibration to 6/20/20, state-specific priors were derived by fitting to different social distancing data sets, with each parameter’s mean, standard deviation, and range used to define a normal distribution prior.

*

See Methods for relationship between IFR and δ.

With respect to contact tracing, the additional compartment SC represents unexposed contacts, who undergo a period of isolation during which they are not susceptible before returning to S; while EC and IC represent contacts who were exposed. Again, the reasonably conservative assumption was made that all exposed contacts undergo testing, with an accelerated testing rate compared to the general population. We assume a closed population of constant size N for each state.

The ordinary differential equations governing our model are as follows:

dSdt=Sc[β+(1β)fc]IUN+ScαdScdt=Scα+Sc(1β)fcIUNdEdt=Eκ+Scβ(1fc)IUNdEcdt=Ecκ+ScβfcIUNdIUdt=IU(λ+ρ)+EκdIcdt=Ic(λc+ρc)+EcκdRUdt=IUρ+IcρcdITdt=IT(ρ+δ)+IUλ+IcλcdRTdt=ITρdFTdt=ITδ

The testing rates λ and λC, the fatality rate δ, and the recovery rate of traced contacts ρC are each composites of several underlying parameters. The testing rate defined as

λ(t)=Ftest,0[111+e(tTT)τT]Senstestktest,

where Ftest,0 is the current testing coverage (fraction of infected individuals tested), Senstest is the test sensitivity (true positive rate), and ktest is rate of testing for those tested, with a typical time-to-test equal to 1/ktest. The time-dependence term models the “ramp-up” of testing using a logistic function with a growth rate of 1/τT days−1, where TT is the time where 50% of the current testing rate is achieved. Similarly, for testing of traced contacts, the same definition is used with the assumption that all identified contacts are tested, Ftest,0 = 1 and at a faster assumed testing rate kC,test:

λc(t)=[111+e(tTT)τT]Senstestkc,test,

Because all contacts are assumed to be tested, the rate ρC at which they enter the “recovered” compartment RU is simply the rate of false negative test results:

ρc(t)=[111+e(tTT)τT](1Senstest)ktest

The fatality rate is adjusted to maintain consistency with the assumption that all COVID-19 deaths are identified, assuming a constant infected fatality rate (IFR). Specifically, we first calculated the fraction of infected that are tested and positive

fpos(t)=fcλc(t)λc(t)+ρc(t)+(1fc)λ(t)λ(t)+ρ.

Where fC is the fraction of contact identified through contact tracing.

Then the case fatality rate CFR(t) = IFR/fpos(t). Because the CFR = δ/(δ + ρ), this implies

δ(t)=ρCFR(t)1CFR(t)=ρIFRfpos(t)IFR.

The model is “seeded” Ninitial cases on February 29, 2020. Because in the early stages of the outbreak, there may be multiple “imported” cases, we only fit to data from March 19, 2020 onwards, one week after the U.S. travel ban was put in place 22.

Our model is fit to daily case yc and death yd data (cumulative data are not used for fitting because of autocorrelation). To adequately fit the case and mortality data, we accounted for two lag times. First, a lag is assumed between leaving the IU compartment and public reporting of a positive test result, accounting for the time it takes to seek a test, obtaining testing, and have the result reported. No lag is assumed for tests from contact tracing. Second, a lag time is assumed between entering the fatally ill compartment FT and publically reported deaths. Additionally, we use a negative binomial likelihood in order to account for the substantial day-to-day variation in reporting results. The corresponding equations are as follows:

yobs,[c,d](t)NegBin[α[c,d],p[c,d](t)]p[c,d](t)=ypred,[c,d](t)α[c,d]+ypred,[c,d](t)ypred,c(t)=IU(tτcase)λ(t)+Ic(t)λc(t)ypred,d(t)=IT(tτdeath)δ(t)

In this parameterization, as the shape parameter α → ∞, the likelihood becomes a Poisson distribution with expected value ypred,[c,d], whereas for small values of α there is substatial inter-individual variability. Case and death data were sourced from The COVID Tracking Project23.

Finally, we derived time-dependent the time-dependent and effective reproduction numbers in this model, given by

R(t)=cβ(1fc)λ+ρ

and

Reff(t)=R(t)S(t)N.

Incorporating social distancing, enhanced hygiene practices, and reopening

The impact of social distancing, hygiene practices, and reopening were modeled through a time-dependence in the contact rate c and the transmission probability per infected contact β:

c(t)=c0[θ(t)+(1θmin)r(t)]β(t)=β0θ(t)η

The θ(t) function parameterizes social distancing during the progression to shelter-in-place, and is modeled as a Weibull function

θ(t)=θmin+(1θmin)e(tτθ)nθ,

which starts a unity and decreases to θmin, with Tθ being Weibull scale parameter and nθ the Weibull shape parameter (Figure S1).

The r(t) function parameterizes relative increase in contacts after shelter-in-place, with r = 1 corresponding to a return to baseline c = c0.

r(t)=rmaxtτθτsτr[u(ttr)u(ttrmax)]+u(ttrmax)u(t)=Heaviside(t)111+e4ttr=τθ+τstrmax=τθ+τs+τr

The term r(t) is 0 before tr, linear between tr and trmax, and constant at a value of rmax after that, and made continuous by approximating the Heaviside function by a logistic function. The reopening time is defined as τs days after τθ, and the maximum relative increase in contacts rmax happens τr days after that.

We selected the functional form above for c(t) because it was found to be able to represent a wide variety of social distancing data, including cell phone mobility data from Unacast24 and Google 25, as well as restaurant booking data from OpenTable 26 . We used these different mobility sources to derive state-specific prior distributions because different social distancing datasets had different values for θmin, τθ, nθ, τS, rmax, and τR (Figure S2). With respect to the reduction in transmission probability β, we assumed that during the “shelter-in-place” phase, hygiene-based mitigation paralleled this decline with an effectiveness power η, and that this mitigation continued through re-opening.

Finally, we define an overall “reopening” parameter that measures the “rebound” in disease transmission c · β relative to its minimum, defined to be 0 during shelter-in-place (i.e., R(t) is at a minimum), and 1 when all restrictions are removed (when R(t) = R0), which can be derived as:

Δ(t)=cβ(c0β0)θmin1+η1θmin1+η.

Scenario evaluation

We used the model to make several inferences about the current and future course of the pandemic in each state. First, we consider the effective reproduction number. Two time points of particular interest are the time of minimum Reff, reflecting the degree to which shelter-in-place and other interventions were effective in reducing transmission, and the final time of the simulation, June 20, 2020, reflecting the extent to which reopening has increased Reff. Additional parameters of interest are the current levels of reopening (t), testing λ, and contact tracing fC.

We then conducted scenario-based prospective predictions using our model’s parameters as estimated through June 20, 2020. We asked the following questions:

  1. Assuming current levels of reopening, what increases in general testing λ and/or contact tracing fC would be necessary to bring Reff < 1?

  2. What amount reopening can maintain Reff < 1 under four different scenarios: current values of testing and contact tracing, doubling testing, double tracing, and doubling both testing and tracing?

  3. What will the rates of new cases and deaths be under different scenarios? Specifically, we evaluate the impact of increases in testing and contact tracing under current levels of reopening, as well as increases or decreases of 25%.

For (a), we evaluated the posterior probability that Reff < 1 under scaling transformations λλ · μλ and fCfC · μC with scaling factors μλ and μC:

Reff(t)=S(t)cβ(1μcfc)μλλ+ρ=S(t)c0β0(1μcfc)μλλ+ρ[Δ(1θmin1+η)+θmin1+η]

For (b), we fixed the scaling factors at 1 or 2, and solved the above equation for crit such that Reff < 1:

Δcrit(μλ,μc)=[μλλ+ρS(t)c0β0(1μcfc)θmin1+η]11θmin1+η

Values of crit(t) indicate the additional degree of reopening possible while maintaining Reff < 1, while values of crit < (t) indicate a reduction of reopening is needed.

Finally, for (c), we additionally evaluated changes in reopening + for values of +25% or −25%, for a total of 12 scenarios (4 different levels of testing and tracing, and 3 different levels of reopening). We then ran the SEIR model forward in time until August 31, 2020. For all three intervention parameters μC, μΛ, and , we assumed a “ramp-up” period of 2 weeks from July 1-July 14, 2020. To summarize the relative urgency of mitigation in each state, we categorized states based on which scenarios resulted in the IQR of Reff(t) being < 1 on July 15, 2020.

Software and code:

Posterior distributions were sampled using Markov chain Monte Carlo simulation performed using MCSim version 6.1.0 using Metropolis within Gibbs sampling 27. For each US state, four chains of 200,000 iterations each were run, with the first 20% of runs discarded, and 500 posterior samples saved for analysis. For each parameter, comparison of interchain and intrachain variability was assessed to determine convergence, with the potential scale reduction factor R ≤ 1.2 considered converged 28. Additional analysis of model outputs was performed in RStudio version 1.2.1335 29 with R version 3.6.130. The codes used to generate our results will be available on Github prior to publication at https://github.com/wachiuphd/COVID-19-Bayesian-SEIR-US.

Data availability statement:

The following publicly available datasets are used:

Mobility data are shown in Supplemental Figure S2. Case and death data are shown in Figures 1 and 3, and Supplemental Figures S3-S6, S10. All data used are also available in the software and code repository.

Supplementary Material

Supplement
Supplement
FigS2validationdaily.pdf (448.7KB, pdf)
Supplement
FigS4MobilityFitsbyState.pdf (1,000.2KB, pdf)
Supplement
Supplement

Acknowledgements:

We thank F.Y. Bois, J.K. Cetina, M. Giannoni, I. Rusyn, and W. Więcek for useful input and advice on scenario development, model formulation, and MCMC simulation. We also thank Unacast for making their mobility data available for researchers, and The COVID Tracking Project for compiling case and mortality data and providing it to the public. Portions of this research were conducted with the advanced computing resources provided by Texas A&M High Performance Research Computing.

Funding: National Science Foundation (NSF RAPID DEB 2028632) and National Institutes of Health, National Institute of Environmental Health Sciences (P30 ES029067).

Footnotes

Ethical approval: Ethical approval was not required for this work.

Data sharing: No additional data available.

Competing interests: The authors declare no competing interests.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplement
Supplement
FigS2validationdaily.pdf (448.7KB, pdf)
Supplement
FigS4MobilityFitsbyState.pdf (1,000.2KB, pdf)
Supplement
Supplement

Data Availability Statement

The following publicly available datasets are used:

Mobility data are shown in Supplemental Figure S2. Case and death data are shown in Figures 1 and 3, and Supplemental Figures S3-S6, S10. All data used are also available in the software and code repository.


Articles from Research Square are provided here courtesy of American Journal Experts

RESOURCES