Assessing the potential impact of transmission during prolonged viral shedding on the effect of lockdown relaxation on COVID-19

Burcu Tepekule; Anthony Hauser; Viacheslav N Kachalov; Sara Andresen; Thomas Scheier; Peter W Schreiber; Huldrych F Günthard; Roger D Kouyos

doi:10.1371/journal.pcbi.1008609

. 2021 Jan 29;17(1):e1008609. doi: 10.1371/journal.pcbi.1008609

Assessing the potential impact of transmission during prolonged viral shedding on the effect of lockdown relaxation on COVID-19

Burcu Tepekule ^1,^2,^*, Anthony Hauser ³, Viacheslav N Kachalov ^1,², Sara Andresen ^1,², Thomas Scheier ⁴, Peter W Schreiber ¹, Huldrych F Günthard ¹, Roger D Kouyos ^1,²

Editor: Alex Perkins⁵

PMCID: PMC7875355 PMID: 33513139

Abstract

A key parameter in epidemiological modeling which characterizes the spread of an infectious disease is the generation time, or more generally the distribution of infectiousness as a function of time since infection. There is increasing evidence supporting a prolonged viral shedding window for COVID-19, but the transmissibility in this phase is unclear. Based on this, we develop a generalized Susceptible-Exposed-Infected-Resistant (SEIR) model including an additional compartment of chronically infected individuals who can stay infectious for a longer duration than the reported generation time, but with infectivity reduced to varying degrees. Using the incidence and fatality data from different countries, we first show that such an assumption also yields a plausible model in explaining the data observed prior to the easing of the lockdown measures (relaxation). We then test the predictive power of this model for different durations and levels of prolonged infectiousness using the incidence data after the introduction of relaxation in Switzerland, and compare it with a model without the chronically infected population to represent the models conventionally used. We show that in case of a gradual easing on the lockdown measures, the predictions of the model including the chronically infected population vary considerably from those obtained under a model in which prolonged infectiousness is not taken into account. Although the existence of a chronically infected population still remains largely hypothetical, we believe that our results provide tentative evidence to consider a chronically infected population as an alternative modeling approach to better interpret the transmission dynamics of COVID-19.

Author summary

A key epidemiological variable characterizing the spread of an infectious disease is the generation time, defining the time between successive cases in a chain of transmission. Although there is increasing evidence supporting a prolonged viral shedding window for COVID-19, it is currently unclear to what extent prolonged transmission also occurs. Here we investigate the plausibility of a population of chronically infected individuals who can stay infectious for a longer duration than the reported generation time, but with infectivity reduced to varying degrees. By using the daily case and fatality data from various countries, we show that the existence of a chronically infected population is not a possibility that can be easily rejected from an epidemiological perspective. Moreover, in case of a gradual easing on the lockdown measures, the predictions of the model including the chronically infected population vary considerably from the predictions of the conventional epidemiological models. Although it is not possible to either prove or disprove the existence of a hypothetical population purely by modeling, our results provide tentative evidence to consider a chronically infected population as an alternative modeling approach in assessing the transmission dynamics of COVID-19.

Introduction

Mathematical models have been extensively used to understand the epidemic characteristics of oubreaks, in predicting future outcomes, and in shaping the national responses regarding control measures [1, 2]. Despite the time pressure, a considerable amount of work has been dedicated to modeling the pandemic of novel coronavirus (SARS-CoV-2) infections that began in China in late 2019 [3–6]. Although most of these studies are based on existing epidemic models such as SIR and SEIR-models, several features of the COVID-19 pandemic have been independently explored, leading to different generalizations of similar dynamical models. On one hand, having a variety of models is central to get a notion of the model sensitivity, on the other, it shows that different assumptions are equally favorable to explain the observed data given the right set of parameter choices, whereas they might lead to different projections on how the epidemic would follow in the future [7, 8]. This variability in future projections becomes especially important when a perturbation, such as the imposition or release of the control measures, is introduced to the dynamical system.

A key epidemiologic variable that characterizes the spread of an infectious disease is the generation time [9], i.e., the time between successive cases in a chain of transmission. Li et al. [10] estimated the generation time distribution to have a mean of 7.5 (95%CI 5.5–19) days based on 6 observations, whereas Ganyani et al. estimated the generation time distribution to have a mean of 5.20 (95%CI 3.78–6.78) days for Singapore and 3.95 (95%CI 3.01–4.91) days for Tianjin [11], Bi et al. estimated the generation time distribution to have a mean of 6.3 (95%CI 5.2–7.6) days [12], He et al. estimated the generation time distribution to have a mean of 5.8 (95%CI 4.8–6.8) days [13], and Hiroshi et al. estimated the generation time distribution to have a mean of 4.7 (95%CI 3.70–6.00) days. Considering all these studies, infectiousness is estimated to decline quickly within 4 to 8 days on average.

Additionally, certain cases of transmission arouse concern about prolonged shedding of SARS-CoV-2 after recovery [14]. Moreover, several studies show proof of active virus replication in upper respiratory tract tissues and prolonged viral shedding even after seroconversion for COVID-19, implying that the contagious period of COVID-19 might last more than one week after clinical recovery in a fraction of patients [15, 16]. De Chang et al. reported patients to be virus positive even after the resolution of symptoms up to 8 days [17]. Similarly, Young et al. reported a median duration of 12 days for viral shedding [18], and Zhou et al. observed a median duration of 20 days [19]. Tan et al. reported a special case where the duration of viral shedding persisted for 49 days from illness onset [20]. Such examples indicate an uncertainty regarding the skewness of the generation time distribution. In additon to this uncertainty, several studies estimating the generation time suffer from short follow-up times, selection bias, and recall bias, which might miss the individual cases with prolonged shedding durations. Considering that the duration of infectiousness is a critical parameter in dynamical models used for predictive purposes, it is important to consider the epidemiological plausibility of a a more heavy tailed generation time distribution than the reported distributions in the literature and investigate its impact on model outcomes.

To do so, we first develop a generalized SEIR model by segregating the infectious compartment into two as “primarily infected” and “chronically infected” population. We assume that primarily infected individuals have a higher infectiousnesss within the time window conventially considered as the generation time, during when they have the potential to develop symptoms and therefore be hospitalized. Afterwards, we assume that the non-hospitalized infecteds transition to the chronically infected phase before recovery and become less infectious, but may stay infectious for a longer duration. By doing so, we include the possibility of a prolonged viral shedding window in our model. Individuals in the chronically infected phase are relevant both for diagnosis (a positive test result) and disease transmission, and we will explore the role of both aspects in explaining the observed data.

Using the incidence and fatality data from different regions of Italy and different states of the U.S., we first show that our model is also a plausible candidate for explaining the data observed prior to the easing of the lockdown measures (relaxation) for a variety of combinations of prolonged duration and level of infectiousness assumed for the chronically infected population. Based on this conclusion, we test the predictive power of different models using the daily confirmed cases data after the introduction of relaxation in Switzerland, including a model without the chronically infected population to represent the models conventionally used. Only Swiss data is used to test the predictive power of different models due to the public avaliability of different data types (estimates on the effective reproductive number, the number of daily confirmed cases, daily deaths, hospitalized and ICU patients) in high temporal resolution. Our results show that, in case of a gradual easing on the lockdown measures, the predictions of the model including the chronically infected population vary considerably from those obtained under a model in which prolonged infectiousness is not taken into account. This variability is especially important when national policies on control measures are being formed, and also for the healthcare systems if projections such as the occupancy of the hospital ward or the ICU are calculated using similar dynamical models.

Materials and methods

Mathematical model

To describe the dynamics of the COVID-19 pandemic, we generalize the susceptible-exposed-infected-removed (SEIR) compartmental model by including eight different states denoted by S(t), E(t), I_p(t), I_c(t), H(t), ICU(t), R(t), and X(t), representing the number of susceptible individuals, exposed (infected but not yet infectious) individuals, primarily infected individuals, chronically infected individuals, hospitalized patients, patients in ICUs, recovered (immune) individuals, and deceased individuals at time t, respectively. To model the prolonged viral shedding in case of COVID-19, we segregate the infectious compartment into two by introducing two different compartments, namely the primarily infected (I_p) and the chronically infected (I_c) individuals. After the incubation period is complete, exposed individuals become primarily infected where they stay infectious within the reported duration of the infectious period of COVID-19. Conventionally, these individuals are assumed to stop being infectious and therefore stop contributing to the disease spread when the generation time is complete. Our purpose by including another step before recovery, i.e., the chronically infected compartment, is to model a scenario such that the primarily infected individuals transition to a state where they are less infectious but they may stay infectious and be diagnosed for a longer duration than the generation time, i.e. continue spreading the infection with reduced transmissibility.

Transitions between different compartments are illustrated in Fig 1, which can be translated into a system of ordinary differential equations, where each arrow, i.e., each process, is associated with a rate. This system is given by the Eq set 1, including the rates of processes as model parameters, and describes the rate of change of compartments over time. Model parameters are given in Table 1 with their corresponding descriptions and prior distributions. An additional compartment C(t) is included in the Eq set 1 to calculate the cumulative number of the positively diagnosed cases in the community, and does not play any role in the disease dynamics.

\begin{matrix} \begin{matrix} \frac{d S (t)}{d t} = & - \frac{S}{N} (β_{p} I_{p} + β_{c} I_{c}), \\ \frac{d E (t)}{d t} = & + \frac{S}{N} (β_{p} I_{p} + β_{c} I_{c}) - τ E, \\ \frac{d I_{p} (t)}{d t} = & + τ E - γ_{p} I_{p}, \\ \frac{d I_{c} (t)}{d t} = & + (1 - ϵ_{H}) γ_{p} I_{p} - γ_{c} I_{c}, \\ \frac{d H (t)}{d t} = & + ϵ_{H} γ_{p} I_{p} - γ_{H} H, \\ \frac{d I C U (t)}{d t} = & + γ_{H} ϵ_{H 2 I} H - γ_{I C U} I C U, \\ \frac{d R (t)}{d t} = & + γ_{H} (1 - ϵ_{H 2 I}) H + γ_{I C U} (1 - ϵ_{x}) I C U + γ_{c} I_{c}, \\ \frac{d X (t)}{d t} = & + γ_{I C U} ϵ_{x} I C U, \\ \frac{d C (t)}{d t} = & + r_{d}^{p} γ_{p} I_{p} + (\frac{1 - r_{d}^{p}}{1 - ϵ_{H}}) r_{d}^{c} γ_{c} I_{c} . \end{matrix} \end{matrix}

(1)

Fig 1 — a) Notation of the compartments and their corresponding descriptions. b) Schematic of the dynamical model given by Eq set 1.

Table 1. Model parameters given with their descriptions, constrained ranges, and prior distributions.

Notation	Description	Constained range or definition	Prior distribution ^‡
$R_{0}^{p}$	Basic reproduction number of the primarily infected population	0 − ∞	$R_{0}^{p} \sim N (2.5, 0.5)$
r_c	Reduction in infectiousness due to being chronic	0%–100%	Fixed to a different value for each simulation.
$R_{0}^{c}$	Basic reproduction number of the chronically infected population	$R_{0}^{c} = R_{0}^{p} (1 - r_{c})$	Conditioned on $R_{0}^{p}$ and r_c.
r_L	Effect of lockdown in reducing infectiousness	0%–100%	r_L ∼ β(1, 1)
m_L	Slope of reduction in infectiousness due to lockdown	0.5–1.5	m_L ∼ 0.5 + β(1, 1)
s_L	Time lag of reduction in infectiousness due to lockdown	0 − ∞	s_L ∼ exp(1/5)
r_lock(t)	Time dependent effect of the lockdown on the transmission rate	Given by Eq 2	Conditioned on r_L, r_c, m_L, and s_L.
1/τ	Duration of the latent period	0 − ∞	τ ∼ exp(1/2.5)
1/γ_p	Duration of infection of I_p	0 − ∞	γ_p ∼ exp(1/2.5)
1/γ_c	Duration of infection of I_c	0.01-20 days	Fixed to a different value for each simulation.
β_p	Transmission rate of I_p	Given by Eq 3	Conditioned on r_lock(t), R₀, and γ_p.
β_c	Transmission rate of I_c	Given by Eq 4	Conditioned on r_lock(t), R₀, γ_c, and r_c.
1/γ_H	Duration of hospital ward stay	0 − ∞	γ_H ∼ exp(1/12)
1/γ_ICU	Duration of ICU stay	0 − ∞	γ_ICU ∼ exp(1/12)
ϵ_H	Rate of direct H admission	0 − ∞	$ϵ_{H} \sim N (0.08, 0.02)$
ϵ_H2I	Transfer rate from H to ICU	0 − ∞	$ϵ_{H 2 I} \sim N (0.4, 0.08)$
ϵ_x	Death rate from ICU	0 − ∞	$ϵ_{x} \sim N (0.4, 0.08)$
$r_{d}^{p}$	Diagnosis rate of I_p	0 − ∞	$r_{d}^{p} \sim N (0.2, 0.03)$
$r_{d}^{c}$	Diagnosis rate of I_c	0 − ∞	$r_{d}^{c} \sim N (0.075, 0.015)$
R₀	Total basic reproduction number	$R_{0} = R_{0}^{p} + (1 - ϵ_{H}) R_{0}^{c}$	Conditioned on $R_{0}^{p}$ , $R_{0}^{c}$ and ϵ_H.
E(0)	Initial frequency of the exposed compartment	0%–100%	$r_{d}^{c} \sim β (1, 10^{3})$
S(0)	Initial frequency of the susceptible compartment	1 − E(0)^*	Conditioned on E(0)
N	Population size	−	Fixed specific to the country used for fitting.

Open in a new tab

* All other compartments (I_p, I_c, H, ICU, R, and X) are assumed to be zero at t = 0, and the first case is assumed to be observed at t = 1.

^‡ $N$ , β, exp denotes the Normal, Beta, and Exponential distributions respectively.

Time-dependent decrease in the transmission of SARS-CoV-2 due to lockdown measures is modeled by a sigmoid function [21], and denoted by r_lock(t), such that

\begin{matrix} r_{l o c k} (t) & = r_{L} + (1 - r_{L}) / [1 + exp (m_{L} \times (t - t_{L} - s_{L}))], \end{matrix}

(2)

where r_L, t_L, m_L, and s_L denote the final effect of the lockdown, start date of the lockdown, slope of the decrease in transmissibility, and the time delay between implementation and effect of the lockdown, respectively. r_lock(t) is used as a multiplicative factor in modeling the transmission rate in a time-dependent manner.

The reduced transmissibility of I_c is modeled via including a reduction coefficient r_c as a multiplicative factor to its transmission rate, representing the reduction in the infectiousness level of the primarily infected population when they move to the chronically infected phase. Introduction of r_c results in two different transmission rates β_p and β_c for I_p and I_c compartments, such that,

\begin{matrix} β_{p} & = r_{l o c k} (t) \times R_{0}^{p} \times γ_{p}, \end{matrix}

(3)

\begin{matrix} β_{c} & = r_{l o c k} (t) \times R_{0}^{c} \times γ_{c}, \end{matrix}

(4)

\begin{matrix} = r_{l o c k} (t) \times R_{0}^{p} (1 - r_{c}) \times γ_{c}, \end{matrix}

(5)

where $R_{0}^{p}$ , $R_{0}^{c}$ , 1/γ_p, and 1/γ_c denotes the basic reproduction number of the primarily infected population, the basic reproduction number of the chronically infected population, duration of primarily infected phase, and the duration of chronically infected phase, respectively. We assume that individuals who develop symptoms do so only during the primarily infected phase, and therefore hospitalization is only possible before they transition to the chronically infected phase. We do not assume any a priori information regarding the testing policy, therefore a positive diagnosis is possible for both primarily and chronically infected individuals, and they contribute to the cumulative number of the positively diagnosed cases with the rates $r_{d}^{p}$ and $r_{d}^{c}$ , respectively.

Model fitting and parameter estimation

Model selection via goodness of fit until relaxation

We implemented two stages of model fitting. The first stage aims to compare the goodness of fits of three different classes of models, which are,

The model without prolonged viral shedding (model without the chronically infected (I_c) compartment),
The model with prolonged viral shedding without prolonged infectiousness, where individuals in the I_c compartment are not infectious (model given by Eq set 1 for r_c = 100%, where r_c denotes the level of reduced infectiousness.),
The model with prolonged viral shedding and prolonged infectiousness, where individuals in the I_c compartment are infectious with different levels of infectiousness (given by Eq set 1 for 0% ≤ r_c < 100%).

The second model with r_c = 100% represents the scenario where the primarily infected individuals do not have prolonged infectiousness, but they still can be diagnosed during the chronic phase, meaning that their test results can still be positive although they are not infectious. Note that all models with prolonged viral shedding at different levels of infectiousness (0% ≤ r_c ≤ 100%) including the model without prolonged infectiousness at all (r_c = 100%) assume that the infected individuals are tested and positively diagnosed with a certain rate during this prolonged viral shedding window. This is not a common assumption in other modeling studies regarding COVID-19. Therefore, the first model without the chronically infected population is included in the comparison to represent the models which are conventionally used.

We then fit each class of model simultaneously to the data on the number of daily confirmed cases and the number of daily deaths. Due to the high spatial variation in transmission dynamics in countries such as Italy and the U.S., we used regional data within these two countries which have consistent spreading patterns. These regions include Lombardy, Piedmont, and Emilia-Romagna for Italy (data reported by the Civil Protection Department of the Ministry of Italy [22]), and the State of New York, the State of New Jersey, and the State of Louisiana for the U.S (data reported by the COVID Tracking Project [23]). Model fitting is done in a Bayesian framework using Stan [24]. The deviations between the model output and the data are assumed to follow a Negative Binomial distribution. Dispersion parameters of the Negative Binomial distributions are estimated separately for both the number of daily confirmed cases and the number of daily deaths during model fitting.

When fitting the models with prolonged viral shedding, we fixed the reduction in infectiousness parameter r_c to different values varying between 0% to 100%. Duration of infectiousness of the I_c compartment (1/γ_c) is also fixed to different values varying from 0.01 to 20 days for all simulations. Other parameters are allowed to vary within their respective ranges, given in Table 1.

During model fitting, we use all the data points until the introduction of the easing on the lockdown measures (relaxation). We then calculate the Root Mean Squared Error (RMSE) between the median of the model estimates and the data points that are used for fitting to evaluate the goodness of the fit for the daily confirmed cases and the daily deaths for each class of model, where lower values of RMSE indicate a better fit.

RMSE values provide a good measure of fit by quantifying how much the median of the model estimate deviates from the data, and are useful to compare the goodness of fit of two different models. On the other hand, they do not incorporate the variance on the model estimates emerging from the probabilistic nature of the fitting procedure. To investigate how often one model performs better than another, we bootstrap estimates from both models within their 95% confidence intervals, and calculate the probability of one model having a greater error value than another model. Bootstrapping is performed via randomly subsampling simulated time series outputs with replacement for a given model. For each sample of a given model, we first normalize the RMSE values for the number of daily confirmed cases and the number of daily deaths via dividing them by the difference between the maximum and the minimum value of their respective data points. We then sum these normalized values up to calculate a combined measure of the goodness of fit, which we refer as the combined RMSE (CRMSE) value. We calculate the probability of one model having a greater CRMSE value than another model by comparing the CRMSE values of each bootstrapped sample for a given pair of models. This analysis is used to address two different questions. First, we investigate whether there is any advantage in including the chronically infected population in the model structure to achieve a better fit by calculating the probability of the model without the I_c compartment (model w/o I_c) having a greater CRMSE value than the model with the I_c compartment for all levels of reduced infectiousness (0% ≤ r_c ≤ 100%), denoted by $P (C R M S E_{w / o I_{c}} > C R M S E_{r_{c} \leq 100 %})$ . Second, we investigate whether there is a substantial difference between having prolonged infectiousness (r_c < 100%) versus being diagnosed without being infectious (r_c = 100%) during the prolonged viral shedding phase by calculating the probability of the model with r_c = 100% having a greater CRMSE value than the models with 0% ≤ r_c < 100%, denoted by P(CRMSE_{r_c = 100%} > CRMSE_{r_c < 100%}). Both quantities are calculated for each duration of infectiousness (1/γ_c value) separately.

Due to the uncertainty of the quantitative effects of the easing on the lockdown measures (relaxation), data points after the relaxation are excluded from the goodness of fit calculations.

Model selection via predictive power after relaxation

The second stage of the model fitting aims to compare the predictive power of different models by incorporating the data after the introduction of relaxation. First, we use the data until relaxation provided by [25] for Switzerland, and fit the model simultaneously to four datasets: the number of daily confirmed cases, the number of daily deaths, the number of patients at the hospital ward at a given day, and the number of patients at the ICU at a given day. Using the parameters we obtained via fitting, we predict the number of daily confirmed cases in case of a gradual relaxation scenario for all models, using a range of r_c and γ_c values.

Relaxation is modeled as an increase in transmissibility, and characterized as a sigmoid function. It is similar to the time-dependent effect of the lockdown (r_lock(t)) given by Eq 2, such that

\begin{matrix} r_{r e l a x} (t) & = r_{L} + 1 / [1 / (r_{end} - r_{L}) + exp (- m_{R} \times (t - t_{R} - s_{R}))], \end{matrix}

(6)

where t_R, m_R, s_R, and r_end denote the start date of the relaxation (18^th of March for Switzerland, middle time point between the start of the first phase of relaxation on 27^th of April, and the start of the third—and the final—phase of relaxation on 8^th of June), slope of increase in transmissibility, the time delay until the effect of the relaxation takes place, and the final effect of the control measures still in place (wearing masks in public transit, practicing hand hygene, etc.), respectively. r_relax(t) is used as a multiplicative factor in a similar fashion to r_lock(t).

Since we aim to compare the predictive power of different models using the data after relaxation, parametrization of r_relax(t) had to represent the quantitative impact of relaxation in Switzerland accurately, but also had to be independent of our model fitting procedure. Therefore, we parameterize r_relax(t) using the effective reproductive number (R_e(t)) estimates provided by the Swiss National COVID-19 Science Task Force [26], assuming that the normalized values of the effective reproductive number over time (R_e(t)/R_e(0)) provide a quantitative proxy for the change in behavior after the introduction of relaxation. We parameterize r_relax(t) such that the numerical values for m_R, s_R, and r_end minimize the RMSE between r_relax(t) and R_e(t)/R_e(0) for the time points (t values) after the introduction of relaxation. This parametrization is done separately for each model with different r_c and γ_c values, since their estimates for r_L will be different which is included in r_relax(t) (Eq 6).

We quantify the predictive power of each model by calculating the RMSE values between the median of the model predictions and the daily confirmed cases data only for the time points after the introduction of relaxation. Similar to the first stage of model fitting, we calculate $P (R M S E_{w / o I_{c}} > R M S E_{r_{c} \leq 100 %})$ and $P (R M S E_{r_{c} = 100 %} > R M S E_{r_{c} < 100 %})$ values using the normalized RMSE results, but for the number of daily confirmed cases only. Because the uncertainty on the parameter estimates will propagate to the future predictions, prediction results will have wider confidence intervals than the fitting results. Therefore when calculating $P (R M S E_{w / o I_{c}} > R M S E_{r_{c} \leq 100 %})$ and $P (R M S E_{r_{c} = 100 %} > R M S E_{r_{c} < 100 %})$ , we bootstrap prediction estimates within their 50% confidence intervals instead of 95% to identify the differences in model predictions in a more informative way (results with estimates bootstrapped within 95% confidence intervals are also provided in the Supporting Information). Model predictions for the other three data types (the number of daily deaths, the number of patients at the hospital ward at a given day, and the number of patients at the ICU at a given day) are excluded from the predictive power calculations, since the impact of relaxation manifests itself most directly in the number of daily confirmed cases data, whereas the other datasets are influenced by many other factors such as treatment success, demography of the patients, hospital capacity, etc. Such factors are likely to change over time and a re-fitting is required using the data points subsequent to the introduction of relaxation to estimate the related model parameters properly.

Only Swiss data is used to test the predictive power of different models because it is the only country to our knowledge where both the estimates on the effective reproductive number and the data on the hospitalized and the ICU patients are publicly available in high temporal resolution in addition to the number of daily confirmed cases and the number of daily deaths.

We implemented both stages of model fitting in a Bayesian framework using Stan [24]. Prior distributions of the parameters used during fitting are given in Table 1.

Results

Possibility of a chronically infected population

We find that the model that describes the data the best is dependent on the combination of the level and duration of the prolonged infectiousness, and the optimal choice of the {γ_c, r_c} combination varies among different regions and different countries. For Lombardy, as the duration of infectiousness becomes longer, models including the chronically infected population outperform the model without the I_c compartment (model w/o I_c) more often (Fig 2e), whereas the models with prolonged infectiousness (r_c < 100%) perform similarly to the model where the individuals can be diagnosed during the prolonged viral shedding window without being infectious (r_c = 100%) (Fig 2f). The absolute difference in RMSE values of the median of the model estimates differ by 30.7 daily confirmed cases (3.1% of the mean number of daily confirmed cases, S1 Table), and 6.17 daily deaths (3.4% of the mean number of daily deaths, S1 Table) the most when all r_c and γ_c values are considered (Fig 2c and 2d). The region of Emilia-Romagna presents a very similar behaviour to Lombardy (S1 Fig). In case of Piedmont and the state of New York, models with prolonged viral shedding (0% ≤ r_c ≤ 100%) outperform the model without the I_c compartment (model w/o I_c) more often, and models with lower levels of infectiousness (higher r_c values) clearly provide a better fit than the models with higher level of infectiousness (S2 and S5 Figs). For the state of Louisiana, all models perform similarly (S3 Fig). For the state of New Jersey, RMSE values for the number of daily confirmed cases are sensitive to the particular combinations of γ_c and r_c values (S4 Fig). Model without the I_c compartment (model w/o I_c) provides a better fit for very short and very long durations of prolonged infectiousness, and similar fits to the model with r_c = 100% for medium durations of prolonged infectiousness (S4 Fig). Maximum difference in the median RMSE values for the number of daily confirmed cases and the number of daily deaths for all combinations of r_c and γ_c values are provided in S1 Table, both in absolute values and relative to the mean of their corresponding data type. Parameter estimates with their corresponding means, standart deviations, and confidence intervals for all combinations of r_c and γ_c values are provided in the S2 Table.

Fig 2 — Fitting and RMSE results for Lombardy, calculated using different levels and durations of infectiousness for the chronically infected population. Model outcomes (presented for 1/γ_c = 14 days) for the number of a) daily confimed cases and b) daily deaths using the data until the introduction of relaxation for model fitting, respectively. Darker shades of blue represent the fitting results with increased infectiousness of the chronically infected population, i.e., lower r_c values within the range 0 ≤ r_c < 100%. Fitting results for r_c = 100% are drawn in red, and the fitting results for the model without the I_c compartment (model w/o I_c) are drawn in pink. Data points that are used for fitting are drawn in black. Gray areas around the model outcomes represent the union of the 95% confidence intervals calculated for all models. RMSE values c) for the number of daily confirmed cases and d) the number of daily deaths for a given r_c and γ_c value used for fitting, where model w/o I_c represents the results for the model without the I_c compartment. e) Probability of the model without the I_c compartment (model w/o I_c) having a greater combined RMSE (CRMSE) value than the model with the I_c compartment for all levels of reduced infectiousness (r_c ≤ 100%) for different r_c and γ_c values. f) Probability of the model where individuals are being diagnosed without being infectious (r_c = 100%) having a greater combined RMSE (CRMSE) value than the model with individuals with a a prolonged infectiousness (r_c < 100%) for different r_c and γ_c values. Points in the gray areas represent the models that are providing a better fit more frequently than e) the model without the I_c compartment (model w/o I_c) and f) the model with r_c = 100%.

Impact of relaxation

Data for Switzerland after the introduction of relaxation is used to test the predictive power of different models. As a demonstrative example, effect of the lockdown and the relaxation on infectiousness (r_lock(t) and r_relax(t)) are provided in Fig 3c and 3d for 1/γ_c = 14 days.

Fig 3 — a) Fitting results and relaxation predictions for Switzerland for the number of daily confirmed cases, calculated using different levels of infectiousness for the chronically infected population, assuming a duration of prolonged infectiousness of 1/γ_c = 14 days. Time dependent effects of the lockdown (r_lock(t)) and the relaxation (r_relax(t)) are illustrated in c) and d), respectively. Predictions drawn in darker shades of blue represent the fitting results with increased infectiousness of the chronically infected population, i.e., lower r_c values within the range 90 ≤ r_c < 100%. Fitting results for r_c = 100% are drawn in red, and the fitting results for the model without the I_c compartment (model w/o I_c) are drawn in pink. Data points that are used for fitting are drawn in black, and the data points used for comparing the predictive power of different models are drawn in green. e) RMSE values calculated using the prediction results for the number of daily confirmed cases for a given r_c and γ_c value, where model w/o I_c represents the results for the model without the I_c compartment. Models with the best predictive power (smallest RMSE value) are indicated by the bold black boxes. f) Probability of the model without the I_c compartment (model w/o I_c) having a greater RMSE value than the model with the I_c compartment for different levels of reduced infectiousness (90% ≤ r_c ≤ 100%) and γ_c values, calculated over the predicted data points. g) Probability of the model where individuals are being diagnosed without being infectious (r_c = 100%) having a greater RMSE value than the model with individuals with a a prolonged infectiousness (r_c < 100%) for different r_c and γ_c values, calculated over the predicted data points. Points in the gray areas represent the models that are providing a better fit more frequently than f) the model without the I_c compartment (model w/o I_c) and g) the model with r_c = 100%. b) Prediction results of the first 5 models (r_c = {93%, 94%, 95%, 96%, 97%}) with the lowest RMSE value (best predictive power), model with r_c = 100%, and model without the I_c compartment (model w/o I_c) for + T days into the future from the last data point observed, where 1/γ_c = 14 days. Predictions for the model with the best predictive power (r_c = 95%), for the model with r_c = 100%, and the model without the I_c compartment (model w/o I_c) are highlighted in blue, red, and pink, respectively.

We observe that all models provide almost identical fits for the data prior to the introduction of relaxation (S6 Fig), but they substantially differ in their predictions regarding after relaxation even for small differences in the infectiousness levels (r_c values) for the chronically infected population (Fig 3a and 3b, demonstrative example for 1/γ_c = 14 days). As the predicted point moves further in time, the quantitative difference between the predictions of different models deviate from each other even more. For 60 days after the last data point observed, the model with the best predictive power (model with r_c = 95%) predicts a median of 1439 daily confirmed cases, whereas model with r_c = 100% predicts a median of 126, and model without the I_c compartment (model w/o I_c) predicts a median of 190 daily confirmed cases (Fig 3b, T = 60), indicating a discrepancy by one order of magnitude. Similar to the results provided for the first stage of fitting, we find that it is more advantageous to use a model including the chronically infected population with low levels of infectiousness (Fig 3f and 3g for estimates bootstrapped within 50%, and S7 Fig for estimates bootstrapped within 95% confidence intervals). The model providing the lowest RMSE value between the predictions and the data is always a model with prolonged infectiousness for a wide range of γ_c values (Fig 3e). However, note that these results are valid under our particular choice of parametrization of the relaxation dynamics, and the resulting relative change in the transmissibility during the relaxation period.

The fact that observed data can be explained equally well by various combinations of r_c and γ_c values is partially due to the flexibility of the fitting procedure, which allows other parameters to be adjusted for a given {r_c, γ_c} pair. Most parameters are free to vary, but their prior distributions are informed such that the hyperparameters (parameters of the prior distributions) align with the reported values in the literature (Table 1). As an example, both the incubation period (1/τ) and the duration of infectiousness of the primarily infected population (1/γ_p) have the mean of 2.5 days, resulting in a generation time distribution with a mean of 5 days, in agreement with the reported values in the literature for COVID-19 (see Introduction). Similarly, the basic reproduction number of the primarily infected population $R_{0}^{p}$ is normally distributed with a mean of 2.5, which is the average value reported for basic reproduction number of COVID-19 in many countries [10, 27]. Mean values of the prior distributions of the parameters related to hospitalization (γ_H, γ_ICU, ϵ_H, ϵ_H2I, and ϵ_x) are adopted from Ferguson et al. [28] and Verity et al. [29], and given a variance such that they can be adjusted specifically for each country during the fitting procedure.

The median of the posterior distributions for $R_{0}^{p}$ , R₀, and r_L provide a good example to demonstrate the flexibility of the fitting procedure (Fig 4). As expected, the basic reproduction number of the primarily infected population ( $R_{0}^{p}$ ) (Fig 4a), the total basic reproduction number ( $R_{0} = R_{0}^{p} + (1 - ϵ_{H}) R_{0}^{c}$ ) (Fig 4b), and the final reduction in infectiousness due to lockdown (1 − r_L) (Fig 4c) are estimated to be lower for a given duration of infectiousness (1/γ_c) as the infectiousness of the chronically infected population decreases (as r_c increases) to explain the observed data.

Parameter estimates for the Swiss data with their corresponding means, standart deviations, and confidence intervals for all combinations of r_c and γ_c values are provided in the S2 Table.

Discussion

The model presented in this work explores the plausibility of an epidemiological model with a prolonged viral shedding window for the COVID-19 pandemic, and investigates both its impact and predictive capabilities on the outcomes of a gradual easing on the lockdown measures (relaxation) given different assumptions on the infectiousness level and duration of a chronically infected population.

Our results show that including a chronically infected population, i.e., individuals that are less infectious but infectious for a longer duration, is not a possibility that can be easily rejected from an epidemiological perspective. This conclusion is based on two main results. First, neither the presence nor the absence of chronic transmission is identifiable from population-level data. The data that has been observed until relaxation can be explained equally well by the model with prolonged viral shedding for a variety of different levels and durations of prolonged infectiousness as by the model without prolonged viral shedding. Although this is partially due to the flexibility of the fitting procedure, the choice of hyperparameters (parameters of the prior distributions) indicates that all fits for a given infectiousness value are possible for a set of reasonable model parameters, and therefore as favorable as the conventional models from a modeling perspective.

Second, even if the presence of a chronically infected population cannot be proven, its introduction to the model structure has a considerable impact on the relaxation outcomes. In case of a gradual easing on the lockdown measures, the predictions of the model including the chronically infected population vary considerably from those obtained under a model in which prolonged infectiousness is not taken into account. Although the level of infectiousness might be low, its impact during the prolonged viral shedding window is significant in terms of predicting the outcomes of a gradual relaxation, indicating that even small differences in prolonged infectiousness levels might change the course of an epidemic when they are present for a certain duration. This is especially important for the healthcare systems if projections such as the occupancy of the hospital ward or the ICU are calculated using similar dynamical models.

The fact that observed data can also be explained with a model including prolonged viral shedding raises certain questions about the interpretation of the epidemic curve, acquired immunity, and the current testing policies. Assuming a relatively short generation time for a model that does not consider a prolonged viral shedding window results in more optimistic projections about epidemic control, as clearly demonstrated in Fig 3. Based on this, countries that were very successful in their initial control measures and therefore experienced a very steep decline in the number of daily confirmed cases might choose to ease the control measures too soon. We still lack a full understanding of the viral shedding window of COVID-19, and therefore might have a biased opinion on the number of infectious individuals in the community. This once again emphasizes the infectiousness of COVID-19 and the significance of frequent testing although the number of cases are in decline.

Using simplified compartmental models such as the one in this study has certain limitations. First, it does not consider the stochastic effects that the system is subject to, which become more important as the number of infecteds decrease in the community. Second, it assumes a well-mixed population, and does not consider the contact structure and the demographic information which are both relevant to the disease spread. Nevertheless, we believe that these two drawbacks of our modeling approach influence all models with and without the prolonged viral shedding to a similar degree, if not penalizing the models with prolonged viral shedding for producing more pessimistic projections since the number of infecteds will be higher in frequency relative to the model without the chronically infected compartment. Additionally, the standard SEIR model assumes constant rates of transition between the exposed, infectious, and recovered classes, leading to waiting times that individuals spend in these states being exponentially distributed [30], resulting in an exponential distribution on the generation time as well. Although mathematically convenient, this assumption is shown to be epidemiologically unrealistic, and less dispersed distributions such as gamma distribution should be used instead [31, 32].

Compartmental model structures are based on the underlying epidemiological and demographic interactions of a particular disease. Given that there are many choices for these interactions, the number of possible combinations are enormous [33, 34]. Our choice of including a chronically infected comparment in the model structure was inspired by the evidence indicating an uncertainity regarding the generation time distributions, but this approach is only one way to extend the basic SEIR model for the COVID-19 pandemic. There are still several open questions regarding the transmission dynamics of COVID-19, meaning that there are many other alternatives of modification a modeler could consider depending on the research question in hand. These alternatives are also potential candidates which would describe the data equally well, and offer reasonable predictions.

One methodological limitation of pure model fitting is the parameter identification problem, especially in the early stages of an epidemic [35]. As clearly demonstrated in Fig 4, models with different assumptions on the duration and the level of prolonged infectiousness lead to equally good descriptions of the observed data by adjusting the parameter values accordingly. Therefore, even if a prolonged viral shedding window exists for COVID-19, it would not be possible to quantify the precise level or the duration of prolonged infectiousness by using model fitting purely. Ultimately, these quantities should be measured or estimated from the relevant type of data.

Another potential limitation is the dependency of the goodness of fit of a given model on the quantification of the impact of relaxation, which inevitably affects the model selection procedure. Although we do not perform any fitting on the data belonging to the relaxation phase, we indirectly inform our predictions by shaping the change in transmissibility (β) via using the normalized values of the R_e estimates provided by the Swiss National COVID-19 Science Task Force, which in turn are calculated using the data on the number of daily cases. Different assumptions on the change of transmissibility during relaxation might alter the infectiousness level that is optimal in predicting the relaxation outcomes. With that being said, our results suggest that the model without the chronically infected compartment heavily underpredicts the case numbers, as clearly seen in Fig 3. Although it is still debated whether the patients who recover from COVID-19 and test positive for the virus after their recovery are still infectious or not, it is clear that these positive test results contribute to the data on the number of daily confirmed cases. However, current modeling studies regarding COVID-19 neglect this fact and assume that the probability of detecting an infection decreases strongly after the mean generation time. Our results show that this assumption might lead to an underestimation of both the reproduction number and the effect of the lockdown (Fig 4), leading to a potential underprediction for the relaxation outcomes.

In conclusion, it is not possible to either prove or disprove the existence of a compartment of chronically infected individuals purely by modeling based on epidemiological data. Our results only provide tentative evidence to consider a chronically infected population as an alternative modeling approach in addressing the knowledge gap on the transmission dynamics of COVID-19. Such an hypothesis must be tested by incorporating data regarding the timing of transmission events, contact histories, and corresponding test results. Furthermore, more clinical and virological diagnostic studies are necessary to establish the biological links between viral load, active viral replication at different sites of the body, severity of symptoms, and a positive test result to infer the infectiousness of an individual over time. Including a chronically infected population in our model was motivated by the evidence reported for prolonged viral shedding in the literature [14–20], and attempted to test whether this is also a plausible descriptive and predictive modeling approach. Given that different assumptions on the infectiousness duration and level during a prolonged viral shedding window can result in similar descriptions of the observed data prior to the introduction of relaxation, and large differences of epidemic projections after relaxation, it is important to consider a chronically infected population from a modeling perspective when national policies are being imposed.

Supporting information

S1 Fig. Fitting and Root Mean Squared Error (RMSE) results for Emilia-Romagna.

Fitting and RMSE results for Emilia-Romagna, calculated using different levels and durations of infectiousness for the chronically infected population. Model outcomes (presented only for 1/γ_c = 14 days) for the number of a) daily confimed cases and b) daily deaths using the data until the introduction of relaxation for model fitting, respectively. Darker shades of blue represent the fitting results with increased infectiousness of the chronically infected population, i.e., lower r_c values within the range 0 ≤ r_c < 100%. Fitting results for r_c = 100% are drawn in red, and the fitting results for the model without the I_c compartment (model w/o I_c) are drawn in pink. Data points that are used for fitting are drawn in black. Gray areas around the model outcomes represent the union of the 95% confidence intervals calculated for all models. RMSE values c) for the number of daily confirmed cases and d) the number of daily deaths for a given r_c and γ_c value used for fitting, where model w/o I_c represents the results for the model without the I_c compartment. e) Probability of the model without the I_c compartment (model w/o I_c) having a greater combined RMSE (CRMSE) value than the model with the I_c compartment for all levels of reduced infectiousness (r_c ≤ 100%) for different r_c and γ_c values. f) Probability of the model where individuals are being diagnosed without being infectious (r_c = 100%) having a greater combined RMSE (CRMSE) value than the model with individuals with a a prolonged infectiousness (r_c < 100%) for different r_c and γ_c values. Points in the gray areas represent the models that are providing a better fit more frequently than e) the model without the I_c compartment (model w/o I_c) and f) the model with r_c = 100%.

(TIFF)

Click here for additional data file.^{(4.8MB, tiff)}

S2 Fig. Fitting and Root Mean Squared Error (RMSE) results for Piedmont.

Fitting and RMSE results for Piedmont, calculated using different levels and durations of infectiousness for the chronically infected population. Model outcomes (presented only for 1/γ_c = 14 days) for the number of a) daily confimed cases and b) daily deaths using the data until the introduction of relaxation for model fitting, respectively. Darker shades of blue represent the fitting results with increased infectiousness of the chronically infected population, i.e., lower r_c values within the range 0 ≤ r_c < 100%. Fitting results for r_c = 100% are drawn in red, and the fitting results for the model without the I_c compartment (model w/o I_c) are drawn in pink. Data points that are used for fitting are drawn in black. Gray areas around the model outcomes represent the union of the 95% confidence intervals calculated for all models. RMSE values c) for the number of daily confirmed cases and d) the number of daily deaths for a given r_c and γ_c value used for fitting, where model w/o I_c represents the results for the model without the I_c compartment. e) Probability of the model without the I_c compartment (model w/o I_c) having a greater combined RMSE (CRMSE) value than the model with the I_c compartment for all levels of reduced infectiousness (r_c ≤ 100%) for different r_c and γ_c values. f) Probability of the model where individuals are being diagnosed without being infectious (r_c = 100%) having a greater combined RMSE (CRMSE) value than the model with individuals with a a prolonged infectiousness (r_c < 100%) for different r_c and γ_c values. Points in the gray areas represent the models that are providing a better fit more frequently than e) the model without the I_c compartment (model w/o I_c) and f) the model with r_c = 100%.

(TIFF)

Click here for additional data file.^{(4.5MB, tiff)}

S3 Fig. Fitting and Root Mean Squared Error (RMSE) results for the State of Louisiana.

Fitting and RMSE results for the State of Louisiana, calculated using different levels and durations of infectiousness for the chronically infected population. Model outcomes (presented only for 1/γ_c = 14 days) for the number of a) daily confimed cases and b) daily deaths using the data until the introduction of relaxation for model fitting, respectively. Darker shades of blue represent the fitting results with increased infectiousness of the chronically infected population, i.e., lower r_c values within the range 0 ≤ r_c < 100%. Fitting results for r_c = 100% are drawn in red, and the fitting results for the model without the I_c compartment (model w/o I_c) are drawn in pink. Data points that are used for fitting are drawn in black. Gray areas around the model outcomes represent the union of the 95% confidence intervals calculated for all models. RMSE values c) for the number of daily confirmed cases and d) the number of daily deaths for a given r_c and γ_c value used for fitting, where model w/o I_c represents the results for the model without the I_c compartment. e) Probability of the model without the I_c compartment (model w/o I_c) having a greater combined RMSE (CRMSE) value than the model with the I_c compartment for all levels of reduced infectiousness (r_c ≤ 100%) for different r_c and γ_c values. f) Probability of the model where individuals are being diagnosed without being infectious (r_c = 100%) having a greater combined RMSE (CRMSE) value than the model with individuals with a a prolonged infectiousness (r_c < 100%) for different r_c and γ_c values. Points in the gray areas represent the models that are providing a better fit more frequently than e) the model without the I_c compartment (model w/o I_c) and f) the model with r_c = 100%.

(TIFF)

Click here for additional data file.^{(4.9MB, tiff)}

S4 Fig. Fitting and Root Mean Squared Error (RMSE) results for the State of New Jersey.

Fitting and RMSE results for the State of New Jersey, calculated using different levels and durations of infectiousness for the chronically infected population. Model outcomes (presented only for 1/γ_c = 14 days) for the number of a) daily confimed cases and b) daily deaths using the data until the introduction of relaxation for model fitting, respectively. Darker shades of blue represent the fitting results with increased infectiousness of the chronically infected population, i.e., lower r_c values within the range 0 ≤ r_c < 100%. Fitting results for r_c = 100% are drawn in red, and the fitting results for the model without the I_c compartment (model w/o I_c) are drawn in pink. Data points that are used for fitting are drawn in black. Gray areas around the model outcomes represent the union of the 95% confidence intervals calculated for all models. RMSE values c) for the number of daily confirmed cases and d) the number of daily deaths for a given r_c and γ_c value used for fitting, where model w/o I_c represents the results for the model without the I_c compartment. e) Probability of the model without the I_c compartment (model w/o I_c) having a greater combined RMSE (CRMSE) value than the model with the I_c compartment for all levels of reduced infectiousness (r_c ≤ 100%) for different r_c and γ_c values. f) Probability of the model where individuals are being diagnosed without being infectious (r_c = 100%) having a greater combined RMSE (CRMSE) value than the model with individuals with a a prolonged infectiousness (r_c < 100%) for different r_c and γ_c values. Points in the gray areas represent the models that are providing a better fit more frequently than e) the model without the I_c compartment (model w/o I_c) and f) the model with r_c = 100%.

(TIFF)

Click here for additional data file.^{(4.9MB, tiff)}

S5 Fig. Fitting and Root Mean Squared Error (RMSE) results for the State of New York.

Fitting and RMSE results for the State of New York, calculated using different levels and durations of infectiousness for the chronically infected population. Model outcomes (presented only for 1/γ_c = 14 days) for the number of a) daily confimed cases and b) daily deaths using the data until the introduction of relaxation for model fitting, respectively. Darker shades of blue represent the fitting results with increased infectiousness of the chronically infected population, i.e., lower r_c values within the range 0 ≤ r_c < 100%. Fitting results for r_c = 100% are drawn in red, and the fitting results for the model without the I_c compartment (model w/o I_c) are drawn in pink. Data points that are used for fitting are drawn in black. Gray areas around the model outcomes represent the union of the 95% confidence intervals calculated for all models. RMSE values c) for the number of daily confirmed cases and d) the number of daily deaths for a given r_c and γ_c value used for fitting, where model w/o I_c represents the results for the model without the I_c compartment. e) Probability of the model without the I_c compartment (model w/o I_c) having a greater combined RMSE (CRMSE) value than the model with the I_c compartment for all levels of reduced infectiousness (r_c ≤ 100%) for different r_c and γ_c values. f) Probability of the model where individuals are being diagnosed without being infectious (r_c = 100%) having a greater combined RMSE (CRMSE) value than the model with individuals with a a prolonged infectiousness (r_c < 100%) for different r_c and γ_c values. Points in the gray areas represent the models that are providing a better fit more frequently than e) the model without the I_c compartment (model w/o I_c) and f) the model with r_c = 100%.

(TIFF)

Click here for additional data file.^{(5MB, tiff)}

S6 Fig. Root Mean Squared Error (RMSE) results of model fitting prior to the introduction of relaxation for Switzerland.

RMSE results of model fitting prior to the introduction of relaxation for Switzerland, calculated using different levels and durations of infectiousness for the chronically infected population.

(TIFF)

Click here for additional data file.^{(1.4MB, tiff)}

S7 Fig. Comparison of RMSE values for Switzerland for 95% confidence interval Bootstrapping.

a) Probability of the model without the I_c compartment (model w/o I_c) having a greater RMSE value than the model with the I_c compartment for different levels of reduced infectiousness (90%≤r_c ≤ 100%) and γ_c values, calculated over the predicted data points. b) Probability of the model where individuals are being diagnosed without being infectious (r_c = 100%) having a greater RMSE value than the model with individuals with a a prolonged infectiousness (r_c < 100%) for different r_c and γ_c values, calculated over the predicted data points. Points in the gray areas represent the models that are providing a better fit more frequently than a) the model without the I_c compartment (model w/o I_c) and b) the model with r_c = 100%.

(TIFF)

Click here for additional data file.^{(903.2KB, tiff)}

S1 Table. Maximum differences in the median RMSE values.

Maximum difference in the median RMSE values for the number of daily confirmed cases and the number of daily deaths for all combinations of r_c and γ_c values for Lombardy, Emilia-Romagna, Piedmont, State of Louisiana, State of New Jersey, and the State of New York. RMSE values are provided in both in absolute values and relative to the mean of their corresponding data type.

(XLSX)

Click here for additional data file.^{(9.3KB, xlsx)}

S2 Table. Parameter estimates with their corresponding means, standart deviations, and confidence intervals.

Parameter estimates with their corresponding means, standart deviations, and confidence intervals for all combinations of r_c and γ_c values for Lombardy, Emilia-Romagna, Piedmont, State of Louisiana, State of New Jersey, the State of New York, and Switzerland.

(XLSX)

Click here for additional data file.^{(645.1KB, xlsx)}

Acknowledgments

We gratefully acknowledge Dr. Julien Riou for his valuable comments and discussions.

Data Availability

All relevant data are within the manuscript, its Supporting Information files and at https://github.com/burcutepekule/covid_prolonged_shedding.

Funding Statement

HFG has received unrestricted research grants from Gilead Sciences and ViiV (paid to institution); personal fees for data and safety monitoring board or consulting/advisory board membership from Merck Gilead Sciences, ViiV, Sandoz and Mepha. The institution of HFG received unrestricted educational grants from ViiV, Gilead, MSD, abbvie, Sandoz and Pfizer paid to the institution. RDK and BT were supported by the Swiss National Science Foundation (grants no. BSSGI0_155851). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1. Lutz CS, Huynh MP, Schroeder M, Anyatonwu S, Dahlgren FS, Danyluk G, et al. Applying infectious disease forecasting to public health: a path forward using influenza forecasting examples. BMC Public Health. 2019;19(1):1659 10.1186/s12889-019-7966-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Basu S, Andrews J. Complexity in mathematical models of public health policies: a guide for consumers of models. PLoS medicine. 2013;10(10). 10.1371/journal.pmed.1001540 [DOI] [PMC free article] [PubMed] [Google Scholar]
3. Kucharski AJ, Russell TW, Diamond C, Liu Y, Edmunds J, Funk S, et al. Early dynamics of transmission and control of COVID-19: a mathematical modelling study. The lancet infectious diseases. 2020. 10.1016/S1473-3099(20)30144-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
4. Zhao S, Chen H. Modeling the epidemic dynamics and control of COVID-19 outbreak in China. Quantitative Biology. 2020; p. 1–9. 10.1007/s40484-020-0199-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Peng L, Yang W, Zhang D, Zhuge C, Hong L. Epidemic analysis of COVID-19 in China by dynamical modeling. arXiv preprint arXiv:200206563. 2020.
6.Mangoni L, Pistilli M. Epidemic analysis of Covid-19 in Italy by dynamical modelling. Available at SSRN 3567770. 2020.
7.Yang W, Zhang D, Peng L, Zhuge C, Hong L. Rational evaluation of various epidemic models based on the COVID-19 data of China. arXiv preprint arXiv:200305666. 2020. [DOI] [PMC free article] [PubMed]
8. Wynants L, Van Calster B, Bonten MM, Collins GS, Debray TP, De Vos M, et al. Prediction models for diagnosis and prognosis of covid-19 infection: systematic review and critical appraisal. bmj. 2020;369 10.1136/bmj.m1328 [DOI] [PMC free article] [PubMed] [Google Scholar]
9. Vink MA, Bootsma MCJ, Wallinga J. Serial intervals of respiratory infectious diseases: a systematic review and analysis. American journal of epidemiology. 2014;180(9):865–875. 10.1093/aje/kwu209 [DOI] [PubMed] [Google Scholar]
10. Li Q, Guan X, Wu P, Wang X, Zhou L, Tong Y, et al. Early transmission dynamics in Wuhan, China, of novel coronavirus–infected pneumonia. New England Journal of Medicine. 2020. 10.1056/NEJMoa2001316 [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Ganyani T, Kremer C, Chen D, Torneri A, Faes C, Wallinga J, et al. Estimating the generation interval for coronavirus disease (COVID-19) based on symptom onset data, March 2020. Eurosurveillance. 2020;25(17):2000257 10.2807/1560-7917.ES.2020.25.17.2000257 [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Bi Q, Wu Y, Mei S, Ye C, Zou X, Zhang Z, et al. Epidemiology and Transmission of COVID-19 in Shenzhen China: Analysis of 391 cases and 1,286 of their close contacts. MedRxiv. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
13. He X, Lau EH, Wu P, Deng X, Wang J, Hao X, et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. Nature medicine. 2020; p. 1–4. [DOI] [PubMed] [Google Scholar]
14. Rothe C, Schunk M, Sothmann P, Bretzel G, Froeschl G, Wallrauch C, et al. Transmission of 2019-nCoV infection from an asymptomatic contact in Germany. New England Journal of Medicine. 2020;382(10):970–971. 10.1056/NEJMc2001468 [DOI] [PMC free article] [PubMed] [Google Scholar]
15. Liu WD, Chang SY, Wang JT, Tsai MJ, Hung CC, Hsu CL, et al. Prolonged virus shedding even after seroconversion in a patient with COVID-19. Journal of Infection. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
16. Wölfel R, Corman VM, Guggemos W, Seilmaier M, Zange S, Müller MA, et al. Virological assessment of hospitalized patients with COVID-2019. Nature. 2020; p. 1–5. [DOI] [PubMed] [Google Scholar]
17. Chang D, Mo G, Yuan X, Tao Y, Peng X, Wang F, et al. Time kinetics of viral clearance and resolution of symptoms in Novel coronavirus infection. American journal of respiratory and critical care medicine. 2020;(ja). 10.1164/rccm.202003-0524LE [DOI] [PMC free article] [PubMed] [Google Scholar]
18. Young BE, Ong SWX, Kalimuddin S, Low JG, Tan SY, Loh J, et al. Epidemiologic features and clinical course of patients infected with SARS-CoV-2 in Singapore. Jama. 2020;323(15):1488–1494. 10.1001/jama.2020.3204 [DOI] [PMC free article] [PubMed] [Google Scholar]
19. Zhou F, Yu T, Du R, Fan G, Liu Y, Liu Z, et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. The lancet. 2020. 10.1016/S0140-6736(20)30566-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
20. Tan L, Kang X, Zhang B, Zheng S, Liu B, Yu T, et al. A special case of COVID-19 with long duration of viral shedding for 49 days. medRxiv. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
21. Hauser A, Counotte MJ, Margossian CC, Konstantinoudis G, Low N, Althaus CL, et al. Estimation of SARS-CoV-2 mortality during the early stages of an epidemic: A modeling study in Hubei, China, and six regions in Europe. PLOS Medicine. 2020;17(7):1–17. 10.1371/journal.pmed.1003189 [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Presidenza del Consiglio dei Ministri—Dipartimento della Protezione Civile. GitHub repository on COVID-19. Available from: https://github.com/pcm-dpc.
23.The Atlantic Monthly Group. The COVID Tracking Project. Available from: https://github.com/COVID19Tracking.
24. Carpenter B, Gelman A, Hoffman MD, Lee D, Goodrich B, Betancourt M, et al. Stan: A probabilistic programming language. Journal of statistical software. 2017;76(1). 10.18637/jss.v076.i01 [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Probst D. COVID-19 Information for Switzerland. Available from: https://www.corona-data.ch/.
26.Swiss National COVID-19 Science Task Force. Effective reproductive number estimates for COVID-19. Available from: https://ncs-tf.ch/en/situation-report/.
27.Centre for the Mathematical Modelling of Infectious Diseases, London School of Hygiene and Tropical Medicine. Time-varying estimate of the effective reproduction number. Available from: https://epiforecasts.io/covid/posts/global/.
28. Ferguson N, Laydon D, Nedjati Gilani G, Imai N, Ainslie K, Baguelin M, et al. Report 9: Impact of non-pharmaceutical interventions (NPIs) to reduce COVID19 mortality and healthcare demand. 2020. [Google Scholar]
29. Verity R, Okell LC, Dorigatti I, Winskill P, Whittaker C, Imai N, et al. Estimates of the severity of COVID-19 disease. MedRxiv. 2020. [Google Scholar]
30. Conlan AJ, Rohani P, Lloyd AL, Keeling M, Grenfell BT. Resolving the impact of waiting time distributions on the persistence of measles. Journal of the Royal Society Interface. 2010;7(45):623–640. 10.1098/rsif.2009.0284 [DOI] [PMC free article] [PubMed] [Google Scholar]
31. Lloyd AL. Realistic distributions of infectious periods in epidemic models: changing patterns of persistence and dynamics. Theoretical population biology. 2001;60(1):59–71. 10.1006/tpbi.2001.1525 [DOI] [PubMed] [Google Scholar]
32. Feng Z, Xu D, Zhao H. Epidemiological models with non-exponentially distributed disease stages and applications to disease control. Bulletin of mathematical biology. 2007;69(5):1511–1536. 10.1007/s11538-006-9174-9 [DOI] [PubMed] [Google Scholar]
33. Hethcote HW. A thousand and one epidemic models In: Frontiers in mathematical biology. Springer; 1994. p. 504–515. [Google Scholar]
34. Silal SP, Little F, Barnes KI, White LJ. Sensitivity to model structure: a comparison of compartmental models in epidemiology. Health Systems. 2016;5(3):178–191. 10.1057/hs.2015.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
35. Sauer T, Berry T, Ebeigbe D, Norton MM, Whalen A, Schiff SJ. Identifiability of infection model parameters early in an epidemic. medRxiv. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Fig. Fitting and Root Mean Squared Error (RMSE) results for Emilia-Romagna.

(TIFF)

Click here for additional data file.^{(4.8MB, tiff)}

S2 Fig. Fitting and Root Mean Squared Error (RMSE) results for Piedmont.

(TIFF)

Click here for additional data file.^{(4.5MB, tiff)}

S3 Fig. Fitting and Root Mean Squared Error (RMSE) results for the State of Louisiana.

(TIFF)

Click here for additional data file.^{(4.9MB, tiff)}

S4 Fig. Fitting and Root Mean Squared Error (RMSE) results for the State of New Jersey.

(TIFF)

Click here for additional data file.^{(4.9MB, tiff)}

S5 Fig. Fitting and Root Mean Squared Error (RMSE) results for the State of New York.

(TIFF)

Click here for additional data file.^{(5MB, tiff)}

S6 Fig. Root Mean Squared Error (RMSE) results of model fitting prior to the introduction of relaxation for Switzerland.

RMSE results of model fitting prior to the introduction of relaxation for Switzerland, calculated using different levels and durations of infectiousness for the chronically infected population.

(TIFF)

Click here for additional data file.^{(1.4MB, tiff)}

S7 Fig. Comparison of RMSE values for Switzerland for 95% confidence interval Bootstrapping.

(TIFF)

Click here for additional data file.^{(903.2KB, tiff)}

S1 Table. Maximum differences in the median RMSE values.

(XLSX)

Click here for additional data file.^{(9.3KB, xlsx)}

S2 Table. Parameter estimates with their corresponding means, standart deviations, and confidence intervals.

(XLSX)

Click here for additional data file.^{(645.1KB, xlsx)}

Data Availability Statement

All relevant data are within the manuscript, its Supporting Information files and at https://github.com/burcutepekule/covid_prolonged_shedding.

[pcbi.1008609.ref001] 1. Lutz CS, Huynh MP, Schroeder M, Anyatonwu S, Dahlgren FS, Danyluk G, et al. Applying infectious disease forecasting to public health: a path forward using influenza forecasting examples. BMC Public Health. 2019;19(1):1659 10.1186/s12889-019-7966-8 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref002] 2. Basu S, Andrews J. Complexity in mathematical models of public health policies: a guide for consumers of models. PLoS medicine. 2013;10(10). 10.1371/journal.pmed.1001540 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref003] 3. Kucharski AJ, Russell TW, Diamond C, Liu Y, Edmunds J, Funk S, et al. Early dynamics of transmission and control of COVID-19: a mathematical modelling study. The lancet infectious diseases. 2020. 10.1016/S1473-3099(20)30144-4 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref004] 4. Zhao S, Chen H. Modeling the epidemic dynamics and control of COVID-19 outbreak in China. Quantitative Biology. 2020; p. 1–9. 10.1007/s40484-020-0199-0 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref005] 5.Peng L, Yang W, Zhang D, Zhuge C, Hong L. Epidemic analysis of COVID-19 in China by dynamical modeling. arXiv preprint arXiv:200206563. 2020.

[pcbi.1008609.ref006] 6.Mangoni L, Pistilli M. Epidemic analysis of Covid-19 in Italy by dynamical modelling. Available at SSRN 3567770. 2020.

[pcbi.1008609.ref007] 7.Yang W, Zhang D, Peng L, Zhuge C, Hong L. Rational evaluation of various epidemic models based on the COVID-19 data of China. arXiv preprint arXiv:200305666. 2020. [DOI] [PMC free article] [PubMed]

[pcbi.1008609.ref008] 8. Wynants L, Van Calster B, Bonten MM, Collins GS, Debray TP, De Vos M, et al. Prediction models for diagnosis and prognosis of covid-19 infection: systematic review and critical appraisal. bmj. 2020;369 10.1136/bmj.m1328 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref009] 9. Vink MA, Bootsma MCJ, Wallinga J. Serial intervals of respiratory infectious diseases: a systematic review and analysis. American journal of epidemiology. 2014;180(9):865–875. 10.1093/aje/kwu209 [DOI] [PubMed] [Google Scholar]

[pcbi.1008609.ref010] 10. Li Q, Guan X, Wu P, Wang X, Zhou L, Tong Y, et al. Early transmission dynamics in Wuhan, China, of novel coronavirus–infected pneumonia. New England Journal of Medicine. 2020. 10.1056/NEJMoa2001316 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref011] 11. Ganyani T, Kremer C, Chen D, Torneri A, Faes C, Wallinga J, et al. Estimating the generation interval for coronavirus disease (COVID-19) based on symptom onset data, March 2020. Eurosurveillance. 2020;25(17):2000257 10.2807/1560-7917.ES.2020.25.17.2000257 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref012] 12. Bi Q, Wu Y, Mei S, Ye C, Zou X, Zhang Z, et al. Epidemiology and Transmission of COVID-19 in Shenzhen China: Analysis of 391 cases and 1,286 of their close contacts. MedRxiv. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref013] 13. He X, Lau EH, Wu P, Deng X, Wang J, Hao X, et al. Temporal dynamics in viral shedding and transmissibility of COVID-19. Nature medicine. 2020; p. 1–4. [DOI] [PubMed] [Google Scholar]

[pcbi.1008609.ref014] 14. Rothe C, Schunk M, Sothmann P, Bretzel G, Froeschl G, Wallrauch C, et al. Transmission of 2019-nCoV infection from an asymptomatic contact in Germany. New England Journal of Medicine. 2020;382(10):970–971. 10.1056/NEJMc2001468 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref015] 15. Liu WD, Chang SY, Wang JT, Tsai MJ, Hung CC, Hsu CL, et al. Prolonged virus shedding even after seroconversion in a patient with COVID-19. Journal of Infection. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref016] 16. Wölfel R, Corman VM, Guggemos W, Seilmaier M, Zange S, Müller MA, et al. Virological assessment of hospitalized patients with COVID-2019. Nature. 2020; p. 1–5. [DOI] [PubMed] [Google Scholar]

[pcbi.1008609.ref017] 17. Chang D, Mo G, Yuan X, Tao Y, Peng X, Wang F, et al. Time kinetics of viral clearance and resolution of symptoms in Novel coronavirus infection. American journal of respiratory and critical care medicine. 2020;(ja). 10.1164/rccm.202003-0524LE [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref018] 18. Young BE, Ong SWX, Kalimuddin S, Low JG, Tan SY, Loh J, et al. Epidemiologic features and clinical course of patients infected with SARS-CoV-2 in Singapore. Jama. 2020;323(15):1488–1494. 10.1001/jama.2020.3204 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref019] 19. Zhou F, Yu T, Du R, Fan G, Liu Y, Liu Z, et al. Clinical course and risk factors for mortality of adult inpatients with COVID-19 in Wuhan, China: a retrospective cohort study. The lancet. 2020. 10.1016/S0140-6736(20)30566-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref020] 20. Tan L, Kang X, Zhang B, Zheng S, Liu B, Yu T, et al. A special case of COVID-19 with long duration of viral shedding for 49 days. medRxiv. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref021] 21. Hauser A, Counotte MJ, Margossian CC, Konstantinoudis G, Low N, Althaus CL, et al. Estimation of SARS-CoV-2 mortality during the early stages of an epidemic: A modeling study in Hubei, China, and six regions in Europe. PLOS Medicine. 2020;17(7):1–17. 10.1371/journal.pmed.1003189 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref022] 22.Presidenza del Consiglio dei Ministri—Dipartimento della Protezione Civile. GitHub repository on COVID-19. Available from: https://github.com/pcm-dpc.

[pcbi.1008609.ref023] 23.The Atlantic Monthly Group. The COVID Tracking Project. Available from: https://github.com/COVID19Tracking.

[pcbi.1008609.ref024] 24. Carpenter B, Gelman A, Hoffman MD, Lee D, Goodrich B, Betancourt M, et al. Stan: A probabilistic programming language. Journal of statistical software. 2017;76(1). 10.18637/jss.v076.i01 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref025] 25.Probst D. COVID-19 Information for Switzerland. Available from: https://www.corona-data.ch/.

[pcbi.1008609.ref026] 26.Swiss National COVID-19 Science Task Force. Effective reproductive number estimates for COVID-19. Available from: https://ncs-tf.ch/en/situation-report/.

[pcbi.1008609.ref027] 27.Centre for the Mathematical Modelling of Infectious Diseases, London School of Hygiene and Tropical Medicine. Time-varying estimate of the effective reproduction number. Available from: https://epiforecasts.io/covid/posts/global/.

[pcbi.1008609.ref028] 28. Ferguson N, Laydon D, Nedjati Gilani G, Imai N, Ainslie K, Baguelin M, et al. Report 9: Impact of non-pharmaceutical interventions (NPIs) to reduce COVID19 mortality and healthcare demand. 2020. [Google Scholar]

[pcbi.1008609.ref029] 29. Verity R, Okell LC, Dorigatti I, Winskill P, Whittaker C, Imai N, et al. Estimates of the severity of COVID-19 disease. MedRxiv. 2020. [Google Scholar]

[pcbi.1008609.ref030] 30. Conlan AJ, Rohani P, Lloyd AL, Keeling M, Grenfell BT. Resolving the impact of waiting time distributions on the persistence of measles. Journal of the Royal Society Interface. 2010;7(45):623–640. 10.1098/rsif.2009.0284 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref031] 31. Lloyd AL. Realistic distributions of infectious periods in epidemic models: changing patterns of persistence and dynamics. Theoretical population biology. 2001;60(1):59–71. 10.1006/tpbi.2001.1525 [DOI] [PubMed] [Google Scholar]

[pcbi.1008609.ref032] 32. Feng Z, Xu D, Zhao H. Epidemiological models with non-exponentially distributed disease stages and applications to disease control. Bulletin of mathematical biology. 2007;69(5):1511–1536. 10.1007/s11538-006-9174-9 [DOI] [PubMed] [Google Scholar]

[pcbi.1008609.ref033] 33. Hethcote HW. A thousand and one epidemic models In: Frontiers in mathematical biology. Springer; 1994. p. 504–515. [Google Scholar]

[pcbi.1008609.ref034] 34. Silal SP, Little F, Barnes KI, White LJ. Sensitivity to model structure: a comparison of compartmental models in epidemiology. Health Systems. 2016;5(3):178–191. 10.1057/hs.2015.2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1008609.ref035] 35. Sauer T, Berry T, Ebeigbe D, Norton MM, Whalen A, Schiff SJ. Identifiability of infection model parameters early in an epidemic. medRxiv. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Assessing the potential impact of transmission during prolonged viral shedding on the effect of lockdown relaxation on COVID-19

Burcu Tepekule

Anthony Hauser

Viacheslav N Kachalov

Sara Andresen

Thomas Scheier

Peter W Schreiber

Huldrych F Günthard

Roger D Kouyos

Roles

Abstract

Author summary

Introduction

Materials and methods

Mathematical model

Fig 1. Illustration of the generalized SEIR model.

Table 1. Model parameters given with their descriptions, constrained ranges, and prior distributions.

Model fitting and parameter estimation

Model selection via goodness of fit until relaxation

Model selection via predictive power after relaxation

Results

Possibility of a chronically infected population

Fig 2. Fitting and Root Mean Squared Error (RMSE) results for Lombardy.

Impact of relaxation

Fig 3. Fitting and prediction results for Switzerland.

Fig 4. Median of the posterior distributions of parameters.

Discussion

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases