Was R < 1 before the English lockdowns? On modelling mechanistic detail, causality and inference about Covid-19

Simon N Wood; Ernst C Wit

doi:10.1371/journal.pone.0257455

. 2021 Sep 22;16(9):e0257455. doi: 10.1371/journal.pone.0257455

Was R < 1 before the English lockdowns? On modelling mechanistic detail, causality and inference about Covid-19

Simon N Wood ^1,^*, Ernst C Wit ²

Editor: Alessandro Rizzo³

PMCID: PMC8457481 PMID: 34550990

Abstract

Detail is a double edged sword in epidemiological modelling. The inclusion of mechanistic detail in models of highly complex systems has the potential to increase realism, but it also increases the number of modelling assumptions, which become harder to check as their possible interactions multiply. In a major study of the Covid-19 epidemic in England, Knock et al. (2020) fit an age structured SEIR model with added health service compartments to data on deaths, hospitalization and test results from Covid-19 in seven English regions for the period March to December 2020. The simplest version of the model has 684 states per region. One main conclusion is that only full lockdowns brought the pathogen reproduction number, R, below one, with R ≫ 1 in all regions on the eve of March 2020 lockdown. We critically evaluate the Knock et al. epidemiological model, and the semi-causal conclusions made using it, based on an independent reimplementation of the model designed to allow relaxation of some of its strong assumptions. In particular, Knock et al. model the effect on transmission of both non-pharmaceutical interventions and other effects, such as weather, using a piecewise linear function, b(t), with 12 breakpoints at selected government announcement or intervention dates. We replace this representation by a smoothing spline with time varying smoothness, thereby allowing the form of b(t) to be substantially more data driven, and we check that the corresponding smoothness assumption is not driving our results. We also reset the mean incubation time and time from first symptoms to hospitalisation, used in the model, to values implied by the papers cited by Knock et al. as the source of these quantities. We conclude that there is no sound basis for using the Knock et al. model and their analysis to make counterfactual statements about the number of deaths that would have occurred with different lockdown timings. However, if fits of this epidemiological model structure are viewed as a reasonable basis for inference about the time course of incidence and R, then without very strong modelling assumptions, the pathogen reproduction number was probably below one, and incidence in substantial decline, some days before either of the first two English national lockdowns. This result coincides with that obtained by more direct attempts to reconstruct incidence. Of course it does not imply that lockdowns had no effect, but it does suggest that other non-pharmaceutical interventions (NPIs) may have been much more effective than Knock et al. imply, and that full lockdowns were probably not the cause of R dropping below one.

Introduction

In principle the inclusion of known mechanisms into models used for statistical inference should improve inference by reducing the bias caused by model misspecification. But there is a catch. What happens if the mechanisms are themselves described only in an approximate manner by ad hoc sub-models? It is then possible for the assumptions built into the sub-models to introduce substantial misspecification bias. The real world consequences of such bias could be substantial if the model is used to determine major public policies. This paper examines and re-implements the model of [1] to investigate the robustness of the inferences about Covid-19 lockdowns made using it. We show that key results are entirely dependent on strong but incidental assumptions introduced in the model formulation, and that relaxation of those assumptions effectively reverses the conclusions.

This may matter in assessing the effectiveness of lockdowns and other stringent blanket measures, which have consequences in addition to reducing viral spread. For example, they modify the evolutionary landscape for the pathogen in ways that seem unlikely to offer a selective advantage for milder strains (see S1 Code). Among mitigation measures full stay-at-home lockdowns are also particularly severe in terms of creating the economic shocks that may cause economic hardship and exacerbate inequality in the long term. In England economic hardship and inequality are associated with very substantial loss of life, as reviewed at length in [2]. We can not predict the actual future life loss that lock down effects will cause, but figures are available that at least indicate the scale of the risk. [2] includes a detailed assessment of the health effects that followed on from the economic shock of 2008, which at minimum constitute a health burden of some 9 million lost life years for the current UK population (based on the increase in the deprivation related life expectancy gap, although Marmot argues for a rather higher figure). For comparison, the extra life loss burden that a minimally mitigated Covid epidemic would have caused is estimated at around 3 million years [3]. The Bank of England characterises the economic shock from UK lockdown and other Covid suppression measures as the largest in 300 years, much larger than 2008. This suggests that lockdowns (and indeed other measures) carry a risk of substantial life loss, and that it is therefore important neither to overstate their clear benefits, nor neglect their downsides, if policy choices are to result in the imposition of measures that broadly minimise risk of life loss in the round (it is obviously facile to reduce the question to a binary choice between lockdown and do nothing). Recognising this, the UK government has made some attempt to assess possible negative health effects of the measures imposed [3], but, although acknowledging that the long term economic impacts on health are likely to be large, has not quantified them. Looking beyond the UK, to India, UNICEF has identified particularly large effects of containment measures, in particular associated with the period of the Indian lockdown from March 24th 2020 [4]: they estimate about 150,000 extra childhood deaths and 60,000 extra still births for India. Given the age profile of Covid deaths, this corresponds to a life year loss more than double that implied by the official Indian Covid death toll to date, and obviously far above the life years saved by the lockdown according to the Government of India/public health foundation of India estimates of 80,000 deaths avoided [5].

[1] is the 41st report of the COVID-19 response team from Imperial College London, whose reports have played a profound role in the shaping of UK government policy on Covid-19. Report 9 in the series provided a major component of the official justification for the first UK lockdown from March 24th 2020, and [1] was covered prominently in the UK Sunday Times, for example. A major message from [1] is that the pathogen reproductive number R was only reduced below one by full lockdowns in England in March and November (see Fig 1), with incidence apparently increasing until the eve of the March lockdown. We show that this result does not survive relaxation of some strong modelling assumptions. [1] also present ‘counterfactual’ simulations from the calibrated model from which they draw conclusions about the deaths that could have been avoided by an earlier first lockdown. We show that these simulations can not be viewed as ‘counterfactuals’ in the usual inferential sense (see e.g. [6]). The avoidable death figures are simple model extrapolations.

The plot is based on data digitized from Fig 1 of [1]. Uncertainties were not reported. The vertical lines mark model breakpoints at: 16th March movement restrictions (work from home advice), 23rd March lockdown announcement, 25th March ‘Lockdown in full effect’, May 11th initial easing, June 15th shops re-open, July 4th restaurants re-open, August 3rd eat-out-to-help-out scheme, September 1st schools open, September 14th rule of 6, October 14th Tier system, November 5th Lockdown. The kinks preceding November 5th are at a further model breakpoint. Prior to the first lockdown the following interventions occurred for which no breakpoints have been imposed: public information campaign, March 4th; symptomatic self isolation 13th; school and hospitality closures 20th. Full lockdown (stay at home orders and shutting down of much ‘non-essential’ activity) came into effect on 24th March, having been announced at 20:30 on 23rd March.

The model in [1] is an age-structured SEIR model with age-structured hospital compartments. The population is divided into 5-year age classes with a final 80+ class and two unstructured classes for care home residents and staff. There are 36 states in each of 19 classes (see Fig 2). The model was specified as a set of ODEs and converted to a discrete time stochastic model for fitting by the τ-leap method [7]. The model was fitted to daily data on hospital deaths, care home deaths, hospital admissions, general ward occupancy, ICU occupancy, antibody test results and PCR test results from surveys, supplied as supplementary material for [1]. [1] also attempts to fit data on test results from the health system. However the model does not attempt to deal with the non-random, opportunistic nature of the sampling in this data stream, despite the continual changes in test capacity, criteria for testing, and operation of the contact tracing system over the course of the data. We therefore believe that there is substantial danger of these data simply undermining the analysis and they should not be included in data to be fitted (we made this decision at the outset, having concluded that we would strongly advice against use of these data if acting as statistical consultants, and have never attempted to fit these data). Data were available for seven English regions, which were fitted separately. The model has 26 free parameters.

Fig 2 — To obtain the rate of flow from one compartment to another, follow the path joining them in the direction of the arrow, multiplying the source state variable by the rate parameters labelling the segments of the path. Rates with a superscript i vary with age class. The relative rates in different classes was obtained from a separate analysis reported in [1], with only a common multiplier of the class specific rates left as a free parameter. For example $p_{H}^{i} = p_{H}^{m a x} ψ_{H}^{i}$ , where $ψ_{H}^{i}$ is fixed, but $p_{H}^{m a x}$ is free. Evaluation of original Knock et al. age-structured SEIR model and S1 Appendix A have full definitions.

[1] bases model inference (fitting) on particle filtering methods, with full fit to all regions reported to take over 100 CPU days, despite using only 96 particles per fit. This computational cost makes model checking difficult, particularly if a more usual number of particles is used and the stronger model assumptions are relaxed: the latter involves allowing substantially more free parameters plus hyper-parameters. Additionally [1] specifies massive overdispersion in all but the test data streams. Decreasing this over-dispersion to levels consistent with the data would likely increase particle depletion problems in filtering, leading to yet longer computing times. Given these issues, we will work directly with the ODE based model. The neglect of stochasticity in the state equations seems likely to be a minor issue here, relative to the other approximations made in the model. In particular, the only non-linearity in the model dynamics is in the transmission between infectious and susceptible sub-populations, which contain large numbers except right at the epidemic start. Other model components are controlled by simple linear flows and are also aggregated over multiple age classes for fitting. Additionally the data sampling interval and total data duration are fairly short relative to the model’s dynamic timescales. In any case, any results dependent on stochasticity would then require a much stronger justification for the stochastic formulation than that it was produced by discretisation of an underlying ODE model.

Furthermore, a generic strength and weakness of the particle filtering methods used in [1] is that they necessarily filter the state variables as well as model parameters. This is advantageous for state forecasting, but can be more problematic for inferential tasks. For an ill-specified dynamic model the filter is often forced to repeatedly select state transitions that are improbable under the model, in order to be sufficiently close to the data. This can result in the filtered states being in an extreme tail of the posterior predictive distribution of the model: that is, of the distribution implied by simulating unfiltered states from the model given the posterior distribution of parameters. Hence model adequacy needs to be checked by comparison of the data with simulations from the posterior predictive distribution. [1] does not report such checks, instead showing the filtered outputs. This is problematic when reality is then contrasted to ‘counterfactual’ simulations, necessarily from the posterior predictive distribution. The simple ODE approach used here does not filter. Instead the states are determined entirely by the model equations and the parameter values. This approach is unforgiving of model misspecification: adequacy is directly assessable from the model fit. It also reduces fit time by four orders of magnitude.

Evaluation of original Knock et al. age-structured SEIR model

In this section we review the model of [1], before presenting some corrections and assumption relaxations in section Modification of the Knock et al. model. Fig 2 is a schematic showing the compartments in each 5-year age or care home class. The exposed, but pre-symptomatic, E stage is modelled by two sequential compartments. It is assumed that no infections are caused by this class. Symptomatic and asymptomatic stages I_C and I_A follow and cause infections, both are single compartment. The duration of the I_C stage is set from data on time from onset of symptoms to hospital admission. The absence of pre-symptomatic infection will lead to longer generation times than are reported in the literature (e.g. [8, 9] p. 26), elevating the R estimates required to achieve observed epidemic growth rates. Care home residents are not hospitalised, and the $G_{D}^{i}$ class shown actually only receives patients for the care home resident class.

Model compartments for PCR and antibody test positivity are fed by the infection rate and the progression rate from the E state, respectively. The infection rate is driven by an age-structured mixing model with contact matrix, C, based on the POLYMOD survey data for the UK [10]. Most elements of C are multiplied by a function b(t) modelling the impact of NPIs, and effects such as weather, on contact rates. In [1] b(t) is piecewise linear with 12 breakpoints (and 12 free parameters) at policy change points. A major aim here is to relax the very strong assumptions built in to such a restrictive model. Care home contact rates are separately parameterized.

Hospitalized patients follow an ICU or general ward route. There are separate compartments for those eventually recovering or dying on the general ward. The ICU route has a pre-ICU compartment, from which patients enter compartments for those dying in ICU, entering ICU but dying later on the general ward, or entering ICU and recovering on the general ward. All compartments are duplicated for confirmed Covid (starred) and not yet confirmed (not starred), with a parameter, γ_U, controlling the rate of testing based transfer from unconfirmed to confirmed. It is assumed that, from the start, 25% of patients arrive at hospital with confirmed Covid. This is improbable given initial testing capacity.

The model captures many features in impressive detail, but several aspects are not modelled:

Separation into locked down and key worker sub-populations at lockdown is not modelled, despite the very different values of R that must apply in these sub-populations, if lockdown is effective.
The assumed linearity of b(t) during lockdown precludes compensation for point 1 in fitting.
Seasonality or other non-NPI temporal effects on transmission are not modelled explicitly and are therefore confounded with the NPI effects, invalidating counterfactual manipulations of the latter.
Region-to-region transmission at the epidemic start is not represented, compromising early model fit and R estimates, as imported cases are modelled as local.
The assumption of no pre-symptomatic infectivity is inconsistent with empirical estimates of the serial interval and generation time, reviewed in [9], for example.
Within hospital transmission is not modelled, although hospital-acquired infections have been reported to account for a quarter of hospitalized cases at times in both waves [11], reports which are corroborated by public NHS data [12], and there is good evidence that the actual figure was higher [13]. This will compromise the hospital data fit.
No interaction between NPIs and age is allowed, which is unlikely given the risk-by-age profiles.
Differential transmission rates between symptomatics and asymptomatics are not modelled.
The reported differences in disease progression between men and women (see [14], for example) are not modelled.
Changes in testing rates with capacity changes are not modelled.

Any biological model for a complex system necessarily makes many simplifying assumptions, often without substantial detriment to statistical inference within the range of the data being modelled. However causal inference based on statistical methods puts much heavier requirements on the model, since it is then required to extrapolate. Counterfactual statements made using a model are of this causal character, and in the current case require the model to behave essentially as a mechanistic representation of reality (since we know of no causal inference strategy that could alleviate the effects of mis-specification in this sort of model, and [1] does not report any). Given this requirement for high mechanistic accuracy, any of the preceding omissions may be problematic. We note also that although we do not seek to extrapolate in this paper, most of these points will have some impact on our results. The hospital acquired infection issue makes it particularly difficult to exactly match hospital data with the model, for example.

The basic SEI(R) model

For concreteness we describe the core of the SEIR model, giving the equations for other compartments in S1 Appendix A. Denoting the time derivative of a variable x by $\dot{x}$ , then for the ith class,

\begin{matrix} {\dot{S}}^{i} = - λ_{i} (t) S^{i} \end{matrix}

(1)

\begin{matrix} {\dot{E}}^{i, 1} = λ_{i} (t) S^{i} - γ_{E} E^{i, 1} \end{matrix}

(2)

\begin{matrix} {\dot{E}}^{i, 2} = γ_{E} E^{i, 1} - γ_{E} E^{i, 2} \end{matrix}

(3)

\begin{matrix} {\dot{I}}_{A}^{i} = (1 - p_{c}) γ_{E} E^{i, 2} - γ_{A} I_{A}^{i} + I (2 < i < 13) ϕ_{t_{0}, σ_{t}} (t) \end{matrix}

(4)

\begin{matrix} {\dot{I}}_{C}^{i} = p_{c} γ_{E} E^{i, 2} - γ_{c} I_{C}^{i} . \end{matrix}

(5)

λ_i(t) is the force of infection defined below, and is the only interesting interaction between age classes. p_c is the proportion of the infected showing symptoms, and the γ parameters determine between compartment flow rates, given in [1]. $I (\cdot)$ is an indicator function and $ϕ_{t_{0}, σ_{t}}$ is an $N (t_{0}, σ_{t}^{2})$ p.d.f. where t₀ is a free parameter. This initialization differs slightly from [1] who put 10 individuals in the age 15–20 asymptomatics at t₀. It is unclear why this is sensible, although it may slightly delay the first wave model care home epidemic. Susceptibles, Sⁱ, are initialized from regional demography supplied in the [1] supplementary material. Care home sizes are supplied in the sircovid package by the carehomes_parameters() function [15].

The effective reproductive number of the pathogen, R, attempts to measure the number of new infections that each infected individual produces on average. Since this number obviously depends on the time course of the epidemic, there are various ways of defining it as an instantaneous quantity (see [9] for a review). For the current model structure the well established definition of [16] is appropriate, and ensures that R = 1 forms a sharp boundary between long term increase and decrease of the epidemic (that is, once R falls below 1, long term decline is guaranteed until it exceeds 1 again). [1] uses this approach for each region, and we follow this. See S1 Appendix A.3 for details. Our fitting also requires the derivatives of the model states with respect to the parameters: the sensitivities. These follow directly from the model specification. For example if $S_{θ_{j}}^{i}$ is the differential of Sⁱ w.r.t. θ_j,

\begin{matrix} {\dot{S}}_{θ_{j}}^{i} = - \frac{\partial λ_{i}}{\partial θ_{j}} S^{i} - λ_{i} S_{θ_{j}}^{i} . \end{matrix}

Generically each term in the model equation involving a state gets replaced by that state’s derivative w.r.t the parameter of interest, and to this are added any terms relating to direct dependence on the parameter of interest. For example, if γ_C was a free parameter then ${\dot{I}}_{C γ_{C}}^{i} = p_{c} γ_{E} E_{γ_{C}}^{i, 2} - γ_{c} I_{C γ_{C}}^{i} - I_{C}^{i}$ . (Note that the same principle applies to the coefficients of the model component function b(t) introduced below. b(t) is represented using a basis expansion, and while the basis functions are time varying, the corresponding coefficients are not).

Force of infection

Writing I for the vector of infectious individuals in each class, then the model for the force of infection in each class is λ = MI where

\begin{matrix} M = (\begin{matrix} b (t) C & b (t) c^{chw} & ϵ b (t) C_{\cdot, 16} \\ b (t) c^{chw} & m_{chw} & m_{chw} \\ ϵ b (t) C_{16, \cdot} & m_{chw} & m_{chr} \end{matrix}) . \end{matrix}

ϵ, m_chw and m_chw are free parameters. b(t) is a parameterized function of time controlling the variation of infection causing contact over time. C is a symmetric matrix of contact rates and c^chw a vector (derived from it for carehome workers). I_j is the sum of asymptomatic ( $I_{A}^{j}$ ) and symptomatic infectious ( $I_{C}^{j}$ ) in class j. S1 Appendix A.1 has the force of infection expressed so that sensitivities follow by inspection.

C is based on the POLYMOD survey [10] accessed through the socialmixr R package [17]. This had 1011 UK participants, who each recorded their contacts on one day. There were 7 participants in the 75–80 age group and none over 80. S1 Appendix A.2 gives details.

The likelihood

The likelihood is constructed from binomial components for the PCR and antibody test data (see S1 Appendix B.2), and negative binomial components for the hospital death, care home death, hospital admissions, general ward occupancy and ICU occupancy data. For the negative binomial components [1] sets κ = μ²/(σ² − μ) equal to 2 in all cases without justification offered. This is a huge level of overdispersion, heavily down-weighting the data relative to the priors. For example, hospital deaths show no evidence of over-dispersion relative to Poisson. But for an expected death rate of 200 the choice of κ raises the standard deviation from 14, for a Poisson deviate, to 140. Although such a choice will reduce particle depletion problems in filtering, it is not easy to justify as a statistical model. Still more problematic is the assumption that observed daily bed occupancy is given by a negative binomial deviate with expectation given by the model, with these deviates independent between days. We are at a loss to understand what mechanism could give rise to such a model. A reasonable model might have daily arrivals and discharges as independent random variables with means given by the model, but occupancy obviously integrates these arrival and discharge rates over days, leading to strong dependence between days. The stochastic version of the model might model some of this dependence, but leaves even less justification for additional independent negative binomial variability.

Modification of the Knock et al. model

In this section we present modifications of the Knock et al. model in order to deal with some of the deficiencies identified above. They consist of a number of corrections and minor modifications and, more fundamentally, relaxing some of the stronger modelling assumptions made in [1].

Corrections and minor modifications

Rates

The γ parameters controlling rates of progression between model compartments are either taken from the literature, or are estimated from CHESS (COVID-19 Hospitalisations in England Surveillance System) data that are not available for checking. There are at least two identifiable problems with the durations used in [1]. Firstly they set the mean duration of the E stage to 4.6 days citing [18]. That paper actually reports a mean of 5.5 days, with 4.6 days lying just above the lower 95% confidence limit for the median. Here we used the mean of 5.8 days from the meta-analysis of [19], which includes [18] as one of the studies. In fact the most statistically careful analysis we found [20] gives an estimated mean incubation period of 9.1 days (n = 1211), and generation time of 5–6 days. Secondly [1] assumes that the mean time from symptoms to hospitalization is 4 days based on [21], but that paper gives 4 days as the median. An exponential distribution is used for time from symptoms to hospitalization (a model which the figures reported in [21] do seem to support), so the median is log2 of the mean. Based on the male and female medians of 5 and 4 days reported in [21], we therefore used a mean time to hospitalization of 6.5 days. In fact [21] is based on early data (up until April 19th 2020) from the ISARIC study. From the much larger ISARIC sample available by October 2020, the mean time from first symptoms to hospitalization is reported as 7.7 days [22], but we will nevertheless follow [1] in using Docherty et al., simply correcting the incorrect use of the median in place of the mean.

Another issue is the assumption that 25% of patients were arriving at hospital with a test confirming their status from the start of the epidemic. In fact, as documented in [23], there was no testing of patients outside of hospitals between 12 March 2020 and 28th April 2020, with very little capacity before this time and close to full testing capacity not reached until mid June 2020 (see Fig 1 of [23]). To crudely capture this we allowed p* to increase linearly from 0 to 0.25 between days 120 and 170, staying at 0.25 thereafter.

Priors

The priors used were not exactly those in [1], rather priors were set to be vague on a working parameter scale. Any limits on parameter were set by the prior intervals reported in [1]. Parameters were optimized on a working scale—either untransformed, log transformed or scaled logit transformed. Gaussian priors on the working scale were also applied, but except for t₀ these were vague, and their only purpose was to allow ready detection of any parameters that were not identifiable. See S1 Appendix B.1 for details.

The negative binomial likelihood

While our basic conclusions are in fact unchanged if we use the likelihood given in [1] for the hospital occupancy data, we can see no valid justification for this part of the model formulation, and therefore replaced it with a likelihood based on the daily change in occupancy. In particular we model the ward (or ICU) arrivals and departures as independent overdispersed Poisson deviates, the difference in which gives the daily change in occupancy. A difficulty with applying this model directly is that hospital arrivals and discharges tend to have weekly pattern. This pattern shows up strongly in the ACFs and PACFs of occupancy first differences for some regions, especially east of England, but is absent from the model. We therefore base the likelihood on weekly changes. Since the changes in occupancy carry no information on the level of occupancy, we also add the sum of daily bed occupancies as a final datum to be fitted, treating this as close to Poisson (by setting κ to a very high constant). See S1 Appendix B.2 for details.

For the total daily hospital admissions data and the care home deaths data we retain the negative binomial model, with the respective κ parameters free to be estimated. Some overdispersion here is a pragmatic way to deal with likely model mismatches in these components. For example, in addition to the mismatches expected from not modelling hospital acquired infections (e.g. [13]), it seems likely that there was some on the ground variability in the severity of disease sufficient for hospitalization, and in rates of discharge, particularly early in the epidemic and when loads were high. For the hospital deaths we set κ = 2000, which gives a likelihood very close to Poisson. There is no legitimate reason to expect overdispersion here, if the model is at all fit for purpose.

Relaxing the model assumptions

The largest change made here is to relax the strong assumption that b(t)—which represents the effects of NPIs, the weather and other factors—is a piecewise linear function with slope changes only at 12 selected NPI change points. Here, b(t) is instead represented semi-parametrically by a logistic transform (see S1 Appendix B.1) of an adaptive smoothing spline, with 80 coefficients and 5 smoothing parameters, in which the degree of smoothness is allowed to vary smoothly with t. See section 5.3.5 of [24] for details. The point of this change is to use a representation of b(t) that allows for a much wider range of possible function shapes and a well founded data driven means for choosing between them, thereby greatly increasing the role of the data in the estimation of b(t), while reducing the role of prior assumptions. Of course it does nothing to remove the confounding of NPIs with weather and other effects, such as spontaneous behavioural changes, but it does avoid the implication that the weather and people’s behaviour change their course only in response to government announcements.

We also relaxed the assumption that all the γ parameters are fixed and known. Firstly, the reference used to justify the choice of γ_G, controlling the rate of progression of fatal disease in care homes [25], appears to contain no information on this parameter, so we allowed it to be a free parameter, which slightly reduces care home death mistiming. Secondly, the model also has difficulty matching the general ward and ICU occupancy data, tending to over-estimate both in the Midlands and two northern regions. To reduce this problem it seemed reasonable to relax the assumption that all the rate parameters controlling progression through the health system were fixed and known. In particular we relaxed the parameters for which there seemed likely to be most scope for some latitude in clinical judgement, perhaps driven by local circumstances, to make substantial differences. So we relaxed the assumptions on the rates related to movement of recovering patients through the system. That is $γ_{I C_{W_{r}}}$ , $γ_{W_{r}}$ and $γ_{H_{r}}$ were treated as free parameters.

A final rigidity in the model structure is that there is assumed to be no infection before individuals could at least potentially become symptomatic on leaving the E stage. At the same time the mean duration of the symptomatic infective stage is set equal to the mean time from symptom onset to hospitalisation. This makes for a very long generation time, much longer than the 5–7 days reported in the literature for the serial interval or generation time (see p. 26 of [9] for a review). One consequence of this is that R estimates need to be higher than those usually quoted to meet the initial rate of increase in the disease ([1] actually limits R in a way that avoids estimates being too high). To relax this link between clinical disease progression rates and the generation interval, we introduced an extra compartment between I_c and hospitalization (see the grey ‘P’ on Fig 2).

\begin{matrix} \dot{P} = γ_{C} I_{c} - γ_{p h} P \end{matrix}

where P replaces I_c in all flows into hospital compartments and the R state. By appropriate choice of γ_ph, this state allows us to shorten the E state and I_c state, hence reducing the generation time, without changing the literature based mean time from infection to hospitalisation. Specifically, we shortened the E state to have an average of 3 days to infectivity, and the I_C state to be 4 days, yielding a generation time of 6.2 days (accounting for the duration of I_A, which was unchanged). The P state then has an average duration of 5.3 days so that the total time from infection to hospitalization still matches the literature based 5.8 + 6.5 days discussed previously.

Estimation and inference

The sensitivities of the model states with respect to the parameters were obtained for all 703 model state variables, yielding a system of 65379 sensitivity ODEs. Model and sensitivities were solved by fourth order Runge-Kutta integration (see e.g. [26]) with a one day time step (having confirmed that halving the step made negligible difference to the evaluated likelihood). Hence the log likelihood and its derivatives w.r.t. the free parameters could be readily evaluated. Due to sparsity and cache efficiency, the sensitivity system less than doubles computing time for the model. Computing the likelihood, likelihood derivatives and R series for the full model takes less than a second on a single core of a low specification laptop—it is considerably faster for the original [1] model with fewer free parameters.

Given the log likelihood and derivatives, the penalized log likelihood and derivatives are also readily evaluated, so the posterior modes of the free parameters can be obtained by quasi-Newton optimization. The smoothness of b(t) was controlled by a Gaussian smoothing prior, with 5 free smoothing parameters, which were estimated by the approximate marginal likelihood optimization method of [27]. Uncertainty was assessed using the large sample approximate posterior covariance matrix of the parameters, and the delta method. See S1 Appendix B.3.

Results

Fig 3 shows the fit of the model with the various assumption relaxations applied. The model fits imperfectly, with some systematic errors in the fit to hospital occupancy and arrival data as expected: without modelling the hospital acquired infections (which are included in the data and, as discussed previously, often made up a substantial portion of the total hospitalized), as well as possible time variability in on-the-ground admission criteria, it is unlikely that better fits could be achieved. Given the ambitious nature of the fitting task, it seems reasonable to view the results as useful in the statistician George Box’s ‘all models are wrong, but some are useful’ sense.

Figs 4 and 5 show the corresponding inferences about incidence and R. All regions have peak incidence prior to the first lockdown with total incidence for England in decline well before lockdown. The regional incidence picture is more mixed at the second lockdown, although the total is again falling well before lockdown. Furthermore all regions have R ≲ 1 by either lockdown, with average R < 1 some days before either lockdown. Several regions relatively distant from London have the inferred R initially increasing. This is probably an artefact caused by the independent initialisation of each region, which cannot capture the initial region-to-region spread. As in [1] the plotted uncertainties would be over-optimistic, even if we assumed a correct model structure, as they do not account for the uncertainty in most of the rate constants.

Fig 4 — Notional 95% credible bands are shown. These do not reflect all the uncertainty in rate parameters and assume a correct model structure: hence they provide a lower bound on uncertainty. Vertical dashed lines show some policy changes. The 4 preceding lockdown I are information campaign, symptomatic self isolation, work from home advice, school and hospitality closures. ‘Eat out to help out’ was a scheme encouraging people to use the restaurants and pubs. The re-opening of schools after the first lockdown is also shown. Subsequent policies introduce increasing levels of restriction.

Fig 5 — Notional 95% credible bands are shown. These do not reflect all the uncertainty in rate parameters and assume a correct model structure: hence they provide a lower bound on uncertainty. Vertical dashed lines as Fig 4.

Although it could also be partially weather driven, the systematic pattern of R continuing to fall after the first lockdown is introduced, and then increasing again well before the lockdown restrictions were lifted, is to be expected. R is the average number of new infections per existing infection. Immediately after lockdown most infections are in the locked down population, with a low R, and only a minority are in the key worker population with higher R (assuming lockdown has an effect), so the average is low. After the locked down population runs out of household members to infect, the proportion of infections among key workers must increase, due to their higher R. So the average R must increase too as most of the infections to average over are now in the higher R population. Although the simple arithmetic mechanism underlying this effect results from having locked down and key worker strata, we only observe aggregate data, reflecting the change in R, but not what causes it. The model also deals only with populations aggregated over the two strata, but can still capture the change in R apparent in aggregate data, if b(t) is flexible enough. However, the piecewise linear b(t) of [1] is not flexible enough in this regard.

Fig 6 shows how the lockdown 1 timing result depends on the various changes made to the [1] model, when they are applied sequentially. All panels use the corrected likelihood. The top left panel then uses the incubation period and time to hospitalization used by [1], and the same serial interval, but has the piecewise linear b(t) replaced by an adaptive spline. Rather than R being much larger than 1 on the eve of lockdown it is around 1. The top right panel modifies the model further, by reducing the serial interval to about 6.2, making it closer to the literature range—if anything this moves the R = 1 point slightly later. The bottom left panel is then the model with the incubation period and time to hospitalization set to the literature values consistent with the papers cited in [1] as the sources of these durations. This panel is simply an enlargement of the relevant portion of Fig 5. Finally the lower right panel shows the results when the smoothing penalty is downweighted by a factor of 4. This checks whether the timing results could be driven by smoothness assumptions, by substantially reducing the amount of smoothing relative to the estimated level. The results do not appear to be a smoothing driven artefact.

If fits of this model to data are viewed as a reasonable basis for inference about the timing of incidence and R levels, then the implication is that R < 1 probably occurred some time before both the first two English lockdowns, and that incidence was already in sharp decline before either. The contrary result of [1] relies on a very restrictive model for b(t) and on setting incubation and hospitalization times to values less than those given in the papers cited as their source.

Discussion

Three major claims are made in [1]. Whereas the first is of a descriptive nature, namely that the two English Covid-19 lockdowns in March and November 2020 coincide with a major drop in the reproduction rate of Covid-19 in the UK, the other two are of a so-called “counterfactual” nature: (i) if England had not gone into lockdown, then there would have not been an associated drop in reproduction rate and (ii) if England had gone into lockdown earlier (or later) then a lot of lives would have been saved (or lost, respectively).

The key challenge is that a counterfactual cannot be directly observed and must be approximated with reference to a comparison group. There are various accepted approaches to determining an appropriate comparison group for counterfactual analysis, ideally using a prospective design. When this is not available, such as in this case, a retrospective approach is necessary. But there are stringent conditions on a retrospective design in order for it to have counterfactual validity, such as avoiding confounding, contamination, and impact heterogeneity (see [6] for an introductory treatment). Confounding occurs where certain factors, for example the various social distancing measures in place prior to the lockdowns, are correlated with exposure to the intervention and, independent of exposure, are causally related to the outcome of interest. Confounding factors are therefore alternate explanations for an observed, but possibly spurious, relationship between intervention and the outcome; in this case between lockdown and the reduction in R. The pre-lockdown social distancing measures are also an example of contamination, which may also invalidate any counter-factual statements. Contamination occurs when members of treatment group (i.e. the actual population) and/or comparison groups (i.e. the counterfactual populations) have access to another intervention which also affects the outcome of interest. Additionally, there is the issue of impact heterogeneity: the impact of the lockdown will be very different in the locked down subset of the population, compared to key workers, who are less restricted. Finally, [1] explicitly states that b(t) is modelling both the effects of NPIs and the weather. There is therefore no basis on which the model can identify the effect of lockdown independent of the weather, enabling the counterfactual manipulation of one while appropriately controlling the other. But such control is absolutely fundamental to causal reasoning with counterfactuals. We conclude that the model and inference of [1] do not form a reasonable basis for making counterfactual statements about how many people would have died if lockdown had occurred at a different time. Even without the preceding general problems, there is the specific problem that lockdown can not have caused R to drop below one if this event preceded lockdown, but the counterfactual statements rely on such a causal link.

While this paper was in review, more direct evidence emerged which aligns with our conclusions, but not with [1]. [28] used a direct statistical deconvolution approach to infer incidence from hospital death data and three published infection to death distributions. The study gives similar results for incidence and R to the whole England results obtained here, and its conclusions are strengthened by the close match between the disease duration distributions used and more recent disease duration data reported by [22] based on more than 24,000 fatal cases. The results here and in [28] also correspond to the reconstructions of the number of newly symptomatic infections each day, reported by [29]. This latter study is based on symptom onset dates reported by antibody positive subjects in a properly randomized surveillance sample. Lagged by the average latent period this gives a direct estimate of incidence, and the results are shown in the left panel of Fig 7. The incidence reconstruction can also be used to infer R by the method given in section 5.1 of [28], and this reconstruction is also shown. Finally, the UK Office for National Statistics has published incidence estimates based on its properly randomized Covid-19 surveillance survey. The survey was not yet active at the time of the first peak, but its results (see Fig 7, right) are in agreement with [28, 29] and the results reported here for the second half of 2020. Hence our model fitting based results are consistent with the relatively direct estimates based on the three least biased data sources available.

Fig 7 — a. continuous curves are onset of new symptoms per day from the REACT-2 study digitized from [29], and lagged by the average 5.8 days from infection to first symptoms to give incidence: blue is raw and black is spline smoothed. Jan 1 2020 is day 1 and vertical dashed red lines show the lockdown dates. The grey band shows a 95% credible interval for R reconstructed from the smoothed incidence curve by the method given in section 5.1 of [28]. The horizontal dashed line shows R = 1. Incidence peaks about 9 days prior to lockdown 1 (day 84, March 24th 2020), and R < 1 four days before lockdown 1. b. [30] published estimated incidence with 95% confidence limits. Red lines show the dates of the second and third UK lockdowns—the survey was not running at the first.

After we had received referees reports for this paper (on 18th June 2021), and revised accordingly, [1] was published in Science Translational Medicine [31], having been submitted there on 14th April 2021. The published paper does not refer to our work, but made some changes relative to [1], of which the most significant appear to be: (i) introducing a pre-hospital non-infectious stage, equivalent to our ‘P’ stage, to shorten the generation time/serial interval to be consistent with the literature and (ii) estimating two common negative binomial κ parameters, thereby avoiding simply setting them to 2 (the number of particles used in filtering has been increased accordingly). An extra ‘community deaths outside hospital’ data stream (comparatively small numbers) was also fitted. The main results of [31] are essentially the same as [1], although the new equivalent of Fig 1 now shows London as having R < 1 before the first lockdown, and R for other regions is slightly reduced on the eve of lockdown. Significantly, given our results, the b(t) model was unchanged and the time to hospitalization, incubation time and hospital occupancy likelihoods remain uncorrected in [31]. No modification appears to have been made that might enhance the statistical validity of the ‘counterfactuals’ presented. Hence we do not believe that the changes made between [1] and [31] address the most substantial issues raised here or undermine our results.

Our results on the timing of R < 1 and peak incidence obviously do not imply that the lockdowns had no effect. Indeed the dip and recovery seen in R after the first lockdown is only expected if lockdown reduces spread in the locked down population, relative to those not locked down. The point is rather that the additional effect, on top of the cumulative effects of other behavioural changes pre-dating lockdown, seems likely to have been greatly overstated. In our view, determining definitively what caused R to drop below one is not possible. In March especially, policy and behavioural changes were so rapid (public information campaign, March 4th; symptomatic self isolation 13th; work from home advice, 16th; school and hospitality closures 20th; full lockdown, 24th) that there would simply have been insufficient time to determine what had worked, even if adequate data had been gathered to answer this question. In fact, there was no surveillance testing at that point. However, it seems difficult to make the case that full lockdowns were necessary to bring R below one, whether region-by-region or in aggregate for England. In densely populated London, by far the UK’s largest city where the control problem should be most difficult, the evidence is particularly strong that R < 1 well before full lockdown. While not impossible, it would be quite counter-intuitive if stronger measures were in fact necessary for control in the less densely populated regions.

Supporting information

S1 Appendix. Supplementary appendices.

(PDF)

Click here for additional data file.^{(396.2KB, pdf)}

S1 Code. Replication code and data.

(ZIP)

Click here for additional data file.^{(109.2KB, zip)}

Acknowledgments

We thank the 2 referees and the editor for some helpful suggestions for improving the paper, including the suggestion of Fig 6. Thanks also to Nicole Augustin, Fraser Nelson, Jason Matthiopoulos and Jonathan Rougier for useful comments and discussions. We supplied the preprint version of this paper to the authors of [1] on 4th February 2021, when we posted a copy on medArxiv. They acknowledged receipt, but have not responded further.

Data Availability

All relevant data are within the manuscript and its Supporting information files.

Funding Statement

The author(s) received no specific funding for this work.

References

1.Knock ES, Whittles LK, Lees JA, Perez Guzman PN, Verity R, Fitzjohn RG, et al. Report 41: The 2020 SARS-CoV-2 epidemic in England: key epidemiological drivers and impact of interventions. Imperial College; London. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Marmot M, Allen J, Boyce T, Goldblatt P, Morrison J. Health Equity in England: The Marmot Review 10 Years On. The Health Foundation; 2020. [Google Scholar]
3.DHSC. Direct and Indirect Impacts of COVID-19 on Excess Deaths and Morbidity; 2020. Department of Health and Social Care, Office for National Statistics, Government Actuary’s Department and Home Office. Available from: https://www.gov.uk/government/publications/dhsconsgadho-direct-and-indirect-impacts-of-covid-19-on-excess-deaths-and-morbidity-15-july-2020.
4.Bhutta ZA, Owais A, Horton S, Rizvi A, Nisar I, Das J, et al. Direct and indirect effects of the COVID-19 pandemic and response in South Asia. UNICEF; 2021. [Google Scholar]
5.PIB India (2020, May). Government of India Press Briefing on the actions taken, preparedness and updates on COVID-19, 22nd May 2020, Press Information Bureau. https://pib.gov.in/WebcastMore.aspx?webcast_tempID=434.
6.Pearl J, Glymour M, Jewell NP. Causal inference in statistics: A primer. John Wiley & Sons; 2016. [Google Scholar]
7.Gillespie DT. Approximate accelerated stochastic simulation of chemically reacting systems. The Journal of chemical physics. 2001;115(4):1716–1733. doi: 10.1063/1.1378322 [DOI] [Google Scholar]
8.Flaxman S, Mishra S, Gandy A, Unwin HJT, Mellan TA, Coupland H, et al. Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature. 2020;584(7820):257–261. doi: 10.1038/s41586-020-2405-7 [DOI] [PubMed] [Google Scholar]
9.Anderson R, Donnelly C, Hollingsworth D, Keeling M, Vegvari C, Baggaley R, et al. Reproduction number (R) and growth rate (r) of the COVID-19 epidemic in the UK: methods of estimation, data sources, causes of heterogeneity, and use as a guide in policy formulation. Royal Society SET-C; 2020. [Google Scholar]
10.Mossong J, Hens N, Jit M, Beutels P, Auranen K, Mikolajczyk R, et al. POLYMOD social contact data; 2017. Available from: 10.5281/zenodo.1157934. [DOI]
11.Discombe M. Covid infections caught in hospital rise by a third in one week. Health Service Journal. 2020. [Google Scholar]
12.NHS. Covid-19 Hospital Activity; 2021. Available from: https://www.england.nhs.uk/statistics/statistical-work-areas/covid-19-hospital-activity/.
13.McKeigue PM, McAllister D, Caldwell D, Gribben C, Bishop J, McGurnaghan SJ, et al. Relation of severe COVID-19 in Scotland to transmission-related factors and risk conditions eligible for shielding support: REACT-SCOT case-control study. BMC Medicine. 2021;. doi: 10.1186/s12916-021-02021-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Williamson EJ, Walker AJ, Bhaskaran K, Bacon S, Bates C, Morton CE, et al. Factors associated with COVID-19-related death using OpenSAFELY. Nature. 2020;584(7821):430–436. doi: 10.1038/s41586-020-2521-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Baguelin M, Knock E, Whittles L, FitzJohn R, Lees J, Cori A. sircovid: SIR Model for COVID-19; 2021. [Google Scholar]
16.Diekmann O, Heesterbeek JAP, Metz JA. On the definition and the computation of the basic reproduction ratio R0 in models for infectious diseases in heterogeneous populations. Journal of mathematical biology. 1990;28(4):365–382. doi: 10.1007/BF00178324 [DOI] [PubMed] [Google Scholar]
17.Funk S. socialmixr: Social Mixing Matrices for Infectious Disease Modelling; 2020. Available from: https://CRAN.R-project.org/package=socialmixr.
18.Lauer SA, Grantz KH, Bi Q, Jones FK, Zheng Q, Meredith HR, et al. The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application. Annals of internal medicine. 2020;172(9):577–582. doi: 10.7326/M20-0504 [DOI] [PMC free article] [PubMed] [Google Scholar]
19.McAloon C, Collins Á, Hunt K, Barber A, Byrne AW, Butler F, et al. Incubation period of COVID-19: a rapid systematic review and meta-analysis of observational research. BMJ open. 2020;10(8):e039652. doi: 10.1136/bmjopen-2020-039652 [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Deng Y, You C, Liu Y, Qin J, Zhou XH. Estimation of incubation period and generation time based on observed length-biased epidemic cohort with censoring for COVID-19 outbreak in China. Biometrics. 2020;. doi: 10.1111/biom.13325 [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Docherty AB, Harrison EM, Green CA, Hardwick HE, Pius R, Norman L, et al. Features of 20 133 UK patients in hospital with covid-19 using the ISARIC WHO Clinical Characterisation Protocol: prospective observational cohort study. BMJ. 2020;369. doi: 10.1136/bmj.m1985 [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Pritchard M, Dankwa EA, Hall M, Baillie JK, Carson G, Docherty A, et al. ISARIC clinical data report 4 October 2020. medRxiv. 2020.
23.Briggs A, Jenkins D, Fraser C. NHS Test and Trace: the journey so far. Health Foundation; 2020. [Google Scholar]
24.Wood SN. Generalized Additive Models: An Introduction with R. 2nd ed. Boca Raton, FL: CRC press; 2017. [Google Scholar]
25.Bernabeu-Wittel M, Ternero-Vega J, Díaz-Jiménez P, Conde-Guzmán C, Nieto-Martín M, Moreno-Gaviño L, et al. Death risk stratification in elderly patients with covid-19. A comparative cohort study in nursing homes outbreaks. Archives of gerontology and geriatrics. 2020;91:104240. doi: 10.1016/j.archger.2020.104240 [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Press WH, Teukolsky SA, Vetterling WT, Flannery BP. Numerical Recipes. 3rd ed. Cambridge: Cambridge University Press; 2007. [Google Scholar]
27.Wood SN, Fasiolo M. A generalized Fellner-Schall method for smoothing parameter optimization with application to Tweedie location, scale and shape models. Biometrics. 2017;73(4):1071–1081. doi: 10.1111/biom.12666 [DOI] [PubMed] [Google Scholar]
28.Wood SN. Inferring UK COVID-19 fatal infection trajectories from daily mortality data: were infections already in decline before the UK lockdowns? Biometrics. 2021. doi: 10.1111/biom.13462 [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Ward H, Cooke G, Whitaker M, Redd R, Eales O, Brown JC, et al. REACT-2 Round 5: increasing prevalence of SARS-CoV-2 antibodies demonstrate impact of the second wave and of vaccine roll-out in England. medRxiv. 2021.
30.ONS. Coronavirus (COVID-19) Infection Survey, UK; 2021. https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/bulletins/coronaviruscovid19infectionsurveypilot/7may2021#number-of-new-covid-19-infections-in-england-wales-northern-ireland-and-scotland.
31.Knock ES, Whittles LK, Lees JA, Perez Guzman PN, Verity R, Fitzjohn RG, et al. Key epidemiological drivers and impact of interventions in the 2020 SARS-CoV-2 epidemic in England. Science Translational Medicine. 2021. doi: 10.1126/scitranslmed.abg4262 [DOI] [PMC free article] [PubMed] [Google Scholar]

PLoS One. doi: 10.1371/journal.pone.0257455.r001

Decision Letter 0

Alessandro Rizzo

18 Jun 2021

PONE-D-21-08472

Was R < 1 before the English lockdowns? On modelling mechanistic detail, causality and inference about Covid-19

PLOS ONE

Dear Dr. Wood,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Both reviewers provided constructive and detailed suggestions to improve the manuscript quality and presentation, especially considering that the manuscript provides a criticism of results obtained by another research group [Knock et al., 2020]

I echo one of the reviewers in the suggestion of putting the authors claim in a more objective fashion, in order to provide the readers with a balanced counterpoint to [Knock et al., 2020], rather than a mere criticism of their work, especially in light of the fact that both studies make simplifying assumptions that might bias their claims.

In this view, another reviewer requires more discussion about the authors' assumptions and the criticism of [Knock et al., 2020] ones. I agree with the reviewer that a more detailed and balanced discussion is required. In my opinion, as well as according to many other scholars, it is very hard to disentangle the single effects of different measures on the spreading of COVID-19, and such a study is strongly dependent on the simplifying assumptions made to carry out the study. Hence, I think that the proposed manuscript would greatly benefit by the analysis of a more balanced viewpoint, highlighting strengths and weaknesses of both set of assumptions, and, possibly, to the consequences that such assumptions may entail in both studies.

Please submit your revised manuscript by Aug 02 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Alessandro Rizzo

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found athttps://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please include a copy of Table 1 which you refer to in your text.

Additional Editor Comments (if provided):

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The manuscript contains a sophisticated compartmental model which allows for capturing the course of COVID-19 pandemic in England. In particular, by fitting the model to different epidemiological indicators such as the number of fatalities, hospitalizations or the evolution of test positivity, the authors are able to infer the actual epidemic incidence, i.e. the number of daily new cases, and study the evolution of the effective reproduction number, which is a pivotal indicator to quantify the stage of an ongoing outbreak.

The analysis presented by the authors represents a critical assessment of a report published by an independent research group in December 2020 addressing the same problem [Knock et al. 2020]. There, it is claimed that lockdown measures during the first epidemic waves in England were essential to reduce the effective reproduction number below one and, as a consequence, observe a decrease in the daily incidence.

The findings here contradict the claims made in [Knock et al. 2020] and show that the epidemic incidence started decreasing before full lockdown measures were implemented in England. The discrepancies between both results mainly arise from three factors: i) the choice of a shorter generation time interval, ii) the correction of the typical time elapsed from the onset of symptoms to hospitalization and iii) the continuity of the function reflecting the impact of non-pharmaceutical interventions. The mathematical framework proposed here is solid and described in depth and the methodology followed by the authors is clearly explained, which make the manuscript scientifically sound.

My main criticism with the current manuscript concerns its writing style. In this sense,I think that the authors should tone down some parts of the manuscript to make their critical assessment more objective. I would also encourage the authors to keep their analysis at the scientific fundamental level rather than discussing further political implications. Some examples of parts which, in my opinion, should be rewritten are:

Page 2: The comments about the media attention of the reports published by the team from Imperial College London or their relevance for policy-making lie out of the scope of a scientific assessment of the model.

Page 3: “Knock et al. do not report such checks, showing only the outputs of filtering” I think that this does not provide any valuable information for the discussion made in the paragraph.

Pages 4-5: “The model captures many features in impressive detail, but several aspects are not modeled”. The enumeration following this sentence describes very exhaustively many limitations of the model introduced in [Knock et al. 2020], most of them are not introduced in the model proposed here by the authors either. As in any theoretical model, some assumptions should be made to reduce the number of equations and not to end up with an unmanageable huge parameters’ space. As the submitted manuscript does not incorporate most of these logical limitations of theoretical frameworks, I do not see the point of including such exhaustive description.

Page 6: “It is hard to understand this choice, unless it was made to avoid particle depletion problems in filtering”. This statement seems very subjective and can be expressed in other words or left to the interpretation of the reader.

Page 7: “While not ideal, this is a less wrong assumption” Given the fact that the assumption is not supported by any reference either, although I agree it can be more logical, I would avoid making any comparison.

Page 9: “Given the ambitious nature of the fitting task, it seems reasonable to view the results as useful in the ‘all models are wrong, but some are useful’ sense.”

Page 12: “We conclude that the model and inference of Knock et al. do not form any sort of reasonable basis for making counterfactual statements about how many people would have died if lockdown had occurred at a different time”

Concerning the contents of the manuscript, I have some suggestions which, in my opinion, would make the result presented by the authors more robust:

Page 2: For the sake of readability, it would be very useful to include some references supporting how lockdowns are detrimental to the evolution of the pathogen or explaining the economic side effects observed in many countries.

The expression for the time derivative of the sensitivities of each variable with respect to the different parameters is valid when the parameters $\\theta$ are constant and do not depend on time. Nonetheless, the function $b(t)$ is time-dependent and, if I am correct, this expression should not be valid in this case. Please clarify it.

I think the manuscript would clearly benefit from a figure showing explicitly how the evolution of the effective reproduction number depends on the generation time interval chosen or the time to hospitalization as reflected by the authors in Page 9.

I would avoid mentioning seasonality as a crucial confounding factor invalidating the counterfactual scenarios proposed in [Knock et al. 2020] because, in the short time windows analyzed in the latter report, one cannot expect substantial underlying changes in the weather conditions.

The section describing the computation of the effective reproduction number should be extended to better explain the significance of this indicator. In particular, within the different alternatives explored in the literature (see [Gostic et al. PLoS Comput Biol 16(12): e1008409 (2020)] for a further review on the topic), the authors compute the reproduction number as the expected number of contagions that an agent becoming infectious at time $t$ will make throughout his/her infectious period if the conditions of the system remain immutable.

In this sense, the explanation of the dependence of the effective reproduction number after lifting lockdown measures in page 9 is not valid if one resorts to this definition. Providing the model does not divide the population into key workers and locked-down individuals either, I think that the authors could get rid of this discussion.

Finally, to obtain the reproduction number at the national level, the authors should weight the individual reproduction numbers by the newly infected individuals in each of the regions rather than with the infected population.

Reviewer #2: The manuscripts provides a revision of the study by Knock et al. (2020). The authors highlight various caveats and adapt the model accordingly. The main finding is that, in contrast to Knock et al., the reproduction number is found below one before the implementation of the two lockdowns in England. In this sense, the authors then question whether the lockdowns were the main driver to control SARS-CoV-2 in England.

The approach the authors take is well motivated and scientifically sound. The authors highlight various shortcomings of the study by Knock et al. and adapt different aspects of the model. Unfortunately, due to the various changes made, it is not possible to isolate the aspect that lead to the different result. It may be due to the shorter generation time, the adapted time from symptom onset to hospitalisation or the different functional form of R(t). I guess that the fit is very costly to perform, but I strongly recommend to incrementally adapt the model by Knock et al., which would allow to isolate the impact of the various changes made.

The inferred reproduction number R(t) starts to decrease almost immediately (Figure 5). How do the authors explain this early decrease? I guess no NPIs were in place back then and mobility still showed no reduction. In this sense, please explicit in a table when the reproduction number actually was below one. It seems that during the first wave the reproduction number in London was below one surprisingly early. Additionally, I strongly recommend the authors to work with dates instead of integer numbers for the day of the year. It would improve the readability of the manuscript a lot.

In the manuscript here, the model consists of regular ODEs, whereas Knock et al. worked with a stochastic model. The authors write: “The neglect of stochasticity in the state equations seems likely to be a minor issue here, relative to the other approximations made in the model.” In principle I agree with this statement. However, looking at the results (Figure 3) the peak in care home deaths is anticipated with respect to the data in all regions. One explanation could be the lack of stochasticity that may delay infections in care homes due to the modular contact matrix. Another factor could be the different initialisation the authors choose as it is pointed out in the manuscript.

In Figure 3, one can see that occupancy and admissions generally peak earlier than the data and decrease faster. How do the authors explain this mismatch? Furthermore, why is there no uncertainty range indicated in Figure 3? Is it too small to be visible? However, this would not be consistent with the uncertainty ranges presented for the incidence (Figure 4) and the reproduction number (Figure 5).

In the discussion, the authors mention a sensitivity analysis they performed. In particular, they considered different time to hospitalisation, generation time and likelihood for occupancy. Please show these results in a supplementary. Because comments like “… R = 1 a little later, but still before lockdown” are not very helpful for the reader. Indicate how much later and when if you already performed the fit.

The authors criticise many times that Knock et al. did not isolate the impact of seasonality and NPIs. Furthermore, you claim that this invalidates the counterfactual scenarios. If the counterfactual scenarios consisted of shifting the time series by months this would certainly be true. However, Knock et al. consider a maximal shift of two weeks. Do the authors truly believe that the meteorological change in two weeks invalidates the conclusions of Knock et al.?

The authors criticise that Knock et al. did not model within hospital transmission. Furthermore, they claim within hospital transmission account for a quarter fo cases in both waves and cite a health service journal. Is this with respect to the reported cases or do the authors believe this also holds for infections in general? Be more specific whether you refer to reported cases, which will be strongly biased towards within hospital transmission. Furthermore, making such claims based on a reference to a journal does not seem appropriate for a scientific article.

The authors point out that including data regarding the number of tests undermines the analysis and should therefore not be included. However, I do not understand whether you actually did fit testing data or not. Later in the manuscript you explain the different assumption made regarding the pre-tested rate. This led me believe that you actually fit to the testing data. Please clarify this point. Also you should be consistent, if you truly believe that it undermines the analysis then don’t include it.

A more general remark is regarding the writing of the manuscript. Various parts are not written with the objectivity a scientific article requires. For example, criticising the approach of Knock et al. for not modelling the differences between men and women seems really far fetched. Similarly, on page 6, you refer to a reasonable model. Rewrite this such that it is less judgemental. An example would be “a more accurate/better approach”. The same comment also holds for other parts of the manuscript:

Page 7: “Less wrong assumption … “. Change it to better/ more realistic assumption.

Page 2: “… was accompanied by substantial press release material”. Why is this relevant?

Page 1: You insinuate that the lockdown lead to a substantial loss of life due to the caused economic hardship. A scientific article is not the place to make insinuations. Referencing a pre-COVID study to back up a claim about the impact of lockdown is very vague and implicit. If you want to make such comment find a study that actually treats the impact of lockdowns and aims at quantifying the non COVID related loss of life.

Page 12: You write “We conclude that the model and inference of Knock et al. do not form any sort of reasonable basis …”. I understand your criticism of their approach, but saying that it does not support any reasonable basis is an exaggeration.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Benjamin Steinegger

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Sep 22;16(9):e0257455. doi: 10.1371/journal.pone.0257455.r002

Author response to Decision Letter 0

1 Jul 2021

Please see the attached Response to reviewers file

Attachment

Submitted filename: response.pdf

Click here for additional data file.^{(682.3KB, pdf)}

PLoS One. doi: 10.1371/journal.pone.0257455.r003

Decision Letter 1

Alessandro Rizzo

5 Aug 2021

PONE-D-21-08472R1

Was R < 1 before the English lockdowns? On modelling mechanistic detail, causality and inference about Covid-19

PLOS ONE

Dear Dr. Wood,

The two Reviewers are appreciative of the work done on the manuscript, which has now largely improved. Both reviewers raise some minor points toward making the paper even clearer and impactful. In particular, one of the Reviewers requires some more discussion about the definition and implications of lockdowns and/or other restrictions, whereas the second one requires some clarifications about the use of the reproductive number and suggests to remove the section devoted to the correspondence between the authors of this manuscript and the team of Knock's et al.

Please submit your revised manuscript by Sep 19 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

We look forward to receiving your revised manuscript.

Kind regards,

Alessandro Rizzo

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #2: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #1: Yes

Reviewer #2: Partly

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Reviewer #1: The authors have done an exhaustive work to address the comments raised by both referees and the revised version of the manuscript is much more solid and scientifically rigorous. Consequently, I think that the manuscript is now suitable for publication.

Nonetheless, I have a few minor comments regarding the modifications introduced by the authors:

- In my first revision, I thought that the authors were computing the effective reproduction number at time $t$ as the number of contagions made by a newly infected individual during his/her infectious period. In case that the authors define the effective reproductive number at time $t$ as the number of contagions made by an existing infectious individual at time $t$, which corresponds to the instantaneous reproductive number, its computation involves the past rather than the future of the dynamics. Therefore, the claim made in Page 4 on the relevance of the future dynamics for this quantity should be modified; nonetheless, both definitions are equivalent as long as the contact and recovery rates remain unchanged (see Nishiura, H., & Chowell, G. (2009). The effective reproduction number as a prelude to statistical estimation of time-dependent epidemic trends. In Mathematical and statistical estimation approaches in epidemiology (pp. 103-121).

- I like the new part of the discussion at the end of Page 15 on the modifications introduced by Knock et al in the publication following their preprint. Having said this, the description of the interaction between the authors of that paper and the authors of this manuscript are not matter of a scientific work and, therefore, should be removed, leaving possible further subjective interpretations to the reader.

- Regarding new Figure 6, I would use letters to label each panel to avoid describing them as a function of their position in the figure in the main text.

Reviewer #2: The authors addressed most of my previous comments. In particular, the inclusion of Figure 6 substantially improves the manuscript. Nevertheless, I still have some comments and technical details that are not clear to me.

The intention behind my comments regarding the impact of economic hardship was exactly on what you comment on in the new version of the manuscript. Temporal economic hardship is not the same as endemic/systemic poverty. Therefore, one should not expect the same impact on the health of individuals. I appreciate the inclusion of the two references on the indirect impact of COVID-19 and the restrictions that were put in place. However, I would like to highlight that neither of the references studies the impact of lockdowns in particular. Both article treat the indirect impacts of COVID-19 in general. The decrease in economic activity cannot be simply attributed to lockdown. The mere existence of a pandemic will affect economic activity, independently on whether restrictions are put in place or not. Furthermore, in the case of England, the closure of restaurants and leisure centres preceded lockdown. I recommend you to make the definition of lockdown more explicit. In particular, point out that the closure of restaurants and nightclubs is not included. As I understand, by lockdown you refer to the closure of non essential retail and the stay-at-home order that was announced on 23 March and took effect on 26 March.

In the reference on South Asia I did not find a part that would justify the conclusion that restrictions led to more deaths than the ones that have been prevented. The only estimation I can find are COVID-19 deaths in 2020 "if no additional mitigation strategies are instituted in the region this year”. However, in this case the mitigation measures, including for example the lockdown in India, are already included.

Nevertheless, I agree with the authors that the impact of lockdown on health besides COVID-19 should be taken seriously. I just ask the authors to be careful how they communicate this and how studies are interpreted.

I have a question regarding the incubation period. In section "3.1 - Corrections and minor modifications" the authors state that they use a mean duration of 5.8 days for the incubation period that is equivalent here with the time spent in compartment E. In contrast, in section "3.3 - Relaxing the model assumptions" the authors state that they shorten the E state to have an average of 3 days to infectivity. So which of both the authors actually consider? Or it this the difference between the plot in the top right and bottom left in Figure 6? Also, when the authors shorten the time in E to 3 days do they consider an additional compartment that represents pre-symptomatic infectiousness?

The authors comment on the published version of the manuscript by Knock et al. In particular, they mention that in the new version the reproduction number is below one in many regions before lockdown. Looking at their results, it seems that the reproduction number drops below one in many regions on the day the lockdown was announced. I think this is very consistent with the results you have here, even though the day of the announcement is not indicated in the graphics. Obviously, not only the implementation but also the announcement has an impact. This is definitely something you should comment on. I also recommend the authors to include a table where you explicit write when the reproduction number crosses one. It is somehow difficult to see this in the plot but is a crucial result of your analysis.

Additionally, I recommend to omit the comments regarding the exchange you had with the authors of Knock et al.. While it may be entertaining academic gossip, I do not think that it adds anything from a scientific perspective. At the end, the community needs to evaluate the content of manuscripts and not their creation process.

The figure 7 shows the reconstructed incidence. However, I would like to point out that if you assume a non exponential infection model, as it is the case for SARS-CoV-2 (approx. gamma distributed generation time), the reproduction number is not necessarily below one when the peak in infections is reached. As a matter of fact, depending on the decrease, the reproduction number will only drop below one some days afterwards. In this sense, if the peak is reached only a few days before lockdown, the figure you present is not very conclusive. You should comment on the limitation of using exponentially distributed waiting times.

My last comment is regarding the potential change in the evolutionary landscape of the virus that the lockdown may induce. As I understand, your argument applies to almost any measures that try to prevent the spread of the disease. I am thinking about social distancing or contact tracing for example. Accordingly, the conclusion would be to do nothing due to the fear of mutations? Or are there any other possible interventions that may contain the spread where this argument does not apply? I am not experienced in the biology of viruses so I cannot really judge the validity of your argument. However, I do not think it is necessary to motivate your study. I recommend omitting this comment as well as the appendix.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Benjamin Steinegger

PLoS One. 2021 Sep 22;16(9):e0257455. doi: 10.1371/journal.pone.0257455.r004

Author response to Decision Letter 1

10 Aug 2021

Please see the attached response to reviews file.

Attachment

Submitted filename: response-2.pdf

Click here for additional data file.^{(90KB, pdf)}

PLoS One. doi: 10.1371/journal.pone.0257455.r005

Decision Letter 2

Alessandro Rizzo

2 Sep 2021

Was R < 1 before the English lockdowns? On modelling mechanistic detail, causality and inference about Covid-19

PONE-D-21-08472R2

Dear Dr. Wood,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Alessandro Rizzo

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

The paper is deemed as being in good shape for publication. I recommend the authors to apply the last suggestions by Reviewer 2 when submitting their final version.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

Reviewer #1: All comments have been addressed

Reviewer #2: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #1: Yes

Reviewer #2: Yes

**********

6. Review Comments to the Author

Reviewer #1: (No Response)

Reviewer #2: The authors addressed the points of my previous review. In this sense, I have only some minor comments:

- The reference from India (PIB India, 2020) is a press conference from the local authorities that was uploaded to YouTube. During the press conference, a power point presentation is shown with the estimations you reference here. From what I see, it seems not possible to look into the methodology that leads to their estimations. In this sense, I would be careful with the inclusion of these numbers. However, as you point in the text, I guess this corresponds to the estimations by the authorities in India.

- I appreciate the inclusion of the numbers as the reproduction number first was below one. However, I recommend the authors to include one example of the exact conversion between numbers and dates in the caption of Figure 6. Otherwise, the reader needs to scroll to Figure 1 for finding the conversion.

- The comments on the possible impact of lockdowns regarding the evolution of the virus seem reasonable to me. However, I would like to stress that I have no knowledge in virus evolution and can thus not judge plausibility.

-Please remove the undertone in the acknowledgements. It is not relevant on which basis the paper was rejected in PNAS. And, it is also not relevant whether you send Knock et al. an e-mail or not.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: Yes: Benjamin Steinegger

PLoS One. doi: 10.1371/journal.pone.0257455.r006

Acceptance letter

Alessandro Rizzo

10 Sep 2021

PONE-D-21-08472R2

Was R < 1 before the English lockdowns? On modelling mechanistic detail, causality and inference about Covid-19

Dear Dr. Wood:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Prof. Alessandro Rizzo

Academic Editor

PLOS ONE

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Appendix. Supplementary appendices.

(PDF)

Click here for additional data file.^{(396.2KB, pdf)}

S1 Code. Replication code and data.

(ZIP)

Click here for additional data file.^{(109.2KB, zip)}

Attachment

Submitted filename: response.pdf

Click here for additional data file.^{(682.3KB, pdf)}

Attachment

Submitted filename: response-2.pdf

Click here for additional data file.^{(90KB, pdf)}

Data Availability Statement

All relevant data are within the manuscript and its Supporting information files.

[pone.0257455.ref001] 1.Knock ES, Whittles LK, Lees JA, Perez Guzman PN, Verity R, Fitzjohn RG, et al. Report 41: The 2020 SARS-CoV-2 epidemic in England: key epidemiological drivers and impact of interventions. Imperial College; London. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0257455.ref002] 2.Marmot M, Allen J, Boyce T, Goldblatt P, Morrison J. Health Equity in England: The Marmot Review 10 Years On. The Health Foundation; 2020. [Google Scholar]

[pone.0257455.ref003] 3.DHSC. Direct and Indirect Impacts of COVID-19 on Excess Deaths and Morbidity; 2020. Department of Health and Social Care, Office for National Statistics, Government Actuary’s Department and Home Office. Available from: https://www.gov.uk/government/publications/dhsconsgadho-direct-and-indirect-impacts-of-covid-19-on-excess-deaths-and-morbidity-15-july-2020.

[pone.0257455.ref004] 4.Bhutta ZA, Owais A, Horton S, Rizvi A, Nisar I, Das J, et al. Direct and indirect effects of the COVID-19 pandemic and response in South Asia. UNICEF; 2021. [Google Scholar]

[pone.0257455.ref005] 5.PIB India (2020, May). Government of India Press Briefing on the actions taken, preparedness and updates on COVID-19, 22nd May 2020, Press Information Bureau. https://pib.gov.in/WebcastMore.aspx?webcast_tempID=434.

[pone.0257455.ref006] 6.Pearl J, Glymour M, Jewell NP. Causal inference in statistics: A primer. John Wiley & Sons; 2016. [Google Scholar]

[pone.0257455.ref007] 7.Gillespie DT. Approximate accelerated stochastic simulation of chemically reacting systems. The Journal of chemical physics. 2001;115(4):1716–1733. doi: 10.1063/1.1378322 [DOI] [Google Scholar]

[pone.0257455.ref008] 8.Flaxman S, Mishra S, Gandy A, Unwin HJT, Mellan TA, Coupland H, et al. Estimating the effects of non-pharmaceutical interventions on COVID-19 in Europe. Nature. 2020;584(7820):257–261. doi: 10.1038/s41586-020-2405-7 [DOI] [PubMed] [Google Scholar]

[pone.0257455.ref009] 9.Anderson R, Donnelly C, Hollingsworth D, Keeling M, Vegvari C, Baggaley R, et al. Reproduction number (R) and growth rate (r) of the COVID-19 epidemic in the UK: methods of estimation, data sources, causes of heterogeneity, and use as a guide in policy formulation. Royal Society SET-C; 2020. [Google Scholar]

[pone.0257455.ref010] 10.Mossong J, Hens N, Jit M, Beutels P, Auranen K, Mikolajczyk R, et al. POLYMOD social contact data; 2017. Available from: 10.5281/zenodo.1157934. [DOI]

[pone.0257455.ref011] 11.Discombe M. Covid infections caught in hospital rise by a third in one week. Health Service Journal. 2020. [Google Scholar]

[pone.0257455.ref012] 12.NHS. Covid-19 Hospital Activity; 2021. Available from: https://www.england.nhs.uk/statistics/statistical-work-areas/covid-19-hospital-activity/.

[pone.0257455.ref013] 13.McKeigue PM, McAllister D, Caldwell D, Gribben C, Bishop J, McGurnaghan SJ, et al. Relation of severe COVID-19 in Scotland to transmission-related factors and risk conditions eligible for shielding support: REACT-SCOT case-control study. BMC Medicine. 2021;. doi: 10.1186/s12916-021-02021-5 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0257455.ref014] 14.Williamson EJ, Walker AJ, Bhaskaran K, Bacon S, Bates C, Morton CE, et al. Factors associated with COVID-19-related death using OpenSAFELY. Nature. 2020;584(7821):430–436. doi: 10.1038/s41586-020-2521-4 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0257455.ref015] 15.Baguelin M, Knock E, Whittles L, FitzJohn R, Lees J, Cori A. sircovid: SIR Model for COVID-19; 2021. [Google Scholar]

[pone.0257455.ref016] 16.Diekmann O, Heesterbeek JAP, Metz JA. On the definition and the computation of the basic reproduction ratio R0 in models for infectious diseases in heterogeneous populations. Journal of mathematical biology. 1990;28(4):365–382. doi: 10.1007/BF00178324 [DOI] [PubMed] [Google Scholar]

[pone.0257455.ref017] 17.Funk S. socialmixr: Social Mixing Matrices for Infectious Disease Modelling; 2020. Available from: https://CRAN.R-project.org/package=socialmixr.

[pone.0257455.ref018] 18.Lauer SA, Grantz KH, Bi Q, Jones FK, Zheng Q, Meredith HR, et al. The incubation period of coronavirus disease 2019 (COVID-19) from publicly reported confirmed cases: estimation and application. Annals of internal medicine. 2020;172(9):577–582. doi: 10.7326/M20-0504 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0257455.ref019] 19.McAloon C, Collins Á, Hunt K, Barber A, Byrne AW, Butler F, et al. Incubation period of COVID-19: a rapid systematic review and meta-analysis of observational research. BMJ open. 2020;10(8):e039652. doi: 10.1136/bmjopen-2020-039652 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0257455.ref020] 20.Deng Y, You C, Liu Y, Qin J, Zhou XH. Estimation of incubation period and generation time based on observed length-biased epidemic cohort with censoring for COVID-19 outbreak in China. Biometrics. 2020;. doi: 10.1111/biom.13325 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0257455.ref021] 21.Docherty AB, Harrison EM, Green CA, Hardwick HE, Pius R, Norman L, et al. Features of 20 133 UK patients in hospital with covid-19 using the ISARIC WHO Clinical Characterisation Protocol: prospective observational cohort study. BMJ. 2020;369. doi: 10.1136/bmj.m1985 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0257455.ref022] 22.Pritchard M, Dankwa EA, Hall M, Baillie JK, Carson G, Docherty A, et al. ISARIC clinical data report 4 October 2020. medRxiv. 2020.

[pone.0257455.ref023] 23.Briggs A, Jenkins D, Fraser C. NHS Test and Trace: the journey so far. Health Foundation; 2020. [Google Scholar]

[pone.0257455.ref024] 24.Wood SN. Generalized Additive Models: An Introduction with R. 2nd ed. Boca Raton, FL: CRC press; 2017. [Google Scholar]

[pone.0257455.ref025] 25.Bernabeu-Wittel M, Ternero-Vega J, Díaz-Jiménez P, Conde-Guzmán C, Nieto-Martín M, Moreno-Gaviño L, et al. Death risk stratification in elderly patients with covid-19. A comparative cohort study in nursing homes outbreaks. Archives of gerontology and geriatrics. 2020;91:104240. doi: 10.1016/j.archger.2020.104240 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0257455.ref026] 26.Press WH, Teukolsky SA, Vetterling WT, Flannery BP. Numerical Recipes. 3rd ed. Cambridge: Cambridge University Press; 2007. [Google Scholar]

[pone.0257455.ref027] 27.Wood SN, Fasiolo M. A generalized Fellner-Schall method for smoothing parameter optimization with application to Tweedie location, scale and shape models. Biometrics. 2017;73(4):1071–1081. doi: 10.1111/biom.12666 [DOI] [PubMed] [Google Scholar]

[pone.0257455.ref028] 28.Wood SN. Inferring UK COVID-19 fatal infection trajectories from daily mortality data: were infections already in decline before the UK lockdowns? Biometrics. 2021. doi: 10.1111/biom.13462 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0257455.ref029] 29.Ward H, Cooke G, Whitaker M, Redd R, Eales O, Brown JC, et al. REACT-2 Round 5: increasing prevalence of SARS-CoV-2 antibodies demonstrate impact of the second wave and of vaccine roll-out in England. medRxiv. 2021.

[pone.0257455.ref030] 30.ONS. Coronavirus (COVID-19) Infection Survey, UK; 2021. https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/conditionsanddiseases/bulletins/coronaviruscovid19infectionsurveypilot/7may2021#number-of-new-covid-19-infections-in-england-wales-northern-ireland-and-scotland.

[pone.0257455.ref031] 31.Knock ES, Whittles LK, Lees JA, Perez Guzman PN, Verity R, Fitzjohn RG, et al. Key epidemiological drivers and impact of interventions in the 2020 SARS-CoV-2 epidemic in England. Science Translational Medicine. 2021. doi: 10.1126/scitranslmed.abg4262 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Was R < 1 before the English lockdowns? On modelling mechanistic detail, causality and inference about Covid-19

Simon N Wood

Ernst C Wit

Roles

Abstract

Introduction

Fig 1. Estimates of R by English region against day of year, as reported in [1].

Evaluation of original Knock et al. age-structured SEIR model

The basic SEI(R) model

Force of infection

The likelihood

Modification of the Knock et al. model

Corrections and minor modifications

Rates

Priors

The negative binomial likelihood

Relaxing the model assumptions

Estimation and inference

Results

Fig 3. Model fits (posterior 95% credible bands for expectations) to the death, hospital and testing data, with one region per row, against day of year.

Fig 4. Inferred incidence, for all regions (coloured) and whole of England (black).

Fig 5. Inferred R for all regions (colour) and the infectives-weighted average for the whole of England (black).

Fig 6. Comparison of inference around Lockdown I (March 24th 2020, day 84), for different modifications of the model of [1].

Discussion

Fig 7.

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Alessandro Rizzo

Roles

Author response to Decision Letter 0

Decision Letter 1

Alessandro Rizzo

Roles

Author response to Decision Letter 1

Decision Letter 2

Alessandro Rizzo

Roles

Acceptance letter

Alessandro Rizzo

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases