A note on variable susceptibility, the herd-immunity threshold and modeling of infectious diseases

Marcus Carlsson; Jens Wittsten; Cecilia Söderberg-Nauclér

doi:10.1371/journal.pone.0279454

. 2023 Feb 15;18(2):e0279454. doi: 10.1371/journal.pone.0279454

A note on variable susceptibility, the herd-immunity threshold and modeling of infectious diseases

Marcus Carlsson ^1,^*, Jens Wittsten ², Cecilia Söderberg-Nauclér ^3,^4,⁵

Editor: Claudine Irles⁶

PMCID: PMC9931097 PMID: 36791079

Abstract

The unfolding of the COVID-19 pandemic has been very difficult to predict using mathematical models for infectious diseases. While it has been demonstrated that variations in susceptibility have a damping effect on key quantities such as the incidence peak, the herd-immunity threshold and the final size of the pandemic, this complex phenomenon is almost impossible to measure or quantify, and it remains unclear how to incorporate it for modeling and prediction. In this work we show that, from a modeling perspective, variability in susceptibility on an individual level is equivalent with a fraction θ of the population having an “artificial” sterilizing immunity. We also derive novel formulas for the herd-immunity threshold and the final size of the pandemic, and show that these values are substantially lower than predicted by the classical formulas, in the presence of variable susceptibility. In the particular case of SARS-CoV-2, there is by now undoubtedly variable susceptibility due to waning immunity from both vaccines and previous infections, and our findings may be used to greatly simplify models. If such variations were also present prior to the first wave, as indicated by a number of studies, these findings can help explain why the magnitude of the initial waves of SARS-CoV-2 was relatively low, compared to what one may have expected based on standard models.

1 Introduction

Since the fundamental works of Kermack and McKendrick [1–3] compartmental mathematical models (such as SIR, SEIR, etc.) are used to model the spread of infectious diseases. Among other things, these papers introduced the by now famous R₀-value and showed that, in contrast with human intuition, an infectious disease will never infect the whole population, no matter how infectious. Instead, the incidence will start to decay when the fraction of recovered reaches the so called the “Herd-Immunity Threshold”, for which they deduced the famous formula

\begin{matrix} 1 - 1 / R_{0} . \end{matrix}

(1)

However, prior to the SARS-CoV-2 pandemic, there was no reliable data from a novel virus (affecting humans) on which this prediction could be tested. Unfortunately, this remains largely the case, since e.g. lockdowns and voluntary isolation (which the models can not predict) had a major effect on the spread. Despite this, data from places like Sweden, that did relatively little to stop community transmission, indicate that the mathematical models have a tendency to overestimate the magnitude of the wave during a major outbreak [4].

Several factors are known to have a damping effect on model curves. One such example is variable susceptibility, see e.g. Ch. 1 and 3 in [5], and the articles [6–9]. By variable susceptibility we here refer to (time-invariant) differences between individuals in the probability of becoming infected, given a certain exposure to the virus, as opposed to individual variations over time. Similar results have also been established numerically for other heterogeneities, such as age and activity [10]. Curiously, variable infectivity (super-spreaders) do not have any damping effect on the spread during a major outbreak [11]. In any case, such conclusions are derived using heuristic arguments or by simply testing relevant models, and the mechanisms behind these phenomena remain poorly understood. In particular, since variability in susceptibility is virtually impossible to quantify, it is unclear how to efficiently incorporate it into the models, wherefore predictions of future COVID-19 waves, or the next pandemic, continues to be a major challenge.

Concretely, suppose a novel infectious disease, whose transmission dynamics involves high variability in infectivity and/or susceptibility, is introduced in a well connected network like a large city, and suppose a major outbreak is about to take place. One may then estimate R₀, i.e. a rough estimate of the average number of new infections that one infective gives rise to, from the data series of early cases, using e.g. EpiEstim [12] or [13]. By a contact tracing study one may also estimate the generation time T_generation, which is the other parameter needed to run a SIR-model. In such a scenario, one can ask the question if the output of a simple SIR-simulation is a good first order approximation of what is about to come, in the absence of Non-Pharmaceutical Interventions? Is the formula (1) a good indicator of when we may expect the outbreak to start to recede?

Based on data from Sweden during the COVID-19 pandemic, the answer seems to be no, see [4] where it is shown that the incidence dropped, unexpectedly, at levels of sero-prevalence much lower than predicted by (1). Of the prior theoretical studies on this topic, the article that comes closest to answering the above questions is Britton et. al. [10], where the authors prove that variations in activity patterns can significantly lower the herd-immunity threshold, in comparison with the classical estimate based on (1). An older publication with a similar message is [14]. However, these conclusions are empirical observations based on models which have been built to incorporate population heterogeneity. This damping effect has not been established mathematically and it remains unclear how, and to what extent, different heterogeneities are manifested. In particular it remains unclear how to more accurately predict the herd-immunity threshold. We remark that, in the case of SARS-CoV-2, a number of factors such as genetic, cross-reactive immunity and innate immunity, have been shown to provide variation in susceptibility [15–18].

1.1 Novel contributions

In this work we prove mathematically that variations in susceptibility have a damping effect on the model curves, whereas variations in infectivity do not (as long as it is uncorrelated with the former, see [7]). More importantly, we also find that the (usually unknown) distribution describing how susceptibility varies is not needed for accurate modeling. More precisely we show that a susceptibility heterogeneous model will behave almost identically to a standard (homogeneous) SIR-model where a portion of the population have sterilizing immunity, and that the precise shape of the susceptibility distribution only influences the level of sterilizing immunity. It is important to underline that this immunity only exists within the mathematical model simplification, and should not be confused with real sterilizing immunity of some individuals. In other words, even if everyone is susceptible to the virus (to some degree), on a population level it will seem as if a portion of the population have sterilizing immunity. We will refer to such an immunity, needed for accurate mathematical modeling, as “Artificial Sterilizing Immunity” (ASI), and the fraction of the population having it as θ. As long as θ can be estimated from available data, we show that the actual Herd-Immunity Threshold is indeed lower than (1) predicts. The correct formula, in the presence of variable susceptibility, is given by

\begin{matrix} (1 - θ) (1 - 1 / R_{0}), \end{matrix}

(2)

and the final size of the pandemic is also shrunk by the same factor (1 − θ). We shall also demonstrate numerically that other population heterogeneities, such as those considered by Britton et. al. [10], have an analogous effect, and hence the findings in this paper can be used to significantly reduce the amount of unknowns in a more realistic heterogeneous model for disease spread.

2 The mathematics of infectious disease spread dynamics

In order to explain the mathematical findings, we first give an overview of how the basic SIR-model works. SIR stands for Susceptibles, Infectives and Recovered, and is the simplest form of a “compartmental model” used in mathematical epidemiology (see e.g. [19] for an introduction to this field). In the model, S, I and R are functions of time t, and to illustrate how these are related we shall also introduce the (redundant) function ν describing the incidence, i.e. the amount of newly infected each day (not to be confused with I, which describes the prevalence). The formula for ν(t) is at the heart of the algorithm, and in the beginning we simply have ν(t) = αI(t), where α is a constant that determines how many new cases an average infective gives rise to during a day. If a is the average number of daily potentially infectious contacts by an average person, and p is the probability that such a contact actually leads to transmission, then α = ap.

As the amount of susceptibles gradually decreases, we have to modify this by multiplying with the fraction of the population that is still susceptible. If the total population is N this fraction is S(t)/N and the formula becomes

\begin{matrix} ν = \frac{α}{N} S I = \frac{a p}{N} S I . \end{matrix}

(3)

To set up the remaining equations we also need the generation time T_generation, i.e. the average time it takes from infection to recovery. The remaining equations are then

\begin{matrix} {\begin{matrix} S^{'} & = - ν \\ I^{'} & = ν - σ I \\ R^{'} & = σ I \end{matrix} \end{matrix}

(4)

where σ = 1/T_generation and ′ indicates differentiation. The equations are intuitively easy to understand, the incidence continuously gets withdrawn from S and added to I, and at the same time there is a current of recovering individuals that leave I at a rate σI and appear in R instead.

The SIR-model, and our extensions thereof, are deterministic in the sense that if we run it twice, the output is the same. Such models are believed to work well during major outbreaks, where the law of large numbers applies [5, 11]. All our findings pertain to this situation; for modeling of e.g. the initial phase or household transmission, other types of models are used.

The most natural initial condition for a new disease is to set I(0) = n where n < < N represents a small number of import cases arriving at time t = 0, and then set S(0) = N − n and R(0) = 0 (so everybody else is initially susceptible and no-one has yet recovered). The value of n is completely irrelevant for the shape of the curves that follow, a low value of n only gives the equation system a slower start so it takes a while longer for the outbreak to reach a certain value. Once this happens, the curves look exactly the same independent of the value n. See the blue graphs in Fig 1 for some typical examples of R-curves and I-curves. In this model, R is always increasing and levels out on a number which is called “the final size of the pandemic” (see Fig 1a). S approximatively looks like N − R, since the prevalence I at any given time is small in comparison with the total population. The incidence ν typically looks just like I, albeit with a lower magnitude.

Fig 1 — (a) Graphs of recovered (as a fraction of the total population) for various SIR-models and a fixed value of R₀ = 1.66. First we display standard SIR, then S-SIR and finally SIR with Artificial Sterilizing Immunity (ASI) with parameters from (8). Note that they start out almost identically but that the latter two bend downwards much earlier than the first, which over-shoots the classical Herd-Immunity Threshold (HIT), whereas the second two stay closely together and level out below the classical HIT. (b) Corresponding curves for prevalence I (the S-graphs are shown independently in Fig 2).

2.1 Contemporary models for COVID-19

Contemporary models used by professional modeling teams usually contain many more compartments than SIR, for instance relating to age stratification, variable activity levels, geographical regions, compartments for people who need ICU and compartments for people that die. For example, the model published by members of the Imperial College COVID-19 response team [20] has at its root a basic SIR (see p. 9 as well as S2 Fig in the supplementary material of [20]), and the same goes for the model [21] used by a renowned Swedish modeling team, which managed to fit the ICU occupancy and deaths with high accuracy during the first wave in Sweden. The latter model also takes into account various regions and interaction patterns between these, but the in-region dynamics is basically a simple SEIR. It is also common to add a compartment E for “Exposed”, incorporating the incubation time, (as indeed is done in the above two examples). However, as we shall show in Section 4, this has a limited effect on the overall behavior. By this we mean that, for every set of parameter values (R₀, incubation time etc.) for SEIR, it is possible to get an almost identical curve with SIR (and vice versa), if we are allowed to alter the parameter values slightly. Since the exact value of these parameters is never known, this means that for practical purposes one may just as well rely on SIR as on SEIR, at least for understanding overall trends. For example, in Fig 3 we show an example of SEIR and SIR with R₀-values that differ by 1%, and the graphs are almost identical. For example the final size of the pandemic differs by less that 1.5%. Moreover, compartments relating to severely ill and deaths also have a marginal effect on the overall behavior, simply because only a small fraction of the infected will end up in these compartments. Based on this, we argue that, for the purposes of understanding the general overall behavior, as we are interested in here, it suffices to study the simpler SIR-model. For other attempts to predict/model SARS-CoV-2 using SIR/SEIR-type models see e.g. [22, 23].

In contrast, other types of heterogeneities such as variable activity levels and different interaction patterns between age groups, does have a notable damping effect on the model curves. For example, the age-activity stratified SEIR by Britton et. al. [10] has an incidence peak of about 35% lower than standard SIR, given analogous input parameters. This is consistent with the findings in [10], where a drop in the Herd-Immunity Threshold of around 30% is observed for the age-activity model, comparing with the prediction (1) based on SIR. This will be further discussed in Section 4.2. Also variable susceptibility has a major effect, but this has already been discussed in the introduction and is further analyzed in Section 3.

2.2 Model versus reality mismatch?

Whether or not the more advanced models accurately describe the spread of COVID-19 is hard to determine, since one may always argue that Non-Pharmaceutical Interventions (NPI’s) as well as voluntary behavioral changes have had a major impact. Without claiming to have a definite answer, the case of Sweden is interesting due to its relaxed strategy, which moreover was kept almost constant during 2020–2021. In particular, schools were kept open, people who could not work from home were encouraged to go to work, family members of infected households were obliged to work or go to school, and widespread face-mask use was never implemented, making the country ideal for comparing models with actual data. Due to insufficient testing, the time series of cases is of limited value, but measurements of sero-prevalence from blood samples give valuable information, since it has been established that most people who get COVID-19 also go on to develop anti-bodies [24], and that these antibodies remain for at least 9 months [25, 26]. Results published by the Swedish Public Health Agency [27] indicate that roughly 11% had had COVID-19 in the Stockholm region after the first wave 2020, which rose to around 22% in February 2021, following the second wave. Also among hospital staff in Sweden (not using face-mask), the prevalence was around 20% [26] after the first wave, in line with observations from infected households elsewhere [28].

However, the model by Sjödin et. al., referred to earlier, predicts a cumulative number of infected people of around 30% after the first wave, despite assuming a 56% decrease in contacts among people of age 0–59 and a 98% reduction among those aged 60–79 (this is for scenario d which accurately fitted ICU-occupancy and death, see Fig 2b, bearing in mind that the Stockholm region has 2.4 million inhabitants). Along the same line, Britton et. al. [10] estimated that the disease could level out at around 43% total infected, in a matter of months. While the authors stress that this is not an actual prediction, it is based on realistic parameters for COVID-19. The famous Report 9 by the Imperial College [29] predicted a total number of 81% infected in a “do-nothing” scenario, based on a more advanced so called “agent based model” that also treats household-contacts separately. According to Table 3 in the report, the number of deaths and peak ICU capacity can be reduced by 50% and 81%, respectively, in the most effective NPI-scenario, which certainly goes beyond what was implemented in Sweden. However, as of February 2021, when the original Wuhan-strain was declining [30], these reduced predictions overestimate the actual figure by a factor of roughly 4 (deaths) and 10 (ICU) (when directly translated to Stockholm County).

Fig 2 — S–curves corresponding to the 3 graphs in Fig 1. As in Fig 1, the blue black and pink have been normalized by division by N. The black curve thus shows the proportion of the total population susceptible to the virus. Note that when the pandemic is over, around 68% are still susceptible, in stark contrast to classical SIR which levels out at around 34%. The pink curve starts out assuming 57% have artificial sterilizing immunity, and hence its initial value is 43% (this number was chosen using the formula (8)). Note that the pink curve looks exactly like the black except for a vertical translation, illustrating the key findings of this article. The S-SIR model has three subgroups S₁, S₂, S₃ corresponding to p₁ = 1 (labeled “super-susceptibles”), p₂ = 0.1 (labeled “normal”) and p₃ = 0.02 (labeled “well-protected”). Here we have normalized with the amount of people in each respective subgroup, wherefore all curves start at 1. Note how the spread in the latter two sub-groups level out as soon as it levels out in the super-susceptible group.

The point here is not to criticize any particular model, and clearly the case of Sweden alone can not prove that models are right or wrong, as mentioned initially. However, based on the massive discrepancy between the actual Swedish data and the various model outcomes described above, it is a legitimate question whether “contemporary models” have a tendency to significantly overestimate the society spread and final size of the pandemic. We find it likely that the answer is yes, and further support for this hypothesis is given in [4]. In this paper we demonstrate that variable susceptibility is one factor that contributes to this phenomenon.

2.3 Pre-immunity, super-spreaders and other inhomogeneities

How can we alter the equation system (3) and (4) in order to dampen the curves? The simplest option is to assume that a certain fraction θ of the population have some form of sterilizing immunity so that they can not get infected by the virus. Mathematically, this is easily achieved by updating the initial conditions to

\begin{matrix} {\begin{matrix} S (0) & = ω N \\ I (0) & = n \\ R (0) & = 0 \end{matrix} \end{matrix}

(5)

where ω = 1 − θ is the fraction of initially susceptible. However, this is not very realistic since immunity is usually not binary, i.e. either 0% or 100% (so called sterilizing immunity). The hypothesis that some people are more susceptible than others is then far more plausible than a binary immunity. In the particular case of SARS-CoV-2, the hypothesis that certain individuals had some form of pre-immunity was suggested in various publications as an explanation for the, at least according to some, unexpectedly mild initial infection waves, see for instance [31]. This paper also lists a number of studies showing that some people had some a priori T-cell immunity. Since then, different articles have demonstrated various mechanisms that make certain individuals more or less susceptible to SARS-CoV-2, e.g. [15–18]. It is also well established that infectivity levels vary dramatically, as mentioned earlier (see e.g. [32]). In addition, this seems uncorrelated to how sick they become; many individuals with very high viral loads are even asymptomatic. In light of this, the most probable assumption is that also the way the virus enters the human is subject to large individual variations.

To make a more realistic model for the spread of COVID-19, or any infectious disease for that matter, it is reasonable to divide the compartments S and I into a number of subcompartments S₁, …, S_J and I₁, …, I_K where people in each compartment have a different level of susceptibility/infectivity. To see how to set up a corresponding equation system for disease spread, recall that a was the amount of daily contacts by one individual. We now let p_jk be the probability that such a contact leads to transmission when an individual in S_j meets one in I_k. The incidence ν_j coming from the group S_j then becomes

\begin{matrix} ν_{j} = \frac{S_{j}}{N} (a p_{j 1} I_{1} + \dots + a p_{j K} I_{K}) \end{matrix}

(6)

(c.f. (3)). Since we assume no correlation between infectivity and susceptibility, the total amount of new infectives ν₁ + … + ν_J is then distributed among the groups I₁, …, I_K according to their relative size. The remaining equations in (4) are easily modified to this new vector setting, we refer to Sec. 1 in S1 File for the details. In the coming section we analyze the behavior of this system of equations, and in Section 4 we also discuss other extensions such as SEIR and variable activity levels.

3 Main results

The main point of this research is that extensions to both SIR and SEIR of the type mentioned above yield overall curves that are only marginally different from basic SIR, given that a level of Artificial Sterilizing Immunity (ASI) is included. First of all, after setting up the details in Section 1 of S1 File, we prove in Proposition 1.1 that the division of I into various sub-compartments have no effect whatsoever, further supporting the conclusions in [8, 9, 11]. In other terms, the existence of “super-spreaders” do not in any notable way affect the dynamics of disease spread. Removing this layer of complexity, the Eq (6) simplify to

\begin{matrix} ν_{j} = \frac{a p_{j}}{N} S_{j} I \end{matrix}

(7)

where p_j is the probability of transmission when a susceptible in group S_j encounters an “average” infectious individual. We refer to Eq (14)-(16) in S1 File for the full system of equations, which we label S-SIR for “Susceptibility-Stratified SIR”. It is a very curious fact that the division of S into subcompartments can not, in contrast to I, mathematically be further reduced to a simpler equation system. However, and this is the key result of this paper, we can prove mathematically that the overall behavior of S-SIR (in terms of prevalence I and recovered R) differs only marginally from the basic SIR (3) and (4) upon including ASI to the initial conditions, as we did in (5). This is the essence of Theorem 2.1, which is found in Section 2 of S1 File. Given probabilities p₁, …, p_J, the theorem also provides formulas for suitable values of the transmission coefficient α (used to compute the incidence ν in (3)) and artificial sterilizing immunity θ (used in the initial conditions (5)), as follows:

\begin{matrix} α = a \frac{\sum_{j = 1}^{J} w_{j} p_{j}^{2}}{\sum_{j = 1}^{J} w_{j} p_{j}}, ω = \frac{{(\sum_{j = 1}^{J} w_{j} p_{j})}^{2}}{\sum_{j = 1}^{J} w_{j} p_{j}^{2}}, \end{matrix}

(8)

where ω = 1 − θ and w_j is the fraction of the population initially belonging to S_j; w_j = S_j(0)/N. A simple illustration of these results is found in Section 1.3 in S1 File. It is important to be careful with the interpretation of θ = 1 − ω as a fraction of people who actually have sterilizing immunity, since there is, in reality, not a division of θN immune and ωN susceptible, which is why we have chosen the acronym ASI; artificial sterilizing immunity. These results are illustrated in Figs 1 and 2. Note in particular that, rather surprisingly, as soon as the most vulnerable susceptibility group (labeled super-susceptibles in Fig 2) runs out of new individuals to infect, transmission in all other groups cease as well. This behavior is typical, see S1 Fig in S1 File for a similar example with different values.

We have observed the same phenomenon also when modeling with SEIR and also when including e.g. different age groups and variable activity levels, following [10]; models with many such layers produce output which seem practically indistinguishable from the output of SIR with ASI, i.e. (3)–(5). We leave as a numerical observation which we discuss further in Section 4. In particular, given an estimated level of ASI θ in a society, it is mathematically impossible to draw any conclusions about how much of θ is caused by inhomogeneities in age and behavior, and how much comes from variations in susceptibility.

Incidentally, at the end of each paper [1–3], Kermack and McKendrick stress that a weakness in their model is that they assume uniform susceptibility, which they consider unrealistic in many cases. However, it seems that they never got around to address this issue, and we have not found a rigorous mathematical analysis of how to deal with this situation elsewhere in the literature either. In particular, the formula 1 − 1/R₀ for the Herd-Immunity Threshold (HIT), which stems from their seminal papers, may very well be inaccurate, as suggested also in [10]. In the coming section we derive a refined version of this formula taking ASI into account.

3.1 Formulas for R₀ and the herd-immunity threshold

It is easy to see that the generation time T_generation (introduced below (3)) coincides with the average time an infected individual remains infective. Since α is the infection rate, we conclude that R₀ = αT_generation for the standard SIR (3) and (4), assuming a fully susceptible population. However, in the presence of ASI θ, the actual infection rate is only (1 − θ)α and hence the correct formula for the R₀-value becomes

\begin{matrix} R_{0} = (1 - θ) α T_{g e n e r a t i o n} = ω α T_{g e n e r a t i o n} . \end{matrix}

(9)

The above value for R₀ is the value that would be estimated by e.g. EpiEstim [12] or [13] from a real time series generated by the model (3) and (4) with initial data (5). Mathematically, R₀ is defined as the number of new infections that one infected individual gives rise to, before disease induced immunity starts to build up. (To compute this, first solve I′(t) = −σI(t), given I(0) = 1, recalling that σ = 1/T_generation, and then integrate the resulting incidence ν, as given by (3), while keeping S(t) fixed at S(0) = ωN.) Similarly, one sees that the effective R-value, denoted R_e(t), in the above model is

R_{e} (t) = \frac{S (t)}{N} α T_{g e n e r a t i o n} = \frac{S (t)}{S (0)} R_{0} .

The term “herd-immunity” carry a variety of meanings [33]. In mathematical epidemiology, given a certain model and a novel virus, the Herd-Immunity Threshold is defined as the total number of infective and recovered needed to achieve R_e(t₀) = 1. Since

I^{'} (t) = \frac{α}{N} (S (t) - σ) I (t) = (R_{e} (t) - 1) σ I (t),

(recall (4)), we see that this coincides with the point at which the wave of infectious naturally starts to recede. Beyond this point, any import cases will not spark new outbreaks. We denote this value by H_IT.

In the SIR-model, it is assumed that individuals mix homogeneously and that recovered individuals have protective antibodies (i.e. sterilizing immunity). While it is known that anti-bodies wane over time, at least for SARS-CoV-2, this waning happens much more slowly than the duration of an outbreak [25], and hence the latter assumption is reasonable for the discussion of the herd-immunity threshold in a shorter time frame. However, we wish to stress that the waning means that herd-immunity is never a stable condition, but will fade with time, and hence the fact that herd immunity is reached during a particular wave does not prevent future waves, which may occur either due to waning antibodies or the emergence of new variants.

Assume now that a SIR-model with a certain level of ASI accurately describes a given outbreak. The Herd-Immunity Threshold H_IT then equals S(0)/N − S(t₀)/N where t₀ is the time point when the herd-immunity threshold is reached, which can be found by solving R_e(t₀) = 1. In other words H_IT is the difference between the fraction S(t₀)/N of susceptibles at the time t₀ when herd-immunity is reached, and the fraction of susceptibles initially. In the SIR-model with ASI, solving R_e(t₀) = 1 yields the equation S(t₀)/N = 1/αT_generation, and so we deduce

\begin{matrix} H_{I T} = ω - \frac{1}{α T_{g e n e r a t i o n}} = ω (1 - 1 / R_{0}), \end{matrix}

(10)

where we used the earlier formula (9) as the definition of R₀. This is the formula for the herd-immunity threshold presented in Eq (2) in the introduction. It implies that the classical formula (1), given an estimate of R₀ from e.g. EpiEstim, is over-estimating the herd-immunity threshold. More importantly, it allows to predict H_IT, given that the ASI parameter θ = 1 − ω can be estimated from available data.

That the classical formula may be misleading has been pointed out before [14], and a more recent contribution indicating that the H_IT could be significantly lower than the value (1) is [10]. These works illustrate this by simply testing models that involve heterogeneities (primarily social mixing patterns, not variable susceptibility), and therefore it offers little guidance for actual estimation of H_IT. Formula (2) is, to our knowledge, the first time this effect has been given a mathematical formula.

To sum up, we have deduced a new formula for the herd-immunity threshold in the model SIR with ASI. Since the results in Section 3 imply that this is a good approximation to Susceptibility-stratified SIR, it follows that the above formula applies to this model as well, with ω given by (8). In Section 4 we demonstrate numerically that the same conclusion seems to be true also for other heterogeneities, and hence the formula may be a better alternative for estimating the herd-immunity threshold more generally (assuming that the value of θ can be inferred from available data).

It is crucial to note that (10) applies under the assumption that the immunity is achieved by natural spread. The herd-immunity threshold for vaccinating is still given by the classical formula (1) (assuming the vaccine gives sterilizing immunity), which is shown in Section 1.2 of S1 File. This indicates that it is harder to achieve herd-immunity by vaccination, but more work is needed to establish these results in practice.

3.2 Damping and the final size of the pandemic

As mentioned earlier, several works have established that variable susceptibility have a damping effect on the prevalence. By the above results, this can now can be quantified. Suppose $(\tilde{S}, \tilde{I}, \tilde{R})$ is a solution to SIR in a homogenous and fully susceptible population (so $\tilde{S} (0) = N$ ), and let $\tilde{α}$ be the corresponding transmission rate. Given a fixed value of ASI θ, it is then easy to see that $(S, I, R) = (ω \tilde{S}, ω \tilde{I}, ω \tilde{R})$ is a solution to (3)–(5), where ω = 1 − θ and $α = \tilde{α} / ω$ . Hence the effect of ASI is really nothing but a rescaling of standard SIR curves. Note that rescaling does not change the value of R₀, which due to formula (9) is given by $ω α T_{g e n e r a t i o n} = \tilde{α} T_{g e n e r a t i o n}$ in both cases.

It is well known that the final size of the pandemic $\tilde{π} = \tilde{R} (\infty) / N$ in the usual SIR (as well as SEIR) is given by solving $1 - \tilde{π} = e^{- R_{0} \tilde{π}}$ (see [9] and Chapter 3 of [5]). Combining this with the above we see that the final size of the pandemic π in SIR with ASI is given by solving

1 - π / ω = e^{- R_{0} π / ω} .

Hence, in combination with our main result about reduction of Susceptibility-Stratified SIR to SIR with ASI, we deduce that the above solution π is a good approximation to the final size of the pandemic for S-SIR with ω given by (8).

4 Extension to more general models

For a disease like COVID-19, with a short incubation period followed by an even shorter infectious period, there is only a marginal difference between modeling using SIR and using SEIR, and hence we believe that the key conclusions of this paper extend to this model as well. Similarly, we have found numerically that more advanced SEIR-models taking variable age and activity levels into account, behave just like SIR if we incorporate ASI. We leave the formal verification of these observations as an open conjecture, and content ourselves with showing some examples.

4.1 SEIR

SEIR has two key parameters apart from R₀, namely T_infectious and T_incubation, where the former is the average time that a person is infectious and the latter is the time from when a person becomes infected until he or she becomes infectious. Estimates for these vary, we here follow Britton et. al. [10] and set T_incubation = 4 and T_infectious = 3. It then follows that the generation time equals

T_{g e n e r a t i o n} = T_{i n f e c t i o u s} + T_{i n f e c t i v e} = 7,

where the generation time is the average time it takes from a person getting infected until that person infects others (see Eq (5) in the supplementary material to [30] for a formal derivation). Note that this is consistent with the choice of T_generation in previous sections.

The reason why SEIR and SIR give almost identical output for COVID-19 is that both are primarily determined by the values of T_generation and R₀. To wit, during a major outbreak, it does not matter if a person is sick for 7 days and infect R₀ people during those 7 days, or if he undergoes incubation for 4 days and then infect R₀ people during the remaining 3 days. As an example, consider Fig 3(a); we see a very similar behavior by choosing parameters for SIR and SEIR in accordance with the above formulas (with R₀ fixed). Moreover, by allowing free parameters, SIR can be made to behave almost identically as SEIR (even without involving ASI). To support this claim, not the almost perfect overlap between the blue and black curves in Fig 3, obtained by keeping T_generation fixed and modifying R₀ by one percent. Since the exact value for the input parameters are unknown in reality, we argue that it is irrelevant whether one uses SIR or SEIR, at least for modeling of SARS-CoV-2 and viruses with similar characteristics. Therefore, the observations of this paper should extend to SEIR as well, even if we have not been able to establish this mathematically.

Fig 3 — (a) SEIR with R₀ = 1.66 and T_infectious + T_infective = 7 (blue), SIR with the same R₀ and T_generation = 7 (red) and finally SIR with a 1% lower R₀, same T_generation (black). (b) Age-activity stratified SEIR with R₀ = 1.66 and T_infectious + T_infective = 7 (blue); SIR using the same T_generation but an ASI of 25% and slightly different R₀ (black).

4.2 Heterogeneous models

Variable susceptibility is not the only type of population heterogeneity which could manifest itself as ASI on a macro level. In [10] the authors develop a heterogeneous SEIR model taking variable interaction pattern between different age-groups into account, as well as the fact that people in each age-group have varying amount of contacts. We implemented their model and then sought parameters for SIR with ASI that would yield a similar output. The result is seen in Fig 3(b). Again, the difference is so fine that it would be impossible to spot in practice. Henceforth, what may appear as a certain level of population (pre-)immunity in mathematical models may in fact be a mix of various population heterogeneities, in which variable susceptibility is only one ingredient.

5 Discussion

There could be many reasons for why certain people are more susceptible than others to infection by a novel virus, ranging from innate and adaptive immunity to cross-reactive immunity from other known viruses as well as genetic differences. For a novel disease, sterilizing pre-immunity, i.e. individuals which are completely immune without ever having had the virus, most likely does not exist. The key point of this study is that sterilizing individual immunity is not needed in order to observe what looks like sterilizing immunity on a population level, which we have coined ASI; artificial sterilizing immunity. We show mathematically that, in order to have ASI, we only need moderate variation in susceptibility. Moreover, we demonstrate numerically that other types of population heterogeneities, such as variable social mixing patterns, also manifest themselves as ASI. The findings in this paper do not limit themselves to SARS-CoV-2, but basically shows that classical formulas for the herd-immunity threshold and the models for spread of infectious diseases with roots in the famous paper by Kermack and McKendrick [1] are inapt to model any infectious disease subject to large variability in susceptibility, and need to be modified as described in Section 3.1.

The estimation of the herd-immunity threshold H_IT is crucial for efficient disease control management and planning. For example, if a society decides to make a lock-down before H_IT is reached, it is almost certain that the disease will re-emerge unless NPI’s are maintained indefinitely. The classical formula (1) is still very much in use, despite the fact that it is known to rely on a number of oversimplifying assumptions which may lead to an erroneous indication. We have established a new formula which we prove applies when variable susceptibility is present. Since we show that our simplified model, SIR with ASI, also seems to be a good substitute for models that involve variable social mixing patterns, it is possible that (2) applies more generally than what we are able to prove mathematically.

Supporting information

S1 Data

(ZIP)

Click here for additional data file.^{(6.9KB, zip)}

S1 File. Supplementary material.

(PDF)

Click here for additional data file.^{(1.1MB, pdf)}

Acknowledgments

We thank Erik Wahlén for fruitful discussions.

Data Availability

All relevant data are within the paper and its Supporting information files.

Funding Statement

The research of J. W. was supported by the Swedish Research Council (2019-04878). The research of C. S-N. was supported by the Swedish Medical Research Council (2019-01736) and Flagship InFLAMES, Finland.

References

1. Kermack WO, McKendrick AG. A contribution to the mathematical theory of epidemics. Proceedings of the royal society of London Series A, Containing papers of a mathematical and physical character. 1927;115(772):700–721. [Google Scholar]
2. Kermack WO, McKendrick AG. Contributions to the mathematical theory of epidemics II. The problem of endemicity. Proceedings of the Royal Society of London Series A, containing papers of a mathematical and physical character. 1932;138(834):55–83. [Google Scholar]
3. Kermack WO, McKendrick AG. Contributions to the mathematical theory of epidemics III. Further studies of the problem of endemicity. Proceedings of the Royal Society of London Series A, Containing Papers of a Mathematical and Physical Character. 1933;141(843):94–122. [Google Scholar]
4. Carlsson M, Söderberg-Nauclér C. COVID-19 modeling outcome versus reality in Sweden Viruses 2022, 14(8), MDPI doi: 10.3390/v14081840 [DOI] [PMC free article] [PubMed] [Google Scholar]
5. Diekmann O, Heesterbeek H, Britton T. Mathematical tools for understanding infectious disease dynamics. In: Mathematical Tools for Understanding Infectious Disease Dynamics. Princeton University Press; 2012. [Google Scholar]
6. Gerasimov A, Lebedev G, Lebedev M, Semenycheva I. COVID-19 dynamics: a heterogeneous model. Frontiers in Public Health. 2021;8:911. doi: 10.3389/fpubh.2020.558368 [DOI] [PMC free article] [PubMed] [Google Scholar]
7. Hickson R, Roberts M. How population heterogeneity in susceptibility and infectivity influences epidemic dynamics. Journal of Theoretical Biology. 2014;350:70–80. doi: 10.1016/j.jtbi.2014.01.014 [DOI] [PubMed] [Google Scholar]
8. Miller JC. Epidemic size and probability in populations with heterogeneous infectivity and susceptibility. Physical Review E. 2007;76(1):010101. doi: 10.1103/PhysRevE.76.010101 [DOI] [PubMed] [Google Scholar]
9. Miller JC. A note on the derivation of epidemic final sizes. Bulletin of mathematical biology. 2012;74(9):2125–2141. doi: 10.1007/s11538-012-9749-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
10. Britton T, Ball F, Trapman P. A mathematical model reveals the influence of population heterogeneity on herd immunity to SARS-CoV-2. Science. 2020;369(6505):846–849. doi: 10.1126/science.abc6810 [DOI] [PMC free article] [PubMed] [Google Scholar]
11. Rousse F, et al. The role of super-spreaders in modeling of SARS-CoV-2. Infectious Disease Modelling (2022). doi: 10.1016/j.idm.2022.10.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Thompson R, Stockwin J, van Gaalen RD, Polonsky J, Kamvar Z, Demarsh P, et al. Improved inference of time-varying reproduction numbers during infectious disease outbreaks. Epidemics. 2019;29:100356. doi: 10.1016/j.epidem.2019.100356 [DOI] [PMC free article] [PubMed] [Google Scholar]
13. Cori A, Ferguson NM, Fraser C, Cauchemez S. A new framework and software to estimate time-varying reproduction numbers during epidemics. American journal of epidemiology. 2013;178(9):1505–1512. doi: 10.1093/aje/kwt133 [DOI] [PMC free article] [PubMed] [Google Scholar]
14. Fox JP, Elveback L, Scott W, Gatewood L, Ackerman E. Herd immunity: basic concept and relevance to public health immunization practices. American journal of epidemiology. 1971;94(3):179–189. doi: 10.1093/oxfordjournals.aje.a121310 [DOI] [PubMed] [Google Scholar]
15. Dee K, Goldfarb DM, Haney J, Amat JA, Herder V, Stewart M, et al. Human rhinovirus infection blocks SARS-CoV-2 replication within the respiratory epithelium: implications for COVID-19 epidemiology. Journal of Infectious Diseases. 2021. doi: 10.1093/infdis/jiab147 [DOI] [PMC free article] [PubMed] [Google Scholar]
16. Ng KW, Faulkner N, Cornish GH, Rosa A, Harvey R, Hussain S, et al. Preexisting and de novo humoral immunity to SARS-CoV-2 in humans. Science. 2020;370(6522):1339–1343. doi: 10.1126/science.abe1107 [DOI] [PMC free article] [PubMed] [Google Scholar]
17. Zeberg H, Pääbo S. A genomic region associated with protection against severe COVID-19 is inherited from Neandertals. Proceedings of the National Academy of Sciences. 2021;118(9). doi: 10.1073/pnas.2026309118 [DOI] [PMC free article] [PubMed] [Google Scholar]
18. Kundu, Rhia, et al. Cross-reactive memory T cells associate with protection against SARS-CoV-2 infection in COVID-19 contacts. Nature communications 13.1 (2022): 1–8. doi: 10.1038/s41467-021-27674-x [DOI] [PMC free article] [PubMed] [Google Scholar]
19. Brauer F, Castillo-Chavez C, Feng Z. Mathematical models in Epidemiology. Springer; 2019. [Google Scholar]
20. Walker PG, Whittaker C, Watson OJ, Baguelin M, Winskill P, Hamlet A, et al. The impact of COVID-19 and strategies for mitigation and suppression in low-and middle-income countries. Science. 2020. doi: 10.1126/science.abc0035 [DOI] [PMC free article] [PubMed] [Google Scholar]
21. Sjödin H, Johansson AF, Brännström Å, Farooq Z, Kriit HK, Wilder-Smith A, et al. COVID-19 healthcare demand and mortality in Sweden in response to non-pharmaceutical mitigation and suppression scenarios. International journal of epidemiology. 2020. doi: 10.1093/ije/dyaa121 [DOI] [PMC free article] [PubMed] [Google Scholar]
22. Hassan Md Nazmul, et al. Mathematical Modeling and Covid-19 Forecast in Texas, USA: a prediction model analysis and the probability of disease outbreak. Disaster medicine and public health preparedness (2021): 1–12. doi: 10.1017/dmp.2021.151 [DOI] [PMC free article] [PubMed] [Google Scholar]
23. Mahmud Md Shahriar, et al. Vaccine efficacy and sars-cov-2 control in california and us during the session 2020-2026: A modeling study. Infectious Disease Modelling 7.1 (2022): 62–81. doi: 10.1016/j.idm.2021.11.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Gudbjartsson DF, Norddahl GL, Melsted P, Gunnarsdottir K, Holm H, Eythorsson E, et al. Humoral immune response to SARS-CoV-2 in Iceland. New England Journal of Medicine. 2020;383(18):1724–1734. doi: 10.1056/NEJMoa2026116 [DOI] [PMC free article] [PubMed] [Google Scholar]
25. Dan JM, Mateus J, Kato Y, Hastie KM, Yu ED, Faliti CE, et al. Immunological memory to SARS-CoV-2 assessed for up to 8 months after infection. Science. 2021. doi: 10.1126/science.abf4063 [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Bred immunitet efter 9 månader. Danderyds hospital press-release. Webbadress: www.ds.se/jobba-hos-oss/mot-oss/bred-immunitet-efter-nio-manader/
27.Folkhälsmyndigheten. Påvisning av antikroppar efter genomgången COVID-19 i blodprov från öppenvården.
28. Madewell ZJ, Yang Y, Longini IM, Halloran ME, Dean NE. Household transmission of SARS-CoV-2: a systematic review and meta-analysis. JAMA network open. 2020;3(12):e2031756–e2031756. doi: 10.1001/jamanetworkopen.2020.31756 [DOI] [PMC free article] [PubMed] [Google Scholar]
29. Ferguson N, Laydon D, Nedjati-Gilani G, Imai N, Ainslie K, Baguelin M, et al. Report 9: Impact of non-pharmaceutical interventions (NPIs) to reduce COVID19 mortality and healthcare demand. Imperial College London. 2020;10(77482):491–497. [Google Scholar]
30. Carlsson M, Hatem G, Söderberg-Nauclér C. Mathematical modeling suggests pre-existing immunity to SARS-CoV-2. medRxiv. 2021. [Google Scholar]
31. Doshi P. Covid-19: Do many people have pre-existing immunity? Bmj. 2020;370. [DOI] [PubMed] [Google Scholar]
32. Jones TC, Biele G, Mühlemann B, Veith T, Schneider J, Beheim-Schwarzbach J, et al. Estimating infectiousness throughout SARS-CoV-2 infection course. Science. 2021. doi: 10.1126/science.abi5273 [DOI] [PMC free article] [PubMed] [Google Scholar]
33. Fine P, Eames K, Heymann DL. “Herd immunity”: a rough guide. Clinical infectious diseases. 2011;52(7):911–916. doi: 10.1093/cid/cir007 [DOI] [PubMed] [Google Scholar]

PLoS One. doi: 10.1371/journal.pone.0279454.r001

Decision Letter 0

Jean-Luc EPH Darlix

8 Apr 2022

PONE-D-22-06269The role of variable susceptibility and infectivity in the spread of SARS-CoV-2PLOS ONE

Dear Dr. Carlsson

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

As pointed out by reviewer 2, it is far from clear how the authors monitored the seroconversions , how they occured and what was the proportion pertaining to innate immunity. This is a critical point which needs to be clari ,authors should explain what they mean by herd immunity, a highly debated subject with little solid data.

.in addition the mathematical model to account for the epidemy and its evolution appears to be highly simplified

Please submit your revised manuscript by May 22 2022 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Jean-Luc EPH Darlix, MG, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: This paper investigates the role of heterogeneous susceptibility and infectiousness in a modified SIR compartmental model of disease spread.

I have many concerns about this paper. Most fundamentally, I feel the authors do not demonstrate a deep understanding of the state of the art in mathematical modeling. So it makes me very uncomfortable that there is a thread running through this paper and the other papers of their series in which they criticize modelers, and imply that they have solved everything.

Major concerns:

1) "it became clear that the models were overly pessimistic." This statement in the abstract makes me worry that this is actually highly motivated reasoning masquerading as science. It is good that the authors recognize this claim is not justified - lines 6-7 and 12 of the text contradict the claim in the abstract that this was clear, and acknowledge that no such claim can be made. But now I'm wondering why did they make a claim that they knew was false in the abstract? As a result, at this stage I have significant doubts that I'm about to read a scientific paper. I've read plenty of opinion pieces dressed up in academic language during the pandemic, and I don't want to read one in a legitimate scientific journal.

Related to this, the authors need to get rid of the judgmental statements about existing models and modelers throughout this and the other papers in this "series of articles aimed at better understanding the discrepancy between model prediction and observed data." It is not clear if they actually know the structure of the model whose implied failure is used to motivate the study - someone reading this paper's description would assume that Imperial used a differential-equations based SEIR model that assumed a well-mixed population. That is not what they did.

2) Far too much of this paper is saying "we show in [5] that this cannot be explained without pre-immunity". [5] is a preprint, and I am uncomfortable relying so strongly on results that have not undergone peer review.

3) "The NPI's have remained virtually constant throughout the cold period of 2020-2021... in theory means that the mathematical models should work, just with a new R_0-value adopted to the new situation. However in [5] we make a major effort to produce refinements of state of the art compartmental models, taking various sorts of population heterogeneities into account [and can only fit data by assuming 60% are immune]".

I'm not in a position to comment on the NPIs in Sweden, but when I look at mobility data for Sweden, I see that mobility was highly variable, and that there was a very significant drop in average movement during the second wave, and the drop is likely not uniform in time. Although I have not read [5] nearly as closely as this draft, the claim is made that we can treat behavior as constant because the NPIs are constant. But the measured behavior was not constant, so I think the claim that this can be ignored is false (this is part of the reason I am uncomfortable with such high dependence on a preprint). Additionally, the fact that the behavior changeis likely bimodal (with some individuals significantly reducing movement while others continue as before) is very likely to result in a subpopulation with significantly reduced probability of infection - which will be indistinguishable from a subpopulation having pre-existing immunity.

4) lines 38-40 "In this article we show mathematically that, rather surprisingly, variatio0ns in infectivity has no bearing on the model curves, whereas variations in susceptiblity manifests itself as pre-immunity on the macro level. "

And lines 167-173:

"Incidentally, at the end of each paper [10-12], Kermack and McKendrick stress that a weakness in their model is that they assume uniform susceptibility, which they consider unrealistic in many cases. However, it seems that they never got around to correct this issue, and we have not found a rigorous analysis of how to model this mathematically elsewhere in the literature either. In particular, the formula 1 -1/R0 for the Herd-Immunity Threshold (HIT) (see SM Sec. 4), which stems from their seminal papers, may very well be inaccurate. In the coming section we derive a refined version of this formula taking variable susceptibility into account."

The result of variations in infectivity is not a surprise to me. The variation in infectivity is unimportant under specific circumstances, which do apply in a well-mixed compartmental model settings (and in disease spread in some random networks as well). The reason for this is straightforward: the model assumes that the number of infections is large enough for a law of large numbers to apply (otherwise the model would not be deterministic). As such we can safely average the number of infections caused in a well-mixed population over all individuals. Assuming that infectiousness and susceptibility are uncorrelated, the average infectiousness is the same early or late in the epidemic. (this breaks down if they are biologically correlated or if the higher infectiousness is due to behaviors that also increase susceptibility, in which case the more susceptible are infected sooner on average). [note, if a population is made up of many small groups such as households, then superspreading does have an impact, but this model does not contain households. If the model is intended to predict probability of establishment, then variation is infectiousness (but not susceptibility) matters]

As for variation in susceptibility, the disease preferentially removes the highly susceptible, leaving a less susceptible residual population. There is a large body of work that analyzes this effect. I am surprised that the authors were unable to find a rigorous analysis of this. I would start with:

- Britton, T., Ball, F. & Trapman, P. A mathematical model reveals the influence of population heterogeneity on herd immunity to SARSCoV-2.

- Gou, W. & Jin, Z. How heterogeneous susceptibility and recovery rates affect the spread of epidemics on networks.

-Gerasimov, A., Lebedev, G., Lebedev, M. & Semenycheva, I. COVID-19 dynamics: A heterogeneous model.

-Hickson, R. & Roberts, M. How population heterogeneity in susceptibility and infectivity influences epidemic dynamics.

-Dolbeault, J. & Turinici, G. Social heterogeneity and the COVID19 lockdown in a multi-group SEIR model.

-Miller, J. C. Epidemic size and probability in populations with heterogeneous infectivity and susceptibility

-Miller, J. C. Bounding the size and probability of epidemics on networks

-Miller, J. A note on the derivation of epidemic final sizes

This last one does a lot to explain the issues that the authors raise (heterogeneity in susceptibility mattering but not infectiousness). It also does age-structured models and many other cases. I think it only touches on the final size relations, though likely the arguments used can apply for intermediate times.

There is some subtlety regarding the herd immunity threshold that has not been studied extensively previously. It depends on whether the immunity is achieved through infection or through random vaccination - for the result on infection acquired immunity I would read Gabriela Gomes & colleagues recent papers/preprints. For random vaccination it remains 1-1/R_0.

5) I am concerned by the discussion starting at line 55:

"For example, a SIR-model has no memory, i.e. it does not keep track of how long a person is sick, but the original equation systems in [10] did. However, it has been shown that this, as well as other factors such as randomness has a very limited bearing on the model curves, see e.g. [13]"

Given that the paper starts by referring to the Imperial modeling work and focuses significant attention on implied failure of their predictions, this statement should not be made because it seems to imply that Imperial's (and other models) use a model as in the SIR model described. Most state of the art models (including Imperial's) do have memory.

I don't know that the results from [13] are consistent with this claim. It's not clear what is meant by 'randomness', but definitely some forms of randomness matter and the details of the generation interval plays an important role in the early dynamics.

6) It is not clear to me how system (3) was handled in terms of R_0. Was R_0 defined initially and then a subset treated as immune (so the effective reproduction number drops)? This is what the statement seems to imply. Or was R_0 defined based on the population after the subset was made immune. In either case, this is just the usual SIR model but confined to a smaller population.

Minor:

1) A source is needed for this claim: "most research teams use extensions of SEIR for modeling COVID-19" Additionally the comment needs to distinguish between an SEIR model in terms of progression of infection and the differential equations SEIR model. E.g., the Imperial Model that the authors seem to focus on does not use an SEIR compartmental model, though they assume SEIR progression of infection. Their model is an individual based simulation. It has households, workplaces, ...

2) I don't believe I have encountered the word "fictive" before (google tells me what it is, but I think it is not standard English).

3) It's not appropriate to refer to the epidemic curves as "bell-shaped". That usually implies e^{-x^2} type behavior. These have exponential growth early and exponential decay late.

4) To my eye, most predicted epidemic curves are not symmetric.

5) caption for fig 1: the "second two" do not level out below the HIT. They level out below the HIT predicted by the "standard" model. But that's because they use they are created using a different model. They overshoot the HIT (the HIT is not defined based on a specific canonical model, it's defined based on when R_e=1 for the actual system). Please introduce a notation for the HIT based on the classical model, and discuss it. The discussion of HIT in the context of a different model is confusing because I keep assuming we are discussing the HIT of that model.

6) It is not clear to me why hard-hit towns in Italy which underwent some of the strictest NPIs anywhere would be expected to have high seroprevalence.

7) I am not aware of any serious academic study of "immunological dark matter". I

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2023 Feb 15;18(2):e0279454. doi: 10.1371/journal.pone.0279454.r002

Author response to Decision Letter 0

26 May 2022

Our response to the comments by Reviewer 1 and the editor are found in the attached word document "response to the reviewers". Note that we were unable to get access to the comments of Reviewer 2. We contacted the journal several times about this issue and finally we decided to submit on time. We would of course be happy to see and respond to the comments of Reviewer 2 as well.

Attachment

Submitted filename: Response to the reviewers.docx

Click here for additional data file.^{(46.7KB, docx)}

PLoS One. doi: 10.1371/journal.pone.0279454.r003

Decision Letter 1

Claudine Irles

23 Nov 2022

PONE-D-22-06269R1A note on variable susceptibility, the herd-immunity threshold and modeling of infectious diseasesPLOS ONE

Dear Dr. Carlsson,

Please submit your revised manuscript by Jan 07 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

We look forward to receiving your revised manuscript.

Kind regards,

Claudine Irles, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

Reviewer #2: Partly

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

Reviewer #2: Yes

**********

6. Review Comments to the Author

Reviewer #2: There are some corrections and important to solve few questions. It is also necessary for readers and to improve quality of the manuscript.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #2: Yes: Md Kamrujjaman

**********

Attachment

Submitted filename: Report_PONE-D-22-06269.pdf

Click here for additional data file.^{(68.5KB, pdf)}

PLoS One. 2023 Feb 15;18(2):e0279454. doi: 10.1371/journal.pone.0279454.r004

Author response to Decision Letter 1

4 Dec 2022

We thank the reviewer for insightful comments that helped improve the manuscript. We have revisited the entire text, rewriting bits and pieces in order to make it more readable and understandable. We have taken particular care to figure axes labels, figure titles and figure texts.

All major changes are marked in red in the manuscript titled “with track changes”. Note that text that was slightly rewritten was also marked in red, so there are no essentially new parts.

To address the specific questions by the reviewer, we have clarified that “variable susceptibility” refers to differences between individuals in the probability to get infected, given a meeting with an infective person, and not, as one could get the impression, individual variations in susceptibility over time. We have clarified this in the introduction. We added a headline “novel contributions” to the introduction in order to make this more clear to the reader. The above adresses comments 1-4 by the reviewer. Concerning 5, we have included a number of relevant new citations, among them three of those that the reviewer suggested. We also added newer research indicating that variable susceptibility indeed was present at the onset of the pandemic, see reference [18] Kundu et al, Nature Communications. Also reference [4] by Carlsson and Söderberg-Naucler has in the meanwhile gotten published, so we put more emphasis on this in the discussion on the discrepancy between model output and reality.

Concerning 5, yes we claim that E plays a minor role, as long as parameter values are free, in the sense that for each parameter configuration for SEIR we can get an almost identical curve using SIR. This is demonstrated in Fig. 3, but the conclusion is well tested on a number of different parameter settings, and this is discussed in depth in Section 4.1. For clarity, these findings have been highlighted also in 2.1, see the new text in red.

Finally, we refrain from adding new graps related to R_0<1, since we feel this would make the manuscript too long and deviate attention from the key findings. Concerning 6, we have carefully read the entire text with a focus on English and punctuation.

Attachment

Submitted filename: Response to the reviewers 2.docx

Click here for additional data file.^{(13.5KB, docx)}

PLoS One. doi: 10.1371/journal.pone.0279454.r005

Decision Letter 2

Claudine Irles

7 Dec 2022

A note on variable susceptibility, the herd-immunity threshold and modeling of infectious diseases

PONE-D-22-06269R2

Dear Dr. Carlsson,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Claudine Irles, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

PLoS One. doi: 10.1371/journal.pone.0279454.r006

Acceptance letter

Claudine Irles

22 Dec 2022

PONE-D-22-06269R2

A note on variable susceptibility, the herd-immunity threshold and modeling of infectious diseases

Dear Dr. Carlsson:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Claudine Irles

Academic Editor

PLOS ONE

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Data

(ZIP)

Click here for additional data file.^{(6.9KB, zip)}

S1 File. Supplementary material.

(PDF)

Click here for additional data file.^{(1.1MB, pdf)}

Attachment

Submitted filename: Response to the reviewers.docx

Click here for additional data file.^{(46.7KB, docx)}

Attachment

Submitted filename: Report_PONE-D-22-06269.pdf

Click here for additional data file.^{(68.5KB, pdf)}

Attachment

Submitted filename: Response to the reviewers 2.docx

Click here for additional data file.^{(13.5KB, docx)}

Data Availability Statement

All relevant data are within the paper and its Supporting information files.

[pone.0279454.ref001] 1. Kermack WO, McKendrick AG. A contribution to the mathematical theory of epidemics. Proceedings of the royal society of London Series A, Containing papers of a mathematical and physical character. 1927;115(772):700–721. [Google Scholar]

[pone.0279454.ref002] 2. Kermack WO, McKendrick AG. Contributions to the mathematical theory of epidemics II. The problem of endemicity. Proceedings of the Royal Society of London Series A, containing papers of a mathematical and physical character. 1932;138(834):55–83. [Google Scholar]

[pone.0279454.ref003] 3. Kermack WO, McKendrick AG. Contributions to the mathematical theory of epidemics III. Further studies of the problem of endemicity. Proceedings of the Royal Society of London Series A, Containing Papers of a Mathematical and Physical Character. 1933;141(843):94–122. [Google Scholar]

[pone.0279454.ref004] 4. Carlsson M, Söderberg-Nauclér C. COVID-19 modeling outcome versus reality in Sweden Viruses 2022, 14(8), MDPI doi: 10.3390/v14081840 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref005] 5. Diekmann O, Heesterbeek H, Britton T. Mathematical tools for understanding infectious disease dynamics. In: Mathematical Tools for Understanding Infectious Disease Dynamics. Princeton University Press; 2012. [Google Scholar]

[pone.0279454.ref006] 6. Gerasimov A, Lebedev G, Lebedev M, Semenycheva I. COVID-19 dynamics: a heterogeneous model. Frontiers in Public Health. 2021;8:911. doi: 10.3389/fpubh.2020.558368 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref007] 7. Hickson R, Roberts M. How population heterogeneity in susceptibility and infectivity influences epidemic dynamics. Journal of Theoretical Biology. 2014;350:70–80. doi: 10.1016/j.jtbi.2014.01.014 [DOI] [PubMed] [Google Scholar]

[pone.0279454.ref008] 8. Miller JC. Epidemic size and probability in populations with heterogeneous infectivity and susceptibility. Physical Review E. 2007;76(1):010101. doi: 10.1103/PhysRevE.76.010101 [DOI] [PubMed] [Google Scholar]

[pone.0279454.ref009] 9. Miller JC. A note on the derivation of epidemic final sizes. Bulletin of mathematical biology. 2012;74(9):2125–2141. doi: 10.1007/s11538-012-9749-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref010] 10. Britton T, Ball F, Trapman P. A mathematical model reveals the influence of population heterogeneity on herd immunity to SARS-CoV-2. Science. 2020;369(6505):846–849. doi: 10.1126/science.abc6810 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref011] 11. Rousse F, et al. The role of super-spreaders in modeling of SARS-CoV-2. Infectious Disease Modelling (2022). doi: 10.1016/j.idm.2022.10.003 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref012] 12. Thompson R, Stockwin J, van Gaalen RD, Polonsky J, Kamvar Z, Demarsh P, et al. Improved inference of time-varying reproduction numbers during infectious disease outbreaks. Epidemics. 2019;29:100356. doi: 10.1016/j.epidem.2019.100356 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref013] 13. Cori A, Ferguson NM, Fraser C, Cauchemez S. A new framework and software to estimate time-varying reproduction numbers during epidemics. American journal of epidemiology. 2013;178(9):1505–1512. doi: 10.1093/aje/kwt133 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref014] 14. Fox JP, Elveback L, Scott W, Gatewood L, Ackerman E. Herd immunity: basic concept and relevance to public health immunization practices. American journal of epidemiology. 1971;94(3):179–189. doi: 10.1093/oxfordjournals.aje.a121310 [DOI] [PubMed] [Google Scholar]

[pone.0279454.ref015] 15. Dee K, Goldfarb DM, Haney J, Amat JA, Herder V, Stewart M, et al. Human rhinovirus infection blocks SARS-CoV-2 replication within the respiratory epithelium: implications for COVID-19 epidemiology. Journal of Infectious Diseases. 2021. doi: 10.1093/infdis/jiab147 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref016] 16. Ng KW, Faulkner N, Cornish GH, Rosa A, Harvey R, Hussain S, et al. Preexisting and de novo humoral immunity to SARS-CoV-2 in humans. Science. 2020;370(6522):1339–1343. doi: 10.1126/science.abe1107 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref017] 17. Zeberg H, Pääbo S. A genomic region associated with protection against severe COVID-19 is inherited from Neandertals. Proceedings of the National Academy of Sciences. 2021;118(9). doi: 10.1073/pnas.2026309118 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref018] 18. Kundu, Rhia, et al. Cross-reactive memory T cells associate with protection against SARS-CoV-2 infection in COVID-19 contacts. Nature communications 13.1 (2022): 1–8. doi: 10.1038/s41467-021-27674-x [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref019] 19. Brauer F, Castillo-Chavez C, Feng Z. Mathematical models in Epidemiology. Springer; 2019. [Google Scholar]

[pone.0279454.ref020] 20. Walker PG, Whittaker C, Watson OJ, Baguelin M, Winskill P, Hamlet A, et al. The impact of COVID-19 and strategies for mitigation and suppression in low-and middle-income countries. Science. 2020. doi: 10.1126/science.abc0035 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref021] 21. Sjödin H, Johansson AF, Brännström Å, Farooq Z, Kriit HK, Wilder-Smith A, et al. COVID-19 healthcare demand and mortality in Sweden in response to non-pharmaceutical mitigation and suppression scenarios. International journal of epidemiology. 2020. doi: 10.1093/ije/dyaa121 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref022] 22. Hassan Md Nazmul, et al. Mathematical Modeling and Covid-19 Forecast in Texas, USA: a prediction model analysis and the probability of disease outbreak. Disaster medicine and public health preparedness (2021): 1–12. doi: 10.1017/dmp.2021.151 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref023] 23. Mahmud Md Shahriar, et al. Vaccine efficacy and sars-cov-2 control in california and us during the session 2020-2026: A modeling study. Infectious Disease Modelling 7.1 (2022): 62–81. doi: 10.1016/j.idm.2021.11.002 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref024] 24. Gudbjartsson DF, Norddahl GL, Melsted P, Gunnarsdottir K, Holm H, Eythorsson E, et al. Humoral immune response to SARS-CoV-2 in Iceland. New England Journal of Medicine. 2020;383(18):1724–1734. doi: 10.1056/NEJMoa2026116 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref025] 25. Dan JM, Mateus J, Kato Y, Hastie KM, Yu ED, Faliti CE, et al. Immunological memory to SARS-CoV-2 assessed for up to 8 months after infection. Science. 2021. doi: 10.1126/science.abf4063 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref026] 26.Bred immunitet efter 9 månader. Danderyds hospital press-release. Webbadress: www.ds.se/jobba-hos-oss/mot-oss/bred-immunitet-efter-nio-manader/

[pone.0279454.ref027] 27.Folkhälsmyndigheten. Påvisning av antikroppar efter genomgången COVID-19 i blodprov från öppenvården.

[pone.0279454.ref028] 28. Madewell ZJ, Yang Y, Longini IM, Halloran ME, Dean NE. Household transmission of SARS-CoV-2: a systematic review and meta-analysis. JAMA network open. 2020;3(12):e2031756–e2031756. doi: 10.1001/jamanetworkopen.2020.31756 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref029] 29. Ferguson N, Laydon D, Nedjati-Gilani G, Imai N, Ainslie K, Baguelin M, et al. Report 9: Impact of non-pharmaceutical interventions (NPIs) to reduce COVID19 mortality and healthcare demand. Imperial College London. 2020;10(77482):491–497. [Google Scholar]

[pone.0279454.ref030] 30. Carlsson M, Hatem G, Söderberg-Nauclér C. Mathematical modeling suggests pre-existing immunity to SARS-CoV-2. medRxiv. 2021. [Google Scholar]

[pone.0279454.ref031] 31. Doshi P. Covid-19: Do many people have pre-existing immunity? Bmj. 2020;370. [DOI] [PubMed] [Google Scholar]

[pone.0279454.ref032] 32. Jones TC, Biele G, Mühlemann B, Veith T, Schneider J, Beheim-Schwarzbach J, et al. Estimating infectiousness throughout SARS-CoV-2 infection course. Science. 2021. doi: 10.1126/science.abi5273 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0279454.ref033] 33. Fine P, Eames K, Heymann DL. “Herd immunity”: a rough guide. Clinical infectious diseases. 2011;52(7):911–916. doi: 10.1093/cid/cir007 [DOI] [PubMed] [Google Scholar]

PERMALINK

A note on variable susceptibility, the herd-immunity threshold and modeling of infectious diseases

Marcus Carlsson

Jens Wittsten

Cecilia Söderberg-Nauclér

Roles

Abstract

1 Introduction

1.1 Novel contributions

2 The mathematics of infectious disease spread dynamics

Fig 1. Graphs of recovered R and prevalence I.

2.1 Contemporary models for COVID-19

2.2 Model versus reality mismatch?

Fig 2. Graphs of susceptibles S.

2.3 Pre-immunity, super-spreaders and other inhomogeneities

3 Main results

3.1 Formulas for R0 and the herd-immunity threshold

3.2 Damping and the final size of the pandemic

4 Extension to more general models

4.1 SEIR

Fig 3. Approximations using SIR with ASI.

4.2 Heterogeneous models

5 Discussion

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Jean-Luc EPH Darlix

Roles

Author response to Decision Letter 0

Decision Letter 1

Claudine Irles

Roles

Author response to Decision Letter 1

Decision Letter 2

Claudine Irles

Roles

Acceptance letter

Claudine Irles

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

3.1 Formulas for R₀ and the herd-immunity threshold