Skip to main content
Philosophical transactions. Series A, Mathematical, physical, and engineering sciences logoLink to Philosophical transactions. Series A, Mathematical, physical, and engineering sciences
. 2022 Aug 15;380(2233):20210306. doi: 10.1098/rsta.2021.0306

Reformulating the susceptible–infectious–removed model in terms of the number of detected cases: well-posedness of the observational model

Eduard Campillo-Funollet 1, Hayley Wragg 2,3, James Van Yperen 3, Duc-Lam Duong 3,4, Anotida Madzvamuse 3,5,6,
PMCID: PMC9376718  PMID: 35965462

Abstract

Compartmental models are popular in the mathematics of epidemiology for their simplicity and wide range of applications. Although they are typically solved as initial value problems for a system of ordinary differential equations, the observed data are typically akin to a boundary value-type problem: we observe some of the dependent variables at given times, but we do not know the initial conditions. In this paper, we reformulate the classical susceptible–infectious–recovered system in terms of the number of detected positive infected cases at different times to yield what we term the observational model. We then prove the existence and uniqueness of a solution to the boundary value problem associated with the observational model and present a numerical algorithm to approximate the solution.

This article is part of the theme issue ‘Technical challenges of modelling real-life epidemics and examples of overcoming these’.

Keywords: susceptible–infectious–recovered, epidemiology, existence, uniqueness, observational model

1. Introduction

At the beginning of any pandemic, mathematical modelling of infectious diseases is thrust into the limelight for governments across the globe to gain imperative forecasting pictures of the impending spread of the infectious disease, as demonstrated by the 2009–2010 swine flu pandemic [14], the 2015–2016 Zika virus epidemic [58] and the most recent COVID-19 pandemic [912]. Non-pharmaceutical interventions (NPIs) are interpreted by adjusting model parameters to allow for quick decision-making on the appropriate NPI measure(s) to be implemented. Understanding and predicting levels of infections in the community is the crux of mathematical modelling in epidemiology since it is these infections that drive the epidemic and also the mathematical models. However, getting accurate, reliable data to describe the current number of infections within the community is arguably the largest challenge for data-driven modelling. Not only is lack of data problematic when deriving and using mathematical models, the context and interpretation of the data and the parameter fitting process often comes with difficulties. In particular, there is often not a one-to-one correspondence between data and parameters or compartments in a model, especially when considering daily snapshots of an epidemic, for example.

In this work, we study the susceptible–infectious–removed (SIR) equations often accredited to the work of Kermack & McKendrick in 1927 [13]. In particular, we look to demonstrate how to use the number of detected cases directly in the model formulation and prove the existence and uniqueness of a solution to the resulting formulation. The SIR equations take the following form of a simple temporal dynamic system of ordinary differential equations, supported by non-negative initial conditions

S˙=βINS,S(0)=S0, 1.1
I˙=βINSγI,I(0)=I0, 1.2
andR˙=γI,R(0)=R0, 1.3

where the dot above the notation denotes the time derivative, see figure 1 for a schematic. As is standard for the SIR equations, we let N denote the total population being considered, β denote the average transmission rate and γ denote the average removal rate. The initial conditions are chosen so that S0+I0+R0=N, which gives rise to S(t)+I(t)+R(t)=N for any t0. Moreover, it is standard to make the assumption that I01, i.e. there is at least one infected individual at the onset of the infectious disease. We direct readers to the following literature to gain insight into the analysis and uses of the SIR equations and its family of compartmental models [1417].

Figure 1.

Figure 1.

Schematic of the SIR compartmental model. We let S denote the proportion of the total population N who are susceptible to the infectious disease being studied. Susceptible individuals become infectious with the disease to form the I subpopulation at a rate λ(t), which represents the current average infection rate. The rate λ(t) is the product of β, the average transmission rate, and the probability of meeting an infectious person I(t)N1. Individuals in the I(t) subpopulation then are removed at a rate γ to form the removed subpopulation R. The description of the removed compartment depends on the nature of the infectious disease and the interpretation of the data available to be used. Typically, the term removed is interchanged with the term recovered. As is standard for epidemiological models of this nature, β1 denotes the average time between transmissions and γ1 denotes the average length of time before removal.

The ‘observational model’, as coined in [18], is a representation of a compartmental model, such as the SIR equations, described in terms of the parameters and compartments that are captured, or to be inferred, by the mathematical interpretation of the data. This observational model gives an intuitive understanding of what parameters can be identified and how the data affects different compartments in the model. Typically, data for infectious diseases come as a daily, or in general, periodic, snapshot of some of the states that individuals experience during the infectious disease. In this study, we are looking to use the detected cases of an infectious disease, which we denote by Xm. In practical terms, we are assuming that our observation is a fraction r(0,1) of the total number of individuals leaving the infectious state I over the time of the observation. This situation arises when the individual ceases to be infectious once they have been diagnosed, for example because the individual can avoid contact with susceptible individuals. We allow for the situation that we only observe a fraction r of cases since there are situations where not all infectious individuals can be removed, such as false-negatives in testing regimes, bad individual compliance in behaviour or financial/resource restrictions on the amount of testing that can be done. In reality, r is likely to be time-dependent since it can characterize aspects of public policy, however, we keep it constant in this study. In order to describe these observations in terms of the model parameters and compartments, we formally describe the removed compartment R as those who are no longer mixing in the general population. Hence, for each observation of the number of detected cases Xm, m=0,,M, we see that

Xm:=rγtmtm+1I(s)ds, 1.4

where [tm,tm+1] represents the time interval of the measurement of the data. This definition of the observed data for the number of detected cases of an infectious disease has been used in the past; see for instance [19].

Using (1.4) as the interpretation of the available data leads to the observational model

I¨=I˙(I˙IβIN)βγI2N, 1.5

a second-order nonlinear differential equation equipped with non-local boundary conditions (1.4), see §3 for the detailed derivation. A significant part of this work is devoted to studying the boundary value problem (1.4) and (1.5). In contrast to initial value problems that typically arise when studying mathematical epidemiology, boundary value problems are much more difficult to deal with. For instance, it is well known that a small change in the boundary conditions may lead to significant changes in the behaviour of the solutions [20]. In fact, there is no general theory to the existence and uniqueness of a solution for boundary value problems. In most instances, we have to study the boundary value problem on a case-by-case basis, especially when the problem at hand is nonlinear, and/or the boundary conditions are atypical, which is the case with our problem as the boundary condition (1.4) is given in a non-local integral form. Non-local boundary conditions have been studied in the past in various forms, recently appearing in the study and application of fractional differential equations [21]. In our case, the non-local boundary conditions correspond to fixing the values of certain integrals of the solution over given time intervals, which is similar to how the average value is imposed on certain boundary value problems with Neumann boundary conditions to ensure uniqueness of the solution. Although this form is convenient for the analysis, we note that we can convert the integral conditions to nonlinear, multi-point Robin-type conditions. This, however, is only possible due to the set-up of the observational model and thus the problem at hand, since the differential equation directly involves the data, and thus the boundary condition in some capacity. As far as we are aware, this is the first work to consider a boundary value problem with these specific integral-type boundary conditions. The key message of our study is on the transformation of an epidemiological initial value problem to a boundary value problem due to the appropriate form of the observed data, hence we refer readers to [2026], and the references therein, for further information on methods and techniques that may be used to analyse certain classes of boundary value problems with different types of boundary conditions.

The main results of this publication can be summarized in the following two theorems:

Theorem 1.1 (Existence). —

Let βe:=r1βN1. Given X0,X1>0 corresponding to (1.4), and given parameters βe, γ>0, there exists a solution IC2(t0,t2) to (1.5) subject to (1.4).

Theorem 1.2 (Uniqueness). —

Let βe:=r1βN1. Given X0,X1>0 corresponding to (1.4), and given parameters βe, γ>0, the solution to (1.5) subject to (1.4) is unique in C2(t0,t2). Furthermore, the solution is smooth.

The structure of the paper is as follows: in §2, we motivate how the approach outlined in this work came about and why it is necessary, in §3, we derive the observational model and pose an equivalent boundary value problem to (1.5), in §4, we prove Theorem 1.1, in §5, we prove theorem 1.2 and in §6, we outline an algorithm for the numerical solution to the observational model with two examples.

2. Motivation

Since the onset of COVID-19, the disease caused by SARS-CoV-2, in December 2019 and the declaration of a pandemic by the World Health Organization in March 2020, governments and organizations across the globe have implemented unprecedented regional, national and international interventions to contain the spread of COVID-19 and the subsequent damage caused by it. Initially, NPIs were the only tool in the fight to contain the spread of the disease, forcing governments to introduce strict control measures on public mixing, the use of personal protective equipment, improvement of personal hygiene and the use of contact tracing to reduce the possibility of transmission throughout their countries, protecting those most vulnerable as well as maintaining control of the healthcare demand and capacity. The social-economical effects of these necessary decisions are still unknown, and it is speculated that these effects will still be felt for many more years to come [2730]. The global picture is starting to look less bleak with the introduction of effective vaccines and vaccination policies allowing the relaxation of such strict NPIs, however with the spawn of new variants of COVID-19, such as the Alpha variant (lineage B.1.1.7) and more recently the Delta variant (lineage B.1.617.2), it is clear we are not out of the pandemic yet [3134]. Indeed, now that ‘normality’ has returned, country borders have opened and tourism is on the increase, we in the UK are amidst an increase in daily positive cases, hospitalizations and deaths, while other European countries are tightening their COVID-19 restrictions or even going back into lockdowns.

In response to the pandemic in the Sussex region of the UK, local authority organizations in Sussex created a consortium that posed modelling questions of strategic operational significance to the Local Health Resilience Partnership covering East Sussex, West Sussex and Brighton & Hove. As a result, leaders from Public Health Intelligence in the local authority organizations approached the University of Sussex to undertake COVID-19 epidemiological modelling that is specific to Sussex as the national level modelling from the Scientific Advisory Group for Emergencies was not applicable at the regional, integrated care level. In response to their request, we developed a data-driven SIR-type mathematical model, which in practice has been used to answer public health questions based around healthcare demand and capacity, and mortuary capacity, such as the impact of second waves, future lockdowns and vaccination supply [18]. In our early parameter estimation attempts, it became clear to us that the data provided were not compatible with the standard SIR-type formulation, in particular, the data did not provide an accurate estimation of all the initial conditions of the compartments of the model. Healthcare data typically come in one of three types: new daily data (such as admissions), current daily data (such as beds occupied) or cumulative daily data (such as total number of deaths). Due to the methods and constraints on data collection, data typically come as new daily data, i.e. a change in the state of an individual. For example, being admitted into hospital is a change of state from being infectious in the community to being in a hospital bed, whereby the infectious state and hospital bed state are compartments of a mathematical model, while the admitted component is the change between compartments. In this set-up, the admissions data are proportional to the infectious compartment, but not directly as it is an integral, so identifying the initial conditions of the compartments from the data becomes a real challenge, see [18] for more details.

Although the remit of this paper does not include parameter estimation results, we nevertheless describe these in order to demonstrate the issue of the estimation of initial conditions described above. Given that we have a system of nonlinear differential equations, the parameter estimation problem can be stated as a nonlinear least-squares (NLS) optimization problem [35]. Let d describe the observed data, θ:=[β,γ] describe the set of parameters we are estimating, x(θ):=[S(θ),I(θ)] describe the set of solutions to (1.1) and (1.2) with f(x(θ),θ) describing the differential equations, and y(x,θ) denote the function that maps the solutions x to the mathematical interpretation of the data. Let L(d|θ;y(x,θ)) describe the likelihood function associated with the probability distribution assumption of the noise between d and y. Then, the NLS optimization problem can be stated as

maximizeθL(d|θ;y(x,θ))subject toθD,x˙(θ)=f(x(θ),θ),x(0)=x0,

where D denotes the admissible parameter space. The NLS problem easily demonstrates the issue of the initial conditions faced: that even although parameters θ can be estimated regardless of the initial conditions, they may not be representative of real life. This impacts the use of epidemiological models, that they may not have any accurate realistic forecasting capabilities. Moreover, it can easily be reasoned that L is highly non-convex with respect to the parameters and in practice will therefore likely converge to solutions close to the initial guess, which may not be the global solution [36,37].

The standard approach in the literature to navigate the issue of estimating initial conditions is to restrict the parameters, for example, by fixing the basic reproductive number, and estimate when the epidemic began using the data provided, see for instance [38]. Fortunately for modellers in the UK, with regards to COVID-19, the disease was brought to us from outside our borders and so identifying ‘patient zero’ and tracing their connections becomes easier. However, with epidemiological data in particular, what is often not considered is that there is stochastic perturbation in the realization of the actual epidemic. The spread of an infectious disease is a stochastic process, and so the collection of data from an epidemic is just one realization of that process. Noise in the data presents itself mathematically as uncertainty in the parameters, while stochastic perturbation in the epidemic presents itself mathematically as uncertainty in both the initial conditions and the parameters. Variability due to stochastic perturbation is high at the onset of an epidemic, especially in a spatially complex and heterogeneous population not typically accounted for in modelling, which makes estimating the first infection and calibrating to data difficult. In [18], we took an alternative approach and estimated the initial conditions from the data at the onset of the first national lockdown, since this was the first available bit of data. In this instance, the initial conditions became part of the parameters we looked to estimate, however this proved challenging due to the nonlinearity of the NLS problem and the differences in parameter sensitivity between initial conditions and system parameters (such as β and γ). This led us onto the problem we are trying to solve currently: Can we incorporate the data into the model formulation in a more natural way that overcomes the issue of accurate estimation of initial conditions? We believe that by rewriting the SIR equations as the observational model with boundary conditions, we can gain accurate estimates of parameters while bypassing issues with initial conditions. However, before approaching the parameter identifiability and estimation problems, we first need to demonstrate the non-trivial well-posedness of the observational model.

3. Derivation of the observational model

Before we derive the observational model, it is important we provide some standard analytical results of (1.1) and (1.2). It is a standard result in dynamical systems and phase space analysis that for positive initial conditions, we have 0<S(t)<N and 0<I(t)<N for t0. In particular, steady-state analysis tells us that

limt(S(t),I(t),R(t))=(S,0,NS),

where S>0. Due to (1.1), S(t) is monotonically decreasing, which implies that

limI0I˙(t)I(t)=limI0βS(t)Nγ=βSNγ. 3.1

In addition, since the right-hand sides of (1.1) and (1.2) are smooth functions, their solutions are infinitely differentiable (see, for instance, [39]), which allows us to make the necessary steps to derive the observational model. Turning our attention to the data (1.4), for the analysis, we assume that, for any m=0,,M,

0<Xm<.

That is to say that we always observe a strictly positive, finite number of cases since we are not considering any observation error or model error in this study, i.e. our observations are exact. Indeed, in this situation, the case Xm=0 implies that there no more infectious cases in the population, and in consequence, there will be no more cases in future observations.

As described in [18], since the data (1.4) are described by the infectious compartment I, we need to rewrite equations (1.1) and (1.2) to be purely in terms of I in order to derive the observational model. To this end, we note three properties of equations (1.1) and (1.2), namely

S˙+I˙=γI, 3.2

which, by differentiating, immediately gives us

S¨+I¨=γI˙. 3.3

Moreover, by differentiating (1.1), we see that

S¨=S˙(I˙IβIN) 3.4

and hence, inserting (3.2) and (3.3) into (3.4), the observational model takes the form

I¨=I˙(I˙IβIN)βγI2N,

as seen in (1.5). One notes that (3.4) and thus (1.5) are well defined for t0 due to (3.1). For the purposes of this paper, we continue to transform (1.5) to facilitate the analysis. By denoting βe:=r1βN1 and setting z(t)=ln(rγI(t)), (1.5) is equivalent to

z¨=βeez(z˙γ+1), 3.5

while the data (1.4) are equivalent to

Xm=tmtm+1ez(s)ds. 3.6

In the subsequent sections, we will prove the following two theorems, which are respectively equivalent to Theorems 1.1 and 1.2 due to the continuous bijective transformation between z and I.

Theorem 3.1 (Existence). —

Given X0,X1>0, corresponding to (3.6), and given parameters βe,γ>0, there exists a solution zC2(t0,t2) to (3.5) subject to (3.6).

Theorem 3.2 (Uniqueness). —

Given X0,X1>0 corresponding to (3.6), and given parameters βe,γ>0, the solution to (3.5) subject to (3.6) is unique in C2(t0,t2). Furthermore, the solution is smooth.

Remark. —

One can observe that as I tends to 0, z becomes negatively unbounded. This is not impactful on the analysis due to the boundary condition (3.6), equivalently (1.4), always being positive. As we noted above, in the situation that (3.6) is zero, and upon assuming no noise in the data, this implies that the epidemic is over and thus this model is not relevant anymore.

Remark. —

One can go further to simplify (3.5) by taking w(ξ)=z(t)+ln(βeγ2), where ξ=γt, to get

w=ew(w+1),

where the prime denotes the derivative with respect to ξ, which leads to (3.6) satisfying

Xm=γβeγtmγtm+1ew(η)dη.

Remark. —

To continue to demonstrate the issues with initial conditions, we can express S(0) and I(0) in terms of z and the parameters. By definition of z, we obtain

S(0)=Nz˙(0)+γβ 3.7

and

I(0)=ez(0)rγ. 3.8

From the formulation of the observational model (3.5), we can see that a solution z is defined only in terms of βe and γ. Assuming that N is known, due to the definition of βe, the pair (β,r) cannot be uniquely identified from the data; in other words, for a given βe, any pair (β,r) satisfying βe=r1βN1 will produce exactly the same solution. This implies that in order to simulate future predictions with the original SIR formulation, one would need estimates for r, as well as the initial conditions. Indeed, from (3.7), we conclude that it is not possible to identify S(0) unless both β and r are known; in particular, since it is not possible to identify both β and r from the data, it is not possible to recover the initial conditions without an independent estimate of the under-reporting coefficient r. By contrast, an advantage of the formulation of the observational model is that it will allow us to compute the solution z, and predict future observations Xm, as long as we can fit the model to identify the value of βe. Note also that, without an independent estimate on r, we can only infer the ratio between the initial number of infectious individuals to the initial number of susceptible individuals since

I(0)S(0)=βeγez(0)z˙(0)+γ.

4. Existence

The proof of theorem 3.1 is based on the Leray–Schauder fixed point theorem [40]. We shall show that the solutions to (3.5) and (3.6) satisfy a priori bounds, and we will use them to show compactness of a suitable operator.

Lemma 4.1. —

Let X0>0 and X1>0. Suppose there exists a solution z to (3.5) and (3.6) with data X0 and X1. Then, there exists some constant C such that |z(t)|C, for t[t0,t2]. Moreover, C is only in terms of the parameters and the data.

Proof. —

Let βe¯:=βeγ1 and let τ[t0,t2]. Integrating (3.5) over (t0,τ), we have

z˙(τ)z˙(t0)=βe¯(ez(t0)ez(τ))βet0τez(s)ds.

Let t[t0,t2] and let δ0(t):=tt0. Then, by integrating with respect to τ(t0,t), we have

z(t)z(t0)δ0(t)z˙(t0)=βe¯(δ0(t)ez(t0)t0tez(τ)dτ)βet0tt0τez(s)dsdτ.

Since the exponential function is positive everywhere, we see that

0t0tez(τ)dτt0t2ez(τ)dτ=X0+X1,

and consequently

0t0tt0τez(s)dsdτδ0(t)(X0+X1).

Hence, we see that there exist bounds

z0,l(t)z(t)z0,u(t), 4.1

for t[t0,t2], where

z0,l(t):=z(t0)+δ0(t)(z˙(t0)+βe¯ez(t0))βe¯(1+γδ0(t))(X0+X1) 4.2

and

z0,u(t):=z(t0)+δ0(t)(z˙(t0)+βe¯ez(t0)). 4.3

These bounds depend on z and z˙ through their values at t0. Since we are interested in uniform bounds for any solution z, we need to show that z(t0) and z˙(t0) are bounded in terms of the data X0 and X1. By exponentiation and integration (4.1) over (t0,t2), we have

t0t2ez0,l(t)dt(X0+X1)t0t2ez0,u(t)dt, 4.4

and hence, by taking (4.2) into account,

t0t2ez0,l(t)dt=ez(t0)βe¯(X0+X1)t0t2eδ0(t)(z˙(t0)+βe¯ez(t0)βe(X0+X1))dt=ez(t0)βe¯(X0+X1)z˙(t0)+βe¯ez(t0)βe(X0+X1)(eδ0(t2)(z˙(t0)+βe¯ez(t0)βe(X0+X1))1).

To show that z(t0) and z˙(t0) are bounded, we proceed by contradiction. Let α0:=ez(t0) and σ0:=z˙(t0), and let

f0,l(α0,σ0):=α0eβe¯(X0+X1)σ0+βe¯α0βe(X0+X1)(eδ0(t2)(σ0+βe¯α0βe(X0+X1))1).

A standard application of L’Hôpital’s rule yields

limα0f0,l(α0,σ0)=andlimσ0f0,l(α0,σ0)=,

where we have fixed σ0 in the first limit and fixed α0 in the second limit, and similarly

limα0,σ0f0,l=.

Since (4.4) implies boundedness of f0,l, we conclude α0 and σ0 cannot be arbitrarily large and in consequence, z(t0) and z˙(t0) are bounded from above. Using a similar argument with the right-hand side bound of (4.4) and taking the limits α00 and σ0, we can conclude that z(t0) and z˙(t0) are also bounded from below.

Therefore, the bounds (4.2) and (4.3) are independent of the solution z, and depend only on the parameters and the data. ▪

We can now proceed to show that z˙ is also uniformly bounded in [t0,t2].

Lemma 4.2. —

Let X0>0 and X1>0. Suppose there exists a solution z to (3.5) and (3.6) with data X0 and X1. Then, there exists some constant C such that |z˙(t)|C, for t[t0,t2]. Moreover, C is only in terms of the parameters and the data.

Proof. —

Using the notation introduced in the proof of lemma 4.1, we integrate (3.5) over (t0,t), where t[t0,t2], to obtain

z˙(t)=z˙(t0)βe¯(ez(t)ez(t0))βet0tez(s)ds. 4.5

We can now use the uniform bounds on z, from lemma 4.1, together with the bounds on z(t0) and z˙(t0) that we obtained in the proof of lemma 4.1, to bound the right-hand side of (4.5). ▪

We can proceed using equations (3.5) and (3.6) and the bounds on z and z˙ to obtain uniform bounds on the second derivative, which in turns allows us to obtain uniform bounds for the third derivative by using them in (3.5). Indeed, we can bootstrap this argument to obtain uniform bounds up to the nth derivative, for finite n. We summarize this in the following corollary.

Corollary 4.3. —

Let n=0,1,,N¯, with N¯<, and let X0>0 and X1>0. Suppose there exists a solution z to (3.5) and (3.6) with data X0 and X1. Then, there exists some constant C(n), depending only on n, the data and the parameters, such that |z(n)(t)|<C(n) for all t[t0,t2].

We can now use the uniform bounds on the solutions to study a suitable operator that will allow us to define the solutions of (3.5) and (3.6) as a fixed point. Let

F[z](t):=az+bztt0tt0τβeez(s)(z˙(s)γ+1)dsdτ, 4.6

where az,bz depend on z and are uniquely determined under the constraints

t0t1eF[z](t)dt=X0andt1t2eF[z](t)dt=X1. 4.7

We note importantly that a fixed point of F is a solution to (3.5)–(3.6). Therefore, our goal is to apply the Leray–Schauder fixed point theorem to F to show the existence of a solution in C2(t0,t2). We will first show that F is a compact operator.

Lemma 4.4. —

Let F be defined as in (4.6). Then F is a compact operator in C2(t0,t2).

Proof. —

Let (zn)nN0 be a sequence of functions bounded in C2(t0,t2). We see that since

(F[zn])(3)=βez˙nezn(z˙nγ+1)βeeznz¨nγ,

we have (F[zn])(3) is also uniformly bounded, since (zn)n is uniformly bounded in C2(t0,t2). Therefore, the Arzela–Ascoli theorem implies that F is a compact operator in C2(t0,t2). We note that to apply the Arzela–Ascoli theorem in C2(t0,t2) suffices to show uniform boundedness of the third derivative, since then we also have equicontinuity of the second derivative; this is analogous to using uniform boundedness of the first derivative to show equicontinuity of the sequence of functions. ▪

With this set-up, in particular that F is compact in C2(t0,t2), we can now finish the proof of theorem 3.1.

Proof of theorem 3.1. —

In order to apply the Leray–Schauder fixed point theory, we now need to show that the set

{zC2(t0,t2):z=κF[z]} 4.8

is bounded for all κ[0,1]. It is easy to see, by linearity of the derivative and the integral, that a fixed point of κF corresponds to a fixed point of F with a change of parameters, namely, by replacing βe by κβe. Therefore, any function z in (4.8) is a solution to (3.5) and (3.6) with a suitable parameter, and as shown in lemma 4.1, is bounded in C2(t0,t2). Thus the Leray–Schauder fixed point theorem implies the existence of a fixed point for F, which, as already mentioned, corresponds to a solution of (3.5)–(3.6). ▪

5. Uniqueness

Now that we have proven that there exists a solution to (3.5)–(3.6), we now want to demonstrate the uniqueness of a solution to (3.5)–(3.6). We employ a standard contradiction argument for boundary value problems by showing that two different solutions, which satisfy the same boundary conditions, must be the same.

Proof of theorem 3.2. —

Let x(t) and y(t) be two solutions to (3.5) and (3.6) satisfying the same data conditions, that is, for m=0,1 we have

Xm=tmtm+1ex(s)ds=tmtm+1ey(s)ds.

Consequently, this means that

tmtm+1ex(s)ey(s)ds=0. 5.1

Let D(t):=x(t)y(t) and E(t):=ex(t)ey(t). By taking the difference of (3.5) for each solution, we have

D¨=βe(E+E˙γ). 5.2

To simplify the presentation, we set βe¯:=βeγ1 and denote

F(t):=D˙(t)+βe¯E(t)D˙(t0)βe¯E(t0).

Using this notation, (5.2) is equivalent to

F˙+βeE=0. 5.3

Let t[t0,t2]. Integrating (5.3) with respect to s(t0,t) gives

F(t)+βet0tE(s)ds=0. 5.4

Moreover, by (5.1) and (5.4), one can see that

F(t1)=F(t2)=0. 5.5

We now proceed by contradiction to show that D(t0)=0. Without loss of generality, assume that D(t0)>0. This gives us that E(t0)>0 by definition and F˙(t0)<0 by (5.3). Considering this, (5.5) and the fact that x, yC2(t0,t2), we have that there exists τ(t0,t1] such that F(τ)=0 and F(t)<0 for all t(t0,τ). In particular, this means that, by integrating F with respect to s(t0,τ), we have

t0τF(s)ds<0 5.6

and, setting t=τ in (5.4), we have

t0τE(s)ds=0. 5.7

Then, by integrating (5.4) with respect to t(t0,τ) and using (5.6), we obtain

0=t0τF(t)dt+βet0τt0tE(s)dsdt<βet0τt0tE(s)dsdt=0,

where the last equality is a consequence of Fubini’s theorem and (5.7). Therefore, we have a contradiction. One notices that, using the same approach, we can also conclude that D(t1)=0 due to (5.5). By integrating (5.4) with respect to t(t0,t1), we have that

D(t1)+βe¯t0t1E(t)dt+βet0t1t0tE(s)dsdt=D(t0)+(t1t0)(D˙(t0)+βe¯E(t0)),

which, by noting that D(t0)=0 implies that E(t0)=0 and using (5.1), gives us

D˙(t0)=0.

Standard ODE theory implies that, since x(t0)=y(t0), x˙(t0)=y˙(t0) and x, yC2(t0,t2), we have x(t)=y(t) for all t[t0,t2]. ▪

6. Numerical examples

Now that we have demonstrated existence and uniqueness, we can build a numerical algorithm to approximate the solution to (3.5)–(3.6). We set up the numerical problem as follows: given X0, X1, βe and γ, we look to find the function z that satisfies (3.5)–(3.6). We use the shooting method to numerically approximate the solution to (3.5), see for example [41] for details. We use a superscript 0 to denote the initial approximation to z and a superscript ε to denote the converged solution from the shooting method.

For our first example, we look at the case where the number of detected cases is reducing. We set βe=8×104, γ=1 and use the data X0=100 and X1=50 with t0=0, t1=1 and t2=2. For the shooting method, we use an initial guess of z0(0)=6, z˙0(0)=1, depicted by the dashed blue line in figure 2a, which results in

01ez0(s)ds255.00079468571883and12ez0(s)ds93.80954898172891.

The resulting initial conditions from the shooting method are zε(0)=4.92579565 and z˙ε(0)=0.66986594, with the resulting function zε depicted by the continuous orange line in figure 2a. Even although we can typically gain knowledge of the total population N, through for example a population census, as previously mentioned we cannot extract β from βe due to the under-reporting parameter r. However, for the purpose of demonstration, we fix r=0.75 and N=1000 to allow us to demonstrate I0 and Iε, the number of infectious people transformed from z0 and zε, respectively. Figure 2b depicts the transformed initial guess I0 as the dashed blue line and depicts the transformed converged solution Iε as the continuous orange line.

Figure 2.

Figure 2.

Demonstration of the numerical approach to approximate the solutions to (3.5) and (3.6) in the case of the number of detected cases is reducing. (a) Comparison of the initial guess for z and the converged numerical approximation for zε. (b) Comparison of the transformed initial guess for I and the transformed converged numerical approximation for Iε. (Online version in colour.)

For our second example, we look at the case where the number of detected cases is increasing. We set βe=2×103, γ=1 and use the data X0=50 and X1=200 with t0=0, t1=1 and t2=2. For the shooting method, we use an initial guess of z0(0)=6 and z˙0(0)=1, depicted by the dashed blue line in figure 3a, which results in

01ez0(s)ds517.9831215391448and12ez0(s)ds423.89217327827055.

The resulting initial conditions from the shooting method are zε(0)=2.9481878 and z˙ε(0)=1.75958188, with the resulting function zε depicted as the continuous orange line in figure 3a. Again, we fix r=0.75 and N=1000 to allow us to demonstrate I0 and Iε in figure 3b, whereby the transformed initial guess I0 is depicted by the dashed blue line and the transformed converged solution Iε is depicted by the continuous orange line.

Figure 3.

Figure 3.

Demonstration of the numerical approach to approximate the solutions to (3.5) and (3.6) in the case of the number of detected cases is increasing. (a) Comparison of the initial guess for z and the converged numerical approximation for zε. (b) Comparison of the transformed initial guess for I and the transformed converged numerical approximation for Iε. (Online version in colour.)

7. Conclusion

In this publication, we have demonstrated how to derive the observational model given data that describes the infectious compartment I, shown that a solution to this formulation exists and that the solution is unique. We have then gone on to demonstrate an algorithm to approximate the solution to the nonlinear observational model. While it is standard these days for more complex and descriptive compartmental models to be analysed and used in practice, this study provides a first illustration of the analysis for the novel boundary value approach, which we intend to build upon to combat the issues outlined in §§2 and 3. It is not hard to reason that using two data points is not enough to provide accurate forecasting capabilities for the short and medium term, but this result combined with parameter estimation with more data points and a model selection approach will shed light on how one can use this approach.

Now that we have guaranteed that the solution to the observational model is unique, given some parameters, we next look to answer the questions of the identifiability of the parameters from the data and how to appropriately deal with noise in the data. That is to say, how much data do we need to be able to uniquely identify the parameters βe and γ as well as z? These are questions of particular importance as understanding what we can identify, and the accuracy of the estimations, helps us to understand the scenario-based forecasting capabilities of the mathematical model. Understanding this and therefore the underlying assumptions needed to produce forecasting results, given the limelight of compartmental models in recent history, will allow for a confident and trustworthy exchange of knowledge between mathematical modellers and those with research questions on infectious diseases, such as local government or healthcare providers.

Acknowledgements

All authors acknowledge the continued support and collaboration of Brighton and Hove City Council, East and West Sussex County Council and the NHS Sussex Commissioners and thank them for the opportunity for past collaborations that led to the conceiving of this manuscript.

Data accessibility

This article has no additional data.

Authors' contributions

E.C.-F.: conceptualization, formal analysis, funding acquisition, investigation, methodology, validation, visualization, writing—original draft, writing—review and editing; H.W.: formal analysis, investigation, validation, visualization, writing—review and editing; J.V.Y.: conceptualization, formal analysis, funding acquisition, investigation, methodology, supervision, validation, visualization, writing—original draft, writing - review and editing; D.-L.D.: formal analysis, investigation, methodology, validation, visualization, writing—review and editing; A.M.: conceptualization, funding acquisition, supervision, writing—review and editing.

All authors gave final approval for publication and agreed to be held accountable for the work performed therein.

Conflict of interest declaration

We declare that we have no competing interests.

Funding

H.W. was supported by the Higher Education Innovation Fund through the University of Sussex, J.V.Y. and D.-L.D. were supported by Brighton and Hove City Council, East and West Sussex County Council and the NHS Sussex Commissioners, and E.C.-.F. was supported by the Wellcome Trust grant no. 204833/Z/16/Z. This work was partly supported by the Global Challenges Research Fund through the Engineering and Physical Sciences Research Council grant no. EP/T00410X/1: UK-Africa Postgraduate Advanced Study Institute in Mathematical Sciences (A.M. and E.C.-F.). A.M.’s work was partially funded by grants from the Health Foundation (1902431) and the NIHR (NIHR133761) and by an individual grant from Dr Perry James (Jim) Browne Research Centre on Mathematics and its Applications (University of Sussex). A.M. is a Royal Society Wolfson Research Merit Award Holder funded generously by the Wolfson Foundation.

References

  • 1.Cauchemez S, Ferguson NM, Wachtel C, Tegnell A, Saour G, Duncan B, Nicoll A. 2009. Closure of schools during an influenza pandemic. Lancet Infect. Dis. 9, 473-481. ( 10.1016/S1473-3099(09)70176-8) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Fraser C. 2009. et al. Pandemic potential of a strain of influenza A (H1N1): early findings. Science 325, 1557-1561. ( 10.1126/science.1176062) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Fineberg HV. 2014. Pandemic preparedness and response—lessons from the H1N1 influeza of 2009. N. Engl. J. Med. 370, 1335-1342. ( 10.1056/NEJMra1208802) [DOI] [PubMed] [Google Scholar]
  • 4.Yang Y, Sugimoto JD, Halloran ME, Basta NE, Chao DL, Matrajt L, Potter G, Kenah E, Longini IM. 2009. The transmissibility and control of pandemic influenza A (H1N1) virus. Science 326, 729-733. ( 10.1126/science.1177373) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Asher J et al. 2017. Preliminary results of models to predict areas in the Americas with increased likelihood of Zika virus transmission in 2017. BioRxiv 187591 [Preprint]. ( 10.1101/187591) [DOI]
  • 6.Ferguson NM, Cucunubá ZM, Dorigatti I, Nedjati–Gilani GL, Donnelly CA, Basáñez MG, Nouvellet P, Lessler J. 2016. Countering the Zika epidemic in Latin America. Science 353, 353-354. ( 10.1126/science.aag0219) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Lowe R., Barcellos C, Brasil Pícia, Cruz O, Honório N, Kuper H, Carvalho M.. 2018. The Zika virus epidemic in Brazil: from discovery to future implications. Int. J. Environ. Res. Public Health 15, 96. ( 10.3390/ijerph15010096) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Zhang Q et al. 2017. Spread of Zika virus in the Americas. Proc. Natl Acad. Sci. USA 114, E4334-E4343. ( 10.1073/pnas.1620161114) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Ferguson NM et al. 2020. Report 9: impact of non-pharmaceutical interventions (NPIs) to reduce COVID-19 mortality and healthcare demand. See https://spiral.imperial.ac.uk:8443/handle/10044/1/77482. ( 10.25561/77482) [DOI]
  • 10.Kissler SM, Tedijanto C, Goldstein E, Grad YH, Lipsitch M. 2020. Projecting the transmission dynamics of SARS-CoV-2 through the postpandemic period. Science 368, 860-868. ( 10.1126/science.abb5793) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Adam D. 2020. Special report: the simulations driving the world’s response to COVID-19. Nature 580, 316-318. (doi: 0.1038/d41586-020-01003-6) [DOI] [PubMed] [Google Scholar]
  • 12.Wu JT, Leung K, Leung GM. 2020. Nowcasting and forecasting the potential domestic and international spread of the 2019-nCoV outbreak originating in Wuhan, China: a modelling study. Lancet 395, 689-697. ( 10.1016/S0140-6736(20)30260-9) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Kermack WO, McKendrick AG. 1927. A contribution to the mathematical theory of epidemics. Proc. R. Soc. Lond. A 115, 700-721. ( 10.1098/rspa.1927.0118) [DOI] [Google Scholar]
  • 14.Bailey NT. 1975. The mathematical theory of infectious diseases and its applications. High Wycombe, UK: Charles Griffin & Company LTD. [Google Scholar]
  • 15.Brauer F, Castillo–Chavez C, Castillo–Chavez C. 2012. Mathematical models in population biology and epidemiology. New York, NY: Springer. [Google Scholar]
  • 16.Diekmann O, Heesterbeek H, Britton T. 2012. Mathematical tools for understanding infectious disease dynamics. Princeton, NJ: Princeton University Press. [Google Scholar]
  • 17.Hethcote HW. 2000. The mathematics of infectious diseases. SIAM Rev. 42, 599-653. ( 10.1137/S0036144500371907) [DOI] [Google Scholar]
  • 18.Campillo-Funollet E et al. 2021. Predicting and forecasting the impact of local outbreaks of COVID-19: use of SEIR-D quantitative epidemiological modelling for healthcare demand and capacity. Int. J. Epidemiol. 50, 1103-1113. ( 10.1093/ije/dyab106) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Magal P, Webb G. 2018. The parameter identification problem for SIR epidemic models: identifying unreported cases. J. Math. Biol. 77, 1629-1648. ( 10.1007/s00285-017-1203-9) [DOI] [PubMed] [Google Scholar]
  • 20.Bernfeld SR, Lakshmikantham V. 1974. An introduction to nonlinear boundary value problems. New York, NY: Academic Press. [Google Scholar]
  • 21.Macado JT, Kiryakova V, Mainardi F. 2011. Recent history of fractional calculus. Commun. Nonlinear Sci. Numer. Simul. 16, 1140-1153. ( 10.1016/j.cnsns.2010.05.027) [DOI] [Google Scholar]
  • 22.Brown RF. 1993. A topological introduction to nonlinear analysis. Boston, MA: Birkhäuser. [Google Scholar]
  • 23.Hastings SP, McLeod JB. 2011. Classical methods in ordinary differential equations: with applications to boundary value problems. Providence, RI: American Mathematical Society. [Google Scholar]
  • 24.Mawhin J. 1979. Topological degree methods in nonlinear boundary value problems. Providence, RI: American Mathematical Society. [Google Scholar]
  • 25.Meyer GH. 1973. Initial-value methods for boundary value problems: theory and application of invariant imbeddings. New York, NY: Academic Press. [Google Scholar]
  • 26.Ntouyas SK. 2006. Nonlocal initial and boundary value problems: a survey. In Handbook of differential equations: ordinary differential equations, vol. 2, pp. 461–557. North-Holland, Netherlands: Elsevier. ( 10.1016/S1874-5725(05)80008-2) [DOI]
  • 27.Douglas M, Katikireddi SV, Taulbut M, McKee M, McCartney G. 2020. Mitigating the wider health effects of COVID-19 pandemic response. BMJ 369, m1557. ( 10.1136/bmj.m1557) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Cullen W, Gulati G, Kelly BD. 2020. Mental health in the COVID-19 pandemic. QJM 113, 311-312. ( 10.1093/qjmed/hcaa110) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Pfefferbaum B, North CS. 2020. Mental health and the COVID-19 pandemic. N. Engl. J. Med. 383, 510-512. ( 10.1056/NEJMp2008017) [DOI] [PubMed] [Google Scholar]
  • 30.Nicola M, Alsafi Z, Sohrabi C, Kerwan A, Al-Jabir A, Iosifidis C, Agha M, Agha R. 2020. The socio-economic implications of the coronavirus pandemic (COVID-19): a review. Int. J. Surg. 78, 185-193. ( 10.1016/j.ijsu.2020.04.018) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Hodgson D, Flasche S, Jit M, Kucharski AJ. 2021. The potential for vaccinated-induced herd immunity against SARS-CoV-2 B.1.1.7 variant. Euro Surveill. 26, 2100428. ( 10.2807/1560-7917.ES.2021.26.20.2100428) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Harris RJ, Hall JA, Zaidi A, Andrews NJ, Dunbar JK, Dabrera G. 2021. Impact of vaccination on household transmission of SARS-CoV-2 in England. See https://static.poder360.com.br/2021/04/public-health-1stdose-abr2021.pdf.
  • 33.Davies NG et al. 2021. Estimated transmissibility and impact of SARS-CoV-2 lineage B.1.1.7 in England. Science 372, eabg3055. ( 10.1126/science.abg3055) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Moore S, Hill EM, Tildesley MJ, Dyson L, Keeling MJ. 2021. Vaccination and non-pharmaceutical interventions for COVID-19: a mathematical modelling study. Lancet Infect. Dis. 21, 793-802. ( 10.1016/S1473-3099(21)00143-2) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Stortelder WJ. 1996. Parameter estimation in dynamic systems. Math. Comput. Simul. 42, 135-142. ( 10.1016/0378-4754(95)00117-4) [DOI] [Google Scholar]
  • 36.Capistrán MA, Moreles MA, Lara B. 2009. Parameter estimation of some epidemic models. The cases of recurrent epidemics caused by respiratory syncytial virus. Bull. Math. Biol. 71, 1890-1901. ( 10.1007/s11538-009-9429-3) [DOI] [PubMed] [Google Scholar]
  • 37.Gábor A, Banga JR. 2015. Robust and efficient parameter estimation in dynamic models of biological systems. BMC Syst. Biol. 9, 1-25. ( 10.1186/s12918-015-0219-2) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Cheetham N, Waites W, Ebyarimpa I, Leber W, Brennan K, Panovska-Griffiths J. 2021. Determining the level of social distancing necessary to avoid future COVID-19 epidemic waves: a modelling study for North East London. Sci. Rep. 11, 5806. ( 10.1038/s41598-021-84907-1) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Teschl G. 2012. Ordinary differential equations and dynamical systems. Providence, RI: American Mathematical Society. [Google Scholar]
  • 40.Zou H. 2008. Fixed point theory and elliptic boundary value problems. In Handbook of differential equations: stationary partial differential equations, vol. 6, pp. 503–583. North-Holland, Netherlands: Elsevier. ( 10.1016/S1874-5733(08)80024-1) [DOI]
  • 41.Press WH, Vetterling WT, Teukolsky SA, Flannery BP. 1986. Numerical recipes. Cambridge, UK: Cambridge University Press. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

This article has no additional data.


Articles from Philosophical transactions. Series A, Mathematical, physical, and engineering sciences are provided here courtesy of The Royal Society

RESOURCES