Abstract
The aim of this article is to understand how to apply partial or total containment to SIR epidemic model during a given finite time interval in order to minimize the epidemic final size, that is the cumulative number of cases infected during the complete course of an epidemic. The existence and uniqueness of an optimal strategy are proved for this infinite-horizon problem, and a full characterization of the solution is provided. The best policy consists in applying the maximal allowed social distancing effort until the end of the interval, starting at a date that is not always the closest date and may be found by a simple algorithm. Both theoretical results and numerical simulations demonstrate that it leads to a significant decrease in the epidemic final size. We show that in any case the optimal intervention has to begin before the number of susceptible cases has crossed the herd immunity level, and that its limit is always smaller than this threshold. This problem is also shown to be equivalent to the minimum containment time necessary to stop at a given distance after this threshold value.
Keywords: Optimal control, SIR epidemic model, Herd immunity, Lockdown policy, Epidemic final size
Introduction
The current outbreak of Covid-19 and the entailed implementation of social distancing on an unprecedented scale led to renewed interest in modeling and analysis of this method to control infectious diseases. In contrast with a recent trend of papers that aim at giving a large account of the complexity of the pandemic, in its epidemiological dimension, but also from the point of view of the functioning of the hospital and public health systems, and even possibly the behavioral aspects, our perspective here is quite different. Our purpose is to study, at a theoretical level, how to use social distancing in an optimal way, in order to minimize the cumulative number of cases infected during the course of an epidemic. This issue is addressed in the framework of the classical SIR model, in line with other papers that studied optimal epidemic control, through treatment, vaccination, quarantine, isolation or testing. This simplified setting deliberately ignores many features important in the effective handling of a human epidemic: population heterogeneity, limited hospital capacity, imprecision of the epidemiological data (including the question of the asymptomatic cases), partial respect of the confinement, etc. On the other hand, thanks to its simplicity complete, computable, solutions are achievable to serve as landmark for real situations. In a nutshell, our aim is to determine what best result may be obtained in terms of reduction of the total cumulative number of infected individuals by applying lockdown of given maximal intensity and duration, in the worst conditions where no medical solution is discovered to stop earlier the epidemic spread.
The SIR model is described by the following system, in which all parameters are positive:
1a |
1b |
1c |
The state variables S, I and R correspond, respectively, to the proportions of susceptible, infected and removed individuals in the population. The sum of the derivatives of the three state variables is zero, so the sum of the variables remains equal to 1 if initially , and one may describe the system solely with Eqs. (1a)–(1b). The total population is constant and no demographic effect (births, deaths) is modeled, as they are not relevant to the time scale to be taken into account in reacting to an outbreak. The infection rate accounts globally for the rate of encounters between the individuals and the probability of transmitting the infection during each of these encounters. The parameter is the recovery rate. Recovered individuals are assumed to have acquired permanent immunity.
Obviously, every solution of (1) is nonnegative, and thus, at any instant, so S and decrease, while R increases. We infer that every state variable admits a limit and that the integral converges. Therefore,
2 |
The basic reproduction number governs the behavior of the system departing from its initial value. When , is always negative: I decreases and no epidemic may occur. When epidemic occurs if . In such a case, I reaches a peak and then goes to zero. From now on, we assume
An important issue is herd immunity. The latter occurs naturally when a large proportion of the population has become immune to the infection. Mathematically, it is defined as the value of S below which the number of infected decreases. For the SIR model (1), one has , and
3 |
Notice that, after passing the collective immunity threshold, that is for large enough t, one has
4 |
While the number of infected decreases when , epidemics continue to consume susceptible and to generate new infections once the immunity threshold has been crossed. The number of infected cases appearing after this point may be quite large. In order to illustrate this fact, Table 1 displays, for several values of , the value of the herd immunity threshold and the number of susceptibles that remain after the fading out of the outbreak, in the case of an initially naive population (). The proportion of infections occurred after the overcome of the immunity is also shown.
Table 1.
1.5 | 2 | 2.5 | 2.9 | 3 | 3.5 | |
---|---|---|---|---|---|---|
0.67 | 0.50 | 0.40 | 0.34 | 0.33 | 0.29 | |
0.42 | 0.20 | 0.11 | 0.067 | 0.059 | 0.034 | |
43% | 37% | 33% | 30% | 29% | 27% |
The value of comes from (3), and may be deduced from the fact that , see Lemma 4.2 below. The ratio represents the proportion of susceptible that occur after passing the collective immunity threshold. The column in bold corresponds to found in [39] for the SARS-CoV-2 in France before the lockdown of March–May 2020
Apart from medical treatment, there are generally speaking three main methods to control human diseases. Each of them, alone or in conjunction with the others, gave rise to applications of optimal control. A first class of interventions consists in vaccination or immunization [2, 4, 8, 10, 11, 14, 16–18, 20, 30, 37, 40, 43]. It consists in transferring individuals from the S compartment to the R one. The members of the latter may not be stricto sensu recovered: they are removed from the infective process. A second class of measures corresponds to screening and quarantining of infected [1, 3, 8, 10, 11, 41, 42, 44]. It may be modeled by transfer of individuals from the I compartment to the R one (which consequently will change its meaning). Last, it is possible to reduce transmission through health promotion campaigns or lockdown policies [8, 10, 34, 36]. Of course these methods may be employed jointly [18].
Other modeling frameworks have also been considered. More involved models called SEIR and SIRS have been analyzed in [8, 16], and SIR model structured by the infection age in [3]. Constraints on the number of infected persons that the public health system can accommodate or on available resources, particularly in terms of vaccination, were studied [10, 11, 34, 43, 44]. Economical considerations may be aggregated to the epidemiological model [5, 29, 38]. Ad hoc models for tackling emergence of resistance to drugs issues have been introduced [22, 23], as well as framework allowing to study revaccination policies [27, 28]. Optimization of vaccination campaigns for vector-borne diseases has also been considered [40]. Optimal public health intervention as a complement to such campaigns has been studied in [12, 13] (see [33] for more material on behavioral epidemiology). The references cited above are limited to deterministic differential models, but discrete-time models and stochastic models have also been used, e.g., in [1, 2, 17].
The costs considered in the literature are usually integral costs combining an “outbreak size” (the integral of the number of infected, or of the newly infected term, or the largest number of infected) and the input variable on a given finite time horizon. Some minimal time control problems have also been considered [4, 10, 11, 21]. The integral of the deviation between the natural infection rate and its effective value due to confinement is used in [34], together with constraints on the maximal number of infected. In [7], the authors minimize the time needed to reach herd immunity, under the constraint of keeping the number of infected below a given value, in an attempt to preserve the public health system. Few results consider infinite horizon [8]. Qualitatively, optimal solutions attached to the vaccination or isolation protocols are in general bang-bang,1 with an intervention from the very beginning. Bounds on the number of switching times between the two modes (typically zero or one) are sometimes provided. By contrast, protocols that mitigate the transmission rate (health promotion campaigns or lockdown policies) usually provide bang-bang optimal solutions with transmission reduction beginning after a certain time. Generally speaking, there exists no ideal strategy, and the achievement of some policy objective may preclude success with others [19].
The present article is dedicated to the optimal control issue of obtaining, by enforcing social distancing, the largest value for , the limit number of susceptible individuals at infinity. Using classical vocabulary of epidemiology, this is equivalent to minimize the attack ratio or its unnormalized counterpart, the final size of the epidemic. Abundant literature exists concerning this quantity, since Kermack and Mc Kendrick’s paper from 1927 [25], see, e.g., [6, 24, 32, 35] for important contributions to its computation in various deterministic settings. Said otherwise, we seek here to determine how close to the herd immunity threshold it is possible to stop the spread of a disease, in the case where no vaccine or treatment is found to modify its evolution. A possible action is to let the proportion of susceptible reach the collective immunity level and then impose total lockdown, putting in (1) after a certain time. This situation is illustrated in Fig. 1. Merely as a reference point, we use the figures of the Covid-19 in France in Spring 2020, borrowed from [39] and given in Table 2. In this way one may stop exactly at the herd immunity threshold; however, this is achieved by applying total lockdown during infinite time duration (Fig. 2).
Table 2.
Parameter | Name | Value |
---|---|---|
Infection rate | 0.29 | |
Recovery rate | 0.1 | |
Lockdown level (France, March/May 2020) | 0.231 | |
Initial proportion of susceptible | ||
Initial proportion of infected | ||
Initial proportion of removed | 0 |
On the contrary, our aim here is to focus more realistically on interventions taking place on a given finite time interval, through possibly partial lockdown. A closely related issue is studied in [26], where only total lockdown is considered.2 Our perspective is slightly different here, as we are equally interested by enforcement measures inducing partial contact reduction. In [15] the authors explore by thorough numerical essays conducted on SIR model the impact of one-shot lockdowns of given intensity and duration, initiated when reaches a certain threshold, with regard to three quantities of interest: the final size, the peak prevalence and the average time of infection. Prolonging the work of the present article, [9] analyzes how to optimally choose the onset of such one-shot interventions, with the aim of maximizing the epidemic final size.
For better readability, all results are exposed in Sect. 2. It is first shown in Sect. 2.1 (Proposition 2.1) that, for strong enough lockdown measures and long enough intervention time T, one can stop the epidemics arbitrarily close after the herd immunity is attained. In Sect. 2.2, we provide and analyze the optimal control law that leads asymptotically, through an intervention of duration T, to the largest number of susceptible individuals. Theorem 2.2 establishes existence and uniqueness of the solution to the optimal control problem, which is bang-bang with a unique commutation from the nominal value to the minimal allowed value of the transmission rate. Theorem 2.3 characterizes in a constructive way the time of this commutation and situates the latter with respect to the peak of the epidemic, and the corresponding proportion of susceptible with respect to the herd immunity level. Last, we show in Sect. 2.3 (Theorem 2.4) that this optimal strategy coincides with a time minimal policy. The results are numerically illustrated in Sect. 3. For the sake of readability, all the proofs are postponed to Sect. 4. Concluding remarks are given in Sect. 5. Last, details on implementation issues are provided in Appendix A.
Notice that from a technical point of view, the problem under study is by nature an optimal control problem on an infinite horizon, as it aims to maximize the limit of the proportion of susceptible cases. However, by using a quantity invariant along the trajectories (see end of Sect. 2.1 and Lemma 4.2 in Sect. 4.1), one is able to formulate the problem over a finite time horizon, thereby allowing for easier handling.
Main Results
According to the introduction in the previous section, we consider in the sequel the following “SIR type” system
5 |
complemented with nonnegative initial data , such that . The time-varying input control u models public interventions on the transmission rate by measures like social distancing, restraining order, lockdown, and so on, imposed on finite time horizon. For given and , u is assumed to belong to the admissible set defined by
The constant T characterizes the duration of the intervention, and its maximal intensity (typically the strength of a lockdown procedure). Admittedly, u represents here a rather abstract quantity: it quantifies how much the nonpharmaceutical measures reduce the rate of contact between susceptible and infected individuals. In practice, though, it would be quite difficult to measure its value, or to assign it a prescribed value. On the other hand, as seen later in Theorem 2.2, the optimal control follows an “all or nothing” pattern: it takes on only the two extreme values, which correspond to stronger possible lockdown or no lockdown at all, so that a precise tuning is not requested in effect.
Toward an Optimal Control Problem: Reachable Asymptotic Immunity Levels
The following result assesses the question of stopping the evolution exactly at, or arbitrarily close to, the herd immunity defined by (3).
Theorem 2.1
Let and . Assume that and consider
6 |
-
(i)There is no and such that the solution S to (5) associated to u satisfies
-
(ii)If , then, for all , there exist and a control such that the solution S to (5) associated to u satisfies
- (iii)
The proof of Theorem 2.1 is given in Sect. 4.2. Theorem 2.1 states that no finite time intervention is able to stop the epidemics before or exactly at the herd immunity. However, one may stop arbitrarily close to the latter by allowing sufficiently long intervention, provided that the intensity is sufficiently strong. This is not true in the opposite case, and the constant is tight.
Remark 2.1
The values given in Table 2, which correspond to the sanitary measures put in place in France during the March to May 2020 containment period, satisfy and (see Sect. 3). According to Table 1, there thus exists a containment strategy that increases the limit proportion of susceptible by .
Orientation. When possible, stopping S arbitrarily close to the herd immunity threshold is only possible by sufficiently long intervention of a strong enough intensity. To determine the closest state to this threshold attainable by control of maximal intensity on the interval [0, T], one is led to consider the following optimal control problem: where
with (S, I) the solution to (5) associated to u.
We now make an observation, which will be crucial in the analysis (see details in Sect. 4.3). It turns out that the quantity is constant on any time interval on which (see Lemma 4.2 in Sect. 4.1). Therefore, using the fact that (see formula (2)) and the monotonicity of , the optimal control problem is indeed equivalent to
where (S, I) is the solution to (5) associated to u.
Optimal Immunity Control
This section is devoted to the analysis of the optimal control problem . The first result reduces the study of this problem to the solution of a one dimensional optimization problem whose unknown, denoted , stands for a switching time. For simplicity, given , and , we define the function by
7 |
Also, we denote the solution of (5) with (see Fig. 2).
Theorem 2.2
Let and . Problem admits a unique solution . Furthermore,
-
(i)
the maximal value is nonincreasing with respect to and nondecreasing with respect to T.
-
(ii)
there exists a unique such that (in particular, the optimal control is bang-bang).
The proof of Theorem 2.2 is given in Sect. 4.3.
At this step, the result above proves that the optimal control belongs to a family of bang-bang functions parameterized by the switching time . This reduces the study to a simpler optimization problem. The switching time associated to the optimal solution of Problem solves the 1D optimization problem where is defined by (7). The next result allows to characterize the optimal for .
In order to state it, let us introduce the function
8 |
where denotes the solution to (5) with .
This function is of particular interest since we will prove that it has the opposite sign to the derivative of the criterion , a property of great value to pinpoint its extremum. In the following result, we will provide a complete characterization of the optimal control , showing that the control structure only depends on the sign of .
Theorem 2.3
Let , and the unique optimal solution to Problem (so that defined as in (7) is the optimal solution to Problem ). The function is decreasing on (0, T) and one has the following characterization:
if , then ;
- if , then is the unique solution on (0, T) to the equation
9
Moreover, if , then , i.e., , where, in agreement with (7), denotes the solution to System (5) with .
In the particular case , one has if, and only if , and in that case, is the unique solution to the equation
Together, the reduction to the optimal control achieved in Theorem 2.2 and the characterization of the optimal in Theorem 2.3 show that solving Eq. (9) is exactly what is needed to solve Problem . Associated search algorithms are presented in Sect. 3. Theorem 2.3 also establishes that the commutation of the optimal control occurs before the reach of the herd immunity level, and fulfills a simple equation in the case .
Remark 2.2
(State-feedback form of the optimal control) It is worth emphasizing that the optimal control is obtained under state-feedback form. As a matter of fact, at each time instant , one may derive from the current state value (S(t), I(t)) the value of the optimal control input by an explicit algorithm. It is sufficient for this to compute the quantity (which amounts to solve the ODE on the interval [t, T] and estimate the integral in (8)): with that done, one must set if , and if . This situation is in sharp contrast with the usual one, where finding the optimal control necessitates the resolution of the two-point boundary-value problem derived from Pontryagin maximum principle.
Relations with the Minimal Time Problem
We have shown in Theorem 2.1 that, for sufficiently small, there exist for every a time of control T and a control for which . Next one may wonder what is, for a given , the minimal time T for which this property holds. This amounts to solve the following optimal control problem.
We recall that is defined in Theorem 2.2. The following result answers this question by noting that solving this problem is equivalent to solve Problem .
Theorem 2.4
Assume (defined in (6)) and let .
Let be the solution to the minimal time problem above and denote the corresponding control function. Then, is the unique solution of Problem determined in Theorems 2.2 and 2.3 associated to .
Conversely, let and the maximum of Problem . Then, T is the minimal time of intervention such that for some .
Numerical Illustrations
To fix ideas, we use the parameter values given in Table 2, coming from [39] and corresponding to the lockdown conditions in force in France between March 17th and May 11th 2020. We suppose that, on the total population of persons in France, there are no removed individuals and 1000 infected individuals at the initial time, i.e., and .
All ODE solutions have been computed with the help of a Runge–Kutta fourth-order method. Optimization algorithms and details on the computational aspects may be found in Appendix A. To check the correctness of the results, we systematically compute and compare the solutions of Problem , obtained directly by projected gradient descent (see Algo. 1), and the solutions of the simpler one-dimensional Problem , obtained by bisection method (see Algo. 2). Solutions do not depend upon which optimization algorithm is used, confirming the theoretical results.
We first show in Figs. 3 and 4 the optimal solutions corresponding to the parameter choices and , with . To capture the full behavior of the trajectories, the computation window is [0, 200]. As expected the solutions are bang-bang. One also observes that the optimal value is larger for smaller , as predicted in Theorem 2.2.
By using Lemma 4.2, it is easy to determine numerically the optimal value by solving the equation
for solution of . This allows to investigate numerically in Figs. 5 and 6 the dependency of and with respect to the parameters T and . As announced in Theorem 2.2, the optimal value is nonincreasing with respect to and nondecreasing with respect to T.
In Fig. 5, for , we observe numerically that the lower bound given in Theorem 2.1 and below which S can get as close as we want to over an infinite horizon, is optimal (). In particular, for lockdown conditions similar to the ones in effect in France between March and May 2020 (), it appears that it is possible to come as close as desired to the optimal bound of inequality (4). Interestingly, we observe in Fig. 6-left that when is too large, then for T large enough.
Another interesting feature can be observed in Fig. 6: the instant at which the optimal lockdown begins is almost independent of the level of the maximal lockdown for “small enough” duration T (say days). In other words, the optimal choice of this instant, and thus the optimal control itself, is robust with respect to the uncertainties on the lockdown duration and intensity, provided the former is not too long and the latter not too weak.
We also mention that Fig. 5-left gives the solution to the minimal time problem. Indeed, from Theorem 2.4, a control is optimal for iff it is optimal for the minimal time problem. Then, given and , the minimal time of action such that the final value of susceptible is at a distance of is obtained by computing the intersection of the curves in Fig. 5-left with the horizontal line . As expected, when is too large, i.e., when the lockdown is insufficient, the solution stays far away from (see Fig. 5-right).
We conclude this section by examining in Fig. 7 the influence of the initial data, and , on the optimal intervention time. Let us mention that, in agreement with the conclusions of [15], the numerical simulation in this figure seems to indicate that the larger the quantity , the earlier the optimal lockdown strategy should be applied.
Proofs of the Main Results
Preliminary Results
Before proving the main results, we provide some useful elementary properties of the state variables (S, I) solving (5) whose role in the sequel will be central. To this aim, it is convenient to introduce the function defined for any
10 |
Let us start with a preliminary result regarding the values of along the trajectories of System (5).
Lemma 4.1
For any and , one has
11 |
along any trajectory of system (5). In particular, if u is constant on a nonempty, possibly unbounded, interval, then the function is constant on this interval along any trajectory of System (5).
Proof
The proof of (11) follows from straightforward computations. Indeed along every trajectory of (5), one has
The second part of the statement is an obvious byproduct of this property, by setting .
Lemma 4.1 allows to characterize the value of the limit of S at infinity, as now stated.
Lemma 4.2
Let . For any trajectory of (5), the limit of S(t) at infinity exists and is the unique solution in the interval of the equation
12 |
where is given by (10).
Proof
Any input control u from is equal to 1 on . Hence, applying Lemma 4.1 with on this interval yields (12), by continuity of and because of (2). Moreover, Eq. (12) has exactly two roots. Indeed, this follows by observing that the mapping is first decreasing and then increasing on , with infinite limit at and , and minimal value at , equal to . We conclude by noting that the limit cannot be larger than : otherwise there would exist such that and for T large enough, so that S would tend to zero at infinity, yielding a contradiction. It follows that the value of is thus the smallest root of (12).
Remark 4.1
(On the control in infinite time) Notice that, in the quite unrealistic situation where one is able to act on System (5) up to an infinite horizon of time, then the optimal strategy to maximize is to consider the constant control function on . Indeed, according to Lemma 4.1,
for all . Hence, with equality if, and only if a.e. on . Since is maximal whenever is minimal, we get that the optimal strategy in that case corresponds to the choice a.e. on . Moreover, it is notable that this maximal value is computed by solving the nonlinear equation . Therefore, an easy application of the implicit functions theorem yields that the mapping is decreasing.
Proof of Theorem 2.1
Let us start with (i). According to (5), one has , thus for all . Then, and S is decreasing. Assume by contradiction that, for all , we have . Since I satisfies (5), it follows that on and then, on . We thus infer that for , and thus as , which is a contradiction.
Let us now show (ii). For all , we denote by the control equal to on . Thanks to the same argument by contradiction as above, one has . Let us denote by the solution to System (5) associated to . Lemma 4.1 shows that the function is conserved, and we infer that solves the equation
Using the expression of ,
Since is bijective on , we deduce that . It follows that for small enough, there exists such that for each , . By using a Gronwall lemma, one infers that as . Then, for all , there exists , such that and . Consider and let us denote by the solution to System (5) associated to . By continuity of ,
Thus .
Let us finally prove (iii). We show that is strictly decreasing. Let such that and and the solutions to System (5) associated to and , respectively. Using Lemma 4.1, is strictly increasing, hence
Thanks to the equations satisfied by and , one has , . Since in strictly decreasing on , we deduce that . This concludes the proof since .
Proof of Theorem 2.2
Solving involves the resolution of an ODE system on an infinite horizon, and it is quite convenient to consider an equivalent version of this problem involving an ODE system on a bounded horizon. A key point for this is that, according to Lemma 4.2, solves Eq. (12). Furthermore, since the mapping is decreasing, maximizing is equivalent to minimize . Combining all these observations yields that the optimal control problem is equivalent to the following problem, investigated hereafter: where
13 |
and (S, I) solves the controlled system (5) associated to the control .
Proof of Theorem 2.2
For better readability, the proof of Theorem 2.2 is decomposed into several steps.
Step 1: existence of an optimal control We will prove the existence of an optimal control for the equivalent problem . Let be a maximizing sequence for Problem . Since is uniformly bounded, we may extract a subsequence still denoted with a slight abuse of notation, converging toward for the weak-star topology of . It is moreover standard that is closed for this topology and therefore, belongs to . For , let us denote the solution to the SIR model (5) associated to . A straightforward application of the Cauchy–Lipschitz theorem yields that is uniformly bounded. By applying Ascoli’s theorem, we may extract a subsequence still denoted that converges toward in . As usually, we consider an equivalent formulation of System (5), where can be seen as the unique fixed point of an integral operator. We then pass to the limit and show that solves the same equation where u has been replaced by . By combining all these facts with the continuity of , we then infer that converges up to a subsequence to , which gives the existence.
Step 2: optimality conditions and bang-bang property We will again establish these properties for the equivalent problem . In what follows, for the sake of simplicity, we will consider and denote by u a solution to Problem and by (S, I), the associated pair solving System (5). Observe first that integrating (11) in Lemma 4.1, one has
It is standard to write the first-order optimality conditions for such kind of optimal control problem. To this aim, we use the so-called Pontryagin maximum principle (see, e.g., [31]) and introduce the Hamiltonian defined on by
There exists an absolutely continuous mapping called adjoint vector such that the extremal satisfies a.e. in [0, T]:
Adjoint equations and transversality conditions:
14a |
14b |
Maximization condition: for a.e. , u(t) solves the problem
and therefore, by using that I is nonnegative on [0, T], one has
15 |
where w denotes the Lipschitz-continuous switching function given by .
By using (5), one computes
We will now prove that the optimal control can be written as defined in (7), for some . From (14b), one has , and thus . According to (15) and by continuity of w, this implies on a certain maximal interval , for some , by continuity of w. By inserting the relation holding on in Eq. (14a) satisfied by , we deduce that on . Since , the Gronwall lemma yields
16 |
Then, either , in which case, (see (7)) or . Let us now address this latter case. Since the interval is maximal by assumption and w is continuous, one has necessarily . On the other hand, S and are continuous, with according to (16). Consequently, for any in the nonempty open interval , there exists a neighborhood of on which
Since , this implies that w is strictly increasing in . Therefore, there exists a maximal open interval with on which , and therefore on which . As a consequence, the left derivative of w at exists and reads
17 |
We will in fact show that . In other words, the control u can be written as (7).
To this aim, let us assume by contradiction that there exists such that and on . Observing that w is differentiable on and using Rolle’s theorem yields the existence of such that .
Note that, according to (14a), one has , a.e. in (0, T). Since , one has also . Using the fact that on , this means that the point is a steady-state of the system
According to the Cauchy–Lipschitz theorem, we infer that is constant on and therefore, , which is in contradiction with (17). As a conclusion, , and u can be written as (7).
Step 3: monotonicity of Let and . It is straightforward that , and then . It follows that the map is nonincreasing.
Let us show that the map is nondecreasing. Let , , and denote by the control realizing the maximum . Since in , one has . Thus
Step 4: uniqueness of the optimal control The demonstration of the uniqueness of the optimal control is achieved in the proof of Theorem 2.3 below, by demonstrating the uniqueness of the optimal switching point . Except this point, the demonstration of Theorem 2.2 is now complete.
Proof of Theorem 2.3
The proof is decomposed into several steps. We assume first that , the case is considered in the last step. In the whole proof, one deals with control functions as defined in formula (7).
Step 1: necessary first order optimality conditions on Let be an optimal control for problem , with be the associated optimal switching time. Let us introduce the criterion j given by
18 |
where is the solution corresponding to the control , as previously defined. For the sake of simplicity, we omit these subscripts in the sequel. By Theorem 2.2, it is equivalent to minimize J defined in (13) and to minimize j. By using Lemma 4.2, one has
19 |
where . According to (19), we infer that S solves the system
20a |
20b |
with the initial data . Using (19), one gets
so that the cost function reads
The next lemma allows to compute the derivative of j with respect to . For the sake of clarity, its proof is postponed to the end of this section.
Lemma 4.3
For all , the derivative3 and of the function and with respect to , in other words and , are given by
and
Thanks to this result, we may compute
By noting that
we have for the function defined in (8):
21 |
We deduce that is equivalent to
22 |
Step 2: Zeros of and uniqueness of the optimal switching time According to (5), one has for any , . Then, using the expression of given in (21), it follows that
Introducing defined by , the last expression writes simply
Differentiating this identity with respect to yields
As a consequence of Lemma 4.3, both terms in the previous formula are negative, and . The function is thus decreasing on (0, T). Moreover, . Therefore, if , then (22), or equivalently , has no solution, and thus . Conversely, if , then (22) admits a unique solution which is the unique critical point of j. In particular, in the case , one has .
We also deduce that the function j is nonincreasing on and increasing on .
Step 3: implies We consider now the particular case where , and show that in this case . As S is decreasing, one then has for any , whatever the input control. Thus, for any , one has , and
appears as a sum of three nonpositive terms. Therefore, , and we conclude that if .
Step 4: implies , that is To prove this estimate on , consider an optimal trajectory for which the switching point verifies and . By continuity, there exists a time instant such that . The point pertains to the optimal trajectory of the initial optimal control problem.
Consider now the optimal control problem defined by the same cost function, but with initial condition and on a time horizon of length . The cost that is considered is the supremum of the limits of S among every admissible control inputs, so the optimal value for the second problem (on horizon ) is equal to the optimal value for the initial problem (on horizon T); and the optimal control for the former problem is indeed the restriction to of the optimal control for the latter problem. As a matter of case, if this was not the case, then concatenating the restriction to of the optimal solution of the problem on horizon T, with the optimal solution of the problem on horizon , would lead to a larger limit of S at infinity.
Now observe that the optimal solution of the problem on takes on the value 1 on , and then on . Therefore, it presents a commutation at time , while the initial state value fulfills . But it was shown in Step 3 that such a situation is impossible. As a conclusion, if , then . The inequality on itself is deduced from the fact that S is decreasing.
Step 5: The case Let us finally deal with the case . Using the fact that S is constant on , we deduce that and for all . A commutation occurs at if, and only if,
23 |
The function S is nonincreasing, thus there exists satisfying this relation only if which is equivalent to . If this is the case, then, since is nonincreasing and is increasing, there exists a unique satisfying the relation (23). We also remark that (23) is equivalent to (22).
To achieve the proof of Theorem 2.3, it now remains to prove Lemma 4.3.
Proof of Lemma 4.3
Using the notation previously defined, one has (see (20b)) on
and, at , is defined thanks to (20a) by
24 |
By differentiating (24) with respect to , one infers
which is the second identity in Lemma 4.3.
Furthermore, using (20b), one has
Differentiating this relation with respect to yields for all
Let us simplify this latter identity. Observe first that, because of (19), one has for all
By using at the same time the change of variable and the identity holding true for any , we infer that
Combining all these facts leads to, for all ,
Therefore, we arrive at, for all ,
which is the first identity of the statement. This achieves the proof of Lemma 4.3, and consequently of Theorem 2.3.
Proof of Theorem 2.4
Let be the minimal time associated to the Minimal time problem. Let be the unique solution of Problem associated to . Assume by contradiction that does not solve the minimal time problem, i.e., . Then for each one has
which is in contradiction with the fact that solves the minimal time problem.
Conversely, let and realizing the maximum of , i.e., . Let be the minimal time of intervention such that for some . Since , we necessarily have . By definition of , there exists such that . Consider . One has and
Hence, by definition of , one has . But v does not have the form of the minimal solution in Theorem 2.2, unless .
Conclusion
Optimal reduction of epidemic final size by social distancing of given maximal duration and intensity has been considered in this paper. This issue amounts to stopping the disease as close as possible after crossing the herd immunity threshold. We first established that stopping arbitrarily close to this value through long enough intervention is possible only if the social distancing intensity is sufficiently intense. We also established the existence and uniqueness for the solution of the considered optimal control problem, which is bang-bang with a unique commutation from the nominal value to the minimal allowed value of the transmission rate. This property gives rise to an efficient numerical algorithm to solve the optimization problem, which is exemplified in the text. As a last result, it has been shown that this problem may be interpreted as equivalent to reaching a given distance to the herd immunity level by minimal intervention time. To the best of our knowledge, these contributions did not appear before.
We stress once again that the optimal control problem considered here is mainly a prototypal one, not aimed at describing exhaustively human epidemics. Ignoring many important features, it provides baseline to consider real situations, thanks to its reduced complexity. In particular, it is certainly not relevant to ignore the limitation of the hospital capacity. In first approximation, the occupancy of hospital beds is proportional to the number of infected individuals, so this effect may be introduced as a constraint. This will be the main topic of future work.
Acknowledgements
The authors express their sincere acknowledgment to working groups Maths4Covid19 and OptimCovid19 for fruitful discussions and in particular their colleagues Luis Almeida (CNRS UMR 7598, LJLL, France), Emmanuel Franck (INRIA Grand-Est and IRMA Strasbourg, France), Sidi-Mahmoud Kaber (Sorbonne Université, LJLL, France), Grégoire Nadin (CNRS UMR 7598, LJLL, France) and Benoît Perthame (Sorbonne Université, LJLL, France).
Appendix A-Implementation Issues
We provide here an insight of the numerical methods used in Sect. 3. The codes are available on:
https://github.com/michelduprez/optimal-immunity-control.git.
A.1. Solving Problem by a Direct Approach
In order to check the consistency of the results of Theorem 2.3, we solved directly the optimal control problem . Our approach rests upon the use of gradient like algorithms, which necessitates the computation of the differential of in an admissible direction4h. According to the proof of Theorem 2.2 (see Sect. 4.3), this differential reads
where denotes the adjoint state, solving the backward adjoint system (14a)–(14b). Thanks to this expression of , we deduce a simple projected gradient algorithm to solve numerically the optimal control problem , then . The algorithm is described in Algorithm 1. The projection operator is given by
A.2. Solving Problem Thanks to the Theoretical Results
Taking advantage of the theoretical results given in Theorem 2.3, we then considered a simpler algorithm, based on the solution of Problem by bisection method. This method is described in Algorithm 2.
All computations shown in Sect. 3 indicate, as expected, that the optimal trajectories computed by the two methods do coincide.
For illustrative purpose, we provide in Fig. 8 the graph of the cost defined in (18) that corresponds to the one-dimensional optimization Problem . As can be seen, the cost is not convex.
Footnotes
In other words, they only take a.e. two different values.
Other valuable contributions are to be found in [26], including the consideration of added integral term accounting for control cost, and hospital overflow minimization.
To avoid any misunderstanding about the differentiability of with respect to , let us make the use of precise. This function stands for the derivative of the function , where is defined as the unique solution to (20b) on , where is defined as the value at of the unique solution to (20a). Defined in this way, the differentiability of this mapping is standard.
More precisely, we call “admissible direction” any element of the tangent cone to the set at u. The cone is the set of functions such that, for any sequence of positive real numbers decreasing to 0, there exists a sequence of functions converging to h as , and for every .
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Pierre-Alexandre Bliman, Email: pierre-alexandre.bliman@inria.fr.
Michel Duprez, Email: michel.duprez@inria.fr.
Yannick Privat, Email: yannick.privat@unistra.fr.
Nicolas Vauchelet, Email: vauchelet@math.univ-paris13.fr.
References
- 1.Abakuks A. An optimal isolation policy for an epidemic. J. Appl. Probab. 1973;10(2):247–262. doi: 10.2307/3212343. [DOI] [Google Scholar]
- 2.Abakuks A. Optimal immunisation policies for epidemics. Adv. Appl. Probab. 1974;6(3):494–511. doi: 10.2307/1426230. [DOI] [Google Scholar]
- 3.Ainseba B, Iannelli M. Optimal screening in structured SIR epidemics. Math. Modell. Nat. Phenom. 2012;7(3):12–27. doi: 10.1051/mmnp/20127302. [DOI] [Google Scholar]
- 4.Alkama M, Elhia M, Rachik Z, Rachik M, Labriji EH. Free terminal time optimal control problem of an SIR epidemic model with vaccination. Int. J. Sci. Res. 2014;3:227. [Google Scholar]
- 5.Greenhalgh, D.: Some results on optimal control applied to epidemics. Math. Biosci. 88(2), 125–158 (1988)
- 6.Andreasen V. The final size of an epidemic and its relation to the basic reproduction number. Bull. Math. Biol. 2011;73(10):2305–2321. doi: 10.1007/s11538-010-9623-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Gaff, H., Schaefer, E.: Optimal control applied to vaccination and treatment strategies for various epidemiological models. Math. Biosci. Eng. 6(3), 469 (2009) [DOI] [PubMed]
- 8.Behncke H. Optimal control of deterministic epidemics. Optim. Control Appl. Methods. 2000;21(6):269–285. doi: 10.1002/oca.678. [DOI] [Google Scholar]
- 9.Bliman PA, Duprez M. How best can finite-time social distancing reduce epidemic final size? J. Theor. Biol. 2020;511:110557. doi: 10.1016/j.jtbi.2020.110557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Bolzoni L, Bonacini E, Soresina C, Groppi M. Time-optimal control strategies in SIR epidemic models. Math. Biosci. 2017;292:86–96. doi: 10.1016/j.mbs.2017.07.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Bolzoni L, Bonacini E, Marca RD, Groppi M. Optimal control of epidemic size and duration with limited resources. Math. Biosci. 2019;315:108232. doi: 10.1016/j.mbs.2019.108232. [DOI] [PubMed] [Google Scholar]
- 12.Buonomo B, Manfredi P, d’Onofrio A. Optimal time-profiles of public health intervention to shape voluntary vaccination for childhood diseases. J. Math. Biol. 2019;78(4):1089–1113. doi: 10.1007/s00285-018-1303-1. [DOI] [PubMed] [Google Scholar]
- 13.Buonomo B, Della Marca R, d’Onofrio A. Optimal public health intervention in a behavioural vaccination model: the interplay between seasonality, behaviour and latency period. Math. Med. Biol. A J. IMA. 2019;36(3):297–324. doi: 10.1093/imammb/dqy011. [DOI] [PubMed] [Google Scholar]
- 14.Di Blasio G. A synthesis problem for the optimal control of epidemics. Numer. Funct. Anal. Optim. 1980;2(5):347–359. doi: 10.1080/01630568008816063. [DOI] [Google Scholar]
- 15.Shim, E.: Optimal dengue vaccination strategies of seropositive individuals. Math. Biosci. Eng. 16(3), 1171–1189 (2019) [DOI] [PubMed]
- 16.Gaff H, Schaefer E. Optimal control applied to vaccination and treatment strategies for various epidemiological models. Math. Biosci. Eng. 2009;6(3):469. doi: 10.3934/mbe.2009.6.469. [DOI] [PubMed] [Google Scholar]
- 17.Greenhalgh D. Some results on optimal control applied to epidemics. Math. Biosci. 1988;88(2):125–158. doi: 10.1016/0025-5564(88)90040-5. [DOI] [Google Scholar]
- 18.Hansen E, Day T. Optimal control of epidemics with limited resources. J. Math. Biol. 2011;62(3):423–451. doi: 10.1007/s00285-010-0341-0. [DOI] [PubMed] [Google Scholar]
- 19.Hollingsworth TD, Klinkenberg D, Heesterbeek H, Anderson RM. Mitigation strategies for pandemic influenza A: balancing conflicting policy objectives. PLoS Comput. Biol. 2011;7(2):e1001076. doi: 10.1371/journal.pcbi.1001076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Hu Q, Zou X. Optimal vaccination strategies for an influenza epidemic model. J. Biol. Syst. 2013;21(04):1340006. doi: 10.1142/S0218339013400068. [DOI] [Google Scholar]
- 21.Morris, D.H., Rossine, F.W., Plotkin, J.B., Levin, S.A.: Optimal, near-optimal, and robust epidemic control. arXiv preprint arXiv:2004.02209 (2020)
- 22.Jaberi-Douraki M, Moghadas SM. Optimality of a time-dependent treatment profile during an epidemic. J. Biol. Dyn. 2013;7(1):133–147. doi: 10.1080/17513758.2013.816377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Jaberi-Douraki M, Heffernan JM, Wu J, Moghadas SM. Optimal treatment profile during an influenza epidemic. Differ. Equ. Dyn. Syst. 2013;21(3):237–252. doi: 10.1007/s12591-012-0149-z. [DOI] [Google Scholar]
- 24.Katriel G. The size of epidemics in populations with heterogeneous susceptibility. J. Math. Biol. 2012;65(2):237–262. doi: 10.1007/s00285-011-0460-2. [DOI] [PubMed] [Google Scholar]
- 25.Kermack WO, McKendrick AG. Contributions to the mathematical theory of epidemics-I. Proc. R. Soc. 1927;115A:700–721. doi: 10.1007/BF02464423. [DOI] [PubMed] [Google Scholar]
- 26.Jaberi-Douraki, M., Moghadas, S.M.: Optimality of a time-dependent treatment profile during an epidemic. J. Biol. Dyn. 7(1), 133–147 (2013) [DOI] [PMC free article] [PubMed]
- 27.Jaberi-Douraki, M., Heffernan, J.M., Wu, J., Moghadas, S.M.: Optimal treatment profile during an influenza epidemic. Differ. Equ. Dyn. Syst. 21(3), 237–252 (2013)
- 28.Kolesin ID, Zhitkova EM. Optimization of immunocorrection of collective immunity. Autom. Remote Control. 2016;77(6):1031–1040. doi: 10.1134/S0005117916060072. [DOI] [Google Scholar]
- 29.Kolesin, I.D., Zhitkova, E.M.: Optimization of immunocorrection of collective immunity. Autom. Remote Control 77(6), 1031–1040 (2016)
- 30.Laguzet L, Turinici G. Globally optimal vaccination policies in the SIR model: smoothness of the value function and uniqueness of the optimal strategies. Math. Biosci. 2015;263:180–197. doi: 10.1016/j.mbs.2015.03.002. [DOI] [PubMed] [Google Scholar]
- 31.Lee EB, Markus L. Foundations of Optimal Control Theory. New York: Wiley; 1967. [Google Scholar]
- 32.Ma J, Earn DJ. Generality of the final size formula for an epidemic of a newly invading infectious disease. Bull. Math. Biol. 2006;68(3):679–702. doi: 10.1007/s11538-005-9047-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Manfredi P, D’Onofrio A. Modeling the Interplay Between Human Behavior and the Spread of Infectious Diseases. Berlin: Springer; 2013. [Google Scholar]
- 34.Angulo, M.T., Castaños, F., Velasco-Hernandez, J.X., Moreno, J.A.: A simple criterion to design optimal nonpharmaceutical interventions for epidemic outbreaks. medRxiv (2020) [DOI] [PMC free article] [PubMed]
- 35.Miller JC. A note on the derivation of epidemic final sizes. Bull. Math. Biol. 2012;74(9):2125–2141. doi: 10.1007/s11538-012-9749-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kermack, W.O., McKendrick, A.G.: Contributions to the mathematical theory of epidemics-I. Proc. R. Soc. 115A, 700–721 (1927) [DOI] [PubMed]
- 37.Morton R, Wickwire KH. On the optimal control of a deterministic epidemic. Adv. Appl. Probab. 1974;6(4):622–635. doi: 10.2307/1426183. [DOI] [Google Scholar]
- 38.Piguillem F, Shi L. The optimal COVID-19 quarantine and testing policies. Einaudi Institute for Economics and Finance (EIEF): Tech. Rep; 2020. [Google Scholar]
- 39.Salje, H., Kiem, C.T., Lefrancq, N., Courtejoie, N., Bosetti, P., Paireau, J., Andronico, A., Hozé, N., Richet, J., Dubost, C.L., et al.: Estimating the burden of SARS-CoV-2 in France. Science (2020) [DOI] [PMC free article] [PubMed]
- 40.Shim E. Optimal dengue vaccination strategies of seropositive individuals. Math. Biosci. Eng. 2019;16(3):1171–1189. doi: 10.3934/mbe.2019056. [DOI] [PubMed] [Google Scholar]
- 41.Wickwire KH. Optimal isolation policies for deterministic and stochastic epidemics. Math. Biosci. 1975;26(3–4):325–346. doi: 10.1016/0025-5564(75)90020-6. [DOI] [Google Scholar]
- 42.Wickwire K. Optimal immunization rules for an epidemic with recovery. J. Optim. Theory Appl. 1979;27(4):549–570. doi: 10.1007/BF00933440. [DOI] [Google Scholar]
- 43.Yang, K., Wang, E., Zhou, Y., Zhou, K.: Optimal vaccination policy and cost analysis for epidemic control in resource-limited settings. Kybernetes (2015)
- 44.Zhou Y, Wu J, Wu M. Optimal isolation strategies of emerging infectious diseases with limited resources. Math. Biosci. Eng. MBE. 2013;10:1691–1701. doi: 10.3934/mbe.2013.10.1691. [DOI] [PubMed] [Google Scholar]