Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2020 Dec 8;93:102453. doi: 10.1016/j.jmateco.2020.102453

Taming the spread of an epidemic by lockdown policies

Salvatore Federico a,, Giorgio Ferrari b
PMCID: PMC7728404  PMID: 33324027

Abstract

We study the problem of a policymaker who aims at taming the spread of an epidemic while minimizing its associated social costs. The main feature of our model lies in the fact that the disease’s transmission rate is a diffusive stochastic process whose trend can be adjusted via costly confinement policies. We provide a complete theoretical analysis, as well as numerical experiments illustrating the structure of the optimal lockdown policy. In all our experiments the latter is characterized by three distinct periods: the epidemic is first let to freely evolve, then vigorously tamed, and finally a less stringent containment should be adopted. Moreover, the optimal containment policy is such that the product “reproduction number × percentage of susceptible” is kept after a certain date strictly below the critical level of one, although the reproduction number is let to oscillate above one in the last more relaxed phase of lockdown. Finally, an increase in the fluctuations of the transmission rate is shown to give rise to an earlier beginning of the optimal lockdown policy, which is also diluted over a longer period of time.

Keywords: SIR model, Optimal stochastic control, Viscosity solution, Epidemic, Lockdown

1. Introduction

During the current Covid-19 pandemic, policymakers are dealing with the trade-off between safeguarding public health and damming the negative economic impact of severe lockdowns. The fight against the virus is made especially hard by the absence of a vaccination and the consequent random horizon of any policy, as well as by the extraordinariness of the event. In particular, the lack of data from the past, the difficulty of rapidly and accurately tracking infected, and super-spreading events such as mass gatherings, give rise to a random behavior of the transmission rate/reproduction number of the virus (see, e.g., Hotz et al., 2020 1 ). In this paper we propose and study a model for the optimal containment of infections due to an epidemic in which both the time horizon and the transmission rate of the disease are stochastic.

In the last months, the scientific literature experienced an explosion in the number of works where the statistical analysis and the mathematical modeling of epidemic models is considered, as well as the economic and social impact of lockdown policies is investigated. A large bunch of papers provides numerical studies related to the Covid-19 epidemics in the setting of classical epidemic models or of generalization of them. Among many others, we refer to Alvarez et al. (2020), that studies numerically optimal containment policies in the context of a Susceptible–Infected–Recovered (SIR) model (cf. Kermack and McKendrick, 1927); Kantner (2020) which also allows for seasonal effects; Toda (2020), which estimates the transmission rate in various countries for a SIR model with given and fixed transmission rate; Aspri et al. (2020), which combines a careful numerical study with an elegant theoretical study of optimal lockdown policies in the SEAIRD (susceptible (S), exposed (E), asymptomatic (A), infected (I), recovered (R), deceased (D)) model; Bayraktar et al. (2020), where a detailed numerical analysis is developed for a SIR model of the Covid-19 pandemic in which herd immunity, behavior-dependent transmission rates, remote workers, and indirect externalities of lockdown are explicitly considered; Acemoglu et al. (2020), where – in the context of a multi-group SIR model – it is investigated the effect of lockdown policies which are targeted to different social groups (especially, the “young”, the “middle-aged” and the “old”); Gollier (2020), in which a multi-risk SIR model with heterogeneous citizens is calibrated on the Covid-19 pandemic in order to study the impact on incomes and mortality of age-specific confinements and Polymerase chain reaction (PCR) tests; Favero et al. (2020), which calibrates and tests a SEIRD model (susceptible (S), exposed (E), infected (I), recovered (R), deceased (D)) of the spread of Covid-19 in an heterogeneous economy where different age and sectors are related to distinct risks.

A theoretical study of the optimal confinement policies in epidemic models is usually challenging because of the nonlinear structure of the underlying dynamical system. The first results on a control-theoretic approach to confinement policies are perhaps those presented in Chapter 4 of Behncke (2000), where it is shown that the optimal policy depends only on the shadow price difference between infected and susceptible. In the context of an optimal timing problem, Huberts and Thijssen (2020) use a continuous-time Markov chain model to study the value and optimal exercise decision of two (sequential) options: the option to intervene on the epidemic and, after intervention has started, the option to end the containment policies. Control-theoretic analysis is also presented recently (Kruse and Strack, 2020, Miclo et al., 2020). In Miclo et al. (2020) the authors study a deterministic SIR model in which the social planner acts in order to keep the transmission rate below its natural level with the ultimate aim not to overwhelm the national health-care system. The minimization of a social cost functional is instead considered in Kruse and Strack (2020), in the context of a deterministic SIR model over a finite time-horizon. The resulting control problem is tackled via the Pontryagin maximum principle and then a thorough numerical illustration is also provided.

Inspired by the deterministic problems of Kruse and Strack (2020) and Miclo et al. (2020) (see also Acemoglu et al., 2020, Alvarez et al., 2020, among others), and motivated by the need of incorporating random fluctuations in the disease’s transmission rate, in this paper we consider a stochastic control-theoretic version of the classical SIR model of Kermack and McKendrick (1927). A population with finite size is divided into three different groups: healthy people who are susceptible to the disease, infected individuals, and people who have recovered (and are not anymore susceptible) or dead. However, differently to the classical SIR model, we suppose that disease’s transmission rate is time-dependent and stochastic. In particular, it evolves as a general diffusion process whose trend can be adjusted by a social planner through policies like social restrictions and lockdowns. The randomness in the transmission rate is modeled by a Wiener process representing all those factors affecting the transmission rate and that are not under the direct control of the regulator. The social planner faces the trade-off between the expected social and economic costs (e.g., drops in the gross domestic product) arising from severe restrictions and the expected costs induced by the number of infections that – if uncontrolled – might strongly impact the national health-care system and, more in general, on the social well-being. The social planner aims at minimizing those total expected costs up to the time at which a vaccination against the disease is discovered. In our model, such a time is also random and independent of the Wiener process.

We provide a complete theoretical study of our model by showing that the minimal cost function (value function) is a classical twice-continuously differentiable solution to its corresponding Hamilton–Jacobi–Bellman (HJB) equation, and by identifying an optimal control in feedback form.2 From a technical point of view, the main difference between the models in Acemoglu et al., 2020, Alvarez et al., 2020, Bayraktar et al., 2020, Kruse and Strack, 2020 and Miclo et al. (2020) and ours, is that we deal with a stochastic version of the SIR model, instead of a deterministic one. As a matter of fact, in the aforementioned works the transmission rate is a deterministic control variable, while it is a controlled stochastic state variable in our paper. Moreover, our formulation is also different from that of other stochastic SIR models where the random transmission rate is chosen in such a way that only the levels of infected and susceptible people become affected by noise, with the transmission rate itself not being a state variable (see, e.g., Jiang and Yu, 2011, Tornatore et al., 2005 and references therein). To the best of our knowledge, ours is the first work considering the transmission rate as a diffusive stochastic state variable and providing the complete theoretical analysis of the resulting control problem.

In addition to its theoretical value, the determination of an optimal control in feedback form allows us to perform numerical experiments aiming at showing some implications of our model. For the numerical analysis we specialize the dynamics of the transmission rate, that we take to be mean-reverting and bounded between 0 and some γ>0 (cf. (4.1)). In this case study, the containment policies employed by the social planner have the effect of modifying the long-run mean of the transmission rate, towards which the process converges at an exponential rate. Moreover, we take a separable social cost function (cf. (4.2)). This is quadratic both in the regulator’s effort and in the percentage of infected people.

An interesting effect which is in fact common to all our numerical experiments is that the optimal lockdown policy is characterized by three distinct periods. In the first phase it is optimal to let the epidemic freely evolve, then the social restrictions should be stringent, and finally should be gradually relaxed in a third period. We also investigate which is the effect of the maximal level L of allowed containment measures (i.e., the lockdown policy can take values in [0,L]) on the final percentage of recovered, which in fact turns out to be decreasing with respect to L. This then suggests that the case L=1 – which leads in a shorter period to the definitive containment of the disease with the smallest percentage of final recovered — might be thought of as optimal in the trade-off between social costs and final number of recovered.

We observe that if the epidemic spread is left uncontrolled, then its reproduction number (Rt)t fluctuates around 1.8 and the final percentage of recovered (i.e. the total percentage of infected during the disease) is approximately 72% of the society after circa 7 months (in all our simulations the initial infected were 1% of the population). On the other hand, when L=1, under the optimal policy we have a relative reduction of circa 30% of the total percentage of recovered individuals, and the reproduction number drops below 0.6 in the period of severe lockdown (circa 60 days). Moreover, the optimal containment is such that the so-called “herd immunity” is reached as the product RtSt (reproduction number × percentage of susceptible) becomes strictly smaller than the critical level of one, even if Rt oscillates at around 1.7 in the last more relaxed phase of lockdown. Finally, we observe that an increase of the fluctuations of the transmission rate β have the effect of anticipating the beginning of the lockdown policies, of diluting the actions over a longer period, and of keeping a larger level of containment in the long run. This can be explained by thinking that an higher uncertainty in the transmission rate induces the policymaker to act earlier and over a longer period in order to prevent positive larger shocks of β.

The rest of the paper is organized as follows: In Section 2.1 we set up the model and the social planner problem. In Section 3 we develop the control-theoretic analysis and provide the regularity of the minimal cost function and an optimal control in feedback form. In Section 4 we present our numerical examples, while concluding remarks are made in Section 5. Finally, the Appendix collects the proof of some technical results needed in Section 3.

2. Problem formulation

2.1. The stochastic controlled SIR model

We model the spread of the infection by relying on a generalization of the classical SIR model that dates back to the work by Kermack and McKendrick (1927). The society has population N and it consists of three different groups. The first group is formed by those people who are healthy, but susceptible to the disease; the second group contains those who are infected, while the last cohort consists of those who are recovered or dead. In line with the classical SIR model, we assume that, once recovered, an individual stays healthy for ever. We denote by St the percentage of individuals who are susceptible at time t0, by It the percentage of infected, and by Rt the fraction of recovered or dead. Clearly, St+It+Rt=1 for all t0.

The fraction of infected people grows at a rate which is proportional to the fraction of society that it is still susceptible to the disease. In particular, letting βt be the instantaneous transmission rate of the disease, during an infinitesimal interval of time dt, each infected individual generates βtSt new infected individuals. It thus follows that the percentage of healthy individuals that get infected within dt units of time is ItβtSt.

Notice that the instantaneous transmission rate βt measures the disease’s rate of infection, as well as the average number of contacts per person per time. In this regard, βt can be thus influenced by a social planner via policies that effectively cap the social interaction, like social distancing and lockdown.

During an infinitesimal interval of time dt, the fraction of infected is reduced by αIt, since infected either recover from the disease, or die because of it at a rate α>0.

According to the previous considerations, the dynamics of St and It can be thus written as

dSt=βtStItdt,t>0,S0=x, (2.1)

and

dIt=(βtStItαIt)dt,t>0,I0=y, (2.2)

where (x,y)(0,1)2 are given initial values such that3 x+y(0,1).

Notice that for any t0, and for any choice of (βt)t we can write

St=xe0tβuIuduandIt=yeαt+0tβuSudu, (2.3)

and therefore St>0 and It>0 for all t0. Moreover, summing up (2.1), (2.2) we have d(St+It)=αIt<0 for all t>0, which then implies that St+It<1 for all t0.

We depart from the classical SIR model by assuming that the transmission rate βt is time-varying, stochastic, and may be controlled. More precisely, we let (Ω,F,F(Ft)t,P) be a complete filtered probability space with filtration F satisfying the usual conditions, and we define on that a one-dimensional Brownian motion (Wt)t. For a given and fixed L0, and for any (ξt)t belonging to

A{ξ:Ω×[0,)[0,L],(ξt)t
Fprogressively measurable},

we assume that the transmission rate evolves according to the stochastic differential equation

dβt=b(βt,ξt)dt+σ(βt)dWt,t>0,β0=z>0. (2.4)

The process (ξt)t influences the trend of the transmission rate and it should be interpreted as any effort devoted by the social planner to the decrease of the transmission rate. In this sense, ξ=0 corresponds to the case of no effort done to decrease the disease, whereas the case ξ=L corresponds to the maximal effort. To fix ideas, ξt may represent a percentage of social/working lockdown at time t and L corresponds to the maximal implementable value of such lockdown (e.g. 60%, etc.). On the other hand, the Brownian motion (Wt)t models any shock affecting the transmission rate and which is not under the control of the social planner.

Regarding the dynamics of (βt)t we make the following standing assumption.

Assumption 2.1

  • (i)

    For every ξA, there exists a unique strong solution to (2.4) and it lies in an open interval I(0,).

  • (ii)
    b:I×[0,L]R is bounded, infinitely many times continuously differentiable with respect to its first argument, and has bounded derivatives of any order; that is, there exists Kb>0 such that
    supnNsup(z,ξ)I×[0,L]|nznb(z,ξ)|Kb.
  • (iii)
    σ:I(0,) is bounded, infinitely many times continuously differentiable with respect to its first argument, and has bounded derivatives of any order; that is, there exists Kσ>0 such that
    supnNsupzI|σ(n)(z)|Kσ.

A reasonable dynamics of the transmission rate (βt)t is the mean-reverting

dβt=ϑ(β^(Lξt)βt)dt+σβt(γβt)dWt,t>0,β0=z(0,γ), (2.5)

for some ϑ,γ,σ>0, β^(0,γ). In this case, in can be shown that 0 and γ are unattainable by the diffusion (βt)t, which then takes values in the interval I=(0,γ) for any t0. The level β^ can be seen as the natural transmission rate of the disease, towards which the transmission rate reverts at rate ϑ when ξ0. Finally, the level γ is the maximal possible transmission rate of the disease, and σ is a measure of the fluctuations of (βt)t around β^. We will employ this dynamics in our numerical illustrations (cf. Section 4). Notice that dynamics (2.5) fulfills all the requirements of Assumption 2.1; this is shown, for the sake of completeness, in Proposition A.3 in the Appendix. Moreover, if ξtL, then the transmission rate defined through (2.5) reaches 0 asymptotically, as its drift is negative and its diffusion coefficient stays bounded. Hence, under the maximal lockdown policy, the disease is asymptotically eradicated.

Remark 2.2

A modeling feature that needs some discussion regards the nature of the control rule in (2.5). In our formulation, the policymaker adjusts ξ continuously over time with the aim of decreasing the trend of the transmission rate. However, motivated by the real-world strategies employed during the Covid-19 crisis, one can very well imagine a model where regulatory constraints are introduced once the reproduction number Rt=βtα becomes larger than a certain value, say R. Within this setting, a natural question would be: which is the optimal R and the optimal size of interventions? A possible answer to this question could be found by proposing a model where the policymaker instantaneously reduces the level of β via lockdown policies and faces proportional and/or fixed costs for its actions. This would gives rise to a singular or impulsive stochastic control problem; see Alvarez (2001)Ferrari (2018) and Belak et al. (2017) and references therein. Given the underlying multi-dimensional setting, we expect that the optimal trigger level R would be a function of the current values of (St,It). However, the proof of such a conjecture would require the thorough study of acomplex (non convex) three-dimensional degenerate singular/impulse stochastic control problem that clearly requires techniques different from those employed in this work.

2.2. The social planner problem

The epidemic generates social costs, that we assume to be increasing with respect to the fraction of the population that is infected. These costs might arise because of lost gross domestic product (GDP) due to inability of working, because of an overstress of the national health-care system etc. The social planner thus employs policies (ξt)t in the form, e.g., of social distancing or lockdown in order to adjust the growth rate of the transmission rate β, with the aim of effectively flattening the curve of the infected percentage of the society. Such actions however come with a cost, which increases with the amplitude of the effort. Assuming that a vaccination against the disease is discovered at a random time τ exponentially distributed with parameter λo>0 and independent of (Wt)t 4 (see also Remark 2.4), the social planner aims at solving

infξAE[0τeδtC(It,ξt)dt]. (2.6)

Here, δ0 measures the social planner’s time preferences, and C:[0,1]×[0,L][0,) is a running cost function measuring the negative impact of the disease on the public health as well as the economic/social costs induced by lockdown policies. The following requirements are satisfied by C.

Assumption 2.3

  • (i)

    (y,ξ)C(y,ξ) is convex and continuous on [0,1]×[0,L].

  • (ii)

    For any y[0,1] we have that ξC(y,ξ) is nondecreasing.

  • (iii)

    For any ξ[0,L] we have that yC(y,ξ) is nondecreasing.

  • (iv)
    There exists K>0 such that for any ξ[0,L] we have that
    |C(y,ξ)C(y,ξ)|K|yy|,(y,y)[0,1]2.
  • (v)
    yC(y,ξ) is semiconcave5 on [0,1], uniformly with respect to ξ[0,L]; that is, there exists K>0 such that for any ξ[0,L] and any μ[0,1] one has
    μC(y,ξ)+(1μ)C(y,ξ)C(μy+(1μ)y,ξ)
    Kμ(1μ)|yy|2,(y,y)[0,1]2.

Without loss of generality, we also take C(0,0)=0. Convexity of yC(y,ξ) captures the fact that the social costs from the disease might be higher if a large share of the population is infected since, for example, the social health-care system is overwhelmed. The fact that ξC(y,ξ) is convex describes that marginal costs of actions are increasing because, e.g., an additional lockdown policy might have a larger impact on an already stressed society. Finally, the Lipschitz and semiconcavity property of C(,ξ) are technical requirements that will be important in the next section.

An application of Fubini’s theorem, employing the independence of τ and (Wt)t, allows to rewrite the problem defined in (2.6) as

infξAE[0τeδtC(It,ξt)dt]=infξAE[0eλtC(It,ξt)dt], (2.7)

where λλo+δ.

Remark 2.4

The assumption that a vaccination against the disease is discovered at an exponential random time τ, independent of (Wt)t, has the technical important effect of leading to a time-homogeneous social planner problem (cf. (2.7)). From a modeling point of view, such a requirement is clearly debatable, as it presupposes that the decision maker does not take into account the scientific progress in the epidemic’s treatment. In order to take care of this, we now propose an alternative more realistic formulation which, however, comes at the cost of substantially increasing the mathematical complexity of the social planner problem.

Suppose that the social planner has full information about the current technological level Qt achieved in the disease’s treatment and assume, for example, that this evolves according to the SDE:

dQt=μ(Qt)dt+η(Qt)dBt,Q0=qR+,

for suitable μ and η, and for a standard Brownian motion (Bt)t independent of (Wt)t. The process (Bt)t models all the exogenous shocks affecting the technological achievements (e.g., new scientific discoveries in related fields), while μ measures the instantaneous trend of the research. Define then a continuous-time Markov chain (Mt)t with two states, 0 and 1, where 0 means that the vaccination is not available and 1 that a treatment has been instead found. We assume that 1 is an absorbing state and that the Markov chain has transition rate from state 0 to state 1 given by (λ(t,Qt))t. Here, λ:R+×R+R+ is such that Λt0tλ(s,Qs)ds<, a.s. for any t0, and it is nondecreasing in its second argument. This latter condition clearly means that the larger the technological level is, the faster the disease is treated.

Within this setting, the problem can be then still be written as

infξAE[0τeδtC(It,ξt)dt],

where (It)t and (ξt)t are as defined above in this section, but

τinf{t0:Mt=1}.

The independence of Q with respect to W then leads to the equivalent formulation

V(x,y,z,q)infξAE[0eδtΛtC(It,ξt)dt],

which defines a four-dimensional stochastic control problem. Clearly, this problem is much more challenging than (2.7) and its analysis, requiring different techniques and results, is left for future research.

Another interesting future work might concern an extension of the previous model in which the social planner can also increase the technological level Q by supporting the research of a vaccination. Assuming that such an investment comes at proportional cost, this problem can be modeled in terms of an intricate stochastic control problem where the transition rate λ of the Markov chain is controlled through a singular control.

In order to tackle Problem (2.7) with techniques from dynamic programming, it is convenient to keep track of the initial values of (St,It,βt)t. We therefore set

O{(x,y,z)R3:(x,y)(0,1)2,x+y<1,zI},

and, when needed, we stress the dependency of (St,It,βt) with respect to (x,y,z)O and ξA by writing (Stx,y,z;ξ,Itx,y,z;ξ,βtz;ξ). Indeed, due to (2.3) and the autonomous nature of (2.4), we have that St and It depend on (x,y,z) and on ξ through βt, while βt depends only on z and directly on ξ. We shall also simply set (Stx,y,z,Itx,y,z,βtz)(Stx,y,z;0,Itx,y,z;0,βtz;0) to denote the solutions to (2.1), (2.2), and (2.4) when ξ0.

Then, for any (x,y,z)O, we introduce the problem’s value function

V(x,y,z)infξAE[0eλtC(Itx,y,z;ξ,ξt)dt]. (2.8)

The latter is well defined given that C is nonnegative. In the next section we will show that V solves the corresponding dynamic programming equation in the classical sense, and we also provide an optimal control in feedback form.

3. The solution to the social planner problem

We introduce the differential operator L acting on functions belonging to the class C1,1,2(R3):

(Lφ)(x,y,z)xyz(φyφx)(x,y,z)αyφy(x,y,z)+12σ2(z)φzz(x,y,z). (3.1)

Next, for any (y,z,p)(0,1)×I×R, define

C(y,z,p)infξ[0,L](C(y,ξ)+b(z,ξ)p), (3.2)

which is continuous on [0,1]×I×R. Indeed, byAssumptions 2.1-(ii) and 2.3-(iv), there exists a constant K¯>0 such that

|C(y,z,p)C(y,z,p)|supξ[0,L](|C(y,ξ)C(y,ξ)|+|b(z,ξ)b(z,ξ)||p|+|b(z,ξ)||pp|)K¯(|yy|+|zz||p|+|pp|).

By the dynamic programming principle, we expect that V should solve (in a suitable sense) the Hamilton–Jacobi–Bellman (HJB) equation

λv(x,y,z)=(Lv)(x,y,z)+C(y,z,vz(x,y,z)),(x,y,z)O. (3.3)

In order to show that V indeed solves (3.3) in the classical sense, we start with the following important preliminary results. Their proofs are standard in the literature of stochastic control (see, e.g., Pham, 2009, Yong and Zhou, 1999), upon employing Assumption 2.1, Assumption 2.3.

Proposition 3.1

There exists K>0 such that, for each q(x,y,z),q(x,y,z)O

  • (i)

    0V(q)Kand|V(q)V(q)|K|qq|; i.e., V is bounded and Lipschitz continuous on O ;

  • (ii)
    for anyμ[0,1]and for someK>0
    μV(q)+(1μ)V(q)V(μq+(1μ)q)Kμ(1μ)|qq|2;
    i.e., V is semiconcave on O .

Moreover, V is a viscosity solution to the HJB equation (3.3) .

Proof

The first claim of (i) above follows from the fact that C is nonnegative and bounded on [0,1]2; the second claim of (i) is due to Proposition 3.1 in Yong and Zhou (1999), whose proof can be easily adapted to our stationary setting. Analogously, the semiconcavity property of (ii) can be obtained by arguing as in Proposition 4.5 of Yong and Zhou (1999). Finally, Theorem 5.2 of Yong and Zhou (1999) (again, easily adapted to our stationary setting) or Proposition 4.3.2-(2) of Pham (2009) leads to the viscosity property. □

The semiconcavity of V, together with the fact that V solves the HJB equation (3.3) in the viscosity sense, yields the following directional regularity result.

Proposition 3.2

Vz exists continuous on O .

Proof

Let (x,y,z)O. By semiconcavity of V, there exists the left and right derivatives of V along the direction z at (x,y,z) that we denote, respectively, by Vz(x,y,z),Vz+(x,y,z). Moreover, again by semiconcavity, we have the inequality Vz(x,y,z)Vz+(x,y,z). Assuming, by contradiction, that V is not differentiable with respect to z at (x,y,z) means assuming that Vz(x,y,z)>Vz+(x,y,z). Hence, we can apply Lemma A.1 in the Appendix and find a sequence of functions (φ~n)nC2(O) such that

φ~n(x¯,y¯,z¯)=V(x¯,y¯,z¯),φ~nVin a neighborhood of(x¯,y¯,z¯),|Dφ~n(x¯,y¯,z¯)|L~<,φ~zzn(x¯,y¯,z¯)n. (3.4)

Then, the viscosity subsolution property of V (cf. Proposition 3.1) yields

λV(x¯,y¯,z¯)(Lφ~n)(x¯,y¯,z¯)+C(y¯,z¯,φ~zn(x¯,y¯,z¯)).

Taking the limit as n and using (3.4) we get a contradiction. We have thus proved that Vz exists at each arbitrary (x,y,z)O.

Now we show that Vz is continuous. Take a sequence (qn)nO such that qnqO, and let ηn=(ηxn,ηyn,ηzn)D+V(qn), the latter being nonempty due to the semiconcavity of V. Since Vz exists at each point of O, we have ηzn=Vz(qn). Since V is semiconcave, the supergradient D+V is locally bounded as a set-valued map, and therefore there exists a subsequence (qnk)k such that ηnkη=(ηx,ηy,ηz). By Cannarsa and Sinestrari (2014, Prop. 3.3.4-(a)), we have ηD+V(q), and again, since Vz exists, we have ηz=Vz(q). Hence, we have proved that from any sequence (qn)nO converging to q, we can extract a subsequence (qnk)kO such that Vz(qnk)Vz(q). By usual arguments on subsequences, the claim follows.  □

We can now prove the main theoretical result of our paper, which ensures that V is actually a classical solution to the HJB equation (3.3). In turn, this provides a way to construct an optimal control in feedback form.6

Theorem 3.3

The following hold:

  • (i)

    VC2(O) and solves the HJB equation (3.3) in the classical sense.

  • (ii)
    Let
    ξ^(x,y,z)argminξ[0,L](C(y,ξ)b(z,ξ)Vz(x,y,z)),(x,y,z)O. (3.5)
    If the system of equations 7
    dSt=βtStItdt,S0=x,dIt=(βtStItαIt)dt,I0=y,dβt=b(βt,ξ^(St,It,βt))dt+σ(βt)dWt,β0=z, (3.6)
    admits a unique strong solution (St,It,βt)t , then the control
    ξtξ^(St,It,βt), (3.7)
    is optimal for (2.8) and (βt)t is the optimally controlled transmission rate; that is,
    V(x,y,z)=E[0eλtC(It,ξt)dt].

Proof

Proof of (i) — Step 1. Recall (3.2) and define F(x,y,z)C(y,z,Vz(x,y,z)). Due to Proposition 3.2 and the continuity of C on (0,1)×I×R, we have that F is continuous on O. Moreover, since C is bounded on [0,1]×[0,L], Vz is bounded on O by Proposition 3.1-(i), and b(,ξ) is bounded (cf. Assumption 2.1-(ii)), there exists K>0 such that

|F(x,y,z)|K,(x,y,z)O. (3.8)

Set now

v(x,y,z)E[0eλtF(Stx,y,z,Itx,y,z,βtz)dt],(x,y,z)O. (3.9)

Although not uniformly elliptic, the differential operator L defined in (3.1) is hypoelliptic, meaning that the so-called Hörmander’s condition is satisfied (cf. the proof of Proposition A.2 in the Appendix and Eq. (A.4) therein). In fact, by Proposition A.2 in the Appendix, for any q(x,y,z)O the (uncontrolled) process (Qtq)t(Stx,y,z,Itx,y,z,βtz) admits a transition density p(t,q,), t>0, which is absolutely continuous with respect to the Lebesgue measure in R3, infinitely many times differentiable, and satisfying the Gaussian estimates (A.2), (A.3). As a consequence, by Fubini’s theorem we can write

v(x,y,z)=0eλt(OF(x,y,z)×p(t,x,y,z;x,y,z)dxdydz)dt,

and recalling (3.8), and applying the dominated convergence theorem, one shows that vC2(O).

For (x,y,z)O, let now τninf{t0:|(Stx,y,z,Itx,y,z,βtz)|n}, nN, and notice that the strong Markov property yields

eλ(tτn)v(Stτnx,y,z,Itτnx,y,z,βtτnz)+0tτnF(Sux,y,z,Iux,y,z,βuz)du
=E[0eλtF(Stx,y,z,Itx,y,z,βtz)dt|Ftτn].

Since vC2(O), we can apply Itô’s formula to the first addend on the left-hand side of the latter, take expectations, observe that the stochastic integral has zero mean (by definition of τn and the fact that vx is continuous), and finally find

E[0tτneλu(Lv+Fλv)(Sux,y,z,Iux,y,z,βuz)du]+v(x,y,z)=E[0eλtF(Stx,y,z,Itx,y,z,βtz)dt]; (3.10)

that is, by (3.9),

E[0tτneλu(Lv+Fλv)(Sux,y,z,Iux,y,z,βuz)du]=0.

Dividing now both left and right-hand sides of the latter by t, invoking the (integral) mean-value theorem, letting t0, and using that t(Stx,y,z,Itx,y,z,βtz) is continuous, we find that v is a classical solution to

λφ=Lφ+FonO. (3.11)

Proof of (i) — Step 2. Let (x,y,z)O, and (Kn)n be an increasing sequence of open bounded subsets of O such that nNKn=O. Defining the stopping time

ρninf{t0:(Stx,y,z,Itx,y,z,βtz)Kn},nN,

we set

v^n(x,y,z)E[0ρnF(Sux,y,z,Iux,y,z,βuz)du+eλρnV(Sρnx,y,z,Iρnx,y,z,βρnz)]. (3.12)

If (x,y,z)Kn, then v^n(x,y,z)=V(x,y,z) as ρn=0 a.s. Take then (x,y,z)Kn. By the same arguments as in Step 1 and considering that V is continuous on Kn, the function v^n is a solution to

λφ=Lφ+F,onKn,φ=VonKn. (3.13)

Since also V is a viscosity solution to the same equation and since uniqueness of viscosity solution holds for such a problem (cf., e.g., Crandall et al., 1992), we have v^n=V on K¯n. Because ρn for n (as the boundary of O is unattainable for (Stx,y,z,Itx,y,z,βtz)), by taking limits as n in (3.12) we find that

V(x,y,z)=limnv^n(x,y,z)=v(x,y,z),(x,y,z)O,

where the last equality follows by dominated convergence upon recalling that V is bounded. But then V=v on O, and therefore VC2(O) and solves (3.11) by Step 1. That is, V is a classical solution to the HJB equation (3.3).

Proof of (ii). The optimality of (3.7) follows by a standard verification theorem based on an application of Itô’s formula and the proved regularity of V (see, e.g., Chapter 3.5 in Pham, 2009). □

4. A case study with numerical illustrations

In this section we illustrate numerically the results of our model, with the aim of providing qualitative properties of the optimal containment policies in a case study.

We use the mean-reverting model for the dynamics of β, i.e. 

dβt=ϑ(β^(Lξt)βt)dt+σβt(γβt)dWt,t>0,β0=z(0,γ), (4.1)

for some L,ϑ,γ,σ>0, β^(0,γ). Notice that such a choice of the dynamics of β fulfills all the requirements of Assumption 2.1 (see Proposition A.3 in the Appendix). Moreover, we assume that the social planner has a quadratic cost function of the form

C(y,ξ)=yy¯2+12ξ2. (4.2)

The latter can be interpreted as a Taylor approximation of any smooth, convex, separable cost function with global minimum in (0,0). In (4.2), y¯(0,1) represents, e.g., the maximal percentage of infected people that the health-care system can handle.

Notice that in this case for any (x,y,z)O one has (cf. (3.5))

ξ^(x,y,z)=L,ifVz(x,y,z)>Lϑβ^,ϑβ^Vz(x,y,z),ifVz(x,y,z)[0,Lϑβ^],0,ifVz(x,y,z)<0. (4.3)

Our numerics is based on a recursion on the nonlinear equation

(λL)v(x,y,z)=C(y,z,vz(x,y,z)),(x,y,z)O,

which is solved by the value function in the classical sense (cf. Theorem 3.3). Namely, starting from v[0]0 we use the recursive algorithm:

(λL)v[n+1]=C(y,z,vz[n]),n1

and those equations are solved by Monte Carlo methods based on the Feynman–Kac formula

v[n+1](x,y,z)
=E[0eλtC(Itx,y,z,βtz,vz[n](Stx,y,z,Itx,y,z,βtz))dt],
(x,y,z)O.

Such an approach is needed because of the lack of appropriate boundary conditions on the HJB equation, as the boundary O is unattainable for the underlying controlled dynamical system.

We take a day as a unit of time. In our experiments we assume that the average length of an infection equals 18 days, so that α=118 (see also Alvarez et al., 2020, Bayraktar et al., 2020, and Kruse and Strack, 2020), the level of the maximal possible transmission rate of the disease is γ=0.16, the natural transmission rate of the disease is β^=0.1, towards which the transmission rate (βt)t reverts at rate ϑ=0.1 when ξ0, σ=1, so that the fluctuations of (βt)t are (at most) of order 10−2. Furthermore, we set λ=1365,8 and we fix y¯=0.1 in (4.2). Finally, in all simulations we assume that at day zero about 1% of the population is infected.

In all the subsequent pictures we show the mean paths of the considered quantities, with their 95% confidence interval. The Monte Carlo average has been performed by employing 6000 independent simulations.

In Section 4.1, we compare the optimal social planner policy with the case of no restrictions; in Section 4.2 we consider strategies in which the containment measures are limited to a fixed percentage L[0,1] and provide a comparison between them; in Section 4.3 we study the effect of the fluctuations of the transmission rate on the problem’s solution.

4.1. The optimal social planner policy

We compare the optimal social planner policy with the case of no restrictions (see Fig. 1). In the optimal social planner policy severe lockdown measures (larger than 40%) are imposed for a period of circa 63 days, starting on day 79; then, it follows a gradual reopening phase. The final percentage of recovered individuals is about 50%, in contrast to 72% which is the total percentage of recovered individuals in the case of no restrictions.

Fig. 1.

Fig. 1

Comparison between the optimal social planner policy (upper panel) and the case of no restrictions (lower panel). The figures in the first column show the (average) evolution of the containment policy through the value of the optimal control ξt; the ones in the second column show the (average) evolution of the instantaneous reproduction number Rt=βtα; the ones in the third column show (average) evolution of the percentage of susceptible (in blue), infected (in red) and recovered (in green) individuals; the ones in the fourth column show the (average) evolution of the product RtSt. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

Furthermore, the cases of optimal lockdown and no lockdown show a substantial difference in the evolution of the reproduction number Rtβtα: in the case of lockdown policies at work, in the most restrictive period, the latter is significantly decreased to values around 0.6. Another relevant quantity to analyze is RtSt. Indeed, recalling (2.2), it is easy to see that the percentage of infected naturally decreases at exponential rate α(RtSt1) if RtSt is maintained strictly below 1. We observe that, under the suboptimal action “no lockdown”, RtSt lies below one from day 85 on. On the other hand, the optimal containment policy is such that RtSt<1 from day 75 on. As a consequence, Rt can let oscillate strictly above one (actually, around 1.7) during the final phase of partial reopening so that the negative impact of lockdowns on the economic growth can be partially dammed.

4.2. The optimal social planner policy with limited containment

In many countries, a vigorous lockdown could not always be feasible, especially for long periods. Further, as pointed out by recent literature (for instance see Aspri et al., 2020), gradual policies of longer duration but more moderate containment exhibit large welfare benefits comparable to the ones obtained by a drastic lockdown. For this reason, we consider a strategy in which the containment measures are limited to a fixed percentage L[0,1]. Notice that L=0.7 in Alvarez et al. (2020), L={0.7,1} in Acemoglu et al. (2020) and Bayraktar et al. (2020). A comparison of the optimal social planner policy with limited containment L{0.2,0.4,0.6,0.8} is shown in Fig. 2 and a summary is contained in Table 1.

Fig. 2.

Fig. 2

Comparison between the optimal social planner policy with limited containment L. The figures in the first row show the evolution of the (average) containment policy through the value of the optimal control ξt; the figures in the second row show the (average) evolution of the instantaneous reproduction number Rt=βtα; the figures in the third row show the evolution of the (average) percentage of susceptible (in blue), infected (in red) and recovered (in green) individuals; the figures in the fourth row show the (average) evolution of the product RtSt. The limited level of containment varies with the columns: the first column treats the case L=0.8, the second column the case L=0.6, the third column the case L=0.4 and the last column the case L=0.2. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)

Table 1.

Optimal social planner policy with different values of limited containment L.

L=0.8 L=0.6 L=0.4 L=0.2
First day of containment 54 54 54 54
Recovered 52% 58% 61% 68%

Clearly, the larger L is, the smaller are the social costs (by definition of the value function). Our experiment shows that for L=0.4,0.6,0.8,1, the final percentage of recovered (hence of the total amount of infected) in average ranges from 52% (case L=0.8) up to 68% (case L=0.2). In all the cases, the optimal containment starts at the maximal rate and the first day of containment is substantially the same (around day 54).

Different ceilings L on the containment strategies also affect the values and the fluctuations’ size of the reproduction number βtα: smaller values of L correspond to milder variation of the reproduction number Rt of size 0.3, whereas larger values of L lead to rapid changes of Rt which reaches levels smaller than 1 (less than 0.8 for L=0.8 and less than 0.6 for L=1). In all the cases, RtSt lies strictly below 1 after a certain date, which is decreasing with respect to L (see the last column in Fig. 1, Fig. 2). Notice that without any containment policies, RtSt decreases on time due to a natural “herd-immunity” effect. On the other, when lockdowns are in place, we observe a faster decrease of RtSt which is forced by the initial vigorous policymaker’s actions. The final relaxation of the latter then allows for an increase of RtSt, which is however constrained below the critical level of 1. Such an effect is monotone decreasing with respect to L.

4.3. The role of uncertainty

The main new feature of our model is to consider a (controlled) stochastic transmission rate in the framework of the classical SIR model. In this section we study numerically how an increase of the fluctuations of the transmission rate affects the optimal solution. In particular, in Fig. 3 the volatility σ takes values 1, 5 and 10, thus leading to fluctuations of β of order 10−2, 5×102, and 10−1, respectively (indeed, recall that σ(β)=σβ(γβ) attains its maximum at γ2 and γ=0.16).

Fig. 3.

Fig. 3

Comparison between the optimal social planner policy with different σ, when L=1. The figures in the first row show the evolution of the (average) containment policy through the value of the optimal control ξt; the figures in the second row show the evolution of the (average) percentage of susceptible (in blue), infected (in red) and recovered (in green) individuals. The level of σ varies with the columns: the first column treats the case σ=1, the second column the case σ=5, the third column the case σ=10. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)

We observe from Fig. 3 that larger fluctuations of β have the effect of anticipating the beginning of the lockdown policies, and of diluting the actions over a longer period. Indeed, when σ=5 and σ=10, the optimal lockdown policy starts around day 46 and 42, respectively, in contrast to day 54 of the case σ=1. Moreover, when σ increases, the maximal employed lockdown intensity reduces and the level of containment stabilizes at a larger value in the long run. This can be explained by thinking that an increase in the fluctuations of the transmission rate induces the policymaker to act earlier and over a longer period in order to prevent positive large shocks of β. However, in order to dam the social costs resulting from a longer period of restrictions, the maximal intensity of the lockdown policy should be reduced.

Moreover, such a spread of the optimal lockdown policy gives rise to an increase of the final percentage of recovered (which is circa 58% and 60% when σ=5 and σ=10, respectively, and circa 50% when σ=1).

5. Conclusions

We have studied the problem of a policymaker which during an epidemic is challenged to optimally balance the safeguard of public health and the negative economic impact of severe lockdowns. The policymaker can implement containment policies in order to reduce the trend of the disease’s transmission rate, which evolves stochastically in continuous time. In the context of the SIR model, our theoretical analysis allows to identify the minimal social cost function as a classical solution to the corresponding dynamic programming equation, as well as to provide an optimal control in feedback form.

In a case study in which the transmission rate is a (controlled) mean-reverting diffusion process, numerical experiments show that the optimal lockdown policy is characterized by three distinct phases: the epidemic is first let freely evolve, then vigorously tamed, and finally a less stringent containment should be adopted. Interestingly, in the last period the epidemic’s reproduction number is to let oscillate strictly above one although the product “reproduction number × percentage of susceptible” is kept strictly below the critical level of one. Hence, under the optimal containment policy, the percentage of infected decreases naturally at an exponential rate and the social planner is then allowed to substantially relax the lockdown in order not to incur too heavy economic costs. Moreover, we show that an increase in the fluctuations of the transmission rate gives rise to an earlier beginning of the optimal lockdown policy, which is also diluted over a longer period of time.

We believe that our work is only a first step in enriching the SIR model of a stochastic controlled component and in understanding the policymaker’s problem of optimally balancing the safeguard of public health and social wealth. There is still much to be done in order to incorporate other features like the partial detectability of the transmission rate or the role of public investment on the discovery of a vaccination (see Remark 2.4 on this). We leave the analysis of the resulting challenging problems for future work.

Acknowledgments

The authors thank Frank Riedel, Mauro Rosestolato, three anonymous Referees and the Guest Editors for interesting comments and suggestions.

Manuscript handled by Editor Aditya Goenka

2

The aforementioned regularity of the value function is a remarkable result on its own. Indeed, although the state process is degenerate (as the Wiener process only affects the dynamics of the transmission rate), we can show that the so-called Hörmander’s condition (cf. Nualart, 2006) holds true for any choice of the model’s parameters. This then ensures the existence of a smooth probability transition density for the underlying (uncontrolled) stochastic process, which in turn enables to prove substantial regularity of the value function.

3

The choice of considering x+y<1 – i.e. of having an initial strictly positive percentage of recovered – is only done in order to deal with an open set in the subsequent mathematical formulation of the problem. As a matter of fact, such a condition is not restrictive from the technical point of view as our results still apply if x+y<, for some >1, thus covering the case x+y=1 as well.

4

We implicitly require that the underlying probability space (Ω,F,F(Ft)t,P) is rich enough to accommodate also such an exponential time τ.

6

Notice that, in order to define a candidate optimal control in feedback form, one actually only needs the existence of the derivative Vz. For instance, in the deterministic problem tackled in Federico and Tacconi (2014) only the regularity of the directional derivative is exploited to prove a verification theorem in the context of viscosity solutions. However, here we can improve the regularity of V due to the stochastic nature of our problem, and therefore prove a classical verification theorem.

8

Our choice of the value of λ=λo+δ can be justified by assuming that it takes at least a year to develop a vaccine (i.e. 1λo365) and that the intertemporal discount rate of the social planner δ is negligible with respect to vaccination discovery rate. Indeed, a typical value for the annual discount rate δ is 5% which is clearly such that 0.053651365.

5

A function f:RnR, n1, is called semiconvex if there exists a constant K0 such that f(x)+K2|x|2 is convex; it is semiconcave if f is semiconvex.

7

Since b is bounded, by the method of Girsanov’s transformation, the system has a weak solution, which is also unique in law (see Karatzas and Shreve, 1991, Ch. 5, Propositions 3.6 and 3.10 and also Karatzas and Shreve, 1991, Ch. 5, Remark 3.7). For the sake of brevity, we do not investigate further existence and uniqueness of strong solutions, even if this might be done by employing finer results (e.g., see the seminal paper Veretennikov, 1979).

Appendix. Technical results

Lemma A.1

Let O be an open neighborhood of 0=(0,0,0)R3 . Let W:OR be a semiconcave function such that Wz(0)>Wz+(0) . Then there exists a sequence of functions (φn)nC2(O) such that

φn(0)=W(0)=0,φnWin a neighborhood of0,|Dφn(0)|L<,φzzn(0)n. (A.1)

Proof

Since W is semiconcave, there exists C00 such that

W^:OR,W^(x,y,z)W(x,y,z)C0(x2+y2+z2),

is concave. Fix such a C0. Since Wz(0)>Wz+(0), also W^z(0)>W^z+(0) and it is clear that it is equivalent to show the claim for W^. By Rockafellar (1970, Theorem 23.4), it follows that there exist

η=(ηx,ηy,ηz),ζ=(ζx,ζy,ζz)D+W(0)such thatηz>ζz.

Set

g(q)η,qζ,q

and notice that W^(0)=0=g(0) and that, by concavity,

W^(q)g(q)qO.

Define

ASpan{ηζ},

and denote by Π:R3A the orthogonal projection on A. Given qR3 we then have the decomposition

q=Πq+ηζ|ηζ|s,s=q,ηζ|ηζ|.

Define, for qO,

φn(q)g(Πq)+ψn(s),

where

ψn:RR,ψn(s)=n2s2+12η+ζ,ηζ|ηζ|s.

This sequence realizes (A.1). Indeed, the first two properties hold by construction; in particular the second one is due to the fact that we have

g(q)=g(Πq)+ζ,ηζ|ηζ|sifs0,η,ηζ|ηζ|sifs<0.

As for the last two properties, we notice that

Dφn(q)=Πη(=Πζ)+ηζ|ηζ|dψnds(s),

so

φzn(q)=Πη,(0,0,1)+ηζ|ηζ|,(0,0,1)dψnds(s)=Πη,(0,0,1)+ηzζz|ηζ|dψnds(s),
φzzn(q)=ηzζz|ηζ|d2ψnds2(s),

which then imply them.  □

Denote by q=(q1,q2,q3)(x,y,z) an arbitrary point of O. For any multi-index α(α1,α2,α3)N3 we denote by |α|=i=13αi and Dqα=|α|q1α1q3α3, with the convention that 0 is the identity.

Proposition A.2

For any qO the (uncontrolled) process (Qtq)t(Stx,y,z,Itx,y,z,βtz) admits a transition density p which is absolutely continuous with respect to the Lebesgue measure in R3 , infinitely many times differentiable, and satisfies the Gaussian estimates

p(t,q;q)C0(t)(1+|q|)m0tn02eD0(t)|qq|2t,t>0,q=(x,y,z)O, (A.2)
|Dqαp(t,q;q)|Cα(t)(1+|q|)mαtnα2eDα(t)|qq|2t,t>0,q=(x,y,z)O. (A.3)

Here, C0 , D0 , Cα , and Dα are increasing functions of time.

Proof

Given f,gC1(R3;R3), define the Lie bracket

[f,g]j=13(gqjffqjg).

Then, for any given and fixed qO, we set

μ(q)xyzxyzαyb(z,0) and Σ(q)00000000σ(z)

and denoting by Σi, i=1,2,3, the columns of the matrix Σ, we construct recursively the set of functions L0{Σ1,Σ2,Σ3}, Lk+1{[μ,φ],[Σ1,φ],[Σ2,φ],[Σ3,φ]:φLk}, k0. We also define Lk0Lk. We say that the Hörmander condition holds true at qO if

Span{φ(q),φL}=R3. (A.4)

Direct calculations show that

L0(q)={Σ3}(q)=00σ(z)
L1(q)={[μ,Σ3]}(q)=xyσ(z)xyσ(z)σz(z)b(z,0)σ(z)bz(z,0)

and

L2(q)={[μ,[μ,Σ3]],[Σ3,[μ,Σ3]]}(q)=xy(2b(z,0)σz(z)ασ(z)σ(z)bz(z,0))xy(σ(z)bz(z,0)2b(z,0)σz(z))b(z,0)2σzz(z)σ(z)b(z,0)bzz(z,0)+σ(z)bz(z,0)2σz(z)b(z,0)bz(z,0),xyσ(z)σz(z)xyσ(z)σz(z)b(z,0)σ(z)σzz(z)σ(z)2bzz(z,0)b(z,0)σz(z)2+σ(z)bz(z,0)σz(z)

Hence, the matrix associated to (L0L1L2)(q) has the sub-matrix formed by all its rows and its first three columns with determinant αx2y2σ3(z)<0. Hence, (A.4) holds true on O given the arbitrariness of q.

Therefore, by Theorem 2.3.3 in Nualart (2006), for any t>0 the uncontrolled process (Stx,y,z,Itx,y,z,βtz)t admits a transition density p which is absolutely continuous with respect to the Lebesgue measure in R3, and infinitely many times differentiable. Moreover, Theorem 9 and Remark 11 in Bally (2003) (see also Kusuoka and Stroock, 1985) show that p satisfies the Gaussian estimates (A.2), (A.3). This completes the proof. □

Proposition A.3

The dynamics of β as in (4.1) satisfy Assumption 2.1 .

Proof

For any zR and ξ[0,L], recall that b(z,ξ)=ϑ(β^(Lξ)z) and define

σ˜(z)=0if z(,0],σz(γz)if z(0,γ),0if z[γ,). (A.5)

Then, for (ξt)tA, introduce the stochastic differential equation

dβ˜t=b(β˜t,ξt)dt+σ˜(β˜t)dWt,β˜0=zR. (A.6)

Because b and σ˜ are Lipschitz-continuous and have sublinear growth, uniformly with respect to ξ, for any (ξt)tA there exists a unique strong solution to (A.6) starting at zR (see, e.g., Theorem 7 in Chapter V of Protter, 2004). We denote such a solution by (β˜tξ,z)t. Since ξb(z,ξ) is decreasing, by Theorem 54 in Chapter V of Protter (2004), we have

β˜tL,zβ˜tξ,zβ˜t0,z,t0a.s., (A.7)

where (β˜tL,z)t and (β˜t0,z)t) are, respectively, the solution to (A.6) with ξtL and with ξt0. On the other hand, by Feller’s test for explosion (cf. Proposition 5.22 in Chapter 5.5 of Karatzas and Shreve, 1991), it can be checked β˜tL,z>0 for all t0 a.s. and β˜t0,z<γ for all t0 a.s. Hence, by (A.7), we get β˜tξ,z(0,γ) for all t0 a.s. This proves that the SDE (4.1) admits a unique strong solution which lies within the open interval I=(0,γ) (cf. Assumption 2.1-(i)).

Given the boundedness of the interval I=(0,γ), using the expression of b and (A.5) it is straightforward to verify that (ii) and (iii) of Assumption 2.1 are satisfied as well. □

References

  1. Acemoglu D., Chernozhukov V., Werning I., Whinston M.D. National Bureau of Economic Research; 2020. Optimal Targeted Lockdowns in a Multi-Group SIR Model: Working Paper 27102. Available online at http://www.nber.org/papers/w27102. [Google Scholar]
  2. Alvarez L.H.R. Singular stochastic control, linear diffusions, and optimal stopping: a class of solvable problems. SIAM J. Control Optim. 2001;39(6):1697–1710. [Google Scholar]
  3. Alvarez F.E., Argente D., Lippi F. A simple planning problem for COVID-19 lockdown, testing and tracing. Am. Econ. Rev.: Insights. 2020 Preprint vailable online at https://www.nber.org/papers/w26981 Forthcoming. [Google Scholar]
  4. Aspri A., Beretta E., Gandolfi A., Wasmer E. 2020. Mortality containment vs. Economic opening: Optimal policies in a SEIARD model. Preprint on arXiv:2006.00085. Available online at https://arxiv.org/abs/2006.00085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bally V. 2003. An Elementary Introduction to Malliavin Calculus: INRIA Rapport de Recerche n. 4718. Available online at http://www.crm.cat/en/Activities/Documents/RR-4718.pdf. [Google Scholar]
  6. Bayraktar E., Cohen A., Nellis A. 2020. A macroeconomic SIR model for Covid-19. Preprint on arXiv:2006.16389. Available online at https://arxiv.org/abs/2006.16389. [Google Scholar]
  7. Behncke H. Optimal control of deterministic epidemics. Optim. Control Appl. Methods. 2000;21(6):269–285. [Google Scholar]
  8. Belak C., Christensen C., Seifried F.T. A general verification result for stochastic impulse control problems. SIAM J. Control Optim. 2017;55(2):627–649. [Google Scholar]
  9. Cannarsa P., Sinestrari C. Semiconcave Functions, Hamilton–Jacobi Equations, and Optimal Control. vol. 58. Birkhäuser; 2014. (Progress in Nonlinear Differential Equations and their Applications). [Google Scholar]
  10. Crandall M., Ishii H., Lions P.L. User’s guide to viscosity solutions of second order partial differential equations. Bull. Amer. Math. Soc. 1992;27(1):1–67. [Google Scholar]
  11. Favero C., Ichino A., Rustichini A. 2020. Restating the economy while saving lives under covid-19. Preprint at SSRN: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3580626. [Google Scholar]
  12. Federico S., Tacconi E. Dynamic programming for optimal control problems with delays in the control variable. SIAM J. Control Optim. 2014;52(2):1203–1236. [Google Scholar]
  13. Ferrari G. On the optimal management of public debt: A singular stochastic control problem. SIAM J. Control Optim. 2018;56(3):2036–2073. [Google Scholar]
  14. Gollier C. Cost-benefit analysis of age-specific deconfinement strategies. Covid Econ. 2020;24:1–31. Available online at: https://www.tse-fr.eu/sites/default/files/TSE/documents/doc/by/gollier/covideconomics_dynamic.pdf. [Google Scholar]
  15. Hotz T., Glock M., Heyder S., Semper S., Böhle A., Krämer A. 2020. Monitoring the spread of Covid-19 by estimating reproduction numbers over time. Preprint available at: https://stochastik-tu-ilmenau.github.io/COVID-19/reports/repronum/repronum.pdf. [Google Scholar]
  16. Huberts N., Thijssen J. 2020. Optimal timing of interventions during an epidemic. Preprint available at SSRN: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3607048. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Jiang D., Yu J. Asymptotic behavior of global positive solution to a stochastic SIR model. Math. Comput. Modelling. 2011;54:221–232. [Google Scholar]
  18. Kantner M. 2020. Beyond just flattening the curve: Optimal control of epidemics with purely nonpharmaceutical interventions. Preprint on arXiv:2004.09471. Available online at https://arxiv.org/abs/2004.09471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Karatzas I., Shreve S. second ed. Springer-Verlag; New York: 1991. Brownian Motion and Stochastic Calculus. [Google Scholar]
  20. Kermack W.O., McKendrick A.G. Contributions to the mathematical theory of epidemics, part i. Proc. Roy. Soc. Edinburgh Sect. A. 1927;115:700–721. [Google Scholar]
  21. Kruse T., Strack P. 2020. Optimal control of an epidemic through social distancing. Preprint at SSRN: https://ssrn.com/abstract=3581295 or . [DOI] [Google Scholar]
  22. Kusuoka S., Stroock D. Applications of the malliavin calculus, part II. J. Fac. Sci. Univ. Tokyo. 1985;32:1–76. [Google Scholar]
  23. Miclo L., Spiroz D., Weibull J. 2020. Optimal epidemic suppression under an ICU constraint. Preprint on arXiv:2005.01327 Available online at https://arxiv.org/abs/2005.01327. [Google Scholar]
  24. Nualart D. second ed. Springer-Verlag; 2006. The Malliavin Calculus and Related Topics. [Google Scholar]
  25. Pham H. Springer-Verlag; 2009. Continuous-Time Stochastic Control and Optimization with Financial Applications. [Google Scholar]
  26. Protter P.E. Springer; 2004. Stochastic Integration and Differential Equations. [Google Scholar]
  27. Rockafellar T. Princeton University Press; 1970. Convex Analysis. [Google Scholar]
  28. Toda A.A. 2020. Susceptible-infected-recovered (SIR) dynamics of Covid-19 and economic impact. Preprint on arXiv:2003.11221. [Google Scholar]
  29. Tornatore E., Buccellato S.M., Vetro P. Stability of a stochastic SIR system. Physica A. 2005;354:111–126. [Google Scholar]
  30. Veretennikov A.Y. On the strong solutions of stochastic differential equations. Theory Probab. Appl. 1979;24(2):354–366. [Google Scholar]
  31. Yong J., Zhou X.Y. Springer; 1999. Stochastic Control - Hamiltonian Systems and HJB Equations. [Google Scholar]

Articles from Journal of Mathematical Economics are provided here courtesy of Elsevier

RESOURCES