Incentives, lockdown, and testing: from Thucydides’ analysis to the COVID-19 pandemic

Emma Hubert; Thibaut Mastrolia; Dylan Possamaï; Xavier Warin

doi:10.1007/s00285-022-01736-0

. 2022 Apr 10;84(5):37. doi: 10.1007/s00285-022-01736-0

Incentives, lockdown, and testing: from Thucydides’ analysis to the COVID-19 pandemic

Emma Hubert ¹, Thibaut Mastrolia ^2,^✉, Dylan Possamaï ³, Xavier Warin ⁴

PMCID: PMC8995008 PMID: 35397720

Abstract

In this work, we provide a general mathematical formalism to study the optimal control of an epidemic, such as the COVID-19 pandemic, via incentives to lockdown and testing. In particular, we model the interplay between the government and the population as a principal–agent problem with moral hazard, à la Cvitanić et al. (Finance Stoch 22(1):1–37, 2018), while an epidemic is spreading according to dynamics given by compartmental stochastic SIS or SIR models, as proposed respectively by Gray et al. (SIAM J Appl Math 71(3):876–902, 2011) and Tornatore et al. (Phys A Stat Mech Appl 354(15):111–126, 2005). More precisely, to limit the spread of a virus, the population can decrease the transmission rate of the disease by reducing interactions between individuals. However, this effort—which cannot be perfectly monitored by the government—comes at social and monetary cost for the population. To mitigate this cost, and thus encourage the lockdown of the population, the government can put in place an incentive policy, in the form of a tax or subsidy. In addition, the government may also implement a testing policy in order to know more precisely the spread of the epidemic within the country, and to isolate infected individuals. In terms of technical results, we demonstrate the optimal form of the tax, indexed on the proportion of infected individuals, as well as the optimal effort of the population, namely the transmission rate chosen in response to this tax. The government’s optimisation problems then boils down to solving an Hamilton–Jacobi–Bellman equation. Numerical results confirm that if a tax policy is implemented, the population is encouraged to significantly reduce its interactions. If the government also adjusts its testing policy, less effort is required on the population side, individuals can interact almost as usual, and the epidemic is largely contained by the targeted isolation of positively-tested individuals.

Keywords: COVID-19, Stochastic epidemic models, Epidemic control, Optimal incentives, Moral hazard

Introduction

Starting around 430 BC, and known as the first historically epidemic, the plague of Athens killed between a quarter and a third of Athenians, as reported by Thucydides. He analysed the consequences of this epidemic, and concluded that it had led a moral upheaval for the Athenians, faced with the complete lack of any useful cure. In the end, the disease was only stopped thanks to the development of a natural immunity within the population, during the first four years of the epidemic phase. Concerning the spread of the disease itself, Thucydides wrote the following

When they were afraid to visit one another, the sufferers died in their solitude, so that many houses were empty because there had been no one left to take care of the sick; or if they ventured they perished, especially those who aspired to heroism. For they went to see their friends without thought of themselves and were ashamed to leave them, at a time when the very relations of the dying were at last growing weary and ceased even to make lamentations, overwhelmed by the vastness of the calamity. (Jowett 1900, Volume I, Book II, pp. 138)

From this analysis, we can already isolate three fundamental questions that need to be addressed whenever an unknown epidemic occurs.

How can one model a disease with only parsimonious information on how it is spreading among the population?
How can one solve the Gordian knot associated to interactions within the population: enjoying on the one hand the presence of others and avoiding solitude, and on the other hand dramatically spreading the disease?
How can governments and decision-makers incentivise people in order to better control the spread of the epidemic?

Choosing a relevant epidemic model The first question is naturally linked to several strands of fundamental research, both for mathematicians and physicians, dealing with the problem of choosing a relevant epidemic model. The paternity of the first mathematical model designed to describe the evolution of an epidemic seems to be attributed to Bernoulli, who proposed one for smallpox as early as 1760 in Bernoulli (1760). However, other early mathematical approaches were used to study various types of epidemics and their consequences, for example by Farr in 1840, who applied mathematics to death records during a smallpox epidemic in England in Farr (1840), and whose work can be considered as a starting point of the field. Nevertheless, the real mathematical development of the theory had to wait for the 20th century, with fundamental contributions for the development of deterministic models by Hamer (1906), Ross (1910), and later Bartlett (1949) who proposed one of the first general investigations of the evolution of deterministic interacting systems, which was then applied to epidemiology in Kendall (1956). It was rapidly noticed that deterministic models were insufficient to account for the uncertainty associated with the disease spreading, and the technical difficulties usually encountered for its detection. This acknowledgement helped nurturing the development of stochastic models, whose first instance seems to be traced back to McKendrick (1925). For a precise comparison between deterministic and stochastic models in discrete-time settings as well as more historical details, we refer our readers to Bailey (1975), and to Allen (2008) for more up-to-date references and an overview of recent epidemiological models.

We will now describe some models, belonging to the general class of compartmental models, and which will be at the heart of our work. The first one considers a sort of worst-case scenario, in which an immunity is not developed after infection. Such models have been coined SIS (for Susceptible–Infected–Susceptible), and consider a population divided into two groups: susceptible individuals interact with infected ones, and therefore move from one class to the other repeatedly. This model was first discussed in Weiss and Dishon (1971), and then extended by Nåsell (1996), who found the quasi-stationary distribution of a continuous-time stochastic SIS model with no births nor deaths. In this work, the stochastic SIS model we will focus on is defined as a solution to a bi-dimensional SDE driven by a single Brownian motion, as proposed by Gray et al. (2011). Alternatively to this quite pessimistic scenario, one can assume that an immunity will appear after infection, thus adding a third state: the recovered individuals, who have been cured and developed antibodies. Introduced originally by Kermack and McKendrick (1927), this so-called SIR model was studied in depth by Anderson and May (1979) in a deterministic setting, while stochastic perturbations were introduced by Beretta et al. (1998). To be consistent with our choice for the SIS model, we will consider the stochastic SIR model proposed by Tornatore et al. (2005). It should be noted that there is a wide variety of formulations of stochastic SIS and SIR models, which makes it impossible to list them all here. We will simply mention the works by Britton and Pardoux (2019), Dieu et al. (2016), Du and Nhu (2020), Jiang et al. (2011) and Schreiber et al. (2021), on the study of the long-term behaviour of this type of stochastic models, thus answering the question whether or not the epidemic can be controlled.

On the control of an epidemic In the aforementioned classical compartmental models, the infection grows into the population through an incidence rate $β$ , and proportionally to the product of the number of susceptible and infected individuals, as already discussed in the work by Wilson and Worcester (1945), or in the Reed–Frost theory, revisited for instance by Abbey (1952). In the absence of a cure or a vaccine, this transmission rate appears as the only control variable for individuals or public institutions to reduce the spread of an epidemic. Our take on the second main question will therefore be from a control-theoretic perspective. At the heart of this approach is the simple idea that when faced with an epidemic, a perfectly rational population will try to find an equilibrium interaction rate, balancing the need to still connect with others, and the natural fear of spreading the infection itself. This is by no means a new point of view, and papers discussing the use of formal control theory in epidemiology can be dated back to the 70s, see among others Taylor (1968), Abakuks (1973), Morton and Wickwire (1974), Wickwire (1975), or Sethi and Staats (1978). More recently and closer to our purpose, we can refer to Behncke (2000), Riley et al. (2003), who studied the impact of the control of transmission rate on the 2002–2004 SARS outbreak in Hong Kong and on the ways to interfere with the disease’s spread, Hansen and Day (2011), and more broadly to the monograph by Lenhart and Workman (2007).

As should be expected, a significant part of the recent literature on the COVID-19 pandemic has also adopted this control point of view, and such lockdown measures as well as their medical, societal, and economical impacts are discussed by, among others, Anderson et al. (2020), Bayraktar et al. (2021), Charpentier et al. (2020), Ferguson et al. (2020), Fowler et al. (2020), Grigorieva et al. (2020), Hatchimonji et al. (2020), Kantner (2020), Piguillem and Shi (2020), or Wilder-Smith et al. (2020). The previous papers take the point of view of a government acting as a central planner, in the sense that it can impose on the population to control the epidemic in a way which is beneficial to the population as a whole. However, though it seems reasonable to assume that at least some individuals, by being afraid of getting sick, will naturally decrease their interaction rates, it would clearly be a stretch to consider that all individuals will follow the governmental’s recommendations. This individuals’ point of view have been considered by Reluga in Reluga (2010) and Reluga (2013), as well as by Li et al. (2017), thus introducing game theory in epidemiologic models, or more recently by Élie et al. (2020) for the case of COVID-19.

Introducing the notion of incentives In light of the issues we have raised, a natural conclusion was, at least for us, that even if a control-theoretic approach to mitigate the impact of an epidemic is clearly desirable, there is a priori no evidence that in face of clear public policies, a population will directly adopt a social distancing behaviour leading to an optimal transmission rate for the welfare of the society. Moreover, in the absence of a system allowing to actually keep track of the level of interaction within the population, governments are faced with a clear situation of moral hazard: it is impossible for large countries to ensure the application of such isolation measures, and therefore it is unfeasible to have an absolute control on the behaviour of all individuals and their interactions.1 Consequently, an incentive policy should also be calibrated by governments in order to get a better control on the spread of the disease. This, as expected, leads us to our third question, which is where our approach departs significantly from the extant literature. Indeed, to our knowledge, the literature on optimal incentives to counter moral hazard in the context of an epidemic is very sparse. Some authors, for instance Valeeva and Backus (2007) or Gramig et al. (2005, 2009), study disease spreading through the lens of asymmetry of information, but they are mostly interested in livestock related diseases, where producers have private information on preventive measures they may have adopted, prior to contamination (ex ante moral hazard), and may or may not declare whether their herd is infected after contamination (ex post adverse selection). A paper by Francis (2004) discusses the optimal taxes/subsidies to encourage vaccination during the flu season. More closely related to our principal–agent formulation, Carmona and Wang consider in (Carmona and Wang 2021, Sect. 5) an application to the containment of an epidemic of their moral hazard theory for agents interacting through a finite state mean-field game. Finally, an approach similar to ours, but which takes into account mean-field type interactions between individuals within the population, has been developed concurrently and independently of the present paper by Aurell et al. (2020).

Principal–agent approach and technical results We thus propose to fulfil this gap in the literature by studying how a lockdown policy can limit the number of infected people during an epidemic, with uncertainties on the actual number of affected individuals, and on their level of adherence to such a policy. More specially, we aim at solving this moral hazard problem by finding

(i)
the best reaction effort of the population to reduce the interaction given a specific government policy;
(ii)
the optimal policy composed by an aggregated tax paid by the population at some fixed maturity, and a testing policy to reduce the uncertainty on the estimated number of infected people.

This problem perfectly fits with a classical principal–agent problem with moral hazard, and boils down to finding a Stackelberg equilibrium between the principal (the leader, here the government) proposing a policy to an agent (the follower, here the population) to interact optimally in order to reduce the spread of the disease. Principal–agent problems have a long history in the economics literature, dating back from, at least, the 60s. It is not our goal here to review the whole literature on the subject, and we refer the interested reader to the seminal books by Laffont and Martimort (2002), Bolton and Dewatripont (2005), or Salanié (2005). We will content ourselves to mention that this literature regained a strong momentum in the past two decades with the development of continuous-time models. Main contributors in these regards are Holmström and Milgrom (1987), Schättler and Sung (1993), Sannikov (2008), see also the monograph by Cvitanić and Zhang (2012). More recently, Cvitanić et al. (2017, 2018) developed a general theory allowing to tackle a great number of contract theory problem, which has been then extended and applied in many different situations.2 However, the previous approach requires a fundamental assumption on the structure of the controlled process, that is not satisfied in our model, because roughly speaking, there is only one Brownian motion driving the two processes, and we therefore cannot directly rely on existing result to tackle our problem. In these so-called degenerate problems, the literature has so far relied on the Pontryagin stochastic maximum principle, see for instance Hu et al. (2019), but this requires extremely stringent assumptions, such as linear dynamics, which are automatically precluded for SIS/SIR models. We nevertheless prove that in our specific problem, it is possible to identity a whole family of contract representations, which is different from the (unique) one obtained in Cvitanić et al. (2018), but which still allows us to re-interpret the problem of the principal as a standard stochastic control problem. As far as we know, ours is the first paper in the literature which uses a dynamic programming approach to solve a degenerate principal–agent problem, and this constitutes our main mathematical contribution.

Numerical results and policy-related implications Unfortunately, there is no way to extract from our model explicit results, especially on the shape of optimal controls. It is therefore necessary to perform numerical simulations, by implementing semi-Lagrangian schemes. The numerical results for the SIR model are conclusive, in the sense that they confirm the relevance of a tax and testing policy to improve the control of an epidemic. First, in the benchmark case, i.e. when the government does not put into place a specific policy, the efforts of the population are not sufficient to contain the epidemic. In our opinion, this supports the need for incentives. Indeed, if a tax policy is put into place, even in the absence of a specific testing policy, the population is encouraged to significantly reduce its interactions, thus containing the epidemic until the end of the period under consideration. Moreover, if the government also adjusts the testing policy, less effort is required on the population side, so individuals can interact almost in a business-as-usual fashion, and the epidemic is largely contained by the targeted isolation of positively-tested individuals. However, in both cases, the population relaxes its effort at the very end of the fixed lockdown period, leading to a resumption of the epidemic at that point. We obtain similar results in the case of a SIS model (see Hubert et al. 2020, Appendix A).

Notations We let $N^{⋆}$ be the set of positive integers, $R_{+} : = [0, \infty)$ and $R_{+}^{⋆} : = (0, \infty)$ . We fix a time horizon $T > 0$ corresponding to the lockdown length chosen, a priori, by the government. For every $n \in N^{⋆}$ , $S^{n}$ represents the set of $n \times n$ symmetric positive matrices with real entries. We also denote by $C^{n}$ the space of continuous functions from [0, T] into $R^{n}$ , and simplify notations when $n = 1$ by setting $C : = C^{1}$ . The set $C^{n}$ will always be endowed with the topology associated to the uniform convergence on the compact [0, T]. For every finite dimensional Euclidean space E, and any $n \in N^{⋆}$ , we let $C_{b} (E, R)$ be the space of bounded, continuous functions from E to $R$ , as well as $C_{b}^{n} (E, R)$ the subset of $C_{b} (E, R)$ of all n-times continuously differentiable functions on E, with bounded derivatives. For every $φ \in C_{b}^{2} (E, R)$ , we denote by $\nabla φ$ its gradient vector, and by $D^{2} φ$ its Hessian matrix.

Informal pandemic models and main results

In this section, in order to highlight the results we obtain throughout this paper, we present our model in an informal way, and refer the reader to Sect. 4 for the rigorous mathematical study. In particular, we first detail the compartmental epidemic models we consider to represent the spreading of the virus, namely a stochastic version of the well-known SIS and SIR models, and how both the population and the government can impact these dynamics. We then describe their optimal control problems, together with the Stackelberg game in which they are involved. Finally, we summarise our theoretical findings, which will prove useful for the numerical resolution described in Sect. 3.

Controlled stochastic SIS/SIR dynamics

At the beginning of an epidemic, it is unlikely that decision-makers, let alone the population, will have sufficient information to conclude that infected individuals become immune to the virus in question once they have recovered. This is particularly true when the virus is new, as in the case of the COVID-19. For this reason, we choose to address in our study both SIS and SIR compartmental models. The SIS model considers that infected individuals do not develop an immunity to the disease, and thus assume that an infected individual can, after recovery, re-contract the disease. Conversely, the SIR compartment model involves a third class, namely the ‘Recovered’, i.e., individuals who have contracted the disease, are now cured, and especially immune to the virus under consideration. In order to make our study more comprehensive, we consider a meta-model, whose epidemic pattern is described by Fig. 1, and which allows us to deal with the two compartmental models mentioned above. We denote by $(S_{t}, I_{t}, R_{t})$ the proportion of individuals in each state ‘Susceptible’, ‘Infected’ and ‘Recovered’ at time $t \geq 0$ . We describe below the main parameters, and whether they are controlled or not, which allows to progressively construct the final specification of the epidemic in terms of stochastic dynamics satisfied by (S, I, R), given by the system (2.1).

Uncontrolled parameters Some parameters are common in the SIS and SIR models, as highlighted by Fig. 1. In particular, both models involve three non-negative parameters: $λ$ , $μ$ and $γ$ . While the parameters $λ$ and $μ$ represent respectively the birth and (natural) death rates among the population, and therefore reflect the demographic dynamics unrelated to the epidemic,3 while $γ$ represents the death rate associated to the disease. Conversely, the two non-negative constant rates $ν$ and $ρ$ are specific to the SIS and SIR models respectively. More precisely, $ν$ corresponds to the rate at which an infected individual returns, after recovery, to the class of susceptible individuals, while $ρ$ represents the recovery rate in the SIR model, i.e., the rate at which individuals who have contracted the disease are cured, and therefore immune to the virus under consideration. All the aforementioned parameters, i.e. $λ, μ, γ, ν$ and $ρ$ are homogeneous to the inverse of our unit of time, i.e. days, and are assumed to be constant and exogenous.

Control of the transmission rate The transmission rate $β$ of the disease is defined as the average number of contacts made by an average infective per unit of time that leads to an infection, and is therefore also homogeneous to the inverse of our unit of time, i.e. days. In contrast to the previous parameters, $β$ is assumed here to be endogenous and time-dependent, in order to model the influence that the population can have on this rate. Indeed, the transmission rate of an epidemic depends essentially on two factors: the disease characteristics and the contact rate within the population. Although the population cannot modify the disease characteristics, each individual can make a costly effort to reduce his/her contact rate with other individuals in the population. With this in mind, we first assume that the constant initial transmission rate of the disease, i.e., without any control measures or particular effort from the population, is given by some level $\bar{β} > 0$ . We then consider that the population can deviate from this initial transmission rate, namely by choosing, at some cost, a process $β \in B$ , assumed to be B-valued for $B : = [0, β^{\max}]$ , where the constant $β^{\max} \geq \bar{β}$ represents the maximum rate of interaction that can be considered.4

Compartmental model with uncertainty The use of a deterministic model is widespread and generally justified for most epidemics. However, when considering for example the COVID-19 pandemic, it appears that the number of infected individuals is not so simple to quantify and estimate. Indeed, without a large testing campaign, it seems complicated to know precisely the actual number of susceptible and infected, especially because of the absence of symptoms for a significant proportion of infected individuals. As a consequence, it seems more realistic for our purpose to represent the spread of the epidemic by a stochastic dynamic, which is inspired by the versions of stochastic SIS and SIR models respectively considered by Gray et al. (2011, Sect. 2) and Tornatore et al. (2005). More precisely, we consider the following dynamic for the epidemic, where the proportion of infected and susceptible are impacted at each time t by a Brownian motion $W_{t}$

\begin{matrix} \{\begin{matrix} S_{t} = s_{0} + \int_{0}^{t} (λ - μ S_{s} + ν I_{s} - β_{s} \sqrt{α_{s}} S_{s} I_{s}) d s + \int_{0}^{t} σ α_{s} S_{s} I_{s} d W_{s}, t \in [0, T], \\ I_{t} = i_{0} - \int_{0}^{t} ((μ + ν + γ + ρ) I_{s} - β_{s} \sqrt{α_{s}} S_{s} I_{s}) d s \\ - \int_{0}^{t} σ α_{s} S_{s} I_{s} d W_{s}, t \in [0, T], \\ R_{t} = r_{0} - \int_{0}^{t} (ρ I_{s} - μ R_{s}) d s, t \in [0, T], \end{matrix}) \end{matrix}

2.1

for a given initial distribution of individuals at time 0, denoted by $(s_{0}, i_{0}, r_{0}) \in R_{+}^{3}$ and assumed to be known. Note that to recover a stochastic SIS model, one has to set $ρ = 0$ , and conversely $ν = 0$ for a SIR.

Remark 2.1

There exist several different versions of stochastic SIS/SIR models, see among others the works by Allen (2008) and Greenwood and Gordillo (2009), in addition to those already mentioned in the introduction. In this paper, we assume that the uncertainty giving rise to the emergence of the Brownian motion is related to the interaction rate $β$ . More precisely, here, $β$ is no longer constant compared to deterministic model but subject to random shocks, i.e., $β d t ⟵ β d t + σ d B_{t}$ . We refer to the works by Gray et al. (2011) and Lesniewski (2020) for more details on the construction of such stochastic models. However, we would like to emphasise that, although we have chosen a specific dynamic, and a formulation in terms of rate and non-dimensionless groups, the general approach we develop in this paper can be adapted in a straightforward way to various stochastic models and other formulations.

Testing policy In addition to the parameters described above—the constant rates $λ$ , $μ$ , $γ$ , $ρ$ and $ν$ , and the population’s control $β$ —this stochastic version includes two new parameters: a fixed and deterministic parameter $σ > 0$ , homogeneous to the inverse of the square root of our time unit, and a dimensionless time-dependent process $α$ , representing the actions of the government in terms of testing policy. More precisely, we first assume that, without any specific effort of the government, $α$ is equal to 1. Then, the government can choose to increase, at some cost, the number of tests in the population, represented by a decrease of the parameter $α$ , thus reducing the volatility of the processes S and I. Hence, both the population and the government have a clearer view of the proportion of susceptible and infected, and thus on the epidemic. In particular, this control $α$ of the government is assumed to be A-valued, where $A : = [ε, 1]$ for a small parameter $ε \in (0, 1)$ ,5 and we denote by $A$ the corresponding set of processes.6

In addition, the testing policy allows the government to isolate positively-tested individuals. More precisely, without any testing policy, i.e. $α = 1$ , the government cannot isolate contaminated individuals efficiently. In this case, all infected people spread the disease, and the transmission rate of the virus is given by $β$ . Conversely, if a testing policy is implemented by the government, i.e. $α < 1$ , we consider that individuals with positive test results can be isolated, and as a consequence less infected people spread the disease. In this case, the effective transmission rate is lower. We however do not assume that the impact of the testing policy on the volatility of S and I, and on the transmission rate has the same magnitude: we expect a lower reduction of the effective transmission rate, compared to the volatility reduction for a given policy $α$ . Indeed, it is easier to reduce the uncertainty on the number of infected people, compared to actually isolate individuals who have been identified as infected. We thus assume a linear dependency with respect to $α$ for the volatility of both S and I, while the effective transmission rate is chosen equal to $β \sqrt{α}$ , so that the number of infected people spreading the disease at time t is actually given by $\sqrt{α} I_{t}$ .

Remark 2.2

To be more realistic, the implementation of the testing policy could be modelled through the addition of a supplementary state, to capture the individuals under quarantine. The theoretical approach developed in this paper can easily be adapted to this purpose, and even for more refined compartmental models. However, the complexity of the numerical resolution increases drastically by adding a state, as mentioned in Sect. 5. We therefore make the choice to limit the number of states, by considering that the testing policy has a direct impact on the effective transmission rate. Nevertheless, this shortcut should not alter the significance of our results in terms of appropriate policies, even if a more precise model would obviously give more relevant quantitative results.

The Stackelberg equilibrium

In addition to the choice of a testing policy, the government can also incentivise the population to limit their social interactions, in order to decrease the transmission rate of the disease, by introducing financial penalties. More precisely, at time 0, the government informs the population about its testing policy $α \in A$ , as well as its fine policy $χ \in C$ ,7 for the lockdown period [0, T]. Informally, while the testing policy directly impact the dynamic (2.1) of the epidemic, the fine policy will play an indirect role: by being indexed on the proportion of susceptible and infected individuals, this tax will incentivise the population to decrease the transmission rate $β$ , in order to limit the spread of the epidemic. Knowing these policies, the population will choose an interacting behaviour according to the following rules

(i)
an increase in the tax lowers its utility;
(ii)
an increase in the level of interaction (up to a specific threshold, namely $\bar{β}$ ) improves its well-being;
(iii)
the population is scared of having a large number of infected people.

Then, by anticipating the optimal response of the population to a given policy $(α, χ) \in A \times C$ , the government will optimise this policy in order to maximise its own expected utility.

Population optimisation problem

For a given policy $(α, χ) \in A \times C$ , we assume that the population solves the following optimal control problem:

\begin{matrix} V_{0}^{A} (α, χ) : = sup_{β \in B} E [\int_{0}^{T} u (t, β_{t}, I_{t}) d t + U (- χ)], \end{matrix}

2.2

where $u : [0, T] \times B \times R_{+} ⟶ R$ and $U : R ⟶ R$ are continuous functions in all their arguments, and U is a bijection from $R$ to $R$ . Given a pair $(α, χ)$ , the set of optimal contact rates $β$ will be denoted $B^{⋆} (α, χ)$ .8

The functions u and U should be interpreted as functions translating respectively the actual value of interaction from the point of view of the population, and the disutility associated to the fine. More precisely, the function U is assumed to be an increasing function, according to (i) above. Concerning the function u, it should be non-decreasing in the second variable up to $\bar{β}$ , and then non-increasing, modelling (ii) above. On the other hand, the function u is assumed to be non-increasing with respect to the proportion of infected individual in the population. In particular, this allows to take into account both the fear of the infection (as mentioned in (iii) above) and the cost that is incurred if an individual is infected.9 Moreover, we choose to normalise the utility of the population to zero when there is no epidemic. In other words, if $i_{0} = 0$ , then $I_{t} = 0$ for all $t \in [0, T]$ , and thus the utility of the population should be equal to 0. With this in mind, we assume first that $U (0) = 0$ , which means that without a fine, the population does not suffer any disutility. Second, when there is no epidemic, the population should not reduce its social interaction, meaning that for all $t \in [0, T]$ , $β_{t} = \bar{β}$ . This leads us to assume that $u (t, \bar{β}, 0) = 0$ , for all $t \in [0, T]$ .

Government optimisation problem

As already explained, the government can choose the tax $χ \in C$ paid by the population at the end of the lockdown period, together with the testing policy $α \in A$ , and we informally write its optimisation problem as

\begin{matrix} V_{0}^{P} : = sup_{(α, χ) \in Ξ} sup_{β \in B^{⋆} (α, χ)} E [χ - \int_{0}^{T} (c (I_{t}) + k (t, α_{t}, S_{t}, I_{t})) d t], \end{matrix}

2.3

where $c : R_{+} ⟶ R_{+}$ and $k : [0, T] \times A \times R_{+} \times R_{+} ⟶ R$ are continuous functions. The function c denotes the instantaneous cost implied by the proportion of infected people, and is thus assumed to be non-decreasing, while the function k represents the cost of the testing policy.

In addition, the set $Ξ$ takes into account the so-called participation constraint for the population. This means that the government is benevolent, which translates into the fact that it has committed to ensure that the living conditions of the population do not fall below a minimal level. Mathematically, the government can only implement policies $(α, χ) \in A \times C$ such that $V_{0}^{A} (α, χ) \geq \underline{v}$ , where the minimal utility $\underline{v} \in R$ is given. This is what is encoded in the set $Ξ$ .

Remark 2.3

Recall that, while the testing policy $α \in A$ directly impact the dynamic of the epidemic, the tax $χ \in C$ plays an indirect and incentive role. Indeed, in the moral hazard situation of interest, i.e. when the government cannot observe the population’s efforts to reduce the transmission rate of the virus, the government can only encourage the population to make efforts, by implementing an incentive scheme. In particular, by indexing the tax $χ \in C$ in an optimal way on the paths of the stochastic processes S and I, which are the only variables observable by the government in this moral hazard context, the population will be incentivised to decrease the transmission rate of the epidemic.

Utilities and cost specifications

We now provide the specification for the utility and cost functions of the population and the government, respectively, that will be used for the numerical simulations in Sect. 3. Nevertheless, we would like to emphasise that our general approach in Sect. 4 does not take these specifications into account, and therefore our theoretical results are valid for very general forms of cost and utility functions. This naturally implies that alternative parameterisations could be chosen for the numerical part, if one wants to capture some costs or effects that are neglected here, for example the individual cost of being infected or the possible scaling costs of testing.

For the population Concerning the population’s utility U with respect to the tax $χ$ , we choose a mixed CARA–risk-neutral utility function, so that $U (0) = 0$ , and U is an increasing and strictly concave bijection from $R$ to $R$

\begin{matrix} U (x) : = \frac{1 - e^{- θ_{p} x}}{θ_{p}} + ϕ_{p} x, x \in R, for some (θ_{p}, ϕ_{p}) \in {(0, + \infty)}^{2} . \end{matrix}

For later use, we record that the inverse of U, denoted by $U^{(- 1)}$ , can be expressed in terms of the LambertW function10

\begin{matrix} U^{(- 1)} (y) : = \frac{1}{θ_{p}} LambertW (ϕ_{p}^{- 1}, e^{\frac{1 - θ_{p} y}{ϕ_{p}}}) + \frac{θ_{p} y - 1}{θ_{p} ϕ_{p}}, y \in R . \end{matrix}

Note that the previous function U defines how the population values dollars (the unit of the tax) in terms of units of utility, called util. More precisely, $ 1 corresponds to U(1) utils, and conversely 1 util is worth $ $U^{(- 1)} (1)$ .

Next, concerning the running utility function u, we can consider the following separable form

\begin{matrix} u (t, b, i) & : = - u_{β} (t, b) - u_{I} (i), (t, b, i) \in [0, T] \times B \times R_{+}, \end{matrix}

2.4

where the functions $u_{β} : [0, T] \times R_{+} ⟶ R$ and $u_{I} : R_{+} ⟶ R$ should respectively capture the two rules (ii) and (iii). The function $u_{I}$ could model the fact that the population underestimates the epidemic when the proportion of infected is close to 0, while when it becomes large, the population is irrationally afraid. For instance, we can choose

\begin{matrix} u_{I} (i) = c_{p} i^{3}, i \in R_{+}, for some c_{p} \geq 0 (in util \cdot {day}^{- 1}) . \end{matrix}

2.5

Finally, the function $u_{β}$ must first acknowledge that it is costly for the population to deviate from its usual contact rate. Second, during the lockdown period, the social cost of distancing measures can become more and more important for the population, and we thus expect the cost $u_{β}$ to also reflect this sensitivity with respect to time. More precisely, we can consider the following form

\begin{matrix} u_{β} (t, b) & : = η_{p} ψ (t) {(\bar{β} - b)}^{2} / 2, (t, b) \in [0, T] \times B, for some η_{p} \\ > 0 (in util \cdot day) . \end{matrix}

2.6

Above, $ψ$ should be a non-decreasing and convex $R_{+}$ -valued function, to represent the increasing aversion to the lockdown for the population as time passes. In other words, deviating from its usual level of interaction entails a social cost to the population that is greater as the duration increases. More precisely, we can consider

\begin{matrix} ψ (t) : = e^{τ_{p} t}, t \in [0, T], for some τ_{p} > 0 (in {day}^{- 1}) . \end{matrix}

For the government Regarding the cost function c, one can choose for instance the following linear–quadratic form $c (i) : = c_{g} (i + i^{2})$ , $i \in R_{+}$ , for some $c_{g}$ in dollars per days, whose value is greater than $c_{p}$ to take into account that the marginal cost linked to the proportion of infected people in the population is higher for the government than for the population itself. More precisely, the linear part represents the cost per unit of infected people, while the quadratic part highlights the cost induced by the saturation of the healthcare system when the number of infected is too high. Compare to the cubic cost chosen for the population in Sect. 2.2.3, this choice emphasises that, on the one hand, even for a small number of infected, the marginal cost faced by the government is not close to 0 (hence the linear term). On the other hand, the population is more likely to incur very high and lasting costs in terms of QALY/DALY when the disease spreads uncontrollably, when compared to the government which mostly faces pecuniary costs.

Concerning the cost k associated with the testing policy, we may consider the following function

This function highlights the fact that it is very costly, if not impossible, to eliminate the uncertainty associated with the epidemic by choosing $α = 0$ , while the cost of a no-testing policy $(α = 1)$ is null. Indeed, on a country-wide scale, it seems impossible to develop a testing policy sufficient to know exactly the proportion of susceptible and infected.

Two alternative problems

As already mentioned, the framework of interest in this paper is that of moral hazard, i.e. when the government does not observe the efforts of the population, and must therefore find an optimal incentive scheme. However, in order to test the relevance of this incentive scheme, it is important to compare our results with those obtained in two more traditional settings: (i) a benchmark case, when the government does not interfere, and (ii) when there is no moral hazard (first-best case), and therefore the government can enforce the optimal transmission rate on the population.

(i)
When the government does not interfere, i.e. without tax and testing policy, it suffices to solve (2.2) for $α = 1$ and $χ = 0$ . Since we assumed that $U (0) = 0$ , the optimisation problem faced by the population boils down to the following standard control problem, whose associated PDE will be given by (2.10)
$\begin{matrix} V_{0}^{A} (1, 0) = sup_{β \in B} E [\int_{0}^{T} u (t, β_{t}, I_{t}) d t] . \end{matrix}$ 2.7
(ii)
The first-best case is the best possible scenario where the government can enforce whichever interaction rate $β \in B$ it desires, and simply has to satisfy the participation constraint of the population. From the practical point of view, this could correspond to a situation where the government is able to track every individual and force them to stop interacting. In this case, the problem faced by the government is
$\begin{matrix} V_{0}^{P, FB} : = & sup_{(α, χ, β) \in A \times C \times B} E [χ - \int_{0}^{T} (c (I_{t}) + k (t, α_{t}, S_{t}, I_{t})) d t], \\ s.t. E [\int_{0}^{T} u (t, β_{t}, I_{t}) d t + U (- χ)] \geq \underline{v} . \end{matrix}$ 2.8

Main results and comparison

In this section, we present the main theoretical results obtained when the dynamic of the epidemic is given by (2.1): we begin by outlining the results in the two alternative problems mentioned above—the benchmark and first-best cases—then explain the optimal form of the tax in the moral hazard case, and conclude with the resolution of the government’s problem in this general case. In short, the solution to any of the three problems is equivalent to solving the relevant Hamilton–Jacobi–Bellman (HJB for short) equation.

The benchmark case: without tax and testing policies

As mentioned in Sect. 2.2.4 (i), this benchmark problem is a standard Markovian stochastic control problem. In this case, the population’s Hamiltonian is defined, for $t \in [0, T]$ , $(s, i) \in {(R_{+}^{⋆})}^{2}$ , $p : = (p_{1}, p_{2}) \in R^{2}$ and $M \in S^{2}$ by

\begin{matrix} H^{A} (t, s, i, p, M) & : = sup_{b \in B} {u (t, b, i) - b s i (p_{1} - p_{2})} + (λ - μ s + ν i) p_{1} \\ - (μ + ν + γ + ρ) i p_{2} + \frac{σ^{2} {(s i)}^{2}}{2} {(\begin{matrix} 1 \\ - 1 \end{matrix})}^{⊤} M (\begin{matrix} 1 \\ - 1 \end{matrix}) . \end{matrix}

2.9

We then have the natural identification $V_{0}^{A} (1, 0) = v (0, s_{0}, i_{0})$ , where v solves the associated HJB equation

\begin{matrix} - \partial_{t} v (t, s, i) - H^{A} (t, s, i, \nabla v, D^{2} v) = 0, (t, s, i) \in D, \end{matrix}

2.10

with terminal condition $v (T, s, i) = 0, (s, i) \in D_{T}$ ; where, for a particular function F defined by (4.4) in Sect. 4.1,

\begin{matrix} D : = {(t, s, i) \in [0, T) \times {(R_{+}^{⋆})}^{2} : 0 < s + i \leq F (t, s_{0}, i_{0})}, \\ D_{T} : = {(s, i) \in R_{+}^{2} : 0 < s + i < F (T, s_{0}, i_{0})} . \end{matrix}

Remark 2.4

Note that if we consider a separable utility u, for example of the form in Sect. 2.2.3, the maximiser of the Hamiltonian is explicitly given by $b^{\circ} (s, i, p_{1} - p_{2})$ , where $b^{\circ}$ is defined for all $(s, i, z) \in {(R_{+}^{⋆})}^{2} \times R$ by

\begin{matrix} b^{\circ} (s, i, z) & : = β^{\max} 1_{{s i z < η_{p} ψ (t) (\bar{β} - β^{\max})}} + (\bar{β} - \frac{siz}{η_{p} ψ (t)}) 1_{{η_{p} ψ (t) (\bar{β} - β^{\max}) \leq s i z \leq \bar{β} η_{p} ψ (t)}} . \end{matrix}

2.11

In particular, the optimal interaction rate is given in this case by $β_{t}^{\circ} = b^{\circ} (S_{t}, I_{t}, (\partial_{s} v - \partial_{i} v) (t, S_{t}, I_{t}))$ , $t \in [0, T]$ .

The first-best case: without moral hazard

To find the optimal interaction rate $β \in B$ , as well as the optimal contract $(α, χ) \in A \times C$ , in the first-best case, one has to solve the government’s problem defined by (2.8). Mathematical details are postponed to Sect. 4.3.3, but we present here an overview of the main results. To take into account the participation constraint, one has to introduce the associated Lagrangian. Given a Lagrange multiplier $ϖ > 0$ , we first remark that the optimal tax is constant and given by $χ^{⋆} (ϖ) : = - (U^{'})^{(- 1)} (1 / ϖ) .$ Then, defining for any $ϖ > 0$

\begin{matrix} {\bar{V}}_{0} (ϖ) : = sup_{(α, β) \in A \times B} E [\int_{0}^{T} (ϖ u (t, β_{t}, I_{t}) - c (I_{t}) - k (t, α_{t}, S_{t}, I_{t})) d t], \end{matrix}

2.12

we have

\begin{matrix} V_{0}^{P, FB} = inf_{ϖ > 0} {χ^{⋆} (ϖ) + ϖ (U (- χ^{⋆} (ϖ)) - \underline{v}) + {\bar{V}}_{0} (ϖ)} . \end{matrix}

2.13

Note that ${\bar{V}}_{0} (ϖ)$ is the value function of a standard stochastic control problem, and therefore we expect to have ${\bar{V}}_{0} (ϖ) = v^{ϖ} (0, s_{0}, i_{0})$ , for a function $v^{ϖ} : [0, T] \times R_{+}^{2} ⟶ R$ solution to the following HJB PDE

\begin{matrix} - \partial_{t} v^{ϖ} (t, s, i) + c (i) - (λ - μ s + ν i) \partial_{s} v^{ϖ} + (μ + ν + γ + ρ) i \partial_{i} v^{ϖ} \\ - H^{ϖ} (t, s, i, \partial v^{ϖ}, D^{2} v^{ϖ}) = 0, (t, s, i) \in D, \end{matrix}

with terminal condition $v^{ϖ} (T, s, i) = 0, (s, i) \in D_{T}$ , where the Hamiltonian is defined, for $t \in [0, T]$ , $(s, i) \in {(R_{+}^{⋆})}^{2}$ , $p : = (p_{1}, p_{2}) \in R^{2}$ and $M \in S^{2}$ by

\begin{matrix} H^{ϖ} (t, s, i, p, M) : = sup_{a \in A} {sup_{b \in B} {ϖ u (t, b, i) - b s i \sqrt{a} (p_{1} - p_{2})} \\ - k (t, a, s, i) + \frac{1}{2} σ^{2} {(s i)}^{2} a^{2} (M_{11} - 2 M_{12} + M_{22})} . \end{matrix}

Remark 2.5

If we consider for instance the utilities given in Sect. 2.2.3 for the utility u, the optimal interaction rate is given for all $t \in [0, T]$ by $β_{t}^{ϖ} = b^{ϖ} (S_{t}, I_{t}, \partial v^{ϖ} (t, S_{t}, I_{t}), α_{t})$ , for $α \in A$ and a Lagrange multiplier $ϖ > 0$ , where $b^{ϖ} (s, i, p, a) : = b^{\circ} (s, i, \sqrt{a} (p_{1} - p_{2}) / ϖ), for all (s, i, p, a) \in {(R_{+}^{⋆})}^{2} \times R^{2} \times A,$ recalling that $b^{\circ}$ is defined by (2.11).

Relevant form of tax policy

Let us now return to the main problem, i.e. the case with moral hazard. One of the main theoretical result of our study is given by Theorem 4.7. Informally, this theorem states that given an admissible contract, namely a testing policy $α \in A$ and a tax $χ \in C$ , there exist a unique $Y_{0}$ and Z such that the following representation holds

\begin{matrix} U (- χ) = & Y_{0} - \int_{0}^{T} (Z_{t} (μ + ν + γ + ρ) I_{t} + u (t, β_{t}^{⋆}, I_{t}) - β_{t}^{⋆} \sqrt{α_{t}} S_{t} I_{t} Z_{t}) d t \\ - \int_{0}^{T} Z_{t} d I_{t}, \end{matrix}

2.14

where $β^{⋆}$ is the unique optimal contact rate for the population. More precisely, we can state that for (Lebesgue–almost every) $t \in [0, T]$ , $β_{t}^{⋆} : = b^{⋆} (t, S_{t}, I_{t}, Z_{t})$ is the maximiser of the function $b \in B ⟼ u (t, b, I_{t}) - b S_{t} I_{t} Z_{t}$ . Under some assumptions for existence and smoothness of the inverse of the function U, the previous equation naturally gives a representation for the tax $χ$ . Based on (2.14), the tax $χ$ will be indexed on the variation of the proportion of infected I, through the stochastic integral $\int_{0}^{\cdot} Z_{s} d I_{s}$ , and not on the variation of susceptible S (though it is indexed on S through the $d t$ integral). Nevertheless, using the link between the dynamics of I and S, we can write

\begin{matrix} U (- χ) = & Y_{0} - \int_{0}^{T} (u (t, β_{t}^{⋆}, I_{t}) - β_{t}^{⋆} \sqrt{α_{t}} S_{t} I_{t} Z_{t} - Z_{t} (λ - μ S_{s} + ν I_{s})) d t \\ + \int_{0}^{T} Z_{t} d S_{t} . \end{matrix}

2.15

Through this equation, we can state that the tax can alternatively be indexed on S instead of I. Therefore, given the strong link between the number of Susceptible and the number of Infected, it is sufficient to index the tax on only one of these two quantities, and one can therefore choose indifferently to index the tax $χ$ on the variations of I or S. The reader familiar with contract theory in continuous-time will have noticed that the previous representation for the tax $χ$ is not exactly the expected one. Indeed, referring for instance to Cvitanić et al. (2018) the contract is usually the sum of three components: a constant similar to $Y_{0}$ , chosen by the Principal in order to satisfy the participation constraint of the Agent; an integral with respect to time $t \in [0, T]$ of the agent’s Hamiltonian; a stochastic integral with respect to the controlled process, i.e., in our framework, (S, I). Neither the representation (2.14) nor (2.15) are, a priori of this form. This difference is due to the fact that the dynamics of (S, I) is degenerated. More precisely, there is a fundamental structure condition in Cvitanić et al. (2018) requiring that the drift of the output process belongs to the range of its volatility. In words, defining for $(s, i) \in R_{+}^{2}$ and $(a, b) \in A \times B$ ,

\begin{matrix} σ (i, s, a) : = σ a s i (\begin{matrix} 1 \\ - 1 \end{matrix}), and λ (s, i, b, a) : = (\begin{matrix} λ - μ s + ν i + b \sqrt{a} s i \\ - (μ + ν + γ + ρ) i + b \sqrt{a} s i \end{matrix}), \end{matrix}

the condition assumed in Cvitanić et al. (2018, Eq. (2.1)) implies that $λ (s, i, b, a) \propto σ (i, s, a)$ , for any $(s, i, a, b) \in R_{+}^{2} \times A \times B$ , which is obviously impossible here. Therefore, we cannot use directly any existing result in the literature, and we should not expect, a priori, to be able to obtain a contract representation similar to the one in Cvitanić et al. (2018), nor that the so-called dynamic programming approach will prove effective in our case. Indeed, as far as we know, such degenerate models have only been tackled using the stochastic maximum principle, see Hu et al. (2019). However, and somewhat surprisingly, the form we exhibit for the tax is actually strongly related to the usual representation. The reason for this is twofold. First, up to the sign, the volatilities in the dynamics of both S and I are exactly the same. Second, both the processes S and I are driven by the same Brownian motion W. Therefore, intuitively, in order to provide incentives to the population, the government can afford to index the tax on only one of the two processes. Mathematically, it is also straightforward to show that given an arbitrary decomposition of the process Z in Eq. (2.14) of the form $Z = : Z^{s} - Z^{i}$ , we have exactly the general form provided in Cvitanić et al. (2018). The main difference is that in Cvitanić et al. (2018), $Z^{s}$ and $Z^{i}$ are both uniquely given, while in our representation, only their difference actually matters. Hence, there is an infinite number of possible representations for the tax $χ$ in our degenerate model.

Government’s problem in the general case

Thanks to the reasoning developed in Sect. 4, we are able to determine the optimal design of the fine policy and the associated optimal effort of the population. In particular, as informally explained in the previous section, to implement a tax policy $χ \in C$ , the government only needs to choose a constant $Y_{0}$ and a process Z. Given these two parameters, we can state that the optimal contact rate for the population is defined by $β_{t}^{⋆} : = b^{⋆} (t, S_{t}, I_{t}, Z_{t}, α_{t})$ , such that the function $b \in B ⟼ u (t, b, I_{t}) - b \sqrt{α_{t}} S_{t} I_{t} Z_{t}$ is maximised for (Lebesgue–almost every) $t \in [0, T]$ .11 It thus remains to solve the government’s problem in order to determine the optimal choice of $Y_{0}$ and Z. The reader is referred to Sect. 4.3 for the rigorous government’s problem, but, to summarise the results, the optimal process Z as well as the optimal testing policy $α$ are determined so as to maximise the government’s Hamiltonian, given by

\begin{matrix} H^{P} (t, s, i, p, M) & = sup_{z \in R, a \in A} {b^{⋆} (t, s, i, z, a) \sqrt{a} s i (p_{2} - p_{1}) \\ + \frac{1}{2} σ^{2} a^{2} {(s i)}^{2} f (z, M) - k (t, a, s, i) - u^{⋆} (t, s, i, z, a) p_{3}} \\ + (λ - μ s + ν i) p_{1} - (μ + ν + γ + ρ) i p_{2} - c (i), \end{matrix}

for $(t, s, i, p, M) \in [0, T] \times R_{+}^{2} \times R^{3} \times S^{3}$ , and where, in addition for $z \in R$ ,

\begin{matrix} f (z, M) : = M_{11} - 2 M_{12} + M_{22} - 2 z (M_{23} - M_{13}) + z^{2} M_{33}, and \\ u^{⋆} (t, s, i, z, a) : = u (t, b^{⋆} (t, s, i, z, a), i) . \end{matrix}

Finally, it remains to solve numerically the following HJB equation, for all $t \in [0, T]$ and $x : = (s, i, y) \in R^{3}$

\begin{matrix} - \partial_{t} v (t, x) - H^{P} (t, x, \nabla_{x} v, D_{x}^{2} v) = 0, (t, x) \in O, v (T, x) = - U^{(- 1)} (y), x \in O_{T}, \end{matrix}

2.16

where the natural domain over which the above PDE must be solved is

\begin{matrix} O : = {(t, s, i, y) \in [0, T) \times R_{+}^{2} \times R : 0 < s + i < F (t, s_{0}, i_{0})}, \\ O_{T} : = {(s, i, y) \in R_{+}^{2} \times R : 0 < s + i < F (T, s_{0}, i_{0})} . \end{matrix}

Numerical experiments

The results presented in Sect. 2.3 are quite theoretical: except for the optimal transmission rate, it is complicated to obtain explicit formulae for the other variables sought, in particular for the optimal testing policy $α$ , even if we consider separable utility functions as in Sect. 2.2.3. It is therefore necessary to perform numerical simulations to evaluate the optimal efforts of the population and the government, as well as the optimal tax policy. Given the similarities in the results between the SIS and SIR models, only those related to the SIR model are presented in this section. The reader will find in Hubert et al. (2020) the results corresponding to the SIS model.

Choice of parameters

The set of parameters used for the simulations of the epidemic dynamics given by (2.1) are provided in Table 1 and are inspired by those chosen by Élie et al. (2020). Recall that the parameter $\bar{β}$ denotes the usual contact rate within the population, before the beginning of the lockdown. In other words, $\bar{β}$ represents the initial and effective transmission rate of the disease, without any specific effort of the population. The associated reproduction number $R_{0}$ , commonly defined by $R_{0} : = \bar{β} / (ν + ρ)$ in the literature on epidemic models, is equal to 2.0, and is thus in the confidence interval of available data, see for example Li et al. (2020). Then, the parameters $λ$ and $μ$ represent respectively the birth and (natural) death rates among the population, and therefore reflect the demographic dynamics unrelated to the epidemic, while $γ$ represents the death rate associated to the disease. To simplify, and since the duration of the COVID-19 epidemic should be relatively short in comparison to the life expectancy at birth, we choose to disregard the demographic dynamics by setting $λ = μ = 0$ . In contrast, we set $γ = 1 %$ , since the mortality associated with the disease appears to be significant. Finally, recall that the parameters $ν$ and $ρ$ correspond respectively to the recovery rates in the SIS and SIR models. Since we want to consider here a SIR dynamic, we let $ν = 0$ and $ρ = 0.1$ , to account for the average 10-day duration of COVID-19 disease.

Table 1.

Set of parameters for the simulations of SIR model

T (days)	$(s_{0}, i_{0}, r_{0})$	$(λ, μ)$	$γ$	$ν$	$ρ$	$σ$	$\bar{β}$
200	$(0.99984, 1.07 \times 10^{- 4}, 5.3 \times 10^{- 5})$	(0, 0)	0.01	0	0.1	0.1	0.2

Open in a new tab

In addition, the following numerical experiments are implemented using the utility and cost functions mentioned in Sect. 2.2.3. These functions require to specify several parameters, provided in Table 2.

Table 2.

Set of parameters for cost and utility functions

(a) Characteristics of the population
Parameters	$c_{p}$	$η_{p}$	$θ_{p}$	$τ_{p}$	$ϕ_{p}$	$β^{\max}$
Values	0.5	1	4	0	0.5	0.2

(b) Characteristics of the government
Parameters	$κ_{g}$	$c_{g}$	$η_{g}$	$ε$
Values	0.001	1	0.01	0.01

Open in a new tab

When not explicitly specified, the simulations presented in this section are performed with the sets of parameters described in Tables 1 and 2. However, the parameters used to describe in particular the utility and cost functions of the population and government are set in a relatively arbitrary way. To actually estimate these parameters would require an extensive sociological and economic study, that we do not presume to be able to perform at this stage, and linking, for example, the population’s costs to the DALY/QALY concepts already mentioned, and the government’s costs to those of the health care system and its possible congestion. Moreover, there is considerable uncertainty in the literature on the choice of all parameters used to describe the dynamics of the epidemic, in particular because the COVID-19 is a new type of virus. It will therefore be necessary to study the sensitivity of the results obtained with respect to the selected parameters.

Finally, it should be remembered that, in contrast to usual principal–agent problems, the government implements a mandatory tax, which the population cannot refuse. Nevertheless, we consider that the government is benevolent, in the sense that it still wishes to ensure that the utility of the population remains above a certain level, denoted by $\underline{v}$ . To fix this level, we assume that the government wants to ensure at the very least to the population the same living conditions it would have had in the event of an uncontrolled epidemic, i.e., without any effort on the part of neither the population nor the government, meaning $β = \bar{β}$ , $α = 1$ and $χ = 0$ . Mathematically, this is equivalent to the following, since u is separable of the form (2.4), such that for all $t \in [0, T]$ , $u_{β} (t, \bar{β}) = 0$ and $u_{I}$ satisfies (2.5)

\begin{matrix} \underline{v} : = E^{P^{1, \bar{β}}} [\int_{0}^{T} u (t, \bar{β}, I_{t}) d t + U (0)] = E^{P^{1, \bar{β}}} [- \int_{0}^{T} (u_{I} (I_{t}) + u_{β} (t, \bar{β})) d t] \\ = - c_{p} E^{P^{1, \bar{β}}} [\int_{0}^{T} I_{t}^{3} d t] . \end{matrix}

3.1

Notice that the reservation utility $\underline{v}$ is given by the worst case scenario, without any sanitary precaution neither from the population nor from the government. This level may be judged too severe, and one could consider a model where the government is more benevolent. Nevertheless, the value of $\underline{v}$ should not be of major importance, since it should only impact the initial value $Y_{0}$ .

Numerical approach

In order to solve Eq. (2.10) corresponding to the population’s problem in the benchmark case, as well as Eq. (2.16) for the government’s problem, we need a method permitting to deal with degenerate HJB equations. We choose to implement semi-Lagrangian schemes, first proposed in Camilli and Falcone (1995). These are explicit schemes using a given time-step $Δ t$ , and requiring interpolation on the grid of points where the equation is solved. This interpolation can be either linear, as proposed in Camilli and Falcone (1995), or using some truncated higher-order interpolators, as proposed by Warin (2016), leading to convergence of the numerical solution to the viscosity solution of the problem. A key point here, which makes the approach delicate, is that the domain over which the PDEs are solved is unbounded. It is therefore necessary to define a so-called resolution domain, over which the numerical solution will be actually computed, which on the one hand must be large enough, and which on the other hand creates additional difficulties in the treatment of newly introduced boundary conditions. In order to treat these issues, we use two special tricks:

(i)
picking randomly the control in (2.1) for the benchmark case, and in (4.15) for the general case, and using the forward SDE with an Euler scheme, a Monte Carlo method allows us to get an envelop of the reachable domain with a high probability. Then, given a discretisation step, the grid of points used by the semi-Lagrangian scheme is defined at each time-step with bounds set by the reachable domain estimated by Monte Carlo. Therefore, at time 0, the grid is represented by one mesh, while their number can reach millions near T;
(ii)
since the scheme is explicit, starting at t, it requires to use only some discretisation points at date $t + Δ t$ , and a modification of the scheme is implemented to use only points inside the grid at date $t + Δ t$ , as shown in Warin (2016).

Lastly, in dimension 3 or above, parallelisation techniques defined in Warin (2016) have to be used. The numerical results below are obtained using the StOpt library, see Gevret et al. (2018).

The benchmark case

We first focus on the benchmark case, when the government does not implement any particular policy to tackle the epidemic, i.e., $α = 1$ and $χ = 0$ . Recall that in this case, the population’s problem is given by (2.7), and is then equivalent to solving the HJB equation (2.10). For our simulations, we choose a number of time-steps equal to 200, and a discretisation step equal to 0.0025. The interpolator is chosen linear, and the optimal command $b^{\circ}$ used to maximise the Hamiltonian is discretised with 200 points given a step discretisation of 0.005. Once the PDE is solved, a forward Euler scheme is used to obtain trajectories of the optimally controlled S and I, meaning with the optimal transmission rate $b^{\circ}$ . In order to check the accuracy of the method described in Sect. 3.2, we implement two versions of the resolution: the first version is a direct resolution of (2.10) with the Hamiltonian (2.9); the second one relies on a change of variable. More precisely, we consider $(s, x : = (s + i))$ as state variables, instead of (s, i), and then solve the problem (2.10), but with a slightly modified Hamiltonian to take into account this change of variable. The advantage of the second representation is that the dispersion of $I_{t} + S_{t}$ is zero and thus smaller than the one of $I_{t}$ , leading to the use of grids with a smaller number of points. First, to give an overview of the overall trend, we plot, on Fig. 2, 100 trajectories of the optimal interaction rate $β^{⋆}$ , and the associated proportions $S_{t}$ and $I_{t}$ of susceptible and infected, using the resolution method (i) mentioned above, i.e., with state variables (S, I). For more accurate trajectories, we compare on Fig. 3 two different trajectories of the optimal interaction rate $β^{⋆}$ , together with the corresponding dynamic of the proportion I of infected. For these two simulations, we compare the results given by the two aforementioned methods. More precisely, while the blue curve is obtained through the direct resolution, the orange one results from the second method, i.e., with state variables $(S, S - I)$ . Finally, on Fig. 4, we test the influence of the parameter $τ_{p}$ by setting $τ_{p} = 0.01$ , instead of 0.

Fig. 2 — Dispersion of 1000 simulations with respect to time of the SIR model in the benchmark case

Fig. 3 — The optimal transmission rate $β$ and the resulting proportion I in the benchmark case Comparison between of the two methods aforementioned on two simulations

Fig. 4 — Dispersion of simulations of the SIR model in the benchmark case with $τ_{p} = 0.01$

Voluntary lockdown of the population As expected, the optimal behaviour $β^{⋆}$ is to start close to $\bar{β}$ , then to decreases as the disease spreads in the population. More specifically, two waves of effort can be observed: the first one delays the acceleration of the epidemic, and the second, generally more significant, takes place during the peak of the epidemic. Approaching the fixed maturity, individuals come back to their usual behaviour $\bar{β}$ .

Sensitivity with respect to the method As we can see in Fig. 3 (top), the optimal effort exhibits the same features as those previously described. Moreover, the blue curve and the orange curve, representing respectively the results of the two aforementioned methods, are very close, except at the beginning of the time interval, probably because of the very small initial value $i_{0}$ . Nevertheless, we can see that the two methods lead to the same dynamic for the proportion of infected, since the two curves are almost superposed. Therefore, a small error on the computation of the optimal effort at the beginning does not impact the optimally controlled trajectories of I. The resolution with respect to $(s, s + i)$ seems to be more regular, and may give a command closer to the analytical one.

The fear of the infection is not enough Without a proper government policy to encourage the lockdown, the natural reduction of the interaction rate among individuals is not sufficient to contain the disease, so that it spreads with a high infection peak, up to 0.175. As a result, even if at the end of the time interval under consideration, the epidemic appears to be over, between 60 and $80 %$ of the population has been contaminated by the virus, since the proportion S at time $T = 200$ lies between 0.2 and 0.4. In conclusion, without some governmental measures, the fear of the epidemic is not sufficient to encourage the population to make sufficient effort, in order to significantly reduce the rate of transmission of the disease. The introduction by the government of an effective lockdown policy together with an active testing policy should improve the results of the benchmark case, in particular by reducing the peak of infection and the total number of infected people over the considered period.

The lockdown fatigue By setting $τ_{p} = 0.01$ instead of 0, the cost of the lockdown from the population’s point of view is now increasing with time. This allows to take into account the possible fatigue the population may suffer if the lockdown continues for too long. As expected, by comparing Figs. 2 and 4, the impatience of the population gives higher values of optimal interaction rate $β$ . Moreover, we can see that the second wave of effort is more impacted (i.e., the contact rate is less reduced) by the impatience of the population than the first one.

Lockdown policy, without testing

We focus in this section on the tax policy, by assuming that $A = {1}$ . In such a situation, i.e., without a proper testing policy, the detection and hence the isolation of ill people becomes very intricate. This case is interesting, as it corresponds to the lockdown policy that most of western countries have implemented in 2020, when faced with the COVID-19 disease, while a very small number of tests was available. Indeed, most countries put in place systems of fines, or even prison sentences, to incentivise people to lockdown. Although the penalties for non-compliance are not as sophisticated as in our model, most governments did adapt the level of penalties according to the stage of the epidemic: higher fines during periods of strict lockdown (hence at the peak of the epidemic), or in case of recidivism, for example. This reflects the adjustment of sanctions in many countries according to the health situation, and therefore a notion of dynamic adaptation to circumstances, which is exactly what is suggested by our tax system. Though it is clear that our model is different from reality, since in most countries, the fine is paid by a particular individual who has not complied with the injunctions, we still believe it allows to highlight sensible guidelines.

The numerical approach is highly similar to the method used to solve the benchmark case. One difference is that we have to estimate the reservation utility of the population, namely $\underline{v}$ , given by (3.1). Using a Monte Carlo method and a Euler scheme with a time-discretisation of 200 time-steps and $10^{6}$ trajectories, we obtain an approximated value $\underline{v} = - 0.02937$ . Then, we can solve (2.16) through the aforementioned semi-Lagrangian scheme, with 200 time steps, as well as a step discretisation for the grid in (s, i, y) corresponding to (0.0025, 0.0025, 0.005), leading to a number of meshes at maturity equal to $250 \times 70 \times 800$ . A last technical point concerning the domain of the control Z. Although this control of the government, used to index the tax on the proportion of infected, can take high values, we have to bound its domain in order to perform the numerical simulations. We choose to restrict its domain to an interval $[- Z_{\max}, Z_{\max}]$ , and consider a discretisation step equal to 0.5. One would naturally expect that a larger choice would lead to somewhat better solutions. However, this neglects a fundamental numerical issue: large values of Z increase the numerical cost, as they enlarge the volatility of the process Y (given by $σ Z I S$ ). As such, since the volatility cone becomes larger, it is necessary to sample a much larger grid in order to be able to cover the region were Y will most likely take its values. Too large values of $Z_{\max}$ therefore become numerically intractable, unless one is willing to sacrifice accuracy. A balance need to be struck, which is why we capped $Z_{maz}$ at 30.

First, we present in Fig. 5 different trajectories of the proportion I of infected when the government implements the optimal tax policy, and compare it to the trajectories obtained in the benchmark case. As mentioned before, we also want to study the sensibility with respect to the arbitrary bound $Z_{\max}$ , and we thus represent the paths of I in three cases, in addition to the benchmark case: for $Z_{\max} = 10$ (orange curves), $Z_{\max} = 20$ (green), and $Z_{\max} = 30$ (red). Then, the corresponding simulations of the optimal control Z of the government, used to index the tax on the proportion of infected, are given in Fig. 6. We compare optimal controls $β$ and Z for the tax policy with different lockdown time period in Fig. 7. Finally, Fig. 8 regroups the simulations of the optimal transmission rate $β^{⋆}$ obtained with the tax policy, and compare it to $β^{\circ}$ obtained in the benchmark case.

Fig. 6 — Optimal trajectories of the control Z without testing. Comparison for different values of $Z_{\max}$ , with $A = {1}$

Fig. 7 — Maturity effect for the tax policy in the SIR model Comparison of the optimal trajectories of Z for $T = 200$ and $T = 250$ , with $Z_{\max} = 30$

Fig. 8 — Optimal transmission rate $β$ without testing Comparison for different $Z_{\max}$ and with the benchmark case, in the case $A = {1}$

The epidemic is at best contained, and at worst delayed Compared to the benchmark case, we observe in Fig. 5 that the optimal lockdown policy prevents the epidemic peak by maintaining low levels of infection during the lockdown period. Therefore, the government has more time to prepare for a possible infection peak after the lockdown, specifically to increase hospital capacity and provide safety equipment (surgical masks, hydro-alcoholic gel, respirators...). The government can also use this time to fund the development of tests to detect the virus, as well as the research on a vaccine or a remedy for the related disease. However, we can see that at the end of the lockdown period, in many cases the virus is not exterminated and the epidemic may even restart. This is also illustrated by Fig. 9, representing the dispersion of 500 trajectories of I, obtained with the optimal control. Such a phenomenon can be understood as follows: the lockdown slows down the epidemic, so that a very small proportion of the population has been infected and is therefore immune. We thus cannot rely on herd immunity, which is reached here if at least 50% of the population has been contaminated, to prevent a resurgence of the epidemic. Consequently, this lockdown policy is a powerful leverage to delay an epidemic, but this tool needs to be supplemented by alternative policies. If the time saved through lockdown is not exploited, it will have no impact on the final consequences of the epidemic.

Fig. 9 — Dispersion of simulations of the proportion I of infected in the SIR model Comparison between the case with tax policy (but without testing) on the left and the benchmark case on the right

Policy implications We first remark in Fig. 6 that the shape of the optimal indexation parameter rate Z remains the same, regardless of the simulation and the value of $Z_{\max}$ . More importantly, we will see that the paths of the optimal transmission rate associated to different $Z_{\max}$ , are almost superposed. As a consequence, and as previously exhibited in Fig. 5, the value of $Z_{\max}$ has a minor impact on the trajectories of I itself. On the shape of the control Z, we remark that it first takes the most negative value possible ( $- Z_{\max}$ ) for about 20 days, then increases almost instantaneously to reach the maximum value $Z_{\max}$ , before slowly decreasing to 0. Therefore, the optimal tax scheme set by the government is as follows. First, at the beginning of the epidemic, it seems optimal to give to the population a compensation as high as possible, by setting $Z = - Z_{\max}$ . Though this may be a numerical artefact, the fact that this appeared in all our simulations tends to show that it is actually significant. We interpret this as the government anticipating the negative consequences of the lockdown policy by immediately providing monetary relief to the population. This is exactly what happened in several countries, for instance in the USA with stimulus checks sent to every citizen, and our model endogenously reproduces this aspect. Policy-wise, it shows that maximum efficiency for such packages is attained when they are provided to the population as early as possible.

Approaching the maturity, the government eases the lockdown. However, this may be premature, since we have observed in the previous figures that the epidemic may restart at the end of the period. Indeed, considering a final time horizon is equivalent to assuming that ‘the world’ stops at that time: costs generated by the epidemic after T are not taken into account. Nevertheless, this boundary effect has no impact on the previous results and interpretations. Indeed, we remark that if we consider a more distant time T, the lockdown certainly lasts longer, but follows the exact same patterns (see Fig. 7 below). Moreover, the lockdown period should still end at some time, which is why a finite terminal time is assumed. This time may correspond to an estimate of the time needed to implement other more sustainable policies, such as the implementation of an active testing policy, or to wait for the discovery of a vaccine.

Optimal tax sensitivity with respect to the lockdown duration On Fig. 7, we give two trajectories of the optimal contact rate $β$ and the optimal Z for two different maturities. It is clear that both trajectories follow the same paths until some point. Regardless of the maturity, the contact rate $β$ and the parameter Z have the same characteristics as those shown respectively in Figs. 6 and 8. As one approaches the shortest maturity, i.e. $T = 200$ , the parameter Z decreases towards 0, while the other remains at the maximum, and decreases later. Therefore, the fact that Z decreases at maturity, as mentioned above, appears to be a boundary effect.

Optimal interaction rate and comparison with the benchmark case In the beginning, recall that Z is negative, meaning that the tax is negatively indexed on the variation of I. In other words, since I is globally (but very slightly) increasing at the beginning of the epidemic, the compensation increases with I, which means that the population is not incentivised at all to decrease their contact rate, and thus the transmission rate of the virus, which remains equal to the initial level $\bar{β}$ . Then, as the epidemic spreads, Z becomes very high, which now incentivises the population to reduce the transmission rate below $\bar{β}$ . Finally, near the end of the lockdown period, Z plunges to zero, which naturally implies that the optimal contact rate $β^{⋆}$ goes back to its usual level $\bar{β}$ .

Tax policy with testing

In this section, we now study the case where the government can implement an active testing policy, in addition to the incentive policy for lockdown, to contain the spread of the epidemic. This policy is similar to the one adopted by most European governments in June 2020, after relatively strict containment periods and at a time when the COVID-19 epidemic seemed to be under control. Indeed, the lockdown periods in Europe have generally made it possible to delay the epidemic, and thus to give public authorities time to prepare a meaningful testing policy. This has two major interests. First, it allows the identification of clusters, and therefore provides a more precise knowledge of the dynamics of the epidemic in real time. Second, by identifying infected people, we can force them to remain isolated. Thus, by developing a robust testing policy, public authorities can in fact relax the lockdown while keeping the rate of disease transmission at a sufficiently low level. Therefore, comparing with the no-testing policy case, we expect that

(i)
the government will be able to control the epidemic at least as well as with just the lockdown policy;
(ii)
it will allow the population to regain a contact rate closer to the desired and initial level $\bar{β}$ .

To study the optimal testing policy $α^{⋆}$ , taking values in $A : = [ε, 1]$ , we consider the cost of effort k given in Sect. 2.2.3. This cost function emphasises the fact that testing the entire population every day is inconceivable, and therefore results in an explosion of cost when $α$ takes values close to 0. Recall that the parameters for the function k, namely $κ_{g}$ and $η_{g}$ are given in Table 2b. Finally, A is discretised with a step equal to 0.05 and we consider $Z_{\max} = 30$ .

Relaxed lockdown but lower effective transmission rate First, comparing Figs. 6 and 10, the optimal control Z presents the same shape in both cases, except at the beginning, since now Z is not negative initially. In fact, we observe that the government is asking for less effort from the population, and therefore the initial stimulus mentioned in the paragraph ‘Policy implications’ still happens, but later and for a much shorter length. Figure 13 also shows that the optimal contact rate is closer to the initial level $\bar{β}$ , which should induce a more violent spread of the disease. Nevertheless, the control $α$ , representing the testing policy and given by Fig. 11, balances this effect. Indeed, the testing allows an isolation of targeted infected individual, and therefore contribute to the decrease of the effective transmission rate of the disease, represented in Fig. 12. Therefore, comparing Fig. 14 with Fig. 9, we notice that the control of the epidemic is more efficient than in the case $A = {1}$ , since the proportion of infected is globally decreased. Finally, Fig. 14 gives a global overview with the dispersion of 500 simulations for the optimal controls $α$ and Z as well as for the proportion I of infected, which confirms the intuition given by the three selected ones (Figs. 13, 14).

Fig. 10 — Optimal trajectories of Z with testing policy

Fig. 13 — Dispersion of simulations of the transmission rate with testing policy

Fig. 11 — Optimal trajectories of the testing policy $α$

Fig. 12 — Optimal effective transmission rate $β \sqrt{α}$ with testing policy Comparison between the three cases, the benchmark, with, and without testing

Fig. 14 — Dispersion of simulations of optimal government’s controls, with the resulting trajectories of I

The first-best case

First, remark that, with the particular choice of utility functions, we have

\begin{matrix} χ^{⋆} (ϖ) = \frac{1}{θ_{p}} ln (\frac{1}{ϖ} - ϕ_{p}), if 0 < ϖ < \frac{1}{ϕ_{P}} = 2 . \end{matrix}

Otherwise, if $ϖ \geq 2$ , the optimal tax policy is equal to $- \infty$ , which cannot be optimal from the government’s point of view, since it leads to an infimum on $ϖ$ equal to $+ \infty$ [see (2.13)]. For each value of the Lagrange parameter, a two dimensional PDE with a two-dimensional control $(α, β)$ is considered. A step discretisation for the grid in (s, i) is taken equal to (0.001, 0.001). $A = [ε, 1]$ is discretised with 20 values and the values of $β$ are discretised with 80 equally spaced values (to reduce the cost of optimisation). We then search for the optimal $ϖ$ parameter with a step of 0.01 within the interval (0, 2). We obtain in this case an optimal value equal to 0.64 and we give on Fig. 15 the results, which show in particular that the epidemic is controlled in a similar way as in the second-best case, with incentives and testing policy.

Fig. 15 — Dispersion of 500 trajectories obtained in the first-best case

The shape of the optimal controls $β$ and $α$ , as well as the trajectories for the proportion I of infected, are highly similar to those obtained in the previous case. The only clear difference is the principal’s value. Indeed, we can compare the optimal value $V_{0}^{P}$ for the government in the moral hazard case, to the first best value $V_{0}^{P, FB}$ . Using $10^{4}$ trajectories and the previously optimal control computed, we estimate $V_{0}^{P, FB} = - 0.249$ while $V_{0}^{P} = - 0.287$ . The difference between the two values, with a relative difference of $15 %$ only pleads in favour of our incentive model: even without being able to track all the population, governments can achieve containment strategies with very similar levels of efficiency, and costs which are not significantly higher. This is of course partly explained by the fact that the testing is profitable both for the government and for the population, as it allows for values of $β$ close to $\bar{β}$ , as shown on Fig. 15.

Incentive policy for epidemic stochastic models

The stochastic model

Initial canonical space

We fix a small parameter $ε \in (0, 1)$ to consider the subset $A : = [ε, 1]$ . $A$ is the set of all finite and positive Borel measures on $[0, T] \times A$ , whose projection on [0, T] is the Lebesgue measure. Every $q \in A$ can be disintegrated as $q (d s, d v) = q_{s} (d v) d s$ , for an appropriate Borel measurable kernel ${(q_{s})}_{s \in [0, T]}$ . We then define the following canonical space $Ø m e g a : = C^{2} \times A,$ whose canonical process is denoted by $(S, I, Λ)$ , in the sense that

\begin{matrix} S_{t} (s, ι, q) : = s (t), I_{t} (s, ι, q) : = ι (t), Λ (s, ι, q) : = q, \forall (t, s, ι, q) \in [0, T] \times Ø m e g a . \end{matrix}

We let $F$ be the Borel $σ$ -algebra on $Ø m e g a$ , and $F : = {(F_{t})}_{t \in [0, T]}$ be the natural filtration of the canonical process

\begin{matrix} F_{t} : = σ ((S_{s}, I_{s}, Δ_{s} (Υ)) : (s, Υ) \in [0, t] \times C_{b} ([0, T] \times A, R)), t \in [0, T], \end{matrix}

where for any $(s, Υ) \in [0, T] \times C_{b} ([0, T] \times A, R)$ , $Δ_{s} (Υ) : = \iint_{[0, s] \times A} Υ (r, a) Λ (d r, d a) .$ Recall that in this framework $F = F_{T}$ . Let $M$ be the set of probability measures on $(Ø m e g a, F_{T})$ . For any $P \in M$ , we let $N^{P}$ be the collection of all $P$ -null sets, that is to say $N^{P} : = {N \in 2^{Ø} m e g a : \exists N^{'} \in F_{T}, N \subset N^{'}, P [N^{'}] = 0},$ where we recall that $2^{Ø} m e g a$ represents the set of all subsets of $Ø m e g a$ , and we let $F^{P} : = {(F_{t}^{P})}_{t \in [0, T]}$ be the $P$ -augmentation of $F$ , where $F_{t}^{P} : = F_{t} \lor σ (N^{P})$ . We let $F^{P +} : = {(F_{t}^{P +})}_{t \in [0, T]}$ the corresponding right limit. Similarly, for any subset $Π \subset M$ , we let $F^{Π} : = {(F_{t}^{Π})}_{t \in [0, T]}$ be the $Π$ -universal completion of $F$ . Fix some initial values $(s_{0}, i_{0}) \in R_{+}^{2}$ ,12 and let us introduce the drift and volatility functions for our controlled model, namely $B : R^{2} ⟶ R^{2}$ and $Σ : R^{2} \times A ⟶ R^{2}$ , defined by

\begin{matrix} B (x, y) : = (\begin{matrix} λ - μ x + ν y \\ - (μ + ν + γ + ρ) y \end{matrix}), Σ (x, y, a) : = (\begin{matrix} σ a x y \\ - σ a x y \end{matrix}), (x, y, a) \in R^{2} \times A, \end{matrix}

where the parameters $(λ, μ, ν, γ, σ) \in {[0, \infty)}^{4} \times R_{+}^{⋆}$ are given. For any $(s, φ) \in [0, T] \times C_{b}^{2} (R^{2}, R)$ , we set

\begin{matrix} M_{s} (φ) & : = φ (S_{s}, I_{s}) - \iint_{[0, s] \times A} (B (S_{r}, I_{r}) \cdot \nabla φ (S_{r}, I_{r}) \\ + \frac{1}{2} Tr [D^{2} φ (S_{r}, I_{r}) (Σ Σ^{⊤}) (S_{r}, I_{r}, a)]) Λ (d r, d a) . \end{matrix}

Definition 4.1

We define the subset $P \subset M$ as the one composed of all $P \in M$ such that

(i)
$M (φ)$ is an $(F, P)$ –local martingale on [0, T] for all $φ \in C_{b}^{2} (R^{2}, R) ;$
(ii)
$P [(S_{0}, I_{0}) = (s_{0}, i_{0})] = 1 ;$
(iii)
with $P$ -probability 1, the canonical process $Λ$ is of the form $δ_{ϕ_{\cdot}} (d v)$ for some Borel function $ϕ : [0, T] ⟼ A$ , where as usual, for any $a \in A$ , $δ_{a}$ is the Dirac mass at a.

We can follow Bichteler (1981), or Neufeld and Nutz (2014, Proposition 6.6) to define a pathwise version of the density of the quadratic variation of S, denoted by $\hat{σ} : [0, T] \times Ø m e g a ⟶ R$ , by ${\hat{σ}}_{t}^{2} (ω) : = lim {sup}_{n \to \infty} n ({⟨ S ⟩}_{t} (ω) - {⟨ S ⟩}_{t - 1 / n} (ω)), (t, ω) \in [0, T] \times Ø m e g a .$ Lévy’s characterisation of Brownian motion ensures that the process13

\begin{matrix} W_{t} : = \int_{0}^{T} {\hat{σ}}_{s}^{- 1 / 2} 1_{{\hat{σ}}_{s} \neq 0} d S_{s}, t \in [0, T], \end{matrix}

4.1

is an $(F^{P}, P)$ –Brownian motion for any $P \in P$ . For any $P \in P$ , we denote by $A_{o} (P)$ the set of $F$ -predictable and A-valued process $α : = {(α_{s})}_{s \in [0, T]}$ such that, $P$ –a.s.

\begin{matrix} \{\begin{matrix} S_{t} = s_{0} + \int_{0}^{t} (λ - μ S_{s} + ν I_{s}) d s + \int_{0}^{t} σ α_{s} S_{s} I_{s} d W_{s}, t \in [0, T], \\ I_{t} = i_{0} - \int_{0}^{t} (μ + γ + ν + ρ) I_{s} d s - \int_{0}^{t} σ α_{s} S_{s} I_{s} d W_{s}, t \in [0, T] . \end{matrix}) \end{matrix}

4.2

Once again, it is a classical result (see for instance Stroock and Varadhan 1997, Theorem 4.5.2, or Élie et al. 2021, Lemma 2.3) that $A_{o} (P)$ is not empty. We recall that the term $λ \geq 0$ denotes the birth rate, the parameter $μ \geq 0$ is the natural death rate in the population (susceptible and infected), $γ \geq 0$ is the death rate inside the infected population. The parameters $ν$ and $ρ$ correspond to recovery rates, depending on whether we are considering a SIS or a SIR model, see the remark below.

Remark 4.2

It can be noted that our model, which results from a mixing of the SIS and SIR models, can be interpreted as an SIR model with partial immunisation, in the sense that only a part of the population develops antibodies for the disease after being infected. Thus, a proportion $ρ$ of the infected moves to the class $R$ , and can no longer be infected. Conversely, the proportion of the infected who do not develop antibodies reverts to the class $S$ , and can therefore contract the disease again. This resulting model is similar to the one developed by Zhang et al. (2018) and called SISRS. This type of model seems in fact well suited to model epidemics related to new viruses, such as the COVID-19, when the immunity of infected persons has not yet been proved.

Before pursuing, we need a bit more notations, and will consider the following sets $A_{o} : = ⋃_{P \in P} A_{o} (P),$ as well as, for any $α \in A_{o}$ , $P (α) : = {P \in P : α \in A_{o} (P)} .$ We will require that the controls chosen by the government lead to only one weak solution to Eq. (4.2), and are such that the processes S and I remain non-negative. We will therefore concentrate our attention to the set $A$ of admissible controls defined by

\begin{matrix} A : = {α \in A_{o} : P (α) is a singleton {P^{α}}, and (S, I) is R_{+}^{2} - valued, P^{α} --a.s.} . \end{matrix}

Notice that the set $A$ is not empty since any constant A-valued process automatically belongs to $A$ , as a direct consequence of Gray et al. (2011, Sect. 3) or Gao et al. (2019, Lemma 2.3). Remark then that, for any $α \in A$ , we have ${\hat{σ}}_{t} = σ S_{t} I_{t} α_{t}, d P^{α} \otimes d t$ –a.e., and

\begin{matrix} S_{t} + I_{t} = s_{0} + i_{0} + \int_{0}^{t} (λ - μ (S_{s} + I_{s}) - (γ + ρ) I_{s}) d s, t \in [0, T], P^{α} --a.s. \end{matrix}

We thus deduce, using the positivity of S and I, that

\begin{matrix} 0 & \leq S_{t} + I_{t} = e^{- μ t} (s_{0} + i_{0}) \\ + \int_{0}^{t} e^{- μ (t - s)} (λ - (γ + ρ) I_{s}) d s \leq F (t, s_{0}, i_{0}), t \in [0, T], P^{α} --a.s., \end{matrix}

4.3

where for all $(t, s, i) \in [0, T] \times R_{+}^{2}$

\begin{matrix} F (t, s, i) : = e^{- μ t} (s + i) + λ (\frac{1 - e^{- μ t}}{μ} 1_{{μ > 0}} + t 1_{{μ = 0}}) . \end{matrix}

4.4

This result proves in particular that S and I are actually $P^{α}$ –almost surely bounded, for any $α \in A$ . Moreover, if $(s_{0}, i_{0}) \in {(R_{+}^{⋆})}^{2}$ , then for all $t \in [0, T]$ , both $S_{t}$ and $I_{t}$ are (strictly) positive.

Remark 4.3

Note that in the SIR model, described by the system (2.1) with $ν = 0$ , we have, for all $t \in [0, T]$ , $R_{t} = r_{0} e^{- μ t} + ρ \int_{0}^{t} I_{s} e^{- μ (t - s)} d s,$ so that $R_{t}$ depends only on the observation of $I_{s}$ for $s \leq t$ . In addition to that $0 \leq S_{t} + I_{t} + R_{t} \leq e^{- μ t} (s_{0} + i_{0} + r_{0}) + \int_{0}^{t} e^{- μ (t - s)} (λ - γ I_{s}) d s \leq F (t, s_{0}, i_{0}) + r_{0} e^{- μ t} .$

Impact of the interaction

The basic model from (4.2) takes into account the testing policy put into place by the government, but ignores so far the interacting behaviour of the population. We model this through an additional control process chosen by the population. More precisely, we fix some constant $β^{\max} > 0$ representing the maximum rate of interaction that can be considered, and we define $B : = [0, β^{\max}]$ . Let $B$ be the set of all $F$ -predictable and B-valued processes. Given a testing policy $α \in A$ implemented by the government, notice that the following stochastic exponential

\begin{matrix} (exp (- \int_{0}^{t} \frac{β_{s}}{σ \sqrt{α_{s}}} d W_{s} - \frac{1}{2} \int_{0}^{t} \frac{β_{s}^{2}}{σ^{2} α_{s}} d s))_{t \in [0, T]}, \end{matrix}

is an $(F, P^{α})$ -martingale, given that the process $β / (σ \sqrt{α})$ takes values in $[0, β^{\max} / (σ \sqrt{ε})]$ , $P^{α}$ –a.s. Therefore, for any $(α, β) \in A \times B$ , we can define a probability measure $P^{α, β}$ on $(Ø m e g a, F)$ , equivalent to $P^{α}$ . Using Girsanov’s theorem, $W_{t}^{β} : = W_{t} + \int_{0}^{t} \frac{β_{s}}{σ \sqrt{α_{s}}} d s, t \in [0, T],$ is an $(F, P^{α, β})$ –Brownian motion, and

\begin{matrix} \{\begin{matrix} S_{t} = s_{0} + \int_{0}^{t} (λ - μ S_{s} + ν I_{s} - β_{s} \sqrt{α_{s}} S_{s} I_{s}) d s \\ + \int_{0}^{t} σ α_{s} S_{s} I_{s} d W_{s}^{β}, t \in [0, T], \\ I_{t} = i_{0} - \int_{0}^{t} ((μ + ν + γ + ρ) I_{s} - β_{s} \sqrt{α_{s}} S_{s} I_{s}) d s \\ - \int_{0}^{t} σ α_{s} S_{s} I_{s} d W_{s}^{β}, t \in [0, T] . \end{matrix}) \end{matrix}

4.5

Optimisation problems

At time 0, the government informs the population about its testing policy $α \in A$ , as well as its fine policy $χ$ , which for now will be an $F_{T}$ -measurable and $R$ -valued random variable (a set we denote by $C$ ). The population solves the following optimal control problem

\begin{matrix} V_{0}^{A} (α, χ) : = & sup_{β \in B} J_{0}^{A} (α, χ, β), with J_{0}^{A} (α, χ, β) \\ : = & E^{P^{α, β}} [\int_{0}^{T} u (t, β_{t}, I_{t}) d t + U (- χ)] . \end{matrix}

4.6

The interpretation of the functions u and U is detailed in Sect. 2.2.1, where the population’s problem was informally introduced. For any $(α, χ) \in A \times C$ , we recall that we denoted by $B^{⋆} (α, χ)$ the set of optimal controls for $V_{0}^{A} (α, χ)$ :

\begin{matrix} B^{⋆} (α, χ) : = {β \in B : V_{0}^{A} (α, χ) = J_{0}^{A} (α, χ, β)} . \end{matrix}

4.7

We require minimal integrability assumptions at this stage, and insist that there exists some $p > 1$ such that

\begin{matrix} E^{P^{α}} [{| U (- χ) |}^{p}] < \infty, for any α \in A . \end{matrix}

4.8

Remark 4.4

Notice that since for any $α \in A$ the Radon–Nykodým density $d P^{α, β} / d P^{α}$ has moments of any order under $P^{α}$ (since any $β \in B$ is bounded and any $α \in A$ is bounded and bounded away from 0), a simple application of Hölder’s inequality ensures that (4.8) implies that for any $p^{'} \in (1, p)$ and any $β \in B$ , $E^{P^{α, β}} [| U (- χ) |^{p^{'}}] < \infty .$

Recall that the government can only implement policies $(α, χ) \in A \times C$ such that $V_{0}^{A} (α, χ) \geq \underline{v}$ , where the minimal utility $\underline{v} \in R$ is given. We denote the subset of $A \times C$ satisfying this constraint and Eq. (4.8) by $Ξ$ .

In line with the informal reasoning developed in Sect. 2.2.2, the government aims at minimising the number of infected people until the end of the lockdown period, and we write rigorously its minimisation problem as

\begin{matrix} V_{0}^{P} : = sup_{(α, χ) \in Ξ} sup_{β \in B^{⋆} (α, χ)} E^{P^{α, β}} [χ - \int_{0}^{T} (c (I_{t}) + k (t, α_{t}, S_{t}, I_{t})) d t], \end{matrix}

4.9

where the functions $c : R_{+} ⟶ R_{+}$ and $k : [0, T] \times A \times R_{+} \times R_{+} ⟶ R$ were introduced in Sect. 2.2.2.

Optimal interaction of the population given tax and test policies

A relevant contract form

Since the fine policy $χ$ is an $F_{T}$ -measurable random variable, where $F$ is the filtration generated by the process (S, I), we should expect that in general $V_{0}^{A} (α, χ) = v (0, s_{0}, i_{0})$ , where the map $v : [0, T] \times C^{2} ⟶ R$ satisfies an informal Hamilton Jacobi Bellman (HJB for short) equation, and as such has the dynamic

\begin{matrix} d v (t, S_{t}, I_{t}) = - H (S_{t}, I_{t}, Z_{t}^{s}, Z_{t}^{i}, α_{t}) d t + Z_{t}^{s} d S_{t} + Z_{t}^{i} d I_{t}, \end{matrix}

where the population’s Hamiltonian $H : [0, T] \times {(R_{+}^{⋆})}^{2} \times R^{2} \times A ⟶ R$ is defined by

\begin{matrix} H (t, s, i, z, z^{'}, a) \\ : = sup_{b \in B} h (t, s, i, z, z^{'}, a, b), (t, s, i, z, z^{'}, a) \in [0, T] \times {(R_{+}^{⋆})}^{2} \times R^{2} \times A \end{matrix}

where $h (t, s, i, z, z^{'}, a, b) : = (λ - μ s + ν i - b \sqrt{a} s i) z - ((μ + ν + γ + ρ) i - b \sqrt{a} s i) z^{'} + u (t, b, i), for b \in B .$

In particular, defining $Z : = Z^{s} - Z^{i}$ , we should have

\begin{matrix} U (- χ) & = V_{0}^{A} (α, χ) - \int_{0}^{T} H (t, S_{t}, I_{t}, Z_{t}^{s}, Z_{t}^{i}, α_{t}) d t + \int_{0}^{T} Z_{t}^{s} d S_{t} + \int_{0}^{T} Z_{t}^{i} d I_{t} \\ = V_{0}^{A} (α, χ) - \int_{0}^{T} ((μ + ν + γ + ρ) I_{t} Z_{t} \\ + sup_{b \in B} {u (t, b, I_{t}) - b \sqrt{α_{t}} S_{t} I_{t} Z_{t}}) d t \\ - \int_{0}^{T} Z_{t} d I_{t} . \end{matrix}

4.10

Given the supremum appearing above, the following assumption will be useful for us.

Assumption 4.5

There exists a unique Borel-measurable map $b^{⋆} : [0, T] \times R_{+}^{⋆} \times R_{+}^{⋆} \times R \times A ⟶ B$ such that

\begin{matrix} b^{⋆} (t, s, i, z, a) \in \underset{b \in B}{argmax} {u (t, b, i) - b \sqrt{a} s i z}, \\ \forall (t, s, i, z, a) \in [0, T] \times {(R_{+}^{⋆})}^{2} \times R \times A . \end{matrix}

4.11

Remark 4.6

We would like to insist on the fact that for the SIR model and in view of Remark 4.3, it is not necessary to consider that the process R is a state variable. Indeed, its value at time t can be deduced from the paths of I until time t. More precisely, following the previous reasoning to find the relevant form of contracts, one could consider

\begin{matrix} d v (t, S_{t}, I_{t}) = - \tilde{H} (S_{t}, I_{t}, R_{t}, Z_{t}^{s}, Z_{t}^{i}, α_{t}) d t + Z_{t}^{s} d S_{t} + Z_{t}^{i} d I_{t} + Z_{t}^{r} d R_{t}, \end{matrix}

where, in this case, the population’s Hamiltonian $\tilde{H} : [0, T] \times {(R_{+}^{⋆})}^{2} \times R^{2} \times A$ is defined by

\begin{matrix} \tilde{H} (t, s, i, r, z, z^{'}, \tilde{z}, a) : = sup_{b \in B} {h (t, s, i, z, z^{'}, a, b)} \\ + (ρ i - μ r) \tilde{z}, for any (t, s, i, z, z^{'}, \tilde{z}, a) \in [0, T] \times {(R_{+}^{⋆})}^{2} \times R^{3} \times A . \end{matrix}

Since the dynamics of R is uncontrolled, a simplification occurs between the part of the Hamiltonian $(ρ i - μ r) \tilde{z}$ and the integral w.r.t. $d R$ , which leads to the same form for the utility function as mentioned in Eq. (4.10).

The general analysis

For any $(α, m) \in A \times N^{⋆}$ , we define $S^{m} (P^{α})$ and $H^{m} (P^{α})$ as respectively the sets of $R$ -valued, $F^{P^{α} +}$ -adapted continuous processes Y s.t. ${‖ Y ‖}_{S^{m} (P^{α})} < \infty$ , and the set of $F^{P^{α}}$ -predictable, $R$ -valued processes Z with ${‖ Z ‖}_{H^{m} (P^{α})} < \infty$ , where

\begin{matrix} {‖ Y ‖}_{S^{m} (P^{α})}^{m} : = & E^{P^{α}} [sup_{t \in [0, T]} | Y_{t} |^{m}], {‖ Z ‖}_{H^{m} (P^{α})}^{m} \\ : = & E^{P^{α}} [(\int_{0}^{T} | {\hat{σ}}_{s} Z_{s} |^{2} d s)^{m / 2}], (Y, Z) \in S^{m} (P^{α}) \times H^{m} (P^{α}) . \end{matrix}

Theorem 4.7

Let $(α, χ) \in Ξ$ . There exists a unique $F_{0}^{P^{α} +}$ -measurable $Y_{0}$ and a unique $Z \in H^{p} (P^{α})$ such that

\begin{matrix} U (- χ) & = Y_{0} - \int_{0}^{T} (Z_{t} (μ + ν + γ + ρ) I_{t} + u (t, β_{t}^{⋆}, I_{t}) - β_{t}^{⋆} \sqrt{α_{t}} S_{t} I_{t} Z_{t}) d t \\ - \int_{0}^{T} Z_{t} d I_{t}, P^{α} --a.s., \end{matrix}

4.12

with $β_{t}^{⋆} : = b^{⋆} (t, S_{t}, I_{t}, Z_{t}, α_{t})$ for all $t \in [0, T]$ . Moreover, $B^{⋆} (α, χ) = {β^{⋆}}$ and $V_{0}^{A} (α, χ) = E^{P^{α}} [Y_{0}]$ .

Proof

Fix $(α, χ) \in Ξ$ as in the statement of the theorem. Let us consider the solution (Y, Z) of the following BSDE

\begin{matrix} Y_{t} = & U (- χ) + \int_{t}^{T} sup_{b \in B} {u (r, b, I_{r}) - Z_{r} b \sqrt{α_{r}} S_{r} I_{r}} d r \\ - \int_{t}^{T} Z_{r} σ α_{r} S_{r} I_{r} d W_{r}, t \in [0, T] . \end{matrix}

4.13

Since $χ \in C$ , u is continuous, I and S are bounded, and B is a compact set, it is immediate this BSDE is well-posed and admits a unique solution $(Y, Z) \in S^{p} (P^{α}) \times H^{p} (P^{α})$ (in a more general context, one may refer for instance to Bouchard et al. 2018, Theorem 4.1). Therefore, using the dynamic of I under $P^{α}$ , given by Eq. (4.2), as well as the definition of $β^{⋆}$ , and letting $t = 0$ , we obtain that (4.12) is satisfied. Next, using this representation for $U (χ)$ in the population’s criteria defined in Eq. (4.6), we notice that, for any $β \in B$ ,

\begin{matrix} J_{0}^{A} (α, χ, β) & = E^{P^{α, β}} [Y_{0} + \int_{0}^{T} (u (t, β_{t}, I_{t}) - Z_{t} (μ + ν + γ + ρ) I_{t} - u (t, β_{t}^{⋆}, I_{t}) \\ + β_{t}^{⋆} \sqrt{α_{t}} S_{t} I_{t} Z_{t}) d t - \int_{0}^{T} Z_{t} d I_{t}] \\ = E^{P^{α}} [Y_{0}] + sup_{β \in B} E^{P^{α, β}} [\int_{0}^{T} (u (t, β_{t}, I_{t}) - β_{t} S_{t} I_{t} Z_{t} - u (t, β_{t}^{⋆}, I_{t}) \\ + β_{t}^{⋆} \sqrt{α_{t}} S_{t} I_{t} Z_{t}) d t] \leq E^{P^{α}} [Y_{0}], \end{matrix}

where we used the fact that that $Z \in H^{p} (P^{α})$ , and that $E_{\cdot}^{β} : = exp (- \int_{0}^{\cdot} \frac{β_{s}}{σ \sqrt{α_{s}}} d W_{s} - \frac{1}{2} \int_{0}^{\cdot} \frac{β_{s}^{2}}{σ^{2} α_{s}} d s),$ is continuous, and both an $(F^{P^{α}}, P^{α})$ - and an $(F^{P^{α} +}, P^{α})$ -martingale (see Neufeld and Nutz 2014, Proposition 2.2), so that for any $β \in B$

\begin{matrix} E^{P^{α, β}} [Y_{0}] & = E^{P^{α}} [E_{T}^{β} Y_{0}] = E^{P^{α}} [E_{0}^{β} Y_{0}] = E^{P^{α}} [Y_{0}] . \end{matrix}

The previous inequality implies that $V_{0}^{A} (α, χ) \leq E^{P^{α}} [Y_{0}] .$ Moreover, thanks to Assumption 4.5, equality is achieved if and only if we choose the control $β^{⋆}$ . This shows that $V_{0}^{A} (α, χ) = E^{P^{α}} [Y_{0}], and B^{⋆} (α, χ) = {β^{⋆}} .$ $□$

In the previous result, the fact that Eq. (4.12) holds with an $F_{0}^{P^{α} +}$ -measurable random variable and not a constant is somewhat annoying. The next lemma shows that we can actually have the representation with a constant without loss of generality.

Lemma 4.8

Let $α \in A$ , and fix an $F_{0}^{P^{α} +}$ -measurable random variable $Y_{0}$ and some $Z \in H^{p} (P^{α})$ . Define the following contracts

\begin{matrix} χ & : = - U^{(- 1)} (Y_{0} - \int_{0}^{T} (Z_{t} (μ + ν + γ + ρ) I_{t} + u (t, β_{t}^{⋆}, I_{t}) \\ - β_{t}^{⋆} \sqrt{α_{t}} S_{t} I_{t} Z_{t}) d t - \int_{0}^{T} Z_{t} d I_{t}), \\ χ^{'} & : = - U^{(- 1)} (E^{P^{α}} [Y_{0}] - \int_{0}^{T} (Z_{t} (μ + ν + γ + ρ) I_{t} + u (t, β_{t}^{⋆}, I_{t}) \\ - β_{t}^{⋆} \sqrt{α_{t}} S_{t} I_{t} Z_{t}) d t - \int_{0}^{T} Z_{t} d I_{t}) . \end{matrix}

Then $V_{0}^{A} (α, χ) = V_{0}^{A} (α, χ^{'}) = E^{P^{α}} [Y_{0}], B^{⋆} (α, χ) = B^{⋆} (α, χ^{'}) = {β^{⋆}} .$

Proof

The equalities for $(α, χ)$ are immediate from Theorem 4.7. For $(α, χ^{'})$ , we have, using the fact that $Z \in H^{p} (P^{α})$ , and thus $Z \in H^{q} (P^{α, β})$ for any $β \in B$ and any $q \in (1, p)$

\begin{matrix} V_{0}^{A} (α, χ^{'}) & = sup_{β \in B} E^{P^{α, β}} [E^{P^{α}} [Y_{0}] + \int_{0}^{T} (u (t, β_{t}, I_{t}) \\ - Z_{t} (μ + ν + γ + ρ) I_{t} - u (t, β_{t}^{⋆}, I_{t}) + β_{t}^{⋆} \sqrt{α_{t}} S_{t} I_{t} Z_{t}) d t - \int_{0}^{T} Z_{t} d I_{t}] \\ = E^{P^{α}} [Y_{0}] + sup_{β \in B} E^{P^{α, β}} [\int_{0}^{T} (u (t, β_{t}, I_{t}) \\ - β_{t} \sqrt{α_{t}} S_{t} I_{t} Z_{t} - u (t, β_{t}^{⋆}, I_{t}) + β_{t}^{⋆} \sqrt{α_{t}} S_{t} I_{t} Z_{t}) d t] \leq E^{P^{α}} [Y_{0}] . \end{matrix}

Since the equality is attained if and only if we choose $β = β^{⋆}$ , this ends the proof. $□$

Characterisation of the class of admissible contracts

We introduce the class $\bar{Ξ}$ of contracts defined by all pairs $(α, χ^{y_{0}, Z})$ with $α \in A$ and $χ^{y_{0}, Z} : = - U^{(- 1)} (Y_{T}^{y_{0}, Z})$ , where $Y^{y_{0}, Z}$ is a process given, $P^{α} --a.s.$ , for all $t \in [0, T]$ by

\begin{matrix} Y_{t}^{y_{0}, Z} = & y_{0} - \int_{0}^{t} (Z_{r} (μ + ν + γ + ρ) I_{r} + u (t, b^{⋆} (r, S_{r}, I_{r}, Z_{r}, α_{r}), I_{r}) \\ - b^{⋆} (r, S_{r}, I_{r}, Z_{r}, α_{r}) \sqrt{α_{r}} S_{r} I_{r} Z_{r}) d r - \int_{0}^{t} Z_{r} d I_{r}, \end{matrix}

with $Z \in H^{p} (P^{α})$ and $y_{0} \in [\underline{v}, \infty)$ . We also denote for simplicity $P^{⋆, α, Z} : = P^{α, b^{⋆} (S_{\cdot}, I_{\cdot}, Z_{\cdot})}$ .

Lemma 4.9

The problem of the government given by (4.9) can be rewritten

\begin{matrix} V_{0}^{P} = sup_{(α, Z) \in A \times H^{p} (P^{α})} E^{P^{⋆, α, Z}} [- U^{(- 1)} (Y_{T}^{\underline{v}, Z}) - \int_{0}^{T} (c (I_{s}) + k (s, α_{s}, S_{s}, I_{s})) d s] . \end{matrix}

4.14

Proof

From Theorem 4.7 and Lemma 4.8, we know that $Ξ \subset \bar{Ξ}$ . To prove the reverse inclusion, let us now consider a pair $(α, χ^{y_{0}, Z}) \in \bar{Ξ}$ . In fact, to show that $\bar{Ξ} \subset Ξ$ (and thus that $Ξ = \bar{Ξ}$ ), we simply need to ensure that $χ^{y_{0}, Z}$ satisfies the integrability condition (4.8). Using the fact that u is continuous, B is compact, $α$ is bounded below by $ε$ , and S and I are bounded, we have that there exists a constant $C > 0$ , which may change value from line to line, such that

\begin{matrix} E^{P^{α}} [| U (- χ^{y_{0}, Z}) |^{p}] & \leq C (1 + E^{P^{α}} [(\int_{0}^{T} | S_{r} I_{r} Z_{r} | d r)^{p} + | \int_{0}^{T} {\hat{σ}}_{r} Z_{r} d W_{r} |^{p}]) \\ \leq C (1 + E^{P^{α}} [(\int_{0}^{T} σ α_{r} | S_{r} I_{r} Z_{r} | d r)^{p}] + {‖ Z ‖}_{H^{p} (P^{α})}^{p}) \\ \leq C (1 + {‖ Z ‖}_{H^{p} (P^{α})}^{p}) < \infty, \end{matrix}

where we used Burkholder–Davis–Gundy’s inequality and Cauchy–Schwarz’s inequality, implying that (4.8) holds.

Next, we use Lemma 4.8 to realise that $B^{⋆} (α, χ^{y_{0}, Z}) = {b^{⋆} (\cdot, S_{\cdot}, I_{\cdot}, Z_{\cdot}, α_{\cdot})}$ , and $V_{0}^{A} (α, χ^{y_{0}, Z}) = y_{0}$ , which implies

\begin{matrix} V_{0}^{P} = sup_{y_{0} \geq \underline{v}} sup_{(α, Z) \in A \times H^{p} (P^{α})} E^{P^{⋆, α, Z}} [- U^{(- 1)} (Y_{T}^{y_{0}, Z}) - \int_{0}^{T} (c (I_{s}) + k (s, α_{s}, S_{s}, I_{s})) d s] . \end{matrix}

To conclude, it is enough to notice that the following map is non-increasing

\begin{matrix} [\underline{v}, \infty) ∋ y_{0} ⟼ E^{P^{⋆, α, Z}} [- U^{(- 1)} (Y_{T}^{y_{0}, Z}) - \int_{0}^{T} (c (I_{s}) + k (s, α_{s}, S_{s}, I_{s})) d s] \in R . \end{matrix}

$□$

Optimal tax and test policies under moral hazard for epidemic models

Weak formulation for the government’s problem

Lemma 4.9 states that the problem of the government can be can be reduced to a more standard stochastic control problem. However, in the current formulation, one of the three state variables, namely Y, is considered in the strong formulation, while the other state variables S and I are considered in weak formulation. Indeed, the variable Y is indexed by the control Z, while the control $(α, Z)$ only impacts the distribution of S and I through $P^{⋆, α, Z}$ . As highlighted by Cvitanić and Zhang (2012, Remark 5.1.3), it makes little sense to consider a control problem of this form directly. Therefore, contrary to what is usually done in principal–agent problems (see, e.g., Cvitanić et al. 2018), we decided to adopt the weak formulation to rigorously write the problem of the principal, since this is the formulation which makes sense for the agent’s problem. We will thus formulate it below, for the sake of thoroughness.14

Let $V : = R \times A$ and consider the sets $V$ as we defined $A$ in Sect. 4.1.1. The intuition is that the principal’s problem depends only on time and on the state variable $X = (S, I, Y)$ . Following the same methodology used for the agent’s problem, to properly define the weak formulation of the principal’s problem, we are led to consider the canonical space $Ø m e g a^{P} : = C^{3} \times V$ , with canonical process $(S, I, Y, Λ^{P})$ , where for any $(t, s, ι, y, q) \in [0, T] \times Ø m e g a^{P}$ :

\begin{matrix} S_{t} (s, ι, y, q) : = s (t), I_{t} (s, ι, y, q) : = ι (t), Y_{t} (s, ι, y, q) : = y (t), Λ^{P} (s, ι, y, q) : = q . \end{matrix}

We let $G$ be the Borel $σ$ -algebra on $Ø m e g a^{P}$ , and $G : = {(G_{T})}_{t \in [0, T]}$ the natural filtration of $(S, I, Y, Λ^{P})$ , defined in the same way as $F$ in the previous canonical space $Ø m e g a$ (see Sect. 4.1). Let then $M^{P}$ be the set of probability measures on $(Ø m e g a^{P}, G_{T})$ . For any $P \in M^{P}$ , we can define $G^{P}$ the $P$ -augmentation of $G$ , its right limit $G^{P +}$ , as well as $F^{Π} : = {(F_{t}^{Π})}_{t \in [0, T]}$ the $Π$ -universal completion of $F$ for any subset $Π \subset M^{P}$ .

The drift and volatility functions for the process X are now defined for any $(t, s, i, z, a) \in [0, T] \times {(R_{+}^{⋆})}^{2} \times V$

\begin{matrix} B^{P} (t, s, i, z, a) : = (\begin{matrix} λ - μ s + ν i - b^{⋆} (t, s, i, z, a) \sqrt{a} s i \\ - (μ + ν + γ + ρ) i + b^{⋆} (t, s, i, z, a) \sqrt{a} s i \\ - u^{⋆} (t, s, i, z, a) \end{matrix}), \\ Σ^{P} (s, i, z, a) : = σ a s i (\begin{matrix} 1 \\ - 1 \\ z \end{matrix}), \end{matrix}

4.15

where $u^{⋆} (t, s, i, z, a) : = u (t, b^{⋆} (t, s, i, z, a), i)$ , for all $(t, s, i, z) \in [0, T] \times {(R_{+}^{⋆})}^{2} \times R$ . For any $(t, φ^{P}) \in [0, T] \times C_{b}^{2} (R^{3}, R)$ , we define

\begin{matrix} M_{t}^{P} (φ^{P}) & : = φ^{P} (X_{t}) - \iint_{[0, t] \times V} (B^{P} (r, S_{r}, I_{r}, v) \cdot \nabla φ^{P} (X_{r}) \\ + \frac{1}{2} Tr [D^{2} φ^{P} (X_{r}) (Σ^{P} {(Σ^{P})}^{⊤}) (r, S_{r}, I_{r}, v)]) Λ^{P} (d r, d v) . \end{matrix}

In the spirit of Definition 4.1 for $P \subset M$ , we define the subset $Q \subset M^{P}$ as the one consisting of all $P \in M^{P}$ such that

(i)
$M^{P} (φ^{P})$ is a $(G, P)$ –local martingale on [0, T] for all $φ^{P} \in C_{b}^{2} (R^{3}, R)$ ;
(ii)
$P [X_{0} = x_{0}] = 1$ , where $x_{0} : = (s_{0}, i_{0}, \underline{v})$ ;
(iii)
with $P$ -probability 1, the canonical process $Λ^{P}$ is of the form $δ_{ϕ_{\cdot}} (d v)$ for some Borel-measurable function $ϕ : [0, T] ⟼ V$ .

Still following the line of Sect. 4.1, we know that for any $P \in Q$ , we can define a $(G^{Q}, P)$ –Brownian motion $W^{P}$ . We then denote by $V_{o} (P)$ the set of $G$ -predictable and V-valued process $(Z, α)$ such that, $P$ –a.s. and for all $t \in [0, T]$ ,

\begin{matrix} \{\begin{matrix} S_{t} = s_{0} + \int_{0}^{t} (λ - μ S_{r} + ν I_{r} - b^{⋆} (r, S_{r}, I_{r}, Z_{r}, α_{r}) \sqrt{α_{r}} S_{r} I_{r}) d r \\ + \int_{0}^{t} σ α_{r} S_{r} I_{r} d W_{r}^{P}, \\ I_{t} = i_{0} - \int_{0}^{t} ((μ + ν + γ + ρ) I_{r} - b^{⋆} (r, S_{r}, I_{r}, Z_{r}, α_{r}) \sqrt{α_{r}} S_{r} I_{r}) d r \\ - \int_{0}^{t} σ α_{r} S_{r} I_{r} d W_{r}^{P}, \\ Y_{t} = \underline{v} - \int_{0}^{t} u^{⋆} (r, S_{r}, I_{r}, Z_{r}, α_{r}) d r + \int_{0}^{t} Z_{r} σ α_{r} S_{r} I_{r} d W_{r}^{P} . \end{matrix}) \end{matrix}

4.16

Solving the government’s problem

Thank to the analysis conducted in the previous subsection, the problem of the government given by (4.9) can now be written rigorously in weak formulation

\begin{matrix} V_{0}^{P} = sup_{P \in Q} E^{P} [- U^{(- 1)} (Y_{T}) - \int_{0}^{T} (c (I_{s}) + k (s, α_{s}, S_{s}, I_{s})) d s] . \end{matrix}

4.17

We then define the Hamiltonian of the government, for all $t \in [0, T]$ , $x : = (s, i, y) \in R^{3}$ and $(p, M) \in R^{3} \times S^{3}$ , by

\begin{matrix} H^{P} (t, x, p, M) & : = sup_{(z, a) \in V} {B^{P} (t, s, i, z, a) \cdot p + \frac{1}{2} Tr [M (Σ^{P} {(Σ^{P})}^{⊤}) (t, s, i, z, a)] \\ - k (t, a, s, i)} - c (i), \end{matrix}

4.18

where $S^{3}$ represents the set of $3 \times 3$ symmetric positive matrices with real entries. More explicitly, the Hamiltonian can be written as follows, with $f (z, M) : = M_{11} - 2 M_{12} + M_{22} - 2 z (M_{23} - M_{13}) + z^{2} M_{33}$ for all $(z, M) \in R \times S^{3}$ :

\begin{matrix} H^{P} (t, x, p, M) & = sup_{z \in R, a \in A} {b^{⋆} (t, s, i, z, a) \sqrt{a} s i (p_{2} - p_{1}) - u^{⋆} (t, s, i, z, a) p_{3} \\ + \frac{1}{2} σ^{2} a^{2} {(s i)}^{2} f (z, M) - k (t, a, s, i)} \\ + (λ - μ s + ν i) p_{1} - (μ + ν + γ + ρ) i p_{2} - c (i) . \end{matrix}

We are then led to consider the following HJB equation, for all $t \in [0, T]$ and $x = (s, i, y) \in R^{3}$ :

\begin{matrix} - \partial_{t} v (t, x) - H^{P} (t, x, \nabla_{x} v, D_{x}^{2} v) = 0, (t, x) \in O, \end{matrix}

4.19

with terminal condition $v (T, x) : = - U^{(- 1)} (y)$ , and where, recalling that F is defined by (4.4), the natural domain over which the above PDE must be solved is15 $O : = {(t, s, i, y) \in [0, T) \times R_{+}^{2} \times R : 0 < s + i < F (t, s_{0}, i_{0})} .$

Remark 4.10

Standard arguments from viscosity solution theory allow to prove that $V_{0}^{P} = v^{P} (0, x_{0})$ (recalling that $x_{0} = (s_{0}, i_{0}, \underline{v}))$ where $v^{P}$ should be understood as the unique viscosity solution, in an appropriate class of functions, of the PDE (4.19). Obtaining further regularity results is by far more challenging. Indeed, it is a second-order, fully non-linear, parabolic PDE, which is clearly not uniformly elliptic, the corresponding diffusion matrix being degenerate. This makes the question of proving the existence of an optimal contract a very complicated one, which is clearly outside the scope of our study. As a sanity check though, we recall that $ε$ -optimal contracts always exist, and can be indeed approximated numerically. See for instance Kharroubi et al. (2020) for an explicit construction of such $ε$ -optimal contracts in a particular case dealing with the stochastic logistic equation.

Comparison with the first-best case

As already mentioned, the first-best case corresponds to the case where the government can enforce whichever interaction rate $β \in B$ it desires (in addition to a contract $(α, χ) \in A \times C$ ), and simply has to satisfy the participation constraint of the population. In order to find the optimal interaction rate in this scenario, as well as the optimal contract, one has to solve the government’s problem defined by (2.8).

The simplest way to take into account the inequality constraint in the definition of $V_{0}^{P, FB}$ is to introduce the associated Lagrangian. By strong duality, we then have

\begin{matrix} V_{0}^{P, FB} = & inf_{ϖ > 0} sup_{(α, χ, β) \in A \times C \times B} {E^{P^{α, β}} [χ - \int_{0}^{T} (c (I_{t}) + k (t, α_{t}, S_{t}, I_{t})) d t] \\ + ϖ (E^{P^{α, β}} [\int_{0}^{T} u (t, β_{t}, I_{t}) d t + U (- χ)] - \underline{v})} . \end{matrix}

First, by concavity of U, it is immediate that for any given Lagrange multiplier $ϖ > 0$ , the optimal tax is constant. Then, using the definition of ${\bar{V}}_{0} (ϖ)$ for any $ϖ > 0$ in (2.12), we have:

\begin{matrix} V_{0}^{P, FB} = inf_{ϖ > 0} {χ^{⋆} (ϖ) + ϖ (U (- χ^{⋆} (ϖ)) - \underline{v}) + {\bar{V}}_{0} (ϖ)} . \end{matrix}

Note that ${\bar{V}}_{0} (ϖ)$ is the value function of a standard stochastic control problem. Therefore, we expect to have ${\bar{V}}_{0} (ϖ) = v^{ϖ} (0, s_{0}, i_{0})$ , where the function $v^{ϖ} : [0, T] \times R_{+}^{2} ⟶ R$ solves the following HJB PDE

\begin{matrix} \{\begin{matrix} - \partial_{t} v^{ϖ} (t, s, i) + c (i) - (λ - μ s + ν i) \partial_{s} v^{ϖ} + (μ + ν + γ + ρ) i \partial_{i} v^{ϖ} \\ - H^{ϖ} (t, s, i, \partial v^{ϖ}, D^{2} v^{ϖ}) = 0, (t, s, i) \in D, \\ v^{ϖ} (T, s, i) = 0, (s, i) \in D_{T}, \end{matrix}) \end{matrix}

where the Hamiltonian is defined, for $t \in [0, T]$ , $(s, i) \in {(R_{+}^{⋆})}^{2}$ , $p : = (p_{1}, p_{2}) \in R^{2}$ and $M \in S^{2}$ by

\begin{matrix} H^{ϖ} (t, s, i, p, M) & : = sup_{a \in A} {sup_{b \in B} {ϖ u (t, b, i) - b s i \sqrt{a} (p_{1} - p_{2})} - k (t, a, s, i) \\ + \frac{1}{2} σ^{2} {(s i)}^{2} a^{2} (M_{11} - 2 M_{12} + M_{22})} . \end{matrix}

Note that if we consider separable utilities with the forms in Sect. 2.2.3, the optimal interaction rate is given, for a testing policy $α \in A$ and a Lagrange multiplier $ϖ > 0$ , by $β_{t}^{ϖ} = b^{ϖ} (S_{t}, I_{t}, \partial v^{ϖ} (t, S_{t}, I_{t}), α_{t})$ for all $t \in [0, T]$ , where

\begin{matrix} b^{ϖ} (s, i, p, a) : = b^{\circ} (s, i, \sqrt{a} (p_{1} - p_{2}) / ϖ), for all (s, i, p, a) \in {(R_{+}^{⋆})}^{2} \times R^{2} \times A . \end{matrix}

Extensions and generalisations

Diseases with latency periods: SEIS, SEIR

The reasoning developed in this paper can be extended in a straightforward way to consider SEIR and SEIS compartment models. These models are used to describe epidemics in which individuals are not directly contagious after contracting the disease, as for the COVID-19 epidemic (see, e.g., Dolbeault and Turinici 2020), and thus involve a fourth class representing the ‘Exposed’, i.e., individuals who have contracted the disease but are not yet infectious. The constant rate at which an exposed person becomes infectious is denoted by $ι \in R_{+}$ . The difference between SEIS and SEIR models is embedded into the immunity toward the disease: for SEIR, it is assumed that the immunity is permanent (as in a SIR), whereas for SEIS, infected individual come back in the susceptible class at rate $ν \geq 0$ , similarly to SIS models. We can also take into account the demographic dynamics of the population, through the parameters $λ$ , $μ$ and $γ$ . Similarly to the previous models, we consider that the dynamic of the epidemic is subject to a noise in the estimation of the proportion of susceptible and infected individuals. Inspired by the stochastic model in Mummert and Otunuga (2019, Eq. (3)), we can consider that the dynamics of the epidemic is given by:

\begin{matrix} \{\begin{matrix} S_{t} = s_{0} + \int_{0}^{t} (λ - μ S_{s} - β_{s} \sqrt{α_{s}} S_{s} I_{s} + ν I_{s}) d s \\ + \int_{0}^{t} σ α_{s} S_{s} I_{s} d W_{s}, \\ E_{t} = e_{0} - \int_{0}^{t} ((μ + ι) E_{s} - β_{s} \sqrt{α_{s}} S_{s} I_{s}) d s \\ - \int_{0}^{t} σ α_{s} S_{s} I_{s} d W_{s}, \\ I_{t} = i_{0} - \int_{0}^{t} ((μ + ν + γ + ρ) I_{s} - ι E_{s}) d s, \\ R_{t} = r_{0} + \int_{0}^{t} (ρ I_{s} - μ R_{s}) d s, \end{matrix}) for t \in [0, T], \end{matrix}

5.1

Note that the proportion I of infected and infectious is uncertain, but only through its dependence on E and the proportion R of recovery is uncertain only through its dependence on I. More precisely, we assume that there is no uncertainty on both the recovery rate $ρ$ , the rate $ι$ at which infected people becomes infectious and the (potentially) rate $ν$ at which an individual loses immunity, implying that if the proportion of exposed individual is perfectly known, the proportion of infected is also known without uncertainty and consequently the proportion of recovery is also certainly known. Again this modelling choice is consistent with most stochastic SEIRS models, and emphasises that the major uncertainty in the current epidemic is related to the non-negligible proportion of (nearly) asymptomatic individuals. Indeed, an asymptomatic individual may be misclassified as susceptible or exposed.

We will now give, informally, the optimisation problems faced by both the population and the government. The most important change compared to SIS/SIR models is that the criteria should now depend on the sum $E + I$ , representing the proportion of the population having contracted the disease, rather than just the proportion I of infectious people. For example, we can consider the following form for the population’s problem:

\begin{matrix} V_{0}^{A} (α, χ) : = sup_{β \in B} E [\int_{0}^{T} u (t, β_{t}, E_{t} + I_{t}) d t + U (- χ)], \end{matrix}

while that of the government could become

\begin{matrix} V_{0}^{P} : = sup_{(α, χ) \in Ξ} sup_{β \in B^{⋆} (α, χ)} E [χ - \int_{0}^{T} (c (E_{t} + I_{t}) + k (t, α_{t}, S_{t}, I_{t})) d t] . \end{matrix}

A slight adaption of our earlier arguments will show that admissible taxes take the form $χ : = - U^{(- 1)} (Y_{T})$ with

\begin{matrix} Y_{t} : = Y_{0} - \int_{0}^{T} (Z_{t} (μ + ι) E_{t} + u (t, β_{t}^{⋆}, E_{t} + I_{t}) - β_{t}^{⋆} \sqrt{α_{t}} S_{t} I_{t} Z_{t}) d t - \int_{0}^{T} Z_{t} d E_{t}, \end{matrix}

where $β^{⋆}$ is the population’s optimal contact rate, under the assumption it exists. It thus remain to solve the government’s problem, but unlike in the previous SIS/SIR models, there are now four state variables, namely (S, E, I, Y). However, solving it numerically is really more challenging since it increases the dimension of the problem. A numerical investigation seems to be complicated as far as we now, and we left these numerical issues for future research.

Beyond SEIS/SEIR models: a theoretically tractable method

There are of course plethora of generalisations of the models we have considered so far. For instance, in SEIRS (or also SIRS) models, the immunity is temporary, i.e. people in the class $R$ may come back into the class $S$ at rate $ν$ . Using a similar stochastic extension of this model, it is straightforward that all our results extend, mutatis mutandis, to this case as well, albeit with one important difference: the control problem faced by the government now has 5 states variables, namely (S, E, I, R, Y). Even more generally, our approach can readily be adapted to compartmental models considering additional classes: for instance the SIDARTHE (‘Susceptible’ (S), ‘Infected’ (I), ‘Diagnosed’ (D), ‘Ailing’ (A), ‘Recognised’ (R), ‘Threatened’ (T), ‘Healed’ (H) and ‘Extinct’ (E)) model investigated in Giordano et al. (2020) for COVID-19. Of course the price to pay is that the number of state variables in the government’s problem will increase with the number of compartments, and numerical procedures to solve the HJB equation will become more delicate to implement, and could be based on neural networks.

Footnotes

Several countries worldwide have decided to use contact-tracing tools, such as mobile phone apps, designed to help tracking down subsequent exposures after an infected individual is identified, see for instance Cho et al. (2020), or Reichert et al. (2020). Using these would in principle erase any possibility or moral hazard, provided that all the population uses the app, and that testing is organised on a massive scale. Even admitting that this would be the case, it remains that these tools have raised complex issues of privacy, see Ienca and Vayena (2020) or Park et al. (2020). In any case, the incentive-based approach we propose can always be considered as a useful complement to any other adopted strategy.

See among others Aïd et al. (2018), El Euch et al. (2021), Cvitanić and Xing (2018), Élie et al. (2019), Élie et al. (2021), Kharroubi et al. (2020).

It should be noted that if the length of the epidemic is relatively short in relation to the life expectancy at birth, the demographic dynamics become less relevant and may be dismissed altogether, by setting $λ = μ = 0$ . Nevertheless, for the sake of generality, we choose to take these dynamics into account, in order to allow for a straightforward application of our study to other types of epidemics.

⁴

We refer to Sect. 4.1.2 for a more precise definition of the set $B$ , taking into account the information flow in the model.

⁵

The lower bound $ε$ is here to insist on the fact that it is not possible, or prohibitively expensive, to cancel completely the uncertainty linked to the disease’s dynamics, by taking $α$ to be 0.

⁶

We refer to Sect. 4.1 for the rigorous definition of the set $A$ .

⁷

See Sect. 4.1.3 for a rigorous definition of the set $C$ of admissible fine policies.

⁸

Once again, the reader is referred to Sect. 4.1.3, and more precisely to Eq. (4.7) for a rigorous definition of $B^{⋆}$ .

⁹

From the population’s point of view, this cost should not actually be expressed in terms of money, but mainly corresponds to medical side effects or general morbidity. We refer to Anand and Hanson (1997), Zeckhauser and Shepard (1976) and Sassi (2006), for an introduction to QALY/DALY (Quality- and Disability-Adjusted Life-Year), the generic measures of disease burden used in economic evaluation to assess the value of medical interventions.

¹⁰

See Corless et al. (1996) for more details on the LambertW function.

¹¹

If we consider separable utilities, as in Sect. 2.2.3, the maximiser $b^{⋆}$ is given for all $(t, s, i, z, a) \in [0, T] \times {(R_{+}^{⋆})}^{2} \times R \times A$ by $b^{⋆} (s, i, z, a) : = b^{\circ} (s, i, z \sqrt{a})$ , recalling that $b^{\circ}$ is defined by (2.11).

¹²

Notice that the initial value of $r_{0}$ of R, which appears in the SIR version of the model, is irrelevant at this stage.

¹³

More precisely, one should first use the result of Stroock and Varadhan (1997, Theorem 4.5.2) to obtain that on an enlargement of

(Ø m e g a, F_{T})

, there is for any

P \in P

, a Brownian motion

W^{P}

, and an

F

-predictable process, A-valued process

α^{P}

such that

\begin{matrix} S_{t} = s_{0} + \int_{0}^{t} (λ - μ S_{s} + ν I_{s}) d s + \int_{0}^{t} σ α_{s}^{P} S_{s} I_{s} d W_{s}^{P}, t \in [0, T], P --a.s. \end{matrix}

The result for W is then immediate. Notice in addition that since W is defined as a stochastic integral, it should also depend on explicitly on

P

. We can however use Nutz (2012, Theorem 2.2) to define W universally, as an

F^{P}

-adapted and continuous process. This requires some set-theoretic assumptions which we implicitly consider here, see Possamaï et al. (2018, Footnote 7) for details.

¹⁴

Notice that at the end of the day, this is not really an issue. Indeed, provided that the problem has enough regularity (typically some semi-continuity of the terminal and running reward with respect to state), one can expect the strong and weak formulations to coincide. See for instance El Karoui and Tan (2013, Theorem 4.5).

¹⁵

The boundary of the domain cannot be reached by the processes S and I, which is why it not necessary to specify a boundary condition. Notice though that the upper bound can formally only be attained when I is constantly 0, in which case S becomes deterministic, and the government best choice for $α$ is clearly 1, and its choice of Z becomes irrelevant. In such a situation, we would immediately have $V_{0}^{P} = \underline{v}$ .

The authors acknowledge the supports of the ANR projects PACMAN ANR-16-CE05-0027 and ReLISCoP ANR-21-CE40-0001.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Emma Hubert, Email: eh3988@princeton.edu.

Thibaut Mastrolia, Email: mastrolia@berkeley.edu.

Dylan Possamaï, Email: dylan.possamai@math.ethz.ch.

Xavier Warin, Email: xavier.warin@edf.fr, http://www.fime-lab.org.

References

Abakuks A. An optimal isolation policy for an epidemic. J Appl Probab. 1973;10(2):247–262. doi: 10.2307/3212343. [DOI] [Google Scholar]
Abbey H. An examination of the Reed–Frost theory of epidemics. Hum Biol. 1952;24(3):201–233. [PubMed] [Google Scholar]
Aïd R, Possamaï D, Touzi N (2018) Optimal electricity demand response contracting with responsiveness incentives. Math Oper Res. To appear
Allen LJS (2008) An introduction to stochastic epidemic models. In: Brauer F, van den Driessche P, Wu J (eds) Mathematical epidemiology, volume 1945 of Lecture notes in mathematics. Springer, Berlin, pp 81–130
Anand S, Hanson K. Disability-adjusted life years: a critical review. J Health Econ. 1997;16(6):685–702. doi: 10.1016/S0167-6296(97)00005-2. [DOI] [PubMed] [Google Scholar]
Anderson RM, May RM. Population biology of infectious diseases: part I. Nature. 1979;280(5721):361–367. doi: 10.1038/280361a0. [DOI] [PubMed] [Google Scholar]
Anderson RM, Heesterbeek H, Klinkenberg D, Hollingsworth TD. How will country-based mitigation measures influence the course of the COVID-19 epidemic? The Lancet. 2020;395(10228):931–934. doi: 10.1016/S0140-6736(20)30567-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
Aurell A, Carmona R, Dayanikli G, Laurière M (2020) Optimal incentives to mitigate epidemics: a Stackelberg mean field game approach. arXiv:2011.03105
Bailey NTJ. The mathematical theory of infectious diseases and its applications. 2. London: Charles Griffin & Company; 1975. [Google Scholar]
Bartlett MS. Some evolutionary stochastic processes. J R Stat Soc Ser B (Methodol) 1949;11(2):211–229. [Google Scholar]
Bayraktar E, Cohen A, Nellis A. A macroeconomic SIR model for COVID- $19$ . Mathematics. 2021;9(16):1901. doi: 10.3390/math9161901. [DOI] [Google Scholar]
Behncke H. Optimal control of deterministic epidemics. Optim Control Appl Methods. 2000;21(6):269–285. doi: 10.1002/oca.678. [DOI] [Google Scholar]
Beretta E, Kolmanovskii V, Shaikhet L. Stability of epidemic model with time delays influenced by stochastic perturbations. Math Comput Simul. 1998;45(3–4):269–277. doi: 10.1016/S0378-4754(97)00106-7. [DOI] [Google Scholar]
Bernoulli D (1760) Essai d’une nouvelle analyse de la mortalité causée par la petite vérole, et des avantages de l’inoculation pour la prévenir. In Histoire de l’Académie Royale des Sciences. Année $M . DCCLX$ . Avec les mémoires de mathématique & de physique, pour la même année, tirés des registres de cette académie.(Mémoires). Imprimerie Royale, Paris, pp 1–45
Bichteler K. Stochastic integration and $L^{p}$ -theory of semimartingales. Ann Probab. 1981;9(1):49–89. [Google Scholar]
Bolton P, Dewatripont M. Contract theory. Cambridge: MIT Press; 2005. [Google Scholar]
Bouchard B, Possamaï D, Tan X, Zhou C. A unified approach to a priori estimates for supersolutions of BSDEs in general filtrations. Ann l’inst Henri Poincaré, Prob Stat (B) 2018;54(1):154–172. [Google Scholar]
Britton T, Pardoux É (eds) (2019) Stochastic epidemic models with inference, volume 2255 of Lecture. Springer, Cham
Camilli F, Falcone M. An approximation scheme for the optimal control of diffusion processes. ESAIM Math Model Numer Anal. 1995;29(1):97–122. doi: 10.1051/m2an/1995290100971. [DOI] [Google Scholar]
Carmona R, Wang P. Finite-state contract theory with a principal and a field of agents. Manag Sci. 2021;67(8):4643–5300. doi: 10.1287/mnsc.2020.3760. [DOI] [Google Scholar]
Charpentier A, Élie R, Laurière M, Tran VC. COVID-19 pandemic control: balancing detection policy and lockdown intervention under ICU sustainability. Math Model Nat Phenom. 2020;15(57):1–52. [Google Scholar]
Cho H, Ippolito D, Yu YW (2020) Contact tracing mobile apps for COVID-19: privacy considerations and related trade–offs. arXiv:2003.11511
Corless RM, Gonnet GH, Hare DEG, Jeffrey DJ, Knuth DE. On the LambertW function. Adv Comput Math. 1996;5(1):329–359. doi: 10.1007/BF02124750. [DOI] [Google Scholar]
Cvitanić J, Xing H. Asset pricing under optimal contracts. J Econ Theory. 2018;173:142–180. doi: 10.1016/j.jet.2017.10.005. [DOI] [Google Scholar]
Cvitanić J, Zhang J. Contract theory in continuous-time models. Berlin: Springer; 2012. [Google Scholar]
Cvitanić J, Possamaï D, Touzi N. Moral hazard in dynamic risk management. Manag Sci. 2017;63(10):3328–3346. doi: 10.1287/mnsc.2016.2493. [DOI] [Google Scholar]
Cvitanić J, Possamaï D, Touzi N. Dynamic programming approach to principal-agent problems. Finance Stochast. 2018;22(1):1–37. doi: 10.1007/s00780-017-0344-4. [DOI] [Google Scholar]
Dieu NT, Nguyen DH, Du NH, Yin G. Classification of asymptotic behavior in a stochastic SIR model. SIAM J Appl Dyn Syst. 2016;15(2):1062–1084. doi: 10.1137/15M1043315. [DOI] [Google Scholar]
Dolbeault J, Turinici G. Heterogeneous social interactions and the COVID-19 lockdown outcome in a multi-group SEIR model. Math Model Nat Phenom. 2020;15(36):1–18. [Google Scholar]
Du NH, Nhu NN. Permanence and extinction for the stochastic SIR epidemic model. J Differ Equ. 2020;269(11):9619–9652. doi: 10.1016/j.jde.2020.06.049. [DOI] [Google Scholar]
El Euch O, Mastrolia T, Rosenbaum M, Touzi N. Optimal make-take fees for market making regulation. Math Financ. 2021;31(1):109–148. doi: 10.1111/mafi.12295. [DOI] [Google Scholar]
El Karoui N, Tan X (2013) Capacities, measurable selection and dynamic programming part II: application in stochastic control problems. arXiv:1310.3364
Élie R, Mastrolia T, Possamaï D. A tale of a principal and many many agents. Math Oper Res. 2019;44(2):440–467. doi: 10.1287/moor.2018.0931. [DOI] [Google Scholar]
Élie R, Hubert E, Turinici G. Contact rate epidemic control of COVID-19: an equilibrium view. Math Model Nat Phenom. 2020;15(35):1–25. [Google Scholar]
Élie R, Hubert E, Mastrolia T, Possamaï D. Mean-field moral hazard for optimal energy demand response management. Math Financ. 2021;31(1):399–473. doi: 10.1111/mafi.12291. [DOI] [Google Scholar]
Farr W (1840) Second annual report of the registrar-general of births, deaths and marriages in England, chapter Appendix. Longman, Orme, Brown, Green, & Longmans, London, pp 69–98
Ferguson N, Laydon D, Nedjati-Gilani G, Imai N, Ainslie K, Baguelin M, Bhatia S, Boonyasiri A, Cucunubá Z, Cuomo-Dannenburg G, Dighe A, Dorigatti I, Fu H, Gaythorpe K, Green W, Hamlet A, Hinsley W, Okell LC, van Elsland S, Thompson H, Verity R, Volz E, Wang H, Wang Y, Walker PGT, Walters C, Winskill P, Whittaker C, Donnely CA, Riley S, Ghani AC (2020) Report 9: impact of non–pharmaceutical interventions (NPIs) to reduce COVID–19 mortality and healthcare demand. Technical report, Imperial College London
Fowler JH, Hill SJ, Levin R, Obradovich N (2020) The effect of stay-at-home orders on COVID-19 infections in the United States. arXiv:2004.06098
Francis PJ. Optimal tax/subsidy combinations for the flu season. J Econ Dyn Control. 2004;28(10):2037–2054. doi: 10.1016/j.jedc.2003.08.001. [DOI] [Google Scholar]
Gao N, Song Y, Wang X, Liu J. Dynamics of a stochastic SIS epidemic model with nonlinear incidence rates. Adv Differ Equ. 2019;2019(1):41. doi: 10.1186/s13662-019-1980-0. [DOI] [Google Scholar]
Gevret H, Langrené N, Lelong J, Warin X, Maheshwari A (2018) STochastic OPTimization library in C++. HAL preprint arXiv:hal-01361291
Giordano G, Blanchini F, Bruno R, Colaneri P, Di Filippo A, Di Matteo A, Colaneri M. Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Nat Med. 2020;26:855–860. doi: 10.1038/s41591-020-0883-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gramig BM, Horan RD, Wolf CA (2005) A model of incentive compatibility under moral hazard in livestock disease outbreak response. Technical report, Michigan State University
Gramig BM, Horan RD, Wolf CA. Livestock disease indemnity design when moral hazard is followed by adverse selection. Am J Agric Econ. 2009;91(3):627–641. doi: 10.1111/j.1467-8276.2009.01256.x. [DOI] [Google Scholar]
Gray A, Greenhalgh D, Hu L, Mao X, Pan J. A stochastic differential equation SIS epidemic model. SIAM J Appl Math. 2011;71(3):876–902. doi: 10.1137/10081856X. [DOI] [Google Scholar]
Greenwood PE, Gordillo LF. Stochastic epidemic modeling. In: Chowell G, Hyman JM, Bettencourt LMA, Castillo-Chavez C, editors. Mathematical and statistical estimation approaches in epidemiology. Dordrecht: Springer; 2009. pp. 31–52. [Google Scholar]
Grigorieva E, Khailov E, Korobeinikov A (2020) Optimal quarantine strategies for COVID-19 control models. arXiv:2004.10614 [DOI] [PMC free article] [PubMed]
Hamer WH. The Milroy lectures on epidemic disease in England—the evidence of variability and of persistency of type. The Lancet. 1906;167(4306):655–662. doi: 10.1016/S0140-6736(01)80264-6. [DOI] [Google Scholar]
Hansen E, Day T. Optimal control of epidemics with limited resources. J Math Biol. 2011;62(3):423–451. doi: 10.1007/s00285-010-0341-0. [DOI] [PubMed] [Google Scholar]
Hatchimonji JS, Swendiman RA, Seamon MJ. Trauma does not quarantine: violence during the COVID-19 pandemic. Ann Surg. 2020;272(2):E53–E54. doi: 10.1097/SLA.0000000000003996. [DOI] [PMC free article] [PubMed] [Google Scholar]
Holmström B, Milgrom P. Aggregation and linearity in the provision of intertemporal incentives. Econometrica. 1987;55(2):303–328. doi: 10.2307/1913238. [DOI] [Google Scholar]
Hu K, Ren Z, Touzi N (2019) Continuous-time principal-agent problem in degenerate systems. arXiv:1910.10527
Hubert E, Mastrolia T, Possamaï D, Warin X (2020) Incentives, lockdown, and testing: from Thucydides’s analysis to the COVID-19 pandemic. arXiv:2009.00484 [DOI] [PMC free article] [PubMed]
Ienca M, Vayena E. On the responsible use of digital data to tackle the COVID-19 pandemic. Nat Med. 2020;26(4):463–464. doi: 10.1038/s41591-020-0832-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jiang D, Yu J, Ji C, Shi N. Asymptotic behavior of global positive solution to a stochastic SIR model. Math Comput Model. 2011;54(1–2):221–232. doi: 10.1016/j.mcm.2011.02.004. [DOI] [Google Scholar]
Jowett B (1900) Thucydes translated into English, to which is prefixed an essay on inscriptions and a note on the geography of Thucydides, volume I, 2nd revised edition. Oxford University Press, Oxford
Kantner M. Beyond just “flattening the curve”: optimal control of epidemics with purely non-pharmaceutical interventions. J Math Ind. 2020;10(23):1–23. doi: 10.1186/s13362-020-00091-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kendall DG (1956) Deterministic and stochastic epidemics in closed populations. In: Neyman J. (ed) Proceedings of the third Berkeley symposium on mathematical statistics and probability, volume 4: contributions to biology and problems of health, pp 149–165
Kermack WO, McKendrick AG. A contribution to the mathematical theory of epidemics. Proc R Soc Lond Ser A. 1927;CXV(772):700–721. [Google Scholar]
Kharroubi I, Lim T, Mastrolia T. Regulation of renewable resource exploitation. SIAM J Control Optim. 2020;58(1):551–579. doi: 10.1137/19M1265740. [DOI] [Google Scholar]
Laffont J-J, Martimort D. The theory of incentives: the principal-agent model. Princeton: Princeton University Press; 2002. [Google Scholar]
Lenhart S, Workman JT. Optimal control applied to biological models. Mathematical and computational biology series. Boca Raton: CRC; 2007. [Google Scholar]
Lesniewski A (2020) Epidemic control via stochastic optimal control. arXiv:2004.06680
Li J, Lindberg DV, Smith RA, Reluga TC. Provisioning of public health can be designed to anticipate public policy responses. Bull Math Biol. 2017;79(1):163–190. doi: 10.1007/s11538-016-0231-8. [DOI] [PubMed] [Google Scholar]
Li Q, Guan X, Wu P, Wang X, Zhou L, Tong Y, Ren R, Leung KSM, Lau EHY, Wong JY, Xing X, Xiang N, Wu Y, Li C, Chen Q, Li D, Liu T, Zhao J, Liu M, Tu W, Chen C, Jin L, Yang R, Wang Q, Zhou S, Wang R, Liu H, Luo Y, Liu Y, Shao G, Li H, Tao Z, Yang Y, Deng Z, Liu B, Ma Z, Zhang Y, Shi G, Lam TTY, Wu JT, Gao GF, Cowling BJ, Yang B, Leung GM, Feng Z. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N Engl J Med. 2020;382:1199–1207. doi: 10.1056/NEJMoa2001316. [DOI] [PMC free article] [PubMed] [Google Scholar]
McKendrick AG. Applications of mathematics to medical problems. Proc Edinb Math Soc. 1925;44:98–130. doi: 10.1017/S0013091500034428. [DOI] [Google Scholar]
Morton R, Wickwire KH. On the optimal control of a deterministic epidemic. Adv Appl Probab. 1974;6(4):622–635. doi: 10.2307/1426183. [DOI] [Google Scholar]
Mummert A, Otunuga OM. Parameter identification for a stochastic SEIRS epidemic model: case study influenza. J Math Biol. 2019;79(2):705–729. doi: 10.1007/s00285-019-01374-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
Nåsell I. The quasi-stationary distribution of the closed endemic SIS model. Adv Appl Probab. 1996;28(3):895–932. doi: 10.2307/1428186. [DOI] [Google Scholar]
Neufeld A, Nutz M. Measurability of semimartingale characteristics with respect to the probability law. Stoch Process Appl. 2014;124(11):3819–3845. doi: 10.1016/j.spa.2014.07.006. [DOI] [Google Scholar]
Nutz M. Pathwise construction of stochastic integrals. Electron Commun Probab. 2012;17(24):1–7. [Google Scholar]
Park S, Choi GJ, Ko H. Information technology-based tracing strategy in response to COVID-19 in South Korea—privacy controversies. J Am Med Assoc. 2020;323(21):2129–2130. doi: 10.1001/jama.2020.6602. [DOI] [PubMed] [Google Scholar]
Piguillem F, Shi L (2020) The optimal COVID–19 quarantine and testing policies. Technical report, Einaudi Institute for Economics and Finance
Possamaï D, Tan X, Zhou C. Stochastic control for a class of nonlinear kernels and applications. Ann Probab. 2018;46(1):551–603. doi: 10.1214/17-AOP1191. [DOI] [Google Scholar]
Reichert L, Brack S, Scheuermann B (2020) Privacy-preserving contact tracing of COVID-19 patients. Technical Report 2020/375, Humboldt–Universität zu Berlin and Alexander von Humboldt Instiute for Internet and Society, Berlin
Reluga TC. Game theory of social distancing in response to an epidemic. PLoS Comput Biol. 2010;6(5):e1000793. doi: 10.1371/journal.pcbi.1000793. [DOI] [PMC free article] [PubMed] [Google Scholar]
Reluga TC. Equilibria of an epidemic game with piecewise linear social distancing cost. Bull Math Biol. 2013;75(10):1961–1984. doi: 10.1007/s11538-013-9879-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
Riley S, Fraser C, Donnelly CA, Ghani AC, Abu-Raddad LJ, Hedley AJ, Leung GM, Ho L-M, Lam T-H, Thach TQ. Transmission dynamics of the etiological agent of SARS in Hong Kong: impact of public health interventions. Science. 2003;300(5627):1961–1966. doi: 10.1126/science.1086478. [DOI] [PubMed] [Google Scholar]
Ross R. The prevention of malaria. New York: E.P. Dutton & Company; 1910. [Google Scholar]
Salanié B. The economics of contracts: a primer. Cambridge: MIT Press; 2005. [Google Scholar]
Sannikov Y. A continuous-time version of the principal-agent problem. Rev Econ Stud. 2008;75(3):957–984. doi: 10.1111/j.1467-937X.2008.00486.x. [DOI] [Google Scholar]
Sassi F. Calculating QALYs, comparing QALY and DALY calculations. Health Policy Plan. 2006;21(5):402–408. doi: 10.1093/heapol/czl018. [DOI] [PubMed] [Google Scholar]
Schättler H, Sung J. The first-order approach to the continuous-time principal-agent problem with exponential utility. J Econ Theory. 1993;61(2):331–371. doi: 10.1006/jeth.1993.1072. [DOI] [Google Scholar]
Schreiber SJ, Huang S, Jiang J, Wang H. Extinction and quasi-stationarity for discrete-time, endemic SIS and SIR models. SIAM J Appl Math. 2021;81(5):2195–2217. doi: 10.1137/20M1339015. [DOI] [Google Scholar]
Sethi SP, Staats PW. Optimal control of some simple deterministic epidemic models. J Oper Res Soc. 1978;29(2):129–136. doi: 10.1057/jors.1978.27. [DOI] [Google Scholar]
Stroock DW, Varadhan SRS (1997) Multidimensional diffusion processes, volume 233 of Grundlehren der mathematischen Wissenschaften. Springer, Berlin
Taylor HM. Some models in epidemic control. Math Biosci. 1968;3:383–398. doi: 10.1016/0025-5564(68)90093-X. [DOI] [Google Scholar]
Tornatore E, Buccellato SM, Vetro P. Stability of a stochastic SIR system. Physica A. 2005;354(15):111–126. doi: 10.1016/j.physa.2005.02.057. [DOI] [Google Scholar]
Valeeva NI, Backus GBC (2007) Incentive systems under ex post moral hazard to control outbreaks of classical swine fever in the Netherlands. Technical report, Agricultural Economics Research Institute and Wageningen University
Warin X. Some non-monotone schemes for time dependent Hamilton–Jacobi–Bellman equations in stochastic control. J Sci Comput. 2016;66(3):1122–1147. doi: 10.1007/s10915-015-0057-9. [DOI] [Google Scholar]
Weiss GH, Dishon M. On the asymptotic behavior of the stochastic and deterministic models of an epidemic. Math Biosci. 1971;11(3–4):261–265. doi: 10.1016/0025-5564(71)90087-3. [DOI] [Google Scholar]
Wickwire KH. Optimal isolation policies for deterministic and stochastic epidemics. Math Biosci. 1975;26(3–4):325–346. doi: 10.1016/0025-5564(75)90020-6. [DOI] [Google Scholar]
Wilder-Smith A, Chiew CJ, Lee VJ. Can we contain the COVID-19 outbreak with the same measures as for SARS? Lancet Infect Dis. 2020;20(5):E102–E107. doi: 10.1016/S1473-3099(20)30129-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wilson EB, Worcester J. The law of mass action in epidemiology. Proc Natl Acad Sci USA. 1945;31(1):24–34. doi: 10.1073/pnas.31.1.24. [DOI] [PMC free article] [PubMed] [Google Scholar]
Zeckhauser R, Shepard D. Where now for saving lives? Law Contemp Probl. 1976;40(4):5–45. doi: 10.2307/1191310. [DOI] [Google Scholar]
Zhang X, Wu J, Zhao P, Su X, Choi D. Epidemic spreading on a complex network with partial immunization. Soft Comput. 2018;22(14):4525–4533. doi: 10.1007/s00500-017-2903-1. [DOI] [Google Scholar]

[CR1] Abakuks A. An optimal isolation policy for an epidemic. J Appl Probab. 1973;10(2):247–262. doi: 10.2307/3212343. [DOI] [Google Scholar]

[CR2] Abbey H. An examination of the Reed–Frost theory of epidemics. Hum Biol. 1952;24(3):201–233. [PubMed] [Google Scholar]

[CR3] Aïd R, Possamaï D, Touzi N (2018) Optimal electricity demand response contracting with responsiveness incentives. Math Oper Res. To appear

[CR4] Allen LJS (2008) An introduction to stochastic epidemic models. In: Brauer F, van den Driessche P, Wu J (eds) Mathematical epidemiology, volume 1945 of Lecture notes in mathematics. Springer, Berlin, pp 81–130

[CR5] Anand S, Hanson K. Disability-adjusted life years: a critical review. J Health Econ. 1997;16(6):685–702. doi: 10.1016/S0167-6296(97)00005-2. [DOI] [PubMed] [Google Scholar]

[CR6] Anderson RM, May RM. Population biology of infectious diseases: part I. Nature. 1979;280(5721):361–367. doi: 10.1038/280361a0. [DOI] [PubMed] [Google Scholar]

[CR7] Anderson RM, Heesterbeek H, Klinkenberg D, Hollingsworth TD. How will country-based mitigation measures influence the course of the COVID-19 epidemic? The Lancet. 2020;395(10228):931–934. doi: 10.1016/S0140-6736(20)30567-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] Aurell A, Carmona R, Dayanikli G, Laurière M (2020) Optimal incentives to mitigate epidemics: a Stackelberg mean field game approach. arXiv:2011.03105

[CR9] Bailey NTJ. The mathematical theory of infectious diseases and its applications. 2. London: Charles Griffin & Company; 1975. [Google Scholar]

[CR10] Bartlett MS. Some evolutionary stochastic processes. J R Stat Soc Ser B (Methodol) 1949;11(2):211–229. [Google Scholar]

[CR11] Bayraktar E, Cohen A, Nellis A. A macroeconomic SIR model for COVID- $19$ . Mathematics. 2021;9(16):1901. doi: 10.3390/math9161901. [DOI] [Google Scholar]

[CR12] Behncke H. Optimal control of deterministic epidemics. Optim Control Appl Methods. 2000;21(6):269–285. doi: 10.1002/oca.678. [DOI] [Google Scholar]

[CR13] Beretta E, Kolmanovskii V, Shaikhet L. Stability of epidemic model with time delays influenced by stochastic perturbations. Math Comput Simul. 1998;45(3–4):269–277. doi: 10.1016/S0378-4754(97)00106-7. [DOI] [Google Scholar]

[CR14] Bernoulli D (1760) Essai d’une nouvelle analyse de la mortalité causée par la petite vérole, et des avantages de l’inoculation pour la prévenir. In Histoire de l’Académie Royale des Sciences. Année $M . DCCLX$ . Avec les mémoires de mathématique & de physique, pour la même année, tirés des registres de cette académie.(Mémoires). Imprimerie Royale, Paris, pp 1–45

[CR15] Bichteler K. Stochastic integration and $L^{p}$ -theory of semimartingales. Ann Probab. 1981;9(1):49–89. [Google Scholar]

[CR16] Bolton P, Dewatripont M. Contract theory. Cambridge: MIT Press; 2005. [Google Scholar]

[CR17] Bouchard B, Possamaï D, Tan X, Zhou C. A unified approach to a priori estimates for supersolutions of BSDEs in general filtrations. Ann l’inst Henri Poincaré, Prob Stat (B) 2018;54(1):154–172. [Google Scholar]

[CR18] Britton T, Pardoux É (eds) (2019) Stochastic epidemic models with inference, volume 2255 of Lecture. Springer, Cham

[CR19] Camilli F, Falcone M. An approximation scheme for the optimal control of diffusion processes. ESAIM Math Model Numer Anal. 1995;29(1):97–122. doi: 10.1051/m2an/1995290100971. [DOI] [Google Scholar]

[CR20] Carmona R, Wang P. Finite-state contract theory with a principal and a field of agents. Manag Sci. 2021;67(8):4643–5300. doi: 10.1287/mnsc.2020.3760. [DOI] [Google Scholar]

[CR21] Charpentier A, Élie R, Laurière M, Tran VC. COVID-19 pandemic control: balancing detection policy and lockdown intervention under ICU sustainability. Math Model Nat Phenom. 2020;15(57):1–52. [Google Scholar]

[CR22] Cho H, Ippolito D, Yu YW (2020) Contact tracing mobile apps for COVID-19: privacy considerations and related trade–offs. arXiv:2003.11511

[CR23] Corless RM, Gonnet GH, Hare DEG, Jeffrey DJ, Knuth DE. On the LambertW function. Adv Comput Math. 1996;5(1):329–359. doi: 10.1007/BF02124750. [DOI] [Google Scholar]

[CR24] Cvitanić J, Xing H. Asset pricing under optimal contracts. J Econ Theory. 2018;173:142–180. doi: 10.1016/j.jet.2017.10.005. [DOI] [Google Scholar]

[CR25] Cvitanić J, Zhang J. Contract theory in continuous-time models. Berlin: Springer; 2012. [Google Scholar]

[CR26] Cvitanić J, Possamaï D, Touzi N. Moral hazard in dynamic risk management. Manag Sci. 2017;63(10):3328–3346. doi: 10.1287/mnsc.2016.2493. [DOI] [Google Scholar]

[CR27] Cvitanić J, Possamaï D, Touzi N. Dynamic programming approach to principal-agent problems. Finance Stochast. 2018;22(1):1–37. doi: 10.1007/s00780-017-0344-4. [DOI] [Google Scholar]

[CR28] Dieu NT, Nguyen DH, Du NH, Yin G. Classification of asymptotic behavior in a stochastic SIR model. SIAM J Appl Dyn Syst. 2016;15(2):1062–1084. doi: 10.1137/15M1043315. [DOI] [Google Scholar]

[CR29] Dolbeault J, Turinici G. Heterogeneous social interactions and the COVID-19 lockdown outcome in a multi-group SEIR model. Math Model Nat Phenom. 2020;15(36):1–18. [Google Scholar]

[CR30] Du NH, Nhu NN. Permanence and extinction for the stochastic SIR epidemic model. J Differ Equ. 2020;269(11):9619–9652. doi: 10.1016/j.jde.2020.06.049. [DOI] [Google Scholar]

[CR31] El Euch O, Mastrolia T, Rosenbaum M, Touzi N. Optimal make-take fees for market making regulation. Math Financ. 2021;31(1):109–148. doi: 10.1111/mafi.12295. [DOI] [Google Scholar]

[CR32] El Karoui N, Tan X (2013) Capacities, measurable selection and dynamic programming part II: application in stochastic control problems. arXiv:1310.3364

[CR33] Élie R, Mastrolia T, Possamaï D. A tale of a principal and many many agents. Math Oper Res. 2019;44(2):440–467. doi: 10.1287/moor.2018.0931. [DOI] [Google Scholar]

[CR34] Élie R, Hubert E, Turinici G. Contact rate epidemic control of COVID-19: an equilibrium view. Math Model Nat Phenom. 2020;15(35):1–25. [Google Scholar]

[CR35] Élie R, Hubert E, Mastrolia T, Possamaï D. Mean-field moral hazard for optimal energy demand response management. Math Financ. 2021;31(1):399–473. doi: 10.1111/mafi.12291. [DOI] [Google Scholar]

[CR36] Farr W (1840) Second annual report of the registrar-general of births, deaths and marriages in England, chapter Appendix. Longman, Orme, Brown, Green, & Longmans, London, pp 69–98

[CR37] Ferguson N, Laydon D, Nedjati-Gilani G, Imai N, Ainslie K, Baguelin M, Bhatia S, Boonyasiri A, Cucunubá Z, Cuomo-Dannenburg G, Dighe A, Dorigatti I, Fu H, Gaythorpe K, Green W, Hamlet A, Hinsley W, Okell LC, van Elsland S, Thompson H, Verity R, Volz E, Wang H, Wang Y, Walker PGT, Walters C, Winskill P, Whittaker C, Donnely CA, Riley S, Ghani AC (2020) Report 9: impact of non–pharmaceutical interventions (NPIs) to reduce COVID–19 mortality and healthcare demand. Technical report, Imperial College London

[CR38] Fowler JH, Hill SJ, Levin R, Obradovich N (2020) The effect of stay-at-home orders on COVID-19 infections in the United States. arXiv:2004.06098

[CR39] Francis PJ. Optimal tax/subsidy combinations for the flu season. J Econ Dyn Control. 2004;28(10):2037–2054. doi: 10.1016/j.jedc.2003.08.001. [DOI] [Google Scholar]

[CR40] Gao N, Song Y, Wang X, Liu J. Dynamics of a stochastic SIS epidemic model with nonlinear incidence rates. Adv Differ Equ. 2019;2019(1):41. doi: 10.1186/s13662-019-1980-0. [DOI] [Google Scholar]

[CR41] Gevret H, Langrené N, Lelong J, Warin X, Maheshwari A (2018) STochastic OPTimization library in C++. HAL preprint arXiv:hal-01361291

[CR42] Giordano G, Blanchini F, Bruno R, Colaneri P, Di Filippo A, Di Matteo A, Colaneri M. Modelling the COVID-19 epidemic and implementation of population-wide interventions in Italy. Nat Med. 2020;26:855–860. doi: 10.1038/s41591-020-0883-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR43] Gramig BM, Horan RD, Wolf CA (2005) A model of incentive compatibility under moral hazard in livestock disease outbreak response. Technical report, Michigan State University

[CR44] Gramig BM, Horan RD, Wolf CA. Livestock disease indemnity design when moral hazard is followed by adverse selection. Am J Agric Econ. 2009;91(3):627–641. doi: 10.1111/j.1467-8276.2009.01256.x. [DOI] [Google Scholar]

[CR45] Gray A, Greenhalgh D, Hu L, Mao X, Pan J. A stochastic differential equation SIS epidemic model. SIAM J Appl Math. 2011;71(3):876–902. doi: 10.1137/10081856X. [DOI] [Google Scholar]

[CR46] Greenwood PE, Gordillo LF. Stochastic epidemic modeling. In: Chowell G, Hyman JM, Bettencourt LMA, Castillo-Chavez C, editors. Mathematical and statistical estimation approaches in epidemiology. Dordrecht: Springer; 2009. pp. 31–52. [Google Scholar]

[CR47] Grigorieva E, Khailov E, Korobeinikov A (2020) Optimal quarantine strategies for COVID-19 control models. arXiv:2004.10614 [DOI] [PMC free article] [PubMed]

[CR48] Hamer WH. The Milroy lectures on epidemic disease in England—the evidence of variability and of persistency of type. The Lancet. 1906;167(4306):655–662. doi: 10.1016/S0140-6736(01)80264-6. [DOI] [Google Scholar]

[CR49] Hansen E, Day T. Optimal control of epidemics with limited resources. J Math Biol. 2011;62(3):423–451. doi: 10.1007/s00285-010-0341-0. [DOI] [PubMed] [Google Scholar]

[CR50] Hatchimonji JS, Swendiman RA, Seamon MJ. Trauma does not quarantine: violence during the COVID-19 pandemic. Ann Surg. 2020;272(2):E53–E54. doi: 10.1097/SLA.0000000000003996. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR51] Holmström B, Milgrom P. Aggregation and linearity in the provision of intertemporal incentives. Econometrica. 1987;55(2):303–328. doi: 10.2307/1913238. [DOI] [Google Scholar]

[CR52] Hu K, Ren Z, Touzi N (2019) Continuous-time principal-agent problem in degenerate systems. arXiv:1910.10527

[CR53] Hubert E, Mastrolia T, Possamaï D, Warin X (2020) Incentives, lockdown, and testing: from Thucydides’s analysis to the COVID-19 pandemic. arXiv:2009.00484 [DOI] [PMC free article] [PubMed]

[CR54] Ienca M, Vayena E. On the responsible use of digital data to tackle the COVID-19 pandemic. Nat Med. 2020;26(4):463–464. doi: 10.1038/s41591-020-0832-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR55] Jiang D, Yu J, Ji C, Shi N. Asymptotic behavior of global positive solution to a stochastic SIR model. Math Comput Model. 2011;54(1–2):221–232. doi: 10.1016/j.mcm.2011.02.004. [DOI] [Google Scholar]

[CR56] Jowett B (1900) Thucydes translated into English, to which is prefixed an essay on inscriptions and a note on the geography of Thucydides, volume I, 2nd revised edition. Oxford University Press, Oxford

[CR57] Kantner M. Beyond just “flattening the curve”: optimal control of epidemics with purely non-pharmaceutical interventions. J Math Ind. 2020;10(23):1–23. doi: 10.1186/s13362-020-00091-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR58] Kendall DG (1956) Deterministic and stochastic epidemics in closed populations. In: Neyman J. (ed) Proceedings of the third Berkeley symposium on mathematical statistics and probability, volume 4: contributions to biology and problems of health, pp 149–165

[CR59] Kermack WO, McKendrick AG. A contribution to the mathematical theory of epidemics. Proc R Soc Lond Ser A. 1927;CXV(772):700–721. [Google Scholar]

[CR60] Kharroubi I, Lim T, Mastrolia T. Regulation of renewable resource exploitation. SIAM J Control Optim. 2020;58(1):551–579. doi: 10.1137/19M1265740. [DOI] [Google Scholar]

[CR61] Laffont J-J, Martimort D. The theory of incentives: the principal-agent model. Princeton: Princeton University Press; 2002. [Google Scholar]

[CR62] Lenhart S, Workman JT. Optimal control applied to biological models. Mathematical and computational biology series. Boca Raton: CRC; 2007. [Google Scholar]

[CR63] Lesniewski A (2020) Epidemic control via stochastic optimal control. arXiv:2004.06680

[CR64] Li J, Lindberg DV, Smith RA, Reluga TC. Provisioning of public health can be designed to anticipate public policy responses. Bull Math Biol. 2017;79(1):163–190. doi: 10.1007/s11538-016-0231-8. [DOI] [PubMed] [Google Scholar]

[CR65] Li Q, Guan X, Wu P, Wang X, Zhou L, Tong Y, Ren R, Leung KSM, Lau EHY, Wong JY, Xing X, Xiang N, Wu Y, Li C, Chen Q, Li D, Liu T, Zhao J, Liu M, Tu W, Chen C, Jin L, Yang R, Wang Q, Zhou S, Wang R, Liu H, Luo Y, Liu Y, Shao G, Li H, Tao Z, Yang Y, Deng Z, Liu B, Ma Z, Zhang Y, Shi G, Lam TTY, Wu JT, Gao GF, Cowling BJ, Yang B, Leung GM, Feng Z. Early transmission dynamics in Wuhan, China, of novel coronavirus-infected pneumonia. N Engl J Med. 2020;382:1199–1207. doi: 10.1056/NEJMoa2001316. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR66] McKendrick AG. Applications of mathematics to medical problems. Proc Edinb Math Soc. 1925;44:98–130. doi: 10.1017/S0013091500034428. [DOI] [Google Scholar]

[CR67] Morton R, Wickwire KH. On the optimal control of a deterministic epidemic. Adv Appl Probab. 1974;6(4):622–635. doi: 10.2307/1426183. [DOI] [Google Scholar]

[CR68] Mummert A, Otunuga OM. Parameter identification for a stochastic SEIRS epidemic model: case study influenza. J Math Biol. 2019;79(2):705–729. doi: 10.1007/s00285-019-01374-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR69] Nåsell I. The quasi-stationary distribution of the closed endemic SIS model. Adv Appl Probab. 1996;28(3):895–932. doi: 10.2307/1428186. [DOI] [Google Scholar]

[CR70] Neufeld A, Nutz M. Measurability of semimartingale characteristics with respect to the probability law. Stoch Process Appl. 2014;124(11):3819–3845. doi: 10.1016/j.spa.2014.07.006. [DOI] [Google Scholar]

[CR71] Nutz M. Pathwise construction of stochastic integrals. Electron Commun Probab. 2012;17(24):1–7. [Google Scholar]

[CR72] Park S, Choi GJ, Ko H. Information technology-based tracing strategy in response to COVID-19 in South Korea—privacy controversies. J Am Med Assoc. 2020;323(21):2129–2130. doi: 10.1001/jama.2020.6602. [DOI] [PubMed] [Google Scholar]

[CR73] Piguillem F, Shi L (2020) The optimal COVID–19 quarantine and testing policies. Technical report, Einaudi Institute for Economics and Finance

[CR74] Possamaï D, Tan X, Zhou C. Stochastic control for a class of nonlinear kernels and applications. Ann Probab. 2018;46(1):551–603. doi: 10.1214/17-AOP1191. [DOI] [Google Scholar]

[CR75] Reichert L, Brack S, Scheuermann B (2020) Privacy-preserving contact tracing of COVID-19 patients. Technical Report 2020/375, Humboldt–Universität zu Berlin and Alexander von Humboldt Instiute for Internet and Society, Berlin

[CR76] Reluga TC. Game theory of social distancing in response to an epidemic. PLoS Comput Biol. 2010;6(5):e1000793. doi: 10.1371/journal.pcbi.1000793. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR77] Reluga TC. Equilibria of an epidemic game with piecewise linear social distancing cost. Bull Math Biol. 2013;75(10):1961–1984. doi: 10.1007/s11538-013-9879-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR78] Riley S, Fraser C, Donnelly CA, Ghani AC, Abu-Raddad LJ, Hedley AJ, Leung GM, Ho L-M, Lam T-H, Thach TQ. Transmission dynamics of the etiological agent of SARS in Hong Kong: impact of public health interventions. Science. 2003;300(5627):1961–1966. doi: 10.1126/science.1086478. [DOI] [PubMed] [Google Scholar]

[CR79] Ross R. The prevention of malaria. New York: E.P. Dutton & Company; 1910. [Google Scholar]

[CR80] Salanié B. The economics of contracts: a primer. Cambridge: MIT Press; 2005. [Google Scholar]

[CR81] Sannikov Y. A continuous-time version of the principal-agent problem. Rev Econ Stud. 2008;75(3):957–984. doi: 10.1111/j.1467-937X.2008.00486.x. [DOI] [Google Scholar]

[CR82] Sassi F. Calculating QALYs, comparing QALY and DALY calculations. Health Policy Plan. 2006;21(5):402–408. doi: 10.1093/heapol/czl018. [DOI] [PubMed] [Google Scholar]

[CR83] Schättler H, Sung J. The first-order approach to the continuous-time principal-agent problem with exponential utility. J Econ Theory. 1993;61(2):331–371. doi: 10.1006/jeth.1993.1072. [DOI] [Google Scholar]

[CR84] Schreiber SJ, Huang S, Jiang J, Wang H. Extinction and quasi-stationarity for discrete-time, endemic SIS and SIR models. SIAM J Appl Math. 2021;81(5):2195–2217. doi: 10.1137/20M1339015. [DOI] [Google Scholar]

[CR85] Sethi SP, Staats PW. Optimal control of some simple deterministic epidemic models. J Oper Res Soc. 1978;29(2):129–136. doi: 10.1057/jors.1978.27. [DOI] [Google Scholar]

[CR86] Stroock DW, Varadhan SRS (1997) Multidimensional diffusion processes, volume 233 of Grundlehren der mathematischen Wissenschaften. Springer, Berlin

[CR87] Taylor HM. Some models in epidemic control. Math Biosci. 1968;3:383–398. doi: 10.1016/0025-5564(68)90093-X. [DOI] [Google Scholar]

[CR88] Tornatore E, Buccellato SM, Vetro P. Stability of a stochastic SIR system. Physica A. 2005;354(15):111–126. doi: 10.1016/j.physa.2005.02.057. [DOI] [Google Scholar]

[CR89] Valeeva NI, Backus GBC (2007) Incentive systems under ex post moral hazard to control outbreaks of classical swine fever in the Netherlands. Technical report, Agricultural Economics Research Institute and Wageningen University

[CR90] Warin X. Some non-monotone schemes for time dependent Hamilton–Jacobi–Bellman equations in stochastic control. J Sci Comput. 2016;66(3):1122–1147. doi: 10.1007/s10915-015-0057-9. [DOI] [Google Scholar]

[CR91] Weiss GH, Dishon M. On the asymptotic behavior of the stochastic and deterministic models of an epidemic. Math Biosci. 1971;11(3–4):261–265. doi: 10.1016/0025-5564(71)90087-3. [DOI] [Google Scholar]

[CR92] Wickwire KH. Optimal isolation policies for deterministic and stochastic epidemics. Math Biosci. 1975;26(3–4):325–346. doi: 10.1016/0025-5564(75)90020-6. [DOI] [Google Scholar]

[CR93] Wilder-Smith A, Chiew CJ, Lee VJ. Can we contain the COVID-19 outbreak with the same measures as for SARS? Lancet Infect Dis. 2020;20(5):E102–E107. doi: 10.1016/S1473-3099(20)30129-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR94] Wilson EB, Worcester J. The law of mass action in epidemiology. Proc Natl Acad Sci USA. 1945;31(1):24–34. doi: 10.1073/pnas.31.1.24. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR95] Zeckhauser R, Shepard D. Where now for saving lives? Law Contemp Probl. 1976;40(4):5–45. doi: 10.2307/1191310. [DOI] [Google Scholar]

[CR96] Zhang X, Wu J, Zhao P, Su X, Choi D. Epidemic spreading on a complex network with partial immunization. Soft Comput. 2018;22(14):4525–4533. doi: 10.1007/s00500-017-2903-1. [DOI] [Google Scholar]

PERMALINK

Incentives, lockdown, and testing: from Thucydides’ analysis to the COVID-19 pandemic

Emma Hubert

Thibaut Mastrolia

Dylan Possamaï

Xavier Warin

Abstract

Introduction

Informal pandemic models and main results

Controlled stochastic SIS/SIR dynamics

Fig. 1.

Remark 2.1

Remark 2.2

The Stackelberg equilibrium

Population optimisation problem

Government optimisation problem

Remark 2.3

Utilities and cost specifications

Two alternative problems

Main results and comparison

The benchmark case: without tax and testing policies

Remark 2.4

The first-best case: without moral hazard

Remark 2.5

Relevant form of tax policy

Government’s problem in the general case

Numerical experiments

Choice of parameters

Table 1.

Table 2.

Numerical approach

The benchmark case

Fig. 2.

Fig. 3.

Fig. 4.

Lockdown policy, without testing

Fig. 5.

Fig. 6.

Fig. 7.

Fig. 8.

Fig. 9.

Tax policy with testing

Fig. 10.

Fig. 13.

Fig. 11.

Fig. 12.

Fig. 14.

The first-best case

Fig. 15.

Incentive policy for epidemic stochastic models

The stochastic model

Initial canonical space

Definition 4.1

Remark 4.2

Remark 4.3

Impact of the interaction

Optimisation problems

Remark 4.4

Optimal interaction of the population given tax and test policies

A relevant contract form

Assumption 4.5

Remark 4.6

The general analysis

Theorem 4.7

Proof

Lemma 4.8

Proof

Characterisation of the class of admissible contracts

Lemma 4.9

Proof

Optimal tax and test policies under moral hazard for epidemic models

Weak formulation for the government’s problem

Solving the government’s problem

Remark 4.10

Comparison with the first-best case

Extensions and generalisations

Diseases with latency periods: SEIS, SEIR

Beyond SEIS/SEIR models: a theoretically tractable method

Footnotes

Contributor Information