Dynamic Games of Social Distancing During an Epidemic: Analysis of Asymmetric Solutions

Ioannis Kordonis; Athanasios-Rafail Lagos; George P Papavassilopoulos

doi:10.1007/s13235-021-00403-1

. 2021 Oct 11;12(1):214–236. doi: 10.1007/s13235-021-00403-1

Dynamic Games of Social Distancing During an Epidemic: Analysis of Asymmetric Solutions

Ioannis Kordonis ^1,^✉, Athanasios-Rafail Lagos ¹, George P Papavassilopoulos ^1,²

PMCID: PMC8503885 PMID: 34659872

Abstract

Individual behaviors play an essential role in the dynamics of transmission of infectious diseases, including COVID-19. This paper studies a dynamic game model that describes the social distancing behaviors during an epidemic, assuming a continuum of players and individual infection dynamics. The evolution of the players’ infection states follows a variant of the well-known SIR dynamics. We assume that the players are not sure about their infection state, and thus, they choose their actions based on their individually perceived probabilities of being susceptible, infected, or removed. The cost of each player depends both on her infection state and on the contact with others. We prove the existence of a Nash equilibrium and characterize Nash equilibria using nonlinear complementarity problems. We then exploit some monotonicity properties of the optimal policies to obtain a reduced-order characterization for Nash equilibrium and reduce its computation to the solution of a low-dimensional optimization problem. It turns out that, even in the symmetric case, where all the players have the same parameters, players may have very different behaviors. We finally present some numerical studies that illustrate this interesting phenomenon and investigate the effects of several parameters, including the players’ vulnerability, the time horizon, and the maximum allowed actions, on the optimal policies and the players’ costs.

Keywords: COVID-19 pandemic, Games of social distancing, Epidemics modeling and control, Nash games, Nonlinear complementarity problems

Introduction

COVID-19 pandemic is one of the most important events of this era. Until early April 2021, it has caused more than 2.8 million deaths, an unprecedented economic depression, and affected most aspects of people’s lives in the larger part of the world. During the first phases of the pandemic, non-pharmaceutical interventions (primarily social distancing) have been one of the most efficient tools to control its spread [13]. Due to the slow roll-out of the vaccines, their uneven distribution, the emergence of SARS-CoV-2 variants, age limitations, and people’s resistance to vaccination, social distancing is likely to remain significant in large part of the globe for the near future.

Mathematical modeling of epidemics dates back to early twentieth century with the seminal works of Ross [33] and Kermack and McKendrick [24]. A widely used modeling approach separates people in several compartments according to their infection state (e.g., susceptible, exposed, infected, recovered, etc.) and derives differential equations describing the evolution of the population of each compartment (for a review, see [1]).

However, the description of individual behaviors (practice of social distancing, use of face masks, vaccination, etc.) is essential for the understanding of the spread of epidemics. Game theory is thus a particularly relevant tool. A dynamic game model describing voluntary implementation of non-pharmaceutical interventions (NPIs) was presented in [32]. Several extensions were published, including the case where infection severity depends on the epidemic burden [6], and different formulations of the cost functions (linear vs. nonlinear, and finite horizon vs. discounted infinite horizon or stochastic horizon) [11, 15, 37]. Aurell et al. [4] study the design of incentives to achieve optimal social distancing, in a Stackelberg game framework. The works [2, 10, 12, 17, 22, 30, 31, 39] study different aspects of the coupled dynamics between individual behaviors and the spread of an epidemic, in the context of evolutionary game theory. Network game models appear in [3, 18, 27, 29], and the effects of altruism on the spread of epidemics are studied in [8, 14, 23, 26]. Another closely related stream of research is the study of the adoption of decentralized protection strategies in engineered and social networks [19, 21, 36, 38]. A review of game theoretic models for epidemics (including also topics other than social distancing, e.g., vaccination) is presented in [9, 20].

This paper presents a dynamic game model to describe the social distancing choices of individuals during an epidemic, assuming that the players are not certain about their infection state (susceptible (S), infected (I), or removed (R)). The probability that a player is at each health state evolves dynamically depending on the player’s distancing behavior, the others’ behavior, and the prevalence of the epidemic. We assume that the players care both about their health state and about maintaining their social contacts. The players may have different characteristics, e.g., vulnerable versus less vulnerable, or care differently about maintaining their social contacts.

We assume that the players are not sure about their infection state, and thus, they choose their actions based on their individually perceived probabilities of being susceptible, infected, or removed. In contrast with most of the literature, in the current work, players—even players with the same characteristics—are allowed to behave differently. We first characterize the optimal action of a player, given the others’ behavior, and show some monotonicity properties of optimal actions. We then prove the existence of a Nash equilibrium and characterize it in terms of a nonlinear complementarity problem.

Using the monotonicity of the optimal solution, we provide a simple reduced-order characterization of the Nash equilibrium in terms of a nonlinear programming problem. This formulation simplifies the computation of the equilibria drastically. Based on that result, we performed numerical studies, which verify that players with the same parameters may follow different strategies. This phenomenon seems realistic since people facing the same risks or belonging to the same age group often have different social distancing behaviors.

The model presented in this paper differs from most of the dynamic game models presented in the literature in the following ways:

The players act without knowing their infection state. However, they know the probability of being susceptible, infected, or recovered, which depends on their previous actions and the prevalence of the epidemic.
The current model allows for asymmetric solutions, i.e., players with the same characteristics may behave differently.

The rest of the paper is organized as follows. Section 2 presents the game theoretic model. In Sect. 3, we analyze the optimization problem of each player and prove some monotonicity properties. In Sect. 4, we prove the existence of the equilibrium and provide Nash equilibrium characterizations. Section 5 presents some numerical results. Finally, ‘Appendix’ contains the proof of the results of the main text.

The Model

This section presents the dynamic model for the epidemic spread and the social distancing game among the members of the society.

We assume that the infection state of each agent could be susceptible (S), infected (I), recovered (R), or dead (D). A susceptible person gets infected at a rate proportional to the number of infected people she meets with. An infected person either recovers or dies at constant rates, which depend on her vulnerability. An individual who has recovered from the infection is immune, i.e., she could not get infected again. The evolution of the infection state of an individual is shown in Fig. 1.

Fig. 1 — The evolution of the infection state of each individual

We assume that there is a continuum of agents. This approximation is frequently used in game theoretic models dealing with a very large number of agents. The set of players is described by the measure space $([0, 1), B, μ)$ , where $B$ is the Borel $σ$ -algebra and $μ$ the Lebesgue measure. That is, each player is indexed by an $i \in [0, 1)$ .

Denote by $S^{i} (t), I^{i} (t), R^{i} (t), D^{i} (t)$ the probability that player $i \in [0, 1)$ is susceptible, infected, removed, or dead at time t. The dynamics is given by:

\begin{matrix} \begin{matrix} {\dot{S}}^{i} & = - r u^{i} S^{i} I^{f} \\ {\dot{I}}^{i} & = r u^{i} S^{i} I^{f} - α^{i} I^{i} \\ {\dot{R}}^{i} & = {\bar{α}}^{i} I^{i} \\ {\dot{D}}^{i} & = (α^{i} - {\bar{α}}^{i}) I^{i} \end{matrix}, \end{matrix}

where $r, α^{i}$ are positive constants, and $u^{i} (t)$ is the action of player i at time t. The quantity $u^{i} (t)$ describes player i’s socialization, which is proportional to the time she spends in public places. The quantity $I^{f}$ , which denotes the density of infected people in public places, is given by:

\begin{matrix} I^{f} (t) = \int I^{i} (t) u^{i} (t) μ (d i) . \end{matrix}

For the actions of the players, we assume that there are positive constants $u_{m}$ , $u_{M}$ , such that $u^{i} (t) \in [u_{m}, u_{M}] \subset [0, 1]$ . The constant $u_{m}$ describes the minimum social contacts needed for an agent to survive, and $u_{M}$ is an upper bound posed by the government.

The cost function for player i is given by:

\begin{matrix} J^{i} = G^{i} (1 - S^{i} (T)) - s^{i} \int_{0}^{T} u^{i} (t) \tilde{u} (t) d t - s^{i} \int_{0}^{T} κ u^{i} (t) d t, \end{matrix}

where T is the time horizon. The parameter $G^{i} > 0$ depends on the vulnerability of the player and indicates the disutility a player experiences if she gets infected. The quantity $1 - S^{i} (T)$ corresponds to the probability that player i gets infected before the end of the time horizon. (Note that in that case the infection state of the player at the end of the time horizon is I, R, or D.) The second term corresponds to the utility a player derives from the interaction with the other players, whose mean action is denoted by $\tilde{u} (t)$ :

\begin{matrix} \tilde{u} (t) = \int u^{i} (t) μ (d i) . \end{matrix}

The third term indicates the interest of a person to visit public places. The relative magnitude of this desire is modeled by a positive constant $κ$ . Finally, constant $s^{i}$ indicates the importance player i gives on the last two terms that correspond to going out and interacting with other people.

Considering the auxiliary variable $\bar{u} (t)$ :

\begin{matrix} \bar{u} (t) = κ + \tilde{u}, \end{matrix}

and computing S(T) by solving (1), the cost can be written equivalently as:

\begin{matrix} J^{i} = G^{i} (1 - S^{i} (0) e^{- r \int_{0}^{T} u^{i} (t) I^{f} (t) d t}) - s^{i} \int_{0}^{T} u^{i} (t) \bar{u} (t) d t . \end{matrix}

Without loss of generality, assume that $R^{i} (0) = D^{i} (0) = 0$ for all $i \in [0, 1)$ .

Assumption 1

(Finite number of types) There are M types of players. Particularly, there are $M + 1$ values $0 = {\bar{i}}_{0} < \dots < {\bar{i}}_{M} = 1$ such that the functions $S^{i} (0), I^{i} (0), G^{i}, s^{i}, α^{i} : [0, 1) \to R$ are constant for $i \in [{\bar{i}}_{0}, {\bar{i}}_{1})$ , $i \in [{\bar{i}}_{1}, {\bar{i}}_{2}), \dots$ , $i \in [{\bar{i}}_{M - 1}, {\bar{i}}_{M})$ . Denote by $m_{j} = μ ([{\bar{i}}_{j - 1}, {\bar{i}}_{j}))$ the mass of the players of type j. Of course $m_{1} + \dots + m_{M} = 1 .$

Remark 1

The finite number of types assumption is very common in many applications dealing with a large number of agents. For example, in the current COVID-19 pandemic, people are grouped based on their age and/or underlying diseases to be prioritized for vaccination. Assumption 1, combined with some results of the following section, is convenient to describe the evolution of the states of a continuum of players using a finite number of differential equations.

Assumption 2

(Piecewise constant actions) The interval [0, T) can be divided in subintervals $[t_{k}, t_{k + 1})$ , with $t_{0} = 0 < t_{1} < \dots < t_{N} = T$ , such that the actions of the players are constant in these intervals.

Remark 2

Assumption 2 indicates that people decide only a finite number of times ( $t_{k}$ ) and follow their decisions for a time interval $[t_{k}, t_{k + 1})$ . A reasonable length for that time interval could be 1 week.

The action of player i in the interval $[t_{k}, t_{k + 1})$ is denoted by $u_{k}^{i}$ .

Assumption 3

(Measurability of the actions) The function $u_{k}^{\cdot} : [0, 1) \to [u_{m}, u_{M}]$ is measurable.

Under Assumptions 1–3, there is a unique solution to differential equations (1), with initial conditions $S^{\cdot} (0), I^{\cdot} (0)$ , and the integrals in (2), (4) are well defined (see Appendix A.1). We use the following notation:

\begin{matrix} {\bar{u}}_{k} = \int_{t_{k}}^{t_{k + 1}} \bar{u} d t, and I_{k}^{f} = \int_{t_{k}}^{t_{k + 1}} I^{f} (t) d t . \end{matrix}

For each player, we define an auxiliary cost, by dropping the fixed terms of (6) and dividing by $s^{i}$ :

\begin{matrix} {\tilde{J}}^{i} (u^{i}, \bar{u}, I^{f}) = - b^{i} exp [- r \sum_{k = 0}^{N - 1} u_{k}^{i} I_{k}^{f}] - \sum_{k = 0}^{N - 1} u_{k}^{i} {\bar{u}}_{k}, \end{matrix}

where $b^{i} = S^{i} (0) G^{i} / s^{i}$ , and $u^{i} = {[u_{0}^{i}, \dots, u_{N - 1}^{i}]}^{T}$ . Denote by $U = {[u_{m}, u_{M}]}^{N}$ the set of possible actions for each player. Observe that $u^{i}$ minimizes $J^{i}$ over the feasible set U if and only if it minimizes the auxiliary cost ${\tilde{J}}^{i}$ . Thus, the optimization problem for player i is equivalent to:

\begin{matrix} \underset{u^{i} \in U}{minimize} {\tilde{J}}^{i} (u^{i}, \bar{u}, I^{f}) . \end{matrix}

Note that the cost of player i depends on the actions of the other players through the terms $\bar{u}$ and $I^{f}$ . Furthermore, the current actions of all the players affect the future values of $\bar{u}$ and $I^{f}$ through the SIR dynamics.

Assumption 4

For a player i of type j, denote $b_{j} = b^{i}$ . Assume that the different types of players have different $b_{j}$ ’s. Without loss of generality, assume that $b_{1} < b_{2} < \dots < b_{M} .$

Assumption 5

Each player i has access only to the probabilities $S^{i}$ and $I^{i}$ and the aggregate quantities $\bar{u}$ and $I^{f}$ , but not the actual infection states.

Remark 3

This assumption is reasonable in cases where the test availability is very sparse, so the agents are not able to have a reliable feedback for their estimated health states.

In the rest of the paper, we suppose that Assumptions 1–5 are satisfied.

Analysis of the Optimization Problem of Each Player

In this section, we analyze the optimization problem for a representative player i, given ${\bar{u}}_{k}$ and $I_{k}^{f} > 0$ , for $k = 0, \dots, N - 1$ .

Let us first define the following composite optimization problem:

\begin{matrix} \underset{A}{minimize} \{- b^{i} e^{- A} + f (A)\}, \end{matrix}

where:

\begin{matrix} f (A) = inf_{u^{i} \in U} \{- \sum_{k = 0}^{N - 1} u_{k}^{i} {\bar{u}}_{k} : \sum_{k = 0}^{N - 1} u_{k}^{i} I_{k}^{f} = A / r\} . \end{matrix}

The following proposition proves that (8) and (9) are equivalent and express their solution in a simple threshold form.

Proposition 1

(i)
If $u^{i}$ is optimal for (8), then $u^{i} \in \tilde{U} = {u_{m}, u_{M}}^{N}$ .
(ii)
Problems (8) and (9) are equivalent, in the sense that they have the same optimal values, and $u^{i}$ minimizes (8) if and only if there is an optimal A for (9) such that $u^{i}$ attains the minimum in (10).
(iii)
Let $A_{m} = r u_{m} \sum_{k = 0}^{N - 1} I_{k}^{f}$ and $A_{M} = r u_{M} \sum_{k = 0}^{N - 1} I_{k}^{f}$ . For $A \in [A_{m}, A_{M}]$ , the function f is continuous, non-increasing, convex, and piecewise affine. Furthermore, it has at most N affine pieces and $f (A) = \infty$ , for $A \notin [A_{m}, A_{M}]$ .
(iv)
There are at most $N + 1$ vectors $u^{i} \in U$ that minimize (8).
(v)
If $u^{i}$ is optimal for (8), then there is a $λ^{'}$ such that ${\bar{u}}_{k} / I_{k}^{f} \leq λ^{'}$ implies $u_{k}^{i} = u_{m}$ , and ${\bar{u}}_{k} / I_{k}^{f} > λ^{'}$ implies $u_{k}^{i} = u_{M}$ .

Proof

The idea of the proof of Proposition 1 is to reduce the problem of minimizing (8) into the minimization of the sum of the concave function $- b^{i} e^{- A}$ with a piecewise affine function f(A). Then, the candidates for the minimum are only the corner points and of f(A) and the endpoints of the interval, where f is defined. The form of the optimal action $u^{i}$ comes from a Lagrange multiplier analysis of (10). For a detailed proof, see Appendix 2. $□$

Remark 4

Part (v) of the proposition shows that if the density of infected people in public places $I^{f}$ is high, or the average socialization $\bar{u}$ is low, then it is optimal for a player to choose a small degree of socialization. The optimal action for each player depends on the ratio ${\bar{u}}_{k} / I_{k}^{f}$ . Particularly, there is a threshold $λ^{'}$ such that the action of player i is $u_{m}$ for values of the ratio below the threshold and $u_{M}$ for ratios above the threshold. Note that the threshold is different for each player.

Remark 5

The fact that the optimal value of a linear program is a convex function of the constraints constants is known in the literature (e.g., see [35] chapter 2). Thus, the convexity of the function f is already known from the literature.

Corollary 1

There is a simple way to solve the optimization problem (8) using the following steps:

Compute $Λ = {{\bar{u}}_{k} / I_{k}^{f} : k = 0, \dots, N - 1} \cup {0}$ .
For all $λ^{'} \in Λ$ compute $u^{λ^{'}}$ with:
$\begin{matrix} u_{k}^{λ^{'}} = \{\begin{matrix} u_{M} if {\bar{u}}_{k} / I_{k}^{f} > λ^{'} \\ u_{m} if {\bar{u}}_{k} / I_{k}^{f} \leq λ^{'} \end{matrix}), \end{matrix}$
and $J^{i} (u^{λ^{'}})$ .
Compare the values of $J^{i} (u^{λ^{'}})$ , for all $λ^{'} \in Λ$ , and choose the minimum.

We then prove some monotonicity properties for the optimal control.

Proposition 2

Assume that for two players $i_{1}$ and $i_{2}$ , with parameters $b^{i_{1}}$ and $b^{i_{2}}$ , the minimizers of (9) are $A_{1}$ and $A_{2}$ , respectively, and $u^{i_{1}}$ and $u^{i_{2}}$ are the corresponding optimal actions. Then:

(i)
If $b^{i_{1}} < b^{i_{2}}$ , then $A_{1} \geq A_{2}$ .
(ii)
If $b^{i_{1}} < b^{i_{2}}$ , then $u_{k}^{i_{2}} \leq u_{k}^{i_{1}}$ , for $k = 0, \dots, N - 1$ .
(iii)
If $b^{i_{1}} = b^{i_{2}}$ , then either $u_{k}^{i_{2}} \leq u_{k}^{i_{1}}$ for all k, or $u_{k}^{i_{1}} \leq u_{k}^{i_{2}}$ for all k.

Proof

See Appendix 3. $□$

Remark 6

Recall that $b^{i} = S^{i} (0) G^{i} / s^{i}$ . Thus, Proposition 2(ii) expresses of the fact that if (a) a person is more vulnerable, i.e., she has large $G^{i}$ , or (b) she derives less utility from the interaction with the others, i.e., she has smaller $s^{i}$ , or (c) it is more likely that she is not yet infected, i.e., she has larger $S^{i} (0)$ , then she interacts less with the others.

Remark 7

The optimal control law can be expressed in feedback form (see Appendix A.4.1).

Nash Equilibrium Existence and Characterization

Existence and NCP Characterization

In this section, we prove the existence of a Nash equilibrium and characterize it in terms of a nonlinear complementarity problem (NCP).

We consider the set $\tilde{U} = {u_{m}, u_{M}}^{N}$ , defined in Proposition 1. Let $v_{1}, \dots, v_{2^{N}}$ be the members of the set $\tilde{U}$ and $p_{l}^{j}$ be the mass of players of type j following action $v_{l} \in \tilde{U}$ . Let also $p^{j} = [p_{1}^{j}, \dots, p_{2^{N}}^{j}]$ be the distribution of actions of the players of type j and $π = [p^{1}, \dots, p^{M}]$ be the distribution of the actions of all the players.

Denote by:

\begin{matrix} Δ_{j} = \{p^{j} \in R^{2^{N}} : p_{l}^{j} \geq 0, \sum_{l = 1}^{2^{N}} p_{l}^{j} = m_{j}\}, \end{matrix}

the set of possible distributions of actions of the players of type j and by $Π = Δ_{1} \times \dots \times Δ_{M}$ the set of all possible distributions.

Finally, let $F : Π \to R^{2^{N} \cdot M}$ be the vector function of auxiliary costs; that is, the component $F_{(j - 1) 2^{N} + l} (π)$ is the auxiliary cost of the players of type j playing a strategy $v^{l}$ , as introduced in (7), when the distribution of actions is $π$ . We denote $F^{j} (π) = [F_{(j - 1) 2^{N} + 1} (π), . . ., F_{j 2^{N}} (π)]$ the vector of the auxiliary costs of the players of type j playing $v^{l}$ , $l = 1, \dots, 2^{N}$ .

Let us recall the notion of a Nash equilibrium for games with a continuum of players (e.g., [28]).

Definition 1

A distribution of actions $π \in Π$ is a Nash equilibrium if for all $j = 1, \dots, M$ and $l = 1, \dots, 2^{N}$ :

\begin{matrix} π_{(j - 1) 2^{N} + l} > 0 \Rightarrow l \in \underset{l^{'}}{arg min} F_{(j - 1) 2^{N} + l^{'}} (π) \end{matrix}

Let $δ^{j} (π)$ be the value of problem (8), i.e., the minimum value of the auxiliary cost of an agent of type j. This value depends on $π$ , through the terms $I^{f}$ and $\bar{u}$ . Define $Φ^{j} (π) = F^{j} (π) - δ^{j} (π)$ and $Φ (π) = [Φ^{1} (π) . . . Φ^{M} (π)]$ . We then characterize a Nash equilibrium in terms of a nonlinear complementarity problem (NCP):

\begin{matrix} 0 \leq π ⊥ Φ (π) \geq 0, \end{matrix}

where $π ⊥ Φ (π)$ means that $π^{T} Φ (π) = 0$ .

Proposition 3

(i)
A distribution $π \in Π$ corresponds to a Nash equilibrium if and only if it satisfies the NCP (13).
(ii)
A distribution $π \in Π$ corresponds to a Nash equilibrium if and only if it satisfies the variational inequality:
$\begin{matrix} {(π^{'} - π)}^{T} F (π) \geq 0, for all π^{'} \in Π \end{matrix}$ 14
(iii)
There exists a Nash equilibrium.

Proof

See Appendix A.5. $□$

Remark 8

In principle, we can use algorithms for NCPs to find Nash equilibria. The problem is that the number of decision variables grows exponentially with the number of decision steps. Thus, we expect that such methods would be applicable only for small values of N.

Structure and Reduced-Order Characterization

In this section, we use the monotonicity of the optimal strategies, shown in Proposition 2, to derive a reduced-order characterization of the Nash equilibrium.

The actions on a Nash equilibrium have an interesting structure. Assume that $π$ is a Nash equilibrium and:

\begin{matrix} V = {v^{l} \in \tilde{U} : \exists j : π_{(j - 1) 2^{N} + l} > 0} \subset \tilde{U}, \end{matrix}

is the set of actions used by a set of players with a positive mass. Let us define a partial ordering on $\tilde{U}$ . For $v^{1}, v^{2} \in \tilde{U}$ , we write $v^{1} ⪯ v^{2}$ if $v_{k}^{1} \leq v_{k}^{2}$ for all $k = 1, \dots, N$ . Proposition 2(iii) implies that $V$ is a totally ordered subset of $\tilde{U}$ (chain).

Lemma 1

There are at most N! maximal chains in $\tilde{U}$ , each of which has length $N + 1$ . Thus, at a Nash equilibrium, there are at most $N + 1$ different actions in $V$ .

Proof

See Appendix A.6. $□$

For each time step k, denote by $ρ_{k}$ the fraction of players who play $u_{M}$ , that is, $ρ_{k} = μ ({i : u_{k}^{i} = u_{M}})$ . Given any vector $ρ = [ρ_{1} \dots ρ_{N}] \in {[0, 1]}^{N}$ , we will show that there is a unique $π \in Π$ , such that the corresponding actions satisfy the conclusion of Proposition 2(iii) and induce the fractions $ρ$ . An example of the relationship between $π$ and $ρ$ is given in Fig. 2.

Fig. 2 — In this example, $N = 5$ and there are $M = 3$ types of players, depicted with different colors. The mass of players below each solid line play $u_{M}$ , and the mass of players above the line play $u_{m}$ . Example 1 computes $π$ from $ρ$

Let us define the following sets:

\begin{matrix} I_{k} = {i \in [0, 1) : u_{k}^{i} = u_{M}}, K_{k} = {k^{'} : ρ_{k^{'}} \geq ρ_{k}}, k = 0, \dots, N - 1 . \end{matrix}

Let $k_{1}, \dots, k_{N}$ be a reordering of $0, \dots, N - 1$ such that $ρ_{k_{1}} \leq ρ_{k_{2}} \leq \dots \leq ρ_{k_{N}}$ . Consider also the set $\tilde{V} = {{\bar{v}}^{1}, \dots, {\bar{v}}^{N + 1}}$ of $N + 1$ actions ${\bar{v}}^{n}$ with:

\begin{matrix} {\bar{v}}_{k}^{n} = \{\begin{matrix} u_{M} if k \in K_{k_{n}} \\ u_{m} otherwise \end{matrix}), n = 1, \dots, N, \end{matrix}

and ${\bar{v}}_{k}^{N + 1} = u_{m}$ , for all k. Observe that ${\bar{v}}^{n + 1} ⪯ {\bar{v}}^{n}$ . The following proposition shows that the set $V$ , defined in (15) is subset of the set $\tilde{V}$ .

Before stating the proposition, let us give an example.

Example 1

Consider the fractions described in Fig. 2. There are three types of players with total mass $m_{1}, m_{2}$ , and $m_{3}$ with corresponding colors blue, pink, and yellow. In this example, we assume that the actions of each player $i \in [0, 1)$ are given by:

\begin{matrix} u_{k}^{i} = \{\begin{matrix} 1, if i < ρ_{k} \\ 0, otherwise \end{matrix}) \end{matrix}

The sets $I_{k}$ are given by:

\begin{matrix} I_{k} = [0, ρ_{k}) . \end{matrix}

The sets $K_{k}$ are given by:

\begin{matrix} K_{0} = {2}, K_{1} = {2, 3}, K_{2} = {2, 3, 1}, K_{3} = {2, 3, 1, 4}, K_{4} = {2, 3, 1, 4, 0} . \end{matrix}

The actions ${\bar{v}}^{n}$ are given by:

\begin{matrix} \begin{matrix} {\bar{v}}^{0} = [u_{M}, u_{M}, u_{M}, u_{M}, u_{M}], {\bar{v}}^{1} = [u_{M}, u_{M}, u_{m}, u_{M}, u_{M}], \\ {\bar{v}}^{2} = [u_{M}, u_{M}, u_{m}, u_{m}, u_{M}], {\bar{v}}^{3} = [u_{M}, u_{m}, u_{m}, u_{m}, u_{M}], \\ {\bar{v}}^{4} = [u_{M}, u_{m}, u_{m}, u_{m}, u_{m}], {\bar{v}}^{5} = [u_{m}, u_{m}, u_{m}, u_{m}, u_{m}] . \end{matrix} \end{matrix}

The mass of the players of each type following each action is described in the following table:

Type	1	1	2	2	2	2	3	3
Mass	$ρ_{2}$	${\bar{i}}_{1} - ρ_{2}$	$ρ_{3} - m_{1}$	$ρ_{1} - ρ_{3}$	$ρ_{4} - ρ_{1}$	${\bar{i}}_{2} - ρ_{4}$	$ρ_{0} - {\bar{i}}_{2}$	$1 - ρ_{0}$
Action	${\bar{v}}^{0}$	${\bar{v}}^{1}$	${\bar{v}}^{1}$	${\bar{v}}^{2}$	${\bar{v}}^{3}$	${\bar{v}}^{4}$	${\bar{v}}^{4}$	${\bar{v}}^{5}$

Open in a new tab

The following proposition and its corollary present a method to compute $π$ from $ρ$ in the general case (i.e., without assuming a set of actions in a form similar to (17)).

Proposition 4

Assume that ${(u_{k}^{i})}_{i \in [0, 1), k = 0, \dots, N - 1}$ , with $u^{i} \in \tilde{U}$ , be a set of actions satisfying the conclusions of Proposition 2. Then:

(i)
For $k \neq k^{'}$ , either $I_{k} \subset I_{k^{'}}$ or $I_{k^{'}} \subset I_{k}$ .
(ii)
If for some $k, k^{'}$ , it holds $ρ_{k} = ρ_{k^{'}}$ , then $μ$ -almost surely all the players have the same action on $k, k^{'}$ , i.e., $μ ({i : u_{k}^{i} = u_{k^{'}}^{i}}) = 1$ .
(iii)
Up to subsets of measure zero, the following inclusions hold:
where indicates that $μ (I_{k_{n}} \ I_{k_{n + 1}}) = 0$ . Furthermore, $μ (I_{k}) = ρ_{k}$ .
(iv)
For $μ$ -almost all $i \in I_{k_{n + 1}} \ I_{k_{n}}$ , the action $u^{i}$ is given by ${\bar{v}}^{n + 1}$ , for $μ$ -almost all $i \in I_{k_{1}}$ , $u^{i} = {\bar{v}}^{1}$ , and for $μ$ -almost all $i \in [0, 1) \ I_{k_{N}}$ , $u_{k}^{i} = {\bar{v}}^{N + 1}$ .

Proof

See Appendix A.7. $□$

Corollary 2

The mass of players of type j with action ${\bar{v}}^{n}$ is given by:

\begin{matrix} μ (i : i is of type j, u^{i} = {\bar{v}}^{n}) = μ ([{\bar{i}}_{j - 1}, {\bar{i}}_{j}) \cap [ρ_{k_{n - 1}}, ρ_{k_{n}})), \end{matrix}

where we use the convention that $ρ_{k_{0}} = 0$ , and $ρ_{k_{N + 1}} = 1$ . Thus:

\begin{matrix} π_{(j - 1) 2^{N} + l} = \{\begin{matrix} μ ([{\bar{i}}_{j - 1}, {\bar{i}}_{j}) \cap [ρ_{k_{n - 1}}, ρ_{k_{n}})) if v^{l} = {\bar{v}}^{n} \\ 0 otherwise \end{matrix}) \end{matrix}

Proof

The proof follows directly from Propositions 4 and 2(ii). $□$

Remark 9

There are at most $M + N + 1$ combinations of j, l such that $π_{(j - 1) 2^{N} + l} > 0 .$

Let us denote by $\tilde{π} (ρ)$ the value of vector $π$ computed by (20).

Example 2

The situation is the same as in Example 1, but without assuming that the actions are given by (17). Then, Corollary 2 shows that $\tilde{π} (ρ)$ is given by the table in Example 1.

Proposition 5

The fractions $ρ_{0}, \dots, ρ_{N - 1}$ correspond to a Nash equilibrium if and only if:

\begin{matrix} H (ρ) = \sum_{j = 1}^{M} \sum_{n = 1}^{N + 1} μ ([{\bar{i}}_{j - 1}, {\bar{i}}_{j}) \cap [ρ_{k_{n - 1}}, ρ_{k_{n}})) ({\bar{F}}_{j, {\bar{v}}^{n}} (\tilde{π} (ρ)) - δ^{j} (\tilde{π} (ρ))) = 0, \end{matrix}

where ${\bar{F}}_{j, {\bar{v}}^{n}} (π)$ is the cost of action ${\bar{v}}^{n}$ , for a player of type j. Furthermore, $H (π)$ is continuous and nonnegative.

Proof

See Appendix A.8. $□$

Remark 10

The computation of an equilibrium has been reduced to the calculation of the minimum of an $N -$ dimensional function. We exploit this fact in the following section to proceed with the numerical studies.

Numerical Examples

In this section, we give some numerical examples of Nash equilibria computation. Section 5.1 presents an example with a single type of players and Sect. 5.2 an example with many types of players. Section 5.3 studies the effect of the maximum allowed action $u_{M}$ on the strategies and the costs of the players.1

Single Type of Players

In this subsection, we study the symmetric case, i.e., all the players have the same parameter $b^{i}$ . The parameters for the dynamics are $r = 0.4$ and $a = 1 / 6$ which correspond to an epidemic with basic reproduction number $R_{0} = 2.4$ , where an infected person remains infectious for an average of 6 days. (These parameters are similar to [22] which analyzes COVID-19 epidemic.) We assume that $u_{m} = 0.4$ and that there is a maximum action $u_{M} = 0.75,$ set by the government. The discretization time intervals are 1 week, and the time horizon T is approximately 3 months (13 weeks). The initially infected players are $I_{0} = 0.01$ . We chose this time horizon to model a wave of the epidemic, starting at a time point where $1 %$ of the population is infected. We assume that $κ = 3$ .

We then compute the Nash equilibrium using a multi-start local search method for (21). Figure 3 shows the fraction $ρ$ of players having action $u_{M}$ at each time step and the evolution of the total mass of infected players for the several values of b. We observe that, for small values of b, which correspond to less vulnerable or very sociable agents, the players do not engage in voluntary social distancing. For intermediate values of b, the players engage voluntary social distancing, especially when there is a large epidemic prevalence. For large values of b, there is an initial reaction of the players which reduces the number of infected people. Then, the actions of the players return to intermediate levels and keep the number of infected people moderate. In all the cases, voluntary social distancing ‘flattens the curve’ of infected people mass, in the sense that it reduces the pick number of infected people and leaves more susceptible persons in the end on the time horizon.

Fig. 3 — a The fractions $ρ_{k}$ , for $k = 1, \dots, 13$ , for different values of b. b The evolution of the number of infected people under the computed Nash equilibrium

We then present some results for the case where $b^{i} = 200$ . Figure 4 illustrates the evolution of $S^{i} (t)$ and $I^{i} (t)$ , for the players having different strategies. We observe that the trajectories of $S^{i}$ ’s do not intersect. What is probably interesting is that the trajectories of $I^{i}$ may intersect. This indicates that, toward the end of the time horizon, it is probable for a person who was less cautious, i.e., she used higher values of $u^{i}$ , to have a lower probability of being infectious.

Fig. 4 — The evolution of the probabilities $S^{i}$ and $I^{i}$ , for players following different strategies, for $b = 200$ . Different colors are used to illustrate the evolution of the probabilities for players using different strategies

Many Types of Players

We then compute the Nash equilibrium for the case of multiple types of players. We assume that there are six types of players with vulnerability parameters $G^{1} = 100, G^{2} = 200, G^{3} = 400, G^{4} = 800, G^{5} = 1600, G^{6} = 3200$ . The sociability parameter $s^{i}$ is equal to 1, for all the players. The masses of these types are $m_{1} = 0.5$ and $m_{2} = \dots = m_{6} = 0.1$ . The initial condition is for all the players $I_{0} = 0.0001$ , and the time horizon is 52 weeks (approximately a year). Here, we assume that the maximum action is $u_{M} = 0.8$ . The rest of the parameters are as in the previous subsection.

Figure 5 presents the fractions $ρ$ and the evolution of the probability of each category of players to be susceptible and infected. Let us note that the analysis of Sect. 4.2 simplifies a lot the analysis. Particularly, the set $Π$ has ${(2^{52} - 1)}^{6} ≃ 8.3 \cdot 10^{93}$ dimensions, while Problem (21) only 52.

Fig. 5 — a The fractions $ρ_{k}$ of players having an action $u_{M}$ . Note that the fractions correspond to players of all types. Since $s^{i} = 1$ , for all the types, it holds $b^{1} < \dots < b^{6}$ , and thus, the more vulnerable players cannot have an action higher than the less vulnerable ones. b The probability of a class of people to be susceptible and infected. The colored lines correspond to the probabilities of being susceptible $S^{i} (t)$ and infected $I^{i} (t)$ , for the several strategies of the players. The bold black line represents the mass of susceptible and infected persons (Color figure online)

Effect of $u_{M}$

We then analyze the case where the types of the players are as in Sect. 5.2, and the initial condition is $I_{0} = 0.005$ for all the players, for various values of $u_{M}$ . The time horizon is 13 weeks.

Figure 6a illustrates the equilibrium fractions $ρ_{k}$ , for the various values of $u_{M}$ . We observe that as $u_{M}$ increases, the fractions $ρ_{k}$ decrease. Figure 6b shows the evolution of the mass of infected players, for the different values of the maximum action $u_{M}$ . We observe that as $u_{M}$ increases, the mass of infected players decreases. Figure 6c presents the cost of the several types of players, for the different value of the maximum action $u_{M}$ . We observe that players with low vulnerability ( $G = 100$ ) prefer always a larger value of $u_{M}$ , which corresponds to less stringent restrictions. For vulnerable players (e.g., $G = 3200$ ), the cost is an increasing function of $u_{M}$ ; that is, they prefer more stringent restrictions. For intermediate values of G, the players prefer intermediate values of $u_{M}$ . The mean cost in this example is minimized for $u_{M} = 0.6$ .

Fig. 6 — a The fractions $ρ_{k}$ , for the several values of the maximum action $u_{M}$ . b The mass of infected people as a function of time, for the different values of the maximum action $u_{M}$ . c The cost for the several classes of players, for the different values of the maximum action $u_{M}$ . The bold black line represents the mean cost of all the players

Conclusion

This paper studied a dynamic game of social distancing during an epidemic, giving an emphasis on the analysis of asymmetric solutions. We proved the existence of a Nash equilibrium and derived some monotonicity properties of the agents’ strategies. The monotonicity result was then used to derive a reduced-order characterization of the Nash equilibrium, simplifying its computation significantly. Through numerical experiments, we show that both the agents’ strategies and the evolution of the epidemic depend strongly on the agents’ parameters (vulnerability, sociality) and the epidemic’s initial spread. Furthermore, we observed that agents with the same parameters could have different behaviors, leading to rich, high-dimensional dynamics. We also observe that more stringent constraints on the maximum action (set by the government) benefit the more vulnerable players at the expense of the less vulnerable. Furthermore, there is a certain value for the maximum action constant that minimizes the average cost of the players.

There are several directions for future work. First, we can study more general epidemics models than the SIR. Second, we can investigate different information patterns, including the cases where the agents receive regular or random information about their health state. Finally, we can compare the behaviors computed analytically with real-world data.

A Appendix: Proof of the Results of the Main Text

A.1 Existence of Solution to (1)

Note that the first two equations in (1) do not depend on $R^{i}$ and $D^{i}$ . Thus, it suffices to show that the first two equations of (1) have a unique solution.

For any $i \in [0, 1)$ , if $[S^{i} (t), I^{i} (t)]$ solve the differential equations (1), with initial condition $(S^{i} (0), I^{i} (0)) \in {[0, 1]}^{2}$ , then $(S^{i} (t), I^{i} (t))$ remain in ${[0, 1]}^{2}$ . Thus, we consider the solution of the differential equations:

\begin{matrix} \begin{matrix} {\dot{S}}^{i} & = {sat}_{B} (- r u^{i} S^{i} I^{f} (t)) \\ {\dot{I}}^{i} & = {sat}_{B} (r u^{i} S^{i} I^{f} (t) - α^{i} I^{i}) \end{matrix}, \end{matrix}

where ${sat}_{B} (z) = max (min (z, B), - B)$ , and $B = r u_{M}^{2} + {max}_{j} α_{j}$ .

Consider the Banach space $X = L^{1} ([0, 1), R^{2})$ , and let $x_{0} = (S^{\cdot} (0), I^{\cdot} (0)) : [0, 1) \to R^{2}$ . Then, under Assumptions 1,3, it holds $x_{0} \in X$ . For each interval $[t_{k}, t_{k + 1})$ , the differential equations (22) with the corresponding initial conditions can be written as:

\begin{matrix} \dot{x} = f_{k} (x), x (t_{k}) = x_{0}^{k}, \end{matrix}

where for $x : i \mapsto {[S^{i}, I^{i}]}^{T}$ , the value of $f_{k} (x) \in X$ is given by:

\begin{matrix} f_{k} (x) : i \mapsto {[{sat}_{B} (- r u_{k}^{i} S^{i} M_{k} x), {sat}_{B} (r u_{k}^{i} S^{i} M_{k} x - α^{i} I^{i})]}^{T}, \end{matrix}

where $M_{k} : X \to R$ is a linear bounded operator with $M_{k} x = \int I^{i} u_{k}^{i} μ (d i)$ . For the initial condition, it holds $x_{0}^{0} = x_{0}$ , and $x_{0}^{k} = x (t_{k})$ is computed from the solution of (23) on the interval $[t_{k - 1}, t_{k})$ , for $k \geq 1$ . For all k, $f_{k}$ is Lipschitz and thus there is a unique solution to (23) (e.g., Theorem 7.3 of [7]). Furthermore, both $I^{\cdot} (t)$ and $u^{\cdot} (t)$ are measurable and bounded. Thus, the integrals in (4), (2) are well defined.

Note that from Assumption 1, we only used the fact that $S^{\cdot} (0), I^{\cdot} (0) : [0, 1) \to R$ are measurable and not the piecewise constant property.

Proof of Proposition 1

(i)
Since $r I_{k}^{f} > 0$ and $b^{i} > 0$ , the cost (7) is strictly concave, with respect to $u_{k}^{i}$ . Thus, the minimum with respect to $u_{k}^{i}$ is either $u_{m}$ or $u_{M}$ .
(ii)
Since U is compact and $\tilde{J}$ is continuous, there is an optimal solution for (8). Denote by $u^{i, ⋆}$ this solution. Further, denote by $V_{1} = {\tilde{J}}^{i} (u^{i, ⋆})$ and $V_{2} = {inf}_{A} {- b^{i} e^{- A} + f (A)}$ the values of problems (8) and (9), respectively. Then, for $A^{⋆} = \sum_{k = 0}^{N - 1} r u_{k}^{i, ⋆} I_{k}^{f}$ , we have
$\begin{matrix} V_{2} \leq - b^{i} e^{- A^{⋆}} + f (A^{⋆}) \leq - b^{i} exp [- r \sum_{k = 0}^{N - 1} u_{k}^{i, ⋆} I_{k}^{f}] - \sum_{k = 0}^{N - 1} u_{k}^{i, ⋆} {\bar{u}}_{k} = {\tilde{J}}^{i} (u^{i, ⋆}) = V_{1}, \end{matrix}$
where the first inequality is due to the definition of $V_{2}$ and the second inequality is due to the definition of $f (A^{⋆})$ . To contradict, assume that $V_{2} < V_{1}$ . Then, there is some A and an $ε > 0$ such that:
$\begin{matrix} - b^{i} e^{- A} + f (A) < V_{1} - 2 ε . \end{matrix}$ 24
Thus, there is a ${\tilde{u}}^{i}$ such that $A = \sum_{k = 0}^{N - 1} r {\tilde{u}}_{k}^{i} I_{k}^{f}$ and $\sum_{k = 0}^{N - 1} {\tilde{u}}_{k}^{i} {\bar{u}}_{k} < f (A) + ε$ . Combining with (24), we get ${\tilde{J}}^{i} (\tilde{u}) < V_{1} - ε$ , which contradicts the definition of $V_{1}$ . For $u^{i, ⋆}$ minimizing (8), the problem (9) is minimized for $A^{⋆} = \sum_{k = 0}^{N - 1} r u_{k}^{i, ⋆} I_{k}^{f}$ and $u^{i, ⋆}$ attains the minimum in (10). To see this, observe that otherwise we would have $V_{2} < V_{1}$ . On the other hand, assume that A minimizes (9) and $u^{i}$ attains the minimum in (10). Then, $V_{2} = - b e^{- A} + f (A) = J^{i} (u^{i})$ . Furthermore, since $V_{2} = V_{1}$ , it holds $J^{i} (u^{i}) = V_{1}$ , and thus, $u^{i}$ minimizes $J^{i}$ .
(iii)
The set $\{u^{i} \in U : \sum_{k = 0}^{N - 1} u_{k}^{i} I_{k}^{f} = A / r\}$ is non-empty if and only if $A \in [A_{m}, A_{M}]$ . Thus, the f(A) is finite if and only if $A \in [A_{m}, A_{M}]$ .

For $A \in [A_{m}, A_{M}]$ , there exists an optimal solution $u^{i}$ that attains the minimum in (10). Since (10) is a feasible linear programming problem, there is a Lagrange multiplier $λ$ (e.g., Proposition 5.2.1 of [5]), and $u^{i}$ minimizes the Lagrangian:
$\begin{matrix} L (u^{i}, λ) = - \sum_{k = 0}^{N - 1} {\bar{u}}_{k} u_{k}^{i} + λ \sum_{k = 0}^{N - 1} I_{k}^{f} u_{k}^{i} - λ A / r . \end{matrix}$ 25
Thus, $u_{k}^{i} = u_{m}$ , if ${\bar{u}}_{k} / I_{k}^{f} < λ$ and $u_{k}^{i} = u_{M}$ , if ${\bar{u}}_{k} / I_{k}^{f} > λ$ . To compute f(A), we reorder k, using a new index $k^{'}$ , such that ${\bar{u}}_{k^{'}} / I_{k^{'}}^{f}$ is non-increasing. Let:
$\begin{matrix} k_{A}^{'} = m a x \{{\bar{k}}_{A}^{'} : \sum_{k^{'} = 0}^{{\bar{k}}_{A}^{'} - 1} u_{M} I_{k^{'}}^{f} + \sum_{k^{'} = {\bar{k}}_{A}^{'}}^{N - 1} u_{m} I_{k^{'}}^{f} \leq A / r\} . \end{matrix}$
Then:
$\begin{matrix} Σ_{k_{A}^{'}} + u_{{\bar{k}}_{A}^{'}}^{i} I_{{\bar{k}}_{A}^{'}}^{f} = A / r . \end{matrix}$
where $Σ_{k_{A}^{'}} = \sum_{k^{'} = 0}^{{\bar{k}}_{A}^{'} - 1} u_{M} I_{k^{'}}^{f} + \sum_{k^{'} = {\bar{k}}_{A}^{'} + 1}^{N - 1} u_{m} I_{k^{'}}^{f}$ . Thus:
$\begin{matrix} f (A) = - \sum_{k^{'} = 0}^{k_{A}^{'} - 1} {\bar{u}}_{k^{'}} u_{M} - \sum_{k^{'} = k_{A}^{'} + 1}^{N - 1} {\bar{u}}_{k^{'}} u_{m} - \frac{{\bar{u}}_{k_{A}^{'}}}{I_{k_{A}^{'}}^{f}} (A / r - Σ_{k_{A}^{'}}), \end{matrix}$
for $A / r \in [Σ_{k_{A}^{'}} + u_{m} I_{k_{A}^{'}}^{f}, Σ_{k_{A}^{'}} + u_{M} I_{k_{A}^{'}}^{f}]$ . It holds:
$\begin{matrix} Σ_{k_{A}^{'}} + u_{M} I_{k_{A}^{'}}^{f} = Σ_{k_{A}^{'} + 1} + u_{m} I_{k_{A}^{'} + 1}^{f} . \end{matrix}$
Therefore, f is continuous and piecewise affine. Furthermore, since ${\bar{u}}_{k^{'}} / I_{k^{'}}^{f}$ is non-increasing with respect to $k^{'}$ , the slope of f is non-decreasing, i.e., it is convex. Thus, f is differentiable in all $(A_{m}, A_{M})$ except of the points $A = Σ_{k^{'}} + u_{M} I_{k^{'}}^{f}$ with ${\bar{u}}_{k^{'}} / I_{k^{'}}^{f} > {\bar{u}}_{k^{'} + 1} / I_{k^{'} + 1}^{f}$ . The linear parts of f are at most N.
(iv)
Since $- b^{i} e^{- A}$ is strictly concave in A, there are at most $N + 1$ possible minima of $- b^{i} e^{- A} + f (A)$ , which correspond to the points of non-differentiability of f in $(A_{m}, A_{M})$ and the points $A_{m}$ and $A_{M}$ . Observe that for $A = A_{m}$ or $A = A_{M}$ , there is a unique $u^{i}$ minimizing (10). We then show that for all the non-differentiability points A of f, there is a unique $u^{i}$ minimizing (10). If A is a non-differentiability point, there is a $k_{0}^{'}$ such that $A / r = \sum_{k^{'} = 0}^{k_{0}^{'}} I_{k^{'}}^{f} u_{M} + \sum_{k^{'} = k_{0}^{'} + 1}^{N - 1} I_{k^{'}}^{f} u_{m}$ and ${\bar{u}}_{k_{0}^{'}} / I_{k_{0}^{'}}^{f} > {\bar{u}}_{k_{0}^{'} + 1} / I_{k_{0}^{'} + 1}^{f}$ . We then show that the unique minimizer in (10) is given by $u_{k^{'}}^{i} = u_{M}$ for $k^{'} \leq k_{0}^{'}$ and $u_{k^{'}}^{i} = u_{m}$ for $k^{'} > k_{0}^{'}$ . Indeed, $u^{i}$ is feasible and if $u^{'} \neq u^{i}$ is another feasible point, it holds:
$\begin{matrix} \sum_{k^{'} = 0}^{k_{0}^{'}} (u_{M} - u_{k^{'}}^{'}) I_{k^{'}}^{f} + \sum_{k^{'} = k_{0}^{'} + 1}^{N - 1} (u_{m} - u_{k^{'}}^{'}) I_{k^{'}}^{f} = 0 . \end{matrix}$
Multiplying by ${\bar{u}}_{k_{0}^{'}} / I_{k_{0}^{'}}^{f}$ , we get:
$\begin{matrix} \sum_{k^{'} = 0}^{k_{0}^{'}} (u_{M} - u_{k^{'}}^{'}) \frac{I_{k^{'}}^{f} {\bar{u}}_{k_{0}^{'}}}{I_{k_{0}^{'}}^{f}} + \sum_{k^{'} = k_{0}^{'} + 1}^{N - 1} (u_{m} - u_{k^{'}}^{'}) \frac{I_{k^{'}}^{f} {\bar{u}}_{k_{0}^{'}}}{I_{k_{0}^{'}}^{f}} = 0 . \end{matrix}$
Then, using that $u_{M} - u_{k^{'}}^{'} \geq 0$ , $u_{m} - u_{k^{'}}^{'} \leq 0$ , and that for $k^{'} \leq k_{0}^{'}$ , it holds ${\bar{u}}_{k^{'}} / I_{k^{'}}^{f} \geq {\bar{u}}_{k_{0}^{'}} / I_{k_{0}^{'}}^{f}$ and for $k^{'} > k_{0}^{'}$ , it holds ${\bar{u}}_{k^{'}} / I_{k^{'}}^{f} < {\bar{u}}_{k_{0}^{'}} / I_{k_{0}^{'}}^{f}$ , we have:
$\begin{matrix} - \sum_{k^{'} = 0}^{N - 1} u_{k^{'}}^{'} {\bar{u}}_{k^{'}} - [- \sum_{k^{'} = 0}^{N - 1} u_{k^{'}}^{i} {\bar{u}}_{k^{'}}] = \sum_{k^{'} = 0}^{k_{0}^{'}} (u_{M} - u_{k^{'}}^{'}) \frac{I_{k^{'}}^{f} {\bar{u}}_{k^{'}}}{I_{k^{'}}^{f}} + \sum_{k^{'} = k_{0}^{'} + 1}^{N - 1} (u_{m} - u_{k^{'}}^{'}) \frac{I_{k^{'}}^{f} {\bar{u}}_{k^{'}}}{I_{k^{'}}^{f}} \geq 0, \end{matrix}$
and the inequality is strict if for some $k^{'} > k_{0}^{'}$ , $u_{k^{'}}^{'} \neq u_{m}$ . Therefore, $u^{i}$ is optimal and if $u^{'}$ is also optimal, then it should satisfy $u_{k^{'}}^{'} = u_{m}$ for all $k^{'} > k_{0}^{'}$ . Combining this with the fact that $\sum_{k^{'} = 0}^{N - 1} u_{k^{'}}^{'} I_{k^{'}}^{f} = A / r$ and $I_{k^{'}}^{f} > 0$ , we get $u^{'} = u^{i}$ .
(v)
We have shown that if $u^{i}$ is optimal, then there is a $k_{0}^{'}$ such that $u_{k^{'}}^{i} = u_{M}$ for $k^{'} \leq k_{0}^{'}$ and $u_{k^{'}}^{i} = u_{m}$ for $k^{'} > k_{0}^{'}$ . Then, using the original index k, the optimal control can be expressed as:
$\begin{matrix} u_{k}^{i} = \{\begin{matrix} u_{M} if {\bar{u}}_{k} / I_{k}^{f} \geq λ^{'} \\ u_{m} if {\bar{u}}_{k} / I_{k}^{f} < λ^{'} \end{matrix}), \end{matrix}$
where $λ^{'} = {\bar{u}}_{k_{0}^{'}} / I_{k_{0}^{'}}^{f}$ .

A.3 Proof of Proposition 2

(i)
Since $A_{1}$ is optimal for $b^{i_{1}}$ and $A_{2}$ is optimal for $b^{i_{2}}$ , it holds:
$\begin{matrix} - b^{i_{1}} e^{- A_{1}} + f (A_{1}) \leq - b^{i_{1}} e^{- A_{2}} + f (A_{2}), \\ - b^{i_{2}} e^{- A_{2}} + f (A_{2}) \leq - b^{i_{2}} e^{- A_{1}} + f (A_{1}) . \end{matrix}$
Adding these equations and reordering, we get:
$\begin{matrix} (b^{i_{2}} - b^{i_{1}}) e^{- A_{2}} \geq (b^{i_{2}} - b^{i_{1}}) e^{- A_{1}} . \end{matrix}$
And since $b^{i_{2}} > b^{i_{1}}$ , we get $A_{1} \geq A_{2}$
(ii)
Using (v) of Proposition 1, and $A_{1} \geq A_{2}$ , we get:
$\begin{matrix} A_{1} / r & = \sum_{k = 0}^{N - 1} u_{k}^{i_{1}} I_{k}^{f} = \sum_{k = 0}^{N - 1} (u_{m} + (u_{M} - u_{m}) h_{λ_{1}^{'}} ({\bar{u}}_{k} / I_{k}^{f})) I_{k}^{f} \geq \\ \geq \sum_{k = 0}^{N - 1} (u_{m} + (u_{M} - u_{m}) h_{λ_{2}^{'}} ({\bar{u}}_{k} / I_{k}^{f})) I_{k}^{f} = \sum_{k = 0}^{N - 1} u_{k}^{i_{2}} I_{k}^{f} = A_{2} / r \end{matrix}$
where $h_{λ} (\cdot)$ is the Heaviside function, i.e., $h_{λ^{'}} (x) = 1$ if $x \geq λ^{'}$ and $h_{λ^{'}} (x) = 0$ otherwise. Therefore, $λ_{1}^{'} \leq λ_{2}^{'}$ .
(iii)
Assume that for $k_{1} \neq k_{2}$ , $u_{k_{1}}^{i_{1}} = u_{k_{2}}^{i_{2}} = u_{m}$ and $u_{k_{1}}^{i_{2}} = u_{k_{2}}^{i_{1}} = u_{M}$ . Then, using (v) of Proposition 1 we have:
$\begin{matrix} λ_{2}^{'} < \frac{{\bar{u}}_{k_{1}}}{I_{k_{1}}^{f}} \leq λ_{1}^{'}, λ_{1}^{'} \leq \frac{{\bar{u}}_{k_{2}}}{I_{k_{2}}^{f}} < λ_{2}^{'}, \end{matrix}$
which is a contradiction.

A.4 Optimal Control and Equilibria in Feedback Form

A.4.1 Optimal Control in Feedback Form

In this section, we express the optimal control law in feedback form using dynamic programming. Let $V_{k} (S_{k}^{i}, s^{i}, G^{i})$ be the optimal cost-to-go from time k of a player with parameters $s^{i}, G^{i}$ :

\begin{matrix} V_{k} (S_{k}^{i}, s^{i}, G^{i}) = min_{u^{i}} \{G^{i} - G^{i} S_{k}^{i} exp [- r \sum_{k^{'} = k}^{N - 1} u_{k^{'}}^{i} I_{k^{'}}^{f}] - s^{i} \sum_{k^{'} = k}^{N - 1} u_{k^{'}}^{i} {\bar{u}}_{k^{'}}\}, \end{matrix}

where $S_{k}^{i} = S^{i} (t_{k}) = S^{i} (0) exp [- r \sum_{k^{'} = 0}^{k - 1} u_{k^{'}}^{i} I_{k^{'}}^{f}]$ . The optimal cost-to-go can be expressed as:

\begin{matrix} V_{k} (S_{k}^{i}, s^{i}, G^{i}) = G_{i} + s^{i} {\tilde{V}}_{k} (S_{k}^{i}, G^{i} / s^{i}) . \end{matrix}

We call ${\tilde{V}}_{k}$ the ‘auxiliary cost-to-go.’

Proposition 6

(i)
The auxiliary cost-to-go ${\tilde{V}}_{k} (S_{k}^{i}, G^{i} / s^{i})$ is non-increasing and concave in $S_{k}^{i}$ .
(ii)
The optimal cost-to-go $V_{k} (S_{k}^{i}, s^{i}, G^{i})$ is non-decreasing in $G^{i}$ , non-increasing in $s^{i}$ , and non-increasing and concave in $S_{k}^{i}$ .
(iii)
The optimal control law can be expressed in threshold form. That is, there are constants $l_{0}, \dots, l_{N - 1}$ such that the optimal control law for each player satisfies: $u_{k}^{i} = u_{m}$ if $S_{k}^{i} G^{i} / s_{i} > l_{k}$ and $u_{k}^{i} = u_{M}$ if $S_{k}^{i} G^{i} / s_{i} < l_{k}$ . For $S_{k}^{i} G^{i} / s_{i} = l_{k}$ , both $u_{k}^{i} = u_{m}$ and $u_{k}^{i} = u_{M}$ are optimal.

Proof

(i)
The auxiliary cost-to-go can be written as:
$\begin{matrix} {\tilde{V}}_{k} (S_{k}^{i}, s^{i} / G^{i}) = min_{u^{i}} \{- \frac{G^{i} S_{k}^{i}}{s^{i}} exp [- r \sum_{k^{'} = k}^{N - 1} u_{k^{'}}^{i} I_{k^{'}}^{f}] - \sum_{k^{'} = k}^{N - 1} u_{k^{'}}^{i} {\bar{u}}_{k^{'}}\} . \end{matrix}$ 27
Then, applying the principle of optimality we get:
$\begin{matrix} {\tilde{V}}_{k} (S_{k}^{i}, s^{i} / G^{i}) = min_{u_{k}^{i}} \{- u_{k}^{i} {\bar{u}}_{k} + {\tilde{V}}_{k + 1} (S_{k}^{i} exp [- r u_{k}^{i} I_{k}^{f}], s^{i} / G^{i})\} . \end{matrix}$ 28
We use induction. For $k = N$ , the auxiliary cost-to-go is given by ${\tilde{V}}_{k} (S_{N}^{i}, s^{i} / G^{i}) = - G^{i} S_{N}^{i} / s^{i}$ . That is, ${\tilde{V}}_{k} (S_{N}^{i}, s^{i} / G^{i})$ is non-increasing and concave in $S_{N}^{i}$ . Assume that it holds for $k = k_{0}$ . Then, for each fixed $u_{k_{0}}^{i}$ , the quantity:
$\begin{matrix} - u_{k_{0}}^{i} {\bar{u}}_{k_{0}} + {\tilde{V}}_{k_{0}} (S_{k}^{i} exp [- r u_{k_{0}}^{i} I_{k_{0}}^{f}], s^{i} / G^{i}), \end{matrix}$
is non-increasing and concave in $S_{k_{0}}^{i}$ . Using (28), i.e., minimizing with respect to $u_{k_{0}}^{i}$ , we conclude to the desired result.
(ii)
The monotonicity properties of $V_{k}$ are a direct consequence of its definition (26). The concavity of the optimal value follows easily from (i).
(iii)
From Proposition 1(i), we know that the optimal $u_{k}^{i}$ is in the set, ${u_{m}, u_{M}}$ . Using Proposition 2 to the subproblem starting at time step k, and applying the principle of optimality, we get the desired result. The fact that if $S_{k}^{i} G^{i} / s_{i} = l_{k}$ , then both $u_{k}^{i} = u_{m}$ and $u_{k}^{i} = u_{M}$ are optimal, is a consequence of the continuity of ${\tilde{V}}_{k}$ .

$□$

Remark 11

Propositions 6(iii) and 1(v) both express the optimal actions for a player i. The primary difference is that Proposition 1(v) uses dynamic programming, and thus, the policies obtained can better handle uncertainties in the individual dynamics of player i.

Equilibrium in Feedback Strategies

Consider a Nash equilibrium $π$ that induces the fractions $ρ_{k}$ , the mean actions ${\bar{u}}_{k}$ , and the mass of infected people in public places $I_{k}^{f}$ , for $k = 0, \dots, N - 1$ . Consider also the auxiliary cost-to-go function ${\tilde{V}}_{k}$ defined in (27), and the corresponding thresholds $l_{k}$ . Then, based on the strategies in Proposition 28(iii) we will describe a Nash equilibrium in feedback strategies. Consider the set of strategies:

\begin{matrix} u_{k}^{i} = \{\begin{matrix} u_{m} if S_{k}^{i} G^{i} / s_{i} > l_{k} \\ u_{M} if S_{k}^{i} G^{i} / s_{i} < l_{k} \\ u_{M} if S_{k}^{i} G^{i} / s_{i} = l_{k}, and i < ρ_{k} \\ u_{m} if S_{k}^{i} G^{i} / s_{i} = l_{k}, and i \geq ρ_{k} \end{matrix}) \end{matrix}

Proposition 7

The set of strategies (29) is a Nash equilibrium.

Proof

Observe that the set of strategies (29) generates the same actions with the Nash equilibrium $π$ . $□$

A.5 Proof of Proposition 3

(i)
Assume that a $π \in Π$ satisfies (13) and fix a $j \in {1, \dots, M}$ . For any l such that $π_{(j - 1) 2^{N} + l} > 0$ , it holds $Φ_{(j - 1) 2^{N} + l} (π) = 0$ , that is $F_{(j - 1) 2^{N} + l} (π) = δ^{j} (π) = {min}_{l^{'}} F_{(j - 1) 2^{N} + l^{'}} (π)$ . Thus, $π \in Π$ is a Nash equilibrium.

Conversely, assume that $π \in Π$ is a Nash equilibrium and fix a $j \in {1, \dots, M}$ . There is an l such that $π_{(j - 1) 2^{N} + l} > 0$ . Since $π$ is a Nash equilibrium, it holds $F_{(j - 1) 2^{N} + l} (π) = δ^{j} (π)$ and for all other $l^{'}$ , it holds $F_{(j - 1) 2^{N} + l^{'}} (π) \geq F_{(j - 1) 2^{N} + l} (π) = δ^{j} (π)$ , which implies (13).
(ii)
Assume that $π$ is a Nash equilibrium and $π^{'} \in Π$ . Then, $\sum_{l = 1}^{2^{N}} π_{(j - 1) 2^{N} + l} = \sum_{l = 1}^{2^{N}} π_{(j - 1) 2^{N} + l}^{'} = m_{j}$ . Since $π$ is a Nash equilibrium, it holds:
$\begin{matrix} \sum_{l = 1}^{2^{N}} {(π_{(j - 1) 2^{N} + l}^{'} - π_{(j - 1) 2^{N} + l})}^{T} F_{(j^{'} - 1) 2^{N} + l} (π) \geq 0 . \end{matrix}$
Thus, (14) holds.

Conversely, assume that (14) holds, for some $π \in Π$ . If $π$ is not a Nash equilibrium, then there is a j, l such that $π_{(j - 1) 2^{N} + l} > 0$ and $F_{(j - 1) 2^{N} + l} > δ^{j} (π)$ . Then, if $l^{'}$ is such that $F_{(j - 1) 2^{N} + l^{'}} = δ^{j} (π)$ , taking $π^{'} = π + π_{(j - 1) 2^{N} + l} e_{(j - 1) 2^{N} + l^{'}} - π_{(j - 1) 2^{N} + l} e_{(j - 1) 2^{N} + l}$ , we get ${(π^{'} - π)}^{T} F (π) < 0$ , which is a contradiction.
(iii)
With a slight abuse of notation, we write $I_{k}^{f} (π), {\bar{u}}_{k} (π)$ to describe the quantities $I_{k}^{f}, {\bar{u}}_{k}$ when the distribution of actions is $π$ and ${\tilde{J}}_{j} (v^{l}, π)$ to describe the auxiliary cost of a player of type j who plays action $v^{j}$ when the distribution of the actions is $π$ .

Lemma 2

The quantities $I_{k}^{f} (π), {\bar{u}}_{k} (π), {\tilde{J}}_{j} (v^{l}, π)$ , are continuous on $π$ .

Proof

The state of the system evolves according to the set of $M 2^{N + 1} + 1$ differential equations:

\begin{matrix} {\dot{S}}^{j, v^{l}} & = - r v_{k}^{l} S^{j, v^{l}} I^{f}, \\ {\dot{I}}^{j, v^{l}} & = r v_{k}^{l} S^{j, v^{l}} I^{f} - α_{j} I^{j, v^{l}}, \\ \dot{z} & = I^{f}, \end{matrix}

where $j = 1, \dots, M$ , $l = 1, \dots, 2^{N}$ , $k : t \in [t_{k}, t_{k + 1})$ , and:

\begin{matrix} I^{f} = \sum_{j = 1}^{M} \sum_{l = 1}^{2^{N}} π_{(j - 1) 2^{N} + l} I^{j, v^{l}} v_{k}^{l} . \end{matrix}

The initial conditions are $S^{j, v^{l}} (0) = S^{j} (0)$ , $I^{j, v^{l}} (0) = I^{j} (0)$ , (Assumption 1) and $z (0) = 0$ .

The right-hand side of the differential equations depends continuously on $π$ through the term $I^{f}$ . Furthermore, $S^{j, v^{l}} (t), I^{j, v^{l}} (t)$ remain in [0, 1] for all $j, v^{l}$ . Thus, the state space of the system remains in ${[0, 1]}^{M \cdot 2^{N}} \times R$ and the right-hand side of the differential equation is Lipschitz. Thus, Theorem 3.4 of [25] applies and $S^{j, v^{l}} (t)$ , $I^{j, v^{j}} (t)$ and z(t) depend continuously on $π$ . Thus, $I_{k}^{f} = z (t_{k} + 1) - z (t_{k})$ depends continuously on $π$ . Furthermore, ${\bar{u}}_{k}$ is continuous (linear) on $π$ . Finally, the auxiliary cost $J_{j} (v^{l}, π)$ , due to its form (7), depends continuously $π$ . $□$

To complete the proof, observe that $F (π)$ is continuous and $Π$ is compact and convex. Thus, the existence is a consequence of Corollary 2.2.5 of [16].

Remark 12

An alternative would be to use Theorem 1 of [34] or Theorem 1 of [28], combined with Lemma 2 to prove the existence of a mixed Nash equilibrium and then use Assumption 1, to construct a pure strategy equilibrium. However, the reduction to an NCP is useful computationally.

A.6 Proof of Lemma 1

Every maximal chain begins with the least element $[u_{m}, \dots, u_{m}]$ and ends at the greatest element $[u_{M}, \dots, u_{M}]$ . Every two consecutive elements of a maximal chain $v^{l}$ , $v^{l + 1}$ differ at exactly one point; otherwise, there exists a vector $v^{'}$ : $v^{l} ⪯ v^{'} ⪯ v^{l + 1}$ and thus the chain is not maximal.

Thus, beginning from $[u_{m}, \dots, u_{m}]$ and changing at each step one point from $u_{m}$ to $u_{M}$ , we get a sequence of $N + 1$ -ordered vectors. So, every maximal chain has length $N + 1$ .

Then, we prove that the number of such chains is N! using induction.

For $N = 2$ , it is easy to verify that we have two chains of 3 elements.

For $N = n$ , we have n! maximal chains of $n + 1$ elements. Then, for $N = n + 1$ we consider one of the previous chains $v^{1} ⪯ v^{2} ⪯ \dots ⪯ v^{N}$ and at each of its elements, we add an extra bit: ${\tilde{v}}^{i} = [v^{i}, β^{i}]$ . We observe that if $β^{i} = u_{M}$ , then for all $j > i$ it should hold $β^{j} = u_{M}$ , in order for the new vectors to remain ordered under $⪯$ .

Denote by $i_{c}$ the point that $β^{i}$ change from $u_{m}$ to $u_{M}$ . For each choice of $i_{c} : β^{j} = u_{m}, j < i_{c} and β^{j} = u_{M}, j > i_{c}$ , we take two ordered vectors ${\tilde{v}}_{1}^{i_{c}} = [v^{i_{c}}, u_{m}]$ and ${\tilde{v}}_{2}^{i_{c}} = [v^{i_{c}}, u_{M}]$ in the new chain, so we have two $β^{i_{c}}$ . Thus, we have $N + 1$ possible choices for the $β = [β^{i}] \in {u_{m}, u_{M}}^{N + 1}$ . This way we observe that from each chain in $({u_{m}, u_{M}}^{N}, ⪯)$ , we can construct $N + 1$ chains in $({u_{m}, u_{M}}^{N + 1}, ⪯)$ .

Remark 13

The fact that $V$ has at most $N + 1$ elements is also a consequence of Corollary 1.

A.7 Proof of Proposition 4

(i)
To contradict, assume that $I_{k} ⊄ I_{k^{'}}$ and $I_{k^{'}} ⊄ I_{k}$ . Then, there is a pair of players $i_{1}, i_{2}$ such that $u_{k}^{i_{1}} = u_{k^{'}}^{i_{2}} = u_{M}$ and $u_{k}^{i_{2}} = u_{k^{'}}^{i_{1}} = u_{m}$ , which contradicts Proposition 2(iii).
(ii)
Without loss of generality, assume that $I_{k} \subset I_{k^{'}}$ . Then:
$\begin{matrix} μ (I_{k}) = ρ_{k} = ρ_{k^{'}} = μ (I_{k}^{'}) = μ (I_{k}) + μ (I_{k^{'}} \ I_{k^{'}}) . \end{matrix}$
Thus, $μ (I_{k^{'}} \ I_{k}) = 0$ . Combining with $I_{k} \subset I_{k^{'}}$ and the definition of $I_{k}, I_{k^{'}}$ , we get $μ ({i : u_{k}^{i} = u_{k^{'}}^{i}}) = 1$ .
(iii)
The equality $μ (I_{k}) = ρ_{k}$ is immediate from the definition of $ρ_{k}$ . Consider a pair $I_{k_{n}}, I_{k_{n + 1}}$ . There are two cases, $ρ_{k_{n}} < ρ_{k_{n + 1}}$ and $ρ_{k_{n}} = ρ_{k_{n + 1}}$ . In the first case, we cannot have $I_{k_{n + 1}} \subset I_{k_{n}}$ . Thus, from (i) we have $I_{k_{n}} \subset I_{k_{n + 1}}$ . If $ρ_{k_{n}} = ρ_{k_{n + 1}}$ , then from part (ii).

The inclusion $K_{k_{n}} \supset K_{k_{n + 1}}$ is immediate from the definition.
(iv)
Let $i \in I_{k_{n + 1}} \ I_{k_{n}}$ . Then, since $i \notin I_{k_{n}}$ $u_{k_{n}^{'}}^{i} = u_{m}$ for $n^{'} \leq n$ . On the other hand, $μ$ -almost all $i \in I_{k_{n + 1}}$ satisfy $i \in I_{k_{n^{'}}}$ , for $n^{'} > n$ . Thus, for $μ$ -almost all $i \in I_{k_{n + 1}} \ I_{k_{n}}$ , the action $u^{i}$ is given by (16). The proof is similar for $i \in I_{k_{1}}$ , and $i \in [0, 1) \ I_{k_{N}}$ .

A.8 Proof of Proposition 5

If $ρ$ corresponds to a Nash equilibrium, then combining (13) and (20) we conclude that $H (ρ) = 0$ . Conversely, since all the terms of (21) are nonnegative, $H (ρ) = 0$ implies that if $μ ([{\bar{i}}_{j - 1}, {\bar{i}}_{j}) \cap [ρ_{k_{n - 1}}, ρ_{k_{n}})) > 0$ , then $F_{(j - 1) 2^{N} + n} (\tilde{π} (ρ)) = δ^{j} (\tilde{π} (ρ))$ . Combining this with (20), we conclude that if for some j, l, $π_{(j - 1) 2^{N} + l} > 0$ , then $F_{(j - 1) 2^{N} + l} (π) = δ^{j} (π)$ , where $π = \tilde{π} (ρ)$ . That is, $π$ is a Nash equilibrium.

From (20), we observe that $π (ρ)$ is continuous with respect to $ρ$ , since $μ (\cdot)$ is the Lebesgue measure. Moreover, (21) can be expressed as:

\begin{matrix} H (ρ) = π {(ρ)}^{T} Φ (π (ρ)) . \end{matrix}

The fact that $H (ρ)$ is nonnegative is a result of (13). Furthermore, from Lemma 2, $F_{(j - 1) 2^{N} + n} (π) = {\tilde{J}}_{j} (v^{n}, π)$ is continuous with respect to $π$ . Additionally, $δ^{j} (π)$ , which is the minimum of $F_{(j - 1) 2^{N} + l} (π) = {\tilde{J}}_{j} (v^{l}, π)$ for all $v^{l}$ , is continuous with respect to $π$ as the minimum of continuous functions. So, $Φ (π) = F (π) - δ (π)$ is continuous with respect to $π$ and $H (ρ)$ is continuous with respect to $ρ$ as composition of continuous functions.

Footnotes

Data availability: The datasets generated during and analyzed during the current study are available from the corresponding author on reasonable request.

This article is part of the topical collection “Modeling and Control of Epidemics” edited by Quanyan Zhu, Elena Gubar and Eitan Altman.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Ioannis Kordonis, Email: jkordonis1920@yahoo.com.

Athanasios-Rafail Lagos, Email: lagosth@mail.ntua.gr.

George P. Papavassilopoulos, Email: yorgos@netmode.ntua.gr

References

1.Allen LJ, Brauer F, Van den Driessche P, Wu J (2008) Mathematical epidemiology, vol 1945, Springer
2.Amaral MA, de Oliveira MM, Javarone MA (2020) An epidemiological model with voluntary quarantine strategies governed by evolutionary game dynamics. arXiv preprint arXiv:2008.05979 [DOI] [PMC free article] [PubMed]
3.Amini H, Minca A (2020) Epidemic spreading and equilibrium social distancing in heterogeneous networks [DOI] [PMC free article] [PubMed]
4.Aurell A, Carmona R, Dayanikli G, Lauriere M (2020) Optimal incentives to mitigate epidemics: a Stackelberg mean field game approach. arXiv preprint arXiv:2011.03105
5.Bertsekas DP (1997) Nonlinear programming. J Oper Res Soc 48(3):334–334
6.Bhattacharyya S, Reluga T. Game dynamic model of social distancing while cost of infection varies with epidemic burden. IMA J Appl Math. 2019;84(1):23–43. doi: 10.1093/imamat/hxy047. [DOI] [Google Scholar]
7.Brezis H (2010) Functional analysis. In: Sobolev spaces and partial differential equations, Springer
8.Brown PNN, Collins B, Hill C, Barboza G, Hines L (2020) Individual altruism cannot overcome congestion effects in a global pandemic game. arXiv preprint arXiv:2103.14538
9.Chang SL, Piraveenan M, Pattison P, Prokopenko M (2020) Game theoretic modelling of infectious disease dynamics and intervention methods: a review,” Journal of biological dynamics, vol. 14, no. 1, pp. 57–89 [DOI] [PubMed]
10.Chen FH (2009) “Modeling the effect of information quality on risk behavior change and the transmission of infectious diseases,” Mathematical biosciences, vol. 217, no. 2, pp. 125–133 [DOI] [PubMed]
11.Cho S (2020) Mean-field game analysis of SIR model with social distancing. arXiv preprint arXiv:2005.06758
12.d’Onofrio A, Manfredi P (2009) Information-related changes in contact patterns may trigger oscillations in the endemic prevalence of infectious diseases. J Theor Biol 256(3), 473–478 [DOI] [PubMed]
13.ECDC (2020) Guidelines for the implementation of non-pharmaceutical interventions against COVID-19
14.Eksin C, Shamma JS, Weitz JS. Disease dynamics in a stochastic network game: a little empathy goes a long way in averting outbreaks. Sci Rep. 2017;7(1):1–13. doi: 10.1038/srep44122. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Elie R, Hubert E, Turinici G. Contact rate epidemic control of COVID-19: an equilibrium view. Math Model Nat Phenomena. 2020;15:35. doi: 10.1051/mmnp/2020022. [DOI] [Google Scholar]
16.Facchinei F, Pang J-S (2007) Finite-dimensional variational inequalities and complementarity problems, Springer
17.Funk S, Salathé M, Jansen VA (2010) “Modelling the influence of human behaviour on the spread of infectious diseases: a review,” Journal of the Royal Society Interface, vol. 7, no. 50, pp. 1247–1256 [DOI] [PMC free article] [PubMed]
18.Hota AR, Sneh T, Gupta K (2020) Impacts of game-theoretic activation on epidemic spread over dynamical networks. arXiv preprint arXiv:2011.00445
19.Hota AR, Sundaram S. Game-theoretic vaccination against networked SIS epidemics and impacts of human decision-making. IEEE Trans Control Netw Syst. 2019;6(4):1461–1472. doi: 10.1109/TCNS.2019.2897904. [DOI] [Google Scholar]
20.Huang Y, Zhu Q (2021) Game-theoretic frameworks for epidemic spreading and human decision making: a review. arXiv preprint arXiv:2106.00214 [DOI] [PMC free article] [PubMed]
21.Huang Y, Zhu Q. A differential game approach to decentralized virus-resistant weight adaptation policy over complex networks. IEEE Trans Control Netw Syst. 2019;7(2):944–955. doi: 10.1109/TCNS.2019.2931862. [DOI] [Google Scholar]
22.Kabir KA, Tanimoto J. Evolutionary game theory modelling to represent the behavioural dynamics of economic shutdowns and shield immunity in the COVID-19 pandemic. R Soc Open Sci. 2021;7(9):201095. doi: 10.1098/rsos.201095. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Karlsson C-J, Rowlett J. Decisions and disease: a mechanism for the evolution of cooperation. Sci Rep. 2020;10(1):1–9. doi: 10.1038/s41598-020-69546-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Kermack WO, McKendrick AG (1927) A contribution to the mathematical theory of epidemics. In: Proceedings of the royal society of London, series A, containing papers of a mathematical and physical character, vol 115, no 772, pp 700–721
25.Khalil HK, Grizzle JW (2002) Nonlinear systems, vol 3, Prentice Hall, Upper Saddle River
26.Kordonis I, Lagos A-R, Papavassilopoulos GP (2020) Nash social distancing games with equity constraints: how inequality aversion affects the spread of epidemics. arXiv preprint arXiv:2009.00146
27.Lagos A-R, Kordonis I, Papavassilopoulos G (2020) Games of social distancing during an epidemic: local vs statistical information. arXiv preprint arXiv:2007.05185 [DOI] [PMC free article] [PubMed]
28.Mas-Colell A. On a theorem of Schmeidler. J Math Econ. 1984;13(3):201–206. doi: 10.1016/0304-4068(84)90029-6. [DOI] [Google Scholar]
29.Paarporn K, Eksin C, Weitz JS, Shamma JS. Networked SIS epidemics with awareness. IEEE Trans Comput Soc Syst. 2017;4(3):93–103. doi: 10.1109/TCSS.2017.2719585. [DOI] [Google Scholar]
30.Poletti P, Ajelli M, Merler S (2012) “Risk perception and effectiveness of uncoordinated behavioral responses in an emerging epidemic,” Mathematical Biosciences, vol. 238, no. 2, pp. 80–89 [DOI] [PubMed]
31.Poletti P, Caprile B, Ajelli M, Pugliese A, Merler S (2009) “Spontaneous behavioural changes in response to epidemics,” Journal of theoretical biology, vol. 260, no. 1, pp. 31–40 [DOI] [PubMed]
32.Reluga TC. Game theory of social distancing in response to an epidemic. PLoS Comput Biol. 2010;6(5):e1000793. doi: 10.1371/journal.pcbi.1000793. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Ross R (1916) An application of the theory of probabilities to the study of a priori pathometry. In: Proceedings of the royal society of London, series A, containing papers of a mathematical and physical character, vol 92, no 638, pp 204–230
34.Schmeidler D (1973) “Equilibrium points of nonatomic games,” Journal of statistical Physics, vol. 7, no. 4, pp. 295–300
35.Shapiro A, Dentcheva D, Ruszczyński A. Lectures on stochastic programming: modeling and theory. Philipedia: SIAM; 2014. [Google Scholar]
36.Theodorakopoulos G, Le Boudec J-Y, Baras JS (2012) “Selfish response to epidemic propagation,” IEEE Transactions on Automatic Control, vol. 58, no. 2, pp. 363–376
37.Toxvaerd F (2020) Equilibrium social distancing, Cambridge working papers in economics
38.Trajanovski S, Hayel Y, Altman E, Wang H, Van Mieghem P. Decentralized protection strategies against SIS epidemics in networks. IEEE Trans Control Netw Syst. 2015;2(4):406–419. doi: 10.1109/TCNS.2015.2426755. [DOI] [Google Scholar]
39.Ye M, Zino L, Rizzo A, Cao M (2020) Modelling epidemic dynamics under collective decision making. arXiv preprint arXiv:2008.01971

[CR1] 1.Allen LJ, Brauer F, Van den Driessche P, Wu J (2008) Mathematical epidemiology, vol 1945, Springer

[CR2] 2.Amaral MA, de Oliveira MM, Javarone MA (2020) An epidemiological model with voluntary quarantine strategies governed by evolutionary game dynamics. arXiv preprint arXiv:2008.05979 [DOI] [PMC free article] [PubMed]

[CR3] 3.Amini H, Minca A (2020) Epidemic spreading and equilibrium social distancing in heterogeneous networks [DOI] [PMC free article] [PubMed]

[CR4] 4.Aurell A, Carmona R, Dayanikli G, Lauriere M (2020) Optimal incentives to mitigate epidemics: a Stackelberg mean field game approach. arXiv preprint arXiv:2011.03105

[CR5] 5.Bertsekas DP (1997) Nonlinear programming. J Oper Res Soc 48(3):334–334

[CR6] 6.Bhattacharyya S, Reluga T. Game dynamic model of social distancing while cost of infection varies with epidemic burden. IMA J Appl Math. 2019;84(1):23–43. doi: 10.1093/imamat/hxy047. [DOI] [Google Scholar]

[CR7] 7.Brezis H (2010) Functional analysis. In: Sobolev spaces and partial differential equations, Springer

[CR8] 8.Brown PNN, Collins B, Hill C, Barboza G, Hines L (2020) Individual altruism cannot overcome congestion effects in a global pandemic game. arXiv preprint arXiv:2103.14538

[CR9] 9.Chang SL, Piraveenan M, Pattison P, Prokopenko M (2020) Game theoretic modelling of infectious disease dynamics and intervention methods: a review,” Journal of biological dynamics, vol. 14, no. 1, pp. 57–89 [DOI] [PubMed]

[CR10] 10.Chen FH (2009) “Modeling the effect of information quality on risk behavior change and the transmission of infectious diseases,” Mathematical biosciences, vol. 217, no. 2, pp. 125–133 [DOI] [PubMed]

[CR11] 11.Cho S (2020) Mean-field game analysis of SIR model with social distancing. arXiv preprint arXiv:2005.06758

[CR12] 12.d’Onofrio A, Manfredi P (2009) Information-related changes in contact patterns may trigger oscillations in the endemic prevalence of infectious diseases. J Theor Biol 256(3), 473–478 [DOI] [PubMed]

[CR13] 13.ECDC (2020) Guidelines for the implementation of non-pharmaceutical interventions against COVID-19

[CR14] 14.Eksin C, Shamma JS, Weitz JS. Disease dynamics in a stochastic network game: a little empathy goes a long way in averting outbreaks. Sci Rep. 2017;7(1):1–13. doi: 10.1038/srep44122. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Elie R, Hubert E, Turinici G. Contact rate epidemic control of COVID-19: an equilibrium view. Math Model Nat Phenomena. 2020;15:35. doi: 10.1051/mmnp/2020022. [DOI] [Google Scholar]

[CR16] 16.Facchinei F, Pang J-S (2007) Finite-dimensional variational inequalities and complementarity problems, Springer

[CR17] 17.Funk S, Salathé M, Jansen VA (2010) “Modelling the influence of human behaviour on the spread of infectious diseases: a review,” Journal of the Royal Society Interface, vol. 7, no. 50, pp. 1247–1256 [DOI] [PMC free article] [PubMed]

[CR18] 18.Hota AR, Sneh T, Gupta K (2020) Impacts of game-theoretic activation on epidemic spread over dynamical networks. arXiv preprint arXiv:2011.00445

[CR19] 19.Hota AR, Sundaram S. Game-theoretic vaccination against networked SIS epidemics and impacts of human decision-making. IEEE Trans Control Netw Syst. 2019;6(4):1461–1472. doi: 10.1109/TCNS.2019.2897904. [DOI] [Google Scholar]

[CR20] 20.Huang Y, Zhu Q (2021) Game-theoretic frameworks for epidemic spreading and human decision making: a review. arXiv preprint arXiv:2106.00214 [DOI] [PMC free article] [PubMed]

[CR21] 21.Huang Y, Zhu Q. A differential game approach to decentralized virus-resistant weight adaptation policy over complex networks. IEEE Trans Control Netw Syst. 2019;7(2):944–955. doi: 10.1109/TCNS.2019.2931862. [DOI] [Google Scholar]

[CR22] 22.Kabir KA, Tanimoto J. Evolutionary game theory modelling to represent the behavioural dynamics of economic shutdowns and shield immunity in the COVID-19 pandemic. R Soc Open Sci. 2021;7(9):201095. doi: 10.1098/rsos.201095. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Karlsson C-J, Rowlett J. Decisions and disease: a mechanism for the evolution of cooperation. Sci Rep. 2020;10(1):1–9. doi: 10.1038/s41598-020-69546-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Kermack WO, McKendrick AG (1927) A contribution to the mathematical theory of epidemics. In: Proceedings of the royal society of London, series A, containing papers of a mathematical and physical character, vol 115, no 772, pp 700–721

[CR25] 25.Khalil HK, Grizzle JW (2002) Nonlinear systems, vol 3, Prentice Hall, Upper Saddle River

[CR26] 26.Kordonis I, Lagos A-R, Papavassilopoulos GP (2020) Nash social distancing games with equity constraints: how inequality aversion affects the spread of epidemics. arXiv preprint arXiv:2009.00146

[CR27] 27.Lagos A-R, Kordonis I, Papavassilopoulos G (2020) Games of social distancing during an epidemic: local vs statistical information. arXiv preprint arXiv:2007.05185 [DOI] [PMC free article] [PubMed]

[CR28] 28.Mas-Colell A. On a theorem of Schmeidler. J Math Econ. 1984;13(3):201–206. doi: 10.1016/0304-4068(84)90029-6. [DOI] [Google Scholar]

[CR29] 29.Paarporn K, Eksin C, Weitz JS, Shamma JS. Networked SIS epidemics with awareness. IEEE Trans Comput Soc Syst. 2017;4(3):93–103. doi: 10.1109/TCSS.2017.2719585. [DOI] [Google Scholar]

[CR30] 30.Poletti P, Ajelli M, Merler S (2012) “Risk perception and effectiveness of uncoordinated behavioral responses in an emerging epidemic,” Mathematical Biosciences, vol. 238, no. 2, pp. 80–89 [DOI] [PubMed]

[CR31] 31.Poletti P, Caprile B, Ajelli M, Pugliese A, Merler S (2009) “Spontaneous behavioural changes in response to epidemics,” Journal of theoretical biology, vol. 260, no. 1, pp. 31–40 [DOI] [PubMed]

[CR32] 32.Reluga TC. Game theory of social distancing in response to an epidemic. PLoS Comput Biol. 2010;6(5):e1000793. doi: 10.1371/journal.pcbi.1000793. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR33] 33.Ross R (1916) An application of the theory of probabilities to the study of a priori pathometry. In: Proceedings of the royal society of London, series A, containing papers of a mathematical and physical character, vol 92, no 638, pp 204–230

[CR34] 34.Schmeidler D (1973) “Equilibrium points of nonatomic games,” Journal of statistical Physics, vol. 7, no. 4, pp. 295–300

[CR35] 35.Shapiro A, Dentcheva D, Ruszczyński A. Lectures on stochastic programming: modeling and theory. Philipedia: SIAM; 2014. [Google Scholar]

[CR36] 36.Theodorakopoulos G, Le Boudec J-Y, Baras JS (2012) “Selfish response to epidemic propagation,” IEEE Transactions on Automatic Control, vol. 58, no. 2, pp. 363–376

[CR37] 37.Toxvaerd F (2020) Equilibrium social distancing, Cambridge working papers in economics

[CR38] 38.Trajanovski S, Hayel Y, Altman E, Wang H, Van Mieghem P. Decentralized protection strategies against SIS epidemics in networks. IEEE Trans Control Netw Syst. 2015;2(4):406–419. doi: 10.1109/TCNS.2015.2426755. [DOI] [Google Scholar]

[CR39] 39.Ye M, Zino L, Rizzo A, Cao M (2020) Modelling epidemic dynamics under collective decision making. arXiv preprint arXiv:2008.01971

PERMALINK

Dynamic Games of Social Distancing During an Epidemic: Analysis of Asymmetric Solutions

Ioannis Kordonis

Athanasios-Rafail Lagos

George P Papavassilopoulos

Abstract

Introduction

The Model

Fig. 1.

Assumption 1

Remark 1

Assumption 2

Remark 2

Assumption 3

Assumption 4

Assumption 5

Remark 3

Analysis of the Optimization Problem of Each Player

Proposition 1

Proof

Remark 4

Remark 5

Corollary 1

Proposition 2

Proof

Remark 6

Remark 7

Nash Equilibrium Existence and Characterization

Existence and NCP Characterization

Definition 1

Proposition 3

Proof

Remark 8

Structure and Reduced-Order Characterization

Lemma 1

Proof

Fig. 2.

Example 1

Proposition 4

Proof

Corollary 2

Proof

Remark 9

Example 2

Proposition 5

Proof

Remark 10

Numerical Examples

Single Type of Players

Fig. 3.

Fig. 4.

Many Types of Players

Fig. 5.

Effect of uM

Fig. 6.

Conclusion

A Appendix: Proof of the Results of the Main Text

A.1 Existence of Solution to (1)

Proof of Proposition 1

A.3 Proof of Proposition 2

A.4 Optimal Control and Equilibria in Feedback Form

A.4.1 Optimal Control in Feedback Form

Proposition 6

Proof

Remark 11

Equilibrium in Feedback Strategies

Proposition 7

Proof

A.5 Proof of Proposition 3

Lemma 2

Proof

Remark 12

A.6 Proof of Lemma 1

Remark 13

A.7 Proof of Proposition 4

A.8 Proof of Proposition 5

Footnotes

Contributor Information

References

ACTIONS

Effect of $u_{M}$