Abstract
The basic reproduction number () can be considerably higher in an SIR model with heterogeneous mixing compared to that from a corresponding model with homogeneous mixing. For example, in the case of measles, mumps and rubella in San Diego, CA, Glasser et al. (Lancet Infect Dis 16(5):599–605, 2016. https://doi.org/10.1016/S1473-3099(16)00004-9), reported an increase of 70% in when heterogeneity was accounted for. Meta-population models with simple heterogeneous mixing functions, e.g., proportionate mixing, have been employed to identify optimal vaccination strategies using an approach based on the gradient of the effective reproduction number (), which consists of partial derivatives of with respect to the proportions immune pi in sub-groups i (Feng et al. in J Theor Biol 386:177–187, 2015. https://doi.org/10.1016/j.jtbi.2015.09.006; Math Biosci 287:93–104, 2017. https://doi.org/10.1016/j.mbs.2016.09.013). These papers consider cases in which an optimal vaccination strategy exists. However, in general, the optimal solution identified using the gradient may not be feasible for some parameter values (i.e., vaccination coverages outside the unit interval). In this paper, we derive the analytic conditions under which the optimal solution is feasible. Explicit expressions for the optimal solutions in the case of n = 2 sub-populations are obtained, and the bounds for optimal solutions are derived for n > 2 sub-populations. This is done for general mixing functions and examples of proportionate and preferential mixing are presented. Of special significance is the result that for general mixing schemes, both and are bounded below and above by their corresponding expressions when mixing is proportionate and isolated, respectively.
Keywords: Meta-population model, Convexity of reproduction number, Optimization problem, Vaccination strategy, Epidemiology, 37N25, 49J15, 34H05, 92D30
1. Introduction
Mechanistic models of pathogen transmission are key public health tools for identifying optimal interventions that can mitigate outbreaks or perhaps even eliminate infectious diseases. However, the utility and credibility of such models hinge on incorporating realistic mixing between sub-populations (i.e., means by which infectious members of one sub-population infect susceptible members of others), which typically is not uniformly random due to preference among age groups, genders, or spatial locations. In fact, models that do not sufficiently account for differences among relevant sub-populations can generate biased or misleading results in situations where evaluations of intervention strategy require incorporation of such heterogeneity and realistic mixing. For example, in the case of measles, mumps and rubella in San Diego, CA, Glasser et al. (2016), reported an increase of 70% in when heterogeneity was accounted for.
Recently, progress has been made (Glasser et al. 2012; Feng et al. 2017) in extending realistic mixing functions based on earlier work (Nold 1980; Jacquez et al. 1988). The effective reproduction numbers derived from these meta-population models with non-homogeneous mixing functions are used to identify optimal vaccination strategies by using methods based on the gradients of (partial derivatives with respect to control parameters) (Feng et al. 2015, 2017). These are constrained optimization problems with the objectives of either minimizing given limited number of vaccine doses, or minimizing vaccine doses needed to reduce to a given level. However, the examples considered in these studies focus only on cases where an optimal solution exists and is feasible in the sense that the vaccine coverages lie between 0 and 1. Conditions have not yet been identified to determine parameter regions within which optimal mathematical solutions are indeed feasible. This is the objective of the current paper. Similar optimization problems using in the context of age-dependent vaccination strategies have been considered in Castillo-Chavez and Feng (1998) and Hadeler and Müller (1996a, b).
When heterogeneous mixing is considered in epidemiological models, preferred mixing is among the commonly used mixing structures (Nold 1980; Jacquez et al. 1988; Glasser et al. 2012), of which proportionate mixing is a special case. In this paper, results and proofs are presented for both preferred and more general mixing. Conditions are determined under which optimal solutions for vaccination strategies exist, some of which are given in explicit expressions depending on model parameters. The proofs for the existence and uniqueness of the optimal vaccination strategy when there are n = 2 sub-populations involve some fundamental properties that we establish for the reproduction number as a function of vaccination coverage p = (p1, p2); namely, homogeneity and convexity. For the case of n > 2 sub-populations, the proofs of results are based on a convexity result of Friedland (1980/81) for the spectral radius over a class of positive matrices.
For n = 2, explicit analytical expressions are derived for the optimal allocation of vaccine as well as the minimized reproduction number, . A formula for the critical vaccine doses η* to achieve ≤ 1 is also derived for n = 2. For the case of n > 2, lower and upper bounds for the minimum of as well as the minimum vaccine doses are derived.
The organization of this paper is as follows. The main problem is described in Sect. 2, which consists of two constrained optimization problems of Lagrange type. Sect. 3 presents the main results of the optimization problem for n = 2 sub-populations. Results for n > 2 sub-populations are provided in Sect. 4. In Sect. 5, we discuss the results. Some detailed proofs are included in the “Appendix”.
2. Description of the problem
The models considered by Glasser et al. (2016) and Feng et al. (2015, 2017) are of the SIR or SEIR type, i.e., the population is apportioned into disjoint states including susceptible (S), exposed (E), infectious (I), and removed or immune (R), and the models consist of systems of ordinary differential equations (ODEs). These models include one or more types of population heterogeneity (e.g., age, spatial location, activity level, vaccination coverage, preferential mixing, population density, etc.), so they are meta-population models with each sub-population model being an SIR or SEIR type linked by a mixing function. We use the simplest of these models as an example, but similar results apply to other models. The model considered in this paper is described by the ODE system
| (2.1) |
where pi are proportions immunized at entry into sub-population i, γ is the per capita recovery rate, θ is the per capita rate for entering and leaving sub-population i so that the population size Ni remains constant. The function λi is the force of infection, i.e., per capita hazard rate of infection of susceptible individuals in sub-population i, in which β is the probability of infection upon contacting an infectious person, ai is average contact rate (activity) in sub-population i, cij is the proportion of ith sub-population’s contacts that are with members of jth sub-population, and Ij/Nj is the probability that a randomly encountered member of sub-population j is infectious.
One of the most influential factors affecting the reproduction number is the mixing function cij. Denote the mixing matrix by C = (cij). Typically, the matrix C has to satisfy the following conditions of Busenberg and Castillo-Chavez (1991):
| (2.2) |
| (2.3) |
| (2.4) |
A commonly used non-homogeneous mixing function that satisfies conditions (2.2)–(2.4) is the preferred mixing function of Jacquez et al. (1988) given by:
| (2.5) |
where δij is the Kronecker delta function (taking value 1 when i = j, 0 otherwise) and ϵi ∈ [0, 1] is the fraction of contacts of group i that is reserved for itself (preferential mixing), whereas the complement (1 − ϵi) is distributed among all sub-populations in proportion to the unreserved contacts, including i (proportionate mixing). Special cases arise when: ϵi = 1 for all i whence C is the identity matrix (exclusively preferential mixing); ϵi = 0 for all i whence (exclusively proportionate mixing). We will refer to this mixing structure throughout the manuscript as Jacquez mixing. More complex examples of mixing matrices C = (cij), such as two-level preferential mixing can be found in Feng et al. (2017).
When Model (2.1) is used, the basic and effective sub-population reproduction numbers, denoted respectively by and , for sub-population i (i = 1, 2, … , n) are given by
| (2.6) |
where
see e.g. Brauer and Castillo-Chavez (2012, Chapter 10). Following Diekmann et al. (1990) and van den Driessche and Watmough (2002), the next generation matrix (NGM) corresponding to this meta-population model is
| (2.7) |
Then the effective reproduction number for the meta-population is given as
which is the spectral radius [and the dominant eigenvalue, by the Perron–Frobenius Theorem (Seneta 1973)] of the nonnegative matrix Kv. Let p = (p1, p2, … , pn). Naturally, is a function of p. The total number of vaccine doses, denoted by η, is . For demonstration purposes, we will assume that vaccine efficacy is 100%. In this paper, we focus on identifying the most efficient allocation of vaccine p = (p1, p2, … , pn) ∈ [0, 1]n for reducing with limited vaccine doses η or using fewest doses to achieve < 1 (to prevent outbreaks). More specifically, we consider the following two constrained optimization problems:
-
(I)
Minimize , subject to , for p ∈ [0, 1]n.
-
(II)
Minimize subject to , for p ∈ [0, 1]n.
In this study, we consider the optimization problems only for the case of , as there will be no outbreak if .
Because of the continuity of and the compactness of [0, 1]n, Problem (I) has a solution for any fixed η ∈ [0, N] , where N = N1 + N2 + … + Nn is the total population. If P* = P*(η) and denote the optimal vaccination allocation and the corresponding minimum reproduction number, respectively, then we have
| (2.8) |
where
An optimal solution P*(η) to Problem (I) that lies in the interior (0, 1)n of the unit hypercube must also satisfy the following equations:
| (2.9) |
where the constant λ is the Lagrange multiplier.
Similarly, noting that the constraint set is compact and nonempty (as (1, 1, … , 1) = 0 ≤ 1), Problem (II) always has a solution. The minimum value of η, which we denote by η*, signifies the smallest number of vaccine doses that can prevent outbreaks under an optimal vaccination policy. It is useful practically to have an explicit expression or estimate of the bounds for η*.
To find η*, notice that is a monotonically decreasing function of pi, and thus, a decreasing function of . Therefore, recalling also the assumption , the inequality constraint can be replaced by an equality constraint , and thus,
It follows that η* is the minimum of η ∈ [0, N] such that and can be found by solving the equation:
| (2.10) |
2.1. Notation
Below we introduce some mathematical concepts that will be used throughout the paper. For ease of reference, we also list the quantities that appear, along with their definitions, in Table 1.
Table 1.
Parameters and symbols with their definitions
| Symbol | Description |
|---|---|
| ai | Per capita contact rate of members of sub-population i |
| γ | Per capita rate of recovery |
| θ | Per capita rate of entering and exiting a sub-population |
| β | Probability of infection on contact |
| ρ | = β/(γ + θ) |
| Ni | Size of sub-population i |
| N | = N1 + N2 + … + Nn. Total population |
| cij | Proportion of contacts of individuals in group i that are with group j |
| C | = (cij). Mixing matrix |
| r(A) | Spectral radius of the matrix A |
| ϵ i | Fraction of contacts of group i reserved for itself |
| n | Number of sub-populations in the meta-population |
| pi | Proportion of sub-population i that is vaccinated |
| p | = (p1, p2,…, pn) |
| = ρai. Basic reproduction number of sub-population i | |
| = . Effective reproduction number of sub-population i | |
| Meta-population basic reproduction number | |
| Kv(p) | . Next generation matrix (NGM) |
| = r(Kv(p)). Meta-population effective reproduction number | |
| η | = . Total number of vaccine doses |
| = {(p1, p2, …, pn) : ℓ(p) = η} | |
| P*(η) | = (, , … , ). Optimal allocation of vaccine |
| Minimum of for a given number of vaccine doses η | |
| η * | Minimum doses for achieving |
| η o | Infimum of all η ∈ (0, N) such that P*(η) ∈ (0, 1)n |
| qi | = 1 − pi |
| q | = (q1, q2, …, qn) |
| = | |
| = {(q1, q2, … ,qn): } | |
| = (1,1, … ,1) − p*(η) | |
| = (1 − q1, 1 − q2, … , 1 − qn) | |
| = | |
| Γ, | Critical and ‘reflected’ critical rays (see Theorems 3.2 and 4.3) |
| π i | = Ni/N. Fraction of population belong to sub-population i |
| S | = . Fraction of the population that is unvaccinated |
| = . Harmonic mean of weighted by πi | |
| = . Population weighted reproduction number | |
| = . Analogous to a scaled reproduction number |
i, j = 1, 2, … , n.
-
–
A set E in a vector space is called convex if for any x0, x1 ∈ E the convex combination (1 − t)x0 + tx1 ∈ E for any t ∈ [0, 1].
-
–We say that a real-valued function f on a convex set E is convex if for any x0, x1 ∈ E
-
–
We say f is strictly convex if the above inequality is strict for all t ∈ (0, 1).
-
–
Recall that if f is twice continuously differentiable, then f is convex on an open convex set if and only if the Hessian Hess f = (∂2 f/∂xi ∂xj) is a nonnegative semi-definite matrix for any x ∈ E. Moreover, f will be strictly convex if Hess f is positive definite. (The converse of this statement is false.)
-
–
All matrices considered in this paper will be over the field of real numbers.
-
–
If C = (cij) is a matrix, then we write C > 0 (C ≥ 0) if cij > 0 (cij ≥ 0) for any pair i, j . We also say that such C is positive (nonnegative). This should not be confused with the notion of positive definite (nonnegative semi-definite) matrices, which are related to the positivity (nonnegativity) of the quadratic form x ↦ xTCx associated with C.
-
–
We say that a square matrix C = (cij) is essentially nonnegative1 if its off-diagonal elements are nonnegative; i.e., cij ≥ 0 if i ≠ j.
-
–
We say that a nonnegative squarematrix C is irreducible if for any pair (i, j), i, j = 1, … , n, there exists a natural number m = m(i, j) such that the entry in the ith row and jth column of Cm is positive.
3. Results for n = 2 sub-populations
In the case of n = 2, the NGM (2.7) is a 2 × 2 matrix. From this matrix we obtain the following explicit expression for the reproduction number as a function of p = (p1, p2):
One condition needed for proving the existence of an optimal solution to Problem (I) is that the mixing matrix satisfies
| (3.1) |
It is easy to verify that condition (3.1) holds for Jacquez mixing as given in (2.5):
provided that ϵi ∈ (0, 1), ai > 0, and Ni > 0, i = 1, 2.
3.1. Statements of the main results
Before we state the main results on the existence and uniqueness of the optimal solutions to Problems (I) and (II), we give the following critical properties of (p1, p2):
Theorem 3.1 (Key properties of (p1, p2)) Consider as a function of p1 and p2, and assume that condition (3.1) holds.
-
(i)
and grows linearly on the rays emanating from (1, 1) into the square [0, 1]2 (See Fig. 1 for illustration).
-
(ii)
is convex on [0, 1]2 and strictly convex on the constraint set .
Fig. 1.
Plot of contour curves of in the case of n = 2 sub-populations and depiction of the optimal point P*(η). Γ is the critical ray. is the constraint line. η0 is the greatest lower bound of 0 < η < N such that P*(η) ∈ (0, 1)2 only if η0 < η < N. This figure illustrates that P*(η) always lies on Γ for such η
The proof of Theorem 3.1 is provided in “Appendix A.1”.
The following theorem describes the solution to Problem (I). For ease of presentation, we introduce the following notation:
| (3.2) |
The set Γ describes the ray emanating from (1, 1) in the direction of −(κ1, κ2), to which we will refer as the critical ray (see Fig. 1).
Theorem 3.2 (Optimal solution to Problem (I) when n = 2) Consider as a function of p1 and p2, and let η0 and κi be given in (3.2). Assume that condition (3.1) holds.
-
(i)
For any given η ∈ [0, N], the optimal point P*(η) exists and is unique.
-
(ii)The point P*(η) lies in the interior of the unit square if and only if
and η ∈ (η0, N).(3.3) -
(iii)
For η ∈ (η0, N), all points P*(η) lie on the critical ray Γ, defined in (3.2) (see Fig. 1).
- (iv)
Remarks A few remarks are in order.
-
(a)
If either of the conditions in (3.3) is violated, or equivalently κ1 ≤ 0 or κ2 ≤ 0, then the ray Γ does not intersect the interior of the square (0, 1)2.
-
(b)
The minimum point P*(η) is the intersection of the critical ray Γ and the constraint set for each η ∈ (η0, N).
-
(c)
If η ∈ (0, η0), the intersection of Γ and lies outside the square [0, 1]2. When η = η0, the intersection lies on the boundary of the square.
-
(d)If η ∈ (0, η0), the minimum point P*(η) is one of the boundary points (η/N1, 0) or (0, η/N2) and hence
-
(e)
Overall, is a strictly decreasing convex function of η and linear on (η0, N) (see Fig. 2).
Fig. 2.
Depiction of the minimized reproduction number as a function of η. The illustration corresponds to the case when ≥ 1
An explicit expression for the optimal solution η* to Problem (II) can be obtained by using Eq. (2.10). We consider two cases depending on the value of
Theorem 3.3 (Critical number of vaccine doses) If condition (3.3) is satisfied, then the minimum value of η* in Problem (II) is given below.
-
(i)(Interior minimum) If ≥ 1, then η* ≥ η0 and
(3.6) -
(ii)(Boundary minimum) If ≤ 1, then 0 ≤ η* ≤ η0 and
The proof is given in “Appendix A”.(3.7)
4. Results for n ≥ 2 sub-populations
In this section, we extend the results for n = 2 sub-populations presented in Sect. 3 to the case of n > 2. For general mixing matrices C = (cij), due to the complexity of the optimization problems (I) and (II) when n > 2, we are unable to obtain explicit expressions for the optimal solutions. Nevertheless, we can derive lower and upper bounds for the minimum reproduction number and the minimum vaccine doses η*. We first present results for general mixing (cij) satisfying (2.2)–(2.4). We then illustrate that some of the key necessary properties of the mixing matrix can be verified for the Jacquez mixing given in (2.5).
4.1. Preliminaries
Rewrite the NGM given in (2.7) as
The effective reproduction number for the meta-population is the spectral radius (and the dominant eigenvalue, by Perron–Frobenius Theorem) of the nonnegative matrix Kv(p), i.e., (p) = r(Kv(p)).
Although the focus of this study is on optimal solutions to Problems (I) and (II), the results presented in the following theorem about the bounds of and are significant in more general applications.
Theorem 4.1 (Bounds for ) Let C be a nonnegative, invertible, irreducible matrix such that −C−1 is essentially nonnegative and the conditions (2.2)–(2.4) are satisfied. Then
-
(a)The lower and upper bounds of are:
(4.1) -
(b)
The lower and upper bounds of correspond to the cases of proportionate mixing and isolated mixing, respectively.
The proof of Theorem 4.1 is given after the proof of Theorem 4.9.
Remarks Theorem 4.1 is stated using the effective reproduction number for 0 ≤ p ≤ 1. The results holds in particular for the basic reproduction number that the the lower and upper bounds are and .
For the ease of presentation, we introduce the ‘reflected’ variables
Note that qi represents the unvaccinated portion of sub-population i = 1, … , n. We also introduce the ‘reflected’ function
| (4.2) |
Note that the formula (4.2) can be used to extend to [0,∞)n. The constraint hyperplanes in Problem (I) will transform to
where
and the optimal point P*(η) will become
For the minimum value of on we will have
The following result states key properties of , generalizing Theorem 3.1 in the case n = 2.
Theorem 4.2 (Convexity and homogeneity of ) The function is homogeneous of degree 1 on q ∈ [0, 1]n, i.e.,
Moreover, is convex if the matrix C = (cij) is invertible and −C−1 is essentially nonnegative. If additionally C is irreducible, then is strictly convex on the constraint set (see Fig. 3).
Fig. 3.
a is a contour plot of in [0, 1]3. b illustrates the restriction of level sets to
Theorem 4.2 immediately implies the following property.
Theorem 4.3 (Critical ray) Let C be a nonnegative square invertible irreducible matrix such that −C−1 is essentially nonnegative. Let ∈ (0, N) be such that has an interior relative minimum point on . Then is the unique point that satisfies the Lagrange multiplier condition
If s > 0 is such that ∈ (0, N), then the unique relative minimum point on is given by
provided that this point still lies in the interior of the unit hypercube.2 Thus, all interior relative minimum points lie on the critical ray emanating from the origin.
We start with the following facts about the spectral radius r (A) of a nonnegative matrix A. The first one is rather simple, if not obvious.
Lemma 4.4 If A is a square matrix and s > 0 then
The second one is also well-known, see e.g. Hill and Longini (2003); Nussbaum (1986).
Lemma 4.5 If A, B are nonnegative irreducible matrices such that A ≤ B, then
The next one is more subtle and is based on a theorem of Friedland (1980/81, Theorem 4.3); see also generalizations of this result in Nussbaum (1986, Sect. 1).
Lemma 4.6 Let C be a nonnegative invertible squarematrix such that −C−1 is essentially nonnegative. Then the mapping
is convex on the set of positive diagonal matrices D = diag(d1, … , dn), di > 0, i.e.,
for any positive matrices D1 and D2 and t ∈ [0, 1]. Moreover, if additionally C is irreducible,3 then the inequality above is strict for t ∈ (0, 1), unless D2 = sD1 for some s > 0.
Proof of Theorem 4.2 If we denote
then for homogeneity we just note that
and therefore by Lemma 4.4, we have
The convexity of follows from the fact that
i.e., is a composition of a linear mapping
and a convex function rC, and is therefore convex, as we impose the condition that −C−1 exists and is essentially nonnegative.
Finally, the strict convexity of on follows from the strict convexity property of rC in Lemma 4.6, because no two points on lie on the same ray emanating from the origin.
4.2. Upper and lower bounds of optimal solutions
In this section, we establish bounds on quantities relevant to Problems (I) and (II) for general mixing matrices C, satisfying (2.2)–(2.4) with an additional property that −C−1 is essentially nonnegative. As we saw in Theorem 4.2, the latter condition is needed to guarantee the convexity of . On the other hand, conditions (2.2)–(2.4) provide important information on positive eigenvectors of the mixing matrix C, which is instrumental in deriving our bounds.
More specifically, we prove upper and lower bounds for , , and (or equivalently, P*(η), , and η*). An interesting feature is that these bounds are independent of the functional form of mixing cij.
4.2.1. Equal per capita contact rates
We start with a special case when all per capita contact rates ai are the same. We show that the minimum of on will occur on the diagonal q1 = q2 = … = qn, under the conditions on C that guarantee the convexity of . In particular, this will hold for simple mixing matrices given by (2.5) for any choice of ϵi ∈ (0, 1) and Ni > 0.
Theorem 4.7 (Equal per capita contact rates) Let C be a nonnegative, invertible, irreducible matrix such that −C−1 is essentially nonnegative and for which the conditions (2.2)–(2.4) are satisfied. Assume additionally that ai = a > 0, i = 1, … , n. Then the minimum of on the intersection of for is achieved at the point
Thus, the critical ray is given by
We will need the following characterization of Friedland (1980/81, Theorem 3.4) for the spectral radius.
Lemma 4.8 If C is a nonnegative, invertible matrix, such that −C−1 is essentially nonnegative, then its spectral radius r (C) is given by
where
Moreover, if C is irreducible, and u = (u1, … , un)T > 0, v = (v1, … vn)T > 0 are right and left eigenvectors of C, i.e.,
then for ξ = (u1v1, – , unvn) one has
Proof of Theorem 4.7 When ai = a, i = 1, … , n, we have the following essential properties of the matrix C
This means that u = (1, … , 1)T and v = (N1/N, … , Nn/N)T are normalized right and left eigenvectors of C:
By Perron–Frobenius Theorem, we also have r (C) = 1. (Note that this holds for any nonnegative matrix satisfying (2.3)). Then by Lemma 4.8 we have
Suppose now that q = (q1, … , qn) ∈ (0, 1)n is such that . Then β = (β1, … , βn) ∈ Pn, where
Note, that we also have
where is the common value of , i = 1, … , n, which also is the meta-population basic reproduction number. Then, by Lemma 4.8,
Note that the use of Lemma 4.8 above is justified because, as with C, is positive, invertible, irreducible, and
is essentially nonnegative. Hence,
On the other hand,
and therefore
By continuity of , this completes the proof.
4.2.2. Arbitrary per capita contact rates
In this section, we first establish the upper and lower bounds for (see Theorem 4.9) and equivalently . The lower bound can be proved using the arguments similar to the proof of Theorem 4.7, and the upper bound follows from the monotonicity of the spectral radius as a function of nonnegative matrices. We then proceed to obtain bounds for the relative minima and equivalently , as well as a bound for the critical value that makes ≤ 1.
Let amin denote the minimum of the activities of the sub-populations, i.e.,
Note that amin > 0. The results beloware for general mixing matrices (not just Jacquez mixing given in (2.5)).
Theorem 4.9 (Bounds for ) Let C be a nonnegative, invertible, irreducible matrix such that −C−1 is essentially nonnegative and the conditions (2.2)–(2.4) are satisfied. Then the bounds of are:
| (4.3) |
Moreover, equalities hold if for s ∈ [0, amin].
Proof We start with the lower bound in (4.3). The key observation is that
which gives positive eigenvectors for C and CT. That is, if u = (1, … , 1)T and v = (a1N1/A, … , anNn/A)T with , then
Then, by Lemma 4.8,
Then, for any q ∈ (0, 1)n, let
and note that βi > 0 and ; i.e., β = (β1, … , βn) ∈ Pn. Hence, by Lemma 4.8,
or equivalently
and by continuity also for all q ∈ [0, 1]n. This proves the lower bound.
The upper bound of can be obtained by noticing that
and applying Lemma 4.5:
| (4.4) |
where we have used that r (C) = 1.
To establish the equalities in (4.3), it is easy to see that the upper and lower bounds are the same when q = s(1/a1, … , 1/an) for constant s ∈ (0, amin). The restriction on s guarantees that the point q is in (0, 1)n. Then, by continuity, equality holds also for the endpoint values of s.
The proof is completed.
We now use the results above to prove Theorem 4.1.
Proof of Theorem 4.1 To prove Theorem 4.1(a), i.e., the bounds for in (4.1), recall that qi = 1 – pi, , and for i = 1, 2, … , n. Substitution of these relationships into (4.3) yields (4.1).
For (b), note that when themixing C = (cij) is proportionate, cij = ωj. In this case, the NGM has rank 1 and its dominant eigenvalue is the sum of the diagonal elements. That is, . Note also that in this case of isolated mixing, i.e., ϵi = 1 for all i in (2.5), C = I and the NGM is diag(), for which .
It is clear that for the proportionate mixing and isolated mixing, the corresponding reproduction numbers coincide with the lower and upper bounds of given in (4.1). Thus, while Theorem 4.1, as stated, is not formally applicable to the proportionate and isolated mixing functions, the bounds of in (4.1) correspond to these two extreme cases.
This completes the proof.
It is easy to verify that, when a1 = a2 = ⋯ = an, Theorem 4.9 contains the conclusion of Theorem 4.7.
Theorems 4.1 and 4.9 can be used to derive the lower and upper bounds for the minimum reproduction number . Introduce the following notation:
| (4.5) |
The following results provide the lower and upper bounds for the minimum in Problem (I):
Theorem 4.10 Assume that the conditions of Theorem 4.9 hold. Let η < N (or > 0), and let S, , , and be defined in (4.5).
-
(a)The bounds of for are:
(4.6) -
(b)The bounds of for are:
(4.7)
Proof (a) To prove the lower bound, note first that ≤ for all q ∈ . Note also that from the inequality in Theorem 4.9 we have
This proves the lower bound in (4.6).
For the upper bound in (a), choose s > 0 so that Q = s(1/a1, … , 1/an) ∈ ; i.e., . Note that Q ∈ [0, 1]n if 0 ≤ s ≤ amin or 0 ≤ ≤ amin . Then, by the case of equality in (a) of Theorem 4.9,
This completes the proof of (a).
To prove (b), note that . Note also that the left-hand side (LHS) and right-hand side (RHS) of inequality (4.6) can be re-expressed in terms of sub-population reproduction numbers . For the LHS of (4.6),
It follows that
| (4.8) |
For the RHS of (4.6), note that
| (4.9) |
From (4.8) to (4.9) we obtain (4.7). This completes the proof of (b).
We can now deduce the results for the upper bound of the critical number of vaccine doses η* in Problem (II):
Theorem 4.11 (Critical number of vaccine doses) Let C satisfy the same conditions as in Theorem 4.9, and let be defined in (4.5).
-
(a)the upper bound for the minimum vaccine dose is given by:
(4.10) -
(b)If for all i , then the inequality (4.10) can be re-written as
(4.11)
Proof (a) We prove the inequality (4.10) by considering two cases.
Case 1: Assume first that 1/ρ ≤ amin. In that case, the upper bound in Theorem 4.10 is applicable for , which gives
Thus,
Case 2: Assume now that 1/ρ ≤ amin. Then the upper bound in Theorem 4.10 for gives
Thus,
Combining cases 1 and 2, we obtain
and recalling that = N − η*, we complete the proof of (a).
For (b), note that the RHS of (4.10) can be re-written as
Thus, the inequality (4.10) can be re-written as
This completes the proof.
Remarks
-
(i)
Note that and (see (4.5)) are weighted basic reproduction numbers, and the factor S is the fraction of the overall population that remains susceptible. In light of this, we see that the lower and upper bounds for in (4.7) take the familiar form of an effective reproduction number.
-
(ii)
The lower and upper bounds in (4.7) are equal if the activities ai for subpopulations i are all the same. Note that = ρai = βai /(θ + γ). Thus, are the same when ai are the same for all i. Then, from (4.5) we see that , which implies that the inequalities in (4.7) become equalities.
-
(iii)
For the upper bound of η*, if ai = a are all the same, we have , in which case the upper bound in (4.11) becomes 1 − 1/. This is similar to the usual formula for the critical vaccination fraction pc = 1 − 1/, for which the number of vaccinated is ηc = pcN = N(1 − 1/).
4.3 Example: the case of Jacquez mixing
As an example ofmixing functions that satisfy the conditions described in Lemma 4.6, we consider the cij for the meta-population model in Feng et al. (2015), which is the Jacquez mixing as given in (2.5).
Proposition 4.12 Let the matrix C = (ci j) be given by (2.5) with ϵi ∈ (0, 1), i = 1, … , n. Then C is invertible and B = C−1 = (bij) is given by
In particular, −C−1 is essentially nonnegative.
Remark 4.13 It is also clear that under conditions of Proposition 4.12, C is positive and thus irreducible.
Proof It will be sufficient to show that CB = I (the identity matrix), or
To simplify computations, let
Then
where we have used above that
This completes the proof.
5. Discussion
The main goal of this study was to solve Problems (I) and (II), which identify the most efficient allocation of limited vaccines using a meta-population model for vaccine-preventable infectious diseases. Although we demonstrated the results using Model (2.1), the approach can be applied to other meta-population models for vaccine-preventable diseases. Model (2.1) incorporates various heterogeneities such as in activity, contacts between sub-populations (mixing), vaccination coverage, and size of each sub-population. We considered general mixing functions that satisfy conditions (2.2)–(2.4), including the special case of Jacquez mixing in (2.5) and special cases of this namely, proportionate mixing (ϵi = 0 for all i) and preferential mixing ( ϵi = 1 for all i). However, it would be more challenging to consider a similar approach to meta-population models that are less-tractable analytically than Model (2.1), particularly when the NGM has a more complicated structure. For example, when a model includes additional factors such as aging from one age-group to the next, multi-level mixing (e.g., age and spatial), and heterogeneity in infectivity and susceptibility, the effective reproduction number as a function of vaccination coverage will be more difficult to study from an analytic point of view.
The optimization problem is based on reducing the effective reproduction number (if > 1) by determining the optimal combination of vaccine coverages p = (p1, p2, … , pn). Because the parameters pi must be between 0 and 1, the optimal solution P*(η) needs to be in the unit hypercube. Even in the case of n = 2 sub-populations, the solution of Problems (I) and (II) is not trivial. For n > 2, the most challenging task is to show the convexity of ; Theorem 4.2. This proves a conjecture of Hill and Longini (2003), although those authors did not consider the structure of the mixing matrix C specified by conditions (2.2)–(2.4). Our proofs are facilitated by using the ‘reflected’ quantities qi = 1 − pi and (q1, q2, … , qn) = (p1, p2, … , pn). For ease of presentation, we first illustrated results for the simpler case of n = 2 sub-populations, and then extended them to n > 2 sub-populations.
In the case of n = 2, explicit formulae are obtained for the optimal solutions when mixing is proportionate or preferential. For Problem (I), the optimal solution P*(η) for a given number of vaccine doses η and the minimized reproduction number are described as functions of model parameters (Theorem 3.2). For Problem (II), an analytical formula for the minimum vaccine doses η* that will reduce to below 1 is provided (Theorem 3.3).
Another interesting finding is that, for any number of vaccine doses η in the constraint, the optimal vaccine coverage P*(η) lies along the ‘critical’ ray Γ . In addition, when n = 2, for the optimal solution P*(η) to be in the unit square [0, 1]2, available vaccine doses must satisfy η0 < η < N, where the lower bound η0 is determined by model parameters. For η ≤ η0, the optimal strategy will be to vaccinate only one sub-population.
Extension of these results to the case n > 2 is complicated by the fact that no explicit formulae are available. Nevertheless, using results for the spectral radius of nonnegative matrices, in Sects. 4.1 and 4.2, we obtain bounds of for an arbitrary mixing C that satisfies the conditions in Theorem 4.1. An interesting finding is that for a large class of mixing matrix C (not just Jacquez), the proportionate mixing gives the smallest while the isolated mixing (no mixing between sub-populations) gives the largest given (see Theorem 4.1). It is clear that these conclusions hold particularly for . The facts that population heterogeneities tend to increase and that models assuming proportionate mixing generate lower values of have been suggested by other researchers (Adler 1992; Andersson and Britton 1998; Diekmann et al. 2012).
We also establish bounds on the relative minima and the critical vaccine dose η* (see Theorems 4.10 and 4.11). Interpretations of those bounds are provided in terms of biological quantities such as weighted reproduction numbers. In particular, we see that the lower and upper bounds are products of the weighted reproduction number and the harmonic mean , respectively, with the fraction unvaccinated S (see (4.7)). Thus, both bounds are in the familiar form of effective reproduction numbers. Moreover, the bounds are equal when all sub-populations have the same activity (ai). A similar interpretation holds for the upper bound of η*, in which case the usual basic reproduction number is replaced by the harmonic mean of sub-population reproduction numbers the weighted by sub-population fractions πi (see also (4.11)).
Acknowledgements
The findings and conclusions in this report are those of the author(s) and do not necessarily represent the official position of the Centers for Disease Control and Prevention or other institutions with which they are affiliated. We thank the anonymous reviewers for comments and suggestions, which helped improve the presentation of the manuscript.
The findings and conclusions in this paper are those of the authors and do not necessarily represent the views of the Centers for Disease Control and Prevention.
A Appendix
In this appendix, we provide detailed proofs for Theorems 3.1–3.3 in Sect. 3 and illustrate an example of these results in the case of Jacquez mixing.
A.1. Proofs of Theorems 3.1–3.3
To prove these theorems, we first prove several propositions. Instead of working with the function (p1, p2) along the rays emanating from the point (p1, p2) (1, 1), it is much easier to consider the ‘reflected’ variables:
and the corresponding rays emanating from the point (q1, q2) = (0, 0) into the unit square [0, 1]2. The ‘reflected’ function is then given by
The quantities corresponding to those mentioned directly after Problem (I) in terms of q and the reflected function are
In addition, at the optimal points P* or Q*, we have
and we know that satisfies the equation
or, equivalently,
provided is in the interior of the unit square.
A.1.1. Reflected function and its properties
The reflected function is given explicitly by
| (A.1) |
for q ∈ [0, 1]2. Note that the formula (A.1) can be used to naturally extend to the first quadrant q ≥ 0 (i.e., q1, q2 ≥ 0). It can be checked that the function is homogeneous of degree 1, i.e.,
| (A.2) |
Geometrically, this means that grows linearly on the rays emanating from the origin, as illustrated in Fig. 4.
Fig. 4.
The graph of . grows linearly on the rays emanating from the origin
We next compute the first derivatives of :
| (A.3) |
for i, j = 1, 2 and i ≠ j. Note that the function is homogeneous of degree 0:
| (A.4) |
i.e., is constant on rays emanating from the origin.
Further, we compute the second derivatives of . By direct computation, we have the following formula for the Hessian
where
Note that k > 0 by (3.1) and therefore Hess is a nonnegative semi-definite matrix, by Sylvester’s criterion. Consequently, is a convex function of (q1, q2). We explicitly note here that is not a strictly convex function, as it grows linearly on the rays emanating from the origin. However, as we show below, it is strictly convex in certain directions. To be more precise, let u = (u1, u2) be a unit vector, and consider the second derivative of in the direction u. We have
whereupon
(Here, u ∥ v indicates that vectors u and v are parallel.) In particular, is strictly convex in the direction (N2, − N1).
To proceed, recall that
and consider again the constraint set
This is a line segment parallel to the vector (N2, − N1) with endpoints on the boundary of [0, 1]2. We will denote the left and right endpoints (with respect to the direction (N2, − N1)) by and , respectively. It is easy to see that
Proposition A.1 The function is strictly convex on the intersections , for ∈ (0, N) if condition (3.1) holds. Consequently its minimum will occur at an interior point if and only if
| (A.5) |
| (A.6) |
where and are the left and right endpoints of (see Fig. 5). Moreover, is the unique point on such that
| (A.7) |
Fig. 5.
Level sets of a and b . The point in (b) is the relative minimum point of on . is the set of all such minima
From the homogeneity properties (A.2) and (A.4), we also have the following proposition.
Proposition A.2 Under the assumptions of Proposition A.1, let ∈ (0, N) be such that is an interior point. Then, for s > 0 such that ∈ (0, N), the minimum point of on is given by
provided that this point still lies in the interior of the unit square.4 In other words, all interior minimum points lie on a ray emanating from the origin. We will denote this ray by and call it the (reflected) critical ray. Moreover, by Proposition A.1
Proof For we have (see Fig. 5)
But then
and by (A.4)
implying that is the critical point on the constraint with constant s and thus that
The rest then follows from Proposition A.1.
A.1.2. Endpoint conditions
We now write the endpoint conditions (A.5)–(A.6) using the explicit formulae for the derivatives of . Because of Proposition A.2, to verify (A.5)–(A.6), it will be sufficient to verify them for the constraint value s with a small s > 0. Thus, without loss of generality, we may assume that itself is small. In that case, the intersection points and of with ∂([0, 1]2) will lie on the left and bottom sides of the square, i.e.,
From (A.3) we have
and, therefore, the conditions (A.5)–(A.6) will take the form
which is equivalent to (3.3).
A.1.3. The critical ray
We next characterize the critical ray . Namely, we identify the intersection point of with ∂([0, 1]2), which we denote by Q0.
For this, consider the function
Because ϕ = 0 on (see Proposition A.1), we must have
Note that conditions (A.5) and (A.6) are equivalent to
Thus, the location of Q0 will depend on the sign of
That is, ∈ {1} × (0, 1] if ϕ(1, 1) ≤ 0, and Q0 = (, 1) ∈ (0, 1] × {1} if ϕ(1, 1) ≥ 0. We will use this fact to find Q0 and characterize .
Proposition A.3 Assume that conditions (3.1) and (3.3) hold, where κ1, κ2 are as in (3.2). Then the intersection of the critical ray with ∂([0, 1]2) has the property
Consequently, the critical ray is given by
Proof Recall that and consider the case ϕ(1, 1) ≤ 0 first. In that case, , where is found from the equation ϕ(1, q2) = 0. This equation can be reduced to a quadratic equation for q2, which has two roots, given by the formulae
where
Then equals either or (the reduction to a quadratic equation may have introduced a false root). Plugging the formulae above into ϕ, it can be verified that if and only if
This inequality is indeed satisfied, because of the condition (3.1):
On the other hand, the verification of the equation results in the condition c22λ2 = c11ρ2, which implies that . Thus, in either case, we can conclude that . Hence,
which is the same quantity as κ2/κ1 as in the statement of the proposition. This finishes the proof in this case.
The case ϕ(1, 1) ≥ 0 is considered similarly and we obtain exactly the same value for the ratio .
We next identify the value of the constraint , which corresponds to the intersection point of and ∂([0, 1]2).
Proposition A.4 Let Q0 be as in Proposition A.3. Then
Proof If κ1 ≥ κ2 then and and thus
On the other hand, if κ1 ≤ κ2 then and and thus
Combining the two cases, we obtain the stated formula.
A.1.4. Explicit formulae for minima
Proposition A.5 For 0 < < , we have the following explicit formulae for and the minimum of :
Proof We know that , where s > 0 can be found from the constraint , which gives
This proves the first formula. To establish the second, we first use the homogeneity of :
Furthermore, by direct calculations, one can show that
which completes the proof.
A.1.5. Proofs of the Theorems
The proofs for Theorems 3.1 and 3.2 can be completed by combining Propositions A.1–A.5, and by writing their statements in terms of the original variables pi and the constraint value η.
To prove Theorem 3.3, we consider several cases. When , we simply use the formula for in Theorem 3.2 for η0 < η < N to find η*. When , the minimum points P*(η) will be on the boundary of the square for 0 < η < η0 and we solve the equations
or more precisely in variables (q1, q2):
from which we get
The corresponding values of η* are
A.2. Example: the case of Jacquez preferred mixing
The conditions in (3.3), which guarantee that the critical ray Γ passes through the interior of the unit square, hold for general mixing functions that satisfy (2.2)–(2.4). These conditions may simplify when specific functions are considered. In this section, we consider the Jacquez preferred mixing given in (2.5).
A.2.1. Interpretation of conditions for the interior critical ray
To verify the conditions (3.3), we consider two cases, one for homogeneous activity (a1 = a2) and other for heterogeneous activity (a1 ≠ a2).
Case 1: a1 = a2. In this case, the inequality κ1 > 0 can be rewritten as
or
which is always satisfied. Similarly, one can verify that κ2 > 0 as well and therefore conditions (3.3) will always hold if a1 = a2.
Case 2: a1 ≠ a2. In this case, the inequality κ1 > 0 can be rewritten as
and simplifying further to
Note that this inequality will readily hold if (1 − ϵ1)a1 − a2 ≤ 0 (which will happen, e.g., if a1 ≤ a2), as the left-hand side will be nonpositive and the right-hand side positive. If, however, (1 − ϵ1)a1 − a2 > 0, then the above inequality will transform to
By repeating this analysis with interchanged indices, we summarize the results above in the following proposition.
Proposition A.6 Suppose the mixing matrix C is given by (2.5) with n = 2.
A.2.2. Simplified expressions at the optimal point
Some of the explicit expressions for the optimal solution provided in previous subsections hold for more general mixing functions C = (cij). These expressions may be simplified when the preferred mixing given in (2.5) is used. These simplified expressions are described in the following propositions. Let
| (A.8) |
for i, j = 1, 2 and i ≠ j. Note that the αi differ from the κi in (3.2) by a constant positive factor. In particular, α2/α1 = κ2/κ1.
Proposition A.7 Consider the mixing function given in (2.5). Let condition (3.3) be satisfied as described in Proposition A.6. Then the relative minima will be interior points if and only if
and will lie on the critical ray
Moreover, the following explicit formulae hold for 0 < < :
| (A.9) |
We next note that, for αi defined in (A.8), we have
and therefore,
In particular, α1 = α2 if a1 = a2. This implies the following particular case of Proposition A.7, which is especially interesting as the minimum points and values do not depend on ϵi (i = 1, 2) although the function and its level sets do (see Fig. 6). We also remark that this is essentially a version of Theorem 4.7 for n = 2, but its proof is more elementary.
Fig. 6.
The critical ray coincides with the main diagonal when a1 = a2 as in Proposition A.8, for any values of ϵi ∈ (0, 1), Ni > 0, i = 1, 2. This figure illustrates two cases with values a ϵ1 = 0.1, ϵ2 = 0.5, N1 = 400, N2 = 1000 and b ϵ1 = 0.8, ϵ2 = 0.2, N1 = 1200, N2 = 300
Proposition A.8 Let C be the Jacquez mixing given in (2.5), and let
Then for all possible values of constants ϵi ∈ (0, 1) and Ni > 0 (i = 1, 2), the critical ray coincides with the diagonal
and the expressions in (A.9) simplify to
| (A.10) |
Footnotes
Note that terminology differs among authors. Here, we use the terminology of Nussbaum (1986); the negatives of such matrices are called M-matrices by Friedland (1980/81) and are said to have a Z sign pattern by van den Driessche and Watmough (2002).
This will automatically hold for 0 < s < 1.
See the definition in Sect. 2.1.
This will automatically hold for 0 < s < 1.
References
- Adler FR (1992) The effects of averaging on the basic reproduction ratio. Math Biosci 111(1):89–98 [DOI] [PubMed] [Google Scholar]
- Andersson H, Britton T (1998) Heterogeneity in epidemic models and its effect on the spread of infection. J Appl Probab 35(3):651–661 [Google Scholar]
- Brauer F, Castillo-Chavez C (2012) Mathematical models in population biology and epidemiology, texts in applied mathematics, 2nd edn. Springer, New York: https://doi.org/10.1007/978-1-4614-1686-9 [Google Scholar]
- Busenberg S, Castillo-Chavez C (1991) A general solution of the problem of mixing of subpopulations and its application to risk- and age-structured epidemic models for the spread of AIDS. IMA J Math Appl Med Biol 8(1):1–29 [DOI] [PubMed] [Google Scholar]
- Castillo-Chavez C, Feng Z (1998) Global stability of an age-structure model for TB and its applications to optimal vaccination strategies. Math Biosci 151(2):135–154 [DOI] [PubMed] [Google Scholar]
- Diekmann O, Heesterbeek JAP, Metz JAJ (1990) On the definition and the computation of the basic reproduction ratio R0 in models for infectious diseases in heterogeneous populations. J Math Biol 28(4):365–382. https://doi.org/10.1007/BF00178324 [DOI] [PubMed] [Google Scholar]
- Diekmann O, Heesterbeek H, Britton T (2012) Mathematical tools for understanding infectious disease dynamics. Princeton University Press, Princeton [Google Scholar]
- Feng Z, Hill AN, Smith PJ, Glasser JW (2015) An elaboration of theory about preventing outbreaks in homogeneous populations to include heterogeneity or preferential mixing. J Theor Biol 386:177–187. https://doi.org/10.1016/j.jtbi.2015.09.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Feng Z, Hill AN, Curns AT, Glasser JW (2017) Evaluating targeted interventions via meta-population models with multi-level mixing. Math Biosci 287:93–104. https://doi.org/10.1016/j.mbs.2016.09.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friedland S (1980/81) Convex spectral functions. Linear Multilinear Algebra 9(4):299–316. https://doi.org/10.1080/03081088108817381 [Google Scholar]
- Glasser J, Feng Z, Moylan A, Del Valle S, Castillo-Chavez C (2012) Mixing in age-structured population models of infectious diseases. Math Biosci 235(1):1–7. https://doi.org/10.1016/j.mbs.2011.10.001 [DOI] [PubMed] [Google Scholar]
- Glasser JW, Feng Z, Omer SB, Smith PJ, Rodewald LE (2016) The effect of heterogeneity in uptake of the measles, mumps, and rubella vaccine on the potential for outbreaks of measles: a modelling study. Lancet Infect Dis 16(5):599–605. https://doi.org/10.1016/S1473-3099(16)00004-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hadeler K, Müller J (1996a) Vaccination in age structured populations I: the reproduction number. Models Infect Human Dis Struct Relat Data 6:90 [Google Scholar]
- Hadeler K, Müller J (1996b) Vaccination in age structured populations II: Optimal strategies. Models for infectious human diseases: their structure and relation to data pp 102–114
- Hill AN, Longini IM Jr (2003) The critical vaccination fraction for heterogeneous epidemic models. Math Biosci 181(1):85–106. https://doi.org/10.1016/S0025-5564(02)00129-3 [DOI] [PubMed] [Google Scholar]
- Jacquez JA, Simon CP, Koopman J, Sattenspiel L, Perry T (1988) Modeling and analyzing HIV transmission: the effect of contact patterns. Math Biosci 92(2):119–199. https://doi.org/10.1016/0025-5564(88)90031-4 [Google Scholar]
- Nold A (1980) Heterogeneity in disease-transmission modeling. Math Biosci 52(3–4):227–240. https://doi.org/10.1016/0025-5564(80)90069-3 [Google Scholar]
- Nussbaum RD (1986) Convexity and log convexity for the spectral radius. Linear Algebra Appl 73:59–122. https://doi.org/10.1016/0024-3795(86)90233-8 [Google Scholar]
- Seneta E (1973) Non-negative matrices. An introduction to theory and applications. Halsted Press, New York [Google Scholar]
- van den Driessche P, Watmough J (2002) Reproduction numbers and sub-threshold endemic equilibria for compartmental models of disease transmission. Dedicated to the memory of John Jacquez. Math Biosci 180(1–2):29–48. https://doi.org/10.1016/S0025-5564(02)00108-6 [DOI] [PubMed] [Google Scholar]






