Abstract
The human papillomavirus (HPV) is sexually transmitted and can infect oral, genital, and anal sites in the human epithelium. Here, we develop a multisite transmission model that includes autoinoculation, to study HPV and other multisite diseases. Under a homogeneous-contacts assumption, we analyze the basic reproduction number R0, as well as type and target reproduction numbers, for a two-site model. In particular, we find that R0 occupies a space between taking the maximum of next generation matrix terms for same site transmission and taking the geometric average of cross-site transmission terms in such a way that heterogeneity in the same-site transmission rates increases R0 while heterogeneity in the cross-site transmission decreases it. Additionally, autoinoculation adds considerable complexity to the form of R0. We extend this analysis to a heterosexual population, which additionally yields dynamics analogous to those of vector–host models. We also examine how these issues of heterogeneity may affect disease control, using type and target reproduction numbers.
Keywords: Basic reproduction number, HPV, Multisite model, Autoinoculation, Heterogeneity, Disease control
1. Introduction
The basic reproduction number R0 is an important quantity in infectious disease epidemiology, defined as the average number of secondary cases arising from a typical primary case in an entirely susceptible population [1, 2, 3]. Equivalently, it is often defined as a threshold parameter that controls the stability of the disease-free equilibrium: if R0 < 1, an emergent disease will die off quickly, while if R0 > 1, the disease will become epidemic [3, 4]. The values of R0 vary greatly by disease [2], ranging from close to 1 for seasonal influenza to 5–7 for smallpox and polio to 12–18 for measles and pertussis. Mathematical modeling is used to estimate the basic reproduction number and other relevant quantities. In practice, R0 is calculated as a threshold parameter that may not precisely correspond to the number of secondary cases per infection, especially in the case of an environmentally transmitted disease [4]. For example, virus shed into the environment may contribute to additional infections not directly attributable to a specific infected person. In practice, R0 is typically calculated as the spectral radius of the next generation matrix [4].
The basic reproduction number also has implications for infection control. If a fraction of the population greater than is permanently protected from infection, such as through immunization at birth, the infection cannot become epidemic [2, 3, 5]. The concept of the basic reproduction number, at least in these infection control terms, can be extended to the type and target reproduction numbers [5, 6, 7]. If there are multiple host types, the type reproduction number for host type i is denoted Ti, and the infection can be controlled by protecting a greater fraction than of host type i, provided no other host acts as a reservoir for the infection. The type reproduction number is especially of interest for vector-borne and other multiple-species infections, and it can be extend to consider any subset of host types [5, 6]. The target reproduction number is a further generalization, in which specific pathways are targeted for control. This corresponds to considering only certain entries of the next generation matrix, instead of whole rows as is the case of the type reproduction number [7].
Here, we consider a class of diseases that may infect multiple sites in a host. Our motivating example is the human papillomavirus (HPV); it is well documented that oral and anogenital sites, although not completely independent, can become infected or clear the virus whether or not the other sites are infected and that autoinoculation may be an important pathway [8, 9, 10]. However, this model is also relevant to other sexually transmitted infections that affect multiple sites. Autoinoculation of the herpes simplex virus (HSV), which can infect genital, oral, anal and other epithelial sites, can occur, especially during the primary outbreak [11, 12, 13]. Chlamydia trachomatis is a bacterium that can infect the genital, gastrointestinal tract, pharynx, and the eyes. Transmission between sites can be direct (usually sexual) or through autoinoculation [14, 15, 16]. Infections by Candida species of yeast, Candida albicans in particular, can infect many sites, particularly oral and genital [17]. The model can also be used to consider multisite infections not typically spread by sexual interaction, such as Trichophyton (athlete’s foot) or conjunctivitis, and may possibly be extended to infections, such as molluscum and warts, that infect multiple, though not distinctly delineated, sites on the epithelium [18].
Deviation from the average, or heterogeneity, in various aspects of infectious disease models is important to the study of the basic reproduction number. Heterogeneity of populations, whether in terms of behavior, spatial distribution, or other characteristics, has been widely studied [19, 20, 3, 21] and has led to the development of such tools as mulitgroup modeling. Although the contexts are quite different, like Robertson et al. [22] we will consider heterogeneity among multiple transmission pathways. While Robertson et al. [22] considered heterogeneity in the direct and indirect pathways of waterborne cholera transmission, we consider differences within same-site and cross-site transmission of a multisite infectious agent.
In this paper we first develop and explore the dynamics of a multisite model with homogeneous contacts (homo- or pansexual population or a nonsexual infection), including analysis of the reproduction numbers and the effects of heterogeneity in the transmission pathways. We then extend this multisite model to one with heterogeneous contacts (heterosexual population) and see how the additional complexity translates into the reproduction numbers.
2. Two-site model with homogeneous contacts
We first consider a model of a two-site sexually transmitted infection assuming that contacts are homogeneous in order to explore the dynamics of a two-site system without the complication of heterosexual contact. We assume that clearance of the infection does not induce immunity (SIS framework). We denote, without loss of generality, the infection sites as oral and genital. Denote the fraction of the population that is uninfected by S, the fraction infected at site X by IX for X ∈{O, G}, and the fraction infected at both sites by IOG. Let μ be the birth/death rate, γX the recovery rate of infection at site X, νXY the rate of autoinoculation from site X to site Y, and βXY the transmission rate from site X on one individual to site Y on a second individual. As most of the diseases we consider here (including HPV) result in negligible mortality on transmission timescales (e.g. mortality due to HPV is typically on the longer timescale of cancer development later in life), we neglect death due to disease. All parameters are also assumed to be non-negative. The probability of two simultaneous events is assumed to be zero. A model schematic is presented in Figure 1, a summary of model parameters may be found in Table 1, and the following equations define our system:
(1) |
Table 1.
Symbol | Parameter |
---|---|
μ | Population birth/death rate |
βOG | Transmission from oral site to genital site on another person |
βGO | Transmission from genital site to oral site on another person |
βOO | Transmission from oral site to oral site on another person |
βGG | Transmission from genital site to genital site on another person |
γO | Clearance of an oral infection |
γG | Clearance of a genital infection |
νOG | Autoinoculation of a genital site from the oral site |
νGO | Autoinoculation of an oral site from the genital site |
As written, the region {0 ≤ S ≤ 1, 0 ≤ IO ≤ 1, 0 ≤ IG ≤ 1, 0 ≤ IOG ≤ 1} is positive invariant for the system, and all trajectories are subject to S + IO + IG + IOG = 1. Some terms, such as IOβOGIO, which reflects a sexual contact between two people with an oral infection that results in one person becoming infected genitally, are not typical in disease models but rather arise because of the multisite nature of the infection modeled.
A key feature of this class of models is autoinoculation. Here, we consider autoinoculation to be the infection of a new site on a host who currently has at least one infected site in the absence of contact with another infected individual. As noted in van den Driessche and Watmough [4], it is not always mathematically fixed whether particular terms—in this case, autoinoculation—should be considered new infections in the context of the next generation matrix. However, we argue that it is epidemiologically correct to consider autoinoculation to be a stage progression (incorporated in the V matrix) rather than a new infection (reflected in the F matrix). Consider an arbitrary individual, patient zero, with single-site infection who is introduced into a fully susceptible population. The basic reproduction number quantifies the number of secondary people infected by this initial infection over patient zero’s whole infective period. If we consider autoinoculation to be a new infection, autoinoculation might cause one secondary infection (at patient zero’s second site) but result in no new infected people. If we consider it to be a stage transfer, the infection at the new site may also contribute to secondary infections. The second interpretation gives a more epidemiologically meaningful R0.
It is useful to note that this model is in some ways similar to other models that include more than one type of infectious class, e.g. multi-strain or multi-stage infectious disease models, in that there are multiple infected classes between which individuals can move, and some structural similarities in the terms for transmission and autoinoculation. However, there are several important differences. In this model, an infected of one type can generate an infection of either type (e.g. an oral infection can generate an oral or genital infection, unlike in a typical multi-strain model), and new infections do not all begin in a particular stage (as would be the case for a typical stage-progression model). In particular, the cross-transmission and autoinnoculation features of the model yield that, generically, there are no single-site equilibria (e.g. in which we have genital but not oral prevalence of disease), as there often are for multi-strain models [4]. This can be seen by setting IO or IG to 0 and observing that the only solution is the disease-free equilibrium.
2.1. Basic reproduction number
We begin with a derivation and analysis of the reproduction number for this model (eq. 1). The disease-free equilibrium is trivially (1, 0, 0, 0). We construct the next generation matrix K = FV−1 as in van den Driessche and Watmough [4]. The rows of the F and V matrices correspond to the three infected compartments, namely IO, IG, and IOG. The F matrix of new infections is
(2) |
and the V matrix of compartment transfer is
(3) |
Then it is easy to show that V−1 has the form
(4) |
where the τ are average waiting times in the compartments, neglecting further infection as we are considering behavior near the disease-free equilibrium, i.e.
(5) |
and the p and q are probabilities that the next compartment transfer will be to IOG or from IOG, respectively, i.e.
(6) |
It should be noted that 1–pOqO –pGqG > 0. We can heuristically understand the form of V−1 in the following way (as in van den Driessche and Watmough [4]). In compartment IOG, the probability of returning to that compartment in two moves is pOqO + pGqG by going either to IO and back or IG and back. The average amount of time spent in IOG, assuming we start in, say, IO, is
(7) |
We can put this heuristic approach in more formal terms: consider the adjacency matrix A of the infected components of the directed graph depicted in Figure 2, where entry am,n is probability of entering component m from component n, for m, n ∈{IO, IG, IOG}. Thus
(8) |
Then
(9) |
This expression, with the inclusion of the τ parameters, leads directly to V−1.
The next generation matrix K = F V−1 thus has the form
(10) |
where
(11) |
The elements of the next generation matrix have a straightforward biological interpretation: KX|Y is the expected number of secondary cases of infection at site Y produced by one individual originally infected at site X, assuming an otherwise susceptible population. Because two events cannot happen simultaneously, trajectories near the disease-free equilibrium have IOG = 0, and so all terms of the form KX|OG are zero. It is important to note here that and , where and are the basic reproduction numbers of the single site SIS models, except in the case of no autoinoculation. In this model, without loss of generality, an individual with only a genital infection originally can autoinoculate their oral site and spread infection from either, which adds significant complexity to the terms of the next generation matrix.
It is also useful to note here that from the form of the elements of the next generation matrix, we see it is possible to have and , and yet have the overall R0 > 1, depending on the rates of autoinoculation and cross transmission. This can happen for biologically reasonable ranges of parameters. Consider βOO = βOG = βGO = βGG = 1, νOG = νGO = 1, γO = γG = 2, and μ = 0; then while .
We find that the spectral radius of the next generation matrix (eq. 10) is
(12) |
It is helpful here to think of KO|O + KG|G as the R0 of the system under the balance condition that KO|OKG|G = KO|GKG|O, that is when the product of the same-site transmission terms of the next generation matrix is equal to the product of the analogous cross-site terms. Hence, more generally, R0 is KO|O + KG|G modified by a term that accounts for how balanced same-site and cross-site infections are, which naturally leads us to the question of heterogeneity. The balance term is, in fact, a measure of balance in same-site and cross-site transmission rates, not just in the expectations of number of infected as a cursory inspection of eq. 12 would suggest:
(13) |
Although all parameters affect the value of R0, it is the relationship between the transmission parameters that affects the structure of R0, e.g. whether R0 is taking the maximum of the same-site terms, the geometric average of the cross-site terms, or some sort of intermediate behavior. Hence, understanding the effects of heterogeneity in the transmission parameters is necessary to understand the dynamics of the model.
In the next sections, we explore the how the two major unique features of this model—(i) the interplay of same- and cross-site transmission and (ii) autoinoculation—each affect R0, focusing first on how heterogeneity in same-site and cross-site transmission affects R0 and related threshold parameters. We then explore how the added complexity derived from autoinoculation may affect estimates of R0 in practice.
2.2. Limiting cases and transmission heterogeneity
The basic reproduction number displays some interesting limiting behavior. As either KG|O or KO|G, but not necessarily both, go to 0, R0 goes to max(KO|O, KG|G). As both KO|O and KG|G go to 0, R0 goes to . Hence, the basic reproduction number can be thought of as occupying a space between taking the maximum and taking a geometric average, displaying behavior more similar to one or the other depending on the type and strength of transmission pathways. Note that these limiting cases are already suggestive of the effect of heterogeneity on the same-site and cross-site terms, which we will explore further below. Considering a different limiting case, we note that as the autoinoculation parameters νGO and νOG go to zero, the basic reproduction number becomes
(14) |
which is a significant reduction in complexity.
Without making these significantly simplifying assumptions, we can still consider heterogeneity, as noted above, in the same-site and cross-site terms of the next generation matrix. Heterogeneity can be seen as a kind of weakly limiting case.
Proposition 2.1
For a fixed total KO|G + KG|O, R0 is largest when KO|G = KG|O. For a fixed total KO|O + KG|G, R0 is smallest when KO|O = KG|G.
This proposition is clear by basic calculus on eq. 12. For a fixed sum KO|G + KG|O, the product KO|GKG|O is largest when KO|G = KG|O, meaning that R0 is largest when the two sites have the same expected number of secondary cross-site infections. So, R0 is large when the cross-site terms are similar and is small when they are different. Similarly, in the formula for R0 (eq. 12), the product KO|OKG|G has the opposite sign from KO|GKG|O, meaning that, for a fixed sum KO|O + KG|G, heterogeneity in the terms increase R0. That is, R0 is large when the expected number of same-site infections is very different for the oral and genital sites and small when they are similar. Heterogeneity in the expected site-specific secondary infections, i.e. elements of the next generation matrix, thus, has a very different effect for same-site than for cross-site infection.
To better understand the effect of heterogeneity in the biological transmission parameters, we derive some analytic results for a special limiting case—identical sites—and then consider, through simulation, how the general model deviates from the limiting case.
2.2.1. Identical site model
In order to simplify the analysis, consider the scenario in which the two sites, although they may have different transmission parameters, have identical clearance and autoinoculation parameters. While a simplification for many diseases, this one is biologically realistic for infections such as athlete’s foot or conjunctivitis.
For this section, γO = γG and νOG = νGO. Hence, τO = τG, pO = pG, and qO = qG. We drop the superscripts for the rest of this section, except from τOG, which is distinct from τO = τG = τ. Here
(15) |
and
(16) |
Proposition 2.2
For a fixed total transmission βOO + βGO + βOG + βGG = k, the extrema of R0 of the identical site model are on the boundary of {βOO ≥ 0, βGG ≥ 0, βOG ≥ 0, βGO ≥ 0}. Moreover, R0 is constant on the planes described by { } and { }. These planes partition the constrained parameter space into two regions of higher and two regions of lower R0.
The proof uses the method of Lagrange multipliers and is left to the appendix. These partitioning planes contain three lines of interest: {βOO = βGG, βOG = βGO}, {βOO = βGO, βGG = βOG}, and {βOO = βOG, βGG = βGO}. Along these lines the following are true respectively: KO|O = KG|G and KO|G = KG|O; KO|O = KG|O and KG|G = KO|G; and KO|O = KO|G and KG|G = KG|O.
We wish to visualize the values of R0 on the surface βOO + βGO + βOG + βGG = k in the four dimensional transmission parameter space. In Figure 3, we plot slices of two three-dimensional projections of this surface, with and without the partitioning planes. In this case, the values of R0 are symmetric in the βOG and βGO coordinates as well as in the βOO and βGG coordinates. Simulations were coded in MATLAB [23]. For the purposes of illustration, clearance (γ) and autoinoculation (ν) parameters were set to 1 (matching realistic HPV clearance times of about 1 year [24, 25] and assuming relatively rare autoinoculation events), and transmission rates are as given in the axes of Figure 3.
We see that, in the (βOO, βGG)-plane (the lower boundary of the figure), R0 has a minimum, and, in the (βOG, βGO)-plane (the z-axis in the figure) R0 has a maximum. These results are consistent with our analysis above of the impact of heterogeneity of the same-site and cross-site terms of the next generation matrix, highlighting the fact that R0 is maximized by heterogeneity in same-site transmission and homogeneity in cross-site transmission.
In fact, the planes partition the space into four regions, two of which have larger R0 (toward primarily same-site transmission) and two of which have smaller R0 (toward primarily cross-site transmission). The global extrema of R0 are achieved when there is only one transmission pathway. Indeed, if all transmission is in one pathway with rate k, then eq 12 simplifies (via eqs 15 and 16) to
(17) |
if that pathway is same-site and
(18) |
if that pathway is cross-site. On the partitioning planes of Proposition 2.2, we have
(19) |
2.2.2. Deviations from the identical site model
The equations to solve for the partitioning planes in the general case are intractable. However, numerical simulation gives a reasonably clear picture as to how R0 behaves under deviations from the identical site model. We examine the effect that changing the autoinoculation and recovery parameters has on the values of R0 in the constrained transmission parameter space.
Increasing from 1 (increasing γG to 1.2 while keeping γO and νOG = νGO constant) moves the intersection of the three lines of partitioning planes so that βOO < βOG = βGO < βGG as seen in Figure 4, which has the same parameters as Figure 3 except for a larger γG. Here, the values of R0 are symmetric in the βOG and βGO coordinates, but, unlike the identical site model, the values are not symmetric in βOO and βGG. The planes move to balance the faster clearance in the compartment with a higher transmission rate. Note that by increasing one clearance parameter relative to Figure 3, R0 is reduced overall. Increasing both parameters but not changing their ratio would have rescaled the values of R0 in Figure 3 but not broken its symmetries.
Increasing from 1 (increasing νOG to 1.2) and keeping the recovery parameters constant, moves moves the intersection of the three lines of the partitioning planes so that βOG < βGG = βOO < βGO as in Figure 5. Now, the values of R0 are symmetric in βOO and βGG but no longer in βOG and βGO. Here, the surface moves to balance the movement in either direction along the infection cycles; that is, we compensate for an increased νOG by also increasing its complement in the cycle, βGO, and similarly for the other direction. In Figure 5, we see that, as we have only increased one autoinoculation parameter relative to Figure 3, autoinoculation acts like transmission to increase R0 overall, which is as expected.
In the identical site model, the three lines of the partitioning planes corresponded to equality of pairs of next generation matrix terms. These equalities do not hold along the corresponding lines of the partitioning planes when γO ≠ γG or νOG ≠ νGO.
Although the position and orientation of the partitioning planes change, the overall qualitative behavior does not. That is, regardless of the values of the recovery and autoinoculation parameters, the extrema of R0 lie on the boundaries of the positive parameter space, and moving off the partitioning plane increases or decreases R0 depending on which of the four regions the movement is into. Moving toward primarily same-site transmission increases R0 while moving toward primarily cross-site transmission decreases it.
2.3. Type and target reproduction numbers
Let us next consider the type reproduction number for, without loss of generality, the genital site in the full two-site model (eq. 1), calculated as in Shuai et al. [7]. The type reproduction number for genital infections is, provided KOO < 1,
(20) |
That is, if the oral site is not an infection reservoir, here meaning that the expected number of oral infections from an oral infection is less than one, then the infection can be controlled by permanently preventing genital infection in a fraction of the population greater than
(21) |
Note the reappearance of the balance term. Further, the possible deleterious effects of model misspecification can be seen here. If one is using a model with only genital infection, one will estimate the fraction of the population in which need to prevent genital infections as , which may differ significantly from the true fraction. We should be careful to realize, however, that the estimates of βGG here may be biased as another result of misspecification, which makes direct comparison of the fractions under the two models difficult.
Returning to the two-site model, if we seek to control only genital-to-genital infections, perhaps through condom use, the target reproduction number is
(22) |
provided KO|O + KO|GKG|O < 1. That is, the infection will be controlled if we prevent genital to genital infection by a fraction of more than
(23) |
That this latter endeavor is more challenging is seen in the reduction in the denominator between the type and target reproduction numbers described here. The two numbers are the same only if either KO|G or KG|O is zero. That these fractions are the same when KG|O is zero is sensible, since all genital transmission in that case will be same-site transmission. It is not so obviously true, heuristically, in the case that KO|G is zero.
We also include the target reproduction number for either of the cross-site transmissions solely. The target reproduction number, provided both KO|O and KG|G are less than 1, is
(24) |
so that the infection will be controlled if we prevent genital to oral transmission by more than
(25) |
Again, to control only genital to oral transmission is easier than controlling all genital infections, and we see that the denominators of eqs. 23 and 25 add up to the denominator of eq. 21. The latter fact is a nice way of expressing that, in this case,
(26) |
In each of TG, TGG, and TOG, KO|G and KG|O only appear as a product. That each depends upon the product instead of the sum or some other function of KO|G and KG|O cannot be easily understood without this analysis. This result has implications for disease control. For instance, a product can more effectively be reduced by reducing only one of the terms than a sum can be. Further, the trade-off structure, if one were able to reduce one term at the expense of another, is very different. That KO|G and KG|O only appear as a product also has implications for the impact of heterogeneity in the terms as it did with the basic reproduction number. In each case, heterogeneity in the cross-site terms decreases the reproduction number. The impact of heterogeneity in KO|O and KG|G is less clear for TG and TGG, though heterogeneity in these same-site terms, unlike for the basic reproduction number, decreases TGO.
2.4. Effects of autoinoculation
In addition to the effects of transmission pathway heterogeneity, we noted in eqs. 1 and 11 that autoinoculation is a key feature of the model, adding significant complexity to both the model equations and R0. Autoinoculation has been noted to be a potentially important pathway for cervical and oral infections [10], and, indeed, a history of warts at different sites (including a range of non-genital sites) has been shown to be associated with increased risk of genital HPV infections [26]. However, the role of autoinculation in oral and genital infections is not yet fully understood. The added complexity that autoinoculation brings to R0 also brings the potential for significant misestimation of R0 if the underlying disease process includes autoinoculation but a model neglecting the autoinoculation pathway is used for fitting the data. In eq. 11 in particular, we see that when the autoinoculation terms are nonzero, the actual total same-site cases generated has the potential to be significantly higher than would be expected based on the single-site R0s that do not consider autoinoculation, i.e. we are in general likely to have and .
To illustrate the potential for errors in R0 estimates due to model misspecification, we generated simulated data using the full model (including autoinoculation) and normally distrbuted error, and then fitted this data using least squares, assuming a reduced model with νOG = νGO = 0. For all model simulations we assumed background birth/deaths in the population were negligible (μ = 0) for simplicity. As shown in Figure 6, both models yield quite similar trajectories for both genital and oral prevalence, however the R0 of the model underlying the data was 4.0 while the estimate of the misspecified (no autoinoculation) model was only 1.25. In general, we found that simulations for a range of parameter values underestimated R0 when autoinoculation was not included in the model. The full model parameters in Figure 6, which were not based on data but rather were chosen to illustrate the potential for mispecification within biologically reasonable bounds, were: βOO = 3, βOG = 1, βGO = 0, βGG = 1, γO = 2, γG = 2, νOG = 5, νGO = 4). While values for the transmission and autoinoculation parameters are largely unknown, these reflect what information there is on clearance times [27, 28, 29]. The resulting estimates for the reduced model when fitted to the data from the full model were βOO = 11.5, βOG = 9.9, βGO = 2.0, βGG = 0, γO = 10.6, γG = 8.4. To be sure this effect was a result of misspecification rather than of fitting, we also fit the full model to the data, yielding nearly identical fits as for the misspecified model and the following parameter estimates: βOO = 2.5, βOG = 1.3, βGO = 0.3, βGG = 0.9, γO = 2.0, γG = 1.9, νOG = 4.3, νGO = 4.4. While we do not recover the original parameters exactly, the R0 estimate of 4.1 is a good approximation.
3. Two-site model with heterogeneous contacts
Here we extend the two-site model with homogeneous contacts (homo- or pansexual population or nonsexual infection) to one with heterogeneous contacts (in particular, a heterosexual population). This model has many analogous features to the model homogeneous contacts, but also has an added layer of complexity from the vector–host-like dynamics (since new infections can only occur from a member of the opposite sex). For sex i ∈ {M, F} with oral (O) and genital (G) sites and i ≠ j, we have the following equations:
(27) |
Here, the parameters are the same as in Table 1 except that we additional need to denote that transmission as originating with one of the sexes. In particular, are transmission rates from site X in sex i to site Y in sex j and, as before, are clearance rates for sex i and site X and are autoinoculation rates in sex i from site X to site Y. The disease free-equilibrium is . Then the matrix F has the form
(28) |
where
(29) |
The V matrix has the form
(30) |
where
(31) |
Then V −1 has the form
(32) |
where
(33) |
and the τ are average waiting times, neglecting further infection, in the compartments (as in eq. 5), and the pi and qi are probabilities for sex i that the next compartment transfer will be to go to or from , respectively (as in eq. 6).
The next generation matrix K = FV −1 thus has the form
(34) |
where
(35) |
and
(36) |
(37) |
(38) |
(39) |
(40) |
(41) |
Hence, we find the basic reproduction number is
(42) |
where
(43) |
We see that the quantity R is the R0 of the system under the either (or both) of the balance conditions and . The balance terms appearing in the form of R0 are analogous to the one we saw in the pansexual model above, but we now have one term for each sex instead of one for the whole population.
The influence of the two-site and the heterosexual aspects of the heterosexual two-site model can be clearly seen in the form of R0 here. The hallmark of the one-site heterosexual transmission model, similarly to vector–host models, is the geometric average of the cross-sex elements of the next generation matrix: for the the model
(44) |
we find
(45) |
where . In the two-site heterosexual model, although we do not have direct geometric averaging, some patterns are preserved: in eq. 42, products (cycles) of the cross-sex elements replace the terms of the homogeneous-contacts model (eq. 12) and a square root is taken at the end, as with geometric averaging. In fact, we recover geometric averaging under certain limiting cases, as shown below.
3.1. Limiting cases
Here we consider two limiting cases: i) independence of sites and ii) one site is a “dead-end” infection. In the first case, we assume that there is no cross-transmission (βOG = βGO = 0) and no autoinoculation (νOG = νGO = 0). Under these assumptions, the two sites are essentially independent as KO|G = KG|O = 0, and the basic reproduction number is
(46) |
That is, since there is no autoinoculation, R0 is the maximum of the of the basic reproduction number for the model with a heterosexual population and only oral sites and the basic reproduction number of the model with only genital sites .
In the second case, we assume, without loss of generality, that the oral sites are a “dead-end” infection, that is, oral sites do not transmit the infection (βOG = βOO = νOG = 0). Then KO|O = KO|G = 0, and, letting ,
(47) |
under these assumptions. Thus, despite νGO ≡ 0, R0 is the same (c.f. eq. 45) as the basic reproduction number of the model with only genital sites, . This result is intuitive, as additional time spent in compartments IO or IOG over the infective lifetime will not produce any additional infections.
3.2. Type reproduction number
The type reproduction number for either gender, assuming the other is not a reservoir, is the square of the basic reproduction number. This result is sensible because the system is similar to vector–host models for which similar results hold [5]. The type reproduction number for, without loss of generality, the genital site, as long as the oral sites are not a reservoir, that is, as long as , is
(48) |
where
(49) |
(50) |
Although the form is similar to that of the basic reproduction number (eq. 42), there is no final square root because the type reproduction number considers cycles in the transmission graph (which is bipartite for a heterosexual population) rather than geometrically averaging over both transmission directions, and there is a term, , which takes into account the strength of the infection at the other site.
4. Conclusions
In this work, we developed a model of a multisite infectious disease and derived expressions for the basic reproduction number under a number of different assumptions and limiting cases. To the best of our knowledge, it is the first analysis of such a model. Although developed with HPV in mind, it may also be relevant to other infections such as HSV and chlamydia, among others.
We find that autoinoculation adds considerable complexity to the analysis of two-site models, as shown in eqs. 1 and 11. Thus, there is a possibility that neglecting autoinoculation may potentially result in severe model misspecification, that is, making incorrect estimations and conclusions as a direct result of using a model that does not fully capture the dynamics. Fortunately, the analysis of the two-site model with homogeneous contacts gives a reasonably clear picture of how the relative magnitudes of the autoinoculation parameters (and, similarly, the clearance parameters) change the impact of heterogeneity in the same-site and in the cross-site transmission parameters (Figures 4 and 5), with heterogeneity in autoinoculation and clearance breaking the symmetry of the identical site model to alter the high- and low-risk regions defined by the partitioning planes. In general, increasing overall autoinoculation rates also increased R0 while increasing clearance decreased it (also shown in Figures 4 and 5). Figure 6 shows the potential for significant error in estimating R0 if the underlying disease process includes autoinoculation but this pathway is neglected in the model. While both the full model and the model lacking autoinoculation generate quite similar oral and genital prevalence trajectories, the true R0 for the simulated data using the full model was 4, while the estimated R0 with the reduced, misspecified model was 1.2. Indeed, the individual transmission and clearance parameters for the misspecified model were quite different from the true values, including altering the relative balance of the transmission pathways. This highlights the strong importance of further work uncovering the potential role of autoinoculation between oral and genital HPV infections, and more generally examining the relative balance of autoinoculation and person-to-person transmission in generating new infections at different sites, particularly as autoinoculation via warts at distal sites has been shown to play a role in increasing genital HPV infection risk [26]. This example also more generally highlights problems that are likely to arise when fitting an incomplete or mispecified model to data, particularly if not all transmission routes are accounted for in the model, as in this case.
Regardless of the magnitudes of the autoinoculation and clearance parameters, heterogeneity in the cross-site next generation matrix terms decreases R0 while heterogeneity in the same-site next generation matrix terms increases it. For a fixed total transmission rate, the extrema of R0 occur when transmission is predominantly through one transmission pathway: the extremum is a maximum when that transmission pathway is same-site and a minimum when it is cross-site. Moreover, the constrained transmission parameter space is partitioned by two planes into four regions in which R0 is either larger or smaller than its value on the partition.
That heterogeneity affects the pathways differently is a departure from the classical association between increasing heterogeneity and larger R0 [30, 3]. Robertson et al. [22] investigated the effect of heterogeneity in transmission pathways for a waterborne disease model, considering heterogeneity in the direct (person-to-person) and indirect (person-to-water-to-person) pathways among different communities as well as their relative contributions. They found that although it was possible to have high heterogeneity and a low R0 by minimizing the connectendess of communities (i.e. a low indirect transmission), their measure of heterogeneity, namely the variance of the direct transmission plus twice the covariance of the direct and indirect transmission, was predictive of and increased with R0. Our results, then, suggest that the effects of heterogeneity in transmission pathways can be very dependent on the structure of the transmission pathways and, thus, should be investigated more broadly.
We are able to comment on the effects of heterogeneity in the clearance and autoinoculation parameters as well. Heterogeneity in the clearance parameters breaks the symmetry between the two same-site transmission terms when considering R0 on the plane βOO + βGG + βOG + βGO = k, and heterogeneity in the autoinoculation parameters breaks the symmetry in the cross-site transmission terms. In particular, increasing γG relative to γO (genital sites clear faster) moves the intersection of the three lines of the partitioning planes so that βOO < βOG = βGO < βGG. That is, for a fixed sum βOO + βGG, R0 achieves its minimum at a point where βOO < βGG. Increasing νOG relative to νGO moves the intersection point so that βOG < βOO = βGG < βGO. Thus for a fixed total βOG + βGO, R0 achieves its maximum at a point where βGO is greater than βOG.
One interesting feature of the effects of heterogeneity on R0 is that we found partitioning planes both for the identical sites model (Figure 3) and general model (Figures 4 and 5), which break the parameter space into two high-R0 regions and two low-R0 regions. This suggests that in general we may not need precise estimates of the different transmission parameters in order to know the rough level of risk for a particular community. These high and low risk regions can give us approximate information about R0 based only on the overall balance and relative strength or dominance of the different transmission pathways. As exact estimates of the transmission parameters are often difficult to determine empirically (compared to estimates of clearance times for example), this sort of relative understanding of the transmission pathways may be easier to determine in practice. This could also potentially help in guiding interventions, by informing which alternative behaviors or intervention strategies are most likely to shift the balance of transmission towards a lower-risk region (for example by promoting oral sex over genital sex to shift from the right high-risk region to the top low-risk region in Figures 3 – 5).
The analysis of the type and target reproduction numbers for the model with homogeneous contacts gives a quantitative way to describe how much more difficult it is to control an infection by targeting a specific pathway rather than a whole site. This may be relevant in the context of HPV, when considering condom use and vaccination as controls. Further, we note the importance of the product of the cross-site infection terms to the basic, type, and target reproduction numbers. In the pansexual model, KO|G and KG|O never appear anywhere but as the product KO|GKG|O. This structure drives the results about cross-site heterogeneity. It may also have implications for the efficacy of different control methods, since a product and a sum are affected by their terms differently and have different trade-off structures.
Although extending the pansexual two-site model to a heterosexual one adds even more complexity, we can see the contributions of the two-site aspect, which balances between taking a maximum and taking a geometric average, and the heterosexual aspect, which incorporates vector–host-like dynamics by taking a square root of transmission cycles between the two sexes. Further, the behavior of the model near limiting cases is clear. However, further analysis is needed to fully flesh out the dynamics of the model more generally.
These results could be generalized to models with more than two sites, such as oral/genital/anal sites for HPV, oral/genital/ocular sites for chlamydia, or more broadly for warts/viral papillomas at a range of epithelial sites. Because of the shape of the F matrix, the R0 of an n-site model will be the largest eigenvalue of the n × n submatrix of K = FV−1 containing only those rows and columns corresponding to infection at a single site. Thus, combinatorial tools might be helpful to study R0 for multisite models in this generalized context. In general, we conjecture that the distinct effects of same-site vs. cross-site transmission heterogeneity will persist as we expand to multiple sites, as may analogs of our results regarding partitioning planes making general regions of high and low risk depending on the balance of the transmission terms. More generally, there are several useful directions for future work with this or similar multi-site models, both in understanding the dynamics and connecting the model to data. Once relevant parameter values have been reported clinically (or data sufficient for parameter estimation recorded), this model will be useful for understand the evolution of disease prevalence in time. Sensitivity of the model to the parameters, then, will become an important consideration in future work. In the meantime, a comprehensive global sensitivity analysis would be a useful first step. Additionally, while we focused on analysis of R0 here, future contributions may include a broader dynamical analysis of multi-site disease models.
Supplementary Material
Highlights.
We analyze the basic reproduction number R0 of a multisite infection model.
Heterogeneity between same site transmission rates increases R0.
Heterogeneity between cross site transmission rates decreases R0.
Neglecting autoinoculation may result in severe misestimation of R0.
Acknowledgments
This work was supported by NIH grant 1U01CA182915-01A1 and the University of Michigan MCubed program.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Diekmann O, Heesterbeek JAP, Metz JAJ. On the definition and the computation of the basic reproduction ratio R 0 in models for infectious diseases in heterogeneous populations. Journal of Mathematical Biology. 1990:365–382. doi: 10.1007/BF00178324. [DOI] [PubMed] [Google Scholar]
- 2.Anderson RM, May RM. Infectious diseases of humans: dynamics and control. Oxford University Press; New York: 1991. [Google Scholar]
- 3.Diekmann O, Heesterbeek J. Mathematical epidemiology of infectious diseases: model building, analysis, and interpretation. New York John Wiley; Chichester: 2000. [Google Scholar]
- 4.van den Driessche P, Watmough J. Reproduction numbers and sub-threshold endemic equilibria for compartmental models of disease transmission. Mathematical Biosciences. 2002;180:29–48. doi: 10.1016/s0025-5564(02)00108-6. [DOI] [PubMed] [Google Scholar]
- 5.Roberts MG, Heesterbeek JAP. A new method for estimating the effort required to control an infectious disease. Proceedings of the Royal Society B Biological Sciences. 2003;270:1359–64. doi: 10.1098/rspb.2003.2339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Heesterbeek JAP, Roberts MG. The type-reproduction number T in models for infectious disease control. Mathematical Biosciences. 2007;206:3–10. doi: 10.1016/j.mbs.2004.10.013. [DOI] [PubMed] [Google Scholar]
- 7.Shuai Z, Heesterbeek JAP, van den Driessche P. Extending the type reproduction number to infectious disease control targeting contacts between types. Journal of Mathematical Biology. 2013;67:1067–82. doi: 10.1007/s00285-012-0579-9. [DOI] [PubMed] [Google Scholar]
- 8.Goodman MT, Shvetsov YB, McDuffie K, Wilkens LR, Zhu X, Thompson PJ, Ning L, Killeen J, Kamemoto L, Hernandez BY. Sequential acquisition of human papillomavirus (HPV) infection of the anus and cervix: the Hawaii HPV Cohort Study. The Journal of Infectious Diseases. 2010;201:1331–9. doi: 10.1086/651620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Edelstein ZR, Schwartz SM, Hawes S, Hughes JP, Feng Q, Stern ME, O’Reilly S, Lee S-K, Fu Xi L, Koutsky LA. Rates and determinants of oral human papillomavirus infection in young men. Sexually transmitted diseases. 2012;39:860–7. doi: 10.1097/OLQ.0b013e318269d098. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Steinau M, Hariri S, Gillison ML, Broutian TR, Dunne EF, Tong Z-y, Markowitz LE, Unger ER. Prevalence of cervical and oral human papillomavirus infections among US women. Journal of Infectious Diseases. 2014;209:1739–43. doi: 10.1093/infdis/jit799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ribes JA, Steele AD, Seabolt JP, Baker DJ. Six-year study of the incidence of herpes in genital and nongenital cultures in a Central Kentucky Medical Center patient population. Journal of Clinical Microbiology. 2001;39:3321–3325. doi: 10.1128/JCM.39.9.3321-3325.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Beauman JG. Genital herpes infection: a review. American Family Physician. 2005;72:1527–1534. [PubMed] [Google Scholar]
- 13.Fatahzadeh M, Schwartz RA. Human herpes simplex virus infections: Epidemiology, pathogenesis, symptomatology, diagnosis, and management. Journal of the American Academy of Dermatology. 2007;57:737–763. doi: 10.1016/j.jaad.2007.06.027. [DOI] [PubMed] [Google Scholar]
- 14.Jones RB, Rabinovitch Ra, Katz BP, Batteiger BE, Quinn TS, Terho P, Lapworth Ma. Chlamydia trachomatis in the pharynx and rectum of heterosexual patients at risk for genital infection. Annals of Internal Medicine. 1985;102:757–762. doi: 10.7326/0003-4819-102-6-757. [DOI] [PubMed] [Google Scholar]
- 15.Stenberg K, Mårdh PA. Genital infection with Chlamydia trachomatis in patients with chlamydial conjunctivitis: unexplained results. Sexually transmitted diseases. 1991;18:1–4. doi: 10.1097/00007435-199101000-00001. [DOI] [PubMed] [Google Scholar]
- 16.Rank RG, Yeruva L. Hidden in plain sight: Chlamydial gastrointestinal infection and its relevance to persistence in human genital infection. Infection and Immunity. 2014;82:1362–1371. doi: 10.1128/IAI.01244-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Calderone RA, Fonzi WA. Virulence factors of Candida albicans. Trends in microbiology. 2001;9:327–335. doi: 10.1016/s0966-842x(01)02094-7. [DOI] [PubMed] [Google Scholar]
- 18.Stulberg DL, Hutchinson AG. Molluscum contagiosum and warts. American Family Physician. 2003;67:1233–140. [PubMed] [Google Scholar]
- 19.May RM, Anderson RM. Transmission dynamics of HIV infection. Nature. 1987;326:137–142. doi: 10.1038/326137a0. [DOI] [PubMed] [Google Scholar]
- 20.Adler FR. The Effects of Averaging on the Basic Reproductive Ratio. Mathematical Biosciences. 1992;111:89–98. doi: 10.1016/0025-5564(92)90080-g. [DOI] [PubMed] [Google Scholar]
- 21.Neri FM, Bates A, Füchtbauer WS, Pérez-Reche FJ, Taraskin SN, Otten W, Bailey DJ, Gilligan CA. The effect of heterogeneity on invasion in spatial epidemics: from theory to experimental evidence in a model system. PLoS Computational Biology. 2011;7:e1002174. doi: 10.1371/journal.pcbi.1002174. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Robertson SL, Eisenberg MC, Tien JH. Heterogeneity in multiple transmission pathways: modelling the spread of cholera and other waterborne disease in networks with a common water source. Journal of Biological Dynamics. 2013;7:254–75. doi: 10.1080/17513758.2013.853844. [DOI] [PubMed] [Google Scholar]
- 23.MATLAB, version R2013a. The MathWorks Inc; Natick, Massachusetts: 2013. [Google Scholar]
- 24.Molano M, van den Brule A, Plummer M, Weiderpass E, Posso H, Arslan A, Meijer CJ, Muñoz N, Franceschi S, et al. Determinants of clearance of human papillomavirus infections in colombian women with normal cytology: a population-based, 5-year follow-up study. American journal of epidemiology. 2003;158:486–494. doi: 10.1093/aje/kwg171. [DOI] [PubMed] [Google Scholar]
- 25.Franco EL, Villa LL, Sobrinho JP, Prado JM, Rousseau MC, Désy M, Rohan TE. Epidemiology of acquisition and clearance of cervical human papillomavirus infection in women from a high-risk area for cervical cancer. Journal of Infectious Diseases. 1999;180:1415–1423. doi: 10.1086/315086. [DOI] [PubMed] [Google Scholar]
- 26.Hernandez BY, Shvetsov YB, Goodman MT, Wilkens LR, Thompson PJ, Zhu X, Tom J, Ning L. Genital and extra-genital warts increase the risk of asymptomatic genital human papillomavirus infection in men. Sexually transmitted infections. 2011;87:391–395. doi: 10.1136/sti.2010.048876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Ho G, Bierman R. Natural history of cervicovaginal papillomavirus infection in young women. New England Journal of Medicine. 1998;338:423–428. doi: 10.1056/NEJM199802123380703. [DOI] [PubMed] [Google Scholar]
- 28.Giuliano AR, Lee JH, Fulp W, Villa LL, Lazcano E, Papenfuss MR, Abrahamsen M, Salmeron J, Anic GM, Rollison DE, Smith D. Incidence and clearance of genital human papillomavirus infection in men (HIM): a cohort study. Lancet. 2011;377:932–40. doi: 10.1016/S0140-6736(10)62342-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kreimer AR, Campbell CMP, Lin HY, Fulp W, Papenfuss MR, Abrahamsen M, Hildesheim A, Villa LL, Salmerón JJ, Lazcano-Ponce E, Giuliano AR. Incidence and clearance of oral human papillomavirus infection in men: the HIM cohort study. Lancet. 2013;6736:1–11. doi: 10.1016/S0140-6736(13)60809-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Dushoff J, Levin S. The effects of population heterogeneity on disease invasion. Mathematical Biosciences. 1995;128:25–40. doi: 10.1016/0025-5564(94)00065-8. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.