Skip to main content
Springer logoLink to Springer
. 2025 Jul 26;91(2):21. doi: 10.1007/s00285-025-02241-w

Re-examining the drivers of variation in parasite loads across hosts in the Tallis-Leyton model

R McVinish 1,
PMCID: PMC12296923  PMID: 40715832

Abstract

The Tallis-Leyton model is a simple model of parasite acquisition where parasites accumulate in the host without affecting the host’s mortality, or eliciting any immune reaction from the host. Furthermore, the parasites do not reproduce in the host. We examine how the variability in parasite loads among hosts is affected by the rate of infectious contacts, the distribution of parasite entering the host during infectious contacts, the host’s age, and the distribution of parasite lifetimes. Motivated by empirical studies in parasitology, variability is examined in the sense of the Lorenz order and related metrics. Perhaps counterintuitively, increased variability in the distribution of parasite lifetimes is seen to decrease variability in the parasite loads among hosts.

Keywords: Aggregation, Convex order, Gini index, Lorenz order, Negative binomial distribution, Pietra index

Introduction

The distribution of parasites among their host population typically displays a high degree of variation; some hosts are infected with many parasites while many hosts have comparatively few. This phenomenon is almost universally observed in wild populations (Shaw and Dobson 1995; Poulin 2007). Following the usual practice in the parasitology literature, we call this phenomenon aggregation (Pielou 1977; Wilson et al. 2001; Poulin 2011).

Unfortunately, there is no universally accepted measure of aggregation. Instead, different authors use different metrics of aggregation to summarise the parasite’s distribution (Pielou 1977; McVinish and Lester 2020; Morrill et al. 2023). Despite claims that these measures have identical interpretations and more-or-less predict each other (Reiczigel et al. 2014), different methods can give opposing answers (McVinish and Lester 2020, Figure 1). The most commonly used measures of aggregation in theoretical models are the variance-to-mean ratio (Isham 1995; Barbour and Pugliese 2000; Herbert and Isham 2000; Peacock et al. 2018) and the k parameter of the negative binomial distribution (Anderson and May 1978a, b; Rosà and Pugliese 2002; Schreiber 2006; McPherson et al. 2012), where the negative binomial distribution has mean m and variance Inline graphic. Both of these measures can be interpreted as quantifying how over dispersed the distribution of parasite load is relative to a Poisson distribution.

An alternative view of aggregation was put forward by Poulin (1993), arguing that a measure of the discrepancy between the observed distribution of parasites in the hosts and the ideal distribution where all hosts are infected with the same number of parasites would be the best measure of aggregation. This view puts the Lorenz ordering of distributions (Lorenz 1905; Arnold and Sarabia 2018) central in the study of aggregation. While the coefficient of variation is perhaps the most widely known measure having a direct relationship to the Lorenz order (Arnold and Sarabia 2018, Sections 5.2.1 & 5.4), Poulin (1993) proposed using a different index, D, as a measure of this discrepancy. It can be shown that, up to a factor that goes to one as the sample size increases, Poulin’s D is Gini’s concentration ratio (Gini 1914, 2005), also known as the estimator of the Gini index (Arnold and Sarabia 2018, Equation 5.85). Poulin’s D has since become one of the standard measures of aggregation used in studies of wild parasite populations (Herrero-Cófreces et al. 2021; Rodríguez-Hernández et al. 2021; Schrock et al. 2025).

The aim of this paper is to characterize how different processes in the Tallis-Leyton model (Tallis and Leyton 1969) shape parasite aggregation in the sense of the Lorenz ordering and related indices. Only population values of these indices are considered, rather than their estimators. Despite the growing importance of the Lorenz order in empirical studies of parasite distributions following Poulin’s proposal, we are unaware of any other theoretical study of parasite acquisition using the Lorenz order. In Sect. 2 we review some background on the Lorenz ordering and the closely related convex ordering of distributions. The Tallis-Leyton model is analysed in Sect. 3. We first show show how the host’s parasite load can be represented as a compound Poisson distribution. This representation is then applied to determine how each of the model parameters affect the Lorenz ordering of the distribution of parasites in the host. The final part of the analysis shows that the host’s parasite load is asymptotically normally distributed in the limit as the rate of infectious contacts goes to infinity. This allows the indices to be approximated in terms of the mean and variance. The paper concludes with a discussion of future challenges in analysing models of parasite aggregation.

Background

Tallis-Leyton model

Tallis and Leyton (1969) proposed the following model for the parasite load M(a) of a definitive host at age a, conditional on survival of the host to age a. The host is parasite free at birth so Inline graphic. During its lifetime, the host makes infectious contacts following a Poisson process with constant rate Inline graphic. At each infectious contact, a random number of parasites N enter the host. Once a parasite enters the host, it survives for a random period of time T. The lifetimes of parasites, numbers of parasites entering the host at infectious contacts, and the Poisson process of infectious contacts are all independent. The parasites are assumed to have no effect on the host mortality so the host’s parasite load at age a is independent of the host surviving to age a. Henceforth, we will simply refer to M(a) as the parasite load of a host age a. Although we won’t make use of this fact, we note that this process also describes an infinite server queue with bulk arrivals and general independent service times (Holman et al. 1983).

Let Inline graphic, Inline graphic and Inline graphic denote the probability generating function (PGF), distribution function, and survival function of a random variable X. We write Inline graphic for the PGF of M(a). Tallis and Leyton (1969) showed

graphic file with name d33e398.gif 1

Using the well known relationship between the moments of a random variable and derivatives of its PGF at zero, we see that the mean and variance of M(a) are

graphic file with name d33e412.gif 2
graphic file with name d33e418.gif 3

Hence, the variance-to-mean ratio is

graphic file with name d33e425.gif 4

Assuming Inline graphic and Inline graphic, the limiting distribution of parasite load as Inline graphic exists and has PGF

graphic file with name d33e451.gif 5

An appropriate rescaling of the host age, rate of infectious contacts, and parasite lifetimes leaves the distribution of the host’s parasite load unchanged. Specifically, for any Inline graphic let Inline graphic represent the parasite load of host age a in the Tallis-Leyton model with parameters Inline graphic, Inline graphic and Inline graphic. Then Inline graphic. To see this, apply the change of variable Inline graphic in the integral in (1) for the PGF of M(ca):

graphic file with name d33e514.gif 6
graphic file with name d33e520.gif 7

Upon noting Inline graphic, it follows that Inline graphic.

Convex order and Lorenz order

Lorenz (1905) proposed the Lorenz curve as a graphical measure of inequality. The following general definition of the Lorenz curve was given by Gastwirth (1971).

Definition

The Lorenz curve Inline graphic for the distribution F with finite mean Inline graphic is given by

graphic file with name d33e570.gif 8

where Inline graphic is the quantile function

graphic file with name d33e584.gif 9

Adapting the description in Arnold and Sarabia (2018, Section 3.1) to a parasitology context, the Lorenz curve L(u) represents the proportion of the parasite population infecting the least infected u proportion of the host population. When all hosts are infected with the same number of parasites, the Lorenz curve is given by Inline graphic and is called the egalitarian line. The Lorenz curve never rises above the egalitarian line, that is Inline graphic for all Inline graphic.

The Lorenz curve defines a partial order on the class of all distributions on Inline graphic with finite mean (Arnold and Sarabia 2018, Definition 3.2.1).

Definition

Let X and Y be random variables with the respective Lorenz curves denoted Inline graphic and Inline graphic. We say X is smaller in the Lorenz order, denoted Inline graphic if Inline graphic for every Inline graphic.

The negative binomial distribution, which is extensively used in parasitology, can be compared in the Lorenz order (McVinish and Lester 2024). Specifically, let Inline graphic denote the negative binomial distribution with mean m and variance Inline graphic. Then

  • (i)

    for any Inline graphic and Inline graphic, Inline graphic, and

  • (ii)

    for any Inline graphic and Inline graphic, Inline graphic.

Closely related to the Lorenz order is the convex order of distributions.

Definition

Let X and Y be two random variables such that Inline graphic. We say X is smaller than Y in the convex order, denoted Inline graphic, if Inline graphic for all convex functions Inline graphic, provided the expectations exist.

These two orderings are related since Inline graphic if and only if

graphic file with name d33e801.gif 10

for every continuous convex function Inline graphic (Arnold and Sarabia 2018, Corollary 3.2.1). In other words,

graphic file with name d33e817.gif 11

Shaked and Shanthikumar (2007, Section 3.A) provide an extensive review of results on the convex order. We briefly mention some of the important results that are used in our analysis.

  • The convex order is closed under weak limits provided the expectations also converge (Shaked and Shanthikumar 2007, Theorem 3.A.12 (c)).

  • The convex order is closed under mixtures (Shaked and Shanthikumar 2007, Theorem 3.A.12 (b)). Let X, Y, and Inline graphic be random variables and write Inline graphic and Inline graphic for the conditional distributions of X and Y given Inline graphic. If Inline graphic for all Inline graphic in the support of Inline graphic, then Inline graphic. As an application of this property we can say that if Inline graphic and Z is an independent non-negative random variable, then Inline graphic.

  • The convex order is closed under convolutions (Shaked and Shanthikumar 2007, Theorem 3.A.12 (d)). Let Inline graphic and Inline graphic be two sets of independent random variables. If Inline graphic for Inline graphic, then
    graphic file with name d33e958.gif 12
  • Combining the properties of closure under mixtures and closure under convolutions, we see the convex order is closed under random sums so
    graphic file with name d33e970.gif 13
    for any non-negative integer random variable K. As an application of the closure under random sums property of the convex order, consider two random variables K and Inline graphic that related by binomial thinning. That is, Inline graphic for some Inline graphic. Then Inline graphic (McVinish and Lester 2020, Section 3)
  • The closure under random sums property can be adapted to the case where the Inline graphic and Inline graphic are two iid sequences with Inline graphic, and Inline graphic and Inline graphic are non-negative integer random variables such that Inline graphic. In this case, (Shaked and Shanthikumar 2007, Theorem 3.A.13) implies
    graphic file with name d33e1054.gif 14
  • The survival function can be used to establish if two random variables can be compared in the convex order. If X and Y are two random variables with the same mean and Inline graphic has a single sign change from positive to negative, then Inline graphic (Shaked and Shanthikumar 2007, Theorem 3.A.44(b)). This property can also be used to characterise the convex order (Shaked and Shanthikumar 2007, Theorem 3.A.45).

Measures of aggregation

In practice, levels of aggregation are compared with numerical summaries rather than using the entire Lorenz curve. If we accept the Lorenz order as the way to compare aggregation in parasite-host systems (Poulin 1993; McVinish and Lester 2020), then our measures of aggregation should respect the Lorenz order. That is, if Inline graphic, then the measure of aggregation Inline graphic should satisfy Inline graphic. Arnold and Sarabia (2018, Chapter 5) review several inequality measures and these can be applied as measures of aggregation. We restrict our attention in this paper to the following four measures respecting the Lorenz order; the coefficient of variation, the Gini index, the Pietra index (also known as the Hoover index, or the Robin-Hood index) and Inline graphic.

The coefficient of variation is given by

graphic file with name d33e1131.gif 15

This measure is rarely used in parasitology, though it is mentioned in some reviews on parasite aggregation such as Wilson et al. (2001) and McVinish and Lester (2020). As means and variances are commonly reported in empirical studies and are often easily calculated for theoretical models, it may be useful in some contexts. For example, from Eqs. (2) and (3), the squared coefficient of variation for the Tallis-Leyton model is

graphic file with name d33e1151.gif 16

The Gini index (Gini 1914, 2005) is given by twice the area between the egalitarian line and the Lorenz curve. For a random variable X, the Gini index can be expressed as

graphic file with name d33e1171.gif 17

where Inline graphic is an independent random variable with Inline graphic (Arnold and Sarabia 2018, Page 47). The Pietra index is given by the maximum vertical distance between the egalitarian line and the Lorenz curve (Pietra 1915, 2014). McVinish and Lester (2020) argue that this index could be useful due to its simple interpretation as the proportion of the parasite population that would need to be redistributed among the hosts in order for all hosts to have the same parasite load. The Pietra index can be expressed as

graphic file with name d33e1206.gif 18

(Arnold and Sarabia 2018, Lemma 5.3.1). In general, the dependence of the Pietra index on the mean is not smooth. For example, the Pietra index for the Poisson distribution with mean Inline graphic is

graphic file with name d33e1223.gif 19

where m is the smallest integer greater than or equal to Inline graphic (Ramasubban 1958). While the Pietra index in this instance is continuous in Inline graphic, it is not differentiable with respect to Inline graphic at integer values of Inline graphic. Similar behaviour will be observed in the numerical results reported in Sect. 3.

Prevalence, the probability that a host is infected by at least one parasite, is an important quantity in parasitology (Jovani and Tella 2006; Kura et al. 2022). Although prevalence is not usually thought of as a measure of aggregation, we may express Inline graphic in terms of the Lorenz curve. From the definition of the Lorenz curve, Inline graphic if Inline graphic. From the definition of the quantile function, Inline graphic for Inline graphic. As the Lorenz curve is continuous and Inline graphic, we see

graphic file with name d33e1311.gif 20

Prevalence for the Tallis-Leyton model can be evaluated directly from the PGF as

graphic file with name d33e1318.gif 21

There is a close connection between the Pietra index and prevalence. If Inline graphic, then

graphic file with name d33e1332.gif 22
graphic file with name d33e1338.gif 23

Hence,

graphic file with name d33e1345.gif 24

More generally, the four indices are constrained by the following inequality

graphic file with name d33e1352.gif 25

(Taguchi 1968; McVinish and Lester 2020).

The Gini index and Pietra index can be further related to the coefficient of variation when the distribution of parasites is approximately normal. Suppose Inline graphic is a sequence of random variables such that

graphic file with name d33e1373.gif 26

where Inline graphic. As Inline graphic with probability one, the above limit is only possible if Inline graphic. Nevertheless, the ratio of the Gini index to the coefficient of variation still has a well defined limit. The Gini index of Inline graphic can be expressed as

graphic file with name d33e1405.gif 27

where Inline graphic is an independent random variable with Inline graphic. Since Inline graphic, the collection of random variables Inline graphic is uniformly integrable and Inline graphic. Applying the asymptotic normality and uniform integrability of the Inline graphic,

graphic file with name d33e1449.gif 28

Similarly, the Pietra index of Inline graphic can be expressed as

graphic file with name d33e1462.gif 29

Applying the asymptotic normality and uniform integrability of the Inline graphic,

graphic file with name d33e1476.gif 30

Numerical evaluation of aggregation measures from the PGF

From Eq. (16), the coefficient of variation can be relatively easily evaluated for the Tallis-Leyton model. Numerical integration of Inline graphic and Inline graphic may be required, but the dependence on age and Inline graphic is explicit. Similarly, Inline graphic could be evaluated with a single numerical integration using (21). On the other hand, evaluation of the Gini and Pietra indices require evaluation of the probability mass function. In the examples of the next section, we numerically evaluate the probability mass function of M(a) by inverting Inline graphic using the Abate-Whitte algorithm (Abate and Whitt 1992). The algorithm was implemented in MATLAB (The MathWorks Inc. 2022a) using the vpa function in the Symbolic Math Toolbox (The MathWorks Inc. 2022b) for high precision arithmetic. The code used to evaluate the indices is available from McVinish (2025).

Analysis of the Tallis-Leyton model

In this section we characterize how the different processes in the Tallis-Leyton model shape parasite aggregation in the sense of the Lorenz ordering and the related indices discussed in Sect. 2.3. The analysis begins with a representation of the host’s parasite load, M(a), as a random variable having a compound Poisson distribution. This representation is used extensively to understand how the rate of infectious contacts (Inline graphic), the distribution of the number of parasites (N) that enter the host during an infectious contact, the age of the host (a), and lifetime distribution of the parasites (T) all affect the distribution of a host’s parasite load in terms of the Lorenz order. When comparing the host’s parasite load in two systems, the parameters of the second parasite-host system is distinguished by a tilde.

Compound Poisson representation

Let n be a non-negative integer, Inline graphic, and let X(nv) denote a random variable from a Inline graphic distribution, with Inline graphic with probability 1 when Inline graphic. Our first result will represent a host’s parasite load M(a) as a random sum of independent and identically distributed random variables.

Theorem 1

Assume N has a distribution on the non-negative integers and T has a continuous distribution on Inline graphic. For Inline graphic, define V to be a random variable on Inline graphic with distribution function

graphic file with name d33e1650.gif 31

Let Inline graphic be a sequence of independent random variables with the same distribution as X(NV), where N and V are independent. Let Inline graphic be a Poisson process with rate Inline graphic that is independent of the sequence Inline graphic Then

graphic file with name d33e1697.gif 32

Proof

We first determine the PGF of X(NV). The PGF of X(nv) is Inline graphic. By conditioning on N, the PGF of X(Nv) is seen to be

graphic file with name d33e1745.gif 33

By conditioning on V and then applying the distribution function of V (31), we can write the PGF of X(NV) as

graphic file with name d33e1771.gif 34
graphic file with name d33e1777.gif 35

We now determine the PGF of the right-hand side of Eq. (32). Conditioning on Inline graphic and noting that Inline graphic is a sequence of independent random variables with the same distribution as X(NV), the PGF of Inline graphic is seen to be

graphic file with name d33e1815.gif 36
graphic file with name d33e1821.gif 37
graphic file with name d33e1827.gif 38

Upon making the substitution Inline graphic so Inline graphic, the PGF of Inline graphic can be expressed as

graphic file with name d33e1853.gif 39

which by Eq. (1) is Inline graphic. Inline graphic

Rate of infectious contacts

In this section we examine the effect of the rate of infectious contacts (Inline graphic) on the parasite aggregation. The rate of infectious contacts has no effect on the variance-to-mean ratio (4), whereas the coefficient of variation is strictly decreasing as the rate of infectious contacts increases (16). The following result shows an increase in the rate of infectious contacts decreases parasite aggregation in the sense of the Lorenz order.

Theorem 2

If Inline graphic and all other model parameters are equal, then Inline graphic.

Proof

Set Inline graphic. Let Inline graphic be a sequence of independent random variables having the same distribution as X(NV) and let Inline graphic be a sequence of independent Inline graphic random variables that are also independent of the Inline graphic. As Inline graphic and the convex order is closed under mixtures, Inline graphic. The PGF of Inline graphic is Inline graphic. Let Inline graphic be a Poisson process with rate Inline graphic. As the convex order is closed under random sums,

graphic file with name d33e1988.gif 40

By Theorem 1, Inline graphic. To determine the distribution of Inline graphic, we evaluate its PGF

graphic file with name d33e2011.gif 41
graphic file with name d33e2017.gif 42

Hence, Inline graphic. Inline graphic

Figure 1 shows the four indices (coefficient of variation, Gini, Pietra, and Inline graphic) for a host aged 3 with rate of infectious contacts (Inline graphic) in [0.25, 128], the number of parasites (N) entering the host at infectious contacts having a Inline graphic distribution, and the parasite lifetimes (T) having a Inline graphic distribution. All four indices are strictly decreasing as the rate of infectious contacts increases. The coefficient of variation (16) is not displayed for small values of Inline graphic as it is proportional to Inline graphic. For Inline graphic, the expected parasite load is less than one so the Pietra index and Inline graphic are equal for Inline graphic following (24). As expected from the discussion in Sect. 2.3, the Pietra index appears to display some discontinuity in the first derivative at points where the expected parasite load is integer valued. This behaviour is less apparent at larger values of Inline graphic.

Fig. 1.

Fig. 1

Plot of the coefficient of variation (orange dotted line), Gini index (purple dashed line), Pietra index (yellow solid line), and Inline graphic (blue dot-dashed line) for a host aged 3 in the Tallis-Leyton model with Inline graphic, and Inline graphic. Since Inline graphic for Inline graphic, the Pietra index and Inline graphic coincide on that interval of Inline graphic as expected (24)

Distribution of N

We now consider the role of the distribution of the number of parasites (N) that enter the host during an infectious contact. As a concrete example, suppose Inline graphic, where m is the mean and the variance is Inline graphic. From (4), the variance-to-mean ratio of the parasite load M(a) is

graphic file with name d33e2206.gif 43

We see that the variance-to-mean ratio is increasing in m but decreasing in k. In contrast, the coefficient of variation of M(a) is decreasing in both m and k.

The next two results show increased variability in the number of parasites entering the host during an infectious contact leads to increased parasite aggregation in the sense of the Lorenz order. The first of these results uses the convex order, which requires the distributions being compared to have the same expectation.

Theorem 3

Suppose that N and Inline graphic are non-negative integer valued random variables such that Inline graphic and Inline graphic. Assume that all other model parameters are equal. Then Inline graphic.

Proof

Using an extension of the closure under random sums property of the convex order Shaked and Shanthikumar (2007, Theorem 3.A.13),

graphic file with name d33e2272.gif 44

As the convex order is closed under mixtures, Inline graphic. Let Inline graphic be a sequence of independent random variables having the same distribution as X(NV) and let Inline graphic be a sequence of independent random variables having the same distribution as Inline graphic. As the convex order is closed under random sums,

graphic file with name d33e2313.gif 45

Theorem 1 shows Inline graphic. Inline graphic

For distributions with different means, we consider only the case where N and Inline graphic are related by binomial thinning. Recall that if Inline graphic for some Inline graphic, then Inline graphic.

Theorem 4

Suppose that Inline graphic for some Inline graphic and all other model parameters are equal. Then Inline graphic.

Proof

Let Inline graphic and Inline graphic be independent standard uniform random variables. Then standard conditioning arguments show

graphic file with name d33e2402.gif 46

As the convex order is closed under mixtures,

graphic file with name d33e2409.gif 47

As the convex order is closed under random sums, Inline graphic. Following the same arguments as in the proof of Theorem 3, we see Inline graphic. Hence, Inline graphic. Inline graphic

When the distribution of the number of parasite has a Inline graphic distribution, Theorems 3 and 4 together show that an increase in m or k will decrease parasite aggregation in the sense of the Lorenz order.

Corollary 5

Suppose Inline graphic and Inline graphic with Inline graphic and Inline graphic. Assume that all other model parameters are equal. Then Inline graphic.

Proof

Let Inline graphic be the parasite load for a host of age a in the Tallis-Leyton model with Inline graphic and all other model parameters equal. The PGF of the Inline graphic distribution is

graphic file with name d33e2522.gif 48

and Inline graphic with Inline graphic. Theorem 4 implies Inline graphic. Since Inline graphic and Inline graphic, Inline graphic. Theorem 3 implies Inline graphic. As the Lorenz ordering is transitive, Inline graphic. Inline graphic

Figure 2 shows the Gini and Pietra indices for a parasite host system with host aged 10, rate of infectious contacts Inline graphic, the distribution of the number of parasites (N) that enter the host during an infectious contact following a Inline graphic distribution, and parasite lifetimes (T) having an Inline graphic distribution. Both indices are decreasing in both m and k as we expect from the above results. The contours of both the Gini and Pietra indices tend to become parallel to the respective axes as Inline graphic and Inline graphic. This is a consequence of the limiting behaviour of the negative binomial distribution (Adell and Cal 1994). The contours of the Pietra index display some discontinuity in the first derivative for Inline graphic, which corresponds to a host’s expected parasite load being 1.

Fig. 2.

Fig. 2

Contour plots showing Gini index (Left) and Pietra index (Right) for a host aged 10 in the Tallis-Leyton model with Inline graphic, Inline graphic and Inline graphic

It is natural to consider which distribution for N results in the least aggregated distribution for the host’s parasite load. This requires determining the smallest distribution in the convex ordering. The distributions being compared must have the same expected value. Let n be a non-negative real number. Define the random variable N such that

graphic file with name d33e2685.gif 49

In the supplementary material of McVinish and Lester (2020) it was shown for any random variable Inline graphic on the non-negative integers with Inline graphic is larger than N in the convex order. That is, Inline graphic and we can say N has the smallest distribution in convex order with expectation n. When Inline graphic, the smallest distribution in convex order for N leads to M(a) having a Poisson distribution. There is no largest distribution in the convex order.

Host age

We now examine the effect of the host’s age (a) on parasite aggregation. Differentiating (4) with respect to a shows the variance-to-mean ratio is a decreasing function of the host’s age. Since the expected parasite load is increasing in age, the coefficient of variation is also decreasing in the host’s age. The following result shows that parasite aggregation in the sense of Lorenz order decreases as the host age increases.

Theorem 6

If Inline graphic, then Inline graphic.

The proof is built from the following lemmas.

Lemma 7

Let V have the distribution (31) and let Inline graphic have the distribution (31) with a replaced by Inline graphic. Let Inline graphic independent of V, and let Inline graphic independent of Inline graphic. Then Inline graphic.    

Proof

Note that

graphic file with name d33e2830.gif 50

so Inline graphic. We show that Inline graphic by examining the sign changes of Inline graphic. The survival functions of BV and Inline graphic are

graphic file with name d33e2865.gif 51

and

graphic file with name d33e2872.gif 52

Since Inline graphic is increasing in a and Inline graphic is decreasing in a,

graphic file with name d33e2898.gif 53

Hence, Inline graphic for all Inline graphic. On Inline graphic, Inline graphic whereas Inline graphic decreases from Inline graphic to Inline graphic. For all Inline graphic, Inline graphic. Hence, Inline graphic has a single sign change from positive to negative. Hence, Inline graphic (Shaked and Shanthikumar 2007, Theorem 3.A.44). Inline graphic

Lemma 8

For any convex function Inline graphic and any non-negative integer valued random variable N that is independent of Inline graphic, Inline graphic is a convex function in v.

Proof

As the binomial distribution Inline graphic is a regular exponential family of distribution with expectation linear in v, Schweder (1982, Proposition 2) implies Inline graphic is convex in v for any positive integer n. As non-negative weighted sums of convex functions are also convex, it follows that Inline graphic is a convex function in v. Inline graphic

Proof of Theorem 6

As Inline graphic, if b takes values in Inline graphic, then Inline graphic. Applying Shaked and Shanthikumar (2007, Theorem 3.A.21) with Lemmas 7 and 8,

graphic file with name d33e3087.gif 54

Since the convex order is transitive and closed under mixtures,

graphic file with name d33e3094.gif 55

In the notation of Theorem 1, Inline graphic, where Inline graphic is a sequence of independent random variables with Inline graphic. From the thinning property of the Poisson process and Theorem 1, we can write Inline graphic, where Inline graphic is a sequence of independent random variables with Inline graphic and Inline graphic is a sequence of independent Inline graphic random variables that are also independent of the Inline graphic. As the convex order is closed under random sums,

graphic file with name d33e3163.gif 56

Inline graphic    

Figure 3 shows the four indices (coefficient of variation, Gini, Pietra, and Inline graphic) for the parasite load M(a) of a host aged a in the Tallis-Leyton model with rate of infectious contacts Inline graphic, the number of parasites (N) entering the host during an infectious contact following a Inline graphic distribution, and parasite lifetimes (T) following an Inline graphic distribution. All four indices are strictly decreasing in host age. The Pietra index appears to be crudely interpolated, however all indices were evaluated on the same grid with a step size of 0.01. The ages where the Pietra index appears non-differentiable are those ages where the expected parasite load of the host is integer valued. Specifically, the expected parasite load of the host is Inline graphic so the host has integer valued expected parasite load at ages 0.22, 0.51, 0.92 and 1.61. As in Fig. 1, the Pietra index coincides with Inline graphic for Inline graphic, that is for Inline graphic.

Fig. 3.

Fig. 3

Plot of the coefficient of variation (orange dotted line), Gini index (purple dashed line), Pietra index (yellow solid line), and Inline graphic (blue dot-dashed line) for a host in the Tallis-Leyton model with Inline graphic, Inline graphic, and Inline graphic. Since Inline graphic for Inline graphic, the Pietra index and Inline graphic coincide for Inline graphic as expected (24)

Parasite lifetime distribution

We now assess the effect of variability in the parasite lifetime distribution (T) on parasite aggregation. Rather than assuming Inline graphic, we will assume that Inline graphic and Inline graphic has a single sign change from positive to negative. As noted in the last bullet point of Sect. 2.2, these conditions imply Inline graphic. The below result shows that increased variability in the parasite lifetimes decreases parasite aggregation in the sense of the Lorenz order. In particular, the result implies that the host’s parasite load is most aggregated when parasites have constant lifetimes.

Theorem 9

Suppose Inline graphic and Inline graphic has a single sign change from positive to negative. Assume all other model parameters are equal. Then Inline graphic.

Proof

We first show that for all Inline graphic,

graphic file with name d33e3378.gif 57

Define the function Inline graphic as

graphic file with name d33e3391.gif 58

By definition Inline graphic. As Inline graphic has a single sign change from positive to negative, H first increases and then decreases on Inline graphic. Since Inline graphic, Inline graphic. Hence, Inline graphic for all Inline graphic and (57) is established. For any Inline graphic, set Inline graphic such that

graphic file with name d33e3460.gif 59

It follows from (57) that Inline graphic. Let Inline graphic. Let V have distribution (31) and let Inline graphic have the distribution (31) with a replaced by Inline graphic and T replaced by Inline graphic. The survival functions of Inline graphic and BV are

graphic file with name d33e3527.gif 60

and

graphic file with name d33e3534.gif 61

Since Inline graphic has a single sign change from positive to negative, it follows that Inline graphic also has a single sign change from positive to negative. Hence, Inline graphic (Shaked and Shanthikumar 2007, Theorem 3.A.44). Applying Lemma 8 and Shaked and Shanthikumar (2007, Theorem 3.A.21) together shows Inline graphic. From Theorem 1, Inline graphic and Inline graphic, where Inline graphic and Inline graphic. Let Inline graphic be a sequence of independent Inline graphic random variables that are also independent of Inline graphic By construction Inline graphic. From the thinning property of the Poisson process, Inline graphic. As the convex order is closed under random sums, we see Inline graphic. Letting Inline graphic and noting that the convex order is closed under weak limits, we see Inline graphic. Inline graphic

That increasing variability in the parasite lifetimes decreases parasite aggregation seems counter-intuitive. However, if we consider the extreme case where the parasite lifetimes are constant, then we see that at any given age the host will either have all or none of the hosts from a previous infectious contact. Therefore, it is natural to expect this to lead to the greatest parasite aggregation. On the other hand, greater variability in the parasite lifetimes effectively spreads out when parasites die, leading to less parasite aggregation.

Asymptotic normality

As noted previously, when the host’s parasite load to converges to a normal distribution, the Gini and Petra indices can each be approximate by a constant multiple of the coefficient of variation as indicated by the limits (28) and (30). The final result shows that when the rate of infectious contacts in the Tallis-Leyton model tends to infinity, the distribution of the host’s parasite load converges to a normal distribution.

Theorem 10

Suppose there exists positive constants Inline graphic and C such that

graphic file with name d33e3682.gif 62

for all Inline graphic such that Inline graphic. Then

graphic file with name d33e3702.gif 63

Proof

The characteristic function of M(a) is Inline graphic. We aim to show that

graphic file with name d33e3724.gif 64

The result then follows by Lévy’s convergence theorem. Define

graphic file with name d33e3731.gif 65

For non-negative integers n and real x define

graphic file with name d33e3745.gif 66

Then Inline graphic and

graphic file with name d33e3758.gif 67

(Williams 1991, pg 183). Note that

graphic file with name d33e3769.gif 68

From the expressions for Inline graphic and Inline graphic,

graphic file with name d33e3788.gif 69

From the expression for Inline graphic and the fact that

graphic file with name d33e3801.gif 70

we obtain

graphic file with name d33e3809.gif 71

Using the bound (67) and the fact that Inline graphic, we see

graphic file with name d33e3825.gif 72

and

graphic file with name d33e3832.gif 73

Finally, using Inline graphic together with the bound (67) and the fact that Inline graphic, we see

graphic file with name d33e3855.gif 74

Hence, the limit (64) holds. Inline graphic

Figure 4 compares the probability mass function of the host’s parasite load, M(a), in the Tallis-Leyton model with the probability density function of the approximating normal distribution. The Tallis-Leyton model used a host aged Inline graphic, number of parasites (N) entering the host during an infectious having a Inline graphic distribution, and parasite lifetimes (T) having an Inline graphic distribution. When the rate of infectious contact Inline graphic, the probability mass function still shows some right skewness. The normal approximation in this instance places a non-negligible probability on values less than zero. When Inline graphic, the probability mass function is very close to symmetric and the normal distribution provides a good approximation. Figure 5 shows the Gini and Pietra indices together with the approximations based on the limits (28) and (30). In this instance the approximations of the Gini and Pietra indices appear reasonably accurate even for Inline graphic as small as 2 where the normal approximation is poor.

Fig. 4.

Fig. 4

Probability mass function (blue bars) and approximating normal probability density function (red line) for a host aged 3 in the Tallis-Leyton model with Inline graphic, Inline graphic, and Inline graphic (left) and Inline graphic (right)

Fig. 5.

Fig. 5

Gini index (purple dashed line) and Pietra index (yellow line) together with the asymptotic normal approximations (dotted lines) for a host aged 3 in the Tallis-Leyton model with Inline graphic and Inline graphic

Discussion

This study examined how variation in M(a), the parasite load of age a hosts, in the Tallis-Leyton model is affected by the host age, the rate of infectious contacts (Inline graphic), the distribution of the number of parasites (N) entering the host during an infectious contact, and the distribution of parasite lifetimes (T). Variation in the parasite load was quantified by several aggregation metrics. While there are many aggregation measures used in the parasitology literature, this study focused on measures related to the Lorenz ordering of distributions, specifically the coefficient of variation, the Gini index, Pietra index, and Inline graphic. The Lorenz based measures of aggregation all decrease together if variation in the distribution decreases in the sense of the Lorenz order and are constrained by equality (24) and inequality (25). Furthermore, when the parasite load has approximately a normal distribution the Gini and Pietra indices can each be approximate by a constant multiple of the coefficient of variation.

The analysis showed that an increase in the rate of infectious contacts or an increase in the host age results in a decrease in the aggregation of parasite load using the Lorenz based measures. These results are perhaps not surprising in light of the behaviour of the Poisson distribution, which decreases in the Lorenz order as the mean increases. It might also be expected that increased variability in the the number of parasites entering the host during an infectious contact results in increased aggregation of parasite load using the Lorenz based measures. However, that increased variation in the parasite lifetimes decreases parasite aggregation in the limit as host age tends to infinity seems counter-intuitive. This result can be understood as variability in parasite lifetimes spreads out when parasites die and hence results in less variable parasite loads.

Although only four measure of aggregation based on the Lorenz order were explicitly mentioned in this study, these results extend to any other index respecting the Lorenz order. On the other hand, measures of aggregation not based on the Lorenz order may behave differently. For example, the variance-to-mean ratio is not affected by changes to the rate of infectious contacts. Also, if the number of parasites entering the host during an infectious contact has a Inline graphic distribution, then an increase in m results in an increase in the variance-to-mean ratio, but the Lorenz based measures decrease.

Unfortunately, the population dynamics of parasites are often more complicated than what is represented in the Tallis-Leyton model. Some parasites need multiple hosts to complete its life cycle. Once a parasite finds a host it may be subject to intraspecific and interspecific competition for resources. Furthermore, parasites often interact with the host either by stimulating an immune response from the host or by increasing the host’s mortality rate.

Isham (1995) proposed a simple stochastic model that incorporates parasite induced host mortality. In Isham’s model, the host acquires parasites following the same dynamics as the Tallis-Leyton model and parasite lifetimes are assumed exponentially distributed. The important difference in Isham’s model is that each parasite present in the host increases the host’s death rate by a fixed amount Inline graphic. A complete analysis of Isham’s model in terms of the Lorenz order is beyond the scope of this paper. In a special case, however, we see that parasite induced host mortality increases aggregation of the parasite distribution in the sense of the Lorenz order. When the number of parasites that enter the host at an infectious contact follows a geometric distribution, an explicit expression for the limiting distribution is possible. Specifically, if Inline graphic, then

graphic file with name d33e4055.gif 75

As the negative binomial distribution is decreasing in Lorenz order in both mean and k, it follows that indices respecting the Lorenz order are increasing in the parasite induced host mortality rate. In contrast, the variance-to-mean ratio is Inline graphic so it is not affected by the parasite induced mortality.

A complete examination Isham’s model in terms of the Lorenz order may prove challenging. Even computing the Gini and Pietra indices may present difficulties since they require absolute moments, which are often not easily evaluated. In that case, the coefficient of variation may prove useful since it respects the Lorenz order, is easily evaluated, and can be used to approximate the Gini and Pietra indices when the distribution is approximately normal.

Acknowledgements

The author express his thanks to the associate editor and two referees for their detailed and thoughtful comments on the original version of the paper.

Funding

Open Access funding enabled and organized by CAUL and its Member Institutions

Data availability

The code used to generate the figures in this paper are publicly available on Zenodo (McVinish 2025).

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  1. Abate J, Whitt W (1992) Numerical inversion of probability generating functions. Oper Res Lett 12:245–251. 10.1016/0167-6377(92)90050-D [Google Scholar]
  2. Adell JA, De La Cal J (1994) Approximating gamma distributions by normalized negative binomial distributions. J Appl Probab 31:391–400. 10.2307/3215032 [Google Scholar]
  3. Anderson RM, May RM (1978a) Regulation and stability of host-parasite population interactions: I. Regulatory processes. J Anim Ecol 47:219–247. 10.2307/3933 [Google Scholar]
  4. Anderson RM, May RM (1978b) Regulation and stability of host-parasite population interactions: II. Destabilizing processes. J Anim Ecol 47:249–267. 10.2307/3934 [Google Scholar]
  5. Arnold BC, Sarabia JM (2018) Majorization and the Lorenz order with applications in applied mathematics and economics. Springer, New York. 10.1007/978-3-319-93773-1 [Google Scholar]
  6. Barbour AD, Pugliese A (2000) On the variance-to-mean ratio in models of parasite distributions. Adv Appl Probab 32:701–719. 10.1239/aap/1013540240 [Google Scholar]
  7. Gastwirth JL (1971) A general definition of the Lorenz curve. Econometrica 39:1037–1039. 10.2307/1909675 [Google Scholar]
  8. Gini C (1914) Sulla misura della concentrazione e della variabilità dei caratteri. Atti Ist Ven 73:1203–1248 [Google Scholar]
  9. Gini C (2005) On the measurement of concentration and variability of characters. METRON Int J Stat 63:3–38 [Google Scholar]
  10. Herbert J, Isham V (2000) Stochastic host-parasite interaction models. J Math Biol 40:343–371. 10.1007/s002850050184 [DOI] [PubMed] [Google Scholar]
  11. Herrero-Cófreces S, Flechoso MF, Rodríguez-Pastor R, Luque-Larena JJ, Mougeot F (2021) Patterns of flea infestation in rodents and insectivores from intensified agro-ecosystems. Parasites Vectors 14:16. 10.1186/s13071-020-04492-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Holman DF, Chaudhry ML, Kashyap BRK (1983) On the service system Inline graphic. Eur J Oper Res 13:142–145. 10.1016/0377-2217(83)90075-9 [Google Scholar]
  13. Isham V (1995) Stochastic models of host-macroparasite interaction. Ann Appl Probab 5:720–740. 10.1214/aoap/1177004702 [Google Scholar]
  14. Jovani R, Tella JL (2006) Parasite prevalence and sample size: misconceptions and solutions. Trends Parasitol 22:214–218. 10.1016/j.pt.2006.02.011 [DOI] [PubMed] [Google Scholar]
  15. Kura K, Truscott JE, Collyer BS, Phillips A, Garba A, Anderson RM (2022) The observed relationship between the degree of parasite aggregation and the prevalence of infection within human host populations for soil-transmitted helminth and schistosome infections. Trans R Soc Trop Med Hyg 116:1226–1229. 10.1093/trstmh/trac033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Lorenz MO (1905) Methods of measuring the concentration of wealth. Publ Am Stat Assoc 9:209–219. 10.2307/2276207 [Google Scholar]
  17. McPherson NJ, Norman RA, Hoyle AS, Bron JE, Taylor NGH (2012) Stocking methods and parasite-induced reductions in capture: modelling Argulus foliaceus in trout fisheries. J Theor Biol 312:22–33. 10.1016/j.jtbi.2012.07.017 [DOI] [PubMed] [Google Scholar]
  18. McVinish R (2025) Numerical evaluation of aggregation indices for the Tallis-Leyton model. Zenodo. 10.5281/zenodo.15293018 [Google Scholar]
  19. McVinish R, Lester RJG (2020) Measuring aggregation in parasite populations. J R Soc Interface 17:20190886. 10.1098/rsif.2019.0886 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. McVinish R, Lester RJG (2024) A graphical exploration of the relationship between parasite aggregation indices. PLoS ONE 19:e0315756. 10.1371/journal.pone.0315756 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Morrill A, Poulin R, Forbes MR (2023) Interrelationships and properties of parasite aggregation measures: a user’s guide. Int J Parasitol 53:763–776. 10.1016/j.ijpara.2023.06.004 [DOI] [PubMed] [Google Scholar]
  22. Peacock SJ, Bouhours J, Lewis MA, Molnár PK (2018) Macroparasite dynamics of migratory host populations. Theor Popul Biol 120:29–41. 10.1016/j.tpb.2017.12.005 [DOI] [PubMed] [Google Scholar]
  23. Pielou EC (1977) Mathematical ecology, Chapter 8. Wiley, New York [Google Scholar]
  24. Pietra G (1915) Delle relazioni tra gli indici di variabilità (nota I). Atti Ist Ven 74:775–792 [Google Scholar]
  25. Pietra G (2014) On the relationships between variability indices (note I). METRON 72:5–16. 10.1007/s40300-014-0034-3 [Google Scholar]
  26. Poulin R (1993) The disparity between observed and uniform distributions: a new look at parasite aggregation. Int J Parasitol 23:931–944. 10.1016/0020-7519(93)90060-C [DOI] [PubMed] [Google Scholar]
  27. Poulin R (2007) Are there general laws in parasite ecology? Parasitology 134:763–776. 10.1017/S0031182006002150 [DOI] [PubMed] [Google Scholar]
  28. Poulin R (2011) Evolutionary ecology of parasites, Chapter 6, 2nd edn. Princeton University Press, Princeton [Google Scholar]
  29. Ramasubban TA (1958) The mean difference and the mean deviation of some discontinuous distributions. Biometrika 45:549–556. 10.1093/biomet/45.3-4.549 [Google Scholar]
  30. Reiczigel J, Marozzi M, Fábián I, Rózsa L (2019) Biostatistics for parasitologists—a primer to quantitative parasitology. Trends Parasitol 35:277–281. 10.1016/j.pt.2019.01.003 [DOI] [PubMed] [Google Scholar]
  31. Rodríguez-Hernández K, Álvarez Mendizábal P, Chapa-Vargas L, Escobar F, González-García F, Santiago-Alarcon D (2021) Haemosporidian prevalence, parasitaemia and aggregation in relation to avian assemblage life history traits at different elevations. Int J Parasitol 51:365–378. 10.1016/j.ijpara.2020.10.006 [DOI] [PubMed] [Google Scholar]
  32. Rosà R, Pugliese A (2002) Aggregation, stability, and oscillations in different models for host-macroparasite interactions. Theor Popul Biol 61(3):319–334. 10.1006/tpbi.2002.1575 [DOI] [PubMed] [Google Scholar]
  33. Schreiber SJ (2006) Host-parasitoid dynamics of a generalized Thompson model. J Math Biol 52:719–732. 10.1007/s00285-005-0346-2 [DOI] [PubMed] [Google Scholar]
  34. Schrock SAR, Walsman JC, DeMarchi J, LeSage EH, Ohmer MEB, Rollins-Smith LA, Briggs CJ, Richards-Zawacki CL, Woodhams DC, Knapp RA, Smith TC, Haddad CFB, Becker CG, Johnson PTJ, Wilber MQ (2025) Do fungi look like macroparasites? Quantifying the patterns and mechanisms of aggregation for host-fungal parasite relationships. Proc R Soc B 292:20242013. 10.1098/rspb.2024.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Schweder T (1982) On the dispersion of mixtures. Scand J Stat 9:165–169 [Google Scholar]
  36. Shaked M, Shanthikumar JG (2007) Stochastic orders. Springer, New York. 10.1007/978-0-387-34675-5 [Google Scholar]
  37. Shaw DJ, Dobson AP (1995) Patterns of macroparasite abundance and aggregation in wildlife populations: a quantitative review. Parasitology 111:S111–S133. 10.1017/s0031182000075855 [DOI] [PubMed] [Google Scholar]
  38. Taguchi T (1968) Concentration-curve methods and structures of skew populations. Ann Inst Stat Math 20:107–141. 10.1007/BF02911628 [Google Scholar]
  39. Tallis GM, Leyton MK (1969) Stochastic models of populations of helminthic parasites in the definitive host. I. Math Biosci 4:39–48. 10.1016/0025-5564(69)90006-6 [Google Scholar]
  40. The MathWorks Inc. (2022a) Matlab version: 9.13.0.2080170 (r2022b) update 1
  41. The MathWorks Inc. (2022b) Symbolic math toolbox (r2022b)
  42. Williams D (1991) Probability with martingales. Cambridge University Press, Cambridge. 10.1017/CBO9780511813658 [Google Scholar]
  43. Wilson K, Bjørnstad O, Dobson A, Merler S, Poglayen G, Randolph S, Read A, Skorping A (2001) Heterogeneities in macroparasite infections: patterns and processes. In: Hudson P, Rizzoli A, Grenfell B, Heesterbeek H, Dobson A (eds) The ecology of wildlife diseases. Oxford University Press, Oxford, pp 6–44. 10.1093/oso/9780198506201.003.0002 [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The code used to generate the figures in this paper are publicly available on Zenodo (McVinish 2025).


Articles from Journal of Mathematical Biology are provided here courtesy of Springer

RESOURCES