Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2026 Jan 27.
Published in final edited form as: Commun Stat Theory Methods. 2012 Dec 14;42(2):201–213. doi: 10.1080/03610926.2011.581787

A Note on the Complementary Mixture Pareto II Distribution

YIMEI LI 1, DANIEL F HEITJAN 2
PMCID: PMC12834256  NIHMSID: NIHMS2139592  PMID: 41602778

Abstract

We introduce a new survival distribution, of Pareto type, that arises from a cure-mixture frailty model. We describe its properties and demonstrate connections with familiar distributions including the Pareto and exponential. We derive its characteristic function and moments.

Keywords: Characteristic function, Moments, Pareto distribution, Survival distribution

Mathematics Subject Classification: 62E99

1. Introduction

Recently, Leemis and McQueston (2008) presented a graphic describing the relationships among a large number of known univariate distributions. A distribution that we encountered in our applied work, which we denote the Complementary Mixture Pareto II (CMPII), was not included in the summary, nor does it appear in a popular compendium of univariate distributions (Johnson et al., 1995). Therefore, we believe that CMPII is new to the statistical literature and in this note we present our derivation of it, describe connections with known distributions, and derive its characteristic function and moments.

We came upon CMPII while developing a model to accommodate the unique characteristics of data from smoking cessation clinical trials (Li et al., 2010). As the durations of quit attempts in such trials are characterized by heterogeneity and the possibility of permanent success, an appropriate statistical model should incorporate both latent frailties and the probability of cure. Our construction of such a model led us directly to CMPII, as we demonstrate below.

Let X denote an event time and S(x) its survival function. We assume that with probability ρ the event time is infinite, so that S(x) takes the form

Sx=ρ+1-ρS*x,ρ0,1,

where S*(x) is a proper survival function. We assume moreover that, conditional on a latent frailty g>0, the cure probability and survival function given not cured take the forms

ρ(g)=exp-ge-ηρ,S*(xg)=exp-xgeησ, (1.1)

where ηρ is a linear predictor for the cure probability and ησ is a linear predictor for event rate given not cured. That is, we assume that: (i) conditionally on the frailty g the probability of cure follows a generalized linear model with a complementary log-log (cloglog) link; (ii) survival given not cured is exponential with a proportional hazard regression; and (iii) the natural log of g is an offset in both linear predictors. Moreover, we assume that g is a random draw from a gamma distribution with shape 1/θ and scale θ, ensuring that E(g)=1 and Var(g)=θ(θ>0). If the cure probability were fixed at 0, this would be the standard model of a gamma-distributed exponential hazard (Duchateau and Janssen, 2008).

This specification is unique among cure-mixture frailty models in using a cloglog rather than a logistic link for the cure probability and in allowing a common frailty to affect both the cure probability and the survival given not cured. Integrating out the frailty, we obtain the marginal survival function

S~x=SxdFg=ρ~+1-ρ~S*~x,

where

ρ~=11+θe-ηρ1/θ,

and

S*~(x)=1+θxeησ-1/θ-1+θe-ηρ+θxeησ-1/θ1-1+θe-ηρ-1/θ.

S*~(x) is a proper survival function in that it is monotone decreasing with S*~(0)=1 and S*~(+)=0. Therefore, the marginal cure probability is ρ~, and the marginal survival function given not cured is S*~(x). Note that the marginal cure probability ρ~ differs from the conditional cure probability ρ, and the marginal non cured survival S*~(x) differs from S*(x)dF(g)—the integral of the conditional non cured survival. The distribution S*~(x) is the subject of this article.

As we indicated, our research suggests that S*~(x) has not previously been identified as a distribution. We therefore devote the remainder of this note to an investigation of its properties and its relationship to other distributions. Note that our purpose here is simply to elucidate basic theoretical properties; we refer readers to Li et al. (2010) for an application.

Our argument invokes the following densities.

Exponential (λ):

f(x)=λe-λx,x0,λ(0,);

Pareto I (α,σ):

f(x)=ασxσ-(α+1),xσ,α,σ(0,);

Pareto II (μ,σ,ξ):

f(x)=1σ1+ξx-μσ-(1/ξ+1),xμ(-,),σ,ξ(0,).

2. General Characteristics

For simplicity, we denote a=eησ,b=eηρ,c=1-(1+θ/b)-1/θ; then,

S*~(x)=1c(1+θax)-1/θ-(1+θ/b+θax)-1/θ,x0,a,b,θ(0,). (2.1)

Here, a is a parameter associated with the conditional hazard (conditional on the frailty) given not cured; a larger a implies a higher conditional hazard of experiencing the event. Parameter b is associated with the conditional cure probability, where a larger value implies a larger conditional cure probability. The parameter θ is the variance of the frailty, and c is a normalizing constant. The corresponding marginal density is

f*~(x)=ac(1+θax)-1/θ-1-(1+θ/b+θax)-1/θ-1,x(0,), (2.2)

and the hazard function is

h*~(x)=a(1+θax)-1/θ-1-(1+θ/b+θax)-1/θ-1(1+θax)-1/θ-(1+θ/b+θax)-1/θx(0,). (2.3)

We rewrite (2.2) as

f*~(x)=αf1(x)-βf2(x), (2.4)

where

α=1c=11-(1+θ/b)-1/θ,β=1-1c=(1+θ/b)-1θ1-(1+θ/b)-1/θ;f1~ParetoII(μ=0,σ=1/a,ξ=θ),f2~ParetoII(μ=0,σ=(1+θ/b)/a,ξ=θ).

and

α>1,β>0,α-β=1.

Because the density is a weighted difference of Pareto II densities, we denote our model as the Complementary Mixture Pareto II (CMPII) distribution.

Note that the first derivative of the density is

f~*(x)=-a2(1+θ)c(1+θax)-1θ+2-(1+θax+θ/b)-1θ+2<0,a,b,θ,x(0,)

so that the density function is always decreasing with time x.

Figure 1 plots the log density and log hazard for different values of a and θ when b=1. Figure 2 depicts how the density and hazard vary with b when a and θ are held fixed. Note that the hazard increases with b and appears to converge to a distribution as b. In fact, by L’Hôpital’s rule,

limbS~*(x)=(1+θax)-1θ-1=ParetoII(μ=0,σ=1/[a(1+θ)],ξ=θ/(1+θ)).

Figure 1.

Figure 1.

Log density and hazard function when b=1.

Figure 2.

Figure 2.

Log density and hazard function when a=1 and θ=1.

As the hazard function (2.3) is analytically complicated, we explored permissible shapes by computing its derivative numerically over a wide range of values of a,b, and θ. In all cases the derivative was negative for all x, suggesting that the hazard is non increasing with time under a wide range of parameter values. This is not surprising, as the special case of the Pareto II can be shown to have a non-increasing hazard (Johnson et al., 1995).

3. The Special Case of b0

Letting b0 in (2.1), the marginal non cured survival function becomes

limb0S*~(x)=(1+θax)-1θ,

the Pareto II (μ=0,σ=1/a,ξ=θ) that arises from the standard frailty model with exponential baseline hazard (Duchateau and Janssen, 2008). This is because as b0, the conditional cure probability in (1.1) goes to 0, so that our model reduces to a standard frailty model with constant hazard conditional on the frailty. The density is then

f*~(x)=a(1+θax)-1θ+1,

with first derivative

f~*(x)=-a2(1+θ)(1+θax)-1θ+2<0,a,θ,x(0,)

so that the density function is always decreasing with time x.

Figures 3 and 4 depict the log density and hazard at various values of a and θ for b0. We see that when θ is fixed and a decreases, the density declines at a slower rate, and the hazard is smaller at any given time. This is because a smaller value of a implies a smaller individual-level conditional hazard, which manifests itself as a smaller population-level marginal hazard. In particular, when a0, the marginal hazard also goes to 0. When a, the marginal non-cured survival becomes the Pareto I (α=1/θ,σ=1/(θa)):

limb0,aS*~(x)=(θax)-1θ.

Figure 3.

Figure 3.

Log density function when b0.

Figure 4.

Figure 4.

Hazard function when b0.

Similarly, the density and hazard function vary with θ for fixed a. When θ is small, the log density approaches a linear function of time and the hazard approaches a constant. This is because θ0 means that the variance of the frailty approaches 0, which implies that there is no between-subject heterogeneity, and therefore the population survival function equals the individual survival function, which is exponential:

limb0,θ0S*~x=e-ax.

We summarize the relationships of CMPII with other known distributions in Fig. 5.

Figure 5.

Figure 5.

Relationships of CMPII to other distributions.

4. Characteristic Function and Moments

We first derive the expectation and variance of CMPII.

Theorem 4.1. The expectation of a variate X following the CMPII distribution is

EX=1a1-θ1-(1+θ/b)-1θ+11-(1+θ/b)-1θ,θ<1;

the second moment is

EX2=2a2(1-θ)(1-2θ)1-(1+θ/b)-1θ+21-(1+θ/b)-1θ,θ<1/2;

and therefore the variance is

Var(X)=2a2(1-θ)(1-2θ)1-(1+θ/b)-1θ+21-(1+θ/b)-1θ-1a2(1-θ)21-(1+θ/b)-1θ+11-(1+θ/b)-1θ2,θ<1/2.

See Appendix A for a proof. We note that the second factor in the first moment formula is always smaller than 1, giving

EX<1a1-θ=EX1~f1.

We next derive the characteristic function ϕ(t)=EeitX.

Theorem 4.2. The characteristic function of CMPII is

ϕ(t)=1cθ-itθa1θexp-itθaΓ-1θ,-itθa-exp-it(1+θ/b)θaΓ-1θ,-it(1+θ/b)θa.

Here, Γ(,) is the upper incomplete Gamma function with complex arguments:

Γ(s,z)=zts-1e-tdt

(Abramowitz and Stegun, 1965; DiDonato and Morris, 1986; Temme, 1996). See Appendix B for a proof. Repeated differentiation gives

Theorem 4.3. The nth moment of the CMPII distribution is

EXn=n!can(1-θ)(1-2θ)(1-nθ)-n!(1+θ/b)n-1θcan(1-θ)(1-2θ)(1-nθ),θ<1n.

See Appendix C for a proof. One can readily verify that Theorem 4.3 implies Theorem 4.1.

5. Discussion

We have described CMPII, a three-parameter family that arises as the marginal distribution of survival given not cured in a cure-mixture frailty model. Several known distributions are limiting cases.

The CMPII has the curious property that its density equals a linear combination of two Pareto II densities, which would appear to offer a clear path to its characteristic function and moments. Ironically, we were unable to find formulas for the Pareto II characteristic function, so we derived terms for CMPII directly (see Appendices B and C). The derivations are tedious but may serve as templates for similar derivations of other distributions.

Pareto distributions have been known for several decades (Arnold and Laguna, 1977; Arnold, 1983) and have found application in various areas of science, especially in economics to describe income and in medical statistics to describe survival. Naturally, one can apply CMPII in these areas as well. Our applied experience (Li et al., 2010) suggests its usefulness specifically in smoking cessation research.

Appendix A: Proof of Theorem 4.1

Johnson et al. (1995) showed that if U is Pareto II (μ,σ,ξ), then

E(U)=σ1-ξ,ξ<1

and

EU2=2σ2(1-ξ)(1-2ξ),ξ<1/2.

Recall that the density of a CMPII variate X is

f*~(x)=αf1(x)-βf2(x),

where

α=11-(1+θ/b)-1/θ=1-β,

f1 is the density of a Pareto II (0,1/a,θ), and f2 is the density of a Pareto II (0,(1+θ/b)/a,θ). One can therefore derive E(X) and EX2 by taking linear combinations of the moments of Pareto II, and Var(X) by subtraction.

Appendix B: Proof of Theorem 4.2

Here, we use the fact that

0(x+w)ve-uxdx=u-v-1ewuΓv+1,wu,

where u,v,w are complex numbers with |argw|<π,Reu>0; and Γ(,) is the upper incomplete Gamma function (Gradshteyn and Ryzhik, 1980).

Equation (2.4) implies that

ϕ(t)=EeitX=αEeitU,U~f1-βEeitV,V~f2.

Now

EeitU,U~f1=a0eitu(1+θau)-1θ-1du=a(θa)-1θ-10u+1θa-1θ-1eitudu=a(θa)-1θ-1(-it)1θexp-itθaΓ-1θ,-itθa,=1θ-itθa1θexp-itθaΓ-1θ,-itθa,

and similarly,

EeitV,V~f2=a1+θ/b0eitv1+θa1+θ/bv-1θ-1dv=1θ-itθa1θ(1+θ/b)1θexp-it(1+θ/b)θaΓ-1θ,-it(1+θ/b)θa;

Therefore,

ϕt=1cθitθa1θ[expitθaΓ1θ,itθaexpit1+θ/bθaΓ1θ,it1+θ/bθa]

Appendix C: Proof of Theorem 4.3

We first note a series of Lemmas.

Lemma A.1.

nz-sΓ(s,z)nz=(-1)nz-s-nΓ(s+n,z)

Proof. See Gradshteyn and Ryzhik (1980). □

Lemma A.2.

-z-sΓs,zz=0=1s,s<0.

Proof.

z-sΓ(s,z)=z-szts-1e-tdt=1ys-1ezydy,-z-sΓ(s,z)z=0=1ys-1dy=-1s,s<0.

Lemma A.3. Define g(z,s)=ezz-sΓ(s,z). Then,

g(z,s)z=-g(z,s+1)+g(z,s);g(z,s)zz=0=-1s(s+1),(s<-1).

Proof.

g(z,s)z=ezz-sΓ(s,z)z=ezz-sΓ(s,z)z+ezz-sΓ(s,z)=ez-z-s-1Γ(s+1,z)+ezz-sΓ(s,z)(byLemmaA.1)=-g(z,s+1)+g(z,s);g(z,s)zz=0=-z-s-1Γ(s+1,z)z=0+z-sΓ(s,z)z=0=1s+1-1s(byLemmaA.2)=-1s(s+1),s<-1.

Lemma A.4.

gn(z,s)zn=k=0n(-1)knkg(z,s+k).

Proof. Denoting An=gn(z,s)/zn, we prove the lemma by induction. Supposing it is true for n, then for n+1,

An+1=Anz=k=0n(1)knkgz,s+kz=k=0n(1)knkgz,s+kgz,s+k+1byLemmaA.3=k=0n(1)knkgz,s+k+l=1n+1(1)lnl1gz,s+l=gz,s+k=1n(1)knkgz,s+k+l=1n(1)lnl1gz,s+l+(1)n+1gz,s+n+1=gz,s+k=1n(1)knkgz,s+k+l=1n(1)lnllnl+1gz,s+l+(1)n+1gz,s+n+1=gz,s+k=1n(1)kn+1kgz,s+k+(1)n+1g(z,s+n+1)=k=0n+1(1)kn+1kgz,s+k,

so it is true for n+1 as well. □

Lemma A.5.

gn(z,s)znz=0=k=0n-1(-1)kn-1k1s+ks+k+1,s+n<0.

Proof.

gn(z,s)znz=0=zgn-1(z,s)zn-1z=0=zk=0n-1(-1)kn-1kg(z,s+k)z=0(byLemmaA.4)=k=0n-1(-1)kn-1kg(z,s+k)zz=0=k=0n-1(-1)kn-1k-1s+ks+k+1byLemmaA.3.

Lemma A.6.

gn(z,s)znz=0=-n!s(s+1)(s+n),s+n<0.

Proof. Setting Bn(s)=gn(z,s)/znz=0, we again prove the lemma using induction. If it is true for n, then for n+1,

Bn+1(s)=k=0n(1)k(nk)1(s+k)(s+k+1)(byLemmaA.5)=1s(s+1)+k=1n1(1)k(nk)1(s+k)(s+k+1)+(1)n1(s+n)(s+n+1)=1s(s+1)+k=1n1(1)k[(n1k)+(n1k)knk]1(s+k)(s+k+1)+(1)n(s+n)(s+n+1)=1s(s+1)+k=1n1(1)k(n1k)1(s+k)(s+k+1)+k=1n1(1)k(n1k)knk1(s+k)(s+k+1)+(1)n1(s+n)(s+n+1)=Bn(s)+k=1n1(1)k(n1k1)1(s+k)(s+k+1)+(1)n1(s+n)(s+n+1)=Bn(s)l=0n2(1)l(n1l)1(s+l+1)(s+l+2)+(1)n1(s+n)(s+n+1)=Bn(s)l=0n1(1)l(n1l)1(s+l+1)(s+l+2)=Bn(s)Bn(s+1)=n!s(s+1)(s+n)n!(s+1)(s+2)(s+n+1)(becauseitistrueforn)=(n+1)!s(s+1)(s+n+1),

so it is true for n+1 as well. □

Based on these Lemmas we can prove Theorem 4.3 easily. Denote z1=-(it)/(θa), z2=-[it(1+θ/b)]/(θa) and s=-1/θ, then

ϕ(t)=αθz1-sez1Γs,z1-βθz2-sez2Γs,z2=αθgz1,s-βθgz2,s,
ϕntnt=0=αθgnz1,sz1nz1=0z1tn-βθgnz2,sz2nz2=0z2tn=αθ-n!s(s+1)(s+n)-iθan-βθ-n!s(s+1)(s+n)-i(1+θ/b)θan(byLemmaA.6)=αinann!(1-θ)(1-2θ)(1-nθ)-βinan(1+θ/b)nn!(1-θ)(1-2θ)(1-nθ),
EXn=i-nϕntnt=0=α1ann!(1-θ)(1-2θ)(1-nθ)-β1an(1+θ/b)nn!(1-θ)(1-2θ)(1-nθ)=n!can(1-θ)(1-2θ)(1-nθ)-n!(1+θ/b)n-1θcan(1-θ)(1-2θ)(1-nθ),θ<1n.

References

  1. Abramowitz M, Stegun IA (1965). Handbook of Mathematical Functions: With Formulas, Graphs, and Mathematical Tables. New York: Dover Publications, Inc. [Google Scholar]
  2. Arnold BC (1983). Pareto Distributions. Fairland, MD: International Cooperative Publishing House, [Google Scholar]
  3. Arnold BC, Laguna L (1977). On Generalized Pareto Distributions with Applications to Income Data. International Studies in Economics, Monograph #10, Dept. of Economics, Iowa State University, Ames, IA. [Google Scholar]
  4. Duchateau L, Janssen P (2008). The Frailty Model. Springer: New York. [Google Scholar]
  5. DiDonato AR, Morris AH (1986). Computation of the incomplete gamma function ratios and their inverse. ACM Trans. Mathemat. Software 12:377–393. [Google Scholar]
  6. Gradshteyn IS, Ryzhik IM (1980). Tables of Integrals, Series, and Products. New York: Acedemic Press. [Google Scholar]
  7. Johnson NL, Kotz S, Balakrishnan N (1995). Continuous Univariate Distributions. New York: Wiley. [Google Scholar]
  8. Leemis LM, McQueston JT (2008). Univariate distribution relationships. The Amer. Statistician 62:45–53. [Google Scholar]
  9. Li Y, Wileyto EP, Heitjan DF (2010). Modeling smoking cessation data with alternating states and a cure fraction using frailty models. Statist. Med 29:627–638. [Google Scholar]
  10. Temme NM (1996). Uniform asymptotics for the incomplete gamma functions starting from negative values of the parameters. Meth. Applic. Anal 3:335–344. [Google Scholar]

RESOURCES