Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2010 Feb 22.
Published in final edited form as: Biometrika. 2005 Jun 1;92(2):271–282. doi: 10.1093/biomet/92.2.271

Empirical-likelihood-based semiparametric inference for the treatment effect in the two-sample problem with censoring

YONG ZHOU 1, HUA LIANG 2
PMCID: PMC2825713  NIHMSID: NIHMS175605  PMID: 20179776

Summary

To compare two samples of censored data, we propose a unified semiparametric inference for the parameter of interest when the model for one sample is parametric and that for the other is nonparametric. The parameter of interest may represent, for example, a comparison of means, or survival probabilities. The confidence interval derived from the semiparametric inference, which is based on the empirical likelihood principle, improves its counterpart constructed from the common estimating equation. The empirical likelihood ratio is shown to be asymptotically chi-squared. Simulation experiments illustrate that the method based on the empirical likelihood substantially outperforms the method based on the estimating equation. A real dataset is analysed.

Keywords: Estimating equation, Confidence interval, Coverage, Kaplan-Meier estimation, Empirical likelihood ratio, Empirical likelihood function

1 Introduction

Let X and Y be random variables representing some characteristic of two treatments, where X corresponds to the new treatment and Y to the control treatment. One may wish to compare the average treatment effects, so that Δ = E(X) − E(Y) is of interest. Alternatively, one may wish to consider the probability of one treatment being better than the other, which is equivalent to considering the survival competition probability, so that Δ = pr(X < Y) or Δ = pr(X > Y), or to compare the probability of death before a given time t0 in survival analysis, so that Δ = pr(X < t0) − pr(Y < t0).When X and Y are both sampled from parametric families, standard statistical approaches such as maximum likelihood can be used (Brownie et al., 1986; Campbell & Ratnarkhi, 1993; Goddard & Hinberg, 1990; Hsieh & Turnbull, 1996). There are also many nonparametric approaches for comparing the two unknown continuous distributions F and G of X and Y , respectively, based on independent samples; see for example Gastwirth & Wang (1988), Hollander & Korwar (1982), and Li et al. (1996).

Sometimes enough is known about one treatment, usually the control treatment, for a parametric form to be assumed for the distribution G. However, this may not hold for F. This leads to a semiparametric two-sample model. Recently much attention has been paid to semiparametric inference. Li et al. (1999) proposed a semiparametric estimator for quantile comparison. They studied two-sample inference through the quantile comparison function G{F−1(p)} assuming that G is known and F is unknown. A simpler case where Y is normally distributed has been studied by Hsieh & Turnbull (1996).

The contribution of this article is to develop a unified approach to inference about the parameter Δ, introduced in the first paragraph. Although a common estimating equation for Δ and the corresponding confidence interval are presented below, we focus on semiparametric inference about Δ by using empirical likelihood ratio methods; see for example Owen (1988, 1990, 2000), Qin (1994, 1999) and Qin & Lawless (1994). The empirical likelihood ratio has a limiting chi-squared distribution, leading to tests and confidence intervals ( Thomas and Grunkemeier, 1975) for a variety of problems, including linear models, generalised linear models and estimating equations. The empirical likelihood method has many advantages over competitors such as the normal-approximation-based method and the bootstrap method (Hall & La Scala, 1990). The appealing features of the empirical likelihood include improvement of the confidence region, increased accuracy of coverage through the use of auxiliary information, and easy implementation. We will derive the asymptotic distribution of the resulting empirical likelihood function in our context, explain how to establish the corresponding confidence interval, and illustrate the advantages numerically.

2 Estimation methods for a two-sample model with censored data

2.1 General framework

Assume that the variables {X10,X20,,Xn0} and {Y10,Y20,Ym0} are randomly censored by two sequences of random variables {U1, U2, · · ·, Un} and {V1, V2, · · ·, Vm}, respectively. The members of each sequence of random variables {Xi0}, {Yi0}, {Ui} and {Vi} are independent and identically distributed with survival functions S(x), D(y), K(u), and Q(v), respectively, where S(x) and K(u) are two unknown functions. The distribution function G(y) = 1 − D(y) is of known form and depends on the unknown d-dimensional parameter θ, i.e., G(y) = Gθ(y). We assume that Gθ(y) has the density function gθ(y). The observed sample are the pairs (Xi, δi), for i = 1, 2, · · · n, and the pairs (Yj, ηj), for j = 1, · · ·, m, where Xi=min(Xi0,Ui), δi=I(Xi0Ui),i=1,2,,n and Yj=min(Yj0,Vj), ηj=I(Yj0Vj),j=1,2,,m in which I(·) denotes an indicator function. We assume that the two samples of pairs are independent.

The parameter of interest, Δ, may be a functional of the functions S(x), D(y), K(u) and Q(v). The estimation of the unknown functions S(x) and K(u) and the parameter θ is a secondary goal; these are considered as nuisance parameters. For any measurement Δ of the difference between the two samples, we assume that the information about θ, Δ and F(x) = 1 − S(x) is available in the known form of an unbiased estimation function, i.e.,

EFψ(X0,θ0,Δ)=0, (2.1)

where θ0 is the true value of θ and ψ, which may be a vector of functions, is assumed to be a real function for the sake of simplicity. As mentioned before plausible candidates for Δ are the following: (i) the difference in the means of the two censored samples, so that Δ = E(X0) − θ0 and ψ(X0, θ0, Δ) = X0θ0 − Δ, where θ0 = EY0; (ii) the difference of probabilities for the events X0 < t0 and Y0 < t0 for a given constant t0, so that Δ = F(t0) − Gθ0(t0) and ψ(X0, θ0, Δ) = I(X0 < t0) − Gθ0(t0) − Δ; or (iii) the probability of the event I(X0 > Y0), so that Δ = E{1 − Gθ0(X0)} and ψ(X0, θ0, Δ) = 1 − Gθ0(X0) − Δ. Another candidate is (iv) a value on the receiver operating characteristic curve, so that Δ=1F{Gθ01(1p)} and ψ(X0,θ0,Δ)=1I{X0Gθ01(1p)}Δ for a given p ∈ (0, 1).

For the case of complete data, Qin (1997) used an empirical likelihood ratio statistic to test the hypothesis that the two populations have the same mean and constructed a confidence interval for the mean difference of two populations in a semiparametric model. This case and Li et al.’s (1999) work on the vertical quantile comparison function for two-sample tests can be viewed as special cases of ψ(·, ·, ·).

2.2 Estimation of Δ based on the common estimating equation

A naive and direct method of estimating Δ is to solve equation (2.1 ) with F and θ replaced by estimates, which may be obtained by maximum likelihood estimation.

The likelihood function based on the samples {(Xi, δi), i = 1, 2, · · ·, n} and {(Yj, ηj), j = 1, 2, · · ·, m} is

L(F,θ)=i=1n{F(Xi)F(Xi)}δi{1F(Xi)}1δij=1mgθηj(Yj){1Gθ(Yj)}1ηj,

provided Q(v) does not depend on θ. We assume that θ ∈ Θ ⊂ Rd. It can be shown that L(F, θ) attains its maximum value at the point (F^,θ^MLE), where θ^MLE is the maximum likelihood estimator based on the sample (Yj, ηj), and F^ is the Kaplan-Meier estimator:

F^(t)=1ut{1dN(u)Y(u)},

where N(u) = Σ I(Xiu, δi = 1) and Y(u) = Σ I(Xiu). The estimator of Δ, denoted by Δ^, is defined as the solution of the estimating equation 0τ1ψ(x,θ^MLE,Δ)dF^(x)=0, which is equivalent to

i=1nδiK^(Xi)ψ(Xi,θ^MLE,Δ)=0, (2.2)

where K^ is the Kaplan-Meier estimator of K, i.e.,

K^(t)=ut{1dNc(u)Y(u)},

in which Nc(u) = Σ I(Xiu, δi = 0), and τ1 = sup{t : S(t)K(t) > 0}.

We assume that m/nζ > 0 as n, m → ∞. The asymptotic normality of the estimator Δ^ is given in the following theorem.

THEOREM 1

Suppose that Assumptions 1-4 in the Appendix hold. Then

1ni=1nWniψ(Xi,θ^MLE,Δ)N(0,σψ2),

in distribution, where Wni=δiK^(Xi), σψ2=σ2(θ,Δ)+β0T(θ,Δ)Σ1β0(θ,Δ), Σ = ζ−1I(θ) with I(θ) given in Assumption 2, σ2(θ,Δ)=var{ψ(X,θ,Δ)}+E0τ1{ψ(X,θ,Δ)G(θ,u)}2I(Xu)λc(u)K(u)du, G(θ, u) = S−1(u)E{ψ(X, θ, Δ)I(Xu)}, β0(θ,Δ)=0τ1ψ.θ(u,θ,Δ)dF(u), and ψ.θ(u,θ,Δ)=ψ(x,θ,Δ)θ, λc(t) is the hazard function of U. Furthermore, if Assumption 5 holds, then

n(Δ^Δ)N(0,σΔ2),

in distribution, where σΔ2=σψ2γ2, and γ(θ,Δ)=E{ψ.Δ(X,θ,Δ)}, in which ψ.Δ(x,θ,Δ)=ψ(x,θ,Δ)Δ.

One application of Theorem 1 is to construct the confidence interval for Δ, for which we need to estimate σΔ2. This can be done by using a plug-in method. Let

σ^12(θ^MLE,Δ^)=1ni=1nδiψ2(Xi,θ^MLE,Δ^)K^(Xi)+1n0ΛdNc(u)K^2(u){G^1(θ^MLE,u)G^2(θ^MLE,u)},

where Λ is the maximum of the uncensored observations,

G^1(θ,u)=1nS^(u)i=1nδiψ2(Xi,θ,Δ^)I(Xiu)K^(Xi), (2.3)
G^2(θ,u)={1nS^(u)i=1nδiψ(Xi,θ,Δ^)I(Xiu)K^(Xi)}2, (2.4)
β^0(θ^MLE,Δ^)=1ni=1nWniψ.θ(Xi,θ^MLE,Δ^),γ^(θ^MLE,Δ^)=1ni=1nWniψ.Δ(Xi,θ^MLE,Δ^). (2.5)

Replace σ2(θ, Δ), β0(θ, Δ), γ(θ, Δ) and Σ in σΔ2 by their estimators σ^12(θ^MLE,Δ^), β^0(θ^MLE,Δ^), γ^(θ^MLE,Δ^) and Σ^=ζn2g(θ^MLE)θ2, to obtain an estimator σ^Δ2 of σΔ2. It is easy to prove that the estimator is consistent. As a consequence, the approximate 100(1 − α)% confidence interval of Δ is (Δ^z1α2σ^Δ2,Δ^+z1α2σ^Δ2), where z1−α/2 denotes the 1 −α/2 quantile of the standard normal distribution.

2.3 Empirical likelihood inference

In this section, we apply the empirical likelihood principle to obtain a confidence interval for Δ. To deal with the censoring we propose an adjusted empirical likelihood function and prove that the resulting log likelihood ratio is still asymptotically chi-squared under mild conditions.

Let θ0 be the true value of parameter θ. Note that the auxiliary information depends only on the unknown distributions F and K and the parameter θ0, since E {δψ(X, θ0, Δ)/K(X−)} = 0. If the function K(·) were known, then we could infer about Δ based on the observation sample {δi/K(Xi−), i = 1, 2, · · ·, n} and the sample {(Yj, ηj), j = 1, 2, · · ·, m}. Let Fp be the distribution function that assigns probabilities pi at points δi/K(Xi−). Then the adjusted empirical likelihood is Ladj(θ,Δ)=i=1npij=1mgθηj(Yj){1Gθ(Yj)}1ηj, where i=1npi=1 and pi ≥ 0 for each i, and the auxiliary information (2.1) is expressed as

i=1nδiK(Xi)ψ(Xi,θ,Δ)=0. (2.6)

Again, we replace K(·) with its Kaplan-Meier estimator K^(), and our adjusted empirical likelihood, evaluated at Δ and θ, is defined by Ladj(θ, Δ) subject to the constraints

i=1npi=1,pi0,fori=1,2,,n, (2.7)
i=1npiδiK^(Xi)ψ(Xi,θ,Δ)=0. (2.8)

Without the restrictions (2.7) and (2.8), the adjusted empirical likelihood has the maximum value nnj=1mgθ^MLEηj(Yj){1Gθ^MLE(Yj)}1ηj, where θ^MLE is the maximum likelihood estimate of θ based on the sample {(Yj, ηj), j = 1, 2, · · ·, m}, as in §2.2. Our adjusted semiparametric empirical likelihood ratio statistic is

R(Δ)=supP,θi=1n(npi)j=1mgθηj(Yj){1Gθ(Yj)}1ηj[j=1mgθ^MLEηj(Yj){1Gθ^MLE(Yj)}1ηj]1,

where P denotes the region of possible pi subject to the constraints (2.7) and (2.8). We first investigate R(Δ) for a given θ, which we denote by R(θ, Δ). The empirical log likelihood ratio of R(θ, Δ) is described as L(θ,Δ)=supPi=1nlog(npi)+g(θ)g(θ^MLE), where g(θ)=j=1mlog[gθηj(Yj){1Gθ(Yj)}1ηj]. Provided that i=1nψ2(Xi,θ,Δ)Wni2>0 and the origin is inside the convex hull of {ψ(Xi, θ, Δ)Wni, i = 1, 2, · · · n}, where Wni=δiK^(Xi), there exits a unique maximum value of L(θ,Δ) or R(θ, Δ) (Qin & Lawless, 1994). By the Lagrange multiplier method, we conclude that L(θ,Δ) attains its maximum value at p^i=n1{1+λ(θ)Wniψ(Xi,θ,Δ)}1, for i = 1, 2, · · ·, n. Alternatively, the estimated empirical log likelihood ratio is

L(θ,Δ)=i=1nlog{1+λ(θ)Wniψ(Xi,θ,Δ)}+g(θ)g(θ^MLE), (2.9)

and the adjusted empirical log likelihood is defined by

EL(θ,Δ)=i=1nlog{1+λ(θ)Wniψ(Xi,θ,Δ)}+g(θ), (2.10)

where λ(θ) is determined by the equation:

1ni=1nψ(Xi,θ,Δ)Wni1+λ(θ)Wniψ(Xi,θ,Δ)=0. (2.11)

In the following, we use ∥ · ∥ to denote the Euclidean norm.

THEOREM 2

Suppose that Assumptions 1-4 in the Appendix hold. Then (i) ℓEL(θ, Δ) attains its maximum value at some point θ^EL in the interior of the ballθθ0∥ < nϱ (1/3 < ϱ < 1/2) such that θ^EL and λ^=λ(θ^EL) satisfy EL(θ^EL,Δ)θ=0 and (2.11); and (ii) the maximiser θ^EL of R(θ, Δ) is strongly consistent and such that 2ρ(θ,Δ)L(θ^EL,Δ)χ(1)2, in distribution, where

ρ(θ,Δ)=σ12(θ,Δ)+β0T(θ,Δ)Σ1β0(θ,Δ)σ02(θ,Δ)+β0T(θ,Δ)Σ1β0(θ,Δ),

in which Σ = ζ−1I(θ) with I(θ) given in Theorem 1, and σ02(θ,Δ)=0τ1ψ2(u,θ,Δ)K(u)dF(u).

Were it not for the term ρ(θ, Δ), the confidence interval could be easily derived without estimating variance as in traditional empirical likelihood (Owen, 2000, p.23). With censorship, the situation becomes complex, and we need to estimate ρ(θ, Δ). As in §2.2, we replace σ2(θ, Δ), β0(θ, Δ) and Σ in ρ(θ, Δ) by their estimators σ^2(θ^EL,Δ^) and β^0(θ^EL,Δ^), which are defined in a similar way to (2.3)-(2.5), and Σ^=(ζn)2g(θ^EL)θ2. The quantity σ02(θ,Δ) is estimated by σ^02(θ^EL,Δ^)=n1i=1n{Wniψ(Xi,θ^EL,Δ^)}2, where Δ^ is a consistent estimator of Δ. The resulting ρ^(θ^EL,Δ^) is a consistent estimator of ρ(θ, Δ). The confidence interval of Δ based on empirical likelihood can then be established. Another approach is not to estimate Δ in the adjusted factor ρ(θ, Δ). A confidence interval can then be constructed on the basis of the quantity 2ρ(θ^EL,Δ)L(θ^,Δ).

Corollary 1

Suppose that the Assumptions of Theorem 2 hold. Then limnpr(ΔIα)=1α, where Iα={Δ:2ρ(θ^EL,Δ)L(θ^,Δ)cα}, and pr(χ(1)2cα)=1α.

Theorem 2 showed that the maximum point is close to the true value, and the adjusted empirical log likelihood ratio is distributed approximately as chi-squared at the maximum point. One way of obtaining this maximum point is to consider ∂ℓEL(θ, Δ)/∂θ = 0, which results in the score equation of semiparametric empirical likelihood, i.e.,

1nλ(θ)i=1nψ.θ(Xi,θ,Δ)Wni1+λ(θ)ψ(Xi,θ,Δ)Wni=1mj=1mlog[gθηj(Yj){1Gθ(Yj)}ηj]θ. (2.12)

There may be multiple solutions of (2.12) but, if (2.12) has a unique root, this root is θ^EL.

An obvious result holds when we use θ^MLE in (2.9), as stated in the following theorem.

THEOREM 3

Suppose that Assumptions 1-4 hold. Then 2r(θ,Δ)L(θ^MLE,Δ)χ(1)2, in distribution, where r(θ,Δ)=σ12(θ,Δ)σ02(θ,Δ).

Theorem 3 can be proved in a similar way to Theorem 2 by using Lemma A.3 and Taylor expansion. We omit the details.

Theorem 3 indicates that the empirical likelihood function based on the estimator θ^MLE is still asymptotically chi-squared, but its asymptotic variance is bigger than that of θ^EL (see Lemma A.5). A confidence interval is available, based on the result of Theorem 3.

3 Simulation Study

In our simulation experiments, we consider only one-dimensional θ for the sake of simplicity. We mainly compare the lengths and coverages of confidence intervals based on θ^EL and the estimating equation. Note that closed-form solutions of (2.11) and (2.12) rarely exist, and we use the Newton-Raphson algorithm for implementation. Codes in S-plus are developed which are modified from El.s, a function written by A.B.Owen.

Example 1

Let X0 and Y0 be independent Un(0, 6) and Un(0, 10) random variables. The censored random variables U and V are generated by the uniform distributions Un(0, cx) and Un(0, cy), respectively. The parameters cx and cy are used to control the rate of censored data; the larger the cx and cy, the lighter the censorship. The four choices used for (cx, cy) are a combination of cx = 12.5 and cx = 15 with cy = 15 and cy = 20. We are concerned with semiparametric inference for Δ = E(X) − θ0 = −2. The simulation study is conducted as follows. For each of the 4 censoring configurations, sample sizes (n, m) = (30, 30), (50, 50) and (100, 100) are considered. For each censoring configuration and sample size, 2000 independent sets of data are generated. The confidence interval is based on the 95% level, and Table 1 summarises the results. The confidence interval for Δ based on the adjusted empirical likelihood is almost symmetric about the true value. Coverages based on the empirical likelihood method are consistently close to the nominal level, and the coverages of the normal approximation are all larger than that of the empirical likelihood method and the nominal level. The lengths of the confidence intervals are inversely proportional to square root of the sample size and censoring degree of sample X, and the confidence interval based on the empirical likelihood method is consistently and substantially shorter than its normal counterpart.

Table 1.

Example 1. Confidence intervals and the corresponding coverage based on the empirical likelihood (EL) and normal approximation (Norm) methods

Sample size CI(95%) coverage
n m cx cy EL Norm EL Norm
30 30 12.5 15 (−2.686, −1.311) (−3.153, −0.847) 94.46 97.39
20 (−2.682, −1.311) (−3.354, −0.646) 94.13 98.60
15 15 (−2.652, −1.313) (−3.106, −0.894) 94.66 97.72
20 (−2.688, −1.347) (−3.399, −0.601) 95.28 98.15
50 50 12.5 15 (−2.531, −1.457) (−2.753, −1.247) 95.02 97.84
20 (−2.534, −1.46) (−2.904, −1.096) 95.68 98.54
15 15 (−2.522, −1.469) (−2.751, −1.249) 95.07 97.53
20 (−2.530, −1.477) (−2.926, −1.074) 94.9 98.34
100 100 12.5 15 (−2.386, −1.619) (−2.442, −1.558) 94.35 96.60
20 (−2.379, −1.613) (−2.519, −1.481) 93.78 97.64
15 15 (−2.374, −1.625) (−2.433, −1.567) 94.45 97.35
20 (−2.369, −1.619) (−2.498, −1.502) 94.98 97.69

CI(95%), approximate 95% confidence interval.

Example 2

We use a scenario similar to that in Example 1, except that X0 and Y0 are exponential random variables with density functions 1/6exp(−x/6) and 1/10 exp(−x/10), respectively. Here, we consider Δ = pr(X0 > Y0). Consequently, θ0 = 10, ψ(x, θ0, Δ) = exp(−x/θ0) − Δ, and Δ = 0.625. The simulation results are given in Table 2. The conclusions are the same as those in Example 1, in general. However, the confidence interval based on the empirical likelihood method is not symmetric about the true value in this example. These confidence intervals also depend more upon the censorship of X than did those in Example 1.

Table 2.

Example 2. Confidence intervals and the corresponding coverage based on the empirical likelihood (EL) and normal approximation (Norm) methods

Sample size CI(95%) coverage
n m cx cy EL Norm EL Norm
30 30 12.5 15 (0.403, 0.777) (0.245, 1.005) 96.90 100
20 (0.411, 0.785) (0.274, 0.976) 97.20 100
15 15 (0.46, 0.792) (0.254, 0.996) 98.05 100
20 (0.455, 0.789) (0.282, 0.968) 98.35 100
50 50 12.5 15 (0.460, 0.755) (0.333, 0.917) 96.55 100
20 (0.461, 0.758) (0.355, 0.895) 97.40 100
15 15 (0.506, 0.769) (0.341, 0.909) 98.15 100
20 (0.506, 0.768) (0.363, 0.887) 98.35 100
100 100 12.5 15 (0.507, 0.723) (0.420, 0.830) 96.05 100
20 (0.508, 0.722) (0.435, 0.815) 96.65 100
15 15 (0.553, 0.742) (0.426, 0.824) 98.10 100
20 (0.551, 0.740) (0.440, 0.810) 98.30 100

CI(95%), approximate 95% confidence interval.

4 Real Data Analysis

Mantel et al. (1977) reported a litter-matched study of the tumourigenesis of a drug. One rat was randomly selected from each of 50 litters and given the drug. Two rats from each litter were selected as controls and given a placebo; see Mantel et al. (1977) for the details of the study. We are interested in comparing the average survival times of the treatment and control groups. Let X and Y be the survival times of the treatment and control groups. We first calculate the Kaplan-Meier estimate of the survival distribution of Y. A Q-Q plot then shows that the exponential-distribution assumption for Y is appropriate. Consequently, the estimated average survival time of the control group is 476, and the estimate of the difference of the averages, Δ = E(X) − E(Y), of the survival times is −240. The 95% confidence intervals for Δ based on the normal approximation and our empirical likelihood method are (−676.5, 195.4) and (−258.8, −68.7), respectively. The conclusions drawn from these two confidence intervals are puzzling because the first interval suggests that there is no significant difference, but the second interval implies the contrary. Recalling the performance of the two methods in the simulation experiment, we prefer the conclusion based on the empirical likelihood method. For comparison, we also conducted a log-rank test on the dataset and obtained a p–value less than 10−3, which reinforces our conclusion based on the empirical likelihood method. Moreover, the 99% confidence interval (empirical likelihood) is (−304.7, −47.4), which indicates a significant difference at the 1% significance level.

5 Conclusion

The finite-sample behaviour of our method seems superior to that of the common estimating equation. Furthermore implementation of the normal-approximation method requires ψ(x, θ0, Δ) to be continuously differentiable in Δ, whereas, this assumption is unnecessary for constructing the confidence interval in the empirical likelihood framework. In addition, in absence of censoring, the factors ρ(θ, Δ) and r(θ, Δ) equal 1, so that the confidence interval based on empirical likelihood does not require the calculation of any covariance or variance. The factors ρ(θ, Δ) and r(θ, Δ) can therefore be viewed as the cost of censoring.

If the function ψ(x, θ0, Δ) in equation (2.1 ) is discontinuous at θ0, then neither the normal-approximation-based method nor the empirical likelihood method is applicable even in the absence of censoring. In particular, this happens if ψ(x, θ0, Δ) includes the term I{X0 < Gθ0(p)}. Hence, the current version is not applicable to the problems of two-sample quantile comparison, and inference for the ROC curve. In principle, the method proposed in this paper can be extended to cover these cases by incorporating smoothing techniques. Detailed investigation of these issues will be discussed in further research.

Acknowledgement

The authors thank the Editor, an associate editor and a referee for their helpful suggestions and constructive comments. Zhou’s research was partially supported by the National Natural Science Foundation of China. Liang’s research was partially supported by the American Lebanese Syrian Associated Charities and a grant from the National Institute of Allergy and Infectious Diseases.

APPENDIX

Theory

Assumption 1

We assume that τSτK, where τS = sup{x : S(x) > 0} and τK = sup{x : K(x) > 0}, and

0τ1ψ2(u,θ,Δ)K(u)dF(u)<.

Assumption 2

The functions ψ(x, θ, Δ) and ψ.θ(x,θ,Δ) are continuous and bounded by some function M(x) in a neighbourhood of the true value θ0 such that, 0τ1M(u)K2(u)dF(u)<, and EFψ.θ(X,θ,Δ)0.

Assumption 3

The density function gθ(y) is three times differentiable with respect to θ on A = {y : gθ(y) > 0}, which is assumed to be independent of θ. There exists a function M1(y) such that, for any yA and θ in the neighbourhood of θ0, Eθ|M1(Y)| < ∞.

Assumption 4

The information matrix I(θ) with entries

Iij=0τ22loggθ(y)θiθj{1Q(y)}gθ(y)dy0τ22log{1Gθ(y)}θiθj{1Gθ(y)}dQ(y),

for i, j = 1, · · ·, d, which are continuous, is positive definite. Here τ2 = sup{t : D(t)Q(t) > 0}.

Assumption 5

The function ψ(x, θ, Δ) is continuously differentiable in Δ, and E{ψ.Δ(X,θ,Δ)}0.

We next state preliminary lemmas. Their detailed proofs are referred to Zhou and Liang (2004).

Lemma A.1

Under Assumptions 1 and 2, we have that

1ni=1nWniψ(Xi,θ0,Δ)N{0,σ2(θ0,Δ)},

in distribution, where σ2(θ0, Δ) is defined in Theorem 2.

Lemma A.2

Suppose that Assumptions 1, 2 and 5 hold. Then

1ni=1n{ψ(Xi,θ,Δ)δiK^(Xi)}2=σ02(θ,Δ)+oP(1),1ni=1nψ.θ(Xi,θ,Δ)δiK^(Xi)=β0(θ,Δ)+oP(1),and1ni=1nδiψ.Δ(Xi,θ,Δ)K^(Xi)=γ(θ,Δ)+oP(1).
Lemma A.3

Suppose that Assumptions 1-4 hold. Then λ(θ) = OP(nϱ) uniformly on {θ : ∥θθ0∥ ≤ cnϱ}, where 1/3 < ϱ < 1/2, c is some constant, and

λ(θ)=i=1nψ(Xi,θ,Δ)Wnii=1n{ψ(Xi,θ,Δ)Wni}2+oP(nϱ)

uniformly on {θ : ∥θθ0∥ ≤ cnϱ}, where λ(θ) satisfies (2.11).

Lemma A.4

Assume that the conditions of Theorem 2 hold, and that ϱ is defined as in Lemma A.1 . Then R(θ, Δ) attains its local maximum value at θ^EL such that θ^ELθ0=O(nϱ), almost surely and θ^EL is a root of (2.12).

Lemma A.5

Suppose that Assumptions 1-4 hold. Then n(θ^ELθ0λ^)N(0,Σ1), in distribution, where

Σ1={Cθσ2Σ1β0β0TΣ1+(Σβ0σ02β0T)1Σ(Σβ0σ02β0T)1Cθβ0T(Σβ0σ02β0T)1Cθ2σ2β0TΣ1Cθ2σ2+Cθ2β0Tσ02β0},

Cθ=(σ02+β0TΣ1β0)1, σ02=σ02(θ,Δ) and β02=β02(θ,Δ) as given in Theorem 2.

Proof of Theorem 1

By Assumption 5, we have

1ni=1nψ(Xi,θ^MLE,Δ)=1ni=1nWniψ(Xi,θ,Δ)+γ(θ,Δ)n(θ^MLEθ)+oP(1).

Theorem 1 follows from Lemmas A.1 and A.2 .

Proof of Theorem 2

The result in (i) is a direct consequence of Lemma A.3 . We now prove result (ii). By using Lemma A.4 and Taylor expansion, we can show that

2L(θ^EL,Δ)=2λ^i=1nψ(Xi,θ^EL,Δ)Wni+λ^2i=1n{ψ(Xi,θ^EL,Δ)Wni}22gT(θ^MLE)θ(θ^ELθ^MLE)+(θ^ELθ^MLE)T2g(θ^MLE)θθT(θ^ELθ^MLE)+oP(1). (A.1)

Note that (2.11) implies that

1ni=1nψ(Xi,θ^EL,Δ)Wni=λ^ni=1n{ψ(Xi,θ^EL,Δ)Wni}2+oP(nϱ); (A.2)

see Lemma A.3 . Recall that θ^MLE is the maximiser of g(θ) and that g(θ^MLE)θ=0. This implies that

1ngT(θ^EL)θ=1n2gT(θ^MLE)θθT(θ^ELθ^MLE)+oP(nϱ).

On the other hand, (2.12) and Lemma A.2 yield that

1ng(θ^EL)θ=λ^ni=1nψ.θ(Xi,θ^EL,Δ)Wni1+λ^ψ(Xi,θ^EL,Δ)Wni=λ^β0+oP(1).

As a result,

θ^ELθ^MLE=λ^{1n2g(θ^MLE)θθT}1β0+oP(1). (A.3)

Noting the fact that 1n2g(θ^MLE)θθTζ1I(θ), together with (A.1)-(A.3), we obtain that

2L(θ^EL,Δ)=(nλ^)2[1ni=1n{ψ(Xi,θ^EL,Δ)Wni}2+β0TΣ1β0]+oP(1).

Using Lemmas A.2 and A.5 , we complete the proof of Theorem 2.

Contributor Information

YONG ZHOU, Institute of Applied Mathematics, Chinese Academy of Sciences, Beijing, 100080, China, yzhou@amss.ac.cn.

HUA LIANG, Department of Biostatistics, St. Jude Children’s Research Hospital, Memphis, Tennessee 38105, USA, hua.liang@stjude.org.

References

  1. Brownie C, Habicht JP, Cogill B. Comparing indicators of health or nutritional status. Am. J. Epidemiol. 1986;124:1031–44. doi: 10.1093/oxfordjournals.aje.a114473. [DOI] [PubMed] [Google Scholar]
  2. Campbell G, Ratnaparkhi MV. An application of Lomax distributions in receiver operating characteristic (ROC) curve analysis. Comp. Statist. 1993;22:1681–97. [Google Scholar]
  3. Gastwirth JL, Wang JL. Control percentile test for censored data. J. Statist. Plan. Infer. 1988;18:267–76. [Google Scholar]
  4. Goddard MJ, Hinberg I. Receiver operating characteristic (ROC) curves and non-normal data: an empirical study. Statist. Med. 1990;9:325–37. doi: 10.1002/sim.4780090315. [DOI] [PubMed] [Google Scholar]
  5. Hall P, La Scala B. Methodology and algorithms of empirical likelihood. Int. Statist. Rev. 1990;58:109–27. [Google Scholar]
  6. Hsieh F, Turnbull BW. Nonparametric and semiparametric estimation of the receiver operating characteristic curve. Ann. Statist. 1996;24:25–40. [Google Scholar]
  7. Hollander M, Korwar RW. Nonparametric Bayesian estimation of the horizontal distance between two populations. In: Gnedenko BV, Puri ML, Vincze I, editors. Nonparametric Statistical Inference I. North Holland; New York: 1982. pp. 409–16. [Google Scholar]
  8. Kaplan EL, Meier P. Nonparametric estimation from incomplete observations. J. Am. Statist. Assoc. 1958;53:457–81. [Google Scholar]
  9. Li G, Tiwari RC, Wells MT. Quantile comparison function in two-sample problems: with applications to comparisons of diagnostic markers. J. Am. Statist. Assoc. 1996;91:689–98. [Google Scholar]
  10. Li G, Tiwari RC, Wells MT. Semiparametric inference for a quantile comparison function with application to receiver operating characteristic curves. Biometrika. 1999;86:487–502. [Google Scholar]
  11. Mantel N, Bohidar NR, Ciminera JL. Mantel-Haenszel analysis of litter-matched time-to-response data, with modifications for recovery of interlitter information. Cancer Res. 1977;37:3863–8. [PubMed] [Google Scholar]
  12. Owen AB. Empirical likelihood ratio confidence intervals for a single functional. Biometrika. 1988;75:237–49. [Google Scholar]
  13. Owen AB. Empirical likelihood confidence regions. Ann. Statist. 1990;18:90–120. [Google Scholar]
  14. Owen AB. Empirical Likelihood. Chapman and Hall; London: 2000. [Google Scholar]
  15. Qin J. Semi-empirical likelihood ratio confidence intervals for the difference of two sample means. Ann. Statist. 1994;46:117–26. [Google Scholar]
  16. Qin J. Empirical likelihood ratio based confidence intervals for mixture proportions. Ann. Statist. 1999;27:1368–84. [Google Scholar]
  17. Qin J, Lawless J. Empirical likelihood and general estimating equations. Ann. Statist. 1994;22:300–25. [Google Scholar]
  18. Thomas DR, Grunkemeier GL. Confidence interval estimation of survival probability for censored data. J. Am. Statist. Assoc. 1975;70:865–71. [Google Scholar]
  19. Zhou Y, Liang H. Discussion paper. Department of Biostatistics, St Jude Children’s Research Hospital; Memphis, USA: 2004. Empirical-likelihood-based semiparametric inference for the treatment effect in the two-sample problem with censoring. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES