Empirical-Likelihood-Based Inferences for Generalized Partially Linear Models

Hua Liang; Yongsong Qin; Xinyu Zhang; David Ruppert

. Author manuscript; available in PMC: 2010 Nov 26.

Published in final edited form as: Scand Stat Theory Appl. 2009 Sep;36(3):433–443.

Empirical-Likelihood-Based Inferences for Generalized Partially Linear Models

Hua Liang ¹, Yongsong Qin ², Xinyu Zhang ³, David Ruppert ⁴

PMCID: PMC2992389 NIHMSID: NIHMS175584 PMID: 21113340

Abstract

This paper considers generalized partially linear models. We propose empirical likelihood based statistics to construct confidence regions for the parametric and nonparametric componenets. The resulting statistics are shown to be asymptotically chi-squared distributed. Finite sample performance of the proposed statistics is assessed by simulation experiments. The proposed methods are applied to a dataset from an AIDS clinical trial.

Keywords: Confidence region, generalized additive models, least favorable curve, local linear regression, semiparametric estimation

1 Introduction

Generalized partially linear models (GPLM), a generalization of partially linear models to possibly non-Gaussian responses, assume that the conditional expectation of the response variables given the covariates can be represented as

E (Y | X, T) = μ {X' β + θ (T)}, Var (Y | X, T) = σ^{2} V (μ),

(1)

where μ = μ{X′β + θ(T)}, μ(·) is a known link function, V (·) is a known function, β is an unknown p × 1 vector, T ∈ R^q, θ is an unknown smooth function, and σ² is an unknown scalar parameter. Assume that T takes values in 𝒯, a closed rectangle in R^q. We assume that (Y_i, X_i, T_i), i = 1, 2, ⋯, n, are independent and identically distributed data from model (1). GPLMs allow easier interpretation of the effect of each variables and are preferable to general nonparametric models (Stone, 1980) since they provides a partial remedy to the “curse of dimensionality,” especially when q is small as is often the case. GPLMs are more flexible than the standard GLM because they combine both parametric and nonparametric components when it is believed that the E(Y|X, T) depends on variable X in a linear way but is nonlinearly related to other independent variables, T.

A special class of the GPLMs, partially linear models, have been intensively studied in literature. See for example, Engle et al. (1986), Speckman, (1988), Härdle, Liang & Gao (2000) and references therein. For GPLMs, Severini & Staniswalis (1994) applied the quasilikelihood principle proposed by Severini & Wong (1992), and Carroll et al. (1997) proposed two different estimation algorithms based on quasilikelihood and local kernel methods. Related topics have recently been studied by Lin & Carroll (2001) for longitudinal data, and Liang & Ren (2005) for measurement errors. It is worth pointing out that the quasilikelihood approach for the GPLM is different from the kernel-based smoothing method for partially linear models. The latter is simple and noniterative because the closed form of the estimators is available, while the former needs an iterative algorithm and an undersmoothing bandwidth.

Under mild regularity conditions, Severini & Staniswalis (1994) derived the asymptotics for the estimators of β and θ(t) that they proposed. In principle, these asymptotic results can be used to construct asymptotically correct confidence intervals of the parameters and pointwise confidence intervals for the nonparametric function. The finite-sample performance of the resulting confidence intervals may not be appealing because the complex structure of the covariance matrix, which needs to be estimated with estimates plugged-in for several parameters. In this paper, we propose an alternative for constructing regions for β and θ(t) using the empirical likelihood principle, which was originally studied by Hartley & Rao (1968) for sample surveys and by Thomas & Grunkemeier (1975) for survival analysis. Owen (2001) gave a comprehensive survey for empirical likelihood methods and related topics. The empirical likelihood method has many advantages over its competitors such as the normal-approximation-based method and the bootstrap method (see Hall & La Scala, 1990). These advantages include improvement of the confidence region, increase of accuracy of coverage because of using auxiliary information, easy implementation, avoiding estimating variances, and studentising automatically. Because of these features, the applications of empirical likelihood in parametric and nonparametric models have received a great amount of attention.

More recently, empirical likelihood based inference has been developed for semiparametric models, e.g., by Zhu & Xue (2006) who developed empirical likelihood confidence regions for the parameters of partially linear single-indexmodels. However, most research on empirical likelihood inference for semiparametric models has focused on the finite-dimensional parameter and has assumed a continuous response variables. In this paper, we study empirical likelihood inference for both the finite-dimensional parameters and the nonparametric functions in semiparametric models and we allow the response variable to be discrete. Our procedure is a generalization of empirical likelihood procedure to a combination of generalized linear models and nonparametric regression. This generalization is by no means straightforward. In Section 2, we will define the empirical likelihood ratio statistics for β and θ(t), derive the asymptotic distributions of the resulting empirical likelihood statistics, and explain how to establish the corresponding CI. In Section 3 we report the results of a simulation experiment to explore the finite sample performance of the proposed confidence intervals. The proposed methods will used to analyze a real dataset in Section 4. Section 5 gives a discussion. All technical derivations are given in the Appendix.

2 Empirical Likelihood Methods

Several authors have applied empirical likelihood to partially linear models, a special case of the GPLM. For example, Shi & Lau (1999) proposed an empirical likelihood based confidence interval for the parameters of a partially linear model. Qin & Jing (2001) and Wang & Li (2002) considered the case in which the response variables Y_i are random censored. These authors proposed an empirical likelihood ratio for β and derived its asymptotic distribution, which is a sum of independent chi-squared distributions with unknown weights.

We first review briefly the quasi-likelihood estimators of β and θ(t) proposed by Severini & Staniswalis (1994). Denote the quasi-likelihood function by

Q (μ, y) = \int_{μ}^{y} \frac{s - y}{V (s)} ds .

Under some regularity conditions, ∑_i Q(μ, Y_i) behaves like a log-likelihood function for μ based on Y₁, ⋯, Y_n and Q(μ, y) behaves like the logarithm of a density function for Y. Let K denote a kernel on R^q, and h = h_n denote a sequence of bandwidths. For each fixed t and β, let θ̂_β(t) denote the solution in η of

\sum_{i = 1}^{n} K (\frac{t - T_{i}}{h}) \frac{\partial}{\partial η} Q {μ (η + X_{i}^{'} β), Y_{i}} = 0 .

(2)

Let 𝒯₀ denote a compact subset of int(T) and let I_i = 1 if T_i ∈ 𝒯₀ and 0 otherwise. Given the estimator θ̂_β(t), an estimator of β, β̂ is then obtained by solving

\sum_{i = 1}^{n} I_{i} \frac{\partial}{\partial β} Q {μ ({\hat{θ}}_{β} (T_{i}) + X_{i}^{'} β), Y_{i}} = 0 .

(3)

The quasi-likelihood estimator of θ(t) is given by θ̂_β̂(t). The trimming by I_i of data near the boundary is employed to reduce boundary bias, which, for kernel regression estimators, can be quite serious and converges to zero at a slower rate than in the interior. In the univariate case, when q = 1, either a boundary-corrected kernel estimator or locally linear kernel estimator may be used instead. Although either of these methods may be extended to the multivariate case, the resulting technical details for the development of the asymptotic theory become cumbersome. For ease of notation, we present our results for the case q = 1 in the remainder of this paper.

2.1 Confidence region for β

Let β₀ denote the true value of β. Write

ω_{1} {β, {\hat{θ}}_{β} (T_{i}), Y_{i}, X_{i}, T_{i}} = I_{i} \frac{\partial}{\partial β} Q [μ {{\hat{θ}}_{β} (T_{i}) + X_{i}^{'} β}, Y_{i}] .

Based on the estimating equation (3) for β, we propose the empirical likelihood ratio statistic for β as follows.

ℓ_{1} (β) = - 2 sup_{p_{1}, \dots, p_{n}} \sum_{i = 1}^{n} log ({np}_{i}),

where p_i, i = 1, ⋯, n, are nonnegative numbers which satisfy

\sum_{i = 1}^{n} p_{i} = 1, \sum_{i = 1}^{n} p_{i} ω_{1} {β, {\hat{θ}}_{β} (T_{i}), Y_{i}, X_{i}, T_{i}} = 0 .

By the Lagrange multiplier method, it can be shown that

ℓ_{1} (β) = 2 \sum_{i = 1}^{n} log [1 + λ_{1}^{'} ω_{1} {β, {\hat{θ}}_{β} (T_{i}), Y_{i}, X_{i}, T_{i}}]

where λ₁ is determined by

\frac{1}{n} \sum_{i = 1}^{n} \frac{ω_{1} {β, {\hat{θ}}_{β} (T_{i}), Y_{i}, X_{i}, T_{i}}}{1 + λ_{1}^{'} ω_{1} {β, {\hat{θ}}_{β} (T_{i}), Y_{i}, X_{i}, T_{i}}} = 0 for β in a neighbourhood of β_{0} .

(4)

The asymptotic distribution of the empirical likelihood ratio statistic ℓ₁(β₀) is established in Theorem 2.1. Its proof is given in the Appendix.

Theorem 2.1

Suppose that nh⁴ → 0 and the conditions (a)–(e) in the Appendix are satisfied. Then, as n → ∞,

ℓ_{1} (β_{0}) \overset{d}{\to} χ_{p}^{2},

where β₀ is the true parameter value and $χ_{p}^{2}$ is a chi-square distributed random variable with p degrees of freedom.

Therefore, CI_β = {β|ℓ₁(β) ≤ c_α} is a 1 − α confidence region for β₀ where c_α satisfies $P (χ_{p}^{2} \leq c_{α}) = 1 - α$ .

2.2 Pointwise confidence region for θ(t)

Let η = θ(t) for fixed t ∈ 𝒯₀, and β̂ be a $\sqrt{n}$ −consistent estimator of β₀. Denote

ω_{2} (η, \hat{β}, Y_{i}, X_{i}, T_{i}) = K (\frac{t - T_{i}}{h}) \frac{\partial}{\partial η} Q {μ (η + X_{i}^{'} \hat{β}), Y_{i}},

where K(·) is a kernel function and h is a bandwidth. Based on the estimating equation (2) for η, we propose the empirical likelihood ratio statistic for η:

ℓ_{2} (η) = - 2 sup_{p_{1}, \dots, p_{n}} \sum_{i = 1}^{n} log ({np}_{i}),

where p_i, i = 1, ⋯, n, are nonnegative numbers which satisfy

\sum_{i = 1}^{n} p_{i} = 1, \sum_{i = 1}^{n} p_{i} ω_{2} (η, \hat{β}, Y_{i}, X_{i}, T_{i}) = 0 .

A direct calculation implies that

ℓ_{2} (η) = 2 \sum_{i = 1}^{n} log {1 + λ_{2} ω_{2} (η, \hat{β}, Y_{i}, X_{i}, T_{i})},

where λ₂ is determined by

\frac{1}{nh} \sum_{i = 1}^{n} \frac{ω_{2} (η, \hat{β}, Y_{i}, X_{i}, T_{i})}{1 + λ_{2} ω_{2} (η, \hat{β}, Y_{i}, X_{i}, T_{i})} = 0 for η in a neighbourhood of η (t) .

(5)

The asymptotic distribution of the empirical likelihood ratio statistic ℓ₂(η) is given in Theorem 2.2. Its proof is given in the Appendix.

Theorem 2.2

Suppose that nh⁴ → 0 and that the conditions (a)–(e) in the Appendix are satisfied. Then, as n →∞,

ℓ_{2} (η) \overset{d}{\to} χ_{1}^{2} .

From Theorem 2.2, the confidence region for η with coverage probability 1 − α(0 < α < 1) can be constructed by CI_η = {η|ℓ₂(η) ≤ c_α}, where c_α satisfies $P (χ_{1}^{2} \leq c_{α}) = 1 - α$ .

Remark 1

Theorems 2.1 and 2.2 indicate that undersmoothing is still needed as for the normal approximation theory (Carroll et al., 1997). To meet this requirement, we use existing bandwidth selection techniques to obtain the optimal bandwidth, ĥ_opt, An ad hoc bandwidth is generated by ĥ_opt × n^−1/20 log^−1/5 n, which ensures the bandwidth has correct order required in Theorems 2.1 and 2.2.

3 Simulations

To illustrate the numerical performance for the proposed method, we conducted a small simulation experiment in which n = 80, 100, 120. We generated data from a logistic model

logit {Y_{i} = 1 | X_{i}, T_{i}} = X_{i} β + θ (T_{i}),

where X_i is independent uniform (−0.5, 0.5) component and T_i is uniformly distributed on (0, 2). The parameter β is equal to 1, and the nonparametric function is θ(z) = sin{(z − a)/(b − a)π} with $a = \sqrt{3} / 2 - 1.645 / \sqrt{12} and b = \sqrt{3} / 2 + 1.645 / \sqrt{12}$ .

In our nonparametric estimation implementation, to save computational time, we tried the simple bandwidth h = an^−1/4(log n)^−1/5 for a = 0.75, 1, 1.25, 1.5, 2, which satisfy the condition in Theorems 1 and 2. We finally selected bandwidth via h = 1.5n^−1/4(log n)^−1/5. The numerical results are fairly stable against shifting values of the selected bandwidth. We used the quartic kernel, K(u) = 15/16(1 − u²)²I_(|u|≤1). We generated 200 data sets in each configuration. The empirical likelihood-based and normal approximation based confidence intervals for β are reported in Table 1. The lower and upper values are the averages of 200 simulated lower and upper values. The columns “AL” give the average length of the confidence intervals, while the column “CP(%)” gives the corresponding coverage probabilities of the 200 simulated datasets. The pointwise confidence intervals for the nonparametric function θ(t) at the selected four points t = 0.3, 0.8, 1.5 and 1.9 are presented in Table 2. A referee has asked us how the confidence intervals proposed compare to bootstrap confidence intervals, for which we used the naive bootstrap, i.e., resampled (X, Y, T), for 500 times. We provided the results for β in Table 1 and for the nonparametric component θ(t) in Table 2. These results basically coincide with those the results based on the normal approximation, and slightly deviate from those based on the proposed method in this paper. But the bootstrap implementation took a significantly longer amount of time compared to the empirical likelihood method. From the tables we may conclude that the coverage probabilities based on empirical likelihood method are mostly closer to the nominal level than those based on the normal approximation method, while the lengths of the empirical likelihood based intervals are slightly shorter than those based on the normal approximation method. The length of the estimated confidence intervals of β decreases with the increase of sample size.

Table 1.

95% normal approximation (Normal), bootstrap (BC) and empirical likelihood (EL) based confidence intervals for β = 1 along with the averages of the left (LE) and right endpoints (RE) of the confidence intervals, their average lengths (AL), and the associated coverage probabilities (CP) for the simulated data.

	Normal				BC				EL

n	CP(%)	LE	RE	AL	CP(%)	LE	RE	AL	CP(%)	LE	RE	AL

80	93.0	−0.568	2.889	3.457	97.0	−0.547	2.904	3.451	96.0	−0.604	2.823	3.427
100	97.0	−0.563	2.493	3.056	96.5	−0.541	2.415	2.955	95.0	−0.514	2.468	2.982
120	97.0	−0.352	2.384	2.736	97.0	−0.362	2.385	2.747	94.5	−0.295	2.398	2.692

Open in a new tab

Table 2.

95% normal approximation (Normal), bootstrap (BC) and empirical likelihood (EL) based pointwise confidence intervals for nonparametric function θ(t) at the four selected points along with the averages of the left (LE) and right endpoints (RE) of the confidence intervals, their average lengths (AL), and the associated coverage probabilities (CP) for the simulated data.

		Normal				BC				EL

t	n	CP(%)	LE	RE	AL	CP(%)	LE	RE	AL	CP(%)	LE	RE	AL

0.3	80	94.5	−1.091	0.793	1.885	95.0	−1.038	0.829	1.867	94.0	−0.841	0.914	1.756
	100	97.5	−0.995	0.586	1.581	97.0	−0.999	0.587	1.586	96.0	−0.893	0.589	1.482
	120	98.5	−0.844	0.418	1.262	96.5	−0.840	0.428	1.267	96.5	−0.823	0.442	1.265
0.8	80	94.7	−0.057	1.711	1.768	95.5	−0.044	1.671	1.715	95.5	−0.033	1.648	1.681
	100	96.5	0.144	1.613	1.469	97.0	0.133	1.614	1.482	95.5	0.093	1.613	1.519
	120	97.0	0.103	1.448	1.345	96.5	0.100	1.427	1.327	94.5	0.110	1.336	1.225
1.5	80	94.0	−1.064	0.431	1.495	94.5	−1.096	0.404	1.500	96.5	−1.032	0.417	1.449
	100	97.5	−1.070	0.278	1.348	97.0	−1.048	0.283	1.332	93.5	−0.786	0.303	1.090
	120	96.5	−1.037	0.236	1.273	97.0	−1.002	0.265	1.267	96.0	−0.965	0.259	1.223
1.9	80	96.0	−2.030	0.157	2.187	96.5	−1.974	0.172	2.146	96.0	−1.962	0.160	2.123
	100	97.0	−1.957	0.043	2.000	96.5	−1.973	0.030	2.003	95.5	−1.629	0.284	1.913
	120	97.5	−1.963	0.024	1.987	97.0	−1.953	0.032	1.985	95.5	−1.630	0.102	1.732

Open in a new tab

4 Real Data Analysis

In recent years, one of the areas focused upon by AIDS researchers has been the relationship between viral load and CD4+ cell counts (Liang, Wu & Carroll, 2003; Liang, et al., 2004). This relationship is used to investigate the concordance and discordance between virologic and immunologic variables, which may help clinicians more deeply understand AIDS pathogenesis and improve therapy. Although antiretroviral therapy for HIV-1 infected patients has greatly improved in recent years, and administration of drug cocktails consisting of three or more drugs can reduce and maintain the viral load below the detection limit in many patients, it is unlikely that any combination of therapies can eradicate HIV in infected patients because of the existence of long-lived infected cells and sites within the body where drugs may not be effective. With the success of highly active antiretroviral therapy (HAART) against HIV infection, viral load (measured as viral RNA copies/mL) is suppressed and maintained at magnitudes that are below the limit of quantification, and the infection is considered chronic. Clinicians and patients are therefore nowadays more interested in achieving a viral load that is below the detection limit and in monitoring the immunologic system (measured by CD4+ cell counts).

In this section we analyze a dataset from the AIDS study PACTG 345 (Scott et al., 2001). Let Y be the indicator of a undetectable viral load level, let X be the CD4 cell count, and let T be the treatment time. In this study, 33 patients were enrolled as cohort II. Specimens were obtained on days 0, 1, 3, 7, 14, 28, 56, then irregularly through to the day 1155. A total of 559 HID-1 RNA measurements were obtained with 256 of these below the detection limit of 400 copies/mL. Thus, 45% of the viral loads were observed to be suppressed below the detection limit. Figure 1 presents the individual observations of plasma HID RNA concentration (viral load) after initial antiretroviral treatments. A main objective of the treatment is to suppress the viral load below the limit of detection.

Viral load measurements of plasma HID RNA concentration in the PACTG 345 study. The detection limit of 400 copies of HID RNA per mL of plasma is indicated by the horizontal line.

We are interested in the relationship between the binary viral load measurement and CD4+ cell counts. A parsimonious model of this relationship is biologically and clinically important because these variables are good biomarkers for anti-HIV treatment and may be used to evaluate antiretroviral therapies. An obvious model is logistic regression, with X and T having linear effects on the logit scale, because it is easily implemented and interpreted. A concern, however, is whether this model can appropriately capture curvature in the effect of T due to drug resistance or noncompliance. To address this concern, we used the method of Härdle, Mammen & Müller (1998) to check if a logistic model is appropriate, and obtained a p-value less than 10⁻⁴, which reflects that the traditional logistic regression is not flexible enough to fit this data set well. We therefore used a partially logistic model, described in (6), to fit the dataset and use the proposed method to obtain the confidence intervals for parametric and nonparametric components.

logit {E (Y | X, T)} = X β + θ (T),

(6)

where θ(t) is a unknown smooth function. The estimate of β is 0.216, the positive value of which reflects the increased chance of RNA below the detection limit at higher levels of CD4+ counts. The 95%confidence intervals for β based on the normal approximation and the proposed empirical likelihood methods are (−0.202, 0.634) and (0.081, 0.514). These two confidence intervals convey different messages. The former interval indicates that the chance of RNA below the detection limit is not statistically significantly related to CD4 cell count, but the latter interval yields an inverse impression. We prefer to the conclusion based on the empirical likelihood method according to biological meanings and the simulation performance of this method. The pointwise estimates of θ(t) and associated confidence regions based on these two methods are shown in Figure 2, in which the solid line is the estimated pattern of θ(t), the dotted lines and broken lines are the confidence regions based on empirical likelihood and normal approximation methods. The former gives a narrower region than the latter.

Confidence intervals of θ(t) obtained by using the normal approximation and empirical likelihood methods. The solid line is the estimated curve of θ(t), while the dotted and broken lines are the pointwise confidence intervals based on empirical likelihood and normal approximation methods.

5 Discussion

To simply inference for GPLMs, we proposed an empirical likelihood-based approach to constructing confidence regions for β and θ(t). The proposed approach is remarkably simpler than its counterpart based on the asymptotic normality of quasilikelihood estimators (Severini & Staniswalis, 1994) and easily executable. The finite-sample performance of the proposed statistics shows promise. In this article, we used local linear regression when we handled nonparametric function θ(t). There are many different alternatives to the local constant kernel regression in (2), including higher degree local polynomial kernel methods, smoothing splines, and regression splines. The details for these methods need further investigation in our setting. We chose the constant kernel regression because theoretical results can be derived (Severini & Staniswalis, 1994).

Model (1) may be extended to a generalized additive partially linear model in the form of

E (y_{i} | X_{i}, T_{i}) = μ {X_{i}^{'} β + \sum_{k = 1}^{K} θ_{k} (T_{k, i})},

where T_i = (T_1,i, …, T_K,i)′ is a K-dimensional vector. The study of this model is interesting and requires additional efforts, but it is beyond the scope of this paper.

Acknowledgements

The research of Liang and Qin was supported by NIH/NIAID grants. Zhang’s research was partially supported by grants from the National Natural Science Foundation of China. Ruppert’s research was supported by NSF and NIH grants. The authors thank the Editor and two referees for their insightful comments that improved an earlier version of this paper.

Appendix

Conditions

The following assumptions are standard in studies of GPLMs, and we assume these hold throughout the article. Write ρ₁(u) = {dμ(u)/du} V⁻¹{μ(u)}, and q₁(u, y) = {y−μ(u)}ρ₁(u).

The density function f(t) of T is positive and continuous at the point t₀ ∈ 𝒯.
The function μ(u) is twice differentiable in u.
The function θ⁽²⁾(t) is continuous at the point t₀ ∈ 𝒯.
With $R = θ (T) + X' β, E {q_{1}^{2} (R, Y) | T = t}, E {q_{1}^{2} (R, Y) X | T = t}$ , and $E {q_{1}^{2} (R, Y) X X' | T = t}$ are twice differentiable in t.
$E {q_{1}^{2 + δ} (R, Y)} < \infty$ , for some δ > 2.

Proof of Theorem 2.1

Denote AA′ by A^⊗2, $Λ_{β_{0}, i} = θ_{β_{0}} (T_{i}) + X_{i}^{'} β_{0} and Λ_{i} = θ (T_{i}) + X_{i}^{'} β_{0}$ for i = 1, …, n. Ξ_n = max_1≤i≤n ‖ω₁{β₀, θ̂_β₀(T_i), Y_i, X_i, T_i}‖. We first show that

n^{- 1 / 2} \sum_{i = 1}^{n} ω_{1} {β_{0}, {\hat{θ}}_{β_{0}} (T_{i}), Y_{i}, X_{i}, T_{i}} \overset{d}{\to} N (0, Г),

(7)

n^{- 1} \sum_{i = 1}^{n} {[ω_{1} {β_{0}, {\hat{θ}}_{β_{0}} (T_{i}), Y_{i}, X_{i}, T_{i}}]}^{\otimes 2} = Г + o_{p} (1),

(8)

and

Ξ_{n} = o (n^{1 / 2}), a . s .,

(9)

where Г is a positive definite matrix in form of

σ^{2} E [I_{1} {X_{1} + \frac{\partial θ_{β} (T_{1})}{\partial β} |_{β = β_{0}}}^{\otimes 2} \frac{{μ' (Λ_{1})}^{2}}{V {μ (Λ_{1})}}] .

Recall that Q(μ, y) behaves like the logarithm of a density function for Y, and that θ_β(t) is a least favorable curve and thus proposition 2 of Severini &Wong (1992) holds, which are shown in the proof of proposition 1 of Severini and Staniswalis (1994). Accordingly, applying (2) in Section 6 of Severini & Wong (1992) (here our $\sum_{i = 1}^{n} ω_{1} {β, θ_{β} (T_{i}), Y_{i}, X_{i}, T_{i}}$ corresponds to $\frac{{dL}_{n} (θ, λ_{θ})}{d θ}$ of Severini & Wong (1992)), we obtain

n^{- 1 / 2} \sum_{i = 1}^{n} ω_{1} {β_{0}, {\hat{θ}}_{β_{0}} (T_{i}), Y_{i}, X_{i}, T_{i}} = n^{- 1 / 2} \sum_{i = 1}^{n} ω_{1} {β_{0}, θ_{β_{0}} (T_{i}), Y_{i}, X_{i}, T_{i}} + o_{p} (1) .

(10)

Furthermore,

\begin{matrix} \sum_{i = 1}^{n} ω_{1} {β_{0}, θ_{β_{0}} (T_{i}), Y_{i}, X_{i}, T_{i}} & = \sum_{i = 1}^{n} I_{i} {X_{i} + \frac{\partial θ_{β} (T_{i})}{\partial β} |_{β = β_{0}}} \frac{μ' (Λ_{β_{0}, i})}{V {μ (Λ_{β_{0}, i})}} {Y_{i} - μ' (Λ_{β_{0}, i})} \\ = \sum_{i = 1}^{n} I_{i} {X_{i} + \frac{\partial θ_{β} (T_{i})}{\partial β} |_{β = β_{0}}} \frac{μ' (Λ_{i})}{V {μ (Λ_{i})}} {Y_{i} - μ' (Λ_{i})} . \end{matrix}

(11)

(7) follows from (10), (11) and a central limit theorem. The proofs of (8) and (9) are trivial.

From (7), (8) and (9), and the arguments similar to the proof of (2.14) in Owen (1990), we can show that

λ_{1} = O_{p} (n^{- 1 / 2}) .

(12)

Recall (4). It is readily seen by a direct calculation and (12) that

\begin{matrix} λ_{1} = & {(n^{- 1} \sum_{i = 1}^{n} {[ω_{1} {β_{0}, {\hat{θ}}_{β_{0}} (T_{i}), Y_{i}, X_{i}, T_{i}}]}^{\otimes 2})}^{- 1} \\ \times [n^{- 1} \sum_{i = 1}^{n} ω_{1} {β_{0}, {\hat{θ}}_{β_{0}} (T_{i}), Y_{i}, X_{i}, T_{i}}] + o_{p} (n^{- 1 / 2}) . \end{matrix}

Thus, using Taylor expansion, we have

\begin{matrix} ℓ_{1} (β_{0}) = & n {[\frac{1}{n} \sum_{i = 1}^{n} ω_{1} {β_{0}, {\hat{θ}}_{β_{0}} (T_{i}), Y_{i}, X_{i}, T_{i}}]}^{'} \\ \times {(\frac{1}{n} \sum_{i = 1}^{n} {[ω_{1} {β_{0}, {\hat{θ}}_{β_{0}} (T_{i}), Y_{i}, X_{i}, T_{i}}]}^{\otimes 2})}^{- 1} \\ \times [\frac{1}{n} \sum_{i = 1}^{n} ω_{1} {β_{0}, {\hat{θ}}_{β_{0}} (T_{i}), Y_{i}, X_{i}, T_{i}}] + o_{p} (1) . \end{matrix}

The proof is complete from (7) and (8).

Proof of Theorem 2.2

Denote by f₀(·) the probability density function of T. Write

H (Y, X, T) = \frac{μ' {θ (T) + X' β_{0}}}{V [μ {θ (T) + X' β_{0}}]} [Y - μ' {θ (T) + X' β_{0}}] .

We first show that

{(nh)}^{- 1 / 2} \sum_{i = 1}^{n} ω_{2} (η, \hat{β}, Y_{i}, X_{i}, T_{i}) \overset{d}{\to} N (0, Г_{0}),

(13)

{(nh)}^{- 1} \sum_{i = 1}^{n} ω_{2}^{2} (η, \hat{β}, Y_{i}, X_{i}, T_{i}) = Г_{0} + o_{p} (1),

(14)

where Г₀ = ∫ K²(u)du · f₀(t)E{H²(Y, X, T)|T = t}.

From Taylor expansion and the fact that

\sqrt{n} (\hat{β} - β_{0}) = O_{p} (1), \frac{1}{nh} \sum_{i = 1}^{n} \frac{\partial}{\partial β} ω_{2} (η, β, Y_{i}, X_{i}, T_{i}) |_{β = β_{0}} = O_{p} (1),

it can been shown that

\begin{matrix} {(nh)}^{- 1 / 2} \sum_{i = 1}^{n} ω_{2} (η, \hat{β}, Y_{i}, X_{i}, T_{i}) = {(nh)}^{- 1 / 2} \sum_{i = 1}^{n} ω_{2} (η, β_{0}, Y_{i}, X_{i}, T_{i}) \\ + \sqrt{nh} (\hat{β} - β_{0})' {(nh)}^{- 1} \sum_{i = 1}^{n} \frac{\partial}{\partial β} ω_{2} (η, β, Y_{i}, X_{i}, T_{i}) |_{β = β_{0}} {1 + o_{p} (1)} \\ = {(nh)}^{- 1 / 2} \sum_{i = 1}^{n} ω_{2} (η, β_{0}, Y_{i}, X_{i}, T_{i}) + o_{p} (1) . \end{matrix}

(15)

Moreover,

\begin{matrix} \sum_{i = 1}^{n} ω_{2} (η, β_{0}, Y_{i}, X_{i}, T_{i}) & = \sum_{i = 1}^{n} K (\frac{t - T_{i}}{h}) \frac{μ' (Λ_{β_{0}, i})}{V {μ (Λ_{β_{0}, i})}} {Y_{i} - μ' (Λ_{β_{0}, i})} \\ = \sum_{i = 1}^{n} K (\frac{t - T_{i}}{h}) \frac{μ' (Λ_{i})}{V {μ (Λ_{i})}} {Y_{i} - μ' (Λ_{i})} . \end{matrix}

(16)

Thus (13) follows from (15), (16) and a central limiting theorem. The proof of (14) is trivial.

Write

Ū_{j} = {(nh)}^{- 1} \sum_{i = 1}^{n} ω_{2}^{j} (η, \hat{β}, Y_{i}, X_{i}, T_{i}), for j = 1, 2 .

From (15), using a central limiting theorem, it can be shown that

Ū_{1} = O_{p} {{(nh)}^{- 1 / 2} + h^{2}} .

Combining with (5), (13) and (14), we have

λ_{2} = {(Ū_{2})}^{- 1} Ū_{1} + O_{p} [{{(nh)}^{- 1 / 2} + h^{2}}^{2}] .

Furthermore, by Taylor expansion, we obtain

\begin{matrix} ℓ_{2} (η) = & 2 λ_{2} \sum_{i = 1}^{n} ω_{2} (η, \hat{β}, Y_{i}, X_{i}, T_{i}) \\ - λ_{2}^{2} \sum_{i = 1}^{n} ω_{2}^{2} (η, \hat{β}, Y_{i}, X_{i}, T_{i}) + O_{p} [{{(nh)}^{- 1 / 2} + h^{2}}^{3}] \\ = & (nh) Ū_{1}^{2} {(Ū_{2})}^{- 1} + O_{p} [{{(nh)}^{- 1 / 2} + h^{2}}^{3}] \end{matrix}

(17)

Theorem 2.2 follows from (13), (14), and (17).

Contributor Information

Hua Liang, University of Rochester.

Yongsong Qin, Guangxi Normal University.

Xinyu Zhang, Chinese Academy of Sciences.

David Ruppert, Cornell University.

References

Carroll RJ, Fan J, Gijbels I, Wand MP. Generalized partially linear single-index models. J. Am. Statist. Assoc. 1997;92:477–489. [Google Scholar]
Chen SX, Qin YS. Empirical likelihood confidence intervals for local linear smoothers. Biometrika. 2000;87:946–953. [Google Scholar]
Engle RF, Granger CWJ, Rice J, Weiss A. Semiparametric estimates of the relation between weather and electricity sales. J. Am. Statist. Assoc. 1986;81:310–320. [Google Scholar]
Hall P, La Scala B. Methodology and algorithms of empirical likelihood. Int. Statist. Rev. 1990;58:109–127. [Google Scholar]
Hartley HO, Rao JNK. A new estimation theory for sample surveys. Biometrika. 1968;55:547–557. [Google Scholar]
Härdle W, Mammen E, Müller M. Testing parametric versus semiparametric modeling in generalized linear models. J. Am. Statist. Assoc. 1998;93:1461–1474. [Google Scholar]
Härdle W, Liang H, Gao J. Partially Linear Models. Heidelberg: Springer Physica-Verlag; 2000. [Google Scholar]
Liang H, Ren HB. Generalized partially linear measurement error models. J. Comp. Graph. Statist. 2005;14:237–250. [Google Scholar]
Liang H, Wu HL, Carroll RJ. The relationship between virologic and immunologic responses in AIDS clinical research using mixed-effect varying-coefficient semiparametric models with measurement error. Biostatistics. 2003;4:297–312. doi: 10.1093/biostatistics/4.2.297. [DOI] [PubMed] [Google Scholar]
Liang H, Wang S, Robins JM, Carroll RJ. Estimation in partially linear models with missing covariates. J. Am. Statist. Assoc. 2004;99:357–367. [Google Scholar]
Lin XH, Carroll RJ. Semiparametric regression for clustered data using generalized estimating equations. J. Am. Statist. Assoc. 2001;96:1045–1056. [Google Scholar]
Owen AB. Empirical likelihood ratio confidence regions. Ann. Statist. 1990;18:90–120. [Google Scholar]
Owen AB. Empirical likelihood. New York: Chapman and Hall; 2001. [Google Scholar]
Qin GS, Jing BY. Censored partial linear models and empirical likelihood. J. Mult. Anal. 2001;78:37–61. [Google Scholar]
Qin J. Empirical likelihood ratio based confidence intervals for mixture proportions. Ann. Statist. 1999;27:1368–1384. [Google Scholar]
Qin J, Lawless J. Empirical likelihood and general estimating equations. Ann. Statist. 1994;22:300–325. [Google Scholar]
Scott ZA, Chadwick EG, Gibson LL, et al. Infrequent detection of HIV-1-specific, but not cytomegalovirus-specific, CD8+T cell responses in young HIV-1-infected infants. J. Immunology. 2001;167:7134–7140. doi: 10.4049/jimmunol.167.12.7134. [DOI] [PubMed] [Google Scholar]
Severini TA, Staniswalis JG. Quasilikelihood estimation in semiparametric models. J. Am. Statist. Assoc. 1994;89:501–511. [Google Scholar]
Severini TA, Wong WH. Profile likelihood and conditionally parametric models. Ann. Statist. 1992;20:1768–1802. [Google Scholar]
Shi J, Lau TS. Empirical likelihood for partially linear models. J. Mult. Anal. 1999;72:132–148. [Google Scholar]
Speckman P. Kernel smoothing in partial linear models. J. R. Statist. Soc. B. 1988;50:413–436. [Google Scholar]
Stone CJ. Optimal rates of convergence for nonparametric estimators. Ann. Statist. 1980;8:1348–1360. [Google Scholar]
Thomas DR, Grunkemeier GL. Confidence interval estimation of survival probabilities for censored data. J. Am. Statist. Assoc. 1975;70:865–871. [Google Scholar]
Wang QH, Li G. Empirical likelihood semiparametric regression analysis under random censorship. J. Mult. Anal. 2002;83:469–486. [Google Scholar]
Zhu LX, Xue LG. Empirical likelihood confidence regions in a partially linear single-index model. J. R. Statist. Soc. B. 2006;68:549–570. [Google Scholar]

[R1] Carroll RJ, Fan J, Gijbels I, Wand MP. Generalized partially linear single-index models. J. Am. Statist. Assoc. 1997;92:477–489. [Google Scholar]

[R2] Chen SX, Qin YS. Empirical likelihood confidence intervals for local linear smoothers. Biometrika. 2000;87:946–953. [Google Scholar]

[R3] Engle RF, Granger CWJ, Rice J, Weiss A. Semiparametric estimates of the relation between weather and electricity sales. J. Am. Statist. Assoc. 1986;81:310–320. [Google Scholar]

[R4] Hall P, La Scala B. Methodology and algorithms of empirical likelihood. Int. Statist. Rev. 1990;58:109–127. [Google Scholar]

[R5] Hartley HO, Rao JNK. A new estimation theory for sample surveys. Biometrika. 1968;55:547–557. [Google Scholar]

[R6] Härdle W, Mammen E, Müller M. Testing parametric versus semiparametric modeling in generalized linear models. J. Am. Statist. Assoc. 1998;93:1461–1474. [Google Scholar]

[R7] Härdle W, Liang H, Gao J. Partially Linear Models. Heidelberg: Springer Physica-Verlag; 2000. [Google Scholar]

[R8] Liang H, Ren HB. Generalized partially linear measurement error models. J. Comp. Graph. Statist. 2005;14:237–250. [Google Scholar]

[R9] Liang H, Wu HL, Carroll RJ. The relationship between virologic and immunologic responses in AIDS clinical research using mixed-effect varying-coefficient semiparametric models with measurement error. Biostatistics. 2003;4:297–312. doi: 10.1093/biostatistics/4.2.297. [DOI] [PubMed] [Google Scholar]

[R10] Liang H, Wang S, Robins JM, Carroll RJ. Estimation in partially linear models with missing covariates. J. Am. Statist. Assoc. 2004;99:357–367. [Google Scholar]

[R11] Lin XH, Carroll RJ. Semiparametric regression for clustered data using generalized estimating equations. J. Am. Statist. Assoc. 2001;96:1045–1056. [Google Scholar]

[R12] Owen AB. Empirical likelihood ratio confidence regions. Ann. Statist. 1990;18:90–120. [Google Scholar]

[R13] Owen AB. Empirical likelihood. New York: Chapman and Hall; 2001. [Google Scholar]

[R14] Qin GS, Jing BY. Censored partial linear models and empirical likelihood. J. Mult. Anal. 2001;78:37–61. [Google Scholar]

[R15] Qin J. Empirical likelihood ratio based confidence intervals for mixture proportions. Ann. Statist. 1999;27:1368–1384. [Google Scholar]

[R16] Qin J, Lawless J. Empirical likelihood and general estimating equations. Ann. Statist. 1994;22:300–325. [Google Scholar]

[R17] Scott ZA, Chadwick EG, Gibson LL, et al. Infrequent detection of HIV-1-specific, but not cytomegalovirus-specific, CD8+T cell responses in young HIV-1-infected infants. J. Immunology. 2001;167:7134–7140. doi: 10.4049/jimmunol.167.12.7134. [DOI] [PubMed] [Google Scholar]

[R18] Severini TA, Staniswalis JG. Quasilikelihood estimation in semiparametric models. J. Am. Statist. Assoc. 1994;89:501–511. [Google Scholar]

[R19] Severini TA, Wong WH. Profile likelihood and conditionally parametric models. Ann. Statist. 1992;20:1768–1802. [Google Scholar]

[R20] Shi J, Lau TS. Empirical likelihood for partially linear models. J. Mult. Anal. 1999;72:132–148. [Google Scholar]

[R21] Speckman P. Kernel smoothing in partial linear models. J. R. Statist. Soc. B. 1988;50:413–436. [Google Scholar]

[R22] Stone CJ. Optimal rates of convergence for nonparametric estimators. Ann. Statist. 1980;8:1348–1360. [Google Scholar]

[R23] Thomas DR, Grunkemeier GL. Confidence interval estimation of survival probabilities for censored data. J. Am. Statist. Assoc. 1975;70:865–871. [Google Scholar]

[R24] Wang QH, Li G. Empirical likelihood semiparametric regression analysis under random censorship. J. Mult. Anal. 2002;83:469–486. [Google Scholar]

[R25] Zhu LX, Xue LG. Empirical likelihood confidence regions in a partially linear single-index model. J. R. Statist. Soc. B. 2006;68:549–570. [Google Scholar]

PERMALINK

Empirical-Likelihood-Based Inferences for Generalized Partially Linear Models

Hua Liang

Yongsong Qin

Xinyu Zhang

David Ruppert

Abstract

1 Introduction

2 Empirical Likelihood Methods

2.1 Confidence region for β

Theorem 2.1

2.2 Pointwise confidence region for θ(t)

Theorem 2.2

Remark 1

3 Simulations

Table 1.

Table 2.

4 Real Data Analysis

Figure 1.

Figure 2.

5 Discussion

Acknowledgements

Appendix

Conditions

Proof of Theorem 2.1

Proof of Theorem 2.2

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Empirical-Likelihood-Based Inferences for Generalized Partially Linear Models

Hua Liang

Yongsong Qin

Xinyu Zhang

David Ruppert

Abstract

1 Introduction

2 Empirical Likelihood Methods

2.1 Confidence region for β

Theorem 2.1

2.2 Pointwise confidence region for θ(t)

Theorem 2.2

Remark 1

3 Simulations

Table 1.

Table 2.

4 Real Data Analysis

Figure 1.

Figure 2.

5 Discussion

Acknowledgements

Appendix

Conditions

Proof of Theorem 2.1

Proof of Theorem 2.2

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases