Improving efficiency of inferences in randomized clinical trials using auxiliary covariates

Min Zhang; Anastasios A Tsiatis; Marie Davidian

doi:10.1111/j.1541-0420.2007.00976.x

. Author manuscript; available in PMC: 2008 Oct 28.

Published in final edited form as: Biometrics. 2008 Jan 11;64(3):707–715. doi: 10.1111/j.1541-0420.2007.00976.x

Improving efficiency of inferences in randomized clinical trials using auxiliary covariates

Min Zhang ^1,^*, Anastasios A Tsiatis ¹, Marie Davidian ¹

PMCID: PMC2574960 NIHMSID: NIHMS45063 PMID: 18190618

Summary

The primary goal of a randomized clinical trial is to make comparisons among two or more treatments. For example, in a two-arm trial with continuous response, the focus may be on the difference in treatment means; with more than two treatments, the comparison may be based on pairwise differences. With binary outcomes, pairwise odds-ratios or log-odds ratios may be used. In general, comparisons may be based on meaningful parameters in a relevant statistical model. Standard analyses for estimation and testing in this context typically are based on the data collected on response and treatment assignment only. In many trials, auxiliary baseline covariate information may also be available, and it is of interest to exploit these data to improve the efficiency of inferences. Taking a semiparametric theory perspective, we propose a broadly-applicable approach to adjustment for auxiliary covariates to achieve more efficient estimators and tests for treatment parameters in the analysis of randomized clinical trials. Simulations and applications demonstrate the performance of the methods.

Keywords: Covariate adjustment, Hypothesis test, k-arm trial, Kruskal-Wallis test, Log-odds ratio, Longitudinal data, Semiparametric theory

1. Introduction

In randomized clinical trials, the primary objective is to compare two or more treatments on the basis of an outcome of interest. Along with treatment assignment and outcome, baseline auxiliary covariates may be recorded on each subject, including demographical and physiological characteristics, prior medical history, and baseline measures of the outcome. For example, the international Platelet Glycoprotein IIb/IIIa in Unstable Angina: Receptor Suppression Using Integrilin Therapy (PURSUIT) study (Harrington, 1998) in subjects with acute coronary syndromes compared the anti-coagulant Integrilin plus heparin and aspirin to heparin and aspirin alone (control) on the basis of the binary endpoint death or myocardial infarction at 30 days. Similarly, AIDS Clinical Trials Group (ACTG) 175 (Hammer et al., 1996) randomized HIV-infected subjects to four antiretroviral regimens with equal probabilities, and an objective was to compare measures of immunological status under the three newer treatments to those under standard zidovudine (ZDV) monotherapy. In both studies, in addition to the endpoint, substantial auxiliary baseline information was collected.

Ordinarily, the primary analysis is based only on the data on outcome and treatment assignment. However, if some of the auxiliary covariates are associated with outcome, precision may be improved by “adjusting” for these relationships (e.g., Pocock et al., 2002), and there is an extensive literature on such covariate adjustment (e.g., Senn, 1989; Hauck, Anderson, and Marcus, 1998; Koch et al., 1998; Tangen and Koch, 1999; Lesaffre and Senn, 2003; Grouin, Day, and Lewis, 2004). Much of this work focuses on inference on the difference of two means and/or on adjustment via a regression model for mean outcome as a function of treatment assignment and covariates. In the special case of the difference of two treatment means, Tsiatis et al. (2007) proposed an adjustment method that follows from application of the theory of semiparametrics (e.g., van der Laan and Robins, 2003; Tsiatis, 2006) by Leon, Tsiatis, and Davidian (2003) to the related problem of “pretest-posttest” analysis, from which the form of the “optimal” (most precise) estimator for the treatment mean difference, adjusting for covariates, emerges readily. This approach separates estimation of the treatment difference from the adjustment, which may lessen concerns over bias that could result under regression-based adjustment because of the ability to inspect treatment effect estimates obtained simultaneously with different combinations of covariates and “to focus on the covariate model that best accentuates the estimate” (Pocock et al., 2002, p. 2925).

In this paper, we expand on this idea by developing a broad framework for covariate adjustment in settings with two or more treatments and general outcome summary measures (e.g., log-odds ratios) by appealing to the theory of semiparametrics. The resulting methods seek to use the available data as efficiently as possible while making as few assumptions as possible. In Section 2, we present a semiparametric model framework involving parameters relevant to making general treatment comparisons. Using the theory of semiparametrics, we derive the class of estimating functions for these parameters in Section 3 and in Section 4 demonstrate how these results lead to practical estimators. This development suggests a general approach to adjusting any test statistic for making treatment comparisons to increase efficiency, described in Section 5. Performance of the proposed methods is evaluated in simulation studies in Section 6 and is shown in representative applications in Section 7.

2. Semiparametric Model Framework

Denote the data from a k-arm randomized trial, k ≥ 2, as (Y_i, X_i, Z_i), i = 1, . . . , n, independent and identically distributed (iid) across i, where, for subject i, Y_i is outcome; X_i is the vector of all available auxiliary baseline covariates; and Z_i = g indicates assignment to treatment group g with known randomization probabilities P(Z = g) = π_g, g = 1, . . . , k, $\sum_{g = 1}^{k} π_{g} = 1$ . Randomization ensures that Z⊥X, where “⫫” means “independent of.”

Let β denote a vector of parameters involved in making treatment comparisons under a specified statistical model. For example, in a two-arm trial, for a continuous real-valued response Y, a natural basis for comparison is the difference in means for each treatment, E(Y | Z = 2) − E(Y | Z = 1), represented directly as β₂ in the model

E (Y ∣ Z) = β_{1} + β_{2} I (Z = 2), β_{1} = E (Y ∣ Z = 1), β = {(β_{1}, β_{2})}^{T} .

(1)

In a three-arm trial, we may consider the model

E (Y ∣ Z) = β_{1} I (Z = 1) + β_{2} I (Z = 2) + β_{3} I (Z = 3), β = {(β_{1}, β_{2}, β_{3})}^{T} .

(2)

In contrast to (1), we have parameterized (2) equivalently in terms of the three treatment means rather than differences relative to a reference treatment, and treatment comparisons may be based on pairwise contrasts among elements of β. For binary outcome Y = 0 or 1, where Y = 1 indicates the event of interest, we may consider for a k-arm trial

logit {E (Y ∣ Z)} = logit {P (Y = 1 ∣ Z)} = β_{1} + β_{2} I (Z = 2) + \cdot \cdot \cdot + β_{k} I (Z = k),

(3)

where logit(p) = log{p/(1 − p)}; β = (β₁, . . . , β_k)^T ; and the log-odds ratio for treatment g relative to treatment 1 is β_g, g = 2, . . . , k.

If Y_i is a vector of continuous longitudinal responses Y_ij, j = 1, . . . , m_i, at times t_i1, . . . , t_imi, response-time profiles in a two-arm trial might be described by the simple linear mixed model

Y_{i j} = α + {β_{1} + β_{2} I (Z_{i} = 2)} t_{i j} + b_{0 i} + b_{1 i} t_{i j} + e_{i j}, {(b_{0 i}, b_{1 i})}^{T} \overset{i i d}{\sim} N (0, D), e_{i j} \overset{i i d}{\sim} N (0, σ_{e}^{2}),

(4)

where β = (β₁, β₂)^T, and β₂ is the difference in mean slope between the two treatments; extension to k > 2 treatment groups is straightforward. Alternatively, instead of considering the fully parametric model (4), one might make no assumption beyond

E (Y_{i j} ∣ Z_{i}) = α + {β_{1} + β_{2} I (Z_{i} = 2)} t_{i j}, j = 1, \dots, m_{i},

(5)

leaving remaining features of the distribution of Y given Z unspecified. For binary Y_ij, the marginal model logit{E(Y_ij | Z_i)} = α + {β₁ + β₂I(Z_i = 2)}t_ij might be adopted.

In all of (1)-(5), β (p × 1) is a parameter involved in making treatment comparisons in a model describing aspects of the conditional distribution of Y given Z and is of central interest. In addition to β, models like (4) and (5) depend on a vector of parameters γ, say; e.g., in (4), $γ = {α, σ_{e}^{2}, vech {(D)}^{T}}^{T}$ ; and γ = α in (5). In general, we define θ = (β^T , γ^T)^T (r × 1), recognizing that models like (1)-(3) do not involve an additional γ, so that θ = β.

For these and similar models, consistent, asymptotically normal estimators for θ, and hence for β and functions of its elements reflecting treatment comparisons, based on the data (Y_i, Z_i), i = 1, . . . , n, only and thus “unadjusted” for covariates, are readily available. Unadjusted, large-sample tests of null hypotheses of “no treatment effects” are also well-established. The difference of sample means is the obvious such estimator for β₂ in (1) and is efficient (i.e., has smallest asymptotic variance) among estimators depending only on these data, and a test of H₀ : β₂ = 0 may be based on the usual t statistic. Similarly, the maximum likelihood estimator (MLE) for β₂ in (4) and associated tests may be obtained from standard mixed model software. For k > 2, pairwise and global comparisons are possible; e.g., in (2), the sample means are efficient estimators for each element of β, and a test of H₀ : β₁ = β₂ = β₃ may be based on the corresponding two-degree-of-freedom Wald statistic.

As noted in Section 1, the standard approach in practice for covariate adjustment, thus using all of (Y_i, X_i, Z_i), i = 1, . . . , n, is based on regression models for mean outcome as a function of X and Z. E.g., for k = 2 and continuous Y, a popular such estimator for β₂ in (1) is the ordinary least squares (OLS) estimator for φ in the analysis of covariance model

E (Y ∣ X, Z) = α_{0} + α_{1}^{T} X + ϕ I (Z = 2);

(6)

extension to k > 2 treatments is immediate. See Tsiatis et al. (2007, Section 3) for discussion of related estimators for β₂ in the particular case of (1). If (6) is the correct model for E(Y | X, Z), then φ and β₂ in (1) coincide, and, moreover, the OLS estimator for φ in (6) is a consistent estimator for β₂ that is generally more precise than the usual unadjusted estimator, even if (6) is not correct (e.g., Yang and Tsiatis, 2001). For binary Y, covariate adjustment is often carried out based on the logistic regression model

logit {E (Y ∣ X, Z)} = logit {P (Y = 1) ∣ X, Z)} = α_{0} + α_{1}^{T} X + ϕ I (Z = 2),

(7)

where the MLE of φ is taken as the adjusted estimator for the log-odds ratio β₂ in (3) with k = 2. In (7), φ is the log-odds ratio conditional on X, assuming this quantity is constant for all X. This assumption may or may not be correct; even if it were, φ is generally different from β₂ in (3). Tsiatis et al. (2007, Section 2) discuss this point in more detail.

To derive alternative methods, we begin by describing our assumed semiparametric statistical model for the full data (Y, X, Z), which is a characterization of the class of all joint densities for (Y, X, Z) that could have generated the data. We seek methods that perform well over as large a class as possible; thus, we assume that densities in this class involve no restrictions beyond the facts that Z⊥X, guaranteed by randomization; that π_g = P(Z = g), g = 1, . . . , k, are known; and that β is defined through a specification on the conditional distribution of Y given Z as in (1)-(5). We thus first describe the conditional density of Y given Z. Under (3) and (4), this density is completely specified in terms of θ, while (5) describes only one aspect of the conditional distribution, the mean, in terms of θ, and (1) and (2) make no restrictions on the conditional distribution of Y given Z. To represent all such situations, we assume that this conditional density may be written as p_Y|Z (y|z; θ, η), where η is an additional nuisance parameter possibly needed to describe the density fully. For (3) and (4), η is null, as the density is already entirely characterized. For (1), (2), and (5), η is infinite-dimensional, as these specifications do not impose any additional constraints on what the density might be, so any density consistent with these models is possible.

Under the above conditions, we assume that all joint densities for (Y, X, Z) may be written, in obvious notation, as p_Y,X,Z(y, x, z; θ, η, ψ, π) = p_Y,X|Z(y, x | z; θ, η, ψ)p_Z(z; π), where p_Z(z; π) is completely specified, as π = (π₁, . . . , π_k)^T is known, and satisfy the constraints

(i) \int p_{Y, X ∣ Z} (y, x ∣ z; θ, η, ψ) d x = p_{Y ∣ Z} (y ∣ z; θ, η),

(8)

(ii) \int p_{Y, X ∣ Z} (y, x ∣ z; θ, η, ψ) d y = p_{X} (x) .

(9)

The joint density involves an additional, possibly infinite-dimensional nuisance parameter ψ, needed to include in the class all joint densities satisfying (i) and (ii). Here, p_X(x) is any arbitrary marginal density for the covariates, and (ii) follows because Z⊥X. In Web Appendix A, we demonstrate that a rich class of joint distributions for (Y, X, Z) may be identified such that X is correlated with Y and Z⫫X [condition (ii)] that also satisfy condition (i). Because the joint density involves both finite (parametric) and infinite-dimensional components, it represents a semiparametric statistical model (see Tsiatis, 2006, Section 1.2).

3. Estimating Functions for Treatment Parameters Using Auxiliary Covariates

We now derive consistent, asymptotically normal estimators for θ, and hence β, in a given p_Y|Z (y|z; θ, η) and using the iid data (Y_i, X_i, Z_i), i = 1 . . . , n, under the semiparametric framework satisfying (8) and (9). To do this, we identify the class of all estimating functions for θ based on (Y, X, Z) leading to all estimators for θ that are consistent and asymptotically normal under this framework. An estimating function is a function of a single observation and parameters used to construct estimating equations yielding an estimator for the parameters.

When the data on auxiliary covariates X are not taken into account, estimating functions for θ based only on (Y, Z) in models like those in (1)-(5) leading to consistent, asymptotically normal estimators are well known. For example, the OLS estimator for θ = β in the linear regression model (1) may be obtained by considering the estimating function

m (Y, Z; θ) = {1, I (Z = 2)}^{T} {Y - β_{1} - β_{2} I (Z = 2)}, θ = β = {(β_{1}, β_{2})}^{T} .

(10)

and solving the estimating equation $\sum_{i = 1}^{n} m (Y_{i}, Z_{i}; θ) = 0$ in θ. The OLS estimator for β₂ so obtained equals the usual difference in sample means. Likewise, with θ = β = (β₁, . . . , β_k)^T and expit(u) = exp(u)/{1+exp(u)}, the usual logistic regression MLE for β in (3) is obtained by solving $\sum_{i = 1}^{n} m (Y_{i}, Z_{i}; θ) = 0$ , where the estimating function m(Y, Z; θ) is equal to

{1, I (Z = 2), \dots, I (Z = k)}^{T} [Y - expit {β_{1} + β_{2} I (Z = 2) + \cdot \cdot \cdot + β_{k} I (Z = k)}] .

(11)

The estimating functions (10) and (11) are unbiased; i.e., have mean zero assuming that (1) and (3), respectively, are correct. Under regularity conditions, unbiased estimating functions lead to consistent, asymptotically normal estimators (e.g., Carroll et al., 2006, Section A.6).

Our key result is that, given a semiparametric model p_Y,X,Z(y, x, z; θ, η, ψ, π) based on a specific p_Y|Z(y|z; θ, η) and satisfying (8) and (9), and given a fixed unbiased estimating function m(Y, Z; θ) (r × 1) for θ, such as (10) or (11), members of the class of all unbiased estimating functions for θ, and hence β, using all of (Y, X, Z) may be written as

m^{*} (Y, X, Z; θ) = m (Y, Z; θ) - \sum_{g = 1}^{k} {I (Z = g) - π_{g}} a_{g} (X),

(12)

where a_g(X), g = 1, . . . , k, are arbitrary r-dimensional functions of X. Because Z⊥X, the second term in (12) has mean zero; thus, (12) is an unbiased estimating function based on (Y, X, Z). When a_g(X) 0, g = 1, . . . , k, (12) reduces to the original estimating function, which does not take account of auxiliary covariates, and solving $\sum_{i = 1}^{n} m (Y_{i}, Z_{i}; θ) = 0$ leads to the unadjusted estimator $\hat{θ} = ({\hat{β}}^{T}, {\hat{γ}}^{T})^{T}$ to which it corresponds. Otherwise, (12) “augments” m(Y, Z; θ) by the second term. With appropriate choice of the a_g(X), the augmentation term exploits correlations between Y and elements of X to yield an estimator for θ solving $\sum_{i = 1}^{n} m^{*} (Y_{i}, X_{i}, Z_{i}; θ) = 0$ that is relatively more efficient than $\hat{θ}$ . The proof of (12) is based on applying principles of semiparametric theory and is given in Web Appendix B.

Full advantage of this result may be taken by identifying the optimal estimating function within class (12), that for which the elements of the corresponding estimator for θ have smallest asymptotic variance. This estimator for β thus yields the greatest efficiency gain over $\hat{β}$ among all estimators with estimating functions in class (12) and hence more efficient inferences on treatment comparisons. By standard arguments for M-estimators (e.g., Stefanski and Boos, 2002), an estimator for θ corresponding to an estimating function of form (12) is consistent and asymptotically normal with asymptotic covariance matrix

Δ^{- 1} Γ {(Δ^{- 1})}^{T}, Γ = E ({[m (Y, Z; θ_{0}) - \sum_{g = 1}^{k} {I (Z = g) - π_{g}} a_{g} (X)]}^{\otimes 2}),

(13)

where θ₀ is the true value of θ, $u^{\otimes 2} = u u^{T}$ , and $Δ = {E {- \partial ∕ \partial θ^{T} m (Y, Z; θ)} ∣}_{θ = θ_{0}}$ . . Thus, to find the optimal estimating function, one need only consider ⌈ in (13) and determine a_g(X), g = 1, . . . , k, leading to ⌈_opt, say, such that ⌈ − ⌈_opt is nonnegative definite. For given m(Y, Z; θ), it is shown in Web Appendix C that ⌈_opt takes a_g(X) = E{m(Y, Z; θ) | X, Z = g}, g = 1, . . . , k. Thus, in general, the optimal estimator in class (12) is the solution to

\sum_{i = 1}^{n} [m (Y_{i}, Z_{i}; θ) - \sum_{g = 1}^{k} {I (Z_{i} = g) - π_{g}} E {m (Y, Z; θ) ∣ X_{i}, Z = g}] = 0 .

(14)

In the case of β₂ in (1), (14) yields the optimal estimator in (16) of Tsiatis et al. (2007).

4. Implementation of Improved Estimators

The optimal estimator in class (12) solving (14) depends on the conditional expectations E{m(Y, Z; θ) | X_i, Z = g}, g = 1, . . . , k, the forms of which are of course unknown. Thus, to obtain practical estimators, we first consider a general adaptive strategy based on postulating regression models for these conditional expectations, which involves the following steps:

(1) Solve the original estimating equation $\sum_{i = 1}^{n} m (Y_{i}, Z_{i}; θ) = 0$ to obtain the unadjusted estimator $\hat{θ}$ . For each subject i, obtain the values $m (Y_{i}, g; \hat{θ})$ for each g = 1, . . . , k.

(2) Note that the $m (Y_{i}, g; \hat{θ})$ are (r × 1). For each treatment group g = 1, . . . , k separately, based on the r-variate “data” $m (Y_{i}, g; \hat{θ})$ for i in group g, develop a parametric regression model $E {m (Y, g; \hat{θ}) ∣ X, Z = g} = q_{g} (X, ζ_{g}) = {q_{g 1} (X, ζ_{g 1}), \dots, q_{g r} (X, ζ_{g r})}^{T}$ , where $ζ_{g} = {(ζ_{g 1}^{T}, \dots, ζ_{g r}^{T})}^{T}$ ; i.e., such that q_gu(X, ζ_gu), u = 1, . . . , r, are regression models for each component of $m (Y_{i}, g; \hat{θ})$ . We recommend an approach analogous to that in Leon et al. (2003, Section 4) where the q_gu(X, ζ_gu) are represented as ${1, c_{g u}^{T} (X)}^{T} ζ_{g u}$ , , u = 1, . . . , r, and c_gu(X) are vectors of basis functions in X that may include polynomial terms in elements of X, interaction terms, splines, and so on. This offers considerable latitude for achieving representations that can approximate the true conditional expectations, and hence predictions derived from them, well. We also recommend obtaining estimates ${\hat{ζ}}_{g} = {({\hat{ζ}}_{g 1}^{T}, \dots, {\hat{ζ}}_{g r}^{T})}^{T}$ via OLS separately for each u = 1, . . . , r, as, by a generalization of the argument in Leon et al. (2003, Section 4), this will yield the most efficient estimator for θ in step (3) below when the q_g(X, ζ_g) are of this form. For each subject i = 1, . . . , n, form predicted values $q_{g} = (X_{i}, {\hat{ζ}}_{g})$ for each g = 1, . . . , k. (3) Using the predicted values from step (2), form the augmented estimating equation

\sum_{i = 1}^{n} [m (Y_{i}, Z_{i}; θ) - \sum_{g = 1}^{k} {I (Z_{i} = g) - π_{g}} q_{g} (X_{i}, {\hat{ζ}}_{g})] = 0

(15)

and solve for θ to obtain the final, adjusted estimator $\tilde{θ}$ . We recommend substituting ${\hat{π}}_{g} = n^{- 1} \sum_{i = 1}^{n} I (Z_{i} = g) for π_{g}, g = 1, \dots, k$ , in (15).

The foregoing three-step algorithm applies to very general m(Y, Z; θ). Often,

m (Y, Z; θ) = A (Z, θ) {Y - f (Z; θ)}

(16)

for some A(Z, θ) with r rows and some f(Z, θ), as in (10) and (11). Here, a simpler, “direct” implementation strategy is possible. Note that E{m(Y, Z; θ) | X, Z = g} = A(g, θ){E(Y |X, Z = g) − f(g; θ)}; thus, for each g = 1, . . . , k, based on the data (Y_i, X_i) for i in group g, we may postulate parametric regression models $E (Y ∣ X, Z = g) = q_{g}^{*} (X, ζ_{g}) = {1, c_{g}^{T} (X)} ζ_{g}$ , for a vector of basis functions c_g(X), and obtain OLS estimators ${\hat{ζ}}_{g}, g = 1, \dots, k$ . Then form for each i = 1, . . . , n the corresponding predicted values for E{m(Y, Z; θ) | X, Z = g} as $q_{g} (X_{i}, {\hat{ζ}}_{g}, θ) = A (g, θ) {q_{g}^{*} (X_{i}, {\hat{ζ}}_{g}) - f (g, θ)}$ , where we emphasize that, here, $q_{g} (X_{i}, {\hat{ζ}}_{g}, θ)$ , g = 1, . . . , k, are functions of θ. Substituting the $q_{g} (X_{i}, {\hat{ζ}}_{g}, θ)$ $(and {\hat{π}}_{g}, g = 1, \dots, k)$ in (15), solve the resulting equation in θ directly to obtain $\tilde{θ}$ .

Several observations follow from semiparametric theory. Although we advocate representing E{m(Y, Z; θ) | X, Z = g} or E(Y |X, Z = g), g = 1, . . . , k, by parametric models, consistency and asymptotic normality of $\tilde{θ}$ hold regardless of whether or not these models are correct specifications of the true E{m(Y, Z; θ) | X, Z = g} or E(Y |X, Z = g). Thus, the proposed methods are not parametric, as their validity does not depend on parametric assumptions. The theory also shows that, in either implementation strategy, if the q_g are specified and fitted via OLS as described above, then, by an argument similar to that in Leon et al. (2003, Section 4), $\tilde{θ}$ is guaranteed to be relatively more efficient than the corresponding unadjusted estimator. Moreover, under these conditions, although ζ_g and π_g, g = 1, . . . , k, are estimated, $\tilde{θ}$ will have the same properties asymptotically as the estimator that could be obtained if the limits in probability of the ${\hat{ζ}}_{g}$ were known and substituted in (14) and if the true π_g were substituted, regardless of whether the q_g are correct or not. In the direct strategy, if Y is discrete, it is natural to instead posit the $q_{g}^{*} (X, ζ_{g})$ as generalized linear models; e.g., logistic regression for binary Y, and fit these using iteratively reweighted least squares (IRWLS). Although the previous statements do not necessarily hold exactly, in our experience, they hold approximately. Regardless of whether or not the q_g are represented by parametric linear models and fitted by OLS, if the representation chosen contains the true form of E{m(Y, Z; θ)|X, Z = g) or E(Y |X, Z = g), respectively, then $\tilde{θ}$ is asymptotically equivalent to the optimal estimator solving (14). In general, the closer the predictions from these models are to the true functions of X, the closer $\tilde{θ}$ will come to achieving the precision of the optimal estimator. Because β is contained in θ, all of these results apply equally to $\tilde{β}$ .

Because in either implementation strategy $\tilde{θ}$ solving (15) is an M-estimator, the sandwich method (e.g., Stefanski and Boos, 2002) may be used to obtain a sampling covariance matrix for $\tilde{θ}$ , from which standard errors for functions of $\tilde{β}$ may be derived. This matrix is of form (13), with expectations replaced by sample averages evaluated at the estimates and a_g(X) replaced by the predicted values using the q_g, g = 1, . . . , k.

The regression models q_g in either implementation, which are the mechanism by which covariate adjustment is incorporated, are determined separately by treatment group and are developed independently of reference to the adjusted estimator $\tilde{β}$ . Thus, estimation of β could be carried out by a generalization of the “principled” strategy proposed by Tsiatis et al. (2007, Section 4) in the context of a two-arm trial and inference on β₂ in (1), where development of the models q_g would be undertaken by analysts different from those responsible for obtaining $\tilde{θ}$ to lessen concerns over possible bias, as discussed in Section 1.

5. Improved Hypothesis Tests

The principles in Section 3 may be used to construct more powerful tests of null hypotheses of no treatment effects by exploiting auxiliary covariates. The key is that, under a general null hypothesis H₀ involving s degrees of freedom, a usual test statistic T_n, say, based on the data (Y_i, Z_i), i = 1, . . . , n, only is asymptotically equivalent to a quadratic form; i.e.,

T_{n} \approx {n^{- 1 ∕ 2} \sum_{i = 1}^{n} ℓ (Y_{i}, Z_{i})}^{T} Σ^{- 1} {n^{- ∕ 12} \sum_{i = 1}^{n} ℓ (Y_{i}, Z_{i})},

(17)

where $ℓ (Y, Z)$ is a (s×1) function of (Y, Z), discussed further below, such that $E_{H_{0}} {ℓ (Y, Z)} = 0$ , with $E_{H_{0}}$ denoting expectation under H₀; and $Σ = E_{H_{0}} {ℓ {(Y, Z)}^{\otimes 2}}$ .

When the notion of “treatment effects” may be formulated in terms of β in a model like (1)-(5), the null hypothesis is typically of the form H₀ : Cβ = 0, where C is a (s×p) contrast matrix. E.g., in (2), C is (2 × 3) with rows (1, −1, 0) and (1, 0, −1). When inference on H₀ is based on a Wald test of the form $T_{n} = {(C \hat{β})}^{T} {(n^{- 1} \hat{Σ})}^{- 1} C \hat{β}$ , where $\hat{β}$ is unadjusted estimator corresponding to an estimating function m(Y, Z; θ), and $n^{- 1} \hat{Σ}$ is an estimator for the covariance matrix of $C \hat{β}, ℓ (Y, Z) = C B m (Y, Z, θ_{0})$ . Here, B is the (p × r) matrix equal to the first p rows of ${[{E_{H_{0}} {- \partial ∕ \partial θ^{T} m (Y_{i}, Z_{i}; θ)} ∣}_{θ = θ_{0}}]}^{- 1}$ , and θ₀ is the value of θ H₀.

In other situations, the null hypothesis may not refer to a parameter like β in a given model. For example, the null hypothesis for a k-arm trial may be H₀ : S₁(u) = · · · = S_k(u) = S(u), where S_g(u) = 1 − P(Y ≤ u|Z = g), and S(u) = 1 − P(Y ≤ u). A popular test in this setting is the Kruskal-Wallis test, which reduces to the Wilcoxon rank sum test for k = 2. Letting $n_{g} = \sum_{i = 1}^{n} I (Z_{i} = g)$ and ${\overset{‒}{R}}_{g}$ be the average of the overall ranks for subjects in group g, the test statistic is $T_{n} = 12 \sum_{g = 1}^{k} n_{g} {{\overset{‒}{R}}_{g} - (n + 1) ∕ 2}^{2} ∕ {n (n + 1)}$ . By results in van der Vaart (1998, Section 12.2), it may be shown that T_n is asymptotically equivalent to a statistic of the form (17), where $ℓ (Y, Z)$ is (k − 1 × 1) with gth element {I(Z = g) − π_g}{S(Y) − 1/2}.

To motivate the proposed more powerful tests, we consider the behavior of T_n in (17) under a sequence of local alternatives H_1n converging to H₀ at rate n^−1/2. Typically, under suitable regularity conditions, $n^{- 1 ∕ 2} \sum_{i = 1}^{n} ℓ (Y_{i}, Z_{i})$ in (17) converges under the sequence H_1n to a $N (τ, Σ)$ random vector for some τ, so that T_n has asymptotically a noncentral $χ_{s}^{2}$ distribution with noncentrality parameter τ^TΣ⁻¹τ. To obtain a more powerful test, then, we wish to construct a test statistic with noncentrality parameter as large as possible. Based on the developments in Section 3, we consider test statistics of the form

T_{n}^{*} = {n^{- 1 ∕ 2} \sum_{i = 1}^{n} ℓ^{*} (Y_{i}, X_{i}, Z_{i})}^{T} Σ^{* - 1} {n^{- 1 ∕ 2} \sum_{i = 1}^{n} ℓ^{*} (Y_{i}, X_{i}, Z_{i})},

(18)

ℓ^{*} (Y, X, Z) = ℓ (Y, Z) - \sum_{g = 1}^{k} {I (Z = g) - π_{g}} a_{g} (X),

(19)

where $Σ^{*} = E_{H_{0}} {ℓ^{*} {(Y, X, Z)}^{\otimes 2}}$ . The second term in (19) has mean zero by randomization under H₀ or any alternative. Accordingly, it follows under the sequence of alternatives H_1n that $n^{- 1 ∕ 2} \sum_{i = 1}^{n} ℓ^{*} (Y_{i}, X_{i}, Z_{i})$ converges in distribution to a $N (τ, Σ^{*})$ random vector, so that $T_{n}^{*}$ in (18) has an asymptotic $χ_{s}^{2}$ distribution with noncentrality parameter τ^TΣ^*−1τ.

These results suggest that, to maximize the noncentrality parameter and thus power, we wish to find the particular Σ* , $Σ_{o p t}^{*}$ , say, that makes $Σ_{o p t}^{* - 1} - Σ^{* - 1}$ non-negative definite for all Σ*, which is equivalent to making $Σ^{*} - Σ_{o p t}^{*}$ non-negative definite for all Σ*. This corresponds to finding the optimal choice of a_g(X), g = 1, . . . , k, in (19). By an argument similar to that leading to (14), the optimal choice is $a_{g} (X) = E {ℓ (Y, Z) ∣ X, Z = g} for g = 1, \dots, k$ .

These developments suggest an implementation strategy analogous to that in Section 4:

(1) For the test statistic T_n, determine $ℓ (Y, Z)$ and substitute sample quantities for any unknown parameters to obtain $\hat{ℓ} (Y_{i}, Z_{i}), i = 1, \dots, n$ . E.g., for H₀ : Cβ = 0 in model (2), with C (2 × 3) as above, m(Y, Z, θ) = {I(Z = 1), I(Z = 2), I(Z = 3)}^T{Y−β₁I(Z = 1) − β₂I(Z = 2) − β₃I(Z = 3)}, θ = (β₁, β₂, β₃)^T. Under H₀, θ₀ = (μ, μ, μ)^T, say, so that m(Y, Z, θ₀) = {I(Z = 1), I(Z = 2), I(Z = 3)}^T(Y−μ), and

ℓ (Y, Z) = (\begin{matrix} π_{1}^{- 1} I (Z = 1) - π_{2}^{- 1} I (Z = 2) \\ π_{1}^{- 1} I (Z = 1) - π_{3}^{- 1} I (Z = 3) \end{matrix}) (Y - μ) .

(20)

As μ is unknown, $\hat{ℓ} (Y_{i}, Z_{i})$ is obtained by substituting $n^{- 1} \sum_{i = 1}^{n} Y_{i}$ for μ. We recommend substituting ${\hat{π}}_{g}$ for π_g, g = 1, 2, 3, in (20), as above. Similarly, for the Kruskal-Wallis test, $\hat{ℓ} (Y_{i}, Z_{i}) = {I (Z = g) - {\hat{π}}_{g}} {\hat{S} (Y_{i}) - 1 ∕ 2}, \hat{S} (u) = n^{- 1} \sum_{i = 1}^{n} I (Y_{i} \geq u)$ .

(2) For each treatment group g = 1, . . . , k separately, treating the $\hat{ℓ} (Y_{i}, Z_{i})$ for subjects i in group g as s-variate “data,” develop a regression model $E {\hat{ℓ} (Y, g) ∣ X, Z = g) = q_{g} (X, ζ_{g}) = {q_{g 1} (X, ζ_{g 1}) \dots, q_{g s} (X, ζ_{g s})}^{T}$ by representing each component q_gu(X, ζ_gu), u = 1, . . . , s, by the parametric “basis function” approach in Section 4; estimate each ζ_gu by OLS to obtain ${\hat{ζ}}_{g}$ ; and form predicted values $q_{g} (X_{i}, {\hat{ζ}}_{g})$ , i = 1, . . . , n.

(3) Using the predicted values from step (2), form

{\hat{ℓ}}^{*} (Y_{i}, X_{i}, Z_{i}) = \hat{ℓ} (Y_{i}, Z_{i}) - \sum_{g = 1}^{k} {I (Z_{i} = g) - {\hat{π}}_{g}} q_{g} (X_{i}, {\hat{ζ}}_{g})

(21)

and substitute these values into (18). Estimate Σ* in (18) by ${\hat{Σ}}^{*} = n^{- 1} \sum_{i = 1}^{n} {\hat{ℓ}}^{*} {(Y_{i}, X_{i}, Z_{i})}^{\otimes 2}$ .

Compare the resulting test statistic ${\hat{T}}_{n}^{*}$ to the $χ_{s}^{2}$ distribution. As in Section 4, there is no effect asymptotically of estimating ζ_g and π_g, g = 1, . . . , k, so that ${\hat{T}}_{n}^{*}$ will achieve the same power asymptotically as if the limits in probability of ${\hat{ζ}}_{g}$ and the true π_g were substituted. Notably, the test based on ${\hat{T}}_{n}^{*}$ will be asymptotically more powerful than the corresponding unadjusted test against any sequence of alternatives.

The approach of Tangen and Koch (1999) to modifying the Wilcoxon test for two treatments is in a similar spirit to this general approach.

6. Simulation Studies

6.1 Estimation

We report results of several simulations, each based on 5000 Monte Carlo data sets. Tsiatis et al. (2007, Section 6) carried out extensive simulations in the particular case of (1); thus, we focus here on estimation of quantities other than differences of treatment means.

In the first set of simulations, we considered k = 2, a binary response Y, and

logit {E (Y ∣ Z)} = β_{1} + β_{2} I (Z = 2),

(22)

so that β₂ is the log-odds ratio for treatment 2 relative to treatment 1, the parameter of interest; and θ = β = (β₁, β₂)^T . For each scenario, we generated Z as Bernoulli with P(Z = 1) = P(Z = 2) = 0.5 and covariates X = (X₁, . . . , X₈)^T such that X₁, X₃, X₈ ∼ $N (0, 1)$ ; X₄ and X₆ were Bernoulli with P(X₄ = 1) = 0.3 and P(X₆ = 1) = 0.5; and X₂ = 0.2X₁ + 0.98U₁, X₅ = 0.1X₁ + 0.2X₃ + 0.97U₂, and X₇ = 0.1X₃ + 0.99U₃, where $U_{ℓ} \sim N (0, 1), ℓ$ = 1, 2, 3. We then generated Y as Bernoulli according to $logit {P (Y = 1 ∣ Z = g, X)} = α_{0 g} + α_{g}^{T} X$ , g = 1, 2, with α_0g and α_g chosen to yield mild, moderate, and strong association between Y and X within each treatment, as follows. Using the coefficient of determination R² to measure the strength of association, R² = (0.18, 0.16) for treatments (1,2) in the “mild” scenario, with (α₀₁, α₀₂) = (0.25, −0.8), α₁ = (0.8, 0.5, 0, 0, 0, 0, 0, 0)^T , and α₂ = (0.3, 0.7, 0.3, 0.8, 0, 0, 0, 0)^T ; R² = (0.32, 0.33) in the “moderate” scenario, with (α₀₁, α₀₂) = (0.38, −0.8), α₁ = (1.2, 1.0, 0, 0, 0, 0, 0, 0)^T , and α₂ = (0.5, 1.3, 0.5, 1.5, 0, 0, 0, 0)^T ; and R² = (0.43, 0.41) in the “strong” scenario, with (α₀₁, α₀₂) = (0.8, −0.8), α₁ = (1.5, 1.8, 0, 0, 0, 0, 0, 0)^T and α₂ = (1.0, 1.3, 0.8, 2.5, 0, 0, 0, 0)^T . Thus, in all cases, X₁, . . . , X₄ are covariates “important” for adjustment while X₅, . . . , X₈ are “unimportant.” For each data set, n = 600, and, we fitted (22) by IRWLS to (Y_i, Z_i), i = 1, . . . , n, to obtain the unadjusted estimate of β. We also estimated β by the proposed methods using the direct implementation strategy, where the models $q_{g}^{*} (X, ζ_{g})$ for each g = 1, 2 in the augmentation term were developed six ways:

Aug. 1 $q_{g}^{*} (X, ζ_{g}) = {1, c_{g}^{T} (X)}^{T} ζ_{g}, c_{g} (X) = ‘ ‘ true ’ ’$ , fit by OLS
Aug. 2 $q_{g}^{*} (X, ζ_{g}) = {1, c_{g}^{T} (X)}^{T} ζ_{g}, c_{g} (X) = X$ , fit by OLS
Aug. 3 $logit {q_{g}^{*} (X, ζ_{g})} = {1, c_{g}^{T} (X)}^{T} ζ_{g}, c_{g} (X) = ‘ ‘ true, ’ ’$ fit by IRWLS
Aug . 4 $logit {q_{g}^{*} (X, ζ_{g})} = {1, c_{g}^{T} (X)}^{T} ζ_{g}, c_{g} (X) = X$ , fit by IRWLS
Aug. 5 $q_{g}^{*} (X, ζ_{g}) = {1, c_{g}^{T} (X)}^{T} ζ_{g}, c_{g} (X)$ by OLS with forward selection
Aug. 6 $logit {q_{g}^{*} (X, ζ_{g})} = {1, c_{g}^{T} (X)}^{T} ζ_{g}, c_{g} (X)$ by IRWLS with forward selection

where “true” means that c_g(X) contained only $X_{ℓ}, ℓ = 1, \dots, 4$ , for which the corresponding element of α_g was not zero (i.e., using the “true important covariates” for each g); and in Aug. 5 and 6 forward selection from linear terms in X₁, . . . , X₈ for linear or logistic regression was used to determine each $q_{g}^{*} (X, ζ_{g})$ , with entry criterion 0.05. Aug. 3, 4, and 6 demonstrate performance when nonlinear models and methods other than OLS are used. We also estimated β₂ by estimating φ in (7) via IRWLS two ways: Usual 1, where only the “important” covariates X₁, . . . , X₄ were included in the model; and Usual 2, where the subset of X₁, . . . , X₈ to include was identified via forward selection with entry criterion 0.05.

Table 1 shows modest to considerable gains in efficiency for the proposed estimators, depending on the strength of the association. The estimators are unbiased, and associated confidence intervals achieve the nominal level. In contrast, the usual adjustment based on (7) leads to biased estimation of β₂, considerable efficiency loss, and unreliable intervals. This is a consequence of the fact that β₂ is an unconditional measure of treatment effect while φ is defined conditional on X; this distinction does not matter when the model for Y is linear but is important when it is nonlinear, as is (7) (see, e.g., Robinson et al., 1998).

Table 1.

Simulation results for estimation of the log-odds ratio β₂ for treatment Z = 2 relative to Z = 1 in (22) based on 5,000 Monte Carlo data sets. “Unadjusted” refers to the unadjusted estimator based on the data on (Y, Z) only, “Aug. a” for a = 1,..., 6 refers to estimators based on the data on (Y, X, Z) using the strategy in Section 4, and “Usual b” for b = 1, 2 refers to direct logistic regression adjustment, as described in the text. MC bias is Monte Carlo bias, MC SD is Monte Carlo standard deviation, Ave. SE is the average of estimated standard errors obtained using the sandwich formula (13), Cov. Prob. is the MC coverage probability of 95% Wald confidence intervals, and Rel. Eff. is the Monte Carlo mean squared error for the unadjusted estimator divided by that for the indicated estimator.

Method	True	MC Bias	MC SD	Ave. SE	Cov. Prob	Rel. Eff.
Mild Association
Unadjusted	−0.494	0.002	0.168	0.166	0.948	1.00
Aug. 1	−0.494	−0.001	0.156	0.153	0.948	1.16
Aug. 2	−0.494	0.000	0.156	0.153	0.944	1.15
Aug. 3	−0.494	0.000	0.156	0.153	0.946	1.16
Aug. 4	−0.494	0.000	0.156	0.152	0.943	1.15
Aug. 5	−0.494	−0.001	0.156	0.153	0.945	1.16
Aug. 6	−0.494	0.000	0.156	0.153	0.946	1.16
Usual 1	−0.494	−0.091	0.185	0.182	0.922	0.66
Usual 2	−0.494	−0.090	0.185	0.182	0.922	0.66
Moderate Association
Unadjusted	−0.490	0.001	0.165	0.165	0.948	1.00
Aug. 1	−0.490	−0.002	0.140	0.139	0.950	1.39
Aug. 2	−0.490	−0.002	0.141	0.139	0.949	1.38
Aug. 3	−0.490	−0.001	0.139	0.138	0.948	1.41
Aug. 4	−0.490	−0.001	0.140	0.137	0.945	1.40
Aug. 5	−0.490	−0.002	0.140	0.139	0.949	1.39
Aug. 6	−0.490	−0.001	0.140	0.138	0.946	1.40
Usual 1	−0.490	−0.218	0.203	0.201	0.813	0.31
Usual 2	−0.490	−0.219	0.204	0.201	0.813	0.31
Strong Association
Unadjusted	−0.460	0.004	0.164	0.165	0.954	1.00
Aug. 1	−0.460	0.000	0.132	0.131	0.952	1.55
Aug. 2	−0.460	0.000	0.132	0.131	0.950	1.54
Aug. 3	−0.460	0.001	0.129	0.128	0.948	1.61
Aug. 4	−0.460	0.001	0.130	0.127	0.945	1.60
Aug. 5	−0.460	0.000	0.132	0.131	0.951	1.55
Aug. 6	−0.460	0.001	0.129	0.127	0.947	1.61
Usual 1	−0.460	−0.321	0.223	0.220	0.695	0.18
Usual 2	−0.460	−0.322	0.224	0.220	0.695	0.17

Open in a new tab

In the second set of simulations, we again took k = 2 and focused on β₂, the difference in treatment slopes in the linear mixed model (4). In each scenario, we generated for each i = 1, . . . , n = 200 Z_i as Bernoulli with P(Z = 1) = P(Z = 2) = 0.5; X_1i, X_2i, X_3i as above; and subject-specific intercept β_0i = 0.5 + 0.2X_1i + 0.5X_2i + b_0i and slope $β_{1 i} = α_{0 g} + α_{1 g} X_{1 i}^{2} + α_{2 g} X_{2 i} + α_{13} X_{3 i} + b_{1 i}$ , where (α₀₁, α₀₂) = (1.0, 1.3), ${(b_{0 i}, b_{1 i})}^{T} \sim N (0, D)$ , with D₁₁ = 1, D₁₂ = 0.2, and D₂₂ = 0.4, so that corr(b_0i, b_1i) = 0.5. We generated m_i = 9, 10, 11 with equal probabilities; took t_ij = 2(j − 1) for j = 1, . . . , m_i; and generated Y_ij = β_0i + β_1it_ij + e_ij, j = 1, . . . , m_i, where $e_{i j} \overset{i i d}{\sim} N (0, σ_{e}^{2} = 16)$ . Writing α_g = (α_1g, α_2g, α_3g), we took α₁ = (0.2, 0.2, 0)^T and α₂ = (0.2, 0, 0.2)^T, yielding R² values between subject-specific slopes and covariates of (0.11, 0.14) in the two groups, for “mild” association; α₁ = (0.13, 0.1, 0)^T and α₂ = (0.13, 0, 0.15)^T , R² = (0.24, 0.24), for “moderate” association; and α₁ = (0.28, 0.25, 0)^T and α₂ = (0.28, 0, 0.25)^T , R² = (0.36, 0.36), for “strong” association. For each data set, we obtained the unadjusted estimate for θ by fitting (4) using SAS proc mixed (SAS Institute, 2006). For (4), m(Y, Z; θ) has components of form (16) for α and β and more complicated components quadratic in Y for D and $σ_{e}^{2}$ . For simplicity, because the estimators for (α, β) and $(D, σ_{e}^{2})$ are uncorrelated, we fixed D and $σ_{e}^{2}$ at the unadjusted analysis estimates in the components of m(Y, Z; θ) for (α, β), as asymptotically this will not impact precision of the estimators for (α, β), and used the direct implementation strategy based on the components for (α, β) only. We considered three variants on the proposed methods, all with each element of $q_{g}^{*} (X, ζ_{g}) = {1, c_{g}^{T} (X)} ζ_{g}$ fitted by OLS: Aug 1., taking $c_{g} (X) = {(1, X_{1}^{2}, X_{2}, X_{3})}^{T}$ , corresponding to the form of the true relationship; Aug 2., with c_g(X) = (1, X₁, X₂, X₃)^T , so not exploiting the quadratic relationship in X₁; and Aug 3., with $c_{g} (X) = {(1, X_{1}^{2}, X_{2}, X_{3})}^{T}$ , including an unneeded linear effect of X₁. Writing now X_i = (X_1i, X_2i, X_3i) , we also estimated β₂ by the estimate of φ from fitting via proc mixed the linear mixed model $Y_{i j} = α_{00} + α_{01}^{T} X_{i} + (α_{10} + α_{11}^{T} X_{i} + ϕ Z_{i}) t_{i j} + b_{0 i} + b_{1 i} t_{i j} + e_{i j}$ , denoted as Usual; such a model, with linear covariate effects only, might be prespecified in a trial protocol (e.g., Grouin et al., 2004). Table 2 shows that the proposed methods lead to relatively more efficient estimators when quadratic terms in X₁ are included in the $q_{g}^{*} (X, ζ_{g})$ .

Table 2.

Simulation results for estimation of β₂ in the linear mixed model (4) using the usual unadjusted method, the proposed augmented methods denoted by “Aug. a” for a=1,2,3, and the “Usual” method, as described in the text, based on 5,000 Monte Carlo data sets. Entries are as in Table 1.

Method	True	MC Bias	MC SD	Ave. SE	Cov. Prob	Rel. Eff.
Mild Association
Unadjusted	0.300	0.000	0.100	0.099	0.951	1.00
Aug. 1	0.300	−0.001	0.095	0.094	0.951	1.10
Aug. 2	0.300	−0.001	0.100	0.097	0.945	1.00
Aug. 3	0.300	−0.001	0.096	0.094	0.950	1.08
Usual	0.300	−0.001	0.100	0.097	0.944	1.00
Moderate Association
Unadjusted	0.300	0.000	0.107	0.106	0.949	1.00
Aug. 1	0.300	−0.001	0.097	0.095	0.951	1.22
Aug. 2	0.300	0.000	0.106	0.103	0.945	1.02
Aug. 3	0.300	−0.001	0.097	0.095	0.952	1.21
Usual	0.300	−0.001	0.105	0.101	0.946	1.04
Strong Association
Unadjusted	0.300	0.000	0.116	0.115	0.950	1.00
Aug. 1	0.300	−0.001	0.098	0.096	0.951	1.41
Aug. 2	0.300	0.000	0.114	0.111	0.943	1.03
Aug. 3	0.300	−0.001	0.098	0.096	0.951	1.39
Usual	0.300	−0.001	0.113	0.109	0.944	1.06

Open in a new tab

6.2 Testing

We carried out simulations based on 10,000 Monte Carlo data sets involving k = 3 and the Kruskal-Wallis test. For each data set, we generated for each of n = 200 or 400 subjects Z with P(Z = g) = 1/3, g = 1, 2, 3, and (Y, X) with joint distribution of (Y, X) given Z bivariate normal with mean {β₁I(Z = 1) + β₂I(Z = 2), 0}^T and covariance matrix vech(1, ρ, 1), where ρ = 0.25, 0.50, 0.75 corresponds to mild, moderate, and strong association between covariate and response. Under the null hypothesis, we set β₁ = β₂ = 0; simulations under the alternative involved β₁ = 0.25, β₂ = 0.4. For each data set, we calculated the unadjusted Kruskal-Wallis test statistic T_n and the proposed statistic ${\hat{T}}_{n}^{*}$ using the strategy in Section 5, with each component of the s = 2-dimensional models q_g(X, ζ_g) in (21) represented as $q_{g u} (X, ζ_{g u}) = {1, c_{g u}^{T} (X)}^{T} ζ_{u g}, u = 1, 2, c_{g u} (X) = {(X, X^{2})}^{T}$ . Each statistic was compared to the 0.95 quantile of the $χ_{2}^{2}$ distribution. Table 3 shows that the proposed procedure yields greater power than the unadjusted test while achieving the nominal level, where the extent of improvement depends on the strength of the association between Y and X, as expected.

Table 3.

Empirical size and power of the usual Kruskal-Wallis test T_n (unadjusted) and the proposed test ${\hat{T}}_{n}^{*}$ based on 10,000 Monte Carlo replications. Each entry in the columns labeled T_n and ${\hat{T}}_{n}^{*}$ is the number of times out of 10,000 that each test rejected the null hypothesis of “no treatment effects” under the corresponding scenario.

		Null		Alternative
ρ	n	T_n	${\hat{T}}_{n}^{*}$	T_n	${\hat{T}}_{n}^{*}$
0.25	200	0.05	0.05	0.51	0.54
	400	0.05	0.05	0.83	0.85
0.50	200	0.05	0.05	0.51	0.64
	400	0.05	0.05	0.83	0.92
0.75	200	0.05	0.05	0.51	0.85
	400	0.05	0.05	0.83	0.99

Open in a new tab

7. Applications

7.1 PURSUIT Clinical Trial

We consider data from 5,710 patients in the PURSUIT trial introduced in Section 1 and focus on the log-odds ratio for Integrilin relative to control. The 35 baseline auxiliary covariates are listed in Web Appendix D.

The unadjusted estimate of the log-odds ratio based on (22), ${\hat{β}}_{2}$ , is −0.174 with standard error 0.073. To calculate the augmented estimator based on (22), we used the direct implementation strategy and took $q_{g}^{*} (X, ζ_{g}) = {1, c_{g}^{T} (X)}^{T} ζ_{g}$ , g = 1, 2, with c_g(X) including main effects of all 35 covariates, and fitted the models by OLS. The resulting estimate ${\tilde{β}}_{2} = - 0.163$ , with standard error 0.071. For these data, the relative efficiency of the proposed estimator to the unadjusted, computed as the square of the ratio of the estimated standard errors, is 1.06. For binary response, substantial increases in efficiency via covariate adjustment are not likely; thus, this admittedly modest improvement is encouraging.

7.2 AIDS Clinical Trials Group Protocol 175

We consider data on 2139 subjects from ACTG 175, discussed in Section 1, where the k = 4 treatments were zidovudine (ZDV) monotherapy (g = 1), ZDV+didanosine (ddI, g = 2), ZDV+zalcitabine (g = 3), and ddI monotherapy (g = 4). The continuous response is CD4 count (cells/mm³, Y ) at 20±5 weeks, and we focus on the four treatment means, with the same 12 auxiliary covariates considered by Tsiatis et al. (2007, Section 5).

We consider the extension of model (2) to k = 4 treatments, so that θ = β = (β₁, . . . , β₄)^T, β_g = E(Y|Z = g), g = 1, . . . , 4. The standard unadjusted estimator for β is the vector of sample averages; these are (336.14, 403.17, 372.04, 374.32)^T for g = (1, 2, 3, 4), with standard errors (5.68, 6.84, 5.90, 6.22)^T . Using the direct implementation strategy with each element of $q_{g}^{*} (X, ζ_{g})$ represented using c_g(X) containing all linear terms in the 12 covariates, the proposed methods yield $\tilde{β}$ = (333.85, 403.83, 370.43, 376.45)^T , with standard errors obtained via the sandwich method as (4.61, 5.93, 4.89, 5.11)^T . This is of course one realization of data; however, it is noteworthy that the standard errors for the proposed estimator correspond to relative efficiencies of 1.51, 1.33, 1.46 and 1.48, respectively.

We also carried out the standard unadjusted three-degree-of-freedom Wald test for H₀ : β₁ = β₂ = β₃ = β₄ and Kruskal-Wallis test for H₀ : S₁(u) = · · · = S₄(u) = S(u), as well as their adjusted counterparts using c_gu(X) containing linear and quadratic terms in the continuous components of X and linear terms in the binary elements. The unadjusted and adjusted Wald statistics are 59.40 and 109.58, respectively; the unadjusted and adjusted Kruskal-Wallis statistics are 49.04 and 100.53; and all are to be compared to $χ_{3}^{2}$ critical values. Again, although the evidence against the null hypotheses is overwhelming even without adjustment, the proposed test statistics are considerably larger.

See Web Appendix D for further results for these data.

8. Discussion

We have proposed a general approach to using auxiliary baseline covariates to improve the precision of estimators and tests for general measures of treatment effect and general null hypotheses in the analysis of randomized clinical trials by using semiparametric theory.

We identify the optimal estimating function involving covariates within the class of such estimating functions based on a given m(Y, Z; θ). For differences of treatment means or measures of treatment effect for binary outcomes, this estimating function in fact leads to the efficient estimator for the treatment effect. In more complicated models, e.g., repeated measures models, we do not identify the optimal estimating function among all possible. Our experience in other problems suggests that gains over the methods here would be modest.

The use of model selection techniques, such as forward selection in our simulations, to determine covariates to include in the augmentation term models should have no effect asymptotically on the properties of the estimators for θ. However, such effects may be evident in smaller samples, requiring a “correction” to account for failure of the asymptotic theory to represent faithfully the uncertainty due to model selection. Investigation of how approaches to inference after model selection (e.g., Hjort and Claeskens, 2003; Shen, Huang and Ye, 2004) may be adapted to this setting would be a fruitful area for future research.

Acknowledgements

This work was supported by NIH grants R37 AI031789, R01 CA051962, and R01 CA085848.

Supplementary Material

supplemenatry

Supplementary Materials

Web Appendices A–D, referenced in Sections 2, 3, and 7, are available under the Paper Information link at the Biometrics website http://www.tibs.org/biometrics.

NIHMS45063-supplement-supplemenatry.pdf^{(120.6KB, pdf)}

References

Carroll RJ, Ruppert D, Stefanski LA, Crainiceanu CM. Measurement Error in Nonlinear Models: A Modern Perspective, Second Edition. Chapman and Hall/CRC; Boca Raton: 2006. [Google Scholar]
Grouin JM, Day S, Lewis J. Adjustment for baseline covariates: An introductory note. Statistics in Medicine. 2004;23:697–699. doi: 10.1002/sim.1646. [DOI] [PubMed] [Google Scholar]
Hammer SM, Katzenstein DA, Hughes MD, Gundaker H, Schooley RT, Haubrich RH, Henry WK, Lederman MM, Phair JP, Niu M, Hirsch MS, Merigan TC, AIDS Clinical Trials Group Study 175 Study Team A trial comparing nucleoside monotherapy with combination therapy in HIV-infected adults with CD4 cell counts from 200 to 500 per cubic millimeter. New England Journal of Medicine. 1996;335:1081–1089. doi: 10.1056/NEJM199610103351501. [DOI] [PubMed] [Google Scholar]
Harrington RA, PURSUIT Investigators Inhibition of platelet glycoprotein IIb/IIIa with eptifibatide in patients with acute coronary syndromes without persistent ST-segment elevation. New England Journal of Medicine. 1998;339:436–443. doi: 10.1056/NEJM199808133390704. [DOI] [PubMed] [Google Scholar]
Hauck WW, Anderson S, Marcus SM. Should we adjust for covariates in nonlinear regression analyses of randomized trials? Controlled Clinical Trials. 1998;19:249– 256. doi: 10.1016/s0197-2456(97)00147-5. [DOI] [PubMed] [Google Scholar]
Hjort NL, Claeskens G. Frequentist model average estimators. Journal of the American Statistical Association. 2003;98:879–899. [Google Scholar]
Koch GG, Tangen CM, Jung JW, Amara IA. Issues for covariance analysis of dichotomous and ordered categorical data from randomized clinical trials and non-parametric strategies for addressing them. Statistics in Medicine. 1998;17:1863–1892. doi: 10.1002/(sici)1097-0258(19980815/30)17:15/16<1863::aid-sim989>3.0.co;2-m. [DOI] [PubMed] [Google Scholar]
Leon S, Tsiatis AA, Davidian M. Semiparametric e cient estimation of treatment e ect in a pretest-posttest study. Biometrics. 2003;59:1046–1055. doi: 10.1111/j.0006-341x.2003.00120.x. [DOI] [PubMed] [Google Scholar]
Lesaffre E, Senn S. A note on non-parametric ANCOVA for covariate adjustment in randomized clinical trials. Statistics in Medicine. 2003;22:3586–3596. doi: 10.1002/sim.1583. [DOI] [PubMed] [Google Scholar]
Pocock SJ, Assmann SE, Enos LE, Kasten LE. Subgroup analysis, covariate adjustment, and baseline comparisons in clinical trial reporting: Current practice and problems. Statistics in Medicine. 2002;21:2917–2930. doi: 10.1002/sim.1296. [DOI] [PubMed] [Google Scholar]
Robinson LD, Dorroh JR, Lein D, Tiku ML. The e ects of covariate adjustment in generalized linear models. Communications in Statistics, Theory and Methods. 1998;27:1653–1675. [Google Scholar]
SAS Institute, Inc. SAS Online Doc 9.1.3. SAS Institute, Inc.; Cary, NC: 2006. [Google Scholar]
Senn S. Covariate imbalance and random allocation in clinical trials. Statistics in Medicine. 1989;8:467–475. doi: 10.1002/sim.4780080410. [DOI] [PubMed] [Google Scholar]
Shen X, Huang HC, Ye J. Inference after model selection. Journal of the American Statistical Association. 2004;99:751–762. [Google Scholar]
Stefanski LA, Boos DD. The calculus of M-estimation. The American Statistician. 2002;56:29–38. [Google Scholar]
Tangen CM, Koch GG. Nonparametric analysis of covariance for hypothesis testing with logrank and Wilcoxon scores and survival-rate estimation in a randomized clinical trial. Journal of Biopharmaceutical Statistics. 1999;9:307–338. doi: 10.1081/BIP-100101179. [DOI] [PubMed] [Google Scholar]
Tsiatis AA. Semiparametric Theory and Missing Data. Springer; New York: 2006. [Google Scholar]
Tsiatis AA, Davidian M, Zhang M, Lu X. Covariate adjustment for twosample treatment comparisons in randomized clinical trials: A principled yet flexible approach. Statistics in Medicine. 2007 doi: 10.1002/sim.3113. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
van der Laan MJ, Robins JM. Unified Methods for Censored Longitudinal Data and Causality. Springer; New York: 2003. [Google Scholar]
van der Vaart AW. Asymptotic Statistics. Cambridge University Press; Cambridge: 1998. [Google Scholar]
Yang L, Tsiatis AA. E ciency study for a treatment e ect in a pretest-posttest trial. The American Statistician. 2001;56:29–38. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

supplemenatry

Supplementary Materials

Web Appendices A–D, referenced in Sections 2, 3, and 7, are available under the Paper Information link at the Biometrics website http://www.tibs.org/biometrics.

NIHMS45063-supplement-supplemenatry.pdf^{(120.6KB, pdf)}

[R1] Carroll RJ, Ruppert D, Stefanski LA, Crainiceanu CM. Measurement Error in Nonlinear Models: A Modern Perspective, Second Edition. Chapman and Hall/CRC; Boca Raton: 2006. [Google Scholar]

[R2] Grouin JM, Day S, Lewis J. Adjustment for baseline covariates: An introductory note. Statistics in Medicine. 2004;23:697–699. doi: 10.1002/sim.1646. [DOI] [PubMed] [Google Scholar]

[R3] Hammer SM, Katzenstein DA, Hughes MD, Gundaker H, Schooley RT, Haubrich RH, Henry WK, Lederman MM, Phair JP, Niu M, Hirsch MS, Merigan TC, AIDS Clinical Trials Group Study 175 Study Team A trial comparing nucleoside monotherapy with combination therapy in HIV-infected adults with CD4 cell counts from 200 to 500 per cubic millimeter. New England Journal of Medicine. 1996;335:1081–1089. doi: 10.1056/NEJM199610103351501. [DOI] [PubMed] [Google Scholar]

[R4] Harrington RA, PURSUIT Investigators Inhibition of platelet glycoprotein IIb/IIIa with eptifibatide in patients with acute coronary syndromes without persistent ST-segment elevation. New England Journal of Medicine. 1998;339:436–443. doi: 10.1056/NEJM199808133390704. [DOI] [PubMed] [Google Scholar]

[R5] Hauck WW, Anderson S, Marcus SM. Should we adjust for covariates in nonlinear regression analyses of randomized trials? Controlled Clinical Trials. 1998;19:249– 256. doi: 10.1016/s0197-2456(97)00147-5. [DOI] [PubMed] [Google Scholar]

[R6] Hjort NL, Claeskens G. Frequentist model average estimators. Journal of the American Statistical Association. 2003;98:879–899. [Google Scholar]

[R7] Koch GG, Tangen CM, Jung JW, Amara IA. Issues for covariance analysis of dichotomous and ordered categorical data from randomized clinical trials and non-parametric strategies for addressing them. Statistics in Medicine. 1998;17:1863–1892. doi: 10.1002/(sici)1097-0258(19980815/30)17:15/16<1863::aid-sim989>3.0.co;2-m. [DOI] [PubMed] [Google Scholar]

[R8] Leon S, Tsiatis AA, Davidian M. Semiparametric e cient estimation of treatment e ect in a pretest-posttest study. Biometrics. 2003;59:1046–1055. doi: 10.1111/j.0006-341x.2003.00120.x. [DOI] [PubMed] [Google Scholar]

[R9] Lesaffre E, Senn S. A note on non-parametric ANCOVA for covariate adjustment in randomized clinical trials. Statistics in Medicine. 2003;22:3586–3596. doi: 10.1002/sim.1583. [DOI] [PubMed] [Google Scholar]

[R10] Pocock SJ, Assmann SE, Enos LE, Kasten LE. Subgroup analysis, covariate adjustment, and baseline comparisons in clinical trial reporting: Current practice and problems. Statistics in Medicine. 2002;21:2917–2930. doi: 10.1002/sim.1296. [DOI] [PubMed] [Google Scholar]

[R11] Robinson LD, Dorroh JR, Lein D, Tiku ML. The e ects of covariate adjustment in generalized linear models. Communications in Statistics, Theory and Methods. 1998;27:1653–1675. [Google Scholar]

[R12] SAS Institute, Inc. SAS Online Doc 9.1.3. SAS Institute, Inc.; Cary, NC: 2006. [Google Scholar]

[R13] Senn S. Covariate imbalance and random allocation in clinical trials. Statistics in Medicine. 1989;8:467–475. doi: 10.1002/sim.4780080410. [DOI] [PubMed] [Google Scholar]

[R14] Shen X, Huang HC, Ye J. Inference after model selection. Journal of the American Statistical Association. 2004;99:751–762. [Google Scholar]

[R15] Stefanski LA, Boos DD. The calculus of M-estimation. The American Statistician. 2002;56:29–38. [Google Scholar]

[R16] Tangen CM, Koch GG. Nonparametric analysis of covariance for hypothesis testing with logrank and Wilcoxon scores and survival-rate estimation in a randomized clinical trial. Journal of Biopharmaceutical Statistics. 1999;9:307–338. doi: 10.1081/BIP-100101179. [DOI] [PubMed] [Google Scholar]

[R17] Tsiatis AA. Semiparametric Theory and Missing Data. Springer; New York: 2006. [Google Scholar]

[R18] Tsiatis AA, Davidian M, Zhang M, Lu X. Covariate adjustment for twosample treatment comparisons in randomized clinical trials: A principled yet flexible approach. Statistics in Medicine. 2007 doi: 10.1002/sim.3113. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] van der Laan MJ, Robins JM. Unified Methods for Censored Longitudinal Data and Causality. Springer; New York: 2003. [Google Scholar]

[R20] van der Vaart AW. Asymptotic Statistics. Cambridge University Press; Cambridge: 1998. [Google Scholar]

[R21] Yang L, Tsiatis AA. E ciency study for a treatment e ect in a pretest-posttest trial. The American Statistician. 2001;56:29–38. [Google Scholar]

PERMALINK

Improving efficiency of inferences in randomized clinical trials using auxiliary covariates

Min Zhang

Anastasios A Tsiatis

Marie Davidian

Summary

1. Introduction

2. Semiparametric Model Framework

3. Estimating Functions for Treatment Parameters Using Auxiliary Covariates

4. Implementation of Improved Estimators

5. Improved Hypothesis Tests

6. Simulation Studies

6.1 Estimation

Table 1.

Table 2.

6.2 Testing

Table 3.

7. Applications

7.1 PURSUIT Clinical Trial

7.2 AIDS Clinical Trials Group Protocol 175

8. Discussion

Acknowledgements

Supplementary Material

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Improving efficiency of inferences in randomized clinical trials using auxiliary covariates

Min Zhang

Anastasios A Tsiatis

Marie Davidian

Summary

1. Introduction

2. Semiparametric Model Framework

3. Estimating Functions for Treatment Parameters Using Auxiliary Covariates

4. Implementation of Improved Estimators

5. Improved Hypothesis Tests

6. Simulation Studies

6.1 Estimation

Table 1.

Table 2.

6.2 Testing

Table 3.

7. Applications

7.1 PURSUIT Clinical Trial

7.2 AIDS Clinical Trials Group Protocol 175

8. Discussion

Acknowledgements

Supplementary Material

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases