Predicting Treatment Effects Using Biomarker Data in a Meta-Analysis of Clinical Trials

Y Li; JMG Taylor

doi:10.1002/sim.3931

. Author manuscript; available in PMC: 2014 Sep 3.

Published in final edited form as: Stat Med. 2010 Aug 15;29(18):1875–1889. doi: 10.1002/sim.3931

Predicting Treatment Effects Using Biomarker Data in a Meta-Analysis of Clinical Trials

Y Li ^1,^*, JMG Taylor ¹

PMCID: PMC4153610 NIHMSID: NIHMS624082 PMID: 20680981

SUMMARY

A biomarker (S) measured after randomization in a clinical trial can often provide information about the true endpoint (T) and hence the effect of treatment (Z). It can usually be measured earlier and more easily than T and as such may be useful to shorten the trial length. A potential use of S is to completely replace T as a surrogate endpoint to evaluate whether the treatment is effective. Another potential use of S is to serve as an auxiliary variable to help provide information and improve the inference on the treatment effect prediction when T is not completely observed. The objective of this report is to focus on its role as an auxiliary variable and to identify situations when S can be useful to increase efficiency in predicting the treatment effect in a new trial in a multiple-trial setting. Both S and T are continuous. We find that higher efficiency gain is associated with higher trial-level correlation but not individual-level correlation when only S, but not T is measured in a new trial; but, the amount of information recovery from S is usually negligible. However, when T is partially observed in the new trial and the individual-level correlation is relatively high, there is substantial efficiency gain by using S. For design purposes, our results suggest that it is often important to collect markers that have high adjusted individual-level correlation with T and at least a small amount of data on T. The results are illustrated using simulations and an example from a glaucoma clinical trial.

Keywords: auxiliary variables, biomarker, clinical trials, meta analysis, mixed model, surrogate

1. INTRODUCTION

A biomarker (S) in a clinical trial is a type of variables intended to provide information about the true endpoint (T) and the effect of treatment (Z). It is often an intermediate physical or laboratory indicator in a disease progression process, and can be measured earlier and is often easier to collect than T. Examples of biomarkers include CD4 counts in AIDS, blood pressure and serum cholesterol level in cardiovascular disease, and prostate-specific antigen in prostate cancer studies. Early measurements are also used as biomarkers for the later measurements, such as the earlier vision test result as a biomarker for the later result in a study on patients with age-related muscular degeneration [1]. Different investigators use different terminology for the roles of the biomarkers [2]. In this paper, we call S a surrogate endpoint when the potential use of S is to completely replace T to evaluate whether the treatment is effective [3]. Alternatively, when S is used to help provide information or enhance the efficiency of the estimator of the treatment effect on T when T is not completely observed, we call S as auxiliary variables [4]. When the true endpoints are rare, later-occurring or costly to obtain, the proper use of good biomarkers can substantially reduce the trial size and duration, hence lower the expense and lead to earlier decision making.

Previous research on the biomarker has often focused on the potential role of S as a surrogate endpoint for T. In a landmark article, Prentice [3] proposed a formal definition for perfect surrogacy and provided validation criteria for a single-trial setting. The criteria require that changes in S fully capture the effect of treatment on T. This paper inspired much research in the field, but the criteria are considered too restrictive for practical use. To relax the criteria, a surrogacy measure based on the proportion of the treatment effect explained (PTE) by S was proposed [5] and further studied and extended by several other authors (e.g., [7, 8, 9]). Freedman [5] also suggested that the PTE confidence interval’s lower bound be > 0.75 for a marker to be acceptable as a surrogate endpoint. However, this requires the treatment effect on T to be very strong, which is rarely observed in practice [8, 10]. The PTE estimator is also highly variable and can be out of the [0,1] range [7, 11]; hence, its practical use is limited.

From a biological aspect, there are often multiple causal pathways leading to disease and complex mechanisms by which the treatment functions; hence, a biomarker may or may not mediate the effect of the treatment on T and the surrogacy measures are often not directly transferable from one study to another. Another problem is that S may not capture the harmful side effect of the treatment. These associated uncertainties in the use of S in replacing T to test a new treatment can lead to incorrect, even harmful conclusions [11, 12]. As a result, very few biomarkers have been accepted as valid surrogate endpoints for T and their potential use as substitutes has been less than promising.

With new biomarkers being discovered and developed at a phenomenal rate, the clinical research community continue to be extremely interested in biomarkers in clinical trials. In this paper, we focus on the use of S as an auxiliary variable in helping predict the treatment effect on T. As we shall see, this role of a biomarker proves to be more promising. One of the most common scenarios for S to be useful as an auxiliary outcome is when one has more information on S than that on T for a study population. This occurs often in practice, since patients are usually recruited into a trial sequentially in calendar time and S is observed more often and earlier than T, particularly on those enrolled early. Previous surrogacy measures are often proposed based on summary statistics in order to identify a replacement for T, and they are not usually suggested explicitly for the purpose of prediction. In the presence of individual-level data, a biomarker may actually be effective as an auxiliary outcome in enhancing inference, but not be identified as such using existing surrogacy measures. A strong association between S and T does not suffice for S to be a substitute for T; as Baker et al [13] stated, “a correlate does not make a surrogate”. However, when individual data on T exist, a strong association can inform and increase the efficiency of treatment effect prediction, as we shall demonstrate.

A number of authors have explored the role of biomarkers as auxiliary variables. However, the opinions on their values have been mixed, as noted by [14]. In much of the previous work, the information recovered from S appears to be very small [18, 4, 19] unless in rare situations when S and T are very highly correlated; however, when there is more structural relationship between S and T, it is more likely to achieve significant efficiency gain by using S [14]. Most of the work mentioned above has focused on the situation when T is the time to an event. When S and T are continuous data, Venkatraman and Begg [21] proposed fully nonparametric tests that incorporate the information from S and found that the amount of efficiency gain through S for these tests is small except in rare occasions when the correlation between S and T is extremely high. A homogeneous sample such as the single trial setting has often been considered in the previous work. When we can identify a group of trials which have similar treatment groups and patient populations, it is natural to use a meta-analytic approach to predict the treatment effect in a new trial. This approach could allow one to account for the heterogeneity among different trials and borrow information from previous trials to improve the efficiency.

In this paper, we will focus on examining the extent of information gain from S in a multiple trial setting. We will examine the situation when T is either completely missing or partially missing in a new trial when we have information on S, T and Z in the previous trials. The objective is to predict the treatment effect on T in the new trial when S and T are continuous and Z is binary. We examine the factors, particularly, the correlation between S and T and the fraction of missing T, that impact the extent of increase in the precision of the treatment effect estimate resulting from utilizing S to identify the situations when S can be beneficial. The results are intended to be of practical value and directly applicable to clinical trials.

In Section 2, we introduce a commonly used bivariate mixed model. In Section 3, we summarize several related methods used to predict the effect of Z on T in a new trial when T is either completely missing or partially missing in the new trial. The methods include those proposed by Buyse et al [10], Gail et al [22] and Henderson [23]. In Section 4, we examine the extent of information recovery from S and its relation to the correlation between S and T. In Section 5, we evaluate the methods and efficiency gain through simulations. In Section 6, we give a data example. In Section 7, we present conclusions.

2. The Model

Suppose we have n randomized trials, i = 1, …, n, where the nth trial is labeled as new and there are m_i patients in the ith trial. Let Z = 0, 1 denote the placebo and treatment groups, respectively and (S_ij, T_ij, Z_ij) represent S, T, and Z for individual j in trial i. We are interested in predicting the actual treatment effect on T in the new trial (δ_Tn) based on previous (n − 1) existing trials and whatever data is available in the nth trial. A commonly used bivariate mixed model used to describe the joint distribution of S_ij and T_ij [10] is:

\begin{matrix} S_{ij} = α_{0} + α_{1} Z_{ij} + a_{0 i} + a_{1 i} Z_{ij} + ∊_{S ij} \\ T_{ij} = γ_{0} + γ_{1} Z_{ij} + r_{0 i} + r_{1 i} Z_{ij} + ∊_{T ij} \end{matrix}

(1)

where

(\begin{matrix} ∊_{S ij} \\ ∊_{T ij} \end{matrix}) ~ MVN ((\begin{matrix} 0 \\ 0 \end{matrix}), σ = (\begin{matrix} σ_{ss} & σ_{st} \\ σ_{tt} \end{matrix})),

(2)

and

(\begin{matrix} a_{0 i} \\ r_{0 i} \\ a_{1 i} \\ r_{1 i} \end{matrix}) ~ MVN ((\begin{matrix} 0 \\ 0 \\ 0 \\ 0 \end{matrix}), D = (\begin{matrix} d_{s s} & d_{s t} & d_{s a} & d_{s r} \\ d_{t t} & d_{t a} & d_{t r} \\ d_{a a} & d_{a r} \\ d_{r r} \end{matrix})) .

(3)

The treatment effect in the nth trial is δ_Tn = γ₁ + γ_1n. Let $Y_{i}^{T} = (S_{i j}, T_{i j}), ∊_{i}^{T} = (∊_{S i j}, ∊_{T i j})$ , β^T = (α₀, γ₀, α₁, γ₁) and $η_{i}^{T} = (a_{0 i}, r_{0 i}, a_{1 i}, r_{1 i})$ . The model (1) can be written in a general mixed model notation as Y_i = X_iβ + U_iη_i + ε_i, where β denotes the fixed effects, η_i denotes the random effects, X_i and U_i are the corresponding design matrices. The vector Y_i follows a bivariate normal distribution with mean X_iβ and variance $V_{i} = U_{i} D U_{i}^{t} + Σ_{i}$ where Σ_i is a 2m_i × 2m_i matrix with m_i blocks of σ on the main diagonal and zeros elsewhere.

3. Methods for Predicting the Treatment Effect δ_Tn in the New Trial

In this section, we introduce several related methods used to predict the effect of Z on T in a new trial when T is either completely missing or partially missing in the new trial.

3.1. Buyse et al Method

Buyse et al (BMBRG) [10] assumed the same model and suggested a method to estimate δ_Tn when T is completely unobserved in the nth trial. First, they fit a bivariate mixed model to the data from trial 1 through (n−1) to obtain the estimates of D, α₀, γ₀, α₁ and γ₁, denoted by $\hat{D}$ , ${\hat{α}}_{0}$ , ${\hat{γ}}_{0}$ , ${\hat{α}}_{1}$ and ${\hat{γ}}_{1}$ , respectively. Second, they fit a linear regression S_nj = μ_0Sn + δ_SnZ_nj + ε_Snj in the nth trial. One then obtains that ${\hat{a}}_{0 n} = {\hat{μ}}_{0 S n} - {\hat{α}}_{0}$ and ${\hat{a}}_{1 n} = {\hat{δ}}_{S n} - {\hat{α}}_{1}$ where ${\hat{μ}}_{0 S n}$ and ${\hat{δ}}_{S n}$ are estimates of μ_0Sn and δ_Sn based on data from the nth trial. Given that β, D, σ, a_0n and a_1n are known, BMBRG showed that δ_n follows a normal distribution with conditional mean

E (δ_{T n}) = γ_{1} + (d_{sr} d_{ar}) {(\begin{matrix} d_{ss} & d_{sa} \\ d_{sa} & d_{aa} \end{matrix})}^{- 1} (\begin{matrix} a_{0 n} \\ a_{1 n} \end{matrix}),

(4)

and conditional variance

var (δ_{T n}) = d_{rr} - (d_{sr} d_{ar}) {(\begin{matrix} d_{ss} & d_{sa} \\ d_{sa} & d_{aa} \end{matrix})}^{- 1} (\begin{matrix} d_{sr} \\ d_{ar} \end{matrix}) .

(5)

While various methods can be used to obtain the estimate for δ_Tn, denoted by ${\hat{δ}}_{T n}$ , in our simulations, we replace β, D, σ, a_0n and a_1n with their estimates in equations (4) and (5) as often done in practice. Specifically, we obtain β, D and σ using a restricted maximum likelihood method from PROC MIXED in SAS. We estimate μ_0Sn and δ_Sn using PROC GLM in SAS and then obtain the estimates for a₀_n and a₁_n. However, this often leads to underestimation of $v a r ({\hat{δ}}_{T n})$ .

3.2. Gail et al Method

Gail et al (GPHC) [22] proposed to estimate δ_Tn without involving models for the joint distribution of (S_ij, T_ij) at the individual level. The method applies to the situation when T is completely unobserved in the nth trial. Let $μ_{T i}^{T} = (μ_{0 T i}, μ_{1 T i})$ represent the marginal means of T in the Z = 0 and 1 groups in the ith trial and similarly for $μ_{S i}^{T} = (μ_{0 S i}, μ_{1 S i})$ . GPHC assume that ${(μ_{0 T i}, μ_{0 S i}, μ_{1 T i}, μ_{1 S i})}^{T}$ follows a multivariate normal distribution with covariance φ where φ is a 4 × 4 matrix representing the between-trial variance; hence, its estimate ${({\hat{μ}}_{0 T i}, {\hat{μ}}_{0 S i}, {\hat{μ}}_{1 T i}, {\hat{μ}}_{1 S i})}^{T}$ follows a multivariate normal distribution with the covariance φ +ω_i where ω_i is a 4 × 4 matrix with two block diagonal matrices denoting the within-trial variance for each treatment group. The elements of μ_Ti, μ_Si, and φ are connected with the parameters in the model (1) in the following way: μ_0Ti = γ₀ + r_0i, μ_1Ti = γ₀ + r_0i + γ₁ + r_1i, μ_0Si = α₀ + a_0i, μ_1Si + α₀ + a_0i + α₁ + a_1i, φ₁₁ = d_tt + d_bb + 2d_tb, φ₁₂ = d_ts + d_ab + d_ta + d_sb, φ₁₃ = d_tt + d_tb, φ₁₄ = d_ts + d_sb, φ₂₂ = d_ss + d_aa + 2d_sa, φ₂₃ = d_st + d_ta, φ₂₄ = d_ss + d_sa, φ₃₃ = d_tt, φ₃₄ = d_st and , φ₄₄ = d_ss.

GPHC show that μ_Tn given ${\hat{μ}}_{S n}$ (and β, φ and ω) follows a normal distribution with mean

E (μ_{T n}) = (\begin{matrix} γ_{0} \\ γ_{0} + γ_{1} \end{matrix}) + (\begin{matrix} φ_{12} & φ_{14} \\ φ_{23} & φ_{34} \end{matrix}) {(\begin{matrix} φ_{22} + ω_{22 n} & φ_{24} \\ φ_{24} & φ_{44} + ω_{44 n} \end{matrix})}^{- 1} (\begin{matrix} a_{0 n} \\ a_{0 n} + a_{1 n} \end{matrix}),

and variance

var (μ_{T n}) = (\begin{matrix} φ_{11} & φ_{13} \\ φ_{13} & φ_{33} \end{matrix}) - (\begin{matrix} φ_{12} & φ_{14} \\ φ_{23} & φ_{34} \end{matrix}) {(\begin{matrix} φ_{22} + ω_{22 n} & φ_{24} \\ φ_{24} & φ_{44} + ω_{44 n} \end{matrix})}^{- 1} {(\begin{matrix} φ_{12} & φ_{14} \\ φ_{23} & φ_{34} \end{matrix})}^{T},

where ω_22n denotes the variance of ${\hat{μ}}_{0 S n}$ and ω_44n for ${\hat{μ}}_{1 S n}$ .

The treatment effect on T in the new trial, δ_Tn, has mean

E (δ_{T n}) = (- 1 1) E (μ_{T n}),

(6)

and variance

var (δ_{T n}) = (- 1 1) var (μ_{T n}) (\begin{matrix} - 1 \\ 1 \end{matrix}) .

(7)

If we drop the terms w₂₂_n and w₄₄_n from the above expressions, we obtain the identical expressions as those of the BMBRG mean and variance. The GPHC formula takes into account the uncertainty associated with estimating a₀_n and a₁_n while BMBRG does not. Similar to BMBRG, GPHC also assume that β, D and σ are known in deriving equations (6) and (7). Since the uncertainties of β, D and σ are not accounted for here, $v a r ({\hat{δ}}_{T n})$ is often underestimated. Gail et al (2000) noted that this method is analogous to the generalized estimating equations (GEE) [6]. We note that the GEE approach can handle the situation when T is partially observed in the new trial, thus the GPHC method could be generalized and would be worthy of further investigation.

To estimate δ_Tn and $v a r ({\hat{δ}}_{T n})$ , in our simulations, we first obtain $\hat{φ} + {\hat{ω}}_{i}$ by calculating the covariances of the treatment- and trial-specific means where $\hat{φ}$ and ${\hat{ω}}_{i}$ denote the estimates of φ and ω_i, respectively. We then calculate the treatment-specific covariances of S and T within each trial and then average them over different trials to obtain ${\hat{ω}}_{i}$ . From these, we calculate $\hat{φ}$ . We calculate the overall treatment-specific means as ${\hat{γ}}_{0}$ and ${\hat{γ}}_{1}$ (i.e., the estimates of γ₀ and γ₁) and the variances for each treatment group in the new trial for ${\hat{ω}}_{22 n}$ and ${\hat{ω}}_{44 n}$ (i.e., the estimates of ω_22n and ω_44n). We estimate μ_0Sn and μ_1Sn and then calculate a_0n and a_1n. Then we plug in these estimates into (6) and (7) to obtain the mean and variance for ${\hat{δ}}_{T n}$ .

3.3. Henderson Method (HD)

While both BMBRG and GPHC methods only apply to the situation when T is completely missing in the new trial, the HD method applies to the situations when T is either completely missing, partially missing or completely observed in the new trial. Using the general mixed model notation, we can obtain the estimates of β and η_n (denoted by $\hat{β}$ and ${\hat{η}}_{n}$ ) by solving the mixed model equation which is described by Henderson [23] (details in Appendix A) and their sum follows a normal distribution with mean

E (\hat{β} + {\hat{η}}_{n}) = β + {DU}_{n}^{T} V_{n}^{- 1} (Y_{n} - X_{n} β) .

(8)

and variance

\begin{matrix} var (\hat{β} + {\hat{η}}_{n} - β - η_{n}) = {(\sum_{i = 1}^{n} X_{i}^{T} V_{i}^{- 1} X_{i})}^{- 1} + D - {DU}_{n}^{T} V_{n}^{- 1} U_{n} D + {DU}_{n}^{T} V_{n}^{- 1} X_{n} \\ {(\sum_{i = 1}^{n} X_{i}^{T} V_{i}^{- 1} X_{i})}^{- 1} X_{n}^{T} V_{n}^{- 1} U_{n} D - 2 {DU}_{n}^{T} V_{n}^{- 1} X_{n} {(\sum_{i = 1}^{n} X_{i}^{T} V_{i}^{- 1} X_{i})}^{- 1} . \end{matrix}

The treatment effect for the nth trial has mean

E ({\hat{δ}}_{T n}) = (0 0 0 1) (β + η_{n})

(9)

and variance

var ({\hat{δ}}_{T n}) = (0 0 0 1) var (\hat{β} + {\hat{η}}_{n} - β - η_{n}) {(0 0 0 1)}^{T} .

(10)

Note that ${\hat{η}}_{n}$ is the best linear unbiased predictor (BLUP) and can be derived as an empirical Bayes estimator [25, 26]. When T is completely missing in the nth trial, the expression of $E ({\hat{δ}}_{T n})$ in (9) is exactly the same as the GPHC estimate in (6). Different from GPHC and BMBRG, the variance formula in (10) accounts for the uncertainty associated with estimating β, but it treats D and σ as known quantities. In the implementations, we obtain these estimates using PROC MIXED in SAS.

3.4. Empirical Bayes Estimate and Conditional Posterior Variance (EB-CPV)

Let r be the number of patients in the new trial on whom we have information on both S and T. The empirical Bayesian estimate of δ_Tn can be obtained as the posterior mode estimate when we assume flat priors for the fixed effects and multivariate normal priors for the random effects [26]. Its expression is identical to the HD estimate in equation (9) [26]. When β, D and σ are known, the conditional posterior variance (CPV) of δ_Tn can approximate the variance of ${\hat{δ}}_{T n}$ [28]. We obtain the CPV of δ_Tn as (details in Appendix C):

var (δ_{T n}) = (0 1) {(Ψ_{d}^{- 1} + Φ_{e}^{- 1})}^{- 1} {(0 1)}^{T},

(11)

where, Ψ_d is a function only of the between-trial covariances given by $Ψ_{11} - Ψ_{12} Ψ_{22}^{- 1} Ψ_{21}$ and Φ_e is a function only of the within-trial covariances given by $Φ_{e} = (\begin{matrix} ϕ_{11} & ϕ_{12} \\ ϕ_{12} & ϕ_{22} \end{matrix})$ . The elements of Ψ_d and Φ_e are listed below:

\begin{matrix} Ψ_{11} = (\begin{matrix} d_{tt} & d_{tr} \\ d_{tr} & d_{rr} \end{matrix}), Ψ_{12} = (\begin{matrix} d_{st} & d_{ta} \\ d_{sr} & d_{ar} \end{matrix}), Ψ_{21} = (\begin{matrix} d_{st} & d_{sr} \\ d_{sr} & d_{ar} \end{matrix}), \\ Ψ_{22} = (\begin{matrix} d_{ss} & d_{sa} \\ d_{sa} & d_{aa} \end{matrix}), ϕ_{11} = \frac{(σ_{tt} - σ_{st}^{2} σ_{ss}^{- 1}) \sum_{j = 1}^{r} Z_{nj}^{2}}{r \sum_{j = 1}^{r} Z_{nj}^{2} - {(\sum_{j = 1}^{r} Z_{nj})}^{2}}, \\ ϕ_{12} = \frac{(σ_{tt} - σ_{st}^{2} σ_{ss}^{- 1}) \sum_{j = 1}^{r} Z_{nj}}{r \sum_{j = 1}^{r} Z_{nj}^{2} - {(\sum_{j = 1}^{r} Z_{nj})}^{2}}, ϕ_{22} = \frac{r (σ_{t t} - σ_{st}^{2} σ_{ss}^{- 1})}{r \sum_{j = 1}^{r} Z_{nj}^{2} - {(\sum_{j = 1}^{r} Z_{nj})}^{2}} . \end{matrix}

When T is completely missing in the nth trial, i.e., r = 0, the CPV simplifies to:

var (δ_{T n}) = (0 1) Ψ_{d} {(0 1)}^{T},

(12)

an expression equivalent to the BMBRG variance formula in (5). The CPV formula can be viewed as the generalization of the BMBRG variance formula. Note that the CPV underestimates the prediction variance because they treat β, D, σ, a_0n and a_1n as known quantities. Morris [29] and Ghosh and Rao [28] showed that a better estimator of the prediction variance can be obtained by adding to the CPV a second term that takes into account the uncertainty about all parameters.

3.5. Bayesian Estimation (denoted by Bayes)

An alternative method to obtain the distributions of the parameters of interest is a fully Bayesian estimation method which is also applicable when T is either partially missing or completely missing. We assume flat priors for the fixed effects, i.e., p(α₀) ∝ 1, p(γ₀) ∝ 1, p(α₁) ∝ 1, and p(γ₁) ∝ 1, and vague priors for the rest of parameters, specifically, σ⁻¹ ~ W (a, E) and D^-−1 ~ W (c, F), where W refers to the Wishart distribution. We use a = 3, c = 5, E = (a + 1)⁻¹I₂ and F = (c + 1)⁻¹I₄. A data augmentation method is used to implement the procedure (details in Appendix B). The Bayesian estimation method naturally takes into consideration the uncertainty associated with estimating every parameter [27], but it can be sensitive to the prior specifications. While it is computationally intensive to conduct extensive simulations to evaluate the properties of this method, it is very feasible to analyze data using this method.

4. Efficiency Gain and Correlation

In this section, we study the precision of the predicted treatment effects ( ${\hat{δ}}_{T n}$ ) and the factors that impact the precision, particularly, the correlation between S and T and the fraction of missingness.

4.1. Correlation

In a multiple-trial setting, with a bivariate mixed model assumption, the treatment adjusted individual-level or within-trial correlation between S and T is $R_{i n d i v}^{2} = σ_{s t}^{2} ∕ σ_{s s} σ_{t t}$ . The trial-level correlation between S and T is defined by Buyse et al [10] as

R_{trial}^{2} = \frac{(d_{sr} d_{ar}) {(\begin{matrix} d_{ss} & d_{sa} \\ d_{sa} & d_{aa} \end{matrix})}^{- 1} (\begin{matrix} d_{sr} \\ d_{ar} \end{matrix})}{d_{rr}} .

The between-trial correlation $R_{t r i a l}^{2}$ assesses how well the treatment effect on T in the new trial can be predicted by that on S. While $R_{t r i a l}^{2}$ is identified as the key factor that impacts the degree of efficiency gain from S in the research by Buyse et al [10] and Gail et al [22], as we shall see in the following, $R_{i n d i v}^{2}$ plays an even more important role than $R_{t r i a l}^{2}$ in obtaining substantial efficiency gain from S with respect to the estimated treatment effect on T when T is partially observed.

4.2. Prediction Precision and Correlation

We examine the impacts of $R_{i n d i v}^{2}$ and $R_{t r i a l}^{2}$ on the prediction precision using the CPV formula in equation (11). We note that when there is an equal number of patients per treatment group in the new trial, the elements of Φ_e in CPV simplify to

ϕ_{11} = \frac{2 σ_{tt} (1 - R_{indiv}^{2})}{r}, ϕ_{12} = \frac{2 σ_{tt} (1 - R_{indiv}^{2})}{r}, ϕ_{22} = \frac{4 σ_{tt} (1 - R_{indiv}^{2})}{r} .

When T is completely missing in the new trial, the CPV simplifies to $v a r (δ_{T n}) = d_{r r} (1 - R_{t r i a l}^{2})$ ; hence, the factors that determine the precision of the predictor of the treatment effect on T are $R_{t r i a l}^{2}$ and d_rr which are between-trial level. When T is partially observed, the additional important factors are within-trial level including $R_{i n d i v}^{2}$ , σ_tt and r. Since the within-trial covariances in Φ_e are usually significantly smaller than the between-trial covariances in Ψ_d, we find that Φ_e usually dominates and Ψ_d has a negligible impact on the CPV. Although the CPV usually underestimates the prediction variance, our simulation studies show that it usually accounts for the majority of the total variance, and a comparison between (11) and (12) should suffice to provide algebraic intuition about the prediction variance.

5. Simulations

5.1. The Setup

We conduct simulation studies to evaluate the bias, efficiency and coverage rates of the confidence intervals for the predicted treatment effect in a new trial using the above methods. For comparison purposes, we also estimate δ_Tn based on observed T using the simple estimate without any distributional assumption (denoted by SIMPLE). That is, ${\hat{δ}}_{T n} = Σ_{k} T_{n k 1} ∕ m_{n 1} - Σ_{l} T_{n l 0} ∕ m_{n 0}$ , where T_nk1 represents T on patient k in the Z = 1 group in the nth trial and similarly for T_nl0, m_n1 represents the number of patients in the Z = 1 group in the nth trial and similarly for m_n0.

We generate 500 data sets based on the bivariate mixed model in (1). We assume equal number of patients per trial and let m_i = m. The parameter specifications are: β^T = (1, 2, 1, 1), d_ss = 0.5, d_tt = 0.2, d_aa = 3.5, d_rr = 1.6, σ_ss = 1 and σ_tt = 0.3. To examine the impact of the trial-level correlation, we vary the correlation matrices for the random effects: $(\begin{matrix} 1 & 0.57 & 0.37 & 0.22 \\ 0.57 & 1 & 0.24 & 0.21 \\ 0.37 & 0.24 & 1 & 0.3 \\ 0.22 & 0.21 & 0.3 & 1 \end{matrix})$ , $(\begin{matrix} 1 & 0.57 & 0.37 & 0.22 \\ 0.57 & 1 & 0.24 & 0.21 \\ 0.37 & 0.24 & 1 & 0.7 \\ 0.22 & 0.21 & 0.7 & 1 \end{matrix})$ and $(\begin{matrix} 1 & 0.57 & 0.37 & 0.22 \\ 0.57 & 1 & 0.24 & 0.21 \\ 0.37 & 0.24 & 1 & 0.9 \\ 0.22 & 0.21 & 0.9 & 1 \end{matrix})$ , which correspond to $R_{t r i a l}^{2} = 0.1$ , 0.5 and 0.8, respectively. To examine the impact of the individual-level correlation, we vary $R_{i n d i v}^{2}$ from 0.1, 0.5, to 0.9. We vary n, m, and the percentage of missingness in the new trial (denoted by p). For each different data set, we have a different underlying true treatment effect δ_Tn because δ_Tn is not fixed and follows a known distribution. Its average across 500 data sets is denoted by ${\overset{‒}{δ}}_{T n}$ . For each data set and each method used, we obtain ${\hat{δ}}_{T n}$ , its standard error, its CI as ${\hat{δ}}_{T n} \pm 1.96 \times (standard error)$ and an indicator variable for whether the 95% CI contains δ_Tn or not. Let ${\bar{\hat{δ}}}_{T n}$ denote the average of ${\hat{δ}}_{T n}$ across 500 data sets. We examine the method’s performance by its average bias $(Bias = {\bar{\hat{δ}}}_{T n} - {\overset{‒}{δ}}_{T n})$ , the average standard error (SE), the root mean squared error $(RMSE = \sqrt{Σ {({\hat{δ}}_{T n} - δ_{T n})}^{2} ∕ 500})$ , and the coverage rate (CR) over all simulated data sets. As we will see all estimates are unbiased, the relative efficiency (RE) of two estimators can be approximated by the inverse of the ratio of the two corresponding RMSE²s.

5.2. Method Evaluation

In Table I, we present Bias, RMSE, SE and CR of ${\hat{δ}}_{T n}$ using the respective methods including SIMPLE, HD, BMBRG, GPHC, Bayes and EB-CPV from simulations with various combinations of n, m, and the percentage of missingness. We let $R_{i n d i v}^{2} = 0.5$ and $R_{t r i a l}^{2} = 0.5$ . When T is completely or 50% observed in the new trial, SIMPLE and HD generate estimates which are unbiased, have similar RMSE and confidence intervals with nominal-level or close- to-nominal-level coverage rates; on the other hand, CPV consistently gives underestimated prediction variances (i.e., SE < RMSE).

Table I.

Simulation results based on 500 data sets. β^T= (1, 2,1,1), d_ss = 0.5, d_tt = 0.2, d_aa = 3.5, d_rr = 1.6, σ_ss = 1, σ_tt = 0.3, $R_{i n d i v}^{2} = 0.5$ and $R_{t r i a l}^{2} = 0.5$

n	m	%Missing	Methods	Bias	RMSE	SE	CR
10	100	0%	SIMPLE	−0.005	0.111	0.109	95.0
			HD	−0.005	0.112	0.106	94.8
			Bayes	−0.006	0.112	0.109	94.8
			EB-CPV	−0.005^*	0.112^*	0.093^†	90.8
		50%	SIMPLE	−0.002	0.149	0.139	95.4
			HD	0.000	0.143	0.139	94.0
			Bayes	−0.004	0.145	0.145	94.8
			EB-CPV	0.000^*	0.143^*	0.132^†	91.2
		100%	BMBRG	−0.013	1.193	0.736	76.6
			GPHC	−0.010	1.125	0.755	80.2
			HD	−0.011	1.125	0.795	82.0
			Bayes	−0.037	1.139	1.105	94.8

40	100	0%	SIMPLE	−0.005	0.111	0.109	95.0
			HD	−0.005	0.110	0.108	95.0
			EB-CPV	−0.005^*	0.110^*	0.093^†	91.4
		50%	SIMPLE	−0.002	0.149	0.155	95.4
			HD	0.003	0.140	0.142	95.8
			EB-CPV	0.003^*	0.140^*	0.130^†	93.2
		100%	BMBRG	−0.008	0.965	0.887	92.8
			GPHC	−0.008	0.965	0.877	92.6
			HD	−0.008	0.965	0.869	92.4

40	300	0%	SIMPLE	0.002	0.063	0.063	94.2
			HD	0.002	0.063	0.063	94.4
			EB-CPV	0.002^*	0.063^*	0.066^†	90.9
		50%	SIMPLE	−0.002	0.089	0.090	94.4
			HD	0.000	0.085	0.083	93.6
			EB-CPV	0.000^*	0.085^*	0.077^†	91.4
		100%	BMBRG	−0.002	0.920	0.868	93.8
			GPHC	−0.002	0.919	0.871	93.8
			HD	−0.002	0.919	0.882	94.0

55	100	0%	SIMPLE	−0.005	0.111	0.109	95.0
			HD	−0.005	0.110	0.108	95.0
			EB-CPV	−0.005^*	0.110^*	0.094^†	90.8
		50%	SIMPLE	−0.002	0.149	0.155	95.4
			HD	0.002	0.140	0.142	95.8
			EB-CPV	0.002^*	0.140^*	0.131^†	93.4
		100%	BMBRG	−0.033	0.950	0.883	93.8
			GPHC	−0.033	0.948	0.898	94.0
			HD	−0.033	0.948	0.898	94.2

Open in a new tab

SIMPLE: simple estimate. HD: Henderson method. Bayes: Bayesian estimation. CPV: conditional posterior variance. BMBRG: method by Buyse et al [10]. GPHC: method by Gail et al [22]. EB-CPV: EB estimate with CPV variance.

obtained using HD.

^†

obtained using CPV.

When T is completely missing in the new trial, BMBRG, GPHC and HD all underestimate the variances of ${\hat{δ}}_{T n}$ . When the number of the trials is relatively large (n = 40, 55), the extent of underestimation is minor; however, with a small number of trials (n = 10), the extent can be more severe and the coverage rates can be less than 85%. Although HD is expected to have better CR than GPHC and GPHC is expected to be better than BMBRG because they account for more uncertainty of the parameters, the advantages of HD and GPHC over BMBRG are small and all methods give similar CRs. The Bayes method we used gives more precise estimates of the variances and the coverage rates are around the 95% nominal level.

The SIMPLE and HD methods give estimates with similar precision which shows that the efficiency gain from the bivariate normal assumption is small. When T is partially or completely observed, the increase in m can improve the precision of the estimates while a larger n does not necessarily improve much precision. When T is completely missing, there is a minor gain in the precision when n and m increase.

5.3. R $R_{i n d i v}^{2}$ , $R_{t r i a l}^{2}$ , Percentage of Missingness and Information Recovery from S

Figure 1A shows the relatively efficiency of ${\hat{δ}}_{T n}$ when T is completely missing in the new trial compared to the estimate before any deletion in T occurs using the HD method. Relative efficiency is defined as the inverse of the ratio of the two variances. We vary $R_{i n d i v}^{2}$ and $R_{t r i a l}^{2}$ and let n = 40 and m = 100. We find that while the increases in $R_{i n d i v}^{2}$ have negligible impact on the precision, the increase in $R_{t r i a l}^{2}$ can improve the precision more than any other factor. These findings agree with the algebraic intuition from the CPV variance formula in (12). Relative to the estimate based on completely observed data, the relative efficiency varies from 0.7%, 1.2% to 3.4% as we increase $R_{t r i a l}^{2}$ from 0.1, 0.5 to 0.8. As a result, when we completely rely on S and summary statistics from previous trials to predict ${\hat{δ}}_{T n}$ , the extent of information recovery is often limited and the precision of ${\hat{δ}}_{T n}$ is usually insufficient to be clinically useful.

Simulation results based on 500 data sets. Relative efficiency of the new treatment effect estimate using S when T is not completely observed to that when T is completely observed. A: 0% of T Observed in the new trial; B: 50% of T observed in the new trial; C: Percentage of Observed T Varies in the new trial.

Figure 1B presents the relative efficiency of ${\hat{δ}}_{T n}$ when T is 50% missing compared with the estimate before any deletion of T using the HD method. We find that high $R_{i n d i v}^{2}$ can lead to a large gain of efficiency from the use of S. When $R_{i n d i v}^{2}$ is large (e.g., 0.7 or 0.9), most of the information on δ_Tn is recovered from S and the precision of the estimate is close to that when T is completely observed. On the other hand, the magnitude of $R_{t r i a l}^{2}$ does not have much impact on the amount of efficiency gain from S. The observations here are in agreement with the CPV variance formula in (11).

Figure 1C shows the relative efficiency of ${\hat{δ}}_{T n}$ when T is partially or completely missing compared with the estimate before any deletion of T. Naturally, the higher the proportion of available T, the smaller the RMSE, and thus the greater the precision for the treatment effect prediction. Interestingly, we find that there is a substantial efficiency gain from the information on S with even a small fraction of observed T, particularly when $R_{i n d i v}^{2}$ is high. For example, when 30% T are observed, the lost information due to missingness is almost completely recovered from S when $R_{i n d i v}^{2} = 0.9$ .

6. Data Analysis: a Glaucoma Study

The evaluation of the extent of information recovery from S in predicting the treatment effect on T in a new trial is illustrated using the Collaborative Initial Glaucoma Treatment Study (CIGTS) [30]. Glaucoma is a group of diseases that cause vision loss and is a leading cause of blindness. High pressure in the eyes, i.e. intraocular pressure (IOP), is a major risk factor of glaucoma. The CIGTS is a randomized multi-center clinical trial to compare the effects of two types of treatments, surgery and medicine, on reducing IOP among glaucoma patients. Patients are enrolled between 1993 and 1997. A total of 607 patients are included in the study and among them, 307 are randomly assigned into the medicine group. IOP (recorded in mmHg) has been measured at different time points following the treatment. For the purpose of this paper, we take the IOP measurement at month 96 as T and that at month 12 as S. We assume that the IOP measurements are normally distributed. To evaluate the situation of a meta-analysis where data are from different trials; we treat the different centers in the CIGTS study as independent trials testing a similar group of treatments. A preliminary analysis of these data shows that the estimate of the between-trial variances, $\hat{D}$ , is non-positive definite. Mimicing the approach of Gail et. al. [22], we rescale up the data size by simulating S_ij and T_ij from bivariate normal distributions for each trial and treatment group with the trial-specific and treatment-specific means and variance-covariances from the real data. Nonetheless, our results are generalizable. The CIGTS study includes 14 centers from which we delete five centers (i.e., 5, 7, 12, 13, 14) either because they had too few observations or because of non-positive definite covariance matrices within center. We also deleted two outliers that are greater than 35mmHg. For the centers included (n = 9), we increase the sample sizes to 335, 176, 385, 264, 539, 368, 286, 528, and 319. The trial-specific and treatment-specific means and correlations for S and T are listed in Table II.

Table II.

Description of Pseudodata in Glaucoma study: Treatment-Specific Means and Individual-Level Correlations for Each Center

Center	Sample Size	Medicine	Surgery	Individual-level Correlation

		(Means of S, T)	(Means of S, T)	Medicine	Surgery

1	670	(17.63, 16.52)	(13.76, 14.59)	0.367	0.608
2	352	(17.22, 16.42)	(14.63, 12.98)	−0.455	0.467
3	770	(19.27, 17.58)	(15.81, 16.17)	0.589	0.548
4	528	(17.17, 15.51)	(10.93, 12.88)	0.176	0.540
5	1078	(18.52, 18.67)	(14.99, 15.32)	0.435	0.407
6	736	(18.62, 18.89)	(15.13, 17.11)	−0.16	−0.0056
7	572	(18.35, 15.34)	(14.59, 14.53)	0.177	0.396
8	1056	(18.59, 16.16)	(13.60, 13.72)	0.31	0.95
9	638	(17.56, 16.82)	(14.19, 14.61)	0.042	0.756

Open in a new tab

The HD method is used to fit the rescaled data for which $\hat{D}$ is positive definite and the estimates of $R_{t r i a l}^{2}$ and $R_{i n d i v}^{2}$ , denoted by ${\hat{R}}_{t r i a l}^{2}$ and ${\hat{R}}_{i n d i v}^{2}$ , are obtained as 0.25 and 0.15, respectively. We randomly select Center 8 as the new trial and delete some proportion of T in Center 8 to examine the extent of efficiency gain through the use of S. The missing mechanism is missing completely at random [24]. The results are listed in Table III. Without missing T, ${\hat{δ}}_{T n}$ is −2.45 with the standard error of 0.29. When T is completely missing, ${\hat{δ}}_{T n}$ is 1.58 with the standard error of 0.79. When 20% or 50% of T are missing, the precision of ${\hat{δ}}_{T n}$ using S is comparable to that based on completely observed T. Even with 80% missing, the SE is substantially smaller than that when 100% of T is missing. For further illustration, we treat Center 9 as a new trial and obtain similar results. With this rescaled data, we have artificially increased the sample size by approximately five fold for each trial, hence, the power to detect the treatment effect is much larger than the original data. When δ_Tn is predicted solely based on S, the relatively efficiency is only about 10% compared to that when T is not missing, and ${\hat{δ}}_{T n}$ reaches the significance level of 0.05 for Center 9 and is not quite significant for center 8. In this particular study we completely observe S by the end of year 1998 but only start to observe T in year 2001, thus by solely relying on S to predict δ_Tn, we can significantly shorten the trial length, but the result is only of borderline significance. In practice many trials do not have such strong effect so when δ_Tn is predicted solely based on S, the substantial loss in precision often results in failure to detect any real treatment effect difference. In the CIGTS, by October 2002, about 20% of T would have been observed and the treatment effect is clearly significant, illustrating the benefit of significant increase in the precision of ${\hat{δ}}_{T n}$ by utilizing a small fraction of T. There is a also a considerable time saving compared to collecting T on all subjects, which would have required follow-up to 2005.

Table III.

Estimate treatment effect on IOP at the 96th month utilizing information from early IOP measures at the 12^th month in the glaucoma study.

p	Estimate	Standard Error	p-value
center = 8

SIMPLE^†	−2.45	0.29	< .0001
No missing^‡	−2.33	0.22	< .0001
100% missing^‡	−1.58	0.79	0.063
90% missing^‡	−1.50	0.47	0.0059
80% missing^‡	−2.37	0.39	< .0001
50% missing^‡	−2.61	0.29	< .0001
20% missing^‡	−2.19	0.23	< .0001

center = 9

SIMPLE^†	−2.21	0.30	< .0001
No missing^‡	−2.32	0.27	< .0001
100% missing^‡	−2.68	0.82	0.0053
90% missing^‡	−2.19	0.61	0.0023
80% missing^‡	−2.30	0.49	< .0002
50% missing^‡	−2.04	0.36	< .0001
20% missing^‡	−2.15	0.30	< .0001

Open in a new tab

^†

Based on complete data before any deletion.

^‡

HD method was used.

7. DISCUSSION

In this report, we examine the role of biomarkers as auxiliary variables in predicting the treatment effect and identify situations when biomarkers can be beneficial in a multiple-trial setting. While previous literature on the use of biomarkers as substitutes for the true endpoints has been mostly negative and the proposed surrogate measures are often not useful in practice, we show that it is possible for S to be useful as auxiliary variables in helping provide information and enhancing the inference on T. Although a high correlation between S and T does not qualify S as a good surrogate [13], we show that the correlation is a critical measure in determining the extent of information recovery from S.

In a multiple-trial setting, when T is completely unobserved, $R_{i n d i v}^{2}$ has little impact on the amount of information recovered from S; on the other hand, the higher the $R_{t r i a l}^{2}$ , the higher the efficiency gain from S. However, even with a relatively high $R_{t r i a l}^{2}$ , the predicted treatment effect based on data from other trials and biomarkers in the new trial solely is usually too imprecise to be clinically useful. On the other hand, when the predicted treatment effect on T solely based on S would be sufficient to detect the difference in the treatment effect, the benefit of reducing the trial length can be enormous. Examples include the situation when the statistical power to detect treatment effect is very large or when $R_{t r i a l}^{2}$ is close to 1 such as the ovarian cancer example in [10]. However, these cases are usually rare in practice. On the contrary, when T is partially observed in the new trial, we find that a high $R_{i n d i v}^{2}$ is a very important determinant in increasing the precision of the predicted treatment effect from S but the impact of $R_{t r i a l}^{2}$ is negligible. With even a small fraction of T and a high $R_{i n d i v}^{2}$ , the information on the treatment effect is mostly recovered and the prediction precision is close to that when T is completely observed. It appears that some data on T are essential to provide the basis for individual-level predictions of T from S and take advantage of the distributional assumption between S and T, and hence to give a much more efficient treatment estimate.

We compare the BMBRG, GPHC and HD methods when T is completely missing. Each method gave unbiased estimates; but the variances were underestimated, particularly when the number of the trials was small. Either a bootstrap [22] or fully Bayesian or measurement-error approach [15] could remedy this problem. When T is partially observed, we use two methods: HD and EB-CPV. We find that the underestimation of the variance from the HD method becomes negligible but CPV consistently underestimates the variance. We note that we only consider the case of missing T being missing completely at random and that all methods are applicable when the missing mechanism is missing at random [24].

In conclusion, biomarkers would seem to have a useful role as auxiliary variables. Future research should focus on their roles as auxiliary variables and identify scenarios when biomarkers can increase the precision of the treatment effect. For design purposes, our results suggest that it is often important to collect at least some data on the true endpoint and more information on biomarkers which have high adjusted individual-level correlations with the true endpoint. With appropriate utilization of high quality biomarkers in estimating the treatment effect when the true endpoint is not completely observed, one can reach a desired level of precision earlier, hence shortening the study period and reducing the cost. In our study, we consider continuous S and T. For future research, it would also be interesting to investigate the factors that impact the efficiency gain and the extent of it when S and T are other types of data such as binary, categorical and time to an event.

ACKNOWLEDGEMENTS

The authors would like to thank Dr. Brenda Gillespie for providing us with the CIGTS data. This research was supported by National Institutes of Health Grant CA129102.

8. APPENDIX A: Henderson Method [23]

Let Y = Xβ+Uη+ε, where the vectors Y, η and ε and the matrix X are obtained from stacking the vectors Y_i, η_i and ε_i and the matrices X_i, respectively, underneath each other, and where U is the block-diagonal matrix with blocks U_i on the main diagonal and zeros elsewhere. Let $D$ and Σ be block-diagonal with blocks D and Σ_i on the main diagonal and zeros elsewhere. We have the following relationships: $E (η) = 0, E (∊) = 0, var (η) = D$ , var(ε) = Σ, cov(η, ε) = 0 and we let $V = U D U^{T} + Σ$ . The estimates of $D$ , Σ and $V$ are denoted by $\hat{D}$ , $\hat{Σ}$ and $\hat{V}$ . Henderson [23] proposed a method to obtain estimates of β and η by solving the mixed model equation as follows:

[\begin{matrix} X^{T} {\hat{Σ}}^{- 1} X & X^{T} {\hat{Σ}}^{- 1} U \\ U^{T} {\hat{Σ}}^{- 1} X & U^{T} {\hat{Σ}}^{- 1} U + {\hat{D}}^{- 1} \end{matrix}] [\begin{matrix} \hat{β} \\ \hat{η} \end{matrix}] = [\begin{matrix} X^{T} {\hat{Σ}}^{- 1} Y \\ U^{T} {\hat{Σ}}^{- 1} Y \end{matrix}]

The solution can be written as:

\begin{matrix} \hat{β} = {(X^{T} {\hat{V}}^{- 1} X)}^{- 1} X^{T} {\hat{V}}^{- 1} Y \\ \hat{η} = \hat{D} U^{T} {\hat{V}}^{- 1} (Y - X \hat{β}) . \end{matrix}

The covariance matrix of $(\hat{β} - β, \hat{η} - η)$ is

C = {[\begin{matrix} X^{T} Σ^{- 1} X & X^{T} Σ^{- 1} U \\ U^{T} Σ^{- 1} X & U^{T} Σ^{- 1} U + D^{- 1} \end{matrix}]}^{- 1} .

McLean and Sanders (1988) [33] and McLean, Sanders and Stroup (1991) [34] show that C can also be written as

C = {[\begin{matrix} C_{11} & C_{21} \\ C_{21} & C_{22} \end{matrix}]}^{- 1}

where

\begin{matrix} C_{11} = & {(X^{T} V^{- 1} X)}^{- 1}, \\ C_{21} = & - D U^{T} V^{- 1} X C_{11}, \\ C_{22} = & {(U^{T} Σ^{- 1} U + D^{- 1})}^{- 1} - C_{21} X^{T} V^{- 1} U D \\ = & D - D U^{T} V^{- 1} U D - C_{21} X^{T} V^{- 1} U D . \end{matrix}

In practice, the estimate, $\hat{C}$ , is often obtained by substituting $D$ and Σ in C with their estimates, as we have done in this paper. From the above, we canDobtain the expression for the mean and variance for $\hat{β} + {\hat{η}}_{n}$ as follows:

\begin{matrix} E (\hat{β} + {\hat{η}}_{n}) = β + {DU}_{n}^{T} V_{n}^{- 1} (Y_{n} - X_{n} β), \\ var (\hat{β} - β + {\hat{η}}_{n} - η_{n}) = {(X^{T} V^{- 1} X)}^{- 1} + D - {DU}_{n}^{T} V_{n}^{- 1} U_{n} D + {DU}_{n}^{T} V_{n}^{- 1} X_{n} \\ {(X^{T} V^{- 1} X)}^{- 1} X_{n}^{T} V_{n}^{- 1} U_{n} D - 2 {DU}_{n}^{T} V_{n}^{- 1} X_{n} {(X^{T} V^{- 1} X)}^{- 1} \\ = & {(\sum_{i = 1}^{n} X_{i}^{T} V_{i}^{- 1} X_{i})}^{- 1} + D - {DU}_{n}^{T} V_{n}^{- 1} U_{n} D + {DU}_{n}^{T} V_{n}^{- 1} X_{n} \\ {(\sum_{i = 1}^{n} X_{i}^{T} V_{i}^{- 1} X_{i})}^{- 1} X_{n}^{T} V_{n}^{- 1} U_{n} D - 2 D U_{n}^{T} V_{n}^{- 1} X_{n} {(\sum_{i = 1}^{n} X_{i}^{T} V_{i}^{- 1} X_{i})}^{- 1} . \end{matrix}

9. APPENDIX B: BAYESIAN ESTIMATION

Iterate the following two steps until the parameters reach convergence:

Step 1: Impute missing T_nj’s from a normal distribution with mean and variance:

\begin{matrix} E (T_{nj} ∣ S_{nj}, Z_{nj}) = & γ_{0} + r_{0 n} - σ_{st} σ_{ss}^{- 1} (α_{0} + a_{0 n}) \\ + & (γ_{1} + r_{1 n} - σ_{st} σ_{ss}^{- 1} (α_{1} + a_{1 n})) Z_{nj} + σ_{st} σ_{ss}^{- 1} S_{nj}, \\ var (T_{nj} ∣ S_{nj}, Z_{nj}) = & σ_{tt} - σ_{st}^{2} σ_{ss}^{- 1} \end{matrix}

Step 2: Apply Gibbs sampling to the complete data to estimate the parameters:

\begin{matrix} D^{- 1} ∣ η ~ W (n + c, {(\sum_{i = 1}^{n} η_{i} η_{i}^{T} + F^{- 1})}^{- 1}) \\ σ^{- 1} ∣ X, Y, Z, β, η ~ W {(\sum_{i = 1}^{n} m_{i} + a, (VS + E^{- 1})}^{- 1}) \\ η_{i} ∣ X, Y, Z, σ, D ~ MVN (VE \times (\sum_{j = 1}^{m_{i}} Z_{ij}^{T} σ^{- 1} (Y_{ij} - X_{ij} β)), VE) \\ β ∣ X, Y, Z, η_{i}, σ ~ MVN (VB \times (\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} X_{ij} σ^{- 1} (Y_{ij} - U_{ij} η_{i})), VB) \end{matrix}

where,

\begin{matrix} VS = \sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} (Y_{ij} - X_{ij} β - U_{ij} η_{i}) {(Y_{ij} - X_{ij} β - U_{ij} η_{i})}^{T}, \\ VB = {(\sum_{i = 1}^{n} \sum_{j = 1}^{m_{i}} X_{ij}^{T} σ^{- 1} X_{ij})}^{- 1}, \\ VE = {(\sum_{j = 1}^{m_{i}} U_{ij}^{T} σ^{- 1} U_{ij} + D^{- 1})}^{- 1} . \end{matrix}

From the distributions of β and η_i, we can obtain the distribution of δ_Tn.

10. APPENDIX C: CONDITIONAL POSTERIOR VARIANCE OF δ_Tn

Let α₀ + a_0i = μ_0Si, γ₀ + r_0i = μ_0Ti, α₁ + a_1i = δ_Si and γ₁ + r_1i = δ_Ti. We can rewrite the model (1) as

\begin{matrix} S_{ij} = μ_{0 S i} + δ_{Si} Z_{ij} + ∊_{Sij} \\ T_{ij} = μ_{0 T i} + δ_{Ti} Z_{ij} + ∊_{Tij} . \end{matrix}

Assume there are r observations with both S and T observed and m_n – r observations with just S observed in the nth trial. The likelihood can be written as:

\begin{matrix} L (ϕ ∣ S, t, Z) \\ = & {\prod_{i = 1}^{n - 1} [\prod_{j = 1}^{m_{i}} N (Y_{ij} ∣ β, η_{i}, σ, D, Z_{ij})]} \\ {\prod_{j = 1}^{r} N (Y_{nj} ∣ η_{n}, β, Z_{nj}, σ, D) \prod_{j = 1}^{m_{n} - r} N (S_{nj} ∣ μ_{0_{sn},} δ_{sn}, Z_{nj}, σ_{ss})} \\ = & \prod_{i = 1}^{n - 1} [\prod_{j = 1}^{m_{i}} \frac{1}{\sqrt{2 π} {∣ σ ∣}^{1 ∕ 2}} \exp {- \frac{1}{2} {(\begin{matrix} S_{ij} - μ_{0 Si} - δ_{Si} Z_{ij} \\ T_{ij} - μ_{0 Ti} - δ_{Ti} Z_{ij} \end{matrix})}^{T} σ^{- 1} (\begin{matrix} S_{ij} - μ_{0 Si} - δ_{Si} Z_{ij} \\ T_{ij} - μ_{0 Ti} - δ_{Ti} Z_{ij} \end{matrix})}] \\ \prod_{j = 1}^{r} \frac{1}{\sqrt{2 π} {∣ σ ∣}^{1 ∕ 2}} \exp {- \frac{1}{2} {(\begin{matrix} S_{nj} - μ_{0 Sn} - δ_{Sn} Z_{nj} \\ T_{nj} - μ_{0 Tn} - δ_{Tn} Z_{nj} \end{matrix})}^{T} σ^{- 1} (\begin{matrix} S_{nj} - μ_{0 Sn} - δ_{Sn} Z_{nj} \\ T_{nj} - μ_{0 Tn} - δ_{Tn} Z_{nj} \end{matrix})} \\ \prod_{j = r}^{m_{n} - r} \frac{1}{\sqrt{2 π} {∣ σ ∣}^{1 ∕ 2}} \exp {- \frac{1}{2} {(S_{nj} - μ_{0 Sn} - δ_{Sn} Z_{nj})}^{2}} \\ \prod_{i = 1}^{n} \frac{1}{\sqrt{2 π} {∣ D ∣}^{1 ∕ 2}} \exp {- \frac{1}{2} {(\begin{matrix} μ_{0 Si} - α_{0} \\ μ_{0 Ti} - γ_{0} \\ δ_{Si} - α_{1} \\ δ_{Ti} - γ_{1} \end{matrix})}^{T} D^{- 1} (\begin{matrix} μ_{0 Si} - α_{0} \\ μ_{0 Ti} - γ_{0} \\ δ_{Si} - α_{1} \\ δ_{Ti} - γ_{1} \end{matrix})}, \end{matrix}

which is proportional to the posterior density when we assume flat priors for the fixed effects and multivariate normal distributions for the random effects. The conditional posterior distributions of μ_0Tn and δ_Tn given the data and all other parameters are proportional to:

\begin{matrix} \begin{matrix} μ_{0} T_{n} \\ δ_{T_{n}} \end{matrix} ∣ \cdot \propto & \prod_{j = 1}^{r} \frac{1}{\sqrt{2 π} {∣ σ ∣}^{1 ∕ 2}} \exp {- \frac{1}{2} {ME}^{T} \times σ^{- 1} \times ME} \\ \frac{1}{\sqrt{2 π} {∣ D ∣}^{1 ∕ 2}} \exp {- \frac{1}{2} {(\begin{matrix} μ_{0 Sn} - α_{0} \\ μ_{0 Tn} - γ_{0} \\ δ_{Sn} - α_{1} \\ δ_{Tn} - γ_{1} \end{matrix})}^{T} D^{- 1} (\begin{matrix} μ_{0 Sn} - α_{0} \\ μ_{0 Tn} - γ_{0} \\ δ_{Sn} - α_{1} \\ δ_{Tn} - γ_{1} \end{matrix})} \\ \propto & \exp {- \frac{1}{2} \sum_{j = 1}^{r} {[T_{nj} - μ_{0 Tn} - δ_{Tn} Z_{nj} - σ_{st} σ_{ss}^{- 1} (S_{nj} - μ_{0 Sn} - α_{n} Z_{nj})]}^{2} \times q^{- 1}} \\ \times \exp {\frac{1}{2} {MD}^{T} \times {(Ψ_{11} - Ψ_{12} Ψ_{22}^{- 1} Ψ_{21})}^{- 1} \times MD} \\ = & A \times B . \end{matrix}

(13)

where

\begin{matrix} Ψ_{11} = (\begin{matrix} d_{tt} & d_{tr} \\ d_{tr} & d_{rr} \end{matrix}), Ψ_{12} = (\begin{matrix} d_{st} & d_{ta} \\ d_{sr} & d_{ar} \end{matrix}), Ψ_{21} = (\begin{matrix} d_{st} & d_{sr} \\ d_{ta} & d_{ar} \end{matrix}), Ψ_{22} = (\begin{matrix} d_{ss} & d_{sa} \\ d_{sa} & d_{aa} \end{matrix}), \\ ME = (\begin{matrix} S_{nj} - μ_{0 Sn} - δ_{Sn} Z_{nj} \\ T_{nj} - μ_{0 Tn} - δ_{Tn} Z_{nj} \end{matrix}), MD = (\begin{matrix} μ_{0 Tn} - γ_{0} \\ δ_{Tn} - γ_{1} \end{matrix}) - Ψ_{12} Ψ_{22}^{- 1} (\begin{matrix} μ_{0 Sn} - α_{0} \\ δ_{Sn} - α_{1} \end{matrix}) . \\ and q = σ_{tt} - σ_{st}^{2} σ_{ss}^{- 1} . \end{matrix}

The covariance contribution for μ_0Tn and δ_Tn from term B is $Ψ_{d} = Ψ_{11} - Ψ_{12} Ψ_{22}^{- 1} Ψ_{21}$ .

We define $Q_{n j} = T_{n j} - σ_{s t} σ_{s s}^{- 1} (S_{n j} - μ_{0 S n} - α_{n} Z_{n j})$ . From (13),

\begin{matrix} A = \exp {- \frac{1}{2} \sum {(Q_{nj} - μ_{0 Tn} - δ_{Tn} Z_{nj})}^{2} q^{- 1}} \\ = \exp {- \frac{1}{2} [\frac{\sum Z_{nj}^{2}}{q} δ_{Tn}^{2} + \frac{r}{q} μ_{0 Tn}^{2} + \frac{\sum Q_{nj}^{2}}{q} - 2 \frac{μ_{0 Tn} \sum Q_{nj}}{q} \\ - 2 \frac{δ_{Tn} \sum Z_{nj} Q_{nj}}{q} + 2 \frac{μ_{0 Tn} δ_{Tn} \sum Z_{nj}}{q}]} . \end{matrix}

A is proportional to a bivariate normal density. The covariance contribution from term A is defined as $Φ_{e} = (\begin{matrix} ϕ_{11} & ϕ_{12} \\ ϕ_{12} & ϕ_{22} \end{matrix})$ , where

\begin{matrix} ϕ_{11} = \frac{(σ_{tt} - σ_{st}^{2} σ_{ss}^{- 1}) \sum_{j = 1}^{r} Z_{nj}^{2}}{r \sum_{j = 1}^{r} Z_{nj}^{2} - {(\sum_{j = 1}^{r} Z_{nj})}^{2}}, \\ ϕ_{12} = \frac{(σ_{tt} - σ_{st}^{2} σ_{ss}^{- 1}) \sum_{j = 1}^{r} Z_{nj}}{r \sum_{j = 1}^{r} Z_{nj}^{2} - {(\sum_{j = 1}^{r} Z_{nj})}^{2}}, \\ ϕ_{22} = \frac{r (σ_{tt} - σ_{st}^{2} σ_{ss}^{- 1})}{r \sum_{j = 1}^{r} Z_{nj}^{2} - {(\sum_{j = 1}^{r} Z_{nj})}^{2}} . \end{matrix}

Combining the variance contributions from terms A and B, we can obtain the conditional posterior covariance for μ_0Tn and δ_Tn as: ${(Φ_{e}^{- 1} + Ψ_{d}^{- 1})}^{- 1}$ . The corresponding conditional posterior variance for ( 0 1 ) ${(Φ_{e}^{- 1} + Ψ_{d}^{- 1})}^{- 1}$ ( 0 1 )^T.

REFERENCES

1.Buyse M, Molenberghs G. Criteria for the validation of surrogate endpoints in randomized experiments. Biometrics. 1998;54:10141029. [PubMed] [Google Scholar]
2.Baker SG, Kramer BS. Biomarker, Surrogate Endpoints, and Early Detection Imaging Tests: Reducing Confusion. http://www.icsa.org/bulletin/Bulletin-1-2004-Contents/A3-25-controverstial-issues-v4.doc.
3.Prentice RL. Surrogate endpoints in clinical trials, definition and operational criteria. Statistics in Medicine. 1989;8:431–440. doi: 10.1002/sim.4780080407. [DOI] [PubMed] [Google Scholar]
4.Hsu C, Taylor JMG, Murray S, Commenges D. Survival analysis using auxiliary variables via nonparametric multiple imputation. Statistics in Medicine. 2006;25:3503–3517. doi: 10.1002/sim.2452. [DOI] [PubMed] [Google Scholar]
5.Freedman LS, Graubard BI, Schatzkin A. Statistical validation of intermediate endpoints for chronic disease. Statistics in Medicine. 1992;11:167–178. doi: 10.1002/sim.4780110204. [DOI] [PubMed] [Google Scholar]
6.Liang KY, Zeger SL. Longitudinal Data Analysis using Generalized Linear Models. Biometrika. 1986;73:13–22. [Google Scholar]
7.Lin DY, Fleming TR, DeGruttola V. Estimating the proportion of treatment effect captured by a surrogate marker. Statistics in Medicine. 1997;16:1515–1527. doi: 10.1002/(sici)1097-0258(19970715)16:13<1515::aid-sim572>3.0.co;2-1. [DOI] [PubMed] [Google Scholar]
8.Bycott PW, Taylor JMG. An evaluation of a measure of the proportion of the treatment effect explained by a surrogate marker. Controlled Clinical Trials. 1998;19:555–568. doi: 10.1016/s0197-2456(98)00039-7. [DOI] [PubMed] [Google Scholar]
9.Wang Y, Taylor JMG. A measure of the proportion of treatment effect explained by a surrogate marker. Biometrics. 2003;58:803–812. doi: 10.1111/j.0006-341x.2002.00803.x. [DOI] [PubMed] [Google Scholar]
10.Buyse M, Molenberghs G, Burzykowski T, Renard D, Geys H. The validation of surrogate endpoints in meta-analyses of randomized experiments. Biostatistics. 2000;1:49–67. doi: 10.1093/biostatistics/1.1.49. [DOI] [PubMed] [Google Scholar]
11.De Gruttola V, Fleming T, Lin DY, Coombs R. Perspective: validating surrogate markers - are we being nave? The Journal of Infectious Diseases. 1997;175:237–246. doi: 10.1093/infdis/175.2.237. [DOI] [PubMed] [Google Scholar]
12.Fleming TR, DeMets DL. Surrogate endpoints in clinical trials: Are we being misled? Annals of Internal Medicine. 1996;125:605–613. doi: 10.7326/0003-4819-125-7-199610010-00011. [DOI] [PubMed] [Google Scholar]
13.Baker SG, Kramer BS. A perfect correlate does not a surrogate make. BMC Medical Research Methodology. 2003;3:16. doi: 10.1186/1471-2288-3-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Cook RJ, Lawless JF. Some comments on efficiency gains from auxiliary information for right-censored data. Journal of Statistical Planning and Inference. 2001;96:191–202. [Google Scholar]
15.Burzykowski T, Molenberghs G, Buyse M. The Evaluation of surrogate endpoints. Springer; New York: 2004. 2004; Chapter 18. [Google Scholar]
16.Pepe MS, Reilly M, Fleming TR. Auxiliary outcome and the mean score method. Journal of Statistical Planning and Inference. 1994;43:137–160. [Google Scholar]
17.Robins JM, Rotnitzky A. Recovery of Information and Adjustment for Dependent Censoring using Surrogate Markers. In: Jewell N, Dietz K, Farewell V, editors. AIDS Epidemiology: Methodological Issues. Birkhauser; Boston: 1992. pp. 297–331. [Google Scholar]
18.Malani HM. A modification of the re-distribution to the right algorithm using disease markers. Biometrika. 1995;82:515–526. [Google Scholar]
19.Murray S, Tsiatis AA. Nonparametric Survival Estimation Using Prognostic Longitudinal Covariates. Biometrics. 1996;52:137–151. [PubMed] [Google Scholar]
20.Kosorok MR, Fleming TR. Using surrogate failure time data to increase cost effectiveness in clinical trials. Biometrika. 1993;80:823–833. [Google Scholar]
21.Venkatraman ES, Begg CB. Properties of a nonparametric test for early comparison of treatments in clinical trials in the presence of surrogate endpoints. Biometrics. 1999;55:1171–1176. doi: 10.1111/j.0006-341x.1999.01171.x. [DOI] [PubMed] [Google Scholar]
22.Gail M, Pfeiffer R, Houwelingen HCV, Carroll RJ. On Meta-analytic assessment of surrogate outcomes. Biostatistics. 2000;1:231–246. doi: 10.1093/biostatistics/1.3.231. [DOI] [PubMed] [Google Scholar]
23.Henderson CR. Best linear unbiased estimation and prediction under a selection model. Biometrics. 1975;31:423–447. [PubMed] [Google Scholar]
24.Little RJA, Rubin DB. Statistical Analysis with Missing Data. 2nd Edition Wiley; New York: 2002. [Google Scholar]
25.Laird NM, Lang N, Stram D. Random-effects models for longitudinal data. Biometrics. 1982;38:963–974. [PubMed] [Google Scholar]
26.Robinson GK. That BLUP is a good thing: the estimation of random effects. Statistical Science. 1991;6:15–51. [Google Scholar]
27.Louis TA, Zelterman D. Bayesian approaches to research synthesis. In: Cooper H, Hedges LV, editors. The handbook of research synthesis. Russell Sage Foundation; New York: 1994. [Google Scholar]
28.Ghosh M, Rao NK. Small area estimation: an appraisal. Statistical Science. 1994;9:55–76. [Google Scholar]
29.Morris C. Parametric empirical Bayes inference: theory and application (with discussions) Journal of American Statistical Association. 1983;78:47–65. [Google Scholar]
30.Musch DC, Lichter PR, Guire KE, Standardi CL, CIGTS Investigators The Collaborative Initial Glaucoma Treatment Study (CIGTS): Study design, methods, and baseline characteristics of enrolled patients. Ophthalmology. 1999;106:653–62. doi: 10.1016/s0161-6420(99)90147-1. [DOI] [PubMed] [Google Scholar]
31.Searle SR. Linear Models. Wiley; New York: 1971. [Google Scholar]
32.SAS Institute Inc. Cary, NC, USA: 2003. [Google Scholar]
33.McLean RA, Sanders WL. Approximating Degrees of Freedom for Standard Errors in Mixed Linear Models. Proceedings of the Statistical Computing Section, American Statistical Association, New Orleans. 1988:50–59. [Google Scholar]
34.McLean RA, Sanders WL, Stroup WW. A Unified Approach to Mixed Linear Models. The American Statistician. 1991;45:54–64. [Google Scholar]

[R1] 1.Buyse M, Molenberghs G. Criteria for the validation of surrogate endpoints in randomized experiments. Biometrics. 1998;54:10141029. [PubMed] [Google Scholar]

[R2] 2.Baker SG, Kramer BS. Biomarker, Surrogate Endpoints, and Early Detection Imaging Tests: Reducing Confusion. http://www.icsa.org/bulletin/Bulletin-1-2004-Contents/A3-25-controverstial-issues-v4.doc.

[R3] 3.Prentice RL. Surrogate endpoints in clinical trials, definition and operational criteria. Statistics in Medicine. 1989;8:431–440. doi: 10.1002/sim.4780080407. [DOI] [PubMed] [Google Scholar]

[R4] 4.Hsu C, Taylor JMG, Murray S, Commenges D. Survival analysis using auxiliary variables via nonparametric multiple imputation. Statistics in Medicine. 2006;25:3503–3517. doi: 10.1002/sim.2452. [DOI] [PubMed] [Google Scholar]

[R5] 5.Freedman LS, Graubard BI, Schatzkin A. Statistical validation of intermediate endpoints for chronic disease. Statistics in Medicine. 1992;11:167–178. doi: 10.1002/sim.4780110204. [DOI] [PubMed] [Google Scholar]

[R6] 6.Liang KY, Zeger SL. Longitudinal Data Analysis using Generalized Linear Models. Biometrika. 1986;73:13–22. [Google Scholar]

[R7] 7.Lin DY, Fleming TR, DeGruttola V. Estimating the proportion of treatment effect captured by a surrogate marker. Statistics in Medicine. 1997;16:1515–1527. doi: 10.1002/(sici)1097-0258(19970715)16:13<1515::aid-sim572>3.0.co;2-1. [DOI] [PubMed] [Google Scholar]

[R8] 8.Bycott PW, Taylor JMG. An evaluation of a measure of the proportion of the treatment effect explained by a surrogate marker. Controlled Clinical Trials. 1998;19:555–568. doi: 10.1016/s0197-2456(98)00039-7. [DOI] [PubMed] [Google Scholar]

[R9] 9.Wang Y, Taylor JMG. A measure of the proportion of treatment effect explained by a surrogate marker. Biometrics. 2003;58:803–812. doi: 10.1111/j.0006-341x.2002.00803.x. [DOI] [PubMed] [Google Scholar]

[R10] 10.Buyse M, Molenberghs G, Burzykowski T, Renard D, Geys H. The validation of surrogate endpoints in meta-analyses of randomized experiments. Biostatistics. 2000;1:49–67. doi: 10.1093/biostatistics/1.1.49. [DOI] [PubMed] [Google Scholar]

[R11] 11.De Gruttola V, Fleming T, Lin DY, Coombs R. Perspective: validating surrogate markers - are we being nave? The Journal of Infectious Diseases. 1997;175:237–246. doi: 10.1093/infdis/175.2.237. [DOI] [PubMed] [Google Scholar]

[R12] 12.Fleming TR, DeMets DL. Surrogate endpoints in clinical trials: Are we being misled? Annals of Internal Medicine. 1996;125:605–613. doi: 10.7326/0003-4819-125-7-199610010-00011. [DOI] [PubMed] [Google Scholar]

[R13] 13.Baker SG, Kramer BS. A perfect correlate does not a surrogate make. BMC Medical Research Methodology. 2003;3:16. doi: 10.1186/1471-2288-3-16. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] 14.Cook RJ, Lawless JF. Some comments on efficiency gains from auxiliary information for right-censored data. Journal of Statistical Planning and Inference. 2001;96:191–202. [Google Scholar]

[R15] 15.Burzykowski T, Molenberghs G, Buyse M. The Evaluation of surrogate endpoints. Springer; New York: 2004. 2004; Chapter 18. [Google Scholar]

[R16] 16.Pepe MS, Reilly M, Fleming TR. Auxiliary outcome and the mean score method. Journal of Statistical Planning and Inference. 1994;43:137–160. [Google Scholar]

[R17] 17.Robins JM, Rotnitzky A. Recovery of Information and Adjustment for Dependent Censoring using Surrogate Markers. In: Jewell N, Dietz K, Farewell V, editors. AIDS Epidemiology: Methodological Issues. Birkhauser; Boston: 1992. pp. 297–331. [Google Scholar]

[R18] 18.Malani HM. A modification of the re-distribution to the right algorithm using disease markers. Biometrika. 1995;82:515–526. [Google Scholar]

[R19] 19.Murray S, Tsiatis AA. Nonparametric Survival Estimation Using Prognostic Longitudinal Covariates. Biometrics. 1996;52:137–151. [PubMed] [Google Scholar]

[R20] 20.Kosorok MR, Fleming TR. Using surrogate failure time data to increase cost effectiveness in clinical trials. Biometrika. 1993;80:823–833. [Google Scholar]

[R21] 21.Venkatraman ES, Begg CB. Properties of a nonparametric test for early comparison of treatments in clinical trials in the presence of surrogate endpoints. Biometrics. 1999;55:1171–1176. doi: 10.1111/j.0006-341x.1999.01171.x. [DOI] [PubMed] [Google Scholar]

[R22] 22.Gail M, Pfeiffer R, Houwelingen HCV, Carroll RJ. On Meta-analytic assessment of surrogate outcomes. Biostatistics. 2000;1:231–246. doi: 10.1093/biostatistics/1.3.231. [DOI] [PubMed] [Google Scholar]

[R23] 23.Henderson CR. Best linear unbiased estimation and prediction under a selection model. Biometrics. 1975;31:423–447. [PubMed] [Google Scholar]

[R24] 24.Little RJA, Rubin DB. Statistical Analysis with Missing Data. 2nd Edition Wiley; New York: 2002. [Google Scholar]

[R25] 25.Laird NM, Lang N, Stram D. Random-effects models for longitudinal data. Biometrics. 1982;38:963–974. [PubMed] [Google Scholar]

[R26] 26.Robinson GK. That BLUP is a good thing: the estimation of random effects. Statistical Science. 1991;6:15–51. [Google Scholar]

[R27] 27.Louis TA, Zelterman D. Bayesian approaches to research synthesis. In: Cooper H, Hedges LV, editors. The handbook of research synthesis. Russell Sage Foundation; New York: 1994. [Google Scholar]

[R28] 28.Ghosh M, Rao NK. Small area estimation: an appraisal. Statistical Science. 1994;9:55–76. [Google Scholar]

[R29] 29.Morris C. Parametric empirical Bayes inference: theory and application (with discussions) Journal of American Statistical Association. 1983;78:47–65. [Google Scholar]

[R30] 30.Musch DC, Lichter PR, Guire KE, Standardi CL, CIGTS Investigators The Collaborative Initial Glaucoma Treatment Study (CIGTS): Study design, methods, and baseline characteristics of enrolled patients. Ophthalmology. 1999;106:653–62. doi: 10.1016/s0161-6420(99)90147-1. [DOI] [PubMed] [Google Scholar]

[R31] 31.Searle SR. Linear Models. Wiley; New York: 1971. [Google Scholar]

[R32] 32.SAS Institute Inc. Cary, NC, USA: 2003. [Google Scholar]

[R33] 33.McLean RA, Sanders WL. Approximating Degrees of Freedom for Standard Errors in Mixed Linear Models. Proceedings of the Statistical Computing Section, American Statistical Association, New Orleans. 1988:50–59. [Google Scholar]

[R34] 34.McLean RA, Sanders WL, Stroup WW. A Unified Approach to Mixed Linear Models. The American Statistician. 1991;45:54–64. [Google Scholar]

PERMALINK

Predicting Treatment Effects Using Biomarker Data in a Meta-Analysis of Clinical Trials

Y Li

JMG Taylor

SUMMARY

1. INTRODUCTION

2. The Model

3. Methods for Predicting the Treatment Effect δ_Tn in the New Trial

3.1. Buyse et al Method

3.2. Gail et al Method

3.3. Henderson Method (HD)

3.4. Empirical Bayes Estimate and Conditional Posterior Variance (EB-CPV)

3.5. Bayesian Estimation (denoted by Bayes)

4. Efficiency Gain and Correlation

4.1. Correlation

4.2. Prediction Precision and Correlation

5. Simulations

5.1. The Setup

5.2. Method Evaluation

Table I.

5.3. R $R_{i n d i v}^{2}$ , $R_{t r i a l}^{2}$ , Percentage of Missingness and Information Recovery from S

Figure 1.

6. Data Analysis: a Glaucoma Study

Table II.

Table III.

7. DISCUSSION

ACKNOWLEDGEMENTS

8. APPENDIX A: Henderson Method [23]

9. APPENDIX B: BAYESIAN ESTIMATION

10. APPENDIX C: CONDITIONAL POSTERIOR VARIANCE OF δ_Tn

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Predicting Treatment Effects Using Biomarker Data in a Meta-Analysis of Clinical Trials

Y Li

JMG Taylor

SUMMARY

1. INTRODUCTION

2. The Model

3. Methods for Predicting the Treatment Effect δTn in the New Trial

3.1. Buyse et al Method

3.2. Gail et al Method

3.3. Henderson Method (HD)

3.4. Empirical Bayes Estimate and Conditional Posterior Variance (EB-CPV)

3.5. Bayesian Estimation (denoted by Bayes)

4. Efficiency Gain and Correlation

4.1. Correlation

4.2. Prediction Precision and Correlation

5. Simulations

5.1. The Setup

5.2. Method Evaluation

Table I.

5.3. R Rindiv2, Rtrial2, Percentage of Missingness and Information Recovery from S

Figure 1.

6. Data Analysis: a Glaucoma Study

Table II.

Table III.

7. DISCUSSION

ACKNOWLEDGEMENTS

8. APPENDIX A: Henderson Method [23]

9. APPENDIX B: BAYESIAN ESTIMATION

10. APPENDIX C: CONDITIONAL POSTERIOR VARIANCE OF δTn

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

3. Methods for Predicting the Treatment Effect δ_Tn in the New Trial

5.3. R $R_{i n d i v}^{2}$ , $R_{t r i a l}^{2}$ , Percentage of Missingness and Information Recovery from S

10. APPENDIX C: CONDITIONAL POSTERIOR VARIANCE OF δ_Tn