Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2011 Dec 14.
Published in final edited form as: JP J Biostat. 2011 May 1;5(2):73–96.

A Joint Model for Prognostic Effect of Biomarker Variability on Outcomes: long-term intraocular pressure (IOP) fluctuation on the risk of developing primary open-angle glaucoma (POAG)

Feng Gao 1, J Philip Miller 1, Stefano Miglior 3, Julia A Beiser 2, Valter Torri 4, Michael A Kass 2, Mae O Gordon 1,2
PMCID: PMC3237682  NIHMSID: NIHMS341362  PMID: 22180704

Abstract

Primary open-angle glaucoma (POAG) is among the leading causes of blindness in the United States and worldwide. While numerous prospective clinical trials have convincingly shown that elevated intraocular pressure (IOP) is a leading risk factor for the development of POAG, an increasingly debated issue in recent years is the effect of IOP fluctuation on the risk of developing POAG. In many applications, this question is addressed via a “naïve” two-step approach where some sample-based estimates (e.g., standard deviation) are first obtained as surrogates for the “true” within-subject variability and then included in Cox regression models as covariates. However, estimates from two-step approach are more likely to suffer from the measurement error inherent in sample-based summary statistics. In this paper we propose a joint model to assess the question whether individuals with different levels of IOP variability have different susceptibility to POAG. In our joint model, the trajectory of IOP is described by a linear mixed model that incorporates patient-specific variance, the time to POAG is fit using a semi-parametric or parametric distribution, and the two models are linked via patient-specific random effects. Parameters in the joint model are estimated under Bayesian framework using Markov chain Monte Carlo (MCMC) methods with Gibbs sampling. The method is applied to data from the Ocular Hypertension Treatment Study (OHTS) and the European Glaucoma Prevention Study (EGPS), two large-scale multi-center randomized trials on the prevention of POAG.

Keywords: Joint model, Longitudinal data, Markov chain Monte Carlo (MCMC), Patient-specific variance, Survival data

1. Introduction

In many clinical trials and epidemiologic studies, investigators are interested in whether the variability of a biomarker is independently predictive of clinical outcomes. In the Honolulu Heart Program, for example, Iribarren et al (1995) reported that the subjects whose weight fluctuated most had a significantly higher risk of death from cardiovascular causes, non-cardiovascular and non-cancerous causes, as well as all causes. Grove et al (1997) and Rothwell et al (2010) found that the variability of systolic blood pressure is an independent risk factor for cardiovascular disease. In many of these studies regarding biomarker variability, the analysis is performed via a two-step approach where some sample-based descriptive statistics (e.g., standard deviation) of the longitudinal data are first calculated for each individual and then, as covariates assuming freedom from measurement error, used in the regression models to correlate with clinical outcomes. One limitation of this simplified approach is that it does not use longitudinal information efficiently. To deal with this concern, Hathaway and D’Agostino (1993) proposed an appealing approach that decomposes longitudinal data into 4 summary statistics including one component reflecting biomarker variability. Unfortunately this method requires two rather strong assumptions (namely, no missing observations and no intermediate endpoint events during the collection of longitudinal data) and this precludes its practical use in most longitudinal studies.

Another serious drawback of the two-step approach is that it ignores the measurement error inherent in sample-based summary statistics. Most clinical trials only have a handful of follow-up visits and thus the individual-level summary statistics are very unstable. In a regression analysis, it is well known that the measurement error in covariates can bias parameter estimates downward to null values and thus may mislead one into believing that a good predictive biomarker is poorly related to the outcome (Hughes, 1993; Prentice, 1982). To address this problem, Lyles et al (1999) proposed a model-based approach to estimate the “true” within-subject variability. In a study on subjects with human immunodeficiency virus (HIV) infection, Lyles et al used a full maximum likelihood method to estimate the variability of CD4 cell counts prior to infection and to assess its relationship with the CD4 level at time of infection. Specifically, CD4 measurements prior to infection were described using a linear mixed model that accounts for both random intercept and subject-specific variance,

CD4ij=β0+Ii+eij,withIiN(0,σI2),eijN(0,Vi),andlog(Vi)N(μv,σv2),

where β0 + Ii defines the “true” CD4 level for the ith individual as in the conventional linear mixed model (Laird and Ware, 1982), and Vi represents the “true” within-subject variability which follows a log-normal distribution with mean μv and variance σv2. When the parameter σv2 becomes 0, the above model reduces to a linear mixed model with homogenous variance.

In this paper, we extend the work of Lyles et al (1999) to the censored data setting. Our research is motivated by the Ocular Hypertension Treatment Study (OHTS), a multi-center randomized trial to evaluate the safety and efficacy of topical ocular hypotensive medication to prevent the development of primary open-angle glaucoma (POAG) (Gordon et al, 2002, 2007). POAG is among the leading causes of blindness in the United States and worldwide and, as of today, intraocular pressure (IOP) is the only factor that can be modified by the current treatment. While numerous randomized clinical trials have convincingly shown that elevated IOP is the leading risk factor and that IOP reduction can significantly reduce the incidence of POAG, an increasingly debated topic is the effect of IOP fluctuation, either diurnally (short-term) or visit to visit (long-term), on the risk of developing POAG (Singh and Shrivastava, 2009; Sultan et al, 2009). The aim of this paper is to seek an answer to this question via a joint analysis of follow-up IOP and time to POAG. We also use the data from another large-scale randomized clinical trial, the European Glaucoma Prevention Study (EGPS) (Miglior et al, 2007a, 2007b), as an independent external validation.

The task of using a series of IOP measurements for prediction is definitely more complicated than using simple summary statistics. However, since the parameters describing longitudinal process and time-to-event process are estimated simultaneously in a joint modeling setting, we expect a more accurate estimation of this relationship. Our joint model is factorized as two sub-models, a marginal longitudinal model for follow-up IOP and a conditional (given the longitudinal data) survival model for the risk of developing POAG. Specifically, the IOP trajectory is described by a linear mixed model similar to Lyles et al (1999), the time to POAG is fit using a piecewise exponential or Weibull distribution, and the two sub-models are linked via shared covariates and patient-specific random effects. The parameters in the proposed joint model will be estimated under Bayesian framework using Markov chain Monte Carlo (MCMC) methods with Gibbs sampling. This paper is organized as follows. In Section 2, we outline the data from OHTS and EGPS, two randomized clinical trials that facilitate our research. Section 3 describes model construction, parameter estimation and model selection for the proposed method. In Section 4 we analyze the OHTS and EGPS data and in Section 5 we conclude with a discussion.

2. Two real-world studies: OHTS and EGPS

2.1 Ocular Hypertension Treatment Study (OHTS)

OHTS was a multi-center randomized trial to evaluate the safety and efficacy of topical ocular hypotensive medication in preventing the onset of POAG. From 1994 to 1996, 1636 participants with ocular hypertension and no evidence of glaucomatous damage were randomized to either observation or treatment with ocular hypotensive medication. The primary outcome was the time to the development of POAG. Follow-up visits which included assessment of IOP and POAG status were scheduled every 6 months and the median follow-up time was 78 months. At the end of study, 104 (12.7%) out of 819 participants in the observational group developed POAG while the incidence rate was 5.4% (44 of 817) among those in the treatment group. Baseline factors that were predictive of POAG included older age, higher IOP, larger cup to disc ratio, greater pattern standard deviation, and thinner central corneal measurements (Gordon et al, 2002).

2.2 European Glaucoma Prevention Study (EGPS)

The EGPS was a double-masked, placebo controlled clinical trial conducted in 4 European countries with 1077 participants randomized to either an active treatment (dorzolamide) or placebo. IOP and POAG status were assessed every 6 months and the median follow-up in EGPS was 55 months. At the completion of the study, the cumulative probability of developing POAG was 13.4% in the dorzolamide group and 14.1% in the placebo group. Though the rate of progression to POAG was not statistically different between dorzolamide and placebo, EGPS identified similar baseline risk factors as in OHTS (Miglior et al, 2007a). EGPS and OHTS also shared some key similarities in the study protocols and a meta-analysis was performed using the pooled data from the OHTS observation and the EGPS placebo (Gordon et al, 2007). A new predictive model, Glaucoma 5-year Risk Calculator, was developed and well accepted by the ophthalmology community.

The purpose of this paper is to assess the effect of long-term IOP fluctuation on the time to POAG. Because treatment to lower IOP is usually initiated or intensified following POAG diagnosis and thus increases IOP variability, our study only includes IOP values prior to POAG onset. The primary endpoint is time from randomization to the onset of POAG, and those subjects without POAG are censored at the date of study closeout. Besides the post-randomization data, 6 baseline demographic and clinical characteristics are also included in this paper: the treatment indicator (TRT, 1 for active treatment and 0 otherwise), age (AGE, decade), central corneal thickness (CCT, μm), baseline IOP (IOP0, mmHg), pattern standard deviation (PSD, dB), and vertical cup/disc ratio (VCD). All the baseline factors except CCT were recorded at randomization. The CCT measurements (which turned out to be one of the most important predictors for POAG) were not included in the original protocols of both studies. However, because CCT is a life-long stable measurement, in all the analyses CCT was treated as a baseline factor though it was collected after randomization (Gordon et al, 2002, 2007; Miglior et al, 2007a, 2007b). Values for the eye-specific variables (CCT, IOP, PSD and VCD) for each participant are the average of two eyes (with the exception of the EGPS participants with only one eye eligible for the study).

In this paper, we exclude 34 participants from EGPS with Pigment Dispersion and Exfoliation Syndromes (an exclusion criterion in OHTS). We also exclude those participants without any follow-up data (18 in OHTS and 47 in EGPS), those without CCT measurements (185 in OHTS and 171 in EGPS), those with 2 or less follow-up visits (9 in OHTS and 13 in EGPS), and those with missing values in any other baseline factors (6 in EGPS). Therefore, these participants with complete baseline data and at least 3 follow-up visits constitute our study cohort (1424 from OHTS and 806 from EGPS). Table 1 presents the summary statistics of baseline covariates and follow-up IOP for our study cohort. Note that,

Table 1.

Summary statistics of baseline predictors and follow-up IOP for OHTS and EGPS, where categorical data are summarized as counts and frequencies (%), and continuous data are summarized in means and standard deviations (SD).

Variables OHTS (N=1424) EGPS (N=806)
Baseline predictors
 TRT (N, %)
  Medication 713 (50.1%) 408 (50.6%)
  Observation/Placebo 711 (49.9%) 398 (49.4%)
 AGE (decades) 5.55 (0.95) 5.72 (0.98)
 CCT (μm) 573.3 (38.5) 573.4 (37.4)
 IOP0 (mmHg) 24.9 (2.71) 23.4 (1.60)
 PSD (dB) 1.91 (0.21) 2.00 (0.52)
 VCD 0.39 (0.19) 0.32 (0.14)
Follow-up IOP
 Mean (mmHg) 21.36 (3.44) 19.53 (2.46)
 SD (mmHg) 2.27 (0.98) 2.22 (0.92)
 Variance of residuals* 4.73 (4.50) 3.73 (3.53)
Follow-up visits (N, %)
  3 11 ( 0.8%) 19 ( 2.4%)
  4 16 ( 1.1%) 14 ( 1.7%)
  ≥5 1397 (98.1%) 773 (95.9%)
POAG (N, %) 134 (9.4%) 88 (10.9%)
*

From simple linear regression models fit to each individual

  • Since these continuous baseline covariates are measured in quite different scales, they are standardized to have mean 0 and variance 1 throughout the remainder of this paper. As such, their coefficients in the regression models represent the effect per 1-SD change.

  • Informative dropout may be an issue in both studies because participants with higher risk of POAG tend to drop out earlier due to diagnosis of POAG. For example, a great majority (26 out of 27 in OHTS, and 27 out of 33 in EGPS) of subjects with 3 or 4 visits have developed POAG. Although these with 3 or 4 visits are only a small fraction of the overall data (1.9% for OHTS and 4.1% for EGPS), excluding them could result in serious biased estimates because they account for 19.4% and 30.6% of the individuals developed POAG in OHTS and EGPS respectively.

3. Joint Models of follow-up IOP and time to POAG

3.1 Notation

Suppose there are N subjects and each subject has ni pre-POAG follow-up visits. For longitudinal data, let Yij denote the post-randomization IOP measured at time tij for ith individual, with i =1, 2, …N, and j=1, 2, …ni. Because high IOP is an eligibility criterion in both studies, the intercept (initial level) in mixed model is more likely suffering from “regression to the mean”. To alleviate such an influence, the time 0 is chosen as 1 year after randomization in this paper. Because post-randomization IOPs are measured approximately every 6 months in both studies, a common measuring time is used for all individuals, with{tij}= {−0.5, 0, 0.5,1, …}. For survival data, let Ti=minimum(Di, Ci) be the observed time for ith subject, where Di is the time to POAG and Ci represents the censoring time which is assumed independent of Di. Let Δi be the corresponding event indicator, with Δi=1 if POAG is observed and Δi=0 otherwise. Finally, we denote the baseline covariates predictive of longitudinal and survival processes as Xi and Zi respectively, which may or may not be the same.

3.2 Joint models

Our goal is to jointly model follow-up IOP and time to POAG, with a special attention to the effect of IOP fluctuation on the risk of developing POAG. In the past decade, many authors have studied the joint analysis of longitudinal and survival data (Pauler and Finkelstein, 2002; Tsiatis and Davidian, 2004; Vonesh et al, 2006; Wulfsohn and Tsiatis, 1997). In most joint models, longitudinal data are delineated by a conventional linear mixed model assuming homogeneous within-subject variance. However, such a homogeneity assumption automatically precludes the assessment of the question whether individuals with different levels of IOP variability have different susceptibility to POAG.

In the proposed model, the IOP trajectory is described by a linear mixed model that incorporates patient-specific variance (Lyles et al, 1999),

Yij=f1(β,Xi,t)+Ii+Sitij+eij=f1(β,Xi,t)+W1i+eij,withIiN(0,σI2),SiN(0,σS2),eijN(0,Vi),andlog(Vi)N(μv,σv2), (1)

where β is a vector of parameters for fixed effects, Ii and Si are the patient-specific intercept and slope as in the conventional linear mixed model (Laird and Ware, 1982), and Vi represents the “true” within-subject variability which follows a log-normal distribution with mean μv and variance σv2. The term f1(β, Xi, t) is a known function describing the mean response, W1i = Ii+ Sitij can be viewed as the “true” individual-level trajectory after adjusting the overall mean response, and the patient-specific variance Vi reflects individual’s fluctuation around the predicted (possibly non-linear) trend over time.

Turning to the survival sub-model, the risk of developing POAG for individual i at time t is specified as a proportional hazards form,

λi(t)=λ0(t)exp(f2(α,Zi)+W2i(γ,Ii,Si,log(Vi),t)), (2)

where λ0(t) is the baseline hazard when all covariates are 0, f2(α, Zi) is a known function representing the effects of baseline covariates, and the term W2i(γ, Ii, Si, log(Vi), t) incorporates patient-specific random effects from model (1). To obtain a fairly flexible λ0(t), we break the survival time annually into K subintervals and assume λ0(t) being a step-function with height λk at each interval (tk, tk+1), for k=1, 2, …, K. Hence, model (2) yields a semi-parametric piecewise exponential distribution for the time to POAG. Regarding the function W2i(γ, Ii, Si, log(Vi), t), 7 different forms are considered in this paper, namely,

  • W2i = 0 (association induced only via shared baseline covariates);

  • W2i =γ1Ii;

  • W2i =γ1Ii +γ2Si;

  • W2i=γ3Yik(t);

  • W2i =γ1Ii +γ4 log(Vi);

  • W2i=γ3Yik(t)+γ4log(Vi);

  • W2i =γ1Ii +γ2Si +γ4 log(Vi);

where Yik(t)=Ii+Si(tk1) is a time-dependent variable and represents individuals’ deviation from the systematic trend at each subinterval (tk, tk+1), for k=1, 2, …, K.

The piecewise exponential model permits an easy incorporation of time-dependent effects. In the event that only time-independent variables are involved in the best model, however, a full parametric model of Weibull (λ0(t) = ρt (ρ−1)) or Exponential (λ0(t) = 1) distribution is also explored, where ρ is the Weibull shape parameter with ρ>1 for a monotone increasing hazard and ρ<1 for a monotone decreasing hazard. We seek this change because, if it works, a parametric λ0(t) will be more convenient for computation and more efficient for prediction. It is also scientifically reasonable for λ0(t) to take a smooth form. In addition, our previous analysis (unpublished study on parametric survival) has shown that, without time-dependent covariates, a parametric Weibull model can adequately fit the OHTS data.

Finally, to facilitate an easy prediction of future cases (whose treatment information may be absent), we assume that the effect of treatment is only signified via post-randomization IOP. That is, the hypotensive medication modifies IOP evolution which in turn influences the time to POAG. Therefore, the latent quantities Ii and Si within W2i(γ, Ii, Si, log(Vi), t) have to be adjusted according to the functional form of f1(β, Xi, t) in model (1). In the presence of a significant interaction (β2, say) between randomization group (TRT) and time, for example, a treatment-modified slope Si rather than Si will be actually used in model (2), with Siβ2TRTi+Si.

3.3 Parameter estimation and model selection

The joint distribution of follow-up IOP, Y, and time to POAG, T, can be specified in the form of

f(Y,Tθ1,θ2)=f(Yθ1,ηi)f(TY,ηi,θ2)f(ηi)dηi,

with the corresponding likelihood function being

L(Y,Tθ1,θ2)=i=1Nf(Yθ1,ηi)f(TY,ηi,θ2)Δi(1F(TY,ηi,θ2))1Δif(ηi)dηi,

where ηi = {Ii, Si, Vi} represents the shared underlying stochastic process, θ1={β,σI2,σS2,μv,σv2} and θ2={α,γ, λk} are the population parameters as given in models (1) and (2), and f ( ) and F ( ) denote the probability density function and cumulative distribution function respectively. The parameters in the model are estimated using Markov chain Monte Carlo (MCMC) method with Gibbs sampling (Gelman, 2003). Specifically, a random sample for each unknown parameter is generated from the full conditional posterior distribution given all other parameters, and the process will be iterated till its convergence to the target distribution.

To analyze data from a Bayesian perspective, the prior distributions for all unknown parameters must be specified. We use a proper prior distribution for each parameter, but the hyper-parameters in a prior are chosen so that the prior will have high variability (non-informative prior). In this paper, the choice of priors is aided by a preliminary data analysis in OHTS (Gao et al, 2011). Following prior distributions are specified and all the unknown parameters are assumed to be mutually independent,

α,β,γ,μvN(0,1000),λkG(0.1,1),ρG(1,1),andσI2,σS2,σv2IG(0.01,0.01),

where N (a, b) denotes a Normal distribution with mean a and variance b, G(a, b) represents a Gamma distribution with mean a/b and variance a/b2, and x ~ IG(a, b) means that x follows an Inverse Gamma distribution or 1/x has a Gamma distribution with meann a/b and variance a/b2.

The best model is selected using Deviance Information Criterion (DIC) which is a Bayesian generalization of Akaike’s Information Criterion (Spiegelhalter et al, 2002). DIC consists of two components, DIC= + ρD. The first term is the posterior mean of deviance and measures the goodness-of-fit. The second term ρD measures model complexity and is defined as the difference between the posterior mean of deviance ( ) and the point estimate of deviance obtained at the posterior mean of parameters{θ̄1,θ̄2}. In normal hierarchical model, ρD is approximately the trace of the “hat” matrix that maps the observed data to their fitted values and thus can be viewed as the expected effective number of parameters in the model. Since a smaller indicates a better fit and a smaller ρD indicates a parsimonious model, the model with smallest DIC is preferred and the best model is selected following the guidelines by Gelman et al (2003).

The parameter estimation is implemented using the noncommercial statistical software WinBUGS (http://www.mrc-bsu.cam.ac.uk/bugs/). We use three parallel MCMC sampling chains with different starting values. The convergence of each chain is monitored by the trace plots and the diagnostic statistics of Gelman et al (2003). The posterior mean of the parameters and the 95% credible intervals are based on 15,000 iterations following a 15,000-iteration of burn-in period.

4.Results

4.1 Separate analysis of longitudinal data

In order to fully specify the mean response f (β, Xi, t), first we analyze the longitudinal data using model (1) alone. Taking advantage of the fact that the conventional linear mixed model and model (1) produce almost identical estimates for fixed effects (Manatunga et al, 2005), we initially analyze IOP using SAS PROC Mixed to avoid intensive computation of MCMC methods. The results show that all the second-order interactions between treatment and the other 5 covariates plus time are not statistically significant (with the likelihood ratio tests of X2=10.2, df=6, p=0.12, and X2=5.7, df=6, p=0.46 for OHTS and EGPS, respectively). The results also reveal that adding quadratic time can significantly improve the model fit in the OHTS (X2=49, df=1, p<0.001) but not in the EGPS (X2=0.5, df=1, p=0.48). Therefore, the mean response f (β, Xi, t) is chosen as f1=β0+β1TRTi+β2tij+β3tij2+β4AGEi+β5CCTi+β6IOP0i+β7PSDi+β8VCDi

and f10+β1TRTi+β2tij+β3AGEi+β4CCTi+β5IOP0i+β6PSDi+β7VCDi for OHTS and EGPS respectively.

Table 2 presents the posterior means and 95% credible intervals for the population parameters in model (1). In OHTS, the initial (more precisely, 1-year after randomization) IOP level in the treatment group is significantly lower than that in the observation group (19.7 versus 24.3 mmHg). The estimates of β2 and β3 show that, on average, the follow-up IOP decreases over time but the trend is not linear. It displays a fast decrease initially and then slows down gradually as time goes by. Among the 5 baseline covariates, only CCT and IOP0 are significantly associated with follow-up IOP. It is noteworthy that the estimated σv2 (0.53 with a 95% interval of 0.48–0.59) supports the assumption of heterogeneous variance for follow-up IOP. For the EGPS data, the difference of initial IOP levels between active treatment and placebo is much smaller (19.7 versus 20.8 mmHg) and the IOP decreases over time linearly about 0.5 mmHg per year. The result also reveals an ample dissimilarity between OHTS and EGPS with respect to the follow-up IOP. For example, baseline CCT is a significant predictor in OHTS but not in EGPS.

Table 2.

Estimated regression coefficients (β) and 95% credible intervals (CI) for IOP trajectory in OHTS and EGPS, where only the longitudinal data are analyzed.

Parameters OHTS
EGPS
β 95% CI β 95% CI
Fixed Effects
 Intercept 24.297 (24.130, 24.460) 20.794 (20.600, 20.990)
 TRT −4.629 (−4.840, −4.401) −1.137 (−1.407, −0.866)
 Time −0.410 (−0.459, −0.362) −0.477 (−0.533, −0.419)
 Time*Time 0.035 (0.027,0.043) -- (--, --)
 AGE 0.010 (−0.105, 0.124) 0.029 (−0.114, 0.175)
 CCT 0.172 (0.060, 0.286) 0.038 (−0.098, 0.171)
 IOP0 1.151 (1.043, 1.263) 1.185 (1.043, 1.327)
 PSD 0.053 (−0.061, 0.175) −0.180 (−0.328, −0.035)
 VCD −0.076 (−0.189, 0.040) 0.058 (−0.084, 0.200)
Random Effects
σI2 3.739 (3.396, 4.091) 3.183 (2.801, 3.595)
σS2 0.111 (0.096, 0.127) 0.409 (0.345, 0.480)
μv 1.421 (1.377, 1.466) 1.275 (1.213, 1.335)
σv2 0.531 (0.478, 0.585) 0.452 (0.374, 0.541)

The posterior distributions for patient-specific random effects are also obtained from model (1) and the posterior means are calculated as a point estimate for each individual. Although it is almost certain that these point estimates tend to have a smaller variability than they should have (over-shrinkage), they provide tangible quantities to explore the relationship between longitudinal and survival processes and thus facilitate investigations to gain insight with respect to joint analysis.

4.2 Separate analysis of survival data

Next we analyze survival data using model (2) alone, with λ0(t) totally unspecified (Cox model). We take the widely used two-step approach to determine whether IOP fluctuation is independently predictive of POAG. Specifically, simple summary statistics (mean and variance) of the post-randomization IOP are calculated for each individual and used in the Cox model as if they were observed at baseline. Since the conventional sample-based variance is more likely influenced by the systematic trend over time, we use the variance of residuals, from the linear regression model fit to each individual, as a surrogate for the “true” within-subject variability. The covariate effect f2(α, Zi) is chosen after the well- established meta-analysis of OHTS and EGPS (Gordon et al, 2007), with the same functional form f2(α, Zi)=α1TRTi+α2 AGEi+α3CCTi+α4IOP0i+α5PSDi+α6VCDi for both studies.

Table 3 shows the estimated hazard ratios (HRs) and their 95% confidence intervals from the Cox models. In OHTS, results show that the mean post-randomization IOP is a significant risk factor to POAG, with high IOP level indicating a poor prognosis. After incorporating follow-up IOP, as expected, the effect of treatment is masked and no longer independently predictive of POAG. Possibly due to the same reason, the effect of baseline IOP is not significant either. Similar results are obtained in the analysis of EGPS data. A rather surprising observation is that the HRs of IOP variability in both OHTS (HR=0.79) and EGPS (HR=0.74) are significantly smaller than 1, indicating that subjects with higher IOP fluctuation are less likely to develop POAG. Although this finding is contradictory to the usual clinical expectation, it is consistent with the trend of a previous analysis in EGPS (HR=0.87, 95% CI 0.70–1.09), which took a two-step approach and used standard deviation as a time-dependent surrogate for the “true” within-subject variability (Miglior et al, 2007b).

Table 3.

Estimated hazard ratios (HR) and 95% confidence intervals (CI) for time to POAG in OHTS and EGPS, where only the survival data are analyzed and the effect of IOP variability is assessed using a naïve two-step approach.

Parameters OHTS
EGPS
HR 95% CI HR 95% CI
Effects of Baseline Covariates
 TRT 1.458 (0.843, 2.521) 1.178 (0.754, 1.840)
 AGE 1.245 (1.037, 1.494) 1.441 (1.142, 1.820)
 CCT 0.542 (0.451, 0.651) 0.747 (0.609, 0.917)
 IOP0 0.893 (0.742, 1.075) 1.071 (0.854, 1.342)
 PSD 1.157 (0.967, 1.384) 1.199 (0.984, 1.460)
 VCD 1.725 (1.417, 2.101) 1.531 (1.234, 1.898)
Effects of Follow-up IOP
  Mean 1.316 (1.228, 1.410) 1.183 (1.078, 1.298)
  log(Vi)* 0.794 (0.663, 0.951) 0.738 (0.607, 0.897)
*

logarithm transformed variance of residuals from simple linear models fit to each individual, one subject (ID=505) excluded from EGPS data due to zero variance.

Figure 1 graphically explores the relationship between IOP variability and time to POAG in OHTS and EGPS. For each study, subjects are first ranked into 4 equal-size groups by the variance of residuals from the ordinary linear model fit to each individual, and then a Kaplan-Meier curve of time to POAG is drawn for each group (Upper panel of Figure 1). The survival curves in the group with least variability (Q1) show a sharp decrease shortly after randomization, whereas the curves in other 3 groups are in an acceptable (though not perfect) order (i.e., the higher the IOP variability, the greater the risk of developing POAG). This phenomenon is mainly caused by the informative dropout. As mentioned earlier, our data only include pre-POAG IOP values. Consequently, these subjects with the highest risk are more likely to fall out of study earlier and thus deprived the chance to show fluctuation. Actually this is exactly one of the major reasons to call for joint analysis, known for its ability to handle informative dropout (Henderson et al, 2000). Similar Kaplan-Meier curves are also prepared using the model-based variance (the posterior mean of patient-specific variance from the mixed models in Section 4.1). Interestingly, the problem of informative dropout is alleviated dramatically even before we attack this issue via joint analysis (Lower panel Figure 1).

Figure 1.

Figure 1

The Kaplan-Meier curves of time to POAG stratified by the estimated variance, where “Sample-based variance” means the variance of residuals from the ordinary linear model fit to each individual, “Model-based variance” refers to the posterior means of the patient-specific variances from linear mixed models, and Q1–Q4 represent the 4 groups based on quartile of variance within each study.

4.3 Joint analysis of post-randomization IOP and time to POAG

Table 4 reports the total DIC from the 9 candidate joint models for each study. Among the first 7 models with a piecewise exponential sub-model, Model 3 and Model 7 have similar performance and provide the best fit to the OHTS data. Because γ4 (which is absent in Model 3) is significantly different from 0 and the two models produce almost identical estimates in all other parameters, Model 7 is chosen as the best fit to the OHTS data. For the EGPS data, all the first 7 models have similar performance and none of the 4 parameters (γ1, γ2, γ3, γ4) for follow-up IOP is significantly different from 0. For comparison with OHTS, however, we choose Model 7 as the best fit to EGPS. Since only time-independent covariates get involved in the best models, two additional joint models are further constructed, one with λ0(t) being a Weibull distribution (Model 8) and the other using an Exponential distribution (Model 9). Interestingly, though Model 8 is chosen as the final model based on DIC, Models 7–9 produce similar results for these parameters of special interest (α, β, γ).

Table 4.

Deviance Information Criterion (DIC) of the candidate joint models for the OHTS and EGPS data.

Model ID λ0(t) W1i W2ia Total DIC
OHTS EGPS

1 λk Ii, Si, Vi 0 80967.9 32194.2
2 λk Ii, Si, Vi γ1I*i 80929.7 32197.3
3 λk Ii, Si, Vi γ1I*i + γ2Si 80890.3 32193.0
4 λk Ii, Si, Vi γ3Y*ik 80909.4 32196.2
5 λk Ii, Si, Vi γ1I*i + γ4log(Vi) 80927.0 32193.3
6 λk Ii, Si, Vi γ3Y*ik + γ4log(Vi) 80907.8 32194.7
7 λk Ii, Si, Vi γ1I*i + γ2Si + γ4log(Vi) 80888.2 32194.4
8 ρt (ρ−1) Ii, Si, Vi γ1I*i + γ2Si + γ4log(Vi) 80494.1 32015.7
9 1 Ii, Si, Vi γ1I*i + γ2Si + γ4log(Vi) 80559.9 32050.7
a

: Ii=β1TRTi+Ii, treatment-modified random intercept

a

: Yik=I1+Si(tk1), deviation from the systematic trend at interval (tk, tk+1)

Table 5 presents the posterior means and 95% credible intervals for the population parameters in Model 8. For the longitudinal component, the parameter estimates from joint models are almost identical as these shown in Table 2. For the survival component, the joint model and two-step analysis reach consistent conclusion for most baseline covariates. Regarding the impact of IOP variability on POAG, however, the joint models result in a conclusion quite different from that in Table 3. In OHTS, the joint model shows that IOP fluctuation (γ4 =0.379, 95% CI 0.100 – 0.688), as well as the initial level (γ1 =0.184, 95% CI 0.121 – 0.246) and the trend over time (γ2 =1.576, 95% CI 0.796 – 2.388), are all independently predictive of POAG. In EGPS, the effect of IOP variability on POAG is not significantly different from 0 (with γ4 =0.396, 95% CI −0.095 – 0.870).

Table 5.

Posterior means and 95% credible intervals (CI) of parameters from the final joint model of longitudinal and survival data for the OHTS and EGPS data.

Parameters OHTS
EGPS
Estimates 95% CI Estimates 95% CI
Longitudinal Sub-model
  Intercept (β0) 24.301 (24.130, 24.460) 20.797 (20.590, 21.020)
  TRT (β1) −4.631 (−4.854, −4.415) −1.148 (−1.433, −0.863)
  Time (β2) −0.405 (−0.452, −0.358) −0.473 (−0.532, −0.418)
  Time*Time (β3) 0.036 (0.028, 0.044) -- (--, --)
  AGE (β4) 0.012 (−0.108, 0.129) 0.029 (−0.122, 0.185)
  CCT (β5) 0.166 (0.054, 0.278) 0.036 (−0.098, 0.179)
  IOP0(β6) 1.160 (1.051, 1.270) 1.188 (1.036, 1.335)
  PSD (β7) 0.048 (−0.068, 0.167) −0.181 (−0.323, −0.035)
  VCD (β8) −0.065 (−0.176, 0.047) 0.065 (−0.073, 0.207)
   σI2 3.741 (3.410, 4.096) 3.182 (2.805, 3.588)
   σS2 0.113 (0.097, 0.131) 0.406 (0.3474, 0.472)
  μv 1.424 (1.379, 1.469) 1.281 (1.219, 1.343)
   σv2 0.533 (0.478, 0.590) 0.449 (0.376, 0.533)
Survival Sub-model
 Shape (ρ) 2.041 (1.791, 2.317) 2.101 (1.664, 2.543)
 Intercept(α0) −7.233 (−7.966, −6.434) −6.303 (−7.398, −5.293)
  AGE (α1) 0.231 (0.043, 0.427) 0.331 (0.100, 0.566)
  CCT (α2) −0.620 (−0.817, −0.421) −0.321 (−0.549, −0.102)
  IOP0(α3) 0.198 (0.025, 0.370) 0.145 (−0.063, 0.349)
  PSD (α4) 0.207 (0.010, 0.396) 0.145 (−0.060, 0.340)
  VCD (α5) 0.564 (0.351, 0.772) 0.413 (0.184, 0.634)
  F/U IOP
   Ii(γ1) 0.184 (0.121, 0.246) 0.067 (−0.055, 0.186)
  Si2) 1.576 (0.796, 2.388) 0.089 (−0.453, 0.624)
  log(Vi) (γ4) 0.379 (0.100, 0.688) 0.396 (−0.095, 0.870)

Ii=β1TRTi+Ii: treatment-modified random intercept

5. Discussion

In this paper, we propose a joint analysis approach to assess whether variability of a biomarker is independently predictive of clinical outcomes. Using data from two long-term clinical trials of the efficacy of IOP lowering medication in the prevention of glaucoma, we determine if long-term IOP fluctuation is independently predictive of POAG. The trajectory of IOP is described by a linear mixed model incorporating patient-specific variance, and its association with the time to POAG is assessed using both semi-parametric and full parametric survival models. Interestingly, the results show that the Weibul model produces similar estimates of the relative risk as those in the piecewise exponential model. This will reduce the computational burden and facilitate an easy prediction.

The joint analysis offers attractive features for modeling the effect of biomarker variability. First, compared to the frailty framework for patient heterogeneity (Henderson et al, 2000), such a regression approach is relatively straightforward to interpret and the effect of biomarker variability on outcome can be readily quantified as hazard ratio which is familiar to clinical investigators. Second, since the parameters describing longitudinal data and time-to-event data are estimated simultaneously, informative dropout and measurement error can be accounted and the relationship between these two stochastic processes can be more accurately estimated. Thirdly, the availability of the full posterior estimates of the random effects provides a useful tool to gain insight into the model construction and permits the possibility of a patient-specific prediction (Pauler and Finkelstein, 2002). Finally, the Deviance Information Criterion (DIC) facilitates an easier comparison among various complex but realistic models that do not need to be nested.

The naïve two-step approach, though convenient to use and widely applied, should be used with caution. Possibly because of measurement error and/or informative dropout, the estimates based on two-step analysis are very sensitive to the data selection (personal communications with Dr. David Musch on his unpublished data on IOP variability). In the OHTS and EGPS data, for example, the current two-step analysis shows that subjects with a higher IOP variability are significantly less likely to develop POAG, but another analysis taking the two-step approach (result not presented in the paper) produces non-significant effects in both studies after we exclude those subjects with 3 visits (who only account for less than 2% of the total study cohort). In contrast, the joint models produce consistent results across different datasets.

Substantively, our results show that IOP variability is independently predictive of POAG in OHTS, and the subjects with high IOP fluctuation have an increased risk of developing POAG. Though this conclusion is not justified by the data from EGPS, the result in EGPS shows a trend consistent with that in OHTS. Note that, in both studies, the estimated within-subject variability actually consists of two components, the biologic fluctuation and the pure measurement error. In the current analysis we have not sought to make such a distinction, but the non-specificity of fluctuation may attenuate the potential association between biomarker variability and clinical outcome.

A limitation of the joint model is its intensive computation. The analysis of OHTS data (N=1424) with a parametric Weibull baseline hazards, for example, required approximately 5 hours per model when using 3 parallel chains with 30,000 iterations each. This makes it difficult to use the joint model as a real-time, web-based tool to predict prognosis. One possible solution is the joint latent class model (Proust-Lima and Taylor, 2009), where the latent classes can be determined by the patterns of IOP trajectory. In future work, we will investigate ways to incorporate the post-treatment IOP and to enhance the predictive accuracy of the existing web-based OHTS-EGPS Glaucoma Risk Calculator.

Acknowledgments

This study is supported by grants from the National Eye Institute National Institute of Health, Bethesda, MD (EY091369 and EY09341).

References

  • 1.Gao F, Miller JP, Xiong C, Huecker J, Gordon MO. A joint-modeling approach to assess the impact of biomarker variability on the risk of developing clinical outcome. Statistical Methods and Applications. 2011;20(1):83–100. doi: 10.1007/s10260-010-0150-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Gelman A, Carlin JB, Stern HS, Rubin DB. Bayesian Data Analysis. 2. CRC Press; 2003. [Google Scholar]
  • 3.Gordon MO, Beiser JA, Brandt JD, et al. The Ocular Hypertension Treatment Study: baseline factors that predict the onset of primary open-angle glaucoma. Arch Ophthalmol. 2002;120:714–20. doi: 10.1001/archopht.120.6.714. [DOI] [PubMed] [Google Scholar]
  • 4.Gordon MO, Torri V, Miglior S, Beiser JA, Floriani I, Miller JP, Gao F, Adamsons I, Poli D, D’Agostino RB, Kass MA The Ocular Hypertension Treatment Study Group and the European Glaucoma Prevention Study Group. A Validated Prediction Model for the Development of Primary Open Angle Glaucoma in Individuals with Ocular Hypertension. Ophthalmology. 2007;114(1):10–19. doi: 10.1016/j.ophtha.2006.08.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Grove JS, Reed DM, Yano K, Hwang LJ. Variability in systolic blood pressure -A risk factor for coronary heart disease? American Journal of Epidemiology. 1997;145:771–776. doi: 10.1093/oxfordjournals.aje.a009169. [DOI] [PubMed] [Google Scholar]
  • 6.Hathaway DK, D’Agostino RB. A technique for summarizing longitudinal data. Statistics in Medicine. 1993;12:2169–2178. doi: 10.1002/sim.4780122303. [DOI] [PubMed] [Google Scholar]
  • 7.Henderson R, Diggle P, Dobson A. Joint modeling of longitudinal measurements and event time data. Biostatistics. 2000;4:465–480. doi: 10.1093/biostatistics/1.4.465. [DOI] [PubMed] [Google Scholar]
  • 8.Hughes MD. Regressin dilution in the proportional hazards model. Biometrics. 1993;49:1056–1066. [PubMed] [Google Scholar]
  • 9.Iribarren C, Sharp DS, Burchfiel CM, Petrovitch H. Association of weight loss and weight fluctuation with mortality among Japanese American men. New England Journal of Medicine. 1995;333:686–692. doi: 10.1056/NEJM199509143331102. [DOI] [PubMed] [Google Scholar]
  • 10.Laird NM, Ware JH. Random-effects models for longitudinal data. Biometrics. 1982;38:963–974. [PubMed] [Google Scholar]
  • 11.Lyles RH, Munõz A, Xu J, Taylor JMG, Chmiel JS. Adjusting for measurement error to assess health effects of variability in biomarkers. Statistics in Medicine. 1999;18:1069–1086. doi: 10.1002/(sici)1097-0258(19990515)18:9<1069::aid-sim97>3.0.co;2-7. [DOI] [PubMed] [Google Scholar]
  • 12.Manatunga A, Schmotzer B, Lyles RH, Small C, Guo Y, Marcus M. Statistical issues related to modeling menstrual length. Proceedings of the American Statistical Association, Section on Statistics in Epidemiology; 2005. [Google Scholar]
  • 13.Miglior S, Pfeiffer N, Torri V, Zeyen T, Cunha-Vaz J, Adamsons I. Predictive factors for open-angle glaucoma among patients with ocular hypertension in the European Glaucoma Prevention Study. Ophthalmology. 2007a;114 (1):3–9. doi: 10.1016/j.ophtha.2006.05.075. [DOI] [PubMed] [Google Scholar]
  • 14.Miglior S, Torri V, Zeyen T, Pfeiffer N, Cunha-Vaz J, Adamsons I. Intercurrent factors associated with the development of open-angle glaucoma in the European Glaucoma Prevention Study. American Journal of Ophthalmology. 2007b;144:266–275. doi: 10.1016/j.ajo.2007.04.040. [DOI] [PubMed] [Google Scholar]
  • 15.Pauler DK, Finkelstein DM. Predicting time to prostate cancer recurrence based on joint models for non-linear longitudinal biomarkers and event time outcomes. Statistics in Medicine. 2002;21:3897–3911. doi: 10.1002/sim.1392. [DOI] [PubMed] [Google Scholar]
  • 16.Prentice RL. Covariate measurement errors and parameter estimation in failure time regression model. Biometrika. 1982;69:331–342. [Google Scholar]
  • 17.Proust-Lima C, Taylor JMG. Development and validation of a dynamic prognostic tool for prostate cancer recurrence using repeated measures of posttreatment PSA: a joint modeling approach. Biostatistics. 2009;10:535–549. doi: 10.1093/biostatistics/kxp009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Rothwell PM, Howard SC, Dolan E, O’Brien E, Dobson JE, Dahlof B, Sever PS, Poulter N. Prognostic significance of visit-to-visit variability, maximum systolic blood pressure, and episodic hypertension. Lancet. 2010;375:895–905. doi: 10.1016/S0140-6736(10)60308-X. [DOI] [PubMed] [Google Scholar]
  • 19.Singh K, Shrivastava A. Intraocular pressure fluctuations: how much do they matter? Curr Opin Ophthalmol. 2009;20:84–87. doi: 10.1097/icu.0b013e328324e6c4. [DOI] [PubMed] [Google Scholar]
  • 20.Spiegelhalter DJ, Best NG, Carlin BP, Van der Linde A. Bayesian Measures of Model Complexity and Fit (with Discussion) Journal of the Royal Statistical Society, Series B. 2002;64:583–616. [Google Scholar]
  • 21.Sultan MB, Mansberger SL, Lee PP. Understanding the importance of IOP variables in glaucoma: a systematic review. Survey of Ophthalmology. 2009;54:643–662. doi: 10.1016/j.survophthal.2009.05.001. [DOI] [PubMed] [Google Scholar]
  • 22.Tsiatis AA, Davidian M. Joint modeling of longitudinal and time-to-event data: an overview. Statistica Sinica. 2004;14:809–834. [Google Scholar]
  • 23.Vonesh EF, Greene T, Schluchter MD. Shared parameter models for the joint analysis of longitudinal data and event times. Statistics in Medicine. 2006;25:143–163. doi: 10.1002/sim.2249. [DOI] [PubMed] [Google Scholar]
  • 24.Wulfsohn M, Tsiatis A. A joint model for survival and longitudinal data measured with error. Biometrics. 1997;53:330–339. [PubMed] [Google Scholar]

RESOURCES