Summary
Progressive and insidious cognitive decline that interferes with daily life is the defining characteristic of Alzheimer’s disease (AD). Epidemiological studies have found that the pathological process of AD begins years before a clinical diagnosis is made and can be highly variable within a given population. Characterizing cognitive decline in the preclinical phase of AD is critical for the development of early intervention strategies when disease-modifying therapies may be most effective. In the last decade, there has been an increased interest in the application of change-point models to longitudinal cognitive outcomes prior to and after diagnosis. Most of the proposed statistical methodology for describing decline relies upon distributional assumptions that may not hold. In this paper, we introduce a quantile regression with a change-point model for longitudinal data of cognitive function in persons bound to develop AD. A change-point in our model reflects the transition from the cognitive decline due to normal aging to the accelerated decline due to disease progression. Quantile regression avoids common distributional assumptions on cognitive outcomes and allows the covariate effects and the change-point to vary for different quantiles of the response. We provided an approach for estimating the model parameters, including the change-point, and presented inferential procedures based on the asymptotic properties of the estimators. A simulation study showed that the estimation and inferential procedures perform reasonably well in finite samples. The practical use of our model was illustrated by an application to longitudinal episodic memory outcomes from two cohort studies of aging and AD.
Keywords: Alzheimer’s disease, Change-point model, Cognitive aging, Disease progression, Longitudinal data, Quantile regression
1. Introduction
Alzheimer’s disease (AD) is an age-associated chronic neurodegenerative disorder, clinically characterized by a progressive loss of memory and other cognitive and functional abilities. Impairments in episodic memory (the ability to learn and retain new information) are among the earliest signs and symptoms of AD. Characterizing the trajectories of cognitive decline in persons bound to develop AD and the risk factors affecting rate of decline is critical for developing strategies for early diagnosis and potential treatments for delaying or preventing the onset of dementia. In the preclinical stage of AD, the cognitive decline is gradual and difficult to differentiate from normal aging or mild affective disorders (Sliwinski et al., 1996; Becker et al., 2009), but as the disease progresses, the decline in cognitive function accelerates and becomes distinguishable from the profile of typical cognitive aging. Prior literature suggests that the acceleration of cognitive decline might begin months or even years before a clinical diagnosis of AD is made (Amieva et al., 2005; Johnson et al., 2009; Grober et al., 2008; Howieson et al., 2008). Numerous random effects single change-point models have been formulated to examine this acceleration of cognitive loss and individual-level variations in the change trajectories in the pre-diagnosis phase of AD (see, e.g., Hall et al., 2000, 2003, Jacqmin-Gadda et al., 2006, Yu et al., 2012, van den Hout et al., 2010, and more recently, Yang and Gao, 2013). However, the conditional distribution of the response variable in a random effects change-point model is often assumed to be normal even though the response is discrete and/or shows boundary effects. To avoid this shortcoming, we propose a quantile regression change-point model to describe changes in cognitive function in the preclinical and prodromal stages of AD. In addition to being distribution free, as in other situations, rank-based methods are more useful than those which estimate means in change-point models when there may be outliers and, especially, when the scientific questions focus on extremes.
Quantile regression (QR) models the effects of covariates on a conditional quantile of the response variable. Unlike mean regression, QR can delineate the entire conditional distribution of a response given covariates, doesn’t have to specify the error distribution for estimation and inference, and is robust against response outliers (for an overview, see Koenker, 2005). In the past two decades, QR has been extended to longitudinal data modeling in several ways. For example, approaches based on marginal models include those proposed by Jung (1996), He et al. (2002), Wang and Fygenson (2009), and Wang and Zhu (2011). Others have proposed methodological approaches based on subject-specific models (Koenker, 2004; Kim and Yang, 2011) or using Bayesian methods for model estimation (Reich et al., 2010). Recently, Li et al. (2011) studied a piecewise linear QR model with a single change-point. In this paper, we extend this model to the longitudinal framework and present an application to study the progression of cognitive impairment in incident AD subjects participating in two ongoing clinical-pathological cohort studies. Similar to Li et al. (2011), this work focuses on dealing with single change-point situations. Extensions to handle two or more change-points in the model are prohibited by algorithmic issues and merit future research.
The rest of the paper is organized as follows. Section 2 presents the methodology and the associated asymptotic theory. Section 3 evaluates the finite sample performance of the methodology through simulation studies. In Section 4, we apply the proposed methodology to a real data set to examine the longitudinal course of cognitive function in preclinical AD and how decline varies with age, education, and genetic measures. We summarize the paper and discuss future research directions in Section 5. The proofs of the statistical properties of the model parameter estimators are given in the online supplementary material.
2. Quantile Regression with a Change-point Model for Longitudinal Data
2.1 Model Setup and Parameter Estimation
We consider the following general quantile regression with a change-point model (QRCPM) for longitudinal data:
| (1) |
where i indexes subjects, j indexes observations within a subject, Yij is the response variable, Xij is the scalar covariate whose slope changes at an unknown change-point tτ, zij is a q-dimensional vector of covariates with constant slopes, the τth conditional quantile of given Xij and zij is 0, and are independent across i (subjects) but are dependent within a subject. Model (1) with the above assumptions is a population-averaged model for longitudinal data. It means that in the population with covariate values Xij and zij, 100τ % of its subjects have an outcome value no larger than . All model parameters (ατ, β1, τ, β2, τ, tτ, γτ) depend on τ. In data analysis, τ is usually determined based on the subject-matter interest. Throughout the rest of the paper we assume that ni (the number of observations within a subject) is bounded by some positive constant K, but N (the number of subjects) grows. Let be the true parameter vector. We define , θ = (α, β1, β2, t, γ⊤)⊤, η = (α, β1, β2, γ⊤)⊤ , and
To derive the estimator of θ0(τ), θ̂n(τ), we adopt the working assumption of independence between observations and thus consider minimizing the objective function
| (2) |
where ρτ (u) = u(τ − I{u < 0}) is the quantile loss function. Due to the non-convexity of Qn,τ (θ), θ̂n(τ) is obtained via profile estimation. A profile estimator of η at a fixed t is given by
which can be obtained by the function ‘rq’ in R package ‘quantreg’. An estimator of the change-point tτ is given by
where a and b are two constants such that tτ is thought to be in (a, b), usually determined graphically. θ̂n(τ) is obtained from η̂n,τ (t̂n,τ) and t̂n,τ . The optimization method used to minimize Qn,τ (η̂n,τ (t), t) is a combination of golden section search and successive parabolic interpolation, implemented by the function ‘optimize’ in R package ‘stats’. On a PC with a 3.40GHz CPU and 8GB RAM, the profile estimation method takes 0.73 seconds to fit a change-point quantile regression model with two covariates to a longitudinal data set of sample size 8000 (1600 subjects each having 5 observations).
2.2 Large Sample Properties
To derive the asymptotic properties of θ̂n(τ), we introduce some notations:
Note that if fYij is the conditional density of Yij given wij, we have . The large sample properties of θ̂n(τ) are presented in the following two theorems. The full proofs are deferred to Web Appendix A.
Theorem 1
Under assumptions (A1)–(A5) in Web Appendix A and the condition that β1,τ ≠ β2,τ, θ̂n(τ) is a strongly consistent estimator of θ0(τ).
Theorem 2
Under assumptions (A1)–(A6), given in Web Appendix A, and the condition that β1,τ ≠ β2,τ, we have
| (3) |
where
It is easy to extend the foregoing result to QRCPM at multiple quantile levels. For 0 < τ1 < τ2 < ···< τm < 1, m ∈
, we set
, 1 ≤ j ≤ m. We assume that model (1) is true for all the τj’s. Let ϑ̂n = (θ̂n(τ1)⊤, θ̂n(τ2)⊤, ···, θ̂n(τm)⊤)⊤ be the vector of estimated parameters with ϑ0 = (θ0(τ1)⊤, θ0(τ2)⊤, ···, θ0(τm)⊤)⊤. To describe the asymptotic properties of ϑ̂n, we introduce three more notations: Hn = diag[Dn,τ1, ···, Dn,τm],
for j ≠ j′ and k ≠ l, and
By Lemma 1 in Web Appendix A, we have with probability one that
| (4) |
where φτ (r) = τ − I{r < 0}. Stacking (4) from k = 1 to m and then applying the multivariate central limit theorem, we obtain
| (5) |
where Λn is a m(4 + q) × m(4 + q) matrix with klth block being Jn(τk, τl) (k, l = 1, ···, m).
2.3 Asymptotic Covariance Matrix Estimation
If the asymptotic covariance matrix in (5) can be consistently estimated, confidence intervals and hypothesis tests on ϑ0 can be derived when the sample size n is large. The asymptotic covariance matrix estimation involves estimating and . The density estimation method we use is the so-called Hendricks-Koenker Sandwich described in Section 3.4.2 of Koenker (2005). and are estimated in different ways depending on the assumed correlation structure of Yi = (Yi1, ···, Yini)⊤.
We consider the estimation of and for four specific correlation structures of Yi:
(1) Compound Symmetry Form
The correlation matrix of Yi is of the compound symmetry form, i.e.,
where ρ is constant across subjects. Under this form of correlation matrix, it is reasonable to assume that and for i = 1, ···, N, j, j′ = 1, ···, ni and j ≠ j′. This assumption holds if, e.g., , where ’s are i.i.d. random subject effects that are independent of the i.i.d. within-subject random errors ’s. So two naturally reasonable estimators of δ(τk) and ζ(τk,τl) are respectively
| (6) |
and
| (7) |
where ’s are the fitted residuals from the QRCPM at quantile level τ.
(2) AR(1) Structure
The correlation matrix of Yi has AR(1) structure, i.e.,
where ρ is constant across subjects. Under this correlation structure, it is reasonable to assume that and for i = 1, ···, N, j, j′ = 1, ···, ni and j ≠ j′. This assumption holds if, e.g., , where v(·, ·) is a bivariate linear function and uij (j = 1, …, ni) follow a stationary AR(1) time series model. Therefore, two naturally reasonable estimators of and are respectively
| (8) |
and
| (9) |
(3) Heteroscedastic Correlation
, where g(·) is a non-constant function, ’s are independent and identically distributed, ’s are independent and identically distributed, and and are independent. Under this assumption, it is reasonable to estimate and with generalized additive (GA) models (Hastie and Tibshirani, 1986). Specifically, we fit a GA logistic model to the binary response for estimating :
| (10) |
where S1(·) and S2(·) are two unspecified functions that are related with g(·). This form of GA model embodies the feature that . S1(·) and S2(·) are estimated by smoothing splines with four degrees of freedom. Therefore, the estimator of is
| (11) |
where α̂, Ŝ1(·) and Ŝ2(·) are the estimators of α, S1(·) and S2(·), respectively. We also fit a GA logistic model to the binary response for estimating :
| (12) |
where Q1(·) and Q2(·) are another two unspecified functions that are related with g(·). Note that we don’t necessarily have . Again, Q1(·) and Q2(·) are estimated by smoothing splines with four degrees of freedom. Hence, the estimator of is
| (13) |
where η̂, Q̂1(·) and Q̂2(·) are the estimators of η, Q1(·) and Q2(·), respectively.
(4) Unstructured Correlation
For an unstructured correlation matrix of Yi, we propose two general estimators for and respectively:
| (14) |
and
| (15) |
These two general estimators can also be applied to the previous three special cases.
It is worth pointing out that the above covariance matrix estimation methods are also applicable to the following linear quantile regression model for longitudinal data,
| (16) |
where Xij ∈ ℝp, i indexes subjects, j indexes observations within a subject, the tth conditional quantile of given Xij is 0, and ’s are independent across i (subjects) but are dependent within a subject. The simplest estimation approach for (16) is to compute the quantile regression as if the data were all independent, i.e., estimating βτ by
| (17) |
He et al. (2002) and Wang and Fygenson (2009), among others, showed that this simple method still consistently estimates βτ and that the estimator is asymptotically normal with a covariance matrix containing the same terms and as defined for (5).
3. Simulation study of QRCPM for longitudinal data
We investigated the finite sample performance of the estimators as well as Wald confidence intervals based on (5) for slopes and change-point in QRCPM through a simulation study. The simulated data were generated from the following three scenarios:
-
Scenario of Compound Symmetry Correlation Structure:
(18) where and ;
-
Scenario of AR(1) Correlation Structure:
(19) where v(x) = 3.2 – 0.2x, uij = 0.5ui,j−1 + εij and ;
-
Scenario of Heteroscedastic Correlation Structure:
(20) where and .
The three corresponding covariance matrix estimators described in Section 2.3 were used in the above three scenarios respectively. As a comparison, the general estimator of the covariance matrix for the unstructured correlation of responses was also used in each scenario. To add some variation to the number of observations per subject in all three scenarios, we set ni = 5 for i = 1, ···, N – 2, nN−1 = 4 and nN = 6, where N was the number of subjects in a data set. The number of observations n in a data set was thus 5N. In the scenarios of compound symmetry and heteroscedastic correlation structures, and . In the scenario of AR(1) correlation structure, , xij = xi,j−1 + 0.5 for j > 1, and . The number of simulation runs was 1000. The simulation results are presented in Table 1. In all cases, we reported results for Wald confidence intervals with a nominal coverage probability of 0.95.
Table 1.
Simulation results for estimating β1,τ, β2,τ, tτ and γτ
| n | Corr. Struct. | Parameter | τ | Bias | SE | SEE
|
CP
|
||
|---|---|---|---|---|---|---|---|---|---|
| Specific | General | Specific | General | ||||||
| 2000 | Comp. Sym. | β1,τ | 0.5 | 0.002 | 0.041 | 0.040 | 0.040 | 0.941 | 0.943 |
| 0.7 | 0.003 | 0.043 | 0.042 | 0.042 | 0.948 | 0.948 | |||
| β2,τ | 0.5 | 0.001 | 0.038 | 0.038 | 0.038 | 0.939 | 0.938 | ||
| 0.7 | −0.001 | 0.040 | 0.040 | 0.040 | 0.951 | 0.951 | |||
| tτ | 0.5 | −0.008 | 0.169 | 0.159 | 0.159 | 0.937 | 0.936 | ||
| 0.7 | −0.003 | 0.177 | 0.167 | 0.168 | 0.938 | 0.939 | |||
| γτ | 0.5 | 0.000 | 0.013 | 0.014 | 0.014 | 0.960 | 0.961 | ||
| 0.7 | 0.000 | 0.014 | 0.015 | 0.015 | 0.953 | 0.954 | |||
|
| |||||||||
| AR(1) | β1,τ | 0.5 | 0.018 | 0.153 | 0.130 | 0.129 | 0.914 | 0.913 | |
| 0.7 | 0.015 | 0.147 | 0.134 | 0.134 | 0.923 | 0.919 | |||
| β2,τ | 0.5 | −0.010 | 0.098 | 0.089 | 0.089 | 0.932 | 0.928 | ||
| 0.7 | −0.009 | 0.100 | 0.093 | 0.093 | 0.927 | 0.927 | |||
| tτ | 0.5 | −0.002 | 0.429 | 0.317 | 0.317 | 0.865 | 0.860 | ||
| 0.7 | 0.005 | 0.423 | 0.331 | 0.331 | 0.875 | 0.874 | |||
| γτ | 0.5 | 0.000 | 0.025 | 0.024 | 0.024 | 0.932 | 0.932 | ||
| 0.7 | 0.000 | 0.026 | 0.025 | 0.025 | 0.944 | 0.936 | |||
|
| |||||||||
| Hetero. | β1,τ | 0.5 | 0.003 | 0.080 | 0.075 | 0.075 | 0.929 | 0.930 | |
| 0.7 | 0.005 | 0.085 | 0.079 | 0.079 | 0.929 | 0.930 | |||
| β2,τ | 0.5 | −0.006 | 0.048 | 0.046 | 0.046 | 0.936 | 0.934 | ||
| 0.7 | −0.005 | 0.053 | 0.048 | 0.048 | 0.925 | 0.922 | |||
| tτ | 0.5 | 0.014 | 0.282 | 0.247 | 0.247 | 0.906 | 0.908 | ||
| 0.7 | 0.009 | 0.305 | 0.261 | 0.260 | 0.894 | 0.895 | |||
| γτ | 0.5 | 0.001 | 0.019 | 0.020 | 0.020 | 0.951 | 0.956 | ||
| 0.7 | 0.000 | 0.020 | 0.021 | 0.021 | 0.957 | 0.953 | |||
|
| |||||||||
| 4000 | Comp. Sym. | β1,τ | 0.5 | 0.001 | 0.028 | 0.028 | 0.028 | 0.956 | 0.953 |
| 0.7 | 0.000 | 0.031 | 0.029 | 0.029 | 0.935 | 0.939 | |||
| β2,τ | 0.5 | −0.002 | 0.028 | 0.027 | 0.027 | 0.943 | 0.944 | ||
| 0.7 | −0.003 | 0.029 | 0.029 | 0.029 | 0.955 | 0.959 | |||
| tτ | 0.5 | 0.003 | 0.124 | 0.112 | 0.113 | 0.923 | 0.922 | ||
| 0.7 | 0.007 | 0.128 | 0.118 | 0.119 | 0.927 | 0.926 | |||
| γτ | 0.5 | 0.001 | 0.010 | 0.010 | 0.010 | 0.956 | 0.954 | ||
| 0.7 | 0.000 | 0.010 | 0.010 | 0.010 | 0.938 | 0.942 | |||
|
| |||||||||
| AR(1) | β1,τ | 0.5 | 0.004 | 0.097 | 0.090 | 0.090 | 0.925 | 0.923 | |
| 0.7 | 0.008 | 0.102 | 0.093 | 0.093 | 0.930 | 0.932 | |||
| β2,τ | 0.5 | −0.004 | 0.068 | 0.062 | 0.062 | 0.924 | 0.926 | ||
| 0.7 | −0.004 | 0.069 | 0.064 | 0.064 | 0.925 | 0.929 | |||
| tτ | 0.5 | 0.002 | 0.272 | 0.221 | 0.221 | 0.894 | 0.897 | ||
| 0.7 | 0.002 | 0.280 | 0.232 | 0.232 | 0.897 | 0.900 | |||
| γτ | 0.5 | 0.001 | 0.017 | 0.017 | 0.017 | 0.949 | 0.951 | ||
| 0.7 | −0.001 | 0.018 | 0.018 | 0.018 | 0.951 | 0.953 | |||
|
| |||||||||
| Hetero. | β1,τ | 0.5 | 0.003 | 0.053 | 0.053 | 0.053 | 0.948 | 0.947 | |
| 0.7 | 0.002 | 0.058 | 0.055 | 0.055 | 0.935 | 0.936 | |||
| β2,τ | 0.5 | 0.000 | 0.033 | 0.033 | 0.033 | 0.944 | 0.943 | ||
| 0.7 | −0.002 | 0.036 | 0.034 | 0.035 | 0.943 | 0.942 | |||
| tτ | 0.5 | −0.005 | 0.190 | 0.176 | 0.175 | 0.920 | 0.920 | ||
| 0.7 | 0.004 | 0.200 | 0.184 | 0.184 | 0.922 | 0.926 | |||
| γ τ | 0.5 | 0.000 | 0.014 | 0.014 | 0.014 | 0.953 | 0.957 | ||
| 0.7 | 0.000 | 0.014 | 0.015 | 0.015 | 0.943 | 0.948 | |||
|
| |||||||||
| 8000 | Comp. Sym. | β1,τ | 0.5 | 0.000 | 0.020 | 0.019 | 0.019 | 0.936 | 0.939 |
| 0.7 | 0.000 | 0.020 | 0.020 | 0.020 | 0.941 | 0.942 | |||
| β2,τ | 0.5 | −0.001 | 0.020 | 0.019 | 0.019 | 0.949 | 0.947 | ||
| 0.7 | −0.001 | 0.021 | 0.021 | 0.021 | 0.939 | 0.938 | |||
| tτ | 0.5 | 0.004 | 0.084 | 0.079 | 0.079 | 0.944 | 0.945 | ||
| 0.7 | 0.000 | 0.086 | 0.084 | 0.084 | 0.937 | 0.935 | |||
| γτ | 0.5 | 0.000 | 0.007 | 0.007 | 0.007 | 0.935 | 0.933 | ||
| 0.7 | 0.000 | 0.007 | 0.007 | 0.007 | 0.946 | 0.947 | |||
|
| |||||||||
| AR(1) | β1,τ | 0.5 | 0.003 | 0.068 | 0.066 | 0.065 | 0.933 | 0.934 | |
| 0.7 | 0.004 | 0.073 | 0.068 | 0.068 | 0.926 | 0.925 | |||
| β2,τ | 0.5 | −0.003 | 0.044 | 0.043 | 0.043 | 0.952 | 0.951 | ||
| 0.7 | −0.004 | 0.046 | 0.045 | 0.045 | 0.951 | 0.951 | |||
| tτ | 0.5 | 0.003 | 0.172 | 0.151 | 0.151 | 0.910 | 0.908 | ||
| 0.7 | 0.005 | 0.181 | 0.158 | 0.158 | 0.909 | 0.910 | |||
| γτ | 0.5 | 0.000 | 0.011 | 0.012 | 0.012 | 0.958 | 0.958 | ||
| 0.7 | 0.000 | 0.012 | 0.012 | 0.012 | 0.960 | 0.958 | |||
|
| |||||||||
| Hetero. | β1,τ | 0.5 | 0.000 | 0.038 | 0.037 | 0.037 | 0.947 | 0.944 | |
| 0.7 | 0.001 | 0.039 | 0.039 | 0.039 | 0.951 | 0.949 | |||
| β2,τ | 0.5 | −0.001 | 0.024 | 0.023 | 0.023 | 0.942 | 0.943 | ||
| 0.7 | −0.001 | 0.025 | 0.024 | 0.024 | 0.942 | 0.946 | |||
| tτ | 0.5 | 0.000 | 0.131 | 0.124 | 0.124 | 0.935 | 0.938 | ||
| 0.7 | −0.004 | 0.140 | 0.130 | 0.130 | 0.928 | 0.927 | |||
| γτ | 0.5 | 0.000 | 0.010 | 0.010 | 0.010 | 0.944 | 0.945 | ||
| 0.7 | 0.000 | 0.010 | 0.010 | 0.010 | 0.959 | 0.961 | |||
n: total number of observations, n = 5N with N being the number of subjects; Corr. Struct.: true correlation structure of Yi; Comp. Sym.: compound symmetry; Hetero.: heteroscedastic. Bias and SE are bias and standard error of the parameter estimator, SEE is the mean of the standard error estimator, and CP is the coverage rate of the 95% confidence interval. Specific means using the estimator of that is specific to the correlation structure of Yi. General means using the estimator of that does not depend on the correlation structure of Yi.
As shown in Table 1, the average of the standard error estimates is smaller than the empirical standard error for the change-point estimator in every case. As a result, the coverage rate of every Wald confidence interval for the change-point is lower than the nominal level of 95%. This might be due to the density estimation involved in estimating the asymptotic covariance matrix, which is not as efficient as the estimation of θ0(τ). Another explanation could be related to the finding by Hinkley (1969), based on an empirical study, that the asymptotic normality of the change-point estimator is not a good approximation for small sample sizes in two-phase broken line regression, although it was proved that, in this special case of segmented regression, the change-point estimator does have an asymptotic normal distribution (Feder, 1975). Notwithstanding the underestimation of the change-point estimator’s stand error for small sample sizes, our simulation showed that, as the sample size increases, the estimate of theoretical standard error approaches the empirical standard error and the empirical coverage probability of Wald confidence interval goes up towards 95%, especially in the compound symmetry scenario. The estimators and Wald confidence intervals for regression coefficients performed reasonably well except a little underestimation of the standard errors in AR(1) and heteroscedastic scenarios when the number of observations is smaller than or equal to 4000. Again, this underestimation might be due to the less efficient density estimation involved in estimating the asymptotic covariance matrix. Similar findings (Wald confidence intervals’ coverage rates being smaller than the nominal level) were reported by Koenker (2005), page 111, for linear quantile regression with a sample size of 500. In every correlation structure scenario, the standard error estimate based on the general estimator of is equivalent to the one based on its specific estimator suitable for that scenario, so is the empirical coverage probability of Wald confidence interval. This suggests that the general estimator of might be used universally.
4. Application to the Longitudinal Assessment of Cognitive Function in Preclinical Alzheimer’s Disease
4.1 Background and Data
Section 4 presents an application of the QRCPM to the combined data from two ongoing cohort studies of incident Alzheimer’s disease: the Religious Orders Study (ROS) (Bennett et al., 2012) and the Memory and Aging Project (MAP) (Bennett et al., 2012). Recruitment, exclusion, and inclusion criteria for these studies and subject evaluations have been previously described in detail (Wilson et al., 2002). Briefly, both studies recruit older individuals without dementia who agree to receive clinical and psychological evaluation each year and to donate their brain for postmortem examination. Enrollment for ROS began in 1994 and includes the participation of over 1,100 older religious clergy (priests, brothers, and nuns). MAP study began in 1997 and currently includes more than 1,400 participants from about 40 retirement communities and senior housing facilities in the Chicago metropolitan area. Besides sharing similar clinical and pathologic findings, these studies also share a common 17- test neuropsychological test battery and follow the same standard protocol and criteria for clinical diagnosis. Both studies were approved by the Institutional Review Board of Rush University Medical Center. Written informed consent was obtained from all study participants. We used data from MAP and ROS studies to estimate the time point at which the acceleration of cognitive decline began in different cognition quantile groups of persons who developed AD and to study whether the shape of cognitive decline depends on gender, education level, and genetic risk factors.
Outcome Measure
The dependent variable of interest in this illustration, episodic memory, was a composite score comprised of six widely-used clinical neuropsychological tests intended for measuring conscious retrieval or recognition of information acquired at a particular time and place (Wilson et al., 2002). Although other psychological functions also may be affected in the early stages of the disease process, episodic memory has been frequently associated with cognitive deficits and early brain changes in preclinical AD (Salmon and Bondi, 1999). Indeed, progressive impairment in episodic memory lasting more than six months is part of the diagnostic criteria for probable AD (Dubois et al., 2007). Therefore, we focused on composite score outcomes defining the domain of episodic memory collected annually on both studies for up to 12 years. The target population for analysis was defined as subjects who were diagnosed with AD during the follow-up visits.
Covariates
Explanatory covariates were included in the model to study predictors of inter-individual differences in change-points and trajectories of cognition quantile over time at different quantile levels. Participant’s gender, years of education (categorized into two levels: ≤ 12 years and > 12 years), and the presence of the apolipoprotein E ε4 allele (ApoE4), a genetic risk factor for cognitive decline and AD, were included in the analysis as time-independent covariates. All these variables have been previously shown to be associated with changes in cognitive function.
Clinical Diagnoses
Annual diagnostic classifications followed a three step process. First, a neuropsychological test battery was administered and scored. Second, a clinical neu-ropsychologist assessed the presence of cognitive impairment after a careful review of all cognitive scores results and relevant demographic information. Third, a clinician determined whether standard criteria for dementia and AD (McKhann et al., 1984) were met. At this step, individuals were classified with respect to AD, mild cognitive impairment, and other neurologic disorders.
The sample for analysis consisted of 429 incident AD cases with at least one follow-up measure of episodic memory and at least one non-missing explanatory covariate. Participant’s age at baseline ranged from 64.78 yrs to 102.10 yrs. A summary of characteristics of the sample is presented in Table 2.
Table 2.
Characteristics of the sample of 429 subjects for analysis
| Characteristic | Analysis |
|---|---|
| Gender—no. (%) | |
| Male | 115 (26.8%) |
| Female | 314 (73.2%) |
| Years of education—no. (%) | |
| > 12 (attended college) | 89 (20.7%) |
| ≤ 12 (not attended college) | 340 (79.3%) |
| ApoE4 status—no. (%) | |
| carrier | 127 (29.6%) |
| non-carrier | 274 (63.9%) |
| unknown | 28 (6.5%) |
| Study—no. (%) | |
| MAP | 189 (44.1%) |
| ROS | 240 (55.9%) |
| Age at baseline (yrs)—mean (sd) | 81.48 (6.57) |
| Age at AD diagnosis (yrs)—mean (sd) | 85.79 (6.62) |
| No. of cognitive evaluations—mean (sd) | 7.1 (2.8) |
4.2 Model Building
As in modeling the change of mean cognitive score over time in AD patients (Yu et al., 2012; Hall et al., 2000), we characterized the evolution of episodic memory quantile by a piecewise linear trajectory with a change-point in the time at which the decline of episodic memory quantile accelerated. The inspection of spaghetti plots of episodic memory score against age and time to AD diagnosis (Figures 1(a) and 1(b)) suggested that the change-point might be different for different individuals on the scale of age, but common on the scale of time to AD diagnosis.
Figure 1.
Spaghetti plots of episodic memory score against age (a) and time to AD diagnosis (b)
Thus, we fitted the following model to the data of the 429 incident AD cases:
| (21) |
where Yij, Xij and Zij are episodic memory score, the time to AD diagnosis and the age of subject i at the jth measurement, respectively, tτ is the time point when the decline of episodic memory quantile begins to accelerate on the scale of time to diagnosis, and has τth quantile at zero. γτ represents the normal aging effect on episodic memory. βτ reflects the change in decline rate of episodic memory quantile after the change-point. The subtraction of 65 in the age terms was done for interpretability of the intercept. Model (21) is equivalent to the following piecewise linear model on the age scale with a random change-point depending on the age of AD occurrence.
where Ri is subject i’s age of AD occurrence. Model (21) is also the same as the final statistical model used in Hall et al. (2000) except that the former models the quantile and the latter models the mean. We performed statistical inference assuming an unstructured correlation matrix for Yi = (Yi1, ···, Yini), that is, we used the covariance matrix estimator for the unspecified correlation structure described in Section 2.3. All the hypothesis tests in the analysis were two-sided with a significance level of 0.05.
To study whether the shape of episodic memory decline was related to gender, education level or genetic risk factors and whether there was a cohort effect from the two different study populations, we fitted model (21) with τ = 0.5 to the data of the subsets of males, females, subjects who attended college, subjects who didn’t attend college, subjects with ApoE4, subjects without ApoE4, subjects in MAP study and subjects in ROS study separately. Here we assume that the quantile functions (21) at different levels are affected by the same set of significant covariates in a common way across quantile levels. Because the model at τ = 0.5 is the most robust against outliers and measurement errors in the models of all quantile levels, it was used to determine the significant covariates and how they affect the quantile curves. Through the 4-degrees-of-freedom Wald test and the 1-degree-of-freedom Wald test, we tested the difference in the full set of parameters and the individual parameters of model (21) between males and females, subjects who attended college and subjects who didn’t attend college, subjects with ApoE4 and subjects without ApoE4, and subjects in MAP study and subjects in ROS study. We found that ApoE4 status produced significant differences in only α0.5 and γ0.5, but gender and study cohort didn’t yield any significant difference in any model parameter. In particular, the change-point t(0.5) doesn’t differ significantly for different cohorts and gender/education/genetic risk groups. Therefore, we decided to fit the following model to the data
| (22) |
where Vi is the indicator of whether to carry ApoE4.
4.3 Analysis Results
Model (22) was estimated at three quantile levels: 0.2, 0.5, and 0.8. The parameter estimates and their asymptotic confidence intervals are tabulated in Table 3. The 95% confidence intervals for the parameters showed that after the change-point, episodic memory declined significantly faster than before the onset of change at all the three quantile levels. The confi-dence intervals also showed that the presence of at least one ApoE4 allele significantly lowered the 20th and 50th percentiles of the episodic memory score and significantly decreased the decline rate of the 20th percentile. This finding may be due to the fact that subjects with episodic memory score at 20th percentile in the ApoE4 carrier group started follow-up with lower scores than their counterparts in the ApoE4 non-carrier group.
Table 3.
Estimates and 95% CIs for the Parameters of Model (22)
|
τ
|
|||
|---|---|---|---|
| 0.2 | 0.5 | 0.8 | |
| ατ | 0.12 [−0.09, 0.33] | 0.39 [0.24, 0.54] | 0.70 [0.59, 0.80] |
| βτ | −0.28 [−0.32, −0.25] | −0.23 [−0.26, −0.21] | −0.16 [−0.19, −0.13] |
| tτ | −4.4 [−4.9, −3.9] | −4.6 [−5.0, −4.1] | −5.0 [−5.7, −4.2] |
| γτ | −0.025 [−0.037, −0.013] | −0.013 [−0.021, −0.005] | −0.010 [−0.016, −0.004] |
| aτ | −0.73 [−1.13, −0.33] | −0.53 [−0.92, −0.15] | −0.18 [−0.55, 0.20] |
| bτ | 0.023 [0.002, 0.044] | 0.011 [−0.008, 0.031] | 0.000 [−0.021, 0.021] |
The three change-points for the 20th, 50th and 80th percentiles of the episodic memory score were 4.4, 4.6 and 5.0 years prior to the diagnosis of AD respectively. These results were similar to the estimates of change-points for the Buschke Selective Reminding Test score (Masur et al., 1994) obtained by Hall et al. (2000) (4.3 and 5.1 years prior to diagnosis in AD patients and dementia patients respectively). For comparison purposes, we also applied Hall et al. (2000) change-point model to our example data using the same outcome, explanatory variables, and general model set-up. The change-point estimate was 3.4 years prior to diagnosis; closest to the lowest (20th) percentile change-point obtained with the QRCPM.
Results of Wald tests for the QRCPM didn’t show that change-points were significantly different across the three quantile levels. Furthermore, we compared the decline rate of episodic memory quantile across the three quantile levels using Wald tests. The decline rates before the change-point were not significantly different across the quantile levels for either AD patients with ApoE4 or AD patients without ApoE4. The decline rates after the change-point were significantly different across the quantile levels for both AD patients with ApoE4 and AD patients without ApoE4 (both p ≈ 0). As illustrated in Figure 2, after the acceleration of episodic memory decline starts, the lower percentiles of the episodic memory score decline faster than the higher percentiles for both ApoE4 carriers and non-carriers.
Figure 2.
Episodic memory score against time to diagnosis of AD conditional on ApoE4 status with fitted 80th (dotted), 50th (dashed) and 20th (solid) quantile regression lines. The fitted quantile regression lines are for the sample mean of age at diagnosis in the sample used to fit model (22): 86.1 years old.
5. Discussion
In this paper, we proposed a quantile regression with a change-point modeling framework for longitudinal data. The proposed methodology is useful in describing the trajectories of disease progression and studying the effect of potential correlates of observed clinical outcomes. We applied the model to data from two longitudinal clinical-pathologic cohort studies of aging and AD to study cognitive decline in preclinical AD. The model presented avoids the possibly unrealistic normality assumptions in the distribution of cognitive scores, which is required by random effects change-point models. It also allows risk factors or disease biomarkers to have different effects on different quantiles of the distribution of cognition. We provided the asymptotic properties of the model parameter estimators as well as the covariance matrix estimation methods for statistical inference. The proposed methodology has numerous applications in epidemiology and medicine to study the course of changes in disease progression. Other examples include the growth pattern in body mass index of children (Sorva et al., 1990) and T4 cell counts prior to AIDS (Kiuchi et al., 1995).
There are at least three future research topics on quantile regression change-point models. One is to model the effects of covariates on the change-point and its associated covariate’s coefficients explicitly. This will lead to a unified model for the entire data rather than separate models for data sets separated by covariates that significantly affect the change-point and/or its associated covariate’s coefficients. However, we expect that many research efforts are needed to develop the asymptotic results for the unified model. Instead, one could make inferences by bootstrapping individuals. Another direction is to study subject-specific quantile regression change-point models in which the within-subject dependence among data is taken into account through the incorporation of random effects. Accounting for random effects in quantile regression is difficult due to the lack of a likelihood. However, using the connection between the asymmetric Laplace distribution (ALD) and quantile regression (Yu and Moyeed, 2001), one could develop a likelihood-based inferential approach by assuming an ALD for residual errors and a symmetric multivariate Laplace distribution (Kotz et al., 2001) for random effects. This working assumption has been adopted by Liu and Bottai (2009) for mixed effects quantile regression models without a change-point. The analysis of the data from ROS and MAP studies implicitly assumed that all the missing outcome measures were missing at random. This assumption is possibly not true. Therefore, the third research topic would be how to treat missing data in quantile regression with a change-point model for longitudinal data, especially when the missing data are nonignorable.
Supplementary Material
Acknowledgments
The data for the example presented in this work were provided by these two studies supported by the NIA: the Religious Orders Study (P30AG10161, R01AG15819), and the Rush Memory and Aging Project (NIA Grant R01AG17917). We are grateful to the principal investigator of these two projects, David A. Bennett, MD, for the authorization to use the data. This work is supported in part by grants from the Alzheimer’s Association (NIRG-12-242799, Dowling) and the NIA (5 P50 AG033514-05, Dowling and Chappell).
Footnotes
Web Appendix A, referenced in Section 2.2, R code implementing the methods and a simulated data set are available with this paper at the Biometrics website on Wiley Online Library.
References
- Amieva H, Jacqmin-Gadda H, Orgogozo JM, Le Carret N, Helmer C, Letenneur C, Barberger-Gateau P, Fabrigoule C, Dartigues JF. The 9 year cognitive decline before dementia of the Alzheimer’s type: A prospective population-based study. Brain. 2005;128:1093–1101. doi: 10.1093/brain/awh451. [DOI] [PubMed] [Google Scholar]
- Becker J, Chang Y, Lopez O, Dew M, Sweet R, Barnes D, Yaffe K, Young J, Kuller L, Reynolds CF. Depressed mood is not a risk factor for incident dementia in a community-based cohort. American Journal of Geriatric Psychiatry. 2009;17:1653–663. doi: 10.1097/jgp.0b013e3181aad1fe. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bennett D, Schneider J, Arvanitakis Z, Wilson R. Overview and findings from the Religious Orders study. Current Alzheimer Research. 2012;9:628–645. doi: 10.2174/156720512801322573. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bennett D, Schneider J, Buchman A, Barnes L, Boyle AP, Wilson R. Overview and findings from the Rush Memory and Aging project. Current Alzheimer Research. 2012;9:646–663. doi: 10.2174/156720512801322663. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dubois B, Feldman H, Jacova C, DeKosky S, Barberger-Gateau P, Cummings J, Delacourte A, Galasko D, Gauthier G, Jicha G, Meguro J, O’Brien J, Pasquier F, Robert P, Rossor M, Salloway S, Stern Y, Visser P, Scheltens P. Research criteria for the diagnosis of Alzheimer’s disease: Revising the nincdscadrda criteria. The Lancet Neurology. 2007;6:734–746. doi: 10.1016/S1474-4422(07)70178-3. [DOI] [PubMed] [Google Scholar]
- Feder PI. On asymptotic distribution theory in segmented regression problems—identified case. Annals of Statistics. 1975;3:49–83. [Google Scholar]
- Grober E, Hall CB, Lipton RB, Zonderman AB, Resnick SM, Kawas C, Albert M, Moss M, Tanzi R, Jones K, et al. Memory impairment, executive dysfunction, and intellectual decline in preclinical Alzheimer’s disease. Journal of the International Neuropsychological Society. 2008;14:266–278. doi: 10.1017/S1355617708080302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hall C, Ying J, Kuo L, Lipton R. Bayesian and profile likelihood change point methods for modeling cognitive function over time. Computational Statistics and Data Analysis. 2003;42:91–109. [Google Scholar]
- Hall CB, Lipton RB, Sliwinski M, Stewart WF. A change point model for estimating the onset of cognitive decline in preclinical Alzheimer’s disease. Statistics in Medicine. 2000;19:1555–1566. doi: 10.1002/(sici)1097-0258(20000615/30)19:11/12<1555::aid-sim445>3.0.co;2-3. [DOI] [PubMed] [Google Scholar]
- Hastie T, Tibshirani R. Generalized additive models. Statistical Science. 1986;1:297–318. doi: 10.1177/096228029500400302. [DOI] [PubMed] [Google Scholar]
- He X, Zhu ZY, Fung WK. Estimation in a semiparametric model for longitudinal data with unspecified dependence structure. Biometrika. 2002;89:579–590. [Google Scholar]
- Hinkley DV. Inference about the intersection in two-phase regression. Biometrika. 1969;56:495–504. [Google Scholar]
- Howieson DB, Carlson NE, Moore MM, Wasserman D, Abendroth CD, Payne-Murphy J, Kaye JA. Trajectory of mild cognitive impairment onset. Journal of the International Neuropsychological Society. 2008;14:192–198. doi: 10.1017/S1355617708080375. [DOI] [PubMed] [Google Scholar]
- Jacqmin-Gadda H, Commenges D, Dartigues JF. Random changepoint model for joint modeling of cognitive decline and dementia. Biometrics. 2006;62:254–260. doi: 10.1111/j.1541-0420.2005.00443.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Johnson D, Storandt M, Morris J, Galvin J. Longitudinal study of the transition from healthy aging to Alzheimer’s disease. Archives of Neurology. 2009;66:1254–1259. doi: 10.1001/archneurol.2009.158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jung S. Quasi-likelihood for median regression models. Journal of the American Statistical Association. 1996;91:251–257. [Google Scholar]
- Kim MO, Yang Y. Semiparametric approach to a random effects quantile regression model. Journal of the American Statistical Association. 2011;106:1405–1417. doi: 10.1198/jasa.2011.tm10470.. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kiuchi AS, Hartigan JA, Holford TR, Rubinstein P, Stevens CE. Change points in the series of t4 counts prior to aids. Biometrics. 1995;51:236–248. [PubMed] [Google Scholar]
- Koenker R. Quantile regression for longitudinal data. Journal of Multivariate Analysis. 2004;91:74–89. [Google Scholar]
- Koenker R. Quantile Regression. Cambridge; New York: 2005. [Google Scholar]
- Kotz S, Kozubowski TJ, Podgorski K. The Laplace Distribution and Generalizations: A Revisit with Applications to Communications, Exonomics, Engineering, and Finance. Birkhäuser; Boston: 2001. [Google Scholar]
- Li C, Wei Y, Chappell R, He X. Bent line quantile regression with application to an allometric study of land mammals’ speed and mass. Biometrics. 2011;67:242–249. doi: 10.1111/j.1541-0420.2010.01436.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liu Y, Bottai M. Mixed-effects models for conditional quantiles with longitudinal data. The International Journal of Biostatistics. 2009;5:1–24. doi: 10.2202/1557-4679.1186. [DOI] [PubMed] [Google Scholar]
- Masur D, Sliwinski M, Lipton R, Blau A, Crystal H. Neuropsychological prediction of dementia and the absence of dementia in healthy elderly persons. Neurology. 1994;44:1427. doi: 10.1212/wnl.44.8.1427. [DOI] [PubMed] [Google Scholar]
- McKhann G, Drachman D, Folstein M, Katzman R, Price D, Stadlan E. Clinical diagnosis of Alzheimer’s disease report of the Nincds–Adrda work group under the auspices of department of health and human services task force on Alzheimer’s disease. Neurology. 1984;34:939–939. doi: 10.1212/wnl.34.7.939. [DOI] [PubMed] [Google Scholar]
- Reich BJ, Bondell HD, Wang HJ. Flexible bayesian quantile regression for independent and clustered data. Biostatistics. 2010;11:337–352. doi: 10.1093/biostatistics/kxp049. [DOI] [PubMed] [Google Scholar]
- Salmon D, Bondi M. Neuropsychology of alzheimer disease. In: Terry R, Katzman R, Bick K, Sisodia S, editors. Alzheimer disease. 2 Lippincott Williams & Wilkins; Philadelphia: 1999. [Google Scholar]
- Sliwinski M, Lipton R, Buschke H, Stewart W. The effects of preclinical dementia on estimates of normal cognitive functioning in aging. The Journals of Gerontology Series B: Psychological Sciences and Social Sciences. 1996;51:217–225. doi: 10.1093/geronb/51b.4.p217. [DOI] [PubMed] [Google Scholar]
- Sorva R, Lankinen S, Tolpanen E, Perheentupa J. Variation of growth in height and weight of children after infancy. Acta Paediatrica. 1990;79:498–506. doi: 10.1111/j.1651-2227.1990.tb11503.x. [DOI] [PubMed] [Google Scholar]
- van den Hout A, Muniz-Terrera G, Matthews F. Smooth random change point models. Statistics in Medicine. 2010;30:599–610. doi: 10.1002/sim.4127. [DOI] [PubMed] [Google Scholar]
- Wang H, Fygenson M. Inference for censored quantile regression models in longitudinal studies. Annals of Statistics. 2009;37:756–781. [Google Scholar]
- Wang HJ, Zhu Z. Empirical likelihood for quantile regression models with longitudinal data. Journal of Statistical Planning and Inference. 2011;141:1603–1615. [Google Scholar]
- Wilson R, Beckett L, Barnes L, Schneider JA, Bach J, Evans D, Bennett D. Individual differences in rates of change in cognitive abilities of older persons. Psychology and Aging. 2002;17:179. [PubMed] [Google Scholar]
- Wilson R, De Leon CF, Barnes L, Schneider JA, Bienias JL, Evans DA, Bennett D. Participation in cognitively stimulating activities and risk of incident Alzheimer’s disease. Journal of the American Medical Association. 2002;287:742–748. doi: 10.1001/jama.287.6.742. [DOI] [PubMed] [Google Scholar]
- Yang L, Gao S. Bivariate random change point models for longitudinal outcomes. Statistics in Medicine. 2013;32:1038–1053. doi: 10.1002/sim.5557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu K, Moyeed RA. Bayesian quantile regression. Statistics and Probability Letters. 2001;54:437–447. [Google Scholar]
- Yu L, Boyle P, Wilson RS, Segawa E, Leurgans S, De Jager PL, Bennett DA. A random change point model for cognitive decline in Alzheimer’s disease and mild cognitive impairment. Neuroepidemiology. 2012;39:73–83. doi: 10.1159/000339365. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.


