A computational method for computing an Alzheimer’s Disease Progression Score; experiments and validation with the ADNI dataset

Bruno M Jedynak; Bo Liu; Andrew Lang; Yulia Gel; Jerry L Prince

doi:10.1016/j.neurobiolaging.2014.03.043

. Author manuscript; available in PMC: 2016 Jan 1.

Published in final edited form as: Neurobiol Aging. 2014 Oct 17;36 Suppl 1:S178–S184. doi: 10.1016/j.neurobiolaging.2014.03.043

A computational method for computing an Alzheimer’s Disease Progression Score; experiments and validation with the ADNI dataset

Bruno M Jedynak ^a,^b,^e,^*, Bo Liu ^a, Andrew Lang ^c, Yulia Gel ^a,^d, Jerry L Prince ^c, for the Alzheimer’s Disease Neuroimaging Initiative

PMCID: PMC4267989 NIHMSID: NIHMS638524 PMID: 25444605

Abstract

Understanding the time-dependent changes of biomarkers related to Alzheimer’s disease (AD) is a key to assessing disease progression and to measuring the outcomes of disease-modifying therapies. In this paper, we validate an Alzheimer’s disease progression score model which uses multiple biomarkers to quantify the AD progression of subjects following three assumptions: (1) there is a unique disease progression for all subjects, (2) each subject has a different age of onset and rate of progression, and (3) each biomarker is sigmoidal as a function of disease progression. Fitting the parameters of this model is a challenging problem which we approach using an alternating least squares optimization algorithm. In order to validate this optimization scheme under realistic conditions, we use the Alzheimer’s Disease Neuroimaging Initiative (ADNI) cohort. With the help of Monte Carlo simulations, we show that most of the global parameters of the model are tightly estimated, thus enabling an ordering of the biomarkers that fit the model well, ordered as: the Rey auditory verbal learning test with 30 minutes delay, the sum of the two lateral hippocampal volumes divided by the intra-cranial volume, followed by (the clinical dementia rating sum of boxes score and the mini mental state examination score) in no particular order and lastly the Alzheimer’s disease assessment scale-cognitive subscale.

Keywords: Alzheimer’s disease, biomarkers, progression score, sampling from the residuals

1. Introduction

The ability to precisely identify the stage of disease, predict the rate of disease progression, and accurately measure the outcomes of potential therapies [8, 11] is critical to the successful management of Alzheimer’s disease (AD). The classical characterization of late-onset Alzheimer’s disease progression is a time-ordered succession of three stages: normal (N), mild cognitive impairment (MCI), and AD. Physical measurements of disease progression, i.e., biomarkers, are used to classify patients into these three stages, but it has been challenging to reliably define finer stages of the disease. In [5], we have proposed a method for computing an Alzheimer’s Disease Progression Score (ADPS) by computationally combining seven biomarkers of AD. A bi-product of this study was a temporal ordering of the biomarkers. Experiments conducted on the combined AD Neuroimaging Initiative (ADNI) I, GO, and II datasets provided an ordering of the biomarkers which was consistent with [4] except for a cognitive test, the Rey Auditory Verbal Learning Test, 30 minutes recall (RAVLT30), which was found to become dynamic very early in the development of the disease. The statistical validation of this ordering was performed by resampling from the collection of subjects. Since the statistical model used is a regression, it can be argued that an additional validation would be obtained by sampling from the residuals.

In this paper we validate the methodology presented in [5] using the technique of sampling from the residuals. We adopt the model-based bootstrap that is widely employed in regression analysis (see, [14, 7], and references therein) and time series ([1, 12]) and has become increasingly popular for inference of longitudinal processes ([15, 10]). The benefit of the proposed approach is that it enables us to quantify estimation uncertainty and impute missing values [3]. The rest of this paper is organized as follows: in Section 2.1 we present the statistical model. The optimization algorithm used for fitting the parameters is presented in Section 2.2. The ADNI dataset and the selection of the biomarkers is presented in Sections 2.3 and 2.4 respectively. Section 2.5 motivates and then describes the sampling method. The results are presented in Section 3 followed by a discussion of our findings in Section 4. Concluding remarks are presented in Section 5.

2. Method

Our research first describes then evaluates an algorithm for computing an Alzheimer’s disease progression score (ADPS), which assigns a time dependent score to each subject.

2.1. Statistical Model

The method we use is based on three assumptions:

All subjects follow a common disease progression but differ in their age of onset and rate of progression;
As the disease progresses, each biomarker changes continuously and monotonically following a sigmoid shaped curve; and
In the longitudinal period over which biomarkers are observed, the rate of progression for a given subject is constant.

The proposed computation positions the longitudinal measurements of each subject on a common disease progression scale. Since it is considered to be a common scale, all subjects are expected to undergo the same biological and cognitive changes when they reach the same value (or score) on this scale. Thus, individuals are generally mapped to different positions on this scale and they progress at different rates regardless of their age of disease onset.

The age t of subject i is to be transformed into the ADPS s_i as

s_{i} (t) = α_{i} t + β_{i}

(1)

after estimation of the subject dependent parameters α_i and β_i, which indicate rate and onset of disease, respectively. A linear transformation is justified because the interval over which longitudinal observations of the ADNI subjects occur is short relative to the overall disease duration. Our objective is to compute a score for all I subjects in the ADNI database by estimating α = (α₁, …, α_I) and β = (β₁, …, β_I). The subject dependent parameters α and β are deliberately modeled as fixed effects, not random effects, as the ADPS may ultimately be used as a covariate.

The longitudinal dynamic of each biomarker is assumed to be the same across the population and can be represented as a sigmoidal function f of ADPS s. Using θ_k = (a_k, b_k, c_k, d_k) to represent the vector of sigmoid function parameters for the k-th biomarker, we can write the form of the the k-th biomarker as

f (s; θ_{k}) = a_{k} {(1 + e^{- b_{k} (s - c_{k})})}^{- 1} + d_{k} .

(2)

The minimum and maximum values of the sigmoid function are d_k and d_k + a_k, and the value of s for which the biomarker is the most dynamic, having maximum slope a_kb_k/4 corresponding to its inflection point, is c_k. Sigmoids offer a parsimonious parametric model which is often a better fit than linear models for biomarkers [2, 13]. They are also similar in form to the conceptual evolution of biomarkers envisioned by Jack et al. [4] (Fig. 1). A comparison of different shapes (linear, sigmoid, quadratic, splines) of various biomarkers as function of ADAS-COG is presented in [9]

The values of seven biomarkers, measured at all visits of all ADNI subjects, are plotted on the normalized ADPS. Each connected polyline represents the consecutive visits of a single subject, and each line segment is colored according to the subject’s clinical diagnoses between visits (see legend). The gray curves are the sigmoid functions representing the fitted behavior of each biomarker in the normalized space. (Reproduced from [5])

The ADNI database contains measurements y_ijk of biomarker k for subject i at visit j. Since there are irregularities in data collection, we use I to denote the set of triples (i, j, k) for which measurements are available. Each biomarker observation can be written as

y_{ijk} = f (α_{i} t_{ij} + β_{i}; θ_{k}) + σ_{k} ε_{ijk}, (i, j, k) \in I,

(3)

where t_ij is the age of subject i at visit j. Observation noise in each biomarker is modeled for simplicity by the product of ε_ijk, which are independent random variables with zero mean and unit variance, and σ_k, which is the standard deviation of biomarker k. The collection of standard deviations σ = (σ₁, …, σ_K) comprise another unknown that must be estimated.

The unknowns in this problem are α, β, θ, and σ and the least squares problem associated with the observation model in (3) is

\begin{array}{l} l (α, β, θ, σ) \\ = \sum_{(i, j, k) \in I} \log σ_{k} + \frac{1}{2 σ_{k}^{2}} {(y_{ijk} - f (α_{i} t_{ij} + β_{i}; θ_{k}))}^{2} . \end{array}

(4)

Necessary conditions on the available data I for guaranteeing the identifiability of the parameters are as follows:

For each biomarker, there is at least one subject i with α_i ≠ 0 and with at least four distinct time-points in I.
For each subject, there is at least one biomarker which is available at two time points in I

In practice, a sufficient number of data points per parameter is needed in order to obtain tight estimators. Examining first the case with no missing data, the number of equations in (3) is IJK where I is the number of subjects, J is the number of time-points and K is the number of biomarkers. The number of parameters is 2I + 5K, counting two parameters per subject, and five per biomarker (four for the sigmoid and one for the standard deviation). In applications where I is large compared to K, the number of data points per parameter is close to JK/2. Note that longitudinal data (J > 1) is critical for such modeling. However, a small number J of time-points together with a small number K of biomarkers is in principle acceptable. The subset of ADNI presented in Section 2.4 has numerous missing data points. We will use simulations to study the quality of the estimation of the parameter c for each biomarker. This parameter is critical for ordering the biomarkers.

2.2. Parameter Fitting

Parameter fitting is performed using alternating least squares wherein the parameters θ, α, β, and σ are optimized iteratively starting from the values computed in the previous step. The details of the fitting algorithm are shown in Alg. 1. The initial values (Line 1) are α⁽⁰⁾ ≡ 1 and β⁽⁰⁾ ≡ 0. Because of the additive form of (4), optimization over θ is done serially over each of the K biomarkers while keeping (α and β fixed). Similarly, optimization over (α, β) is performed serially over each of the I subjects while keeping θ fixed. Fitting of θ, α, and β requires optimization of continuously differentiable nonconvex functions, which is carried out using the Levenberg-Marquardt algorithm [6] (Lines 4 and 8). I_k (line 4) is the number of subjects and visits available for biomarker k. The denominator in the equation of Line 5 is the number of degrees of freedom. A canonical way to parameterize sigmoid functions is to constrain the parameter b_k in (2) to be non-negative. This is enforced with the loop over biomarkers (Lines 12–16), which does not modify the objective function in (4). Our experiments confirm that successful fitting is accomplished in 30 iterations; i.e., L = 30 on Line 2.

Algorithm 1.

Algorithm for the fitting of the parameters

1: Inititialize α⁽⁰⁾, β⁽⁰⁾

2: for l = 1 to L do

3: for k = 1 to K do

θ_{k}^{(1)} = \arg \min_{θ_{k}} \sum_{(i, j) \in I_{k}} {(y_{ijk} - f (α_{i}^{(0)} t_{ij} + β_{i}^{(0)}; θ_{k}))}^{2}

σ_{k}^{{(1)}^{2}} = \frac{1}{| I_{k} - 2 I - 4 |} \sum_{(i, j) \in I_{k}} {(y_{ijk} - f (α_{i}^{(0)} t_{ij} + β_{i}^{(0)}; θ_{k}^{(1)}))}^{2}

6: end for

7: for i = 1 to I do

(α_{i}^{(1)}, β_{i}^{(1)}) = \arg \min_{α_{i}, β_{i}} \sum_{(j, k) \in I_{i}} \frac{1}{σ_{k}^{{(1)}^{2}}} {(y_{ijk} - f (α_{i} t_{ij} + β_{i}; θ_{k}^{(1)}))}^{2}

9: end for

10: α⁽⁰⁾ = α⁽¹⁾; β⁽⁰⁾ = β⁽¹⁾

11: end for

12: for k = 1 to K do

13: if b_k < 0 then

14:

a_{k}^{(1)} = - a_{k}^{(1)}, b_{k}^{(1)} = - b_{k}^{(1)}, d_{k}^{(1)} = d_{k}^{(1)} + a_{k}^{(1)}

15: end if

16: end for

17: for i = 1 to I do

18:

α_{i}^{(1)} = \frac{α_{i}^{(1)}}{σ_{N}}, β_{i}^{(1)} = \frac{β_{i}^{(1)} - m_{N}}{σ_{N}}

19: end for

Open in a new tab

The units of ADPS are arbitrarily defined, which implies that we must choose two specific numerical values in order to fully specify the ADPS. This situation is analogous to the selection of a scale for temperature, where the numerical values of the freezing and boiling points of water determine the scale. In our experiments we chose to fix the ADPS such that after computation over the entire population, the computed ADPS for all visits of subjects with normal clinical assessment had a trimmed mean value (m_N) and a trimmed standard deviation (σ_N) which are set respectively to zero and one. This is accomplished in Lines 17–19.

2.3. The Alzheimer’s Disease Neuroimaging Initiative cohort

Data used in the preparation of this article were obtained from the Alzheimers Disease Neuroimaging Initiative (ADNI) database (http://adni.loni.ucla.edu/). The ADNI was launched in 2003 by the National Institute on Aging (NIA), the National Institute of Biomedical Imaging and Bioengineering (NIBIB), the Food and Drug Administration (FDA), private pharmaceutical companies and non-profit organizations, as a $60 million, 5-year public private partnership. The primary goal of ADNI has been to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early Alzheimers disease (AD). Determination of sensitive and specific markers of very early AD progression is intended to aid researchers and clinicians to develop new treatments and monitor their effectiveness, as well as lessen the time and cost of clinical trials. The Principal Investigator of this initiative is Michael W. Weiner, MD, VA Medical Center and University of California San Francisco. ADNI is the result of efforts of many coinvestigators from a broad range of academic institutions and private corporations, and subjects have been recruited from over 50 sites across the U.S. and Canada. The initial goal of ADNI was to recruit 800 adults, ages 55 to 90, to participate in the research, approximately 200 cognitively normal older individuals to be followed for 3 years, 400 people with MCI to be followed for 3 years and 200 people with early AD to be followed for 2 years. For up-to-date information, see www.adni-info.org.

2.4. Biomarker selection

The available data in ADNI are measurements in elderly humans of multiple biomarkers associated with Alzheimer’s disease. Hundreds of subjects, categorized into N, MCI, and AD, were examined at baseline and with repeat visits every 6 to 12 months for a period of up to 60 months.

The following seven biomarkers were selected for use based on their relevance in assessing the progression of AD.

the sum of the two lateral hippocampal volumes¹ normalized by dividing by the intra-cranial volume (HIPPO);
the Alzheimer’s Disease Assessment Scale-cognitive subscale (ADAS);
the Mini-Mental State Examination score (MMSE);
the Aβ₄₂ protein level measured from the cerebrospinal fluid (ABETA);
the Tau protein level measured from the cerebrospinal fluid (TAU);
the Clinical Dementia Rating Sum of Boxes score (CDRSB);
the Rey Auditory Verbal Learning Test, 30 minutes recall (RAVLT30)

A detailed description of the ADNI population, protocols and biomarkers is provided at http://adni.loni.ucla.edu/.

The ADNI, ADNI GO, and ADNI 2 biomarker datasets were downloaded from the ADNI server (http://adni.loni.ucla.edu/) on November 24, 2011. All visits without date information were removed. Subjects not having at least two measurements for at least one of the seven biomarkers were also removed. Subjects not having at least two measurements of the HIPPO biomarker were removed. The total number of subjects remaining was 687, where 389 were male, 275 were female, and 23 had unknown sex. The total number of visits was 3658, and the clinical diagnoses at these visits were 1103 normal, 1513 MCI, and 1010 AD² Of the seven biomarkers considered, only ADAS and RAVLT30 were available at the time of download from the ADNI 2/GO dataset. The protocol for these biomarkers is the same in ADNI, ADNI 2, and ADNI GO.

2.5. Sampling from the residuals

The analysis of a longitudinal, simultaneously acquired collection of biomarkers of ADNI dataset is a complex task for several reasons. First, for each biomarker, the sequences of measurements obtained across time point are correlated. Table 1 shows the correlation between measurements taken at baseline and at one year after baseline. The correlation is larger than 0.7 for all the biomarkers and is as high as 0.98 in case of the volume of the hippocampus over all subjects.

Table 1.

Correlation coefficient for each biomarker between the measurements at baseline and 1 year after baseline

Biomarker	Correlation coef.	Biomarker	Correlation coef.
HIPPO	0.98	ADAS	0.83
MMSE	0.74	TAU	0.93
ABETA	0.94	CDRSB	0.85
RAVLT30	0.8

Open in a new tab

Secondly, the structure of the missing data in ADNI is complex. The schedule of visits depends on the status (N, MCI, AD) at baseline and is different for different biomarkers³. Moreover, the subjects are recruited over a relatively long period; hence the number of available measurements at an earlier visit (e.g. baseline) is significantly larger than at a later one (e.g. baseline plus 36 months). Table 2 shows the proportion of available measurements relative to the total number of subjects for each of the 7 selected biomarkers and each time-point.

Table 2.

Proportion in % of available data for each biomarker and each time point. The denominator is the number of subjects with ADAS measurement at baseline. Note that only MCI subjects have a visit at 18 month after baseline which explains why the numbers are lower for all the biomarkers at the 18 months visit.

Time-point	HIPPO	ADAS	MMSE	TAU	ABETA	CDRSB	RAVLT30
baseline	99	100	100	53	53	100	99
+6 months	97	99	99	0	0	99	99
+12 months	96	99	99	45	45	98	98
+18 months	40	45	45	0	0	45	45
+24 months	73	88	88	13	13	87	86
+36 months	20	62	64	3	3	62	63

Open in a new tab

We now describe the sampling from the residual algorithm which we employ. The key idea is to construct a bootstrap distribution of y_ijk while taking into account its dependence structure, which then can be employed for inference and imputation purposes. In particular, the residuals ε_ijk are ideally independent and identically distributed (i.i.d.). However, since measurements are taken consecutively over time, this is not the case, as Table 1 indicates. Thus, we can attempt to filter out temporal dependence among ε_ijk by applying an appropriate filter (e.g., an autoregressive model), and the resulting new filtered residuals η_ijk should be close to the i.i.d. assumption. Hence, given conditional independence of η_ijk, we can employ the classical bootstrap and sample new realizations of $η_{ijk}^{*}$ from the estimated empirical probability distribution of η_ijk. Substituting in the sampled $η_{ijk}^{*}$ into the filter yields new bootstrap residuals, $ε_{ijk}^{*}$ which in turn can be plugged-in into the estimated model (3) and the new proxy bootstrap values $y_{ijk}^{*}$ are thus obtained. The resulting bootstrap distribution of $y_{ijk}^{*}$ serves as a proxy to the unknown distribution of y_ijk, and can be employed to assess errors in parameter estimation and imputation of missing values as described in the Algorithm 2 below.

Algorithm 2.

Algorithm for sampling from the residuals

1: Sample η_ijk independently from a standard Normal distribution, assuming no missing data;

2: Compute ε_ijk using the AR(1) model, imputing the values η_ijk as residuals;

3: Compute y_ijk using the model (3), imputing ε_ijk as residuals;

4: Determine the missing data for each biomarker with the following sampling procedure: Sample from a 2 state (not available (NA), not NA) non-homogeneous Markov chain indexed by the successive visits. The transition matrix was estimated separately for each status at baseline (N, MCI and AD) and before hand.

5: Estimate all the parameters from the simulated dataset with missing data obtained at the last step using Alg. 1. Notate

c_{k}^{(m)}

the estimated parameter c for biomarker k at iteration m.

Open in a new tab

In order to assess the quality of the parameter fitting algorithm presented in Alg. 1, we generated a large number (M = 40, 000) of simulated datasets by sampling from the residuals as follows. First, we estimated all of the parameters in the model using Alg. 1 and the ADNI dataset. Second, we computed the residuals ε_ijk from (3). Since these residuals showed correlation over time for each biomarker, we fit an autoregressive AR(1) model for the residuals of each biomarker. We then repeated the procedure described in Alg. 2 M = 40, 000 times.

3. Results

3.1. ADPS computed for ADNI subjects

The ADPS was computed for all subject visits in the combined ADNI, ADNI 2, and ADNI GO data sets (with minimal exclusions as described in Section 2.4). Results are presented in Fig. 1. Overall, Normal subjects (black) have the smallest ADPS, MCI subjects (red) have moderate ADPS, and AD subjects (green) have the largest ADPS. Lower ADPS are therefore consistent with the normal population and higher ADPS are indicative of increased presence of dementia. Those subjects whose clinical status changes from MCI to AD (blue) are found mostly between the red and green colors. The estimated sigmoidal behaviors of each biomarker are shown in gray.

3.2. Assesment of the quality of the estimation of the inflection point of each biomarker curve

The estimator of the parameter c_k for biomarker k using the ADNI dataset is denoted $c_{k}^{T}$ , where “T” stands for “Target”. Each sampling from the residuals, indexed by m, produces a simulated dataset from which an estimator of c_k is computed using Alg. 1 and is notated $c_{k}^{(m)}$ . The histograms for the data ${c_{k}^{(m)} - c_{k}^{T}}_{m \in {1; \dots; 40, 000}}$ for each biomarker k are presented in Fig 2. In comparing k these histograms, be advised that the scale of the horizontal axis is not the same for each biomarker. Recall that the parameter c_k, the inflection point of each sigmoid, records the moment in the disease progression where the biomarker k changes the most (is the most dynamic). First, consider the biomarkers HIPPO, TAU, ABETA, and RAVLT30. The mean square error for these estimators is small. Indeed, the histograms obtained by sampling from the residuals are centered close to the origin with a small standard deviation. These results give validity to the choices and the settings of the optimization algorithm. Second, in the case of MMSE and CDRSB, some bias is observed and the standard deviation is moderated. Finally, in the case of ADAS, the bias as well as the standard deviation are larger. Note that the fitted sigmoids for these three biomarkers do not level off in the later stages of the disease (see Fig 1), which might explain why the fit is less stable (large bootstrap standard deviation). One remedy to stabilize the estimation could be to constrain the extremal values, d_k and d_k + a_k. For example, the maximum value for ADAS is 70 which could be enforced during the optimization process.

Distribution of the errors in estimating the parameter c of the sigmoid (see (2)) as function of the progression time-line of the disease.

3.3. Ordering of the biomarkers

For each simulation, the parameter c_k, i.e. the inflection point of the sigmoid fitted to biomarker k, was obtained and the summary of the ordering of the c_k values for each of the 40,000 simulations is presented in Table 3. Clearly, the ordering the most prevalent is as follows: RAVLT30, HIPPO, ABETA, TAU, then (CDRSB, MMSE) in no particular order and then ADAS. This result is consistent with our results obtained in [5] where the statistical method used to assess this ordering, i.e., resampling from the subjects, was different than the method used here, i.e, sampling from the residuals. Care should be taken in interpreting this result in term of the ordering of the biomarkers. In particular, ABETA and TAU biomarkers are not well explained by the model, see Fig. 1. One can visually see that there is a large residual variance in the case of these two biomarkers by comparing the spread of the data points around the sigmoid curves in grey. As a consequence, we prefer to eliminate these biomarkers from our analysis and propose the following ordering of the biomarkers: RAVLT30, HIPPO, then (CDRSB and MMSE) in no particular order then ADAS.

Table 3.

Ordering of the biomarkers according to the location of the inflection point. For each biomarker (line), the value recorded in column j is the number of times, in percent, this biomarker has an inflection point which is the j^th smallest out of the seven biomarkers. This was computed using 40,000 independent samples. The results are truncated to the nearest tenth of a percent.

	#1	#2	#3	#4	#5	#6	#7
RAVLT30	99.7	0.3	0	0	0	0	0
HIPPO	0	99.4	0.4	0.1	0	0.1	0
ABETA	0.1	0.2	95.9	3.7	0	0	0
TAU	0.1	0.1	3.8	96	0	0	0
CDRSB	0	0	0	0.1	65.4	33.5	1
MMSE	0	0	0	0	32.3	66	1.7
ADAS	0	0	0	0	2.2	0.4	97.3

Open in a new tab

4. Discussion

Amyloid concentration in the brain is known to change very early in the disease process [17, 18]. Why is it then that we do not detect an early change of ABETA? There are several non-exclusive possible explanations. One is that ABETA is a noisy measurement of the amyloid burden, eventually contaminated by one or several physiological covariates. A complementary explanation is that there are several paths to disease within the ADNI subjects. For example, there might be MCI or even AD subjects who are impaired for reasons unrelated to Alzheimer’s (depression, tau only pathologies, vascular dementia). Recall that the first assumption in our model is that there is a unique disease progression for all the subjects. A violation of this hypothesis would result in a lack of fit for some of the biomarkers and could also modify our conclusions about the ordering of the biomarkers.

The fact that the RAVLT30 biomarker is dynamic early in the disease process is an interesting result which deserves further investigations in ADNI. Using data from the Canadian Study of Health and Aging, it was found in [16] that the RAVLT was predictive of neurodegenerative changes up to 10 years prior to diagnosis Also, early changes in the hippocampus volume might occur very early in the disease process while being too subtle to be detected with the current protocol and that progress in the acquisition and/or image processing technology might reveal these subtleties. Finally, the reader is reminded that the results were obtained for the ADNI dataset as of November 4th 2011 and do not necessarily extrapolate to a larger or different population.

5. Conclusion

We have presented a validation of a multi-biomarker, data-driven approach to assess time-dependent changes of biomarkers in AD and to localize subjects on a common scale of disease progression over the entire range of progression represented in the ADNI cohort. The sampling from the residuals analysis show that the inflection point of the biomarker sigmoid curves are well estimated for most biomarkers. Our presented model and subsequent validation argue that the following ordering of the biomarkers should be considered: RAVLT30, HIPPO, then (CDRSB and MMSE) in no particular order and lastly ADAS.

Acknowledgments

Personnel costs for this research were partially supported by a grant from Pfizer Inc. as well as from an Ossoff scholar award. Other support came from grants numbered P41EB015909 and R01EB012547 from the National Institute Of Biomedical Imaging And Bioengineering. Data collection and sharing for this project was funded by the Alzheimer’s Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: Abbott; Alzheimers Association; Alzheimers Drug Discovery Foundation; Amorfix Life Sciences Ltd.; AstraZeneca; Bayer Health-Care; BioClinica, Inc.; Biogen Idec Inc.; Bristol-Myers Squibb Company; Eisai Inc.; Elan Pharmaceuticals Inc.; Eli Lilly and Company; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; GE Healthcare; Innogenetics, N.V.; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Medpace, Inc.; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Servier; Synarc Inc.; and Takeda Pharmaceutical Company. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer’s Disease Cooperative Study at the University of California, San Diego. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of California, Los Angeles. This research was also supported by NIH grants P30 AG010129 and K01 AG030514.

Footnotes

Freesurfer version 4.4.0 for longitudinal data http://surfer.nmr.mgh.harvard.edu)

Note that we used ABETA and TAU data from files UPENNBIOMK, UPENNBIOMK2, UPENNBIOMK3 and UPENNBIOMK4. For each subject, we use the latest available file (batch), i.e. if UPENNBIOMK4 data is available for this subject, we use it. Otherwise we use UPENNBIOMK3, and so on. For each subject, all his/her ABETA and TAU data always come from the same file (batch).

See ADNI General Procedures Manual, pages 6, 7, and 8

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

1.Bühlman P. Bootstraps for time series. Statistical Science. 2002;17(1):52–72. [Google Scholar]
2.Caroli A, Frisoni G the Alzheimer’s Disease Neuroimaging Initiative. The dynamics of Alzheimer’s disease biomarkers in the alzheimer’s disease neuroimaging initiative cohort. Neurobiol Aging. 2010;31(8):1263–74. doi: 10.1016/j.neurobiolaging.2010.04.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Graham J. Missing Data: Analysis and Design. Springer Verlag; 2012. [Google Scholar]
4.Jack CR, Jr, Knopman DS, Jagust WJ, Shaw LM, Aisen PS, Weiner MW, Petersen RC, Trojanowski JQ. Hypothetical model of dynamic biomarkers of the Alzheimer’s pathological cascade. The Lancet Neurology. 2010;9(1):119–128. doi: 10.1016/S1474-4422(09)70299-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Jedynak B, Lang A, Liu B, Katz E, Zhang Y, Wyman B, Rauning D, Jedynak CP, Caffo B, Prince J. Staging for neurodegenerative diseases: Validation with the alzheimer’s disease neuroimaging initiative cohortIn preparation. 2012 doi: 10.1016/j.neuroimage.2012.07.059. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Levenberg K. A method for the solution of certain non-linear problems in least squares. The Quarterly of Applied Mathematic. 1944;2:164–168. [Google Scholar]
7.M C. Bootstrap Methods: A Guide for Practitioners and Researchers. Wiley; 2007. [Google Scholar]
8.Morris J, Price J. Pathologic correlates of nondemented aging, mild cognitive impairment, and early-stage Alzheimer’s disease. Journal of Molecular Neuroscience. 2001;17:101–118. doi: 10.1385/jmn:17:2:101. [DOI] [PubMed] [Google Scholar]
9.Mouiha A, Duchesne S. Toward a dynamic biomarker model in alzheimer’s disease. Journal of Alzheimers Disease. 2012;30(1):91–100. doi: 10.3233/JAD-2012-111367. [DOI] [PubMed] [Google Scholar]
10.Newsom J, Jones R, Hofer S. Aging, Health, and Social Sciences. Routledge; 2013. Longitudinal Data Analysis: A Practical Guide for Researchers. [Google Scholar]
11.Perrin RJ, Craig-Schapiro R, Malone JP, Shah AR, Gilmore P, Davis AE, Roe CM, Peskind ER, Li G, Galasko DR, Clark CM, Quinn JF, Kaye JA, Morris JC, Holtzman DM, Townsend RR, Fagan AM. Identification and validation of novel cerebrospinal fluid biomarkers for staging early Alzheimer’s disease. PLoS ONE. 2011;6(1):e16032. doi: 10.1371/journal.pone.0016032. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Politis D. The impact of bootstrap methods on time series analysis. Statistical Science. 2003;18:219–230. [Google Scholar]
13.Sabuncu MR, Desikan RS, Sepulcre J, Yeo BTT, Liu H, Schmansky NJ, Reuter M, Weiner MW, Buckner RL, Sperling RA, Fischl B ADNI. The dynamics of cortical and hippocampal atrophy in Alzheimer disease. Arch Neurol. 2011;68(8):1040–1048. doi: 10.1001/archneurol.2011.167. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Shao J, Tu D. The Jackknife and Bootstrap. springer; New York: 1995. [Google Scholar]
15.Skrondal A, Rabe-Hesketh S. Multilevel and Related Models for Longitudinal Data. Springer Verlag; 2008. [Google Scholar]
16.Tierney MC, Yao C, Kiss A, McDowell I. Neuropsychological tests accurately predict incident alzheimer disease after 5 and 10 years. Neurology. 2005;64(11):1853–1859. doi: 10.1212/01.WNL.0000163773.21794.0B. URL http://www.neurology.org/content/64/11/1853.abstract. [DOI] [PubMed] [Google Scholar]
17.Vemuri P, Wiste HJ, Weigand SD, Knopman DS, Trojanowski JQ, Shaw LM, Bernstein MA, Aisen PS, Weiner M, Petersen RC, Jack CR Alzheimer’s Disease Neuroimaging Initiative. Serial MRI and CSF biomarkers in normal aging, MCI, and AD. Neurology. 2010 Jul;75(2):143–151. doi: 10.1212/WNL.0b013e3181e7ca82. URL http://dx.doi.org/10.1212/wnl.0b013e3181e7ca82. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Villemagne V, Burnham S, Bourgeat P, Brown B, Ellis KA, Salvado O, Szoeke C, Macaulay L, Martins R, Maruff P, Ames D, Rowe C, Masters C. Amyloid-beta deposition, neurodegeneration, and cognitive decline in sporadic alzheimer’s disease: a prospective cohort study. The Lancet Neurology. 2013:357–367. doi: 10.1016/S1474-4422(13)70044-9. [DOI] [PubMed] [Google Scholar]

[R1] 1.Bühlman P. Bootstraps for time series. Statistical Science. 2002;17(1):52–72. [Google Scholar]

[R2] 2.Caroli A, Frisoni G the Alzheimer’s Disease Neuroimaging Initiative. The dynamics of Alzheimer’s disease biomarkers in the alzheimer’s disease neuroimaging initiative cohort. Neurobiol Aging. 2010;31(8):1263–74. doi: 10.1016/j.neurobiolaging.2010.04.024. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Graham J. Missing Data: Analysis and Design. Springer Verlag; 2012. [Google Scholar]

[R4] 4.Jack CR, Jr, Knopman DS, Jagust WJ, Shaw LM, Aisen PS, Weiner MW, Petersen RC, Trojanowski JQ. Hypothetical model of dynamic biomarkers of the Alzheimer’s pathological cascade. The Lancet Neurology. 2010;9(1):119–128. doi: 10.1016/S1474-4422(09)70299-6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] 5.Jedynak B, Lang A, Liu B, Katz E, Zhang Y, Wyman B, Rauning D, Jedynak CP, Caffo B, Prince J. Staging for neurodegenerative diseases: Validation with the alzheimer’s disease neuroimaging initiative cohortIn preparation. 2012 doi: 10.1016/j.neuroimage.2012.07.059. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Levenberg K. A method for the solution of certain non-linear problems in least squares. The Quarterly of Applied Mathematic. 1944;2:164–168. [Google Scholar]

[R7] 7.M C. Bootstrap Methods: A Guide for Practitioners and Researchers. Wiley; 2007. [Google Scholar]

[R8] 8.Morris J, Price J. Pathologic correlates of nondemented aging, mild cognitive impairment, and early-stage Alzheimer’s disease. Journal of Molecular Neuroscience. 2001;17:101–118. doi: 10.1385/jmn:17:2:101. [DOI] [PubMed] [Google Scholar]

[R9] 9.Mouiha A, Duchesne S. Toward a dynamic biomarker model in alzheimer’s disease. Journal of Alzheimers Disease. 2012;30(1):91–100. doi: 10.3233/JAD-2012-111367. [DOI] [PubMed] [Google Scholar]

[R10] 10.Newsom J, Jones R, Hofer S. Aging, Health, and Social Sciences. Routledge; 2013. Longitudinal Data Analysis: A Practical Guide for Researchers. [Google Scholar]

[R11] 11.Perrin RJ, Craig-Schapiro R, Malone JP, Shah AR, Gilmore P, Davis AE, Roe CM, Peskind ER, Li G, Galasko DR, Clark CM, Quinn JF, Kaye JA, Morris JC, Holtzman DM, Townsend RR, Fagan AM. Identification and validation of novel cerebrospinal fluid biomarkers for staging early Alzheimer’s disease. PLoS ONE. 2011;6(1):e16032. doi: 10.1371/journal.pone.0016032. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Politis D. The impact of bootstrap methods on time series analysis. Statistical Science. 2003;18:219–230. [Google Scholar]

[R13] 13.Sabuncu MR, Desikan RS, Sepulcre J, Yeo BTT, Liu H, Schmansky NJ, Reuter M, Weiner MW, Buckner RL, Sperling RA, Fischl B ADNI. The dynamics of cortical and hippocampal atrophy in Alzheimer disease. Arch Neurol. 2011;68(8):1040–1048. doi: 10.1001/archneurol.2011.167. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] 14.Shao J, Tu D. The Jackknife and Bootstrap. springer; New York: 1995. [Google Scholar]

[R15] 15.Skrondal A, Rabe-Hesketh S. Multilevel and Related Models for Longitudinal Data. Springer Verlag; 2008. [Google Scholar]

[R16] 16.Tierney MC, Yao C, Kiss A, McDowell I. Neuropsychological tests accurately predict incident alzheimer disease after 5 and 10 years. Neurology. 2005;64(11):1853–1859. doi: 10.1212/01.WNL.0000163773.21794.0B. URL http://www.neurology.org/content/64/11/1853.abstract. [DOI] [PubMed] [Google Scholar]

[R17] 17.Vemuri P, Wiste HJ, Weigand SD, Knopman DS, Trojanowski JQ, Shaw LM, Bernstein MA, Aisen PS, Weiner M, Petersen RC, Jack CR Alzheimer’s Disease Neuroimaging Initiative. Serial MRI and CSF biomarkers in normal aging, MCI, and AD. Neurology. 2010 Jul;75(2):143–151. doi: 10.1212/WNL.0b013e3181e7ca82. URL http://dx.doi.org/10.1212/wnl.0b013e3181e7ca82. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] 18.Villemagne V, Burnham S, Bourgeat P, Brown B, Ellis KA, Salvado O, Szoeke C, Macaulay L, Martins R, Maruff P, Ames D, Rowe C, Masters C. Amyloid-beta deposition, neurodegeneration, and cognitive decline in sporadic alzheimer’s disease: a prospective cohort study. The Lancet Neurology. 2013:357–367. doi: 10.1016/S1474-4422(13)70044-9. [DOI] [PubMed] [Google Scholar]

PERMALINK

A computational method for computing an Alzheimer’s Disease Progression Score; experiments and validation with the ADNI dataset

Bruno M Jedynak

Bo Liu

Andrew Lang

Yulia Gel

Jerry L Prince

Abstract

1. Introduction

2. Method

2.1. Statistical Model

Figure 1.

2.2. Parameter Fitting

Algorithm 1.

2.3. The Alzheimer’s Disease Neuroimaging Initiative cohort

2.4. Biomarker selection

2.5. Sampling from the residuals

Table 1.

Table 2.

Algorithm 2.

3. Results

3.1. ADPS computed for ADNI subjects

3.2. Assesment of the quality of the estimation of the inflection point of each biomarker curve

Figure 2.

3.3. Ordering of the biomarkers

Table 3.

4. Discussion

5. Conclusion

Acknowledgments

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

A computational method for computing an Alzheimer’s Disease Progression Score; experiments and validation with the ADNI dataset

Bruno M Jedynak

Bo Liu

Andrew Lang

Yulia Gel

Jerry L Prince

Abstract

1. Introduction

2. Method

2.1. Statistical Model

Figure 1.

2.2. Parameter Fitting

Algorithm 1.

2.3. The Alzheimer’s Disease Neuroimaging Initiative cohort

2.4. Biomarker selection

2.5. Sampling from the residuals

Table 1.

Table 2.

Algorithm 2.

3. Results

3.1. ADPS computed for ADNI subjects

3.2. Assesment of the quality of the estimation of the inflection point of each biomarker curve

Figure 2.

3.3. Ordering of the biomarkers

Table 3.

4. Discussion

5. Conclusion

Acknowledgments

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases