Abstract
We introduce a mass-univariate framework for the analysis of whole-brain structural trajectories using longitudinal Voxel-Based Morphometry data and Bayesian inference. Our approach to developmental and aging longitudinal studies characterizes heterogeneous structural growth/decline between and within groups. In particular, we propose a probabilistic generative model that parameterizes individual and ensemble average changes in brain structure using linear mixed-effects models of age and subject-specific covariates. Model inversion uses Expectation Maximization (EM), while voxelwise (empirical) priors on the size of individual differences are estimated from the data. Bayesian inference on individual and group trajectories is realized using Posterior Probability Maps (PPM). In addition to parameter inference, the framework affords comparisons of models with varying combinations of model order for fixed and random effects using model evidence. We validate the model in simulations and real MRI data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) project. We further demonstrate how subject specific characteristics contribute to individual differences in longitudinal volume changes in healthy subjects, Mild Cognitive Impairment (MCI), and Alzheimer's Disease (AD).
Keywords: Brain morphology, Lifespan brain aging, Dementia, Longitudinal analysis, Multi-level models, Bayesian inference
Highlights
-
•
We introduce a framework for structural trajectories using Longitudinal MRI.
-
•
Bayesian inference on trajectories is realized using Posterior Probability Maps (PPM).
-
•
We validate the model in simulations and real MRI data from the ADNI project.
Introduction
Magnetic Resonance Imaging (MRI) and computational morphometry have become important tools for in-vivo analysis of changes in healthy and pathological brain development and aging (Frisoni et al., 2010; Fjell and Walhovd, 2010). One of the most exciting research questions is the nature of variability in aging brain structure (Raz et al., 2005, 2010; Raz and Rodrigue, 2006) and function (Pudas et al., 2013; Grady, 2012) observed across individuals. Most aging studies apply cross-sectional designs, providing estimates of population average, age-related, differences via pooling within cohorts (Ziegler et al., 2012a). However, exploring the large heterogeneity of true within-subject brain changes necessarily requires repeated measures and longitudinal designs (Raz and Lindenberger, 2011).
Longitudinal assessments offer significant advantages over cross-sectional studies (for an introduction see e.g. Fitzmaurice et al., 2008). A longitudinal study is more powerful for a fixed number of subjects. It permits separation of within- and between-subject variability, and helps to ameliorate confounds. Another important advantage is that in addition to providing estimates of population average brain changes it enables a characterization of systematic differences in longitudinal trajectories among individuals. This allows researchers to identify adverse as well as protective factors that may influence healthy and pathological changes in brain anatomy and function over time (see e.g. Taki et al., 2013; Thambisetty et al., 2012; Smith et al., 2010; Debette et al., 2011; den Heijer et al., 2012). Moreover, individual subjects' trajectories are promising biomarkers for early stage diagnosis (Chetelat and Baron, 2003), tracking of disease progression (Fonteijn et al., 2012; Jedynak et al., 2012; Sabuncu et al., 2014; Donohue et al., 2014; Young et al., 2014) and monitoring of potential treatments (Douaud et al., 2013).
Crucially, longitudinal MR-based morphometry is prone to artifacts due to scanner inhomogeneities, registration inconsistency, and subtle scanner-positioning or hydration-related deformations of the brains (Schnaudigel et al., 2010; Littmann et al., 2006; Kempton et al., 2009). Sophisticated within-subject registration pipelines have been introduced recently to parameterize structural changes in an unbiased fashion (Ashburner and Ridgway, 2013; Leung et al., 2012; Lorenzi and Pennec, 2013; Holland et al., 2011; Reuter et al., 2010, 2012).
An essential difference between longitudinal and cross-sectional analysis lies in the modeling assumptions about each individual. With a single observation per subject one has to assume the process of interest is identical across subjects (using fixed-effects assumptions). In contrast, longitudinal designs allow one to parameterize individual variations in the process by including random effects (or random coefficients). Modeling repeated measurements of behavior is well established in psychology and psychometry (for review see McArdle, 2009). In the last decade, there has been a growing interest in applications of mixed-effects models in the context of neuroimaging of development (Shaw et al., 2006, 2008; Raznahan et al., 2011a,b, 2014; Schumann et al., 2010) and aging neuroscience (Lerch et al., 2005; Lau et al., 2008; Carmichael et al., 2010). Additional articles focus specifically on methods for analysis of longitudinal MRI (Resnick et al., 2000; Chan et al., 2003; Frost et al., 2004; Bernal-Rusiel et al., 2012) and voxel-wise or vertex-wise longitudinal modeling (Guillaume et al., 2014; Li et al., 2013; Skup et al., 2012; Chen et al., 2013; Bernal-Rusiel et al., 2013).
Bayesian inference has been successfully applied to functional brain scans in multiple domains, ranging from general linear models, group analysis, spatial models, analysis of connectivity, to model comparisons (for extensive review see Woolrich, 2012). Bayesian inference typically exploits hierarchical observation models that take into account different levels of observations (e.g. scans and subjects), allows for the inclusion of biologically informed prior-beliefs about parameters, and affords comparisons among competing (nested or non-nested) models. Bayesian treatment of whole-brain neuroimaging data might also increase the sensitivity by finessing the problem of multiple comparison (Friston and Penny, 2003; Schwartzman et al., 2009). In contrast to classical inference, it also enables the assessment of evidence in favor of the null hypothesis; i.e., no aging-related change or preservation of structural integrity. These issues speak to a Bayesian framework for modeling structural change trajectories. However, there are currently only a few existing studies that consider longitudinal structural MRI (Schmid et al., 2009; Chen et al., 2012).
Here, we propose a generic modeling framework for longitudinal morphometric brain changes in development and aging studies. After diffeomorphic registration on the within-subject (Ashburner and Ridgway, 2013) and between-subject (Ashburner and Friston, 2011) level, we build a generative linear mixed-effects model of repeated observations. The model inversion flexibly accommodates unbalanced and sparse designs with potentially different numbers of follow up scans per subject. Using Expectation Maximization (EM) we obtain voxelwise individual and group level change parameters and compute Posterior Probability Maps (PPM) (Friston and Penny, 2003) for inference about regionally specific effects. In other words, we focus on making regionally specific inferences about longitudinal changes in anatomy, that properly account for both within and between subject variability in neurodevelopmental trajectories.
We validate the model using simulated data and a large MRI sample from the ADNI cohort. We then demonstrate a parametric analysis of subject specific covariates and explore the model space to optimize explanations of individual trajectory differences.
Methods
In this section, we introduce a generative model of local structural trajectories using random and fixed effects; i.e., a mixed effect, hierarchical or multilevel model. We describe the Bayesian formulation, the implicit (empirical) prior covariance components and their estimation using EM. We extend this framework to modeling of trajectories over multiple groups and review the use of PPMs for inference on model parameters. We conclude this section with a treatment of Bayesian model selection of ensemble trajectory models.
A generative model of local structural trajectories
The model for age-related changes of local brain structure (per voxel or region) is based upon the following generative model, which comprises a likelihood and prior. The model is an application of the Bayesian linear hierarchical observation framework introduced by Friston et al. (2002a) (for application in the context of fMRI see also Friston et al., 2002b).
We here consider the special case of a two level model, one for individual structural trajectories and a second level for an ensemble of trajectories, denoted by ε. The first level likelihood model is based on the assumption that the trajectory of underlying volumetric changes is sampled from subject-specific functions of age or time
(1) |
where the measurement yij is the j-th of mi observations (e.g. of gray matter density at a single voxel) obtained from the i-th of N subjects at age tij, and ϵij(1) denotes an i.i.d. Gaussian measurement error with variance σ2. In what follows we use time centered tij in order to develop trajectories around the reference age, i.e. tr, which typically is chosen as the mean age of the sample. Individual differences of trajectories are thus encoded by subject-specific change parameters θi(1) resulting in an ensemble of age-related trajectories ε = {g(t, θi(1))}i = 1N for a sample of individuals. In particular, we parameterize the function describing the trajectory using a D degree polynomial expansion of age
(2) |
with coefficients θi(1) = [θ1,i(1), …, θD + 1,i(1)]T. For example, for D = 2 we have 3 coefficients per subject, encoding the intercept, slope and quadratic terms. We can easily write these linear models using compact matrix notation with individual design matrices and change parameters as gi = Xi(1)θi(1). Then, the model for all subjects follows
(3) |
(4) |
with subject i's observations , M = ∑ mi concatenated observations y, first level design matrix X(1), concatenated change parameters θ(1), and first level Gaussian errors ϵ(1). Vectorizing observations yij in ‘person-scan’ format, i.e. the successive scans are grouped by subjects (all from subject 1, all from subject 2, etc.), is a natural way to arrange longitudinal data with missing scans and varying number of follow ups. This additionally simplifies the structure of the first level design matrix, which then takes a block-diagonal form. Note, that this first level model explicitly accommodates unbalanced designs, i.e. Xi(1) ≠ Xj(1), with varying ages and numbers of scans per subject.
The sample change parameters of the trajectory functions are determined by (primarily non-age-dependent) subject specific effects. Note that these second level regressors can be chosen to model covariates of interest, e.g. IQ scores, genetic markers, or symptom severity, as well as purely confounding variables, e.g. global brain parameters. These measures are summarized in a centered N × R between-subject covariates matrix Z with entries zir. For example, in the results section below, we use a genetic risk score as a covariate of interest and test to see how this predicts first level parameters. Now, we adopt the following linear second level model
(5) |
(6) |
with D + 1 dimensional identity matrix I, second level design matrix X(2), concatenated parameters θ(2), and zero mean multivariate Gaussian errors ϵ(2) respectively (for distributions details see also the section on Covariance component specification). Note, that we can further simplify the structure of the design matrix by writing it as a Kronecker product [[1NZ] ⊗ ID + 1] using N dimensional column vector of ones 1N. Although one could choose a separate set of covariates for each trajectory parameter, we here consider the common exploratory situation where one is interested in potential effects of a small set of covariates on all trajectory properties, i.e. intercept, slope, etc.
Due to the particular choice of a column of ones in the second level design, it follows that θ1(2) parameterizes the sample average change in terms of a mean trajectory, which is the expectation for every subject's trajectory parameters after accounting for covariate effects. The remaining second level parameters θ2(2), …, θR + 1(2) become the coefficients of each covariate's contribution to individual trajectory differences.
Combining fixed and random effects
The above model with degree zero might be referred to as the random intercept model without slope. Using this model in the context of longitudinal MRI assumes variability of structure across subjects but no changes over time. If we chose model degree one, the model now includes a random slope parameter for every subject. One might argue that the first (or higher) degree(s) can also enter as fixed (as opposed to random effects); e.g., assuming the same rate of change (or quadratic effect) for all subjects. The above framework naturally extends to modeling these additional fixed effects of degree d by appending column vectors xfd with entries to the first level design matrix . In this case we need to extend first level parameters accordingly, i.e. θ(1) = [θ1(1), …, θN(1), θf(1)]. In presence of these fixed effects the second level design follows as
(7) |
If we now constrain the second level errors for fixed effects parameters to be zero, we can perform second level group inference for random and fixed effects parameters in a similar way (as will be shown in the Bayesian perspective section). In what follows we use D to denote the degree of random effects and Df for the degree of fixed effects. Please note that entering fixed effects, in addition to random effects of the same degree, would result in redundant parameters for the average trajectory. Thus, one might prefer only using additional fixed effects with higher degrees Df > D. This parametrization of fixed and random effects is motivated by our hierarchical formulation of the model and might slightly differ from standard mixed-effects textbooks.
Covariance component specification
In order to estimate the above model, we need to fully specify all covariance constraints for first and second level errors, further denoted with Cϵ(1) and Cϵ(2) respectively. Given an unknown covariance structure C we use a small set of covariance basis functions Qk and estimate the corresponding coefficients or hyperparameters λk
(8) |
More generally, this can be motivated by a first-order Taylor expansion of the covariances with respect to their hyperparameters (for details see e.g. Friston et al., 2002a). This idea will be now outlined for all covariance components of the above model. We begin with a single group design, and extend this to modeling of multiple groups in later sections. It is important to note that the covariance could be specified being linear in the hyperparameters (as seen in Eq. (8)); however, this does not preclude negative definite covariances (Harville, 1977). In contrast to optimizing linear coefficients λ, in what follows, we optimize log-covariance parameters, ie. eλ. This forces the hyperparameters to be positive and at the same time increases the stability of the subsequent optimization scheme.
In particular, as mentioned above, we specify the first level error covariance using an isotropic noise model
(9) |
with IM denoting an identity matrix and noise variance σ2. This models unstructured errors of measurement, e.g. due to MRI noise and random errors or minor inaccuracies during preprocessing. Furthermore we recall that every subject is fully described by its parameter vector θi(1). Considering the population, however, there is unknown variability of individual parameters across subjects, which is either explicitly modeled by covariates (or group structures) in design matrix X(2) or captured by the second level error covariance Cϵ(2). The unexplained individual differences might differentially affect all trajectory coefficients and thus (at least) one further hyperparameter for each of the trajectory parameters is required. We therefore use λ1, λ2, etc. to describe unexplained individual differences of intercept, slope etc. For that purpose we use Ri to denote the covariance matrix of residual parameter vectors Cov(ϵi) and we suppose
(10) |
Typically, having only very sparse observations in longitudinal MRI designs prevents us from estimating Ri on the individual level. For reasons of identifiability in a wide range of designs, we therefore assume the same residual covariance across all subjects, i.e. Ri = R. The full second level error covariance can be therefore specified as follows
(11) |
where covariance basis functions Qd can be efficiently implemented exploiting the Kronecker product. Taken together [σ2, λ1, …, λD + 1] fully parameterize the covariance components of the model in its simples form; resulting, e.g. in three voxelwise hyperparameters for single ensembles of linear trajectories. Please note that the above framework nicely extends to more complex models, e.g. with first level covariates and correlated residuals at the second level.
Finally, we finesse the covariance components to account for any fixed effects as discussed in the section on Combining fixed and random effects. This means we consider the case when the degree of fixed effects exceeds the degree of random effects and we apply extended design matrices and parameters (Eq. (7)). In order to perform similar inference for second level fixed effects parameters like group average parameters of random effects we enforce identity of first and second level fixed effects parameters, i.e. θf(1) = θf(2). This can be easily implemented by choosing a hyperparameter of second level fixed effects errors with a very small variance, i.e. with e.g. σf2 = e− 32.
Bayesian perspective
We now explore the Bayesian perspective on the above model for an ensemble of trajectories (defined by Eqs. (4) and (6)). A key aspect of this formulation is that we can consider the second level to furnish an empirical prior for the first level parameters, as follows
(12) |
(13) |
with error covariances Cϵ(k), k = 1, 2. The first level error covariance corresponds to measurement noise.
Finally, we assume second level priors on the ensemble change parameters. At the end of this section we will briefly discuss promising choices of priors which might be relevant for potential applications:
(14) |
The hierarchical structure of the trajectory model implies that the joint probability factorizes as
(15) |
rendering the data conditionally independent of the second level parameters given the first level parameters (Bishop, 2006).
In this framework, hierarchical model inversion corresponds to estimating covariance components Cϵ(1), Cϵ(2) and Cθ(2) respectively. For this purpose, the model can be further rearranged in a non-hierarchical form (see also Friston et al., 2002a)
(16) |
Exploiting the Bayesian perspective, we treat the second level errors as additional model parameters, which will be estimated in subsequent steps.
To ensure all covariance components are evaluated simultaneously, we further augment the model by adding rows that correspond to the prior expectation E[ϵ(2)] = 0 and E[θ(2)] = ηθ(2) respectively
(17) |
(18) |
with augmented data , augmented design and parameters θ and augmented errors . Note that in contrast to the models considered above the augmented error contains all covariance components of the two-level model. One further benefit of augmentation is that it allows formulating the Gaussian likelihood and prior of the ensemble trajectories in a pleasingly compact form
(19) |
(20) |
with expectation and covariance components
(21) |
Longitudinal MRI studies of healthy and pathological development rest on inferences about first or second level parameters of the above model. The posterior density over parameters, given a particular sample of observations, is also Gaussian and can be written using the compact Gauss–Markov form
(22) |
(23) |
(24) |
The ensuing model inversion can be performed in a fully Bayesian way, i.e. using an informative prior on top level parameters; i.e., with given ηθ(2) and Cϵ(2). These prior distributions can be specified based on expectations from the literature or as suggested in Friston and Penny (2003) one might apply empirically derived prior distributions using the data at hand, e.g. obtained from a pooled covariance estimate. Moreover, if one does not have explicit prior assumptions about the local patterns of change, one can treat these parameters as unknown, thus using uninformative priors.
In this particular study we apply uninformative or flat priors with Cθ(2) = ∞ (or equivalently (Cθ(2))− 1 = 0), with the prior expectation ηθ(2) set to zero. In order to obtain the posterior over all trajectory parameters, we estimate the covariance components using an EM scheme. As described above, the top level prior covariance is unknown, realized by setting it to an arbitrarily high value, in particular we choose Cθ(2) = e32I. A simple illustration of the applied model is shown in Fig. 1.
Model estimation using Expectation Maximization
As proposed by Friston et al. (2002a) we adapt an EM algorithm (Dempster et al., 1977) to obtain all covariance components and the posterior of the change parameters. EM iteratively refines a lower bound F on the log-likelihood of the data given the hyperparameters, i.e. ln p(y|λ) ≥ F(q(θ), λ), where q(θ) is any distribution of the change parameters. Using iterative alternation between E and M steps (see later), one performs a coordinate ascent on F, and thus implicitly increases the log-likelihood.
E-Step
Under the above Gaussian assumptions, each E-step maximizes F(q(θ), λ) with respect to the distribution q(θ). Here, this simply corresponds to obtaining sufficient statistics for the posterior of the parameters, i.e. F is maximized by q(θ) = p(θ|y, λ). Using the covariance parametrization of the augmented model and Eqs. (23) and (24) the posterior is given by
(25) |
(26) |
(27) |
M-Step
Here, we optimize F(q(θ), λ) with respect to the covariance hyperparameters, in a maximum likelihood sense, using the posterior distribution obtained during the preceding E-step. In particular, F during the M-step is given by
(28) |
with residuals (for an exact derivation see Friston et al., 2002a). The first term decreases F with a larger number and size of hyperparameters, while the second term increases F with smaller precision weighted residuals corresponding to a better model fit.
Note also, that during the M-step the posterior covariance Cθ|y is a fixed result from the preceding E-step, while Cϵ = Cϵ(λ) depends on the hyperparameters and will be optimized. Thus in general the third term of F is not the trace of an identity matrix. The last term, which stems from the entropy of the distribution over change parameters q(θ), can be neglected, because it does not depend on the hyperparameters.
To update the hyper parameters we adopt a Fisher scoring algorithm, using the first derivative (or gradient) g and the expected second partial derivatives (or Fisher's Information matrix) H:
(29) |
(30) |
(31) |
(32) |
The updated hyperparameters re-enter into the estimation of the posterior in the next E-step.2 Finally, after appropriate initialization of the hyperparameters λ, the full algorithm alternates between the E- and M-steps until convergence.
Multiple groups
Longitudinal studies of development and aging often aim at inference about differences among average population trajectories. Typically, this involves comparing change rates (or slope differences) in healthy vs. pathological development, specific treatment conditions, or groups following specific lifestyle patterns. Although the ongoing structural change is well characterized by the slope parameters, the current framework also supports inference about other aspects of trajectory shape; e.g., intercepts or higher order non-linearities.
We therefore generalize the above model to situations where one observes M scans of N = N1 + N2 + … + NG subjects, who are individuals from G different populations with (mainly non-age dependent) subject-specific covariates Z1, …, ZG. For example, we might consider three groups of subjects in the ADNI dataset, including controls, MCI and AD, with Mini Mental State Examination scores as covariates. The subsamples are further used to estimate independent ensembles of trajectories ε1, …, εG using the same trajectory parametrization (Eq. (4)), e.g. quadratic curves. The assumptions about the group structure of trajectories can be realized by modifying the second level design matrix (in Eq. (6)) appropriately
(33) |
In addition to allowing for different average trajectories in different groups, the amount of individual differences within each ensemble εg might also differ across populations. This is easily achieved by adapting the model of the second level covariance components to include independent hyperparameters for each group.
We suppose the covariance structure of second level residuals to be Cov(ϵi(2)) = Rg for subject i from group g. We again exploit diagonal covariance basis functions Qd (see the section on Covariance component specification) to parameterize the variability of all change parameters in all groups resulting in G(D + 1) hyperparameters for the second level model
(34) |
Note that one could include fixed effects of time or age. In many practical applications these would enter as group specific fixed effects for each group and trajectory parameter. Finally, having specified a single or multi-group trajectory model, the estimation of parameters and covariance components proceeds using EM as described above.
Inference about group differences and analysis of individual differences of change
To facilitate practical applications to longitudinal MRI studies, we also need to consider Bayesian inference about population differences and subject specific covariate effects on individual trajectories. These effects can be characterized using the usual approach of defining contrasts for linear models as commonly used in Statistical Parametric Mapping (SPM) (Friston et al., 1995). In particular, single contrast vectors are used to specify a single hypothesis about first or second level change parameters. For example, let us suppose a design with linear trajectories (first level) and two groups and no covariates (second level). If we use contrast vector c = [0, 1, 0, − 1]T, then cTθ(2) = 0 tests the (null) hypothesis that the rate of change (slope) in group one is equal to the slope in group two. Moreover, multiple contrast vectors can be used to specify compound hypotheses. If c1 = [1, 0, − 1, 0]T, and c2 = [0, 1, 0, − 1]T then [c1, c2]Tθ(2) = 0 assumes both intercepts and slopes to be same across groups.
PPMs were introduced for Bayesian inference on mass-univariate general linear models used in neuroimaging (Friston and Penny, 2003). When applying PPMs, one is often interested in the probability of linear contrasts c = cTθ(2) exceeding a certain threshold, e.g. γ = 0. One can additionally specify a nonzero probability threshold, typically pt = 0.95. We are now in a position to construct PPMs for Bayesian inference on arbitrary trajectory parameter contrasts by voxelwise evaluation of the posterior
(35) |
with the cumulative density function of the unit normal distribution ϕ. Similarly, this framework affords comparison of structural change of single individuals using the first level individual change parameters θ(1).
Comparison of different trajectory models
The above framework for individual trajectory estimation requires an a-priori assumption about the polynomial order of random or fixed effects. Generally, comparing various trajectory models corresponds to the evaluation of competing hypotheses about trajectories in development, aging and pathology or about nonlinear changes during the lifespan. One can also use model comparison to test for differences among groups, e.g. H0: all subjects in same group vs. H1: subjects in control, MCI and AD groups. Crucially, one can use Bayesian model comparison to optimize aspects of the models about which one is uncertain such as the degree order of the polynomial above. Practically, Bayesian model comparison rests upon model evidence that is approximated by the free energy obtained from EM. This (lower bound) approximation to log model evidence is used to monitor convergence during parameter estimation of any particular model and optimize the model per se.
Bayesian model comparison has been suggested as a principled approach for inference about nested and non-nested models of neuroimaging data (Penny et al., 2004; Penny, 2012). Assuming the same prior probability for both model orders, different orders can compared using the difference in free energy or log evidence. This corresponds to the log Bayes factor (Kass and Raftery, 1995). Local voxelwise evaluation of this probability ratio compares model evidences of models with different degrees
(36) |
or models with and without some additional fixed effects. Observing BF > 1 in the above example indicates that it is more likely that individual differences of change are better captured by order i compared to j.
Questions about model order can be addressed flexibly using nested model comparisons. Two models are nested when the smaller (e.g. linear) model is obtained by setting some parameters of the larger (e.g. quadratic) model to zero. Note however, that this comparison of model evidence naturally extends to non-nested models, e.g. comparing two models with two different sets of covariates.
Summary of methods
In summary, we propose a hierarchical generative model to infer families of (nonlinear) trajectories reported by longitudinal changes in local brain volumes (or tissue densities). The key aspect of this model is its hierarchical structure, wherein the first (within-subject) level accommodates longitudinal effects whose trajectory depends upon group average parameters at the second (between-subject) level. Crucially, this level includes differences in subjects that may be of interest; for example, group differences or diagnosis, behavior or genetic variables (see later). Alternatively, second level effects may be considered as confounds; for example, the age of a subject (e.g., at baseline), their gender, or brain/head size (Barnes et al., 2010). By modeling nonlinear trajectories in this fashion, one can easily accommodate unbalanced designs, while exploiting the efficiency of mixed-effects inference and associated parameter estimates.
Results
Validation using simulated structural trajectories
In what follows, we address the face validity of the above approach using simulated data generated by the model with linear trajectories drawn from the range of design and (hyper-) parameter specifications typical of longitudinal MRI and VBM preprocessing. The simulated data were entered into EM to compare parameter estimates with the ground truth. This basically establishes that the model inversion can recover veridical parameter estimates. This validation procedure followed two steps.
Firstly, simulation of an ensemble of trajectories corresponding to a set of parameters θ(1) with specified average and individual trajectory differences. In the above generative model, this corresponds to the case of having only a column of ones in the second level design. Second level average change parameters were fixed to θ(2) = [1.2, − 5 ⋅ 10− 3]T, i.e. the mean intercept is 1.2 and mean slope is − 5 ⋅ 10− 3. No subject covariates were included in these simulations. To evaluate model performance in different contexts, the individual differences of the intercept and the slope, i.e. [λ1, λ2], were either assumed to be large [10− 2, 10− 4] or small [10− 4, 10− 6] respectively. Illustrations of simulated trajectories are shown in Fig. 2A.
Secondly, performing longitudinal MRI acquisition is equivalent to sparse temporal sampling of the unknown ground truth trajectories. The sampling process is specified by the first level design matrix. However, longitudinal MRI studies might vary substantially with respect to two main design characteristics. Designs can be more or less balanced with respect to age and differ with respect to the number of follow up measures per subject, i.e. more or less sparse. The simulation of MRI sampling and other design factors are illustrated in Fig. 2B.
Fig. 3 shows the root mean squared error of the first and second level intercept and slope parameters comparing the ground truth and the model estimations.
In general the change parameter estimates obtained from EM were found to be highly accurate, supporting the validity of the proposed method for different designs. As expected for a hierarchical model, the second level (group) parameter estimates were generally closer to the ground truth than first level (individual) change parameters. In our simulations, higher noise levels (or first level errors) primarily impaired first level parameter estimation accuracy.
To a minor extent, the first level noise also significantly affected the second level slope estimates, especially in sparse balanced designs. Similarly, larger individual differences (or second level errors) were found to increase estimation errors of the second level. Interestingly, larger individual differences also resulted in increased first level parameter errors, especially for less balanced designs.
We further found that having fewer follow up scans (or higher sparsity) in longitudinal designs broadly compromises individual and group level parameter estimates. Sparsity particularly affected all first level parameters in balanced and less balanced designs and the second level slope estimates; especially in balanced designs with more observational noise. In contrast, using more or less balanced designs had differential effects on estimation accuracy. Trajectory intercept errors were increased by more balanced designs, while slope estimates seemed at least in part to be improved.
As our model is based on assumptions about Gaussian distributions, the model inversion and inference might be affected by any violation of this assumption. A second set of simulations was conducted to test the validity of our model inversion in the presence of non-Gaussian error distributions (Fig. 4). We explicitly manipulated skewness and kurtosis of the first and second level errors and assessed the stability and accuracy of group trajectory rate of change (slope) and the corresponding variability hyperparameter, i.e. λ2. Interestingly, we observed that rates of change in terms of group slope parameters were highly accurately reconstructed over a wide range of non-Gaussian distributions. Therefore, the main model parameter seemed to be almost completely unbiased. Furthermore, our results show a slightly biased estimation of the hyperparameters under higher values of skewness and especially large values of kurtosis, i.e. peaked or super-Gaussian distributions. However, in our experience, strongly super-Gaussian data is rather unlikely in deformation based morphometric features, while more often slightly skewed data due to modulations from jacobian determinants is observed. Additionally, given the empirical results, the posterior uncertainty was more strongly affected by the total variance differences of first and second level errors than by presence of significant higher central moments.
Finally, another possibility for evaluation and validation of our approach was used. We compared the linear Bayesian mixed-effects model to a simple summary statistic approach. The latter is generally valid if the design is balanced across subjects. That means that in this case the summary statistic approach should perform optimally, so we tested if our approach provides comparable results in this ideal scenario. As illustrated in Fig. 5, this idea was confirmed using a simulation framework with balanced and not age-balanced designs additionally varying the error variances. Our approach performed similarly to summary statistics for balanced designs over a wide variety of first and second level error variances. We also observed that Bayesian mixed-effects models appeared more powerful than summary statistics when the latter is expected to be sub-optimal, i.e. in unbalanced designs. A similar result was obtained comparing balanced designs and varying timing of observations on the within-subject level (not shown).
Validation using real MRI data
Sample
In a second validation analysis, we provide a provisional assessment in terms of predictive validity by seeing if we could detect group differences (that we assume to be present). In this instance, we analyzed empirical data: The Bayesian mixed-effects models were applied and validated with a large longitudinal sample of healthy and pathological aging from the Alzheimer's Disease Neuroimaging Initiative (ADNI, http://www.adni-info.org)3 (see also Mueller et al., 2005).
We analyzed a subsample of the ADNI1 stage of the study, focusing on T1-weighted images acquired on 1.5 T scanners. After downloading and preprocessing 2397 scans of 474 participants, we excluded 39 subjects with 127 scans (due to substantial artifacts appearing in quality checks and errors during preprocessing). Apart from image sequence and preprocessing parameters (see also image preprocessing section), we did not apply any additional inclusion criteria.
The analyzed sample contained 2146 scans of 435 subjects, 181/254 female/male, ages 56.5–91.1, mean 76.4, std 6.7 years). The sample contains 10, 16, 31, 126, 113, 94, 43 and 2 subjects with ages 56–60, 60–65, 65–70, 70–75, 75–80, 80–85, 85–90 and 90–92 years respectively.
According to ADNI diagnostic criteria, the sample contained 688 scans of 140 healthy elderly subjects (further denoted as NO), 552 scans of 108 subjects with stable diagnosis of MCI during the whole ADNI study (denoted as sMCI), 530 scans of 92 subjects converting from original MCI diagnosis at baseline to AD during the ADNI study (pMCI), and 376 scans of 95 patients of patients diagnosed with AD.
The sample is less balanced with respect to age and the number of MRI acquisitions per subjects varies from 1 to 9 with 4.93 scans per subject on average. There were 34, 131, 122, 119 and 28 subjects having ≤ 3, 4, 5, 6 and ≥ 7 scans respectively. Most MRI acquisitions were performed at baseline or 6, 12, 18, 24, 36, 48, and 60 months of the within subject study time. The sample maps within subject healthy and pathological aging from 17, 32, 126, 218, 40 and 2 elderly subjects over 0–1, 1–2, 2–3, 3–4, 4–5 and 5–6 years respectively. A more detailed description of the ADNI study design and sample selection procedures can be found at http://adni.loni.usc.edu/data-samples/.
Symmetric diffeomorphic registration and image preprocessing
ADNI provides preprocessed T1-weighted images that have undergone specific correction steps to reduce scanner induced biases. To reduce these influences and minimize effects due to heterogeneity of protocols, all included images were chosen to match the MPRAGE with Gradwarp, B1 correction and N3 specification (see http://adni.loni.usc.edu/methods/mri-analysis/mri-pre-processing/). For further details about the applied ADNI MRI protocols please see http://adni.loni.usc.edu/methods/documents/mri-protocols/.
All further preprocessing steps were performed in SPM12b r6080 (Wellcome Trust Centre for Neuroimaging, London, UK, http://www.fil.ion.ucl.ac.uk/spm). Because longitudinal MR-based morphometry is particularly prone to artifacts due to scanner inhomogeneities, registration inconsistency, and subtle age-related deformations of the brains, it requires sophisticated preprocessing pipelines in order to detect the changes of interest and achieve unbiased results (Ashburner and Ridgway, 2013; Reuter and Fischl, 2011).
Thus, at first we applied the symmetric diffeomorphic registration for longitudinal MRI (Ashburner and Ridgway, 2013). In particular, this rests on a intra-subject modeling framework that combines non-linear diffeomorphic and rigid-body registration and further corrects for intensity inhomogeneity artifacts. The optimization is realized in a single generative model and provides internally consistent estimates of within-subject brain deformations during the study period. The registration model creates an average T1-image for each subject and the corresponding deformation fields for every individual scan.
Second, we applied SPM12b's unified segmentation to each subject's average T1-image, which assumes every voxel to be drawn from an unknown mixture of six distinct tissue classes: gray matter (GM), white matter (WM), and cerebrospinal fluid (CSF), bone, other tissue and air (see also Ashburner and Friston, 2005).
Third, all voxels within-subject average tissue maps were multiplied by the Jacobian determinants from the above longitudinal registration. Note, that this within-subject modulation is expected to encode all local individual volume changes during the study period.
Fourthly, nonlinear template generation and image registration was performed on the individual average GM and WM tissue maps using a geodesic shooting procedure (Ashburner and Friston, 2011). This defined the template space for all subsequent mixed-modeling steps.
Fifthly, the within-subject modulated (native space) segment images were subsequently deformed to this study template space. Note that only within- but no between-subject modulation was applied. We further quality checked the ensuing images manually and using covariance-based inhomogeneity measures as implemented in the VBM8 toolbox for SPM (http://www.neuro.uni-jena.de/vbm/).
Finally, images were smoothed using Gaussian kernels of 4 mm full width at half maximum. Subsequent modeling and analysis was performed for all tissue classes within corresponding binary masks. The masks were defined by a voxelwise sample mean of GM, WM and CSF tissue maps exceeding an absolute threshold of 0.1, 0.4, and 0.2 respectively.
All mixed-effects modeling steps were performed on 1.5 mm resolution images of ADNI subsamples using the above steps. The resulting images are assumed to reflect age-related effects, as well as healthy and pathological individual variability in terms of fine-grained maps of local gray matter (GMV), white matter (WMV) and cerebrospinal fluid volume (CSFV) content.
Computation time
Mass-univariate EM for Bayesian mixed-effects model inversion is computationally expensive. Single voxel computation time was found to depend on number of subjects, scans, groups, polynomial model degree and number of covariates. Subgroup models using linear trajectories (N = 60, M = 300, G = 1, D = 1, R = 1) took 4 h for local estimation in whole brain gray matter regions (0.05 s per voxel) on a desktop machine (6.5 CentOS Linux, Intel Xeon CPU, 3.20GHz, 12GB, Matlab R2013b). Using large sample data like ADNI with many hyperparameters, a single voxel inversion can take up to 30 s. However, mass-univariate estimation lends itself nicely to parallel computation. Using cluster computing facilities most model estimations were achieved within 1–3 days.
Normal aging and comparison of clinical group trajectories
First, we characterized trajectories in normal aging subjects. Fig. 6 shows PPMs of linear (i.e. slope) coefficients of the ensemble average trajectory in our normal aging group. In particular the PPMs indicated widespread decline of local volumes in GM and WM regions and substantial growth of CSF volume in the ventricles and sulcal regions. Using this sensitive longitudinal design, almost all regions were found to be affected by aging. Although the presented framework exploits linear mixed-effects models, one can explore nonlinear age-related effects by inclusion of quadratic terms and model. Assuming a quadratic model for every subject, we observe accelerated volume loss within many regions from all lobes. Most prominent accelerations were found in temporal GM and even more evident in the expansion of the lateral ventricle.
To further validate our model, we next compared local structural trajectories in clinical groups of the ADNI sample. Fig. 7 shows the PPMs of slope comparisons of the sMCI, pMCI and AD groups against the NO group. The comparisons of clinical and normal aging groups clearly indicate a region specific, temporo-parietal pattern of increased rates of atrophy in GM and WM volumes. This pattern is complemented by an increased rate of ventricle expansion in the disease groups. Groups that develop a full AD pathology (pMCI and AD) also show more negative rates of atrophy in frontal, occipital and cerebellar regions. Additionally to the more widespread spatial extent of the pathology in pMCI and AD compared to sMCI groups, the average rate of volume loss in terms of slope differences indicates a faster decline of regional temporo-parietal volume.
This conversion effect can be seen by evaluation of slope differences in pMCI and sMCI groups (Fig. 8). According to our sample, the conversion from MCI to AD at some point during the study also seems to be reflected in differential rates of local brain volume changes. Due to limitations of space, we restrict our presentation of comparisons to second level slope parameters. It is worth mentioning that the model supports similar comparisons for the trajectory intercepts, which mainly reflect existing differences before the study, as opposed to ongoing changes of brain structure during the study. Three examples of individual structural trajectories are shown in Fig. (9).
Examples of subject level and group level trajectories in NO, sMCI, and pMCI groups are displayed in Fig. 10. As expected for a hierarchical model, the posterior trajectory precision (or inverse variance) is found to be much smaller for the group level compared to the individual level. Ensemble trajectory estimates in groups are more precise and inference therefore more sensitive for detecting developmental differences.
Analysis of individual differences of trajectories
In contrast to typical cross-sectional MRI studies of brain development and aging, individual trajectory models, based on repeated measures MRI, also afford analysis of within-subject change variability. A strength of our approach is that we can explore effects of risk or protective factors on ongoing structural decline. This could involve lifestyle parameters, genetic profiles, cognitive test scores etc. or any stable between-subject variable of interest.
To demonstrate the potential of this method, we focused on explaining variability of local rates of atrophy based on the E4 allele of the Apolipoprotein gene (further denoted as ApoE4), an established risk factor for increased lifetime prevalence of AD. We define this score as the number of ApoE4 alleles of an individual, which can have either zero, one or two copies. This risk score was entered as a predictor Z (in Eq. (6)) for slope variability. Fig. 11 shows the PPMs for voxels showing steeper decline of GMV (or growth of CSFV) with higher ApoE4 risk in the group of NO and sMCI.
We observed localized effects indicating faster volume loss in anterior medial temporal lobe regions and lateral ventricle growth in normal subjects with higher ApoE4 risk scores. More widespread effects were found in temporo-parieto-frontal GM regions of stable MCI subjects. In addition to the above group differences of change, these results demonstrate the sensitivity of our method for analysis of additional within-group heterogeneity of change.
Comparing models of different degrees
Here we demonstrate examples for evidence-based model comparison within our generative trajectory modeling framework. There are many questions in the context of longitudinal MRI studies that can be elegantly framed in terms of model comparisons.
Firstly, one might aim at inference about different parametrizations, particularly the choice of a certain polynomial model degree of random and fixed effects of the trajectory models, i.e. the choice of [D, Df]. This is crucial in light of evidence for nonlinearities in brain maturation (Shaw et al., 2008), accelerated gray matter loss in healthy aging (Fjell et al., 2013) and other nonlinearities in clinical groups (Leung et al., 2013).
Secondly, one might also be interested in comparing generative models using different sets of covariates, e.g. by including informative predictors for individual differences of change. Fig. 12 gives an example of log Bayes factors for linear and second order models obtained from independent EM optimization for each model. Bayes factors in our normal aging group clearly favor a linear random effects model over alternative models in most gray matter regions. Introducing age as a fixed effect increased model evidence. Model evidence was further improved by allowing for random slope variability in most gray matter regions, especially in medial temporal lobe regions. According to the same comparison, individual differences among structural brain changes are most pronounced in the lateral ventricle regions.
Interestingly, parts of the ventricles exhibited further increased model evidence by additional inclusion of random quadratic growth effects. This was found to be emphasized for the lateral ventricle which in parts borders on the hippocampus. We further evaluated models with all combinations of fixed and random effects up to second order.
The overall winning model in most gray matter includes random effects for intercepts and slopes. Exceptions were found in right temporo-parietal and postcentral gray matter regions, and in left inferior frontal gyrus. Here and within parts of the lateral ventricle a quadratic random effects model was better able to capture individual differences of change in normal aging.
Permutation testing for empirical false positive rate
Finally, to ensure our voxelwise estimation scheme does not produce spurious or misleading conclusions we repeated a similar analysis under random permutations. We focussed on a subsample of 60 normal subjects with 300 MRI scans. Similar effect maps as shown for the whole group of normals were obtained. Then the data was randomly permuted 100 times and we reran the Bayesian model inversion outlined above. Posterior probability maps were calculated exactly as outlined in the full ADNI model. We hoped to see that the number of voxels in the ensuing PPMs (thresholded at 95%) was 5% of the search volume or less. The mean false positive rate was found to be 2.85%. The distribution of % suprathreshold voxels over 100 presentations (with replacement) is shown in Fig. 13 (right). More generally, no indication for increased false positive rates was found for other probability thresholds as well (see Fig. 13 left).
Discussion
We have described, validated and applied a powerful framework for analysis of brain morphometry in longitudinal MRI data using Bayesian inference. The emphasis is on the analysis of individual differences of brain changes in one or more samples and subsequent inference about the contribution of subject specific covariates such as cognitive abilities, behavior, psychopathology, health, and lifestyle factors.
In particular, the approach exploits algorithms for within- (Ashburner and Ridgway, 2013) and between- (Ashburner and Friston, 2011) subject diffeomorphic registration in order to generate non-linearly registered tissue volume images of subjects and scans using Jacobian determinants of deformations. The resulting (modulated) tissue segment maps are subjected to mass-univariate generative mixed-effects modeling.
EM is used for Bayesian inversion of the generative model by estimating variance components and empirical Gaussian priors on individual differences of change. The model is hierarchical and provides estimates of local individual change trajectories over the whole study period, even for variable numbers of scans per subject or for less balanced designs.
Our approach is similar to recently proposed iterative schemes for surface-based cortical thickness analysis in longitudinal MRI data (Bernal-Rusiel et al., 2012, 2013) and fMRI group analysis (Chen et al., 2013). We also briefly compared our EM algorithm to the openly available mass-univariate mixed-effects algorithms from Freesurfer (https://surfer.nmr.mgh.harvard.edu/fswiki/LinearMixedEffectsModels) (Bernal-Rusiel et al., 2012) (not shown in results) using synthetic longitudinal data (from the section on Validation using simulated structural trajectories ) with linear models and balanced data with Gaussian errors. We found convergence to the same group trajectory parameter estimates suggesting the validity of the applied iterative mixed-model schemes. However, a detailed evaluation of multiple approaches in multiple settings (a) including non-Gaussian error distributions (b) with both balanced and unbalanced designs (c) with linear and non-linear ground truth trajectories, is left for future work.
In contrast to the above methods, our model focuses on Bayesian inference on fixed- and random-effect parameters for individual and group trajectories, as well as Bayesian model selection. Our focus on Bayesian inference and random effects models overcomes some limitations of classical inference (for discussion see Friston et al., 2002a) and the proposed evidence based comparison of models allows one to navigate a rich model space.
Other Bayesian approaches for longitudinal MRI have been proposed with emphasis on either classification, evaluation of treatment effects, and dynamical networks (Elliott et al., 2010; Schmid et al., 2009; Chen et al., 2012). Our approach complements these methods by providing single subject trajectories and a model of their heterogeneity in aging samples. Notably, our iterative EM based random effects estimation also substantially differs from non-iterative marginal modeling using generalized estimating equations (for introduction see eg. Fitzmaurice et al., 2008) with recent application to longitudinal data using sandwich estimators (Guillaume et al., 2014) and adaptive multiscale methods (Skup et al., 2012).
We extensively validated our method using ground truth comparison with simulated longitudinal data. The model consistently reproduced veridical estimates across study designs with different characteristics. A design with fewer scans per subject was found to substantially reduce parameter accuracy, especially for the rates of change (or slopes). This result favors less sparse designs for efficient analysis of individual differences of change. Less balanced designs were also found to increase deviations from ground truth with some exceptions, especially for second level slope estimates and higher noise levels.
Notably, by construction, the design variability is part of the likelihood model and these effects are fully accounted for in the posterior parameter uncertainty (or credible intervals). Thus, using PPMs is expected to provide valid inference about individual, group and covariate parameters across a wide variety of study designs. Moreover, using non-Gaussian distributions, we have revealed some evidence for the robustness of the method under potential violations of the normality assumption. Mean parameter estimates were found to be unaffected from non-Gaussianity, hyperparameters were rather mildly affected by skewness and more biased by very large values of kurtosis. Comparison to summary statistics showed that posterior probabilities perform similarly in balanced designs and are likely to improve inference in typical unbalanced observational designs.
We further validated our approach using real MRI data from a large subsample of the available ADNI dataset. The spatio-temporal pattern of structural trajectories in subsamples for Normal Aging, stable MCI, progressive MCI and AD was found to be consistent with existing neuroimaging evidence (Driscoll et al., 2009; Misra et al., 2009; Anderson et al., 2012; Barnes et al., 2008; Vemuri et al., 2010b). Applying linear models of trajectories, PPMs of clinical groups indicated substantially increased rates of local brain atrophy and ventricle growth. The spatial pattern clearly emphasizes temporo-parietal regions in stable MCIs compared to normal aging, while higher rates of atrophy in pMCI and AD were also found in frontal gray matter regions. The sensitivity of this longitudinal mixed-modeling method was further demonstrated by observing differential rates of atrophy in progressive MCI compared to stable MCI. In line with recent evidence in healthy aging (Fjell et al., 2014), we also found additional accelerated decline (i.e. reverse U-shaped trajectories) of cortical and subcortical gray matter regions and accelerated growth (i.e. U-shaped trajectories) of lateral ventricle using quadratic models. As suggested by the study of Holland et al. (2012), different patterns of change of rates of atrophy might apply to pathological compared to healthy aging groups. We will focus on the volume dynamics during disease progression in a separate paper.
Using ADNI data we also aimed to explore the strength of mixed-effects models to identify the effects of covariates of interest. For that particular purpose, we chose to analyze the effects of a genetic risk score based on the number of ApoE4 alleles, i.e. 0, 1, or 2, a well known and established risk factor for development of AD and corresponding signs of atrophy in MRI (Vemuri et al., 2010a; Risacher et al., 2010; Morgen et al., 2013; Taylor et al., 2014; Tosun et al., 2010; Moffat et al., 2000; Hostage et al., 2014). Although one could have alternatively used group comparisons based on number of ApoE4 alleles, we preferred to include this risk score as an example for an additional predictor of within group variability around the mean changes in normal aging and stable MCI groups.
The PPMs of ApoE4 risk's second level contrast indicated effects on variability of ventricle growth in normal aging and widespread effects on gray matter rate of atrophy in stable MCI. This emphasizes the risk score as an important contributing factor to local structural aging. Similarly, this technique could be used for parametric analysis of other risk scores or continuous behavioral variables thought to be involved in development and aging.
Candidate hypotheses about brain development and aging can be framed in terms of specific trajectory models. These hypotheses might involve (A) the inclusion of certain degrees of fixed or random effects of time, nonlinearities etc. and (B) explicitly modeling brain–behavior relationships by inclusion of behavioral covariates. Scientists can then use Bayesian inference to update their beliefs about the respective hypotheses, in light of new (neuroimaging) data.
Bayesian model selection has been introduced to identify the most likely of a set of hypotheses e.g. using log model evidence ratios or Bayes factors (Kass and Raftery, 1995). Evidence comparisons of nested models are analogous to the F-tests commonly used in SPM (Friston et al., 1995). However, a major advantage of applying Bayesian instead of frequentist inference to trajectory models is that evidence based comparison extends to non-nested models. This is useful because different combinations of random and fixed effects or covariates are not necessarily nested. For instance first order random effects models cannot be reduced to a zero order random effects model with first order fixed effects by setting some variables to zero.
Voxelwise model evidence maps were previously introduced for efficient group level inference in fMRI using random effects (Rosa et al., 2010). Our models extend this idea to mixed-effects models for longitudinal MRI. Using a normal aging sample, we here demonstrate that Bayesian model selection can be also used for particular choices of combinations of random and fixed effects in normal aging-related structural changes. We explored a model space with all combinations of fixed and random effects up to order three. Pairwise comparisons of models were illustrated using Bayes factor maps.
The model with the most evidence was found to vary over regions and tissue classes. For most gray matter regions a random intercept and slope model was found to be most likely, with exceptions of a left prefrontal and a postcentral region, and regions adjacent to the ventricles. These were found to show second order random effects with individual differences of accelerations.
The second order random effects model was also more likely for the lateral ventricle adjacent to the hippocampus and its posterior parts. Although we only found accelerated lateral ventricle volume increases, this is in line with recent observations of late accelerated aging in hippocampal gray matter in normal aging (Ziegler et al., 2012b; Fjell et al., 2014). Disregarding potential segmentation difficulties of hippocampal gray matter, one also might expect that the spatial regularization of the within-subject deformations is slightly biased towards the adjacent ventricle growth. This might have reduced the sensitivity for detection of second order individual decline differences in hippocampal regions.
At the same time, our results extend existing fixed effects findings. Similar to a recent study using ROIs from manual volumetry (Raz et al., 2010), mixed-effects models go beyond testing for (nonlinear) fixed effects of aging because they explicitly model heterogeneity of structural changes.
In contrast to findings of Raz et al. (2010), where some regions did exhibit age-related change, but without any sign of individual differences, here Bayesian model selection showed the highest model evidence for linear or even quadratic random effects. In fact, in our voxelwise whole brain search we did not observe any brain region in all three analyzed tissue classes that exhibited most evidence for a model with random intercepts and linear fixed effects, i.e. showing uniform aging across subjects. These deviations of semi-automated and manual longitudinal volumetry might be further addressed.
Our ADNI sample findings suggest substantial heterogeneity among local structural brain changes in normal aging subjects without (or prior to) signs of dementia. Similar questions might be addressed about the heterogeneity of trajectories in disease states and during treatment processes.
It is also worth mentioning that the optimal degree of random effects (from evidence based comparison) specifies the dimensionality of individual differences in aging brain structure. This degree determines the complexity of a sufficient individual model of change rather than only quantifying the smoothness of the temporal dynamic on the group level. This idea nicely connects to the multivariate perspective on cognitive ability differences (see e.g. Ziegler et al., 2013).
Future studies might focus on Bayesian model selection in larger random effects model spaces using additional sets of genetic, physiological, and behavioral predictors. After sufficiently capturing the complexity of individual differences of aging-related brain changes as random effects (or hidden variables), one might aim to explain latent variables based on other observations, such as behavior, genes, and other MRI modalities.
Here we applied uninformative priors in the presented results. However, the proposed framework enables flexible specification of prior structures at the top-level of parameters, which can be used to implement prior knowledge about the process of interest, e.g. in terms of expected growth or decline rates in development or aging. The framework (and the corresponding implementation in SPM) will provide the choice of top-level priors being either uninformative (i.e. flat) or informative. Uninformative priors can be used for exploratory research similar to other standard mixed-effects models, while otherwise, informative priors can be chosen to be either fixed (for fully Bayesian inference) or estimated from the data using empirical Bayes. In particular, further extensions aim to include examples of empirical priors, e.g. global shrinkage or atlas-based regional shrinkage priors which regularize all voxelwise trajectory estimates after estimating a top level prior on the whole brain or regional ROI level respectively. The use of empirical priors in context of neuroimaging data was recently motivated by machine learning applications showing the potential for probabilistic single case inference given the ‘prior-knowledge’ of a large MRI database (Ziegler et al., 2014). Although we did not observe any evidence for increased rates of false positives during permutation testing, it is worth mentioning that empirical priors have also been discussed in the context of control of false discovery rate (FDR) (Schwartzman et al., 2009).
We finally like to mention some limitations and possible extensions of this work. Firstly, Bayesian model reduction has been recently proposed for efficient inference on general linear models and dynamical systems models of neuroimaging data (Friston and Penny, 2011; Penny and Ridgway, 2013). Using model reduction, posterior estimates and model evidences might be accurately approximated for large model spaces using only the optimized full model (instead of inverting every reduced model). Future studies might therefore work on efficient approximation techniques to improve the efficiency of Bayesian model comparison across wider spaces of mixed-effects models.
Secondly, our presented model applied group specific priors with independent estimation of multiple ensembles of trajectories. However, the hierarchical modeling framework naturally extends to higher levels. These could be extended to model individual differences of changes in multiple clinical groups of a joint population, the inclusion of multi-center scanner levels, and the variance across birth cohorts.
Thirdly, the mass-univariate Bayesian model inversion is computationally very expensive and does not account for spatial correlations among the voxels. As in recent work, the model might be extended to combine priors on heterogeneity and image space using full spatio-temporal-models or adaptive smoothing techniques (see e.g. Bernal-Rusiel et al., 2013; Skup et al., 2012).
Finally, recently developed techniques in quantitative imaging provide biologically relevant properties, e.g. about brain myelination and iron levels (Draganski et al., 2011; Callaghan et al., 2014; Lambert et al., 2013). Following quantitative biomarkers in healthy and pathological development might be expected to provide biologically meaningful models of developmental heterogeneity while reducing the potential influence of anatomical shape variability.
Acknowledgments
We are grateful to John Ashburner for discussions, support, comments and corrections to the manuscript. This work was supported in part by the German Academic Exchange Service (DAAD). Data collection and sharing for this project was funded by the Alzheimer's Disease Neuroimaging Initiative (ADNI) (National Institutes of Health Grant U01 AG024904) and DOD ADNI (Department of Defense award number W81XWH-12-2-0012). ADNI is funded by the National Institute on Aging, the National Institute of Biomedical Imaging and Bioengineering, and through generous contributions from the following: Alzheimer's Association; Alzheimer's Drug Discovery Foundation; Araclon Biotech; BioClinica, Inc.; Biogen Idec Inc.; Bristol-Myers Squibb Company; Eisai Inc.; Elan Pharmaceuticals Inc.; Eli Lilly and Company; EuroImmun; F. Hoffmann-La Roche Ltd and its affiliated company Genentech, Inc.; Fujirebio; GE Healthcare; IXICO Ltd.; Janssen Alzheimer Immunotherapy Research & Development, LLC.; Johnson & Johnson Pharmaceutical Research & Development LLC.; Medpace, Inc.; Merck & Co., Inc.; Meso Scale Diagnostics, LLC.; NeuroRx Research; Neurotrack Technologies; Novartis Pharmaceuticals Corporation; Pfizer Inc.; Piramal Imaging; Servier; Synarc Inc.; and Takeda Pharmaceutical Company. The Canadian Institutes of Health Research is providing funds to support ADNI clinical sites in Canada. Private sector contributions are facilitated by the Foundation for the National Institutes of Health (http://www.fnih.org). The grantee organization is the Northern California Institute for Research and Education, and the study is coordinated by the Alzheimer's Disease Cooperative Study at the University of California, San Diego. ADNI data are disseminated by the Laboratory for Neuro Imaging at the University of Southern California. Dr Ridgway is supported by the Medical Research Council [grant number MR/J014257/2]. The Wellcome Trust Centre for Neuroimaging is supported by core funding from the Wellcome Trust [grant number 091593/Z/10/Z].
Conflict of interest statement
The authors declare that there are neither actual nor potential conflicts of interest.
Footnotes
Please note that the actual implementation uses with and hyperpriors Ph = 1/32. The treatment of hyperparameters using a probabilistic perspective is motivated within the variational Bayes framework (Friston et al., 2007) and increases numerical stability of the optimization scheme.
Data used in the preparation of this article were obtained from the Alzheimer's Disease Neuroimaging Initiative (ADNI) database (adni.loni.usc.edu). The ADNI was launched in 2003 by the National Institute on Aging (NIA), the National Institute of Biomedical Imaging and Bioengineering (NIBIB), the Food and Drug Administration (FDA), private pharmaceutical companies and non-profit organizations, as a $60 million, 5-year public–private partnership. The primary goal of ADNI has been to test whether serial magnetic resonance imaging (MRI), positron emission tomography (PET), other biological markers, and clinical and neuropsychological assessment can be combined to measure the progression of mild cognitive impairment (MCI) and early Alzheimer's disease (AD). Determination of sensitive and specific markers of very early AD progression is intended to aid researchers and clinicians to develop new treatments and monitor their effectiveness, as well as lessen the time and cost of clinical trials. The Principal Investigator of this initiative is Michael W. Weiner, MD, VA Medical Center and University of California – San Francisco. ADNI is the result of efforts of many co-investigators from a broad range of academic institutions and private corporations, and subjects have been recruited from over 50 sites across the U.S. and Canada. The initial goal of ADNI was to recruit 800 subjects but ADNI has been followed by ADNI-GO and ADNI-2. To date these three protocols have recruited over 1500 adults, ages 55 to 90, to participate in the research, consisting of cognitively normal older individuals, people with early or late MCI and people with early AD. The follow up duration of each group is specified in the protocols for ADNI-1, ADNI-2 and ADNI-GO. Subjects originally recruited for ADNI-1 and ADNI-GO had the option to be followed in ADNI-2. For up-to-date information, see http://www.adni-info.org.
References
- Anderson V.M., Schott J.M., Bartlett J.W., Leung K.K., Miller D.H., Fox N.C. Gray matter atrophy rate as a marker of disease progression in AD. Neurobiol. Aging. 2012;33(7):1194–1202. doi: 10.1016/j.neurobiolaging.2010.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ashburner J., Friston K.J. Unified segmentation. NeuroImage. 2005;26(3):839–851. doi: 10.1016/j.neuroimage.2005.02.018. [DOI] [PubMed] [Google Scholar]
- Ashburner J., Friston K.J. Diffeomorphic registration using geodesic shooting and Gauss–Newton optimisation. NeuroImage. 2011;55(3):954–967. doi: 10.1016/j.neuroimage.2010.12.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ashburner J., Ridgway G.R. Symmetric diffeomorphic modeling of longitudinal structural MRI. Front. Neurosci. 2013;6(197) doi: 10.3389/fnins.2012.00197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barnes J., Scahill R.I., Frost C., Schott J.M., Rossor M.N., Fox N.C. Increased hippocampal atrophy rates in AD over 6 months using serial MR imaging. Neurobiol. Aging. 2008;29(8):1199–1203. doi: 10.1016/j.neurobiolaging.2007.02.011. [DOI] [PubMed] [Google Scholar]
- Barnes J., Ridgway G.R., Bartlett J., Henley S.M.D., Lehmann M., Hobbs N., Clarkson M.J., MacManus D.G., Ourselin S., Fox N.C. Head size, age and gender adjustment in MRI studies: a necessary nuisance? NeuroImage. 2010;53(4):1244–1255. doi: 10.1016/j.neuroimage.2010.06.025. [DOI] [PubMed] [Google Scholar]
- Bernal-Rusiel J.L., Greve D.N., Reuter M., Fischl B., Sabuncu M.R., for the Alzheimer's Disease Neuroimaging Initiative Statistical analysis of longitudinal neuroimage data with Linear Mixed Effects models. NeuroImage. 2012;66C:249–260. doi: 10.1016/j.neuroimage.2012.10.065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bernal-Rusiel J.L., Reuter M., Greve D.N., Fischl B., Sabuncu M.R., the Alzheimer's Disease Neuroimaging Initiative Spatiotemporal linear mixed effects modeling for the mass-univariate analysis of longitudinal neuroimage data. NeuroImage. 2013;81:358–370. doi: 10.1016/j.neuroimage.2013.05.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bishop C.M. Springer; 2006. Pattern Recognition and Machine Learning. [Google Scholar]
- Callaghan M.F., Freund P., Draganski B., Anderson E., Cappelletti M., Chowdhury R., Diedrichsen J., Fitzgerald T.H.B., Smittenaar P., Helms G., Lutti A., Weiskopf N. Widespread age-related differences in the human brain microstructure revealed by quantitative magnetic resonance imaging. Neurobiol. Aging. 2014;35(8):1862–1872. doi: 10.1016/j.neurobiolaging.2014.02.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carmichael O., Schwarz C., Drucker D., Fletcher E., Harvey D., Beckett L., Jack C.R., Weiner M., DeCarli C., the Alzheimer Disease Neuroimaging Initiative Longitudinal changes in white matter disease and cognition in the first year of the Alzheimer disease neuroimaging initiative. Arch. Neurol. 2010;67(11):1370–1378. doi: 10.1001/archneurol.2010.284. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chan D., Janssen J.C., Whitwell J.L., Watt H.C., Jenkins R., Frost C., Rossor M.N., Fox N.C. Change in rates of cerebral atrophy over time in early-onset Alzheimer's disease: longitudinal MRI study. Lancet. 2003;362(9390):1121–1122. doi: 10.1016/S0140-6736(03)14469-8. [DOI] [PubMed] [Google Scholar]
- Chen R., Resnick S.M., Davatzikos C., Herskovits E.H. Dynamic Bayesian network modeling for longitudinal brain morphometry. NeuroImage. 2012;59(3):2330–2338. doi: 10.1016/j.neuroimage.2011.09.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen G., Saad Z.S., Britton J.C., Pine D.S., Cox R.W. Linear mixed-effects modeling approach to FMRI group analysis. NeuroImage. 2013;73:176–190. doi: 10.1016/j.neuroimage.2013.01.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chetelat G. a, Baron J.-C. Early diagnosis of Alzheimer's disease: contribution of structural neuroimaging. NeuroImage. 2003;18(2):525–541. doi: 10.1016/s1053-8119(02)00026-5. [DOI] [PubMed] [Google Scholar]
- Debette S., Seshadri S., Beiser A., Au R., Himali J.J., Palumbo C., Wolf P.A., DeCarli C. Midlife vascular risk factor exposure accelerates structural brain aging and cognitive decline. Neurology. 2011;77(5):461–468. doi: 10.1212/WNL.0b013e318227b227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dempster A.P., Laird N.M., Rubin D.B. Maximum likelihood from incomplete data via the EM algorithm. J. R. Stat. Soc. Ser. B (Stat Methodol.) 1977;39(1):1–38. [Google Scholar]
- den Heijer T., van der Lijn F., Ikram A., Koudstaal P.J., van der Lugt A., Krestin G.P., Vrooman H.A., Hofman A., Niessen W.J., Breteler M.M.B. Vascular risk factors, apolipoprotein E, and hippocampal decline on magnetic resonance imaging over a 10-year follow-up. Alzheimers Dement. 2012;8(5):417–425. doi: 10.1016/j.jalz.2011.07.005. [DOI] [PubMed] [Google Scholar]
- Donohue M.C., Jacqmin-Gadda H., Le Goff M., Thomas R.G., Raman R., Gamst A.C., Beckett L.A., Jack C.R., Weiner M.W., Dartigues J.-F., Aisen P.S., the Alzheimer's Disease Neuroimaging Initiative Estimating long-term multivariate progression from short-term data. Alzheimers Dement. 2014 doi: 10.1016/j.jalz.2013.10.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Douaud G., Refsum H., de Jager C.A., Jacoby R., Nichols T.E., Smith S.M., Smith A.D. Preventing Alzheimer's disease-related gray matter atrophy by B-vitamin treatment. Proc. Natl. Acad. Sci. U. S. A. 2013;110(23):9523–9528. doi: 10.1073/pnas.1301816110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Draganski B., Ashburner J., Hutton C., Kherif F., Frackowiak R.S.J., Helms G., Weiskopf N. Regional specificity of MRI contrast parameter changes in normal ageing revealed by voxel-based quantification (VBQ) NeuroImage. 2011;55(4):1423–1434. doi: 10.1016/j.neuroimage.2011.01.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Driscoll I., Davatzikos C., An Y., Wu X., Shen D., Kraut M., Resnick S.M. Longitudinal pattern of regional brain volume change differentiates normal aging from MCI. Neurology. 2009;72(22):1906–1913. doi: 10.1212/WNL.0b013e3181a82634. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Elliott C., Francis S.J., Arnold D.L., Collins D.L., Arbel T. Bayesian classification of multiple sclerosis lesions in longitudinal MRI using subtraction images. Med. Image Comput. Comput. Assist. Interv. 2010;13(Pt 2):290–297. doi: 10.1007/978-3-642-15745-5_36. [DOI] [PubMed] [Google Scholar]
- Fitzmaurice G., Davidian M., Verbeke G., Molenberghs G. CRC Press; 2008. Longitudinal Data Analysis. [Google Scholar]
- Fjell A.M., Walhovd K.B. Structural brain changes in aging: courses, causes and cognitive consequences. Rev. Neurosci. 2010;21(3):187–221. doi: 10.1515/revneuro.2010.21.3.187. [DOI] [PubMed] [Google Scholar]
- Fjell A.M., Westlye L.T., Grydeland H., Amlien I., Espeseth T., Reinvang I., Raz N., Dale A.M., Walhovd K.B., the Alzheimer's Disease Neuroimaging Initiative Accelerating cortical thinning: unique to dementia or universal in aging? Cereb. Cortex. 2014;24:919–934. doi: 10.1093/cercor/bhs379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fjell A.M., Westlye L.T., Grydeland H., Amlien I., Espeseth T., Reinvang I., Raz N., Holland D., Dale A.M., Walhovd K.B., Alzheimer's Disease Neuroimaging Initiative Critical ages in the life course of the adult brain: nonlinear subcortical aging. Neurobiol. Aging. 2013;34(10):2239–2247. doi: 10.1016/j.neurobiolaging.2013.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fonteijn H.M., Modat M., Clarkson M.J., Barnes J., Lehmann M., Hobbs N.Z., Scahill R.I., Tabrizi S.J., Ourselin S., Fox N.C., Alexander D.C. An event-based model for disease progression and its application in familial Alzheimer's disease and Huntington's disease. NeuroImage. 2012;60(3):1880–1889. doi: 10.1016/j.neuroimage.2012.01.062. [DOI] [PubMed] [Google Scholar]
- Frisoni G.B., Fox N.C., Jack C.R., Scheltens P., Thompson P.M. The clinical use of structural MRI in Alzheimer disease. Nature reviews. Neurology. 2010;6(2):67–77. doi: 10.1038/nrneurol.2009.215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friston K.J., Penny W. Posterior probability maps and SPMs. NeuroImage. 2003;19(3):1240–1249. doi: 10.1016/s1053-8119(03)00144-7. [DOI] [PubMed] [Google Scholar]
- Friston K.J., Penny W.D. Post hoc Bayesian model selection. NeuroImage. 2011;56(4):2089–2099. doi: 10.1016/j.neuroimage.2011.03.062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friston K.J., Holmes A., Worsley K.J. Statistical parametric maps in functional imaging: a general linear approach. Hum. Brain Mapp. 1995;2:189–210. [Google Scholar]
- Friston K.J., Penny W., Phillips C., Kiebel S., Hinton G., Ashburner J. Classical and Bayesian inference in neuroimaging: theory. NeuroImage. 2002;16(2):465–483. doi: 10.1006/nimg.2002.1090. [DOI] [PubMed] [Google Scholar]
- Friston K.J., Glaser D.E., Henson R.N.A., Kiebel S., Phillips C., Ashburner J. Classical and Bayesian inference in neuroimaging: applications. NeuroImage. 2002;16(2):484–512. doi: 10.1006/nimg.2002.1091. [DOI] [PubMed] [Google Scholar]
- Friston K., Mattout J., Trujillo-Barreto N., Ashburner J., Penny W. Variational free energy and the Laplace approximation. NeuroImage. 2007;34(1):220–234. doi: 10.1016/j.neuroimage.2006.08.035. [DOI] [PubMed] [Google Scholar]
- Frost C., Kenward M.G., Fox N.C. The analysis of repeated ‘direct’ measures of change illustrated with an application in longitudinal imaging. Stat. Med. 2004;23(21):3275–3286. doi: 10.1002/sim.1909. [DOI] [PubMed] [Google Scholar]
- Grady C. The cognitive neuroscience of ageing. Nat. Rev. Neurosci. 2012;13(7):491–505. doi: 10.1038/nrn3256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guillaume B., Hua X., Thompson P.M., Waldorp L., Nichols T.E., the Alzheimer's Disease Neuroimaging Initiative Fast and accurate modelling of longitudinal and repeated measures neuroimaging data. NeuroImage. 2014;94:287–302. doi: 10.1016/j.neuroimage.2014.03.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Harville D.A. Maximum likelihood approaches to variance component estimation and to related problems. J. Am. Stat. Assoc. 1977;72(358):320–338. [Google Scholar]
- Holland D., Dale A.M., the Alzheimer's Disease Neuroimaging Initiative Nonlinear registration of longitudinal images and measurement of change in regions of interest. Med. Image Anal. 2011;15(4):489–497. doi: 10.1016/j.media.2011.02.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Holland D., Desikan R.S., Dale A.M., McEvoy L.K., the Alzheimer's Disease Neuroimaging Initiative Rates of decline in Alzheimer disease decrease with age. PLoS One. 2012;7(8):e42325. doi: 10.1371/journal.pone.0042325. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hostage C.A., Choudhury K.R., Murali Doraiswamy P., Petrella J.R., the Alzheimer's Disease Neuroimaging Initiative Mapping the effect of the apolipoprotein E genotype on 4-year atrophy rates in an Alzheimer disease-related brain network. Radiology. 2014;271(1):211–219. doi: 10.1148/radiol.13131041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jedynak B.M., Lang A., Liu B., Katz E., Zhang Y., Wyman B.T., Raunig D., Jedynak C.P., Caffo B., Prince J.L., the Alzheimer's Disease Neuroimaging Initiative A computational neurodegenerative disease progression score: method and results with the Alzheimer's disease Neuroimaging Initiative cohort. NeuroImage. 2012;63(3):1478–1486. doi: 10.1016/j.neuroimage.2012.07.059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kass R.E., Raftery A.E. Bayes factors. J. Am. Stat. Assoc. 1995;90(430):773–795. [Google Scholar]
- Kempton M.J., Ettinger U., Schmechtig A., Winter E.M., Smith L., McMorris T., Wilkinson I.D., Williams S.C.R., Smith M.S. Effects of acute dehydration on brain morphology in healthy humans. Hum. Brain Mapp. 2009;30(1):291–298. doi: 10.1002/hbm.20500. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lambert C., Chowdhury R., FitzGerald T., Fleming S.M., Lutti A., Hutton C., Draganski B., Frackowiak R., Ashburner J. Characterizing aging in the human brainstem using quantitative multimodal MRI analysis. Front. Hum. Neurosci. 2013;7(462) doi: 10.3389/fnhum.2013.00462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lau J.C., Lerch J.P., Sled J.G., Henkelman R.M., Evans A.C., Bedell B.J. Longitudinal neuroanatomical changes determined by deformation-based morphometry in a mouse model of Alzheimer's disease. NeuroImage. 2008;42(1):19–27. doi: 10.1016/j.neuroimage.2008.04.252. [DOI] [PubMed] [Google Scholar]
- Lerch J.P., Pruessner J.C., Zijdenbos A., Hampel H., Teipel S.J., Evans A.C. Focal decline of cortical thickness in Alzheimer's disease identified by computational neuroanatomy. Cereb. Cortex. 2005;15(7):995–1001. doi: 10.1093/cercor/bhh200. [DOI] [PubMed] [Google Scholar]
- Leung K.K., Ridgway G.R., Ourselin S., Fox N.C., the Alzheimer's Disease Neuroimaging Initiative Consistent multi-time-point brain atrophy estimation from the boundary shift integral. NeuroImage. 2012;59(4):3995–4005. doi: 10.1016/j.neuroimage.2011.10.068. [DOI] [PubMed] [Google Scholar]
- Leung K.K., Bartlett J.W., Barnes J., Manning E.N., Ourselin S., Fox N.C., the Alzheimer's Disease Neuroimaging Initiative Cerebral atrophy in mild cognitive impairment and Alzheimer disease: rates and acceleration. Neurology. 2013;80(7):648–654. doi: 10.1212/WNL.0b013e318281ccd3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Y., Gilmore J.H., Shen D., Styner M., Lin W., Zhu H. Multiscale adaptive generalized estimating equations for longitudinal neuroimaging data. NeuroImage. 2013;72:91–105. doi: 10.1016/j.neuroimage.2013.01.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Littmann A., Guehring J., Buechel C., Stiehl H.-S. Acquisition-related morphological variability in structural MRI. Acad. Radiol. 2006;13(9):1055–1061. doi: 10.1016/j.acra.2006.05.001. [DOI] [PubMed] [Google Scholar]
- Lorenzi M., Pennec X. Efficient parallel transport of deformations in time series of images: from Schild's to pole ladder. J. Math. Imaging Vision. 2013;50(1–2):5–17. doi: 10.1007/978-3-642-22092-0_38. [DOI] [PubMed] [Google Scholar]
- McArdle J.J. Latent variable modeling of differences and changes with longitudinal data. Annu. Rev. Psychol. 2009;60:577–605. doi: 10.1146/annurev.psych.60.110707.163612. [DOI] [PubMed] [Google Scholar]
- Misra C., Fan Y., Davatzikos C. Baseline and longitudinal patterns of brain atrophy in MCI patients, and their use in prediction of short-term conversion to AD: results from ADNI. NeuroImage. 2009;44(4):1415–1422. doi: 10.1016/j.neuroimage.2008.10.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moffat S.D., Szekely C.A., Zonderman A.B., Kabani N.J. Longitudinal change in hippocampal volume as a function of apolipoprotein E genotype. Neurology. 2000;55(1):134–136. doi: 10.1212/wnl.55.1.134. [DOI] [PubMed] [Google Scholar]
- Morgen K., Frölich L., Tost H., Plichta M.M., Kölsch H., Rakebrandt F., Rienhoff O., Jessen F., Peters O., Jahn H., Luckhaus C., Hüll M., Gertz H.-J., Schröder J., Hampel H., Teipel S.J., Pantel J., Heuser I., Wiltfang J., Rüther E., Kornhuber J., Maier W., Meyer-Lindenberg A. APOE-dependent phenotypes in subjects with mild cognitive impairment converting to Alzheimer's disease. J. Alzheimer Res. 2013;37(2):389–401. doi: 10.3233/JAD-130326. [DOI] [PubMed] [Google Scholar]
- Mueller S.G., Weiner M.W., Thal L.J., Petersen R.C., Jack C.R., Jagust W., Trojanowski J.Q., Toga A.W., Beckett L. Ways toward an early diagnosis in Alzheimer's disease: the Alzheimer's Disease Neuroimaging Initiative (ADNI) Alzheimers Dement. 2005;1(1):55–66. doi: 10.1016/j.jalz.2005.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Penny W.D. Comparing dynamic causal models using AIC, BIC and free energy. NeuroImage. 2012;59(1):319–330. doi: 10.1016/j.neuroimage.2011.07.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Penny W.D., Ridgway G.R. Efficient posterior probability mapping using Savage-Dickey ratios. PLoS One. 2013;8(3):e59655. doi: 10.1371/journal.pone.0059655. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Penny W.D., Stephan K.E., Mechelli A., Friston K.J. Comparing dynamic causal models. NeuroImage. 2004;22(3):1157–1172. doi: 10.1016/j.neuroimage.2004.03.026. [DOI] [PubMed] [Google Scholar]
- Pudas S., Persson J., Josefsson M., de Luna X., Nilsson L.-G., Nyberg L. Brain characteristics of individuals resisting age-related cognitive decline over two decades. J. Neurosci. 2013;33(20):8668–8677. doi: 10.1523/JNEUROSCI.2900-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raz N., Lindenberger U. Only time will tell: Cross-sectional studies offer no solution to the age–brain–cognition triangle: Comment on Salthouse (2011) Psychol. Bull. 2011;137(5):790–795. doi: 10.1037/a0024503. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raz N., Rodrigue K.M. Differential aging of the brain: patterns, cognitive correlates and modifiers. Neurosci. Biobehav. Rev. 2006;30(6):730–748. doi: 10.1016/j.neubiorev.2006.07.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raz N., Lindenberger U., Rodrigue K.M., Kennedy K.M., Head D., Williamson A., Dahle C., Gerstorf D., Acker J.D. Regional brain changes in aging healthy adults: general trends, individual differences and modifiers. Cereb. Cortex. 2005;15(11):1676–1689. doi: 10.1093/cercor/bhi044. [DOI] [PubMed] [Google Scholar]
- Raz N., Ghisletta P., Rodrigue K.M., Kennedy K.M., Lindenberger U. Trajectories of brain aging in middle-aged and older adults: regional and individual differences. NeuroImage. 2010;51(2):501–511. doi: 10.1016/j.neuroimage.2010.03.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raznahan A., Lerch J.P., Lee N., Greenstein D., Wallace G.L., Stockman M., Clasen L., Shaw P.W., Giedd J.N. Patterns of coordinated anatomical change in human cortical development: a longitudinal neuroimaging study of maturational coupling. Neuron. 2011;72(5):873–884. doi: 10.1016/j.neuron.2011.09.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raznahan A., Shaw P.W., Lalonde F., Stockman M., Wallace G.L., Greenstein D., Clasen L., Gogtay N., Giedd J.N. How does your cortex grow? J. Neurosci. 2011;31(19):7174–7177. doi: 10.1523/JNEUROSCI.0054-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raznahan A., Shaw P.W., Lerch J.P., Clasen L.S., Greenstein D., Berman R., Pipitone J., Chakravarty M.M., Giedd J.N. Longitudinal four-dimensional mapping of subcortical anatomy in human development. Proc. Natl. Acad. Sci. U. S. A. 2014;111(4):1592–1597. doi: 10.1073/pnas.1316911111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Resnick S.M., Goldszal A.F., Davatzikos C., Golski S., Kraut M.A., Metter E.J., Bryan R.N., Zonderman A.B. One-year age changes in MRI brain volumes in older adults. Cereb. Cortex. 2000;10(5):464–472. doi: 10.1093/cercor/10.5.464. [DOI] [PubMed] [Google Scholar]
- Reuter M., Fischl B. Avoiding asymmetry-induced bias in longitudinal image processing. NeuroImage. 2011;57(1):19–21. doi: 10.1016/j.neuroimage.2011.02.076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reuter M., Rosas H.D., Fischl B. Highly accurate inverse consistent registration: a robust approach. NeuroImage. 2010;53(4):1181–1196. doi: 10.1016/j.neuroimage.2010.07.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reuter M., Schmansky N.J., Rosas H.D., Fischl B. Within-subject template estimation for unbiased longitudinal image analysis. NeuroImage. 2012;61(4):1402–1418. doi: 10.1016/j.neuroimage.2012.02.084. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Risacher S.L., Shen L., West J.D., Kim S., McDonald B.C., Beckett L.A., Harvey D.J., Jack C.R., Weiner M.W., Saykin A.J., the Alzheimer's Disease Neuroimaging Initiative Longitudinal MRI atrophy biomarkers: relationship to conversion in the ADNI cohort. Neurobiol. Aging. 2010;31(8):1401–1418. doi: 10.1016/j.neurobiolaging.2010.04.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rosa M.J., Bestmann S., Harrison L., Penny W. Bayesian model selection maps for group studies. NeuroImage. 2010;49(1):217–224. doi: 10.1016/j.neuroimage.2009.08.051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sabuncu M.R., Bernal-Rusiel J.L., Reuter M., Greve D.N., Fischl B., for the Alzheimer's Disease Neuroimaging Initiative Event time analysis of longitudinal neuroimage data. NeuroImage. 2014;97:9–18. doi: 10.1016/j.neuroimage.2014.04.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmid V.J., Whitcher B., Padhani A.R., Taylor N.J., Yang G.-Z. A Bayesian hierarchical model for the analysis of a longitudinal dynamic contrast-enhanced MRI oncology study. Magn. Reson. Med. 2009;61(1):163–174. doi: 10.1002/mrm.21807. [DOI] [PubMed] [Google Scholar]
- Schnaudigel S., Preul C., Ugur T., Mentzel H.J., Witte O.W., Tittgemeyer M., Hagemann G. Positional brain deformation visualized with magnetic resonance morphometry. Neurosurgery. 2010;66(2):376–384. doi: 10.1227/01.NEU.0000363704.74450.B4. [DOI] [PubMed] [Google Scholar]
- Schumann C.M., Bloss C.S., Barnes C.C., Wideman G.M., Carper R.A., Akshoomoff N., Pierce K., Hagler D., Schork N., Lord C., Courchesne E. Longitudinal magnetic resonance imaging study of cortical development through early childhood in autism. J. Neurosci. 2010;30(12):4419–4427. doi: 10.1523/JNEUROSCI.5714-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schwartzman A., Dougherty R.F., Lee J., Ghahremani D., Taylor J.E. Empirical null and false discovery rate analysis in neuroimaging. NeuroImage. 2009;44(1):71–82. doi: 10.1016/j.neuroimage.2008.04.182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shaw P.W., Greenstein D., Lerch J.P., Clasen L., Lenroot R., Gogtay N., Evans A.C., Rapoport J., Giedd J.N. Intellectual ability and cortical development in children and adolescents. Nature. 2006;440(7084):676–679. doi: 10.1038/nature04513. [DOI] [PubMed] [Google Scholar]
- Shaw P.W., Kabani N.J., Lerch J.P., Eckstrand K., Lenroot R., Gogtay N., Greenstein D., Clasen L., Evans A.C., Rapoport J.L., Giedd J.N., Wise S.P. Neurodevelopmental trajectories of the human cerebral cortex. J. Neurosci. 2008;28(14):3586–3594. doi: 10.1523/JNEUROSCI.5309-07.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Skup M., Zhu H., Zhang H. Multiscale adaptive marginal analysis of longitudinal neuroimaging data with time-varying covariates. Biometrics. 2012;68(4):1083–1092. doi: 10.1111/j.1541-0420.2012.01767.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith A.D., Smith S.M., de Jager C.A., Whitbread P., Johnston C., Agacinski G., Oulhaj A., Bradley K.M., Jacoby R., Refsum H. Homocysteine-lowering by B vitamins slows the rate of accelerated brain atrophy in mild cognitive impairment: a randomized controlled trial. PLoS One. 2010;5(9):e12244. doi: 10.1371/journal.pone.0012244. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Taki Y., Thyreau B., Kinomura S., Sato K., Goto R., Wu K., Kawashima R., Fukuda H. A longitudinal study of the relationship between personality traits and the annual rate of volume changes in regional gray matter in healthy adults. Hum. Brain Mapp. 2013;34(12):3347–3353. doi: 10.1002/hbm.22145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Taylor J.L., Scanlon B.K., Farrell M., Hernandez B., Adamson M.M., Ashford J.W., Noda A., Murphy G.M., Weiner M.W. APOE-epsilon4 and aging of medial temporal lobe gray matter in healthy adults older than 50 years. Neurobiol. Aging. 2014 doi: 10.1016/j.neurobiolaging.2014.05.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thambisetty M., An Y., Kinsey A., Koka D., Saleem M., Guentert A., Kraut M., Ferrucci L., Davatzikos C., Lovestone S., Resnick S.M. Plasma clusterin concentration is associated with longitudinal brain atrophy in mild cognitive impairment. NeuroImage. 2012;59(1):212–217. doi: 10.1016/j.neuroimage.2011.07.056. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tosun D., Schuff N., Truran-Sacrey D., Shaw L.M. Relations between brain tissue loss, CSF biomarkers, and the ApoE genetic profile: a longitudinal MRI study. Neurobiol. Aging. 2010;31(8):1340–1354. doi: 10.1016/j.neurobiolaging.2010.04.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vemuri P., Wiste H.J., Weigand S.D., Knopman D.S., Shaw L.M., Trojanowski J.Q., Aisen P.S., Weiner M., Petersen R.C., Jack C.R., the Alzheimer's Disease Neuroimaging Initiative Effect of apolipoprotein E on biomarkers of amyloid load and neuronal pathology in Alzheimer disease. Ann. Neurol. 2010;67(3):308–316. doi: 10.1002/ana.21953. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vemuri P., Wiste H.J., Weigand S.D., Knopman D.S., Trojanowski J.Q., Shaw L.M., Bernstein M.A., Aisen P.S., Weiner M., Petersen R.C., Jack C.R., the Alzheimer's Disease Neuroimaging Initiative Serial MRI and CSF biomarkers in normal aging, MCI, and AD. Neurology. 2010;75(2):143–151. doi: 10.1212/WNL.0b013e3181e7ca82. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Woolrich M.W. Bayesian inference in FMRI. NeuroImage. 2012;62(2):801–810. doi: 10.1016/j.neuroimage.2011.10.047. [DOI] [PubMed] [Google Scholar]
- Young A.L., Oxtoby N.P., Daga P., Cash D.M., Fox N.C., Ourselin S., Schott J.M., Alexander D.C., the Alzheimer's Disease Neuroimaging Initiative A data-driven model of biomarker changes in sporadic Alzheimer's disease. Brain. 2014;137(Pt 9):2564–2577. doi: 10.1093/brain/awu176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ziegler G., Dahnke R., Gaser C., the Alzheimer's Disease Neuroimaging Initiative Models of the aging brain structure and individual decline. Front. Neuroinformatics. 2012;6:3. doi: 10.3389/fninf.2012.00003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ziegler G., Dahnke R., Jäncke L., Yotter R.A., May A., Gaser C. Brain structural trajectories over the adult lifespan. Hum. Brain Mapp. 2012;33:2377–2389. doi: 10.1002/hbm.21374. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ziegler G., Dahnke R., Winkler A.D., Gaser C. Partial least squares correlation of multivariate cognitive abilities and local brain structure in children and adolescents. NeuroImage. 2013;82:284–294. doi: 10.1016/j.neuroimage.2013.05.088. [DOI] [PubMed] [Google Scholar]
- Ziegler G., Ridgway G.R., Dahnke R., Gaser C., the Alzheimer's Disease Neuroimaging Initiative A.D.N.I. Individualized Gaussian process-based prediction and detection of local and global gray matter abnormalities in elderly subjects. NeuroImage. 2014;97:333–348. doi: 10.1016/j.neuroimage.2014.04.018. [DOI] [PMC free article] [PubMed] [Google Scholar]