Skip to main content
eLife logoLink to eLife
. 2021 Nov 10;10:e69995. doi: 10.7554/eLife.69995

Individual variations in ‘brain age’ relate to early-life factors more than to longitudinal brain change

Didac Vidal-Pineiro 1,, Yunpeng Wang 1, Stine K Krogsrud 1, Inge K Amlien 1, William FC Baaré 2, David Bartres-Faz 3, Lars Bertram 1,4, Andreas M Brandmaier 5,6, Christian A Drevon 7, Sandra Düzel 6, Klaus Ebmeier 8, Richard N Henson 9, Carme Junqué 3,10, Rogier Andrew Kievit 9,11, Simone Kühn 12,13, Esten Leonardsen 1, Ulman Lindenberger 5,6, Kathrine S Madsen 2,14, Fredrik Magnussen 1, Athanasia Monika Mowinckel 1, Lars Nyberg 15, James M Roe 1, Barbara Segura 3,10, Stephen M Smith 6, Øystein Sørensen 1, Sana Suri 16,17, Rene Westerhausen 18, Andrew Zalesky 19, Enikő Zsoldos 17, Kristine Beate Walhovd 1,20, Anders Fjell 1,20
Editors: Juan Zhou21, Christian Büchel22
PMCID: PMC8580481  PMID: 34756163

Abstract

Brain age is a widely used index for quantifying individuals’ brain health as deviation from a normative brain aging trajectory. Higher-than-expected brain age is thought partially to reflect above-average rate of brain aging. Here, we explicitly tested this assumption in two independent large test datasets (UK Biobank [main] and Lifebrain [replication]; longitudinal observations ≈ 2750 and 4200) by assessing the relationship between cross-sectional and longitudinal estimates of brain age. Brain age models were estimated in two different training datasets (n ≈ 38,000 [main] and 1800 individuals [replication]) based on brain structural features. The results showed no association between cross-sectional brain age and the rate of brain change measured longitudinally. Rather, brain age in adulthood was associated with the congenital factors of birth weight and polygenic scores of brain age, assumed to reflect a constant, lifelong influence on brain structure from early life. The results call for nuanced interpretations of cross-sectional indices of the aging brain and question their validity as markers of ongoing within-person changes of the aging brain. Longitudinal imaging data should be preferred whenever the goal is to understand individual change trajectories of brain and cognition in aging.

Research organism: Human

eLife digest

Scientists who study the brain and aging are keen to find an effective way to measure brain health, which could help identify people at risk for dementia or memory problems. One popular marker is ‘brain age’. This measurement uses a brain scan to estimate a person’s chronological age, then compares the estimated brain age to the person’s actual age to determine whether their brain is aging faster or slower than expected for their age.

However, since brain age relies on one brain scan taken at one point in time, it is not clear whether it really measures brain aging or if it might capture brain differences that have been present throughout the individual’s life. Studies comparing individual brain scans over several years would be necessary to know for sure.

Now, Vidal-Piñeiro et al. show that the brain-age measurement does not reflect faster brain aging. In the experiments, the researchers compared repeated brain scans of thousands of individuals over 40 years of age. The experiments showed that deviations from normative brain age detected in a single scan reflected early life differences more than changes in the brain over time. For example, people with older-looking brains were more likely to have had a low birth weight or to have a combination of genes associated with having an older looking brain.

Vidal-Piñeiro et al. show that brain age mostly reflects a pre-existing brain condition rather than brain aging. The experiments also suggest that genetics and early brain development likely have a strong impact on brain health throughout life. Future studies trying to test or develop brain-aging measurements should use serial measurements to track brain changes over time.

Introduction

The concept of brain age is increasingly used to capture interindividual differences in the structure, function, and neurochemistry of the aging brain (Cole and Franke, 2017). The biological age of the brain is estimated typically by applying machine learning to magnetic resonance imaging (MRI) data to predict chronological age. The difference between predicted brain age and actuackal chronological age (brain age delta) reflects the deviation from the expected norm and is often used to index brain health. Brain age delta has been related to brain, mental, and cognitive health, and proved valuable in predicting outcomes such as mortality (Cole et al., 2018; Cole and Franke, 2017; Elliott et al., 2019). To different degrees, it is assumed that brain age delta reflects past and ongoing neurobiological aging processes (Cole and Franke, 2017; Elliott et al., 2019; Franke and Gaser, 2019; Smith et al., 2020). Hence, it is common to interpret positive brain age deltas as reflecting a steeper rate of brain aging; often dubbed as accelerated aging (here both terms are used interchangeably) (Cole and Franke, 2017; Franke and Gaser, 2019; Smith et al., 2019).

The assumption that brain age delta reflects an ongoing process of faster or slower neurobiological aging implies that there should be a relationship between cross-sectional and longitudinal estimates of brain age. Alternatively, individual deviations from the expected brain age could capture constant interindividual differences in brain structure that remain stable throughout the lifespan, reflecting early genetic and environmental influences (Deary, 2012; Elliott et al., 2019; Walhovd et al., 2016). These perspectives offer fundamentally divergent interpretations of higher brain age (delta) in groups experiencing specific life events, brain disorders, and other medical problems. Here, we tested whether brain age – derived from structural T1-weighted (T1w) morphological features – is related to accelerated brain aging, early-life factors, or a combination of both.

If interindividual variations of brain age reflect variations in rates of ongoing brain aging (Figure 1a), cross-sectional brain age delta should be positively associated with brain decline measured longitudinally. Here, we quantified individual brain change as the annual rate of change of brain age delta (brain age deltalong). In addition, we also assessed brain change with a composite score of structural brain change as obtained using principal component (PC) analysis of change and change in the different raw structural brain features. These analyses were performed in two independent cohorts, both divided into a cross-sectional model generation (training) and a longitudinal, hypothesis testing (test) dataset. If cross-sectional variations in brain age reflect differences in brain structure established early in life, one should observe a relationship between brain age and influences associated with stable, lifelong effects on brain structure. Here, we selected two congenital factors: self-reported birth weight and polygenic scores for brain age (PGS-BA), for which lifelong effects on age-related phenotypes have been shown (Walhovd et al., 2012; Walhovd et al., 2020; Figure 1b). Birth weight reflects normal variation in body (and brain) size as well as prenatal conditions, whereas PGS-BA quantifies genetic liability of having a higher brain age.

Figure 1. Theoretical expectations and study characteristics.

(a) Three hypothetical trajectories leading to higher brain age delta. Higher brain age delta can be explained by a steeper rate of neurobiological aging (green), distinct events that led to the accumulation of brain damage in the past (yellow), or early-life genetic and developmental factors (purple). The black arrow represents normative values of brain age through the lifespan. (b) Brain aging (green) vs. early-life (blue-purple) accounts of brain age in older age. For the brain aging notion, cross-sectional brain age (points) relates to the slope of brain age as assessed by two or more observations across time (continuous line), reflecting ongoing differences in the rate of aging (dashed line, green scale). For the early-life notion, cross-sectional brain age (points) relates to early environmental, genetic, and/or developmental differences such as birth weight (blue-purple scale). (c) Relative age distribution for the UK Biobank test and training datasets. (d) Age variance explained (r2) for each MRI feature in the training dataset. Features are grouped by modality and ordered by the variance explained. (e) Brain age model as estimated on the training (n = 38,682), and (f) test datasets (participants = 1372; two observations each). In (e) and (f), lines represent the identity (gray; i.e., f(x) = x or diagonal fit), the linear (green), and the generalized additive models (GAM; orange) fits of chronological age to brain age. Confidence intervals (CIs) around the GAM fit represent 99.9% CIs for the mean. In (d), gwc = gray-white matter contrast, (c) = cortical, and (s) = subcortical.

Figure 1.

Figure 1—figure supplement 1. Age distribution for the Lifebrain replication dataset.

Figure 1—figure supplement 1.

(a) Relative age distribution for the Lifebrain training and test datasets. Relative age distribution for the different cohorts of the Lifebrain (b) training and (c) test datasets.

Figure 1—figure supplement 2. Brain age model predictions.

Figure 1—figure supplement 2.

Brain age model prediction (i.e., on test data) as estimated (a) using LASSO in the UK Biobank dataset and (b) extreme boosting gradient in the Lifebrain sample. Gray, green, and orange lines represent the identity, the linear, and the generalized additive models (GAM) functions fitting brain on chronological age.

Results

Brain age prediction

Chronological age (Figure 1c) was predicted based on regional and global features from structural T1w MRI, including cortical thickness, area, volume, and gray-white matter contrast, as well as subcortical volume and intensity imaging-derived phenotypes (|N| = 365). See a list of the different structural features used in the model in Supplementary files 1 and 2, and Figure 1d for pairwise correlations with age. The model was trained on 38,682 participants (age range = 44.8–82.6 years) with a single MRI from the UK Biobank (Miller et al., 2016) using gradient boosting as implemented in XGBoost (https://xgboost.readthedocs.io) and optimized using 10-fold cross-validation and a randomized hyperparameter search. The trained model (Figure 1e) was then used to predict brain age for an independent test dataset of 1372 participants with two MRIs each (age range = 47.2–80.6 years, mean [SD] follow-up = 2.3 [0.1] years). The predictions – applied to the longitudinal test set – revealed a high correlation between chronological and brain age (r = 0.82) with mean absolute error (MAE) = 3.31 years and root mean squared error (RMSE) = 4.14 years (Figure 1f), comparable to other brain age models using UK Biobank MRI data (Cole, 2020a). We used generalized additive models (GAM) to correct for the brain age bias, that is, the underestimation of brain age in older individuals and vice versa; a regression-to-the-mean bias (Smith et al., 2019). Brain age delta was calculated as the residual from the GAM fit. Brain age delta at baseline and follow-up were strongly correlated (r = 0.81). To establish generalizability, we replicated our results using a different machine learning algorithm – a LASSO-based approach (Cole, 2020a) – and an independent training and test (longitudinal) dataset from the Lifebrain consortium (Walhovd et al., 2018) with up to 11.2 years of follow-up (3292 unique participants, age range = 18.0–94.4 years; technical and biological replication). See Figure 1—figure supplement 1 and Supplementary file 3 for additional demographic information. All the codes used to generate the results are available alongside the article and at https://github.com/LCBC-UiO/VidalPineiro_BrainAge, (Vidal-Piñeiro, 2021; copy archived at swh:1:rev:2044c6ca40e0b8f99c9190c6edfde8ca76b559ac).

Brain age does not strongly relate to the rate of brain aging

First, we tested whether cross-sectional brain age delta was associated with brain age deltalong – that is, annual rate of change in brain age delta – using linear models controlling for age, sex, scanning site, and estimated intracranial volume (eICV). We selected the centercept (brain age delta at mean chronological age), instead of baseline brain age delta, to avoid statistical dependency between indices. Cross-sectional and brain age deltalong were weakly, but negatively, associated in the UK Biobank (β = –0.016 [±0.008] delta/year, t(p) = –2.0 (.04), r2 = 0.002, Figure 2a). Cross-sectional and brain age deltalong were unrelated using a LASSO regression approach (β = –0.003 [±0.006] delta/year, t(p) = –0.5 (.65), r2 = 0.001, Figure 2b), and in the Lifebrain replication sample (β = –0.007 [±0.01] delta/year, t(p) = –0.6 (.53), r2 = 0.001, Figure 2c). Post-hoc equivalence tests showed that positive relationships with β > 0.010 delta/year would be rejected in all three analyses, thus confirming a lack of a meaningful relationship between cross-sectional and longitudinal brain age (Materials and methods and Figure 2—figure supplement 1). UK Biobank (gradient boosting) results remained not significant when brain age delta was derived by time points 1 and 2 as two independent training sets (10-fold cross-validation; uncorrected delta values), thus avoiding potential confounds with age-bias correction (t(p) = 0.3 (.76)). Lifebrain results remained unaffected after including follow-up interval as an additional covariate or restricting the analysis to participants with long follow-up intervals (>4 years; n = 424). The relationship between cross-sectional and brain age deltalong was not significant in both cases (β = –0.008 [±0.01] delta/year, t(p) = –0.7 (.45); β = –0.008 [±0.007] delta/year, t(p) = –1.1 (.26)).

Figure 2. Relationship between cross-sectional and longitudinal brain age delta.

(a) Main analysis using the UK Biobank dataset and boosting gradient (n = 1372, p=0.04, r2 = 0.002). (b) Replication analyses using a different training algorithm (LASSO; n = 1372, p=0.65, r2 = 0.001) and (c) an independent dataset (Lifebrain; n = 1500, p=0.53, r2 = 0.001). XGB = boosting gradient as implemented in XGBoost. Confidence intervals (CIs) represent 99.9% CI for the fit. Longitudinal brain age delta (brain age deltalong) refers to the rate of change in delta between baseline and follow-up MRI measurements. Cross-sectional brain age delta (brain age deltacross) refers to centercept brain age delta; that is, at mean age.

Figure 2.

Figure 2—figure supplement 1. Equivalence tests.

Figure 2—figure supplement 1.

Inferiority tests for the three main models used to assess the relationship between cross-sectional and brain age deltalong. Inferiority tests test whether a null hypothesis of an effect as large as Δ can be rejected. In the x-axis, Δ reflects the null hypothesis as βetas (years/delta). A null hypothesis of an effect at least as large as 0.11 years/delta can be rejected (p<0.05) in all three tests. Δ has been evaluated at [–0.02, 0.05, 0.001]. The dashed red line indicates a p=0.05 criterion for the null hypothesis rejection.
Figure 2—figure supplement 2. Relationship between brain age delta and composite measures of change.

Figure 2—figure supplement 2.

Relationship between a composite measure of change as captured by the first principal component on feature change and cross-sectional brain age delta in (a) the UK Biobank and boosting gradient, (b) the UK Biobank and the LASSO algorithm, and (c) the Lifebrain dataset. Relationship between the composite measure of change and (longitudinal) brain age deltalong in (d) the UK Biobank and boosting gradient, (e) the UK Biobank and the LASSO algorithm, and (f) the Lifebrain dataset. Negative values in the principal component reflect brain decline (e.g., steeper cortical thinning, higher ventricle volume, etc.). n = 1369 and 1497 for the UK Biobank and the Lifebrain datasets.
Figure 2—figure supplement 3. Relationship between brain age delta and change in raw features.

Figure 2—figure supplement 3.

Feature change over time in the (a) UK Biobank and (b) Lifebrain datasets. Signed relationship between cross-sectional brain age delta and longitudinal change in the raw features in (c) the UK Biobank using a boosting gradient algorithm, (d) the UK Biobank using a LASSO algorithm, and (e) the Lifebrain dataset using the boosting gradient algorithm. Signed relationship between change in brain age delta (brain age deltalong) and longitudinal change in the raw features in (f) the UK Biobank using a boosting gradient algorithm, (g) the UK Biobank using a LASSO algorithm, and (h) the Lifebrain dataset using the boosting gradient algorithm. Dashed lines represent a Bonferroni-corrected significance threshold (|n| = 365 and 372 features for UK Biobank and Lifebrain datasets, respectively). The solid line represents an uncorrected p=0.05 significance threshold. n = 1372 and 1500 for the UK Biobank and the Lifebrain datasets.

We additionally tested whether cross-sectional and longitudinal brain age delta (brain age deltalong) were associated with a composite measure of longitudinal brain change or with change in any of the structural MRI features. See Materials and methods for details. Cross-sectional brain age delta was unrelated to a principal component of change (β = –0.009 [±0.01] year, t(p) = –0.7 (.46), r2 = 0.001). We did not find a significant relationship when brain age delta was computed with neither a LASSO algorithm nor using the Lifebrain sample (β = –0.02 [±0.01] year, t(p) = −1.7 (0.09), r2 = 0.002; β = 0.007 [±0.006] year, t(p) = 1.3 (0.2), r2 = 0.001). In contrast, brain age deltalong was associated with a principal component of change in the UK Biobank dataset as well as in both replication analyses (all tests p<0.001). See Figure 2—figure supplement 2 for a visual representation. For specific features, cross-sectional brain age delta was significantly related to change – in the expected direction – of features capturing lateral ventricle expansion and white matter hypointensities (p<0.05 Bonferroni-corrected). Brain age deltalong related to change in 45 of the features pertaining to four different modalities. The results were replicated both using the LASSO algorithm and the Lifebrain dataset (Figure 2—figure supplement 3 and Supplementary file 4).

Finally, we estimated the rate of aging effects using a cross-sectional model by estimating the scaling of the size of delta with age as defined in Smith et al., 2019. The scaling (δ) of brain age delta (δ) throughout the datasets’ age range was = 0.14 and 0.09 for the UK Biobank and the Lifebrain datasets. This corresponds to an increase in the spread of brain age delta of |δ| = 0.38 and 0.37 years – when moving from youngest to oldest – in the UK Biobank and the Lifebrain datasets, suggesting that brain age delta only modestly reflects rate of aging effects.

Brain age delta is associated with congenital factors on brain structure

Next, we tested whether birth weight was associated with brain age delta or change in brain age delta. Linear mixed models were used to fit time (from baseline; years), birth weight, and its interaction on brain age delta using age at baseline, sex, scanning site, and eICV as covariates. Birth weight was significantly related to brain age delta (β = –0.70 [±0.30] year/kg, t(p) = −2.3 (0.02), r2 = 0.009, Figure 3a), but not to delta change (β = 0.02 [±0.09] year/kg, t(p) = 0.3 (0.79), Figure 3c). Birth weights were limited to normal variations at full term (from 2.5 to 4.5 kg; n = 770 unique individuals) but see Figure 3—figure supplement 1 for results with varying cutoffs. The results were not affected by excluding individuals being part of multiple births (p=0.02) and were replicated using the LASSO approach (β = –0.79 [±0.29] year/kg, t(p) = −2.8 (0.006), r2 = 0.009, Figure 3b and d).

Figure 3. Relationship between cross-sectional brain age delta and birth weight.

(a) Main effect of birth weight on brain age delta using the UK Biobank dataset and boosting gradient (n = 770, p=0.02, r2 = 0.009). (b) This effect was replicated using a different training algorithm (LASSO) (n = 770, p=0.005, r2 = 0.009). Relationship between longitudinal change in brain age delta and birth weight was not significant either (c) in the main test or (d) in the LASSO replication analysis (p>0.5). Note that we used delta at time point 1 to illustrate the main effect of birth weight at time 0 and brain age deltalong to represent the birth weight × time interaction of the linear mixed models. Confidence intervals (CIs) represent 99.9% CI for the fit. XGB = boosting gradient as implemented in XGBoost.

Figure 3.

Figure 3—figure supplement 1. Robust effects of birth weight on brain age delta.

Figure 3—figure supplement 1.

βeta estimates showing the relationship between brain age delta and birth weight with variable minimum and maximum birth weight exclusion thresholds. Note negative βetas irrespective of the minimum and maximum self-reported birthweight thresholds.

Finally, we tested whether PGS-BA related to brain age delta and change in brain age delta (n = 1339). PGS-BA was computed using a mixture-normal model based on a genome-wide association study (GWAS) of the brain age delta phenotype in the UK Biobank training dataset. To test the association, linear mixed models were used as above along with the top 10 genetic PCs to account for population structure. PGS-BA was positively associated with brain age delta (β = 0.54 [±0.09] year, t(p) = 9.4 (<0.001), r2 = 0.02, Figure 4a) and negatively associated with brain age delta change (β = –0.06 [±0.03] year, t(p) = −2.4 (0.02), Figure 4c) in the independent test dataset. Likewise, PGS-BA was associated with brain age delta derived from the LASSO algorithm (β = 0.53 [±0.09] year, t(p) = 10.4 (<0.001), r2 = 0.02, Figure 4b) but not to brain age delta change (β = –0.001 [±.02] year, t(p) = 0.0 (1.0), Figure 4d). See Figure 4—figure supplement 1 for GWAS results. The association between PGS-BA and brain age delta remained significant when using as covariates the top 10 genetic components derived from the full UK Biobank sample (p<0.001 in both analyses).

Figure 4. Relationship between cross-sectional brain age delta and polygenic scores of brain age delta (PGS-BA).

(a) Main effect of PGS-BA on brain age delta using the UK Biobank dataset and boosting gradient (n = 1339, p<0.001, r2 = 0.02). (b) This effect was replicated using a different training algorithm (LASSO) (n = 1339, p<0.001, r2 = 0.02). (c) We found a negative association between longitudinal change in brain age delta and PGS-BA (=0.02; higher genetic liability to brain age related to negative change in brain age delta), which was not found (d) in the LASSO replication analysis (p=1.0). Note that we used delta at time point 1 to illustrate the main effect of PGS-BA at time 0 and brain age deltalong to represent the PGS-BA × time interaction of the linear mixed models. Confidence intervals (CIs) represent 99.9% CI for the fit. XGB = boosting gradient as implemented in XGBoost.

Figure 4.

Figure 4—figure supplement 1. Brain age delta genome-wide association study (GWAS).

Figure 4—figure supplement 1.

(a) Manhattan plot of the GWAS results for the test set on brain age delta (38,163 individuals). The horizontal line represents the threshold for genome-wide significance. (b) Quantile-quantile (QQ) plot illustrating the deviation of the observed p-values from the null hypothesis.

Discussion

Altogether, these findings do not support the claim that individual variation in the cross-sectional brain age metric capture across-subject differences in the ongoing rate of brain aging. Rather, brain age seems to reflect early-life influences on brain structure, and only to a very modest degree reflects actual rate of brain change in middle and old adulthood. A lack of relationship between brain age and rate of brain aging could potentially be explained – although not investigated in the present study – by the effect of circumscribed events such as isolated insults or detrimental lifestyles that occurred in the past, resulting in higher, but not accelerating, brain age. Yet, variations in brain age could equally reflect congenital and early-life differences and show lifelong stability. Cross-sectional brain age studies are ill-suited to disentangle these sources of variation but are often interpreted in line with the former. This assumes that variation in brain age largely results from the accumulation of damage and insults during the lifespan, with similar starting points for everyone. An exception is Elliott et al., 2019, who found that middle-aged individuals with higher brain age already exhibited poorer cognitive function and brain health at age 3 years. This fits a robust corpus of literature showing effects of lifelong, stable influences as indexed by childhood IQ (Karama et al., 2014), genetics (Walhovd et al., 2020), and neonatal characteristics (Walhovd et al., 2016) on brain and cognitive variation in old age.

It has been argued that at a population level brain age captures modest rate of aging effects because brain age delta spreads with increasing age (Smith et al., 2019). Here, we found a similar degree of delta spreading in our brain age metrics. Likewise, our secondary analyses suggested brain age related to change in a few specific neuroimaging features, that is, ventricular expansion and white matter hypointensities, though not to any composite score. Thus, both results are compatible and converge towards brain age as a real but relatively modest metric for capturing ongoing brain change. The largest part of interindividual variation in brain age delta, instead, largely originates before the sample lower bound (⪝ 18 and 45 years for the Lifebrain and UK Biobank datasets). Also, associations of brain age with other bodily markers of aging or with cognitive decline have yielded mixed support for cross-sectional brain age as a marker of individual differences in brain aging (Cole et al., 2018, p. 201; Elliott et al., 2019; Franke and Gaser, 2012). Other multivariate approaches might be better equipped for capturing the dynamics of the aging brain. Using independent component analysis, a recent study found that – compared to a single brain age score – distinct modes of multimodal brain variation better reflect both the genetic make-up and ongoing aging effects, with a subset of modes showing significant spreading of delta with age (Smith et al., 2020). The degree to which brain age reflects ongoing effects of aging likely depends on the specific features, modalities, and algorithms employed, and is constrained by model properties such as prediction accuracy and homoscedasticity. Yet, without longitudinal imaging, one should not interpret brain age as accelerated aging. Our results align with theoretical claims and empirical observations that covariance structures capturing differences between individuals do not necessarily generalize to covariance structures within individuals (Molenaar, 2004; Schmiedek et al., 2020). From a measurement theory perspective, our results suggest that cross-sectional brain age has low validity as an index of brain aging – despite having high reliability (Franke and Gaser, 2012) – as only a small portion of variance is associated with the trait of interest alone (Zuo et al., 2019). Most variance is rather associated with other factors that vary systematically across individuals, some of which are already present at birth.

The results further showed that birth weight, which reflects differences in genetic propensities and prenatal environment (Gielen et al., 2008), explained a modest portion of the variance in brain age. Subtle variations in birth weight are associated with brain structure early in life and present throughout the lifespan (Walhovd et al., 2016). This association should be considered as proof of concept that the metric of brain age reflects the past more than presently ongoing events in the morphological structure of the brain. This was confirmed by the consistent association between PGS-BA and brain age delta but not with brain age delta change. Since PGS-BA was computed based on cross-sectional brain age delta, this relationship may not be surprising, but still suggests a different genetic foundation for longitudinal brain age. These findings link with evidence that brain development is strongly influenced by a genetic architecture that, in interaction with environmental factors, leads to substantial, longlasting effects on brain structure. By contrast, aging mechanisms seem to be more related to limitations of maintenance and repair functions and have a more stochastic nature (Kirkwood, 2005).

Limitations and technical considerations

We used large training datasets to estimate the brain age models and the PGS scores leading to robust PGS-BA and brain age estimates. Self-reported birth weight (Nilsen et al., 2017) and cross-sectional brain age (Franke and Gaser, 2012) are highly reliable measures; thus, our analyses are well-powered to detect small effects (Zuo et al., 2019). The reliability of brain age deltalong is, however, unknown. Strictly speaking, brain age delta is a prediction error from a model that maximizes the prediction of age in cross-sectional data and thus partially also reflects noise. Given that deltalong is estimated as the difference between two deltacross estimates, it will hence have higher noise than the cross-sectional estimates, reducing the power in identifying potential associations between longitudinal and cross-sectional delta. This may be compounded by the relatively short interscan interval in the UK Biobank (≈2 years). However, our sample size (n > 1200) ensures that the tests performed in this study are well-powered to detect small effects, even if deltalong has mediocre reliability (Zuo et al., 2019). Further, replication of our null results in the Lifebrain sample with more observations and longer follow-up times reduces the likelihood of noise as the main factor behind the lack of relationship. Furthermore, previous studies have found that changes in brain age are partly heritable (Brouwer et al., 2021) and relate to, for instance, cardiometabolic risk factors (Beck, 2021), suggesting that it captures biologically relevant signals (i.e., has predictive validity), although with substantially different origins from cross-sectional brain age. Although the reliability of deltalong needs to be formally tested, the null relationship between deltacross and deltalong does not seem to be a result of a low-powered test.

We speculate that our results partially generalize to other normative and residual-based modeling approaches, as well as to developmental samples. There is considerable evidence in the literature that birth weight and genetic risk for neurodegenerative conditions affect brain structure from early life (Raznahan et al., 2012; Walhovd et al., 2020; Walhovd et al., 2016). Brain age models are related to other models such as normative brain charts (Bethlehem, 2021; Dong et al., 2020) – akin to normative anthropometric charts – the main difference being that brain age models predict, rather than control for, age (Marquand et al., 2019). Both types of models produce normative brain scores, which are uncorrelated with age (Butler et al., 2021). Thus, caution is required when interpreting these scores as indices of brain aging without availability of longitudinal data. Developmental samples may, however, reflect slightly stronger relationships between cross-sectional brain age delta and ongoing brain change as brain changes during early-life development typically occur at a faster pace than in middle or later life. Similarly, for specific disease groups such as Alzheimer’s disease patients (Franke and Gaser, 2012), interindividual brain variation in brain age might reflect to a greater extent prevailing loss of brain structure. Moreover, the variance associated with factors other than ongoing development/aging might be more limited in early than later age since influences leading to interindividual variations in brain structure have a shorter span to accumulate. That is, as time from birth increases, chronological age as a marker of individual development is reduced.

Finally, many genetic and environmental factors relate to lifelong stable differences in brain age beyond birth weight and PGS-BA. However, both variables are congenital and show stable associations through the lifespan (Raznahan et al., 2012; Walhovd et al., 2020) without strong evidence that they relate to brain change after adolescence. Thus, birth weight and PGS-BA are paradigmatic for showing how interindividual differences in brain age emerge early in life. The present study does not provide a systematic understanding of these influences but presents a framework for interpreting the impact such measures may exert on age-related phenotypes.

Conclusions

The results call for caution in interpreting brain-derived indices of aging based on cross-sectional MRI data and underscore the need to rely on longitudinal data whenever the goal is to understand the trajectories of brain and cognition in aging.

Materials and methods

Key resources table.

Reagent type (species) or resource Designation Source or reference Identifiers Additional information
Software, algorithm R Project for Statistical Computing https://www.r-project.org/ RRID:SCR_001905 Version 3.6.3
Software, algorithm FreeSurfer https://surfer.nmr.mgh.harvard.edu/ RRID:SCR_001847 Version 6.0

Participants and samples

The main sample was drawn from the UK Biobank neuroimaging branch (https://www.ukbiobank.ac.uk/ Miller et al., 2016). 38,682 individuals had MRI available at a single time point and were used as the training dataset. 1372 individuals had longitudinal data and were used as the test dataset. The present analyses were conducted under data application number 32048. The Lifebrain dataset (Walhovd et al., 2018) included datasets from five different major European Lifespan cohorts: the Center for Lifespan Changes in Brain and Cognition cohort (LCBC, Oslo; Walhovd et al., 2016), the Cambridge Center for Aging and Neuroscience study (Cam-CAN; Shafto et al., 2014; Taylor et al., 2017), the Berlin Study of Aging-II (Base-II; Bertram et al., 2014), the University of Barcelona cohort (UB; Rajaram et al., 2016; Vidal-Piñeiro et al., 2014), and the BETULA project (Umeå; Nilsson et al., 2010). Furthermore, we included data from the Australian Imaging Biomarkers and Lifestyle flagship study of ageing (AIBL; Ellis et al., 2009). In addition to cohort-specific inclusion and exclusion criteria, individuals aged <18 years, or with evidence of mild cognitive impairment, or Alzheimer’s disease were excluded from the analyses. 1792 individuals with only one available scan were used for the Lifebrain training dataset. 1500 individuals with available follow-up of >0.4 years were included in the test dataset. Individuals had between 2 and 8 available scans each. Sample demographics for the UK Biobank and the Lifebrain samples are provided in Supplementary file 3. See also Figure 1c and Figure 1—figure supplement 1 for a visual representation of the age distribution in the UK Biobank and the Lifebrain datasets. UK Biobank (North West Multi-Center Research Ethics Committee [MREC]; see also https://www.ukbiobank.ac.uk/the-ethics-and-governance-council) and the different cohorts of the Lifebrain replication dataset (Supplementary file 5) have ethical approval from the respective regional ethics committees. All participants provided informed consent.

MRI acquisition and preprocessing

See https://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/brain_mri.pdf for details on the UK Biobank T1w MRI acquisition. UK Biobank and Lifebrain MRI data were acquired with 3 and 10 different scanners, respectively. T1w MRI acquisition parameters for both the Lifebrain and the UK Biobank are summarized in Supplementary file 6.

We used summary regional and global metrics derived from T1w data. For UK Biobank, we used the imaging-derived phenotypes developed centrally by UK Biobank researchers (Miller et al., 2016) and distributed via the data showcase (http://biobank.ctsu.ox.ac.uk/crystal/index.cgi). See preprocessing details in https://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/brain_mri.pdf. This procedure yielded 365 structural MRI features, partitioned in 68 features of cortical thickness, area, and gray-white matter contrast, 66 features of cortical volume, 41 features of subcortical intensity, and 54 features of subcortical volume. See the list of features in Supplementary files 1 and 2. Lifebrain data were processed on the Colossus processing cluster, University of Oslo. Similar to the UK Biobank pipeline, we used the fully automated longitudinal FreeSurfer v.6.0. pipeline (Reuter et al., 2012) for cortical reconstruction and subcortical segmentation of the structural T1w data (http://surfer.nmr.mgh.harvard.edu/fswiki Dale et al., 1999; Fischl et al., 1999; Fischl and Dale, 2000) and used similar atlases for structural segmentation and feature extraction.

Birth weight

We used birth weight (kg) from the UK Biobank (field #20022). Participants were asked to enter their birth weight at the initial assessment visit, the first repeat assessment visit, or the first imaging visit. In the case of multiple birth weight instances, we used the latest available input. n = 894 participants from the test dataset had available data on birth weight. The main analysis was constrained to normal variations in birth weight between 2.5 and 4.5 kg (n = 770; Walhovd et al., 2012) due to lower reliability of extreme scores and to tentatively remove participants potentially with severe medical complications associated with prematurity.

Genetic preprocessing

Detailed information on genotyping, imputation, and quality control was published by Bycroft et al., 2018. For genetic analyses, we only included participants with both genotypes and MRI scans. Following the recommendations from the UK Biobank website, we excluded individuals with failed genotyping, who had abnormal heterozygosity status, or who withdrew their consents. We also removed participants who were genetically related – up to the third degree – to at least another participant as estimated by the kinship coefficients as implemented in PLINK (Chang et al., 2015). For the GWAS we used 38,163 individuals from the training dataset. Polygenic risk scores were computed using the test dataset consisting of 1339 individuals with longitudinal MRI.

Genome-wide association study (GWAS)

We performed GWAS analysis on the training dataset and the brain age delta-semi-corrected phenotype using the imputed UK Biobank genotypes. To control for subtle effects of population stratification in the dataset, we computed the top 10 PCs using the PLINK command –pca on a decorrelated set of autosome single-nucleotide polymorphisms (SNPs). The set of SNPs (n = 101,797) were generated by using the PLINK command, --maf 0.05, --hwe 1e–6, --indep-pairwise 100 50 0.1. The –glm function from PLINK was used to perform GWAS on about 9 million autosomal SNPs, including age, sex, and the top 10 PCs as covariates. See Manhattan and quantile-quantile (QQ) plots in Figure 4—figure supplement 1. Note that our results corroborated the same association region reported in Jonsson et al., 2019 with a smaller sample.

Polygenic scores (PGS)

The GWAS results for the training dataset were used to compute PGS (PGS-BA) in the independent test dataset (n = 1339 participants). We used the recently developed method PRS-CS (Ge et al., 2019) to estimate the posterior effect sizes of SNPs that were shown to have high quality in the HapMap data (International HapMap 3 Consortium et al., 2010). Rather than estimating the polygenicity of brain age delta from our data, we assumed a highly polygenic architecture for brain age delta by setting the parameter --phi = 0.01 (Boyle et al., 2017). The remaining parameters of PRS-CS were set to the default values. PGS was based on 654,725 SNPs and was computed on the independent test data using the --score function from PLINK. SNPs were aligned with HapMap 3 SNPs (autosome only as provided by PRC-CS) and posterior effects were estimated. We also computed the population structures PCs’ in the test dataset using the same procedure as in the training dataset.

Statistical analyses

All statistical analyses were run with R version 3.6.3 https://www.r-project.org/. We used the UK Biobank as the main sample and the Lifebrain cohort for independent replication. The main description refers to the UK Biobank pipeline, though Lifebrain replication followed identical steps unless otherwise stated. For replication across machine learning pipelines, we used a LASSO regression approach for age prediction, adapted from (Cole, 2020b). See more details in Cole, 2020a. The correlation between LASSO-based and Gradient Boosting-based brain age deltas was 0.80.

Brain age prediction

We used machine learning to estimate each individuals’ brain age based on a set of regional and global features extracted from T1w sequences. We estimated brain age using gradient tree boosting (https://xgboost.readthedocs.io). We used participants with only one MRI scan for the training dataset (n = 36,682) and participants with longitudinal data as test dataset (n = 1372). All variables were scaled prior to any analyses using the training dataset metrics as reference.

The model was optimized in the training set using a 10-fold cross-validation randomized hyperparameter search (50 iterations). The hyperparameters explored were number of estimators [seq(100:600, by = 50)], learning rate (0.01, 0.05, 0.1, 0.15, 0.2), maximum depth [seq(2:8, by = 1)], gamma regularization parameter [seq(0.5:1.5, by = 0.5)], and min child weight [seq(1:4, by = 1)]. The remaining parameters were left to default. The optimal parameters were number of estimators = 500, learning rate = 0.1, maximum depth = 5, gamma = 1, and min child weight = 4 predicting r2 = 0.68 variance in chronological age with MAE = 3.41 and RMSE = 4.29. See visual representation in Figure 1f.

Next, we recomputed the machine learning model using the entire training dataset and the optimal hyperparameters and used it to predict brain age for the test dataset (Figure 1e). These metrics are similar or better than other brain age models using UK Biobank MRI data (Cole, 2020a; de Lange et al., 2019) and the cross-validation diagnostics. We used GAM to correct for the brain age bias estimation (Smith et al., 2019); r = –0.54 for the test dataset. Note that we used GAM fittings as estimated in the training dataset so delta values in the test dataset are not centered to 0. Brain age delta was estimated as the GAM residual. The correlation between brain age delta corrected based on the training vs. the test fit was r > 0.99. Also, GAM-based bias correction led to similar brain age delta estimations to linear and quadratic-based corrections (r > 0.99). The diagnostics for the LASSO-based model were as follows: variance explained (r2) = 0.69/0.69; MAE = 3.36/3.28; RMSE = 4.21/4.04; age bias = –0.56/–0.52 for the training and predicted datasets. See representation of the brain age prediction in Figure 2—figure supplement 2.

Higher level analysis

Relationship between cross-sectional and longitudinal brain age

For each participant, we computed the mean brain age delta across the two MRI time points and the yearly rate of change (brain age deltalong). We selected mean, instead of baseline brain age delta, to avoid statistical dependency between both indices (Rogosa and Willett, 1985; Wainer, 2000). Brain age deltalong was fitted by mean brain age delta using a linear regression model, which accounted for age, sex, site, and eICV. We used mean eICV across both time points.

Relationship between brain age delta and change in brain features

For each participant, we computed the yearly rate of change in all the raw neuroimaging features and tested whether change was significantly different from 0 (one-sample t-test, <0.05, Bonferroni-corrected; Figure 2—figure supplement 3, Supplementary file 4). Features with significant change over time were fed into a PC analysis (uncentered). The first component, explaining ≃20% of the variance both in the UK Biobank and the Lifebrain datasets, was selected for further analysis. Although it did not qualitatively affect the results, we removed two and three extreme outliers from the UK Biobank and Lifebrain datasets (score >10). See Supplementary file 4 for component weights. Finally, we tested whether cross-sectional and brain age deltalong predicted brain change as quantified both by the first component analysis and change in each of the raw neuroimaging features (p<0.05, Bonferroni-corrected) using the same models described above.

Spreading of brain age delta with age

Further, we estimated the degree to which brain age delta reflects rate of aging using a cross-sectional model proposed by Smith et al., 2019, which estimates the scaling of brain age delta through the datasets’ age range. The scaling is estimated by λ in δ = δ0(1 +λY0), where δ is brain age delta, Y0 is a linear mapping of chronological age into the range 0:1, and |δ0| relates to brain age delta distribution in the youngest participants. The spread of brain age delta throughout the datasets’ age range can then be expressed as |δ0|λ (years).

Relationship between brain age PGS and cross-sectional and longitudinal brain age

This association was tested using linear mixed models with time from baseline (years), PGS-BA, and its interaction on brain age delta. Age at baseline, sex, site, eICV, and the 10 first PCs for population structure were used as covariates. The PCs of population structure were added to minimize false positives associated with any form of relatedness within the sample.

Effects of birth weight on brain age

Linear mixed models were used to fit time, birth weight, and its interaction on brain age delta, using age at baseline, sex, site, and eICV as covariates. We explored the consistency of the results by modifying the birth weight limits in a grid-like fashion [0.5, 2.7, 0.025] and [4.2, 6.5, 0.025] for minimum and maximum birth weight (Figure 3—figure supplement 1). Self-reported birth weight is a reliable estimate of actual birth weight. However, extreme values are either misestimated or reflect profound gestational abnormalities (Nilsen et al., 2017; Tehranifar et al., 2009).

Equivalence tests

Post-hoc equivalence tests were performed to test for the absence of a relationship between cross-sectional and brain age deltalong (Lakens et al., 2018). Specifically, we used inferiority tests to test whether a null hypothesis of an effect at least as large as Δ (in years/delta) could be rejected. We reran the three main models assessing a relationship between cross-sectional and longitudinal brain age delta (UK Biobank trained with boosting gradient, UK Biobank trained with LASSO, and Lifebrain trained with boosting gradient) varying the right-hand-side test (Δ) [–0.02, 0.05, 0.001] (p<0.05, one-tailed; Figure 2—figure supplement 1).

Assumptions were checked for the main statistical tests using plot diagnostics. Variance explained for single terms refers to unique variance (UVE), which is defined as the difference in explained variance between the full model and the model without the term of interest. For linear mixed models, UVE was estimated as implemented in the MuMIn r-package.

Lifebrain-specific steps

Features

The Lifebrain cohort included |N| = 372 features. It included eight new features compared to the UK Biobank dataset, whereas one feature was excluded (new features: left and right temporal pole area volume and thickness, cerebral white matter volume, cortex volume; excluded feature: ventricle choroid). See age variance explained in each feature in Supplementary files 1 and 2 as estimated with GAMs.

Quality control

Prior to any analysis, we tentatively removed observations for which > 5% of the features fell above or below 5 SD from the sample mean. The application of this arbitrary high threshold led to the removal of 10 observations. We considered these MRI data to be extreme outliers and likely to be artifactual and/or contaminated by important sources of noise. Also, before brain prediction, we tentatively removed variance associated with the different scanners using generalized additive mixed models (GAMM) and controlling for age as a smooth factor and a subject identifier as random intercept. This correction was performed due to differences in age distribution by scanner and lack of across scanner calibration.

Hyperparameter search and model diagnostics

The optimal parameters for the Lifebrain replication sample were number of estimators = 600, learning rate = 0.05, maximum depth = 4, gamma = 1.5, and min child weight = 1. Using cross-validation, the model predicted r2 = 0.92 of the age variance with MAE = 4.75 and RMSE = 6.31. Brain age was underestimated in older age (bias r = –0.33).

Model prediction

The age variance explained by brain age was r2 = 0.90 with MAE = 4.68 and RMSE = 6.06. Brain age was underestimated in older age (bias r = –0.25; Figure 1—figure supplement 2).

Higher level analysis

For each individual, mean brain age delta was considered as the grand mean brain age delta across the different MRI time points. To compute brain age deltalong , we set for each participant a linear regression model with observations equal to the number of time points that fitted brain age delta by time since the initial visit. Slope indexed change in brain age delta/year. The relationship between mean and brain age deltalong was tested using linear mixed models controlling for age, sex, and eICV as fixed effects, and using a site identifier as a random intercept. Likewise, linear mixed models were used to test the relationship between brain age delta and change in brain features. Note that eICV was identical across time points as a result of being estimated through the longitudinal FreeSurfer pipeline. We could not obtain the required information on genetics and birth weight to replicate the analyses supporting the early-life account.

Data and code availability

The raw data were gathered from the UK Biobank, the Lifebrain cohort, and the AIBL. Raw data requests are specific to each cohort. UK Biobank and AIBL data are available upon application to UK Biobank and at https://aibl.csiro.au upon corresponding approvals. For the Lifebrain cohorts, requests for raw MRI data should be submitted to the corresponding principal investigator. See contact details in Supplementary file 5. MRI data is not openly available as participants did not consent to share publicly their data. Access to data is available upon reasonable requests and transfer agreements. Different sample agreements are required for each dataset.Statistical analyses in this article are available alongside the article and will be available at https://github.com/LCBC-UiO/VidalPineiro_BrainAge. All analyses were performed in R 3.6.3. The scripts were run on the Colossus processing cluster, University of Oslo. UK Biobanks’ data acquisition, MRI preprocessing, and feature generation pipelines are freely available (https://www.fmrib.ox.ac.uk/ukbiobank). For the Lifebrain cohorts, the image acquisition details are summarized in Supplementary file 6. MRI preprocessing and feature generation scripts were performed with the freely available FreeSurfer software (https://surfer.nmr.mgh.harvard.edu/). For bash-sourcing scripts, please contact the corresponding author.

Acknowledgements

BASE-II has been supported by the German Federal Ministry of Education and Research under grant numbers 16SV5537/16SV5837/16SV5538/16SV5536K/01UW0808/01UW0706/01GL1716A/01GL1716B. Part of the computation was performed on the Norwegian high-performance computation resources, sigma2, through the project no. nn9769k. The Wellcome Centre for Integrative Neuroimaging is supported by core funding from award 203139/Z/16/Z from the Wellcome Trust. Data used in the preparation of this article were partially obtained from the AIBL funded by the Commonwealth Scientific and Industrial Research Organisation (CSIRO), which was made available at the ADNI database (http://www.loni.usc.edu/ADNI). UK Biobank is generously supported by its founding funders the Wellcome Trust and UK Medical Research Council, as well as the Department of Health, Scottish Government, the Northwest Regional Development Agency, British Heart Foundation and Cancer Research UK. The organization has over 150 dedicated members of staff, based in multiple locations across the UK.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Didac Vidal-Pineiro, Email: d.v.pineiro@psykologi.uio.no.

Juan Zhou, National University of Singapore, Singapore.

Christian Büchel, University Medical Center Hamburg-Eppendorf, Germany.

Funding Information

This paper was supported by the following grants:

  • H2020 European Research Council 732592 to Kristine Beate Walhovd.

  • H2020 European Research Council 283634 725025 to Anders Fjell.

  • H2020 European Research Council 313440 to Kristine Beate Walhovd.

  • Norges Forskningsråd to Anders Fjell.

  • Max Planck Institute for Dynamics of Complex Technical Systems Magdeburg to Andreas M Brandmaier.

  • María de Maeztu Unit of Excellence (Institute of Neurosciences,University of Barcelona) MDM-2017-0729 to Barbara Segura, Carme Junqué, David Bartres-Faz.

  • European Research Council 677804 to Simone Kühn.

  • UK Medical Research Council G1001354 to Sana Suri, Enikő Zsoldos.

  • Charitable Trust 1117747 to Enikő Zsoldos, Sana Suri.

  • Alzheimer’s Research UK 441 to Sana Suri.

  • NIHR Biomedical Research Centre, Oxford to Sana Suri.

  • Knut and Alice Wallenberg Foundation to Lars Nyberg.

  • ICREA Academia Award to David Bartres-Faz.

  • Norges Forskningsråd 324882 to Didac Vidal-Pineiro.

  • Medical Research Council SUAG/046 G101400 to Richard N Henson.

Additional information

Competing interests

No competing interests declared.

CAD: Is an employee of Vitas Ltd.

Author contributions

Conceptualization, Formal analysis, Visualization, Writing – original draft, Writing – review and editing.

Conceptualization, Formal analysis, Visualization, Writing – review and editing.

Data curation, Writing – review and editing.

Data curation, Resources, Software.

Data curation, Funding acquisition, Writing – review and editing.

Data curation, Funding acquisition, Writing – review and editing.

Data curation, Funding acquisition, Writing – review and editing.

Funding acquisition, Methodology, Writing – review and editing.

Data curation, Resources, Writing – review and editing.

Data curation, Writing – review and editing.

Funding acquisition, Writing – review and editing.

Data curation, Funding acquisition, Writing – review and editing.

Data curation, Funding acquisition, Writing – review and editing.

Data curation, Funding acquisition, Writing – review and editing.

Data curation, Funding acquisition, Writing – review and editing.

Formal analysis, Methodology, Writing – review and editing.

Conceptualization, Data curation, Funding acquisition, Writing – review and editing.

Data curation, Funding acquisition, Writing – review and editing.

Data curation, Methodology, Resources, Writing – review and editing.

Resources, Visualization, Writing – review and editing.

Conceptualization, Data curation, Funding acquisition, Writing – review and editing.

Formal analysis, Writing – review and editing.

Data curation, Writing – review and editing.

Conceptualization, Methodology, Writing – review and editing.

Formal analysis, Methodology, Writing – review and editing.

Data curation, Funding acquisition, Writing – review and editing.

Data curation, Writing – review and editing.

Conceptualization, Writing – review and editing.

Data curation, Writing – review and editing.

Conceptualization, Funding acquisition, Supervision, Writing – review and editing.

Conceptualization, Funding acquisition, Supervision, Writing – original draft, Writing – review and editing.

Ethics

UK Biobank (North West Multi-Center Research Ethics Committee [MREC]; see also https://www.ukbiobank.ac.uk/the-ethics-and-governance-council) and the different cohorts of the Lifebrain replication dataset (see Pseudo-Table below) have ethical approval from the respective regional ethics committees. All participants provided informed consent. LCBC Norwegian Regional Committee for Medical and Health Research Ethic, Regional Ethical Committee of South Norway, BETULA Regional Ethical Vetting Board at Umeå University, BASE-II Ethics committee of the Charité-Universitätsmedizin Berlin Cam-CAN, Cambridgeshire 2 Research Ethics Committee, UB Comisión de Bioética de la Universidad de Barcelona and Hospital Clinic AIBL Institutional ethics committees of Austin Health, StVincent's Health Hollywood Private Hospital and Edith Cowan University.

Additional files

Supplementary file 1. List of cortical brain features.

List of cortical features included in the brain age model and age variance explained in the UK Biobank and the Lifebrain training datasets. Vol = volume; GWC = gray-white matter contrast; Cth = cortical thickness.

elife-69995-supp1.docx (23.3KB, docx)
Supplementary file 2. List of subcortical brain features.

List of subcortical features included in the brain age model and age variance explained in the UK Biobank and the Lifebrain training datasets. Vol = volume; Int = intensity; hemi = hemisphere.

elife-69995-supp2.docx (22KB, docx)
Supplementary file 3. Sociodemographics.

Main sample descriptives for the training and test datasets. Obs = mean number of observations per participant (SD). Follow-up = mean time (years) between the first and the last MRI observation (SD). For the test datasets, age and age range refer to age at baseline. *AIBL does not belong to the Lifebrain consortium but was included to enrich the replication sample.

elife-69995-supp3.docx (14.8KB, docx)
Supplementary file 4. Relationship between brain age delta and change in brain features.

Long. change = longitudinal change in the raw neuroimaging features (mean change [log10(p)]). PC1 load = feature loadings on the first component of longitudinal change. Deltacross = relationship between cross-sectional brain age delta and feature change (r2 [log10(p)]). Deltalong = relationship between longitudinal brain age delta and feature change (r2 [log10(p)]). GWC = gray-white matter contrast. Cth = cortical thickness. Bil = bilateral. Subc = subcortical. n = 1372 and 1500 for the UK Biobank and the Lifebrain datasets. |N| = 365 and 372 features in the UK Biobank and the Lifebrain datasets. XGB = boosting gradient as implemented in XGBoost.

elife-69995-supp4.docx (120.3KB, docx)
Supplementary file 5. Contact information.

Contact information and ethical comittees for the different cohorts.

elife-69995-supp5.docx (14.8KB, docx)
Supplementary file 6. Data acquisition parameters.

Data acquisition parameters for the T1w sequences. *UK Biobank employed three scanners of the same model and with equivalent parameters (Cheadle, Reading, and Newcastle centers). **AIBL does not belong to the Lifebrain consortium but was included in the Lifebrain replication dataset.

elife-69995-supp6.docx (14.9KB, docx)
Transparent reporting form
Source code 1. Analysis Code.
elife-69995-code1.zip (51.7KB, zip)

Data availability

The raw data were gathered from the UK Biobank, the Lifebrain cohort, and the AIBL. Raw data requests are specific to each cohort. UK Biobank and AIBL data are available upon application to UK Biobank and at https://aibl.csiro.au upon corresponding approvals. For the Lifebrain cohorts, requests for raw MRI data should be submitted to the corresponding principal investigator. See contact details in Supplementary File 5. Different agreements are required for each dataset. Statistical analyses in this manuscript are available alongside the manuscript and will be made available at https://github.com/LCBC-UiO/VidalPineiro_BrainAge, (copy archived at swh:1:rev:2044c6ca40e0b8f99c9190c6edfde8ca76b559ac). All analyses were performed in R 3.6.3. The scripts were run on the Colossus processing cluster, University of Oslo. UK Biobanks' data acquisition, MRI preprocessing, and feature generation pipelines are freely available (https://www.fmrib.ox.ac.uk/ukbiobank). For the Lifebrain cohorts, the image acquisition details are summarized in Supplementary File 6. MRI preprocessing and feature generation scripts were performed with the freely available FreeSurfer software (https://surfer.nmr.mgh.harvard.edu/).

References

  1. Beck D. Cardiometabolic Risk Factors Associated with Brain Age and Accelerate Brain Ageing. medRxiv. 2021 doi: 10.1101/2021.02.25.21252272. [DOI] [PMC free article] [PubMed]
  2. Bertram L, Böckenhoff A, Demuth I, Düzel S, Eckardt R, Li SC, Lindenberger U, Pawelec G, Siedler T, Wagner GG, Steinhagen-Thiessen E. Cohort Profile: The Berlin Aging Study II (BASE-II)†. Ternational Journal of Epidemiology. 2014;43:703–712. doi: 10.1093/ije/dyt018. [DOI] [PubMed] [Google Scholar]
  3. Bethlehem RA. Brain Charts for the Human Lifespan. bioRxiv. 2021 doi: 10.1101/2021.06.08.447489. [DOI]
  4. Boyle EA, Li YI, Pritchard JK. An Expanded View of Complex Traits: From Polygenic to Omnigenic. Cell. 2017;169:1177–1186. doi: 10.1016/j.cell.2017.05.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Brouwer RM, Schutte J, Janssen R, Boomsma DI, Hulshoff Pol HE, Schnack HG. The Speed of Development of Adolescent Brain Age Depends on Sex and Is Genetically Determined. Cerebral Cortex. 2021;31:1296–1306. doi: 10.1093/cercor/bhaa296. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Butler ER, Chen A, Ramadan R, Le TT, Ruparel K, Moore TM, Satterthwaite TD, Zhang F, Shou H, Gur RC, Nichols TE, Shinohara RT. Pitfalls in brain age analyses. Human Brain Mapping. 2021;42:4092–4101. doi: 10.1002/hbm.25533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bycroft C, Freeman C, Petkova D, Band G, Elliott LT, Sharp K, Motyer A, Vukcevic D, Delaneau O, O’Connell J, Cortes A, Welsh S, Young A, Effingham M, McVean G, Leslie S, Allen N, Donnelly P, Marchini J. The UK Biobank resource with deep phenotyping and genomic data. Nature. 2018;562:203–209. doi: 10.1038/s41586-018-0579-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience. 2015;4:7. doi: 10.1186/s13742-015-0047-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Cole JH, Franke K. Predicting Age Using Neuroimaging: Innovative Brain Ageing Biomarkers. Trends in Neurosciences. 2017;40:681–690. doi: 10.1016/j.tins.2017.10.001. [DOI] [PubMed] [Google Scholar]
  10. Cole JH, Ritchie SJ, Bastin ME, Valdés Hernández MC, Muñoz Maniega S, Royle N, Corley J, Pattie A, Harris SE, Zhang Q, Wray NR, Redmond P, Marioni RE, Starr JM, Cox SR, Wardlaw JM, Sharp DJ, Deary IJ. Brain age predicts mortality. Molecular Psychiatry. 2018;23:1385–1392. doi: 10.1038/mp.2017.62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cole JH. Multimodality neuroimaging brain-age in UK biobank: relationship to biomedical, lifestyle, and cognitive factors. Neurobiology of Aging. 2020a;92:34–42. doi: 10.1016/j.neurobiolaging.2020.03.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Cole J. UK Biobank-Brain-Age. 6127347GitHub. 2020b https://github.com/james-cole/UKBiobank-Brain-Age
  13. Dale AM, Fischl B, Sereno MI. Cortical surface-based analysis. NeuroImage. 1999;9:179–194. doi: 10.1006/nimg.1998.0395. [DOI] [PubMed] [Google Scholar]
  14. de Lange AMG, Kaufmann T, van der Meer D, Maglanoc LA, Alnæs D, Moberget T, Douaud G, Andreassen OA, Westlye LT. Population-based neuroimaging reveals traces of childbirth in the maternal brain. PNAS. 2019;116:22341–22346. doi: 10.1073/pnas.1910666116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Deary IJ. Looking for “system integrity” in cognitive epidemiology. Gerontology. 2012;58:545–553. doi: 10.1159/000341157. [DOI] [PubMed] [Google Scholar]
  16. Dong HM, Castellanos FX, Yang N, Zhang Z, Zhou Q, He Y, Zhang L, Xu T, Holmes AJ, Thomas Yeo BT, Chen F, Wang B, Beckmann C, White T, Sporns O, Qiu J, Feng T, Chen A, Liu X, Chen X, Weng X, Milham MP, Zuo XN. Charting brain growth in tandem with brain templates at school age. Science Bulletin. 2020;65:1924–1934. doi: 10.1016/j.scib.2020.07.027. [DOI] [PubMed] [Google Scholar]
  17. Elliott ML, Belsky DW, Knodt AR, Ireland D, Melzer TR, Poulton R, Ramrakha S, Caspi A, Moffitt TE, Hariri AR. Brain-age in midlife is associated with accelerated biological aging and cognitive decline in a longitudinal birth cohort. Molecular Psychiatry. 2019;10:e626. doi: 10.1038/s41380-019-0626-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Ellis KA, Bush AI, Darby D, De Fazio D, Foster J, Hudson P, Lautenschlager NT, Lenzo N, Martins RN, Maruff P, Masters C, Milner A, Pike K, Rowe C, Savage G, Szoeke C, Taddei K, Villemagne V, Woodward M, Ames D, Group AR. The Australian Imaging, Biomarkers and Lifestyle (AIBL) study of aging: methodology and baseline characteristics of 1112 individuals recruited for a longitudinal study of Alzheimer’s disease. Ternational Psychogeriatrics. 2009;21:672–687. doi: 10.1017/S1041610209009405. [DOI] [PubMed] [Google Scholar]
  19. Fischl B, Sereno MI, Dale AM. Cortical surface-based analysis II: Inflation, flattening, and a surface-based coordinate system. NeuroImage. 1999;9:195–207. doi: 10.1006/nimg.1998.0396. [DOI] [PubMed] [Google Scholar]
  20. Fischl B, Dale AM. Measuring the thickness of the human cerebral cortex from magnetic resonance images. PNAS. 2000;97:11050–11055. doi: 10.1073/pnas.200033797. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Franke K, Gaser C. Longitudinal changes in individual BrainAGE in healthy aging, mild cognitive impairment, and Alzheimer’s disease. GeroPsych. 2012;25:235–245. doi: 10.1024/1662-9647/a000074. [DOI] [Google Scholar]
  22. Franke K, Gaser C. Ten Years of BrainAGE as a Neuroimaging Biomarker of Brain Aging: What Insights Have We Gained? Frontiers in Neurology. 2019;10:789. doi: 10.3389/fneur.2019.00789. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Ge T, Chen CY, Ni Y, Feng YCA, Smoller JW. Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nature Communications. 2019;10:1776. doi: 10.1038/s41467-019-09718-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Gielen M, Lindsey PJ, Derom C, Smeets HJM, Souren NY, Paulussen ADC, Derom R, Nijhuis JG. Modeling genetic and environmental factors to increase heritability and ease the identification of candidate genes for birth weight: a twin study. Behavior Genetics. 2008;38:44–54. doi: 10.1007/s10519-007-9170-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. International HapMap 3 Consortium. Altshuler DM, Gibbs RA, Peltonen L, Altshuler DM, Gibbs RA, Peltonen L, Dermitzakis E, Schaffner SF, Yu F, Peltonen L, Dermitzakis E, Bonnen PE, Altshuler DM, Gibbs RA, de Bakker PIW, Deloukas P, Gabriel SB, Gwilliam R, Hunt S, Inouye M, Jia X, Palotie A, Parkin M, Whittaker P, Yu F, Chang K, Hawes A, Lewis LR, Ren Y, Wheeler D, Gibbs RA, Muzny DM, Barnes C, Darvishi K, Hurles M, Korn JM, Kristiansson K, Lee C, McCarrol SA, Nemesh J, Dermitzakis E, Keinan A, Montgomery SB, Pollack S, Price AL, Soranzo N, Bonnen PE, Gibbs RA, Gonzaga-Jauregui C, Keinan A, Price AL, Yu F, Anttila V, Brodeur W, Daly MJ, Leslie S, McVean G, Moutsianas L, Nguyen H, Schaffner SF, Zhang Q, Ghori MJR, McGinnis R, McLaren W, Pollack S, Price AL, Schaffner SF, Takeuchi F, Grossman SR, Shlyakhter I, Hostetter EB, Sabeti PC, Adebamowo CA, Foster MW, Gordon DR, Licinio J, Manca MC, Marshall PA, Matsuda I, Ngare D, Wang VO, Reddy D, Rotimi CN, Royal CD, Sharp RR, Zeng C, Brooks LD, McEwen JE. tegrating common and rare genetic variation in diverse human populations. Nature. 2010;467:52–58. doi: 10.1038/nature09298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Jonsson BA, Bjornsdottir G, Thorgeirsson TE, Ellingsen LM, Walters GB, Gudbjartsson DF, Stefansson H, Stefansson K, Ulfarsson MO. Brain age prediction using deep learning uncovers associated sequence variants. Nature Communications. 2019;10:5409. doi: 10.1038/s41467-019-13163-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Karama S, Bastin ME, Murray C, Royle NA, Penke L, Muñoz Maniega S, Gow AJ, Corley J, Valdés Hernández M, Lewis JD, Rousseau MÉ, Lepage C, Fonov V, Collins DL, Booth T, Rioux P, Sherif T, Adalat R, Starr JM, Evans AC, Wardlaw JM, Deary IJ. Childhood cognitive ability accounts for associations between cognitive ability and brain cortical thickness in old age. Molecular Psychiatry. 2014;19:555–559. doi: 10.1038/mp.2013.64. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Kirkwood TBL. Understanding the odd science of aging. Cell. 2005;120:437–447. doi: 10.1016/j.cell.2005.01.027. [DOI] [PubMed] [Google Scholar]
  29. Lakens D, Scheel AM, Isager PM. Equivalence Testing for Psychological Research: A Tutorial. Advances in Methods and Practices in Psychological Science. 2018;1:259–269. doi: 10.1177/2515245918770963. [DOI] [Google Scholar]
  30. Marquand AF, Kia SM, Zabihi M, Wolfers T, Buitelaar JK, Beckmann CF. Conceptualizing mental disorders as deviations from normative functioning. Molecular Psychiatry. 2019;24:1415–1424. doi: 10.1038/s41380-019-0441-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Miller KL, Alfaro-Almagro F, Bangerter NK, Thomas DL, Yacoub E, Xu J, Bartsch AJ, Jbabdi S, Sotiropoulos SN, Andersson JLR, Griffanti L, Douaud G, Okell TW, Weale P, Dragonu I, Garratt S, Hudson S, Collins R, Jenkinson M, Matthews PM, Smith SM. Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nature Neuroscience. 2016;19:1523–1536. doi: 10.1038/nn.4393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Molenaar PCM. A Manifesto on Psychology as Idiographic Science: Bringing the Person Back Into Scientific Psychology, This Time Forever. Measurement. 2004;2:201–218. doi: 10.1207/s15366359mea0204_1. [DOI] [Google Scholar]
  33. Nilsen TS, Kutschke J, Brandt I, Harris JR. Validity of Self-Reported Birth Weight: Results from a Norwegian Twin Sample. Twin Research and Human Genetics. 2017;20:406–413. doi: 10.1017/thg.2017.44. [DOI] [PubMed] [Google Scholar]
  34. Nilsson LG, Adolfsson R, Bäckman L, de Frias CM, Molander B, Nyberg L. Betula: A Prospective Cohort Study on Memory, Health and Aging. Aging, Neuropsychology, and Cognition. 2010;11:134–148. doi: 10.1080/13825580490511026. [DOI] [Google Scholar]
  35. Rajaram S, Valls-Pedret C, Cofán M, Sabaté J, Serra-Mir M, Pérez-Heras AM, Arechiga A, Casaroli-Marano RP, Alforja S, Sala-Vila A, Doménech M, Roth I, Freitas-Simoes TM, Calvo C, López-Illamola A, Haddad E, Bitok E, Kazzi N, Huey L, Fan J, Ros E. The Walnuts and Healthy Aging Study (WAHA): Protocol for a Nutritional Intervention Trial with Walnuts on Brain Aging. Frontiers in Aging Neuroscience. 2016;8:333. doi: 10.3389/fnagi.2016.00333. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Raznahan A, Greenstein D, Lee NR, Clasen LS, Giedd JN. Prenatal growth in humans and postnatal brain maturation into late adolescence. PNAS. 2012;109:11366–11371. doi: 10.1073/pnas.1203350109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Reuter M, Schmansky NJ, Rosas HD, Fischl B. Within-subject template estimation for unbiased longitudinal image analysis. NeuroImage. 2012;61:1402–1418. doi: 10.1016/j.neuroimage.2012.02.084. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Rogosa DR, Willett JB. Understanding correlates of change by modeling individual differences in growth. Psychometrika. 1985;50:203–228. doi: 10.1007/BF02294247. [DOI] [Google Scholar]
  39. Schmiedek F, Lövdén M, von Oertzen T, Lindenberger U. Within-person structures of daily cognitive performance differ from between-person structures of cognitive abilities. PeerJ. 2020;8:e9290. doi: 10.7717/peerj.9290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Shafto MA, Tyler LK, Dixon M, Taylor JR, Rowe JB, Cusack R, Calder AJ, Marslen-Wilson WD, Duncan J, Dalgleish T, Henson RN, Brayne C, Matthews FE, Cam-CAN The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) study protocol: a cross-sectional, lifespan, multidisciplinary examination of healthy cognitive ageing. BMC Neurology. 2014;14:204. doi: 10.1186/s12883-014-0204-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Smith SM, Vidaurre D, Alfaro-Almagro F, Nichols TE, Miller KL. Estimation of brain age delta from brain imaging. NeuroImage. 2019;200:528–539. doi: 10.1016/j.neuroimage.2019.06.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Smith SM, Elliott LT, Alfaro-Almagro F, McCarthy P, Nichols TE, Douaud G, Miller KL. Brain aging comprises many modes of structural and functional change with distinct genetic and biophysical associations. eLife. 2020;9:e52677. doi: 10.7554/eLife.52677. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Taylor JR, Williams N, Cusack R, Auer T, Shafto MA, Dixon M, Tyler LK, Henson RN. The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) data repository: Structural and functional MRI, MEG, and cognitive data from a cross-sectional adult lifespan sample. NeuroImage. 2017;144:262–269. doi: 10.1016/j.neuroimage.2015.09.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Tehranifar P, Liao Y, Flom JD, Terry MB. Validity of Self-reported Birth Weight by Adult Women: Sociodemographic Influences and Implications for Life-Course Studies. American Journal of Epidemiology. 2009;170:910–917. doi: 10.1093/aje/kwp205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Vidal-Piñeiro D, Martin-Trias P, Arenaza-Urquijo EM, Sala-Llonch R, Clemente IC, Mena-Sánchez I, Bargalló N, Falcón C, Pascual-Leone Á, Bartrés-Faz D. Task-dependent activity and connectivity predict episodic memory network-based responses to brain stimulation in healthy aging. Brain Stimulation. 2014;7:287–296. doi: 10.1016/j.brs.2013.12.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Vidal-Piñeiro D. VidalPineiro_BrainAge. swh:1:rev:2044c6ca40e0b8f99c9190c6edfde8ca76b559acSoftware Heritage. 2021 https://archive.softwareheritage.org/swh:1:dir:b64b1dc0cb2de452fe9677a2b45a895aa9171a21;origin=https://github.com/LCBC-UiO/VidalPineiro_BrainAge;visit=swh:1:snp:18701519e2e25bcfc5dbd55aafa7ad7439bf78f4;anchor=swh:1:rev:2044c6ca40e0b8f99c9190c6edfde8ca76b559ac
  47. Wainer H. The centercept: an estimable and meaningful regression parameter. Psychological Science. 2000;11:434–436. doi: 10.1111/1467-9280.00284. [DOI] [PubMed] [Google Scholar]
  48. Walhovd KB, Fjell AM, Brown TT, Kuperman JM, Chung Y, Hagler DJ, Roddey JC, Erhart M, McCabe C, Akshoomoff N, Amaral DG, Bloss CS, Libiger O, Schork NJ, Darst BF, Casey BJ, Chang L, Ernst TM, Frazier J, Gruen JR, Kaufmann WE, Murray SS, van Zijl P, Mostofsky S, Dale AM, Pediatric Imaging, Neurocognition, and Genetics Study Long-term influence of normal variation in neonatal characteristics on human brain development. PNAS. 2012;109:20089–20094. doi: 10.1073/pnas.1208180109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Walhovd KB, Krogsrud SK, Amlien IK, Bartsch H, Bjørnerud A, Due-Tønnessen P, Grydeland H, Hagler DJ, Håberg AK, Kremen WS, Ferschmann L, Nyberg L, Panizzon MS, Rohani DA, Skranes J, Storsve AB, Sølsnes AE, Tamnes CK, Thompson WK, Reuter C, Dale AM, Fjell AM. Neurodevelopmental origins of lifespan changes in brain and cognition. PNAS. 2016;113:9357–9362. doi: 10.1073/pnas.1524259113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Walhovd KB, Fjell AM, Westerhausen R, Nyberg L, Ebmeier KP, Lindenberger U, Bartrés-Faz D, Baaré WFC, Siebner HR, Henson R, Drevon CA, Strømstad Knudsen GP, Ljøsne IB, Penninx B, Ghisletta P, Rogeberg O, Tyler L, Bertram L, Lifebrain Consortium Healthy minds 0-100 years: Optimising the use of European brain imaging cohorts (“Lifebrain.”. European Psychiatry. 2018;50:47–56. doi: 10.1016/j.eurpsy.2017.12.006. [DOI] [PubMed] [Google Scholar]
  51. Walhovd KB, Fjell AM, Sørensen Ø, Mowinckel AM, Reinbold CS, Idland AV, Watne LO, Franke A, Dobricic V, Kilpert F, Bertram L, Wang Y. Genetic risk for Alzheimer disease predicts hippocampal volume through the human lifespan. Neurology. Genetics. 2020;6:e506. doi: 10.1212/NXG.0000000000000506. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Zuo XN, Xu T, Milham MP. Harnessing reliability for neuroscience research. Nature Human Behaviour. 2019;3:768–771. doi: 10.1038/s41562-019-0655-x. [DOI] [PubMed] [Google Scholar]

Decision letter

Editor: Juan Zhou1
Reviewed by: Xi-Nian Zuo

Our editorial process produces two outputs: (i) public reviews designed to be posted alongside the preprint for the benefit of readers; (ii) feedback on the manuscript for the authors, including requests for revisions, shown below. We also include an acceptance summary that explains what the editors found interesting or important about the work.

Acceptance summary:

This paper is of interest to scientists within the field of lifespan developmental neuroscience. It revealed that BrainAge scores did not predict within-person aging of the brain taken from the longitudinal data, but did correlate with metrics already occurring at birth, namely birth weight and polygenetic scores. This calls cautions in interpreting cross-sectional BrainAge indices as well as concluding their validity as markers of individual-level brain aging process.

Decision letter after peer review:

Thank you for submitting your article "Individual variations in "Brain age" relate to early life factors more than to longitudinal brain change" for consideration by eLife. Your article has been reviewed by 2 peer reviewers, and the evaluation has been overseen by a Reviewing Editor and Christian Büchel as the Senior Editor. The following individual involved in review of your submission has agreed to reveal their identity: Xi-Nian Zuo (Reviewer #2).

The reviewers have discussed their reviews with one another, and the Reviewing Editor has drafted this to help you prepare a revised submission.

Essential Revisions:

1) Improve terminology precision and interpretation (e.g. early life influence).

2) Improve manuscript readability significantly (including figures).

3) Add study limitations and future work (including developing brain).

4) Introduce measurement theory of individual differences.

5) Clarify the relationship between brainage and other metrics.

6) Discuss/clarify the effect size and the possible implications.

Reviewer #1 (Recommendations for the authors):

Beyond what was summarized in the Public Review, here I note some errors in figures and text so that the authors can fix those, and suggestions for edits that should help with clarity.

As already mentioned, the purpose of this study is to be applauded as mis- and over-interpretation of cross-sectional data findings as generalizing to true longitudinal change within individuals is a major problem in our field. The increasingly popular use of BrainAge as a metric has only increased this problem. The authors attempt to tests these assumptions was much needed.

The format of the submission made the manuscript very difficult to parse (and I work immediately in this exact field). Primarily driving this difficult to understand the details of the paper are (1) the extreme brevity of the background/introduction and (2) the unfortunate reverse order of the manuscript sections -- it is always difficult when the results come before the methods or description of variables, but in this particular case, it was extremely disadvantageous. If this is the required journal format, then I guess not much van be done, but it is a hindrance to understanding the study. I found I could not read the Results section on its own and had to flip through to simultaneously read the Methods and Supplements. This was exhausting. Some readers may simply give up. One suggestion (highly recommended) is to utilize the Abstract to at least introduce some specifics about the study, like the variables to be used (morphological structural features instead of brain integrity or brain decline – no one can tell what the study will use as written). Similarly I did not know what polygenic scores were based on or what genetic amalgamation they represented until very far into the paper. Also portions of the intro were not clear on the first read, and only made sense after I finished the whole paper and supplemental info. Some word changes may help?

Discussion: I thought that calling birth weight and genetics "early-life influences" throughout the paper a bit of an overstatement. Because this calls to mind many environmental things one experiences as a developing child/adolescent and was misleading (esp since things are not defined until very late in the sections, like Method at the end). A more accurate thing to call this would be something that made it clear that these variables represented "at birth indices". I do think this is a VERY important distinction and keeps the authors from leading the readers into a concept that is vague and perhaps not what is expected from that terminology.

Some study limitations are always warranted and a thoughtful inclusion. There really aren't any provided here to speak of. As a small example, I wondered what the impact is of very different sample sizes in different tests, but then comparisons across those analyses being made (as in the 770 for the birth weight, vs the 38k and 1372 for the other tests).

Reviewer #2 (Recommendations for the authors):

1. Measurement theory of individual differences should be included somewhere to clarify different focuses in terms of its two components: inter-individual (between-subject) variability and intra-individual (within-subject) variability. In theory, cross-sectional data is not enough to separate the two components and thus not good enough for any studies of individual differences, which call longitudinal data in ideal experimental design, to ensure both reliability and validity. Please see some comments in a recent publication (https://www.nature.com/articles/s41562-019-0655-x).

2. BrainAge is a recently developed method for normative aging research. What is the relationship between BrainAge and those metrics derived from the growth charts (e.g., height and weight).

3. I noticed that the effect sizes detected are all small (e.g., the R2 < 0.01). In such large sample, how these weak associations are interpreted in terms of the measurement reliability and validity? What is the potential factors with impacts on the ability of detecting such small effect size?

4. This work is demonstrated for aging samples, but should be generalizable for developing samples. Please discuss about this point more comprehensively. Some recent work (e.g., https://www.sciencedirect.com/science/article/pii/S2095927320304965) may be of values.

eLife. 2021 Nov 10;10:e69995. doi: 10.7554/eLife.69995.sa2

Author response


Essential Revisions:

1) Improve terminology precision and interpretation (e.g. early life influence).

Clarification of terminology is discussed in our response to Reviewer #1, comment #8. Briefly, in the revised version of the manuscript, the term “early-life influences” is replaced with “congenital factors” when referring to birth weight and polygenic scores. Further, in the revised version, we discuss in the limitation sections to which extent the relationship between cross-sectional brain age and the congenital factors can be extrapolated to other congenital and early-life variables.

2) Improve manuscript readability significantly (including figures).

We have carefully gone through the whole manuscript and revised the text to improve readability. This is explained in detail in the response to Reviewer #1, comment #1. To ease readability, the manuscript now includes additional details regarding the study both in the abstract and in the introduction. We also followed the reviewer’s suggestion of including additional plots for clarity and transparency and we have modified the existing plots following his/her suggestions.

3) Add study limitations and future work (including developing brain).

The limitations of the study are discussed in depth in response to Reviewer #1, comment #11. The revised manuscript now includes a “Limitations” section in which we discuss:

1) Statistical power;

2) The extent to which our results can be generalized to other normative modeling approaches and developmental samples, and

3) Whether the relationship between brain age and the congenital factors polygenic scores for brain age (PGS-BA) and birth weight can be extended to other congenital and early life factors.

4) Introduce measurement theory of individual differences.

This issue is discussed in response to Reviewer #2, comment #1. The relationship between cross-sectional and longitudinal brain age can be understood in the context of the measurement theory of individual differences. Following this account, the results indicate cross-sectional brain age has low validity – despite high reliability – rather reflecting variance related to factors that vary systematically across individuals and that are present early in life.

5) Clarify the relationship between brainage and other metrics.

This issue is discussed in response to Reviewer #2, comment #2. Normative brain aging charts are analogous to normative anthropometric growth charts used in pediatrics. Brain age models can be considered a special case of normative brain modeling. The main difference between brain age and normative modeling is that the latter uses the demographic variables to predict – a priori-defined – brain features. The degree to which our results can be extended to normative brain charts is discussed in detail in the revised version of the manuscript.

6) Discuss/clarify the effect size and the possible implications.

This issue is discussed in response to Reviewer #2, comment #3. All the main tests are well-powered to detect small effect sizes and use variables with high reliability. Reliability for longitudinal brain age delta is, however, unknown (and it has to be lower than cross-sectional brain age delta). Even assuming mediocre reliabilities, the tests would have enough power to detect small effect sizes – set aside that most research assumes a moderate-to-high relationship between cross-sectional brain age δ and brain aging. We now discuss the effect sizes and possible power issues in detail.

Reviewer #1 (Recommendations for the authors):

Beyond what was summarized in the Public Review, here I note some errors in figures and text so that the authors can fix those, and suggestions for edits that should help with clarity.

As already mentioned, the purpose of this study is to be applauded as mis- and over-interpretation of cross-sectional data findings as generalizing to true longitudinal change within individuals is a major problem in our field. The increasingly popular use of BrainAge as a metric has only increased this problem. The authors attempt to tests these assumptions was much needed.

The format of the submission made the manuscript very difficult to parse (and I work immediately in this exact field). Primarily driving this difficult to understand the details of the paper are (1) the extreme brevity of the background/introduction and (2) the unfortunate reverse order of the manuscript sections -- it is always difficult when the results come before the methods or description of variables, but in this particular case, it was extremely disadvantageous. If this is the required journal format, then I guess not much van be done, but it is a hindrance to understanding the study. I found I could not read the Results section on its own and had to flip through to simultaneously read the Methods and Supplements. This was exhausting. Some readers may simply give up. One suggestion (highly recommended) is to utilize the Abstract to at least introduce some specifics about the study, like the variables to be used (morphological structural features instead of brain integrity or brain decline – no one can tell what the study will use as written). Similarly I did not know what polygenic scores were based on or what genetic amalgamation they represented until very far into the paper. Also portions of the intro were not clear on the first read, and only made sense after I finished the whole paper and supplemental info. Some word changes may help?

Thank you for the comment. We apologize that the manuscript was difficult to parse. The format of the journal specifies an Introduction-Results-Discussion-Methods format. However, the revised version of the manuscript includes your suggestions in the abstract and introduction sections as an attempt to increase readability. The abstract now defines the variables used (“brain structure features”), identifies the test and training datasets, and defines birth weight and polygenic scores of brain age as two congenital factors thought to reflect constant, lifelong influences emerging in early life. The introduction section also introduces specific references to the method and variables used to ease understanding of the results. See below for the main changes in both sections.

p.5 (Abstract): “Here, we explicitly tested this assumption in two independent large test datasets (UK Biobank [main] and Lifebrain [replication]; longitudinal observations ≈ 2,750 and 4,200)”

p.5 (Abstract): “Brain age models were estimated in two different training datasets (n ≈ 38,000 and 1,800 individuals [replication]) based on brain structural features.”

p.5 (Abstract): “Rather, brain age in adulthood was associated with the congenital factors of birth weight and polygenic scores of brain age, assumed to reflect a constant, lifelong influence on brain structure from early life.”

p.6 (Introduction): “Alternatively, individual deviations from the expected brain age could capture constant interindividual differences in brain structure that remain stable throughout the lifespan, reflecting early genetic and environmental influences (Deary, 2012; Elliott et al., 2019; Walhovd et al., 2016).”

p.6 (Introduction): “Here we tested whether brain age – derived from structural T1-weighted (T1w) morphological features – is related to accelerated brain aging, early-life factors, or a combination of both.”

p.7 (Introduction): “In addition, we also assessed brain change with a composite score of structural brain change as obtained using principal component analysis and change in the different raw structural brain features. These analyses were performed in two independent cohorts, both divided into a cross-sectional model generation (training) and a longitudinal, hypothesis testing (test) dataset.”

p.7 (Introduction): “If cross-sectional variations in brain age reflect differences in brain structure established early in life, one should observe a relationship between brain age and influences associated with stable, lifelong effects on brain structure. Here, we selected two congenital traits: self-reported birth weight and polygenic scores for brain age (PGS-BA), for which lifelong effects on age-related phenotypes have been shown (Walhovd et al., 2020, 2012) (Figure 1b).”

p.7 (Introduction): “Birth weight reflects normal variation in body (and brain) size as well as prenatal conditions, whereas PGS-BA quantifies genetic liability of having a higher brain age.”

Discussion: I thought that calling birth weight and genetics "early-life influences" throughout the paper a bit of an overstatement. Because this calls to mind many environmental things one experiences as a developing child/adolescent and was misleading (esp since things are not defined until very late in the sections, like Method at the end). A more accurate thing to call this would be something that made it clear that these variables represented "at birth indices". I do think this is a VERY important distinction and keeps the authors from leading the readers into a concept that is vague and perhaps not what is expected from that terminology.

Thank you for the comment. We agree it is important to disambiguate both terms and that “early-life influences” may be vague. We have replaced it with “congenital factors/indices'' as they refer to traits present from birth. However, both indices represent a proof-of-concept that interindividual variations in cross-sectional brain age delta reflect lifelong influences rooted in the distant past more than presently ongoing events, and thus to some extent can be generalizable to other non-studied factors that may exert a stable influence on age-related brain phenotypes rather than affect the slope of decline. The revised version disambiguates both concepts and discusses to which point the relationship between cross-sectional brain age and the congenital factors can be extrapolated to other congenital and early-life variables.

In p. 5 (Abstract): “Rather, brain age in adulthood was associated with the congenital factors of birth weight and polygenic scores of brain age, assumed to reflect a constant, lifelong influence on brain structure from early life.”

In p. 7 (Introduction): “If cross-sectional variations in brain age reflect differences in brain structure established early in life, one should observe a relationship between brain age and influences associated with stable, lifelong effects on brain structure. Here, we selected two congenital factors: self-reported birth weight and polygenic scores for brain age (PGS-BA), for which lifelong effects on age-related phenotypes have been shown (Walhovd et al., 2020, 2012) (Figure 1b).”

In p. 12 (Results): “Brain age delta is associated with congenital factors on brain structure”

In p. 15 (Discussion): “Rather, brain age seems to reflect early-life influences on brain structure, and only to a very modest degree reflects actual rate of brain change in middle and old adulthood. A lack of relationship between brain age and rate of brain aging can potentially be explained - although not investigated in the present study - by the effect of circumscribed events such as isolated insults or detrimental lifestyles that occurred in the past resulting in higher, but not accelerating, brain age. Yet, variations in brain age can equally reflect congenital and early-life differences and show lifelong stability.”

In p. 19 (Discussion): “Finally, many genetic and environmental factors relate to lifelong stable differences in brain age beyond birth weight and PGS-BA. However, both variables are congenital and show stable associations through the lifespan (Raznahan et al., 2012; Walhovd et al., 2020, 2016) without strong evidence that they relate to brain change after adolescence. Thus, birth weight and PGS-BA are paradigmatic for showing how interindividual differences in brain age emerge early in life. The present study does not provide a systematic understanding of these influences, but presents a framework for interpreting the impact such measures may exert on age-related phenotypes.”

Some study limitations are always warranted and a thoughtful inclusion. There really aren't any provided here to speak of. As a small example, I wondered what the impact is of very different sample sizes in different tests, but then comparisons across those analyses being made (as in the 770 for the birth weight, vs the 38k and 1372 for the other tests).

We now include a “Limitations” section. In this section, we discuss the following issues: (1) whether the analyses are well-powered; (2) the extent to which our results can be generalized to other normative modeling approaches and developmental samples (see Reviewer #2, comments #2, #4 for a wider discussion), and (3) whether the relationship between brain age and congenital factors PGS-BA and birth weight can be extended to other congenital and early life factors.

In pp. 17-9 (Discussion): “We used large training datasets to estimate the brain age models and the PGS scores leading to robust PGS-BA and brain age estimates. Self-reported birth weight (Nilsen et al., 2017) and cross-sectional brain age (Franke and Gaser, 2012) are highly reliable measures; thus, our analyses are well powered to detect small effects (Zuo et al., 2019). The reliability of brain age deltalong is, however, unknown. Strictly speaking, brain age delta is a prediction error from a model that maximizes the prediction of age in cross-sectional data and thus partially also reflects noise. Given that deltalong is estimated as the difference between two deltacross estimates, it will hence have higher noise than the cross-sectional estimates reducing the power in identifying potential associations between longitudinal and cross-sectional delta; note also the relatively short interscan interval in UK Biobank (≈2y). However, our sample size (n > 1,200) ensures that the tests performed in this study are well-powered to detect small effects, even if deltalong has mediocre reliability (Zuo et al., 2019). Further, replication of our null results in the Lifebrain sample with more observations and longer follow-up times reduces the likelihood of noise as the main factor behind the lack of relationship. Furthermore, previous studies have found that changes in brain age are partly heritable (Brouwer et al., 2021) and relate to for instance cardiometabolic risk factors (Beck et al., 2021), suggesting that it captures biologically relevant signals (i.e. has predictive validity), although with substantially different origins from cross-sectional brain age. Although the reliability of deltalong needs to be formally tested, the null relationship between deltacross and deltalong does not seem to be a result of a low-powered test.

We speculate that our results partially generalize to other normative and residual-based modeling approaches as well as to developmental samples. There is considerable evidence in the literature that birth weight and genetic risk for neurodegenerative conditions affects brain structure from early life (Raznahan et al., 2012; Walhovd et al., 2020, 2016, 2012b). Brain age models are related to other models such as normative brain charts (Bethlehem et al., 2021; Dong et al., 2020) - akin to normative anthropometric charts - the main difference being that brain age models predict, rather than control for, age (Marquand et al., 2019). Both types of models produce normative brain scores, which are uncorrelated with age (Butler et al., 2021). Thus, caution is required when interpreting these scores as indices of brain aging without availability of longitudinal data. Developmental samples may, however, reflect slightly stronger relationships between cross-sectional brain age delta and ongoing brain change as brain changes during early-life development typically occur at a faster pace than in middle or later life. Similarly, for specific disease groups such as Alzheimer’s disease patients (Franke and Gaser, 2012), interindividual brain variation in brain age might reflect to a greater extent prevailing loss of brain structure. Moreover, the variance associated with factors other than ongoing development/aging might be more limited in early than later age, since influences leading to interindividual variations in brain structure have a shorter span to accumulate. That is, as time from birth increases, chronological age as a marker of individual development is reduced.

Finally, many genetic and environmental factors relate to lifelong stable differences in brain age beyond birth weight and PGS-BA. However, both variables are congenital and show stable associations through the lifespan (Raznahan et al., 2012; Walhovd et al., 2020, 2016) without strong evidence that they relate to brain change after adolescence. Thus, birth weight and PGS-BA are paradigmatic for showing how interindividual differences in brain age emerge early in life. The present study does not provide a systematic understanding of these influences, but presents a framework for interpreting the impact such measures may exert on age-related phenotypes.”

Reviewer #2 (Recommendations for the authors):

1. Measurement theory of individual differences should be included somewhere to clarify different focuses in terms of its two components: inter-individual (between-subject) variability and intra-individual (within-subject) variability. In theory, cross-sectional data is not enough to separate the two components and thus not good enough for any studies of individual differences, which call longitudinal data in ideal experimental design, to ensure both reliability and validity. Please see some comments in a recent publication (https://www.nature.com/articles/s41562-019-0655-x).

Thanks for the comment. Indeed, the relationship between cross-sectional and longitudinal brain age can be understood in the context of the measurement theory of individual differences (Brandmaier et al., 2018; Zuo et al., 2019). Previous work has shown that cross-sectional brain age is highly reliable (Franke and Gaser, 2012) but its validity – i.e. the proportion of the total variance attributed to the trait of interest alone, that is brain aging – has only been indirectly assessed. In this account, longitudinal brain change can be considered a “gold standard” criterion. We now discuss the findings under the framework of the measurement theory of individual differences.

p. 16 (discussion): “From a measurement theory perspective, our results suggest that cross-sectional brain age has low validity as an index of brain aging – despite having high reliability (Franke and Gaser, 2012) – as only a small portion of variance is associated with the trait of interest alone (Zuo et al., 2019). Most variance is rather associated with other factors that vary systematically across individuals, some of which are already present at birth.”

2. BrainAge is a recently developed method for normative aging research. What is the relationship between BrainAge and those metrics derived from the growth charts (e.g., height and weight).

Normative brain aging charts are analogous to normative anthropometric growth charts used in pediatrics. Brain age models can be considered a special case of normative brain modeling. The main difference between brain age and normative modeling is that the latter uses the demographic variables to predict a priori-defined brain features. Instead, Brain age models invert the approach, predicting age from several brain measures. Traditional normative charts are more easily interpretable than brain age whereas brain age models lead to simple outputs as they condense brain data into a single score (Marquand et al., 2019).

Both methods lead to scores that characterize the brain features of a given participant with respect to his/her peers (i.e. normative scores). Because age is invariably used in any normative model, both lead to brain measures that are uncorrelated with age (Butler et al., 2021). Researchers often interpret these measures as markers of ongoing brain change. We focused on brain age models as the interpretation of norm-deviation variations as accelerated/delayed aging is more pervasive, possibly due to:

a) Semantics, and that

b) The biological variables are selected and weighted based on their association with age. Thus, it is important to determine to which degree all these different metrics quantify advanced or delayed aging. Moreover, our call for caution interpreting these measures is generalizable. We now acknowledge the similarity between brain age models and normative brain models and discuss to what extent our word of caution can be extrapolated across models.

p. 18 (discussion): “We speculate that our results partially generalize to other normative and residual-based modeling approaches as well as to developmental samples. There is considerable evidence in the literature that birth weight and genetic risk for neurodegenerative conditions affect brain structure from early life (Raznahan et al., 2012; Walhovd et al., 2020, 2016, 2012b). Brain age models are related to other models such as normative brain charts (Bethlehem et al., 2021; Dong et al., 2020) – akin to normative anthropometric charts – the main difference being that brain age models predict, rather than control for, age (Marquand et al., 2019). Both types of models produce normative brain scores, which are uncorrelated with age (Butler et al., 2021). Thus, caution is required when interpreting these scores as indices of brain aging without proper assessment of longitudinal data.”

3. I noticed that the effect sizes detected are all small (e.g., the R2 < 0.01). In such large sample, how these weak associations are interpreted in terms of the measurement reliability and validity? What is the potential factors with impacts on the ability of detecting such small effect size?

Thank you for the question. Your assessment is correct. The significant relationships of birth weight and polygenic risk scores on brain age delta had an effect size of r2 ≈.009 and.02. The effect size of the non-significant relationship between cross-sectional and longitudinal brain age delta was r2 ≤ 0.001. We have little doubt these values stem from true positive and negative tests for the following reasons:

1) We used cross-sectional brain age delta, polygenic scores (PGS) of brain age, and self-reported birth weight as independent variables. Brain age δ and self-reported birth weight are highly reliable (Franke and Gaser, 2012; Nilsen et al., 2017); thus, not risking regression dilution effects.

2) We used cross-sectional and longitudinal brain age delta as dependent variables. The cross-sectional brain age delta has very high reliability. To our knowledge, there are no reports of the reliability of the longitudinal brain age delta. However, even if longitudinal brain age delta has a mediocre reliability for (≈ 0.4) – reliability for delta change certainly will be lower than for cross-sectional data – our test would still have enough power to detect small effects due to a relatively large sample size in both the main and the analyses (n > 1.200) (Zuo et al., 2019). The remaining tests use relatively large samples and involve measures with high reliability and thus are well-powered to detect very small effects.

Regarding the validity of the variables, longitudinal brain age delta has high validity as a measure of brain change while the validity of cross-sectional brain age delta as an index of brain aging is precisely a research question of the present study as discussed in Reviewer #1, comment #1. Likewise, the validity of PGS of brain age and self-reported birth weight as congenital factors is also high. Of course, both indices only capture a tiny fraction of the genetic and environmental influences that lead to interindividual differences in brain structure already in early life. Thus, the PGS and birth weight tests offer proof-of-concept that interindividual variance in brain age delta is more influenced by early life factors than by ongoing processes, rather than quantifying or approximating the total amount of variance in brain age delta that is explained by congenital factors. We now include a succinct summary of this explanation in the limitations section.

In pp 17-8 (discussion): “We used large training datasets to estimate the brain age models and the PGS scores leading to robust PGS-BA and brain age estimates. Self-reported birth weight (Nilsen et al., 2017) and cross-sectional brain age (Franke and Gaser, 2012) are highly reliable measures; thus, our analyses are well-powered to detect small effects (Zuo et al., 2019). The reliability of brain age deltalong is, however, unknown. Strictly speaking, brain age delta is a prediction error from a model that maximizes the prediction of age in cross-sectional data and thus partially also reflects noise. Given that deltalong is estimated as the difference between two deltacross estimates, it will hence have higher noise than the cross-sectional estimates reducing the power in identifying potential associations between longitudinal and cross-sectional δ; note also the relatively short interscan interval in UK Biobank (≈2y). However, our sample size (n > 1,200) ensures that the tests performed in this study are well-powered to detect small effects, even if deltalong has mediocre reliability (Zuo et al., 2019). Further, replication of our null results in the Lifebrain sample with more observations and longer follow-up times reduces the likelihood of noise as the main factor behind the lack of relationship. Furthermore, previous studies have found that changes in brain age are partly heritable (Brouwer et al., 2021) and relate to for instance cardiometabolic risk factors (Beck et al., 2021), suggesting that it captures biologically relevant signals (i.e. has predictive validity), although with substantially different origins from cross-sectional brain age. Although the reliability of deltalong needs to be formally tested, the null relationship between deltacross and deltalong does not seem to be a result of a low-powered test.”

In p. 19 (discussion): “Finally, many genetic and environmental factors relate to lifelong stable differences in brain age beyond birth weight and PGS-BA. However, both variables are congenital and show stable associations through the lifespan (Raznahan et al., 2012; Walhovd et al., 2020, 2016) without strong evidence that they relate to brain change after adolescence. Thus, birth weight and PGS-BA are paradigmatic for showing how interindividual differences in brain age emerge early in life. The present study does not provide a systematic understanding of these influences, but presents a framework for interpreting the impact such measures may exert on age-related phenotypes.”

4. This work is demonstrated for aging samples, but should be generalizable for developing samples. Please discuss about this point more comprehensively. Some recent work (e.g., https://www.sciencedirect.com/science/article/pii/S2095927320304965) may be of values.

Indeed. Both congenital factors (PGS of brain age δ and birth weight) will be associated with cross-sectional brain age delta in developmental samples. Previous research has found that other congenital indices such as preterm birth relate to brain age during childhood and adolescence. Other studies have found stable, lifelong effects of birth weight on brain structural features (Raznahan et al., 2012; Walhovd et al., 2012; Wheater et al., 2021), and stable effects of genetic factors of Alzheimer’s Disease on hippocampus volume during the entire lifespan (Walhovd et al., 2020).

Likewise, we also believe our caution when interpreting cross-sectional residual-based indices as accelerated/delayed maturation is extendable to developmental research. Yet, it is also likely that in development samples, brain age reflects ongoing changes to a greater degree than in aging for the following reasons: (1) During developmental interindividual differences in brain change are higher than in middle-age or aging; thus, changes in brain structure are generally steeper in development than in aging. This feature might also apply to specific disease groups in aging such as in Alzheimer’s disease patients. (2) Variance associated with variables other than ongoing brain change that systematically vary across individuals, should be lower than in older adults where a longer lifespan should lead to a wider and higher accumulation of effects on the individuals’ brain structure. Thus, as time from birth increases, chronological age as a marker of individual development is reduced. Whereas present knowledge allows us to extend our call for caution to a developmental context, the degree to which brain age delta reflects ongoing development in younger samples, needs to be formally tested. The revised version of the manuscript includes a brief version of the present argumentation.

In p. 18 (discussion): “We speculate that our results partially generalize to other normative and residual-based modeling approaches as well as to developmental samples. There is considerable evidence in the literature that birth weight and genetic risk for neurodegenerative conditions affects brain structure from early life (Raznahan et al., 2012; Walhovd et al., 2020, 2016, 2012b). Brain age models are related to other models such as normative brain charts (Bethlehem et al., 2021; Dong et al., 2020) – akin to normative anthropometric charts – the main difference being that brain age models predict, rather than control for, age (Marquand et al., 2019). Both types of models produce normative brain scores, which are uncorrelated with age (Butler et al., 2021). Thus, caution is required when interpreting these scores as indices of brain aging without availability of longitudinal data. Developmental samples may, however, reflect slightly stronger relationships between cross-sectional brain age delta and ongoing brain change as brain changes during early-life development typically occur at a faster pace than in middle or later life. Similarly, for specific disease groups such as Alzheimer’s disease patients (Franke and Gaser, 2012), interindividual brain variation in brain age might reflect to a greater extent prevailing loss of brain structure. Moreover, the variance associated with other factors than ongoing aging/development might be more limited in early than later age as factors leading to interindividual variations in brain structure have a shorter span to accumulate. That is, as time from birth increases, chronological age as a marker of individual development is reduced.”

References

Beck D, Lange A-MG de, Pedersen ML, Alnæs D, Maximov II, Voldsbekk I, Richard G, Sanders A-M, Ulrichsen KM, Dørum ES, Kolskår KK, Høgestøl EA, Steen NE, Djurovic S, Andreassen OA, Nordvik JE, Kaufmann T, Westlye LT. 2021. Cardiometabolic risk factors associated with brain age and accelerate brain ageing. medRxiv 2021.02.25.21252272. doi:10.1101/2021.02.25.21252272

Bethlehem R a. I, Seidlitz J, White SR, Vogel JW, Anderson KM, Adamson C, Adler S, Alexopoulos GS, Anagnostou E, Areces-Gonzalez A, Astle DE, Auyeung B, Ayub M, Ball G, Baron-Cohen S, Beare R, Bedford SA, Benegal V, Beyer F, Bae JB, Blangero J, Cábez MB, Boardman JP, Borzage M, Bosch-Bayard JF, Bourke N, Calhoun VD, Chakravarty MM, Chen C, Chertavian C, Chetelat G, Chong YS, Cole JH, Corvin A, Courchesne E, Crivello F, Cropley VL, Crosbie J, Crossley N, Delarue M, Desrivieres S, Devenyi G, Biase MAD, Dolan R, Donald KA, Donohoe G, Dunlop K, Edwards AD, Elison JT, Ellis CT, Elman JA, Eyler L, Fair DA, Fletcher PC, Fonagy P, Franz CE, Galan-Garcia L, Gholipour A, Giedd J, Gilmore JH, Glahn DC, Goodyer I, Grant PE, Groenewold NA, Gunning FM, Gur RE, Gur RC, Hammill CF, Hansson O, Hedden T, Heinz A, Henson R, Heuer K, Hoare J, Holla B, Holmes AJ, Holt R, Huang H, Im K, Ipser J, Jack CR, Jackowski AP, Jia T, Johnson KA, Jones PB, Jones DT, Kahn R, Karlsson H, Karlsson L, Kawashima R, Kelley EA, Kern S, Kim K, Kitzbichler MG, Kremen WS, Lalonde F, Landeau B, Lee S, Lerch J, Lewis JD, Li J, Liao W, Linares DP, Liston C, Lombardo MV, Lv J, Lynch C, Mallard TT, Marcelis M, Markello RD, Mazoyer B, McGuire P, Meaney MJ, Mechelli A, Medic N, Misic B, Morgan SE, Mothersill D, Nigg J, Ong MQW, Ortinau C, Ossenkoppele R, Ouyang M, Palaniyappan L, Paly L, Pan PM, Pantelis C, Park MM, Paus T, Pausova Z, Binette AP, Pierce K, Qian X, Qiu J, Qiu A, Raznahan A, Rittman T, Rollins CK, Romero-Garcia R, Ronan L, Rosenberg MD, Rowitch DH, Salum GA, Satterthwaite TD, Schaare HL, Schachar RJ, Schultz AP, Schumann G, Schöll M, Sharp D, Shinohara RT, Skoog I, Smyser CD, Sperling RA, Stein DJ, Stolicyn A, Suckling J, Sullivan G, Taki Y, Thyreau B, Toro R, Tsvetanov KA, Turk-Browne NB, Tuulari JJ, Tzourio C, Vachon-Presseau É, Valdes-Sosa MJ, Valdes-Sosa PA, Valk SL, Amelsvoort T van, Vandekar SN, Vasung L, Victoria LW, Villeneuve S, Villringer A, Vértes PE, Wagstyl K, Wang YS, Warfield SK, Warrier V, Westman E, Westwater ML, Whalley HC, Witte AV, Yang N, Yeo BTT, Yun HJ, Zalesky A, Zar HJ, Zettergren A, Zhou JH, Ziauddeen H, Zugman A, Zuo XN, AIBL, Initiative ADN, Investigators ADRWB, ASRB, Team C, Cam-CAN, Ccnp 3r-Brain, COBRE, Group EDBA working, FinnBrain, Study HAB, Imagen K, NSPN, OASIS-3, Project O, POND, The PREVENT-AD Research Group V, Alexander-Bloch AF. 2021. Brain charts for the human lifespan. bioRxiv 2021.06.08.447489. doi:10.1101/2021.06.08.447489

Brandmaier AM, Wenger E, Bodammer NC, Kühn S, Raz N, Lindenberger U. 2018. Assessing reliability in neuroimaging research through intra-class effect decomposition (ICED). eLife 7:e35718. doi:10.7554/eLife.35718

Brouwer RM, Schutte J, Janssen R, Boomsma DI, Hulshoff Pol HE, Schnack HG. n.d. The Speed of Development of Adolescent Brain Age Depends on Sex and Is Genetically Determined. Cereb Cortex. doi:10.1093/cercor/bhaa296

Butler ER, Chen A, Ramadan R, Le TT, Ruparel K, Moore TM, Satterthwaite TD, Zhang F, Shou H, Gur RC, Nichols TE, Shinohara RT. 2021. Pitfalls in brain age analyses. Human Brain Mapping 42:4092–4101. doi:10.1002/hbm.25533

Deary IJ. 2012. Looking for “system integrity” in cognitive epidemiology. Gerontology 58:545–553. doi:10.1159/000341157

Dong H-M, Castellanos FX, Yang N, Zhang Z, Zhou Q, He Y, Zhang L, Xu T, Holmes AJ, Thomas Yeo BT, Chen F, Wang B, Beckmann C, White T, Sporns O, Qiu J, Feng T, Chen A, Liu X, Chen X, Weng X, Milham MP, Zuo X-N. 2020. Charting brain growth in tandem with brain templates at school age. Science Bulletin 65:1924–1934. doi:10.1016/j.scib.2020.07.027

Elliott ML, Belsky DW, Knodt AR, Ireland D, Melzer TR, Poulton R, Ramrakha S, Caspi A, Moffitt TE,

Hariri AR. 2019. Brain-age in midlife is associated with accelerated biological aging and cognitive decline in a longitudinal birth cohort. Mol Psychiatry. doi:10.1038/s41380-019-0626-7

Franke K, Gaser C. 2012. Longitudinal changes in individual BrainAGE in healthy aging, mild cognitive impairment, and Alzheimer’s disease. GeroPsych: The Journal of Gerontopsychology and Geriatric Psychiatry 25:235–245. doi:10.1024/1662-9647/a000074

Marquand AF, Kia SM, Zabihi M, Wolfers T, Buitelaar JK, Beckmann CF. 2019. Conceptualizing mental disorders as deviations from normative functioning. Mol Psychiatry 24:1415–1424. doi:10.1038/s41380-019-0441-1

Nilsen TS, Kutschke J, Brandt I, Harris JR. 2017. Validity of Self-Reported Birth Weight: Results from a Norwegian Twin Sample. Twin Res Hum Genet 20:406–413. doi:10.1017/thg.2017.44

Raznahan A, Greenstein D, Lee NR, Clasen LS, Giedd JN. 2012. Prenatal growth in humans and postnatal brain maturation into late adolescence. PNAS 109:11366–11371. doi:10.1073/pnas.1203350109

Walhovd KB, Fjell AM, Brown TT, Kuperman JM, Chung Y, Hagler DJ, Roddey JC, Erhart M, McCabe C, Akshoomoff N, Amaral DG, Bloss CS, Libiger O, Schork NJ, Darst BF, Casey BJ, Chang L, Ernst TM, Frazier J, Gruen JR, Kaufmann WE, Murray SS, van Zijl P, Mostofsky S, Dale AM, Pediatric Imaging, Neurocognition, and Genetics Study. 2012. Long-term influence of normal variation in neonatal characteristics on human brain development. Proc Natl Acad Sci USA 109:20089–20094. doi:10.1073/pnas.1208180109

Walhovd KB, Fjell AM, Sørensen Ø, Mowinckel AM, Reinbold CS, Idland A-V, Watne LO, Franke A, Dobricic V, Kilpert F, Bertram L, Wang Y. 2020. Genetic risk for Alzheimer disease predicts hippocampal volume through the human lifespan. Neurology Genetics 6. doi:10.1212/NXG.0000000000000506

Walhovd KB, Krogsrud SK, Amlien IK, Bartsch H, Bjørnerud A, Due-Tønnessen P, Grydeland H, Hagler DJ, Håberg AK, Kremen WS, Ferschmann L, Nyberg L, Panizzon MS, Rohani DA, Skranes J, Storsve AB, Sølsnes AE, Tamnes CK, Thompson WK, Reuter C, Dale AM, Fjell AM. 2016. Neurodevelopmental origins of lifespan changes in brain and cognition. Proc Natl Acad Sci USA 113:9357–9362. doi:10.1073/pnas.1524259113

Wheater E, Shenkin SD, Muñoz Maniega S, Valdés Hernández M, Wardlaw JM, Deary IJ, Bastin ME, Boardman JP, Cox SR. 2021. Birth weight is associated with brain tissue volumes seven decades later but not with MRI markers of brain ageing. NeuroImage: Clinical 31:102776. doi:10.1016/j.nicl.2021.102776

Zuo X-N, Xu T, Milham MP. 2019. Harnessing reliability for neuroscience research. Nat Hum Behav 3:768–771. doi:10.1038/s41562-019-0655-x

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Supplementary file 1. List of cortical brain features.

    List of cortical features included in the brain age model and age variance explained in the UK Biobank and the Lifebrain training datasets. Vol = volume; GWC = gray-white matter contrast; Cth = cortical thickness.

    elife-69995-supp1.docx (23.3KB, docx)
    Supplementary file 2. List of subcortical brain features.

    List of subcortical features included in the brain age model and age variance explained in the UK Biobank and the Lifebrain training datasets. Vol = volume; Int = intensity; hemi = hemisphere.

    elife-69995-supp2.docx (22KB, docx)
    Supplementary file 3. Sociodemographics.

    Main sample descriptives for the training and test datasets. Obs = mean number of observations per participant (SD). Follow-up = mean time (years) between the first and the last MRI observation (SD). For the test datasets, age and age range refer to age at baseline. *AIBL does not belong to the Lifebrain consortium but was included to enrich the replication sample.

    elife-69995-supp3.docx (14.8KB, docx)
    Supplementary file 4. Relationship between brain age delta and change in brain features.

    Long. change = longitudinal change in the raw neuroimaging features (mean change [log10(p)]). PC1 load = feature loadings on the first component of longitudinal change. Deltacross = relationship between cross-sectional brain age delta and feature change (r2 [log10(p)]). Deltalong = relationship between longitudinal brain age delta and feature change (r2 [log10(p)]). GWC = gray-white matter contrast. Cth = cortical thickness. Bil = bilateral. Subc = subcortical. n = 1372 and 1500 for the UK Biobank and the Lifebrain datasets. |N| = 365 and 372 features in the UK Biobank and the Lifebrain datasets. XGB = boosting gradient as implemented in XGBoost.

    elife-69995-supp4.docx (120.3KB, docx)
    Supplementary file 5. Contact information.

    Contact information and ethical comittees for the different cohorts.

    elife-69995-supp5.docx (14.8KB, docx)
    Supplementary file 6. Data acquisition parameters.

    Data acquisition parameters for the T1w sequences. *UK Biobank employed three scanners of the same model and with equivalent parameters (Cheadle, Reading, and Newcastle centers). **AIBL does not belong to the Lifebrain consortium but was included in the Lifebrain replication dataset.

    elife-69995-supp6.docx (14.9KB, docx)
    Transparent reporting form
    Source code 1. Analysis Code.
    elife-69995-code1.zip (51.7KB, zip)

    Data Availability Statement

    The raw data were gathered from the UK Biobank, the Lifebrain cohort, and the AIBL. Raw data requests are specific to each cohort. UK Biobank and AIBL data are available upon application to UK Biobank and at https://aibl.csiro.au upon corresponding approvals. For the Lifebrain cohorts, requests for raw MRI data should be submitted to the corresponding principal investigator. See contact details in Supplementary file 5. MRI data is not openly available as participants did not consent to share publicly their data. Access to data is available upon reasonable requests and transfer agreements. Different sample agreements are required for each dataset.Statistical analyses in this article are available alongside the article and will be available at https://github.com/LCBC-UiO/VidalPineiro_BrainAge. All analyses were performed in R 3.6.3. The scripts were run on the Colossus processing cluster, University of Oslo. UK Biobanks’ data acquisition, MRI preprocessing, and feature generation pipelines are freely available (https://www.fmrib.ox.ac.uk/ukbiobank). For the Lifebrain cohorts, the image acquisition details are summarized in Supplementary file 6. MRI preprocessing and feature generation scripts were performed with the freely available FreeSurfer software (https://surfer.nmr.mgh.harvard.edu/). For bash-sourcing scripts, please contact the corresponding author.

    The raw data were gathered from the UK Biobank, the Lifebrain cohort, and the AIBL. Raw data requests are specific to each cohort. UK Biobank and AIBL data are available upon application to UK Biobank and at https://aibl.csiro.au upon corresponding approvals. For the Lifebrain cohorts, requests for raw MRI data should be submitted to the corresponding principal investigator. See contact details in Supplementary File 5. Different agreements are required for each dataset. Statistical analyses in this manuscript are available alongside the manuscript and will be made available at https://github.com/LCBC-UiO/VidalPineiro_BrainAge, (copy archived at swh:1:rev:2044c6ca40e0b8f99c9190c6edfde8ca76b559ac). All analyses were performed in R 3.6.3. The scripts were run on the Colossus processing cluster, University of Oslo. UK Biobanks' data acquisition, MRI preprocessing, and feature generation pipelines are freely available (https://www.fmrib.ox.ac.uk/ukbiobank). For the Lifebrain cohorts, the image acquisition details are summarized in Supplementary File 6. MRI preprocessing and feature generation scripts were performed with the freely available FreeSurfer software (https://surfer.nmr.mgh.harvard.edu/).


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES