Abstract
Brain structure in later life reflects both influences of intrinsic aging and those of lifestyle, environment and disease. We developed a deep neural network model trained on brain MRI scans of healthy people to predict “healthy” brain age. Brain regions most informative for the prediction included the cerebellum, hippocampus, amygdala and insular cortex. We then applied this model to data from an independent group of people not stratified for health. A phenome-wide association analysis of over 1,410 traits in the UK Biobank with differences between the predicted and chronological ages for the second group identified significant associations with over 40 traits including diseases (e.g., type I and type II diabetes), disease risk factors (e.g., increased diastolic blood pressure and body mass index), and poorer cognitive function. These observations highlight relationships between brain and systemic health and have implications for understanding contributions of the latter to late life dementia risk.
Subject terms: Machine learning, Cognitive ageing, Risk factors
Introduction
Chronological age is a major risk factor for poorer physical and mental health and chronic later life neurodegenerative diseases1–3. Brain structures and functions show considerable heterogeneity, suggesting that they change at different rates between individuals as a consequence of differences in genotype, environment or lifestyle and disease4. We therefore hypothesised that age-related differences between brains relative to changes in a “healthy” normative population may provide an index of disease or disease risk.
A variety of approaches have been used for multi-dimensional modelling of “brain age” from brain MRI images and for assessing associations and differences between modelled brain ages and specific health outcomes, exposures or traits5. However, most prior studies have had relatively small sample sizes and have been applied in populations selected for a specific clinical pathology or outcome, as large-scale MRI phenotyping of large general populations has not been performed until recently6. Previous work also has relied on linear methods that cannot capture non-linear relationships within the data or “black box” machine learning methods unable to provide information concerning which brain image features were predictive, limiting the interpretability of findings.
More recently, studies have taken advantage of large-scale population-based data including those in UK Biobank and have used neural networks and other advanced methods to analyse MRI imaging data7–13. Here we build on these recent studies using a deep convolutional neural network (CNN) with T1-weighted brain MRI data from 21,382 volunteers in the UK Biobank. Our CNN model for predicting brain age was trained and validated on brain images from sub-groups selected for their relative health. We were able to assess the relative contributions of features from different brain regions to identify those that were most informative for the model using permutation importance. We then tested for the potential clinical meaningfulness of differences between modelled and chronological brain age differences by exploring associations of these differences with over 1,400 clinical, lifestyle and environmental characteristics for individuals in a different group of 12,296 of the UK Biobank volunteers that had not been stratified for relative health.
Results
Brain age prediction using the optimised neural network
The model-defined and chronological ages were strongly correlated in the training set of healthy individuals (N = 3,067, Pearson correlation = 0.97, mean absolute error = 1.71 years). However, the neural network showed a linear bias for age; individuals older/younger than the cohort average were predicted to be younger/older than they are. Similar bias has been reported and investigated in previous studies12. After linearly adjusting the model output for chronological age using data from the training set, the model showed no significant bias on the validation set (N = 3,926). We explored the model’s accuracy on a test set containing only healthy individuals (N = 2,057), on which it achieved a mean absolute error of 2.87 years.
We then applied the model to the larger test set who had not been stratified for health (N = 12,196). The model achieved a mean absolute error of 3.42 years and results described a unimodal distribution of differences between predicted and chronological ages (Fig. 1). There was a strong direct relationship (Fig. 2, Pearson correlation coefficient 0.82, P = 2.67 × 10–242).
Contributions of different brain regions to brain age predictions
We attempted to partially explain those image features contributing most to the age prediction model. We assessed differential contributions of brain regions at the level of major white and grey matter regions by serial inference of the model with image regions permuted between individuals. Six brain regions (left cerebellar lobules I-IV and the left crus and vermis, the right hippocampus, left amygdala, and left insular cortex) were found to contribute most to the accuracy of age prediction (Fig. 3 and Supplementary Fig. 1). Removing the information contributions of any of those regions caused the mean average error to increase by more than 0.10 years.
Phenome-wide association study
We then explored the potential meaningfulness of differences between model predicted and chronological ages in the unstratified test population. We explored associations of these differences with more than 1,410 of the International Statistical Classification of Diseases and Related Health Problems (ICD) codes, self-reported clinical conditions and physical, lifestyle and environmental phenotypes. Of these, 24 were found to be significantly associated (corrected P < 2.35 × 10–5) with brain age differences (20 direct associations; 4 inverse associations) (Fig. 4 and Table 1).
Table 1.
Categorical and ordered traits | Category | Odds ratio (95% CI) | p-value | Rate of incidence (cases/controls) |
---|---|---|---|---|
Multiple sclerosis | Diagnoses | 4.04 (2.37, 6.93) | 2.85 × 10–07 | 18/12,278 |
Type 1 diabetes | Diagnoses | 2.39 (1.73, 3.29) | 1.17 × 10–07 | 49/12,247 |
Taking insulin | Touchscreen | 2.22 (1.55, 3.14) | 2.22 × 10–05 | 36/1633 |
Type 2 diabetes | Diagnoses | 1.42 (1.25, 1.61) | 7.77 × 10–08 | 334/11,962 |
Essential (primary) hypertension | Diagnoses | 1.22 (1.15, 1.29) | 1.22 × 10–10 | 1820/10,476 |
Number of treatments/medications taken | Verbal interview | 1.13 (1.09, 1.18) | 6.02 × 10–10 | 12,294 |
Continuous traits | Category | Beta (95% CI) | p-value | N samples |
---|---|---|---|---|
Systolic brachial blood pressure during PWA | Heart MRI | 0.08 (0.06, 0.10) | 2.20 × 10–13 | 10,338 |
Diastolic blood pressure | Physical measures | 0.08 (0.06, 0.10) | 2.20 × 10–14 | 11,830 |
Central systolic blood pressure during PWA | Heart MRI | 0.08 (0.06, 0.10) | 7.59 × 10–13 | 10,337 |
Systolic blood pressure | Physical measures | 0.07 (0.06, 0.09) | 1.91 × 10–14 | 11,830 |
End systolic pressure during PWA | Heart MRI | 0.07 (0.05, 0.09) | 3.04 × 10–10 | 10,306 |
Peripheral pulse pressure during PWA | Heart MRI | 0.07 (0.05, 0.09) | 3.25 × 10–10 | 10,335 |
Time taken to enter values in symbol-digit matching test | Cognitive function | 0.07 (0.04, 0.09) | 2.67 × 10–07 | 6592 |
Central pulse pressure during PWA | Heart MRI | 0.07 (0.04, 0.09) | 7.47 × 10–10 | 10,335 |
Cardiac output during PWA | Heart MRI | 0.07 (0.04, 0.09) | 1.47 × 10–09 | 10,188 |
Cardiac index during PWA | Heart MRI | 0.07 (0.05, 0.09) | 1.94 × 10–10 | 10,046 |
Stroke volume during PWA | Heart MRI | 0.05 (0.03, 0.07) | 4.36 × 10–06 | 10,190 |
Ventricular rate | Physical measures | 0.05 (0.03, 0.07) | 2.23 × 10–05 | 10,656 |
Central augmentation pressure during PWA | Heart MRI | 0.05 (0.03, 0.07) | 1.39 × 10–05 | 10,333 |
Body mass index | Physical measures | 0.04 (0.02, 0.06) | 3.33 × 10–05 | 12,260 |
Hand grip strength (right) | Physical measures | − 0.03 (− 0.05, − 0.02) | 3.51 × 10–06 | 12,267 |
Hand grip strength (left) | Physical measures | − 0.04 (− 0.05, − 0.03) | 1.72 × 10–08 | 6581 |
Number of symbol digit matches attempted | Cognitive function | − 0.07 (− 0.09, − 0.04) | 4.08 × 10–07 | 6581 |
Number of symbol digit matches made correctly | Cognitive function | − 0.07 (− 0.10, − 0.05) | 1.82 × 10–08 | 10,338 |
The diagnoses and traits associated with brain age differences included cardiovascular and metabolic diseases and risk factors, cognitive function and physical strength. For example, there was a four-fold higher risk of having been diagnosed with multiple sclerosis (OR 4.04, 95% CI 2.38–6.93) for each positive SD difference between predicted and chronological age (3.7 years). Measures of blood pressure also showed positive associations with brain age difference including the self-reported diagnosis of hypertension (OR 1.22, 95% CI 1.15–1.29 per SD increase in brain age difference) and measured systolic (beta 0.07, 95% CI 0.06–0.09) and diastolic (beta 0.08, 95% CI 0.06–0.10) blood pressures. Direct associations were found between brain age difference and metabolic traits such as type I (OR 2.39, 95% CI 1.73–3.29) and type II (OR 1.42, 95% CI 1.25–1.61) diabetes and participants taking insulin (OR 2.22, 95% CI 1.55–3.14) (Table 1).
Conversely, individuals with a predicted age younger than their chronological age were found to have greater physical strength, as reflected in hand grip strength (beta − 0.03, 95% CI − 0.05 to − 0.02). Brain age differences were inversely associated with improved performance in tests included in the UK Biobank cognitive battery, including time taken to enter values in a digit-symbol matching test (beta 0.07, 95% CI 0.04–0.09) and the numbers of symbols matched correctly (beta − 0.07, 95% CI − 0.10 to − 0.05).
With a less stringent threshold of 5% FDR (P = 1.45 × 10–3), 20 additional associations with brain age difference were observed (16 direct associations, 4 inverse associations, Supplementary Table 2). Additional positive associations with brain age difference included having had a depressive episode (OR 3.33, 95% CI 1.85–6.01), history of a prior psychiatric episode (OR 1.97, 95% CI 1.35–2.87) and higher neuroticism score (OR 1.07, 95% CI 1.03–1.12).
Mendelian randomisation analysis of selected traits
We used Mendelian randomisation to investigate the effects of genetic determinants for a range of traits on brain age differences in order to explore the potential for causality (Supplementary Table 3). The association of a higher genetically determined diastolic blood pressure with higher brain age difference in main and sensitivity regression analyses (inverse variance weighted: beta 0.06, p-value 0.01, weighted median: beta 0.07, p-value 0.02, MR Egger: beta 0.14, p-value 0.05) ( Supplementary Table 3 and Supplementary Fig. 2) provided evidence in support of a causal relationship of blood pressure. The MR-Egger intercept, a measure of pleiotropy that may bias the main inverse variance weighted estimate, did not reach statistical significance, further supporting this conclusion. There was also no evidence for heterogeneity (Cochrane’s Q for inverse variant weighted median: 0.14, Cochrane’s Q for MR Egger: 0.15). By contrast, we did not find evidence for a causal influence of diabetes on brain age differences.
Discussion
In this large study of 21,382 middle and older aged participants with rich brain MRI data, we developed a deep learning approach to calculate brain age difference with respect to a healthy reference population. Brain age difference should reflect cumulative effects on brain structure associated with effects of environmental, lifestyle and disease exposures, as well as individual differences in genotype. We have approached the “explainability” of this measure by characterising the brain regions whose features made the greatest contributions to brain age difference, which were discovered to be the cerebellum, hippocampus, amygdala and insular cortex. Finally, we conducted an exploration of over 1,400 phenotypes and traits and demonstrated associations between brain age difference and clinically meaningful traits related to cardiovascular, metabolic and brain diseases.
Previous studies have used measures of difference or changes in brain or grey matter volumes over time to provide indirect surrogate measures of the relative rates of brain neuronal volume loss during life14. Recent studies use MRI features to predict age-related changes in the brain9–12,15 and have applied multivariate methods such as lasso16 or independent component analysis17, to prospectively derived image features. These studies have described structural and functional changes distinguishing individuals in a dataset 11, and revealed that many informative features can be related to brain volume and white-matter microstructure differences10. However, many of these previous models were trained on unselected populations that included both people who are healthy and those with disease or disease risk factors. We selected a relatively healthy training population, as also described in two recent studies10,15, to fit the model so that positive deviations between predicted and calendar ages can be interpreted as potential signals of disease or disease risk. Given that a hypothesis derived from previous studies is that disease or disease risk factors modify brain aging measures, previous approaches may reduce sensitivity to pathology, although strong influences may be captured by even using heterogeneous training sets or model constructs, e.g., the relationship between elevated blood pressure and greater brain age reported by us and in earlier work10. It may be one factor accounting for the heterogeneity of brain aging trajectories elegantly described earlier11. For our analyses, we selected a relatively healthy training population to fit the model so that positive deviations between predicted and calendar ages can be interpreted as potential signals of disease or disease risk. Additionally, relationships between brain structures and relative signal intensities (e.g., those reflected in MRI “texture” measures) also change18. Unlike simple scalar volume measures, these structural and tissue image texture changes are highly multi-dimensional. Linear models may be relatively insensitive to complex modes of variation or interventions. Our approach uses a deep non-linear CNN that enables capture of complex differences between brain structure and a normative model built from features that are not able to be defined a priori. CNNs achieve state-of-the-art performance on many tasks, including age prediction from brain MRI8,13,15. CNNs are prone to overfitting, which likely explains the difference between train and test accuracy observed. This highlights the importance of the completely held-out test set such as has been used here for generalisability and comparisons with other work. 2D CNNs have been used effectively8 and trade spatial information for the ability to leverage models pre-trained on natural images. Here, we opted for a 3D CNN that retains and leverages the entire MRI volume.
We also identified regions most important for model predictions using permutation importance, an emerging method for interpreting neural network models on brain MRI 19. This highlighted brain regions playing central roles in cognition and memory (hippocampus), emotional regulation and salience (amygdala) and physiological homeostasis (insula). All of these regions have been recognised previously as having a functional role or showing population differences relevant to brain health20–22. The importance of the cerebellum for brain age is of particular interest, as relationships between cerebellar pathology and cognitive dysfunction or late life neurodegenerative diseases remain poorly described23,24. All other regions did have a smaller, but measurable, impact on model predictions (supplementary Fig. 1). This suggests that the model might have learned a global representation along with more local, region-based patterns. Our observations point to the importance of future work in applying emerging approaches that combine principles from statistics and machine learning for models based on learned features that can be described with greater granularity. Their discovery could contribute to better understanding of the mechanisms underlying relationships between brain structure and disease risk.
Brain structural changes learned by model correlated with clinical diagnoses and phenotypic characteristics, as well as cognitive function, extending prior studies based on different models and using different training sets that also described deviations of predicted ages from chronological ages (similar to the brain age difference metric described here) amongst pathological or “at risk” subgroups of the larger cohort 8–11,25–29. Here, individuals with higher brain age difference performed worse in cognitive tests for fluid intelligence, giving support to the index as an informative metric. In terms of diseases, participants with multiple sclerosis had higher brain age differences, revealing changes to brain in addition to those that are age-related. Indeed, multiple sclerosis is associated with macro- and microscopic inflammatory and demyelinating pathology in both white and grey matter30 and has previously been associated with increased brain age differences31,32. In contrast, we did not observe a statistically significant association between brain age difference and cerebral infarction. This is possibly due to cerebral infarction causing heterogenic brain changes between individuals that are not consistent enough to be associated with a measure across a population. Alternatively, our stringent multiple testing corrections to guard against false positives can potentially cause some true positives to be missed; a limitation of the approach. Analysis showed associations between higher brain age difference and type I and type II diabetes, a finding previously observed in other imaging studies33 including analysis on UK biobank10 and supported by previously recognised effects of diabetes on brain structure34,35. This study also revealed a direct association between brain age differences and vascular disease risk factors, particularly blood pressure. Associations between brain age deviations using distinct approaches to ours but similar data from UK biobank also revealed associated between this phenotype and blood pressure adding internal validity to these results10. Although a relationship between hypertension and both cognitive decline and brain atrophy has already been established35,36, and prolonged hypertension is recognised to be associated with increased white matter pathology37,38 the mechanisms of these associations are not well defined39. Physical fitness was associated with lower brain age difference highlighting the association of physical fitness not only to functional40, but also to structural changes of the brain.
We examined the potential for causality in the associations between traits and brain age differences. MR analysis demonstrated a likely causal relationship between increased diastolic blood pressure and increased brain age. This implies that reducing diastolic blood pressure would have an impact on the relative brain age, broadly consistent with clinical evidence that reducing or preventing hypertension reduces the risk of strokes41. By contrast, our MR analysis did not provide evidence for a causal effect of diabetes on brain age differences, suggesting common, pleotropic factors (pleiotropy) may contribute causally to both, consistent with the possibility that treatments for diabetes mellitus also may have an independent impact on late-life neurodegenerative processes42. Future prospective studies relating brain age differences and the incidence of cognitive impairments would add to confidence that the measure could be used as a risk stratification tool for late life cognitive impairments or other brain disorders.
Nonetheless, although we have adopted use of the concept of “brain age” proposed previously, we believe that the term should be used cautiously. Aging is referenced to time since birth, but incorporates concepts of time-dependent intrinsic biological processes and individually specific influences acting on a tissue or person43; changes to the brain during the life course are more than solely a consequence of time (age) alone. Our use of a healthy population for training is intended to maximise the interpretability of brain age difference as an index of risk of dysfunction or disease. As noted above, the sensitivity of the approach to factors affecting individuals is limited, as the interpretability analysis is based on population characteristics. Further work could focus on developing descriptions and explanatory hypotheses at an individual level (e.g., using methods such as Shapely Additive Explanations44). Another important limitation is the accuracy of region definitions. Although the images were carefully aligned as part of UK Biobank pre-processing, imperfect alignment would cause boundary effects. These were partially mitigated by running the permutation multiple times and averaging the results. However, the relative accuracy of detection of disease or disease risk associations depends on the population sample size and structure; our detections of associations are, for example, impacted by the relatively low prevalence of stroke, multiple sclerosis and diabetes in the population studied. We are making our model openly available for others but need to highlight that it was developed and validated with UK Biobank data; generalisation would require extending training to include new target populations or data acquired using different MRI platforms or sequences. Finally, we analysed the associations between genetic variants and brain age differences in order to perform MR analysis; however, an in-depth investigation of genetic determinants of brain age was beyond the scope of this work and has been covered elsewhere9,13. We adopted a hypothesis free approach to investigate a range of phenotypes in relation to brain age differences with adjustment for multiple comparisons. However, different associations had different sample sizes and therefore power to detect associations which should be taken into account when interpreting the associations.
Our results add to a growing literature showing that brain structural differences provide a general marker of systemic health. They suggest that brain age difference may be an index of health or risk of later life metabolic, cardiovascular and brain diseases and functional traits relevant to health. Consistent with conclusions from large cohort treatment studies41, our results suggest a direct causal link between higher diastolic blood pressure and brain age difference. With larger populations and further refinement of methods, the approach may help to better define risk factors of late life brain disease. Stratifying people on this (or a similar) index may help identify those who could benefit most from interventions for risk factor reduction for future brain health. Finally, this work further adds to the literature on the potential use of AI as a decision-support tool to enhance the information available from neuroradiological reporting.
Methods
Description of UK Biobank and imaging data
UK Biobank is a population-based cohort study of ~ 500,000 participants, who were recruited from the UK general population between 2006 and 2010. At baseline, participants, who were between 40–69 years old, provided blood samples for biochemical tests and genotyping and a wide range of self-reported information and physical measurements, and consented for their data to be linked to Hospital Episode Statistics (HES). Detailed protocols for obtaining the measurements from participants have been described6. The UK Biobank resource is open to all bona-fide researchers anywhere in the world, including those funded by academia and industry. An imaging extension to the existing UK Biobank study was initiated in 2016 with plans to scan 100,000 individuals from the cohort by 2022–23. Here we have analysed the interim release of T1-weighted structural brain MRI on 21,382 participants from this imaging sub-study45. The images were captured on a 3 T Siemens Skyra scanner (software platform VD13). Each image was single channel with dimensions 182 × 218 × 182 at 1 mm3 resolution. Images used here were aligned to the MNI152 template46 as part of UK Biobank pre-processing.
Study design
To set up a supervised machine learning framework, we split the data from 21,382 individuals into four sets: a training (N = 3,067), validation (N = 3,962), healthy test (N = 2,057) and unselected (general) population test (N = 12,296) set. Individuals were divided randomly between the four sets, subject to restrictions on the number of apparently healthy and unhealthy individuals in each set. Healthy participants were defined as those who did not report health conditions or chronic treatments for health conditions and hospital ICD coding available through UK Biobank did not identify chronic disease at the time of, or preceding, baseline assessment. The training set was used to optimise parameters of the neural network and learn relationships between healthy brain structure and age. This allowed the model to learn a representation of the physiological age-related modifications in brains of healthy individuals as a proxy for changes occurring with normal aging processes. Including brains from unhealthy individuals would have forced the model to learn structural changes related to disease and correct its predictions accordingly. The validation set was used to tune and assess the model without exposing it to individuals reserved for the final analysis.
We used two separate hold-out test sets to evaluate model performance. The former containing only healthy individuals to test the model on unseen healthy individuals. The latter test set was used for conducting the phenome-wide association study and included individuals who had not been stratified for health status. See Supplementary Table 1 for descriptive characteristics of each data set.
Deep neural network for brain age difference modelling
The objective was to determine the brain age difference for an individual, given an MRI of their brain and their chronological age. We defined brain age difference as the difference between an individual’s chronological age and the age predicted by the deep learning model from T1-weighted MRI.
The age prediction model was a convolutional neural network47,48 that took as input a T1-weighted structural image with dimensions 182 × 218 × 182 and output the predicted age of the person. Our network used 3D convolutional operations, batch-normalisation49 and activations with a rectified linear unit, residual blocks, based on those in residual networks (ResNets)50. For a detailed description of the model, please refer to the supplementary methods. Training was run by minimising the mean squared error between the true and predicted values and optimized using Adam51. We used 3D convolutions throughout the network to leverage the full structure of the MRI. We built the model in PyTorch52 and TensorLy53. Models were trained on an NVIDIA P100 GPU.
We used a permutation importance approach19,54,55 to analyse the importance of different brain regions by quantifying their contribution to model predictions. The method works by repeatedly permuting a specific feature, in this case a region of the brain, between individuals. Serial repetition across the population develops a model of the distribution of predictions without the region that can be compared with that before data removal. For a detailed description of the approach, please refer to the supplementary methods.
Statistical analysis
We performed a phenome-wide association study to test for associations between brain age difference and 1,410 phenotypic characteristics measured on UK Biobank participants through clinical assessments, record linkage and health and lifestyle questionnaires. We used the PHEASANT analysis method to perform this agnostic scan as previously described56. Briefly, the software uses a series of regression analyses (linear for continuous traits and logistic for binary traits) to associate traits with the exposure of interest (i.e. brain age difference). For each trait, samples with missing values were excluded from that analysis. All analyses were adjusted for age, sex and assessment centre. To account for multiple testing, we used Bonferroni correction (P = 2.35 × 10–05). In sensitivity analyses, in order to account for the correlation between the measured phenotypes in UK Biobank we also used a false discovery rate (FDR) of 5% using the Benjamini–Hochberg procedure to account for multiple testing57.
Mendelian randomisation
Causality of associations between brain age differences and traits is difficult to infer. Mendelian randomisation provides a method for assessing the potentially causal nature of some associations58. We performed two-sample Mendelian randomisation analysis of selected traits (exposure) that had summary GWAS data available (diastolic and systolic blood pressure, pulse pressure, Alzheimer’s disease and diabetes) to explore the causality of the reported associations. Genetic variants used as instruments were obtained from DIAGRAM 1000G study59 for type 2 diabetes, from the International Genomics of Alzheimer's Project60 for Alzheimer’s diseases, and from a recent study61 for blood pressure. For the latter, we used the allele effects from the International Consortium for Blood Pressure62 to avoid bias due to overlapping samples. For the association of the genetic variants with brain age difference, we used the data from UK Biobank. We matched and harmonised the effective allele for each set of instruments with brain age difference and removed the correlated variants using LD clumping (r2 < 0.1). We estimated the causal effects using the inverse variance weighted method. Potential pleiotropic effect was detected using heterogeneity tests and sensitivity analysis was done using weighted median and MR Egger regression methods to rule out pleiotropic effects. All analyses were done using the Two Sample MR package63.
Supplementary information
Acknowledgements
The authors thank Professor Korbinian Strimmer (University of Manchester) for early discussion on the project, Dr. Suzuki Hideaki (Tohoku University) for suggestions on brain region interpretation, Dr. Jean Kossaifi (NVIDIA AI, Imperial College London) for suggestions on the deep neural network, Dr. Kanta Chechi (Imperial College London) for helping set up GWAS analysis and Dr. Jian Huang (Imperial College London) for assisting with Mendelian randomisation analysis. This research has been conducted using the UK Biobank Resource under Application Number 18545. AK is funded by a Medical Research Council studentship. PMM also has been in receipt of generous personal and research support from the Edmond J Safra Foundation and Lily Safra, an NIHR Senior Investigator’s Award. PE receives support from the Medical Research Council for the MRC Centre for Environment and Health (MR/S019669/1). This work is supported by the UK Dementia Research Institute which receives its funding from UK DRI Ltd, funded by the UK Medical Research Council, Alzheimer's Society and Alzheimer's Research UK. This work has been supported by the Imperial College NIHR Biomedical Research Centre.
Author contributions
A.K. designed the study, developed methods, analysed the data, drafted and led in revision of the manuscript for intellectual content. S.F. designed study, developed methods and revised the manuscript for intellectual content. Y.P. developed methods and revised the manuscript for intellectual content. P.M.M. supported design of the study, interpreted the data and revised the manuscript for intellectual content. P.E. interpreted the data and revised the manuscript for intellectual content. A.D. guided design of the study, interpreted the data and revised the manuscript for intellectual content. I.T. supervised A.K., jointly led in design of the study, interpreted the data and revised the manuscript for intellectual content.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
is available for this paper at 10.1038/s41598-020-76518-z.
References
- 1.Sowell ER, et al. Mapping cortical change across the human life span. Nat. Neurosci. 2003;6:309. doi: 10.1038/nn1008. [DOI] [PubMed] [Google Scholar]
- 2.Stern Y. Cognitive reserve in ageing and Alzheimer's disease. Lancet Neurol. 2012;11:1006–1012. doi: 10.1016/S1474-4422(12)70191-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Matthews FE, et al. A two-decade comparison of prevalence of dementia in individuals aged 65 years and older from three geographical areas of England: results of the Cognitive Function and Ageing Study I and II. Lancet. 2013;382:1405–1412. doi: 10.1016/S0140-6736(13)61570-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Raz N, et al. Regional brain changes in aging healthy adults: general trends, individual differences and modifiers. Cereb. Cortex. 2005;15:1676–1689. doi: 10.1093/cercor/bhi044. [DOI] [PubMed] [Google Scholar]
- 5.Cole JH, Marioni RE, Harris SE, Deary IJ. Brain age and other bodily ‘ages’: implications for neuropsychiatry. Mol. Psychiatry. 2019;24:266. doi: 10.1038/s41380-018-0098-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Sudlow C, et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med. 2015;12:e1001779. doi: 10.1371/journal.pmed.1001779. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Dinsdale, N. K. et al. Learning patterns of the ageing brain in MRI using deep convolutional networks. NeuroImage, 117401 (2020). [DOI] [PubMed]
- 8.Bashyam VM, et al. MRI signatures of brain age and disease over the lifespan based on a deep brain network and 14 468 individuals worldwide. Brain. 2020;143:2312–2324. doi: 10.1093/brain/awaa160. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Ning K, Zhao L, Matloff W, Sun F, Toga AW. Association of relative brain age with tobacco smoking, alcohol consumption, and genetic variants. Sci. Rep. 2020;10:1–10. doi: 10.1038/s41598-019-56847-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Cole JH. Multi-modality neuroimaging brain-age in UK Biobank: relationship to biomedical, lifestyle and cognitive factors. Neurobiol. Aging. 2020;92:34–42. doi: 10.1016/j.neurobiolaging.2020.03.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Smith SM, et al. Brain aging comprises many modes of structural and functional change with distinct genetic and biophysical associations. Elife. 2020;9:e52677. doi: 10.7554/eLife.52677. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Smith SM, Vidaurre D, Alfaro-Almagro F, Nichols TE, Miller KL. Estimation of brain age delta from brain imaging. NeuroImage. 2019;200:528–539. doi: 10.1016/j.neuroimage.2019.06.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Jónsson BA, et al. Brain age prediction using deep learning uncovers associated sequence variants. Nat. Commun. 2019;10:1–10. doi: 10.1038/s41467-018-07882-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Lemaitre H, et al. Normal age-related brain morphometric changes: nonuniformity across cortical thickness, surface area and gray matter volume? Neurobiol Aging. 2012;33:617e611–617e619. doi: 10.1016/j.neurobiolaging.2010.07.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Cole JH, et al. Predicting brain age with deep learning from raw imaging data results in a reliable and heritable biomarker. NeuroImage. 2017;163:115–124. doi: 10.1016/j.neuroimage.2017.07.059. [DOI] [PubMed] [Google Scholar]
- 16.Tibshirani R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B (Methodol.) 1996;58:267–288. [Google Scholar]
- 17.Comon P. Independent component analysis, a new concept? Signal Process. 1994;36:287–314. doi: 10.1016/0165-1684(94)90029-9. [DOI] [Google Scholar]
- 18.Kovalev VA, Kruggel F, von Cramon DY. Gender and age effects in structural brain asymmetry as measured by MRI texture analysis. NeuroImage. 2003;19:895–905. doi: 10.1016/S1053-8119(03)00140-X. [DOI] [PubMed] [Google Scholar]
- 19.Hu, D. et al. In International Conference on Medical Image Computing and Computer-Assisted Intervention. 149–157 (Springer, Berlin).
- 20.Gunning-Dixon FM, et al. Age-related differences in brain activation during emotional face processing. Neurobiol. Aging. 2003;24:285–295. doi: 10.1016/S0197-4580(02)00099-4. [DOI] [PubMed] [Google Scholar]
- 21.Hartley T, Lever C, Burgess N, O'Keefe J. Space in the brain: how the hippocampal formation supports spatial cognition. Philos. Trans. R. Soc. B Biol. Sci. 2014;369:20120510. doi: 10.1098/rstb.2012.0510. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Menon V, Uddin LQ. Saliency, switching, attention and control: a network model of insula function. Brain Struct. Funct. 2010;214:655–667. doi: 10.1007/s00429-010-0262-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Jacobs HI, et al. The cerebellum in Alzheimer’s disease: evaluating its role in cognitive decline. Brain. 2017;141:37–47. doi: 10.1093/brain/awx194. [DOI] [PubMed] [Google Scholar]
- 24.Hoche F, Guell X, Vangel MG, Sherman JC, Schmahmann JD. The cerebellar cognitive affective/Schmahmann syndrome scale. Brain. 2018;141:248–270. doi: 10.1093/brain/awx317. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Cole JH, et al. Brain age predicts mortality. Mol. Psychiatry. 2018;23:1385. doi: 10.1038/mp.2017.62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kolenic M, et al. Obesity, dyslipidemia and brain age in first-episode psychosis. J. Psychiatr. Res. 2018;99:151–158. doi: 10.1016/j.jpsychires.2018.02.012. [DOI] [PubMed] [Google Scholar]
- 27.Franke K, Gaser C, Roseboom TJ, Schwab M, de Rooij SR. Premature brain aging in humans exposed to maternal nutrient restriction during early gestation. NeuroImage. 2018;173:460–471. doi: 10.1016/j.neuroimage.2017.10.047. [DOI] [PubMed] [Google Scholar]
- 28.Cole JH, Leech R, Sharp DJ, Initiative ASDN. Prediction of brain age suggests accelerated atrophy after traumatic brain injury. Ann. Neurol. 2015;77:571–581. doi: 10.1002/ana.24367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Kaufmann T, et al. Common brain disorders are associated with heritable patterns of apparent aging of the brain. Nat. Neurosci. 2019;22:1617–1623. doi: 10.1038/s41593-019-0471-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Filippi M, et al. Changes in the normal appearing brain tissue and cognitive impairment in multiple sclerosis. J. Neurol. Neurosurg. Psychiatry. 2000;68:157–161. doi: 10.1136/jnnp.68.2.157. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Cole PhD, J. H., et al. Longitudinal assessment of multiple sclerosis with the brain-age paradigm. Ann. Neurol. 2020;88:93–105. doi: 10.1002/ana.25746. [DOI] [PubMed] [Google Scholar]
- 32.Høgestøl EA, et al. Cross-sectional and longitudinal MRI brain scans reveal accelerated brain aging in multiple sclerosis. Front. Neurol. 2019;10:450. doi: 10.3389/fneur.2019.00450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Franke K, Gaser C, Manor B, Novak V. Advanced BrainAGE in older adults with type 2 diabetes mellitus. Front. Aging Neurosci. 2013;5:90. doi: 10.3389/fnagi.2013.00090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Kodl CT, Seaquist ER. Cognitive dysfunction and diabetes mellitus. Endocr. Rev. 2008;29:494–511. doi: 10.1210/er.2007-0034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Suzuki H, et al. Associations of regional brain structural differences with aging, modifiable risk factors for dementia, and cognitive performance. JAMA Network Open. 2019;2:e1917257–e1917257. doi: 10.1001/jamanetworkopen.2019.17257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Swan GE, et al. Association of midlife blood pressure to late-life cognitive decline and brain morphology. Neurology. 1998;51:986–993. doi: 10.1212/WNL.51.4.986. [DOI] [PubMed] [Google Scholar]
- 37.De Leeuw FE, et al. Hypertension and cerebral white matter lesions in a prospective cohort study. Brain. 2002;125:765–772. doi: 10.1093/brain/awf077. [DOI] [PubMed] [Google Scholar]
- 38.Suzuki, H. et al. Abnormal brain white matter microstructure is associated with both pre-hypertension and hypertension. PloS ONE12 (2017). [DOI] [PMC free article] [PubMed]
- 39.Qiu C, Fratiglioni L. A major role for cardiovascular burden in age-related cognitive decline. Nat. Rev. Cardiol. 2015;12:267. doi: 10.1038/nrcardio.2014.223. [DOI] [PubMed] [Google Scholar]
- 40.Angevaren M, Aufdemkampe G, Verhaar H, Aleman A, Vanhees L. Physical activity and enhanced fitness to improve cognitive function in older people without known cognitive impairment. Cochrane Database Syst. Rev. 2008 doi: 10.1002/14651858. [DOI] [PubMed] [Google Scholar]
- 41.Howard G, et al. Is blood pressure control for stroke prevention the correct goal? The lost opportunity of preventing hypertension. Stroke. 2015;46:1595–1600. doi: 10.1161/STROKEAHA.115.009128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Meng L, Li X-Y, Shen L, Ji H-F. Type 2 diabetes mellitus drugs for Alzheimer’s disease: current evidence and therapeutic opportunities. Trends Mol. Med. 2020;26:597–614. doi: 10.1016/j.molmed.2020.02.002. [DOI] [PubMed] [Google Scholar]
- 43.Rowe JW, Kahn RL. Human aging: usual and successful. Science. 1987;237:143–149. doi: 10.1126/science.3299702. [DOI] [PubMed] [Google Scholar]
- 44.Lundberg SM, Lee S-I. A unified approach to interpreting model predictions. Adv. Neural. Inf. Process. Syst. 2017;30:4765–4774. [Google Scholar]
- 45.Miller KL, et al. Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nat. Neurosci. 2016;19:1523. doi: 10.1038/nn.4393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Jenkinson M, Bannister P, Brady M, Smith S. Improved optimization for the robust and accurate linear registration and motion correction of brain images. Neuroimage. 2002;17:825–841. doi: 10.1006/nimg.2002.1132. [DOI] [PubMed] [Google Scholar]
- 47.LeCun Y, et al. Backpropagation applied to handwritten zip code recognition. Neural Comput. 1989;1:541–551. doi: 10.1162/neco.1989.1.4.541. [DOI] [Google Scholar]
- 48.Fukushima K. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biol. Cybern. 1980;36:193–202. doi: 10.1007/BF00344251. [DOI] [PubMed] [Google Scholar]
- 49.Ioffe, S. & Szegedy, C. Batch normalization: Accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167 (2015).
- 50.He, K., Zhang, X., Ren, S. & Sun, J. in Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778.
- 51.Kingma, D. P. & Ba, J. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
- 52.Paszke A, et al. PyTorch: an imperative style, high-performance deep learning library. Adv. Neural. Inf. Process. Syst. 2019;32:8024–8035. [Google Scholar]
- 53.Kossaifi J, Panagakis Y, Anandkumar A, Pantic M. Tensorly: Tensor learning in python. J. Mach. Learn. Res. 2019;20:925–930. [Google Scholar]
- 54.Breiman L. Random forests. Mach. Learn. 2001;45:5–32. doi: 10.1023/A:1010933404324. [DOI] [Google Scholar]
- 55.Fisher A, Rudin C, Dominici F. All models are wrong, but many are useful: learning a variable's importance by studying an entire class of prediction models simultaneously. J. Mach. Learn. Res. 2019;20:1–81. [PMC free article] [PubMed] [Google Scholar]
- 56.Millard LA, Davies NM, Gaunt TR, Davey Smith G, Tilling K. Software Application Profile: PHESANT: a tool for performing automated phenome scans in UK Biobank. Int. J. Epidemiol. 2017;47:29–35. doi: 10.1093/ije/dyx204. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B (Methodol.) 1995;57:289–300. [Google Scholar]
- 58.Davey Smith G, Ebrahim S. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? Int. J. Epidemiol. 2003;32:1–22. doi: 10.1093/ije/dyg070. [DOI] [PubMed] [Google Scholar]
- 59.Morris AP, et al. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat. Genet. 2012;44:981. doi: 10.1038/ng.2383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Lambert J-C, et al. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer's disease. Nat. Genet. 2013;45:1452. doi: 10.1038/ng.2802. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Evangelou E, et al. Genetic analysis of over 1 million people identifies 535 new loci associated with blood pressure traits. Nat. Genet. 2018;50:1412. doi: 10.1038/s41588-018-0205-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Ehret GB, et al. Genetic variants in novel pathways influence blood pressure and cardiovascular disease risk. Nature. 2011;478:103. doi: 10.1038/nature10405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Hemani G, et al. The MR-Base platform supports systematic causal inference across the human phenome. Elife. 2018;7:e34408. doi: 10.7554/eLife.34408. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.