Skip to main content
Springer logoLink to Springer
. 2026 Mar 11;231(3):37. doi: 10.1007/s00429-025-03070-9

Predicting brain volumes from anthropometric and demographic features: insights from UK biobank neuroimaging data

Kimia Nazarzadeh 1,2,3,, Simon B Eickhoff 2,4, Georgios Antonopoulos 2,4, Lukas Hensel 1, Caroline Tscherpel 1,5, Vera Komeyer 2,4,6, Federico Raimondo 2,4, Christian Grefkes 5, Kaustubh R Patil 2,4,
PMCID: PMC12979296  PMID: 41811501

Abstract

Brain size measures are well-studied and often treated as a confound in volumetric neuroimaging analyses. Yet their relationship with body anthropometric measures and demographics remains underexplored. In this study, we examined those relationships alongside age- and sex-related differences in global brain volumes. Using brain magnetic resonance imaging (MRI) of healthy participants in the UK Biobank, we derived global measures of brain morphometry, including total intracranial volume (TIV), total brain volume (TBV), gray matter volume (GMV), white matter volume (WMV), and cerebrospinal fluid (CSF). We extracted these measures using the Computational Anatomy Toolbox (CAT) and FreeSurfer. Our analyses were structured in three approaches: across-sex analysis, sex-specific analysis, and impact of age analysis. Employing machine learning (ML), we found that TIV was strongly predicted by sex (across-sex Inline graphic 0.68), reflecting sex difference. On the other hand, TBV, GMV, WMV, and CSF were more sensitive to age, with higher prediction accuracy when age was included as a feature, highlighting age-related changes in the brain structure, such as fluid expansion. Sex-specific models showed reduced TIV prediction (Inline graphic 0.25) but improved TBV accuracy (Inline graphic 0.44), underscoring sex-specific body-brain relationships. Anthropometric measures, particularly seated height and weight, improved prediction of TIV and TBV, while waist and hip circumference showed negative associations, though their effects generally remained secondary to age and sex. These findings advance our understanding of brain-body scaling relationships and underscore the necessity of accounting for age and sex in neuroimaging studies of brain morphology.

Supplementary Information

The online version contains supplementary material available at 10.1007/s00429-025-03070-9.

Keywords: Brain volume, Sex difference, Machine learning, UK Biobank, Structural MRI, Anthropometrics

Introduction

The intricate interplay between brain morphology and body characteristics represents a fundamental aspect of neuroimaging research, revealing how brain structure varies with respect to factors such as anthropometrics, age, sex, genetics, and environmental conditions (Gurholt et al. 2021). Understanding these complex body-brain relationships is critical for advancing our knowledge of brain structure and function and for improving the precision and utility of neuroimaging studies, particularly in the context of health, aging, and disease (Gurholt et al. 2021). Key global volumetric measures of brain morphology, including total intracranial volume (TIV), representing the total cranial capacity; total brain volume (TBV), the sum of whole-brain gray matter volume (GMV) and whole-brain white matter volume (WMV), and cerebrospinal fluid volume (CSF) are critical for understanding how brain structure varies with individual factors such as age, sex, and body characteristics (Raz and Rodrigue 2006; Bayat et al. 2012). Multiple studies have demonstrated that these brain volumes are associated with anthropometric features such as height, weight, and body mass index (BMI), waist circumference (WC), and hip circumference (HC), with varying effects across sexes and age groups. For instance, height and weight are positively associated with brain volumes, suggesting that larger body sizes correspond to larger brain volumes. Conversely, higher WC, an indicator of central obesity, is associated with reduced TBV and GMV, particularly in frontal, temporal, and limbic regions (Ward et al. 2005; Bayat et al. 2012). Similarly, higher HC, which reflects lower-body fat distribution, has been associated with larger brain volumes, although the underlying mechanisms and implications of this relationship have not been fully understood (Eboh et al. 2016).

Age, as a ubiquitous biological process, is a pivotal factor shaping brain volumes across the lifespan, driving significant structural changes, making it a primary risk factor for various brain disorders (Smith et al. 2007). These age-related alterations, modulated by other factors such as sex, genetics, and environmental factors, manifest distinctly across brain compartments. Evidence reveals distinct patterns of change for each brain volume (Smith et al. 2007; Chen et al. 2007; Ruigrok et al. 2014). TIV increases during childhood, then stabilizes in adulthood due to the fixed cranial structure. In contrast, TBV and its components decline markedly with aging (Mills et al. 2016). GMV decreases linearly from early adulthood, with studies showing pronounced reductions in the frontal cortex by middle age and loss across multiple regions over time. WMV follows a different trajectory, initially increasing into early adulthood before declining later in life, with evidence indicating deterioration beginning in young adulthood and progressing across various brain areas. Meanwhile, CSF volume increases with age, a consequence of brain tissue atrophy. These widespread and progressive age-related changes emphasize the dynamic and differential effects of aging on brain structure. Sex, further, as a fundamental biological factor, significantly influences patterns of brain structure and aging. Sex difference in brain morphology is well-established, with males generally exhibiting larger absolute global brain volumes than females (McCarthy et al. 2012; DeCasien et al. 2022). These differences, observed across multiple neuroanatomical features, may be influenced by genetic, hormonal, and developmental factors. On average, males have approximately greater TIV (12%), TBV (11%), GMV (9%), WMV (13%), and CSF (11.5%) compared to females (Chen et al. 2007; Ruigrok et al. 2014; Eliot et al. 2021). These percentages reflect larger brain volumes in males, which generally align with greater body size during development, as larger bodies typically correspond to proportionally larger brains. However, when controlling for overall body size, these differences diminish significantly (Ruigrok et al. 2014; Mills et al. 2016; Eliot et al. 2021). Regionally, females tend to have a larger fronto-parietal cortex, while males show a larger occipito-temporal cortex (Ritchie et al. 2018). In contrast, findings for subcortical structures, such as the hippocampus, remain inconsistent across studies (Chen et al. 2007; Ruigrok et al. 2014; Lotze et al. 2019). Understanding such sex-specific differences in global brain volumes is essential for accurately interpreting neuroimaging data and could inform clinical practices regarding neurodegenerative diseases that exhibit sex-specific patterns of progression (Ferretti et al. 2018).

Despite these insights, a critical gap remains regarding which anthropometric measures, alongside age and sex, are predictive of brain volumes at the individual level. Addressing this gap is vital for mapping the normative body-brain relationships and identifying potential biomarkers of brain health and aging, which could guide early detection and intervention strategies for neurodegenerative conditions with sex- and age-specific profiles. Motivated by this need, our study employs Machine learning (ML) to predict brain volumes derived from structural magnetic resonance imaging (MRI) data using anthropometric and demographic features in a large cohort from the UK Biobank (Inline graphic 21,807). We asked two key questions: to what degree can anthropometric features predict brain volumes compared to the dominant influences of age and sex, and how do age and sex modulate the predictive power of anthropometrics across different brain volumes? These questions matter because unraveling the relative contributions of these factors could refine our understanding of brain allometry—the proportional scaling of brain and body—and reveal association of demographic and physical traits with brain structure across the lifespan. We hypothesize that TIV will be more strongly predicted by sex due to dimorphism in body proportions, while TBV, GMV, WMV, and CSF are expected to exhibit greater sensitivity to age, reflecting their susceptibility to age-related atrophy. The potential contribution of anthropometric measures to the prediction of these brain volumes will also be explored. By applying ML to assess prediction accuracies in age- and sex-related analyses, we seek to provide a comprehensive framework that clarifies these relationships and sets the groundwork for future studies linking brain morphology to health outcomes, such as cognitive decline or disease susceptibility.

Materials and methods

Participants and data

Data for this study were obtained from the UK Biobank, a comprehensive database with over 500,000 adult participants recruited from 22 assessment centers across the United Kingdom (UK) (Sudlow et al. 2015). The baseline assessment took place between 2006 and 2010. In 2014, a subset of participants was invited to undergo brain imaging (https://www.ukbiobank.ac.uk/). This imaging assessment included a wide range of demographic and anthropometric measurements. The healthy individuals were defined based on inclusion and exclusion criteria consistent with the International Classification of Diseases, 10th revision (ICD-10), and the detailed exclusion criteria listed in Table S1. To ensure the completeness of the dataset, only participants with complete data for demographics, anthropometrics, and brain imaging were included in the final sample (Inline graphic 21,807; females Inline graphic 11,576, males Inline graphic 10,231).

Demographic and anthropometric assessments

Sex was determined based on self-reported information (Data-Field 31). Age was calculated as the difference between the participant’s birth and assessment dates. Anthropometric measurements were collected during the physical measures phase of the assessment. These included weight (Data-Field 21002), WC (Data-Field 48), HC (Data-Field 49), seated height (Data-Field 51), seating box height (Data-Field 3077), and standing height (Data-Field 50).

Brain volume

We calculated absolute brain volumes using T1-weighted (T1w) MRI data from the UK Biobank first imaging assessment visit, acquired with 3T scanners following the UK Biobank Imaging Protocol (UK Biobank Imaging Protocols, https://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/brain_mri.pdf). T1w images, initially preprocessed with quality control by the UK Biobank (Alfaro-Almagro et al. 2018). T1w images were segmented into gray matter, white matter, and CSF and normalised by using the default preprocessing of the CAT12.8.1 (r2042, 1.5 mm isotropic resolution) Toolbox (Gaser et al. 2024) as standalone version compiled under MATLAB 2019b and SPM12 (Wellcome Centre for Human Neuroimaging, https://www.fil.ion.ucl.ac.uk/spm/). For non-linear registration, the Shooting method (Ashburner and Friston 2011) with modulation was used. GMV, WMV, and CSF volumes were extracted from the normalised segments. Supplementary analyses were conducted using brain volumes derived from FreeSurfer ASEG segmentation, provided by the UK Biobank. We compared brain volumes derived from CAT and FreeSurfer to investigate how these distinct preprocessing approaches might influence brain volume estimates and subsequent predictive modeling outcomes. CAT employs a voxel-based morphometry framework with an explicit head model, which may improve spatial normalization accuracy and sensitivity to subtle structural variations, potentially benefiting models reliant on precise anatomical alignment. In contrast, FreeSurfer uses a surface-based approach with detailed tissue segmentation, which could enhance volume accuracy for tissue-specific predictions. By examining these differences, we aimed to assess their impact on the reliability and generalizability of our predictive models. The following absolute volumes were analyzed from FreeSurfer-processed UK Biobank fields: total intracranial volume (TIV, Data-Field 26521), gray matter volume (GMV, Data-Field 25006), white matter volume (WMV, Data-Field 25008), total brain volume (TBV, Data-Field 25010), which is defined as the sum of GMV and WMV (TBV = GMV + WMV), and cerebrospinal fluid volume (CSF, Data-Field 26527).

ML analysis

Feature configurations

In this study, we examined the relationships between brain morphology (i.e., TIV, TBV, GMV, WMV, CSF) and various anthropometric measurements, including weight, WC, HC, seated height, and seating box height, and demographic factors such as sex and age. We utilized a factorial design, incorporating two primary independent factors, sex and age, structured across multiple analytical dimensions. The sex factor was evaluated in two distinct configurations: an across-sex analysis, where features of males and females were combined, and sex was included as an additional feature, and a within-sex analysis, where data were analyzed separately for each sex. This dual approach enabled us to assess both the independent and interactive effects of anthropometric features, age, and sex on brain volumes from both CAT and FreeSurfer pipelines (see Sect. 2.3). For the age factor, we employed three analytical strategies to examine its impact: (1) including age as a feature, (2) excluding age from the feature list, and (3) removing age as a confound. These strategies were applied consistently in both across-sex and within-sex analyses. The across-sex analysis facilitated the exploration of age- and sex-related effects on brain morphology in the entire study population, while the within-sex analysis allowed us to identify sex-specific patterns and investigate how age modulates the relationships between anthropometric measurements and brain volumes within each sex group. By integrating these factorial dimensions, we aimed to provide a comprehensive evaluation of how anthropometric features, age, and sex interact to shape brain morphology in our study cohort.

Model training and performance evaluation

To predict brain volumes from anthropometric and demographic features, the data from both CAT and FreeSurfer preprocessing pipelines (see Sect. 2.3) were split into training (90%) and test (10%) datasets for both across-sex and within-sex analyses (Table 1). We employed linear support vector machine (SVM) and random forest (RF) regression models to predict brain volumes using feature combinations of anthropometric measurements (weight, WC, HC, seated height, and seating box height), age, and sex as structured by the factorial design detailed in Sect. 3.1. Model performance was evaluated using Pearson’s correlation coefficient (Inline graphic), the coefficient of determination (Inline graphic), and mean absolute error (Inline graphic). To ensure robust generalization estimates, we implemented 10 times repeated 10-fold (10Inline graphic10-fold) cross-validation (CV) using the Julearn machine learning library (version 0.2.7; https://juaml.github.io/julearn/) (Hamdan et al. 2023), which builds on the scikit-learn library (sklearn) (Pedregosa et al. 2011). For the linear SVM, the hyperparameter Inline graphic was determined heuristically as Inline graphic where​ Inline graphic represents the number of subjects, Inline graphic is the feature matrix of the dataset, Inline graphic denotes the value of feature Inline graphic for subject Inline graphic. This ensures C scales proportionally with the size and norm of the input data. As a nonlinear method, we also trained an RF regression model using the sklearn (version 1.2.1), with 100 trees, a minimum of 2 samples per split, the square root (sqrt) of the total number of features as the maximum number of features considered for the best split and bootstrapping of the training samples (true) as the hyperparameters (default settings in this version of sklearn). We compared the performance of the linear SVM and RF models from the 10 × 10-fold CV procedure using the training dataset, with each iteration involving random shuffling of the dataset prior to fold assignment to ensure unbiased partitioning. This procedure yielded 100 independent training–test cycles, providing a robust estimate of model generalizability. To formally test whether observed differences between models were statistically significant, we applied the corrected repeated k-fold paired t-test (Nadeau and Bengio 2003; Bouckaert and Frank 2004), with results reported in Supplementary Table S2. The model exhibiting superior performance in CV was selected for further analysis. Feature importance (FI) scores were then derived from the final models trained on the full training dataset (90% of the data) to quantify the contribution of individual features to the model outputs. These FI values are unitless, reflecting relative weights of predictors in the linear SVM model, and should therefore be interpreted in terms of relative rather than absolute magnitude (Guyon et al. 2002). For the RF models, FI scores were obtained from the scikit-learn attribute “feature_importances_”, which reflects the mean decrease in impurity (average variance reduction) contributed by each feature across all trees. For the linear SVM, FI scores were based on the model coefficient parameter (scikit-learn model’s “.coef_”), which represents the weights used for prediction. While these coefficients provide relative indicators of feature contribution, their interpretation can be influenced by correlations among features (Haufe et al. 2014). To address this, we also computed Haufe-transformed patterns, which re-express the model coefficients in the space of the original features by incorporating the covariance structure of the data providing values that more directly reflect the association between each feature and the target. Finally, we used the 10% hold-out test set to predict brain volumes and compared the predicted values to the true values, confirming the model’s performance on unseen data beyond the CV framework.

Table 1.

Summary of the imaging population characteristics

Dataset Train dataset Test dataset
Sex Both sexes Female Male Both sexes Female Male
Number 19,625 10,418 9,207 2,182 1,158 1,024
Age, mean (SD) 63.32 (7.5) 62.66 (7.28) 64.07 (7.68) 63.27 (7.44) 62.29 (7.22) 64.37 (7.54)
Weight, mean (SD) 75.11 (14.74) 68.20 (12.53) 82.93 (13.06) 74.84 (14.6) 68.13 (12.48) 82.43 (13.01)
WC, mean (SD) 87.18 (12.36) 81.72 (11.39) 93.35 (10.35) 86.97 (12.24) 81.63 (11.18) 93.01 (10.45)
HC, mean (SD) 100.32 (8.52) 100.25 (9.56) 100.39 (7.16) 100.19 (8.37) 100.22 (9.26) 100.16 (7.23)
Seated height, mean (SD) 131.47 (7.26) 127.33 (4.86) 136.16 (6.65) 131.54 (7.33) 127.46 (5.05) 136.15 (6.76)
Box height, mean (SD) 42.11 (4.23) 40.81 (3.09) 43.59 (4.82) 42.16 (4.34) 40.77 (3.22) 43.73 (4.88)

WC: Waist circumference

HC: Hip circumference

SD: Standard deviation

Results and discussion

In this manuscript, the results and discussion are presented in a combined section to facilitate a more direct interpretation of the findings. This integrated format allows each result to be immediately contextualized and discussed, improving clarity and avoiding redundancy.

Association between age and body-brain structure

Correlation analyses between age, anthropometric measurements, and brain volumes in female and male cohorts revealed complex age-related relationships in body-brain associations (Fig. 1). Pearson’s correlation coefficient was used to assess linear relationships between continuous variables. WC (Fig. 1B) showed weak positive correlations with age (females: Inline graphic = 0.06, Inline graphic < 0.001; males: Inline graphic = 0.04, Inline graphic = 0.0003), suggesting a subtle increase in visceral fat distribution as individuals age. Although statistically significant, these correlations were small in magnitude, suggesting limited effect sizes. The statistical significance of these correlations with small effect sizes likely reflects the large sample size rather than a meaningful biological association. Despite their weakness, these trends align with prior research documented a tendency for abdominal fat accumulation with advancing age, attributed to metabolic and hormonal changes (Kuk et al. 2009; Frank et al. 2019; Ponti et al. 2020). Caution is warranted in interpreting these weak associations, and further research may clarify their relevance. In contrast, weight, HC, seated height, and standing height (Fig. 1A and C-E) exhibited negative correlations. Among these, seated height (females: Inline graphic = -0.25, Inline graphic < 0.001; males: Inline graphic = -0.23, Inline graphic < 0.001) and standing height (females: Inline graphic = -0.20, Inline graphic < 0.001; males: Inline graphic = -0.20, Inline graphic < 0.001) exhibited the strongest declines, consistent with established patterns of height loss due to vertebral compression, bone mass reduction, and an excessive forward rounding of the upper back (kyphosis) (Cummings and Melton 2002; Volpi et al. 2004). On the other hand, weight demonstrated slightly lower negative correlations (females: Inline graphic -0.10, Inline graphic < 0.001; males: Inline graphic -0.16, Inline graphic < 0.001), likely reflecting the progressive loss of muscle mass (sarcopenia) that occurs with aging, as muscle, being denser than fat, contributes significantly to body weight (Volpi et al. 2004; Kuk et al. 2009). HC showed a much weaker negative correlations (females: Inline graphic = -0.03, Inline graphic = 0.0026; males: Inline graphic = -0.06, Inline graphic < 0.001) suggesting that fat redistribution in the hips occurs more slower than in the abdomen, potentially preserving hip shape in aging relative to some body regions with more pronounced fat accumulation (Kuk et al. 2009). These findings illustrate a complex interplay of anthropometric changes that may influence age-related alterations, highlight the importance of considering a range of body measurements when assessing the impact of aging on brain morphology.

Fig. 1.

Fig. 1

Association between age and anthropometric measurements and brain volumes derived from the CAT data, separated by sex. Panels (A-E) display the relationships between age and anthropometric measurements: (A) weight, (B) waist circumference (WC), (C) hip circumference (HC), (D) seated height, and (E) standing height. Panels (FJ) show the correlations between age and brain volumes: (F) total intracranial volume (TIV), (G) total brain volume (TBV), (H) gray matter volume (GMV), (I) white matter volume (WMV), and (J) cerebrospinal fluid volume (CSF). Analyses were performed separately for females (N = 11,576) and males (N = 10,231)

Brain volumes displayed marked age-related trajectories (Fig. 1F-J). TIV (Fig. 1F) showed a small negative correlation with age (females: Inline graphic = -0.07, Inline graphic < 0.001; males: Inline graphic = -0.04, Inline graphic < 0.001), indicating that overall cranial capacity remains relatively stable across the lifespan. This stability is expected, as TIV includes bone structure, which is developed during childhood and remains constant throughout adulthood despite age-related changes in brain volumes (Smith et al. 2007; Fjell et al. 2009; Blinkouskaya et al. 2021; Cox 2024). In contrast, TBV (Fig. 1G), GMV (Fig. 1H), and WMV (Fig. 1I) demonstrated more pronounced decline with age. GMV exhibited the strongest negative correlations (females: Inline graphic = -0.34, Inline graphic < 0.001; males: Inline graphic = -0.39, Inline graphic < 0.001), suggesting a decline likely due to neuronal loss and synaptic pruning associated with normal aging (Gennatas et al. 2017). WMV displayed moderate negative correlations (females: Inline graphic = -0.26, Inline graphic < 0.001; males: Inline graphic = -0.31, Inline graphic < 0.001), less pronounced than GMV but still significant, potentially impacting neural communication efficiency in older adults. TBV, the sum of GMV and WMV, mirrored the declines in these volumes. Conversely, CSF (Fig. 1J) increased sharply (females: Inline graphic = 0.41, Inline graphic < 0.001; males: Inline graphic = 0.57, Inline graphic < 0.001), more prominently in males, reflecting fluid expansion as brain tissue atrophies (Yamada et al. 2023). This increase in CSF volume may support intracranial pressure maintenance as GMV and WMV shrink with age-related atrophy. This observation aligns with the Monro-Kellie doctrine, which states that within the rigid skull, the sum of brain tissue, CSF, and blood volume is constant. As GMV and WMV decrease, CSF volume expand to compensate, thereby stabilizing intracranial pressure (Benson et al. 2023).

Brain volume predictive modelling

The primary analysis utilized CAT-derived data with the linear SVM model to predict brain volumes using anthropometric measurements and age. This approach was applied to both across-sex and within-sex analyses, with age included as a feature in the model (see Sect. 3.1). The linear SVM demonstrated robust predictive performance across all brain volumes (Table 2). Direct statistical comparisons with the RF model (Supplementary Table S2), confirmed that the linear SVM outperformed the RF model, yielding higher Pearson’s r, R², and lower MAE in 10 × 10-fold CV. Supplementary analyses extended these findings by applying full results for both linear SVM and RF models to CAT and FreeSurfer-derived volumes, again maintaining age as a feature (see Supplementary Table S3).

Table 2.

The performance of linear SVM models on the training dataset for brain volumes

Linear SVM
Data Brain volume Scores (mean ± SD) Both sexes Female Male
CAT TIV Pearson’s r 0.67 ± 0.01 0.24 ± 0.03 0.24 ± 0.03
R2 0.44 ± 0.01 0.06 ± 0.01 0.06 ± 0.02
MAE 88.03 ± 1.35 82.18 ± 1.67 94.81 ± 2.49
TBV Pearson’s r 0.64 ± 0.01 0.39 ± 0.02 0.43 ± 0.02
R2 0.41 ± 0.01 0.15 ± 0.02 0.19 ± 0.02
MAE 70.57 ± 1.17 66.02 ± 1.31 75.54 ± 1.92
GMV Pearson’s r 0.62 ± 0.01 0.41 ± 0.02 0.45 ± 0.02
R2 0.38 ± 0.01 0.17 ± 0.02 0.2 ± 0.02
MAE 36.37 ± 0.59 34. 4 ± 0.74 38.51 ± 0.99
WMV Pearson’s r 0.6 ± 0.01 0.33 ± 0.02 0.37 ± 0.02
R2 0.36 ± 0.02 0.11 ± 0.02 0.13 ± 0.02
MAE 40.19 ± 0.7 37.33 ± 0.81 43.37 ± 1.04
CSF Pearson’s r 0.67 ± 0.01 0.42 ± 0.03 0.58 ± 0.02
R2 0.44 ± 0.02 0.18 ± 0.02 0.33 ± 0.02
MAE 38.2 ± 0.72 36.72 ± 0.87 39.16 ± 1.00

Full results, including RF models and CAT- and FreeSurfer-derived volumes are provided in supplementary Table S3

Prediction of head size (TIV)

The prediction of TIV using anthropometric measurements, age, and sex revealed intricate relationships shaped by biological and developmental factors. Using CAT-derived data with the linear SVM model, including age as a feature, we analyzed both across-sex and within-sex cohorts (Fig. 2; Table 3). In the across-sex model (first column in Fig. 2), predictive accuracy was moderate (Inline graphic = 0.68), suggesting that anthropometric measurements serve as reasonable proxies for TIV. In contrast, the within-sex models (females: second and males: third columns in Fig. 2) demonstrated a lower predictive accuracy (females: Inline graphic = 0.26, males: Inline graphic = 0.25), indicating the challenges of estimating TIV for each sex separately. This decrease in accuracy suggests that much of the predictive power in the across-sex model is driven by sex difference. Sex emerged as the most significant predictor in the across-sex model (Feature Importance; FIsex = 71.05), reflecting differences in cranial capacity, which are likely influenced by hormonal factors, particularly testosterone, during development. Males, on average, exhibit 12% larger skulls and TIV compared to females, explaining the pronounced influence of sex on TIV prediction (Ruigrok et al. 2014; Eliot et al. 2021). However, body size reduces sex-based differences in TIV; when adjusted for height and weight, these differences are smaller, indicating TIV is linked to both body proportions and skull size. In addition, the high Haufe-transform value for sex (0.96) indicated a direct continuous association with TIV, consistent with sex acting mainly as a categorical proxy for body-size features. Age showed a smaller importance for TIV prediction (FIage: across-sex: 2.86, females: 0.56, males: 5.68), aligning with the stability of cranial structure across adulthood, unlike brain volumes that decline with age-related atrophy. The slightly higher FI of age in males may reflect sex-specific changes in body composition or posture indirectly affecting TIV estimation. The Haufe pattern for age (across-sex: 0.07, females: -0.30, males: -0.12) showed minimal direct association between age and TIV. Among anthropometric predictors, seated height exhibited moderate positive associations with TIV (FIseated height: across-sex: 42.18, females: 29.72, males: 35.9; Haufe: across-sex: 0.73, female: 0.74, male: 0.72), supporting a close link between vertical skeletal growth and cranial development. Weight also emerged as another important predictor (FIweight: across-sex: 26.2, females: 20.62, males: 27.65; Haufe: across-sex: 0.57, female: 0.33, male: 0.48), likely due to its association with overall body size, though its relationship with TIV is weaker and more complex than that of height, potentially confounded by factors like adiposity or muscle mass. In addition, WC (FIWC: across-sex: -14.47, females: -8.77, males: -18.48; Haufe: across-sex: 0.45, female: -0.04, male: 0.07) and HC (FIHC: across-sex: -9.88, females: -13.38, males: -6.64; Haufe: across-sex: 0.04, female: 0.10, male: 0.20) displayed weak negative FI but small positive Haufe values, suggesting that central adiposity has a limited influence on TIV. The results for the across-sex context using FreeSurfer, within-sex context, and RF model analysis are provided in the supplementary file (Figures S1, S2, S3).

Fig. 2.

Fig. 2

Estimation of head size (TIV) on CAT data using linear SVM. A Prediction accuracy using the Pearson correlation coefficient (r) from CV analysis on the training dataset (90%) for both sexes (left, = 19,625), females (middle, = 10,418), and males (right, = 9,207). B Association between predicted and true TIV values on the test dataset (10%) for both sexes (N  = 2,182), females (N  = 1,158), and males (N  = 1,024). The scatter plot of predicted versus true TIV: for both sexes (Inline graphic = 0.68, Inline graphic= 0.46, Inline graphic = 86.94), for females (Inline graphic= 0.26, Inline graphic= 0.07, Inline graphic = 84.27), and for males (Inline graphic = 0.25, Inline graphic= 0.06, Inline graphic = 89.78). Although the point distributions may appear visually similar because they are derived from the same subjects, the predicted values for each individual differ depending on the model used, illustrating how sex-related variance influences model performance. C Feature importance (FI) scores from the final models trained on the training dataset (90%). For both sexes (FISex = 71.05, FIAge = 2.86, FIWeight = 26.2, FIWC = -14.47, FIHC = -9.88, FISeated height = 42.18, FIBox height = -23.56; Haufe: Sex: 0.96, Age: 0.07, Weight: 0.57, WC: 0.45, HC: 0.04, Seated height: 0.73, Box height: 0.38), for females (FIAge = 0.56, FIWeight = 20.62, FIWC = -8.77, FIHC = -13.38, FISeated height = 29.72, FIBox height = -20.19; Haufe: Age: -0.30, Weight: 0.33, WC: -0.04, HC: 0.10, Seated height: 0.74, Box height: 0.10), and for males (FIAge = 5.68, FIWeight = 27.65, FIWC = -18.48, FIHC = -6.64, FISeated height = 35.90, FIBox height = -23.57; Haufe: Age: -0.12, Weight: 0.48, WC: 0.07, HC: 0.20, Seated height: 0.72, Box height: 0.25)

Table 3.

Feature importance (FI) and Haufe transform of features

TIV TBV
Both sexes Female Male Both sexes Female Male
FI FIsex 71.05 -- -- 45.6 -- --
FIage 2.86 0.56 5.68 -26.38 -21.99 -30.45
FIweight 26.2 20.62 27.65 21.31 16.03 22.26
FIWC -14.47 -8.77 -18.48 -13.93 -8.34 -17.46
FIHC -9.88 -13.38 -6.64 -5.63 -7.7 -3.55
FIseated height 42.18 29.72 35.9 35.15 26.06 29.49
FIbox height -23.56 -20.19 -23.57 -19.14 -17.58 -19.02
Haufe transform HTsex 0.96 -- -- 0.83 -- --
HTage 0.07 -0.30 -0.12 -0.34 -0.80 -0.84
HTweight 0.57 0.33 0.48 0.57 0.30 0.35
HTWC 0.45 -0.04 0.07 0.37 -0.04 -0.01
HTHC 0.04 0.10 0.20 0.07 0.12 0.15
HTseated height 0.73 0.74 0.72 0.76 0.63 0.57
HTbox height 0.38 0.10 0.25 0.39 0.12 0.23

Prediction of brain size (TBV)

TBV estimation using the linear SVM model on CAT-derived data with age included as a feature was employed across both across-sex and within-sex cohorts (Fig. 3; Table 3). In the across-sex model (first column in Fig. 3), TBV prediction achieved moderate accuracy (Inline graphic 0.65), slightly lower than TIV (Inline graphic 0.68) but the within-sex models (Fig. 3; females: second and males: third column) outperformed their TIV counterparts (females: Inline graphic 0.44, males: Inline graphic 0.45). This suggests that anthropometric measurements better capture brain size variation within sex, unlike TIV, where sex difference dominates. Sex remained a significant predictor of TBV (FIsex = 45.6; Haufe: Sex: 0.83), yet its influence was notably weaker than for TIV (FIsex = 71.05). This reduced sex difference in TBV likely stems from the brain-to-cranium ratio: while males have larger skulls, their brain size does not scale proportionally with TIV (Ruigrok et al. 2014). Additionally, regional brain composition (e.g., such as females’ relatively higher GMV) may further moderate sex differences in TBV, making its relationship with body size more complex than for TIV (Ruigrok et al. 2014). A striking contrast with TIV emerged in the role of age. While TIV showed minimal age association, age was a strong negative predictor of TBV (FIage: across-sex: -26.38; females: -21.99; males: -30.45; Haufe: across-sex: -0.34; females: -0.80; males: -0.84), indicating that older age is associated with lower TBV. This susceptibility to aging, potentially more pronounced in males, aligns with established patterns of tissue loss and further diminishes sex difference in TBV over time. Seated height and weight, like for TIV, remained important predictors of TBV. Seated height showed a strong positive predictor of TBV (FIseated height: across-sex: 35.15, females: 26.06, males: 29.49; Haufe: across-sex: 0.76; females: 0.63; males: 0.57), suggesting that, like TIV, taller individuals tend to have larger brain volumes. This positive relationship likely reflects a stronger connection to brain tissue development rather than cranial capacity, as TBV aligns more closely with neurodevelopmental scaling, whereas TIV is influenced by broader skeletal factors. Thus, seated height may be a more direct predictor of brain volume. However, the predictive power of weight (FIweight: across-sex: 21.31, females: 16.03, males: 22.26; Haufe: across-sex: 0.57; females: 0.30; males: 0.35) was somewhat less pronounced than for TIV, indicating that body size may have a more complex relationship with brain size due to factors like body composition, such as fat and muscle distribution, which differs between sexes. WC (FIWC = -13.93, females: -8.34, males: -17.46; Haufe: across-sex: 0.37; females: -0.04; males: -0.01) and HC (FIHC = -5.63, females: -7.7, males: -3.55; Haufe: across-sex: 0.07; females: 0.12; males: 0.15) also showed weak association with TBV. However, TBV negative associations were slightly weaker in magnitude and more variable across sexes, possibly due to tissue-specific factors such as age-related GMV loss, metabolic effects of adiposity, and sex differences in brain composition. These factors may influence TBV more heterogeneously than the fixed cranial structure of TIV. The results for the across-sex context using FreeSurfer, within-sex context, and RF model analysis are provided in the supplementary file (Figures S4, S5, S6). Taken together, these results demonstrate that while age and sex remain dominant predictors of global brain measures, anthropometric features such as seated height, weight, and WC provide additional predictive value, revealing specific body–brain associations that persist even when age and sex are considered.

Fig. 3.

Fig. 3

Estimation of brain size (TBV) on CAT data using linear SVM. A Prediction accuracy using the Pearson correlation coefficient (r) from CV analysis on the training dataset (90%) for both sexes (left, = 19,625), females (middle, = 10,418), and males (right, = 9,207). B Association between predicted and true TBV values on the test dataset (10%) for both sexes (N  = 2,182), females (N  = 1,158), and males (N  = 1,024). The scatter plot of predicted versus true TBV: for both sexes (Inline graphic = 0.65, Inline graphic= 0.43, Inline graphic = 68.46), for females (Inline graphic = 0.44, Inline graphic= 0.19, Inline graphic = 65.86), and for males (Inline graphic = 0.45, Inline graphic= 0.20, Inline graphic = 71.31). Although the point distributions may appear visually similar because they are derived from the same subjects, the predicted values for each individual differ depending on the model used, illustrating how sex-related variance influences model performance. C Feature importance (FI) scores from the final models trained on the training dataset (90%). For both sexes (FISex = 45.6, FIAge = -26.38, FIWeight = 21.31, FIWC = -13.93, FIHC = -5.63, FISeated height = 35.15, FIBox height = -19.14; Haufe: Sex: 0.83, Age: -0.34, Weight: 0.57, WC: 0.37, HC: 0.07, Seated height: 0.76, Box height: 0.39), for females (FIAge = -21.99, FIWeight = 16.03, FIWC = -8.34, FIHC = -7.7, FISeated height = 26.06, FIBox height = -17.58; Haufe: Age: -0.80, Weight: 0.30, WC: -0.04, HC: 0.12, Seated height: 0.63, Box height: 0.12), and for males (FIAge = -30.45, FIWeight = 22.26, FIWC = -17.46, FIHC = -3.55, FISeated height = 29.49, FIBox height = -19.02; Haufe: Age: -0.84, Weight: 0.35, WC: -0.01, HC: 0.15, Seated height: 0.57, Box height: 0.23)

The Haufe-transformed patterns showed that seated height and weight were the strongest positive contributors to both TIV and TBV, with sex also strongly associated with TIV in both sexes models. Age showed a negative association with TBV. WC and HC had weaker effects on both TIV and TBV in sex separated models. To further assess the interpretability of the ML-derived feature patterns, we compared the Haufe-transform values with classical analysis of variance (ANOVA) results computed on the training data (Supplementary Table S4). We found a consistent structure for both TIV and TBV: sex exhibited the strongest association, followed by seated height and box height, while weight contributed moderately and WC and HC showed smaller effects. The direction and magnitude of Haufe-transformed patterns were largely consistent with the ANOVA F-values in both sexes models with disagreements appearing only at lower effect sizes.

Comparison between standing height and seated height

The comparison between seated height and standing height is crucial for understanding the complex relationships between body measurements and brain morphology (Masanovic et al. 2020). Standing height includes both trunk and leg length, with the femur significantly contributing to overall height (Bogin and Varela-Silva 2010). In contrast, seated height isolates trunk growth, providing a more direct assessment of anatomical factors that might be related to brain size. This distinction is important due to the differing growth patterns of the femur compared to the trunk and skull, which can introduce confounding variables when using standing height as a predictor of brain size. Therefore, in this analysis, we compared the predictive power of standing height and seated height for the head size (TIV) and brain size (TBV) by including standing height as an additional feature (Fig. 4). Although standing height exhibited a higher FI value than seated height for both TIV (FIstanding height = 30.79, FIseated height = 13.84) and TBV (FIstanding height = 20.72, FIseated height = 16.02). However, despite these higher FI scores, the addition of standing height did not improve overall prediction accuracy (TIV: Inline graphic 0.68, TBV: Inline graphic 0.65), indicating redundancy between the two measures. The lack of improvement in prediction accuracy, despite higher FI scores for standing height, can be explained by developmental and anatomical factors. The trunk and skull, which are reflected in seated height, follow a growth trajectory that aligns more closely with brain development, reaching near-adult proportions earlier than appendicular structures like the femur (Kawada et al. 2022). In contrast, leg length, a significant component of standing height, exhibits more prolonged and variable growth, which may be influenced by environmental, nutritional, or postural factors (Fredriks et al. 2005; Bogin and Varela-Silva 2010). As a result, seated height, focusing on the upper body, may serve as an anatomically relevant proxy for cranial and brain size, potentially minimizing the noise introduced by leg length variability. Thus, while standing height captures greater variability as reflected in its higher FI scores, it does not translate into better predictive performance, reinforcing the idea that trunk-based measures better capture anatomical scaling relationships between body and brain size, reflected by TIV and TBV.

Fig. 4.

Fig. 4

Comparison between head size and brain size including standing height as a feature. A Estimation of head size (TIV). Left plot: Prediction accuracy using the Pearson correlation coefficient (r) from CV analysis on the training dataset (90%) for both sexes (𝑁 = 19,625: females (𝑁 = 10,418), male (𝑁 = 9,207)). Middle plot: Association between predicted and true TIV values on the test dataset for both sexes (𝑁 = 2,182: females (𝑁 = 1,158), males (𝑁 = 1,024)). The scatter plot of predicted versus true TIV for both sexes (Inline graphic = 0.68, R2= 0.46, MAE = 86.94). Right plot: Feature importance (FI) scores from the final models trained on the training dataset (90%). For both sexes (FISex = 71.05, FIAge = 2.86, FIWeight = 26.20, FIWC = -14.47, FIHC = -9.88, FISeated height = 42.18, FIBox height = -23.56). B Estimation of brain size (TBV) on CAT data using linear SVM. Left plot: Prediction accuracy using the Pearson correlation coefficient (r) from CV analysis on the training dataset (90%) for both sexes (𝑁 = 19,625: females (𝑁 = 10,418), male (𝑁 = 9,207)). Middle plot: Association between predicted and true TBV values on the test dataset for both sexes (𝑁 = 2,182: females (𝑁 = 1,158), males (𝑁 = 1,024)). The scatter plot of predicted versus true TBV for both sexes (Inline graphic= 0.65, R2= 0.43, MAE = 68.46). Right plot: Feature importance (FI) scores from the final models trained on the training dataset (90%). For both sexes (FISex = 45.60, FIAge = -26.38, FIWeight = 21.31, FIWC = -13.93, FIHC = -5.63, FISeated height = 35.15, FIBox height = -19.14)

Impact of age on brain volumes

The comparison of brain volume predictions across three age strategies—including age as a feature, excluding it from the feature set, or removing it as a confound (see Sect. 3.1) provides valuable insights into how age influences various brain volumes, particularly when considering anthropometric features. The results were obtained using CAT-derived data in an across-sex context (Fig. 5). The analysis highlights the intricate relationship between age, brain structure, and anthropometric measurements. Distinct age-related patterns emerged in brain volume predictions, emphasizing the complex interplay of these factors. TIV and TBV predictions showed distinct patterns with age. TIV predictions remained remarkably stable (Inline graphic 0.66) across all configurations, indicating a minimal impact of age. This stability aligns with the understanding that, on a population level, TIV remains fixed throughout adulthood, largely due to the stable nature of the bony structure of the cranium, which maintains its size despite age-related brain atrophy (Smith et al. 2007). The consistent prediction accuracy suggests that TIV is primarily influenced by genetic or developmental factors, rather than age or anthropometrics. In contrast, TBV exhibited marked sensitivity to age. When age was included as a feature in the prediction model, the accuracy of TBV prediction (Inline graphic = 0.64) reflected the strong relationship between age and changes in GMV and WMV over the lifespan. As individuals age, GMV generally decreases, and WMV may also experience a decline, both of which contribute to overall reductions in TBV (Mills et al. 2016; Irimia 2021; Yamada et al. 2023). In this context, the inclusion of age as a feature captured not only these structural changes but also provided critical insight into how TBV changes over time. Excluding age from the feature set or removing it as a confound led to decreased accuracy (Inline graphic = 0.6 and Inline graphic = 0.59, respectively), confirming that age is a critical factor in TBV prediction, even when accounting for other variables like anthropometric measurements or sex. The similar accuracies in the age-excluded and age-confound-removed conditions suggest that age-related effects on TBV are primarily direct rather than merely confounding influences. This direct effect of age on TBV highlights how aging affects the brain overall volume in a measurable, predictable way, unlike TIV, which remains stable.

Fig. 5.

Fig. 5

Impact of age on brain volumes on CAT data using linear SVM. Comparison of brain volumes concerning age-related effects across different age-related features. The analysis includes three conditions: including age as a feature (Included Age), where age is included as a predictor in the model; excluding age from the feature set (Excluded Age), where age is omitted from the model as a predictor, and impact of age after confounder removal (Age Confound Removed), where age-related effects are assessed following the removal of potential confounding factors associated with age. Analyses were performed on the total study sample (Inline graphic = 19,625; females = 10,418; males = 9,207)

GMV and WMV both showed reduced prediction accuracies when age was excluded from the feature set or removed as a confounder, with a more pronounced impact for GMV. For GMV, the prediction accuracy dropped when age was excluded from the feature set (Inline graphic 0.57) or removed as a confound (Inline graphic 0.55), compared to when age was included as a feature (Inline graphic 0.62). These reductions indicate that age plays a critical role in predicting GMV beyond what can be accounted for by anthropometric features. This finding is consistent with the understanding that GMV atrophy is more pronounced with age due to processes such as neuronal loss, dendritic pruning, and synaptic changes, which occur more significantly in certain brain regions (Gennatas et al. 2017; Irimia 2021; Yamada et al. 2023). Age, therefore, serves as a proxy for these ongoing structural changes, and removing it as a confounder may mask important age-specific variations in GMV. This loss of predictive accuracy suggests that age-related changes in brain structure are not entirely captured by other variables, such as anthropometric features, implying the importance of age as a crucial marker for predicting GMV changes (Gennatas et al. 2017). In contrast, the impact of age on WMV prediction was less pronounced, though still significant. Prediction accuracy dropped moderately when age was excluded from feature set (Inline graphic = 0.58) or removed as a confound (Inline graphic = 0.57), compared to when age was included as a feature (Inline graphic = 0.6). This suggests that, while anthropometric features are important to WMV prediction, age still adds predictive value. The modest decrease in prediction accuracy when age is excluded or removed as a confound suggests that anthropometric features may be more strongly associated with WMV than with GMV. The minimal difference between the age-excluded and age-confound removed conditions suggests that age-related effects on WMV are direct, with limited confounding influences from other predictors. Interestingly, these findings suggest that age-related atrophy is more pronounced in GMV than in WMV, which is consistent with the established pattern of brain shrinkage in normal aging (Smith et al. 2007; Irimia 2021). GMV atrophy tends to occur earlier and more extensively than WMV loss, which likely explains the stronger impact of age on GMV predictions. However, this differential atrophy could also be influenced by segmentation bias. Variations in the methods used to segment GMV and WMV could contribute to the observed differences in volume and prediction accuracy (Smith et al. 2007; Irimia 2021).

For the CSF volume, age had a strong impact on prediction accuracy. When age was excluded from the feature set or removed as a covariate, accuracy dropped substantially (Inline graphic = 0.51, Inline graphic = 0.46, respectively) compared to when age was included in the feature set (Inline graphic = 0.66). These large differences highlight that age is a critical factor in predicting CSF volume and are consistent with previous research showing an approximate 30 ml (2%) increase per decade from the 20s to 488 ml (33.7%) in individuals over 80 years old, due to tissue loss and fluid expansion with aging (Yamada et al. 2023). The further reduction in accuracy when age was removed as a confounder suggests that age-related effects on CSF volume prediction involve both direct and indirect contributions. The direct effect refers to the structural changes, particularly the atrophy of GMV and WMV, which leads to an increase in CSF volume. The indirect effect includes other factors, such as anthropometric features and sex, that can influence brain structure and the relationship between age and CSF. This is consistent with the known relationship between GMV and WMV atrophy and the increase in CSF volume with age (Gennatas et al. 2017; Irimia 2021; Yamada et al. 2023). As both GMV and WMV decrease, CSF volume increases with aging, providing a clearer and more consistent indicator of age-related structural changes. These findings underscore the importance of including age as a factor in neuroimaging studies, particularly when predicting brain volumes, as it captures both direct and indirect age-related changes in brain structure. It is important to note that the observed changes in prediction accuracy (e.g., Pearson’s r) when including or excluding age quantify its added predictive value. This reflects both unique and shared variance with anthropometric features. In parallel, the magnitude of the model coefficients allows comparison of age’s relative contribution against other predictors. Considered together, these results demonstrate that age contributes predictive information beyond anthropometric features.

The results for the across-sex context using FreeSurfer, within-sex context, and RF model analysis are provided in the supplementary file (Figures S7, S8, S9, S10, S11, S12, and S13). These additional analyses corroborate the main findings, supporting the consistency and robustness of the results across different methods and contexts.

Conclusions

This study provides a comprehensive analysis of how brain volumes are associated with anthropometric features, age, and sex, in a large UK Biobank cohort using advanced machine learning methods. Our findings reveal that TIV is predominantly driven by sex, reflecting a strong sex difference in cranial development. In contrast, TBV, GMV, WMV, and CSF are more sensitive to age, consistent with well-established patterns of brain volumes loss and compensatory fluid expansion across adulthood, with the greatest age-related sensitivity observed in CSF and GMV. Anthropometric measures, including weight, height, waist circumference, and hip circumference, provide additional predictive value, particularly for TBV and TIV. However, their influences are secondary to those of age and sex. Notably, central adiposity (as indexed by waist circumference) is negatively associated with brain volumes, likely reflecting the metabolic impact of visceral fat, while height and weight are positively linked to cranial and brain size, especially earlier in life. However, these relationships are complex and can vary across brain volumes and demographic groups. The differential association of age, sex, and anthropometric features across brain compartments highlights the necessity of including both age and sex as core covariates in neuroimaging research to ensure accurate interpretation of volumetric differences. These results also underscore the value of integrating detailed anthropometric data to refine models of brain structure variation and to better understand the biological and environmental factors underlying brain morphology. From a clinical perspective, these findings may aid the interpretation of brain imaging by helping distinguish normative, age-related brain volume changes from neurodegenerative disease. Importantly, these findings have potential implications for early detection of abnormal aging and disease risk. Deviations from typical body-brain scaling patterns might serve as accessible indicators of neurodegenerative or systemic conditions, highlighting individuals at risk for both brain- and body-related health issues. Such deviations could guide future work on personalized monitoring and interventions aimed at promoting both healthy brain and bodily aging. Overall, this work advances our understanding of the factors driving variation in brain volumes and provides a foundation for future longitudinal studies to unravel the mechanisms linking physical development, aging, and neuroanatomy. Such insights are crucial for interpreting neuroimaging findings in both healthy and clinical populations and for informing strategies aimed at promoting brain health across the lifespan.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary Material 1 (3.8MB, docx)

Acknowledgements

This research has been conducted using data from UK Biobank resources (application number 41655). All data used in this study are publicly accessible from UK Biobank via their standard data access procedure (http://www.ukbiobank.ac.uk/).

Author contributions

All authors read and approved the final manuscript. We use the CRediT contributor role taxonomy to describe individual contributions to the paper. Conceptualization: K.N., S.B.E., C.G., K.R.P.; Data curation: K.N., V.K., K.R.P.; Formal analysis: K.N.; Methodology: K.N., S.B.E., C.G., K.R.P.; Supervision: C.G., S.B.E., K.R.P.; Visualization: K.N., Writing—original draft: K.N.; Writing—review & editing: K.N., S.B.E., G.A., L.H., C.T., V.K., F.R., C.G., K.R.P.

Funding

Open Access funding enabled and organized by Projekt DEAL. This research was funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) - project-ID 431549029 - Collaborative Research Centre CRC1451 on motor performance project B05.

Data availability

All data used in this study are publicly available through the UK Biobank, accessible via their standard data access procedure at (http://www.ukbiobank.ac.uk/). The code used in the current study is available from the authors on GitHub (https://github.com/KNazarzadeh/brainsize) upon reasonable request.

Declarations

Conflict of interest

The authors declare no competing interests.

Ethics approval and consent to participate

The UK Biobank study was approved by the North West Multicenter Research Ethics Committee (No. 16/NW/0274), with written informed consent obtained from all participants. A re-analysis of the anonymized data was approved by the ethics committee of the Heinrich Heine University Düsseldorf (2018-317-RetroDEuA).

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Kimia Nazarzadeh, Email: k.nazarzadeh@fz-juelich.de.

Kaustubh R. Patil, Email: k.patil@fz-juelich.de

References

  1. Alfaro-Almagro F, Jenkinson M, Bangerter NK et al (2018) Image processing and quality control for the first 10,000 brain imaging datasets from UK biobank. NeuroImage 166:400–424. 10.1016/j.neuroimage.2017.10.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Ashburner J, Friston KJ (2011) Diffeomorphic registration using geodesic shooting and Gauss–Newton optimisation. NeuroImage 55:954–967. 10.1016/j.neuroimage.2010.12.049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bayat P-D, Ghanbari A, Sohouli P et al (2012) Correlation of skull size and brain volume, with age, weight, height and body mass index of Arak medical sciences students. Int J Morphol 30:157–161. 10.4067/S0717-95022012000100028 [Google Scholar]
  4. Benson JC, Madhavan AA, Cutsforth-Gregory JK et al (2023) The Monro-Kellie doctrine: a review and call for revision. AJNR Am J Neuroradiol 44:2–6. 10.3174/ajnr.A7721 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Blinkouskaya Y, Caçoilo A, Gollamudi T et al (2021) Brain aging mechanisms with mechanical manifestations. Mech Ageing Dev 200:111575. 10.1016/j.mad.2021.111575 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bogin B, Varela-Silva MI (2010) Leg Length, body Proportion, and health: a review with a note on beauty. IJERPH 7:1047–1075. 10.3390/ijerph7031047 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bouckaert RR, Frank E (2004) Evaluating the replicability of significance tests for comparing learning algorithms. In: Dai H, Srikant R, Zhang C (eds) Advances in knowledge discovery and data mining. Springer, Berlin, Heidelberg, pp 3–12 [Google Scholar]
  8. Chen X, Sachdev PS, Wen W, Anstey KJ (2007) Sex differences in regional Gray matter in healthy individuals aged 44–48 years: a voxel-based morphometric study. NeuroImage 36:691–699. 10.1016/j.neuroimage.2007.03.063 [DOI] [PubMed] [Google Scholar]
  9. Cox SR (2024) Neurocognitive aging. Annual Rev Dev Psychol 6:505–527. 10.1146/annurev-devpsych-010923-102441 [Google Scholar]
  10. Cummings SR, Melton LJ (2002) Epidemiology and outcomes of osteoporotic fractures. Lancet 359:1761–1767. 10.1016/S0140-6736(02)08657-9 [DOI] [PubMed] [Google Scholar]
  11. DeCasien AR, Guma E, Liu S, Raznahan A (2022) Sex differences in the human brain: a roadmap for more careful analysis and interpretation of a biological reality. Biol Sex Differ 13:43. 10.1186/s13293-022-00448-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Eboh DE, Okoro EC, Iteire KA (2016) A Cross-sectional anthropometric study of cranial capacity among Ukwuani people of South Nigeria. Malays J Med Sci 23:72–82. 10.21315/mjms2016.23.5.10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Eliot L, Ahmed A, Khan H, Patel J (2021) Dump the dimorphism: comprehensive synthesis of human brain studies reveals few male-female differences beyond size. Neurosci Biobehav Rev 125:667–697. 10.1016/j.neubiorev.2021.02.026 [DOI] [PubMed] [Google Scholar]
  14. Ferretti MT, Iulita MF, Cavedo E et al (2018) Sex differences in alzheimer disease — the gateway to precision medicine. Nat Rev Neurol 14:457–469. 10.1038/s41582-018-0032-9 [DOI] [PubMed] [Google Scholar]
  15. Fjell AM, Westlye LT, Amlien I et al (2009) High consistency of regional cortical thinning in aging across multiple samples. Cereb Cortex 19:2001–2012. 10.1093/cercor/bhn232 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Frank AP, de Souza Santos R, Palmer BF, Clegg DJ (2019) Determinants of body fat distribution in humans May provide insight about obesity-related health risks. J Lipid Res 60:1710–1719. 10.1194/jlr.R086975 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fredriks A, van Buuren S, van Heel WJM et al (2005) Nationwide age references for sitting height, leg length, and sitting height/height ratio, and their diagnostic value for disproportionate growth disorders. Arch Dis Child 90:807–812. 10.1136/adc.2004.050799 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Gaser C, Dahnke R, Thompson PM et al (2024) CAT: a computational anatomy toolbox for the analysis of structural MRI data. GigaScience 13:giae049. 10.1093/gigascience/giae049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Gennatas ED, Avants BB, Wolf DH et al (2017) Age-Related effects and sex differences in Gray matter Density, Volume, Mass, and cortical thickness from childhood to young adulthood. J Neurosci 37:5065–5073. 10.1523/JNEUROSCI.3550-16.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Gurholt TP, Kaufmann T, Frei O et al (2021) Population-based body–brain mapping links brain morphology with anthropometrics and body composition. Transl Psychiatry 11:295. 10.1038/s41398-021-01414-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Guyon I, Weston J, Barnhill S, Vapnik V (2002) Gene selection for cancer classification using support vector machines. Mach Learn 46:389–422. 10.1023/A:1012487302797 [Google Scholar]
  22. Hamdan S, More S, Sasse L et al (2023) Julearn: an easy-to-use library for leakage-free evaluation and inspection of ML models
  23. Haufe S, Meinecke F, Görgen K et al (2014) On the interpretation of weight vectors of linear models in multivariate neuroimaging. NeuroImage 87:96–110. 10.1016/j.neuroimage.2013.10.067 [DOI] [PubMed] [Google Scholar]
  24. Irimia A (2021) Cross-sectional volumes and trajectories of the human brain, Gray matter, white matter and cerebrospinal fluid in 9,473 typically aging adults. Neuroinformatics 19:347–366. 10.1007/s12021-020-09480-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Kawada M, Nakatsukasa M, Nishimura T et al (2022) Human shoulder development is adapted to obstetrical constraints. Proceedings of the National Academy of Sciences 119:e2114935119. 10.1073/pnas.2114935119
  26. Kuk JL, Saunders TJ, Davidson LE, Ross R (2009) Age-related changes in total and regional fat distribution. Ageing Res Rev 8:339–348. 10.1016/j.arr.2009.06.001 [DOI] [PubMed] [Google Scholar]
  27. Lotze M, Domin M, Gerlach FH et al (2019) Novel findings from 2,838 adult brains on sex differences in Gray matter brain volume. Sci Rep 9:1671. 10.1038/s41598-018-38239-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Masanovic B, Arifi F, Gardasevic J (2020) Relationship between sitting height measurements and standing height: a prospective regional study among adolescents in the Southern region of Kosovo. Int J Morphology 38:1681–1685. 10.4067/S0717-95022020000601681 [Google Scholar]
  29. McCarthy MM, Arnold AP, Ball GF et al (2012) Sex differences in the brain: the not so inconvenient truth. J Neurosci 32:2241–2247. 10.1523/JNEUROSCI.5372-11.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Mills KL, Goddings A-L, Herting MM et al (2016) Structural brain development between childhood and adulthood: convergence across four longitudinal samples. NeuroImage 141:273–281. 10.1016/j.neuroimage.2016.07.044 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Nadeau C, Bengio Y (2003) Inference for the generalization error. Mach Learn 52:239–281. 10.1023/A:1024068626366 [Google Scholar]
  32. Pedregosa F, Varoquaux G, Gramfort A et al (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830 [Google Scholar]
  33. Ponti F, Santoro A, Mercatelli D et al (2020) Aging and imaging assessment of body composition: from fat to facts. Front Endocrinol 10. 10.3389/fendo.2019.00861
  34. Raz N, Rodrigue KM (2006) Differential aging of the brain: Patterns, cognitive correlates and modifiers. Neurosci Biobehav Rev 30:730–748. 10.1016/j.neubiorev.2006.07.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Ritchie SJ, Cox SR, Shen X et al (2018) Sex differences in the adult human brain: evidence from 5216 UK biobank participants. Cereb Cortex 28:2959–2975. 10.1093/cercor/bhy109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Ruigrok ANV, Salimi-Khorshidi G, Lai M-C et al (2014) A meta-analysis of sex differences in human brain structure. Neurosci Biobehav Rev 39:34–50. 10.1016/j.neubiorev.2013.12.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Smith CD, Chebrolu H, Wekstein DR et al (2007) Age and gender effects on human brain anatomy: a voxel-based morphometric study in healthy elderly. Neurobiol Aging 28:1075–1087. 10.1016/j.neurobiolaging.2006.05.018 [DOI] [PubMed] [Google Scholar]
  38. Sudlow C, Gallacher J, Allen N et al (2015) UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Med 12:e1001779. 10.1371/journal.pmed.1001779 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Volpi E, Nazemi R, Fujita S (2004) Muscle tissue changes with aging: current opinion in clinical nutrition. Metabolic Care 7:405–410. 10.1097/01.mco.0000134362.76653.b2 [Google Scholar]
  40. Ward MA, Carlsson CM, Trivedi MA et al (2005) The effect of body mass index on global brain volume in middle-aged adults: a cross sectional study. BMC Neurol 5:23. 10.1186/1471-2377-5-23 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Yamada S, Otani T, Ii S et al (2023) Aging-related volume changes in the brain and cerebrospinal fluid using artificial intelligence-automated segmentation. Eur Radiol 33:7099–7112. 10.1007/s00330-023-09632-x [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material 1 (3.8MB, docx)

Data Availability Statement

All data used in this study are publicly available through the UK Biobank, accessible via their standard data access procedure at (http://www.ukbiobank.ac.uk/). The code used in the current study is available from the authors on GitHub (https://github.com/KNazarzadeh/brainsize) upon reasonable request.


Articles from Brain Structure & Function are provided here courtesy of Springer

RESOURCES