Skip to main content
Human Brain Mapping logoLink to Human Brain Mapping
. 2021 Jul 16;42(14):4623–4642. doi: 10.1002/hbm.25572

Neuroanatomical norms in the UK Biobank: The impact of allometric scaling, sex, and age

Camille Michèle Williams 1,, Hugo Peyre 1,2,3, Roberto Toro 4,5,6, Franck Ramus 1
PMCID: PMC8410561  PMID: 34268815

Abstract

Few neuroimaging studies are sufficiently large to adequately describe population‐wide variations. This study's primary aim was to generate neuroanatomical norms and individual markers that consider age, sex, and brain size, from 629 cerebral measures in the UK Biobank (N = 40,028). The secondary aim was to examine the effects and interactions of sex, age, and brain allometry—the nonlinear scaling relationship between a region and brain size (e.g., total brain volume)—across cerebral measures. Allometry was a common property of brain volumes, thicknesses, and surface areas (83%) and was largely stable across age and sex. Sex differences occurred in 67% of cerebral measures (median |β| = .13): 37% of regions were larger in males and 30% in females. Brain measures (49%) generally decreased with age, although aging effects varied across regions and sexes. While models with an allometric or linear covariate adjustment for brain size yielded similar significant effects, omitting brain allometry influenced reported sex differences in variance. Finally, we contribute to the reproducibility of research on sex differences in the brain by replicating previous studies examining cerebral sex differences. This large‐scale study advances our understanding of age, sex, and brain allometry's impact on brain structure and provides data for future UK Biobank studies to identify the cerebral regions that covary with specific phenotypes, independently of sex, age, and brain size.

Keywords: aging, allometry, brain volumes, cerebral norms, cortical surface area, cortical thickness, sex differences


This study's primary aim was to generate neuroanatomical norms and individual markers that consider age, sex, and brain size, from 629 cerebral measures in the UK Biobank (N = 40,028). Allometry was a common property of brain volumes, thicknesses, and surface areas (83%) and we found sex differences in 67% of the investigated brain regions. This large‐scale study advances our understanding of age, sex, and brain allometry's impact on brain structure and provides data for future UK Biobank studies to identify the cerebral regions that covary with specific phenotypes, independently of sex, age, and brain size.

graphic file with name HBM-42-4623-g005.jpg


Abbreviations

GMV

gray matter volume

MCT

total mean cortical thickness

TBV

total brain volume

TCMs

total cerebral measures

TSA

total surface area

WMV

white matter volume

1. INTRODUCTION

Although all humans share a common brain structure and organization, they also vary in terms of the size and shape of their brain and its subcomponents. These neuroanatomical variations are thought to partly underlie differences in cognitive and behavioral traits and in the risk of developing psychiatric and neurological disorders (for review, see Dallaire‐Théroux, Callahan, Potvin, Saikali, & Duchesne, 2017; Deary, 2010; Jumah, Ghannam, Jaber, Adeeb, & Tubbs, 2016; Oakes, Loukas, Oskouian, & Tubbs, 2017; Schmidt, Khalid, Loukas, & Tubbs, 2018). Yet, most of these studies rely on relatively small samples and suffer from high sampling variability. When the sample is too small to accurately represent the control or target population, spurious neuroanatomical markers may be reported. Moreover, if true effects are observed in small samples, their size would be exaggerated as only large effects would pass a conventional statistical significance threshold (e.g., p < .05) with few degrees of freedom (for review, see Szucs & Ioannidis, 2017, 2020). Thus, despite hundreds of studies, few neuroanatomical measures can be declared as robust markers of cognitive traits or psychiatric and neurological disorders (Gong et al., 2019; Marek et al., 2020; Matsuo et al., 2019; Peyre et al., 2020; Ramus, Altarelli, Jednoróg, Zhao, & Scotto di Covella, 2018; Williams, Peyre, Toro, Beggiato, & Ramus, 2020).

An alternative approach to comparing clinical and healthy groups would be to compare clinical groups with population norms, as is done with well‐established cognitive dimensions, such as general intelligence, personality, and psychopathology scales (Beck, Steer, & Brown, 1988; Costa & McCrae, 2008; Wechsler, Psychological Corporation, & PsychCorp (Firm), 2008). If neuroanatomical norms for a population were available, then comparing any clinical group to these norms would overcome the issue of sampling variability in the control group. However, neuroanatomical norms are not easy to establish, as they require large populations, and neuroanatomical measures often depend on magnetic resonance imaging (MRI) scanner characteristics and acquisition sequences. Valid norms would therefore only be established within a given study in a single scanner, or in a small set of comparable scanning sites with similar acquisition protocols as done by the UK Biobank.

For this reason, the UK Biobank, the largest neuroimaging dataset available to date (N ~ 40,000), is an ideal candidate to create neuroanatomical norms that could be reused for multiple studies of neurological and psychiatric disorders. These norms should be sex‐specific, given that the two sexes differ on several neuroanatomical brain measures (Kaczkurkin, Raznahan, & Satterthwaite, 2019; Lotze et al., 2019; Ritchie et al., 2018; Ruigrok et al., 2014; Sanchis‐Segura et al., 2019), and have different risks of developing certain neurological and psychiatric disorders (Beam et al., 2018; Boyd et al., 2015; Seedat et al., 2009). And age should also be considered, as it is associated with variations in neuroanatomical measures (Fjell et al., 2013; Hurtz et al., 2014; Narvacan, Treit, Camicioli, Martin, & Beaulieu, 2017; Vinke et al., 2018; Wierenga et al., 2014), cognitive function (Simon R. Cox et al., 2018; S.R. Cox, Ritchie, Fawns‐Ritchie, Tucker‐Drob, & Deary, 2019), and disease risk (Fiske, Wetherell, & Gatz, 2009; Fjell et al., 2009; Jellinger & Attems, 2015).

Finally, global brain measures should be taken into account to create norms that are independent of variations in brain size. Although there is mounting evidence that brain allometry—the nonlinear scaling relationship between regional and global brain dimensions—is an inherent property of the brain (Finlay, Darlington, & Nicastro, 2001; Jäncke, Mérillat, Liem, & Hänggi, 2015; D. Liu, Johnson, Long, Magnotta, & Paulsen, 2014; Mankiw et al., 2017; Reardon et al., 2018; Toro et al., 2009; Williams et al., 2020), standard modes of adjustment for individual differences in global measures, such as the proportion method, or linear covariate adjustment, omit brain allometry. To this day, numerous studies have shown that different methods of adjustment for brain size contribute to the variability of reported volumetric group differences (Lefebvre, Beggiato, Bourgeron, & Toro, 2015; O'Brien et al., 2006, 2011; Reardon et al., 2016; Sanchis‐Segura et al., 2019) and some specifically suggest that omitting brain allometry leads to spurious group differences (Mankiw et al., 2017; Reardon et al., 2016; Williams et al., 2020). Since regional/global relationships follow a power function in a majority of regions, it is recommended to log‐transform regional and total cerebral measures (TCMs; i.e., total brain volume [TBV], total mean cortical thickness [MCT], or total surface area [TSA]) to account for allometric scaling and to obtain an accurate description of the relationship between brain regions and TCMs.

Thus, the present study's first aim is to create neuroanatomical norms and individual markers in the UK Biobank that take into account sex, age, and the allometric relationships between regional and global brain measures. The second goal is to investigate the extent to which neuroanatomical variations depend on sex, age (linear and quadratic), and brain allometry effects and their interactions. Finally, the third aim is to examine whether omitting brain allometry systematically biases reported results by comparing the results of models with differing TCM adjustment techniques. By generating neuroanatomical markers across volumes, mean thicknesses, and surface areas available in the UK Biobank, we provide population norms for future UK Biobank studies that aim to link regional neuroanatomical markers to specific cognitive and behavioral traits or neurological and psychiatric disorders.

2. METHODS

2.1. Participants

Participants were drawn from the UK Biobank, an open‐access large prospective study with phenotypic, genotypic, and neuroimaging data from 500,000 participants recruited between 2006 and 2011 at 40–69 years old in Great Britain (Sudlow et al., 2015). All participants provided informed consent (“Resources tab” at https://biobank.ctsu.ox.ac.uk/crystal/field.cgi?id=200). The UK Biobank received ethical approval from the Research Ethics Committee (reference 11/NW/0382) and the present study was conducted based on application 46 007.

Currently, MRI data and imaging‐derived phenotypes (IDPs) are available for about 41,000 participants. This study analyzed the IDPs from the first imaging visit generated by an image‐processing pipeline developed and run by the UK Biobank Imaging team (Alfaro‐Almagro et al., 2018; Miller et al., 2016).

2.1.1. Brain image acquisition and processing

A standard Siemens Skyra 3 T running VD13A SP4 with a standard Siemens 32‐channel RF receive head coil was used to collect data. The 3D MPRAGE T1‐weighted volumes were analyzed by the UK Biobank Imaging team with pipeline scripts that primarily call for FSL and Freesurfer tools. Details of the acquisition protocols, image processing pipeline, image data files and derived measures IDPs of brain structure and function are available in the UK Biobank Imaging Protocols.

2.1.2. Total brain volume

TBV was calculated as the sum of the total gray matter volume (GMV; i.e., sum of cortical and subcortical GMV, data‐field 26,518), cerebellum white matter volume (WMV, data‐fields 26,556 for left and 26,587 for right), and cerebral WMV (data‐fields 26,553 for left and 26,584 for right) from the UK Biobank ASEG Freesurfer segmentations. Refer to Supplemental Info 1 for more on the choice of TBV. Individuals with missing data for these regions were excluded from the analyses, yielding 40,055 participants.

2.1.3. Scanner site

The age and sex of participants differed across the three scanner sites located in Cheadle (Site 11025); Reading (Site 11026); and Newcastle (Site 11027; See Supplemental Info 1). One individual without scanner site was removed from the analyses yielding 40,054 participants.

2.1.4. Sex

Participants who did not self‐report as male or female or whose self‐reported sex and genetic sex differed were also excluded from the analyses (N = 26). When genetic sex was not available, reported‐sex was used to define the sex of the participant. Of the 40,028 participants included in the analyses, there were more females (N = 21,142) than males (N = 18,886, χ 2[1] = 127.15, p < 2.2e‐16).

2.1.5. Age

To obtain a continuous and more precise measure of age, age was calculated based on the year and month of birth of the participant and the day, month, and year of their MRI visit. Mean age was 64.13 years old (SD = 7.54). Males (M = 64.81 years, SD = 7.64) were older than females (t (39,186) = 17.17, p < 2.2e‐16, M = 63.51 years, SD = 7.39).

2.2. Image‐derived phenotypes

The descriptive statistics of all global and regional IDPs analyzed in the present study and their respective data‐fields and segmentation origin are listed in Supplemental Table A1. The majority of IDPs correspond to gray matter, since WMVs were not segmented by the UK Biobank Imaging team.

2.2.1. Global IDPs

A total of nine global IDPs were investigated: TBV, total MCT, TSA, subcortical GMV, cortical GMV, cerebral WMV, cerebellar GMV, cerebellar WMV, and the brainstem volume.

WMV measures were obtained by summing left and right global measures from Freesurfer ASEG segmentations (data‐field 190). Cerebellum GMV was calculated as the sum of the cerebellar volumes from the FAST segmentations (data‐field 1101). Total MCT and TSA were respectively calculated as the sum of the mean cortical thickness and surface area measures from the Freesurfer a2009s segmentations (data‐field 197). The whole brain stem measure was taken from the Freesurfer subsegmentations (data‐field 191) and the subcortical GMV measure was calculated as the sum of the left and right whole amygdala, hippocampus, and thalamus volumes from the Freesurfer subsegmentations (data‐field 191) and the left and right caudate, accumbens, pallidum, putamen, and ventral diencephalon of the Freesurfer ASEG segmentations (data‐field 190).

Based on the recommendations from the UK Biobank Imaging Protocols, we excluded Freesurfer IDPs when T2‐FLAIR was not used in addition to the T1 images to obtain the segmentations from Freesurfer a2009s (volume, surface area, and mean thickness) and Freesurfer subsegmentations. Moreover, 790 individuals had missing values for all FAST cerebellum segmentations and were excluded from the FAST segmentation analyses.

Thus, while 40,028 individuals were included in the analyses for TBV, cerebellum WM, and cerebral WM, 39,238 individuals were included in the analyses with FAST segmentations and 38,710 were included in the analyses of the Freesurfer subsegmentations and Freesurfer a2009s segmentations. Missing values and null segmentations (e.g., 0 mm3) for a region were replaced by the mean of that region when calculating global measures. See Supplemental Info 3 for correlations between the global measures provided by Freesurfer ASEG and those calculated from the FAST, Freesurfer subsegmentations, Freesurfer ASEG, and Freesurfer a2009s segmentations.

2.2.2. Regional IDPs

A case‐wise participant exclusion strategy was applied to each IDP for the regional analyses: participants with a missing value or a segmentation error for a region were excluded from the analyses of that region but were maintained in the analyses of other IDP. Following visual examination of the distribution of regional cerebral measures, values three times the inter‐quartile range for a region were considered to be segmentation errors and were removed from the analyses of that region.

A total of 620 regional IDPs were investigated: 444 cortical regions (148 volumes, 148 surface areas, and 148 cortical thicknesses) from the Freesurfer a2009s segmentations (Destrieux Atlas, data‐field 197), 116 whole segmentations and subsegmentations of the amygdala, hippocampus, and thalamus and subsegmentations of the brainstem (Freesurfer subsegmentations, data‐field 191), 28 cerebellum GMV segmentations from the FAST segmentations (data‐field 1101), and 32 subcortical, white matter, and ventricle volumes from the Freesurfer ASEG segmentations (data‐field 190). Freesurfer subcortical segmentations for the caudate, putamen, accumbens, and pallidum were used instead of the preregistered FIRST volumes, for segmentation consistency with the other subcortical and cortical volume which were segmented from Freesurfer.

2.3. Statistical analyses

Analyses were preregistered on OSF and run using R (R Core Team, 2019). The preregistration and code are on OSF (https://osf.io/s4qc5/). Used packages are listed in Supplemental Info 7.

2.3.1. Data transformation

To examine allometric scaling, regional and global measures were log10 transformed. All continuous variables were centered around the mean in order to examine the effects of a variable when other variables are at their mean value. The categorical sex variable was coded −0.5 for males and 0.5 for females. The scanner site variable was dummy coded as a fixed effect with the largest site, Cheadle (Site 1102), as reference.

2.3.2. Analyses

Global and regional analyses were performed twice. Once without standardizing (dividing by 1 SD) to obtain the allometric scaling coefficient for each brain region and once with standardizing to report standardized betas as effect sizes. Nonlinear effects of age were modeled with quadratic age over spline regression as spline regressions do not yield interpretable beta coefficients of age. Isometry was tested using the linear hypothesis function which tests if scaling coefficients obtained without scaling variables differ from 1. Scanner site (Cheadle—Site 1102, Reading—Site 11026, and Newcastle—Site 11027) was additionally added as a covariate, although it was omitted from the preregistration.

2.3.3. Global analyses

Global analyses were conducted to evaluate how TBV varies with age, age2, and sex (Equation (1)) and how TSA, total MCT, cerebral GMV, cerebral WMV, total subcortical GMV, cerebellum GMV, cerebellum WMV, and the brainstem volume vary with TBV, age, age2, and sex (Equation (2)).

Log10TBV=Intercept+β1*Age+β2*Sex+β3*Age2+β4*Age×Sex+β5*Age2×Sex+β6*Scanner Site+Error. (1)
Log10Global Measure=Intercept+β1*Log10TBV+β2*Age+β3*Sex+β4*Age2+β5*Log10TBV×Sex+β6*Log10TBV×Age+β7*Log10TBV×Age2+β8*Age×Sex+β9*Age2×Sex+β10*Log10TBV×Age×Sex+β11*Log10TBV×Age2×Sex+β12*Scanner Site+Error. (2)

2.3.4. Regional analyses

Regional analyses were conducted to evaluate how regional volumes, surface areas, and cortical thicknesses vary with TCM, age, age2, and sex with Equation (3). The TCMs were TBV for volumes, total MCT for mean thicknesses, and TSA for surface areas.

Log10Region=Intercept+β1*Log10TCM+β2*Age+β3*Sex+β4*Age2+β5*Log10TCM×Sex+β6*Log10TCM×Age+β7*Log10TCM×Age2+β8*Age×Sex+β9*Age2×Sex+β10*Log10TCM×Age×Sex+β11*Log10TCM×Age2×Sex+β12*Scanner Site+Error. (3)

2.3.5. Regional neuroanatomical deviance markers

To obtain a global and regional marker of an individual's deviance from the norm in terms of volume, mean thickness, and surface area, we extracted the residuals from each dependent variable. The residuals were obtained from the model where continuous variables were centered but not divided by 1 SD to maintain differences in magnitude across regions for the calculation of the global deviance markers. Residuals of a region were then divided by the SD of that region's residuals to obtain a Z‐score for each individual and region. Each Z‐score corresponds to a person's regional deviance marker and reflects an individual's deviance from the norm of a region, given their age, sex, and TCM.

2.3.6. Global neuroanatomical deviance markers

From the regional neuroanatomical deviance markers, we generated four person‐level global neuroanatomical deviance markers with Equation (4): one for volumes, one for mean thicknesses, one for surface areas, and one for all regions. The person‐level global neuroanatomical deviance markers correspond to a person's global neuroanatomical deviance from the norm. Although we preregistered Equation (4) without dividing by the total number of investigated regions for a global measure (N), we did so to obtain a value reflecting mean deviation relative to the norm. Considering that all IDPs were not available for all individuals, we excluded participants with more than 10% of missing data across regional IDPs. Global deviance markers for each individual were divided by the SD of the global deviance marker to obtain a Z‐score for each individual and global marker. The cerebral marker across brain measures, which was not preregistered, was calculated by averaging the Z‐score of the volumetric, cortical mean thickness, and cortical surface area global neuroanatomical deviance marker.

Global neuroanatomical deviance=Regional neuroanatomical deviance markers2/N (4)

2.3.7. Exploratory analyses

These analyses were not preregistered unless otherwise stated (for more details see Supplemental Info 4). In brief, we first examined whether global and regional measures were allometric with the linearHypothesis function from the car R package (Fox et al., 2020) with the null hypothesis: “The slope of log10(TCM) is equal to 1”. Then, we examined whether the scaling coefficient of mean thicknesses with TBV differed from ⅓ and whether the scaling coefficient of surface areas with TBV differed from ⅔ with the linearHypothesis function (Fox et al., 2020) and appropriate null (e.g., null hypothesis for mean thicknesses: “The slope of log10(TBV) is equal to 1/3”). We would expect these coefficients if brain growth was proportional (similar to a sphere) and if larger brains were scaled‐up versions of smaller brains. Third, we examined sex differences in variance with a Levene's test (F‐test) and calculated as the variance ratio as Female SD/male SD. Fourth, we compared the results of our main analysis to those obtained when using a linear covariate TCM adjustment (i.e., Equation (4) without the log transformation) and when using the proportion adjustment for TCM (i.e., by first dividing a region by TCM to obtain an adjusted regional measure and then running Equation (4) on the adjusted region without the log transformation and the main effect of TBV). Fifth, as preregistered, we attempted to replicate previous studies on cerebral sex differences that considered brain allometry. Sixth, we replicated Ritchie et al.'s (2018) analyses of the Desikan–Killiany cortical measures and FIRST subcortical volumes with the linear covariate TCM adjustment. We additionally ran the same analyses with the allometric TCM adjustment to examine whether we observed the same effects of omitting brain allometry when investigating sex differences in the UK Biobank using different cerebral segmentations and statistical analyses.

2.3.8. Multiple comparison corrections

Since 11 beta coefficients were examined for each of the 620 regional IDPs, three thresholds of significance were used: 0.05/11, 0.05/620, and 0.05/(11 × 620). The same rationale was applied to the global IDPs, with the following thresholds for TBV 0.05/5, 0.05/9, and 0.05/(5 × 9) and for the remaining global measures: 0.05/11, 0.05/9, and 0.05/(11 × 9). Significant variance ratio differences are reported at p < .05/629 (sum of global and regional cerebral measures).

3. RESULTS AND DISCUSSION

To avoid redundancy, results are sequentially discussed. Only results at the strictest level of significance are reported in text: p < .05/ (5 × 9) for TBV, 0.05/ (11 × 9) for other global measures, p < .05/ (11 × 620) for regional measures, and p < .05/629 for sex differences in variance. Scaling coefficients (α) correspond to the estimate when continuous variables are centered, and standardized betas (β) reflect the estimate when continuous variables are centered and divided by 1 SD.

Descriptive statistics are available in Supplemental Tables A2–A14. Regression results by region are available in Supplemental Tables B1–B10 and by main effects or interaction, in Supplemental Tables C1–C21. See Supplemental Tables D2–D5 for statistics on regional deviance from isometry. Correlations between each left and right region in terms of the scaling coefficient with the TCM, the age standardized betas, and sex standardized betas are available on OSF, in Supplemental Figure File 1 (https://osf.io/s4qc5/) and correlations between cortical scaling, sex, and age coefficients across cortical measures are available in Supplemental Table C1.

3.1. Neuroanatomical norms

Neuroanatomical norms were generated for 40,028 UK Biobank participants at two levels: for global brain measures (total cerebral and cerebellar GM and WM volumes, TSA, total MCT, and total subcortical and brainstem volumes) relative to TBV, and for each regional measure relative to its corresponding TCM (i.e., TBV, TSA, or total MCT). We provided a simplified version of these norms by reporting the predicted mean of a region for the mean TBV and mean age by sex in the reference scanner site (Cheadle, Site 11025). To estimate variability around these norms, we also provided the observed 10th, 25th, 50th,75th, and 90th percentiles, the mean, and the SD of each brain measure for each sex in participants with a TCM and age between the 40th and 60th percentiles. We chose the 40th and 60th percentile cutoffs to obtain at least 1,000 participants for each norm value by sex (Supplemental Tables H1–H4).

We additionally created neuroanatomical deviance markers that correspond to the standardized residuals of the full statistical models. These residuals reflect the extent to which the cerebral measures of an individual deviate from those of other individuals with the same sex, age, and total brain size. The global and regional neuroanatomical markers will be returned to the UK Biobank and made available to UK Biobank researchers for future use. When additional neuroimaging data becomes available, we recommend that future researchers run our code available on OSF to obtain neuroanatomical norms and regional and deviance markers for their participants to obtain more accurate estimates of the neuroanatomical norms and deviance markers of the UK Biobank participants.

Finally, in addition to creating global and regional neuroanatomical norms for each of the 629 brain measures, we computed global neuroanatomical deviance markers, which reflect the deviance of an individual from the norm across all brain regions. This was done separately for volumes, surfaces and thicknesses, as well as across the three types of measures.

An individual with a “normal” brain—where all regional measures are at the expected value given this individual's sex, age, and total cerebral size—should have a global neuroanatomical deviance of 0 for volume, surface, and thickness. However, values ranged from 0.57 to 1.53 for volumes, 0.02 to 0.05 for mean thicknesses, and 0.04 to 0.13 for surface areas, suggesting that it is “normal” to deviate from the cerebral norms (Table 1). In turn, individuals with a global neuroanatomical deviance above the mean had brain regions that deviated from the regional norms more than most individuals, while values below the mean represent reduced deviance from the neuroanatomical norm.

TABLE 1.

Mean and variance sex differences in across global deviance markers

Global deviance markers Min Max Male Female d p VR p
μ SD μ SD
Volumes 0.57 1.53 0.85 0.96 0.82 0.92 0.27 2.20E‐16 0.95 1.56E‐12
Mean thicknesses 0.02 0.05 0.03 0.04 0.03 0.04 0.13 2.20E‐16 1.00 .712
Surface areas 0.04 0.13 0.07 0.01 0.07 0.01 0.12 2.20E‐16 0.98 .027
All measures −2.14 4.74 0.05 0.74 −0.04 0.71 0.12 2.20E‐16 0.97 1.12E‐14

Note: d: Cohen's d. The global volumes, mean thicknesses, and surface areas deviance markers are calculated from Equation (4) and the global neuroanatomical deviance marker (all measures) corresponds to the average of the Z‐score of the three global deviance markers. Significance set to 0.05/6, as six tests were performed. Global volumetric deviance was positively correlated with global surface area deviance (r = .35, p < 2.2e‐16) and global mean thickness deviance (r = .33, p < 2.2e‐16). Global surface deviance and global mean thickness deviance were also positively correlated (r = .17, p < 2.2e‐16).

The means of the volumetric, surface area, and neuroanatomical (all cerebral measures combined) global neuroanatomical deviance were larger in males, suggesting that male volumes and surface areas deviated more from their sex‐specific norm than females. However, females deviated more from their norm in the mean thickness global allometry marker. Males additionally had more variable volumetric and neuroanatomical global allometric markers (Table 1, Figure 1). Thus, investigations of global as well as regional neuroanatomical deviance should always take sex into account.

FIGURE 1.

FIGURE 1

Sex differences in global neuroanatomical deviances across volumes (a), cortical surface areas (b), cortical mean thicknesses (c), and all volumes, cortical surface areas, and cortical mean thicknesses (d). Mean (dashed lines) differences were found across measures, while variance differed between sexes only across volumes. Global allometry marker corresponds to the square root of the sum of squared residuals divided by the number of regions for that measure from the model with age, total brain volume, and sex as well as age2, total brain volume, and sex interactions. Global neuroanatomical deviance marker corresponds to the average of the Z‐score of the three global deviance markers. Significance set to 0.05/6, as six tests were performed

With the world's largest neuroimaging dataset, we created neuroanatomical norms in the UK Biobank, to which any UK Biobank individual can be compared, in the same way that scores from intelligence tests or personality questionnaires can be compared to population norms. With these norms, future UK Biobank studies will be able to examine whether individuals that deviate from the norm on a global or regional brain measure also deviate from the norm in terms of cognitive and behavioral traits or of risk for neurological and psychiatric disorders. Having brain markers that are relative to TCMs (rather than raw measures) will make it easier to distinguish the specific contribution of each regional brain measure from that of more global brain measures. As for global neuroanatomical deviance markers, future studies will be able to investigate the extent to which global neuroanatomical deviance reflects disruptions of brain development or serves as a risk factor for neurodevelopmental or psychiatric disorders. For studies examining the associations of specific regional brain measures with cognitive phenotypes, it may be useful to adjust on global neuroanatomical deviance, on top of total brain size, to fully dissociate regional from global effects.

3.2. Allometry

3.2.1. Global allometry

All global scaling coefficients were hypoallometric (α ranging from .03 to .91), suggesting that these regions increase less than TBV as TBV increases, except for cerebral WMV, which was hyperallometric (α = 1.21). The scaling coefficient of TSA with TBV significantly differed from the theoretical value 2/3 (α = .89) and the scaling coefficient of total MCT with TBV differed from 1/3 (α = .03). In cerebellar GMV, all regions were hypoallometric (α = .50–.95), except for four isometric regions (α = .93–.95). Ventricles and the cerebral spinal fluid were hypoallometric (α = .40–1.01), except for the lateral ventricles (left α = 1.01 and right α = .96). The optic chiasm, corpus callosum, cerebellum WMV, and ventral diencephalon measures were hypoallometric (α = .55–.93), whereas for the mid‐anterior segmentation of the corpus callosum which was isometric (α = .99).

Allometric coefficients were generally consistent with previous studies that report hyperallometry in cerebral WMV (de Jong et al., 2017; Toro et al., 2009) and hypoallometry in the majority of cerebellar (Mankiw et al., 2017), subcortical (de Jong et al., 2017; D. Liu et al., 2014; Reardon et al., 2016; Williams et al., 2020), and corpus callosum volumes (Lefebvre et al., 2015). See Supplemental Table D1 for global deviance from allometry.

3.2.2. Cortical allometry

TBV positively predicted all volumes, TSA positively predicted all surface areas, and total MCT positively predicted all mean thicknesses. Scaling coefficients varied across regions and measures (Table 1). The scaling coefficients of cortical volumes were highly correlated to those of cortical surface areas (r = .85, p < 2.2e‐16) but were not correlated to those of cortical mean thicknesses (r = −.05, p = .535). In cortical regions, 98 volumes, 62 areas, and 63 mean thicknesses were hypoallometric (α = .60–.96), while 18 volumes, 50 areas, and 58 mean thicknesses were hyperallometric (α = 1.04–1.62), and 32 volumes, 36 areas, and 27 mean thicknesses were isometric (α = .94–1.11; Figure 2).

These findings mirror those reported by Liu et al. (2014) and Reardon et al. (2018), who found that scaling across cortical regions is heterogeneous, covering a wide range of hypoallometry, isometry, and hyperallometry values. Similarly to Reardon et al. (2018), who studied the scaling relationship between vertex area and cortical surface area, surface areas were hyperallometric in the middle frontal and supramarginal gyri and sulci, and hypoallometric in the sensorimotor cortices (precentral, paracentral, and postcentral gyri and sulci), occipital temporal regions, and some cingulate (anterior, mid‐anterior, and post dorsal) and callosal (sub and pericallosal) regions.

FIGURE 2.

FIGURE 2

Scaling coefficients of cortical surface areas, mean thicknesses, and volumes with total brain volume (TBV). Values are the scaling coefficients of a region with TBV and range from 0.61 (volume of the right posterior ramus of lateral sulcus) to 1.63 (mean thickness of the left paracentral gyrus and sulcus). The flat representation corresponds to the flattened image of the superior view with the midline of the circle reflecting regions within the sagittal plane and circle edges reflecting inferior regions. Figures made with https://neuroanatomy.github.io/cortex/ (Toro, 2020) [Correction added on 26 July 2021, after first online publication: Figure 2 has been replaced.]

3.2.3. Subcortical allometry

All subcortical volumes were hypoallometric (α = .49–.92) except for the right lateral posterior and the right limitans suprageniculate nuclei, which were isometric (α = .95, α = 1.08, respectively), and the left limitans suprageniculate nuclei and left accumbens area, which were hyperallometric (α = 1.17, α = 1.11, respectively).

In line with previous findings (Jäncke et al., 2015; D. Liu et al., 2014; Williams et al., 2020), whole subcortical structures were generally hypoallometric. So far, only two other studies have examined and reported variations in scaling within subcortical structures. Whereas the present study examined scaling between subcortical subsegmentation volumes and TBV, Reardon et al. (2018) studied the scaling relationship of the vertex area of subcortical subsegmentations with total subcortical area. van Eijke et al. (2020) examined sex differences in hippocampal subsegmentations when considering the allometric relationship between subsegmentations and either TBV or total hippocampal volume, but they did not report scaling coefficients. The sparse literature on regional to global scaling within subcortical regions suggests that various global measures can be considered. However, the choice of global measure depends on the question at hand. Given that the majority of studies, including ours, compare regional cerebral measures between populations that differ in total brain size, we felt that TCM was the most relevant measure in our study to yield unbiased estimates of regional group differences. While our finding that allometry varies within subcortical volumes extends our understanding of the cerebral scaling relationships between regional and total brain measures, further studies should investigate whether additional global measures should be taken into account when examining regional group differences.

3.2.4. Mean thickness and surface area scaling with TBV

Exploratory analyses on the scaling relationship between TBV and cortical mean thicknesses or surface areas revealed that the scaling coefficients of all mean thicknesses with TBV were different from one‐third, whereas the scaling coefficients of 19 regional surface areas with TBV did not differ from two‐thirds (Supplemental Tables D6 and D7).

As the first study to examine scaling coefficients of regional cortical mean thicknesses with TBV that go beyond lobar segmentations, this study adds the literature supporting that larger brains are not simply a scaled‐up version of smaller brains (Finlay et al., 2001; Im et al., 2008; Toro et al., 2008). If the brain regions grew proportionally to brain size, total MCT would scale to the power of one‐third with TBV, and TSA to the power of two‐thirds with TBV (Finlay et al., 2001). Yet, we find that total MCT scaled to the power of 0.03 with TBV and that the majority of cortical mean thicknesses had allometric scaling coefficients close to 0, reflecting the stability of cortical mean thicknesses with TBV growth. Moreover, TSA scaled to the power of 0.89 with TBV and the majority of cortical surface areas had hypoallometric scaling coefficients that were larger than two‐thirds. The greater than geometrically expected hypoallometry across surfaces can be explained by the dramatic increase in gyrification (Fish et al., 2017), and more specifically, in sulcal convolution, which occurs with the expansion of TBV (Im et al., 2008; Toro et al., 2008). Finally, the heterogeneity of allometric patterns observed across the cortex may be explained by the nonuniform gyrification of the cortical surface (Fish et al., 2017).

3.2.5. Sex‐ or age‐dependent allometry

Allometry depended on sex (0.2%, 12/628) or age (11%, 71/628) in relatively few regions (details in Supplemental Info 5.1). Sex differences in allometry were less frequent, although larger (|α| = .05–.12) than age differences in allometry (|α| = .01–.04). Regional volumes generally increased less with TBV in females, while mean thicknesses increased less with total MCT in males. For instance, the right fimbria volume increases less with TBV in females (α = .29) compared to males (α = .50, β = −.12, SE = 0.02), while the left surface area of the pericallosal sulcus increases more with TSA in females (α = .71) compared to males (α = .59, β = .09, SE = 0.01; Figure S3). As for age‐dependent changes in allometry, the right fimbria volume increases less with TBV in younger (α = .74) compared to older individuals (α = 1.13, β = .03, SE = 0.01), while the mean thickness of the left transverse temporal sulcus increases more with the TBV in younger (α = 1.49) compared to older individuals (α = 1.22, β = −.04, SE = 0.00; Figure S4).

Although sparse, significant interaction between TCM and age or sex suggests that matching individuals between groups by TCM may not be appropriate for all regions. Cerebral sex differences may reflect sex‐dependent distributions of tissues instead of individual differences in brain size (Luders, Gaser, Narr, & Toga, 2009). To our knowledge, age‐dependent allometry has not been investigated by previous studies. Of the two other studies that examined sex‐dependent allometry (Reardon et al., 2016; van Eijk et al., 2020), neither Eijke sex‐dependent allometry in the subcortical regions they examined. Similarly to Reardon et al. (2016), we did not find a sex‐dependent allometric relationship in the pallidum. However, we reported a significant sex‐dependent allometric relationship in hippocampal fimbria that was not observed by van Eijk et al. (2020). Considering the small effect size of these interactions on the fimbria, we speculate that the sample size of van Eijk et al.'s (2020) study was insufficient to detect these interactions. However, another explanation could be that sex‐dependent allometric relationships within the hippocampus are absent in the adolescent and younger adult population. Since interactions with brain size are often overlooked in studies examining sex and age differences (e.g., Ritchie et al., 2018; Vinke et al., 2018), the presence of some significant interactions in our study suggests that they should be considered across regions to obtain unbiased estimates of age and sex effects on the brain and to accurately identify associations between brain regions and behavioral or cognitive traits.

3.2.6. Conclusion on allometry

All brain regions varied with global brain measures. The majority (86%) of brain regions were allometric as they scaled nonlinearly with their TCM. Of the regions exhibiting allometry, hypoallometry occurred in 93% of volumes, 55% of the cortical surface areas, and 52% of the cortical mean thicknesses (Table 2). While the association between scaling and cognition and behavior remains unknown, our study adds to the literature (de Jong et al., 2017; Finlay et al., 2001; Jäncke et al., 2015; Jäncke, Liem, & Merillat, 2019; Toro et al., 2009) supporting allometric scaling as an inherent property of the brain that varies across regions and cerebral measures, and provides scaling coefficients for regions that were not previously investigated (e.g., subcortical subsegmentations, ventricles, etc.).

TABLE 2.

Cerebral regions exhibiting brain allometry

Measure N % Min First Qu. Median Mean Third Qu. Max
Cortical surface areas (N = 148) Hypoallometric 62 42 0.64 0.76 0.83 0.82 0.91 0.95
Hyperallometric 50 34 1.06 1.1 1.13 1.16 1.21 1.39
Cortical mean thicknesses (N = 148) Hypoallometric 63 43 0.66 0.78 0.84 0.83 0.89 0.96
Hyperallometric 58 39 1.04 1.09 1.14 1.2 1.31 1.62
Cortical volumes (N = 148) Hypoallometric 98 66 0.6 0.73 0.82 0.8 0.89 0.96
Hyperallometric 18 12 1.05 1.07 1.1 1.15 1.16 1.59
Freesurfer subcortical subsegmentation volumes (N = 116) Hypoallometric 113 97 0.49 0.68 0.75 0.74 0.8 0.92
Hyperallometric 1 1 1.17 1.17 1.17 1.17 1.17 1.17
FAST cerebellar volumes (N = 28) Hypoallometric 24 86 0.5 0.68 0.74 0.75 0.82 0.94
ASEG subcortical volumes (N = 8) Hypoallometric 7 88 0.69 0.73 0.75 0.78 0.81 0.92
Hyperallometric 1 13 1.11 1.11 1.11 1.11 1.11 1.11
ASEG ventricle and CSF volumes (N = 10) Hypoallometric 7 70 0.4 0.47 0.49 0.51 0.55 0.62
ASEG cerebellum, corpus callosum, ventral DC, optic chiasm volumes (N = 14) Hypoallometric 12 86 0.55 0.66 0.78 0.76 0.85 0.93
Hyperallometric 1 7 1.21 1.21 1.21 1.21 1.21 1.21
Global measures (N = 9) Hypoallometric 8 89 0.03 0.77 0.73 0.83 0.9 0.91
Hyperallometric 1 11 1.21 1.21 1.21 1.21 1.21 1.21

Note: Values displayed are allometric scaling coefficients (α). Hyperallometric α > 1, and hypoallometric: α <1.

Abbreviations: CSF, cerebral spinal fluid; DC, diencephalon.

3.3. Sex differences

3.3.1. Global measures

TBV was significantly larger in males than in females (β = −1.14). Once TBV was adjusted for with the allometric adjustment (Equation (3)), the cerebellar WMV (β = .27), cerebellar GMV (β = .25), total MCT (β = .12), and cerebral WMV (β = .06) were greater in females, while brainstem volume (β = −.21), total subcortical volumes (β = −.08), TSA (β = −.07), and cerebral GMV (β = −.02) were greater in males.

Consistent with previous studies, males had a larger TBV (e.g., Ritchie et al., 2018; Ruigrok et al., 2014), while females had a relatively larger total MCT (Im et al., 2008, p. 200; van Velsen et al., 2013). However, we did not find that males and females had a similar TSA (Im et al., 2008), nor did we observe greater cerebral WMV in males relative to brain size (Chen, Sachdev, Wen, & Anstey, 2007; Gur et al., 1999). Instead, our analyses revealed that males have a larger TSA and females a greater cerebral WMV. In light of the small magnitude of these sex differences (<0.1) and the sample size of previous studies (N < 150), we speculate that past studies were not sufficiently powered to reliably estimate these effects.

3.3.2. Cerebellar GMV

The Freesurfer ASEG cerebellum GMV—used to calculate TBV—was larger in males (β = − 0.36), whereas the FAST cerebellum GMV—calculated as the sum of the cerebellar lobes and vermes from FAST Diedrichsen Cerebellar Atlas—was larger in females (β = .25).

Discrepancies in results between segmentation algorithms may stem from Diedrichsen's segmentation algorithm ignoring individual WMV and GMV intensities, which are taken into account by Freesurfer. Or, differences may be due to Freesurfer overlabeling peripheral tissue as it is more sensitive in regions of low contrast between tissue types (Carass et al., 2018). Although the cerebellar GMV of the FAST and ASEG segmentations only correlated at 76%, differences across cerebellar segmentations did not influence our measure of TBV, as the TBV calculated by summing regional segmentations (including the FAST GMV) correlated at 99.8% with our measure of TBV (derived from the ASEG segmentations).

3.3.3. Regional cerebellar GMVs

When examining sex differences in the cerebellum with the FAST cerebellum regional segmentations, females had larger cerebellar GMV in 82% of regions (23/28) with coefficients ranging from 0.08 (right crus I) to 0.64 (vermis X). The crus I vermis and the left and right lobule V did not differ between sexes and the left and right lobule X were larger in males (β = −.17, β = −.12, respectively).

Our findings contrast with the literature on sex differences in cerebellar regions (review in S. Han, An, Carass, Prince, Resnick (2020) and S. Han, Carass, He, Prince (2020)). For instance, S. Han, An, et al. (2020) and S. Han, Carass, et al. (2020) reported that the VIIIa lobules were larger in males and that the right I–III lobules, and IX and X vermes were larger in females when adjusting for intracranial volume with the linear covariate method. In our replication of Mankiw et al.'s (2017) study, we found that, instead of being greater in males, the cerebellum, flocculus, cerebellar lobule VIIb, VIIb, and VIIa, and crus II volumes were greater in females and that the flocculus volume did not vary between sexes. Although both studies used segmentation algorithms that have a better parcellation accuracy than the SUIT segmentation (Diedrichsen, Balsters, Flavell, Cussans, & Ramnani, 2009) of the present study (Carass et al., 2018; S. Han, Carass, et al., 2020), these discrepancies may also stem from differences in sample age (mean age = 12.5 and 70 years old, respectively) and size (N = 116 and 2,023, respectively).

In light of the difficulty of segmenting the cerebellum and the differences in regional specificity and accuracy across segmentations, we suggest that future studies take advantage of the large UK Biobank dataset to apply and compare cerebellar segmentation algorithms.

3.3.4. Whole subcortical volumes

The thalamus (right β = −.15, left β = −.08), putamen (left and right β = −0.18), left pallidum (β = −.08), and left amygdala (β = −.12) were larger in males and the hippocampus (right β = .07, left β = .06) and left accumbens were larger in females (β = .10). The right pallidum volume, the caudate volumes, and the right accumbens area volume did not differ between sexes.

Our findings are consistent with previous studies reporting greater putamen volume in males (S. Liu, Seidlitz, Blumenthal, Clasen, & Raznahan, 2020; Lotze et al., 2019; Ritchie et al., 2018), larger thalamic volumes in males (Lotze et al., 2019), and bigger hippocampal volumes in females (Malykhin, Huang, Hrybouski, & Olsen, 2017; Nordenskjöld et al., 2015). However, our results contrast with research supporting the presence of sex differences in the right amygdala (S. Liu et al., 2020; Lotze et al., 2019; Ritchie et al., 2018) and the left hippocampus (Ritchie et al., 2018; Tan, Ma, Vira, Marwha, & Eliot, 2016) or greater male hippocampal volume (Lotze et al., 2019; Pintzka, Hansen, Evensmoen, & Håberg, 2015). Finally, we similarly report that males have larger putamen, pallidum, and left amygdala volumes in our replication of studies examining sex differences when considering brain allometry (Reardon et al., 2016; Sanchis‐Segura et al., 2019). See Supplemental Figure File 6 for a comparison of sex differences in effect sizes across large‐scale studies.

We additionally replicated Ritchie et al.'s (2018) findings that males had greater pallidum, putamen, and amygdala volumes, when adjusting for TBV with the linear covariate adjustment and analyzing FIRST subcortical segmentations. For a detailed analysis of the replication by region, see Supplemental Info 6.4.2 and Supplemental Tables G5–G9.

When adjusting for TBV with the linear covariate or the allometric approach in the replication models with the FIRST segmentations, we found similar subcortical sex differences to those reported in our main analyses with the Freesurfer segmentations with some exceptions. Specifically, instead of being larger in males, the left thalamus was larger in females and right thalamus did not show sex differences in the replication models. Moreover, the right accumbens area was larger in females and the right pallidum was larger in males in the replication analyses, although they did not differ between sexes in the main analyses. These discrepancies may stem from the different terms and interactions included in the main and replication analyses or from differences between the FIRST and Freesurfer subcortical segmentations. For instance, FIRST provides a segmentation of the amygdala more similar to that of manual tracing than Freesurfer (Morey et al., 2009), although the amygdala agreement with manual segmentation is relatively poor compared to other regions such as the hippocampus (Morey et al., 2010), potentially due to the complexity of the structure (Schoemaker et al., 2016).

Sex differences from our main analyses additionally varied across hemispheres. For instance, the amygdala volume was larger for males in the left hemisphere and similar across sexes in the right hemisphere. However, seemingly inconsistent results on whole subcortical structures may be illuminated by examining their subcomponents, as provided by the Freesurfer subcortical subsegmentations.

3.3.5. Subcortical subsegmentations

Sex differences were found in 67% of the subcortical subsegmentations (74/110), with greater male volume in 42 (38%) regions and greater female volume in 32 (29%) regions (details in Supplemental Info 5.2.3).

The magnitude of the subsegmentation subcortical sex differences was not perceptible at the whole subcortical level due to the presence of sex differences in opposite directions. For instance, medial and lateral regions of the thalamus were considerably larger in females (β ranging from .06 to .25), although the whole thalamus volume was greater in males (right β = −.15, left β = −.08). Moreover, the right cortical nucleus (β = .13) was larger in females even though sex differences were absent in the whole right segmentation of amygdala.

The only other study taking into account brain allometry when examining sex differences across subcortical subsegmentations examined sex effects across hippocampal subsegmentations while adjusting for TBV with the allometric and linear covariate approach (van Eijk et al., 2020). When combining data from the Queensland Twin Imaging study (N = 727) and the Human Connectome Project (N = 960), van Eijk et al. (2020) similarly reported a larger hippocampal fissure in males. However, in contrast to our finding that females had a larger presubiculum in the left hemisphere, they found that males had larger presubiculum volumes. While males had larger fimbria volumes in their linear model, females in our study had a greater fimbria volume in the left hemisphere in both linear and allometric models. The hippocampal subfields from the UK Biobank and the Queensland Twin Imaging cohorts were extracted using the T1‐weighted subcortical images, whereas T2‐weighted images were also used for the Human Connectome Project. Although a recent study suggests that T1‐weighted images may not be adequate enough to determine the internal boundaries of the hippocampal subfields (Wisse et al., 2021), there is currently no consensus on the matter. Since our results were not more similar to those of the Queensland Twin Imaging cohort, we speculate that these differences stem from a combination of factors, including differences in sample size and age, as their participants were between 21 and 36 years old, and differences in the MRI acquisition parameters (Stonnington et al., 2008; Takao, Hayashi, & Ohtomo, 2013) as well as the T images used to extract the hippocampal subfields (Wisse et al., 2021).

Taken together, these findings support a high variability of sex differences within subcortical structures and highlight the importance of conducting more analyses on fine‐grained segmentations to better understand where cerebral sex differences lie.

3.3.6. Cortical regions

Cortical sex differences were present in 57% of surface areas (84/148) and 66% of volumes (97/148) and mean thicknesses (98/148). No clear spatial trend in sex differences was apparent across lobes. In terms of cortical volumes, 62 regions (42%) were larger in males, ranging from −0.27 (right occipital pole) to −0.06 (right inferior temporal sulcus), and 35 regions (24%) were larger in females, ranging from 0.06 (right precentral gyrus) to 0.26 (right postcentral gyrus). As for cortical mean thicknesses, 50 regions (34%) were greater in males and 48 (32%) were greater in females. Greater mean thicknesses in males varied from −0.39 (left medial orbital—olfactory sulcus) to −0.05 (parieto‐occipital sulcus or fissure), while greater mean thicknesses in females ranged from 0.06 (left transverse frontopolar gyri and sulci) to 0.31 (left transverse temporal sulcus). Finally, in terms of cortical surface areas, males had larger surface areas in 50 regions (34%) ranging from −0.22 (right medial occipito‐temporal‐collateral—sulcus and lingual sulcus) to −0.06 (left precentral gyrus), and females had larger surface areas in 34 regions (23%) with coefficients ranging from 0.06 (left superior frontal sulcus) to 0.21 (right anterior transverse temporal gyrus of Heschl; Figure 3).

FIGURE 3.

FIGURE 3

Sex differences across cortical measures. Sex effects differences from −0.40 (mean thickness of the left medial orbital sulcus) to 0.32 (mean thickness of the left transverse temporal sulcus). Negative effects reflect greater male than female volumes. The flat representation corresponds to the flattened image of the superior view with the midline of the circle reflecting regions within the sagittal plane and circle edges reflecting inferior regions. Figures made with https://neuroanatomy.github.io/cortex/ (Toro, 2020)

We replicated the majority of sex differences (93%, 84/94) reported by Ritchie et al. (2018) with the Desikan–Killiany Cortical segmentations when adjusting for TCM with the linear covariate approach. We observed significant sex differences in 69 additional regions, possibly due to our larger sample size. Sex differences from the main analyses with the Destrieux segmentation and the replication analyses with the Desikan–Killiany appeared to be generally consistent.

In line with a study of cortical volumetric sex differences in 411 middle aged participants (Chen et al., 2007), we found that males had a larger left inferior temporal gyrus and larger right occipital lingual and right middle temporal gyri, while females had a larger right inferior parietal gyrus, right postdorsal part of the cingulate gyrus and sulcus, and left and right mid‐anterior and postventral parts of the cingulate gyrus. Consistent with a study of 2,838 middle aged adults (Lotze et al., 2019), we found that females have larger volumes in the superior parietal lobe and right orbitofrontal cortex, whereas males have larger volumes in the left temporal pole and right fusiform gyrus. Yet, in contrast with the previous literature on sex differences in GMV (Lotze et al., 2019; Ritchie et al., 2018; Ruigrok et al., 2014), we reported that females have a larger precentral gyri than males and that males have a larger right anterior part of the cingulate gyrus and sulcus than females. Moreover, we did not find sex differences in the left anterior part of the cingulate gyrus and sulcus, although past studies suggest that this volume was larger in females (Lotze et al., 2019; Ruigrok et al., 2014). The divergence in some of the presented results can partly be attributed to differences in sample size, varying neuroimaging techniques, and to the small size of these effects (median |β| = .09) as well as to differences in sample age range. And yet, we observed a similar pattern of sex differences in our replication of Ritchie et al.'s (2018) analyses with the allometric adjustment for TCM, suggesting that the different terms included in our models and the differences between the Destrieux and the Desikan–Killiany Atlases had little influence on the majority reported cortical sex differences.

3.3.7. Sex differences in variance

In addition to observing mean sex differences in two‐thirds of regions, we found sex differences in variance in 49% (306/629) of regions. A total of 253 (40%) regions exhibited greater male variability and 56 (9%) exhibited greater female variability (Table 3). Sex differences in variance ranged from 0.82 (for the right cerebellar lobule VIIIa, implying greater male variability) to 1.17 (for the optic chiasm; Supplemental Info 6.1 and Supplemental Tables E1–E7).

TABLE 3.

Variance ratios of sex differences across segmentations

Segmentation Sex differences in variance ratio N % Min First Qu. Median Mean Third Qu. Max
Cortical volumes (N = 148) M > F 63 43 0.91 0.94 0.95 0.95 0.96 0.97
M < F 5 3 1.02 1.04 1.05 1.07 1.08 1.14
Cortical surface areas (N = 148) M > F 71 48 0.90 0.94 0.95 0.95 0.96 0.97
M < F 8 5 1.03 1.04 1.04 1.06 1.08 1.14
Cortical mean thicknesses (N = 148) M > F 26 18 0.90 0.95 0.95 0.95 0.96 0.97
M < F 36 24 1.03 1.05 1.06 1.06 1.07 1.15
Cerebellar volumes (N = 28) M > F 23 16 0.82 0.90 0.94 0.92 0.95 0.97
M < F 2 1 1.04 1.04 1.04 1.04 1.04 1.04
Subcortical subsegmentations (N = 116) M > F 64 55 0.88 0.93 0.94 0.94 0.96 0.97
ASEG subcortical volumes (N = 8) M > F 2 25 0.95 0.96 0.96 0.96 0.96 0.96
ASEG ventricle and CSF volumes (N = 10) M > F 2 20 0.95 0.95 0.95 0.95 0.95 0.96
M < F 2 20 1.07 1.08 1.09 1.09 1.10 1.11
ASEG cerebellum, corpus callosum, ventral DC, optic chiasm volumes (N = 14) M > F 5 36 0.93 0.94 0.95 0.95 0.95 0.96
M < F 3 21 1.02 1.03 1.04 1.08 1.10 1.17
Global measures (N = 9) M > F 1 11 0.90 0.90 0.90 0.90 0.90 0.90

Note: Values represent variance ratios. Variance ratio = Female SD/male SD.

Abbreviations: CSF, cerebral spinal fluid; F, female; M, male; Qu., quartile.

As reported by previous studies (Ritchie et al., 2018; Wierenga et al., 2018, 2020; Wierenga, Bos, van Rossenberg, & Crone, 2019), the majority of brain regions with sex differences in variance were more variable in males (82%) compared to females (18%). Overall, cerebellar lobes and vermes were also more variable in males. Sex differences in variance across subcortical subsegmentations were also greater in males in the hippocampus, amygdala, and thalamus. However, while volumes and surface areas were more variable in males, mean thicknesses were generally more variable in females. When examining the correlation between a region's sex effect standardized beta and variance ratio, we found that larger regions in one sex were typically more variable in the other sex (r = −.33). There were some exceptions. For instance, the left and right post central gyrus were larger and more variable in females while the caudate, which did not show mean sex differences, was more variable in males.

Our findings thus generally support greater male variability, a phenomenon which is known to extend beyond brain measures. The greater male variability hypothesis states that males are more variable than females across a variety of psychological and physical characteristics. The greater male variability hypothesis states that males are more variable than females across a variety of psychological and physical characteristics (Ellis, 1894) and is widely supported by a range of human (e.g., Johnson, Carothers, & Deary, 2008; Ju, Duan, & You, 2015; Karwowski et al., 2016; Lehre, Lehre, Laake, & Danbolt, 2009; Wierenga et al., 2019) and animal (e.g., Branch et al., 2020; DeCasien, Sherwood, Schapiro, & Higham, 2020) studies. Although the mechanisms behind the greater male variability hypothesis exceed the scope of the present study, our findings further support greater male variability, which extends well beyond brain measures.

3.3.8. Conclusion on sex differences

Overall, we found that sex differences in the brain are the rule rather than the exception, affecting two‐thirds (418/629) of the investigated brain measures, with 231 regions relatively greater in males. The standardized coefficients (β) of the sex effect of cortical volumes were highly correlated to those of cortical surface areas (r = .77, p = 2.28e‐43) and moderately so for cortical mean thicknesses (r = .45, p = 1.23e‐08).

Although many of the regional sex differences had very small effect sizes (β < .1) and were only significant due to the large sample size, 46% of these cerebral measures (292/629) had a sex difference above 0.1. Specifically, sex differences in cerebellar GMV and WMV, total MCT, the corpus callosum, and the ventricles were generally greater than 0.1 (details in Supplemental Info 5.2), whereas sex differences in cerebral GMV and WMV, TSA, 51% of subcortical regions and 35–45% of the cortical regions were under 0.1.

3.4. Age effects

3.4.1. Global measures

All cerebral measures decreased linearly with age (β ranging from −.04 in to −.33), except for the brainstem volume, TSA, and cerebral WMV which increased (relatively to TBV) with age (β = .13, β = .11, β = .04, respectively). The quadratic age term did not significantly predict cerebral GMV and WMV or the brainstem volume. However, total subcortical volume (β = .01) and TSA (β = .02) positively, and TBV (β = −.05), total MCT (β = −.04), cerebellar GMV (β = −.05), and cerebellar WMV (β = −.02) negatively varied with quadratic age.

In line with the literature, TBV and total MCT decreased with linear and quadratic age (Ritchie et al., 2018; van Velsen et al., 2013; Vinke et al., 2018) and total MCT decreased more rapidly with linear age in males (van Velsen et al., 2013). Yet, our finding that TSA increased with linear and quadratic age contrasts with previous reports of a decrease in surface area across the lifespan (Hogstrom, Westlye, Walhovd, & Fjell, 2013; Lemaitre et al., 2012; Long et al., 2012). Divergent results between our study and those of Hogstrom et al. (2013) and Long et al. (2012) can be explained by their omission of brain size, as we similarly observed a decrease of TSA with age when excluding TBV from our models. However, when applying the proportion TCM adjustment, as done by Lemaitre et al. (2012), we observed an increase in TSA with age, suggesting that differences between our studies may instead stem from differences in segmentation or sample characteristics (e.g., smaller sample [N = 216], wider age range (18–87 years old)). Based on our findings, the relative expansion of TSA in older adults with age may reflect a global spread of the sulci, which appears to occur more rapidly in males than females.

3.4.2. Cerebellar volumes

There was a linear decline with age of the cerebellar GMV for the FAST (β = −.20) and the Freesurfer ASEG (β = −.02) segmentations. However, the FAST cerebellar GMV was also negatively predicted by quadratic age (β = −.05) and its linear decline with age was quicker in males compared to females (β = .03). As for the regional FAST cerebellar GMVs, 27 out of 28 (96%) cerebellar volumes decreased linearly with age. The age effect in the cerebellar IX vermis (β = −.02) did not reach significance. Linear age effects ranged from −0.20 (right crus I) to −0.05 (vermis crus I). We found a negative quadratic age effect in 74% (20/27) of regions with a linear age effect, which ranged from −0.07 (left cerebellar lobule VIIb) to −0.03 (right cerebellar lobule VIIb). The cerebellar IX vermis was the only area with a quadratic age but no linear age effect (β = −.04).

The linear decrease with age of the cerebellum mirrors previous findings (for review, see Bernard & Seidler, 2013). The literature also similarly reports a linear rather than a nonlinear cerebellar change with age (for review, see Fjell et al., 2013; Fjell & Walhovd, 2010) and the absence of an age by sex interaction in cerebellar volumes (Hoogendam et al., 2012; Raz et al., 2005) when examining the Freesurfer segmentation of the cerebellum. The discrepancies in results between cerebellar segmentations further highlight the non‐negligible impact that the type of cerebellar segmentation algorithm has on reported results (as discussed in Sections 3.2.2 and 3.2.3).

Although our findings add to the scarce literature on age related changes within the cerebellum, age effects in these GMVs remain highly inconsistent (Bernard & Seidler, 2013; S. Han, An, et al., 2020; Koppelmans et al., 2017). We speculate that these differences in reported results can be attributed to the insufficient sample size of previous studies to investigate the these effects (our median |β| = .17; N = 54 for Bernard and Seidler (2013) and N = 213, for Koppelmans et al., 2017) and differences in segmentation algorithms, which vary in accuracy and in the number of segmented cerebellar regions (Carass et al., 2018; L. Han et al., 2019).

3.4.3. Whole subcortical and subcortical subsegmentation volumes

The putamen (β = −.09 for both), accumbens area (left β = −.33, right β = −.24), amygdala (left β = −.16, right β = −.14), and hippocampus (left β = −.23, right β = −.19) decreased with age. The pallidum (left β = .03, right β = .06) and caudate (left β = .12, right β = .15) volumes increased with linear age, whereas the thalamus did not show significant linear age effects. The accumbens area, amygdala, and hippocampus volumes had a negative quadratic age effect, ranging from −0.03 (right accumbens area) to −0.09 (left presubiculum head), and the thalamus, caudate, and right putamen all showed positive quadratic age effects, ranging from 0.02 (right putamen) to 0.06 (left caudate).

Linear age was a significant predictor of 95% (104/110) of subsegmentations and quadratic age was a significant predictor of 88 (104/110) subsegmentations. Although there were no age effects at the whole subcortical level of the thalamus, we found 24 linear age effects with an absolute effect size greater than 0.1 across thalamic subsegmentations. More positive linear and quadratic age effects were found in ventral and intralaminar thalamic volumes. On the other hand, the majority of the amygdala and the hippocampal subsegmentations (>92%) decreased with age. The direction of the quadratic age effects on subcortical subsegmentations was similar to that of the whole subcortical volumes, as we found negative quadratic age effects across the amygdala and hippocampal subsegmentations and positive quadratic age effects across the thalamic subsegmentations.

The volumetric decline in the amygdala, hippocampus, putamen, and nucleus accumbens with age mirrors previous findings (Hogstrom et al., 2013; Kurth, Cherbuin, & Luders, 2017; Sele, Liem, Mérillat, & Jäncke, 2020; Vinke et al., 2018). However, our finding of an increase in pallidum volume with age contrasts with past studies reporting a small decrease with age in this region (Sele et al., 2020; Vinke et al., 2018). Moreover, while we add to the literature reporting an expansion of the caudate with age (Vinke et al., 2018), a small decrease has also been reported (Sele et al., 2020). In terms of the discrepancy in the caudate results, we speculate that Sele et al.'s (2020) sample (N = 231) was insufficient to observe such small changes with age (β = .05–.06). As for the discrepancies in the pallidum results, differences may be attributed to the different terms included in the regressions, as Vinke et al. (2018) modeled nonlinear changes with splines instead of a quadratic age.

3.4.4. Cortical regions

Cortical regions increased with linear age in 22% of volumes, 35% of mean thicknesses, 24% of surface areas, and declined with linear age in 41% of volumes, 52% of mean thicknesses, and 33% of surface areas (Figure 4). Cortical volumes decreased linearly with age in 60 regions, ranging from −0.13 (right precentral gyrus) to −0.02 (right parieto‐occipital sulcus), and increased with age in 33 regions, ranging from 0.03 (right superior segment of circular sulcus of the insula) to 0.13 (left central sulcus). Cortical mean thicknesses decreased with linear age in 77 regions, ranging from −0.14 (right inferior circular sulcus of the insula) to −0.02 (left posterior dorsal cingulate gyrus), and increased with age in 52 regions, ranging from 0.02 (left middle occipital gyrus) to 0.22 (right cuneus gyrus). Surface areas linearly increased with age in 36 regions, ranging from 0.02 (left superior frontal gyrus) to 0.15 (right central sulci), and decreased with age in 49 regions, ranging from −0.11 (right the orbital sulci) to −0.02 (left lateral aspect of the superior temporal gyrus). As for quadratic age effects, we found a positive quadratic age effect in 5% of volumes, 6% of mean thicknesses, 3% of surface areas, and a negative quadratic age effect in 1% of volumes, 9% of mean thicknesses, 3% of surface areas. For further details on cortical age effects, see Supplemental Word Document Section 5.3.

FIGURE 4.

FIGURE 4

Linear age effects across cortical measures. Age effects ranged from −0.40 (mean thickness of the right inferior segment of the circular sulcus of the insula) to 0.22 (mean thickness of the right cuneus gyrus, O6). The flat representation corresponds to the flattened image of the superior view with the midline of the circle reflecting regions within the sagittal plane and circle edges reflecting inferior regions. Figures made with https://neuroanatomy.github.io/cortex/ (Toro, 2020)

Our findings support and extend the literature reporting large age‐related changes across the cortical measures (e.g., Fjell et al., 2009; Lotze et al., 2019; Pintzka et al., 2015; Salat et al., 2004; Storsve et al., 2014). The majority of frontal volumes decreased with linear age, while frontal surface areas linearly decreased with age in the orbital gyri and sulci and the inferior frontal gyrus. Frontal mean thicknesses decreased with linear age in the frontal medial and frontal superior regions. Temporal surface areas and volumes generally decreased with linear age, whereas age effects on temporal mean thicknesses were more variable. For instance, we observed a mean thickness thinning in the lateral aspect of the superior temporal gyrus and the middle temporal gyrus and an increase in thickness in the planum polare, Heschl's gyrus, lingual gyrus, and the temporal pole. Occipital regions generally decreased with linear age in surface areas and volumes, and increased with linear age in mean thicknesses, while parietal regions mainly decreased with linear age across cerebral measures.

Our findings additionally shed a light on the inconsistent age‐related changes reported in motor, somatosensory, and visual cortices (Hogstrom et al., 2013; for review, see McGinnis, Brickhouse, Pascual, & Dickerson, 2011). We found the largest surface area expansions with age in the paracentral lobule and sulcus and the precentral and postcentral gyrus (motor and somatosensory cortices) and a reduction of a similar size in the mean thicknesses and volumes of these regions with age. As for the visual cortices, the largest mean thickness expansions with age occurred in the cuneus gyrus, the lingual gyrus, occipital pole, the inferior, middle, and superior, occipital gyri. The surface areas of these regions generally decreased with age and their volumes were either unaffected by age or showed a slight increase with age.

3.4.5. Conclusion of age effects

We observed a linear change with age in 76% (480/629) and a quadratic one in 25% (159/629) of cerebral measures. About 49% of regions decreased with linear age and 28% increased with linear age. Regions with quadratic age effects typically displayed linear age effects. There were significant sex by age interactions in 14% of regions (87/629), ranging from −0.16 (right cerebellar lobule X) to 0.19 (left fimbria). See Supplemental Info 5.4 for the results and discussion of the sex by age and sex by age2 interactions and Supplemental Info 5.3 for further details on the ASEG and Freesurfer subsegmentations linear and quadratic age effects.

3.5. Does brain allometry influence reported results?

We examined the effects of omitting brain allometry and adjusting for TCM with different methods. To do so, we compared the number of significant results from our main analyses obtained with the allometric TCM adjustment to the number of significant results obtained when using the linear covariate or proportion TCM adjustment. The models with the linear covariate TCM adjustment yielded similar significant effects and interactions (i.e., age and sex effects and their interactions with and without TCM) to the allometric models: The linear covariate model underestimated or overestimated effects in 2.35% (102/4340) of statistical tests and these differences occurred in regions near significance with small effect sizes (Supplemental Figures File 3 and 4, https://osf.io/s4qc5/). In contrast, the proportion adjustment for TCM overestimated 14.24% (618/4340) of effects and interactions reported in our main analyses compared to the allometric TCM adjustment (see Supplemental Info 6.2.1 and Supplemental Table F1 for details).

In our replication of the FIRST subcortical and Desikan–Killiany cortical sex differences reported by Ritchie et al. (2018), models with the linear covariate and the allometric approach additionally yielded consistent results. Finally, based on our correlational analyses of a region's deviance from isometry (i.e., |1—scaling coefficient|) and the difference in the effect size of a term between models with varying TCM adjustments (e.g., |proportion sex β ‐ allometric sex β|), we found that discrepancies in significance between the linear and allometric models were accentuated in more allometric regions, specifically for the proportion models (see Supplemental Info 6.2.3, Supplemental Table F5, and Supplemental Figures File 5 https://osf.io/s4qc5/).

Sex differences in variance differed in 38.06% (236/620) of regions between the allometric and the linear covariate approach and in 29.03% (180/620) of regions between the allometric and the proportion approach. These discrepancies additionally lead to a change in the direction of the correlation between a region's sex standardized beta and variance ratio. When adjusting for TCM with the proportion or the linear covariate approach, regions that were larger in males became more variable in males, instead of being more variable in females (see Supplemental Info 6.2.2 and Supplemental Tables F2–F4). We found a similar change in the direction of the correlations between the sex effect Cohen's d and variance ratios in our replication of Ritchie et al.'s (2018) study with the cortical Desikan–Killiany and subcortical FIRST segmentations (Supplemental Table G10).

In line with previous research, results from the linear covariate and allometric approach are more similar than the results of the proportion and allometric approach (Mankiw et al., 2017; Reardon et al., 2016; Sanchis‐Segura et al., 2019). We find that the major source of variation in mean results across models with differing TCM adjustments is due to the omission of the intercept of the relationship between a region and its TCM (as in the proportion method), rather than the omission of its nonlinear relationship (as in the proportion and linear covariate methods). Moreover, we are the first to find that omitting brain allometry leads to over or underestimating sex differences in variance in some regions. In light of our findings, we suggest that brain allometry be considered to provide unbiased estimates of age, sex, and TBV effects and interactions across all brain regions, and suggest that previous sex differences in variance relative to brain size be reexamined with brain allometry (Ritchie et al., 2018; Wierenga et al., 2018, 2019, 2020).

3.6. Limitations

In light of the “healthy volunteer” selection bias and older age range of the UK Biobank (Fry et al., 2017), this article is limited in its capacity to generalize its findings and neuroanatomical norms and markers to other age groups or to the UK population. These norms and markers are also not independent of ethnicity or highest level of education attained, factors thought to influence neuroanatomical measures (Shen et al., 2017; Tang et al., 2018). Future studies should investigate whether the present neuroanatomical markers vary as a function of ethnicity and level of education. If the aim is to generalize these findings to the UK population, we suggest that neuroanatomical markers be created with a representative sample or that weights be used to adjust the phenotypic measures of the UK Biobank to match those of the UK population.

While numerous studies focus on developing machine learning algorithms (e.g., SVM classifications) to generate neuroanatomical markers, we were interested in identifying sex, age, and TCM effects while considering their potential interactions, which are often omitted in the literature. By opting for a regression approach, we were able to quantify effects and interactions for each region, which would have been lost with machine learning. Future studies would nevertheless benefit from examining the age, sex, and global brain effects and interactions of other anatomical measures, such as diffusion tractography, and functional measures in the UK Biobank.

While the present study modeled age as a linear and quadratic function, other studies examining sex and age interactions used the nonparametric local smoothing technique (i.e., smoothing splines; Fjell et al., 2013; Vinke et al., 2018), which are thought to be more predictive of individual trajectories and less vulnerable to sampling range (Fjell & Walhovd, 2010). However, splines have a tendency to overfit the data by modeling inflections where there is no reason to expect any and these inflections may not be replicable in a different sample. Thus, we avoided the risk of overfitting by taking nonlinearities into account with quadratic age. We additionally reported relatively few cases of nonlinear age effects, both visually and as significant quadratic effects. While 25% of regions significantly varied with quadratic age, only 1% of all global and regional measures had an age2 effect sizes over 1, suggesting that significant quadratic effects deviate little from linearity. In light of the risk of overfitting with spline regressions and the lack of observable complex nonlinear variations in cerebral measures with age, modeling nonlinear age effects with age2 appears to be sufficient to address our paper's goal and its simplicity makes it the most parsimonious choice.

4. CONCLUSION

The present study is the first to generate neuroanatomical norms in the UK Biobank for other regions than the hippocampus (Nobis et al., 2019) and the largest analysis to date of the age, sex, and TCM effects and interactions on global and regional brain volumes, cortical mean thicknesses, and cortical surface areas. We provide further evidence that brain allometry is a common property of the brain that should be considered to report unbiased estimates of age, sex, and TCM effects and interactions. By generating volumetric and allometric norms in the UK Biobank, we pave the way for future research to examine the associations between these markers and the behavioral and cognitive traits available in the UK Biobank. Once associations between these UK Biobank neuroanatomical norms and cognitive and behavioral measures are established, researchers will be able to examine the extent to which these neuroanatomical markers mediate the effect that genes and the environment have on these traits. This line of research will play critical role in our understanding of the influence that neuroanatomy has on who we are.

CONFLICT OF INTERESTS

On behalf of all authors, the corresponding author states that there is no conflict of interest.

ETHICS STATEMENT

The UK Biobank received ethical approval from the Research Ethics Committee (reference 11/NW/0382). More information can be found here: https://www.ukbiobank.ac.uk/learn‐more‐about‐uk‐biobank/about‐us/ethics. All participants provided informed consent (“Resources tab” at https://biobank.ctsu.ox.ac.uk/crystal/field.cgi?id=200). The present study was conducted based on application 46 007.

Supporting information

AppendixS1: Supporting information

Table A

Table B

Table C

Table D

Table E

Table F

Table G

Table H

ACKNOWLEDGMENTS

This work received support under the program “Investissements d'Avenir” launched by the French Government and implemented by l'Agence Nationale de la Recherche (ANR) with the references ANR‐17‐EURE‐0017 and ANR‐10‐IDEX‐0001‐02 PSL. Funding was also obtained from Fondation pour l'Audition (FPA RD‐2016‐8 research grant). This research has been conducted using the UK Biobank Resource.

Williams, C. M., Peyre, H., Toro, R., & Ramus, F. (2021). Neuroanatomical norms in the UK Biobank: The impact of allometric scaling, sex, and age. Human Brain Mapping, 42(14), 4623–4642. 10.1002/hbm.25572

Funding information Agence Nationale de la Recherche, Grant/Award Numbers: ANR‐10‐IDEX‐0001‐02 PSL, ANR‐17‐EURE‐0017; Fondation pour l'Audition, Grant/Award Number: FPA RD‐2016‐8

DATA AVAILABILITY STATEMENT

This research has been conducted using data from UK Biobank, a major biomedical database (http://www.ukbiobank.ac.uk/). Restrictions apply to the availability of these data, which were used under license for this study: application 46007. Preregistration and code are available on OSF: https://osf.io/s4qc5/.

REFERENCES

  1. Alfaro‐Almagro, F., Jenkinson, M., Bangerter, N. K., Andersson, J. L. R., Griffanti, L., Douaud, G., … Smith, S. M. (2018). Image processing and quality control for the first 10,000 brain imaging datasets from UK Biobank. NeuroImage, 166, 400–424. 10.1016/j.neuroimage.2017.10.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Beam, C. R., Kaneshiro, C., Jang, J. Y., Reynolds, C. A., Pedersen, N. L., & Gatz, M. (2018). Differences between women and men in incidence rates of dementia and Alzheimer's disease. Journal of Alzheimer's Disease, 64(4), 1077–1083. 10.3233/JAD-180141 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Beck, A. T., Steer, R. A., & Garbin, M. G. (1988). Psychometric properties of the Beck Depression Inventory: Twenty‐five years of evaluation. Clinical Psychology Review, 8(1), 77–100. [Google Scholar]
  4. Bernard, J. A., & Seidler, R. D. (2013). Relationships between regional cerebellar volume and sensorimotor and cognitive function in young and older adults. The Cerebellum, 12(5), 721–737. 10.1007/s12311-013-0481-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Boyd, A., van de Velde, S., Vilagut, G., de Graaf, R., O'Neill, S., Florescu, S., … Kovess‐Masfety, V. (2015). Gender differences in mental disorders and suicidality in Europe: Results from a large cross‐sectional population‐based study. Journal of Affective Disorders, 173, 245–254. 10.1016/j.jad.2014.11.002 [DOI] [PubMed] [Google Scholar]
  6. Branch, C. L., Sonnenberg, B. R., Pitera, A. M., Benedict, L. M., Kozlovsky, D. Y., Bridge, E. S., & Pravosudov, V. V. (2020). Testing the greater male variability phenomenon: Male mountain chickadees exhibit larger variation in reversal learning performance compared with females. Proceedings of the Royal Society B: Biological Sciences, 287(1931), 20200895. 10.1098/rspb.2020.0895 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Carass, A., Cuzzocreo, J. L., Han, S., Hernandez‐Castillo, C. R., Rasser, P. E., Ganz, M., … Prince, J. L. (2018). Comparing fully automated state‐of‐the‐art cerebellum parcellation from magnetic resonance images. NeuroImage, 183, 150–172. 10.1016/j.neuroimage.2018.08.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Chen, X., Sachdev, P. S., Wen, W., & Anstey, K. J. (2007). Sex differences in regional gray matter in healthy individuals aged 44–48 years: A voxel‐based morphometric study. NeuroImage, 36(3), 691–699. 10.1016/j.neuroimage.2007.03.063 [DOI] [PubMed] [Google Scholar]
  9. Costa, P. T., Jr., & McCrae, R. R. (2008). The revised NEO Personality Inventory (NEO‐PI‐R). In The SAGE handbook of personality theory and assessment, Vol 2: Personality measurement and testing (pp. 179–198). Thousand Oaks, CA, US: Sage Publications. 10.4135/9781849200479.n9 [DOI] [Google Scholar]
  10. Cox, S. R., Ritchie, S. J., Fawns‐Ritchie, C., Tucker‐Drob, E. M., & Deary, I. J. (2019). Structural brain imaging correlates of general intelligence in UK Biobank. Intelligence, 76, 101376. 10.1016/j.intell.2019.101376 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Cox, S. R., Bastin, M. E., Ritchie, S. J., Dickie, D. A., Liewald, D. C., Muñoz Maniega, S., … Deary, I. J. (2018). Brain cortical characteristics of lifetime cognitive ageing. Brain Structure and Function, 223(1), 509–518. 10.1007/s00429-017-1505-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Dallaire‐Théroux, C., Callahan, B. L., Potvin, O., Saikali, S., & Duchesne, S. (2017). Radiological‐pathological correlation in Alzheimer's disease: Systematic review of antemortem magnetic resonance imaging findings. Journal of Alzheimer's Disease, 57(2), 575–601. 10.3233/JAD-161028 [DOI] [PubMed] [Google Scholar]
  13. de Jong, L. W., Vidal, J.‐S., Forsberg, L. E., Zijdenbos, A. P., Haight, T., Sigurdsson, S., … Launer, L. J. (2017). Allometric scaling of brain regions to intra‐cranial volume: An epidemiological MRI study. Human Brain Mapping, 38(1), 151–164. 10.1002/hbm.23351 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Deary, I. J. (2010). Cognitive epidemiology: Its rise, its current issues, and its challenges. Personality and Individual Differences, 49(4), 337–343. 10.1016/j.paid.2009.11.012 [DOI] [Google Scholar]
  15. DeCasien, A. R., Sherwood, C. C., Schapiro, S. J., & Higham, J. P. (2020). Greater variability in chimpanzee (Pan troglodytes) brain structure among males. Proceedings of the Royal Society B: Biological Sciences, 287(1925), 20192858. 10.1098/rspb.2019.2858 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Diedrichsen, J., Balsters, J. H., Flavell, J., Cussans, E., & Ramnani, N. (2009). A probabilistic MR atlas of the human cerebellum. NeuroImage, 46(1), 39–46. 10.1016/j.neuroimage.2009.01.045 [DOI] [PubMed] [Google Scholar]
  17. Ellis, H. (1894). The variational tendency of men. In Man and woman: A study of human secondary sexual characters (1st ed., pp. 358–372). London, England: Walter Scott; Retrieved from https://archive.org/details/manandwomanastu00elligoog/page/n16/mode/2up [Google Scholar]
  18. Finlay, B. L., Darlington, R. B., & Nicastro, N. (2001). Developmental structure in brain evolution. The Behavioral and Brain Sciences, 24(2), 263–278; discussion 278‐308. 10.1017/S0140525X01003958 [DOI] [PubMed] [Google Scholar]
  19. Fish, A. M., Cachia, A., Fischer, C., Mankiw, C., Reardon, P. K., Clasen, L. S., … Raznahan, A. (2017). Influences of brain size, sex, and sex chromosome complement on the architecture of human cortical folding. Cerebral Cortex, 27(12), 5557–5567. 10.1093/cercor/bhw323 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Fiske, A., Wetherell, J. L., & Gatz, M. (2009). Depression in older adults. Annual Review of Clinical Psychology, 5(1), 363–389. 10.1146/annurev.clinpsy.032408.153621 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Fjell, A. M., & Walhovd, K. B. (2010). Structural brain changes in aging: Courses, causes and cognitive consequences. Reviews in the Neurosciences, 21(3), 187–221. 10.1515/revneuro.2010.21.3.187 [DOI] [PubMed] [Google Scholar]
  22. Fjell, A. M., Westlye, L. T., Amlien, I., Espeseth, T., Reinvang, I., Raz, N., … Walhovd, K. B. (2009). Minute effects of sex on the aging brain: A multisample magnetic resonance imaging study of healthy aging and Alzheimer's disease. Journal of Neuroscience, 29(27), 8774–8783. 10.1523/JNEUROSCI.0115-09.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Fjell, A. M., Westlye, L. T., Grydeland, H., Amlien, I., Espeseth, T., Reinvang, I., … Walhovd, K. B. (2013). Critical ages in the life course of the adult brain: Nonlinear subcortical aging. Neurobiology of Aging, 34(10), 2239–2247. 10.1016/j.neurobiolaging.2013.04.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Fox, J., Weisberg, S., Price, B., Adler, D., Bates, D., Baud‐Bovy, G., … R‐Core . (2020). car: Companion to Applied Regression (3.0‐10) [Computer software]. Retrieved from https://CRAN.R-project.org/package=car
  25. Fry, A., Littlejohns, T. J., Sudlow, C., Doherty, N., Adamska, L., Sprosen, T., … Allen, N. E. (2017). Comparison of sociodemographic and health‐related characteristics of UK Biobank participants with those of the general population. American Journal of Epidemiology, 186(9), 1026–1034. 10.1093/aje/kwx246 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Gong, Q., Scarpazza, C., Dai, J., He, M., Xu, X., Shi, Y., … Mechelli, A. (2019). A transdiagnostic neuroanatomical signature of psychiatric illness. Neuropsychopharmacology, 44(5), 869–875. 10.1038/s41386-018-0175-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Gur, R. C., Turetsky, B. I., Matsui, M., Yan, M., Bilker, W., Hughett, P., & Gur, R. E. (1999). Sex differences in brain gray and white matter in healthy Young adults: Correlations with cognitive performance. Journal of Neuroscience, 19(10), 4065–4072. 10.1523/JNEUROSCI.19-10-04065.1999 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Han, L., Chen, L., Bin, Y., Cun, D., Yu‐chen, H., & Xin, J. (2019). Deep feature combination based multi‐model wind power prediction. 2019 IEEE 2nd International Conference on Computer and Communication Engineering Technology (CCET) . 10.1109/CCET48361.2019.8989358 [DOI]
  29. Han, S., An, Y., Carass, A., Prince, J. L., & Resnick, S. M. (2020). Longitudinal analysis of regional cerebellum volumes during normal aging. NeuroImage, 220, 117062. 10.1016/j.neuroimage.2020.117062 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Han, S., Carass, A., He, Y., & Prince, J. L. (2020). Automatic cerebellum anatomical parcellation using U‐net with locally constrained optimization. NeuroImage, 218, 116819. 10.1016/j.neuroimage.2020.116819 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Hogstrom, L. J., Westlye, L. T., Walhovd, K. B., & Fjell, A. M. (2013). The structure of the cerebral cortex across adult life: Age‐related patterns of surface area, thickness, and gyrification. Cerebral Cortex, 23(11), 2521–2530. 10.1093/cercor/bhs231 [DOI] [PubMed] [Google Scholar]
  32. Hoogendam, Y. Y., van der Geest, J. N., van der Lijn, F., van der Lugt, A., Niessen, W. J., Krestin, G. P., … Ikram, M. A. (2012). Determinants of cerebellar and cerebral volume in the general elderly population. Neurobiology of Aging, 33(12), 2774–2781. 10.1016/j.neurobiolaging.2012.02.012 [DOI] [PubMed] [Google Scholar]
  33. Hurtz, S., Woo, E., Kebets, V., Green, A. E., Zoumalan, C., Wang, B., … Apostolova, L. G. (2014). Age effects on cortical thickness in cognitively normal elderly individuals. Dementia and Geriatric Cognitive Disorders EXTRA, 4(2), 221–227. 10.1159/000362872 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Im, K., Lee, J.‐M., Lyttelton, O., Kim, S. H., Evans, A. C., & Kim, S. I. (2008). Brain size and cortical structure in the adult human brain. Cerebral Cortex, 18(9), 2181–2191. 10.1093/cercor/bhm244 [DOI] [PubMed] [Google Scholar]
  35. Jäncke, L., Liem, F., & Merillat, S. (2019). Scaling of brain compartments to brain size. Neuroreport, 30(8), 573–579. 10.1097/WNR.0000000000001249 [DOI] [PubMed] [Google Scholar]
  36. Jäncke, L., Mérillat, S., Liem, F., & Hänggi, J. (2015). Brain size, sex, and the aging brain. Human Brain Mapping, 36(1), 150–169. 10.1002/hbm.22619 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Jellinger, K. A., & Attems, J. (2015). Challenges of multimorbidity of the aging brain: A critical update. Journal of Neural Transmission (Vienna, Austria: 1996), 122(4), 505–521. 10.1007/s00702-014-1288-x [DOI] [PubMed] [Google Scholar]
  38. Johnson, W., Carothers, A., & Deary, I. J. (2008). Sex differences in variability in general intelligence: A new look at the old question. Perspectives on Psychological Science, 3(6), 518–531. 10.1111/j.1745-6924.2008.00096.x [DOI] [PubMed] [Google Scholar]
  39. Ju, C., Duan, Y., & You, X. (2015). Retesting the greater male variability hypothesis in mainland China: A cross‐regional study. Personality and Individual Differences, 72, 85–89. 10.1016/j.paid.2014.07.021 [DOI] [Google Scholar]
  40. Jumah, F., Ghannam, M., Jaber, M., Adeeb, N., & Tubbs, R. S. (2016). Neuroanatomical variation in autism spectrum disorder: A comprehensive review. Clinical Anatomy, 29(4), 454–465. 10.1002/ca.22717 [DOI] [PubMed] [Google Scholar]
  41. Kaczkurkin, A. N., Raznahan, A., & Satterthwaite, T. D. (2019). Sex differences in the developing brain: Insights from multimodal neuroimaging. Neuropsychopharmacology, 44(1), 71–85. 10.1038/s41386-018-0111-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Karwowski, M., Jankowska, D. M., Gajda, A., Marczak, M., Groyecka, A., & Sorokowski, P. (2016). Greater male variability in creativity outside the WEIRD world. Creativity Research Journal, 28(4), 467–470. 10.1080/10400419.2016.1229978 [DOI] [Google Scholar]
  43. Koppelmans, V., Hoogendam, Y. Y., Hirsiger, S., Mérillat, S., Jäncke, L., & Seidler, R. D. (2017). Regional cerebellar volumetric correlates of manual motor and cognitive function. Brain Structure and Function, 222(4), 1929–1944. 10.1007/s00429-016-1317-7 [DOI] [PubMed] [Google Scholar]
  44. Kurth, F., Cherbuin, N., & Luders, E. (2017). The impact of aging on subregions of the hippocampal complex in healthy adults. NeuroImage, 163, 296–300. 10.1016/j.neuroimage.2017.09.016 [DOI] [PubMed] [Google Scholar]
  45. Lefebvre, A., Beggiato, A., Bourgeron, T., & Toro, R. (2015). Neuroanatomical diversity of corpus callosum and brain volume in autism: Meta‐analysis, analysis of the autism brain imaging data exchange project, and simulation. Biological Psychiatry, 78(2), 126–134. 10.1016/j.biopsych.2015.02.010 [DOI] [PubMed] [Google Scholar]
  46. Lehre, A.‐C., Lehre, K. P., Laake, P., & Danbolt, N. C. (2009). Greater intrasex phenotype variability in males than in females is a fundamental aspect of the gender differences in humans. Developmental Psychobiology, 51(2), 198–206. 10.1002/dev.20358 [DOI] [PubMed] [Google Scholar]
  47. Lemaitre, H., Goldman, A., Sambataro, F., Verchinski, B., Meyer‐Lindenberg, A., Weinberger, D., & Mattay, V. (2012). Normal age‐related brain morphometric changes: Nonuniformity across cortical thickness, surface area and grey matter volume? Neurobiology of Aging, 33(3), 617.e1–617.e9. 10.1016/j.neurobiolaging.2010.07.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Liu, D., Johnson, H. J., Long, J. D., Magnotta, V. A., & Paulsen, J. S. (2014). The power‐proportion method for intracranial volume correction in volumetric imaging analysis. Frontiers in Neuroscience, 8, 356. 10.3389/fnins.2014.00356 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Liu, S., Seidlitz, J., Blumenthal, J. D., Clasen, L. S., & Raznahan, A. (2020). Integrative structural, functional, and transcriptomic analyses of sex‐biased brain organization in humans. Proceedings of the National Academy of Sciences of the United States of America, 117(31), 18788–18798. 10.1073/pnas.1919091117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Long, X., Liao, W., Jiang, C., Liang, D., Qiu, B., & Zhang, L. (2012). Healthy aging: An automatic analysis of global and regional morphological alterations of human brain. Academic Radiology, 19(7), 785–793. 10.1016/j.acra.2012.03.006 [DOI] [PubMed] [Google Scholar]
  51. Lotze, M., Domin, M., Gerlach, F. H., Gaser, C., Lueders, E., Schmidt, C. O., & Neumann, N. (2019). Novel findings from 2,838 adult brains on sex differences in gray matter brain volume. Scientific Reports, 9(1), 1–7. 10.1038/s41598-018-38239-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Luders, E., Gaser, C., Narr, K. L., & Toga, A. W. (2009). Why sex matters: Brain size independent differences in gray matter distributions between men and women. The Journal of Neuroscience, 29(45), 14265–14270. 10.1523/JNEUROSCI.2261-09.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Malykhin, N. V., Huang, Y., Hrybouski, S., & Olsen, F. (2017). Differential vulnerability of hippocampal subfields and anteroposterior hippocampal subregions in healthy cognitive aging. Neurobiology of Aging, 59, 121–134. 10.1016/j.neurobiolaging.2017.08.001 [DOI] [PubMed] [Google Scholar]
  54. Mankiw, C., Park, M. T. M., Reardon, P. K., Fish, A. M., Clasen, L. S., Greenstein, D., … Raznahan, A. (2017). Allometric analysis detects brain size‐independent effects of sex and sex chromosome complement on human cerebellar organization. The Journal of Neuroscience, 37(21), 5221–5231. 10.1523/JNEUROSCI.2158-16.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Marek, S., Tervo‐Clemmens, B., Calabro, F. J., Montez, D. F., Kay, B. P., Hatoum, A. S., … Dosenbach, N. U. F. (2020). Towards reproducible brain‐wide association studies. BioRxiv. 2020.08.21.257758. 10.1101/2020.08.21.257758 [DOI]
  56. Matsuo, K., Harada, K., Fujita, Y., Okamoto, Y., Ota, M., Narita, H., … Watanabe, Y. (2019). Distinctive neuroanatomical substrates for depression in bipolar disorder versus major depressive disorder. Cerebral Cortex, 29(1), 202–214. 10.1093/cercor/bhx319 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. McGinnis, S. M., Brickhouse, M., Pascual, B., & Dickerson, B. C. (2011). Age‐related changes in the thickness of cortical zones in humans. Brain Topography, 24(3), 279–291. 10.1007/s10548-011-0198-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Miller, K. L., Alfaro‐Almagro, F., Bangerter, N. K., Thomas, D. L., Yacoub, E., Xu, J., … Smith, S. M. (2016). Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nature Neuroscience, 19(11), 1523–1536. 10.1038/nn.4393 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Morey, R. A., Petty, C. M., Xu, Y., Hayes, J. P., Wagner, H. R., Lewis, D. V., … McCarthy, G. (2009). A comparison of automated segmentation and manual tracing for quantifying hippocampal and amygdala volumes. NeuroImage, 45(3), 855–866. 10.1016/j.neuroimage.2008.12.033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Morey, R. A., Selgrade, E. S., Wagner, H. R., Huettel, S. A., Wang, L., & McCarthy, G. (2010). Scan‐rescan reliability of subcortical brain volumes derived from automated segmentation. Human Brain Mapping, 31(11), 1751–1762. 10.1002/hbm.20973 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Narvacan, K., Treit, S., Camicioli, R., Martin, W., & Beaulieu, C. (2017). Evolution of deep gray matter volume across the human lifespan. Human Brain Mapping, 38(8), 3771–3790. 10.1002/hbm.23604 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Nobis, L., Manohar, S. G., Smith, S. M., Alfaro‐Almagro, F., Jenkinson, M., Mackay, C. E., & Husain, M. (2019). Hippocampal volume across age: Nomograms derived from over 19,700 people in UK Biobank. NeuroImage: Clinical, 23, 101904. 10.1016/j.nicl.2019.101904 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Nordenskjöld, R., Malmberg, F., Larsson, E.‐M., Simmons, A., Ahlström, H., Johansson, L., & Kullberg, J. (2015). Intracranial volume normalization methods: Considerations when investigating gender differences in regional brain volume. Psychiatry Research: Neuroimaging, 231(3), 227–235. 10.1016/j.pscychresns.2014.11.011 [DOI] [PubMed] [Google Scholar]
  64. Oakes, P., Loukas, M., Oskouian, R. J., & Tubbs, R. S. (2017). The neuroanatomy of depression: A review. Clinical Anatomy, 30(1), 44–49. 10.1002/ca.22781 [DOI] [PubMed] [Google Scholar]
  65. O'Brien, L. M., Ziegler, D. A., Deutsch, C. K., Frazier, J. A., Herbert, M. R., & Locascio, J. J. (2011). Statistical adjustments for brain size in volumetric neuroimaging studies: Some practical implications in methods. Psychiatry Research, 193(2), 113–122. 10.1016/j.pscychresns.2011.01.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. O'Brien, L. M., Ziegler, D. A., Deutsch, C. K., Kennedy, D. N., Goldstein, J. M., Seidman, L. J., … Herbert, M. R. (2006). Adjustment for whole brain and cranial size in volumetric brain studies: A review of common adjustment factors and statistical methods. Harvard Review of Psychiatry, 14(3), 141–151. 10.1080/10673220600784119 [DOI] [PubMed] [Google Scholar]
  67. Peyre, H., Mohanpuria, N., Jednoróg, K., Heim, S., Grande, M., van Ermingen‐Marbach, M., … Ramus, F. (2020). Neuroanatomy of dyslexia: An allometric approach. European Journal of Neuroscience, 52(6), 3595–3609. [DOI] [PubMed] [Google Scholar]
  68. Pintzka, C. W. S., Hansen, T. I., Evensmoen, H. R., & Håberg, A. K. (2015). Marked effects of intracranial volume correction methods on sex differences in neuroanatomical structures: A HUNT MRI study. Frontiers in Neuroscience, 9, 238. 10.3389/fnins.2015.00238 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. R Core Team . (2019). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; Retrieved from https://www.R-project.org/ [Google Scholar]
  70. Ramus, F., Altarelli, I., Jednoróg, K., Zhao, J., & Scotto di Covella, L. (2018). Neuroanatomy of developmental dyslexia: Pitfalls and promise. Neuroscience & Biobehavioral Reviews, 84, 434–452. 10.1016/j.neubiorev.2017.08.001 [DOI] [PubMed] [Google Scholar]
  71. Raz, N., Lindenberger, U., Rodrigue, K. M., Kennedy, K. M., Head, D., Williamson, A., … Acker, J. D. (2005). Regional brain changes in aging healthy adults: General trends, individual differences and modifiers. Cerebral Cortex, 15(11), 1676–1689. 10.1093/cercor/bhi044 [DOI] [PubMed] [Google Scholar]
  72. Reardon, P. K., Clasen, L., Giedd, J. N., Blumenthal, J., Lerch, J. P., Chakravarty, M. M., & Raznahan, A. (2016). An allometric analysis of sex and sex chromosome dosage effects on subcortical anatomy in humans. The Journal of Neuroscience, 36(8), 2438–2448. 10.1523/JNEUROSCI.3195-15.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Reardon, P. K., Seidlitz, J., Vandekar, S., Liu, S., Patel, R., Park, M. T. M., … Raznahan, A. (2018). Normative brain size variation and brain shape diversity in humans. Science, 360(6394), 1222–1227. 10.1126/science.aar2578 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Ritchie, S. J., Cox, S. R., Shen, X., Lombardo, M. V., Reus, L. M., Alloza, C., … Deary, I. J. (2018). Sex differences in the adult human brain: Evidence from 5216 UK Biobank participants. Cerebral Cortex, 28(8), 2959–2975. 10.1093/cercor/bhy109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Ruigrok, A. N. V., Salimi‐Khorshidi, G., Lai, M.‐C., Baron‐Cohen, S., Lombardo, M. V., Tait, R. J., & Suckling, J. (2014). A meta‐analysis of sex differences in human brain structure. Neuroscience & Biobehavioral Reviews, 39, 34–50. 10.1016/j.neubiorev.2013.12.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Salat, D. H., Buckner, R. L., Snyder, A. Z., Greve, D. N., Desikan, R. S. R., Busa, E., … Fischl, B. (2004). Thinning of the cerebral cortex in aging. Cerebral Cortex, 14(7), 721–730. 10.1093/cercor/bhh032 [DOI] [PubMed] [Google Scholar]
  77. Sanchis‐Segura, C., Ibañez‐Gual, M. V., Adrián‐Ventura, J., Aguirre, N., Gómez‐Cruz, Á. J., Avila, C., & Forn, C. (2019). Sex differences in gray matter volume: How many and how large are they really? Biology of Sex Differences, 10(1), 32. 10.1186/s13293-019-0245-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Schmidt, C. K., Khalid, S., Loukas, M., & Tubbs, R. S. (2018). Neuroanatomy of anxiety: A brief review. Cureus, 10(1), e2055. 10.7759/cureus.2055 [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Schoemaker, D., Buss, C., Head, K., Sandman, C. A., Davis, E. P., Chakravarty, M. M., … Pruessner, J. (2016). Hippocampus and amygdala volumes from magnetic resonance images in children: Assessing accuracy of Freesurfer and FSL against manual segmentation. NeuroImage, 129, 1–14. 10.1016/j.neuroimage.2016.01.038 [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Seedat, S., Scott, K. M., Angermeyer, M. C., Berglund, P., Bromet, E. J., Brugha, T. S., … Kessler, R. C. (2009). Cross‐national associations between gender and mental disorders in the World Health Organization World Mental Health Surveys. Archives of General Psychiatry, 66(7), 785–795. 10.1001/archgenpsychiatry.2009.36 [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Sele, S., Liem, F., Mérillat, S., & Jäncke, L. (2020). Decline variability of cortical and subcortical regions in aging: A longitudinal study. Frontiers in Human Neuroscience, 14, 363. 10.3389/fnhum.2020.00363 [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Shen, X., Reus, L. M., Cox, S. R., Adams, M. J., Liewald, D. C., Bastin, M. E., … McIntosh, A. M. (2017). Subcortical volume and white matter integrity abnormalities in major depressive disorder: Findings from UK Biobank imaging data. Scientific Reports, 7(1), 5547. 10.1038/s41598-017-05507-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Stonnington, C. M., Tan, G., Klöppel, S., Chu, C., Draganski, B., Jack, C. R., … Frackowiak, R. S. J. (2008). Interpreting scan data acquired from multiple scanners: A study with Alzheimer's disease. NeuroImage, 39(3), 1180–1185. 10.1016/j.neuroimage.2007.09.066 [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Storsve, A. B., Fjell, A. M., Tamnes, C. K., Westlye, L. T., Overbye, K., Aasland, H. W., & Walhovd, K. B. (2014). Differential longitudinal changes in cortical thickness, surface area and volume across the adult life span: Regions of accelerating and decelerating change. The Journal of Neuroscience, 34(25), 8488–8498. 10.1523/JNEUROSCI.0391-14.2014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Sudlow, C., Gallacher, J., Allen, N., Beral, V., Burton, P., Danesh, J., … Collins, R. (2015). UK Biobank: An open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS Medicine, 12(3), e1001779. 10.1371/journal.pmed.1001779 [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Szucs, D., & Ioannidis, J. P. A. (2017). When null hypothesis significance testing is unsuitable for research: A reassessment. Frontiers in Human Neuroscience, 11, 390. 10.3389/fnhum.2017.00390 [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Szucs, D., & Ioannidis, J. P. A. (2020). Sample size evolution in neuroimaging research: An evaluation of highly‐cited studies (1990–2012) and of latest practices (2017–2018) in high‐impact journals. NeuroImage, 221, 117–164. 10.1016/j.neuroimage.2020.117164 [DOI] [PubMed] [Google Scholar]
  88. Takao, H., Hayashi, N., & Ohtomo, K. (2013). Effects of the use of multiple scanners and of scanner upgrade in longitudinal voxel‐based morphometry studies. Journal of Magnetic Resonance Imaging, 38(5), 1283–1291. 10.1002/jmri.24038 [DOI] [PubMed] [Google Scholar]
  89. Tan, A., Ma, W., Vira, A., Marwha, D., & Eliot, L. (2016). The human hippocampus is not sexually‐dimorphic: Meta‐analysis of structural MRI volumes. NeuroImage, 124(Pt A), 350–366. 10.1016/j.neuroimage.2015.08.050 [DOI] [PubMed] [Google Scholar]
  90. Tang, Y., Zhao, L., Lou, Y., Shi, Y., Fang, R., Lin, X., … Toga, A. (2018). Brain structure differences between Chinese and Caucasian cohorts: A comprehensive morphometry study. Human Brain Mapping, 39(5), 2147–2155. 10.1002/hbm.23994 [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Toro, R. (2020). Cortical figures generator. Retrieved from https://r03ert0.github.io/cortex/
  92. Toro, R., Chupin, M., Garnero, L., Leonard, G., Perron, M., Pike, B., … Paus, T. (2009). Brain volumes and Val66Met polymorphism of the BDNF gene: Local or global effects? Brain Structure & Function, 213(6), 501–509. 10.1007/s00429-009-0203-y [DOI] [PubMed] [Google Scholar]
  93. Toro, R., Perron, M., Pike, B., Richer, L., Veillette, S., Pausova, Z., & Paus, T. (2008). Brain size and folding of the human cerebral cortex. Cerebral Cortex, 18(10), 2352–2357. 10.1093/cercor/bhm261 [DOI] [PubMed] [Google Scholar]
  94. van Eijk, L., Hansell, N. K., Strike, L. T., Couvy‐Duchesne, B., de Zubicaray, G. I., Thompson, P. M., … Wright, M. J. (2020). Region‐specific sex differences in the hippocampus. NeuroImage, 215, 116781. 10.1016/j.neuroimage.2020.116781 [DOI] [PubMed] [Google Scholar]
  95. van Velsen, E. F. S., Vernooij, M. W., Vrooman, H. A., van der Lugt, A., Breteler, M. M. B., Hofman, A., … Ikram, M. A. (2013). Brain cortical thickness in the general elderly population: The Rotterdam Scan Study. Neuroscience Letters, 550, 189–194. 10.1016/j.neulet.2013.06.063 [DOI] [PubMed] [Google Scholar]
  96. Vinke, E. J., de Groot, M., Venkatraghavan, V., Klein, S., Niessen, W. J., Ikram, M. A., & Vernooij, M. W. (2018). Trajectories of imaging markers in brain aging: The Rotterdam Study. Neurobiology of Aging, 71, 32–40. 10.1016/j.neurobiolaging.2018.07.001 [DOI] [PubMed] [Google Scholar]
  97. Wechsler, D., Psychological Corporation , & PsychCorp (Firm) . (2008). WAIS‐IV technical and interpretive manual. San Antonio, TX: Pearson. [Google Scholar]
  98. Wierenga, L. M., Bos, M. G. N., van Rossenberg, F., & Crone, E. A. (2019). Sex effects on development of brain structure and executive functions: Greater variance than mean effects. Journal of Cognitive Neuroscience, 31(5), 730–753. 10.1162/jocn_a_01375 [DOI] [PubMed] [Google Scholar]
  99. Wierenga, L. M., Doucet, G. E., Dima, D., Agartz, I., Aghajani, M., Akudjedu, T. N., … Tamnes, C. K. (2020). Greater male than female variability in regional brain structure across the lifespan. In BioRxiv (p. 2020.02.17.952010). [DOI] [PMC free article] [PubMed]
  100. Wierenga, L. M., Langen, M., Ambrosino, S., van Dijk, S., Oranje, B., & Durston, S. (2014). Typical development of basal ganglia, hippocampus, amygdala and cerebellum from age 7 to 24. NeuroImage, 96, 67–72. 10.1016/j.neuroimage.2014.03.072 [DOI] [PubMed] [Google Scholar]
  101. Wierenga, L. M., Sexton, J. A., Laake, P., Giedd, J. N., Tamnes, C. K., & Pediatric Imaging, Neurocognition, and Genetics Study . (2018). A key characteristic of sex differences in the developing brain: Greater variability in brain structure of boys than girls. Cerebral Cortex, 28(8), 2741–2751. 10.1093/cercor/bhx154 [DOI] [PMC free article] [PubMed] [Google Scholar]
  102. Williams, C. M., Peyre, H., Toro, R., Beggiato, A., & Ramus, F. (2020). Adjusting for allometric scaling in ABIDE I challenges subcortical volume differences in autism spectrum disorder. Human Brain Mapping, 41, 4610–4629. 10.1002/hbm.25145 [DOI] [PMC free article] [PubMed] [Google Scholar]
  103. Wisse, L. E. M., Chételat, G., Daugherty, A. M., de Flores, R., la Joie, R., Mueller, S. G., … Carr, V. A. (2021). Hippocampal subfield volumetry from structural isotropic 1 mm3 MRI scans: A note of caution. Human Brain Mapping, 42(2), 539–550. 10.1002/hbm.25234 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

AppendixS1: Supporting information

Table A

Table B

Table C

Table D

Table E

Table F

Table G

Table H

Data Availability Statement

This research has been conducted using data from UK Biobank, a major biomedical database (http://www.ukbiobank.ac.uk/). Restrictions apply to the availability of these data, which were used under license for this study: application 46007. Preregistration and code are available on OSF: https://osf.io/s4qc5/.


Articles from Human Brain Mapping are provided here courtesy of Wiley

RESOURCES