Abstract
There remains little consensus about the relationship between sex and brain structure, particularly in childhood. Moreover, few pediatric neuroimaging studies have analyzed both sex and gender as variables of interest - many of which included small sample sizes and relied on binary definitions of gender. The current study examined gender diversity with a continuous felt-gender score and categorized sex based on X and Y allele frequency in a large sample of children ages 9–11 years-old (N=7693). Then, a statistical model-building approach was employed to determine whether gender diversity and sex independently or jointly relate to brain morphology, including subcortical volume, cortical thickness, gyrification, and white matter microstructure. The model with sex, but not gender diversity, was the best-fitting model in 75% of gray matter regions and 79% of white matter regions examined. The addition of gender to the sex model explained significantly more variance than sex alone with regard to bilateral cerebellum volume, left precentral cortical thickness, as well as gyrification in the right superior frontal gyrus, right parahippocampal gyrus, and several regions in the left parietal lobe. For mean diffusivity in the left uncinate fasciculus, the model with sex, gender, and their interaction captured the most variance. Nonetheless, the magnitude of variance accounted for by sex was small in all cases and felt-gender score was not a significant predictor on its own for any white or gray matter regions examined. Overall, these findings demonstrate that at ages 9–11 years-old, sex accounts for a small proportion of variance in brain structure, while gender diversity is not directly associated with neurostructural diversity.
Keywords: sex, gender, neuroimaging, neurodevelopment, brain structure
1. Introduction
Although scientific institutions increasingly advise that sex and gender should be disaggregated and studied separately in all human research (Heidari et al., 2016; Mauvais-Jarvis et al., 2020; National Academies of Sciences, 2022; Nielsen et al., 2018; Tannenbaum et al., 2019), few neuroscientific studies to date have included both as variables of interest. As a result, in contemporary human neuroscience literature it remains impossible to parse the extent to which differences between males and females are ascribable to sex versus social-environmental factors like gender (Eliot et al., 2021). In acknowledgement of the current inability to discern the source of the observed variance, many authors have begun to use combined terms such as “gender/sex” or “sex/gender” (Eliot et al., 2021; Franconi et al., 2019; Friedrichs & Kellmeyer, 2022; Hines, 2020; van Anders, 2022). Yet, despite this uncertainty, neuroscientific investigations often portray sex differences as “hardwiring” and imply a causal link between brain structure and genetic or hormonal sex differences (Friedrichs & Kellmeyer, 2022; Persson & Pownall, 2021). Therefore, more research is needed to disentangle the relationships between sex, gender, and brain structure.
Many researchers distinguish sex from gender according to biological versus sociocultural etiology, but this distinction is imperfect (Hyde et al., 2018). For instance, societal norms can affect how sex is interpreted or assigned (Carpenter, 2018; Griffiths, 2018), and gendered behavioral norms - such as those around diet, exercise, and sleep - can impact biology via epigenetics (Mauvais-Jarvis et al., 2020). To standardize research methods, the National Academies of Science, Engineering, and Medicine recently released a report on the measurement of sex and gender, defining sex as: “a multidimensional construct based on a cluster of anatomical and physiological traits that include external genitalia, secondary sex characteristics, gonads, chromosomes, and hormones.” Gender is defined as: “a multidimensional construct that links gender identity, which is a core element of a person’s individual identity; gender expression, which is how a person signals their gender to others through their behavior and appearance (such as hair style and clothing); and cultural expectations about social status, characteristics, and behavior that are associated with sex traits” (emphasis in original; National Academies of Sciences, 2022). Notably, these definitions eschew the typical binary categorizations of sex (male/female) and gender (masculine/feminine) and depart from normative views of gender as an immutable characteristic based on sex, as determined through physical examination (National Academies of Sciences, 2022). The shift in terminology reflects the increased visibility of transgender and intersex communities and the growing use of multi-dimensional gender assessments in research (Jacobson & Joel, 2018; Joel, 2011; Joel et al., 2014, 2015; Llaveria Caselles, 2021; Pelletier et al., 2015; Rippon et al., 2014). Despite these important efforts to discriminate sex- versus gender-related impacts in the biological sciences, few pediatric neuroscience studies to date have included both variables.
Many pediatric neuroimaging studies have reported sex differences in brain structure, albeit with little consensus on the location, magnitude, and potential functional relevance of such findings (DeCasien et al., 2022; Kaczkurkin et al., 2019; Raznahan et al., 2011). Subcortical brain regions such as the hippocampus, amygdala, and thalamus synthesize testosterone and estradiol (Ahmadpour & Grange-Messent, 2021; Azcoitia et al., 2011) and undergo volumetric changes during puberty (Vijayakumar et al., 2016). Nevertheless, meta-analyses of studies on hippocampus and amygdala volume have found no significant sex differences after normalizing for total brain volume (Marwha et al., 2017; A. Tan et al., 2016). Cortical thickness (cortical thickness) - the distance between the white matter boundary and the pial surface (Fischl & Dale, 2000) - decreases during child and adolescent development (Bethlehem et al., 2022), while regional surface area changes via decreases in gyrification index (GI) - the proportion of the gray matter surface located within sulcal folds (Alemán-Gómez et al., 2013; Schaer et al., 2012). Research on the role of sex on these cortical developmental outcomes has yielded conflicting results. Though some authors found sex differences in cortical thickness (Herting et al., 2015; Raznahan et al., 2016; Sepehrband et al., 2018) and gyrification (Fish et al., 2017; G. Li et al., 2014) others have reported no significant sex differences in children and adolescents (Alemán-Gómez et al., 2013; Mills et al., 2014; Vijayakumar et al., 2016; Wierenga et al., 2018). White matter microstructural differences, such as fractional anisotropy (FA) and mean diffusivity (MD), have also been extensively studied between the sexes during childhood. MD measures the average diffusion in three directions, while FA, on the other hand, describes the extent to which local brain structure restricts diffusion (O’Donnell & Westin, 2011). Males generally appear to have higher FA, but lower MD than females throughout the lifespan (Kaczkurkin et al., 2019; Raznahan et al., 2011), though some authors have found no adolescent sex differences in FA and MD (Giorgio et al., 2008, 2010; Lebel et al., 2008; Seunarine et al., 2016). Potential contrasting sex effects in pediatric neuroimaging studies could be due to small sample sizes, wide age ranges, or differences in sample composition. In this regard, the landmark Adolescent Brain Cognitive Development℠ Study (ABCD Study®), the largest long-term study of brain development in the United States (ABCD Study, 2022), enrolled over 11,000 9- and 10-year-old children to further study biological and social factors, like gender on brain development (Volkow et al., 2018). As such, several recent studies using ABCD data have reported significant neuroanatomical sex differences across numerous domains, including subcortical volume (Adeli et al., 2020), cortical thickness (Brennan et al., 2021; Tomasi & Volkow, 2023; Wiglesworth et al., 2023), gray matter density (Murray et al., 2022), white matter microstructure (Lawrence et al., 2021; Tomasi & Volkow, 2023), and differences in the association between brain structure and behavior (Chen et al., 2022; Kim et al., 2022). However, none of these previous studies on sex differences utilizing the ABCD study examined the potential role of gender.
In contrast to sex, gender has infrequently been used as a variable of interest in neuroscience research, let alone in studies focused on structural brain development. Articles mentioning gender often use the term interchangeably with sex, neglect to describe how gender was measured, and use binary, sex-related terminology, such as “male” and ”female”, making it difficult to evaluate (Backeljauw et al., 2015; Bhargava et al., 2021; Coviello et al., 2018; Fassett-Carman et al., 2022; Levman et al., 2017; Ou et al., 2015; Szulc-Lerch et al., 2018; Urben et al., 2017). This conflation exemplifies the lack of a priori model building in the existing neuroimaging literature on gender, which can lead to faulty assumptions of causality (Edmiston & Juster, 2022). For instance, when group differences in brain structure between transgender and cisgender participants are depicted as the result of atypical sexual differentiation (Kreukels & Guillamon, 2016; Kurth et al., 2022; Levin et al., 2022), it demonstrates both the assumption that sex causes any observed differences between sexes, and that gender identity is akin to brain sex. This reduces gender to a binary distinction between cisgender and transgender (Llaveria Caselles, 2021). In this dichotomy, studying the relationship between gender and brain structure in normative development is superfluous since sex and gender are assumed to be synonymous in the typically developing brain. It is therefore unsurprising that the handful of pediatric neuroimaging studies investigating gender and brain structure have largely focused exclusively on transgender children and utilized binary gender classification (Beking et al., 2020; Burke et al., 2014, 2016; Hoekzema et al., 2015; Nota et al., 2017; Skorska et al., 2021; Soleman et al., 2013; Staphorsius et al., 2015).
Moving beyond the cisgender/transgender dichotomy, gender diversity is used to describe variations of gender identity and expression that differ from those stereotypically associated with an individual’s sex assigned at birth (Bölte et al., 2023). Non-dichotomous assessments of both adults and children have demonstrated that gender diversity also exists among cisgender people (Joel et al., 2014; A. Potter et al., 2021). Yet, few pediatric neuroimaging studies to date have used a non-dichotomous measure of gender (Rauch & Eliot, 2022). Using the Child Sex Role Inventory, Wood et al. (2008) found femininity - but not masculinity - was associated with volume in portions of the frontal cortex (n = 74, ages 7.8–17.0) while Belfi et al. (2014), found masculinity - but not femininity - was associated with frontal and temporal cortex volume (n = 108, ages 7–17). In both studies, however, the results were only significant among female participants. Moreover, the Child Sex Role Inventory is adapted for children based on the Bem Sex Role Inventory, which was developed using ratings of the desirability of traits by 100 college students in the 1970s, making its contemporary validity questionable (Choi et al., 2008; Hoffman & Borders, 2001). Recently, Xerxa et al. asked participants whether they “wish to be the opposite sex” and whether they would “rather be treated as someone from the opposite sex?” with responses coded as ordinal variables as well as a binary categorization (e.g., adolescents who reported gender diversity and those who did not). While they found no association between gender diversity and gray matter volume, white matter volume, total brain volume, or regional subcortical volume, they did find that gender diverse male adolescents on average had thicker left inferior temporal cortices (Xerxa et al., 2023). Thus, despite more recent calls for increased attention to disentangling sex and gender in neuroscience research, their respective relationships with typical neurodevelopment remain equivocal. Sex differences studies consistently fail to account for gender as a possible confound, while neuroimaging studies of gender are rare, small, and concentrated on a subpopulation - limiting generalizability. Therefore, future research is needed to determine whether sex, gender, and/or their combination relate to neurodevelopmental outcomes, such as structural brain morphology.
To unify the previously separate lines of inquiry and address the common limitations of the existing sex and gender-based neuroimaging research, this cross-sectional study utilized data from the baseline ABCD Study visit, including multiple neuroimaging modalities, genetic data to assess X and Y allele frequency, and a continuous measure of gender. In order to establish clear, concise, and scientifically sound conceptions of sex and gender, we chose to limit our analysis to one dimension of sex - chromosomal sex - and one continuous measure of gender diversity that assesses how much a participant feels like a boy and how much they feel like a girl. The goal of the current study was to utilize a statistical modeling building approach to systematically examine sex and gender diversity as they relate to brain structure in children ages 9 and 10 years old. Specifically, we examined if sex, gender, or their combination captured the greatest amount of variance in common gray matter macrostructure outcomes, including cortical thickness, subcortical volumes, and gyrification, as well as white matter microstructure, including FA and MD. We hypothesized that we would be able to successfully capture variance uniquely associated with gender in the brain at ages 9–11 years old. Additionally, we anticipated that the best model for capturing outcome variance would differ by region and feature. In particular, since the few non-dichotomous gender studies to date have reported positive associations with the cortex and null findings with regard to the subcortex and global white matter volume, we expected gender to account for more regional variance in cortical architecture than in subcortical volume and white matter microstructure. Due to its documented association with sex hormones (Ahmadpour & Grange-Messent, 2021; Pletzer et al., 2018; Vijayakumar et al., 2021), we expected the subcortex to be primarily associated with sex alone.
2. Materials and Methods
2.1. Participants
Data for this analysis were acquired from the larger ongoing ABCD Study®. The Adolescent Brain Cognitive Development (ABCD) study enrolled 11,880 children from 21 different sites around the United States from 2016 through 2018 (ABCD Study, 2022; Casey et al., 2018; Hagler et al., 2019). Exclusion criteria for the ABCD study included lack of English proficiency, severe sensory, neurological, medical, or intellectual limitations, and inability to complete an MRI scan. The heterogeneity of sites captured geographic, demographic, and socioeconomic diversity of participants (Garavan et al., 2018; Heeringa & Berglund, 2020). In terms of age, sex, and household size, the ABCD cohort closely matches the distribution of 9- and 10-year-olds in the American Community Survey (ACS), a large probability sample survey of U.S. households conducted annually by the U.S. Bureau of Census (Heeringa & Berglund, 2020).
In the interest of reproducibility, the ABCD Study publicly shares its raw and processed data, as well as its processing pipelines. For the current study, we elected to use a combination of raw and tabulated questionnaire and neuroimaging data from the study baseline and one-year follow-up visit as obtained from the NDA 3.0 (raw T1 and T2 structural MRI files) and 4.0 (tabulated questionnaire and diffusion MRI) releases (NDA 3.0 and 4.0 data release 2021; https://dx.doi.org/10.15154/1523041). The rationale for performing our own preprocessing for gray matter macrostructure using raw T1w and T2w images, as compared to using the tabulated data for these outcomes, was that adding an additional contrast image improves pial surface parcellation (Lindroth et al., 2019; Misaki et al., 2015).
After obtaining the data, we performed a series of steps to determine the final sample for our analyses (Supplemental Figure 1). Participants were excluded if: data was collected outside the 21 primary research sites; incidental neurological findings were noted by a radiologist (Y. Li et al., 2021); participant was reported taking hormone medications; X and Y allele frequency ratio did not match their reported sex at birth; participant did not complete both felt-gender questions; quality control standards of the ABCD consortium for raw imaging or FreeSurfer reconstruction were not met (Hagler et al., 2019); or if errors occurred during execution of our pre-processing or processing pipelines. In addition, we elected to restrict our sample to one child per family to simplify the structure of our multilevel model and improve model fit. During the selection of a single child, to increase variability and improve our ability to assess associations between gender diversity and brain structure, we opted to include the child with the lowest felt-gender score from each family. In families where siblings had the same felt-gender score, one child was chosen randomly. Full description of the final sample for the current study can be found in Table 1.
Table 1.
Sample Demographics
ABCD (N=11876) |
Sample (N=7693) |
|
---|---|---|
| ||
Sex | ||
Female | 5680 (47.8%) | 3729 (48.5%) |
Male | 6196 (52.2%) | 3964 (51.5%) |
| ||
Felt-Gender* | ||
Mean (SD) | 4.74 (0.595) | 4.72 (0.612) |
Median [Min, Max] | 5.00 [1.00, 5.00] | 5.00 [1.00, 5.00] |
Missing | 840 (7.1%) | 0 (0%) |
| ||
Age (months) | ||
Mean (SD) | 119 (7.50) | 119 (7.46) |
Median [Min, Max] | 119 [107, 133] | 119 [107, 133] |
| ||
Pubertal Development | ||
Pre | 5837 (49.1%) | 3805 (49.5%) |
Early | 2713 (22.8%) | 1765 (22.9%) |
Mid/Late | 2854 (24.0%) | 1849 (24.0%) |
Missing | 472 (4.0%) | 274 (3.6%) |
| ||
Race* | ||
White | 7517 (63.3%) | 5009 (65.1%) |
Black | 1868 (15.7%) | 1060 (13.8%) |
Multiracial (Black) | 649 (5.5%) | 401 (5.2%) |
Multiracial (Non-black) | 785 (6.6%) | 508 (6.6%) |
Other a | 874 (7.4%) | 597 (7.8%) |
Missing | 183 (1.5%) | 118 (1.5%) |
| ||
Ethnicity | ||
Non-Hispanic | 9312 (78.4%) | 5989 (77.9%) |
Hispanic | 2411 (20.3%) | 1611 (20.9%) |
Missing | 153 (1.3%) | 93 (1.2%) |
| ||
Parent Education* | ||
< HS Diploma | 578 (4.9%) | 334 (4.3%) |
HS orGED | 1110 (9.3%) | 643 (8.4%) |
Some College | 3058 (25.7%) | 1922 (25.0%) |
Bachelor | 3010 (25.3%) | 1963 (25.5%) |
Post Graduate Degree | 4041 (34.0%) | 2777 (36.1%) |
Missing | 79 (0.7%) | 54 (0.7%) |
Difference between research sample and ABCD Study sample is statistically significant at the p < 0.05 level.
The “Other” race/ethnicity category includes participants who were parent-identified as Asian Indian, Chinese, Filipino/a, Japanese, Korean, Vietnamese, Other Asian, American Indian/Native American, Alaska Native, Native Hawaiian, Guamanian, Samoan, Other Pacific Islander, or Other Race
After performing stringent data cleaning and removing siblings, our final sample still closely matched the ABCD sample in terms of sex, age, pubertal development, and ethnicity, but differed significantly in terms of race and parental education. While these differences do not impact the internal validity of the study, it could impact the generalizability of our results. To ensure our purposeful selection to increase greater gender diversity did not confound our results, we compared our final analytic sample to one in which one child per family was selected at random (rather than by felt-gender score), which showed similar distributions of all other covariates included in our models. This suggests the other factors in subset selection (i.e., image quality (Cosgrove et al., 2022)) possibly led to sample differences rather than the oversampling for gender diversity.
2.2. Neuroimaging Data
A harmonized data collection protocol was utilized across sites with either a Siemens, Phillips, or GE 3T MRI scanner. Motion compliance training, as well as real-time, prospective motion correction was used to reduce motion distortion. T1w images were acquired using a magnetization-prepared rapid acquisition gradient echo (MPRAGE) sequence (TR=2500, TE=2.88, flip angle=8) and T2w images were obtained with fast spin echo sequence (TR=3200, TE=565, variable flip angle), with 176 slices with 1mm3 isotropic resolution (Casey et al., 2018). Diffusion MRI data was acquired in the axial plane at 1.7 mm3 isotropic resolution with multiband acceleration factor 3. Ninety-six non-collinear gradient directions were collected with seven b0 images and four non-zero b-values (b=500, b=1000, b=2000, b=3000). Trained technicians inspected all T1w, T2w, and dMRI images and all images underwent a centralized quality control process in order to identify severe artifacts or irregularities (Hagler et al., 2019).
To assess gray matter macrostructure, we obtained the raw baseline T1w and T2w images from the ABCD 3.0 release (NDA 3.0 data release 2020; https://dx.doi.org/10.15154/1520591) and implemented the Human Connectome Project minimal preprocessing pipeline (Glasser et al., 2013) at the Stevens Institute of Neuroimaging and Informatics. Regional parcellation and segmentation was then performed based on the Desikan-Killiany atlas using FreeSurfer 7.1.1 for each participant using T1w and T2w images (Desikan et al., 2006). Calculation of lGI was accomplished using the additional FreeSurfer function “lgi” (Schaer et al., 2012), which compares total surface area within sulcal folds to visible pial surface area. This function is sensitive to topological defects in the FreeSurfer parcellations; therefore, in cases where lGI calculation failed due to aberrant results, participants were excluded from all analyses. Primary outcomes of interest included 68 cortical thickness regions, 22 subcortical volume regions, 68 lGI regions, 21 white matter regions for FA and MD. For a complete list of regions by feature, please see Supplemental Table 3.
Tabulated white matter microstructure results and demographic data from the baseline study visit were obtained from the 4.0 data release via the NIMH Data Archive (https://nda.nih.gov/abcd/; http://dx.doi.org/10.15154/1523041). ABCD diffusion processing employs five iterations of eddy current correction and robust tensor fitting to minimize gradient distortions and motion (Hagler et al., 2019; Hagler Jr et al., 2009). The b=0 images are coarsely registered to a diffusion atlas before being registered to T1w images using mutual information. DMRI images are then resampled and registered using the transform from rigid registration of the T1w image to the diffusion atlas. Finally, the diffusion gradient matrix is adjusted for head rotation. Probabilistic atlas-based tractography is then performed with AtlasTrack using a priori tract location probabilities to inform fiber selection (Hagler Jr et al., 2009). For this study, we utilized the tabulated FA and MD data from the AtlasTrack fiber atlas. Specifically, we selected the fornix, cingulate cingulum, parahippocampal cingulum, uncinate fasciculus, superior longitudinal fasciculus, inferior longitudinal fasciculus, inferior fronto-occipital fasciculus, anterior thalamic radiations, corticospinal tracts, forceps major, forceps minor, and corpus callosum as regions of interest (ROIs).
2.3. Independent variables
2.3.1. Sex
The ABCD Study collects parent-reported sex assigned at birth. However, due to the multidimensional nature of sex, assignment at birth is not always an accurate reflection of chromosomal sex. Therefore, we also chose to use the frequency ratio of X and Y alleles from blood samples to detect the presence of a Y chromosome and ascertain the genetic sex of participants. Children whose assigned sex and genetic sex did not match (n = 9) were excluded from analysis. Furthermore, children who were reported to be taking hormone medications at the baseline assessment were excluded from analysis (for a list of excluded medications, see Supplemental Table 1).
2.3.2. Gender Diversity
At the one-year visit, children completed a gender questionnaire (A. Potter et al., 2021; A. S. Potter et al., 2022). The first two questions of the questionnaire asked how much the participant feels like a boy and how much they feel like a girl, on a 5-point Likert scale (“Not at all,” “A little,” “Somewhat,” “Mostly,” “Totally”). Scoring was based on sex assigned at birth, with questions reverse-coded for the gender that was culturally incongruent with the participant’s sex assigned at birth (e.g., how much an XX child feels like a boy and how much an XY child feel like a girl) (see Supplemental Table 2). Then the two scores were averaged to create a “felt-gender” score which represented the level of congruence between the participant’s personal feelings of gender and the sex/gender assigned at birth). In the current study, we used the term “gender diversity” to refer to deviation from the maximum felt-gender score of 5 (e.g., feeling “totally” congruent with the gender assigned to one’s birth sex and “not at all” having incongruent gender feelings). As such, lower felt-gender scores reflect higher levels of gender diversity.
2.3.3. Other Predictors
Based on our questions of interest, we identified a number of demographic variables in the ABCD dataset as important predictors of brain structure. Age at scan date was measured in months and rounded to the nearest whole month. PDS was calculated based on the parent-report version of the Pubertal Development Scale and categorized as prepuberty, early puberty, mid puberty, late puberty, and post-puberty (Cheng et al., 2021; Herting et al., 2021; Petersen et al., 1988; Thijssen et al., 2020). Since few children in this age range were in late puberty or post-pubertal, we combined the mid, late, and post-puberty groups into a single category (mid/late puberty). We chose to include measures of race, ethnicity, and socioeconomic status in our models because human neurodevelopment is sensitive to myriad ecological factors which, due to systemic social injustice, are correlated with sociocultural variables, like race, ethnicity, and socioeconomic status (Nketia et al., 2021; Werchan & Amso, 2017). Race was determined via parent report, with parents encouraged to select all answers that apply. For participants who selected more than one race, we categorized them as multiracial Black if one of their selections was “Black” or multiracial non-Black. We combined the categories of Asian Indian, Chinese, Filipino/a, Japanese, Korean, Vietnamese, Other Asian, American Indian/Native American, Alaska Native, Native Hawaiian, Guamanian, Samoan, other Pacific Islander, and “other race” into a single category (“other race”), due to small group numbers. Ethnicity was also parent-reported as either Hispanic or non-Hispanic. Multiple measurements of socioeconomic status - such as parental educational attainment, household income, income-to-needs ratio, neighborhood disadvantage - have been linked to brain structure in studies of ABCD data (Rakesh et al., 2022). Due to its low non-response rate, we chose to use educational attainment, which was defined as the highest level of education achieved by any parent in the household and binned into the following categories: less than high school diploma, high school diploma or GED, some college, bachelor’s degree, or postgraduate degree. Scanner manufacturer was included to account for differences in data collection caused by MRI software and hardware. Although intracranial volume is often used account for differences in brain size during brain volume analyses, actual brain volume varies throughout the course of the day (Karch et al., 2019; Nakamura et al., 2015; Trefler et al., 2016). Thus, we chose to also include total brain volume (TBV) as a predictor in our subcortical volume models to account for the relationship between regional and whole-brain volume. TBV was calculated by FreeSurfer, then scaled by the sample root-mean-square.
2.4. Analyses
Mixed effects modeling is a statistical approach for handling nested data structures, such as in the case of multi-site studies that need to account for site clustering that may result from unmeasured sociocultural or environmental factors of participants and/or the data collection process (Heeringa & Berglund, 2020; Owens et al., 2021). The term “mixed” refers to the fact that it combines traditional fixed effect modeling with addition of random effects - variables whose effect on the outcome is not uniform across groups (Kreft & De Leeuw, 1998). Our study implemented the lmer() and glmer() functions for generalized linear mixed modeling from the lme4 package in R (Bates et al., 2015; R Core Team, 2019) to examine the effects of sex, felt-gender and their interaction on gray matter macrostructure and white matter microstructure. Residuals from all models were visually and numerically inspected to rule out violations of model assumptions. If a model failed to meet distributional assumptions (i.e., normality assumption) and/or experienced convergence issues with lmer(), we implemented glmer() to fit the model with gamma distribution assumption but keeping the identity link to preserve interpretability. In order to improve the accuracy of our models, we sought to account for additional variables that influence neurodevelopment, and to account for nesting of subjects within sites, we included the data collection site as a random effect. For our fixed effects, we added age, PDS, race, ethnicity, parental education, and scanner as fixed effects (all collected at the baseline visit), as well as TBV for the models of subcortical volume.
For this study, we relied on a model building strategy to inform our understanding of how sex, gender, and their combination (i.e., main effects and/or interaction) relate to gray matter macrostructure and white matter microstructure. For each ROI outcome, we fit a null model without sex or gender, along with four models of sex and/or gender. The null model (M0) included only our covariates (e.g., ROI ~ age + PDS score + race + ethnicity + parental education + scanner + (1 | site)) (and when applicable, TBV for subcortical outcomes). Next, sex was added to the M0 model as a fixed effect to create our sex model (M1), whereas felt-gender score was added to the M0 model as a fixed effect to create a gender model (M2). Next, we created two additional models that included both sex and gender as main effects in a sex and gender model (M3), as well a fourth model that included both main effects and a sex-by-gender interaction term for a sex-by-gender model (M4). The point biserial correlation between sex and felt-gender scores in our analytic sample was −0.24. To rule out multicollinearity amongst our covariates, we calculated the variance inflation ratio (vif) of each model. Importantly, all variance inflation factor values were below 3, suggesting no multicollinearity between variables in any model. Given that very large samples tend to transform small differences into statistically significant effects, we chose to calculate Cohen’s f2 to quantify the effect size and aid in our interpretation of the regional relationships detected between structure and sex, gender, and interaction.
For each ROI, we then used pairwise ANOVA tests in order to determine whether each model captured more variance than the null model as well as to compare Akaike Information Criterion (AIC) between models with differing numbers of independent variables (see Supplemental Figure 2). In cases where no model fit the outcome variance better than the null model (M0), the null model was considered the best fit for the data. If the interaction model (M4) had a p-value < 0.05 in pairwise ANOVA comparisons with all three other models (M1, M2, and M3) and the null model (M0), it was chosen as the best model. Otherwise, if the sex + gender model (M3) had a p-value < 0.05 in pairwise comparisons with the sex model (M1), the gender model (M2), and the null model (M0) it was chosen as the best model. If neither the interaction nor the sex + gender model was chosen as the best and only one of the two remaining models - sex (M1) and gender (M2) - significantly improved fit compared to the null model, then the model that performed better than the null was considered the best-fitting model. If both the sex (M1) and gender (M2) models improved fit compared to the null model, then the model with the lowest AIC was chosen as the best model for explaining outcome variance. Finally, to gauge the magnitude of the difference in model fit, we calculated the range of model r2 values for each ROI.
3. Results
Below, we report model comparison results by macrostructural (i.e., subcortical volume, cortical thickness, and gyrification index) and microstructural outcomes of interest, respectively. A summary of which model was deemed to account for the greatest amount of variance by macrostructural gray matter outcomes is presented in Table 2 (as well as in Supplemental Tables 5, 6, & 7), whereas white matter microstructural outcomes are presented in Table 3 (as well as Supplemental Tables 8–9). In the majority of both gray matter and white matter ROIs, structural variance was best accounted for by the sex model (M1). The gender model (M2) was not the best model for capturing structural variance in any white or gray matter regions; however, the sex and gender model (M3) captured more variance than sex alone in eight gray matter regions and the interaction model (M4) captured the most variance in one white matter region.
Table 2:
Number and frequency (%) of the best model for each gray matter macrostructural ROI in terms of outcome variance captured (R2). Abbreviations: M0 = null model (i.e., covariates only); M1 = null model + sex; M2 = null model + gender; M3 = null model + sex + gender; M4 = null model + sex + gender + sex-by-gender interaction; ROI = regions of interest.
Model | ||||||
---|---|---|---|---|---|---|
|
||||||
Total ROIs | M0 | M1 | M2 | M3 | M4 | |
|
||||||
Gray Matter | ||||||
158 | 31 (19.6%) | 119 (75.3%) | 0 (0.0%) | 8 (5.1%) | 0 (0.0%) | |
| ||||||
Subcortical Volume | 22 | 5 (22.7%) | 15 (68.2%) | - | 2 (9.1%) | - |
| ||||||
Basal Ganglia | 8 | 1 (12.5%) | 7 (87.5%) | - | - | - |
Limbic | 6 | 4 (66.7%) | 2 (33.3%) | - | - | - |
Corpus Callosum | 5 | - | 5 (100%) | - | - | - |
Other | 3 | 1 (33.3%) | - | 2 (66.7%) | - | |
| ||||||
Cortical Thickness | 68 | 23 (33.8%) | 44 (64.7%) | - | 1 (1.5%) | - |
| ||||||
Frontal | 22 | 8 (36.4%) | 13 (59.1%) | - | 1 (4.5%) | - |
Temporal | 18 | 3 (16.7%) | 15 (83.3%) | - | - | - |
Parietal | 10 | 6 (60.0%) | 4 (40.0%) | - | - | - |
Occipital | 8 | 4 (50.0%) | 4 (50.0%) | - | - | - |
Cingulate | 8 | 2 (25.0%) | 6 (75.0%) | - | - | - |
Insula | 2 | - | 2 (100%) | - | - | - |
| ||||||
Local Gyrification Index | 68 | 3 (4.4%) | 60 (88.2%) | - | 5 (7.4%) | - |
| ||||||
Frontal | 22 | 3 (13.6%) | 18 (81.8%) | - | 1 (4.5%) | - |
Temporal | 18 | - | 17 (94.4%) | - | 1 (5.6%) | - |
Parietal | 10 | - | 7 (70.0%) | - | 3 (30.0%) | - |
Occipital | 8 | - | 8 (100%) | - | - | - |
Cingulate | 8 | - | 8 (100%) | - | - | - |
Insula | 2 | - | 2 (100%) | - | - | - |
Table 3:
Number and frequency (%) of the best model for each white matter microstructural ROI in terms of variance explained. Abbreviations: M0 = null model (i.e., covariates only); M1 = null model + sex; M2 = null model + gender; M3 = null model + sex + gender; M4 = null model + sex + gender + sex-by-gender interaction; ROI = regions of interest.
Model | ||||||
---|---|---|---|---|---|---|
|
||||||
Total ROIs | M0 | M1 | M2 | M3 | M4 | |
|
||||||
White Matter | ||||||
42 | 8 (19.1%) | 33 (78.6%) | 0 (0%) | 0 (0%) | 1 (2.4%) | |
| ||||||
Fractional Anisotropy | 21 | 6 (28.6%) | 15 (71.4%) | - | - | - |
| ||||||
Association Fibers | 6 | - | 6 (100%) | - | - | - |
Projection Fibers | 4 | 2 (50.0%) | 2 (50.0%) | - | - | - |
Limbic Fibers | 8 | 3 (37.5%) | 5 (62.5%) | - | - | - |
Callosal Fibers | 3 | 1 (33.3%) | 2 (66.7%) | - | - | - |
Mean Diffusivity | 21 | 2 (9.5%) | 18 (85.7%) | - | - | 1 (4.8%) |
| ||||||
Association Fibers | 6 | 1 (16.7%) | 5 (83.3%) | - | - | - |
Projection Fibers | 4 | - | 4 (100%) | - | - | - |
Limbic Fibers | 8 | - | 7 (87.5%) | - | - | 1 (12.5%) |
Callosal Fibers | 3 | 1 (33.3%) | 2 (66.7%) | - | - | - |
3.1. Gray Matter Macrostructure
3.1.1. Subcortical Volume
All subcortical volume averages were larger in males than in females. Using the modeling approach to determine which factors best explained the variance subcortical volume, we found that in the majority of subcortical ROIs (15/22; 68%), the sex model (M1) captured more variance than the other models (Figure 1). However, the direction of the relationship between sex and volume varied by region (Figure 2). The model with both sex and gender (M3) was determined to best capture the variance in the left and right cerebellum; albeit sex was a significant predictor in these models, while gender diversity was not statistically significant. The gender model (M2) and the interaction model (M3) were not the best-fitting model for any subcortical region. In the left and right thalamus, left and right hippocampus, and right pallidum, neither the sex model (M1), the gender model (M2), the sex + gender model (M3), nor the interaction model (M4) explained more variance than the null model; therefore we consider the null model (M0; covariates only) to be the best-fitting model in 23% of subcortical regions examined, including the right pallidum, bilateral hippocampus, and bilateral thalamus. Effect sizes of individual sex, gender, and interaction beta coefficients in each model were small (max f2 = 0.028). Additionally, the difference in R2 between the best model and the null model was universally small (mean ΔR2 = 0.004, max ΔR2 = 0.014).
Figure 1.
Coronal and sagittal slices depicting the best model for capturing variance in gray matter structure, including subcortical volume (A), cortical thickness (B), and local gyrification index (C). Each ROI is colored according to the best-fitting model: gray = null model (i.e., covariates only); purple= M1 (null model + sex; green = M2 (null model + gender); pink = M3 = (null model + sex + gender); orange= M4 (null model + sex + gender + sex-by-gender interaction).
Figure 2.
Beta coefficient estimates and 95% confidence intervals (CI) for the best-fitting model for each subcortical region. Abbreviations: M1 = null model (covariates) + sex; M3 = null model (covariates) + sex + gender; R=right; L=left; * = FDR-corrected p-value for sex < 0.05
3.1.2. Cortical Thickness
With regard to variance in regional cortical thickness, we found that the sex model (M1) captured the most variance in 65% (44/68) of ROIs examined (Figure 1). The model with a combination of sex and gender (M3) was the best fit for the variance in thickness of the left precentral gyrus. In this region, males showed thinner cortex than females (β = −0.012, p < 0.001), while the association between gender and thickness was not significant. For 34% of regions, none of the models performed better than the null model (M0; covariates only) - including bilaterally in the rostral middle frontal gyrus, medial orbitofrontal cortex, transverse temporal gyrus, postcentral gyrus, precuneus, lateral occipital cortex, lingual gyrus, and posterior cingulate cortices, the null model was the best model bilaterally. Where significant, the relationship between sex and cortical thickness was heterogeneous (see Figure 3): in the left middle temporal, left and right fusiform gyrus, left parahippocampal gyrus, left and right superior parietal lobe, and right caudal anterior cingulate cortex, the mean cortical thickness among males was higher than the mean among females; in the other 34 regions where the difference between sexes was significant, the mean cortical thickness measurement for females was higher than the mean for males. The overlap between the male and female confidence intervals was large in every region. Across all models and ROIs, the effect sizes of our sex and gender estimates were very small (max f2 = 0.020), and the difference in explained variance between models was minimal (as measured with conditional R2 range per ROI; mean ΔR2 = 0.002, max ΔR2 = 0.018).
Figure 3.
Beta coefficient estimates and 95% confidence intervals (CI) for the best-fitting model for each cortical thickness region. Abbreviations: M1 = null model (covariates); M3 = null model (covariates) + sex + gender; R=right; L=left; Cing = cingulate; * = FDR-corrected p-value for sex < 0.05
3.1.3. Local Gyrification Index
The sex model (M1) best described the variance in gyrification in 88% of regions, while the sex + gender model (M3) was the best predictor of variance in the right superior frontal cortex, right parahippocampal gyrus, left superior parietal cortex, left inferior parietal cortex, and left postcentral gyrus (Figure 1). In all 68 ROIs, the mean local gyrification index among males was higher than the mean among females (Figure 4). The association between felt-gender score and lGI was not significant in any region. Sex and gender estimate effect sizes were again small according to Cohen’s guidelines (Cohen, 1988) for interpretation of effect size (max f2 = 0.077). However, the maximum effect size for lGI - which was observed in the right superior temporal gyrus - was also the maximum effect size in this study, suggesting that sex accounts for a larger proportion of the variance in lGI than it does in any other metric examined. Furthermore, lGI outcomes had the largest average difference in explained variance between the best and worst model (mean ΔR2 = 0.023, max ΔR2 = 0.064).
Figure 4.
Beta coefficient estimates and 95% confidence intervals (CI) for the best-fitting model for local gyrification of each region. Abbreviations: M1 = null model (covariates) + sex; M3 = null model (covariates) + sex + gender; R=right; L=left; * = FDR-corrected p-value for sex < 0.05
3.2. White Matter Microstructure
3.2.1. Fractional Anisotropy
Model building results showed that mean FA was best explained by the sex model (M1) in 71% of tracts (15/21; Figure 5). In half of fiber tracts, the male FA average was higher than the female average, while the female average was higher than males in the other half (Figure 6). For 29% of regions (6/21), none of the models accounted for significantly more variance than the null model with covariates (M0), including the forceps major, left uncinate fasciculus, bilateral superior longitudinal fasciculus, and bilateral corticospinal tracts. Despite the sex model accounting for the most variance of the four models, only a small proportion of FA variance was attributable to sex (max f2 = 0.017) and the difference between models in terms of explained variance was small (mean ΔR2 = 0.003, max ΔR2 = 0.012).
Figure 5.
T1-weighted images overlaid with examined white matter tracts colored according to the best-fitting model. White = null model (i.e., covariates only); purple= M1 null model (covariates) + sex; green = M2 (null model + gender); pink = M3 (null model + sex + gender); orange= M4 (null model + sex + gender + sex-by-gender interaction). The gender model (M2) and the sex + gender model (M3) were not the best-fitting model in any WM ROIs.
Figure 6.
Beta coefficient estimates and 95% confidence intervals (CI) for the best-fitting model for FA in each white matter tract. Abbreviations: M1 = null model (covariates) + sex; R=right; L=left; ILF = inferior longitudinal fasciculus; IFOF = inferior fronto-occipital fasciculus
3.2.2. Mean Diffusivity
In modeling analyses, the sex model (M1) best described the variance in MD in 86% of regions (18/21) including limbic, projection, association, and callosal white matter fiber tracts (Figure 5). The interaction model (M4) best accounted for MD variability in the left uncinate, and sex - but not gender diversity or the interaction term - was independently significantly associated with MD in the model. In 10% of tracts (2/21), MD variance was best explained by the null model (M0), including the left cingulate cingulum, and forceps major. In all regions where sex was a significant predictor of MD, the male average MD was higher than the female average (Figure 7). Effect sizes were small (max f2 = 0.035), and so was the difference in variance explained by each model, as measured by conditional R2 range (mean ΔR2 = 0.008, max ΔR2 = 0.026).
Figure 7.
Beta coefficient estimates and 95% confidence intervals (CI) for the best-fitting model for MD in each white matter tract. Abbreviations: M1 = null model (covariates) + sex; M4 = null model (covariates) + sex + gender + sex-by-gender interaction); R=right; L=left; CST = corticospinal tracts; ILF = inferior longitudinal fasciculus; IFOF = inferior fronto-occipital fasciculus; SLF = superior longitudinal fasciculus
4. Discussion
This study is the largest neuroimaging study to date using a non-dichotomous assessment of gender (Rauch & Eliot, 2022), and the first to examine the relationship between diversity of felt–gender and diversity of brain structure - while also considering sex - in a nationwide pediatric sample. The classical model (M1) - with sex (but not gender) - explained the most variance in 75% of gray matter ROIs and 79% of white matter ROIs. This suggests a widespread association between sex and both gray and white matter structures in youth ages 9–11 years-old - although the effect sizes for sex estimates were small in all cases. In contrast, the gender model (M2) did not account for more variance than the other models in any of the regions examined. The sex + gender model (M3) captured the most variance in 8 gray matter ROIs (left and right cerebellum volume, left precentral cortical thickness, and local gyrification index in the left superior and inferior parietal cortex, left precentral gyrus, and right parahippocampal gyrus), while the interaction model (M4) captured the most variance in left uncinate fasciculus MD. Nonetheless, the β coefficient estimates for gender and the sex-by-gender interaction term from the M4 model were not statistically significant in the best model for any region. Taken together, these results demonstrate that, in 9–11-year-olds, felt-gender diversity is not directly associated with brain structure, while sex accounts for a small amount of variance in brain structure across a wide array of regions and measurements.
Our findings align with previous work involving the ABCD dataset where different types of machine learning techniques were implemented to probe potential discriminative sex differences in the adolescent brain (Adeli et al., 2020; Brennan et al., 2021; Kim et al., 2021, 2022). For example, deep learning identified the subcortex as the most sex-discriminative area of the brain (Adeli et al., 2020). The authors further probed the ROIs associated with the most-discriminative voxels using two-sample t-tests and reported sex differences in volume in the pallidum, some parts of the cerebellum, hippocampus, and putamen - but not the amygdala or insula (Adeli et al., 2020). In another ABCD analysis, machine learning analysis utilizing various brain features predicted sex with 70.8% accuracy with as few as 13 attributes (select ROIs pertaining to cortical thickness, cortical volume, sulcal depth, gray matter intensity, white matter intensity, and gray/white intensity contrast)(Brennan et al., 2021). Moreover, these analyses did not account for other possible sources of variance between males and females, such as pubertal development, which differs significantly between males and females at ages 9–11 years in the ABCD sample (Herting et al., 2021), nor did they consider the potential role of gender.
Through our model building approach and adjusting for these key covariates, our analysis supported previous findings of significant sex differences in both gray and white matter structures as well as highlighted a number of brain regions that are similar between the sexes (i.e., best explained by the null model) at ages 9–11 years old. Similar to Adeli and colleagues (Adeli et al., 2020), we confirmed significant sex differences bilaterally in the caudate, putamen, pallidum, cerebellum. We found additional sex differences in the brain stem, corpus callosum, nucleus accumbens, and the amygdala. Experiments in mice using the “Four Core Genotypes” model (XX and XY gonadal males, and XX and XY gonadal females) demonstrate that the X chromosome and gonadal hormones influence the volume of the cerebellum and corpus callosum, providing a potential biological explanation for our findings (Corre et al., 2016). In contrast to Adeli, we found no significant association between sex and hippocampal volume in any model, despite the known role of the hippocampus in the synthesis of both testosterone and estradiol (Ahmadpour & Grange-Messent, 2021; Azcoitia et al., 2011) as well as its enrichment in estrogen receptors (Bean et al., 2014). Research in murine models of sex chromosome aneuploidies has found that, rather than increasing differences, X-chromosome gene dosage effects and androgen effects counteract each other in some brain regions and reduce sex differences in volume, (Raznahan et al., 2013). This demonstrates that a regional association with sex hormones does not necessitate a sex difference in structure. Instead, sex interacts with neuroanatomy through multiple mechanisms, including gene expression, X-chromosome inactivation and inactivation escape, endocrine regulation, and receptor density (Raznahan & Disteche, 2021; Sigurdardottir et al., 2020). The relationship between cortical thickness and sex is also complex. Cortical thickness does not appear to be related to X-chromosome inactivation (Mallard et al., 2021), and findings regarding its association with androgen levels in children are mixed (Herting et al., 2015; Nguyen, 2018). However, cortical thickness has been linked to the length of CAG repeats in the androgen receptor gene - a measure which influences androgen receptor sensitivity (Raznahan et al., 2010; Zitzmann, 2009). In our analysis, cortical thickness was lower in males in portions of the frontal, temporal and cingulate cortices. On average, the brains of males also expressed a significantly higher gyrification index than brains of females in all but 4 regions examined (the left cuneus cortex, left pericalcarine cortex, and left and right frontal poles) indicating a global, consistent association between sex and gyrification in adolescence. Neuroimaging of adolescents with sex chromosome aneuploidies has revealed both sex differences in gyrification and sex chromosome dosage effects on gyrification, offering insight into a potential genetic explanation for our findings (Fish et al., 2017).
In addition to gray matter macrostructure, the current study examined white matter microstructure. While previous studies have reported sex differences in FA and MD in the ABCD sample (Lawrence et al., 2021), the current study aimed to further investigate the degree to which gender may have unique or combined effects in addition to the noted sex effects. The sex model (M1) captured the most variance in 79% of white matter regions. Previous research suggests sex differences in white matter microstructure may be related to sex differences in androgen and estrogen. In adolescents, regional FA is correlated with circulating androgen and estradiol (Herting et al., 2012). Additionally, studies of hormone replacement therapy in adults have reported reductions in MD and increases in FA with testosterone administration, while estradiol and antiandrogen therapy resulted in increases in MD and decreases in FA (Kranz et al., 2017). On the other hand, one of the genes involved in myelination, proteolipid protein (PLP1), is located on the X chromosome, is subject to X-inactivation, and has been implicated in multiple sclerosis - a demyelinating disease with a known sex disparity (Cloake et al., 2018). Therefore, more research is needed to elucidate the mechanism through which sex differences in white matter develop in childhood.
These current results also highlight the importance of thoughtful model design in neuroscientific studies of sex and gender. The largest effect sizes for sex were seen for lGI suggesting that sex accounts for a larger proportion of the variance in lGI than it does in any other metric examined. The maximum proportion of variance accounted for by sex was 7.7% in the right superior temporal gyrus. The amount of additional variance captured compared to the null model was also relatively modest, with a median of 0.3% more variance captured by the best model across all ROIs. Furthermore, all of our effect sizes are considered small according to Cohen’s heuristic (e.g., small: f2≥ 0.02, medium: f2≥ 0.15, large: f2 ≥ 0.35; Cohen, 1988), and most are below the average effect size in the ABCD study (0.05; Owens et al., 2021). Therefore, in contrast to the conclusion of Brennan et al., our results indicate that sex differences account for a small amount of variance in brain structure, rather than ‘sexual dimorphism’, at ages 9–11 years old. As such, our findings bolster previous assertions that sex differences in the brain do exist, but do not amount to two distinct, non-overlapping sex phenotypes in the brain (Eliot et al., 2021; Friedrichs & Kellmeyer, 2022; Hyde et al., 2018; Rippon et al., 2014). In contrast to sex-related variance, we did not discover any significant neuroanatomical variability related to gender diversity alone. That is, while gender improved model-fit in some regions by accounting for more variance than sex alone, felt-gender was not found to be statistically significant in those models. Similarly, a prior study in adolescents aged 12–17 found no relationship between gender dysphoria and cortical morphometry (Skorska et al., 2021). Though these findings suggest that there is not a direct association between felt-gender or dysphoria and brain structure, the relationship between gender diversity and health outcomes is theorized to be complex and modified by distal and proximal stressors, as well as protective factors (K. K. Tan et al., 2020; Testa et al., 2015). Therefore, further study is needed to explore whether other facets of gender and gender-related aspects of the social environment interact with neurological development. Collectively, our analyses demonstrate that sex - but not felt-gender - accounts for a small fraction of the total structural variance at ages 9–11 years-old.
4.1. Limitations
The generalizability of our results is limited by non-response and selection bias. The group of participants excluded due to missing data on the gender questionnaire was more frequently Hispanic, Black, had more advanced pubertal development, and had lower parental education than the included sample. Other authors using ABCD data have also demonstrated that sample sociodemographic composition can be affected by common exclusion criteria, such as image quality, limiting the generalizability of results (Cosgrove et al., 2022). Hence, our exclusion of participants based on image quality, atypical neuroanatomical findings, or missing data may have contributed to bias in our final sample. Finally, because the gender questionnaire was not administered at the baseline imaging visit, we relied on responses collected approximately one year after images were collected. Therefore, our models assume that felt-gender is relatively stable between the ages of 9–11 years.
Furthermore, this study is a cross-sectional, observational study of sex and gender in humans, and therefore, we cannot ascertain the direction, mechanism, or cause of our results. As such, our findings should not be interpreted as evidence that sex causes structural differences. The concept of permanent, dimorphic sex or gender phenotypes in the brain is at odds with the dynamic nature of sex, gender, and the brain (Friedrichs & Kellmeyer, 2022). Brain structure in general is plastic and can be altered temporarily or permanently by various experiences and environmental factors (Kolb & Gibb, 2011). Social factors like family and school support are also known to mitigate the relationship between gender diversity and mental health problems (Ross-Reed et al., 2019). Our research was limited in scope and not designed to encapsulate all the myriad dimensions of sex and gender that could interact with brain structure. While androgens and estrogens have a documented relationship with brain structure, levels fluctuate with time of day, menstrual cycle phase, and contraceptive use (Uban et al., 2018). Given that hormone collection in ABCD is not consistent with regard to time of day or menstrual cycle phase (Herting et al., 2021; Uban et al., 2018), we chose not to include measures of circulating hormones in our analysis. In terms of genetics, although we were able to detect the presence of a Y chromosome through use of the X and Y allele frequency ratio, we did not examine other sources of sex chromosome variability - including aneuploidy, mosaicism, X-chromosome inactivation, and inactivation escape (Raznahan & Disteche, 2021). Though we failed to reject the null hypothesis of no relationship between felt-gender diversity and brain structure, our results cannot rule out an association between brain structure and other aspects of gender. Moreover, it may be necessary to consider moderating factors such as stress, social exclusion, and victimization (Loso et al., 2023; K. K. Tan et al., 2020). Future work should explore how other gender-related variables - such as gender expression, gender discrimination, and social support - may contribute to brain structure during development. Thus, our analyses should be viewed as an initial step towards disentangling the sex and gender in neurodevelopmental research.
4.2. Conclusions
This study demonstrated that variance in white matter microstructure, subcortical volume, cortical thickness and gyrification was related to sex, but not gender diversity in children aged 9–11. Although adding gender diversity increased the amount of outcome variance explained by the model in 8 gray matter ROIs and the addition of the sex-by-gender diversity interaction term improved the model fit in one white matter ROI, neither gender diversity, nor the interaction term were significant in any of these regions. In contrast, the sex model captured the most variance in 76% of the ROIs examined. However, the effect sizes of these sex differences across a variety of macro and microstructural measures were universally small. This suggests that sex accounts for a small proportion of neuroanatomical differences in late childhood.
Supplementary Material
Highlights.
Sex-related variance in regional human brain structure is widespread at ages 9–11
Together, sex and gender diversity accounted for more variance in only a few regions
Felt-gender diversity itself was not significantly related to any outcome at ages 9–11
Effect sizes for sex and felt-gender estimates were small
Acknowledgments
Data used in the preparation of this article were obtained from the Adolescent Brain Cognitive DevelopmentSM (ABCD) Study (https://abcdstudy.org), held in the NIMH Data Archive (NDA). This is a multisite, longitudinal study designed to recruit more than 10,000 children ages 9–11 and follow them over 10 years into early adulthood. The ABCD Study® is supported by the National Institutes of Health and additional federal partners under award numbers U01DA041048, U01DA050989, U01DA051016, U01DA041022, U01DA051018, U01DA051037, U01DA050987, U01DA041174, U01DA041106, U01DA041117, U01DA041028, U01DA041134, U01DA050988, U01DA051039, U01DA041156, U01DA041025, U01DA041120, U01DA051038, U01DA041148, U01DA041093, U01DA041089, U24DA041123, U24DA041147. A full list of supporters is available at https://abcdstudy.org/federal-partners.html. A listing of participating sites and a complete listing of the study investigators can be found at https://abcdstudy.org/consortium_members/. ABCD consortium investigators designed and implemented the study and/or provided data but did not necessarily participate in the analysis or writing of this report. This manuscript reflects the views of the authors and may not reflect the opinions or views of the NIH or ABCD consortium investigators. This study was also funded by RF1MH123223.
Abbreviations:
- ABCD
Adolescent Brain Cognitive Development Study
- MRI
magnetic resonance imaging
- DTI
diffusion tensor imaging
- FA
fractional anisotropy
- MD
mean diffusivity
- ABCD
Adolescent Brain Cognitive Development
- WM
white matter
- GM
gray matter
- T1w
T1-weighted
- T2w
T2-weighted
- ROI
region of interest
- PDS
pubertal development scale
Footnotes
Declaration of Interests
JC declares status as employee of the private company NeuroScope Inc (NY, USA).
Data and Code Availability Statement
Data from the ABCD Study Data Release is available for access through the NIMH Data Archive (NDA 3.0 data release 2020: https://dx.doi.org/10.15154/1520591; NDA 4.0 data release 2021; https://dx.doi.org/10.15154/1523041). Only researchers with an approved NDA Data Use Certification (DUC) can obtain ABCD Study data.
References
- ABCD Study. (2022). ABCD Study. https://abcdstudy.org/about/
- Adeli E., Zhao Q., Zahr N. M., Goldstone A., Pfefferbaum A., Sullivan E. V., & Pohl K. M. (2020). Deep learning identifies morphological determinants of sex differences in the pre-adolescent brain. NeuroImage, 223, 117293. 10.1016/j.neuroimage.2020.117293 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ahmadpour D., & Grange-Messent V. (2021). Involvement of testosterone signaling in the integrity of the neurovascular unit in the male: Review of evidence, contradictions, and hypothesis. Neuroendocrinology, 111(5), 403–420. [DOI] [PubMed] [Google Scholar]
- Alemán-Gómez Y., Janssen J., Schnack H., Balaban E., Pina-Camacho L., Alfaro-Almagro F., Castro-Fornieles J., Otero S., Baeza I., & Moreno D. (2013). The human cerebral cortex flattens during adolescence. Journal of Neuroscience, 33(38), 15004–15010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Azcoitia I., Yague J. G., & Garcia-Segura L. M. (2011). Estradiol synthesis within the human brain. Neuroscience, 191, 139–147. 10.1016/j.neuroscience.2011.02.012 [DOI] [PubMed] [Google Scholar]
- Backeljauw B., Holland S. K., Altaye M., & Loepke A. W. (2015). Cognition and brain structure following early childhood surgery with anesthesia. Pediatrics, 136(1), e1–e12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bates D., Maechler M., Bolker B., & Walker S. (2015). Fitting Linear Mixed-Effects Models Using lme4. Journal of Statistical Software, 67(1), 1–48. 10.18637/jss.v067.i01. [DOI] [Google Scholar]
- Beking T., Burke S. M., Geuze R. H., Staphorsius A. S., Bakker J., Groothuis A. G. G., & Kreukels B. P. C. (2020). Testosterone effects on functional amygdala lateralization: A study in adolescent transgender boys and cisgender boys and girls. Psychoneuroendocrinology, 111, 104461. [DOI] [PubMed] [Google Scholar]
- Belfi A. M., Conrad A. L., Dawson J., & Nopoulos P. (2014). Masculinity/femininity predicts brain volumes in normal healthy children. Developmental Neuropsychology, 39(1), 25–36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bethlehem R. a. I., Seidlitz J., White S. R., Vogel J. W., Anderson K. M., Adamson C., Adler S., Alexopoulos G. S., Anagnostou E., Areces-Gonzalez A., Astle D. E., Auyeung B., Ayub M., Bae J., Ball G., Baron-Cohen S., Beare R., Bedford S. A., Benegal V., … Alexander-Bloch A. F. (2022). Brain charts for the human lifespan. Nature, 604(7906), Article 7906. 10.1038/s41586-022-04554-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bhargava A., Arnold A. P., Bangasser D. A., Denton K. M., Gupta A., Hilliard Krause L. M., Mayer E. A., McCarthy M., Miller W. L., Raznahan A., & Verma R. (2021). Considering Sex as a Biological Variable in Basic and Clinical Studies: An Endocrine Society Scientific Statement. Endocrine Reviews, 42(3), 219–258. 10.1210/endrev/bnaa034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bölte S., Neufeld J., Marschik P. B., Williams Z. J., Gallagher L., & Lai M.-C. (2023). Sex and gender in neurodevelopmental conditions. Nature Reviews Neurology, 19(3), Article 3. 10.1038/s41582-023-00774-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brennan D., Wu T., & Fan J. (2021). Morphometrical Brain Markers of Sex Difference. Cerebral Cortex, 31. 10.1093/cercor/bhab037 [DOI] [PubMed] [Google Scholar]
- Burke S. M., Cohen-Kettenis P. T., Veltman D. J., Klink D. T., & Bakker J. (2014). Hypothalamic Response to the Chemo-Signal Androstadienone in Gender Dysphoric Children and Adolescents. Frontiers in Endocrinology, 5, 60. 10.3389/fendo.2014.00060 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burke S. M., Kreukels B. P., Cohen-Kettenis P. T., Veltman D. J., Klink D. T., & Bakker J. (2016). Male-typical visuospatial functioning in gynephilic girls with gender dysphoria—Organizational and activational effects of testosterone. Journal of Psychiatry and Neuroscience, 41(6), 395–404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Carpenter M. (2018). Intersex variations, human rights, and the International classification of diseases. Health and Human Rights, 20(2), 205. [PMC free article] [PubMed] [Google Scholar]
- Casey B. J., Cannonier T., Conley M. I., Cohen A. O., Barch D. M., Heitzeg M. M., Soules M. E., Teslovich T., Dellarco D. V., & Garavan H. (2018). The adolescent brain cognitive development (ABCD) study: Imaging acquisition across 21 sites. Developmental Cognitive Neuroscience, 32, 43–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen Y., Ide J. S., Li C. S., Chaudhary S., Le T. M., Wang W., Zhornitsky S., Zhang S., & Li C.-S. R. (2022). Gray matter volumetric correlates of dimensional impulsivity traits in children: Sex differences and heritability. Human Brain Mapping, 43(8), 2634–2652. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng T. W., Magis-Weinberg L., Guazzelli Williamson V., Ladouceur C. D., Whittle S. L., Herting M. M., Uban K. A., Byrne M. L., Barendse M. E., & Shirtcliff E. A. (2021). A researcher’s guide to the measurement and modeling of puberty in the ABCD Study® at baseline. Frontiers in Endocrinology, 471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Choi N., Fuqua D. R., & Newman J. L. (2008). The Bem Sex-Role Inventory: Continuing Theoretical Problems. Educational and Psychological Measurement, 68(5), 881–900. 10.1177/0013164408315267 [DOI] [Google Scholar]
- Cloake N. C., Yan J., Aminian A., Pender M. P., & Greer J. M. (2018). PLP1 mutations in patients with multiple sclerosis: Identification of a new mutation and potential pathogenicity of the mutations. Journal of Clinical Medicine, 7(10), 342. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cohen J. (1988). Statistical power analysis for the behavioral sciences. Erlbaum. [Google Scholar]
- Corre C., Friedel M., Vousden D. A., Metcalf A., Spring S., Qiu L. R., Lerch J. P., & Palmert M. R. (2016). Separate effects of sex hormones and sex chromosomes on brain structure and function revealed by high-resolution magnetic resonance imaging and spatial navigation assessment of the Four Core Genotype mouse model. Brain Structure and Function, 221(2), 997–1016. 10.1007/s00429-014-0952-0 [DOI] [PubMed] [Google Scholar]
- Cosgrove K. T., McDermott T. J., White E. J., Mosconi M. W., Thompson W. K., Paulus M. P., Cardenas-Iniguez C., & Aupperle R. L. (2022). Limits to the generalizability of resting-state functional magnetic resonance imaging studies of youth: An examination of ABCD Study® baseline data. Brain Imaging and Behavior, 16(4), 1919–1925. 10.1007/s11682-022-00665-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coviello C., Keunen K., Kersbergen K. J., Groenendaal F., Leemans A., Peels B., Isgum I., Viergever M. A., de Vries L. S., & Buonocore G. (2018). Effects of early nutrition and growth on brain volumes, white matter microstructure, and neurodevelopmental outcome in preterm newborns. Pediatric Research, 83(1), 102–110. [DOI] [PubMed] [Google Scholar]
- DeCasien A. R., Guma E., Liu S., & Raznahan A. (2022). Sex differences in the human brain: A roadmap for more careful analysis and interpretation of a biological reality. Biology of Sex Differences, 13(1), 43. 10.1186/s13293-022-00448-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- Desikan R. S., Ségonne F., Fischl B., Quinn B. T., Dickerson B. C., Blacker D., Buckner R. L., Dale A. M., Maguire R. P., Hyman B. T., Albert M. S., & Killiany R. J. (2006). An automated labeling system for subdividing the human cerebral cortex on MRI scans into gyral based regions of interest. NeuroImage, 31(3), 968–980. 10.1016/j.neuroimage.2006.01.021 [DOI] [PubMed] [Google Scholar]
- Edmiston E. K., & Juster R.-P. (2022). Refining Research and Representation of Sexual and Gender Diversity in Neuroscience. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging. 10.1016/j.bpsc.2022.07.007 [DOI] [PubMed] [Google Scholar]
- Eliot L., Ahmed A., Khan H., & Patel J. (2021). Dump the “dimorphism”: Comprehensive synthesis of human brain studies reveals few male-female differences beyond size. Neuroscience & Biobehavioral Reviews, 125, 667–697. 10.1016/j.neubiorev.2021.02.026 [DOI] [PubMed] [Google Scholar]
- Fassett-Carman A. N., Link to external site, this link will open in a new window, Smolker H., Hankin B. L., Snyder H. R., & Banich M. T. (2022). Major gender differences in relations between life stressor frequency and gray matter in adolescence and emerging adulthood. Developmental Psychology. 10.1037/dev0001489 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fischl B., & Dale A. M. (2000). Measuring the thickness of the human cerebral cortex from magnetic resonance images. Proceedings of the National Academy of Sciences, 97(20), 11050–11055. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fish A. M., Cachia A., Fischer C., Mankiw C., Reardon P. K., Clasen L. S., Blumenthal J. D., Greenstein D., Giedd J. N., & Mangin J.-F. (2017). Influences of Brain Size, Sex, and Sex Chromosome Complement on the Architecture of Human Cortical Folding. Cerebral Cortex (New York, NY), 27(12), 5557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Franconi F., Campesi I., Colombo D., & Antonini P. (2019). Sex-Gender Variable: Methodological Recommendations for Increasing Scientific Value of Clinical Studies. Cells, 8(5), Article 5. 10.3390/cells8050476 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friedrichs K., & Kellmeyer P. (2022). Neurofeminism: Feminist critiques of research on sex/gender differences in the neurosciences. European Journal of Neuroscience, 56(11), 5987–6002. 10.1111/ejn.15834 [DOI] [PubMed] [Google Scholar]
- Garavan H., Bartsch H., Conway K., Decastro A., Goldstein R. Z., Heeringa S., Jernigan T., Potter A., Thompson W., & Zahs D. (2018). Recruiting the ABCD sample: Design considerations and procedures. Developmental Cognitive Neuroscience, 32, 16–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Giorgio A., Watkins K. E., Chadwick M., James S., Winmill L., Douaud G., De Stefano N., Matthews P. M., Smith S. M., & Johansen-Berg H. (2010). Longitudinal changes in grey and white matter during adolescence. Neuroimage, 49(1), 94–103. [DOI] [PubMed] [Google Scholar]
- Giorgio A., Watkins K. E., Douaud G., James A. C., James S., De Stefano N., Matthews P. M., Smith S. M., & Johansen-Berg H. (2008). Changes in white matter microstructure during adolescence. Neuroimage, 39(1), 52–61. [DOI] [PubMed] [Google Scholar]
- Glasser M. F., Sotiropoulos S. N., Wilson J. A., Coalson T. S., Fischl B., Andersson J. L., Xu J., Jbabdi S., Webster M., & Polimeni J. R. (2013). The minimal preprocessing pipelines for the Human Connectome Project. Neuroimage, 80, 105–124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Griffiths D. A. (2018). Shifting syndromes: Sex chromosome variations and intersex classifications. Social Studies of Science, 48(1), 125–148. 10.1177/0306312718757081 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hagler D. J., Hatton S., Cornejo M. D., Makowski C., Fair D. A., Dick A. S., Sutherland M. T., Casey B. J., Barch D. M., & Harms M. P. (2019). Image processing and analysis methods for the Adolescent Brain Cognitive Development Study. Neuroimage, 202, 116091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hagler D. J. Jr, Ahmadi M. E., Kuperman J., Holland D., McDonald C. R., Halgren E., & Dale A. M. (2009). Automated white-matter tractography using a probabilistic diffusion tensor atlas: Application to temporal lobe epilepsy. Human Brain Mapping, 30(5), 1535–1547. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heeringa S. G., & Berglund P. A. (2020). A guide for population-based analysis of the Adolescent Brain Cognitive Development (ABCD) Study baseline data. BioRxiv. [Google Scholar]
- Heidari S., Babor T. F., De Castro P., Tort S., & Curno M. (2016). Sex and Gender Equity in Research: Rationale for the SAGER guidelines and recommended use. Research Integrity and Peer Review, 1(1), 2. 10.1186/s41073-016-0007-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Herting M. M., Gautam P., Spielberg J. M., Dahl R. E., & Sowell E. R. (2015). A Longitudinal Study: Changes in Cortical Thickness and Surface Area during Pubertal Maturation. PLOS ONE, 10(3), e0119774. 10.1371/journal.pone.0119774 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Herting M. M., Maxwell E. C., Irvine C., & Nagel B. J. (2012). The Impact of Sex, Puberty, and Hormones on White Matter Microstructure in Adolescents. Cerebral Cortex, 22(9), 1979–1992. 10.1093/cercor/bhr246 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Herting M. M., Uban K. A., Gonzalez M. R., Baker F. C., Kan E. C., Thompson W. K., Granger D. A., Albaugh M. D., Anokhin A. P., Bagot K. S., Banich M. T., Barch D. M., Baskin-Sommers A., Breslin F. J., Casey B. J., Chaarani B., Chang L., Clark D. B., Cloak C. C., … Sowell E. R. (2021). Correspondence Between Perceived Pubertal Development and Hormone Levels in 9–10 Year-Olds From the Adolescent Brain Cognitive Development Study. Frontiers in Endocrinology, 11. 10.3389/fendo.2020.549928 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hines M. (2020). Neuroscience and Sex/Gender: Looking Back and Forward. Journal of Neuroscience, 40(1), 37–43. 10.1523/JNEUROSCI.0750-19.2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoekzema E., Schagen S. E. E., Kreukels B. P. C., Veltman D. J., Cohen-Kettenis P. T., Delemarre-van de Waal H., & Bakker J. (2015). Regional volumes and spatial volumetric distribution of gray matter in the gender dysphoric brain. Psychoneuroendocrinology, 55, 59–71. 10.1016/j.psyneuen.2015.01.016 [DOI] [PubMed] [Google Scholar]
- Hoffman R. M., & Borders L. D. (2001). Twenty-five years after the Bem Sex-Role Inventory: A reassessment and new issues regarding classification variability. Measurement and Evaluation in Counseling and Development, 34(1), 39–55. [Google Scholar]
- Hyde J. S., Bigler R. S., Joel D., Tate C. C., & van Anders S. M. (2018). The Future of Sex and Gender in Psychology: Five Challenges to the Gender Binary. [DOI] [PubMed]
- Jacobson R., & Joel D. (2018). An Exploration of the Relations Between Self-Reported Gender Identity and Sexual Orientation in an Online Sample of Cisgender Individuals. Archives of Sexual Behavior, 47(8), 2407–2426. 10.1007/s10508-018-1239-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Joel D. (2011). Male or Female? Brains are Intersex. Frontiers in Integrative Neuroscience, 5. 10.3389/fnint.2011.00057 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Joel D., Berman Z., Tavor I., Wexler N., Gaber O., Stein Y., Shefi N., Pool J., Urchs S., Margulies D. S., Liem F., Hänggi J., Jäncke L., & Assaf Y. (2015). Sex beyond the genitalia: The human brain mosaic. Proceedings of the National Academy of Sciences, 112(50), 15468–15473. 10.1073/pnas.1509654112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Joel D., Tarrasch R., Berman Z., Mukamel M., & Ziv E. (2014). Queering gender: Studying gender identity in ‘normative’ individuals. Psychology & Sexuality, 5(4), 291–321. 10.1080/19419899.2013.830640 [DOI] [Google Scholar]
- Kaczkurkin A. N., Raznahan A., & Satterthwaite T. D. (2019). Sex differences in the developing brain: Insights from multimodal neuroimaging. Neuropsychopharmacology, 44(1), 71–85. 10.1038/s41386-018-0111-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karch J. D., Filevich E., Wenger E., Lisofsky N., Becker M., Butler O., Mårtensson J., Lindenberger U., Brandmaier A. M., & Kühn S. (2019). Identifying predictors of within-person variance in MRI-based brain volume estimates. NeuroImage, 200, 575–589. [DOI] [PubMed] [Google Scholar]
- Kim K., Joo Y., Ahn G., & Cha J. (2021). Sex, Brains, and Individuals: Machine—Learned Brain Sex Explains Individual-Specific Cognitive Development and its Genetic Factors in Prepuberty Children. Biological Psychiatry, 89(9), S372–S373. [Google Scholar]
- Kim K., Joo Y. Y., Ahn G., Wang H.-H., Moon S.-Y., Kim H., Ahn W.-Y., & Cha J. (2022). The sexual brain, genes, and cognition: A machine-predicted brain sex score explains individual differences in cognitive intelligence and genetic influence in young children. Human Brain Mapping. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kolb B., & Gibb R. (2011). Brain Plasticity and Behaviour in the Developing Brain. Journal of the Canadian Academy of Child and Adolescent Psychiatry, 20(4), 265–276. [PMC free article] [PubMed] [Google Scholar]
- Kranz G. S., Seiger R., Kaufmann U., Hummer A., Hahn A., Ganger S., Tik M., Windischberger C., Kasper S., & Lanzenberger R. (2017). Effects of sex hormone treatment on white matter microstructure in individuals with gender dysphoria. NeuroImage, 150, 60–67. 10.1016/j.neuroimage.2017.02.027 [DOI] [PubMed] [Google Scholar]
- Kreft I. G., & De Leeuw J. (1998). Introducing multilevel modeling. Sage. [Google Scholar]
- Kreukels B. P., & Guillamon A. (2016). Neuroimaging studies in people with gender incongruence. International Review of Psychiatry, 28(1), 120–128. [DOI] [PubMed] [Google Scholar]
- Kurth F., Gaser C., Sánchez F. J., & Luders E. (2022). Brain Sex in Transgender Women Is Shifted towards Gender Identity. Journal of Clinical Medicine, 11(6), 1582. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lawrence K. E., Abaryan Z., Laltoo E., Hernandez L. M., Gandal M., McCracken J. T., & Thompson P. M. (2021). White matter microstructure shows sex differences in late childhood: Evidence from 6,797 children. BioRxiv. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lebel C., Walker L., Leemans A., Phillips L., & Beaulieu C. (2008). Microstructural maturation of the human brain from childhood to adulthood. Neuroimage, 40(3), 1044–1055. [DOI] [PubMed] [Google Scholar]
- Levin R. N., Erickson-Schroth L., Mak K., & Edmiston E. K. (2022). Biological studies of transgender identity: A critical review. Journal of Gay & Lesbian Mental Health, 1–30. [Google Scholar]
- Levman J., MacDonald P., Lim A. R., Forgeron C., & Takahashi E. (2017). A pediatric structural MRI analysis of healthy brain development from newborns to young adults. Human Brain Mapping, 38(12), 5931–5942. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li G., Wang L., Shi F., Lyall A. E., Lin W., Gilmore J. H., & Shen D. (2014). Mapping Longitudinal Development of Local Cortical Gyrification in Infants from Birth to 2 Years of Age. The Journal of Neuroscience, 34(12), 4228–4238. 10.1523/JNEUROSCI.3976-13.2014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Y., Thompson W. K., Reuter C., Nillo R., Jernigan T., Dale A., Sugrue L. P., Brown J., Dougherty R. F., & Rauschecker A. (2021). Rates of incidental findings in brain magnetic resonance imaging in children. JAMA Neurology, 78(5), 578–587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lindroth H., Nair V. A., Stanfield C., Casey C., Mohanty R., Wayer D., Rowley P., Brown R., Prabhakaran V., & Sanders R. D. (2019). Examining the identification of age-related atrophy between T1 and T1 + T2-FLAIR cortical thickness measurements. Scientific Reports, 9, 11288. 10.1038/s41598-019-47294-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Llaveria Caselles E. (2021). Epistemic Injustice in Brain Studies of (Trans)Gender Identity. Frontiers in Sociology, 6, 63. 10.3389/fsoc.2021.608328 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Loso H. M., Dube S. L., Chaarani B., Ivanova M., Garavan H., Johns M. M., & Potter A. S. (2023). Associations Between Gender Nonconformity, School Environments, Family Conflict, and Emotional and Behavioral Health Among Children Ages 10–11. Journal of Adolescent Health. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mallard T. T., Liu S., Seidlitz J., Ma Z., Moraczewski D., Thomas A., & Raznahan A. (2021). X-chromosome influences on neuroanatomical variation in humans. Nature Neuroscience, 24(9), 1216–1224. [DOI] [PubMed] [Google Scholar]
- Marwha D., Halari M., & Eliot L. (2017). Meta-analysis reveals a lack of sexual dimorphism in human amygdala volume. Neuroimage, 147, 282–294. [DOI] [PubMed] [Google Scholar]
- Mauvais-Jarvis F., Bairey Merz N., Barnes P. J., Brinton R. D., Carrero J.-J., DeMeo D. L., De Vries G. J., Epperson C. N., Govindan R., Klein S. L., Lonardo A., Maki P. M., McCullough L. D., Regitz-Zagrosek V., Regensteiner J. G., Rubin J. B., Sandberg K., & Suzuki A. (2020). Sex and gender: Modifiers of health, disease, and medicine. The Lancet, 396(10250), 565–582. 10.1016/S0140-6736(20)31561-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mills K. L., Lalonde F., Clasen L. S., Giedd J. N., & Blakemore S.-J. (2014). Developmental changes in the structure of the social brain in late childhood and adolescence. Social Cognitive and Affective Neuroscience, 9(1), 123–131. 10.1093/scan/nss113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Misaki M., Savitz J., Zotev V., Phillips R., Yuan H., Young K. D., Drevets W. C., & Bodurka J. (2015). Contrast Enhancement by Combining T1- and T2-weighted Structural Brain MR Images. Magnetic Resonance in Medicine, 74(6), 1609–1620. 10.1002/mrm.25560 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Murray S. B., Diaz-Fong J. P., Duval C. J., Balkchyan A. A., Nagata J. M., Lee D. J., Ganson K. T., Toga A. W., Siegel S. J., & Jann K. (2022). Sex differences in regional gray matter density in pre-adolescent binge eating disorder: A voxel-based morphometry study. Psychological Medicine, 1–13. [DOI] [PubMed] [Google Scholar]
- Nakamura K., Brown R. A., Narayanan S., Collins D. L., Arnold D. L., & Initiative A. D. N. (2015). Diurnal fluctuations in brain volume: Statistical analyses of MRI from large populations. Neuroimage, 118, 126–132. [DOI] [PubMed] [Google Scholar]
- National Academies of Sciences, E. (2022). Measuring Sex, Gender Identity, and Sexual Orientation (Becker T., Ed.; p. 200). The National Academies Press. https://www.nap.edu/catalog/26424/measuring-sex-gender-identity-and-sexual-orientation [PubMed] [Google Scholar]
- Nguyen T.-V. (2018). Developmental effects of androgens in the human brain. Journal of Neuroendocrinology, 30(2), e12486. 10.1111/jne.12486 [DOI] [PubMed] [Google Scholar]
- Nielsen M. W., Bloch C. W., & Schiebinger L. (2018). Making gender diversity work for scientific discovery and innovation. Nature Human Behaviour, 2(10), 726–734. [DOI] [PubMed] [Google Scholar]
- Nketia J., Amso D., & Brito N. H. (2021). Towards a more inclusive and equitable developmental cognitive neuroscience. Developmental Cognitive Neuroscience, 52, 101014. 10.1016/j.dcn.2021.101014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nota N. M., Kreukels B. P. C., den Heijer M., Veltman D. J., Cohen-Kettenis P. T., Burke S. M., & Bakker J. (2017). Brain functional connectivity patterns in children and adolescents with gender dysphoria: Sex-atypical or not? Psychoneuroendocrinology, 86, 187–195. 10.1016/j.psyneuen.2017.09.014 [DOI] [PubMed] [Google Scholar]
- O’Donnell L. J., & Westin C.-F. (2011). An introduction to diffusion tensor image analysis. Neurosurgery Clinics, 22(2), 185–196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ou X., Andres A., Pivik R. T., Cleves M. A., & Badger T. M. (2015). Brain gray and white matter differences in healthy normal weight and obese children. Journal of Magnetic Resonance Imaging, 42(5), 1205–1213. [DOI] [PubMed] [Google Scholar]
- Owens M. M., Potter A., Hyatt C. S., Albaugh M., Thompson W. K., Jernigan T., Yuan D., Hahn S., Allgaier N., & Garavan H. (2021). Recalibrating expectations about effect size: A multi-method survey of effect sizes in the ABCD study. PLOS ONE, 16(9), e0257535. 10.1371/journal.pone.0257535 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pelletier R., Ditto B., & Pilote L. (2015). A composite measure of gender and its association with risk factors in patients with premature acute coronary syndrome. Psychosomatic Medicine, 77(5), 517–526. [DOI] [PubMed] [Google Scholar]
- Persson S., & Pownall M. (2021). Can open science be a tool to dismantle claims of hardwired brain sex differences? Opportunities and challenges for feminist researchers. Psychology of Women Quarterly, 45(4), 493–504. [Google Scholar]
- Petersen A. C., Crockett L., Richards M., & Boxer A. (1988). A self-report measure of pubertal status: Reliability, validity, and initial norms. Journal of Youth and Adolescence, 17(2), 117–133. [DOI] [PubMed] [Google Scholar]
- Pletzer B., Harris T., & Hidalgo-Lopez E. (2018). Subcortical structural changes along the menstrual cycle: Beyond the hippocampus. Scientific Reports, 8(1), Article 1. 10.1038/s41598-018-34247-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Potter A., Dube S., Allgaier N., Loso H., Ivanova M., Barrios L. C., Bookheimer S., Chaarani B., Dumas J., Feldstein-Ewing S., Freedman E. G., Garavan H., Hoffman E., McGlade E., Robin L., & Johns M. M. (2021). Early adolescent gender diversity and mental health in the Adolescent Brain Cognitive Development study. Journal of Child Psychology and Psychiatry, 62(2), 171–179. 10.1111/jcpp.13248 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Potter A. S., Dube S. L., Barrios L. C., Bookheimer S., Espinoza A., Ewing S. W. F., Freedman E. G., Hoffman E. A., Ivanova M., & Jefferys H. (2022). Measurement of gender and sexuality in the Adolescent Brain Cognitive Development (ABCD) study. Developmental Cognitive Neuroscience, 53, 101057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- R Core Team. (2019). R: A language and environment for statistical computing. [Computer software]. R Foundation for Statistical Computing. https://www.R-project.org/ [Google Scholar]
- Rakesh D., Zalesky A., & Whittle S. (2022). Assessment of Parent Income and Education, Neighborhood Disadvantage, and Child Brain Structure. JAMA Network Open, 5(8), e2226208. 10.1001/jamanetworkopen.2022.26208 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rauch J. M., & Eliot L. (2022). Breaking the binary: Gender versus sex analysis in human brain imaging. NeuroImage, 264, 119732. 10.1016/j.neuroimage.2022.119732 [DOI] [PubMed] [Google Scholar]
- Raznahan A., & Disteche C. M. (2021). X-chromosome regulation and sex differences in brain anatomy. Neuroscience & Biobehavioral Reviews, 120, 28–47. 10.1016/j.neubiorev.2020.10.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raznahan A., Lee N. R., Greenstein D., Wallace G. L., Blumenthal J. D., Clasen L. S., & Giedd J. N. (2016). Globally Divergent but Locally Convergent X- and Y-Chromosome Influences on Cortical Development. Cerebral Cortex, 26(1), 70–79. 10.1093/cercor/bhu174 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raznahan A., Lee Y., Stidd R., Long R., Greenstein D., Clasen L., Addington A., Gogtay N., Rapoport J. L., & Giedd J. N. (2010). Longitudinally mapping the influence of sex and androgen signaling on the dynamics of human cortical maturation in adolescence. Proceedings of the National Academy of Sciences, 107(39), 16988–16993. 10.1073/pnas.1006025107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raznahan A., Shaw P., Lalonde F., Stockman M., Wallace G. L., Greenstein D., Clasen L., Gogtay N., & Giedd J. N. (2011). How Does Your Cortex Grow? Journal of Neuroscience, 31(19), 7174–7177. 10.1523/JNEUROSCI.0054-11.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rippon G., Jordan-Young R., Kaiser A., & Fine C. (2014). Recommendations for sex/gender neuroimaging research: Key principles and implications for research design, analysis, and interpretation. Frontiers in Human Neuroscience, 8, 650. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ross-Reed D. E., Reno J., Peñaloza L., Green D., & FitzGerald C. (2019). Family, school, and peer support are associated with rates of violence victimization and self-harm among gender minority and cisgender youth. Journal of Adolescent Health, 65(6), 776–783. [DOI] [PubMed] [Google Scholar]
- Schaer M., Cuadra M. B., Schmansky N., Fischl B., Thiran J.-P., & Eliez S. (2012). How to measure cortical folding from MR images: A step-by-step tutorial to compute local gyrification index. JoVE (Journal of Visualized Experiments), 59, e3417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sepehrband F., Lynch K. M., Cabeen R. P., Gonzalez-Zacarias C., Zhao L., D’Arcy M., Kesselman C., Herting M. M., Dinov I. D., & Toga A. W. (2018). Neuroanatomical morphometric characterization of sex differences in youth using statistical learning. Neuroimage, 172, 217–227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seunarine K. K., Clayden J. D., Jentschke S., Munoz M., Cooper J. M., Chadwick M. J., Banks T., Vargha-Khadem F., & Clark C. A. (2016). Sexual dimorphism in white matter developmental trajectories using tract-based spatial statistics. Brain Connectivity, 6(1), 37–47. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sigurdardottir H. L., Lanzenberger R., & Kranz G. S. (2020). Genetics of sex differences in neuroanatomy and function. Handbook of Clinical Neurology, 175, 179–193. [DOI] [PubMed] [Google Scholar]
- Skorska M. N., Chavez S., Devenyi G. A., Patel R., Thurston L. T., Lai M.-C., Zucker K. J., Chakravarty M. M., Lobaugh N. J., & VanderLaan D. P. (2021). A multi-modal MRI analysis of cortical structure in relation to gender dysphoria, sexual orientation, and age in adolescents. Journal of Clinical Medicine, 10(2), 345. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soleman R. S., Schagen S. E., Veltman D. J., Kreukels B. P., Cohen-Kettenis P. T., Lambalk C. B., Wouters F., & Delemarre-van de Waal H. A. (2013). Sex differences in verbal fluency during adolescence: A functional magnetic resonance imaging study in gender dysphoric and control boys and girls. The Journal of Sexual Medicine, 10(8), 1969–1977. [DOI] [PubMed] [Google Scholar]
- Staphorsius A. S., Kreukels B. P., Cohen-Kettenis P. T., Veltman D. J., Burke S. M., Schagen S. E., Wouters F. M., Delemarre-van De Waal H. A., & Bakker J. (2015). Puberty suppression and executive functioning: An fMRI-study in adolescents with gender dysphoria. Psychoneuroendocrinology, 56, 190–199. [DOI] [PubMed] [Google Scholar]
- Szulc-Lerch K. U., Timmons B. W., Bouffet E., Laughlin S., de Medeiros C. B., Skocic J., Lerch J. P., & Mabbott D. J. (2018). Repairing the brain with physical exercise: Cortical thickness and brain volume increases in long-term pediatric brain tumor survivors in response to a structured exercise intervention. NeuroImage: Clinical, 18, 972–985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tan A., Ma W., Vira A., Marwha D., & Eliot L. (2016). The human hippocampus is not sexually-dimorphic: Meta-analysis of structural MRI volumes. Neuroimage, 124, 350–366. [DOI] [PubMed] [Google Scholar]
- Tan K. K., Treharne G. J., Ellis S. J., Schmidt J. M., & Veale J. F. (2020). Gender minority stress: A critical review. Journal of Homosexuality, 67(10), 1471–1489. [DOI] [PubMed] [Google Scholar]
- Tannenbaum C., Ellis R. P., Eyssel F., Zou J., & Schiebinger L. (2019). Sex and gender analysis improves science and engineering. Nature, 575(7781), 137–146. [DOI] [PubMed] [Google Scholar]
- Testa R. J., Habarth J., Peta J., Balsam K., & Bockting W. (2015). Development of the Gender Minority Stress and Resilience Measure. Psychology of Sexual Orientation and Gender Diversity, 2(1). [Google Scholar]
- Thijssen S., Collins P. F., & Luciana M. (2020). Pubertal development mediates the association between family environment and brain structure and function in childhood. Development and Psychopathology, 32(2), 687–702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tomasi D., & Volkow N. D. (2023). Measures of Brain Connectivity and Cognition by Sex in US Children. JAMA Network Open, 6(2), e230157. 10.1001/jamanetworkopen.2023.0157 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trefler A., Sadeghi N., Thomas A. G., Pierpaoli C., Baker C. I., & Thomas C. (2016). Impact of time-of-day on brain morphometric measures derived from T1-weighted magnetic resonance imaging. NeuroImage, 133, 41–52. 10.1016/j.neuroimage.2016.02.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Uban K. A., Horton M. K., Jacobus J., Heyser C., Thompson W. K., Tapert S. F., Madden P. A. F., & Sowell E. R. (2018). Biospecimens and the ABCD study: Rationale, methods of collection, measurement and early data. Developmental Cognitive Neuroscience, 32, 97–106. 10.1016/j.dcn.2018.03.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Urben S., Van Hanswijck De Jonge L, Barisnikov K., Pizzo R., Monnier M., Lazeyras F., Borradori Tolsa C., & Hüppi P. S. (2017). Gestational age and gender influence on executive control and its related neural structures in preterm-born children at 6 years of age. Child Neuropsychology, 23(2), 188–207. [DOI] [PubMed] [Google Scholar]
- van Anders S. M. (2022). Gender/sex/ual diversity and biobehavioral research. Psychology of Sexual Orientation and Gender Diversity. [Google Scholar]
- Vijayakumar N., Allen N. B., Youssef G., Dennison M., Yücel M., Simmons J. G., & Whittle S. (2016). Brain development during adolescence: A mixed-longitudinal investigation of cortical thickness, surface area, and volume. Human Brain Mapping, 37(6), 2027–2038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vijayakumar N., Youssef G., Allen N. B., Anderson V., Efron D., Mundy L., Patton G., Simmons J. G., Silk T., & Whittle S. (2021). The effects of puberty and its hormones on subcortical brain development. Comprehensive Psychoneuroendocrinology, 7, 100074. 10.1016/j.cpnec.2021.100074 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Volkow N. D., Koob G. F., Croyle R. T., Bianchi D. W., Gordon J. A., Koroshetz W. J., Pérez-Stable E. J., Riley W. T., Bloch M. H., Conway K., Deeds B. G., Dowling G. J., Grant S., Howlett K. D., Matochik J. A., Morgan G. D., Murray M. M., Noronha A., Spong C. Y., … Weiss S. R. B. (2018). The conception of the ABCD study: From substance use to a broad NIH collaboration. Developmental Cognitive Neuroscience, 32, 4–7. 10.1016/j.dcn.2017.10.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Werchan D. M., & Amso D. (2017). A novel ecological account of prefrontal cortex functional development. Psychological Review, 124(6), 720. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wierenga L. M., Sexton J. A., Laake P., Giedd J. N., Tamnes C. K., & Pediatric Imaging N. (2018). A key characteristic of sex differences in the developing brain: Greater variability in brain structure of boys than girls. Cerebral Cortex, 28(8), 2741–2751. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wiglesworth A., Fiecas M. B., Xu M., Neher A. T., Padilla L., Carosella K. A., Roediger D. J., Mueller B. A., Luciana M., & Klimes-Dougan B. (2023). Sex and age variations in the impacts of puberty on cortical thickness and associations with internalizing symptoms and suicidal ideation in early adolescence. Developmental Cognitive Neuroscience, 101195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wood J. L., Murko V., & Nopoulos P. (2008). Ventral frontal cortex in children: Morphology, social cognition and femininity/masculinity. Social Cognitive and Affective Neuroscience, 3(2), 168–176. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xerxa Y., White T., Busa S., Trasande L., Hillegers M. H. J., Jaddoe V. W., Castellanos F. X., & Ghassabian A. (2023). Gender Diversity and Brain Morphology Among Adolescents. JAMA Network Open, 6(5), e2313139. 10.1001/jamanetworkopen.2023.13139 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zitzmann M. (2009). The role of the CAG repeat androgen receptor polymorphism in andrology. Advances in the Management of Testosterone Deficiency, 37, 52–61. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data from the ABCD Study Data Release is available for access through the NIMH Data Archive (NDA 3.0 data release 2020: https://dx.doi.org/10.15154/1520591; NDA 4.0 data release 2021; https://dx.doi.org/10.15154/1523041). Only researchers with an approved NDA Data Use Certification (DUC) can obtain ABCD Study data.