Skip to main content
Science Advances logoLink to Science Advances
. 2022 May 18;8(20):eabm2923. doi: 10.1126/sciadv.abm2923

Human brain anatomy reflects separable genetic and environmental components of socioeconomic status

Hyeokmoon Kweon 1, Gökhan Aydogan 2, Alain Dagher 3, Danilo Bzdok 3,4,5,6, Christian C Ruff 2, Gideon Nave 7, Martha J Farah 8,*, Philipp D Koellinger 1,9,*
PMCID: PMC9116589  PMID: 35584223

Abstract

Socioeconomic status (SES) correlates with brain structure, a relation of interest given the long-observed relations of SES to cognitive abilities and health. Yet, major questions remain open, in particular, the pattern of causality that underlies this relation. In an unprecedently large study, here, we assess genetic and environmental contributions to SES differences in neuroanatomy. We first establish robust SES–gray matter relations across a number of brain regions, cortical and subcortical. These regional correlates are parsed into predominantly genetic factors and those potentially due to the environment. We show that genetic effects are stronger in some areas (prefrontal cortex, insula) than others. In areas showing less genetic effect (cerebellum, lateral temporal), environmental factors are likely to be influential. Our results imply a complex interplay of genetic and environmental factors that influence the SES-brain relation and may eventually provide insights relevant to policy.


Socioeconomic status is linked with brain anatomy through a regionally varying balance of genetic and environmental influences.

INTRODUCTION

Socioeconomic status (SES), typically measured by income, education, occupation, and neighborhood quality, is a powerful predictor of important life outcomes including physical and mental health, academic achievement, and cognitive abilities (15). The brain plays a central role not only in these relations, most obviously in mental health and intellectual capabilities, but also in physical health through neuroendocrine and inflammatory pathways (6, 7). Thus, neuroscience provides a window on the biosocial pathways linking SES and human health and capabilities.

Neuroscience research on SES has revealed a generally positive relation with overall brain volume, as well as with regional cortical and subcortical volumes and cortical surface areas (810). We note variability across studies in the regions most associated with SES, which may be due, in part, to the relatively small samples studied, to differences in the ways SES has been measured and analyzed (e.g., choices of covariates) (10, 11), and to different environments with different levels of assistance to individuals of low SES (12, 13). One of the goals of the present study is to establish the relation of SES to regional gray matter volumes (GMVs) in the largest sample so far examined for voxel-level data, using a comprehensive measure of SES and controls for a number of potential confounds, based on a well-powered, preregistered analysis plan.

The second goal of the study is to differentiate genetic from environmental causes of the SES-GMV relation. The role of genes and environment in various outcomes associated with SES has been debated for decades and has provoked controversy in part because of perceived implications for policy (14).

Here, we pursue these two goals using data from the UK Biobank (UKB), a large-scale prospective epidemiological study of individuals aged 40 to 69 years at recruitment (15, 16). We conducted voxel-based morphometry (VBM) analysis to investigate GMV associations with SES, which was measured by a rich set of SES indicators. To probe the genetic basis of the SES-GMV relation, we constructed a polygenic index of SES from multiple genome-wide association study (GWAS) results (effective N = 849,744), which included a large-scale meta-analysis of educational attainment (17). We then examined to which extent the estimated SES-GMV associations can be attributed to the shared common genetic architectures of SES.

RESULTS

After selecting participants who had undergone both magnetic resonance imaging (MRI) and genotyping, and had complete SES information related to occupation, income, education, and neighborhood quality, we excluded participants with clinical diagnoses related to brain pathology, morbid obesity, heavy alcohol drinking, or low data quality. The resulting sample was 23,931 individuals, with a mean age of 62, 57% of whom were female. This sample size provides 90% statistical power to detect effects as small as R2 > 0.17% at the 5% significance level (corrected for multiple testing by permutation testing; uncorrected P < 2.19 × 10−6; see section S4.1). T1 images were preprocessed with Computational Anatomy Toolbox (CAT) 12, and anatomical regions were labeled according to the Neuromorphometrics atlas.

SES was represented in the analyses to follow by two summary measures, derived from available SES variables using a generalized version of principal components analysis (PCA; Fig. 1 and fig. S2). This approach better accommodates measurement error and allows us to appreciate the multidimensional nature of SES with just two components. PC1SES mainly captures the positive correlations between the different SES measures and is most strongly influenced by occupations, occupational wages, and education. PC2SES primarily reflects occupations and neighborhood qualities that are not strongly linked with educational attainment or income, e.g., individuals who live in relatively poor neighborhoods despite having high educational attainment. As shown later, PC2SES contributes to capturing nongenetic variation in SES.

Fig. 1. Measures of SES and PCA.

Fig. 1.

(A) On the left, a distance correlation matrix is plotted for seven indices of SES. On the right, the squared loadings for each PC are indicated. (B) Scatter plots of the first PC (PC1SES) against the second component (PC2SES). The points in different colors represent four SES groups defined by National Statistics Socio-economic Classification, which are approximately clustered by the two PCs. On the right, the same scatter plots are presented for each SES group. The mean values of each PC are indicated for each group. The regression lines are plotted to describe that SES is more complex for the lower SES groups.

SES and GMV

We first examined the relation between total intracranial volume (TIV) and SES by regressing TIV on PC1SES and PC2SES, controlling for sex, age, genetic population structure, and a number of image-related technical covariates (see section S3.3). PC1SES is positively associated with TIV {standardized β = 0.10, P = 1.1 × 10−87, 95% confidence interval (CI) [0.09, 0.11]}, while for PC2SES the relation is statistically indistinguishable from zero (standardized β = 0.01, P = 0.14, 95% CI [−0.00, 0.02]). The two PCs together explain 1.6% of the variance of interest in TIV beyond the covariates (partial R2)—slightly higher than TIV’s relation to educational attainment (1.4%) and lower than its relation to fluid intelligence (2.6%) (18).

Next, we conducted VBM analysis to test the association of these two PCs with regional GMV across the brain using the same set of covariates. Higher SES is associated with larger GMV across the brain (Fig. 2A). In total, 89.5% of the voxels have a statistically significant association with SES at a family-wise error (FWE) rate of 5%, all of which are positive. For statistically significant voxels, the average partial R2 is 0.4% and the highest is 1.2%, with the strongest associations in the left ventral striatum and the right frontal pole. Thus, the positive relation between total brain volume and SES arises from many relatively small sources of structural variation that are widespread across the brain.

Fig. 2. Manhattan plots: VBM of GMV and SES.

Fig. 2.

(A) Univariate VBM results on the two PCs for SES. These regressions did not control for TIV. P values on a log10 scale (top) and partial R2 (bottom) are plotted for each voxel. The sign of the association is that of the first PC. The voxels were anatomically labeled according to the Neuromorphometrics atlas and grouped by the labeled regions. Within each region, the voxels were ordered by their distance to the medoid of their region. (B) Univariate VBM results with TIV controlled for.

Accordingly, when TIV is controlled for, just 8.5% of the voxels have a statistically significant association with SES and the average effect size in partial R2 is reduced by over half to 0.17% for the statistically significant voxels (see section S4.1.2). As shown in Fig. 2B, the strongest positive associations between SES and relative GMV fall in the prefrontal, insular, frontal opercular, lateral parietal, and lateral temporal regions, as well as in subcortical areas including the cerebellum, striatum, and thalamus. While SES-GMV associations are mainly driven by PC1SES, PC2SES contributes relatively more in lateral temporal, cerebellar, and ventromedial prefrontal regions than in other regions (Fig. 2B and fig. S4A).

The regions implicated in these analyses include many reported in previous studies of SES and brain structure. While the cerebellum has not often been linked to SES, this may reflect its omission from many morphometric studies [but see (19), for a study of SES and cerebellar volume specifically, with positive findings]. Conversely, hippocampus volume is often noted to correlate with SES. Although this was also found in the present study, it was not among the strongest relations.

We also explored the influence of individual aspects of SES, such as education and income, by conducting a cluster-based analysis (figs. S8 and S9) as well as VBM on each measure separately (figs. S5 and S11). The overall pattern of results is similar, with years of schooling being most strongly associated.

SES-health relations are often stronger at lower levels of SES, where more extreme deprivation may impose unique effects on health (20, 21), and this pattern is also seen in SES effects on the cortex in children (22). Stronger SES-GMV associations were found here in the lower SES participants of our sample as well (fig. S6) (23). Regionally, this is particularly apparent in the striatum (low SES, N = 15,611, max partial R2 = 0.65%, TIV adjusted; high SES, N = 8320, max partial R2 = 0.17%, TIV adjusted).

An alternative measure of the strength of the SES-GMV relation is the ability of aggregate GMV measures to predict SES. The small effect sizes for individual voxels do not imply that the association between SES and overall GMV structure is also small. To show this, we constructed brain-wide GMV scores to predict PC1SES and PC2SES via a stacked block ridge regression (24) with fivefold cross-validation. These scores predict ΔR2 = 4.9% (95% CI [4.4, 5.4]) of out-of-sample variation in PC1SES and ΔR2 = 0.5% (95% CI [0.3, 0.7]) in PC2SES (see section S4.2 for details).

Genetic and environmental components of SES-GMV relation

The second question to be addressed is the contribution of genetic and environmental influence to the SES-GMV relations reported here. We approached this by first estimating the single-nucleotide polymorphism (SNP)–based heritability of SES and brain measures as well as the pairwise genetic correlations among them, which indicated that the genetic architectures of SES and brain structure are partly overlapping (section S6.1). We then constructed a polygenic index for SES (PGISES) using the results of the GWAS. In view of the sensitivity of GWAS results to differences in ancestry, we derived the index from UKB participants of European ancestry only, excluding the scanned participants and other participants genetically related to them. The genetic data consisted of relatively common genetic variants (SNPs) with minor allele frequency ≥1%, which were related to educational attainment, occupational wages, household income, local average income, and neighborhood quality, combined using Genomic Structural Equation Modeling (SEM) (17, 25) (effective N = 849,744). PGISES is strongly associated with PC1SESR2 = 7.1%, P < 10−300) and weakly associated with PC2SESR2 = 0.02%, P = 0.03) (see section S3.4 for details). PGISES could then be used with images from participants of European ancestry (N = 20,799) to help discriminate genetic from environmental causes of GMV differences.

PGISES was then used to predict TIV (ΔR2 = 0.8%, P = 7.4 × 10−64) and GMV across the entire brain via VBM. The latter analysis revealed positive associations in widely distributed voxels (Fig. 3A, row b), with the most pronounced associations in the anterior insula, frontal operculum, prefrontal, anterior cingulate, and striatum. There is substantial overlap between the neuroanatomical correlates of SES and PGISES. Controlling for TIV, approximately 41% of the GMV voxels associated with SES are also associated with PGISES. This overlap is especially apparent in the insular and prefrontal cortices, with roughly 96 and 64% of the voxels associated with PCSES also associated with PGISES, respectively.

Fig. 3. VBM of SES and its genetic and environmental components.

Fig. 3.

(A) Univariate VBM results, with GMV as the dependent variable. Voxels significant at FWE rate of 5% level are plotted for (a) the two PCs measuring SES, (b) PGISES, and (c) SES while controlling for PGISES. (B) Percent reduction in the association between GMV and the two PCs for SES due to controlling for PGISES. (C) Percent reduction due to controlling for BMI in the residual association between GMV and the two PCs for SES after controlling for PGISES. The figures plot only voxels that had significant SES-GMV association before PGISES and BMI were controlled for. Montreal Neurological Institute (MNI) coordinates are indicated for (A) and (B). Measurement error in PGISES was adjusted for with genetic instrument variable regression for (B) and (C). The sample was restricted to individuals of European ancestry.

We then examined to which extent the shared common genetic architectures of SES and GMV account for the observed phenotypic associations by comparing TIV-adjusted regression results of GMV on SES with and without controlling for PGISES. For 13% of the voxels significantly associated with SES before PGISES is controlled for, there is a statistically significant change in at least one of the coefficients for PC1SES and PC2SES after accounting for PGISES. Controlling for PGISES reduces the SES-GMV associations across the entire brain, with the greatest reduction in the anterior insula, frontal operculum, ventrolateral prefrontal cortex, and ventral striatum of both hemispheres, consistent with VBM of PGISES mentioned earlier (Fig. 3B). When we correct for measurement error in PGISES using genetic instrumental variable (GIV) regression (26), we estimate that PGISES accounts for more than half of the SES-GMV associations for many of these regions. On average, 38% of the SES-GMV associations (min = 3%, max = 87%) can be statistically attributed to PGISES (see section S4.3 for details).

The remaining associations between GMV and SES could be either due to environmental influences on both or due to rare SNPs, structural variants (e.g., inversions and deletions), or interactions among genes (i.e., epistasis) that PGISES does not fully account for. Forty-three percent of the voxels significantly associated with SES fall into this category, remaining associated with SES after controlling for PGISES (Fig. 3A, row c). The SES-GMV association is least attenuated by genetic controls in the cerebellum and lateral temporal, lateral parietal, posterior cingulate, and primary motor regions, as well as some areas of the dorsolateral and ventromedial prefrontal cortex (vmPFC) and the thalamus. Controlling for PGISES accounts for less than 30% of the SES-GMV association in many of these regions. These results suggest that the aforementioned regions may be particularly susceptible to the influence of the socioeconomic environment. This is consistent with the relatively stronger association of PC2SES to GMV in many of these areas, as PC2SES was found to be barely heritable (see section S6.7). In summary, a substantial portion of the SES-GMV relation is attributable to known genetics, and that portion varies according to region of the brain. The remaining portion of this relation is also substantial and likely includes the effects of the environment.

Next, we sought to extend our evidence concerning environmental influences through the study of a specific environmental factor. Numerous environmental exposures are associated with SES and are plausible causal contributors to the SES-GMV relation found here. These include prenatal and childhood factors with lifelong effects, as well as adulthood exposures such as chronic life stress, nutritional status, physical exercise, environmental toxins, smoking, and other substance use. Experimental research with animals and human research with longitudinal, quasi-experimental, or experimental studies show that these are all capable of affecting the brain. On the basis of recent research with the same sample relating mid-life obesity to cognitive and brain aging (27), we chose to extend our analyses by including body mass index (BMI) as a marker for a set of behavioral factors that could mediate the SES-GMV relation, including nutrition, physical activity, and obesity, which can affect the brain through their downstream effects on blood pressure, blood lipids, glucose metabolism, and inflammation. In addition to the logical point that PGISES controls would account for genetic influences of BMI on the SES-GMV relation, there is also experimental evidence of SES affecting BMI through the environment: Increasing SES causes BMI to decrease (28).

BMI accounts for an average of 44% of the SES-GMV associations that remain after controlling for PGISES (Figs. 3C and 4). This result is not due to neurological disease associated with BMI, such as stroke or neurodegenerative disease, because neurological disease was an exclusionary criterion for our sample. The effect is particularly large in the thalamus and the cerebellum as well as the lateral temporal region and some areas of the vmPFC. Furthermore, for 91% of the voxels with significant SES-GMV association in the European ancestry sample, at least 50% of the estimated SES-GMV association can be statistically attributed to PGISES and BMI combined, with 67% on average.

Fig. 4. Genetic and environmental components in the association between SES and GMV of voxel clusters.

Fig. 4.

Associations in partial R2 between the two PCs for SES and GMV in voxel clusters attributable to PGISES and BMI. The numbers in the bars report the percent share in the SES-GMV association statistically attributable to PGISES or BMI partialled out of PGISES. The clusters were formed from the VBM results plotted in Fig. 3Aa. See table S9 for more information about the clusters. Measurement error in PGISES was adjusted for with genetic instrument variable regression. The sample was restricted to individuals of European ancestry.

We then explored the possible functional implications of the volumetric differences observed here by relating them spatially to the results of meta-analyzed functional MRI (fMRI) studies, based on NeuroQuery and 492 cognitive concepts from the Cognitive atlas knowledge base (Fig. 5 and fig. S15) (29, 30). The neuroanatomical correlates of SES are most strongly expressed in language, perceptual cognitive functions, self-monitoring, and communication with statistical significance at the false discovery rate of 5%. These functional associations of SES appear to be driven by genetic influences (PGISES), while PGISES also distinctly reflects functions related to decision-making (risk and uncertainty), altruism, and empathy as well as broader categories of concepts as shown. The regions presumed to be more environmentally susceptible (Fig. 5C) tend to relate more to functions pertaining to executive control and learning and memory, none of which, however, were statistically significant at the false discovery rate of 5%.

Fig. 5. Functional annotation of brain regions associated with SES.

Fig. 5.

A total of 492 cognitive concepts, belonging to 10 categories, were taken from Cognitive Atlas, and their predicted fMRI meta-analysis results were generated by NeuroQuery. For each concept, we computed the difference in mean chi-square between voxels statistically significant at nominal 1% level and the rest of the voxels. Then, a pseudo–T score for the difference was computed and its P value was obtained by 10,000 spatial permutations. This procedure was carried out for the VBM results for (A) two PCs of SES, (B) PGISES, and (C) SES controlling for PGISES. The P values of the top five concepts from each category were plotted on a log10 scale, where each category was ordered by the average of the top five concepts. The asterisk indicates the significance at the false discovery rate (FDR) of 5% level. See table S17 and fig. S15 for the full results.

DISCUSSION

In summary, our results show that SES is linked with brain anatomy through a regionally varying balance of genetic and environmental influences. The functions of the implicated brain regions span many cognitive and affective capacities. A measurement error–corrected polygenic index enabled us to separate regions whose correlations with SES can be partly attributed to common genetic variants, at least in individuals of European ancestry, from other regions more susceptible to environmental and behavioral exposures that correlate with SES, notably BMI. Our results suggest that brain health is more susceptible to SES-related environmental stressors in specific regions, including reduced GMV in the cerebellum among individuals with low SES.

Our study is not the first to introduce the genetic aspect in neuroscience of SES (3133). Notably, global and regional measures of cortical regions have been found to have association with a polygenic index for educational attainment (32, 33). Total surface area has also been shown to correlate independently with both parental education and a polygenic index for educational attainment (33). To our knowledge, our study is the first to show varying degrees of the genetic contribution to the relationship between SES and brain-wide regional measures, including subcortical regions. Specifically, we identified many regions that remained associated with SES even after adjusting for genetic controls (PGISES).

In an age of growing inequality and socioeconomic disparities in health, achievement, and wellbeing, understanding the neural embedding of SES has social as well as scientific relevance. Poverty and social deprivation are associated with widespread regional reductions of GMV, which the present results confirm with unprecedented certainty and anatomical specificity. A novel implication of our findings is that this association can be explained in part, but only in part, by genetic predisposition to different degrees across the brain. It has been argued that genetically caused disadvantages cannot, at present, be ameliorated by policies that improve the social and economic environment (34). However, this reasoning is invalid for at least two reasons. First, even entirely genetic conditions can be treated with environmental interventions, for example, phenylketonuria (35). Second, genetic contributions to complex behavioral outcomes such as SES are likely to work via environmental channels that can be influenced (36, 37). In particular, the variance captured by PGISES is expected to contain indirect genetic effects such as genetic nurture (38) that work via different family environments, including family-specific differences in child-rearing and neighborhood quality. An extensive note in section S5 concerns the interpretation and limitations of our results.

For policy purposes, genetic influences should not be taken as a sign of intractability (36, 39). Rather, our findings imply that biological and social factors both contribute to neural disparities and that policy interventions may influence and interact with biological factors. While it would be premature to base specific policies on our results, future research in this direction could provide insights that can be translated into targeted interventions [see (40) for an in-depth discussion]. For example, further insights into whether cognitive stimulation during early life or anti-poverty policies (13, 41, 42) reduce neural disparities would be valuable.

MATERIALS AND METHODS

Sample description

We used publicly available data from the UKB, which recruited approximately 500,000 participants from the general population of the United Kingdom (15, 43). Study participants were 40 to 69 years old at recruitment between 2006 and 2010. Our study sample originates from 40,681 individuals whose structural T1 MRI images were available in January 2020 (data field 20252). To derive voxel-level GMVs, we processed T1 images from 38,545 genotyped individuals with CAT12 for Statistical Parametric Mapping (SPM). We then applied several filters to ensure data quality and avoid spurious findings, which concern clinical diagnoses related to brain pathology, morbid obesity, heavy alcohol drinking, or low data quality. The details for these filters can be found in the Supplementary Materials.

After applying these exclusion criteria, 31,330 individuals remained in our sample. A total of 7215 individuals were further excluded due to missing data for the variables used in the analyses. To rule out that our results are influenced by shared family environments among related individuals, we also removed close relatives by randomly dropping one from each pair of siblings or parent-offsprings (see section S2 for details). Our final sample for the main analysis included N = 23,931 individuals. In the analyses that used genetic data, we included N = 20,799 individuals of European ancestry from this sample.

Measures

Brain imaging data

We extracted GMV on the voxel level from T1-weighted structural brain MRI images provided in the NIFTI (Neuroimaging Informatics Technology Initiative) format (data field 20252). The scanning and processing protocols are detailed in the UKB’s brain imaging documentation (https://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/brain_mri.pdf) as well as in publications (15, 44). We preprocessed the T1 images with CAT12 for SPM (www.fil.ion.ucl.ac.uk/spm/software/spm12/). A detailed description on how we derived voxel-level GMV data is available in section S3.1.

SES measures

We collected and constructed an extensive set of SES measures: occupation, occupational wages, household income, housing type, local average household income, and neighborhood SES score. We derived some of the variables by relying on external data sources or aggregating several measures. Specific definitions of these measures can be found in section S3.2.

We then reduced the dimensions of the data by extracting the first two PCs, which represented overall SES implied by the available indicators. To account for the fact that we have both noncategorical and categorical SES indicators, we used a method that is often called factorial analysis of mixed data, which is essentially a generalization of PCA that can handle such mixed data (45, 46). This method combines ordinary PCA for noncategorical data with multiple correspondence analysis for categorical data and is implemented in the R package PCAmix (47).

Control variables

As control variables, we included age, sex, TIV, genetic population structure, and a set of image-related technical covariates such as site and time of acquisition. The full description of control variables and their details can be found in section S3.3.

Polygenic index for SES

We constructed a PGI for SES (PGISES) by combining multiple GWAS results of SES indicators, which included educational attainment, occupational wages, household income, local average income, and neighborhood score. We conducted GWAS on each of these measures with the UKB participants of European ancestry, excluding those in the analysis sample of this study as well as their close relatives (up to the third degree of relatedness). We ran each GWAS with a linear mixed model, estimated by BOLT-LMM (48). See section S3.4 for more details about these GWAS.

We combined these GWAS results to represent general SES by the common factor GWAS function of Genomic SEM (25). The effective sample size of this common-factor SES GWAS amounts to 849,744 (49). We then constructed the PGI for SES for those of European ancestry in the analysis sample (N = 20,799). To adjust for the correlation between the SNPs, we used a Bayesian approach called LDpred (50, 51) with a reference panel from the Haplotype Reference Consortium (version 1.1) (52). The SNPs included in the PGISES were limited to the autosomal biallelic SNPs established by the International HapMap 3 Consortium (53), which are known to work well for phenotype predictions (17, 54). The SNPs were also filtered to ensure minor allele frequency > 0.01, imputation score > 0.7, and missing rate < 0.05. As a result, 1,020,632 SNPs were used for PGISES. PGISES was standardized to have zero mean and unit variance.

Statistical analysis

A detailed description of the analysis methods used is available in the Supplementary Materials. Only the overall summary was provided here.

Voxel-based morphometry

Our baseline analysis estimated the associations between voxel-level GMV and the two SES PCs. For each voxel, GMV was regressed on the two SES PCs along with the control variables. An F test was conducted for each voxel to test whether there was significant association between its GMV and the SES PCs jointly. We used permutation testing to correct for multiple hypothesis testing across voxels and used P = 2.193 × 10−6 as the 5% significance level corrected for FWE (see section S4.1.3 for details).

VBM with PGISES

Using PGISES, we conducted the following additional VBM analyses: (i) VBM of SES PCs only with individuals of European ancestry, (ii) VBM of PGISES, and (iii) VBM of the SES PCs controlling for PGISES. These VBMs were carried out in the same way as the baseline analysis described above. We then examined which GMV voxels are significantly associated with the SES PCs and/or PGISES. Furthermore, we tested whether there were statistically significant changes in SES-GMV associations before and after PGISES was controlled for, by using the Wald test (see section S4.3.2).

Measurement error correction for polygenic index

PGISES is a noisy proxy of true linear effects of common genetic variants that are linked to SES because GWAS estimates of individual SNP effects are obtained from finite sample sizes. The difference between the true PGI and the available PGI can be viewed as the classic measurement error, which leads to an attenuation bias in the coefficient estimate for the PGISES. We addressed this attenuation bias by using GIV regression (26). The essential idea is that the true PGISES can be recovered from a noisy PGISES(1) by using another PGISES(2) as an instrumental variable that was derived from a different GWAS sample. The crucial assumption here is that the noise in PGISES(1) and PGISES(2) is uncorrelated to each other. GIV regression can address the measurement error in PGISES to the extent that this assumption holds.

To obtain PGISES(1) and PGISES(2), we randomly split the UKB GWAS sample into two subsamples (N = 105,517 to 170,945) such that each subsample has the same male-female ratio and no individuals in one subsample are related to anyone in the other subsample with more than the third degree of relatedness. With each subsample, GWAS was run for the five numerical SES measures and the results were combined with Genomic SEM. Then, PGISES(1) and PGISES(2) were constructed from one of the two independent GWAS subsample results (see section S4.3.4).

Functional annotations

We connected our anatomical findings to known functional localizations by leveraging Cognitive Atlas and the extrapolatable meta-analysis tool NeuroQuery (29, 30). We first took the 518 cognitive concepts from Cognitive Atlas, which were categorized into 10 functional categories. Then, for each concept, we generated a meta-analyzed z-score brain map using NeuroQuery. This toolbox allows users to generate a predictive MRI-derived spatial distribution for any term, based on very large-scale meta-analyses containing mostly functional MRI studies. We excluded 26 concepts containing a term for which NeuroQuery failed to generate a brain map. As a result, 492 concepts remained. For each concept-associated brain map, we calculated the difference in mean chi-square between voxels statistically significant nominally at 1% level and the rest of voxels in the VBM results. We then computed a pseudo–T score for the difference in mean chi-square and obtained its P value from 10,000 spatial permutations of the F statistics map from the VBM. We used an approach developed by Burt et al. (55) for permutation, which allowed us to permute the volumetric brain map with subcortical regions while preserving spatial autocorrelation. We used these permutation-based P values as a summary measure to evaluate the strength of signal for a given functional concept in relation to SES (see section S4.4 for details).

Acknowledgments

This research was carried out under the auspices of the Brain Imaging and Genetics in Behavioral Research Consortium (https://big-bear-research.org/), using UK Biobank resources under application 11425. We thank J. Gee and P. Cook of the Penn Image Computing and Science Laboratory for valuable advice on image analysis. S. Hyman provided comments on an earlier draft of this manuscript.

Funding: This work made use of the Dutch National e-Infrastructure with the support of the SURF Cooperative using grant no. EINF-2327. The study was supported by funding from an ERC Consolidator Grant to P.D.K. (647648, EdGe), an NSF Faculty Early Career Development Program grant (1942917) to G.N., the School of Arts and Sciences at the University of Pennsylvania to M.J.F., a Swiss National Science Foundation grant (no. 100019L_173248) to C.C.R., and the Wisconsin Alumni Research Foundation, provided to P.D.K. by the Office of the Vice Chancellor for Research and Graduate Education at the University of Wisconsin-Madison. G.N. thanks Carlos and Rosa de la Cruz for ongoing support.

Author contributions: H.K., G.N., M.J.F., and P.D.K. designed the research plan. G.N., M.J.F., and P.D.K. oversaw the study. H.K. analyzed the data. H.K., M.J.F., and P.D.K. wrote the paper and the Supplementary Materials. G.A., A.D., D.B., and C.C.R. provided helpful advice and feedback on the study design. All authors contributed to and critically reviewed the manuscript.

Competing interests: The authors declare that they have no competing interests.

Data and materials availability: All data needed to evaluate the conclusions in the paper are present in the paper and/or the Supplementary Materials. Interactive brain maps for the baseline results are available at https://github.com/hmkweon/SES_VBM. The UK Biobank data can be provided pending a scientific review and a completed material transfer agreement. Requests for the UK Biobank data should be submitted to: www.ukbiobank.ac.uk/enable-your-research/apply-for-access. The specific data fields are provided in the Supplementary Materials. CAT12 toolbox is available at www.neuro.uni-jena.de/cat. The preregistration document, the codes for the data construction and the analyses, and the additional data used to construct SES variables are publicly available at https://osf.io/kg29c.

Supplementary Materials

This PDF file includes:

Sections S1 to S7

Figs. S1 to S19

References

Other Supplementary Material for this manuscript includes the following:

Data S1, containing tables S1 to S25

View/request a protocol for this paper from Bio-protocol.

REFERENCES AND NOTES

  • 1.Adler N. E., Stewart J., Health disparities across the lifespan: Meaning, methods, and mechanisms. Ann. N. Y. Acad. Sci. 1186, 5–23 (2010). [DOI] [PubMed] [Google Scholar]
  • 2.Sacks D. W., Stevenson B., Wolfers J., The new stylized facts about income and subjective well-being. Emotion 12, 1181–1187 (2012). [DOI] [PubMed] [Google Scholar]
  • 3.Chetty R., Stepner M., Abraham S., Lin S., Scuderi B., Turner N., Bergeron A., Cutler D., The association between income and life expectancy in the United States, 2001-2014. JAMA 315, 1750–1766 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Stringhini S., Carmeli C., Jokela M., Avendaño M., Muennig P., Guida F., Ricceri F., d’Errico A., Barros H., Bochud M., Chadeau-Hyam M., Clavel-Chapelon F., Costa G., Delpierre C., Fraga S., Goldberg M., Giles G. G., Krogh V., Kelly-Irving M., Layte R., Lasserre A. M., Marmot M. G., Preisig M., Shipley M. J., Vollenweider P., Zins M., Kawachi I., Steptoe A., Mackenbach J. P., Vineis P., Kivimäki M.; LIFEPATH consortium , Socioeconomic status and the 25 × 25 risk factors as determinants of premature mortality: A multicohort study and meta-analysis of 1.7 million men and women. Lancet 389, 1229–1237 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Ridley M., Rao G., Schilbach F., Patel V., Poverty, depression, and anxiety: Causal evidence and mechanisms. Science 370, eaay0214 (2020). [DOI] [PubMed] [Google Scholar]
  • 6.McEwen B. S., Gianaros P. J., Central role of the brain in stress and adaptation: Links to socioeconomic status, health, and disease. Ann. N. Y. Acad. Sci. 1186, 190–222 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Muscatell K. A., Socioeconomic influences on brain function: Implications for health. Ann. N. Y. Acad. Sci. 1428, 14–32 (2018). [DOI] [PubMed] [Google Scholar]
  • 8.Farah M. J., The neuroscience of socioeconomic status: Correlates, causes, and consequences. Neuron 96, 56–71 (2017). [DOI] [PubMed] [Google Scholar]
  • 9.Merz E. C., Wiltshire C. A., Noble K. G., Socioeconomic inequality and the developing brain: Spotlight on language and executive function. Child Dev. Perspect. 13, 15–20 (2019). [Google Scholar]
  • 10.Noble K. G., Giebler M. A., The neuroscience of socioeconomic inequality. Curr. Opin. Behav. Sci. 36, 23–28 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Button K. S., Ioannidis J. P. A., Mokrysz C., Nosek B. A., Flint J., Robinson E. S. J., Munafò M. R., Power failure: Why small sample size undermines the reliability of neuroscience. Nat. Rev. Neurosci. 14, 365–376 (2013). [DOI] [PubMed] [Google Scholar]
  • 12.Walhovd K. B., Fjell A. M., Wang Y., Amlien I. K., Mowinckel A. M., Lindenberger U., Düzel S., Bartrés-Faz D., Ebmeier K. P., Drevon C. A., Baaré W. F. C., Ghisletta P., Johansen L. B., Kievit R. A., Henson R. N., Madsen K. S., Nyberg L., Harris J. R., Solé-Padullés C., Pudas S., Sørensen Ø., Westerhausen R., Zsoldos E., Nawijn L., Lyngstad T. H., Suri S., Penninx B., Rogeberg O. J., Brandmaier A. M., Education and income show heterogeneous relationships to lifespan brain and cognitive differences across European and US cohorts. Cereb. Cortex 32, 839–854 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Weissman D. G., Hatzenbuehler M., Cikara M., Barch D., McLaughlin K. A., Antipoverty programs mitigate socioeconomic disparities in brain structure and psychopathology among U.S. youths. PsyArXiv 10.31234/osf.io/8nhej (2021). [Google Scholar]
  • 14.R. J. Herrnstein, C. A. Murray, The Bell Curve: Intelligence and Class Structure in American Life (Free Press, 1994). [Google Scholar]
  • 15.Miller K. L., Alfaro-Almagro F., Bangerter N. K., Thomas D. L., Yacoub E., Xu J., Bartsch A. J., Jbabdi S., Sotiropoulos S. N., Andersson J. L. R., Griffanti L., Douaud G., Okell T. W., Weale P., Dragonu I., Garratt S., Hudson S., Collins R., Jenkinson M., Matthews P. M., Smith S. M., Multimodal population brain imaging in the UK Biobank prospective epidemiological study. Nat. Neurosci. 19, 1523–1536 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Bycroft C., Freeman C., Petkova D., Band G., Elliott L. T., Sharp K., Motyer A., Vukcevic D., Delaneau O., O’Connell J., Cortes A., Welsh S., Young A., Effingham M., McVean G., Leslie S., Allen N., Donnelly P., Marchini J., The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Lee J. J., Wedow R., Okbay A., Kong E., Maghzian O., Zacher M., Nguyen-Viet T. A., Bowers P., Sidorenko J., Linnér R. K., Fontana M. A., Kundu T., Lee C., Li H., Li R., Royer R., Timshel P. N., Walters R. K., Willoughby E. A., Yengo L.; 23andMe Research Team; COGENT (Cognitive Genomics Consortium); Social Science Genetic Association Consortium, Alver M., Bao Y., Clark D. W., Day F. R., Furlotte N. A., Joshi P. K., Kemper K. E., Kleinman A., Langenberg C., Mägi R., Trampush J. W., Verma S. S., Wu Y., Lam M., Zhao J. H., Zheng Z., Boardman J. D., Campbell H., Freese J., Harris K. M., Hayward C., Herd P., Kumari M., Lencz T., Luan J., Malhotra A. K., Metspalu A., Milani L., Ong K. K., Perry J. R. B., Porteous D. J., Ritchie M. D., Smart M. C., Smith B. H., Tung J. Y., Wareham N. J., Wilson J. F., Beauchamp J. P., Conley D. C., Esko T., Lehrer S. F., Magnusson P. K. E., Oskarsson S., Pers T. H., Robinson M. R., Thom K., Watson C., Chabris C. F., Meyer M. N., Laibson D. I., Yang J., Johannesson M., Koellinger P. D., Turley P., Visscher P. M., Benjamin D. J., Cesarini D., Gene discovery and polygenic prediction from a genome-wide association study of educational attainment in 1.1 million individuals. Nat. Genet. 50, 1112–1121 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Nave G., Jung W. H., Linnér R. K., Kable J. W., Koellinger P. D., Are bigger brains smarter? Evidence from a large-scale preregistered study. Psychol. Sci. 30, 43–54 (2019). [DOI] [PubMed] [Google Scholar]
  • 19.Cavanagh J., Krishnadas R., Batty G. D., Burns H., Deans K. A., Ford I., McConnachie A., McGinty A., McLean J. S., Millar K., Sattar N., Shiels P. G., Tannahill C., Velupillai Y. N., Packard C. J., McLean J., Socioeconomic status and the cerebellar grey matter volume. Data from a well-characterised population sample. Cerebellum 12, 882–891 (2013). [DOI] [PubMed] [Google Scholar]
  • 20.Adler N. E., Ostrove J. M., Socioeconomic status and health: What we know and what we don’t. Ann. N. Y. Acad. Sci. 896, 3–15 (1999). [DOI] [PubMed] [Google Scholar]
  • 21.Schnittker J., Education and the changing shape of the income gradient in health. J. Health Soc. Behav. 45, 286–305 (2004). [DOI] [PubMed] [Google Scholar]
  • 22.Noble K. G., Houston S. M., Brito N. H., Bartsch H., Kan E., Kuperman J. M., Akshoomoff N., Amaral D. G., Bloss C. S., Libiger O., Schork N. J., Murray S. S., Casey B. J., Chang L., Ernst T. M., Frazier J. A., Gruen J. R., Kennedy D. N., Van Zijl P., Mostofsky S., Kaufmann W. E., Kenet T., Dale A. M., Jernigan T. L., Sowell E. R., Family income, parental education and brain structure in children and adolescents. Nat. Neurosci. 18, 773–778 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.D. Rose, D. J. Pevalin, A Researcher’s Guide to the National Statistics Socio-Economic Classification (SAGE, 2003). [Google Scholar]
  • 24.Mbatchou J., Barnard L., Backman J., Marcketta A., Kosmicki J. A., Ziyatdinov A., Benner C., O’Dushlaine C., Barber M., Boutkov B., Habegger L., Ferreira M., Baras A., Reid J., Abecasis G., Maxwell E., Marchini J., Computationally efficient whole-genome regression for quantitative and binary traits. Nat. Genet. 53, 1097–1103 (2021). [DOI] [PubMed] [Google Scholar]
  • 25.Grotzinger A. D., Rhemtulla M., de Vlaming R., Ritchie S. J., Mallard T. T., Hill W. D., Ip H. F., Marioni R. E., McIntosh A. M., Deary I. J., Koellinger P. D., Harden K. P., Nivard M. G., Tucker-Drob E. M., Genomic structural equation modelling provides insights into the multivariate genetic architecture of complex traits. Nat. Hum. Behav. 3, 513–525 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.DiPrete T. A., Burik C. A. P., Koellinger P. D., Genetic instrumental variable regression: Explaining socioeconomic and health outcomes in nonexperimental data. Proc. Natl. Acad. Sci. U.S.A. 115, E4970–E4979 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Morys F., Dadar M., Dagher A., Association between midlife obesity and its metabolic consequences, cerebrovascular disease, and cognitive decline. J. Clin. Endocrinol. Metab. 106, e4260–e4274 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Ludwig J., Sanbonmatsu L., Gennetian L., Adam E., Duncan G. J., Katz L. F., Kessler R. C., Kling J. R., Lindau S. T., Whitaker R. C., McDade T. W., Neighborhoods, obesity, and diabetes—A randomized social experiment. N. Engl. J. Med. 365, 1509–1519 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Poldrack R. A., Kittur A., Kalar D., Miller E., Seppa C., Gil Y., Parker D. S., Sabb F. W., Bilder R. M., The cognitive atlas: Toward a knowledge foundation for cognitive neuroscience. Front. Neuroinform. 5, 17 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Dockès J., Poldrack R. A., Primet R., Gözükan H., Yarkoni T., Suchanek F., Thirion B., Varoquaux G., NeuroQuery, comprehensive meta-analysis of human brain mapping. Elife 9, e53385 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Raffington L., Czamara D., Mohn J. J., Falck J., Schmoll V., Heim C., Binder E. B., Shing Y. L., Stable longitudinal associations of family income with children’s hippocampal volume and memory persist after controlling for polygenic scores of educational attainment. Dev. Cogn. Neurosci. 40, 100720 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Mitchell B. L., Cuéllar-Partida G., Grasby K. L., Campos A. I., Strike L. T., Hwang L.-D., Okbay A., Thompson P. M., Medland S. E., Martin N. G., Wright M. J., Rentería M. E., Educational attainment polygenic scores are associated with cortical total surface area and regions important for language and memory. Neuroimage 212, 116691 (2020). [DOI] [PubMed] [Google Scholar]
  • 33.Judd N., Sauce B., Wiedenhoeft J., Tromp J., Chaarani B., Schliep A., van Noort B., Penttilä J., Grimmer Y., Insensee C., Becker A., Banaschewski T., Bokde A. L. W., Quinlan E. B., Desrivières S., Flor H., Grigis A., Gowland P., Heinz A., Ittermann B., Martinot J.-L., Martinot M.-L. P., Artiges E., Nees F., Orfanos D. P., Paus T., Poustka L., Hohmann S., Millenet S., Fröhner J. H., Smolka M. N., Walter H., Whelan R., Schumann G., Garavan H., Klingberg T., Cognitive and brain development is independently influenced by socioeconomic status and polygenic scores for educational attainment. Proc. Natl. Acad. Sci. U.S.A. 117, 12411–12418 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.C. Murray, Human Diversity: The Biology of Gender, Race, and Class (Grand Central Publishing, 2020). [Google Scholar]
  • 35.D. B. Paul, How PKU became a genetic disease, in Human Heredity in the Twentieth Century, B. Gausemeier, Ed. (Routledge, 2015), pp. 197–210. [Google Scholar]
  • 36.Jencks C., Heredity, environment, and public policy reconsidered. Am. Sociol. Rev. 45, 723–736 (1980). [PubMed] [Google Scholar]
  • 37.Harden K. P., Koellinger P. D., Using genetics for social science. Nat. Hum. Behav. 4, 567–576 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Kong A., Thorleifsson G., Frigge M. L., Vilhjalmsson B. J., Young A. I., Thorgeirsson T. E., Benonisdottir S., Oddsson A., Halldorsson B. V., Masson G., Gudbjartsson D. F., Helgason A., Bjornsdottir G., Thorsteinsdottir U., Stefansson K., The nature of nurture: Effects of parental genotypes. Science 359, 424–428 (2018). [DOI] [PubMed] [Google Scholar]
  • 39.Goldberger A. S., Heritability. Economica 46, 327–347 (1979). [Google Scholar]
  • 40.Farah M. J., Socioeconomic status and the brain: Prospects for neuroscience-informed policy. Nat. Rev. Neurosci. 19, 428–438 (2018). [DOI] [PubMed] [Google Scholar]
  • 41.Farah M. J., Sternberg S., Nichols T. A., Duda J. T., Lohrenz T., Luo Y., Sonnier L., Ramey S. L., Montague R., Ramey C. T., Randomized manipulation of early cognitive experience impacts adult brain structure. J. Cogn. Neurosci. 33, 1197–1209 (2021). [DOI] [PubMed] [Google Scholar]
  • 42.Noble K. G., Magnuson K., Gennetian L. A., Duncan G. J., Yoshikawa H., Fox N. A., Halpern-Meekin S., Baby’s first years: Design of a randomized controlled trial of poverty reduction in the United States. Pediatrics 148, e2020049702 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Littlejohns T. J., Holliday J., Gibson L. M., Garratt S., Oesingmann N., Alfaro-Almagro F., Bell J. D., Boultwood C., Collins R., Conroy M. C., Crabtree N., Doherty N., Frangi A. F., Harvey N. C., Leeson P., Miller K. L., Neubauer S., Petersen S. E., Sellors J., Sheard S., Smith S. M., Sudlow C. L. M., Matthews P. M., Allen N. E., The UK Biobank imaging enhancement of 100,000 participants: Rationale, data collection, management and future directions. Nat. Commun. 11, 2624 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Alfaro-Almagro F., Jenkinson M., Bangerter N. K., Andersson J. L. R., Griffanti L., Douaud G., Sotiropoulos S. N., Jbabdi S., Hernandez-Fernandez M., Vallee E., Vidaurre D., Webster M., McCarthy P., Rorden C., Daducci A., Alexander D. C., Zhang H., Dragonu I., Matthews P. M., Miller K. L., Smith S. M., Image processing and quality control for the first 10,000 brain imaging datasets from UK Biobank. Neuroimage 166, 400–424 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Hill M. O., Smith A. J. E., Principal component analysis of taxonomic data with multi-state discrete characters. Taxon 25, 249–255 (1976). [Google Scholar]
  • 46.J. Pagès, Multiple Factor Analysis by Example using R (CRC Press, 2014). [Google Scholar]
  • 47.M. Chavent, V. Kuentz-Simonet, A. Labenne, J. Saracco, Multivariate analysis of mixed data: The R package PCAmixdata. arXiv:1411.4911 [stat.CO] (18 November 2017).
  • 48.Loh P.-R., Tucker G., Bulik-Sullivan B. K., Vilhjálmsson B. J., Finucane H. K., Salem R. M., Chasman D. I., Ridker P. M., Neale B. M., Berger B., Patterson N., Price A. L., Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nat. Genet. 47, 284–290 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.T. T. Mallard, R. K. Linnér, A. Okbay, A. D. Grotzinger, R. de Vlaming, S. F. W. Meddens, E. M. Tucker-Drob, K. S. Kendler, M. C. Keller, P. D. Koellinger, K. Paige Harden, Not just one p: Multivariate GWAS of psychiatric disorders and their cardinal symptoms reveal two dimensions of cross-cutting genetic liabilities. bioRxiv 603134 [Preprint]. 8 September 2020. 10.1101/603134. [DOI]
  • 50.Vilhjálmsson B. J., Yang J., Finucane H. K., Gusev A., Lindström S., Ripke S., Genovese G., Loh P.-R., Bhatia G., Do R., Hayeck T., Won H.-H.; Schizophrenia Working Group of the Psychiatric Genomics Consortium, Discovery, Biology, and Risk of Inherited Variants in Breast Cancer (DRIVE) study, Kathiresan S., Pato M., Pato C., Tamimi R., Stahl E., Zaitlen N., Pasaniuc B., Belbin G., Kenny E. E., Schierup M. H., De Jager P., Patsopoulos N. A., McCarroll S., Daly M., Purcell S., Chasman D., Neale B., Goddard M., Visscher P. M., Kraft P., Patterson N., Price A. L., Modeling linkage disequilibrium increases accuracy of polygenic risk scores. Am. J. Hum. Genet. 97, 576–592 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Privé F., Arbel J., Vilhjálmsson B. J., LDpred2: Better, faster, stronger. Bioinformatics 36, 5424–5431 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.McCarthy S., Das S., Kretzschmar W., Delaneau O., Wood A. R., Teumer A., Kang H. M., Fuchsberger C., Danecek P., Sharp K., Luo Y., Sidore C., Kwong A., Timpson N., Koskinen S., Vrieze S., Scott L. J., Zhang H., Mahajan A., Veldink J., Peters U., Pato C., van Duijn C. M., Gillies C. E., Gandin I., Mezzavilla M., Gilly A., Cocca M., Traglia M., Angius A., Barrett J. C., Boomsma D., Branham K., Breen G., Brummett C. M., Busonero F., Campbell H., Chan A., Chen S., Chew E., Collins F. S., Corbin L. J., Smith G. D., Dedoussis G., Dorr M., Farmaki A.-E., Ferrucci L., Forer L., Fraser R. M., Gabriel S., Levy S., Groop L., Harrison T., Hattersley A., Holmen O. L., Hveem K., Kretzler M., Lee J. C., McGue M., Meitinger T., Melzer D., Min J. L., Mohlke K. L., Vincent J. B., Nauck M., Nickerson D., Palotie A., Pato M., Pirastu N., McInnis M., Richards J. B., Sala C., Salomaa V., Schlessinger D., Schoenherr S., Slagboom P. E., Small K., Spector T., Stambolian D., Tuke M., Tuomilehto J., Van den Berg L. H., Van Rheenen W., Volker U., Wijmenga C., Toniolo D., Zeggini E., Gasparini P., Sampson M. G., Wilson J. F., Frayling T., de Bakker P. I. W., Swertz M. A., McCarroll S., Kooperberg C., Dekker A., Altshuler D., Willer C., Iacono W., Ripatti S., Soranzo N., Walter K., Swaroop A., Cucca F., Anderson C. A., Myers R. M., Boehnke M., McCarthy M. I., Durbin R.; Haplotype Reference Consortium , A reference panel of 64,976 haplotypes for genotype imputation. Nat. Genet. 48, 1279–1283 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.The International HapMap 3 Consortium , Integrating common and rare genetic variation in diverse human populations. Nature 467, 52–58 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Okbay A., Beauchamp J. P., Fontana M. A., Lee J. J., Pers T. H., Rietveld C. A., Turley P., Chen G.-B., Emilsson V., Meddens S. F. W., Oskarsson S., Pickrell J. K., Thom K., Timshel P., de Vlaming R., Abdellaoui A., Ahluwalia T. S., Bacelis J., Baumbach C., Bjornsdottir G., Brandsma J. H., Concas M. P., Derringer J., Furlotte N. A., Galesloot T. E., Girotto G., Gupta R., Hall L. M., Harris S. E., Hofer E., Horikoshi M., Huffman J. E., Kaasik K., Kalafati I. P., Karlsson R., Kong A., Lahti J., van der Lee S. J., deLeeuw C., Lind P. A., Lindgren K.-O., Liu T., Mangino M., Marten J., Mihailov E., Miller M. B., van der Most P. J., Oldmeadow C., Payton A., Pervjakova N., Peyrot W. J., Qian Y., Raitakari O., Rueedi R., Salvi E., Schmidt B., Schraut K. E., Shi J., Smith A. V., Poot R. A., Pourcain B. S., Teumer A., Thorleifsson G., Verweij N., Vuckovic D., Wellmann J., Westra H.-J., Yang J., Zhao W., Zhu Z., Alizadeh B. Z., Amin N., Bakshi A., Baumeister S. E., Biino G., Bønnelykke K., Boyle P. A., Campbell H., Cappuccio F. P., Davies G., De Neve J.-E., Deloukas P., Demuth I., Ding J., Eibich P., Eisele L., Eklund N., Evans D. M., Faul J. D., Feitosa M. F., Forstner A. J., Gandin I., Gunnarsson B., Halldórsson B. V., Harris T. B., Heath A. C., Hocking L. J., Holliday E. G., Homuth G., Horan M. A., Hottenga J.-J., de Jager P. L., Joshi P. K., Jugessur A., Kaakinen M. A., Kähönen M., Kanoni S., Keltigangas-Järvinen L., Kiemeney L. A. L. M., Kolcic I., Koskinen S., Kraja A. T., Kroh M., Kutalik Z., Latvala A., Launer L. J., Lebreton M. P., Levinson D. F., Lichtenstein P., Lichtner P., Liewald D. C. M.; LifeLines Cohort Study, Loukola A., Madden P. A., Mägi R., Mäki-Opas T., Marioni R. E., Marques-Vidal P., Meddens G. A., McMahon G., Meisinger C., Meitinger T., Milaneschi Y., Milani L., Montgomery G. W., Myhre R., Nelson C. P., Nyholt D. R., Ollier W. E. R., Palotie A., Paternoster L., Pedersen N. L., Petrovic K. E., Porteous D. J., Räikkönen K., Ring S. M., Robino A., Rostapshova O., Rudan I., Rustichini A., Salomaa V., Sanders A. R., Sarin A.-P., Schmidt H., Scott R. J., Smith B. H., Smith J. A., Staessen J. A., Steinhagen-Thiessen E., Strauch K., Terracciano A., Tobin M. D., Ulivi S., Vaccargiu S., Quaye L., van Rooij F. J. A., Venturini C., Vinkhuyzen A. A. E., Völker U., Völzke H., Vonk J. M., Vozzi D., Waage J., Ware E. B., Willemsen G., Attia J. R., Bennett D. A., Berger K., Bertram L., Bisgaard H., Boomsma D. I., Borecki I. B., Bültmann U., Chabris C. F., Cucca F., Cusi D., Deary I. J., Dedoussis G. V., van Duijn C. M., Eriksson J. G., Franke B., Franke L., Gasparini P., Gejman P. V., Gieger C., Grabe H.-J., Gratten J., Groenen P. J. F., Gudnason V., van der Harst P., Hayward C., Hinds D. A., Hoffmann W., Hyppönen E., Iacono W. G., Jacobsson B., Järvelin M.-R., Jöckel K.-H., Kaprio J., Kardia S. L. R., Lehtimäki T., Lehrer S. F., Magnusson P. K. E., Martin N. G., McGue M., Metspalu A., Pendleton N., Penninx B. W. J. H., Perola M., Pirastu N., Pirastu M., Polasek O., Posthuma D., Power C., Province M. A., Samani N. J., Schlessinger D., Schmidt R., Sørensen T. I. A., Spector T. D., Stefansson K., Thorsteinsdottir U., Thurik A. R., Timpson N. J., Tiemeier H., Tung J. Y., Uitterlinden A. G., Vitart V., Vollenweider P., Weir D. R., Wilson J. F., Wright A. F., Conley D. C., Krueger R. F., Smith G. D., Hofman A., Laibson D. I., Medland S. E., Meyer M. N., Yang J., Johannesson M., Visscher P. M., Esko T., Koellinger P. D., Cesarini D., Benjamin D. J., Genome-wide association study identifies 74 loci associated with educational attainment. Nature 533, 539–542 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Burt J. B., Helmer M., Shinn M., Anticevic A., Murray J. D., Generative modeling of brain maps with spatial autocorrelation. Neuroimage 220, 117038 (2020). [DOI] [PubMed] [Google Scholar]
  • 56.Aydogan G., Daviet R., Karlsson Linnér R., Hare T. A., Kable J. W., Kranzler H. R., Wetherill R. R., Ruff C. C., Koellinger P. D., Nave G., Genetic underpinnings of risky behaviour relate to altered neuroanatomy. Nat. Hum. Behav. 5, 787–794 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Daviet R., Aydogan G., Jagannathan K., Spilka N., Koellinger P. D., Kranzler H. R., Nave G., Wetherill R. R., Associations between alcohol consumption and gray and white matter volumes in the UK Biobank. Nat. Commun. 13, 1175 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Anderson M. L., Multiple inference and gender differences in the effects of early intervention: A reevaluation of the abecedarian, perry preschool, and early training projects. J. Am. Stat. Assoc. 103, 1481–1495 (2008). [Google Scholar]
  • 59.Alfaro-Almagro F., McCarthy P., Afyouni S., Andersson J. L. R., Bastiani M., Miller K. L., Nichols T. E., Smith S. M., Confound modelling in UK Biobank brain imaging. Neuroimage 224, 117002 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Turley P., Walters R. K., Maghzian O., Okbay A., Lee J. J., Fontana M. A., Nguyen-Viet T. A., Wedow R., Zacher M., Furlotte N. A.; 23andMe Research Team; Social Science Genetic Association Consortium, Magnusson P., Oskarsson S., Johannesson M., Visscher P. M., Laibson D., Cesarini D., Neale B. M., Benjamin D. J., Multi-trait analysis of genome-wide association summary statistics using MTAG. Nat. Genet. 50, 229–237 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Nichols T., Hayasaka S., Controlling the familywise error rate in functional neuroimaging: A comparative review. Stat. Methods Med. Res. 12, 419–446 (2003). [DOI] [PubMed] [Google Scholar]
  • 62.Winkler A. M., Ridgway G. R., Webster M. A., Smith S. M., Nichols T. E., Permutation inference for the general linear model. Neuroimage 92, 381–397 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Freedman D., Lane D., A nonstochastic interpretation of reported significance levels. J. Bus. Econ. Stat. 1, 292–298 (1983). [Google Scholar]
  • 64.MacKinnon D. P., Krull J. L., Lockwood C. M., Equivalence of the mediation, confounding and suppression effect. Prev. Sci. 1, 173–181 (2000). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.J. M. Wooldridge, Econometric Analysis of Cross Section and Panel Data (MIT Press, 2010). [Google Scholar]
  • 66.Alexander-Bloch A. F., Shou H., Liu S., Satterthwaite T. D., Glahn D. C., Shinohara R. T., Vandekar S. N., Raznahan A., On testing for spatial correspondence between maps of human brain structure and function. Neuroimage 178, 540–551 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Greenland S., Pearl J., Robins J. M., Causal diagrams for epidemiologic research. Epidemiology 10, 37–48 (1999). [PubMed] [Google Scholar]
  • 68.Shrier I., Platt R. W., Reducing bias through directed acyclic graphs. BMC Med. Res. Methodol. 8, 70 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Tennant P. W. G., Murray E. J., Arnold K. F., Berrie L., Fox M. P., Gadd S. C., Harrison W. J., Keeble C., Ranker L. R., Textor J., Tomova G. D., Gilthorpe M. S., Ellison G. T. H., Use of directed acyclic graphs (DAGs) to identify confounders in applied health research: Review and recommendations. Int. J. Epidemiol. (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Selzam S., Ritchie S. J., Pingault J.-B., Reynolds C. A., O’Reilly P. F., Plomin R., Comparing within- and between-family polygenic score prediction. Am. J. Hum. Genet. 105, 351–363 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Yang J., Benyamin B., McEvoy B. P., Gordon S., Henders A. K., Nyholt D. R., Madden P. A., Heath A. C., Martin N. G., Montgomery G. W., Goddard M. E., Visscher P. M., Common SNPs explain a large proportion of the heritability for human height. Nat. Genet. 42, 565–569 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Lee S. H., Yang J., Goddard M. E., Visscher P. M., Wray N. R., Estimation of pleiotropy between complex diseases using single-nucleotide polymorphism-derived genomic relationships and restricted maximum likelihood. Bioinformatics 28, 2540–2542 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Evans L. M., Tahmasbi R., Vrieze S. I., Abecasis G. R., Das S., Gazal S., Bjelland D. W., de Candia T. R.; Haplotype Reference Consortium, Goddard M. E., Neale B. M., Yang J., Visscher P. M., Keller M. C., Comparison of methods that use whole genome data to estimate the heritability and genetic architecture of complex traits. Nat. Genet. 50, 737–745 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Loh P.-R., Bhatia G., Gusev A., Finucane H. K., Bulik-Sullivan B. K., Pollack S. J.; Schizophrenia Working Group of Psychiatric Genomics Consortium, de Candia T. R., Lee S. H., Wray N. R., Kendler K. S., O’Donovan M. C., Neale B. M., Patterson N., Price A. L., Contrasting genetic architectures of schizophrenia and other complex diseases using fast variance-components analysis. Nat. Genet. 47, 1385–1392 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Collins S. E., Associations between socioeconomic factors and alcohol outcomes. Alcohol Res. 38, 83–94 (2016). [PMC free article] [PubMed] [Google Scholar]
  • 76.Fry A., Littlejohns T. J., Sudlow C., Doherty N., Adamska L., Sprosen T., Collins R., Allen N. E., Comparison of sociodemographic and health-related characteristics of UK biobank participants with those of the general population. Am. J. Epidemiol. 186, 1026–1034 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Sections S1 to S7

Figs. S1 to S19

References

Data S1, containing tables S1 to S25


Articles from Science Advances are provided here courtesy of American Association for the Advancement of Science

RESOURCES