Abstract
Recent work leveraging artificial intelligence has offered promise to dissect disease heterogeneity by identifying complex intermediate brain phenotypes, called dimensional neuroimaging endophenotypes (DNEs). We advance the argument that these DNEs capture the degree of expression of respective neuroanatomical patterns measured, offering a dimensional neuroanatomical representation for studying disease heterogeneity and similarities of neurologic and neuropsychiatric diseases. We investigate the presence of nine DNEs derived from independent yet harmonized studies on Alzheimer’s disease, autism spectrum disorder, late-life depression and schizophrenia in the UK Biobank study. Phenome-wide associations align with genome-wide associations, revealing 31 genomic loci (P < 5 × 10−8/9) associated with the nine DNEs. The nine DNEs, along with their polygenic risk scores, significantly enhanced the predictive accuracy for 14 systemic disease categories, particularly for conditions related to mental health and the central nervous system, as well as mortality outcomes. These findings underscore the potential of the nine DNEs to capture the expression of disease-related brain phenotypes in individuals of the general population and to relate such measures with genetics, lifestyle factors and chronic diseases.
Disease heterogeneity1–8 has been a critical challenge for precision medicine9. A new era powered by artificial intelligence (AI) and large-scale, multi-omics biomarkers may enable us to quantify individualized liability for various brain diseases10,11. Recent work has leveraged semi-supervised clustering methods (Fig. 1a and Supplementary Method 1) to tackle this challenge. These methods characterize disease heterogeneity by constructing a mapping or transformation from a reference group, or healthy controls, to a target group, such as patients with a specific disease. In clinical neuroscience, these methods can quantify deviation from typical brain structure measured by T1-weighted magnetic resonance imaging (MRI)1,2,12,13. They represent disease-related neuroanatomical heterogeneity with multiple low-dimensional categorical subtypes associated with specific patterns of brain change relative to the reference group. Instead of focusing on the categorical subtypes, we investigated their corresponding continuous phenotypes (that is, dimensions)14, given that many chronic brain diseases develop along a continuous spectrum. Each neuroanatomical pattern’s level of expression, therefore, serves as a dimensional AI-derived biomarker pertinent to the respective disease.
Fig. 1 |. Study workflow.

a, The concept of semi-supervised learning methods used in this study. These AI methods model the patterns and transformations from the healthy control to the patient domain, thus capturing variations related to underlying disease pathology. Nine DNEs previously published1,2,12,13 from four disease-focused, case–control studies were investigated. b, The expression of the nine DNEs in the UKBB general population. The trained models were then applied to the UKBB population to quantify the expression of the neuroanatomical patterns of the nine DNEs at individual levels; a higher DNE score indicates a greater expression (manifestation/presence) of the respective neuroanatomical pattern. For example, the blue samples express predominantly AD2, whereas the pink samples express predominantly SCZ2. The kernel density estimate for each DNE is shown. It is worth noting that AD1–2 DNEs are from the Surreal-GAN14 model and others from the HYDRA86 model, resulting in varying DNE score ranges by modelling. Overall, lower scores imply milder imaging pattern expressions. c, Phenome- and genome-wide analyses were performed on the nine DENs. PWAS were conducted to associate the nine DNEs with phenotypes across nine organ systems, cognition and lifestyle factors. GWAS were performed to investigate associations between the nine DNEs and common genetic variants (SNPs). Finally, the nine DNEs and their PRSs predicted 14 disease categories (ICD-10-based), 8 cognitive scores and mortality. CN, healthy control; PT, patient.
Previous heterogeneity research has primarily focused on within-disease heterogeneity1–8. However, this approach neglects the shared disease mechanisms, genetics and clinical manifestations among different brain diseases. Conversely, while several studies have investigated the shared genetic components across various brain diseases, they have overlooked the important aspect of disease heterogeneity within each condition15,16. As such, a broad perspective is required to investigate disease heterogeneity simultaneously, spanning multiple neurodegenerative and neuropsychiatric disorders. This holistic approach aids in understanding the commonalities and interrelationships between these brain diseases and the multi-organ systems of humans17,18. Such an effort could simultaneously capture neurobiological heterogeneity within disorders and explain shared features, mechanisms and risk factors across disorders. Ultimately, unravelling neurobiological heterogeneity within neuropsychiatric syndromes and explaining co-morbidity among them promises to accelerate more effective diagnosis, treatment and prevention strategies.
These AI-derived biomarkers capture disease-specific neuroanatomic heterogeneity. However, whether these biomarkers are present in the general population, potentially simultaneously, remains unknown. In this Article, we sought to measure the presence of multiple AI-based signatures in the general population, delineate common mechanisms among them and shed light on their relationship with other human organ systems14,19,20. To do this, we capitalized on nine imaging biomarkers recently derived from regional grey matter (GM) volumetrics derived from several large-scale disease-focused consortia, including Alzheimer’s Disease Neuroimaging Initiative (ADNI)21 for Alzheimer’s disease (AD1–2 (ref. 12)), Autism Brain Imaging Data Exchange (ABIDE)22 for autism spectrum disorder (ASD1–3 (ref. 1)), late-life depression (LLD) for late-life depression older than 60 years old (LLD1–2 (ref. 2)) and Psychosis Heterogeneity Evaluated via Dimensional Neuroimaging (PHENOM13) for schizophrenia (SCZ1–2 (ref. 13)). Using semi-supervised clustering and representation learning methodologies23, we obtained nine imaging biomarkers capturing the neuroanatomical diversity across the four distinct brain diseases. AD1 and AD2 illustrate distinct atrophy patterns in the overall brain and medial temporal lobe, respectively12. ASD1, ASD2 and ASD3 showcase reduced overall volume, expanded subcortical volume and increased cortical volume1. LLD1 and LLD2 (ref. 2) show augmented subcortical volume and overall brain atrophy, respectively. Finally, SCZ1 and SCZ2 (ref. 13) show global brain atrophy and expanded basal ganglia volume. We first conceptualized these biomarkers as the dimensional neuroimaging endophenotypes (DNE), seeking to test the endophenotype hypothesis in psychiatry24–26, which suggests that such measurable intermediate biomarkers (that is, the endophenotypes) bridge genetics and clinical symptoms and the onset of the disease. They are thought to be more closely related to the underlying aetiology and genetics than the complex clinical symptoms or the disease itself. Furthermore, the concept of dimensional representation reinforces the idea that diseases such as AD progress along a continuous spectrum. As such, categorizing disease heterogeneity into distinct subtypes overlooks the presence of co-existing patterns within the same individuals (refer to Supplementary Method 1a for details).
We evaluated the manifestation of the nine DNEs in the general population using the neuroimaging and genetic data available in the UK Biobank study27 (UKBB). The four pre-trained disease-specific AI models were applied to the 39,178 UKBB participants with both brain MRI28 and genetic29 data. To delineate the phenotypic landscape of the nine DNEs, we first tested whether the neuroanatomical patterns of the nine DNEs are present in the UKBB general population. Subsequently, we conducted a phenome-wide association study (PWAS) to establish associations between the nine DNEs and 611 UKBB phenotypes, including brain imaging-derived phenotypes (IDPs), traits related to multiple human organ systems, cognition and lifestyle factors. We performed a genome-wide association study (GWAS) linking the nine DNEs to 6,477,810 quality-checked common single-nucleotide polymorphisms (SNPs) to depict their genetic architecture. Furthermore, we conducted analyses to investigate genetic correlations, colocalization and causal relationships between the nine DNEs, nine human organ systems17 and several chronic diseases. Finally, we assessed the ability of the nine DNEs and their corresponding polygenic risk score (PRS) to predict 14 systemic disease categories, 8 cognitive scores, and mortality.
Results
Our analytic framework involves computational genomics, statistical methods and machine learning to elucidate the phenotypic landscape and genetic architecture of the nine DNEs, as illustrated in Fig. 1.
All nine DNEs are evident in the general population
We tested whether the neuroanatomical patterns defined in the four disease populations could be found in the general population. We applied the DNE models pre-trained for each disease population to the UKBB general population to measure the degree of expression of each DNE at the individual level. The imaging data from the four disease populations and the UKBB general population were first statistically harmonized via the imaging-based SysTem for AGing and NeurodeGenerative diseases (iSTAGING) consortium30 and then linearly corrected in the AI models for common confounders such as demographics to alleviate potential domain shifts31 (Supplementary Text 1).
We first summarize the neuroanatomical patterns of the nine DNEs (Fig. 2a). Overall, the original patterns identified in the disease populations1,2,12,13 manifest in the general population. AD1 shows a pattern of brain atrophy (that is, negative correlation) across various brain volumes, while AD2 involves focal atrophy of the medial temporal lobe. ASD1 captures a pattern of lower GM volumes in several subcortical regions, including the pallidum, amygdala and putamen, whereas ASD2 reflects a pattern of relatively larger GM volumes (that is, positive correlation) in subcortical regions. ASD3, conversely, is characterized by relatively larger GM volumes in several cortical areas, including the insula. LLD1 (positive correlation) and LLD2 (negative correlation) are characterized by concomitant patterns of regional GM volumes, prominently involving the middle frontal gyrus, the insula and the thalamus. For SCZ, a widespread pattern of reduced brain volumes (for example, insula) is associated with SCZ1, whereas SCZ2 shows increased volumes of the putamen and pallidum. The details of the P value, sample sizes and β values of the linear regression are presented in Supplementary Files 1 and 2.
Fig. 2 |. Phenome-wide associations of the nine DNEs.

. a, The neuroanatomical patterns of the nine DNEs were manifested in the UKBB general population and were concordant with the patterns initially derived from the original disease populations1,2,12,13. A linear regression model was applied to the 119 GM ROIs derived from T1-weighted MRI data while accounting for various covariates. We present the β coefficients of the ROIs that withstood the Bonferroni correction. Positive correlations are depicted using warm reddish colours, while cold blue colours represent negative correlations. For AD2, we showed the sagittal view to visualize the hippocampus and medial temporal lobe. Sample sizes range from 40,534 to 40,981 to derive these results. b, The nine DNEs are over-expressed (that is, a higher mean of the DNE score in the population) and under-expressed (that is, a lower mean of the DNE score) in the general population compared to the disease populations. The kernel density estimates of the nine DNEs are shown for both the training dataset (grey-coloured in patients) and the independent test dataset from the UKBB. Significant differences that survived the Bonferroni corrections between the training and independent test datasets (two-sampled t-test) are denoted with the symbol *. Sample sizes range from 38,534 to 38,981 to derive these results in UKBB and from 307 to 1,510 in the original diseased populations. c, Phenome-wide associations (PWAS) between the nine DNEs (left panel) and 611 phenotypes (middle panel) are dominated by brain phenotypic measures. The right panel shows representative phenotypes linked to multiple phenotype categories with the highest statistical significance after the Bonferroni correction (two-sided t-test P < 0.05/611 based on all 611 phenotypes tested). A thicker coloured line corresponds to a higher value of −log10(P value). The symbols ‘+’ and ‘−’ represent positive and negative correlations. OD, orientation dispersion; FA, fractional anisotropy; fMRI, functional magnetic resonance imaging; FC, functional connectivity; ICVF, intracellular volume fraction; DSST, digit symbol substitution test. Sample sizes range from 722 to 39,174 after merging the DNEs and these phenotypes. Credit: human body in c, NIAID NIH BIOART.
The nine DNEs showed different strengths of expression in the general population relative to what is seen in the disease populations from which they were derived. These differences were statistically significant except for ASD2, after correcting for multiple comparisons using the Bonferroni method (Fig. 2b). For instance, AD1, characterized by diffuse brain atrophy in the ADNI data, showed a significant under-expression (that is, a smaller mean of the DNE score) in the general population. Conversely, the subcortical atrophy pattern originally identified in ASD1 from the ABIDE data showed a significant over-expression (that is, a larger mean of the DNE score) in the participants from the general population.
We then tested whether the nine DNEs differed between the healthy control and disease groups (AD, ASD, LLD and SCZ) in the general UKBB population. Compared to the healthy control group (N = 6,390), a small proportion of patients with the four brain diseases showed significant differences (two-sample t-test) for AD1–2 (N = 23), LLD1–2 (N = 1,329) and SCZ1–2 (N = 23), except for ASD1–3 (only 6 ASD patients in this population). For example, AD2, characterized by focal medial temporal lobe atrophy, had a significantly lower mean of DNE in the healthy control (0.29 ± 0.19) compared to the AD patient groups (0.45 ± 0.27; P = 1.1 × 10−4). Detailed statistics for all nine DNEs are presented in Supplementary Table 1.
These results proved that the nine DNEs manifest in the general population and convey disease-specific information about the four brain diseases. The contrast in their expression (that is, over- and under-expression) between the disease-specific and UKBB general populations is expected and underscores their potential relevance as sub-clinical or vulnerability quantitative indices.
The nine DNEs show phenotypic associations with traits beyond the brain
To delineate their phenotypic landscape, we associated the nine DNEs with 611 phenotypes in UKBB. To avoid circularity, the PWAS did not include the 119 GM regions of interest (ROIs) derived from T1-weighted MRI, from which the nine DNEs were derived. Out of the 611 additional clinical traits spanning multiple organ systems, cognition and lifestyle factors, we discovered 1,818 significant associations after applying the Bonferroni correction (P < 0.5/611 based on all 611 phenotypes tested) (Fig. 2c and Supplementary File 3).
Of the 1,818 significant associations, 91% were related to the brain. For example, the mean intracellular volume fraction in the superior frontal–occipital fasciculus derived from the multi-shell neurite orientation dispersion and density imaging32 model was significantly associated with AD1 (β = −0.67 ± 0.02, −log10(P value) > 300). Multiple DNEs were significantly associated with the biological age gap (BAG, AI-predicted age minus chronological age) of the brain (for example, SCZ2: β = 0.19 ± 0.01, −log10(P value) = 50.10). Furthermore, 2% of the phenotypes related to the musculoskeletal system were associated with the nine DNEs. The nine DNEs were also largely associated with many phenotypes related to mental health (1%). For example, the neuroticism score was significantly associated with LLD2 (β = −1.09 × 10−2 ± 2.42 × 10−3, −log10(P value) = 5.20) (Supplementary File 3). A previous study suggests distinct neural mechanisms between older individuals with neurotic and non-neurotic depression, indicating diverse biological pathologies contributing to varying clinical presentations of LLD33.
We conducted two sensitivity analyses to validate the main PWAS results. We obtained high concordance rates in split-sample (98.03%) and sex-stratified analyses (93.98%). Detailed results can be found in Supplementary Text 2 and Supplementary Files 4 and 5 for split-sample and sex-stratified analyses.
As anticipated, 91% of the significant associations were linked to the brain, given that the DNEs were derived from regional brain volumetrics in brain disease-specific populations. However, it is noteworthy that these phenotypic associations extended beyond the brain, providing evidence for the significant associations between the brain and other organ systems. This brain–body connection was consistent with previous literature on multi-organ research using imaging and genetic data17,34,35.
Genome-wide associations identify 66 genomic loci
At the genome-wide significance level (P < 5 × 10−8), GWAS using PLINK (v2.0) for a linear regression model identified 10, 8, 5, 21, 9, 1, 3, 3 and 6 genomic loci significantly associated with AD1, AD2, ASD1, ASD2, ASD3, LLD1, LLD2, SCZ1 and SCZ2, respectively (66 in total; Fig. 3a and Supplementary File 6). At a more stringent significance level (P < 5 × 10−8/9 based on the 9 DNEs tested), 31 loci passed the Bonferroni correction, as annotated in Fig. 3a (query date, 2 June 2023, via FUMA version v1.5.4). To support the robustness of our GWAS, we estimated the intercept of linkage disequilibrium score regression (LDSC)36 and obtained intercepts of 0.9997 ± 0.0093, 1.0325 ± 0.0091, 0.9962 ± 0.0095, 1.0172 ± 0.0099, 1.0117 ± 0.0085, 1.0135 ± 0.008, 1.0162 ± 0.0094, 1.0101 ± 0.0087 and 1.0124 ± 0.0095 for the nine DNEs. All intercepts were close to 1, indicating no substantial genomic inflation in our GWASs. The Manhattan and quantile–quantile (QQ) plots of the nine GWASs are presented in Supplementary Figs. 1–9 and are also publicly available on the Multi-organ biomEDIcal sCIeNcE (MEDICINE) knowledge portal at https://labs-laboratory.com/medicine/.
Fig. 3 |. Genome-wide associations of the nine DNEs.

a, Genome-wide associations identified 66 (10, 8, 5, 21, 9, 1, 3, 3, 6 for the nine DNEs) genomic loci (P < 5 × 10−8) associated with the nine DNEs. Using the top lead SNP, we denoted each genomic loci linked to the 9 DNEs. Red * symbols indicate that the locus linkage disequilibrium has not been previously associated with any trait in the EMBL-EBI GWAS catalogue. The left legend indicates the significant SNP-based heritability (h2) for the nine DNEs; the right legend represents the SNP density of our genetic data throughout the human genome. GWAS was performed using the Genome Reference Consortium Human Build 37 (GRCh37). These GWASs included 31,976 participants of European ancestry. b, Phenome-wide association query of the previously identified genomic loci (left panel) in the EMBL-EBI GWAS catalogue (via FUMA 1.4.2) shows a brain-dominant genetic architecture. We categorized all clinical traits (middle panel) into several high-level categories linked to multiple organ systems, neurodegenerative and neuropsychiatric disorders, lifestyle factors and so on. We then show the keyword cloud plots for each category (right panel). Credit: human body in b, NIAID NIH BIOART.
All DNEs are significantly heritable (0.24 < h2 < 0.66, P < 1 × 10−10) after Bonferroni correction (Fig. 3a and Supplementary Table 2). We used the GCTA37 software (v1.94.1) to estimate h2, acknowledging that previous research has demonstrated variations in the magnitude of h2 estimates based on the choice of methods. The h2 estimates obtained through GCTA or similar methods may underestimate SNP-based heritability (h2) (that is, missing heritability), potentially due to unaccounted factors such as gene–environment interactions, epistasis and rare variant effects.
We further investigated the significant genomic loci by mapping them to protein-encoding genes and examining their functional implications through expression quantitative trait loci (eQTL) mapping. Supplementary Fig. 10 presents the regional Manhattan plot for the most significant genomic locus associated with each DNE. For example, we identified a locus associated with ASD2 (top lead SNP, rs3068507 at 20q11.21) and a neighbouring locus associated with SCZ1 (top lead SNP, rs6088962 at 20q11.21), both of which mapped to the MYLK2 gene (Supplementary Fig. 10d,h). The MYLK2 gene encodes a myosin light chain kinase primarily expressed in adult skeletal muscle.
We conducted seven sensitivity analyses to validate the main GWAS results. We obtained perfect concordant rates (100%) and similar genomic inflation factors using fastGWA38,39 (v1.94.1) for a generalized linear mixed model in the nine GWAS. Specifically, we excluded the related individuals (up to second degree) in our genetic quality check pipeline; mixed effect models such as fastGWA can detect additional genetic relatedness without excluding individuals by using a sparse genomic relationship matrix, reducing the computational burden for large-scale GWAS. This sensitivity check further supports that no substantial genomic inflation exists in our main GWASs. We also observed high concordance rates in split-sample, sex-stratified (63.26–92.54%) and longitudinal GWAS analyses (100%, N = 1,116), but the concordance rates were relatively low in non-European ancestry GWAS, independent ADNI whole-genome sequencing GWAS and six case–control GWAS40–45 of neurodegenerative and neuropsychiatric disorders from the Psychiatric Genomics Consortium (PGC)46 (Supplementary Table 3a). The sample sizes for the non-European (N = 4,783) and ADNI (N = 1,555) samples are small; the case–control GWAS from the PGC may overlook the heterogeneity within each disease. Detailed results are presented in Supplementary Text 3 for the sensitivity results, Supplementary Files 7–13 for replicated SNPs/loci and Supplementary Figs. 1–9 for Manhattan and QQ plots and the LDSC intercepts.
The genetic associations of the nine DNEs parallel their phenotypic associations
We performed a phenome-wide look-up analysis to understand the phenotypic associations in the literature for the identified genomic loci in our GWAS.
In total, 2,525 clinical traits were associated with genetic variants in our GWAS, including traits linked to multiple organ systems, cognition and lifestyle factors (Fig. 3b and Supplementary File 14). The genomic loci were largely associated with clinical traits of the brain (53%), musculoskeletal (17%), immune system (6%), neurodegenerative (1%) and neuropsychiatric (1%) diseases. For example, AD2 genomic loci were largely associated with traits related to the brain (565 out of 781, for example, brain IDPs), musculoskeletal (133/781, for example, standing height), immune (17/781, for example, reticulocyte count), cognition (15/781, for example, cognitive performance), lifestyle factors (13/781, for example, smoking) and neurodegenerative traits (1/781, that is, neurofibrillary tangles).
The findings closely align with the phenotypic associations observed in Fig. 2c, reinforcing that the DNEs share genetic determinants linked to organs beyond the brain, lifestyle factors and cognition.
The genetic correlation of the nine DNEs
To understand the shared genetic underpinnings, we estimated the genetic correlation36 (gc) between the nine DNEs, the BAG of nine human organ systems17 and six brain diseases (AD, attention-deficit/hyperactivity disorder, ASD, bipolar disorder (BIP), obsessive–compulsive disorder (OCD), SCZ; Supplementary Table 3a) from PGC and four lifestyle factors and cognitive scores (Supplementary Table 3b).
We first estimated the gc between each pair of DNEs (Fig. 4a). Numerous DNEs showed strong genetic correlations among each other. The highest positive genetic correlation was obtained between ASD2 and SCZ1 (gc = 0.57 ± 0.04); the highest negative genetic correlations were obtained between ASD2 and ASD1 (gc = −0.55 ± 0.04) and between ASD3 and SCZ1 (gc = −0.51 ± 0.05). We also observed a substantial alignment between the phenotypic correlation (pc) and the genetic correlation of pairwise DNEs, supporting the long-standing Cheverud’s conjecture47. However, we identified two exceptions where the observed phenotypic and genetic correlations showed opposite directions. ASD1 and ASD3 showed a negative phenotypic (pc = −0.39 ± 0.08) but a positive genetic correlation (gc = 0.21 ± 0.05); ASD1 and LLD1 showed a negative phenotypic (pc = −0.34 ± 0.09) but a positive genetic correlation (gc = 0.16 ± 0.07) (Supplementary Table 4). This implies that non-genetic factors, such as lifestyle and environmental factors, may exert opposite influences on the two DNEs. We performed additional MAGMA (v1.10) gene-set analysis48 to test the genetic similarity between ASD2 and SCZ1. The most significant biological pathway underlying ASD2 is the negative regulation of locomotion (GO 0040013, P = 2.27 × 10−5, β = 0.22 ± 0.02), implicated in biological processes that stop, prevent or reduce the frequency, rate or extent of locomotion of a cell or organism. The most significant biological pathway for SCZ1 is the negative regulation of neurotransmitter transport (GO 0051589, P = 1.41 × 10−5, β = 0.22 ± 0.02), involved in biological processes that downregulate the directed movement of a neurotransmitter into, out of or within a cell. In particular, the latter supports the involvement of dopamine and glutamate, two major neurotransmitters in the central nervous system, in SCZ49.
Fig. 4 |. The genetic correlation, colocalization, and causal networks of the nine DNEs.

a, The genetic correlation between two DNEs (gc, lower triangle) mirrors their phenotypic correlation (pc, upper triangle). Red-shadowed rectangles highlight two exceptions. The symbol * indicates significant results after the Benjamini–Hochberg correction. The symbol # indicates nominal significance. b, Genetic correlations between the nine DNEs and nine BAGs for nine human organ systems17. c, Genetic correlations between the nine DNEs and six neurodegenerative and neuropsychiatric disorders. The bar plots show the estimated mean genetic correlation along with its standard error. d, Genetic correlations between the nine DNEs and four traits related to lifestyle factors and cognition. The bar plots show the estimated mean genetic correlation along with its standard error. A two-sided P value is used to determine statistical significance. e, Genetic colocalization was evidenced at one locus (6p21.1) between ASD2 and SCZ1. The signed PP.H4.ABF (0.92) denotes the PP of hypothesis H4, which suggests that both traits share the same causal SNP (rs2790099). A positive PP indicates concordant β values for both DNEs, while a negative PP implies opposite β values. f, Genetic colocalization was evidenced at one locus (3p.22.1) between ASD2 and brain BAG: PP.H4.ABF = 0.95 with the cause SNP rs5848503. g, Genetic colocalization was evidenced at one locus (6p.22.1) between ASD3 and SCZ case–control GWAS44 from PGC (European ancestry): PP.H4.ABF = 0.82 with the cause SNP rs9257566. h, The causal network of the nine DNEs with the eight multi-organ BAGs. Solid arrow lines (from the exposure to the outcome variables) indicate significant causal relationships after the Benjamini–Hochberg correction; dotted arrow lines show nominal significance (P < 0.05). The symbols ‘+’ (OR > 1 and gc > 0) and ‘−’ (OR < 1 and gc < 0) represent a positive relationship between the two traits. i, the causal network of the 9 DNEs with the 11 chronic diseases (for example, AD, ADHD, BIP, and SCZ from PGC). WM, white matter; ADHD, attention-deficit/hyperactivity disorder; CD, Crohn’s disease; IBD, inflammatory bowel disease; PBC, primary biliary cirrhosis; T2D, type 2 diabetes; RA, rheumatoid arthritis. Credit: human body in b,h, NIAID NIH BIOART.
Between the nine DNEs and the BAGs across nine human organ systems, we found significant genetic correlations between AD1 (gc = 0.23 ± 0.05), ASD1 (gc = 0.44 ± 0.05), LLD2 (gc = 0.24 ± 0.07), SCZ1 (gc = 0.26 ± 0.06) and the brain BAG, and between ASD1 and the eye BAG (gc = 0.19 ± 0.07) (Fig. 4b and Supplementary Table 5).
Finally, we also found a marginally significant genetic correlation between AD2 and AD (gc = 0.22 ± 0.12), AD1 and BIP (gc = −0.08 ± 0.04) and ASD3 and BIP (gc = 0.09 ± 0.04) using GWAS summary statistics from PGC (Fig. 4c and Supplementary Table 6a). We observed a nominal genetic correlation signal between ASD1 and reaction time (gc = 0.35 ± 0.15; Fig. 4d and Supplementary Table 6b).
In summary, the nine DNEs show substantial genetic correlations among themselves and with organ systems beyond the brain. These findings highlight the interconnectedness of the neuroanatomical patterns and genetic determinants across multiple body systems and diseases, suggesting shared underlying disease mechanisms and potential pleiotropic effects.
The genetic colocalization of the nine DNEs
To seek the shared causal variants between two clinical traits (for example, AD1 versus LLD2), we performed approximate Bayes factor colocalization50 analyses between the nine DNEs (Fig. 4e), with the nine BAGs (Fig. 4f) and the six brain disorders from PGC46 (Fig. 4g).
Among the nine DNEs, we detected 44 causal variants (SNPs) showing significant colocalization signals. We showcased the shared causal variant (rs2790099 at 6p21.1) between ASD2 and SCZ2 with a PP.H4. ABF (approximate Bayes factor) = 0.92 (Fig. 4e), which examines the posterior probability (PP) to evaluate the hypothesis, the presence of a single shared causal variant associated with both traits within a specific genomic locus. This causal SNP was mapped to the RUNX2 gene. The loss of function in RUNX2 causes a rare autosomal dominant skeletal disorder—cleidocranial dysplasia51—but it was implicated in ASD or SCZ in previous literature.
Between the nine DNEs and nine BAGs, we identified 13 causal variants (SNPs) showing significant colocalization signals. We showcased the shared causal variant (rs5848503 at 3p22.1) between ASD2 and the brain BAG with a PP.H4.ABF = 0.95 (Fig. 4f). One mapped gene in this locus is the MOPB gene, which encodes the myelin-associated oligodendrocytes basic protein and is actively involved in the structural constituent of the myelin sheath and nervous system development. This gene was previously implicated in ASD using single-cell genomics52, SCZ53, amyotrophic lateral sclerosis and Parkinson’s disease54.
Between the nine DNEs and six brain diseases from PGC, we identified six causal variants (SNPs) showing significant colocalization signals. We showcased the shared causal variant (rs9257566 at 6p22.1) between ASD3 and SCZ with a PP.H4.ABF = 0.82 (Fig. 4g). In this locus, multiple olfactory receptor genes and the dysfunction of the olfactory system were implicated in SCZ55,56 and ASD57,58. For instance, the olfactory receptor genes OR2J2 and OR2J3 are two protein-coding genes in copy number variants associated with SCZ using microRNA data59. The causal SNP (rs9257566) was associated with SCZ and brain IDP, such as white matter microstructural measures (Supplementary Fig. 11). Besides those two conditions, AD has also been widely associated with early olfactory dysfunction60,61. The sensitivity checks on the prior probability (P12) for the three illustrations are shown in Supplementary Fig. 12a–c. The causal variant, cytogenetic region and their colocation signal direction (based on β coefficients) are presented in Supplementary Fig. 13a–c. Detailed results are shown in Supplementary Files 15–17.
The genetic colocalization of the nine DNEs revealed causal genetic variants, indicating that the same genomic regions may causally influence the expression of these AI-derived endophenotypes.
The causal relationship of the nine DNEs
We applied bidirectional two-sample Mendelian randomization analyses62 to depict a causal network between the 9 DNEs, the 8 BAGs (excluding the brain BAG) and 11 chronic diseases spanning the whole-body system.
For each pair of DNEs, as the GWAS populations completely overlapped, conducting two-sample Mendelian randomization was not feasible63. Alternatively, the split-sample GWAS did not yield sufficient statistical power due to the limited number of instrumental variables available (<6 SNPs).
Among the nine DNEs and eight BAGs, we found potential causal effects of the eye BAG on LLD1 (P = 4.57 × 10−3, odds ratio (OR) (95% confidence interval) = 1.14 (1.04, 1.24), number of SNPs = 16), the cardiovascular BAG on ASD3 (P = 6.04 × 10−3, OR (95% confidence interval) = 1.16 (1.04, 1.29), number of SNPs = 34) and the pulmonary BAG on LLD2 (P = 1.98 × 10−3, OR (95% confidence interval) = 1.14 (1.04, 1.25), number of SNPs = 49). No significant causal signals persisted after the Benjamini–Hochberg correction in the inverse analyses (Fig. 4h). Details of the results, including all five different Mendelian randomization estimators, are shown in Supplementary File 18.
Between the 9 DNEs and 11 chronic diseases, encompassing brain-related conditions and diseases affecting other organs, we identified a potential causal effect from AD2 to AD using the GWAS summary statistics from PGC—the largest sample size (N = 1,126,536 individuals) in the AD case–control study (P = 1.74 × 10−4, OR (95% confidence interval) = 1.25 (1.11, 1.40), number of SNPs = 7) (Fig. 4i). We did not detect a causal link in the opposite direction from AD to AD2. This aligns with the endophenotype hypothesis, suggesting that DNEs are situated within the causal pathway from genetics to external phenotypes, such as AD diagnosis and cognitive decline. Details of the results, including all five different Mendelian randomization estimators, are shown in Supplementary File 19. The results of the sensitivity check are presented in Supplementary Figs. 14–17.
The Mendelian randomization results further emphasize the potential benefits of overall organ health for brain-related conditions. This highlights the interconnectedness between various organ systems and the brain, underscoring the significance of a holistic health and disease prevention approach.
The nine DNEs predict 14 systemic diseases and mortality
We investigated the added prediction power of the 9 DNEs and their respective PRS for 14 systemic diseases based on the ICD-10 code (Supplementary Tables 7 and 8), 8 cognitive scores (Supplementary Table 9) and mortality outcomes (that is, the date of death). It is worth noting that we did not perform prediction on the four brain diseases (AD, ASD, LLD and SCZ) due to the small and highly imbalanced sample sizes in the UKBB general population. As anticipated, the prediction performance across all tasks was modest, considering that the DNEs were derived from specific disease populations.
In addition to commonly available features, such as age and sex, we found that AD1, ASD1, LLD1, SCZ1 and SCZ2 provided additional prediction power (that is, incremental R2) for many disease categories. Across the 14 disease categories, the DNEs showed higher incremental R2 in mental and behavioural disorders (ICD-10 code: F group) and diseases linked to the central nervous system (ICD-10 code: G group) than other disease categories, proving that the nine DNEs in the general population capture disease-related effects of the four brain diseases. Combining all nine DNEs further improved the incremental R2, especially in mental and behavioural disorders (R2 = 1.01%, P = 1.74 × 10−5) and diseases linked to the central nervous system (R2 = 0.63%, P = 1.33 × 10−5) (Fig. 5a). The incremental R2 values for all tests are shown in Supplementary Table 7. All metrics, encompassing sample sizes and incremental R2 for both the null and alternative models, are outlined in Supplementary File 20. Results using only the PRS target population are presented in Supplementary Fig. 18.
Fig. 5 |. Additional prediction power of the nine DNEs and PRSs for 14 systemic diseases, cognition, and mortality outcomes.

a, The incremental R2 values of the nine DNEs for predicting 14 systemic disease categories were assessed using the entire UKBB sample, with N = 39,178 participants as independent test data. The results focusing only on the PRS target population (N = 15,891) can be found in Supplementary Fig. 18. ‘ALL’ indicates the incremental R2 contributed by combining the nine DNEs. b, The incremental R2 of the PRS of the nine DNEs to predict 14 systemic diseases based on the ICD-10 code using only the PRS target sample. c, In the PRS target sample, disease classification accuracy from the independently hold-out test data (N = 5,581) was assessed using nested cross-validated SVMs in the training/validation/test data (N = 10,000) by fitting various sets of features (Cov indicates age and sex). d, In the PRS target sample, cognitive score prediction accuracy (Pearson’s r) from the independently hold-out test data (3,632 < N < 5,570) was assessed using nested cross-validated support vector regression models. e, The SCZ1, SCZ1-PRS, AD1-PRS and ASD1 show significant associations with the risk of mortality in the PRS target sample. Age and sex were included as covariates in the Cox proportional hazard model. f, The nine DNEs and PRSs were cumulatively included as features in cross-validation for mortality risk prediction. The symbol * indicates significant results that survived the Benjamini–Hochberg correction. The symbol # indicates nominal significance. All P values are two-sided. CNS, central nervous system; HR, hazard ratio; CI, concordance index; TMT, trail-making test. The box plots show the mean of the CIs, while the violin plots illustrate their distribution during cross-validation.
Compared to the nine DNEs, the nine PRSs provided smaller additional prediction power. For example, the PRS for ASD3 explained an additional 0.05% of the variance (incremental R2) in diseases associated with the blood and immune systems (P = 0.03), as well as neoplasms (P = 0.04). Combining all nine PRSs improves the incremental R2, particularly in mental and behavioural disorders (R2 = 0.3%, P = 0.047). No results survived multiple comparisons using the Benjamini–Hochberg method (Fig. 5b). Detailed results are shown in Supplementary Table 8 and File 21.
We assessed the prediction ability of support vector machines (SVM) at the individual level to classify the 14 disease categories. The highest performance was observed for eye diseases (ICD-codes: H0–5). Age and sex significantly enhanced the classification accuracy. Including age and sex alongside the 9 DNEs and 9 PRSs boosted the classification accuracy from 0.55 to 0.65. The inclusion of PRSs, DNEs and the combination of both, along with age and sex as features, resulted in improved classification accuracy for mental and behavioural disorders. For example, the accuracy increased from 0.51 to 0.55 and 0.57 for features of age and sex, 9 DNEs and 9 PRSs, incrementally (Fig. 5c). These findings highlight the added value of incorporating the nine DNEs and PRSs in predicting these disease categories. Detailed results are shown in Supplementary Table 9a. The full evaluation metrics of the cross-validated and independent test results are presented in Supplementary File 22.
We examined the predictive capability of these feature sets in estimating 8 cognitive scores. The digit symbol substitution test showed the highest performance, reaching a Pearson’s r of 0.46 (for example, P = 3.9 × 10−189 and mean absolute error = 3.75 for the feature set of 9 DNE plus covariates) in the independent test dataset using all different sets of features (Fig. 5d). The combination of DNEs and PRSs did not notably enhance prediction accuracy. However, contrary to the 14 systemic disease classifications, incorporating the 119 GM IDPs alongside DNEs and PRSs resulted in a 2% increase in the prediction accuracy for fluid intelligence, measured by Pearson’s r (Supplementary Table 9b). The full evaluation metrics of the cross-validated and independent test results are presented in Supplementary File 23.
Finally, we evaluated the prediction power of the nine DNEs and PRSs for mortality risk prediction using the Cox regression. Among these, SCZ1, SCZ1-PRS, ASD1 and AD1-PRS were significantly associated with the risk of mortality (Fig. 5e and Supplementary Table 10a). Adding SCZ1, AD1-PRS, ASD1 and SCZ1-PRS to age and sex further improved the prediction, but the performance decreased afterward (Fig. 5f). Lastly, incorporating the nine DNEs from the second scan of 1,348 participants into the model slightly increased the statistical significance and the hazard ratio (Supplementary Table 10b and c).
Discussion
This study investigated the manifestation of nine disease-related brain endophenotypes—derived from four case–control studies via semi-supervised AI methods—in the general population of 39,178 participants in the UKBB. We assessed commonalities and differences among the nine DNEs, their genetic correlates in the general population, their relationships with the multiple human organ systems and their predictive capacity for 14 systemic disease categories and mortality. Our findings demonstrate the potential of the nine AI-derived DNEs in identifying high-risk individuals within the general population prone to developing the four major brain disorders.
Shared neuroanatomical patterns and genetic determinants
Understanding the shared disease mechanism of neurodegenerative and neuropsychiatric diseases is a complex and ongoing challenge in medical research15,16,40,42–45,64. Our results suggest that shared underlying mechanisms and genetic factors may contribute to the development and progression of these disorders. This notion of shared disease mechanisms across the four major neurodegenerative and neuropsychiatric diseases, namely, ASD, SCZ, LLD and AD, has garnered considerable attention and reshaped our understanding of these conditions15,16.
Despite the inherent heterogeneity among neuroanatomical patterns observed in different brain diseases (Fig. 2a), a notable commonality exists regarding their manifestations, which might emanate from underlying mechanisms sharing neuropathologic characteristics and pathways. As an illustration, AD1, LLD2 and SCZ1 showed a negative correlation (brain atrophy) with widespread cortical volumes, such as the bilateral insula and middle frontal gyrus. This aligns with expectations, considering that the UKBB population includes individuals primarily from mid to late life (above 45 years). From an aetiological standpoint, various factors can contribute to the global cortical volume reduction within the general population, with late-onset neurodegenerative and neuropsychiatric disorders and ageing exerting a significant impact. Likewise, ASD2 and SCZ2 showed a positive association with the basal ganglia, including the globus pallidum. This could imply the existence of potential protective genetic or environmental factors that collectively contribute to the concept of ‘brain reserve’, which might mitigate volume loss in a portion of the general population. It is worth noting that we previously revealed that individuals predominantly expressing SCZ2 showed higher levels of education13 and higher remission rates than those primarily influenced by SCZ1. Alternatively, these volume increases might reflect neuropathologic mechanisms, such as disrupted connectivity, which are not necessarily associated with neurodegenerative and neurodevelopmental components related to relatively lower brain volumes. Ultimately, another significant aspect to consider involves the impact of medications prescribed by clinicians, affecting both disease-specific and general populations. Research indicates that antipsychotic medications, commonly prescribed for most individuals with SCZ and some with ASD and AD, have been shown to delay brain volume loss in the basal ganglia65 and pallidum66.
The commonalities in neuroanatomical patterns across brain diseases can be attributed to several factors. First, shared genetic factors may influence brain structure and function29,30, contributing to similar neuroanatomical alterations across different diseases. Genetically, this was largely evidenced by our genetic correlation (Fig. 4a) and colocalization results (Fig. 4e). For instance, ASD2 showed prominent positive genetic overlap with SCZ1. Historically, there has been a long-standing association between ASD and SCZ, leading to the notion that autism could be a form of ‘childhood schizophrenia’67. This conceptual link between the two conditions has been debated and discussed. Previous case–control neuroimaging studies demonstrated divergent structural and functional brain patterns in individuals with ASD compared to those with SCZ68,69, largely ignoring the neuroanatomical heterogeneity within each condition. Genetic variants that impact key signalling pathways, synaptic function and neuronal connectivity70,71 could influence multiple disease phenotypes, leading to overlapping neuroanatomical patterns.
AD1 also shared genetic similarities with LLD1, both characterized by spatially extensive brain atrophy and increased brain age. Previous studies found a higher prevalence of depressive symptoms and LLD in individuals with AD compared to the general population72–74. The relationship between AD and LLD likely involves multiple factors and may be bidirectional. On the one hand, LLD may increase the risk of developing AD or accelerate the progression of cognitive decline in individuals already affected by AD. On the other hand, AD-related changes in the brain, such as neuroinflammation and neurochemical imbalances, may worsen depressive symptoms in individuals with LLD.
Recognizing the shared disease mechanisms across the four brain diseases underscores the importance of a broader perspective on clinical presentations and underlying biological mechanisms. This understanding is important for developing targeted and personalized approaches to patient care, leading to more effective treatments and interventions. Future research may explore disease heterogeneity to transcend traditional classification boundaries and identify data-driven subtypes or dimensions within a transdiagnostic framework75.
Beyond the brain
Our findings strongly concur with a paradigm shift in treating brain diseases. While the conventional approach has predominantly concentrated on interventions targeting the brain, emerging evidence highlights the critical importance of considering the broader systemic and environmental factors that influence disease onset and progression.
Unravelling the intricate interconnections between the brain and other organ systems is crucial in broadening our understanding of brain diseases, as demonstrated by our findings and those of others. The brain does not function in isolation but interacts with and is influenced by various physiological systems throughout the body. Our results showed a close genetic association and causality between the DNEs and the eye, cardiovascular and pulmonary systems (Fig. 4b,f,h). These findings parallel previous literature. For instance, eye-related pathological changes have been revealed to mirror early signs of neurological and neuropsychiatric conditions76. The nervous and cardiovascular systems—the heart–brain axis—are intricately linked, with brain regions controlling heart function via sympathetic and parasympathetic pathways77. Dysfunctions in one system can affect the other’s function, resulting in brain and cardiovascular diseases. The immune system plays a crucial role in modulating inflammation and neuroinflammation, which are implicated in many brain disorders, such as AD78, SCZ79 and depression80. Similarly, the gut–brain axis highlights the bidirectional communication between the gut microbiota and the brain, with emerging evidence linking alterations in the gut microbiome to brain diseases such as Parkinson’s disease81 and depression82. Understanding and targeting these systemic interactions can modulate disease processes and improve treatment outcomes.
Furthermore, considering environmental and lifestyle factors is essential in treating brain diseases83. Our previous work17 has shown that the BAGs of nine human organ systems comply with Cheverud’s conjecture: the phenotypic correlation of two BAGs mirrors their genetic correlations. However, herein we showed that the phenotypic correlation between two DNEs (for example, ASD1 versus LLD1) did not reflect their underlying genetic correlation (Fig. 4a), indicating potentially strong environmental and lifestyle factors that exert opposite effects on the two DNEs. Furthermore, an interesting observation from our study was that the heritability estimate (h2) of early-onset diseases, such as ASD, was higher than that of late-onset diseases, such as LLD, within the three neuropsychiatric disorders. This finding suggests that genetic factors play a more prominent role in developing ASD at a younger age. By contrast, other factors, such as environmental influences, socioeconomic factors and lifestyle choices, may have a stronger impact on developing LLD later in life. These differential heritability patterns shed light on the complex interplay between genetic and non-genetic factors in the underlying disease mechanism of neurodegenerative and neuropsychiatric disorders across different stages of life. These heritability patterns aligned with a previous study that examined multiple GWAS drawn from more than 200,000 patients for 25 brain-associated disorders and 17 phenotypes16.
In conclusion, going beyond the brain is crucial for understanding and treating brain diseases. By considering the connections between the brain and other organ systems, understanding the impact of environmental and lifestyle factors and harnessing the power of advanced AI technologies, we can develop more effective and personalized approaches to prevent, diagnose and treat brain diseases.
AI-derived DNEs for precision diagnostics
The present study leverages cutting-edge, semi-supervised AI methods23 and open science advancements to enhance our understanding of disease heterogeneity in neurodegenerative and neuropsychiatric disorders. In this context, implementing these AI-derived DNEs at early disease or preclinical stages—in the general population—may facilitate the identification of individuals at risk and the initiation of proactive interventions before the onset of noticeable symptoms, likely leading to more effective treatments and interventions and better outcomes.
The proposed AI-derived DNEs capture intricate brain structure and function variations, often subtle and spatially complex, which traditional diagnostic methods and case–control studies may overlook. By quantifying the neuroanatomical patterns associated with specific brain disorders, DNEs may offer a personalized disease vulnerability assessment, inform interventions at early preclinical stages and potentially prevent or delay the onset of symptoms. At the individual level, integrating genetic information (that is, PRSs) with DNEs significantly improves prediction performance for 14 systemic diseases and mortality outcomes (Fig. 5). In addition, our Mendelian randomization analyses supported the well-established endophenotype hypothesis in genetic psychiatry24—endophenotype in psychiatric disorders resides inside the causal pathway from underlying genetics to their exo-phenotypes (that is, the disease syndrome itself), thereby being closer to its disease mechanisms. We found that AD2, characterized by focal medial temporal lobe atrophy, exerted a causal relationship with AD. However, we did not find evidence of a reverse causal relationship, suggesting that the underlying genetics may influence the development of AD through the DNE, although it may not be the exclusive pathway contributing to the disease. This highlights the role of genetics in influencing the disease process, particularly through the identified DNE, shedding light on potential pathways and mechanisms involved in AD development.
Limitations
The present study has several limitations. First, the genetic analysis focused exclusively on common genetic variants. Future investigations should explore the contribution of rare variants to these brain diseases. Secondly, it is important to recognize that our GWAS analyses predominantly involved participants of European ancestry, limiting the generalizability of the genetic findings to other populations with different ancestral backgrounds. Further research efforts are necessary to collect more diverse genetic data and include underrepresented racial and ethnic groups to enhance the generalizability of the findings. In addition, the validation of the nine DNEs would benefit from additional longitudinal analyses. Fortunately, ongoing efforts to collect longitudinal brain MRI data in UKBB84 hold promise for providing valuable insights to the scientific community and advancing the field of precision medicine. Furthermore, more advanced methodological developments will be developed and integrated within our semi-supervised clustering learning framework to alleviate potential domain shifts85. Finally, our future research will incorporate multimodal imaging to generate DNEs sensitive to a broader range of brain alterations, including macrostructural volumetric changes, functional disruptions and structural network abnormalities.
Outlook
Together, our AI-derived DNEs have emerged as novel instruments for precision medicine. By capturing the complexity and heterogeneity of brain disorders, DNEs provide a better understanding of disease pathology, facilitate personalized risk assessment and hold promise for targeted interventions and population selection.
Methods
Study populations
Our previous studies used semi-supervised AI models to define the nine DNEs from four disease case–control populations. These populations consisted of 865 healthy controls, 1,096 individuals with mild cognitive impairment, and 414 AD patients from ADNI21 for AD1 and AD2; 362 typically developing controls and 307 patients with ASD from ABIDE22 for ASD1–3; 495 healthy controls and 501 LLD patients from the LLD study2 for LLD1–2; and 364 healthy controls and 307 SCZ patients from PHENOM13 for SCZ1–2. For more detailed information about the characteristics of the study populations, please refer to the original papers.
The trained AI models were then applied to the UKBB general population as independent data. UKBB is a population-based study of approximately 500,000 people recruited from the United Kingdom between 2006 and 2010. The UKBB study has ethical approval, and the ethics committee is detailed at https://www.ukbiobank.ac.uk/learn-more-about-uk-biobank/governance/ethics-advisory-committee. The protocols of this study were approved by the University of Pennsylvania institutional review board. The current study analysed 39,178 multimodal brain MRI data from UKBB. T1-weighted MRI data were locally processed at the University of Pennsylvania; IDP from diffusion and resting-state functional MRI were downloaded from UKBB. In addition, we processed the imputed genotype data27 from UKBB for GWAS analyses. Last, other clinical traits were also analysed, including phenotypes related to nine human organ systems. The current work was performed under application numbers 35148 and 60698. We detail our dataset splits in Supplementary Text 4.
Semi-supervised AI methods to derive the nine DNEs
The methodologies used in the current study to derive the nine DNEs belong to the semi-supervised learning algorithms (Fig. 1a) pioneered by our group. Refer to a review for details of this type of modelling. In particular, the current study used the Heterogeneity through Discriminative Analysis (HYDRA86) and Semi-SUpeRvised ReprEsentAtion Learning via Generative Adversarial Network (Surreal-GAN14) models.
Surreal-GAN.
Surreal-GAN14 dissects underlying disease-related heterogeneity via a deep representation learning approach, instead of the discriminative SVM, under the principle of semi-supervised clustering—the ‘1-to-k’ mapping. The methodological advance of this method is that Surreal-GAN models disease heterogeneity as a continuous dimensional representation, enforces monotone disease severity in each dimension and allows the non-exclusive manifestation of all dimensions in the same participant (Supplementary Method 1a).
HYDRA.
HYDRA leverages a widely used discriminative method, that is, SVM, to seek the ‘1-to-k’ mapping. The novelty is that HYDRA extends multiple linear SVMs to the nonlinear case piecewise, thereby simultaneously serving for classification and clustering. Specifically, it constructs a convex polytope by combining the hyperplane from k linear SVMs, separating the healthy control group from the k subpopulation of the patient group. Intuitively, each face of the convex polytope can be regarded to encode each subtype, capturing a distinct disease effect (Supplementary Method 1b). Supplementary Text 5 details the original image patterns of the diseased population in our previous study.
Imaging analyses
T1-weighted MRI processing.
All images were first corrected for magnetic field intensity inhomogeneity87. A deep learning-based skull stripping algorithm was applied to remove extra-cranial material. In total, 145 IDPs were generated in GM (119 ROIs), white matter (20 ROIs) and ventricles (6 ROIs) using a multi-atlas label fusion method from the MUlti-atlas region Segmentation utilizing Ensembles (MUSE) atlas88. The ROIs were fit to the four machine learning models to derive the nine DNEs. The imaging quality check is detailed in Supplementary Method 2.
Neuroanatomical pattern of the nine DNEs.
We assessed the neuroanatomical patterns shown by the nine DNEs within the general population. As the DNEs were defined based on the 119 GM ROIs obtained from T1-weighted MRI scans, we aimed to test whether these patterns observed in the disease populations were manifested in the general population. To this end, we used a linear regression model in which each DNE was treated as the dependent variable, while the ROI, age, age-squared, sex, age × sex interaction, age-squared × sex interaction, intracranial volume, brain positions in the scanner (lateral, transverse and longitudinal; field ID, 25756–25758) and head motion were considered independent variables and covariates.
PWAS for the nine DNEs.
We performed PWAS to associate the nine DNEs to each of the 611 additional phenotypes. PWAS excluded the 119 GM ROIs used to derive the nine DNEs to prevent any potential circular effects. Instead, the analysis incorporated IDPs from other modalities, such as diffusion and resting-state functional MRI. The same linear regression models and multiple comparison corrections were used.
To check the robustness of our PWAS results, we also performed two sensitivity checks: (1) sex-stratified PWAS for males and females and (2) split-sample PWAS by randomly dividing the entire population into two splits (sex and age did not significantly differ).
Genetic analyses
We used the imputed genotype data for all genetic analyses, and our quality check pipeline resulted in 31,929 participants with European ancestry and 6,477,810 SNPs. First, we excluded 36,394 related individuals (up to second degree) from the full UKBB sample (N ≈ 500,000) using the KING software (v2.2.7) for family relationship inference89. We then removed duplicated variants from all 22 autosomal chromosomes. Individuals whose genetically identified sex did not match their self-acknowledged sex were removed. Other excluding criteria were as follows: (1) individuals with more than 3% of missing genotypes; (2) variants with minor allele frequency of less than 1%; (3) variants with larger than 3% missing genotyping rate; (4) variants that failed the Hardy–Weinberg test at 1 × 10−10. The Hardy–Weinberg test removes the SNPs that deviate from the expected Hardy–Weinberg equilibrium (HWE) frequency, ensures the integrity of the genetic data, reduces false-positive findings and improves the overall robustness of the GWAS results. To adjust for population stratification90, we derived the first 40 genetic principal components using the FlashPCA (v2.0) software91. Details of the genetic quality check protocol are described elsewhere.
GWAS.
For GWAS, we ran a linear regression using PLINK92 for each DNE, controlling for confounders of age, age-squared, sex, age × sex interaction, age-squared × sex interaction, the first 40 genetic principal components, total intracranial volume, three brain position parameters in the scanner and head motion, which were included as suggested by a previous study29. We adopted the genome-wide P value threshold (5 × 10−8) and annotated independent genetic signals considering linkage disequilibrium (see ‘Annotation of genomic loci’).
To check the robustness of our GWAS results, we also performed several sensitivity checks: (1) fastGWA38,39 for a generalized mix effect model, (2) sex-stratified GWAS for males and females, (3) split-sample GWAS by randomly dividing the entire population into two splits (sex and age did not significantly differ), (4) comparison of the GWAS results using the 1,348 participants (that is, 1,116 European ancestry) that were collected for baseline and longitudinal scans from UKBB, (5) non-European GWAS (N = 4,783), (6) independent GWAS on ADNI whole-genome sequencing data (N = 1,555) on AD1 and AD2, and (7) concordance with six European ancestry GWAS from PGC, including AD, attention-deficit/hyperactivity disorder, ASD, BIP, OCD and SCZ (Supplementary Table 3a).
SNP-based heritability.
We estimated the SNP-based heritability (h2) using GCTA37 with the same covariates as in GWAS. GCTA estimates the SNP-based heritability using a method called restricted maximum likelihood to quantify the proportion of phenotypic variance in a trait that the additive effects of all common SNPs can explain, which was claimed to address the ‘missing heritability’. The main steps involved in GCTA include constructing the genetic relationship matrix, modelling phenotypic variance and using restricted maximum likelihood to estimate the h2.
Annotation of genomic loci.
The annotation of genomic loci and mapped genes was performed via FUMA93. For the annotation of genomic loci, FUMA first defined lead SNPs (correlation r2 ≤ 0.1, distance < 250 kilobases) and assigned them to a genomic locus (non-overlapping); the lead SNP with the lowest P value (that is, the top lead SNP) was used to represent the genomic locus. For gene mappings, three different strategies were considered. First, positional mapping assigns the SNP to its physically nearby genes (a 10 kb window by default). Second, eQTL mapping annotates SNPs to genes based on eQTL associations using the GTEx v8 data94. Finally, chromatin interaction mapping annotates SNPs to genes when there is a significant chromatin interaction between the disease-associated regions and nearby or distant genes93. The definition of top lead SNP, lead SNP, independent significant SNP and candidate SNP can be found in Supplementary Method 3.
To determine whether a genomic locus is new (that is, newly discovered in our study), we used FUMA to query the candidate SNPs, independent significant SNPs and top lead SNP in the European Molecular Biology Laboratory’s European Bioinformatics Institute (EMBL-EBI) GWAS catalogue, taking linkage disequilibrium into account. For these new loci, we further manually queried the non-significant SNPs in linkage disequilibrium (r2 > 0.1) with these SNPs directly on the GWAS catalogue platform on 29 October 2024. A locus is considered new if no previous GWAS has identified associations with any of these SNPs. It is worth noting that the current definition of new loci is not completely exhaustive and objective, as new GWAS data are being collated into the GWAS catalogue, while other public data platforms (such as the GWAS catalogue and the Integrative Epidemiology Unit (IEU) Open GWAS Project) are not considered.
Phenome-wide association queries for the identified loci in the GWAS catalogue.
We queried the candidate and significant independent SNPs, considering linkage disequilibrium (Supplementary Method 3), within each locus in the EMBL-EBI GWAS catalogue (query date, 2 June 2023 and also via FUMA version v1.5.4) to determine their previously identified associations with any other traits. For these associated traits, we further mapped them into several high-level categories for visualization purposes. We categorized these traits based on their primary affected organ systems, with examples including blood pressure under cardiovascular, brain MRI-derived measures under brain and AD under neurodegenerative disorders. Other platforms, such as the GWAS Atlas (https://atlas.ctglab.nl/PheWAS), used a similar categorization approach; we followed a similar approach to curating the high-level categories for the trait-category mapping.
Genetic correlation.
We used the LDSC36 software (git version aa33296) to estimate the pairwise genetic correlation (gc) between each pair of DNEs, with 9 BAGs of multiple organ systems from our previous work17, 6 neurodegenerative and neuropsychiatric disorders from PGC and 4 lifestyle and cognitive traits (Supplementary Table 3a). We used the precomputed linkage disequilibrium scores from the 1000 Genomes of European ancestry. To ensure the suitability of the GWAS summary statistics, we first checked that the selected study’s population was European ancestry; we then guaranteed a moderate SNP-based heritability h2 estimate. Notably, LDSC corrects for sample overlap and provides an unbiased estimate of genetic correlation95. Benjamini–Hochberg procedure was performed to account for multiple comparisons.
Bayesian colocalization.
We used the R package (coloc; v5) to investigate the genetic colocalization signals between two traits at each genomic locus. We used the fully Bayesian colocalization analysis using Bayes factors (coloc.abf). This method examines the PP (PP.H4.ABF: approximate Bayes factor) to evaluate hypothesis H4, which suggests the presence of a single shared causal variant associated with both traits within a specific genomic locus. To determine the significance of the H4 hypothesis, we set a threshold of PP.H4.ABF > 0.8 (ref. 50). All other parameters (for example, the prior probability of P12) were set as default. We also performed relevant sensitivity analyses to check the robustness of our findings. For each pair of traits, the genomic locus (N > 100 SNPs) was defined by default from FUMA on the nine DNEs, and then the coloc package extracted and harmonized the GWAS summary statistics within this locus for the other trait.
Two-sample bidirectional Mendelian randomization.
We did not perform causal inference between each pair of DNEs due to the overlapped populations and low sample sizes in split-sample analyses.
We used a bidirectional, two-sample Mendelian randomization using the TwoSampleMR package62 (v0.5.6) to infer the causal relationships between the nine DNEs and the eight BAGs across nine human organ systems17 (excluding the brain). The forward and inverse Mendelian randomization was performed between each trait pair by switching the exposure and outcome variables. We applied five different Mendelian randomization methods and reported the results of inverse variance weighting in the main text and the four others (that is, Egger, weighted median, simple mode and weighted mode estimators) in Supplementary Files 18 and 19.
We then performed Mendelian randomization between the 9 DNEs and 11 chronic diseases spanning the whole-body system. These diseases include four diseases from PGC (the ASD and OCD GWAS summary statistics did not provide the allele frequency information) and seven diseases unbiasedly curated in our previous work17, which detailed a systematic procedure to choose the appropriate traits. After harmonizing their GWAS summary statistics (using the function harmonise_data from 2SampleMR), this resulted in 11 clinical traits with at least six valid instrumental variables (that is, SNPs). The clinical traits included in our Mendelian randomization are presented in Supplementary Table 11. Benjamini–Hochberg correction was performed for all tested traits.
We performed several sensitivity analyses. First, a heterogeneity test was performed to check for violating the instrumental variable assumptions. Horizontal pleiotropy was estimated to navigate the violation of the instrumental variable’s exclusivity assumption96 using a funnel plot, single-SNP Mendelian randomization approaches and Mendelian randomization Egger estimator97. Moreover, the leave-one-out analysis excluded one instrument (SNP) at a time and assessed the sensitivity of the results to individual SNPs.
PRS calculation for the nine DNEs.
We calculated the PRS using the GWAS results from the split-sample analyses. The weights of the PRS were defined based on split1 data (base data), and the split2 GWAS summary statistics were used as the target data for PRS calculation. The quality control steps for the base data are as follows: (1) removal of duplicated and ambiguous SNPs for the base data, (2) clumping the base GWAS data, (3) pruning to remove highly correlated SNPs in the target data, (4) removal of high heterozygosity samples in the target data and (5) removal of duplicated, mismatching and ambiguous SNPs in the target data. After rigorous quality control, we used PLINK to generate PRS for the split2 population by adopting the classic C + T method (clumping + thresholding). To determine the ‘best-fit’ PRS, we performed a linear regression using the PRS calculated at different P-value thresholds (0.001, 0.05, 0.1, 0.2, 0.3, 0.4, 0.5), controlling for age, sex, intracellular volume and the first 40 genetic principal components. For each DNE-PRS, we chose the P-value threshold with the highest incremental R2 (Supplementary Fig. 19). The comparison between PLINK and PRS-CS98 is presented in Supplementary Text 6 and Supplementary Figures 20–22.
Disease, cognition and mortality outcome prediction
We used logistic regression to calculate the incremental R-squared (R2) statistics of the nine DNEs and PRSs to predict 14 disease categories (a), SVMs to classify the healthy control participants from the disease groups (b) and Cox proportional hazard model to predict mortality outcomes (c). The patients for the 14 disease categories were defined based on the ICD-10 code from the UKBB website (data field, 41270). The healthy control group included participants without any ICD-10-based disease diagnoses. The cognitive function data included 8 cognitive scores (category, 100026), which are detailed in Supplementary File 23. The mortality outcome refers to the date of death at https://biobank.ndph.ox.ac.uk/ukb/field.cgi?id=40000.
Pseudo R2 statistics of the logistic regression.
We built a null model by including age, sex and intracranial volume as predictors and the disease as the outcome variable. The alternative model took the disease-specific DNE or PRS as one additional predictor. The incremental R2 was calculated as the difference between the pseudo R2 of the alternative model and that of the null model, implemented by the PseudoR2 function from the DescTools R package (v 0.99.38). For the nine PRSs, we used the PRS target sample (N = 15,891). For the nine DNEs, we calculated the incremental R2 using the entire UKBB sample (N = 39,178) and the PRS target sample (N = 15,891). We reported the incremental R2 of the DNE/PRS in the main manuscript (Fig. 5a,b). Other metrics, including R2 of the null and the alternative model, sample sizes and the β values of other covariates, are presented in Supplementary Files 20 and 21.
SVMs to classify patients versus controls.
Using the PRS target sample (N = 15,891), we used 10,000 participants in a nested cross-validation procedure (that is, cross-validation training/validation/test datasets) to select the hyperparameter C in SVM. In addition, we held out 5,581 participants as an independent test dataset. In Fig. 5c, we only presented the classification accuracy from the independent test dataset. The nested cross-validation procedure99 involved an outer loop repeated 50 times, where 80% of the data were randomly selected for training/validation and 20% for testing. Within each outer loop iteration, an inner loop used 80% of the training/validation data for a 10-fold training/validation split. Critically, the model trained on the training/validation/test datasets generalized to the independent test dataset (Fig. 5c). Supplementary File 22 contains various metrics, including balanced accuracy, sensitivity, specificity, negative predictive value, positive predictive value and sample sizes for the independent dataset (Ind. accuracy) and the cross-validation training/validation/test datasets (cross-validation accuracy). We did not statistically compare the performance between different machine learning models as this is a complex matter without a universal solution. A standard t-test on cross-validation results is too liberal and should not be applied, as shown by Nadeau and Bengio100. The corrected resampled t-test100 was proposed, but it depends on the data and the cross-validation set-up.
Support vector regression to predict cognitive scores.
Using the PRS target sample (N = 15,891), we used 10,000 participants in a nested cross-validation procedure (that is, cross-validation training/validation/test datasets) to select the hyperparameter C in SVM. In addition, we held out 5,581 participants (some had missing values depending on specific cognitive scores) as an independent test dataset (Ind. r). In Fig. 5d, we only presented Pearson’s r from the independent test dataset. The same nested cross-validation procedure was used, as mentioned above. Likewise, Supplementary File 23 encompasses a range of metrics, including mean absolute error, P values and sample sizes, pertaining to the independent dataset (Ind. r) and the cross-validation training/validation/test datasets (cross-validation r).
Cox proportional hazard model to predict the date of death.
To evaluate the predictive capacity of individual DNE and PRS for mortality risk, we used a Cox proportional hazard model while adjusting for covariates such as age and sex. The hazard ratio was calculated and reported as the effect size measure that indicates the influence of each DNE or PRS on mortality risk. Furthermore, we incrementally added the most predictive DNE or PRS to the Cox model to determine when the model’s performance reached saturation. The concordance index was used to assess the model’s performance using a 5-fold cross-validation procedure. All survival analyses were conducted using the lifelines (v0.25.7) Python package.
Supplementary Material
Supplementary information The online version contains supplementary material available at https://doi.org/10.1038/s41551-025-01412-w.
Acknowledgements
We express our sincere gratitude to the UK Biobank team (https://www.ukbiobank.ac.uk/) for their invaluable contribution to advancing clinical research in our field. We thank the PGC (https://pgc.unc.edu/) for generously sharing the GWAS summary statistics with the scientific community. This study used the UK Biobank resource under application numbers 35148 (C.D.) and 60698 (A.Z.). J.W. leads the MULTI consortium under UK Biobank Application number 647044; J.W. and the Laboratory of AI and Biomedical Science (LABS) are supported by the start-up funding from Columbia University. We also gratefully acknowledge the support of the imaging-based SysTem for AGing and NeurodeGenerative diseases (iSTAGING) consortium, funded by the National Institute on Aging through grant RF1 AG054409 at the University of Pennsylvania (C.D.). In addition, we acknowledge the funding program from the Rebecca L. Cooper Foundation at the University of Melbourne (A.Z.). Y.E.T. is supported by a National Health and Medical Research Council Investigator Grant (APP2026413).
Footnotes
Competing interests
The authors declare no competing interests.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
The GWAS summary statistics and pre-trained AI models from this study are publicly accessible via the MEDICINE Knowledge Portal (https://labs-laboratory.com/medicine/) and Synapse (https://www.synapse.org/Synapse:syn64923248/wiki/630992)101. Genomic loci annotation used data from FUMA (https://fuma.ctglab.nl/). UKBB data can be requested at https://www.ukbiobank.ac.uk/. GWAS summary data from the PGC can be accessed at https://pgc.unc.edu/. MUSE atlas is generated via the pipeline at https://github.com/CBICA/MUSE. The raw data for the main figures and Supplementary Files 1–23 are publicly available via Zenodo at https://zenodo.org/records/15238099 (ref. 102).
Code availability
The software and resources used in this study are all publicly available via MLNI103 (HYDRA) at https://github.com/anbai106/mlni (DNEs for ASD1–3, LLD1–2, SCZ1–2) and Surreal-GAN104 at https://github.com/zhijian-yang/SurrealGAN (DNEs for AD1–2).
References
- 1.Hwang G et al. Assessment of neuroanatomical endophenotypes of autism spectrum disorder and association with characteristics of individuals with schizophrenia and the general population. JAMA Psychiatry 80, 498–507 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Wen J et al. Characterizing heterogeneity in neuroimaging, cognition, clinical symptoms, and genetics among patients with late-life depression. JAMA Psychiatry 79, 464–474 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Young AL et al. Uncovering the heterogeneity and temporal complexity of neurodegenerative diseases with Subtype and Stage Inference. Nat. Commun. 9, 4273 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Yang Z et al. A deep learning framework identifies dimensional representations of Alzheimer’s disease from brain structure. Nat. Commun. 12, 7065 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Zhang X et al. Bayesian model reveals latent atrophy factors with dissociable cognitive trajectories in Alzheimer’s disease. Proc. Natl Acad. Sci. USA 113, E6535–E6544 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Vogel JW et al. Four distinct trajectories of tau deposition identified in Alzheimer’s disease. Nat. Med. 27, 871–881 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wen J et al. Multi-scale semi-supervised clustering of brain images: deriving disease subtypes. Med. Image Anal. 75, 102304 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Ferreira D, Nordberg A & Westman E Biological subtypes of Alzheimer disease: a systematic review and meta-analysis. Neurology 94, 436–448 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Hodson R Precision medicine. Nature 537, S49–S49 (2016). [DOI] [PubMed] [Google Scholar]
- 10.Davatzikos C et al. Precision diagnostics based on machine learning-derived imaging signatures. Magn. Reson. Imaging 64, 49–61 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Leonenko G et al. Identifying individuals with high risk of Alzheimer’s disease using polygenic risk scores. Nat. Commun. 12, 4506 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Wen J et al. Genetic and clinical correlates of two neuroanatomical AI dimensions in the Alzheimer’s disease continuum. Transl. Psychiatry 14, 1–14 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Chand GB et al. Two distinct neuroanatomical subtypes of schizophrenia revealed using machine learning. Brain 143, 1027–1038 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Yang Z, Wen J & Davatzikos C Surreal-GAN: semi-supervised representation learning via GAN for uncovering heterogeneous disease-related imaging patterns. International Conference on Learning Representations (2021). [Google Scholar]
- 15.Wingo TS et al. Shared mechanisms across the major psychiatric and neurodegenerative diseases. Nat. Commun. 13, 4314 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Anttila V Analysis of shared heritability in common disorders of the brain. Science 360, eaap8757 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wen J et al. The genetic architecture of biological age in nine human organ systems. Nat. Aging 4, 1290–1307 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Wen J Multiorgan biological age shows that no organ system is an island. Nat. Aging 4, 1182–1183 (2024). [DOI] [PubMed] [Google Scholar]
- 19.Wen J et al. The genetic architecture of multimodal human brain age. Nat. Commun. 15, 2604 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Boquet-Pujadas A et al. MUTATE: a human genetic atlas of multiorgan artificial intelligence endophenotypes using genome-wide association summary statistics. Brief. Bioinform. 26, bbaf125 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Petersen RC et al. Alzheimer’s Disease Neuroimaging Initiative (ADNI): clinical characterization. Neurology 74, 201–209 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Di Martino A et al. Enhancing studies of the connectome in autism using the autism brain imaging data exchange II. Sci. Data 4, 170010 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Wen J et al. Subtyping brain diseases from imaging data. In Machine Learning for Brain Disorders (ed. Colliot O.) 491–510 (Springer, 2023); 10.1007/978-1-0716-3195-9_16 [DOI] [Google Scholar]
- 24.Kendler K & Neale M Endophenotype: a conceptual analysis. Mol. Psychiatry 15, 789–797 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Cannon TD & Keller MC Endophenotypes in the genetic analyses of mental disorders. Annu. Rev. Clin. Psychol. 2, 267–290 (2006). [DOI] [PubMed] [Google Scholar]
- 26.Gottesman II & Gould TD The endophenotype concept in psychiatry: etymology and strategic intentions. Am. J. Psychiatry 160, 636–645 (2003). [DOI] [PubMed] [Google Scholar]
- 27.Bycroft C et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Alfaro-Almagro F et al. Image processing and quality control for the first 10,000 brain imaging datasets from UK Biobank. Neuroimage 166, 400–424 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Elliott LT et al. Genome-wide association studies of brain imaging phenotypes in UK Biobank. Nature 562, 210–216 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Wen J et al. Genomic loci influence patterns of structural covariance in the human brain. Proc. Natl Acad. Sci. USA 120, e2300842120 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Guan H & Liu M Domain adaptation for medical image analysis: a survey. IEEE Trans. Biomed. Eng. 69, 1173–1185 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Zhang H, Schneider T, Wheeler-Kingshott CA & Alexander DC NODDI: practical in vivo neurite orientation dispersion and density imaging of the human brain. NeuroImage 61, 1000–1016 (2012). [DOI] [PubMed] [Google Scholar]
- 33.Joseph C, Wang L, Wu R, Manning KJ & Steffens DC Structural brain changes and neuroticism in late-life depression: a neural basis for depression subtypes. Int. Psychogeriatr. 33, 515–520 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Tian YE et al. Heterogeneous aging across multiple organ systems and prediction of chronic disease and mortality. Nat. Med. 29, 1221–1231 (2023). [DOI] [PubMed] [Google Scholar]
- 35.McCracken C et al. Multi-organ imaging demonstrates the heart-brain-liver axis in UK Biobank participants. Nat. Commun. 13, 7839 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Bulik-Sullivan BK et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Genet. 47, 291–295 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Yang J, Lee SH, Goddard ME & Visscher PM GCTA: a tool for genome-wide complex trait analysis. Am. J. Hum. Genet 88, 76–82 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Jiang L, Zheng Z, Fang H & Yang J A generalized linear mixed model association tool for biobank-scale data. Nat. Genet. 53, 1616–1621 (2021). [DOI] [PubMed] [Google Scholar]
- 39.Jiang L et al. A resource-efficient tool for mixed model association analysis of large-scale data. Nat. Genet. 51, 1749–1755 (2019). [DOI] [PubMed] [Google Scholar]
- 40.Wightman DP et al. A genome-wide association study with 1,126,563 individuals identifies new risk loci for Alzheimer’s disease. Nat. Genet. 53, 1276–1282 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Demontis D et al. Genome-wide analyses of ADHD identify 27 risk loci, refine the genetic architecture and implicate several cognitive domains. Nat. Genet. 55, 198–208 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Grove J et al. Identification of common genetic risk variants for autism spectrum disorder. Nat. Genet. 51, 431–444 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Mullins N et al. Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology. Nat. Genet. 53, 817–829 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Trubetskoy V et al. Mapping genomic loci implicates genes and synaptic biology in schizophrenia. Nature 604, 502–508 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.International Obsessive Compulsive Disorder Foundation Genetics Collaborative (IOCDF-GC) and OCD Collaborative Genetics Association Studies (OCGAS) Revealing the complex genetic architecture of obsessive-compulsive disorder using meta-analysis. Mol. Psychiatry 23, 1181–1188 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.O’Donovan MC What have we learned from the Psychiatric Genomics Consortium. World Psychiatry 14, 291–293 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Cheverud JM A comparison of genetic and pehnotypic correlations. Evolution 42, 958–968 (1988). [DOI] [PubMed] [Google Scholar]
- 48.de Leeuw CA, Mooij JM, Heskes T & Posthuma D MAGMA: generalized gene-set analysis of GWAS data. PLoS Comput. Biol. 11, e1004219 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.McCutcheon RA, Krystal JH & Howes OD Dopamine and glutamate in schizophrenia: biology, symptoms and treatment. World Psychiatry 19, 15–33 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Giambartolomei C et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 10, e1004383 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Hordyjewska-Kowalczyk E et al. Functional analysis of novel RUNX2 mutations identified in patients with cleidocranial dysplasia. Clin. Genet. 96, 429–438 (2019). [DOI] [PubMed] [Google Scholar]
- 52.Wamsley B et al. Molecular cascades and cell type–specific signatures in ASD revealed by single-cell genomics. Science 384, eadh2602 (2024). [DOI] [PubMed] [Google Scholar]
- 53.Ayalew M et al. Convergent functional genomics of schizophrenia: from comprehensive understanding to genetic risk prediction. Mol. Psychiatry 17, 887–905 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Siokas V et al. Myelin-associated oligodendrocyte basic protein rs616147 polymorphism as a risk factor for Parkinson’s disease. Acta Neurol. Scand. 145, 223–228 (2022). [DOI] [PubMed] [Google Scholar]
- 55.Arnold SE et al. Dysregulation of olfactory receptor neuron lineage in schizophrenia. Arch. Gen. Psychiatry 58, 829–835 (2001). [DOI] [PubMed] [Google Scholar]
- 56.Turetsky BI, Moberg PJ, Arnold SE, Doty RL & Gur RE Low olfactory bulb volume in first-degree relatives of patients with schizophrenia. Am. J. Psychiatry 160, 703–708 (2003). [DOI] [PubMed] [Google Scholar]
- 57.Almandil NB et al. Exome-wide analysis identify multiple variations in olfactory receptor genes (OR12D2 and OR5V1) associated with autism spectrum disorder in Saudi females. Front. Med. 10, 1051039 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Kuo P-H et al. Genome-wide association study for autism spectrum disorder in Taiwanese Han population. PLoS ONE 10, e0138695 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Warnica W et al. Copy number variable microRNAs in schizophrenia and their neurodevelopmental gene targets. Biol. Psychiatry 77, 158–166 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Christen-Zaech S et al. Early olfactory involvement in Alzheimer’s disease. Can. J. Neurol. Sci. 30, 20–25 (2003). [DOI] [PubMed] [Google Scholar]
- 61.Murphy C Olfactory and other sensory impairments in Alzheimer disease. Nat. Rev. Neurol. 15, 11–24 (2019). [DOI] [PubMed] [Google Scholar]
- 62.Hemani G et al. The MR-Base platform supports systematic causal inference across the human phenome. eLife 7, e34408 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Sanderson E et al. Mendelian randomization. Nat. Rev. Methods Prim. 2, 1–21 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.23andMe Research Team. et al. Genome-wide association study of depression phenotypes in UK Biobank identifies variants in excitatory synaptic pathways. Nat. Commun. 9, 1470 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Navari S & Dazzan P Do antipsychotic drugs affect brain structure? A systematic and critical review of MRI findings. Psychol. Med. 39, 1763–1777 (2009). [DOI] [PubMed] [Google Scholar]
- 66.Chopra S et al. Differentiating the effect of antipsychotic medication and illness on brain volume reductions in first-episode psychosis: a longitudinal, randomised, triple-blind, placebo-controlled MRI study. Neuropsychopharmacology 46, 1494–1501 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Zeldovich L Cold parenting? Childhood schizophrenia? How the diagnosis of autism has evolved over time. Science 10.1126/science.aau1206 (2018). [DOI] [Google Scholar]
- 68.Moreau CA et al. Dissecting autism and schizophrenia through neuroimaging genomics. Brain 144, 1943–1957 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Fu Z et al. Dynamic functional network reconfiguration underlying the pathophysiology of schizophrenia and autism spectrum disorder. Hum. Brain Mapp. 42, 80–94 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Bereczki E et al. Synaptic markers of cognitive decline in neurodegenerative diseases: a proteomic approach. Brain 141, 582–595 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Jiang C-C et al. Signalling pathways in autism spectrum disorder: mechanisms and therapeutic implications. Signal Transduct. Target. Ther. 7, 1–36 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Chi S, Yu J-T, Tan M-S & Tan L Depression in Alzheimer’s disease: epidemiology, mechanisms, and management. J. Alzheimers Dis. 42, 739–755 (2014). [DOI] [PubMed] [Google Scholar]
- 73.Dafsari FS & Jessen F Depression—an underrecognized target for prevention of dementia in Alzheimer’s disease. Transl. Psychiatry 10, 1–13 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Ly M et al. Late-life depression and increased risk of dementia: a longitudinal cohort study. Transl. Psychiatry 11, 1–10 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Lalousis PA et al. Neurobiologically based stratification of recent-onset depression and psychosis: identification of two distinct transdiagnostic phenotypes. Biol. Psychiatry 92, 552–562 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Woo M Eyes hint at hidden mental-health conditions. Nature https://www.nature.com/articles/d41586-019-01114-9 (2019). [DOI] [PubMed]
- 77.Tahsili-Fahadan P & Geocadin RG Heart–brain axis. Circ. Res. 120, 559–572 (2017). [DOI] [PubMed] [Google Scholar]
- 78.Leng F & Edison P Neuroinflammation and microglial activation in Alzheimer disease: where do we go from here? Nat. Rev. Neurol. 17, 157–172 (2021). [DOI] [PubMed] [Google Scholar]
- 79.Murphy CE, Walker AK & Weickert CS Neuroinflammation in schizophrenia: the role of nuclear factor kappa B. Transl. Psychiatry 11, 1–13 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Miller AH & Raison CL The role of inflammation in depression: from evolutionary imperative to modern treatment target. Nat. Rev. Immunol. 16, 22–34 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Tan AH, Lim SY & Lang AE The microbiome–gut–brain axis in Parkinson disease — from basic research to the clinic. Nat. Rev. Neurol. 18, 476–495 (2022). [DOI] [PubMed] [Google Scholar]
- 82.Morais LH, Schreiber HL & Mazmanian SK The gut microbiota–brain axis in behaviour and brain disorders. Nat. Rev. Microbiol. 19, 241–255 (2021). [DOI] [PubMed] [Google Scholar]
- 83.Tost H, Champagne FA & Meyer-Lindenberg A Environmental influence in the brain, human welfare and mental health. Nat. Neurosci. 18, 1421–1431 (2015). [DOI] [PubMed] [Google Scholar]
- 84.Ambitious Project Announced to Create the World’s Largest Longitudinal Imaging Dataset (UK Biobank, 2022); https://www.ukbiobank.ac.uk/learn-more-about-uk-biobank/news/ambitious-project-announced-to-create-the-world-s-largest-longitudinal-imaging-dataset [Google Scholar]
- 85.Wang R, Chaudhari P & Davatzikos C Embracing the disharmony in medical imaging: a simple and effective framework for domain adaptation. Med. Image Anal. 76, 102309 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Varol E, Sotiras A & Davatzikos C HYDRA: revealing heterogeneity of imaging and genetic patterns through a multiple max-margin discriminative analysis framework. NeuroImage 145, 346–364 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Tustison NJ et al. N4ITK: improved N3 bias correction. IEEE Trans. Med. Imaging 29, 1310–1320 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Doshi J et al. MUSE: MUlti-atlas region segmentation utilizing ensembles of registration algorithms and parameters, and locally optimal atlas selection. Neuroimage 127, 186–195 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Manichaikul A et al. Robust relationship inference in genome-wide association studies. Bioinformatics 26, 2867–2873 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Price AL, Zaitlen NA, Reich D & Patterson N New approaches to population stratification in genome-wide association studies. Nat. Rev. Genet. 11, 459–463 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Abraham G, Qiu Y & Inouye M FlashPCA2: principal component analysis of Biobank-scale genotype datasets. Bioinformatics 33, 2776–2778 (2017). [DOI] [PubMed] [Google Scholar]
- 92.Purcell S et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet 81, 559–575 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Watanabe K, Taskesen E, van Bochoven A & Posthuma D Functional mapping and annotation of genetic associations with FUMA. Nat. Commun. 8, 1826 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.The GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Bulik-Sullivan B et al. An atlas of genetic correlations across human diseases and traits. Nat. Genet. 47, 1236–1241 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Bowden J et al. A framework for the investigation of pleiotropy in two-sample summary data Mendelian randomization. Stat. Med. 36, 1783–1802 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 97.Bowden J, Davey Smith G & Burgess S Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int. J. Epidemiol. 44, 512–525 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Ge T, Chen C-Y, Ni Y, Feng Y-CA & Smoller JW Polygenic prediction via Bayesian regression and continuous shrinkage priors. Nat. Commun. 10, 1776 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Wen J et al. Convolutional neural networks for classification of Alzheimer’s disease: overview and reproducible evaluation. Med. Image Anal. 63, 101694 (2020). [DOI] [PubMed] [Google Scholar]
- 100.Nadeau C & Bengio Y Inference for the generalization error. Mach. Learn. 52, 239–281 (2003). [Google Scholar]
- 101.Wen J MEDICINE: AI-derived GWAS summary statistics sharing portal. Preprint at 10.7303/syn64923248 (2025). [DOI] [Google Scholar]
- 102.Wen J DNE_Figures_Dataset. Zenodo 10.5281/zenodo.15238098 (2025). [DOI] [Google Scholar]
- 103.Wen J MLNI (version 0.1.4). GitHub https://github.com/anbai106/mlni (2025). [Google Scholar]
- 104.Yang Z Surreal-GAN (version 0.1.1). GitHub https://github.com/zhijian-yang/SurrealGAN (2025). [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The GWAS summary statistics and pre-trained AI models from this study are publicly accessible via the MEDICINE Knowledge Portal (https://labs-laboratory.com/medicine/) and Synapse (https://www.synapse.org/Synapse:syn64923248/wiki/630992)101. Genomic loci annotation used data from FUMA (https://fuma.ctglab.nl/). UKBB data can be requested at https://www.ukbiobank.ac.uk/. GWAS summary data from the PGC can be accessed at https://pgc.unc.edu/. MUSE atlas is generated via the pipeline at https://github.com/CBICA/MUSE. The raw data for the main figures and Supplementary Files 1–23 are publicly available via Zenodo at https://zenodo.org/records/15238099 (ref. 102).
The software and resources used in this study are all publicly available via MLNI103 (HYDRA) at https://github.com/anbai106/mlni (DNEs for ASD1–3, LLD1–2, SCZ1–2) and Surreal-GAN104 at https://github.com/zhijian-yang/SurrealGAN (DNEs for AD1–2).
