Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

Research Square logoLink to Research Square
[Preprint]. 2026 Jan 22:rs.3.rs-8311177. [Version 1] doi: 10.21203/rs.3.rs-8311177/v1

Deep-learning-derived glaucoma-related endophenotypes enable novel genome-wide genetic and functional discovery

Liyin Chen 1, Yan Zhao 2, Saber Kazeminasab Hashemabad 3, Tobias Elze 4, Mohammad Eslami 5, Mengyu WANG 6, Janey Wiggs 7, Ayellet Segre 8, Nazlee Zebardast 9
PMCID: PMC12869636  PMID: 41646398

Abstract

The genetic architecture of primary open-angle glaucoma (POAG), a leading cause of irreversible blindness, remains largely unexplained due to the reliance of previous genome-wide association studies (GWAS) on imprecise phenotypes from electronic health records. Here, we overcome this with a disease-trained, task-transfer machine learning (ML) framework that learns glaucoma-related damage patterns from a large clinical repository of 8,323 glaucoma patients. We showed that ML optical coherence tomography (OCT)-derived endophenotypes trained on 18,985 OCT scans from these patients identified novel loci associated with POAG. By applying the derived endophenotypes to 47,908 UK Biobank participants, we performed GWAS in European, African, and Asian ancestral groups followed by cross-ancestry meta-analyses. In total, we identified 36 and 43 LD-independent GWAS loci that passed genome-wide significance in the EUR and cross-ancestry meta-analysis, respectively. About two thirds of the identified loci overlapped with previously reported POAG related associations, demonstrating the validity of our approach. Importantly, more than a third (21) of the loci were novel to glaucoma. Extensive functional analyses, including Bayesian colocalization analysis, gene-based association tests, Mendelian randomization, and single-cell enrichment analysis, converged on 11 high-confidence gene effectors, five of which were novel to glaucoma. These genes support Wnt-mediated outflow dysfunction and retinal ganglion cell vulnerability in POAG pathogenesis and are potential actionable drug targets. Our findings expanded POAG genetic associations, provided mechanistic insights at cell-type resolution, and proposed plausible putative causal genes. This study provides a powerful, generalizable ML-driven strategy for accelerating the discovery of disease mechanisms and therapeutic targets for complex diseases.

INTRODUCTION

Glaucoma is a leading cause of irreversible blindness worldwide, affecting over 80 million people worldwide and is projected to reach > 110 million by 2040 (1, 2). Because current treatments primarily reduce intraocular pressure (IOP), deeper molecular insight is essential to shift glaucoma care from damage limitation to neuroprotection and prevention. As glaucoma is a highly heritable disease, especially its most common form, primary open angle glaucoma (POAG), numerous genome-wide association studies (GWAS) have been carried out for this disease over the last two decades (3, 4, 411). Despite these efforts, currently known POAG-associated loci only explain about 9.4% of the disease heritability (5, 12).

Accurate disease phenotyping is necessary for successful GWAS, and a limitation of prior glaucoma GWAS is their reliance on diagnostic codes and case-control definitions. These conventional phenotyping methods suffer from noise inherent to diagnostic codes in electronic health records (13, 14). By contrast, studies based on clinically examined cases and controls are often underpowered due to smaller sample sizes. Moreover, the case-control phenotyping approach fails to capture of full spectrum of disease heterogeneity. To address these limitations, intermediate quantitative endophenotypes are often used, because they are measured on a continuous scale, offering greater statistical power than dichotomous labels. Using a multi-trait analysis, prior work has demonstrated the utility of endophenotypes such as intraocular pressure (IOP) (15) and vertical cup-to-disk-ratio (vCDR) for genome-wide discovery for POAG (16). However, while these endophenotypes significantly improved GWAS power, they are indirect and imperfect surrogate for glaucomatous damage(17, 18).

An objective indicator of glaucomatous change include thinning of the retinal nerve fiber layer (RNFL) and the ganglion cell complex (GCC) (19) as measured by optical coherence tomography (OCT) scans, routinely used in clinical practice. Several studies have performed GWAS on average retinal layer thicknesses calculated from OCT scans (2022). The association signals from these studies spanned both loci previously implicated in ocular or systemic disease and loci that modulate normal inter-individual variation in retinal morphology, demonstrating that retinal layer thicknesses capture functional and structural variations in the retina. However, the use of global average thicknesses, which compress information from 2-D thickness maps to a singular value, miss spatially heterogeneous patterns of damage that are typical of glaucomatous eyes. For example, Currant et al., 2021 attempted to use the average inner retinal layer thickness as an endophenotype for POAG but only replicated one known POAG loci.

With the recent advances in machine learning (ML) methods, nuanced endophenotypes from medical images, including OCT-derived retinal thickness maps, have been extracted for enhanced genome-wide association studies (2327). Recent studies sought to enhance genome-wide discovery by performing GWAS on latent representations of retinal structural variability extracted from OCT-derived thickness maps (28, 29). While these studies extracted richer features beyond global average layer thicknesses, they fail to capture disease-specific variation when trained entirely on a largely healthy UKBB population. As a result, identified endophenotypes reflect general morphological variation of the human retina rather than disease-relevant changes.

To allow for disease-specific genome-wide discovery, here, we developed a task-transfer approach to improve quantitative phenotyping for POAG. We used unsupervised machine learning and a large clinical repository of OCT scans from 8,323 open-angle glaucoma (OAG) patients to capture clinically-derived glaucoma-related damage patterns that are not captured in largely healthy population-based cohorts. We subsequently mapped our trained ML endophenotypes onto the OCT scans from 47,908 participants in the UKBB, a large biobank where genotype data is available. This approach leverages the large number of OAG patients in our clinical cohort while bridging both the paucity of glaucoma cases in UKBB and the absence of genotypes in the clinical cohort, enabling well-powered, disease-specific genetic studies.

We identified 36 and 43 GWAS-specific LD independent loci with genome-wide significance from the EUR and cross-ancestry meta-analysis, respectively. Importantly, 21 loci have not been previously reported to be associated with glaucoma or well-established glaucoma-related traits. We further explored the functional significance of these previously unidentified loci by performing gene-based association tests, colocalization analysis with expression and splicing quantitative trait loci (eQTL and sQTL), mendelian randomization, and cell type enrichment analysis. By integrating multiple lines of genetic evidence, we highlight 11 gene associations, including 5 novel ones, for their functional relevance to glaucoma, which can be further investigated as clinically interpretable therapeutic leads. Overall, by uniting disease-focused deep representation learning with cross-ancestry genetics and functional single-cell omics, we validated and extended the putative causal variants and genes for POAG. Beyond ophthalmology, our study establishes a scalable, generalizable framework for ML-driven phenotyping of complex diseases and novel genome-wide discovery.

RESULTS

Study Design

We previously developed an ML method to extract quantitative endophenotypes from GCC and RNFL thickness maps derived from macular OCT scans(30). Briefly, two deep learning models, an unsupervised autoencoder (AUTO) and a self-supervised momentum contrastive learning-based method (MOCO), were used to learn the latent representations of the GCC and the RNFL thickness maps. The latent representations then underwent dimensionality reduction and gaussian mixture modeling for clustering. This method allows for the identification of distinct retinal layer endophenotypes for subsequent genetic studies. We have previously shown the clinical correlations of these endophenotypes.

The study design of this work is illustrated in Fig. 1. Specifically, we trained the two models described above (MOCO and AUTO) using 18,985 macular OCT scans of 8,323 glaucoma patients (99% with open-angle glaucoma [OAG]) treated at Massachusetts Eye and Ear (MEE). The trained model was then mapped to 86,115 OCT scans from 35,739 European (EUR), 1,677 Asian (ASN), and 1,730 African (AFR) participants with genotype array data in the UKBB (1,074 [0.027%] were diagnosed with glaucoma by ICD code and self-reports). Twenty-one endophenotypes, 14 from the AUTO model and 7 from the MOCO model, were found in UKBB.

Figure 1. Overview of the study design.

Figure 1

The flowchart illustrates the main steps of the study. First, ML models were trained on 18,985 OCT scans from 8,323 glaucoma patients at MEE to derive glaucoma-related endophenotypes. These trained models were then applied to 86,115 OCT scans from 47,908 UKBB participants to generate quantitative traits for genetic analysis. GWAS were performed separately on participants of European, African, and Asian ancestry using a 75% discovery and 25% replication set. The results were combined through ancestry-specific and subsequent cross-ancestry GWAS meta-analyses for each ML-OCT-derived endophenotype. Finally, a suite of functional analyses, including gene-based tests, colocalization, Mendelian randomization, and cell-type enrichment, was conducted to interpret the genetic findings and prioritize causal genes.

The genotype arrays in the UKBB were imputed and underwent quality controls, yielding an average of 8,590,260 common variants (minor allele frequency [MAF] >0.01) in each population for genetic association tests. Because of the lack of a separate cohort with both macular OCT scans and genotyping data for validation, to identify a high confidence set of variants, we divided the UKBB dataset from each population into a discovery and replication subset in a 3:1 ratio. GWAS was performed on each endophenotype and study subsets using a fixed effects linear regression model, adjusting for age at recruitment, sex, refractive error (measured as spherical equivalent) and top 10 genotype principal components (PCs), separately for each ancestral group, followed by meta-analyses of the discovery and replication subsets. A final cross-ancestry meta-analysis was conducted to enhance statistical power and identify shared genetic risk factors across populations. Post-GWAS functional analytical approaches were then applied to prioritize the genes and potential underlying causal mechanisms of the genetic findings for these endophenotypes.

GWAS and meta-analysis on ML-OCT-derived glaucoma endophenotypes

We performed GWAS of 21 total endophenotypes derived using our two ML methods in the EUR subsets. In the EUR discovery GWAS, 637 and 23 unique LD independent (r2 < 0.2) variants passed suggestive (P ≤ 1×10−5) and genome-wide significance (P ≤ 5×10−8), respectively (SI File 1). Of these, 63 and 14 unique variants passed nominal significance (P ≤ 0.05) and Bonferroni-correction in the replication GWAS, respectively. In the EUR meta-analysis of the discovery and replication subsets, 36 GWAS-specific LD-independent variants passed genome-wide significance (P ≤ 5×10−8) across the 21 endophenotypes (Fig. 2a, SI File 2). Among them, 17 passed the Bonferroni-corrected genome-wide significance adjusting for the number of endophenotypes (P ≤ 5×10−8 / 21). The effect sizes of these variants show high concordance (R2 = 0.93) between the EUR discovery and replication subsets (Fig. 3a).

Figure 2. Overlayed Manhattan plots of GWAS meta-analyses for 21 ML-OCT-derived endophenotypes.

Figure 2

Panel (a) displays the results from the EUR meta-analysis, while panel (b) shows the results from the cross-ancestry meta-analysis. In both plots, the y-axis represents the −log10(P-value) of association, and the x-axis shows the chromosomal position. The red dashed line indicates the threshold for genome-wide significance (P ≤ 5 × 10−8). Each point represents a genetic variant, with colors indicating the number of endophenotypes for which the variant reached significance. The shape of the points corresponds to the ML model and retinal layer for the most significant association at that locus. Nearest genes of the genome-wide significant loci are labeled.

Figure 3. ML-OCT-derived endophenotypes replicated and expanded glaucoma genetic associations with consistent effect sizes across cohort subsets and ancestries.

Figure 3

a) The effect sizes of significant loci in the EUR discovery GWAS stayed highly concordant in the replication GWAS. b) Representative forest plot to show that the effect sizes of genome-wide significant loci were relatively consistent across meta-analysis with different ancestries. The large standard errors in AFR and AMR were due to the small sample sizes. c) and d) Pie plots showing the proportions of EUR and cross-ancestry meta-analysis genome-wide significant loci that were previously reported in glaucoma, open-angle glaucoma (OAG), intraocular pressure (IOP), retinal nerve fiber layer thickness (RNFL), retinal layer thickness (RLT), and macular thickness (MT).

Due to the small sample sizes of the AFR and ASN groups, only 4 and 3 LD independent loci were identified, respectively for these population meta-analyses (SI Sheet 2). Nevertheless, the cross-ancestry meta-analysis identified 43 unique LD independent genome-wide significant loci across the 21 endophenotypes, with 20 passing the Bonferroni-corrected genome-wide significance (P ≤ 5×10−8 / 21) (Fig. 2b) (SI File 2). Of the 36 unique LD independent variants identified in the EUR meta-analysis, 49% (18) replicated in the cross-ancestry meta-analysis at genome-wide significance.

Effect-size heterogeneity (I2) for the genome-wide significant loci was minimal in both the EUR and cross-ancestry meta-analyses, indicating consistent genetic associations across GWAS subsets and across populations. We visualized the consistency of effect sizes across EUR and cross-ancestry meta-analyses using forest plots (Fig. 3b, SI File 3). All loci demonstrated highly consistent effect sizes between EUR and cross-ancestry analyses (Pearson correlation coefficient r = 0.998, P = 1.341 × 10−52). Most loci also exhibited consistency within AFR and ASN populations, although these effect estimates had larger standard errors and did not reach statistical significance, as expected due to the smaller sample sizes. None of the GWAS or meta-analyses showed any evidence of population structure inflation (l=1.000–1.028) (SI File 4).

Genetic associations for glaucoma were replicated and expanded

We systematically compared the genome-wide significant loci identified in our EUR and cross-ancestry meta-analyses to previously reported associations with POAG and established glaucoma-related traits, including IOP, RNFL thickness and macular thickness. Of the LD independent genome-wide significant loci, 23 out of 36 (64%) in the EUR meta-analysis and 27 out of 43 (63%) in the cross-ancestry meta-analysis replicated known glaucoma-related genome-wide significant genetic associations (including LD proxies with r2 > 0.1) (Fig. 3c, 3d) (SI File 5). The novel glaucoma loci found in this study were listed in Table 1. Multiple of them were overlapped between the EUR and the cross-ancestry meta-analyses, yielding 21 unique novel loci.

Table 1.

Novel loci from EUR and cross-ancestry ML-OCT endophenotype GWAS meta-analyses.

Population Locus Chromosome Position (hg38) Effect allele Beta Standard error P Direction* Nearest gene
Cross-ancestry 1:161386627_AT_A 1 161416837 A −0.2308 0.0408 1.60E-08 −−−−−− ENSG00000283360
Cross-ancestry rs3767216 1 19813847 A −0.0437 0.0076 8.66E-09 −+−−−+ RNF186
EUR rs3767216 1 19813847 G −0.0449 7.90E-03 1.48E-08 −− RNF186
EUR 2:145315140_GGTGT_G 2 144557573 G −0.0453 8.00E-03 1.86E-08 −− lnc-ZEB2–3/LINC01412
Cross-ancestry 2:145315140_GGTGT_G 2 144557573 G −0.0426 0.0077 3.56E-08 −+−−−+ ENSG00000235435
Cross-ancestry rs62172787 2 173960997 A −0.1375 0.0251 4.59E-08 ??−−?? SP3
EUR rs62172787 2 173960997 G −0.1375 2.51E-02 4.59E-08 −− SP3
Cross-ancestry rs61416912 3 194092883 T −0.075 0.0127 3.06E-09 −−−−+− NA
Cross-ancestry rs10933712 3 194093395 A −0.0712 0.0127 1.92E-08 −−−−+− ENSG00000229964
Cross-ancestry rs11728835 4 4361027 A −0.091 0.0162 2.02E-08 −−−−−− NSG1
Cross-ancestry rs56834952 4 147970941 A 0.0679 0.0119 1.14E-08 ++++++ ARHGAP1
Cross-ancestry 8:109194601_CA_C 8 108182372 CA −0.0498 0.0085 4.75E-09 −−−−−− NA
EUR 8:109194601_CA_C 8 108182372 CA −0.0483 8.80E-03 4.70E-08 −− EIF3E
EUR rs4750249 10 12706817 C 0.0507 9.00E-03 1.93E-08 ++ CAMK1D
EUR rs112115087 14 60341147 G −0.0512 7.80E-03 4.76E-11 −− NA
Cross-ancestry rs141927261 14 74148741 G 0.0661 0.0112 3.32E-09 ++++++ NA
EUR rs72766456 16 8622271 G −0.0653 1.18E-02 3.50E-08 −− METTL22
EUR 16:80430159_CG_C 16 80396262 C 0.1751 3.05E-02 9.48E-09 ++ MAF
EUR rs72855624 17 81562724 A −0.0781 0.0116 1.61E-11 −− NPLOC4
Cross-ancestry rs72855624 17 81562724 A −0.0732 0.011 2.32E-11 −−−−−+ NPLOC4
Cross-ancestry 17:79614932_TTAAC_T 17 81647906 T 0.0722 0.0076 2.37E-21 −+++++ NPLOC4
EUR 17:79614932_TTAAC_T 17 81647906 TTAAC −0.0621 7.80E-03 1.71E-15 −− NPLOC4

The table lists the 21 novel, LD-independent loci that reached genome-wide significance (P ≤ 5 × 10−8) and have not been previously associated with glaucoma or its related traits. For each lead variant, the table details the population in which it was identified, its chromosomal position (genome build hg38), the effect allele, the effect size (beta) and standard error and association P-value from the corresponding meta-analysis, and the nearest mapped gene.

*

The direction of effect signs corresponds to the discovery and replication GWAS in the EUR meta-analysis, and discovery and replication for each of the populations in the following order: AFR, EUR and ASN.

Gene-based association test

We performed gene-based association analysis for each ML-OCT-derived endophenotype using MAGMA and identified 28 and 27 genes that passed the gene-based Bonferroni-corrected threshold (P < 0.05/18,658 genes tested) for both the EUR and cross-ancestry meta-analyses, respectively. Among them, 68% (19) of the genes significant in EUR were also significant in the cross-ancestry meta-analyses. Full gene-based tests results are in SI File 6.

Colocalization analysis of GWAS significant loci with cis eQTLs and cis sQTLs

To further propose putative causal genes that may underlie genome-wide significant loci for these endophenotypes through regulatory effects, we used e/sQTLs in brain, nerve, artery, and fibroblast tissues from the Genotype-Tissue Expression (GTEx) project (31), as well as peripheral retina eQTLs (32). We applied eCAVIAR, a Bayesian colocalization method, to the genome-wide significant loci from each of the endophenotypes’ meta-analyses that had at least one genome-wide significant hit. e/sQTLs from the selected tissues that overlapped each GWAS locus LD interval were included for colocalization.

We found that 72% (26/36) of EUR GWAS loci and 40% (17/42) of cross-ancestry GWAS loci identified in our study significantly colocalized with at least one eQTL and/or sQTL (colocalization posterior probability [CLPP] >0.01). The retinal eQTLs that significantly colocalized largely overlapped with significant e/sQTLs identified in GTEx tissues. In total, 25 genes from the EUR meta-analysis and 24 genes from the cross-ancestry meta-analysis showed significant colocalization with the identified genome-wide significant loci (SI File 7). Of these, 95% (20/25) and 79% (19/24) of the colocalizing genes in EUR and cross-ancestry meta-analysis were protein-coding, respectively.

Figure 4 and Fig. S1 summarizes the e/sGenes that colocalized significantly with at least one of the genome-wide significant loci from the cross-ancestry and EUR meta-analysis, respectively. Several e/sGenes, including EIF3E, PCNX4, PBLD, TSPAN10, and DNAJC12, colocalized with the novel loci in this study. In addition, 36% (10) and 30% (8) of the significant genes from the EUR and cross-ancestry gene-based tests overlapped with the significant e/sGenes identified through the colocalization analysis, respectively.

Figure 4. Colocalizing e/sQTLs in GTEx tissues and retina with genome-wide significant loci in the cross-ancestry meta-analyses.

Figure 4

Shown are genes with at least one significant colocalization (CLPP > 0.01) between e/sQTLs tested across selected GTEx tissues and peripheral retina and genome-wide significant loci identified in the cross-ancestry meta-analysis. Bubble size represents the maximum colocalization posterior probability among all e/sVariants tested for each gene, QTL type, and tissue combination. Points are color-coded according to the relative direction of effect between the QTL and the GWAS meta-analysis.

Cell-type enrichment analysis

To connect the identified genes to involved cell types and disease mechanisms, we applied ECLIPSER (33, 34) to test whether the expression of the identified colocalizing e/sGenes was enriched in specific cell types in key eye tissues relevant to the pathophysiology of glaucoma, the outflow pathways, retina, and optic nerve head and surrounding tissues. Consistent with Hamel et al. (2024) (35), we observed nominal enrichment (P<0.05, FDR<0.2) of RNFL endophenotype mapped genes in fibroblasts in both the unconventional (ciliary body and iris) and conventional (trabecular meshwork/Schlemm’s canal) outflow pathways in the anterior segment, and fibroblasts and astrocytes in the optic nerve head (Table 2, SI File 8, Fig. 5a, Fig. S2, and Fig. S3). These recurring cell-type signals underscore the central importance of outflow pathway fibroblasts, retinal support cells, and ONH macroglia in glaucoma pathogenesis and validate the robustness of our analysis.

Table 2.

Summary of the ocular cell types showing significant enrichment of expression of e/sQTL-mapped genes to endophenotype loci.

Tissue Cross-ancestry, GCC Cross-ancestry, RNFL EUR GCC EUR RNFL
Anterior segment (trabecular meshwork/Schlemm's canal, iris, central cornea, corneoscleral wedge, lens) Iris sphincter*, CB_NPCE, goblet, lens_fiber, Uveal_Melanocyte goblet, CB_NPCE, iris_fibro, ciliary_fibro, K_fibro, iris_RPE, TM_fibro, K_Endo CB_NPCE, Lens_Fiber, Uveal_Melanocyte, CB_PCE, Iris_PPE, Conj_Melanocyte, Iris_APE, Lens_EarlyFiber, Iris_Sphincter, Pericytel CB_NPCE, Iris_Fibro, Ciliary_Fibro, Iris_PPE, TM_Fibro, Goblet
Retina RPE*, Both_AC, ON_MGC, OFF_MGC Astrocyte RPE*, Both_AC, Other_RGC, Cone
Optic nerve head, optic nerve and surrounding posterior tissues Fibroblasts, melanocytes, immune cells, 6_Astro, 8_Astro Fibroblasts, astrocytes, 6_Astro, 8_Astro 3-Melanocyte*, 14-Melanocyte* 17-Fibro, 28-Fibro, 5-Fibro, 10-Fibro, 22-Fibro, 22-Astro
Macula RPE*, H2 RPE*, H2

An asterisk (*) denotes cell types that passed the stringent tissue-wide significance threshold (BH FDR<0.1), while all other listed cell types passed nominal significance (P<0.05). Cell types are listed in order of significance in GCC and RNFL endophenotypes. Cell type descriptions are available in Tables S19 (cross-ancestry meta-analyses) and S20 (EUR meta-analyses).

Figure 5. Cell type enrichment analysis of the cross-ancestry meta-analysis in the optic nerve head and retina.

Figure 5

Panels (a) and (c) show the fold-enrichment of cell type-specific expression for genes mapped from the cross-ancestry GWAS meta-analysis loci in the optic nerve head for RNFL-derived endophenotypes and retina for GCC-derived endophenotypes, respectively. Cell types are ranked by their enrichment score (x-axis). Points are colored by statistical significance: red indicates tissue-wide significance (FDR < 0.1), while grey indicates nominal significance (P < 0.05). Panels (b) and (d) highlight the specific genes driving the most significant enrichment signals from the corresponding plots on the left. Panel (b) shows that RSPO2 is the primary driver of the enrichment in optic nerve head fibroblasts. Panel (d) shows that DYNC1I2, NPLOC4, and SLC25A12collectively drive the enrichment in RGC clusters. The horizontal dashed line represents log2(Fold-change) of 0.375 (FC = 1.3) and FDR < 0.1 used as the cell type-specificity enrichment cutoff. FC, fold-change of gene expression in given cell type relative to all other cells in the given tissue.

Importantly, our analysis highlighted several enriched cell types and lead genes that were previously unreported but may mediate glaucoma risk. Nominal enrichment was observed in fibroblasts of several anterior and posterior segment cell types in the EUR and cross-ancestry meta-analyses, which were all driven by RSPO2 (Fig. 5a, 5b). Notably, RSPO2 appeared repeatedly in our functional analyses. One of the most significant cell type enrichments found for GCC endophenotype genes (tissue-wide Benjamini-Hochberg adj. P<0.05) was in the retinal ganglion cell (RGC) cluster of all RGCs other than ON-MGC and OFF-MGC cells in the retina, that was primarily driven by DYNC1I2, NPLOC4, and SLC25A12 (Fig 5c, 5d). Additional cell types showing tissue-wide significance for the GCC endophenotype loci included optic nerve head melanocytes, retinal/macular RPE cells (Fig. 5c) and iris sphincter muscle cells. The pigmented cell type enrichments were driven by genes (OCA2, NPLOC4 and TSPAN10) associated with myopia(36), a risk factor for POAG.

Mendelian randomization of putative causal genes

To provide additional support for a causal relationship between e/sQTLs and our glaucoma endophenotype associations, we applied Generalized Summary-data-based Mendelian Randomization (GSMR) to the 12 genes that show statistical significance in at least two of the three functional analyses we applied to the endophenotype GWAS: colocalization analysis, gene-based association testing, and cell-type enrichment analysis (Table 3). After removing variants with horizontal pleiotropy, 92% (11) of genes showed evidence of causality using a Bonferroni-corrected threshold (P < 0.05/13) accounting for the number of genes tested. Complete MR results are available in the SI File 9.

Table 3.

Summary of functional evidence for prioritized genes.

Gene Locus Gene-based test Colocalization Cell-type enrichment analysis MR
ATOH7 chr10:68281826:C:T + + +
EIF3E chr8:108279863:C:T + + +
LIN52 chr14:74199522:T:C + + +
MYPN chr10:68281826:C:T + +
NPLOC4 chr14:74199522:T:C + + + +
PBLD chr10:68281826:C:T + + +
RSPO2 chr8:108170015:T:C + + + +
SLC25A16 chr10:68263296:A:G + + + +
TSPAN10 chr17:81644245:A:G + + + +
C14orf39 chr14:60490453:G:A + + +
SLC25A12 chr2:171739103:T:TG + + +
DYNC1I2 chr2:171739103:T:TG + + +

The table highlights genes that were statistically significant in at least two of the primary functional analyses performed. A "+" indicates that a gene passed the significance threshold for the corresponding analysis; A "−" indicates the gene did not meet the significance threshold for that analysis. MR, Mendelian randomization.

Gene-set enrichment analysis

Gene-set enrichment analysis of the endophenotype GWAS using MAGMA recapitulated known POAG biology while sharpening the axes involved. In both the EUR and cross-ancestry meta-analyses, we observed robust evidence for lipid/cholesterol regulation and innate inflammatory programs, highlighted by enrichment of SREBF1A targets (P = 8.31 × 10−3 , SI File 10), response to oxidized phospholipids (P = 1.22 × 10−4), IL-17/IL-22 signaling (P = 2.07 × 10−2) and suppression of phagosome maturation (P = 2.80 × 10−5). Vascular and hypoxic responses were prominent (VEGFR1/2 signaling, P = 1.54 × 10−5; HIF1/2 targets, P = 2.66 × 10−3), as well as developmental/ECM-remodeling cascades relevant to aqueous outflow and retinal support tissues (WNT signaling and Notch activation). We also observed the calcium signaling pathway enriched in both analyses (P = 1.98 × 10−4). Our results reinforced and extended the pathway landscape reported by recent large POAG GWAS(12, 35), providing support for altered lipid handling, para-inflammation, vascular/hypoxic stress, and tissue-remodeling signaling as biological processes influencing POAG.

DISCUSSION

Our study shows that disease-focused, deep-learning phenotypes extracted from macular OCT can expand the genetic understanding of POAG. Our European and cross-ancestry meta-analyses uncovered 36 and 43 genome-wide significant loci, respectively. Among these loci, roughly 65% were previously reported to be associated with OAG or at least one well-established glaucoma-related trait, such as IOP and retinal layer thickness. The concordance of our GWAS hits with established glaucoma loci suggest that our ML-OCT-derived endophenotypes are disease specific and able to capture contributions of glaucomatous structural damage. More importantly, however, we identified 21 novel genome-wide significant loci, where several of the mapped genes are highly expressed in relevant eye tissues, such as ONH and retina. Biological relevance of the identified loci is further supported by multiple lines of evidence.

Using systematic functional analyses, we advance from association to higher-resolution gene and cell-type inference, providing a basis for mechanistic hypotheses that can be followed up for POAG. Bayesian colocalization with multiple GTEx tissues that contain relevant cell types plus retinal eQTLs linked 72% of European association signals to likely regulatory gene/s in tissues with eye-relevant cell types. MAGMA gene-based tests uncovered 28 significant genes from the European meta-analysis, 68% of which overlapped with those from the cross-ancestry analysis. Importantly, 36% of MAGMA hits in EUR cohort overlapped colocalizing e/sGenes, providing further statistical support. Cell-type specific enrichment analysis connected these genes to the cellular architecture of the eye, where pigmented cells in the retina and ONH, ONH and outflow pathway fibroblasts, and ONH astrocytes that emerged as the most over-represented populations, echoing the cell-types produced by Hamel et al. (2024). The enrichment in outflow pathway cells suggested that elevated IOP also plays a role in affecting glaucomatous damage in the retina. Not previously observed with POAG GWAS loci was enrichment in non-midget ganglion cell RGCs driven by NPLOC4, SLC25A12, and DYNC1I2. Mendelian randomization further strengthened the role of the identified genes in glaucoma pathogenesis. This layered approach prioritized 11 putative causal genes (EIF3E, LIN52, NPLOC4, PBLD, RSPO2, SLC25A16, TSPAN10, DYNC1I2, SLC25A12, ATOH7, and C14orf39).

Among the prioritized genes, some are known to be associated with POAG related traits, while others are novel. Specifically, ATOH7 and C14orf39 are genes that have been previously proposed to potentially contribute to POAG risk (37, 38). SLC25A16, and PBLD are genes near ATOH7 and have been reported to be associated with VDCR. TSPAN10 and NPLOC4 have been reported to be associated with ocular traits like macular thickness and optic disc parameters. Importantly, five genes (RSPO2, EIF3E, LIN52, DYNC1I2, SLC25A12) that showed strong supportive causal evidence in our studies have not been reported previously. For example, EIF3E, a gene from one of our novel loci, highly significant in the gene-based association test (P = 4.52 × 10−13), showed a significant colocalization signal with multiple brain and artery tissues, and had elevated expression across several outflow pathway, ONH and retinal cell types (Fig. S4). As another example, RSPO2 was significant in gene-based association tests (P = 2.13 × 10− 9) and colocalized with eQTLs in brain cortex. Single-cell expression in ocular cell types showed elevated RSPO2 expression in the fibroblasts of ONH and multiple cell types in the anterior segment, with the highest expression levels in fibroblasts in the trabecular meshwork, iris and ciliary body. Cell-type enrichment analysis further supported RSPO2 as the main driver of fibroblast enrichment in these regions.

The genes and ocular cell types identified in this study point to two connected biologic mechanisms of POAG risk. RSPO2 is a potentiator of canonical Wnt signaling (39); its high expression in anterior-segment and ONH fibroblasts suggest a role in extracellular-matrix remodeling and aqueous outflow regulation that potentially leads to elevated IOP. On the neuronal side, EIF3E, an initiator for protein synthesis and DNA damage repair (40), shows strong gene-based and colocalization support with broad retinal expressions, implicating a potential role in RGC resilience. DYNC1I2 underlies enrichment in an RGC subtype clusters and encodes a dynein-1 intermediate chain; dynein-dependent axonal transport is known to fail early in experimental glaucoma (41), potentially linking this locus to RGC transport vulnerability. SLC25A12, a mitochondrial aspartate–glutamate carrier, contributes to the same RGC enrichment and supports neuronal energy metabolism. This data supports a model where Wnt-tuned fibroblast programs in the ONH and outflow pathway modulate IOP and biomechanical stress, which then interface with RGC-intrinsic capacities for protein synthesis/repair, axonal transport, and mitochondrial bioenergetics to determine susceptibility to glaucomatous injury.

While these novel genes do not have established therapeutic agents, publicly available data suggests they represent highly tractable targets for future drug discovery programs(42). For instance, RSPO2, EIF3E, and DYNC1I2 possess known protein-ligand structures that can guide small molecule development. Specifically, the existence of a co-crystal structure of RSPO2 with a small molecule ligand provides a direct blueprint for structure-based drug design. Furthermore, as RSPO2 is a secreted potentiator of canonical Wnt signaling highly expressed in anterior-segment fibroblasts, developing a neutralizing antibody represents a highly feasible and specific therapeutic strategy. These results demonstrated that our genetic findings not only contributed to the understanding of POAG biology, but also proposed high-confidence starting points for functional validations with patient-derived cells and animal models, ultimately paving the way for novel IOP-lowering and neuroprotective treatments for POAG.

There are several limitations in this work. While our ML models were trained on a large group of patients with POAG related ICD codes, inherent coding errors may introduce some noise. While not feasible at this scale, focusing on patients with confirmed POAG, as verified by chart review, may have resulted in more specific POAG endophenotypes. Nevertheless, we expect the size of our dataset largely overcomes this limitation. Additionally, due to the lack of an external cohort with available OCT and genotype data, GWAS replication was performed by a 75:25 split of the UK Biobank samples, a less stringent approach than true external replication, but represents the most rigorous validation currently feasible at this scale. Importantly, the strong concordance of effect sizes between our discovery and replication sets, combined with extensive validation from multiple downstream functional analyses, provides high confidence in our findings and establishes a powerful phenotyping framework for future replication efforts. Regarding the association analyses, although the first ten genotype principal components were included as covariates, subtle population substructure may persist and inflate association signals. Finally, the generalizability of UK Biobank results is limited due to its European-centric sample. Confirmation of the smaller but directionally aligned effects seen in our African and Asian groups is needed using larger non-European OCT cohorts.

In conclusion, by integrating disease-trained ML-OCT-derived endophenotypes with systematic functional genomics, we have successfully expanded the number of glaucoma associated loci, linked them to plausible causal genes, and mapped them to specific eye cell types. Beyond advancing our understanding of glaucoma’s genetic mechanisms, this study demonstrates the effectiveness of enhancing disease phenotyping by transfer learning from a clinical dataset to a population cohort, a methodology applicable to the investigation of other complex diseases.

MATERIALS AND METHODS

MEE cohort definition

OCT scans from glaucoma patients seen in MEE between October 1, 2020 to June 1, 2023 were used for model training. Glaucoma Individuals were defined based on the International Classification of Diseases (ICD) code in the electronic health record (i.e ICD9: 365.x, ICD10: H40.x). Specifically, 99% (8207) of these patients contain at least one of the OAG ICD codes. The other 1% are patients with other types of glaucoma, including angle-closure glaucoma and secondary glaucoma.

Quality control of UKBB genotype data

Genotype array quality control steps were applied with PLINK v1.9 and PLINK v2.0 (https://www.cog-genomics.org/plink/) on directly genotyped and imputed variants of 488,395 UKBB samples. The hard call sample quality control (QC) was conducted to remove withdrawn samples (N=107), to exclude samples that are not in impute data (N=969), to exclude samples with no sample QC data (N=112), to remove ambiguous sex (N=1930), to remove heterozygosity outliers by using 5 standard deviations from the mean heterozygosity rate within each inferred ancestry subpopulation (N=978) and to remove related individuals with high cryptic relatedness (> 0.1875 Pi-hat) (N=35,113). To infer ancestry, Principal Component Analysis (PCA) was first applied to linkage disequilibrium (LD)-pruned (r2<0.2 in 200kb windows) genetic markers with MAF > 5%. The SHARDS algorithm selected the first 5 important PCs and then implemented k-nearest neighbors classification to predict the ancestral background of participants using labels from the 1000 Genomes Project Phase 3 reference panel. Participants were then categorized based on primary genetic ancestry. In terms of the relative removal algorithm, the graph theory was also applied to keep the maximal independent set. The application of maximal independent set contributed to keep the as many samples as possible. Concurrently, another relative removal algorithm was also conducted to keep highest ratio of the number of cases versus the number of controls to improve the power.

Regarding the imputed variant QC, the pipeline iteratively examines variants, INFO score, allele frequencies and tests of Hardy-Weinberg equilibrium. More specifically, the missingness rate per sample using posterior probability>0.9 for genotype calling was inspected. A small missingness rate of autosomes was observed and no significant difference was observed between male and female. Monomorphic sites (M=552,024) were removed if only one allele occurs at a site or locus in the population. We further filtered out the variants with indels length>50 (M=2,312) and subsequently, the variants with INFO score<0.6 (M=45,927,754) were excluded. The variants with allele frequency<0.1% for each ancestry subpopulation (EUR: M=35,850,305; AMR: M=28,792,905; AFR: M=24,954,341; SAS: M=35,437,737; EAS: M=37,236,001) were further excluded. Then the Hardy-Weinberg equilibrium was calculated and inspected for each ancestry subpopulation. The mid p-value was utilized in exact tests for Hardy-Weinberg equilibrium analyses. Therefore, a total of 227,173 variants in EUR, 98 variants in AMR, 32,317 variants in AFR, 38,173 variants in SAS, and 3,711 variants in EAS of autosomes failed Hardy-Weinberg and were removed. To be conservative, a variant was kept if it passed the Hardy-Weinberg for all ancestry subpopulations. Another MAF filter was applied to keep the common variants with MAF>1% for further analyses. After QC, the number of variants in EUR, AFR, and ASN are 9,329,555, 8,477,280, and 7,963,945, respectively.

GWAS and meta-analysis

Macular GCC and RNFL endophenotypes were derived from 18,985 OCT scans of 8,323 glaucoma patients treated at MEE. These endophenotypes were mapped to OCT scans from the UK Biobank, including 35,739 participants of European ancestry, 1,730 of African ancestry, and 1,677 of Asian ancestry. For patients with scans from both eyes and multiple visits, we prioritized the right eye. If no scans were available for the right eye, the left eye was used instead. When multiple scans existed for the same eye, the most recent scan was used as the endo phenotype.

Genome-wide association studies (GWAS) were conducted on a total of 21 endo phenotypes, including 7 AUTO GCC phenotypes, 2 MOCO GCC phenotypes, 7 AUTO RNFL phenotypes, and 5 MOCO RNFL phenotypes. Each endophenotype was inverse rank normalized prior to performing GWAS testing. GWAS was performed separately for each ancestry group (AFR, Asian, and EUR). Within each ancestry, samples were randomly divided into discovery (75%) and replication (25%) sets, with matched age distributions and female/male proportions across the two subsets. Analyses were performed using PLINK v2.0, adjusting for age, sex, the first 10 principal components (PCs), and refractive error. PCs were calculated separately within each ancestry group. For meta-analysis, we used METAL (version 2011–03-25), applying a fixed-effects model based on inverse-variance weighting, under the assumption that all samples came from the same biobank. Heterogeneity statistics were computed, and beta coefficients and standard errors were used as inputs. Meta-analyses were firstly conducted within each ancestry, followed by a trans-ancestry meta-analysis.

LD clumping was performed with PLINK v2.0 to identify independent loci in each meta-analysis. LD calculations were based on population-specific samples in the UKBB. For genome-wide significant signals, the LD threshold r2 was set to be 0.2 and the LD radius was set to be 500kb for LD clumping.

None of the GWAS or meta-analyses showed any evidence of population structure inflation, with the genomic lambda values around 1. Effect-size heterogeneity (I2) for the genome-wide significant loci was minimal in both the EUR and cross-ancestry meta-analyses, with most loci showing I2 values near zero. Additionally, heterogeneity p-values for all tested loci were greater than 0.05, indicating consistent effects across populations (SI File 2).

Comparison with previously reported associations

We compared our identified independent genome-wide significant loci to genome-wide significant variants previously reported to be associated with glaucoma and glaucoma-related traits, which are retinal layer thickness, IOP, CDR. The set of variants used for comparison were downloaded from GWAS Catalog (MONDO_0005041, EFO_0004190, OBA_2050110, EFO_0004695) and from three POAG GWAS studies (35, 43, 44). To ensure that our classification of novel vs previously reported loci is LD-aware, we included LD proxies (r2 > 0.1, UKBB as reference panel) of the identified loci in this comparison.

Colocalization analysis with expression and splicing QTL datasets

We conducted a Bayesian colocalization analysis using eCAVIAR (https://github.com/fhormoz/caviar) to pinpoint candidate genes and regulatory elements (e/sQTLs) potentially involved in how glaucoma-linked genetic variations exert their effects. This method estimates the probability that overlapping GWAS and e/sQTL signals share the same causal variant or haplotype. This method considers local LD and the presence of multiple causal variants (allelic heterogeneity).

We used cis-expression quantitative trait loci (cis-eQTLs) and cis-splicing quantitative trait loci (cis-sQTLs) data from multiple tissues in the GTEx project (v8 release)(45) and peripheral retina eQTLs from the EyeGEx dataset(46). The GTEx data was based on GENCODE v26 annotations, while EyeGEx data used GENCODE v25. Significant e/sQTLs (FDR < 0.05), gene expression levels, and intron excision ratios (LeafCutter values) were downloaded from the GTEx portal (https://gtexportal.org/home/datasets) for GTEx and EyeGEx. These summary statistics for all tested variant-gene pairs were then used for colocalization analysis.

Colocalization was performed on the genome-wide significant LD-independent loci for each GWAS meta-analysis. Every GWAS locus was assessed for colocalization with all eQTLs and sQTLs containing at least five e/sVariants (FDR < 0.05) located within the corresponding GWAS LD interval. For each locus, LD windows were defined by selecting variants in LD (r2 > 0.1) with the lead GWAS variant, extending 50 kb upstream and downstream. For loci derived from European populations, LD calculations were based on European samples from the 1000 Genomes Project (47), while for the cross-ancestry loci, LD was computed using the European, African, and Asian samples in 1000 Genomes, using PLINK v1.9. When the lead variant was missing from the 1000 Genomes dataset, a high LD proxy (r2 > 0.8) or the nearest variant was used. Z-scores for GWAS and e/sQTL summary statistics were computed as the effect size divided by its standard error for each variant. For retina-specific eQTLs, Z-scores were derived from association p-values, assuming a chi-square distribution with 1 degree of freedom. We tested all variants with MAF > 1% that were common to the GWAS and e/sQTL datasets. Variants in GWAS datasets were aligned to the alternative allele (ALT) reported in GTEx and EyeGEx datasets. Only eQTLs were analyzed for retina-specific data, while both eQTLs and sQTLs were analyzed for GTEx tissues. A locus was considered to exhibit significant colocalization when the colocalization posterior probability (CLPP) exceeded 0.01, following eCAVIAR’s recommendations. To reduce false-positive findings, we excluded any variant–gene–tissue–trait combinations in which the colocalized e/sVariant had a GWAS p-value > 0.05, an e/sQTL p-value > 1 × 10−4, or failed to meet the FDR < 0.05 criterion (i.e., labeled “FALSE” in the column ‘QC_name_QTL_FDR05_P1E04_GWAS_P5E02’ of SI Sheet 6).

Gene-based tests and gene set enrichment analyses

We conducted gene-based association tests and gene set enrichment analyses with MAGMA (v1.10, https://ctg.cncr.nl/software/magma). In this gene-based analysis, genetic variants were mapped to 18,685 genes based on distance (+/−100kb) to compute a gene-level z-score, which was then tested for association with the ML-derived endophenotypes adjusting for cofounding factors. The derived gene-based P values were adjusted using the Bonferroni method to account for multiple testing (P < 0.05/18,685 = 2.68 × 10−6). In the pathway analysis, a list of 17,023 curated gene sets from MSigDB (https://www.gsea-msigdb.org/gsea/msigdb/) human database were used, and the P values of pathway analysis were adjusted by applying a Bonferroni correction.

Mendelian randomization

For each gene identified by our colocalization or our gene-based analysis, we performed summary-data Mendelian randomization using Generalized Summary-data-based Mendelian Randomization (GSMR, https://yanglab.westlake.edu.cn/software/gsmr/) to test for a putative causal effect of cis-regulated expression on the endophenotypes. For each target gene, we first extracted independent cis-eQTL instruments (defined as SNPs within ±1 Mb of the transcription start site with P < 5 × 10−8) from the GTEx and retina eQTL summary statistics. Instruments were LD-pruned at r2 < 0.05 in 1 Mb windows using a European-ancestry reference panel (n = 5,000 unrelated UK Biobank participants). To guard against horizontal pleiotropy, we applied the HEIDI-outlier test implemented in GSMR with a PHEIDI threshold of 0.01, removing SNPs inconsistent with a single causal pathway. GSMR then estimated the causal effect of gene expression on the endophenotype with a generalized least-squares framework that accounts for sampling variance and residual LD among instruments. Effect estimates and their standard errors were computed under the default GSMR settings.

Ocular single-cell expression

Publicly available single-cell RNA-seq datasets from the posterior segment, anterior segment, and retina were used to plot the single-cell expression patterns of our prioritized genes across a variety of ocular cell types (4850).

Cell type-specific enrichment

To identify ocular cell types that are enriched for cell type-specific expression of genes that mapped to GWAS loci based on colocalizing e/sQTLs, we used a previously described pipeline, ECLIPSER (34). Briefly, ECLIPSER operates by determining if genes mapped to GWAS loci for a specific complex disease or trait based on e/sQTLs or other functional genomic data show greater cell type-specific expression compared to genes linked to a background set of GWAS loci from thousands of unrelated traits. The fundamental principle of ECLIPSER is that in a relevant tissue, multiple disease-associated genes will exhibit elevated expression in pathogenic cell types relative to non-pathogenic cells, to a greater extent than genes associated with unrelated traits.

The cell type-specific expression data we used was previously described in Hamel et. al, 2024 (35). It is single-nucleus RNA-seq of 13 tissues dissected from non-diseased human eyes, including central cornea, corneoscleral wedge (CSW), trabecular meshwork (TM) including Schlemm’s canal, iris, ciliary body (CB), lens (all from anterior segment(51)), peripheral and macular retina(52, 53), the optic nerve head (ONH), optic nerve (ON), peripapillary sclera (PPS), peripheral sclera, and choroid (all from posterior segment(54)). GWAS-specific genome-wide significant loci from the EUR and cross-ancestry meta-analysis were used as input for enrichment.

Supplementary Material

This is a list of supplementary files associated with this preprint. Click to download.

SupplementalInformation.zip

Acknowledgments:

We are grateful to the patients of the MEE glaucoma clinic and the participants of the UK Biobank whose contributions were essential for this research. We would also like to acknowledge Yuyang Luo, Puja Mehta, and Susan Xu who provided technical assistance in some of the computational tools.

Funding:

National Institutes of Health grants R01 EY036518 (NZ), R01 EY036222 (MW), R01 EY030575 (TE), NIH/NEI R01EY031424 (AVS) P30EY003790 (TE), P30EY014104 (AVS)

Research to Prevent Blindness (NZ, TE)

Footnotes

Declarations

Competing interests: N. Zebardast serves as a consultant for Sanofi/Genzyme.

Ethics statement

This study was conducted in accordance with the tenets of the Declaration of Helsinki. The use of OCT scans acquired at MEE was approved by the Harvard Longwood Campus (HLC) IRB (protocol 2024P003507), with a waiver of informed consent granted owing to the retrospective use of de-identified data. Access to UKBB OCT and genetic data was obtained with approval from the UKBB Committee. All participants in UKBB provided informed consent at recruitment. All data were handled in accordance with applicable data protection and privacy regulations.

Contributor Information

Liyin Chen, Massachusetts Eye and Ear Infirmary.

Yan Zhao, Massachusetts Eye and Ear Infirmary.

Saber Kazeminasab Hashemabad, Massachusetts Eye and Ear Infirmary.

Tobias Elze, Massachusetts Eye and Ear Infirmary.

Mohammad Eslami, Massachusetts Eye and Ear Infirmary.

Mengyu WANG, Massachusetts Eye and Ear Infirmary.

Janey Wiggs, Massachusetts Eye and Ear Infirmary.

Ayellet Segre, Massachusetts Eye and Ear Infirmary.

Nazlee Zebardast, Massachusetts Eye and Ear Infirmary.

Data and materials availability:

The GWAS summary statistics generated in this study are available in Figshare (10.6084/m9.figshare.30227764). UK Biobank data are available by request through the UK Biobank Access Management System https://www.ukbiobank.ac.uk/. The 1000 Genomes phase 3 data is available at https://www.internationalgenome.org/. The GTEx protected data are available through the database of Genotypes and Phenotypes (dbGaP) (accession no. phs000424.v8). The processed GTEx eQTL and sQTL summary statistics are available on the GTEx portal (https://gtexportal.org/home/datasets). The EyeGEx retina eQTL summary statistics is available at https://visiongenomics.org. The snRNA-seq data for the anterior segment and macula are available in Gene Expression Omnibus (GEO) accession number GSE199013, for the optic nerve head and posterior tissues in GSE236566, and for the retina in GSE226108. All the results from our analyses (e.g., GWAS, MAGMA, colocalization, Mendelian randomization, cell-type enrichment) can be found in the Supplementary Data. This study was conducted in accordance with the tenets of the Declaration of Helsinki. The use of OCT scans acquired at MEE was approved by the Harvard Longwood Campus (HLC) IRB (protocol 2024P003507), with a waiver of informed consent granted owing to the retrospective use of de-identified data. Access to UKBB OCT and genetic data was obtained with approval from the UKBB Committee. All participants in UKBB provided informed consent at recruitment. All data were handled in accordance with applicable data protection and privacy regulations.

References

  • 1.Davuluru S. S., Jess A. T., Kim J. S. B., Yoo K., Nguyen V., Xu B. Y., Identifying, Understanding, and Addressing Disparities in Glaucoma Care in the United States. Transl Vis Sci Technol 12, 18 (2023). [Google Scholar]
  • 2.Tham Y.-C., Li X., Wong T. Y., Quigley H. A., Aung T., Cheng C.-Y., Global Prevalence of Glaucoma and Projections of Glaucoma Burden through 2040. Ophthalmology 121, 2081–2090 (2014). [DOI] [PubMed] [Google Scholar]
  • 3.Bailey J. N. C., Loomis S. J., Kang J. H., Allingham R. R., Gharahkhani P., Khor C. C., Burdon K. P., Aschard H., Chasman D. I., Igo R. P., Hysi P. G., Glastonbury C. A., Ashley-Koch A., Brilliant M., Brown A. A., Budenz D. L., Buil A., Cheng C.-Y., Choi H., Christen W. G., Curhan G., De Vivo I., Fingert J. H., Foster P. J., Fuchs C., Gaasterland D., Gaasterland T., Hewitt A. W., Hu F., Hunter D. J., Khawaja A. P., Lee R. K., Li Z., Lichter P. R., Mackey D. A., McGuffin P., Mitchell P., Moroi S. E., Perera S. A., Pepper K. W., Qi Q., Realini T., Richards J. E., Ridker P. M., Rimm E., Ritch R., Ritchie M., Schuman J. S., Scott W. K., Singh K., Sit A. J., Song Y. E., Tamimi R. M., Topouzis F., Viswanathan A. C., Verma S. S., Vollrath D., Wang J. J., Weisschuh N., Wissinger B., Wollstein G., Wong T. Y., Yaspan B. L., Zack D. J., Zhang K., Study E.-N. E., ANZRAG Consortium, Weinreb R. N., Pericak-Vance M. A., Small K., Hammond C. J., Aung T., Liu Y., Vithana E. N., MacGregor S., Craig J. E., Kraft P., Howell G., Hauser M. A., Pasquale L. R., Haines J. L., Wiggs J. L., Genome-wide association analysis identifies TXNRD2, ATXN2 and FOXC1 as susceptibility loci for primary open-angle glaucoma. Nat Genet 48, 189–194 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Gharahkhani P., Burdon K. P., Fogarty R., Sharma S., Hewitt A. W., Martin S., Law M. H., Cremin K., Bailey J. N. C., Loomis S. J., Pasquale L. R., Haines J. L., Hauser M. A., Viswanathan A. C., McGuffin P., Topouzis F., Foster P. J., Graham S. L., Casson R. J., Chehade M., White A. J., Zhou T., Souzeau E., Landers J., Fitzgerald J. T., Klebe S., Ruddle J. B., Goldberg I., Healey P. R., Wellcome Trust Case Control Consortium 2, NEIGHBORHOOD consortium, Mills R. A., Wang J. J., Montgomery G. W., Martin N. G., RadfordSmith G., Whiteman D. C., Brown M. A., Wiggs J. L., Mackey D. A., Mitchell P., MacGregor S., Craig J. E., Common variants near ABCA1, AFAP1 and GMDS confer risk of primary open-angle glaucoma. Nat Genet 46, 1120–1125 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Gharahkhani P., Jorgenson E., Hysi P., Khawaja A. P., Pendergrass S., Han X., Ong J. S., Hewitt A. W., Segrè A. V., Rouhana J. M., Hamel A. R., Igo R. P., Choquet H., Qassim A., Josyula N. S., Cooke Bailey J. N., Bonnemaijer P. W. M., Iglesias A., Siggs O. M., Young T. L., Vitart V., Thiadens A. A. H. J., Karjalainen J., Uebe S., Melles R. B., Nair K. S., Luben R., Simcoe M., Amersinghe N., Cree A. J., Hohn R., Poplawski A., Chen L. J., Rong S.-S., Aung T., Vithana E. N., NEIGHBORHOOD consortium, Allingham R. R., Brilliant M., Budenz D. L., Bailey J. N. C., Fingert J. H., Gaasterland D., Gaasterland T., Haines J. L., Hauser M., Lee R. K., Lichter P. R., Liu Y., Moroi S., Myers J., Pericak-Vance M., Realini A., Rhee D., Richards J. E., Ritch R., Schuman J. S., Scott W. K., Singh K., Sit A. J., Vollrath D., Weinreb R. N., Wollstein G., Zack D. J., ANZRAG consortium, Sharma S., Martin S., Zhou T., Souzeau E., Landers J., Fitzgerald J. T., Mills R. A., Craig J., Burdon K., Graham S. L., Casson R. J., Goldberg I., White A. J., Healey P. R., Mackey D. A., Hewitt A. W., Biobank Japan project, Shiono M., Misumi K., Kaieda R., Harada H., Minami S., Emi M., Emoto N., Daida H., Miyauchi K., Murakami A., Asai S., Moriyama M., Takahashi Y., Fujioka T., Obara W., Mori S., Ito H., Nagayama S., Miki Y., Masumoto A., Yamada A., Nishizawa Y., Kodama K., Kutsumi H., Sugimoto Y., Koretsune Y., Kusuoka H., Yanaiag H., Nagai A., Hirata M., Kamatani Y., Muto K., Matsuda K., Kiyohara Y., Ninomiya T., Tamakoshi A., Yamagata Z., Mushiroda T., Murakami Y., Yuji K., Furukawa Y., Zembutsu H., Tanaka T., Ohnishi Y., Nakamura Y., FinnGen study, Jalanko A., Kaprio J., Donner K., Kaunisto M., Mars N., Dada A., Shcherban A., Ganna A., Lehisto A., Kilpeläinen E., Brein G., Awaisa G., Harju J., Pärn K., Della Briotta Parolo P., Kajanne R., Lemmelä S., Sipilä T. P., Sipilä T., Lyhs U., Llorens V., Niiranen T., Kristiansson K., Männikkö L., Jiménez M. G., Perola M., Wong R., Kilpi T., Hiekkalinna T., Järvensivu E., Kaiharju E., Mattsson H., Laukkanen M., Laiho P., Lähteenmäki S., Sistonen T., Soini S., Ziemann A., Lehtonen A., Lertratanakul A., Georgantas B., Riley-Gillis B., Quarless D., Rahimov F., Heap G., Jacob H., Waring J., Davis J. W., Smaoui N., Popovic R., Esmaeeli S., Waring J., Matakidou A., Challis B., Close D., Petrovski S., Karlsson A., Schleutker J., Pulkki K., Virolainen P., Kallio L., Mannermaa A., Heikkinen S., Kosma V.-M., Chen C.-Y., Runz H., Liu J., Bronson P., John S., Lahdenperä S., Eaton S., Zhou W., Hendolin M., Tuovila O., Pakkanen R., Maranville J., Usiskin K., Hochfeld M., Plenge R., Yang R., Biswas S., Greenberg S., Laakkonen E., Kononen J., Paloneva J., Kujala U., Kuopio T., Laukkanen J., Kangasniemi E., Savinainen K., Laaksonen R., Arvas M., Ritari J., Partanen J., Hyvärinen K., Wahlfors T., Peterson A., Oh D., Chang D., Teng E., Strauss E., Kerchner G., Chen H., Chen H., Schutzman J., Michon J., Hunkapiller J., McCarthy M., Bowers N., Lu T., Bhangale T., Pulford D., Waterworth D., Kulkarni D., Xu F., Betts J., Gordillo J. E., Hoffman J., Auro K., McCarthy L., Ghosh S., Ehm M., Pitkänen K., Mäkelä T., Loukola A., Joensuu H., Sinisalo J., Eklund K., Aaltonen L., Färkkilä M., Carpen O., Kauppi P., Tienari P., Ollila T., Tuomi T., Meretoja T., Pitkäranta A., Turunen J., Hannula-Jouppi K., Pikkarainen S., Seitsonen S., Koskinen M., Palomäki A., Rinne J., Metsärinne K., Elenius K., Pirilä L., Koulu L., Voutilainen M., Juonala M., Peltonen S., Aaltonen V., Loboda A., Podgornaia A., Chhibber A., Chu A., Fox C., Diogo D., Holzinger E., Eicher J., Gormley P., Mehta V., Wang X., Kettunen J., Pylkäs K., Kalaoja M., Karjalainen M., Hinttala R., Kaarteenaho R., Vainio S., Mantere T., Remes A., Huhtakangas J., Junttila J., Tasanen K., Huilaja L., Luodonpää M., Hautala N., Karihtala P., Kauppila S., Harju T., Blomster T., Soininen H., Harvima I., Pihlajamäki J., Kaarniranta K., Pelkonen M., Laakso M., Hiltunen M., Kiviniemi M., Kaipiainen-Seppänen O., Auvinen P., Kälviäinen R., Julkunen V., Malarstig A., Hedman Å., Marshall C., Whelan C., Lehtonen H., Parkkinen J., Linden K., Kalpala K., Miller M., Bing N., McDonough S., Chen X., Hu X., Wu Y., Auranen A., Jussila A., Uusitalo-Järvinen H., Kankaanranta H., Uusitalo H., Peltola J., Kähönen M., Isomäki P., Laitinen T., Salmi T., Muslin A., Wang C., Chatelain C., Xu E., Auge F., Call K., Klinger K., Crohns M., Gossel M., Palin K., Rivas M., Siirtola H., Tabuenca J. G., UK Biobank Eye and Vision Consortium, Aslam T., Barman S., Barrett J., Bishop P., Bunce C., Carare R., Chakravarthy U., Chan M., Cipriani V., Day A., Desai P., Dhillon B., Dick A., Egan C., Ennis S., Foster P., Fruttiger M., Gallacher J., Garway-Heath D., Gibson J., Gore D., Guggenheim J., Hardcastle A., Harding S., Hogg R., Keane P. A., Khaw P. T., Khawaja A., Lascaratos G., Lotery A. J., Luthert P., Macgillivray T., Mackie S., Mcguinness B., Mckay G., Mckibbin M., Mitry D., Moore T., Morgan J., Muthy Z., O’Sullivan E., Owen C., Patel P., Paterson E., Peto T., Petzold A., Rahi J., Rudnicka A., Self J., Sivaprasad S., Steel D., Stratton I., Strouthidis N., Sudlow C., Thaung C., Thomas D., Trucco E., Tufail A., Vernon S., Viswanathan A., Williams C., Williams K., Woodside J., Yates M., Yip J., Zheng Y., Tapp R., Atan D., Doney A., Allen N., Littlejohns T., Sergouniotis P., Black G., GIGA study group, Kanyaro N., Ntomoka C., Massaga J. J., Ikungura J. K., 23 and Me Research Team, Agee M., Aslibekyan S., Bell R. K., Bryc K., Clark S. K., Elson S. L., Fletez-Brant K., Fontanillas P., Furlotte N. A., Gandhi P. M., Heilbron K., Hicks B., Hinds D. A., Huber K. E., Jewett E. M., Jiang Y., Kleinman A., Lin K.-H., Litterman N. K., McCreight J. C., McIntyre M. H., McManus K. F., Mountain J. L., Mozaffari S. V., Nandakumar P., Noblin E. S., Northover C. A. M., O’Connell J., Pitts S. J., Poznik G. D., Sathirapongsasuti J. F., Shastri A. J., Shelton J. F., Shringarpure S., Tian C., Tung J. Y., Tunney R. J., Vacic V., Zare A. S., Tamiya G., Shiga Y., Yamamoto M., Nakazawa T., Currant H., Birney E., Wang X., Auton A., Lupton M. K., Martin N. G., Ashaye A., Olawoye O., Williams S. E., Akafo S., Ramsay M., Hashimoto K., Kamatani Y., Akiyama M., Momozawa Y., Foster P. J., Khaw P. T., Morgan J. E., Strouthidis N. G., Kraft P., Kang J. H., Pang C. P., Pasutto F., Mitchell P., Lotery A. J., Palotie A., Van Duijn C., Haines J. L., Hammond C., Pasquale L. R., Klaver C. C. W., Hauser M., Khor C. C., Mackey D. A., Kubo M., Cheng C.-Y., Craig J. E., MacGregor S., Wiggs J. L., Genome-wide meta-analysis identifies 127 open-angle glaucoma loci with consistent effect across ancestries. Nat Commun 12, 1258 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Han X., Steven K., Qassim A., Marshall H. N., Bean C., Tremeer M., An J., Siggs O. M., Gharahkhani P., Craig J. E., Hewitt A. W., Trzaskowski M., MacGregor S., Automated AI labeling of optic nerve head enables insights into cross-ancestry glaucoma risk and genetic discovery in >280,000 images from UKB and CLSA. The American Journal of Human Genetics 108, 1204–1216 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hysi P. G., Cheng C.-Y., Springelkamp H., Macgregor S., Bailey J. N. C., Wojciechowski R., Vitart V., Nag A., Hewitt A. W., Höhn R., Venturini C., Mirshahi A., Ramdas W. D., Thorleifsson G., Vithana E., Khor C.-C., Stefansson A. B., Liao J., Haines J. L., Amin N., Wang Y. X., Wild P. S., Ozel A. B., Li J. Z., Fleck B. W., Zeller T., Staffieri S. E., Teo Y.-Y., Cuellar-Partida G., Luo X., Allingham R. R., Richards J. E., Senft A., Karssen L. C., Zheng Y., Bellenguez C., Xu L., Iglesias A. I., Wilson J. F., Kang J. H., van Leeuwen E. M., Jonsson V., Thorsteinsdottir U., Despriet D. D. G., Ennis S., Moroi S. E., Martin N. G., Jansonius N. M., Yazar S., Tai E.-S., Amouyel P., Kirwan J., van Koolwijk L. M. E., Hauser M. A., Jonasson F., Leo P., Loomis S. J., Fogarty R., Rivadeneira F., Kearns L., Lackner K. J., de Jong P. T. V. M., Simpson C. L., Pennell C. E., Oostra B. A., Uitterlinden A. G., Saw S.-M., Lotery A. J., Bailey-Wilson J. E., Hofman A., Vingerling J. R., Maubaret C., Pfeiffer N., Wolfs R. C. W., Lemij H. G., Young T. L., Pasquale L. R., Delcourt C., Spector T. D., Klaver C. C. W., Small K. S., Burdon K. P., Stefansson K., Wong T.-Y., BMES GWAS Group, NEIGHBORHOOD Consortium, Wellcome Trust Case Control Consortium 2, Viswanathan A., Mackey D. A., Craig J. E., Wiggs J. L., van Duijn C. M., Hammond C. J., Aung T., Genome-wide analysis of multi-ancestry cohorts identifies new loci influencing intraocular pressure and susceptibility to glaucoma. Nat Genet 46, 1126–1130 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Pasquale L. R., Loomis S. J., Kang J. H., Yaspan B. L., Abdrabou W., Budenz D. L., Chen T. C., Delbono E., Friedman D. S., Gaasterland D., Gaasterland T., Grosskreutz C. L., Lee R. K., Lichter P. R., Liu Y., McCarty C. A., Moroi S. E., Olson L. M., Realini T., Rhee D. J., Schuman J. S., Singh K., Vollrath D., Wollstein G., Zack D. J., Allingham R. R., Pericak-Vance M. A., Weinreb R. N., Zhang K., Hauser M. A., Richards J. E., Haines J. L., Wiggs J. L., CDKN2B-AS1 genotype-glaucoma feature correlations in primary open-angle glaucoma patients from the United States. Am J Ophthalmol 155, 342–353.e5 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Springelkamp H., Iglesias A. I., Mishra A., Höhn R., Wojciechowski R., Khawaja A. P., Nag A., Wang Y. X., Wang J. J., Cuellar-Partida G., Gibson J., Bailey J. N. C., Vithana E. N., Gharahkhani P., Boutin T., Ramdas W. D., Zeller T., Luben R. N., Yonova-Doing E., Viswanathan A. C., Yazar S., Cree A. J., Haines J. L., Koh J. Y., Souzeau E., Wilson J. F., Amin N., Müller C., Venturini C., Kearns L. S., Kang J. H., NEIGHBORHOOD Consortium, Tham Y. C., Zhou T., van Leeuwen E. M., Nickels S., Sanfilippo P., Liao J., van der Linde H., Zhao W., van Koolwijk L. M. E., Zheng L., Rivadeneira F., Baskaran M., van der Lee S. J., Perera S., de Jong P. T. V. M., Oostra B. A., Uitterlinden A. G., Fan Q., Hofman A., Tai E.-S., Vingerling J. R., Sim X., Wolfs R. C. W., Teo Y. Y., Lemij H. G., Khor C. C., Willemsen R., Lackner K. J., Aung T., Jansonius N. M., Montgomery G., Wild P. S., Young T. L., Burdon K. P., Hysi P. G., Pasquale L. R., Wong T. Y., Klaver C. C. W., Hewitt A. W., Jonas J. B., Mitchell P., Lotery A. J., Foster P. J., Vitart V., Pfeiffer N., Craig J. E., Mackey D. A., Hammond C. J., Wiggs J. L., Cheng C.-Y., van Duijn C. M., MacGregor S., New insights into the genetics of primary open-angle glaucoma based on meta-analyses of intraocular pressure and optic disc characteristics. Hum Mol Genet 26, 438–453 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Wiggs J. L., Kang J. H., Yaspan B. L., Mirel D. B., Laurie C., Crenshaw A., Brodeur W., Gogarten S., Olson L. M., Abdrabou W., DelBono E., Loomis S., Haines J. L., Pasquale L. R., GENEVA Consortium, Common variants near CAV1 and CAV2 are associated with primary open-angle glaucoma in Caucasians from the USA. Hum Mol Genet 20, 4707–4713 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Wiggs J. L., Yaspan B. L., Hauser M. A., Kang J. H., Allingham R. R., Olson L. M., Abdrabou W., Fan B. J., Wang D. Y., Brodeur W., Budenz D. L., Caprioli J., Crenshaw A., Crooks K., Delbono E., Doheny K. F., Friedman D. S., Gaasterland D., Gaasterland T., Laurie C., Lee R. K., Lichter P. R., Loomis S., Liu Y., Medeiros F. A., McCarty C., Mirel D., Moroi S. E., Musch D. C., Realini A., Rozsa F. W., Schuman J. S., Scott K., Singh K., Stein J. D., Trager E. H., Vanveldhuisen P., Vollrath D., Wollstein G., Yoneyama S., Zhang K., Weinreb R. N., Ernst J., Kellis M., Masuda T., Zack D., Richards J. E., Pericak-Vance M., Pasquale L. R., Haines J. L., Common variants at 9p21 and 8q22 are associated with increased susceptibility to optic nerve degeneration in glaucoma. PLoS Genet 8, e1002654 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Han X., Gharahkhani P., Hamel A. R., Ong J. S., Rentería M. E., Mehta P., Dong X., Pasutto F., Hammond C., Young T. L., Hysi P., Lotery A. J., Jorgenson E., Choquet H., Hauser M., Cooke Bailey J. N., Nakazawa T., Akiyama M., Shiga Y., Fuller Z. L., Wang X., Hewitt A. W., Craig J. E., Pasquale L. R., Mackey D. A., Wiggs J. L., Khawaja A. P., Segrè A. V., 23andMe Research Team, International Glaucoma Genetics Consortium, MacGregor S., Large-scale multitrait genome-wide association analyses identify hundreds of glaucoma risk loci. Nat Genet 55, 1116–1125 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Johnson K. B., Wei W.-Q., Weeraratne D., Frisse M. E., Misulis K., Rhee K., Zhao J., Snowdon J. L., Precision Medicine, AI, and the Future of Personalized Health Care. Clin Transl Sci 14, 86–93 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Shickel B., Tighe P. J., Bihorac A., Rashidi P., Deep EHR: A Survey of Recent Advances in Deep Learning Techniques for Electronic Health Record (EHR) Analysis. IEEE J Biomed Health Inform 22, 1589–1604 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Khawaja A. P., Cooke Bailey J. N., Wareham N. J., Scott R. A., Simcoe M., Igo R. P., Song Y. E., Wojciechowski R., Cheng C.-Y., Khaw P. T., Pasquale L. R., Haines J. L., Foster P. J., Wiggs J. L., Hammond C. J., Hysi P. G., UK Biobank Eye and Vision Consortium, NEIGHBORHOOD Consortium, Genome-wide analyses identify 68 new loci associated with intraocular pressure and improve risk prediction for primary open-angle glaucoma. Nat Genet 50, 778–782 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Han X., Steven K., Qassim A., Marshall H. N., Bean C., Tremeer M., An J., Siggs O. M., Gharahkhani P., Craig J. E., Hewitt A. W., Trzaskowski M., MacGregor S., Automated AI labeling of optic nerve head enables insights into cross-ancestry glaucoma risk and genetic discovery in >280,000 images from UKB and CLSA. Am J Hum Genet 108, 1204–1216 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Hoffmann E. M., Zangwill L. M., Crowston J. G., Weinreb R. N., Optic disk size and glaucoma. Surv Ophthalmol 52, 32–49 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Weinreb R. N., Aung T., Medeiros F. A., The pathophysiology and treatment of glaucoma: a review. JAMA 311, 1901–1911 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Scuderi G., Fragiotta S., Scuderi L., Iodice C. M., Perdicchi A., <p>Ganglion Cell Complex Analysis in Glaucoma Patients: What Can It Tell Us?</p>. EB Volume 12, 33–44 (2020). [Google Scholar]
  • 20.Currant H., Hysi P., Fitzgerald T. W., Gharahkhani P., Bonnemaijer P. W. M., Senabouth A., Hewitt A. W., UK Biobank Eye and Vision Consortium, International Glaucoma Genetics Consortium, Atan D., Aung T., Charng J., Choquet H., Craig J., Khaw P. T., Klaver C. C. W., Kubo M., Ong J.-S., Pasquale L. R., Reisman C. A., Daniszewski M., Powell J. E., Pébay A., Simcoe M. J., Thiadens A. A. H. J., van Duijn C. M., Yazar S., Jorgenson E., MacGregor S., Hammond C. J., Mackey D. A., Wiggs J. L., Foster P. J., Patel P. J., Birney E., Khawaja A. P., Genetic variation affects morphological retinal phenotypes extracted from UK Biobank optical coherence tomography images. PLoS Genet 17, e1009497 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Gao X. R., Huang H., Kim H., Genome-wide association analyses identify 139 loci associated with macular thickness in the UK Biobank cohort. Hum Mol Genet 28, 1162–1172 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Zekavat S. M., Jorshery S. D., Rauscher F. G., Horn K., Sekimitsu S., Koyama S., Nguyen T. T., Costanzo M. C., Jang D., Burtt N. P., Kühnapfel A., Shweikh Y., Ye Y., Raghu V., Zhao H., Ghassemi M., Elze T., Segrè A. V., Wiggs J. L., Del Priore L., Scholz M., Wang J. C., Natarajan P., Zebardast N., Phenome- and genome-wide analyses of retinal optical coherence tomography images identify links between ocular and systemic health. Sci Transl Med 16, eadg4517 (2024). [DOI] [PubMed] [Google Scholar]
  • 23.Kirchler M., Konigorski S., Norden M., Meltendorf C., Kloft M., Schurmann C., Lippert C., transferGWAS: GWAS of images using deep transfer learning. Bioinformatics 38, 3621–3628 (2022). [DOI] [PubMed] [Google Scholar]
  • 24.Xie Z., Zhang T., Kim S., Lu J., Zhang W., Lin C.-H., Wu M.-R., Davis A., Channa R., Giancardo L., Chen H., Wang S., Chen R., Zhi D., iGWAS: Image-based genome-wide association of self-supervised deep phenotyping of retina fundus images. PLoS Genet 20, e1011273 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Poplin R., Varadarajan A. V., Blumer K., Liu Y., McConnell M. V., Corrado G. S., Peng L., Webster D. R., Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning. Nat Biomed Eng 2, 158–164 (2018). [DOI] [PubMed] [Google Scholar]
  • 26.Wu J., Chen Z., Xiao S., Liu G., Wu W., Wang S., DeepMoIC: multi-omics data integration via deep graph convolutional networks for cancer subtype classification. BMC Genomics 25, 1209 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Chen L., Can classical statistics and deep learning converge on explainable, causally driven target discovery? DNA Research , dsaf024 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Jackson V. E., Wu Y., Bonelli R., Owen J. P., Scott L. W., Farashi S., Kihara Y., Gantner M. L., Egan C., Williams K. M., Ansell B. R. E., Tufail A., Lee A. Y., Bahlo M., Multi-omic spatial effects on high-resolution AI-derived retinal thickness. Nat Commun 16, 1317 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Sergouniotis P. I., Diakite A., Gaurav K., UK Biobank Eye and Vision Consortium, Birney E., Fitzgerald T., Autoencoder-based phenotyping of ophthalmic images highlights genetic loci influencing retinal morphology and provides informative biomarkers. Bioinformatics 41, btae732 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Kazeminasab S., Sekimitsu S., Fazli M., Eslami M., Shi M., Tian Y., Luo Y., Wang M., Elze T., Zebardast N., An Artificial Intelligence Method for Phenotyping of OCT-Derived Thickness Maps Using Unsupervised and Self-supervised Deep Learning. J Imaging Inform Med (2025), doi: 10.1007/s10278-025-01539-x. [DOI] [Google Scholar]
  • 31.Consortium GTEx, The Genotype-Tissue Expression (GTEx) project. Nat Genet 45, 580–585 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Ratnapriya R., Sosina O. A., Starostik M. R., Kwicklis M., Kapphahn R. J., Fritsche L. G., Walton A., Arvanitis M., Gieser L., Pietraszkiewicz A., Montezuma S. R., Chew E. Y., Battle A., Abecasis G. R., Ferrington D. A., Chatterjee N., Swaroop A., Retinal transcriptome and eQTL analyses identify genes associated with age-related macular degeneration. Nat Genet 51, 606–610 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Eraslan G., Drokhlyansky E., Anand S., Fiskin E., Subramanian A., Slyper M., Wang J., Van Wittenberghe N., Rouhana J. M., Waldman J., Ashenberg O., Lek M., Dionne D., Win T. S., Cuoco M. S., Kuksenko O., Tsankov A. M., Branton P. A., Marshall J. L., Greka A., Getz G., Segrè A. V., Aguet F., Rozenblatt-Rosen O., Ardlie K. G., Regev A., Single-nucleus cross-tissue molecular reference maps toward understanding disease gene function. Science 376, eabl4290 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Rouhana J. M., Wang J., Eraslan G., Anand S., Hamel A. R., Cole B., Regev A., Aguet F., Ardlie K. G., Segrè A. V., ECLIPSER: identifying causal cell types and genes for complex traits through single cell enrichment of e/sQTL-mapped genes in GWAS loci (2021), doi: 10.1101/2021.11.24.469720. [DOI] [Google Scholar]
  • 35.Hamel A. R., Yan W., Rouhana J. M., Monovarfeshani A., Jiang X., Mehta P. A., Advani J., Luo Y., Liang Q., Rajasundaram S., Shrivastava A., Duchinski K., Mantena S., Wang J., van Zyl T., Pasquale L. R., Swaroop A., Gharahkhani P., Khawaja A. P., MacGregor S., International Glaucoma Genetics Consortium (IGGC), Chen R., Vitart V., Sanes J. R., Wiggs J. L., Segrè A. V., Integrating genetic regulation and single-cell expression with GWAS prioritizes causal genes and cell types for glaucoma. Nat Commun 15, 396 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Shah R. L., Guggenheim J. A., UK Biobank Eye and Vision Consortium, Genome-wide association studies for corneal and refractive astigmatism in UK Biobank demonstrate a shared role for myopia susceptibility loci. Hum Genet 137, 881–896 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Ramdas W. D., van Koolwijk L. M. E., Ikram M. K., Jansonius N. M., de Jong P. T. V. M., Bergen A. A. B., Isaacs A., Amin N., Aulchenko Y. S., Wolfs R. C. W., Hofman A., Rivadeneira F., Oostra B. A., Uitterlinden A. G., Hysi P., Hammond C. J., Lemij H. G., Vingerling J. R., Klaver C. C. W., van Duijn C. M., A genome-wide association study of optic disc parameters. PLoS Genet 6, e1000978 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Ramdas W. D., van Koolwijk L. M. E., Cree A. J., Janssens A. C. J. W., Amin N., de Jong P. T. V. M., Wolfs R. C. W., Gibson J., Kirwan J. F., Hofman A., Rivadeneira F., Oostra B. A., Uitterlinden A. G., Ennis S., Lotery A. J., Lemij H. G., Klaver C. C. W., Vingerling J. R., Jansonius N. M., van Duijn C. M., Clinical implications of old and new genes for open-angle glaucoma. Ophthalmology 118, 2389–2397 (2011). [DOI] [PubMed] [Google Scholar]
  • 39.Srivastava A., Rikhari D., Srivastava S., RSPO2 as Wnt signaling enabler: Important roles in cancer development and therapeutic opportunities. Genes Dis 11, 788–806 (2024). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Masutani M., Sonenberg N., Yokoyama S., Imataka H., Reconstitution reveals the functional core of mammalian eIF3. EMBO J 26, 3373–3383 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Martin K. R. G., Quigley H. A., Valenta D., Kielczewski J., Pease M. E., Optic nerve dynein motor protein distribution changes with intraocular pressure elevation in a rat model of glaucoma. Experimental Eye Research 83, 255–262 (2006). [DOI] [PubMed] [Google Scholar]
  • 42.Buniello A., Suveges D., Cruz-Castillo C., Llinares M. B., Cornu H., Lopez I., Tsukanov K., Roldán-Romero J. M., Mehta C., Fumis L., McNeill G., Hayhurst J. D., Martinez Osorio R. E., Barkhordari E., Ferrer J., Carmona M., Uniyal P., Falaguera M. J., Rusina P., Smit I., Schwartzentruber J., Alegbe T., Ho V. W., Considine D., Ge X., Szyszkowski S., Tsepilov Y., Ghoussaini M., Dunham I., Hulcoop D. G., McDonagh E. M., Ochoa D., Open Targets Platform: facilitating therapeutic hypotheses building in drug discovery. Nucleic Acids Res 53, D1467–D1475 (2025). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Gharahkhani P., Jorgenson E., Hysi P., Khawaja A. P., Pendergrass S., Han X., Ong J. S., Hewitt A. W., Segrè A. V., Rouhana J. M., Hamel A. R., Igo R. P., Choquet H., Qassim A., Josyula N. S., Cooke Bailey J. N., Bonnemaijer P. W. M., Iglesias A., Siggs O. M., Young T. L., Vitart V., Thiadens A. A. H. J., Karjalainen J., Uebe S., Melles R. B., Nair K. S., Luben R., Simcoe M., Amersinghe N., Cree A. J., Hohn R., Poplawski A., Chen L. J., Rong S.-S., Aung T., Vithana E. N., Tamiya G., Shiga Y., Yamamoto M., Nakazawa T., Currant H., Birney E., Wang X., Auton A., Lupton M. K., Martin N. G., Ashaye A., Olawoye O., Williams S. E., Akafo S., Ramsay M., Hashimoto K., Kamatani Y., Akiyama M., Momozawa Y., Foster P. J., Khaw P. T., Morgan J. E., Strouthidis N. G., Kraft P., Kang J. H., Pang C. P., Pasutto F., Mitchell P., Lotery A. J., Palotie A., van Duijn C., Haines J. L., Hammond C., Pasquale L. R., Klaver C. C. W., Hauser M., Khor C. C., Mackey D. A., Kubo M., Cheng C.-Y., Craig J. E., MacGregor S., Wiggs J. L., Genome-wide meta-analysis identifies 127 open-angle glaucoma loci with consistent effect across ancestries. Nat Commun 12, 1258 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Han X., Gharahkhani P., Hamel A. R., Ong J. S., Rentería M. E., Mehta P., Dong X., Pasutto F., Hammond C., Young T. L., Hysi P., Lotery A. J., Jorgenson E., Choquet H., Hauser M., Cooke Bailey J. N., Nakazawa T., Akiyama M., Shiga Y., Fuller Z. L., Wang X., Hewitt A. W., Craig J. E., Pasquale L. R., Mackey D. A., Wiggs J. L., Khawaja A. P., Segrè A. V., 23andMe Research Team, International Glaucoma Genetics Consortium, MacGregor S., Large-scale multitrait genome-wide association analyses identify hundreds of glaucoma risk loci. Nat Genet 55, 1116–1125 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.GTEx Consortium, The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science 369, 1318–1330 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Ratnapriya R., Sosina O. A., Starostik M. R., Kwicklis M., Kapphahn R. J., Fritsche L. G., Walton A., Arvanitis M., Gieser L., Pietraszkiewicz A., Montezuma S. R., Chew E. Y., Battle A., Abecasis G. R., Ferrington D. A., Chatterjee N., Swaroop A., Retinal transcriptome and eQTL analyses identify genes associated with age-related macular degeneration. Nat Genet 51, 606–610 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Fairley S., Lowy-Gallego E., Perry E., Flicek P., The International Genome Sample Resource (IGSR) collection of open human genomic variation resources. Nucleic Acids Res 48, D941–D947 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.van Zyl T., Yan W., McAdams A. M., Monavarfeshani A., Hageman G. S., Sanes J. R., Cell atlas of the human ocular anterior segment: Tissue-specific and shared cell types. Proc Natl Acad Sci U S A 119, e2200914119 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Monavarfeshani A., Yan W., Pappas C., Odenigbo K. A., He Z., Segrè A. V., van Zyl T., Hageman G. S., Sanes J. R., Transcriptomic analysis of the ocular posterior segment completes a cell atlas of the human eye. Proc Natl Acad Sci U S A 120, e2306153120 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Hahn J., Monavarfeshani A., Qiao M., Kao A. H., Kölsch Y., Kumar A., Kunze V. P., Rasys A. M., Richardson R., Wekselblatt J. B., Baier H., Lucas R. J., Li W., Meister M., Trachtenberg J. T., Yan W., Peng Y.-R., Sanes J. R., Shekhar K., Evolution of neuronal cell classes and types in the vertebrate retina. Nature 624, 415–424 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.van Zyl T., Yan W., McAdams A. M., Monavarfeshani A., Hageman G. S., Sanes J. R., Cell atlas of the human ocular anterior segment: Tissue-specific and shared cell types. Proc Natl Acad Sci U S A 119, e2200914119 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.van Zyl T., Yan W., McAdams A. M., Monavarfeshani A., Hageman G. S., Sanes J. R., Cell atlas of the human ocular anterior segment: Tissue-specific and shared cell types. Proc Natl Acad Sci U S A 119, e2200914119 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Liang Q., Cheng X., Wang J., Owen L., Shakoor A., Lillvis J. L., Zhang C., Farkas M., Kim I. K., Li Y., DeAngelis M., Chen R., A multi-omics atlas of the human retina at single-cell resolution. Cell Genom 3, 100298 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Monavarfeshani A., Yan W., Pappas C., Odenigbo K. A., He Z., Segrè A. V., van Zyl T., Hageman G. S., Sanes J. R., Transcriptomic analysis of the ocular posterior segment completes a cell atlas of the human eye. Proc Natl Acad Sci U S A 120, e2306153120 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The GWAS summary statistics generated in this study are available in Figshare (10.6084/m9.figshare.30227764). UK Biobank data are available by request through the UK Biobank Access Management System https://www.ukbiobank.ac.uk/. The 1000 Genomes phase 3 data is available at https://www.internationalgenome.org/. The GTEx protected data are available through the database of Genotypes and Phenotypes (dbGaP) (accession no. phs000424.v8). The processed GTEx eQTL and sQTL summary statistics are available on the GTEx portal (https://gtexportal.org/home/datasets). The EyeGEx retina eQTL summary statistics is available at https://visiongenomics.org. The snRNA-seq data for the anterior segment and macula are available in Gene Expression Omnibus (GEO) accession number GSE199013, for the optic nerve head and posterior tissues in GSE236566, and for the retina in GSE226108. All the results from our analyses (e.g., GWAS, MAGMA, colocalization, Mendelian randomization, cell-type enrichment) can be found in the Supplementary Data. This study was conducted in accordance with the tenets of the Declaration of Helsinki. The use of OCT scans acquired at MEE was approved by the Harvard Longwood Campus (HLC) IRB (protocol 2024P003507), with a waiver of informed consent granted owing to the retrospective use of de-identified data. Access to UKBB OCT and genetic data was obtained with approval from the UKBB Committee. All participants in UKBB provided informed consent at recruitment. All data were handled in accordance with applicable data protection and privacy regulations.


Articles from Research Square are provided here courtesy of American Journal Experts

RESOURCES