Abstract
Age-related macular degeneration (AMD) is one of the most common causes of visual impairment in the elderly, with a complex and still poorly understood etiology. Whole-genome association studies have discovered 34 genomic regions associated with AMD. However, the genes and cognate proteins that mediate the risk, are largely unknown. In the current study, we integrate levels of 4782 human serum proteins with all genetic risk loci for AMD in a large population-based study of the elderly, revealing many proteins and pathways linked to the disease. Serum proteins are also found to reflect AMD severity independent of genetics and predict progression from early to advanced AMD after five years in this population. A two-sample Mendelian randomization study identifies several proteins that are causally related to the disease and are directionally consistent with the observational estimates. In this work, we present a robust and unique framework for elucidating the pathobiology of AMD.
Subject terms: Eye diseases, Computational biology and bioinformatics, Predictive markers
Age related macular degeneration is a common cause of visual impairment in the elderly, but the etiology is not fully understood. Here, the authors use genetic data, serum proteomics, and AMD phenotypic data from a large Icelandic cohort to discover proteins altered in, causally related to AMD or signifying progression of advanced AMD.
Introduction
AMD is a progressive late-onset disease that primarily affects the macular area of the retina and is a common cause of permanent loss of vision in the elderly population1. The clinical hallmark of early AMD is an accumulation of extracellular protein and lipid-containing deposits between the retinal pigment epithelium (RPE) and Bruch´s membrane, termed drusen. Advanced AMD can be either neovascular (nAMD) associated with blood vessel growth and leakage, or geographic atrophy (GA/dry AMD) characterized by patches of RPE cell and photoreceptor cell loss in the macula. Anti-VEGF therapy is highly effective in controlling the abnormal vessel growth and leakage in nAMD, however, the disease nevertheless progresses and by seven years, 98% of nAMD patients have atrophy2. Patients with GA have marked visual disability with relentless visual deterioration and progression to legal blindness. There are currently no approved therapies for GA or early AMD3,4. Elucidation of AMD pathobiology and identification of modifiable targets are critical for identifying clinically relevant biomarkers and designing therapeutics.
The first common genetic risk factor reported for AMD was the missense variant rs1061170 in CFH on chromosome 1q31.35–8. Many genes encoding other proteins involved in the complement cascade reside within the 1q31.3 region, including complement factors H related to 1 to 5 (CFHR1-5). Two independent variants at 1q31.3, rs1061170 and intronic variant rs1410996, account for 17% of AMD risk9. In the most recent GWAS examining 16,144 patients with advanced AMD, 52 independent variants were found at 34 different genomic loci explaining 46.7% of the variability in AMD risk10. The risk variants with the largest difference between late AMD patients and healthy controls reside within the CFH genomic region and the ARMS2/HTRA1 locus on chromosome 10q26, although for most variants, the effect size was small10. While some recognized AMD causal candidates are found at these genomic risk locations, the vast majority are yet to be identified.
Proteins are undeniably the key players in all life processes, with changes in their function and/or regulation influencing disease and well-being. As a result, changes in protein regulation and function, as well as their related networks, are most likely to mediate the genetic risk of complex diseases11–14. Serum proteins have the desired attributes required for a comprehensive and unified approach to measuring an individual’s global molecular status, as they capture information across many tissues and show a direct link to disease-related molecular pathways and activities11,12. Furthermore, serum proteins participate in cross-tissue regulatory loops, and therefore tissue-specific disease progression emerges from an integration of local and systemic signals12. Recent developments in high-throughput measurements of thousands of proteins in a single sample have aided this work11,15–17, and aptamer-based affinity methods in particular, have been a driver of recent discoveries11,18–21. In this study, the serum levels of 4782 proteins, encoded by 4137 human genes, measured in 5457 individuals from the prospective population-based AGES-RS cohort were examined for association to different stages of AMD, disease progression and prediction, and the extent to which they mediated the genetics of the disease.
Results
An examination of 4782 serum proteins for links to various stages of AMD
Descriptive statistics on the appearance of AMD in the 5457 person AGES-RS cohort ages 67 years and older (mean age 76.6 ± 5.6 years; 57.3% female) are shown in Supplementary Data 1. Using sex and age-adjusted logistic regression analysis of 4782 proteins (4137 gene symbols) and two distinct definitions of early-stage AMD (see Methods section for details), we discovered that 28 serum proteins were associated with different stages of AMD using a study-wide significance threshold (Fig. 1a–c, Table 1, Supplementary Fig. 1A-C, and Supplementary Data 2 and 3). This included 15 proteins associated with early-stage AMD identified using both Holliday et al.22 and Jonasson et al.23 classification criteria, with two proteins unique to Jonasson et al.23 and 10 proteins unique to Holliday et al.22 (Fig. 1a, Supplementary Fig. 1A, and Supplementary Data 2 and 3). The relationship between the quintiles of the 28 protein levels and AMD outcomes revealed that some were only associated with the early or late stages of AMD, such as CEBPB and CFHR5, respectively (Fig. 2a, b), whereas proteins such as FUT5 were associated with all stages of AMD (Fig. 2a, b). We further stratified late AMD into GA or nAMD and compared protein quintiles of all AMD-associated proteins with the different AMD-related outcomes. Levels of some AMD-associated proteins, such as CFHR1, BPIFB1, and CFHR5, increased almost continuously from no AMD to advanced nAMD (Supplementary Fig. 2A–C). Supplementary Figure 3 depicts the distribution and relationship between the different quintiles of all 28 AMD-associated protein levels and AMD-related outcomes, while Supplementary Fig. 4 compares the top and bottom quintiles. Again, it is evident that while some proteins were associated with all AMD stages (e.g., CFHR1, NDUFS4), some were more, or only associated with early (e.g., LINGO1, RAB17) or late-stage AMD (e.g., BIRC2) (Supplementary Figs. 3 and 4). We note that the aptamer specificity for 15 of the AMD-associated proteins with sufficient pull-down mass spectrometry sensitivity was fully validated using orthogonal analysis (Methods section and Supplementary Data 4).
Table 1.
Protein | AMD any | AMD earlya | AMD late | ||||||
---|---|---|---|---|---|---|---|---|---|
β-value | P-value | P adj.b | β-value | P-value | P adj.b | β-value | P-value | P adj.b | |
NDUFS4 | −0.298 | 1.4E-17 | 7.3E-14 | −0.260 | 3.0E-12 | 1.4E-08 | −0.562 | 2.7E-14 | 1.4E-10 |
CFHR1 | 0.194 | 4.8E-09 | 2.5E-05 | 0.157 | 3.7E-06 | 0.01617 | 0.483 | 1.0E-09 | 5.0E-06 |
FUT5 | 0.180 | 1.2E-08 | 0.00016 | 0.160 | 1.7E-06 | 0.00747 | 0.352 | 7.5E-07 | 0.00329 |
TST | −0.235 | 2.6E-11 | 1.3E-07 | −0.218 | 8.1E-09 | 3.5E-05 | −0.353 | 1.2E-06 | 0.00544 |
BAMBI | 0.177 | 5.0E-07 | 0.00255 | 0.158 | 8.2E-07 | 0.00357 | 0.138 | 0.06560 | ns |
BPIFB1 | 0.193 | 3.1E-08 | 0.00016 | 0.176 | 1.9E-06 | 0.00733 | 0.271 | 0.00022 | ns |
CEBPB | 0.173 | 9.7E-07 | 0.00489 | 0.171 | 2.6E-06 | 0.01239 | 0.189 | 0.01393 | ns |
ST6GALNAC1 | 0.161 | 1.1E-06 | 0.00591 | 0.167 | 1.1E-06 | 0.00465 | 0.102 | 0.13511 | ns |
DLL3 | 0.190 | 1.8E-07 | 0.00093 | 0.188 | 1.4E-06 | 0.00536 | 0.170 | 0.01847 | ns |
UTS2 | 0.158 | 4.8E-06 | 0.02449 | 0.157 | 2.3E-05 | ns | 0.157 | 0.03319 | ns |
TNFRSF14 | 0.147 | 5.3E-06 | 0.02696 | 0.158 | 2.2E-06 | 0.00977 | 0.123 | 0.09463 | ns |
CXCL17 | 0.146 | 5.3E-06 | 0.02700 | 0.145 | 1.18E-05 | 0.04882 | 0.138 | 0.03687 | ns |
RGS8 | 0.149 | 5.4E-06 | 0.02762 | 0.153 | 7.4E-06 | 0.03254 | 0.137 | 0.04946 | ns |
RAB17 | 0.152 | 7.1E-06 | 0.03613 | 0.147 | 2.5E-05 | ns | 0.186 | 0.01027 | ns |
HNRNPC | 0.143 | 7.4E-06 | 0.03769 | 0.137 | 3.0E-05 | ns | 0.163 | 0.01434 | ns |
CRABP1 | 0.156 | 8.7E-06 | 0.04415 | 0.159 | 9.5E-06 | 0.04178 | 0.148 | 0.05272 | ns |
CFHR5 | 0.161 | 1.1E-05 | 0.0463 | 0.077 | 0.01843 | ns | 0.406 | 3.3E-08 | 0.00014 |
RPS6KB1 | 0.169 | 2.3E-06 | 0.00538 | 0.103 | 0.00093 | ns | 0.155 | 0.02414 | ns |
ZPBP | 0.159 | 1.1E-05 | 0.04729 | 0.155 | 2.6E-05 | ns | 0.171 | 0.02920 | ns |
KREMEN2 | 0.183 | 3.2E-06 | 0.01627 | 0.188 | 7.3E-06 | 0.02791 | 0.168 | 0.03047 | ns |
CCL1 | 0.142 | 2.2E-05 | ns | 0.154 | 7.4E-06 | 0.03254 | 0.177 | 0.01066 | ns |
CFP | −0.158 | 1.1E-05 | 0.04689 | −0.103 | 0.00079 | ns | −0.108 | 0.13210 | ns |
GHR | −0.156 | 1.1E-05 | 0.04343 | −0.072 | 0.02341 | ns | −0.145 | 0.02935 | ns |
LINGO1 | 0.162 | 8.0E-06 | 0.04634 | 0.120 | 0.00051 | ns | 0.172 | 0.01456 | ns |
B3GNT6 | 0.142 | 1.0E-05 | 0.04602 | 0.104 | 7.2E-04 | ns | 0.201 | 0.00291 | ns |
C2orf40 | 0.142 | 1.2E-05 | 0.04882 | 0.118 | 1.5E-04 | ns | 0.153 | 0.02070 | ns |
BCL2L1 | 0.147 | 1.2E-05 | 0.04782 | 0.127 | 4.9E-05 | ns | 0.137 | 0.06061 | ns |
BIRC2 | 0.107 | 0.00057 | ns | 0.086 | 0.00693 | ns | 0.321 | 5.0E-06 | 0.02279 |
ns not significant.
aEarly stage AMD was defined by either Holliday et al.22 or Jonasson et al.23. Based on either definition of early AMD, only the most significant association is reported. For more information, see Supplementary Data 2 and 3.
bThe term P adj. refers to the P-value adjusted for the number of comparisons. All reported P-values are two-sided and derived from sex and age-adjusted logistic regression analysis.
The serum protein network, closely linked to many common diseases and under genetic control, was recently identified11. This work showed that serum proteins exist in 27 coregulatory network modules, most of which are arranged in larger clusters. Many of the 28 AMD-associated proteins were correlated to each other (Supplementary Fig. 5), and in fact, 12 of these 28 proteins were enriched in a single module of the serum protein network (P = 2.1 × 10−8) (Fig. 2c and Supplementary Data 5). This module, known as serum protein module 13 (PM13), was not linked to any of the previously studied cardio-metabolic traits11. The Eigenprotein (the first principal component) of PM13 was found to be significantly associated with AMD-related outcomes (Fig. 2d). Serum protein modules included proteins synthesized in all solid tissues and may serve as an integration of the body’s tissue-specific networks. Similarly, some AMD-related proteins were abundantly expressed in AMD-relevant ocular cells (Supplementary Fig. 6), whereas others were enriched in tissues such as the liver and brain (Supplementary Data 6).
Proteins that indicate the progression of or predict late-stage AMD
Using single point sex and age-adjusted logistic regression analysis, we examined which, if any, of the 4137 serum proteins in early AMD subjects only anticipated advancement to late AMD (pure GA or nAMD) in the same people over a 5-year follow-up period. Considering multiple comparisons, a single protein, PRMT3, showed significantly increased levels (OR = 1.88, P = 5.3 × 10−6) in early AMD at the baseline exam and prior to progression to pure GA at follow-up (Fig. 3a and Supplementary Fig. 7A). Figure 3b and Supplementary Fig. 7B show the observed five-year incidence of pure GA AMD across baseline PRMT3 quartiles, with the highest quartile having a significant increase in the number of new patients compared to the lowest (P = 0.00047), using an age and sex-adjusted logistic model. Following that, we looked at how the effect on progression varied across PRMT3 quartiles. As shown in Fig. 3c and Supplementary Fig. 7C, the effect appears to be additive from the lowest to the highest levels of PRMT3 (F-test P = 0.001). No baseline protein changes predicted progression of early AMD to nAMD between first visit and five-year follow-up.
Age has been linked to the progression of both nAMD and GA AMD, whereas higher HDL cholesterol levels have been linked to the progression of GA and sex (females) with nAMD only23. Additionally, smoking has been linked to the rapid progression of GA AMD24. According to a recent study, known AMD susceptibility variants on chromosomes 1 and 10 can predict the progression of GA AMD25. The area under the curve (AUC) was estimated for pure GA prediction comparing models with and without PRMT3 as a predictor to determine if the protein improves prediction beyond known risk factors. Consequently, we examined all known risk factors for prediction of pure GA AMD development using estimates of AUC and included high impact genetic markers on chromosomes 1 (rs10922109 and rs570618) and 10 (rs3750846). This revealed that PRMT3 significantly improved the prediction of pure GA (AUC = 0.76, P = 0.0249) and was additive to all the parameters tested (Fig. 3d and Supplementary Fig. 7D).
Next, we applied a data-driven nonparametric bootstrap26 and least absolute shrinkage and selection operator (LASSO)27 to estimate the sampling distribution of logistic regression coefficients for all 4782 proteins in order to identify independent protein predictors of advanced AMD (Methods). The 21 proteins that appeared in at least 80% of the 500 iterations included the AMD-associated proteins CFHR1, CFHR5, FUT5, BIRC2, and NDUFS4 (Fig. 4). Here, CFHR1 and NDUFS4 had coefficients that were always non-zero in predicting late-stage AMD (Fig. 4). Interestingly, the complement factors CFH and CFB are among the protein predictors not listed in Table 1. Supplementary Figure 8 depicts a receiver operating characteristic curve (ROC) for the diagnostic ability of the 21 protein predictors to classify late-stage AMD, demonstrating a significant difference between the two ROC curves, that is the demographics versus the demographics plus proteins ROC curves (F-test of equality P = 1.4 × 10−17). Overall, these findings highlight the additional benefit of using LASSO regression to uncover new aspects of the relationship between global serum proteins and AMD.
Serum proteins link genetics to AMD, revealing its pathobiology
In the most recent GWAS meta-analysis of advanced AMD, Fritsche et al.10 examined 16,144 AMD patients, identifying 52 independent common and rare variants across 34 distinct genomic loci (Supplementary Data 7). We tested each of these variants for an effect on the 4782 proteins with the aim to narrow down the causal candidates at these genetic risk loci. Here, 18 of the risk loci for AMD were associated with a total of 340 serum proteins (Supplementary Data 7 and 8). In addition, the variant rs11080055, linked to the previously described11 hotspot rs704 (r2 = 0.784) was associated with several hundred serum proteins (Supplementary Data 7 and 9). Intriguingly, 22 out of 28 proteins found to be linked to AMD above were associated with one or more of six AMD susceptibility variants (Table 2). The AMD associated variant rs10922109 in the CFH gene at 1q31.3, for example, was significantly associated with AMD in the AGES-RS (P = 2.6 × 10−16) (Supplementary Data 7) and influenced the levels of 40 serum proteins (Fig. 5 and Supplementary Data 7 and 8), including the AMD-associated proteins CFHR1, TST, DLL3, ST6GALNAC1, CFP, and NDUFS4 (Fig. 5, Table 1, and Supplementary Data 8). Among the proteins associated with rs10922109, complement system proteins are significantly over-represented (Supplementary Data 10). These included the cis (proximal) associated proteins CFHR1, CFHR4, and CFH (Figs. 5 and 6a–c), as well as the trans (distal) associated proteins C3 (Fig. 5), CFP (Figs. 5 and 6d), and CFB (Fig. 5). According to the GTEx database28, rs10922109 is associated with mRNA levels of the genes encoding CFHR1, CFHR4, and CFH (Supplementary Fig. 9A), with directionally consistent effects on the transcripts and cognate proteins (Supplementary Fig. 9A, B). CFHR4 and CFH were not significantly associated with AMD using stringent multiple testing corrections, though the risk allele effect was directionally consistent with their relationships seen using the top and bottom quintiles and various stages of AMD (Fig. 6b, c). C3 and CFB, the two trans regulated proteins, were not significantly linked to any AMD-related outcomes (data not shown). Finally, out of the 40 proteins associated with rs10922109, 11 proteins cluster in protein module PM13 (Supplementary Data 11 and 12).
Table 2.
Protein | AMD varianta | Effect of variant on serum protein levelsb | |||
---|---|---|---|---|---|
SNP | Most proximal gene | β-value | P-value | pQTL | |
NDUFS4 | rs10922109 | CFH | 0.5854 | 9.86E-223 | Trans |
rs570618 | CFH | −0.8904 | 1.20E-566 | Trans | |
CFHR1 | rs10922109 | CFH | −0.5848 | 1.16E-234 | Cis |
rs570618 | CFH | 0.4675 | 1.37E-139 | Cis | |
FUT5 | rs12019136 | FUT6 | −0.9998 | 6.51E-86 | Cis |
TST | rs10922109 | CFH | 0.5241 | 3.69E-178 | Trans |
rs570618 | CFH | −0.7866 | 1.36E-424 | Trans | |
BAMBI | rs570618 | CFH | 0.1364 | 1.03E-14 | Trans |
BPIFB1 | None | N/A | N/A | N/A | N/A |
CEBPB | rs570618 | CFH | 0.0959 | 4.59E-08 | Trans |
ST6GALNAC1 | rs10922109 | CFH | −0.1018 | 3.12E-08 | Trans |
rs570618 | CFH | 0.1488 | 2.00E-15 | Trans | |
rs11080055 | TMEM97 | −0.2254 | 7.21E-37 | Trans | |
DLL3 | rs10922109 | CFH | −0.1219 | 1.98E-11 | Trans |
rs570618 | CFH | 0.1593 | 7.24E-18 | Trans | |
UTS2 | rs570618 | CFH | 0.1301 | 3.88E-13 | Trans |
rs11080055 | TMEM97 | −0.1874 | 3.84E-28 | Trans | |
TNFRSF14 | rs570618 | CFH | 0.1272 | 2.14E-11 | Trans |
rs2230199 | C3 | 0.1693 | 3.96E-17 | Trans | |
CXCL17 | None | N/A | N/A | N/A | N/A |
RGS8 | rs570618 | CFH | 0.1711 | 7.43E-20 | Trans |
RAB17 | rs570618 | CFH | 0.1631 | 6.24E-19 | Trans |
HNRNPC | rs11080055 | TMEM97 | −0.0957 | 2.18E-07 | Trans |
CRABP1 | rs570618 | CFH | 0.1003 | 1.74E-08 | Trans |
rs11080055 | TMEM97 | −0.1854 | 5.16E-28 | Trans | |
CFHR5 | rs570618 | CFH | 0.2284 | 4.53E-32 | Cis |
RPS6KB1 | None | N/A | N/A | N/A | N/A |
ZPBP | rs570618 | CFH | 0.1016 | 3.81E-09 | Trans |
rs11080055 | TMEM97 | −0.1301 | 2.27E-15 | Trans | |
KREMEN2 | rs570618 | CFH | 0.0866 | 3.55E-07 | Trans |
CCL1 | rs570618 | CFH | 0.1820 | 1.71E-22 | Trans |
CFP | rs10922109 | CFH | 0.3049 | 2.64E-62 | Trans |
rs570618 | CFH | −0.3028 | 3.82E-59 | Trans | |
GHR | None | N/A | N/A | N/A | N/A |
LINGO1 | None | N/A | N/A | N/A | N/A |
B3GNT6 | None | N/A | N/A | N/A | N/A |
C2orf40 | rs570618 | CFH | 0.1704 | 4.25E-19 | Trans |
rs11080055 | TMEM97 | −0.1099 | 1.68E-09 | Trans | |
BCL2L1 | rs570618 | CFH | 0.1611 | 3.94E-18 | Trans |
BIRC2 | rs429358 | APOE | −0.5793 | 1.54E-106 | Trans |
Cis proximal effect, Trans distal effect, N/A not applicable.
aAMD associated variants are from Fritsche et al.10 and highlighted in Supplementary Data 7.
bReported P-values are two-sided derived from sex and age-adjusted linear regression analysis. The effect alleles for each corresponding pQTL13 are shown in Supplementary Data 8 and 9. To account for multiple comparisons, significant associations with P-values of <1 × 10−6 are reported.
Aside from the expected enrichment of the complement system among the 340 proteins associated with AMD-linked variants (Supplemental data 8 and 10), there were many previously unknown links, including SLC5A8 (aka SMCT1). SLC5A8 is a Na+-coupled transporter located in the retina and notable for its ability to transport the prodrug 2-oxothiazolidine-4-carboxylate (OTC) across the retina29,30. This protein was found to be linked to two AMD-associated variants, rs2070895 and rs2043085 in or proximal to the LIPC gene (Supplementary Data 8 and Fig. 6e), as well as the APOB intron variant rs2678379 (Fig. 6e), which is a well-established regulator of plasma lipoprotein (HDL, LDL, and triglycerides) levels31. The genetic influence and disease association show that serum levels of SLC5A8 are inversely related to AMD (Fig. 6f), which is consistent with the protein’s proposed protective function30. The proteins linked to AMD-related genetic variants map to pathways with both known and previous unknown association to the disease (Supplementary Note 1 and Supplementary Data 10).
We examined the relationship between all known AMD-related GWAS variants and the 27 serum protein network modules through their Eigenproteins (1st and 2nd principal components). The modules in supercluster III (modules PM11 to PM15)11, were associated with the largest number of AMD-causing genetic variants (Supplementary Data 13). For example, the CFH variant rs570618 at 1q31.3, which is a proxy for the CFH Y402H (aka rs1061170) missense mutation, was linked to AMD risk (Supplementary Data 7) and is associated with 217 serum proteins, 100 of which are found in PM13 (Supplementary Data 8 and 11). Also, rs570618 was significantly associated with the Eigenprotein for the AMD-associated protein module PM13 (Supplementary Data 13), further reinforcing its connection to AMD. In passing, rs570618 was associated with 33 of the 40 proteins linked to rs10922109 (see above) and are both intronic SNPs in the CFH gene in a moderate linkage disequilibrium (r2 = 0.41), but the latter is also associated with 184 other serum proteins (Supplementary Data 8).
The earliest recognized and most well-studied genetic factors for AMD were variants on chromosome 1 (1q31.3) spanning the CFH gene5–8. Further investigation into the AMD-related genetic effects of 1q31.3 variants, which included haplotype analysis, found a common deletion across the CFHR3 and CFHR1 genes that protects against AMD32. The eight variants across the CFH gene10 are not completely independent of the CFHR3-CFHR1 deletion33, which is tagged by the variant rs6677604 (allele A)34. We looked at the effect of rs6677604 on global serum proteins as a proxy for the CFHR3-CFHR1 deletion. The protective allele A for rs6677604 was significantly associated with reduced risk of late AMD in the AGES-RS cohort (β = −0.04, P = 3.0 × 10−7), and was also strongly linked with lower serum levels of CFHR1 (β = −1.33, P = 2.7 × 10−890; Supplementary Data 14). Overall, rs6677604 was linked to 22 proteins in serum (Supplementary Data 14), including six proteins that were not associated with any of the other AMD genetic markers listed in Supplementary Data 8 and 9. For instance, calnexin (CANX) was previously implicated in ARMS2 secretion35 and was one of the six novel proteins associated with rs6677604 (Supplementary Data 14).
Mendelian randomization analysis identifies proteins causally related to AMD
To determine if any of the 28 AMD-associated proteins might be causally related to the disease, a two-sample Mendelian randomization (MR) study was performed using cis-acting genetic variants as instruments. BPIFB1, CFHR1, CFHR5, FUT5, and GHR were found to have such instruments13. However, the genetic instruments for BPIFB1 and GHR do not overlap with previously known risk loci for AMD (Table 2). The causal estimate for CFHR1, CFHR5, and FUT5 was determined using the generalized weighted least squares method (GWLS)36, where CFHR1, CFHR5, and FUT5 were found to be significant (FDR < 0.05; Fig. 7a). For both definitions of AMD (Methods), the causal estimate agrees with the observed relationship between these proteins and AMD (Fig. 7a), highlighting their role as risk factors for AMD development. Of the three proteins, CFHR1 was detected with ELISA in a much smaller sample of an independent study of healthy volunteers and AMD patients, confirming elevated circulating levels of CFHR1 in the disease’s most advanced stage (Methods section and Supplementary Fig. 10). Significant increases in all factor H-related proteins, including CFHR1, have also been linked to AMD in recent studies33,37. Figure 7b–d shows scatter plots with the generalized weighted causal estimate and MR-Egger regression for CFHR1, CFHR5, and FUT5, which were all identified as causal candidates in the MR analysis. The P-values for the Egger intercept were >0.05, indicating that there was not statistically significant evidence of pleiotropy (Fig. 7b–d). Because the loci containing the genes encoding CFHR1, CFHR5, and FUT5 are saturated with multiple independent variants for both AMD (Supplementary Data 7) and proteins13,14, colocalization analysis becomes difficult and thus inconclusive38,39. As a result, colocalization analyses were omitted from the current study.
In a secondary analysis, we repeated the MR analysis, but this time we included 1327 aptamers with cis-acting genetic instruments. In this study, eight additional proteins, including C3, CFI, AIF1, and VTN, were found to have a significant causal estimate for AMD (Supplementary Fig. 11). Here, the allograft inflammatory factor 1 (AIF1) was found to be significantly associated with AMD outcomes in this population and directionally consistent with the causal estimate after multiple testing correction (Supplementary Data 15). These proteins were collectively enriched for the complement and coagulation cascade (FDR = 0.0002). CFHR1 remained statistically significant at this more stringent multiple testing correction threshold (Supplementary Fig. 11).
Discussion
Homeostasis has long been recognized as a property of living systems and is maintained by integrating local and global signals so that each tissue does not act independently, but rather influences and responds to preserve the systemic equilibrium and coordination. The measurement of the levels of thousands of proteins in serum of thousands of individuals has begun to illuminate the process by which homeostasis may occur11,12,19. In this study we document how serum proteins report on and influence AMD appearance and progression in the eye. Previous studies of AMD patient plasma and urine have revealed changes in lipid and energy metabolites; however, sample size has limited detection of serological protein changes, or stratification of late AMD into GA and nAMD40–45. The population-based AGES-RS study with its array of biomarkers, clinical profiles, and genetic risk factors collected prospectively from participants who were aged over 67 at baseline visit, and a follow-up visit after five-years has enabled us to identify circulating proteins and protein networks in patients that are associated with AMD stage and progression. Several grading systems have been used to classify the disease stage of AMD. In our study, we used the classification described by Holliday et al.22 as well as that of Jonasson et al.23. The distinction between these two systems is that one uses a stricter criterion for diagnosis of early AMD (Methods section), resulting in fewer cases. An additional less stringent system22 was included to explore protein associations more fully. It is hoped that using the different criterion will facilitate cross-referencing to various studies in the public domain.
Consistent with the known links between AMD and the complement pathway, three complement pathway proteins were associated with AMD (CFHR1, CFHR5, and CFP). Six additional proteins known to modulate both the innate and adaptive immune response were also associated with AMD (TNFRSF14, CCL1, CXCL17, BPIFB1, BIRC2, and CEBPB), implicating induction of inflammatory processes in AMD. Reductions in two mitochondrial proteins, the complex 1 protein NDUFS4 and the sulfotransferase protein TST were also associated with AMD, consistent with structural, functional, and genetic mitochondrial changes in AMD46–50. Inactivation of the NDUFS4 gene causes a severe form of the vision loss disease Leigh syndrome, and mice lacking this gene die prematurely at the age of 50-60 days51, with compromised photoreceptor function52 and excessive lipid droplet formation53. Elevation of the ribosomal S6 protein kinase 1 (RPS6KB1), a component of the nutrient-responsive mTOR (mammalian target of rapamycin) signaling pathway was also associated with AMD, consistent with the AMD-like phenotype in mice with RPE mTOR overactivation54–56. The pro-longevity effects of RPS6KB1 gene deletion57, or mTORC1 inhibition across multiple species58, are also intriguing when considering AMD as a disorder of aging. Some of these proteins, such as CFHR1, CFHR5, BIRC2, and NDUFS4, were found to be among the 21 independent predictors of late-stage AMD.
The rate of progression from early to late AMD in the AGES-RS was ~4.5% per year23. Serum levels of PRMT3 protein were elevated in early AMD patients who subsequently progressed to GA, but not in those progressing to nAMD. PRMT3 controls ribosomal activity via arginine methylation of the 40S ribosomal subunit protein RPS259–61. Interestingly, control of mRNA translation occurs via phosphorylation of another component of the 40S ribosomal subunit, RPS6, by the previously mentioned kinase RPS6KB1. Also, a third ribosomal 40S subunit RPS10 was controlled by the AMD-associated variants rs570618 and rs429358. Alterations in post-translational regulation of protein synthesis by both phosphorylation and arginine methylation could be a key early driver of AMD pathogenesis and progression.
AMD is associated with 34 distinct genomic loci, accounting for nearly half of AMD’s genetic variance10. Integrating intermediate traits such as mRNA and/or protein levels with genetics and disease traits helps in identifying the causal candidates11,62–65. The present study integrated the most recent findings of AMD genetic risk factors with 4137 human serum proteins, providing mechanistic insights into previously described AMD SNPs. Many of the variants that explained the largest fraction of the genetic susceptibility for AMD, were associated with the levels of numerous circulating proteins. Interestingly, of the AMD-associated proteins identified, the majority were linked to one or more of only six AMD susceptibility variants. The CFH variant rs10922109 (allele A) for example, confers AMD protection (OR = 0.51) and is associated with lower CFHR1 and CFHR4 protein levels and higher CFH levels. Consistent with these changes in protein levels, rs10922109 (allele C) has previously been associated with increased CFHR133, activation of the complement cascade in AMD patients66 and also increased serum CFHR467. AMD variants had both overlapping and distinct effects on serum proteins and were associated with numerous proteins in the complement cascade. For instance, the CFH variant rs570618 influenced 33 of the 40 proteins associated with the co-localized variant rs10922109 (e.g., CFHR1/4, C3, CFB, and CFP), but rs570618 is also associated with 184 other serum proteins (e.g., CFHR5, C1S and C8) (Supplementary Data 8). As previously noted, rs570618 is a surrogate for the CFH missense mutation Y402H. It has been claimed that this mutation causes CFH to bind less tightly to CRP68, impairing debris clearance and increasing retinal inflammation. According to the current study, Y402H in CFH is associated with variations in blood levels of 217 proteins, which suggests a more complex explanation of the substantial risk of rs570618 for AMD. A previous study has connected the rs570618 AMD risk allele T to elevated CFHR1 and CFHR5 levels33, which is consistent with our findings (Supplementary Data 8). Finally, we examined the association of protein networks with AMD, since deep profiling has revealed the modular structure of the serum proteome containing 27 distinct protein modules11. Interestingly, the protein module PM13 is directly associated with AMD outcomes and is enriched for individual proteins linked to AMD (12 of 28) and AMD risk variants.
CFHR1 and CFHR5 are presumed pathogenic since they impede CFH binding to pro-inflammatory lipid peroxidation products69,70, and induce inflammasome activation71, whereas CFHR1 gene deletion is known to be protective for AMD69. Indeed, the two-sample MR test analysis revealed that both proteins were causally linked to AMD, consistent with prior MR analyses of factor H-related proteins found causally related to AMD37. However, because their cis regions overlap (Supplementary Data 8), from which the genetic instruments are selected, it is impossible to say whether one or both are the causal effector. In addition, FUT5 was supported as causal candidate in the MR analysis. As 23 of the AMD-associated proteins did not have any cis-acting instruments, they could not be tested in the MR analyses, and we thus cannot exclude their candidacy as causal proteins. In the extended MR analysis, eight additional proteins were supported as causal, including other members of the complement system (CFI, C3, VTN), further highlighting the role of this pathway in the development of AMD. In contrast to these strong genetic associations driving systemic protein changes discussed above, rs3750846 in the ARMS2/HTRA1 locus on chromosome 10 was not associated with serum protein levels in the present study. One reason could be that the current platform lacks aptamers that detect the ARMS2 and HTRA1 proteins encoded by genes near rs3750846, but the variant is known to regulate the transcription of both ARMS2 and HTRA1 in solid tissues72. Alternatively, this variant may alter ocular specific protein changes that are not apparent in serum. It should be noted, however, that CANX, which was previously implicated in ARMS2 secretion35, was linked to the AMD-associated variant rs6677604 on chromosome 1 (Supplementary Data 14).
AMD is a multifactorial age-related disease with a complex pathobiology in which systemic and local inflammatory and other effectors play significant roles. Recent evidence indicates that thousands of proteins present in serum participate in cross tissue regulation that connects all parts of the body11,12. This occurs by tissues secreting protein(s) into the bloodstream which control biological processes in other physically distant tissues, resulting in a network of cross-tissue regulatory loops12. For example, the expression of the proteins CFHR1 and CFHR5, which have been causally linked to AMD in this study, are highly specific to the liver, whereas FUT5 is expressed in the bone marrow and testis (Supplementary Data 6). More research is required to determine the roles of the numerous serum proteins discussed in this study and their potential effect on various pathophysiological processes that lead to different stages of AMD in the local eye setting. Some of the serum proteins associated with AMD in this study are characterized as intracellular proteins, and the significance of their presence in serum, remains to be determined. Our findings provide a comprehensive and unique framework for understanding the pathobiology of AMD, which may lead to the discovery of systemic biomarkers and therapeutic targets for the disease.
Methods
Study population
Participants aged 66 through 96 were from the Age, Gene/Environment Susceptibility Reykjavik Study (AGES-RS) cohort73. The AGES-RS was approved by the NBC in Iceland (approval number VSN-00-063), and by the National Institute on Aging Intramural Institutional Review Board, and the Data Protection Authority in Iceland. AGES-RS is a single-center prospective population-based study of highly phenotyped subjects (5764, mean age 76.6 ± 5.6 years) and survivors of the 40-year-long prospective Reykjavik study (N ∼ 18,000), an epidemiologic study aimed to understand aging in the context of gene/environment interaction by focusing on four biologic systems: vascular, neurocognitive (including sensory), musculoskeletal, and body composition/metabolism. Descriptive statistics of this cohort as well as detailed definition of the various disease endpoints and relevant phenotypes measured have been published11,73. Of the 5764 AGES-RS participants 3411 attended a 5-year follow-up visit (AGES-RS II).
Detailed description of AMD diagnosis and the baseline characteristics of the AMD population in AGES-RS has previously been described in two separate publications22,23, which vary in terms of the medical definition of early stage AMD. In Holliday et al.22, early stage AMD (n = 1755) was defined as the presence of soft drusen (>63 µm) alone, retinal pigment epithelium (RPE) depigmentation alone or a combination of soft drusen with increased retinal pigment and/or depigmentation in the absence of late AMD, while Jonasson et al.23 defined early AMD (n = 1054) by the presence of any soft drusen and pigmentary abnormalities (increased or decreased retinal pigment) or the presence of large soft drusen ≥125 µm in diameter with a large drusen area >500 µm in diameter, or large ≥125 µm indistinct soft drusen in the absence of signs of late AMD23. Late AMD (n = 272) was defined by the presence of any of the following: geographic atrophy (GA) or exudative AMD including subretinal hemorrhage, subretinal fibrous scar, RPE detachment, or serous detachment of the sensory retina or signs of treatment for neovascular AMD23. Also, early and late AMD was combined as AMD any. Late AMD was separated into GA pure (n = 112) or neovascular AMD (nAMD, n = 160). GA with possibly some exudative AMD was also defined (GA + nAMD, n = 183). We used both medical definitions of early AMD (and AMD any) separately to analyze the association of serum proteins with AMD.
Protein measurements via SOMAmers
For the AGES-RS, we used a distinct version of the SomaScan platform (Novartis V3-5K), based on the slow-off rate modified aptamer (SOMAmer) protein profiling technology11,12,74. The aptamers are small single-stranded 40-mer DNA oligomers with modified nucleic acids selected to specifically recognize target proteins in their native three-dimensional state and show slow dissociation kinetics (t1/2 > 30 min) which in combination with stringent wash steps impedes nonspecific binding74. The custom-design SOMAscan platform was built to quantify 5034 protein analytes in a single serum sample with a focus on proteins that are known or expected to be present extracellularly or on the surface of cells, 4782 of which SOMAmers directly bind to 4137 different human proteins. Blood samples were collected at the AGES-RS baseline, after an overnight fast. Serum was prepared using a standardized protocol75, stored in 0.5 ml aliquots at −80 °C and serum samples that had not been previously thawed were used for the protein measurements. To avoid batch or time of processing biases, the order of sample collection and processing for protein measurements were randomized and all samples run as a single set at SomaLogic Inc. (Boulder, CO, US). All SOMAmers that passed quality control had median intra-assay and inter-assay coefficient of variation (CV) < 5% or equivalent to reported variability76. Several metrics, including aptamer specificity through direct tandem mass spectrometry (MS) analysis and inferential assessment via genetic analysis, have been used to determine the performance of the proteomic platform, suggesting strong target specificity throughout the platform11. Hybridization controls were used to correct for systematic variability in detection and calibrator samples of three dilution sets (40%, 1%, and 0.005%) were included so that the degree of fluorescence was a quantitative reflection of protein concentration. Box-Cox transformation was applied on the protein data77, and extreme outlier values excluded, defined as values above the 99.5th percentile of the distribution of 99th percentile cutoffs across all proteins after scaling, resulting in the removal of an average 11 samples per SOMAmer. Previous studies have shown that protein quantitative trait loci (QTLs) replicate well across different study populations as well as proteomic platforms11,21. While a recent comparison of protein measurements across different platforms showed a wide range of correlations78, cis pQTLs detection and validation by orthogonal MS-based measures were predictive of a strong correlation across platforms and were great indicators of platform specificity when protein concentrations obtained by orthogonal methods differ. The aptamer specificity of six of the AMD-associated proteins listed in the main text has already been validated by orthogonal mass spectrometry (MS)-based approach11. Nine additional aptamers were confirmed with SOMAmer pull down and MS (SP-MS) using AMD patient serum samples (Supplementary Data 4). Aside from low abundance affecting pull-down, there are many reasons why some aptamer-enriched proteins are not detected by SP-MS. For instance, MS analyses have different sensitivity than that of SOMAmer scan and is dependent on the protein sequence. Factors such as ion suppression of individual peptides caused by instrument/interfering molecules, incomplete trypsin digestion, and modifications of peptides either naturally or artifactually can limit the detection.
Protein measurements via ELISA
Participants aged 55 and over (up to n = 60, 15 per group), enrolled in a prospective study by Ophthalmic Consultants of Boston, to measure complement and disease-related protein biomarkers in blood. AMD patients with Age-Related Eye Disease Study (AREDS)79 grade 2–4 had best corrected visual acuity (BCVA) of at least 20/200. Healthy volunteers were aged matched (±2 to AMD patients) with BCVA of 20/40 or better in both eyes, and a comprehensive eye examination within previous 12 months revealing no diagnosis other than refractive error, mild cataract, dry eye, or AREDS grade 1. AREDS grading criteria for grade 2 included mild changes including multiple small drusen, non-extensive intermediate drusen, and/or pigment abnormalities. AREDS grade 3 included at least 1 large drusen of at least 125 µm in diameter, extensive intermediate drusen, and/or noncentral geographic atrophy. AREDS grade 4 included advanced AMD central geographic atrophy in one or both eyes. Qualifying subjects completed an informed consent form, and no identifying patient information was included. A blood sample of 10 ml was collected by routine phlebotomy, in EDTA-coated, lavender-topped collection tubes. Plasma was isolated after centrifugation at 3184 × g for 30 min, frozen and stored at −80 °C. CFH antibody was purchased from Quidel (cat. #A255). Rabbits were immunized with recombinant human CFHR1 protein at Covance (Princeton, NJ), and CFHR1 cross-reactive antibodies were purified from serum using CFHR1-conjugated CNBr-activated Sepharose 4B resin (Cytiva, Marlborough, MA). Additionally, antibodies showing cross-reactivity to Factor H were removed by several rounds of depletion using Factor H-conjugated resin. CFHR1 levels in patient plasma were measured in a Meso Scale Discovery assay (Rockville, MD) using the selective antibodies.
We investigated any possible cross-reactivity of CFHR1 antibodies with the CFH and CFHR proteins. For this, recombinant CFH protein was obtained from Complement Technology (TX, US), and recombinant CFH and CFHR proteins were expressed in HEK cells. More specifically, anti-CFH antibodies demonstrated no cross-reactivity to CFHR proteins as measured by ELISA, whereas anti-CFHR1 antibodies showed trace cross-reactivity to CFHR2, however, with CFHR2 binding signal below the signal level of CFHR1 used to extrapolate unknown plasma levels.
Analysis of gene expression in single-cell RNA sequencing data from eye tissues
Two separate single-cell RNA-sequencing experiments on seven human donor eyes, five controls and two AMD patients, yielded the data for single-cell gene expression80, also found at the GEO database (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE135922). The first study assessed single cells from the RPE/choroid (2 controls and 1 AMD), whereas the second evaluated the endothelial population after a CD31 antibody enrichment step (3 controls and 1 AMD)80. Each of the AMD donors was labeled as having “neovascular AMD,” with no other phenotypic information provided. The normalized single-cell data was downloaded from GEO and was analyzed with the R package Seurat (v.3.0.0) in R 3.6.3 environment. The final dataset contained 4335 cells after filtering. Variable genes were identified using Seurat with default parameters and Principal Components Analysis (PCA) was performed on these variable genes. First 11 PCs of the single-cell data (resolution = 0.2) were used for clustering cells with similar gene expression profile. Clusters were identified using FindNeighbors and FindClusters functions from Seurat package and UMAP dimensionality reduction was utilized for cluster visualization. The cell clusters were then manually annotated based on the markers reported in the paper80.
Statistical and genetic analysis
We used linear or logistic regression for the associations of individual proteins as well as Eigenvectors of protein modules with various phenotype and genotype measures, depending on the result being continuous or binary. Statistical results were obtained using linear models for continuous outcomes and generalized logistic models for binary outcomes for comparison of protein quintiles and AMD-related clinical traits. The models were fit using the outcome phenotypes as dependent variables and protein quintiles as predictor variables along with the adjustment variables age and sex. The protein quintiles were treated as factor variables to avoid underlying assumptions regarding linearity of effects. Continuous outcomes were standardized using z-scores prior to model fitting. Thus, coefficient estimates should be interpreted on the standard deviation scale. That is an estimated mean difference of 1 between protein quintiles translates to a one-standard-deviation difference between groups after adjusting for other included variables. The expected means were obtained as linear predictions from the fitted models along with the fitted confidence intervals around the mean. The linear predictions for qualitative phenotypes are shown on the log-odds scale. The difference between the 5th and 1st protein quintiles was obtained as the expected marginal difference between those groups. Thus, for continuous outcomes, they are the optimal linear estimator with corresponding confidence intervals and P-values, but for discrete outcomes they are obtained using the commonly applied asymptotic approximations. We applied linear regression using an additive genetic model for all single-point SNP association analyses for different disease-related outcomes as well as for proteins. The results of associating genetic variants with serum protein levels were obtained using a GWAS of 4782 human proteins measured in serum using 7,506,463 assayed and imputed genetic variants from 5368 AGES-RS individuals13.
For identification of protein predictors for late-stage AMD, we approximated the sampling distribution of logistic regression coefficients for all 4782 protein variables through the nonparametric bootstrap26 and least absolute shrinkage and selection operator (LASSO)27, estimated using the glmnet package for R81. We summarized our results after 500 bootstrap iterations by calculating the coefficients mean, 95% confidence intervals by calculating the 2.5% and 97.5% quantiles, and the percentage of iterations in which they were non-zero. We adjusted the LASSO models for age and sex in our analysis by leaving their coefficients unpenalized. Here, we fit the logistic regression LASSO model for prevalent late-stage AMD and compare the odds of late AMD to no AMD. All statistical analyses were conducted using the software environment R statistical package version 3.6.0 (2019-04-26) and and RStudio (1.1.456).
Two-sample Mendelian randomization analysis
In a two-sample Mendelian randomization study, genetic variants for each trait, the protein-encoding gene (exposure) X and AMD (outcome) Y are found in two distinct samples. A genetic variant (SNP) Z is used in the analysis provided it fulfills the three assumptions of instrumental variables:
There exists a significant association between SNP Z and exposure X.
SNP Z is independent of any confounder U which might influence exposure X and outcome Y.
SNP Z is independent of outcome conditional on exposure X and confounder U. This assumption is usually referred to as the exclusion criterion.
The first assumption is readily tested by setting a threshold on the significance level on the SNP-exposure association. Unfortunately, it remains a challenge to examine the validity of the second and third assumptions. When a SNP violates the third assumption, we generally speak of pleiotropy. To obtain the SNP-exposure associations, all genetic instruments within a 1 Mb (±500 kb) cis window for the protein-encoding gene were obtained for a given SOMAmer. A cis-window-wide significance level Pb = 0.05/N, where N was the number of SNPs within a given cis-window, was computed. Genetic instruments within the cis window for each SOMAmer were then clumped such that variants in high linkage disequilibrium (LD) (r2 ≥ 0.2) within a 1 Mb region were combined. The list of variants was then further pruned by removing all instruments with P ≥ Pb.
Summary statistics on the genetic risk on AMD were obtained from a GWAS provided by the IAMDGC consortium10. Any SNP in the cis window-wide significant data set not found in the AMD GWAS data set were replaced by proxy SNPs (r2 > 0.8) when possible, to maximize SNP coverage. Causal estimate for each protein was obtained by the generalized weighted least squares (GWLS) method36, which accounts for the correlation that can exist between instruments. Let be an index set of all SNPs associated with a protein-encoding gene and AMD. For , denote the j-th SNP-protein association and SNP-AMD association as and respectively, each with their corresponding standard error and . Then, the causal estimate is found by evaluating:
1 |
where and and is the inverse of the weighting matrix whose (i,j)-th entry is and is the correlation between SNPs . The standard error of is , where with:
2 |
reflects our uncertainty about the weights by assuming that we only know their relative magnitudes.
Finally, each estimate was subjected to a two-step sensitivity analysis. Any protein with a causal estimate found to be significant after adjusting the P-value with the Benjamini–Hochberg method was reassessed with the weighted median estimator which allows for up to 50% of the genetic instruments to violate any of the three assumptions of instrumental variables82. If the direction of the weighted median estimator was consistent with the GWLS estimate and remained significant at P < 0.05 it was subjected to the second stage of the sensitivity analysis but otherwise removed. Any protein which passed the first step was then reexamined with MR-Egger estimator which replaces the exclusion criterion with a weaker condition, the InSIDE assumption83, which allows SNPs to exhibit pleiotropic effects provided that the pleiotropic effect is uncorrelated with the SNP-AMD effect . As with the first step, any protein whose MR-Egger estimate was directionally consistent with the GWLS estimate and had a P-value exceeding 0.05 for the intercept coefficient was kept and considered a causal candidate. Causality for proteins with single cis-acting variants was assessed with the Wald ratio estimator .
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
The authors acknowledge the contribution of the Icelandic Heart Association (IHA) staff to the AGES-RS, as well as the involvement of all study participants. We thank the IAMDGC consortium for supplying us with their GWAS summary statistics data. National Institute on Aging (NIA) contracts N01-AG-12100 and HHSN271201200022C for V.G. financed the AGES study; retinal image collection and AMD readings were funded by the NIH Intramural Research Program (ZIAEY000401). V.G. received a funding from the NIA (1R01AG065596), and IHA received a support from Althingi (the Icelandic Parliament). The Icelandic Research Fund (IRF) funded V.E. and Va.G. with grants 195761-051, 184845-053, and 206692-051, while Va.G. received a postdoctoral research grant from the University of Iceland Research Fund
Author contributions
V.E. and T.E.W. designed and supervised the study. V.E., T.E.W., Va.G., T.J., E.F.G., B.G.J., T.A., M.T., N.F., S.P., X.L., R.E., Y.Z., S.J., C.L.H., S.M.L., J.L., C.L.G., A.A.N., B.L., R.P., T.E.W., Vi.G., M.F.C., Z.L., L.J.L., L.L.J., F.J., Q.Z., and Q.H. performed data analysis and/or contributed to the acquisition of data. N.F., R.P., X.L., J.R.L., and L.L.J. provided expertize on proteomics data. V.E. and T.E.W. wrote the first draft of the manuscript, with all coauthors contributing to data interpretation and manuscript editing. All coauthors have approved the submitted version of the paper.
Peer review
Peer review information
Nature Communications thanks Anneke den Hollander and Sirui Zhou for their contribution to the peer review of this work. Peer reviewer reports are available.
Data availability
The custom-design Novartis SOMAscan data are available through a collaboration agreement with the Novartis Institutes for BioMedical Research (lori.jennings@novartis.com). Data from the AGES Reykjavik study are available through collaboration (AGES_data_request@hjarta.is) under a data usage agreement with the IHA. All access to data is controlled via the use of a subject-signed informed consent authorization. The time it takes to respond to requests varies depending on their nature and circumstances of the request, but it will not exceed 14 working days. All data supporting the conclusions of the paper are presented in the main text and freely available as a supplement to this manuscript (Supplementary Information and Supplementary Data 1–15). The GTEx database (https://gtexportal.org/home/) was used to obtain the eQTL data.
Competing interests
The study was supported by the Novartis Institute for Biomedical Research. M.T., N.F., S.P., X.L., R.E., Y.Z., S.J., C.L.H., S.M.L., J.L., C.L.G., A.A.N., B.L., R.P., Z.L., L.L.J., T.E.W., Q.Z., Q.H., and J.R.L. are employees and stockholders of Novartis. All other authors have no conflict of interests to declare.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Valur Emilsson, Email: valur@hjarta.is.
Tony E. Walshe, Email: tony.walshe@agios.com
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-022-31085-x.
References
- 1.Wong WL, et al. Global prevalence of age-related macular degeneration and disease burden projection for 2020 and 2040: a systematic review and meta-analysis. Lancet Glob. Health. 2014;2:e106–e116. doi: 10.1016/S2214-109X(13)70145-1. [DOI] [PubMed] [Google Scholar]
- 2.Rofagha S, et al. Seven-year outcomes in ranibizumab-treated patients in ANCHOR, MARINA, and HORIZON: a multicenter cohort study (SEVEN-UP) Ophthalmology. 2013;120:2292–2299. doi: 10.1016/j.ophtha.2013.03.046. [DOI] [PubMed] [Google Scholar]
- 3.Jaffe, G. J. et al. Imaging features associated with progression to geographic atrophy in age-related macular degeneration: CAM report 5. Ophthalmol. Retina5, 855–867 (2020). [DOI] [PubMed]
- 4.Chakravarthy U, et al. Characterizing disease burden and progression of geographic atrophy secondary to age-related macular degeneration. Ophthalmology. 2018;125:842–849. doi: 10.1016/j.ophtha.2017.11.036. [DOI] [PubMed] [Google Scholar]
- 5.Edwards AO, et al. Complement factor H polymorphism and age-related macular degeneration. Science. 2005;308:421–424. doi: 10.1126/science.1110189. [DOI] [PubMed] [Google Scholar]
- 6.Hageman GS, et al. A common haplotype in the complement regulatory gene factor H (HF1/CFH) predisposes individuals to age-related macular degeneration. Proc. Natl Acad. Sci. USA. 2005;102:7227–7232. doi: 10.1073/pnas.0501536102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Haines JL, et al. Complement factor H variant increases the risk of age-related macular degeneration. Science. 2005;308:419–421. doi: 10.1126/science.1110359. [DOI] [PubMed] [Google Scholar]
- 8.Klein RJ, et al. Complement factor H polymorphism in age-related macular degeneration. Science. 2005;308:385–389. doi: 10.1126/science.1109557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Raychaudhuri S, et al. A rare penetrant mutation in CFH confers high risk of age-related macular degeneration. Nat. Genet. 2011;43:1232–1236. doi: 10.1038/ng.976. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Fritsche LG, et al. A large genome-wide association study of age-related macular degeneration highlights contributions of rare and common variants. Nat. Genet. 2016;48:134–143. doi: 10.1038/ng.3448. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Emilsson V, et al. Co-regulatory networks of human serum proteins link genetics to disease. Science. 2018;361:769–773. doi: 10.1126/science.aaq1327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Lamb, J. R., Jennings, L. L., Gudmundsdottir, V., Gudnason, V. & Emilsson, V. It’s in our blood: a glimpse of personalized medicine. Trends Mol. Med.27, 20–30 (2020). [DOI] [PMC free article] [PubMed]
- 13.Gudjonsson A, et al. A genome-wide association study of serum proteins reveals shared loci with common diseases. Nat. Commun. 2022;13:480. doi: 10.1038/s41467-021-27850-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Emilsson V, et al. Coding and regulatory variants are associated with serum protein levels and disease. Nat. Commun. 2022;13:481. doi: 10.1038/s41467-022-28081-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Geyer PE, et al. Plasma Proteome Profiling to detect and avoid sample-related biases in biomarker studies. EMBO Mol. Med. 2019;11:e10427. doi: 10.15252/emmm.201910427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Enroth S, et al. Systemic and specific effects of antihypertensive and lipid-lowering medication on plasma protein biomarkers for cardiovascular diseases. Sci. Rep. 2018;8:5531. doi: 10.1038/s41598-018-23860-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Schwenk JM, et al. The Human Plasma Proteome Draft of 2017: building on the human plasma peptideatlas from mass spectrometry and complementary assays. J. Proteome Res. 2017;16:4299–4310. doi: 10.1021/acs.jproteome.7b00467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Williams SA, et al. Plasma protein patterns as comprehensive indicators of health. Nat. Med. 2019;25:1851–1857. doi: 10.1038/s41591-019-0665-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lehallier B, et al. Undulating changes in human plasma proteome profiles across the lifespan. Nat. Med. 2019;25:1843–1850. doi: 10.1038/s41591-019-0673-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Emilsson V, Gudnason V, Jennings LL. Predicting health and life span with the deep plasma proteome. Nat. Med. 2019;25:1815–1816. doi: 10.1038/s41591-019-0677-y. [DOI] [PubMed] [Google Scholar]
- 21.Sun BB, et al. Genomic atlas of the human plasma proteome. Nature. 2018;558:73–79. doi: 10.1038/s41586-018-0175-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Holliday EG, et al. Insights into the genetic architecture of early stage age-related macular degeneration: a genome-wide association study meta-analysis. PLoS ONE. 2013;8:e53830. doi: 10.1371/journal.pone.0053830. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Jonasson F, et al. Five-year incidence, progression, and risk factors for age-related macular degeneration: the age, gene/environment susceptibility study. Ophthalmology. 2014;121:1766–1772. doi: 10.1016/j.ophtha.2014.03.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Joachim N, et al. Incidence and progression of geographic atrophy: observations from a population-based cohort. Ophthalmology. 2013;120:2042–2050. doi: 10.1016/j.ophtha.2013.03.029. [DOI] [PubMed] [Google Scholar]
- 25.Perlee LT, et al. Inclusion of genotype with fundus phenotype improves accuracy of predicting choroidal neovascularization and geographic atrophy. Ophthalmology. 2013;120:1880–1892. doi: 10.1016/j.ophtha.2013.02.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Efron B. Bootstrap methods: another look at the jackknife. Ann. Stat. 1979;7:1–26. doi: 10.1214/aos/1176344552. [DOI] [Google Scholar]
- 27.Tibshirani R. Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B (Methodol.) 1996;58:267–288. [Google Scholar]
- 28.GTEx Consortium. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science. 2020;369:1318–1330. doi: 10.1126/science.aaz1776. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Babu E, et al. Transport via SLC5A8 (SMCT1) is obligatory for 2-oxothiazolidine-4-carboxylate to enhance glutathione production in retinal pigment epithelial cells. Invest. Ophthalmol. Vis. Sci. 2011;52:5749–5757. doi: 10.1167/iovs.10-6825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Martin PM, et al. Expression of the sodium-coupled monocarboxylate transporters SMCT1 (SLC5A8) and SMCT2 (SLC5A12) in retina. Invest. Ophthalmol. Vis. Sci. 2007;48:3356–3363. doi: 10.1167/iovs.06-0888. [DOI] [PubMed] [Google Scholar]
- 31.Hoffmann TJ, et al. A large electronic-health-record-based genome-wide study of serum lipids. Nat. Genet. 2018;50:401–413. doi: 10.1038/s41588-018-0064-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hughes AE, et al. A common CFH haplotype, with deletion of CFHR1 and CFHR3, is associated with lower risk of age-related macular degeneration. Nat. Genet. 2006;38:1173–1177. doi: 10.1038/ng1890. [DOI] [PubMed] [Google Scholar]
- 33.Lorés-Motta L, et al. Common haplotypes at the CFH locus and low-frequency variants in CFHR2 and CFHR5 associate with systemic FHR concentrations and age-related macular degeneration. Am. J. Hum. Genet. 2021;108:1367–1384. doi: 10.1016/j.ajhg.2021.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Keilhauer CN, Fritsche LG, Guthoff R, Haubitz I, Weber BH. Age-related macular degeneration and coronary heart disease: evaluation of genetic and environmental associations. Eur. J. Med. Genet. 2013;56:72–79. doi: 10.1016/j.ejmg.2012.10.005. [DOI] [PubMed] [Google Scholar]
- 35.Kortvely E, Hauck SM, Behler J, Ho N, Ueffing M. The unconventional secretion of ARMS2. Hum. Mol. Genet. 2016;25:3143–3151. doi: 10.1093/hmg/ddw162. [DOI] [PubMed] [Google Scholar]
- 36.Burgess S, Dudbridge F, Thompson SG. Combining information on multiple instrumental variables in Mendelian randomization: comparison of allele score and summarized data methods. Stat. Med. 2016;35:1880–1906. doi: 10.1002/sim.6835. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Cipriani V, et al. Beyond factor H: the impact of genetic-risk variants for age-related macular degeneration on circulating factor-H-like 1 and factor-H-related protein concentrations. Am. J. Hum. Genet. 2021;108:1385–1400. doi: 10.1016/j.ajhg.2021.05.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Wallace C. A more accurate method for colocalisation analysis allowing for multiple causal variants. PLoS Genet. 2021;17:e1009440. doi: 10.1371/journal.pgen.1009440. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Giambartolomei C, et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 2014;10:e1004383. doi: 10.1371/journal.pgen.1004383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Lynch AM, et al. Plasma biomarkers of reticular pseudodrusen and the risk of progression to advanced age-related macular degeneration. Transl. Vis. Sci. Technol. 2020;9:12. doi: 10.1167/tvst.9.10.12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Lynch AM, et al. Proteomic profiles in advanced age-related macular degeneration using an aptamer-based proteomic technology. Transl. Vis. Sci. Technol. 2019;8:14. doi: 10.1167/tvst.8.1.14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Laíns, I. et al. Human plasma metabolomics in age-related macular degeneration: meta-analysis of two cohorts. Metabolites9, 127 (2019). [DOI] [PMC free article] [PubMed]
- 43.Laíns I, et al. Human plasma metabolomics study across all stages of age-related macular degeneration identifies potential lipid biomarkers. Ophthalmology. 2017;125:245–254. doi: 10.1016/j.ophtha.2017.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Laíns I, et al. Human plasma metabolomics in age-related macular degeneration (AMD) using nuclear magnetic resonance spectroscopy. PLoS ONE. 2017;12:e0177749. doi: 10.1371/journal.pone.0177749. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Laíns I, et al. Metabolomics in the study of retinal health and disease. Prog. Retinal Eye Res. 2018;69:57–79. doi: 10.1016/j.preteyeres.2018.11.002. [DOI] [PubMed] [Google Scholar]
- 46.Jones MM, et al. Mitochondrial DNA haplogroups and age-related maculopathy. Arch. Ophthalmol. 2007;125:1235–1240. doi: 10.1001/archopht.125.9.1235. [DOI] [PubMed] [Google Scholar]
- 47.Feher J, et al. Mitochondrial alterations of retinal pigment epithelium in age-related macular degeneration. Neurobiol. Aging. 2006;27:983–993. doi: 10.1016/j.neurobiolaging.2005.05.012. [DOI] [PubMed] [Google Scholar]
- 48.Udar N, et al. Mitochondrial DNA haplogroups associated with age-related macular degeneration. Invest. Ophthalmol. Vis. Sci. 2009;50:2966–2974. doi: 10.1167/iovs.08-2646. [DOI] [PubMed] [Google Scholar]
- 49.Nordgaard CL, Karunadharma PP, Feng X, Olsen TW, Ferrington DA. Mitochondrial proteomics of the retinal pigment epithelium at progressive stages of age-related macular degeneration. Invest. Ophthalmol. Vis. Sci. 2008;49:2848–2855. doi: 10.1167/iovs.07-1352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Ferrington DA, et al. Altered bioenergetics and enhanced resistance to oxidative stress in human retinal pigment epithelial cells from donors with age-related macular degeneration. Redox Biol. 2017;13:255–265. doi: 10.1016/j.redox.2017.05.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Jain IH, et al. Hypoxia as a therapy for mitochondrial disease. Science. 2016;352:54–61. doi: 10.1126/science.aad9642. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Gospe SM, 3rd, et al. Photoreceptors in a mouse model of Leigh syndrome are capable of normal light-evoked signaling. J. Biol. Chem. 2019;294:12432–12443. doi: 10.1074/jbc.RA119.007945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Liu L, MacKenzie KR, Putluri N, Maletic-Savatic M, Bellen HJ. The glia-neuron lactate shuttle and elevated ROS promote lipid synthesis in neurons and lipid droplet accumulation in glia via APOE/D. Cell Metab. 2017;26:719–737.e716. doi: 10.1016/j.cmet.2017.08.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Go YM, et al. MTOR-initiated metabolic switch and degeneration in the retinal pigment epithelium. FASEB J. 2020;34:12502–12520. doi: 10.1096/fj.202000612R. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Zhao C, et al. mTOR-mediated dedifferentiation of the retinal pigment epithelium initiates photoreceptor degeneration in mice. J. Clin. Invest. 2011;121:369–383. doi: 10.1172/JCI44303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Huang J, et al. Abnormal mTORC1 signaling leads to retinal pigment epithelium degeneration. Theranostics. 2019;9:1170–1180. doi: 10.7150/thno.26281. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Selman C, et al. Ribosomal protein S6 kinase 1 signaling regulates mammalian life span. Science. 2009;326:140–144. doi: 10.1126/science.1177221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Lopez-Otin C, Galluzzi L, Freije JMP, Madeo F, Kroemer G. Metabolic control of longevity. Cell. 2016;166:802–821. doi: 10.1016/j.cell.2016.07.031. [DOI] [PubMed] [Google Scholar]
- 59.Swiercz R, Cheng D, Kim D, Bedford MT. Ribosomal protein rpS2 is hypomethylated in PRMT3-deficient mice. J. Biol. Chem. 2007;282:16917–16923. doi: 10.1074/jbc.M609778200. [DOI] [PubMed] [Google Scholar]
- 60.Choi S, Jung CR, Kim JY, Im DS. PRMT3 inhibits ubiquitination of ribosomal protein S2 and together forms an active enzyme complex. Biochim. Biophys. Acta. 2008;1780:1062–1069. doi: 10.1016/j.bbagen.2008.05.010. [DOI] [PubMed] [Google Scholar]
- 61.Dionne KL, Bergeron D, Landry-Voyer AM, Bachand F. The 40S ribosomal protein uS5 (RPS2) assembles into an extraribosomal complex with human ZNF277 that competes with the PRMT3-uS5 interaction. J. Biol. Chem. 2019;294:1944–1955. doi: 10.1074/jbc.RA118.004928. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Schadt EE. Molecular networks as sensors and drivers of common human diseases. Nature. 2009;461:218–223. doi: 10.1038/nature08454. [DOI] [PubMed] [Google Scholar]
- 63.Zhang B, et al. Integrated systems approach identifies genetic nodes and networks in late-onset Alzheimer’s disease. Cell. 2013;153:707–720. doi: 10.1016/j.cell.2013.03.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Emilsson V, et al. Genetics of gene expression and its effect on disease. Nature. 2008;452:423–U422. doi: 10.1038/nature06758. [DOI] [PubMed] [Google Scholar]
- 65.Chen YQ, et al. Variations in DNA elucidate molecular networks that cause disease. Nature. 2008;452:429–435. doi: 10.1038/nature06757. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Heesterbeek TJ, et al. Complement activation levels are related to disease stage in AMD. Invest. Ophthalmol. Vis. Sci. 2020;61:18. doi: 10.1167/iovs.61.3.18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Cipriani V, et al. Increased circulating levels of Factor H-Related Protein 4 are strongly associated with age-related macular degeneration. Nat. Commun. 2020;11:778. doi: 10.1038/s41467-020-14499-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Skerka C, et al. Defective complement control of factor H (Y402H) and FHL-1 in age-related macular degeneration. Mol. Immunol. 2007;44:3398–3406. doi: 10.1016/j.molimm.2007.02.012. [DOI] [PubMed] [Google Scholar]
- 69.Alic L, et al. A genome-wide association study identifies key modulators of complement factor H binding to malondialdehyde-epitopes. Proc. Natl Acad. Sci. USA. 2020;117:9942–9951. doi: 10.1073/pnas.1913970117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Rudnick RB, et al. FHR5 binds to laminins, uses separate C3b and surface-binding sites, and activates complement on malondialdehyde-acetaldehyde surfaces. J. Immunol. 2018;200:2280–2290. doi: 10.4049/jimmunol.1701641. [DOI] [PubMed] [Google Scholar]
- 71.Irmscher S, et al. Serum FHR1 binding to necrotic-type cells activates monocytic inflammasome and marks necrotic sites in vasculopathies. Nat. Commun. 2019;10:2961. doi: 10.1038/s41467-019-10766-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Consortium GT. The GTEx Consortium atlas of genetic regulatory effects across human tissues. Science. 2020;369:1318–1330. doi: 10.1126/science.aaz1776. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Harris TB, et al. Age, Gene/Environment Susceptibility-Reykjavik Study: multidisciplinary applied phenomics. Am. J. Epidemiol. 2007;165:1076–1087. doi: 10.1093/aje/kwk115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Gold L, et al. Aptamer-based multiplexed proteomic technology for biomarker discovery. PLoS ONE. 2010;5:e15004. doi: 10.1371/journal.pone.0015004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Tuck MK, et al. Standard operating procedures for serum and plasma collection: early detection research network consensus statement standard operating procedure integration working group. J. Proteome Res. 2009;8:113–117. doi: 10.1021/pr800545q. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Hathout Y, et al. Large-scale serum protein biomarker discovery in Duchenne muscular dystrophy. Proc. Natl Acad. Sci. USA. 2015;112:7153–7158. doi: 10.1073/pnas.1507719112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 77.Max Kuhn, K. J. Applied Predictive Modeling (Springer, 2013).
- 78.Raffield LM, et al. Comparison of proteomic assessment methods in multiple cohort studies. Proteomics. 2020;20:e1900278. doi: 10.1002/pmic.201900278. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Liew G, Joachim N, Mitchell P, Burlutsky G, Wang JJ. Validating the AREDS simplified severity scale of age-related macular degeneration with 5- and 10-year incident data in a population-based sample. Ophthalmology. 2016;123:1874–1878. doi: 10.1016/j.ophtha.2016.05.043. [DOI] [PubMed] [Google Scholar]
- 80.Voigt AP, et al. Single-cell transcriptomics of the human retinal pigment epithelium and choroid in health and macular degeneration. Proc. Natl Acad. Sci. USA. 2019;116:24100–24107. doi: 10.1073/pnas.1914143116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 2010;33:1–22. doi: 10.18637/jss.v033.i01. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 82.Bowden J, Davey Smith G, Haycock PC, Burgess S. Consistent estimation in mendelian randomization with some invalid instruments using a weighted median estimator. Genet. Epidemiol. 2016;40:304–314. doi: 10.1002/gepi.21965. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Bowden J, Davey Smith G, Burgess S. Mendelian randomization with invalid instruments: effect estimation and bias detection through Egger regression. Int. J. Epidemiol. 2015;44:512–525. doi: 10.1093/ije/dyv080. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The custom-design Novartis SOMAscan data are available through a collaboration agreement with the Novartis Institutes for BioMedical Research (lori.jennings@novartis.com). Data from the AGES Reykjavik study are available through collaboration (AGES_data_request@hjarta.is) under a data usage agreement with the IHA. All access to data is controlled via the use of a subject-signed informed consent authorization. The time it takes to respond to requests varies depending on their nature and circumstances of the request, but it will not exceed 14 working days. All data supporting the conclusions of the paper are presented in the main text and freely available as a supplement to this manuscript (Supplementary Information and Supplementary Data 1–15). The GTEx database (https://gtexportal.org/home/) was used to obtain the eQTL data.