Abstract
Genome-wide association studies (GWAS) have revealed sixteen risk loci for endoemtrial cancer but the identification of candidate susceptibility genes remains challenging. Here, we perform transcriptome-wide association study (TWAS) analyses using the largest endometrial cancer GWAS and gene expression from six relevant tissues, prioritizing eight candidate endometrial cancer susceptibility genes, one of which (EEFSEC) is located at a potentially novel endometrial cancer risk locus. We also show evidence of biologically relevant tissue-specific expression associations for CYP19A1 (adipose), HEY2 (ovary) and SKAP1 (whole blood). A phenome-wide association study demonstrates associations of candidate susceptibility genes with anthropometric, cardiovascular, diabetes, bone health and sex hormone traits that are related to endometrial cancer risk factors. Lastly, analysis of TWAS data highlights candidate compounds for endometrial cancer repurposing. In summary, this study reveals endometrial cancer susceptibility genes, including those with evidence of tissue specificity, providing insights into endometrial cancer aetiology and avenues for therapeutic development.
Subject terms: Cancer genetics, Endometrial cancer
Pik Fang Kho et al. conduct multi-tissue transcriptome-wide association studies of endometrial cancer risk. Their results identify potential susceptibility genes for endometrial cancer, and provide avenues for the development of future treatments for this disease.
Introduction
Endometrial cancer is the most commonly diagnosed gynaecological cancer in developed countries, with incidence expected to continue to increase over the next decade (reviewed in Morice et al.1). To explore the role of common genetic variation in endometrial cancer susceptibility, we recently conducted a genome-wide association study (GWAS) meta-analysis and identified 16 susceptibility loci2. Despite the success of GWAS in identifying genetic variants underlying endometrial cancer risk, biological interpretation remains challenging given that the majority of the GWAS risk variation is located in non-protein-coding regions (reviewed in O’Mara et al.3).
Trait-associated loci identified by GWAS have demonstrated enrichment for expression quantitative trait loci (eQTLs), suggesting that changes in gene expression link genetic variation and phenotypes4. As eQTL data are not typically available for GWAS samples, the transcriptome-wide association study (TWAS) approach has been developed to integrate eQTL and GWAS data from independent sample sets5. TWAS uses effect estimates from eQTLs to impute gene expression in GWAS samples, and subsequently associate the imputed gene expression with phenotypes. TWAS has an additional advantage of alleviating the multiple testing penalty of GWAS in statistical inference by testing the imputed expression of thousands of genes rather than millions of genetic variants.
TWAS has identified candidate causal genes for many complex traits, using eQTL data from disease-relevant tissues or cell types6–9. However, endometrial or uterine eQTL data are less commonly available than for other tissues or cell types (e.g., whole blood). Nevertheless, Mortlock et al.10 have shown that 85% of eQTLs found in the endometrium are shared across multiple tissues, suggesting that data from other tissues could be incorporated to perform endometrial cancer TWAS. Recently, innovative multi-tissue TWAS approaches that leverage shared eQTLs across multiple tissues have become available, and provide increased statistical power to identify candidate causal genes11,12. These multi-tissue TWAS approaches account for the correlation of gene expression between tissues and perform multivariate regression to predict gene expression from multiple tissues simultaneously.
In the current study, we identify eight candidate endometrial cancer susceptibility genes by conducting multi-tissue TWAS analyses, integrating GWAS data from the Endometrial Cancer Association Consortium2 and eQTLs from six related or biologically relevant tissues from the Genotype-Tissue Expression (GTEx) project4: two adipose tissue types (subcutaneous adipose and visceral omentum adipose), three gynaecological tissues (ovary, uterus and vagina) and whole blood. Two of the candidate susceptibility genes (AC021755.3 and EEFSEC) have not been previously reported and EEFSEC is located outside established endometrial cancer GWAS risk loci. Notably, follow-up analyses suggest evidence of tissue-specific gene expression associations. We also perform a phenome-wide association analysis to uncover traits that relate to candidate endometrial cancer susceptibility genes and provide insights into endometrial cancer aetiology. Finally, we explore drug repurposing opportunities for endometrial cancer by analysing TWAS findings using the Connectivity Map database13 and Open Target platform14, and highlight candidate compounds for endometrial cancer treatment
Results
Multi-tissue TWAS analyses identify eight candidate susceptibility genes for endometrial cancer
We performed two multi-tissue TWAS analyses using eQTL data from six related or biologically relevant tissues. The first multi-tissue TWAS analysis using S-MultiXcan11 revealed seven genes whose imputed expression associated with endometrial cancer susceptibility (Table 1). Given that the TWAS associations could result from linkage disequilibrium (LD) between genetic variants independently affecting gene expression and endometrial cancer risk, we performed colocalization analysis to reduce LD contamination15. The associations of CYP19A1, SKAP1, HEY2, EEFSEC, and EVI2A were supported by colocalization of eQTL and GWAS signals in at least one of the relevant tissues (Table 1); thus, these five genes were prioritized as candidate endometrial cancer susceptibility genes. The colocalization data also suggested tissue-specific associations for CYP19A1 (subcutaneous adipose), HEY2 (ovary) and SKAP1 (whole blood) (Table 1).
Table 1.
Locus | Gene | Ntissue | P-value | Zmin | Zmax | Zmean | ZSD | PPAdipose Sub. | PPAdipose Vis. | PPOvary | PPVagina | PPUterus | PPWB |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
3q21.3 | EEFSEC | 3 | 1.10E−06 | 2.98 | 5.07 | 4.08 | 1.05 | 0.85 | 0.85 | 0.01 | 0.09 | 0.02 | 0.88 |
6q22.31 | HEY2 | 1 | 9.94E−09 | −5.73 | −5.73 | −5.73 | NA | 0.00 | 0.00 | 0.89 | 0.01 | 0.02 | 0.00 |
15q21.2 | GLDN | 4 | 1.34E−12 | −2.02 | 3.65 | −0.23 | 2.62 | 0.00 | 0.00 | 0.01 | 0.02 | 0.00 | 0.00 |
15q21.2 | CYP19A1 | 2 | 9.52E−12 | 5.75 | 6.17 | 5.96 | 0.30 | 0.82 | 0.02 | 0.00 | 0.01 | NA | 0.28 |
17q11.2 | EVI2A | 3 | 1.50E−06 | −5.14 | −3.45 | −4.51 | 0.92 | 0.90 | 0.88 | 0.03 | 0.02 | 0.01 | 0.88 |
17q21.32 | SKAP1 | 2 | 7.27E−09 | −6.11 | −0.15 | −3.13 | 4.22 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.95 |
17q21.32 | SNX11 | 4 | 5.41E−07 | 3.49 | 5.38 | 4.63 | 0.83 | 0.03 | 0.03 | 0.03 | 0.02 | 0.03 | 0.02 |
The Bonferroni corrected P value threshold is 3.8 × 10−6 (accounting for 13,182 genes tested). Bolded findings have evidence of colocalization (PP > 0.8) in at least one tissue.
Ntissue number of tissues with available expression weights, Zmin minimum Z score from single tissue S-PrediXcan result, Zmax maximum Z score from single tissue S-PrediXcan result, Zmean mean Z score from single tissue S-PrediXcan result, ZSD standard deviation of Z score from single tissue S-PrediXcan result, PP posterior probability that endometrial cancer risk and eQTL variants from respective tissue colocalize, Adipose Sub. subcutaneous adipose, Adipose Vis. visceral omentum adipose, WB whole blood, NA not available.
The second multi-tissue TWAS analysis using joint-tissue imputation (JTI)16 revealed 11 unique genes whose imputed expression associated with endometrial cancer susceptibility in at least one of the six tissues studied (Table 2). Five of the seven genes (CYP19A1, SKAP1, HEY2, SNX11, and EEFSEC) initially revealed by S-MultiXcan (i.e. before prioritisation by colocalization analysis) were also identified using JTI. To prioritise the genes identified by JTI, we performed a Mendelian randomization (MR) causal inference analysis in the respective tissues using MR-JTI, providing five candidate endometrial cancer susceptibility genes: CYP19A1, SNX11, AC021755.3, EEFSEC, and EIF2AK4 (Table 2). These analyses indicated some tissue specificity in the predicted gene expression associations: CYP19A1 in adipose (subcutaneous and visceral omentum), AC021755.3 in uterus, and EEFSEC and EIF2AK4 in vagina. However, the colocalization analysis of the EEFSEC S-MultiXcan result suggested the involvement of other tissues (Table 1). Two candidate endometrial cancer susceptibility genes (CYP19A1 and EEFSEC) were prioritized using both S-MultiXcan-colocalization and MR-JTI approaches.
Table 2.
Tissue | Locus | Gene | JTI | MR-JTI | ||
---|---|---|---|---|---|---|
Z-score | P-value | Beta | 95% CI | |||
Subcutaneous adipose | 15q21.2 | CYP19A1 | 7.03 | 2.00E−12 | 0.35 | (0.04, 0.57) |
16p12.1 | NPIPB6 | 4.78 | 1.74E−06 | −0.07 | (−0.23, 0.33) | |
17q21.32 | SNX11 | 4.90 | 9.45E−07 | 0.35 | (0.04, 0.69) | |
17q21.32 | AC004477.3 | −4.89 | 1.03E−06 | −0.33 | (−0.66, 0.04) | |
Visceral omentum adipose | 15q21.2 | CYP19A1 | 6.40 | 1.51E−10 | 0.29 | (0.04, 0.50) |
17q21.32 | SNX11 | 4.86 | 1.19E−06 | 0.58 | (0.38, 0.83) | |
Ovary | 6q22.31 | HEY2 | −5.82 | 5.89E−09 | 0.13 | (−0.12, 0.36) |
15q15.1 | AC021755.2 | 4.74 | 2.09E−06 | −0.04 | (−0.27, 0.37) | |
15q15.1 | AC021755.3 | 4.92 | 8.61E−07 | −0.01 | (−0.31, 0.35) | |
15q15.1 | EIF2AK4 | 5.09 | 3.64E−07 | −0.08 | (−0.36, 0.24) | |
17q21.32 | AC004477.1 | 4.56 | 5.20E−06 | 0.02 | (−0.34, 0.25) | |
17q21.32 | SNX11 | 4.64 | 3.43E−06 | 0.52 | (0.25B 0.81) | |
Uterus | 15q15.1 | AC021755.3 | 4.78 | 1.77E−06 | 0.32 | (0.10, 0.54) |
Vagina | 3q21.3 | EEFSEC | 4.61 | 4.00E−06 | 0.36 | (0.20, 0.62) |
15q15.1 | EIF2AK4 | 5.31 | 1.07E−07 | 0.29 | (0.09, 0.49) | |
17q21.32 | SNX11 | 4.52 | 6.14E−06 | 0.08 | (−0.17, 0.35) | |
Whole blood | 3q21.3 | EEFSEC | 4.66 | 3.20E−06 | 0.13 | (−0.11, 0.40) |
15q21.2 | CYP19A1 | 6.03 | 1.61E−09 | 0.09 | (−0.08, 0.42) | |
17q21.32 | SKAP1 | −4.63 | 3.61E−06 | −0.09 | (−0.37, 0.26) | |
17q21.32 | SNX11 | 4.73 | 2.22E−06 | 0.40 | (0.20, 0.81) |
Bonferroni corrected P value thresholds were: 3.7 × 10−6 (13,526 genes tested) in JTIsubcutaneous adipose; 3.9 × 10−6 (12,949 genes tested) in JTIvisceral omentum adipose; 5.8 × 10−6 (8615 genes tested) in JTIOvary; 7.1 × 10−6 (7049 genes tested) in JTIuterus; 7.0 × 10−6 (7191 genes tested) in JTIvagina and 4.9 × 10−6 (10,140 genes tested) in JTIwhole blood. Bolded findings have evidence of causality in MR-JTI analyses.
Secondary TWAS analysis of the endometrioid endometrial cancer subtype using S-MultiXcan identified five genes, three of which (CYP19A1, HEY2, and SKAP1) passed the colocalization threshold (Supplementary Table 1); all three genes had also been identified in the main S-MultiXcan-colocalization analysis using all endometrial cancer cases (Table 1). Analysis of endometrioid endometrial cancer using JTI identified seven unique genes (Supplementary Table 2). Subsequent causal inference test by MR-JTI revealed two candidate genes for endometrioid endometrial cancer susceptibility (SNX11 and EIF2AK4 (Supplementary Table 2)), both of which had been identified in the main MR-JTI analysis using all endometrial cancer cases (Table 2). Thus, the two multi-tissue TWAS and associated analyses prioritized eight candidate endometrial cancer susceptibility genes: AC021755.3, EEFSEC, EIF2AK4, CYP19A1, HEY2, SNX11, EVI2A, and SKAP1. Gene expression of the candidate susceptibility genes was generally positively correlated across the solid tissues studied (Fig. 1a–h). However, gene expression in whole blood was occasionally inversely, or less well correlated, with expression levels in other tissues (e.g., EIF2AK4 (Fig. 1c), EVI2A (Fig. 1d), and SNX11 (Fig. 1h)).
Phenome-wide association study (PheWAS) analysis of candidate susceptibility genes highlights associations with traits related to endometrial cancer risk
PheWAS analysis of the eight candidate susceptibility genes revealed that four candidate genes (CYP19A1, EIF2AK4, EVI2A and SNX11) associated with 56 traits related to seven phenotypic categories: anthropometric, bone health, cardiovascular, diabetes, hematopoiesis, liver function, and sex hormones (Supplementary Table 3). Five of these categories (anthropometric, bone health, cardiovascular, diabetes, and sex hormones) contain traits that relate to or include genetically established endometrial cancer risk factors (e.g. body mass index2, LDL and HDL cholesterol17, insulinemia18, estradiol19, sex hormone-binding globulin (SHBG) and testosterone20). Of the PheWAS traits with available GWAS data, 13 demonstrated evidence of a significant genetic correlation with endometrial cancer, including SHBG, testosterone, the impedance of arm (related to body mass index) and apolipoprotein A (a key constituent of HDL) (Supplementary Table 4; Supplementary Fig. 1).
Drug repurposing analysis reveals candidates for endometrial cancer therapy
We integrated data from S-MultiXcan and JTI analyses with drug-induced gene expression profiles from the Connectivity Map database, identifying four compounds with profiles opposing the expression of genes associated with endometrial cancer risk (Table 3). None of these drugs have been approved for endometrial cancer treatment; however, enzastaurin has anti-cancer effects across endometrial cancer cell lines (mean IC50 = 30 μM) in the Genomics of Drug Sensitivity in Cancer database21. We also explored drug repurposing opportunities using the Open Targets platform to assess the eight candidate susceptibility genes. Only the protein encoded by CYP19A1 (aromatase) is targeted by clinically approved drugs, with indications for breast cancer and Cushing syndrome (Supplementary Table 5).
Table 3.
Connectivity score | Compound | Drug class | Clinically tested* (highest phase) | Indications |
---|---|---|---|---|
−93.60 | Methyl-2,5-dihydroxycinnamate | EGFR inhibitor | No | – |
−91.07 | Enzastaurin | Protein kinase C inhibitor | Yes (Phase III) | B-cell lymphoma and glioblastoma |
−90.09 | Prunetin | ABCG2 inhibitor | No | – |
−90.00 | Nocodazole | Tubulin inhibitor | No | – |
*ClinicalTrials.gov database (https://www.clinicaltrials.gov) accessed June 2021.
Discussion
Two multi-tissue TWAS approaches prioritized eight candidate susceptibility genes for endometrial cancer. Two of these genes (CYP19A1 and EEFSEC) were identified by both sets of analyses and four candidate susceptibility genes (AC021755.3, CYP19A1, EIF2AK4, HEY2, and SKAP1) demonstrated evidence of tissue specificity. Secondary analysis revealed that five of the eight candidate genes (CYP19A1, EIF2AK4, HEY2, SKAP1, and SNX11) were also candidate susceptibility genes for the endometrioid endometrial cancer subtype. All candidate susceptibility genes except for EEFSEC (3q21.3) are located at known endometrial cancer GWAS risk loci. EEFSEC encodes a eukaryotic elongation factor that is involved in the incorporation of selenocysteine into proteins. This gene has been shown to be necessary for the viability for a number of cancer cell lines, including two derived from endometrial tumors (https://depmap.org/portal/), but its specific function in endometrial carcinogenesis is unknown. The 3q21.3 locus has been preliminarily reported as a risk locus for endometrial cancer in a cross-cancer GWAS meta-analysis22 and may provide a genome-wide significant risk loci in future larger endometrial cancer GWAS.
Three of the TWAS candidate susceptibility genes (EIF2AK4, SKAP1, and SNX11) have been recently identified as potential regulatory targets of endometrial cancer GWAS risk variation through enhancer-promoter chromatin looping studies23. Although these data did not support the candidate susceptibility gene HEY2, bioinformatic analysis has previously suggested its targeting by endometrial cancer GWAS risk variation19.
In terms of the relationships between the candidate susceptibility genes and endometrial cancer, the best-characterized gene is CYP19A1. We have previously linked CYP19A1 to endometrial cancer susceptibility through genetic association with circulating estrogen19. CYP19A1 encodes the aromatase enzyme, responsible for a rate-limiting step in the synthesis of estrogen and unopposed estrogen exposure is one of the most well-established risk factors for endometrial cancer24. Consistent with its function, increased CYP19A1 expression was associated with increased endometrial cancer risk in our study. The S-MultiXcan-colocalization and MR-JTI analyses provided evidence that adipose-specific expression of CYP19A1 is linked to endometrial cancer risk, concordant with the production of estrogen by aromatase in the adipose of postmenopausal women25. Furthermore, drug repurposing analysis linked CYP19A1 to aromatase inhibitors (letrozole, anastrozole, exemestane and aminoglutethimide) which are used to treat breast cancer. Notably, letrozole has shown efficacy in treating endometrial hyperplasia26,27, a precursor to endometrial cancer; however, there is less evidence for the use of aromatase inhibitors to treat advanced endometrial cancer28–30.
S-MultiXcan-colocalization and MR-JTI analyses also suggested tissue-specific associations for expression of AC021755.3, EIF2AK4 HEY2, and SKAP1. The association of AC021755.3 with endometrial cancer risk was specific to predicted gene expression in the uterus, an organ whose lining consists of the endometrium. AC021755.3, a long non-coding RNA with unknown function, is located ~43 kb downstream of EIF2AK4 and has not been previously related to endometrial cancer. The MR-JTI association of EIF2AK4 was specific to predicted gene expression in the vagina although it should be noted that the JTI analysis showed nominally significant associations in both uterus and ovary (Supplementary Table 6). The S-MultiXcan-colocalization result for HEY2 was only significant in the ovary and it is interesting to note that HEY2 may play a role in the development of ovarian follicles31,32, which are an important source of sex hormones in premenopausal women. Lastly, the colocalisation of the S-MultiXcan SKAP1 association in whole blood is consistent with the function of the encoded protein which regulates T-cell receptor signaling33 and enhances conjugation of T-cells and antigen-presenting cells34.
PheWAS analysis of the candidate endometrial cancer susceptibility genes revealed associations with traits belonging to seven phenotypic categories, potentially yielding insight into endometrial cancer aetiology. Indeed, five of the categories relate to genetically established endometrial cancer risk factors2,17–20. The remaining categories (hematopoiesis and liver function) have not been previously linked with endometrial cancer risk. However, evidence suggests that traits related to red blood cells and platelets, including red blood cell distribution width (found in the current study), may provide diagnostic markers for endometrial cancer35. Furthermore, it was notable to find alkaline phosphatase among the liver traits as intestinal-type alkaline phosphatase levels have recently been predicted to be causally associated with endometrial cancer risk36 and we also observed a nominally significant genetic corrleation between alkaline phosphatase and endometrial cancer risk. GWAS data for such traits could be used to assess their effects on endometrial cancer risk using similar MR approaches to our studies of body mass index and blood lipids2,17. The PheWAS analysis may also provide information about the function of the candidate endometrial cancer susceptibility genes. For example, little is known about EVI2A but it was associated with traits such as SHBG and testosterone levels, which have been genetically related to endometrial cancer risk20, suggesting a role in hormone regulation.
We found four compounds that may counteract gene expression changes associated with endometrial cancer susceptibility. As drug targets supported by genetic evidence of disease association are more likely to receive clinical approval37,38, there is a strong rationale for future pre-clinical endometrial cancer sensitivity studies of candidate compounds identified in this analysis. One of the compounds, enzastaurin, has already demonstrated anti-cancer effects in endometrial cancer cell lines and thus may have the potential for drug repurposing. Indeed, it has shown clinical benefit in the treatment of serous endometrial cancer in a phase I study of solid tumors39. It is also noteworthy that the tubulin inhibitor nocodazole was identified, as the tubulin inhibitor paclitaxel is currently used to treat endometrial cancer patients with advanced disease40.
We acknowledge some limitations to this study such as small eQTL sample sizes, in particular for uterus, arguably the most relevant tissue for investigating endometrial cancer. However, we have used two approaches that leverage genetic regulation shared across tissues to improve the prediction of gene expression in relevant tissues. Furthermore, we have applied colocalization and MR-JTI analyses to identify candidate causal gene expression associations with endometrial cancer risk, two of which were prioritized by both analyses, and reduce false-positive findings. Differences in the results between the two analyses could be related to the genetic variants used: colocalization analysis was performed using all variants in the gene vicinity whereas MR-JTI analysis was performed using independent variants (r2 < 0.01) in the vicinity. Thus, the extent to which the colocalization/MR-JTI results represent genuine causal relationships remains unclear. Further functional studies will be necessary to determine the biological effects of the candidate susceptibility genes identified in this study on experimental phenotypes related to endometrial cancer risk.
In summary, our study has highlighted the value of using a multi-tissue TWAS, with priortization for cauaslity, to identify candidate endometrial cancer susceptibility genes, including biologlically relevant tissue-specific findings. As well as generating insights into biological mechanisms underlying endometrial cancer risk, and identifying a potential novel risk locus, we have highlighted compounds with potential for endometrial cancer repurposing and provided avenues for future studies of endometrial cancer aetiology.
Methods
Endometrial cancer risk GWAS data
We used summary data from the largest endometrial cancer GWAS meta-analysis to date, comprising 12,906 cases and 108,979 controls of European ancestry2,41. All participants provided informed consent and these studies are approved by research ethics committees from QIMR Berghofer Medical Research Institute, University-Clinic Erlangen, Karolinska Institutet, UZ Leuven, The Mayo Clinic, The Hunter New England Health District, The Regional Committees for Medical and Health Research Ethics Norway, and the UK National Research Ethics Service (04/Q0803/148 and 05/MRE05/1). Endometrial cancer subtype analyses were additionally performed using GWAS summary data restricting cases to endometrioid endometrial cancer patients (8758 cases and 46,126 controls)42. Detailed descriptions of the quality control procedures and GWAS analysis can be found in the corresponding publication2. Briefly, genetic variants with MAF < 1%, genotyping calling rate <98%, genotype missingness above 10%, and deviation from Hardy-Weinberg equilibrium (P < 10−12 in cases and P < 10−7 in controls) were excluded from the GWAS analysis. After quality control exclusion, 9.5 million variants remained in the GWAS analysis. For TWAS analysis, only variants located outside the major histocompatibility complex region (chr6:26Mb–34 Mb), were used.
eQTL data
We used cis-eQTL summary statistics and gene expression data from subcutaneous adipose, visceral omentum adipose, ovary, uterus, vagina and whole blood that are available in the GTEx Project (version 8). Cis-eQTL summary statistics for these tissues and information about respective sample sizes are available from the GTEx website (https://gtexportal.org/home/).
Multi-tissue TWAS analysis
S-MultiXcan aggregates single-tissue TWAS results to test the joint effects of gene expression variations from multiple tissues on the trait. To perform S-MultiXcan, we used S-PrediXcan11 to incorporate endometrial cancer GWAS and eQTL data from each of the six relevant GTEx tissues (version 8) and identify genes associated with endometrial cancer risk. Pre-specified weights based on covariance of genetic variants were used in S-PrediXcan analysis and obtained from PredictDB data repository (http://predictdb.org/). S-PrediXcan data from all tissues were then jointly analysed using multivariate regression in S-MultiXcan11. JTI leverages shared genetic regulation across tissues to improve the prediction performance of gene expression in each tissue16. For this analysis, we incorporated eQTL data from each of the six relevant GTEx tissues using pre-computed weights based on the covariance of genetic variants43. Bonferroni correction was applied to identify statistically significant TWAS genes from S-MultiXcan and JTI. S-PrediXcan, S-MultiXcan and JTI analyses of endometrial cancer GWAS were performed using SPrediXcan.py that was made available on Github (https://github.com/hakyimlab/MetaXcan).
Colocalization and MR analyses
As the residual LD amongst genetic variants used to generate gene expression weights in S-MultiXcan and JTI could induce spurious association on nearby non-causal genes, we performed analyses to assess the co-occurrence of endometrial cancer risk and eQTL signals. These subsequent analyses enabled us to tease apart pleiotropic associations (where the same genetic variant is associated with gene expression and disease risk) from linkage (where correlated genetic variants are associated with gene expression and disease risk independently). For S-MultiXcan associations, colocalization analysis was performed using the COLOC package with default parameters in R15 and the input was identical to the TWAS analysis but restricted to variants within 1 Mb up- and down-stream of the S-MultiXcan identified genes. Full summary statistics of these genes from relevant tissues were retrieved from GTEx portal (version 8 release; https://gtexportal.org/home/). COLOC provided posterior probabilities for the GWAS and eQTL signals to be explained by the same genetic variant (Hypothesis 4, colocalization). We considered a posterior probability for colocalization >0.80 for pleiotropic association as evidence of colocalization within a locus. For JTI associations we performed MR-JTI16, using the code made available by Gamazon Lab on Github (https://github.com/gamazonlab/MR-JTI). The input was similar to colocalization analysis but further restricted to independent variants (LD < 0.01) that are associated with the expression of JTI identified genes. Full summary statistics for these JTI identified genes were also retrieved from the GTEx portal. LD scores for each independent variant were obtained from pre-computed LD scores of 1000 Genome European ancestry samples that are made available in BOLT-LMM44. Bonferroni adjustment was applied for multiple test correction. Genes that passed S-MultiXcan-colocalization or MR-JTI were prioritized as candidate endometrial cancer susceptibility genes.
Cross-tissue gene expression correlations
Gene expression (transcripts per million) data from subcutaneous adipose, visceral omentum adipose, ovary, uterus, vagina, and whole blood were downloaded from the GTEx (v8) project. Pairwise correlations of expression of individual genes across tissues were calculated by the base R function “cor” using pairwise complete observations (i.e., use = “pairwise.complete.obs”).
PheWAS analyses
We assessed pleiotropic associations of the candidate endometrial cancer susceptibility genes by performing a PheWAS analysis in the CTG-VIEW database45. The PheWAS analsyis was performed using summary-data-based MR46 results from ~1600 phenotypes that are available from this platform. Analyses were performed using eQTL data from relevant or well-powered tissues (i.e., subcutaneous adipose, visceral omentum adipose, ovary, uterus, vagina, and whole blood) and included a heterogeneity test (HEIDI) to identify candidate genes that are affected by the same GWAS risk variant, comparable to our colocalization analysis. Genes passing a Bonferroni correction (PSMR < 3.1 × 10−5 accounting for 1600 phenotypes examined) and no evidence of heterogeneity (PHEIDI > 0.05) were defined as having PheWAS associations with candidate susceptibility genes. To explore the shared genetics between endometrial cancer and PheWAS traits, pairwise genetic correlations were performed using LD score regression. Publicly available GWAS summary statistics for endometrial cancer and PheWAS traits were used in this analysis and were derived from O’Mara et al.2 and the Neale Laboratory (http://www.nealelab.is/uk-biobank), respectively. Bonferroni correction was used to evaluate the statistical significance of the genetic correlation result.
Computational drug repurposing analysis
We used the Connectivity Map platform13 to identify compounds with gene expression profiles opposing the endometrial cancer risk TWAS data. As Connectivity Map analysis requires at least ten upregulated and ten downregulated coding genes as input, we relaxed the threshold in S-MultiXcan and JTI associations to a false discovery rate (FDR)-adjusted P-value < 0.1 to include sufficient unique upregulated and downregulated (n = 10 of each) genes for Connectivity Map analysis (Supplementary Tables 6 and 7). Connectivity scores were generated based on a modified Kolmogorov-Smirnov score, which summarises the relationship of the endometrial cancer TWAS gene expression profile with the drug-induced gene expression profile across cancer cell types in the Connectivity Map database. A connectivity score ≤ −90 suggest that the expression of the query genes opposes the drug-induced gene expression profile. We also used the Open Targets platform14 to determine if candidate endometrial cancer susceptibility genes encode known targets for drugs that have been clinically studied.
Statistics and reproducibility
Statistical analyses were primarily performed using Python (https://www.python.org/), R (https://www.r-project.org/), and Plink (https://www.cog-genomics.org/plink/). Bonferroni correction was used to account for multiple testing in multi-tissue TWAS analyses, colocalization and MR analyses, and PheWAS analyses.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Supplementary information
Acknowledgements
We thank members of the the Endometrial Cancer Association Consortium for contributing their study data, the many women who participated in these studies, and the numerous institutions and their staff who supported recruitment. A full list of consortium members and acknowledgements can be found in the Supplementary Note. We thank Dr Dan Zhou for his help in interpreting MR-JTI results of this study. This work was supported by a National Health and Medical Research Council (NHMRC) Project Grant (APP1109286). P.F.K. is supported by an Australian Government Research Training Program PhD Scholarship and QIMR Berghofer Postgraduate Top-Up Scholarship. T.O.M. and A.B.S. are supported by an NHMRC Investigator Fellowships (APP1173170 & APP177524).
Author contributions
P.F.K., T.O.M. and D.M.G. designed the study. P.F.K. and X.W. conducted analyses. P.F.K., T.O.M. and D.M.G. interpreted the results. G.C.P. provided resources for phenome-wide analyses. T.D., E.L.G., R.J.S., A.B.S., T.O.M. and the Endometrial Cancer Association Consortium provided endometrial cancer risk GWAS resources. P.F.K., T.O.M. and D.M.G. drafted the manuscript. T.O.M. and D.M.G. supervised the study. A.B.S. and T.O.M. acquired funding. All authors provided critical review of the manuscript.
Data availability
Summary-level GWAS meta-analysis results for endometrial cancer that support the findings of this study are available at the NHGRI-EBI GWAS Catalog (endometrial cancer: https://www.ebi.ac.uk/gwas/studies/GCST006464; endometrioid endometrial cancer: https://www.ebi.ac.uk/gwas/studies/GCST006465). Cis-eQTL summary statistics are accessible from the GTEx website (https://gtexportal.org/home/) and gene expresion data for candidate susceptibility genes are available in Supplementary Data 1. Pre-specified gene expression weights were obtained from PredictDB data repository (http://predictdb.org/). Other data generated and/or analyzed during this study are included in this article and its supplementary information files or are available on reasonable request.
Code availability
We used publicly available software (URLs listed below) in this research: S-PrediXcan and S-MultiXcan (https://github.com/hakyimlab/MetaXcan); COLOC (https://cran.r-project.org/web/packages/coloc/index.html); JTI and MR-JTI (https://github.com/gamazonlab/MR-JTI); and Connectivity Map (https://clue.io/cmap). The computer code used in this research is available at https://github.com/pikfang/TWAS.
Competing interests
The authors declare no competing interests.
Footnotes
Peer review information Communications Biology thanks Arjun Bhattacharya and the other, anonymous, reviewers for their contribution to the peer review of this work. Primary Handling Editor: George Inglis. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Tracy A. O’Mara, Dylan M. Glubb.
Supplementary information
The online version contains supplementary material available at 10.1038/s42003-021-02745-3.
References
- 1.Morice P, Leary A, Creutzberg C, Abu-Rustum N, Darai E. Endometrial cancer. Lancet. 2016;387:1094–1108. doi: 10.1016/S0140-6736(15)00130-0. [DOI] [PubMed] [Google Scholar]
- 2.O’Mara TA, et al. Identification of nine new susceptibility loci for endometrial cancer. Nat. Commun. 2018;9:3166. doi: 10.1038/s41467-018-05427-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.O’Mara TA, Glubb DM, Kho PF, Thompson DJ, Spurdle AB. Genome-wide association studies of endometrial cancer: latest developments and future directions. Cancer Epidemiol. Biomark. Prev. 2019;28:1095–1102. doi: 10.1158/1055-9965.EPI-18-1031. [DOI] [PubMed] [Google Scholar]
- 4.Consortium GT, et al. Genetic effects on gene expression across human tissues. Nature. 2017;550:204–213. doi: 10.1038/nature24277. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gusev A, et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat. Genet. 2016;48:245–252. doi: 10.1038/ng.3506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Wu L, et al. A transcriptome-wide association study of 229,000 women identifies new candidate susceptibility genes for breast cancer. Nat. Genet. 2018;50:968–978. doi: 10.1038/s41588-018-0132-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Gusev A, et al. A transcriptome-wide association study of high-grade serous epithelial ovarian cancer identifies new susceptibility genes and splice variants. Nat. Genet. 2019;51:815–823. doi: 10.1038/s41588-019-0395-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bien SA, et al. Genetic variant predictors of gene expression provide new insight into risk of colorectal cancer. Hum. Genet. 2019;138:307–326. doi: 10.1007/s00439-019-01989-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Lamontagne M, et al. Leveraging lung tissue transcriptome to uncover candidate causal genes in COPD genetic associations. Hum. Mol. Genet. 2018;27:1819–1829. doi: 10.1093/hmg/ddy091. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Mortlock S, et al. Tissue specific regulation of transcription in endometrium and association with disease. Hum. Reprod. 2020;35:377–393. doi: 10.1093/humrep/dez279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Barbeira AN, et al. Integrating predicted transcriptome from multiple tissues improves association detection. PLoS Genet. 2019;15:e1007889. doi: 10.1371/journal.pgen.1007889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Hu Y, et al. A statistical framework for cross-tissue transcriptome-wide association analysis. Nat. Genet. 2019;51:568–576. doi: 10.1038/s41588-019-0345-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Subramanian A, et al. A next generation connectivity map: L1000 platform and the first 1,000,000 profiles. Cell. 2017;171:1437–1452.e1417. doi: 10.1016/j.cell.2017.10.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Carvalho-Silva D, et al. Open targets platform: new developments and updates two years on. Nucleic Acids Res. 2019;47:D1056–D1065. doi: 10.1093/nar/gky1133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Giambartolomei C, et al. Bayesian test for colocalisation between pairs of genetic association studies using summary statistics. PLoS Genet. 2014;10:e1004383. doi: 10.1371/journal.pgen.1004383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Zhou D, et al. A unified framework for joint-tissue transcriptome-wide association and Mendelian randomization analysis. Nat. Genet. 2020;52:1239–1246. doi: 10.1038/s41588-020-0706-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Kho PF, et al. Mendelian randomization analyses suggest a role for cholesterol in the development of endometrial cancer. Int J. Cancer. 2021;148:307–319. doi: 10.1002/ijc.33206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Nead, K. T. et al. Evidence of a causal association between insulinemia and endometrial cancer: a Mendelian randomization analysis. J. Natl. Cancer Inst.10.1093/jnci/djv178 (2015). [DOI] [PMC free article] [PubMed]
- 19.Thompson DJ, et al. CYP19A1 fine-mapping and Mendelian randomization: estradiol is causal for endometrial cancer. Endocr. Relat. Cancer. 2016;23:77–91. doi: 10.1530/ERC-15-0386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Ruth KS, et al. Using human genetics to understand the disease impacts of testosterone in men and women. Nat. Med. 2020;26:252–258. doi: 10.1038/s41591-020-0751-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Yang W, et al. Genomics of drug sensitivity in cancer (GDSC): a resource for therapeutic biomarker discovery in cancer cells. Nucleic Acids Res. 2013;41:D955–D961. doi: 10.1093/nar/gks1111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Kar, S. P. et al. Combining genome-wide studies of breast, prostate, ovarian and endometrial cancers maps cross-cancer susceptibility loci and identifies new genetic associations. Preprint at https://www.biorxiv.org/content/10.1101/2020.06.16.146803v1 (2020).
- 23.O’Mara, T. A., Spurdle, A. B., Glubb, D. M. & Endometrial Cancer Association, C. Analysis of promoter-associated chromatin interactions reveals biologically relevant candidate target genes at endometrial cancer risk loci. Cancers10.3390/cancers11101440 (2019). [DOI] [PMC free article] [PubMed]
- 24.Ignatov, A. & Ortmann, O. Endocrine risk factors of endometrial cancer: polycystic ovary syndrome, oral contraceptives, infertility, tamoxifen. Cancers10.3390/cancers12071766 (2020). [DOI] [PMC free article] [PubMed]
- 25.Zhao H, Zhou L, Shangguan AJ, Bulun SE. Aromatase expression and regulation in breast and endometrial cancer. J. Mol. Endocrinol. 2016;57:R19–R33. doi: 10.1530/JME-15-0310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Moradan S, Nikkhah N, Mirmohammadkhanai M. Comparing the Administration of Letrozole and Megestrol Acetate in the Treatment of Women with Simple Endometrial Hyperplasia without Atypia: A Randomized Clinical Trial. Adv. Ther. 2017;34:1211–1220. doi: 10.1007/s12325-017-0509-8. [DOI] [PubMed] [Google Scholar]
- 27.Tabatabaie A, et al. Comparing letrozole with medroxyprogesterone acetate (MPA) as hormonal therapy for simple endometrial hyperplasia without atypia in adult and middle-aged women. Eur. J. Gynaecol. Oncol. 2013;34:552–555. [PubMed] [Google Scholar]
- 28.Rose PG, et al. A phase II trial of anastrozole in advanced recurrent or persistent endometrial carcinoma: a Gynecologic Oncology Group study. Gynecol. Oncol. 2000;78:212–216. doi: 10.1006/gyno.2000.5865. [DOI] [PubMed] [Google Scholar]
- 29.Ma BB, et al. The activity of letrozole in patients with advanced or recurrent endometrial cancer and correlation with biological markers−a study of the National Cancer Institute of Canada Clinical Trials Group. Int J. Gynecol. Cancer. 2004;14:650–658. doi: 10.1111/j.1048-891X.2004.14419.x. [DOI] [PubMed] [Google Scholar]
- 30.Lindemann K, et al. Examestane in advanced or recurrent endometrial carcinoma: a prospective phase II study by the Nordic Society of Gynecologic Oncology (NSGO) BMC Cancer. 2014;14:68. doi: 10.1186/1471-2407-14-68. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Trombly DJ, Woodruff TK, Mayo KE. Suppression of Notch signaling in the neonatal mouse ovary decreases primordial follicle formation. Endocrinology. 2009;150:1014–1024. doi: 10.1210/en.2008-0213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Terauchi KJ, Shigeta Y, Iguchi T, Sato T. Role of Notch signaling in granulosa cell proliferation and polyovular follicle induction during folliculogenesis in mouse ovary. Cell Tissue Res. 2016;365:197–208. doi: 10.1007/s00441-016-2371-4. [DOI] [PubMed] [Google Scholar]
- 33.Wu L, Fu J, Shen SH. SKAP55 coupled with CD45 positively regulates T-cell receptor-mediated gene transcription. Mol. Cell Biol. 2002;22:2673–2686. doi: 10.1128/MCB.22.8.2673-2686.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Wang H, et al. SKAP-55 regulates integrin adhesion and formation of T cell-APC conjugates. Nat. Immunol. 2003;4:366–374. doi: 10.1038/ni913. [DOI] [PubMed] [Google Scholar]
- 35.Zhang H, Liang K, Ke L, Tang S. Clinical application of red cell distribution width, mean platelet volume, and cancer antigen 125 detection in endometrial cancer. J. Clin. Lab Anal. 2020;34:e23309. doi: 10.1002/jcla.23309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Zhu, J. et al. Associations between genetically predicted circulating protein concentrations and endometrial cancer risk. Cancers10.3390/cancers13092088 (2021). [DOI] [PMC free article] [PubMed]
- 37.Nelson MR, et al. The support of human genetic evidence for approved drug indications. Nat. Genet. 2015;47:856–860. doi: 10.1038/ng.3314. [DOI] [PubMed] [Google Scholar]
- 38.King EA, Davis JW, Degner JF. Are drug targets with genetic support twice as likely to be approved? Revised estimates of the impact of genetic support for drug mechanisms on the probability of drug approval. PLoS Genet. 2019;15:e1008489. doi: 10.1371/journal.pgen.1008489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Nwankwo N, et al. Phase I study of enzastaurin and bevacizumab in patients with advanced cancer: safety, efficacy and pharmacokinetics. Investigational N. Drugs. 2013;31:653–660. doi: 10.1007/s10637-012-9850-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Burke WM, et al. Endometrial cancer: a review and current management strategies: Part II. Gynecologic Oncol. 2014;134:393–402. doi: 10.1016/j.ygyno.2014.06.003. [DOI] [PubMed] [Google Scholar]
- 41.O’Mara, T. A. et al. Endometrial cancer GWAS dataset. GWAS Cataloghttps://www.ebi.ac.uk/gwas/studies/GCST006464 (2018).
- 42.O’Mara, T. A. et al. Endometrioid endometrial cancer GWAS dataset. GWAS Cataloghttps://www.ebi.ac.uk/gwas/studies/GCST006465 (2018).
- 43.Gamazon, E. R. & Zhou, D. C. JTI. 10.5281/zenodo.3842289 (2020).
- 44.Loh PR, Kichaev G, Gazal S, Schoech AP, Price AL. Mixed-model association for biobank-scale datasets. Nat. Genet. 2018;50:906–908. doi: 10.1038/s41588-018-0144-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Cuéllar-Partida, G. et al. Complex-traits genetics virtual lab: a community-driven web platform for post-GWAS analyses. Preprint at bioRxiv10.1101/518027 (2019).
- 46.Zhu Z, et al. Integration of summary data from GWAS and eQTL studies predicts complex trait gene targets. Nat. Genet. 2016;48:481–487. doi: 10.1038/ng.3538. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Summary-level GWAS meta-analysis results for endometrial cancer that support the findings of this study are available at the NHGRI-EBI GWAS Catalog (endometrial cancer: https://www.ebi.ac.uk/gwas/studies/GCST006464; endometrioid endometrial cancer: https://www.ebi.ac.uk/gwas/studies/GCST006465). Cis-eQTL summary statistics are accessible from the GTEx website (https://gtexportal.org/home/) and gene expresion data for candidate susceptibility genes are available in Supplementary Data 1. Pre-specified gene expression weights were obtained from PredictDB data repository (http://predictdb.org/). Other data generated and/or analyzed during this study are included in this article and its supplementary information files or are available on reasonable request.
We used publicly available software (URLs listed below) in this research: S-PrediXcan and S-MultiXcan (https://github.com/hakyimlab/MetaXcan); COLOC (https://cran.r-project.org/web/packages/coloc/index.html); JTI and MR-JTI (https://github.com/gamazonlab/MR-JTI); and Connectivity Map (https://clue.io/cmap). The computer code used in this research is available at https://github.com/pikfang/TWAS.