Skip to main content
Scientific Reports logoLink to Scientific Reports
. 2024 May 18;14:11370. doi: 10.1038/s41598-024-62178-w

Potential drug targets for tumors identified through Mendelian randomization analysis

Na Song 1,2,#, Pingyu Shi 2,#, Kai Cui 2,#, Liqun Zeng 2, Ziwei Wang 2, Wenyu Di 1, Jinsong Li 1, Yanwu Fan 2, Zhanjun Li 3, Jinghang Zhang 1, Wei Su 1,, Haijun Wang 1,2,
PMCID: PMC11102463  PMID: 38762700

Abstract

According to the latest cancer research data, there are a significant number of new cancer cases and a substantial mortality rate each year. Although a substantial number of clinical patients are treated with existing cancer drugs each year, the efficacy is unsatisfactory. The incidence is still high and the effectiveness of most cancer drugs remains unsatisfactory. Therefore, we evaluated the human proteins for their causal relationship to for cancer risk and therefore also their potential as drug targets. We used summary tumors data from the FinnGen and cis protein quantitative trait loci (cis-pQTL) data from a genome-wide association study, and employed Mendelian randomization (MR) to explore the association between potential drug targets and nine tumors, including breast, colorectal, lung, liver, bladder, prostate, kidney, head and neck, pancreatic caners. Furthermore, we conducted MR analysis on external cohort. Moreover, Bidirectional MR, Steiger filtering, and colocalization were employed to validate the main results. The DrugBank database was used to discover potential drugs of tumors. Under the threshold of False discovery rate (FDR) < 0.05, results showed that S100A16 was protective protein and S100A14 was risk protein for human epidermal growth factor receptor 2-positive (HER-positive) breast cancer, phosphodiesterase 5A (PDE5A) was risk protein for colorectal cancer, and melanoma inhibitory activity (MIA) was protective protein for non-small cell lung carcinoma (NSCLC). And there was no reverse causal association between them. Colocalization analysis showed that S100A14 (PP.H4.abf = 0.920) and S100A16 (PP.H4.abf = 0.932) shared causal variation with HER-positive breast cancer, and PDE5A (PP.H4.abf = 0.857) shared causal variation with colorectal cancer (CRC). The MR results of all pQTL of PDE5A and MIA were consistent with main results. In addition, the MR results of MIA and external outcome cohort were consistent with main results. In this study, genetic predictions indicate that circulating S100 calcium binding protein A14 (S100A14) and S100 calcium binding protein A16 (S100A16) are associated with increase and decrease in the risk of HER-positive breast cancer, respectively. Circulating PDE5A is associated with increased risk of CRC, while circulating MIA is associated with decreased risk of NSCLC. These findings suggest that four proteins may serve as biomarkers for cancer prevention and as potential drug targets that could be expected for approval.

Keywords: Drug target, Biomarker, Plasma protein, Tumor, Mendelian randomization

Subject terms: Cancer, Drug discovery


According to the latest cancer survey data, there are a significant number of new cancer cases and substantial mortality rates each year. Malignant tumors such as lung cancer, colorectal cancer, and pancreatic cancer rank in the top ten in both incidence and mortality rates for both male and female1. Furthermore, prostate cancer remains the leading cause of death among men. Despite a significant decline in prostate cancer mortality rates, it still stands as the second leading cause of cancer-related deaths2. Similarly, although survival rates for breast cancer have improved significantly over the past few decades, it remains the most common cancer and the second deadliest cancer among female3. Therefore, cancer prevention and treatment are important in the field of scientific research. However, the identification of biomarkers for cancer prevention is still insufficient, and the unique mechanisms within the tumor microenvironment pose challenges for drug treatments46. Currently, commonly used anti-cancer drugs include 5-fluorouracil, capecitabine, paclitaxel, gemcitabine, and others. However, the effectiveness of these drugs remains limited to a small subset of patients7,8. In recent years, studies have attempted to discover novel and efficient cancer therapeutic drugs9,10. However, some findings encountered obstacles in the process of translating into clinical outcomes. For example, some anti-cancer drugs may encounter issues such as insufficient efficacy, ineffectiveness, or adverse reactions11. Revealing precancerous markers and identifying potential therapeutic targets may offer new insights into the future prevention and treatment of cancer.

Proteins play crucial roles in various biological processes in the human body, as well as in the progression of cancer12,13. Currently, due to the vital functions of proteins, multiple drug targets are primarily concentrated on proteins. In recent years, MR studies have been widely employed in drug target screening and drug repurposing14,15. MR is a recently emerging research method that employs genetic instrumental variables for causal association analysis16,17. Single nucleotide polymorphisms (SNPs) identified through genome-wide association studies are used as instrumental variables to assess the causal association between exposure and outcome. Here, the study of human plasma protein data using MR may help to uncover underlying genetic factors in tumors.

In our study, we first performed a two-sample MR analysis with plasma protein as exposure and 9 tumors in the FinnGen database as outcomes. Our results were then further explored by external queue validation, reverse MR, and co-localization analysis. Finally, four plasma proteins are being considered as potential drug targets for tumors, and Drugs that identify potential targets based on DrugBank database are considered potential therapeutics for tumors.

Materials and methods

Mendelian randomization (MR)

Mendelian Randomization (MR) is a novel analytical method that has emerged in recent years, utilizing statistical techniques to assess the impact of a specific factor on human diseases18. It utilizes instrumental variable to analysis, aiming to assess causal relationships between exposures and outcomes using non-experimental data. In MR analysis, genetic variation SNPs are considered instrumental variables. When DNA is passed from parents to offspring, alleles segregate independently. This method is akin to random assignment in randomized controlled trials (RCTs), with the purpose of simulating RCTs to minimize the risk of confounding19. However, before conducting MR analysis, three assumptions need to be met. First, the instrumental variables must be strongly correlated with the exposure factor. Second, the instrumental variables must be independent of confounding factors. Third, the instrumental variables should only affect the outcomes through the exposure factor20. In accordance with the principles and assumptions of Mendelian randomization, we have crafted a flowchart outlining the analysis procedure for this study (Fig. 1).

Figure 1.

Figure 1

Study design for identification of plasma proteins causally associated with tumors. pQTL: protein quantitative trait loci.

pQTL and GWAS data

The study design was shown in Fig. 1. We conducted a MR analysis to explore the causal association between plasma protein concentration (n = 4917) and nine types of tumors, including breast, colorectal, lung, liver, bladder, prostate, kidney, head and neck, pancreatic caners. Summary-level data of plasma pQTL was obtained from a comprehensive genome-wide association study (GWAS) published by Ferkingstad et al.21. In addition, we performed the analysis with cis-pQTLs as they are known for their resistance to horizontal pleiotropy22. Previously, a study has identified several potential drug targets for acute pancreatitis through MR analysis of cis-pQTLs for 4719 plasma proteins23. Here, we employed the same plasma protein screening approach as follows: Firstly, a rigorous association screening was performed on 4917 plasma protein pQTL (p < 5e-8), which be advantageous for excluding weak instrumental variables. Secondly, SNPs with linkage disequilibrium (LD) were eliminated based on the criteria of r2 = 0.001. Thirdly, to ensure that SNPs are not pleiotropy, the retained SNPs are limited to those with cis (Gene locus ± 1Mb). In addition, due to algorithmic reasons, SNPs with a low minor allele frequency (MAF) exhibit higher error rates, hence it is necessary to set the MAF threshold to 0.01. Additionally, we utilized PhenoScanner, a database containing published literature on GWAS data information, to control for confounding factors24. F-value is an indicator of the correlation between an instrumental variable and exposure, and the SNPs (F-value > 10) are strongly correlated with exposure25.

The tumor GWAS data were obtained from the R9 version of the FinnGen database (https://www.finngen.fi/en/access_results) (Supplement Table S1)26, including malignant neoplasm of breast (HER-negative) (num_cases = 5965, num_controls = 167,017), malignant neoplasm of breast (HER-positive) (num_cases = 9698, num_controls = 167,017), colorectal cancer (num_cases = 6509, num_controls = 287,137), non-small cell lung cancer (num_cases = 4901, num_controls = 287,137), small cell lung cancer (num_cases = 676, num_controls = 167,017), hepatocellular carcinoma (num_cases = 453, num_controls = 287,137), clear cell adenocarcinoma of kidney (num_cases = 901, num_controls = 167,017), malignant neoplasm of bladder (num_cases = 2053, num_controls = 167,017), malignant neoplasm of head and neck (num_cases = 2131, num_controls = 287,137), adenocarcinoma and ductal carcinoma of pancreas (num_cases = 692, num_controls = 287,137), malignant neoplasm of prostate (num_cases = 13,216, num_controls = 119,948). All data were derived from public databases and approved by the ethics review committee.

Data for validation

The GWAS summary data for colorectal cancer and lung cancer were derived from the UK Biobank database and were obtained from the Open GWAS online database (https://gwas.mrcieu.ac.uk). The study included a total of 5,657 cases and 372,016 controls for colorectal cancer, as well as 2,671 cases and 372,016 controls for lung cancer. Moreover, the GWAS summary data of breast cancer were obtained from the Breast Cancer Association Consortium (BCAC)27, encompassing a total of 69,501 cases and 105,974 controls. These data were used to validate the primary results.

MR analysis

We conducted a two-sample MR analysis using the retained SNPs to identify potential therapeutic targets for tumors. In this study, plasma protein concentration was employed as the exposure factor, while tumor served as the outcome factor. The methods of Inverse-variance weighted (IVW) and the Wald ratio were employed to assess the causal association between plasma protein concentration and tumors28. The Wald ratio was employed for the analysis of a single SNP, while the IVW method was utilized to evaluate multiple SNPs. The main results were screened based on the threshold condition of false discovery rate (FDR) < 0.0529, and the Benjamini–Hochberg method was employed for FDR calculation. In addition, we utilized the phenoscanner platform (http://www.phenoscanner.medschl.cam.ac.uk/) to fulfill the assumptions of independence and exclusivity in MR analysis30. SNPs associated with tumor risk factors and those directly associated with tumors were eliminated. Afterwards, we utilized the ‘harmonise_data’ and ‘mr’ function in TwoSampleMR to extract the same SNPs in the outcome and obtained the results. And the accuracy of the direction was verified by steiger test. Furthermore, we validated the main results with additional tumor GWAS data. For external queue validation results, the threshold was adjusted to p < 0.05. Finally, MR Analysis of all pQTL data (cis-pQTL and trans-pQTL) was also used as a sensitivity analysis in order to verify the main results. The all methods used for the sensitivity analysis are shown in Fig. 1. Additionally, to verify the reliability of the results, we conducted a statistical power analysis for the main outcomes using the online website https://sb452.shinyapps.io/power.

Reverse MR analysis

In order to exclude the bias caused by reverse causality, we conducted a bidirectional MR analysis. According to the same screening conditions (p < 5e-8, kb = 10,000, r2 = 0.001), SNPs were extracted from the GWAS data of breast tumor, colorectal cancer, and non-small cell lung cancer in the FinnGen database, with tumor as exposure and plasma protein concentration as outcome. IVW, MR-Egger, weighted median, simple mode, and weighted mode were used as the primary analysis methods3133. p < 0.05 was considered statistically significant.

Colocalization analysis

Colocalization analysis is a method of investigating whether a causal variation is shared between two traits and is used to enhance the results of MR Analysis34. The Bayesian colocalization method provides posterior probabilities of five hypotheses35. To assess the presence of shared causal variation in a specific genomic region between the two phenotypes, Bayesian colocalization analysis (pQTL-GWAS) was performed using SNPs located within 500 Mb from the lead SNP of all cis-pQTL with MAF > 0.0123. There are four hypotheses for co-localization analysis. H0: Phenotype 1 and Phenotype 2 are not significantly associated with any SNP loci in a specific genomic region. H1/H2: Phenotype 1 or Phenotype 2 is significantly associated with SNP loci in a specific genomic region. H3: Both Phenotype 1 and Phenotype 2 are significantly associated with SNP loci in a specific genomic region, but driven by different causal variant positions. H4: Both Phenotype 1 and Phenotype 2 are significantly associated with SNP loci in a specific genomic region, and driven by the same causal variant position. If H4 has a higher probability in statistical terms, it suggests that the significant signal locus can influence the phenotype. In this study, based on the coloc package, we evaluated the posterior probability of hypothesis 4 (PP.H4). PP.H4 > 0.8 was identified as being associated with a specific genomic region through shared causal variation between the protein and the tumor36, which is strong evidence for co-localization. 0.4 < PP.H4 < 0.8 was regarded as moderate co-localization.

Potential target drugs

Plasma proteins with strong co-localization and favorable results in sensitivity analyses were selected as first-class targets. In addition, the STRING database (https://string-db.org/) and DrugBank (https://go.drugbank.com/) were utilized to investigate whether discovered cancer therapeutic drug targets interact with the first-class targets37. furthermore, the DrugBank database is being used separately to explore potential cancer therapy drugs for first-class targets. The molecular docking of potential target drugs was performed on MCULE online platform with default settings (https://mcule.com/apps/1-click-docking/)3841. The platform will generate top four ranked of molecular docking results according to the drug molecules and targets provided, and we selected Rank 1 to prepare the figures.

Results

Proteomic screening of tumor-causing proteins

In our results, 5365 cis-pQTL of a total of 1743 proteins were used for follow-up analysis after a series of screening (Supplement Table S2). However, after extracting SNPs in the outcome, some lacked SNPs, leaving SNPs with only 1653 plasma proteins in every tumor. A total of 1005 proteins were causally associated with tumors at the p < 0.05 threshold (Supplement Table S3), while there were 6 significant outcomes at the FDR < 0.05 threshold (NCAN, GREM1, S100A16, S100A14, PDE5A, MIA), and only the following 4 proteins (S100A16, S100A14, PDE5A, MIA) were causally associated with tumors after removing the SNPs associated with the outcomes by phenoscanner (Fig. 2). An instrumental variable of NCAN, rs2228603, was associated with nonalcoholic fatty liver disease. The instrumental variables of GREM1, rs4779584 and rs144674978, were associated with colorectal cancer. The statistical significance of NCAN and GREM1 did not reach the threshold of FDR < 0.05. In non-small cell lung cancer, rs2279011 was excluded because of its association with smoking. There were causal associations between S100A14 (Wald ratio, OR = 2.11, 95% CI, 1.50–2.96, FDR = 0.0144) and S100A16 (IVW, OR = 0.70, 95% CI, 0.60–0.82, FDR = 0.0144) and HER-positive breast tumors, between PDE5A (IVW, OR = 1.53, 95% CI, 1.25–1.87, FDR = 0.0304) and colorectal cancer, and between MIA (IVW, OR = 0.82, 95% CI, 0.76–0.88, FDR = 0.0005) and non-small cell lung cancer. And they had the correct causal direction (p < 0.05). There was no heterogeneity (SNP > 2) and pleiotropy (SNP > 3) in results (Supplement Table S3). These results suggest that S100A16 may be a risk protein for HER-positive breast cancer, S100A14 may be a protective protein for HER-positive breast cancer, PDE5A may be a risk protein for colorectal cancer, and MIA may be a risk protein for non-small cell lung cancer. Additionally, the results have been validated as reliable through statistical power analysis (Supplement Table S4).

Figure 2.

Figure 2

MR results for plasma proteins on the risk of tumors. Causal association of four plasma proteins (S100A14, S100A16, PDE5A, MIA) with tumors based on MR analysis with Wald ratio and Inverse Variance Weighted methods.

Sensitivity analysis of tumor causal proteins

In our findings, four proteins were ultimately identified as potential drug targets for tumors, including S100A16, S100A14, PDE5A, and MIA. In addition, Reverse validation, Bayesian colocalization, MR analysis of the total pQTL, and validation of external queue were used as sensitivity analyses. In external queue validation, the results for S100A16 (IVW, OR = 0.99, 95% CI, 0.91–1.07, p = 0.722), S100A14 (Wald ratio, OR = 1.08, 95% CI, 0.90–1.30, p = 0.406) and PDE5A (Wald ratio, OR = 0.99, 95% CI, 0.98–1.00, p = 0.105) were not replicated, while the results for MIA (IVW, OR = 0.90, 95% CI, 0.83–0.97, p = 0.009) were replicated in non-small cell lung cancer (Fig. 3). And there was no heterogeneity and pleiotropy (p > 0.05) (Supplement Table S3). Moreover, in reverse MR Analysis, tumors were considered exposure, and plasma proteins were considered outcome. No causal association was found between tumors and plasma protein (Fig. 4). Meanwhile, to validate our main results, all pQTL data of four plasma proteins (cis-pQTL and trans-pQTL) were used for MR Analysis, the same findings were observed for plasma proteins PDE5A (IVW, OR = 1.39, 95% CI, 1.13–1.71, p = 2e-3) and MIA (IVW, OR = 0.84, 95% CI, 0.79–0.90, p = 2e-7) (Fig. 5). Interestingly, the results of colocalization analysis showed that S100A16 (coloc, PP.H4.abf = 0.933), S100A14 (coloc, PP.H4.abf = 0.913), and PDE5A (coloc, PP.H4.abf = 0.855) had the shared causal variation with HER-positive breast and colorectal cancers, respectively (Fig. 2). These results further validated the causal association between four plasma proteins and tumors.

Figure 3.

Figure 3

External queue validation of MR results for plasma proteins on the risk of tumors. External data for the four plasma proteins came from the UK Biobank and the Breast Cancer Association Consortium (BCAC).

Figure 4.

Figure 4

Reverse MR results for plasma proteins on the risk of tumors. There was no reverse causal effect on the primary MR results.

Figure 5.

Figure 5

MR results of all pQTL for plasma proteins on the risk of tumors. The results of PDE5A and MIA were replicated in all PQTL (cis-pQTL and Trans-pQTL) data as exposed.

Potential therapeutic drugs for tumors

Based on the above results, we considered PDE5A as a first-class therapeutic target, S100A16, S100A14 and MIA as second-class targets, and the other proteins with p < 0.05 threshold as third-class targets. While indirect potential drug targets were not identified through STRING, we discovered drugs in the medication database that may interact with the targets. Through the search of DrugBank database, we found that target of the marketed drugs Sildenafil, Vardenafil, Dipyridamole, Theophylline, Tadalafil and Avanafil was PDE5A, and they were used as inhibitors of PDE5A (Fig. 6). These results suggest that PDE5A inhibitor drugs may have a preventive effect as well as a potential therapeutic effect on colorectal cancer. However, there were no suitable drugs targeting S100A16, S100A14 and MIA. These findings suggest that targeting PDE5A holds significant potential as a therapeutic target for colorectal cancer in future clinical investigations.

Figure 6.

Figure 6

Potential target drugs for colorectal cancer based on MR analysis. Potential target drugs came from DrugBank database. (A) Potential drugs for the target PDE5A in colorectal cancer; (B-G) Molecular docking results of Sildenafil, Vardenafil, Dipyridamole, Theophylline, Tadalafil and Avanafil based on MCULE online platform.

Discussion

In this study, we primarily used the two-sample MR method to investigate the causal association between human plasma protein concentration and tumors, aiming to discover potential drug therapeutic targets and cancer prevention biomarkers. Based on our research, we set a criterion: Proteins with strong colocalization and robust sensitivity were identified as potential first-class targets, while proteins that met only one condition were identified as second-class potential targets. Ultimately, based on our analysis, S100A16 and S100A14 were validated as potential second-class drug targets for HER-positive breast cancer, PDE5A as potential first-class drug targets for colorectal cancer, and MIA as potential second-class drug targets for non-small cell lung cancer. In these findings, the causal relationship between MIA and lung cancer was further confirmed in external queue (ieu-b-4954), providing additional credibility to the potential drug targets. Our analysis also revealed that MR Analyses of trans-pQTL and cis-pQTL for PDE5A and MIA were consistent with the main results.

Firstly, in order to eliminate the effects of confounding factors and horizontal pleiotropy, PhenoScanner was used to remove SNPs associated with outcomes and confounding factors before main analysis. Secondly, evaluation results based on data from previous drug development programs showed that target indication pairings identified by MR and co-localization were more likely to be approved42. To assess the potential of these findings as targets for tumor drug therapy, we further evaluated the causal association between plasma proteins and tumors through colocalization analysis43. HH.P4 > 0.8 was considered evidence of strong colocalization. Three proteins identified by MR, S100A16, S100A14 and PDE5A, might share causal variables with HER-positive breast cancer and colorectal cancer. In addition, Reverse causation MR analysis was performed to remove the effects of reverse causation. Results showed no reverse causal association between plasma proteins and tumors.

In our analysis, S10014, S100A16 and MIA were regarded as second-class target of HER-positive breast cancer. S100A14 and S100A16 both belong to calcium-binding protein. These proteins were involved in various biological functions of the human body, such as cell signal transduction, cell proliferation, and apoptosis44,45. Studies had shown that S100A14 might be related to the breast cancer metastasis by promoting expression of chemokines CCL2 and CXCL546. Mechanistically, S100A14 activated NF-κB signaling to upregulate chemokine expression. This was consistent with our findings that S100A14 was considered a risk protein for HER-positive breast cancer. Similarly, another study showed that S100A14 was an independent prognostic factor in triple-negative breast cancer47. In addition, expression of S100A16 activated epithelial-mesenchymal transition and promoted breast cancer progression48, which was contrary to our findings. Interestingly, a study had shown that co-expression of S100A16 and S100A14 in breast cancer promoted the invasive ability of cancer cells; overexpression and knockdown of S100A14 could affect the expression of S100A1649. Therefore, we speculated that S100A14 might play a leading role in breast cancer. In addition, MIA was regarded as second-class target of non-small cell lung cancer in our analysis. MIA was a small secreted protein that was primarily secreted by but not limited to melanoma50. Overexpression of MIA not only promoted melanoma metastasis but also activated the invasive ability of pancreatic cancer cells. However, the association between MIA and lung cancer had not been studied so far. Moreover, PDE5A were regarded as first-class target of colorectal cancer. So far, multiple studies had shown PDE5A as a potential therapeutic target for tumors. PDE5A was overexpressed in breast cancer stroma and affected the growth of breast cancer cells by inducing the expression of chemokine CXCL1651. In addition, the use of the PDE5A inhibitors sildenafil or vardenafil enhanced apoptosis of human castration-resistant prostate cancer cells52. In colorectal cancer, the use of PDE5A inhibitors had been found to be associated with a favorable prognosis and lower metastasis rate in male with colorectal cancer53. To our knowledge, no previous studies had used MR combined with colocalization to explore the causal association between PDE5A and colorectal cancer. Our results provide evidence for the clinical translation of PDE5 inhibitors in treatment of colorectal cancer.

Although the number of potential therapeutic targets identified had increased in recent years, that had not translated into clinical outcomes. Based on the phenomenon, we used cis-pQTL for MR and colocalization analysis to identify potential therapeutic targets for tumors, which will be more likely to be approved.

Despite our study identifying potential targets and drugs advantageous for clinical cancer treatment, however, there are some limitations in this analysis. The limitations of this study are as follows: First, we extracted 4917 protein pQTLs from Ferkingstad et al.21 for exposure in MR analysis. After preliminary processing, only 1653 plasma proteins remained for analysis, and many potential plasma proteins still await validation. In addition, external data of exposure were not further verified. The results may be subject to some bias. Second, cis-pQTL was reserved for analysis, and some pQTL had only one instrumental variable, so it could not be analyzed for heterogeneity or pleiotropy. Although we subsequently added trans-pQTL, there may be some bias in the results. However, we subsequently enhanced the accuracy of the results by employing colocalization analysis. Thirdly, our study data originated from the decode data and the FinnGen cohort, both of which consist of European populations. Therefore, our results may be applicable primarily to European populations. In addition, it is well known that different tumor subtypes have different therapeutic strategies based on different information such as tumor marker expression, driver gene mutation, and tumor stage and grade, etc. In this study, because FinnGen database did not provide further clinical classification based on clinicopathological information such as gene alteration and tumor stages/grade, etc., we could not analyze the correlation between clinical subtypes of related tumors. Fourth, as we focused on exploring potential drug targets for cancer, the conclusions drawn have not been validated through clinical trials. Therefore, these results require further investigation.

Conclusions

Overall, our study employed MR analysis to investigate causal relationships between plasma proteins and nine types of cancer. We found that elevated S100A14 in circulation increased the risk of HER-positive breast cancer, while S100A16 decreased the risk of HER-positive breast cancer. PDE5A was associated with an increased risk of CRC, and MIA was linked to a reduced risk of SCLC. Additionally, we identified PDE5A, a target for erectile dysfunction drugs, as a potential therapeutic target for CRC, suggesting that such medications may serve as novel treatments by inhibiting PDE5A. However, further research is needed to validate these findings and understand their specific roles in tumor diagnosis and drug target development.

Supplementary Information

Acknowledgements

The authors are grateful to the participants and investigators of the FinnGen and decode study, as well as the open GWAS for providing the research information. We also thanks to Dr. Zhanjun Li for the help and guidance in molecular docking.

Author contributions

H.W and N.S conceived and coordinated the study. K.C and N.S wrote the paper. K.C, L.Z, J.Z and Y.F analyzed the data and Figures & Tables, P.S, Z.W, W.D, J.L and W.S for the data processing, N.S, K.C and H.W reviewed and revised the article. H.W and Z.L for the molecular docking work. All authors contributed to interpretation of data, manuscript revision, and critical discussion. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the Program for Young Key Teachers in Colleges and Universities in Henan Province (2020GGJS150, 2021GGJS104), and Key Medical Science and Technology Research Program Project of Henan Province (No. 20232028).

Data availability

All data for this study is available from the corresponding author upon reasonable request.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Na Song, Pingyu Shi and Kai Cui.

Contributor Information

Wei Su, Email: weisu_xxmu@163.com.

Haijun Wang, Email: 151044@xxmu.edu.cn.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-024-62178-w.

References

  • 1.Siegel RL, et al. Cancer statistics, 2023. CA Cancer J. Clin. 2023;73:17–48. doi: 10.3322/caac.21763. [DOI] [PubMed] [Google Scholar]
  • 2.Pinsky PF, et al. Screening for prostate cancer. N. Engl. J. Med. 2023;388:1405–1414. doi: 10.1056/NEJMcp2209151. [DOI] [PubMed] [Google Scholar]
  • 3.Nolan E, et al. Deciphering breast cancer: from biology to the clinic. Cell. 2023;186:1708–1728. doi: 10.1016/j.cell.2023.01.040. [DOI] [PubMed] [Google Scholar]
  • 4.Sherman MH, et al. Tumor microenvironment in pancreatic cancer pathogenesis and therapeutic resistance. Annu. Rev. Pathol. 2023;18:123–148. doi: 10.1146/annurev-pathmechdis-031621-024600. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Cai M, et al. Current therapy and drug resistance in metastatic castration-resistant prostate cancer. Drug. Resist. Updat. 2023;68:100962. doi: 10.1016/j.drup.2023.100962. [DOI] [PubMed] [Google Scholar]
  • 6.Li F, et al. Regulation of cisplatin resistance in bladder cancer by epigenetic mechanisms. Drug Resist Updat. 2023;68:100938. doi: 10.1016/j.drup.2023.100938. [DOI] [PubMed] [Google Scholar]
  • 7.Vodenkova S, et al. 5-fluorouracil and other fluoropyrimidines in colorectal cancer: Past, present and future. Pharmacol. Ther. 2020;206:107447. doi: 10.1016/j.pharmthera.2019.107447. [DOI] [PubMed] [Google Scholar]
  • 8.Ashrafizadeh M, et al. New insight towards development of paclitaxel and docetaxel resistance in cancer cells: EMT as a novel molecular mechanism and therapeutic possibilities. Biomed. Pharmacother. 2021;141:111824. doi: 10.1016/j.biopha.2021.111824. [DOI] [PubMed] [Google Scholar]
  • 9.Verma S, et al. 'Targeting' improved outcomes with antibody-drug conjugates in non-small cell lung cancer-an updated review. Curr. Oncol. 2023;30:4329–4350. doi: 10.3390/curroncol30040330. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Desai A, et al. Antibody-drug conjugates: A promising novel therapeutic approach in lung cancer. Lung Cancer. 2022;163:96–106. doi: 10.1016/j.lungcan.2021.12.002. [DOI] [PubMed] [Google Scholar]
  • 11.Moreau Bachelard C, et al. Risks and benefits of anticancer drugs in advanced cancer patients: A systematic review and meta-analysis. EClinicalMedicine. 2021;40:101130. doi: 10.1016/j.eclinm.2021.101130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Gong Y, et al. PUMILIO proteins promote colorectal cancer growth via suppressing p21. Nat. Commun. 2022;13:1627. doi: 10.1038/s41467-022-29309-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Wu J, et al. CMTM family proteins 1–8: Roles in cancer biological processes and potential clinical value. Cancer Biol. Med. 2020;17:528–542. doi: 10.20892/j.issn.2095-3941.2020.0032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Lin J, et al. Potential drug targets for multiple sclerosis identified through Mendelian randomization analysis. Brain. 2023;146:3364–3372. doi: 10.1093/brain/awad070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Rosoff DB, et al. Mendelian randomization study of PCSK9 and HMG-CoA reductase inhibition and cognitive function. J. Am. Coll. Cardiol. 2022;80:653–662. doi: 10.1016/j.jacc.2022.05.041. [DOI] [PubMed] [Google Scholar]
  • 16.Zhu G, et al. Chickenpox and multiple sclerosis: A Mendelian randomization study. J. Med. Virol. 2023;95:e28315. doi: 10.1002/jmv.28315. [DOI] [PubMed] [Google Scholar]
  • 17.Sun JX, et al. The association between human papillomavirus and bladder cancer: Evidence from meta-analysis and two-sample mendelian randomization. J. Med. Virol. 2023;95:e28208. doi: 10.1002/jmv.28208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Birney, E. Mendelian Randomization. Cold Spring Harb Perspect Med. 12, (2022). [DOI] [PMC free article] [PubMed]
  • 19.Larsson SC, et al. Mendelian randomization for cardiovascular diseases: Principles and applications. Eur. Heart J. 2023;44:4913–4924. doi: 10.1093/eurheartj/ehad736. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Choi KW, et al. Assessment of bidirectional relationships between physical activity and depression among adults: A 2-sample Mendelian randomization study. JAMA Psychiatry. 2019;76:399–408. doi: 10.1001/jamapsychiatry.2018.4175. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Ferkingstad E, et al. Large-scale integration of the plasma proteome with genetics and disease. Nat. Genet. 2021;53:1712–1721. doi: 10.1038/s41588-021-00978-w. [DOI] [PubMed] [Google Scholar]
  • 22.Yang C, et al. Mendelian randomization and genetic colocalization infer the effects of the multi-tissue proteome on 211 complex disease-related phenotypes. Genome. Med. 2022;14:140. doi: 10.1186/s13073-022-01140-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Bourgault J, et al. Proteome-wide Mendelian randomization identifies causal links between blood proteins and acute pancreatitis. Gastroenterology. 2023;164:953–965.e953. doi: 10.1053/j.gastro.2023.01.028. [DOI] [PubMed] [Google Scholar]
  • 24.Staley JR, et al. PhenoScanner: a database of human genotype-phenotype associations. Bioinformatics. 2016;32:3207–3209. doi: 10.1093/bioinformatics/btw373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Bottigliengo D, et al. A Mendelian randomization study investigating the causal role of inflammation on Parkinson's disease. Brain. 2022;145:3444–3453. doi: 10.1093/brain/awac193. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Kurki MI, et al. FinnGen provides genetic insights from a well-phenotyped isolated population. Nature. 2023;613:508–518. doi: 10.1038/s41586-022-05473-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Michailidou K, et al. Association analysis identifies 65 new breast cancer risk loci. Nature. 2017;551:92–94. doi: 10.1038/nature24284. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Chen D, et al. Assessing causality between osteoarthritis with urate levels and gout: a bidirectional Mendelian randomization study. Osteoarthritis Cartilage. 2022;30:551–558. doi: 10.1016/j.joca.2021.12.001. [DOI] [PubMed] [Google Scholar]
  • 29.Gossmann A, et al. FDR-corrected sparse canonical correlation analysis with applications to imaging genomics. IEEE Trans. Med. Imaging. 2018;37:1761–1774. doi: 10.1109/TMI.2018.2815583. [DOI] [PubMed] [Google Scholar]
  • 30.Freuer D, et al. Association between inflammatory bowel disease and both psoriasis and psoriatic arthritis: A bidirectional 2-sample mendelian randomization study. JAMA Dermatol. 2022;158:1262–1268. doi: 10.1001/jamadermatol.2022.3682. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Huang D, et al. Association between COVID-19 and telomere length: A bidirectional Mendelian randomization study. J. Med. Virol. 2022;94:5345–5353. doi: 10.1002/jmv.28008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Wu F, et al. Mendelian randomization study of inflammatory bowel disease and bone mineral density. BMC Med. 2020;18:312. doi: 10.1186/s12916-020-01778-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Li P, et al. Association between gut microbiota and preeclampsia-eclampsia: A two-sample Mendelian randomization study. BMC Med. 2022;20:443. doi: 10.1186/s12916-022-02657-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Bouras E, et al. Circulating inflammatory cytokines and risk of five cancers: A Mendelian randomization analysis. BMC Med. 2022;20:3. doi: 10.1186/s12916-021-02193-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Han X, et al. Integrating genetics and metabolomics from multi-ethnic and multi-fluid data reveals putative mechanisms for age-related macular degeneration. Cell Rep. Med. 2023;4:101085. doi: 10.1016/j.xcrm.2023.101085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Chen Y, et al. Genomic atlas of the plasma metabolome prioritizes metabolites implicated in human diseases. Nat. Genet. 2023;55:44–53. doi: 10.1038/s41588-022-01270-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Wishart, D. S. et al. DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res. 46, D1074-d1082 (2018). [DOI] [PMC free article] [PubMed]
  • 38.Muteeb, G. et al. Identification of a Potential Inhibitor (MCULE-8777613195-0-12) of New Delhi Metallo-beta-Lactamase-1 (NDM-1) Using In Silico and In Vitro Approaches. Molecules. 27, (2022). [DOI] [PMC free article] [PubMed]
  • 39.Odhar HA, et al. Molecular docking and dynamics simulation analysis of the human FXIIa with compounds from the Mcule database. Bioinformation. 2023;19:160–166. doi: 10.6026/97320630019160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Odhar HA, et al. Molecular docking enabled updated screening of the matrix protein VP40 from Ebola virus with millions of compounds in the MCULE database for potential inhibitors. Bioinformation. 2019;15:627–632. doi: 10.6026/97320630015627. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Onawole AT, et al. Identification of potential inhibitors against the Zika virus using consensus scoring. J. Mol. Graph. Model. 2017;73:54–61. doi: 10.1016/j.jmgm.2017.01.018. [DOI] [PubMed] [Google Scholar]
  • 42.Zheng J, et al. Phenome-wide Mendelian randomization mapping the influence of the plasma proteome on complex diseases. Nat. Genet. 2020;52:1122–1131. doi: 10.1038/s41588-020-0682-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Lin, Y. et al. Circulating inflammatory cytokines and female reproductive diseases: a Mendelian randomization analysis. J Clin Endocrinol Metab. (2023). [DOI] [PubMed]
  • 44.Zhu M, et al. Calcium-binding protein S100A14 induces differentiation and suppresses metastasis in gastric cancer. Cell Death Dis. 2017;8:e2938. doi: 10.1038/cddis.2017.297. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Zhang WS, et al. S100a16 deficiency prevents hepatic stellate cells activation and liver fibrosis via inhibiting CXCR4 expression. Metabolism. 2022;135:155271. doi: 10.1016/j.metabol.2022.155271. [DOI] [PubMed] [Google Scholar]
  • 46.Li X, et al. A S100A14-CCL2/CXCL5 signaling axis drives breast cancer metastasis. Theranostics. 2020;10:5687–5703. doi: 10.7150/thno.42087. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Ehmsen S, et al. S100A14 is a novel independent prognostic biomarker in the triple-negative breast cancer subtype. Int. J. Cancer. 2015;137:2093–2103. doi: 10.1002/ijc.29582. [DOI] [PubMed] [Google Scholar]
  • 48.Zhou W, et al. Up-regulation of S100A16 expression promotes epithelial-mesenchymal transition via Notch1 pathway in breast cancer. J. Biomed. Sci. 2014;21:97. doi: 10.1186/s12929-014-0097-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Tanaka M, et al. Co-expression of S100A14 and S100A16 correlates with a poor prognosis in human breast cancer and promotes cancer cell invasion. BMC Cancer. 2015;15:53. doi: 10.1186/s12885-015-1059-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.El Fitori J, et al. Melanoma Inhibitory Activity (MIA) increases the invasiveness of pancreatic cancer cells. Cancer Cell. Int. 2005;5:3. doi: 10.1186/1475-2867-5-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Catalano, S. et al. Phosphodiesterase 5 (PDE5) Is Highly Expressed in Cancer-Associated Fibroblasts and Enhances Breast Tumor Progression. Cancers. 11, (2019). [DOI] [PMC free article] [PubMed]
  • 52.Chang JF, et al. Phosphodiesterase type 5 (PDE5) inhibitors sensitize topoisomerase II inhibitors in killing prostate cancer through PDE5-independent impairment of HR and NHEJ DNA repair systems. Front Oncol. 2018;8:681. doi: 10.3389/fonc.2018.00681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Huang W, et al. Phosphodiesterase-5 inhibitors use and risk for mortality and metastases among male patients with colorectal cancer. Nat. Commun. 2020;11:3191. doi: 10.1038/s41467-020-17028-4. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data Availability Statement

All data for this study is available from the corresponding author upon reasonable request.


Articles from Scientific Reports are provided here courtesy of Nature Publishing Group

RESOURCES