Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

medRxiv logoLink to medRxiv
[Preprint]. 2025 Jul 11:2024.05.29.24308170. Originally published 2024 May 31. [Version 2] doi: 10.1101/2024.05.29.24308170

Large-scale integration of omics and electronic health records to identify potential risk protein biomarkers and therapeutic drugs for cancer prevention

Qing Li, Qingyuan Song, Zhishan Chen, Jungyoon Choi, Victor Moreno, Jie Ping, Wanqing Wen, Chao Li, Xiang Shu, Jun Yan, Xiao-ou Shu, Qiuyin Cai, Jirong Long, Jeroen R Huyghe, Rish Pai, Stephen B Gruber, Yaohua Yang, Graham Casey, Xusheng Wang, Adetunji T Toriola, Li Li, Bhuminder Singh, Ken S Lau, Li Zhou, Zichen Zhang, Chong Wu, Ulrike Peters, Wei Zheng, Quan Long, Zhijun Yin, Xingyi Guo
PMCID: PMC11160851  PMID: 38853880

Abstract

Identifying risk protein targets and their therapeutic drugs is crucial for effective cancer prevention. Here, we conduct integrative and fine-mapping analyses of large genome-wide association studies data for breast, colorectal, lung, ovarian, pancreatic, and prostate cancers, and characterize 710 lead variants independently associated with cancer risk. Through mapping protein quantitative trait loci (pQTL) for these variants using plasma proteomics data from over 75,000 participants, we identify 365 proteins associated with cancer risk. Subsequent colocalization analysis identifies 101 proteins, including 74 not reported in previous studies. We further characterize 36 potential druggable proteins for cancers or other disease indications. Analyzing >3.5 million electronic health records, we conducted analyses of emulated trials for 11 drugs across 290 comparisons and identified three drugs significantly associated with reduced colorectal cancer risk: caffeine vs. paroxetine (HR, 0.51; 95% CI, 0.41–0.64), haloperidol vs. prochlorperazine (HR, 0.47; 95% CI, 0.33–0.68), and trazodone hydrochloride vs. paroxetine (HR, 0.49; 95% CI, 0.38–0.63). Conversely, caffeine was associated with increased cancer risk in comparisons with finasteride (colorectal cancer) and fluoxetine (breast cancer). Meta-analysis identified six drugs significantly associated with cancer risk, including acetazolamide, which was associated with reduced colorectal cancer risk (HR, 0.79; 95% CI, 0.72–0.87). This study identifies novel protein biomarkers and candidate drug targets across six major cancer types and highlights several approved drugs with potential chemopreventive effects.

Full Text Availability

The license terms selected by the author(s) for this preprint version do not permit archiving in PMC. The full text is available from the preprint server.


Articles from medRxiv are provided here courtesy of Cold Spring Harbor Laboratory Preprints

RESOURCES