Skip to main content
Frontiers in Molecular Biosciences logoLink to Frontiers in Molecular Biosciences
editorial
. 2024 Mar 13;11:1395607. doi: 10.3389/fmolb.2024.1395607

Editorial: Medical big data in cancer research

Jingxin Mao 1,2,*, Ke Zheng 3, Xiong Weng 4
PMCID: PMC10965761  PMID: 38545415

According to the data of the Global Cancer Report, the incidence rate and mortality rate of cancer was reached 19.29 million and 9.96 million respectively in 2020. Cancer has become the primary problem affecting the life and health of all human beings, posing a serious threat to human life and health (Xi and Xu, 2021). With the increase in the number of cancer patients, the clinical medical data generated is also becoming increasingly large. As a traditional medical record model, doctor workstations can no longer meet the needs of researchers in mining, analyzing, and utilizing clinical data (Bray et al., 2018). In the field of cancer research, big data analysis plays a crucial role. Its profound heritage lies in its ability to provide rich case data, enabling medical researchers to understand the hidden connections and patterns between cases (Hassan et al., 2022). Moreover, big data analysis can provide strong support for medical researchers to explore potential new drugs and treatment methods. Through deep mining and analysis of the vast disease database, we can discover biomarkers, gene variations, and drug responses closely related to diseases, providing a solid foundation for the develop varies of new drugs and the innovation of treatment methods (Mao et al., 2023).

In addition, big data analysis has demonstrated excellent capabilities in drug safety and efficacy evaluation (Krallinger et al., 2010). By deeply analyzing a large amount of clinical trial data and real-world treatment data, we can accurately evaluate the side effects, efficacy, and differences in treatment efficacy of drugs in different populations, providing valuable guidance for rational drug use and personalized treatment (Tabib et al., 2020). Taken together, the development of medical big data enables molecular biological analysis at the micro level, such as metabolomics, proteomics, epigenetics and other technologies (Figure 1, by Figdraw, ID:TUSIYd5de0). Combined with the molecular mechanism of disease, the relationship between gene protein (enzymes, receptors, channels, signaling molecules, etc.) and cancer can be summarized from multi sample and big data analysis, providing a new scientific basis for the creation of new drugs and therapies. Therefore, totally 5 original research manuscripts by universally acknowledged authors which was collected on the related fields.

FIGURE 1.

FIGURE 1

The relationship of medical big data and cancer research (By Figdraw).

Borisov et al. proposed a novel gene expression data normalization method called Shambhala, which can present normalized data in a unified format, transforming each expression profile into a predefined universal format. It was found that the generalization phenomenon of human ribonucleic acid (RNA) profiles, which showed a significant decrease in platform specific clustering while strongly preserving the overall tissue-specific clustering features.

Eng et al. investigated the association between oxidative stress, deoxyribonucleic acid (DNA) damage, and genetic changes in benign thyroid lesion in one lobe (BTG) to papillary thyroid cancer (PTC) transformation. Due to the increase in reactive oxygen species (ROS) levels, the oxidative DNA damage intensifies, which may indicate the transformation of BTG to PTC. This may be achieved through mutations in genes related to the mitogen-activated protein kinase (MAPK) signaling pathway and stress activated MAPK/c-Jun N-terminal kinases (JNK) cascade reactions.

Kang et al. established a model based on immune related gene expression using the expression of immune related genes and calculated the scores for each sample. Analyzed the correlation between the model and clinical information, immune infiltration, drug response, and biological pathways. It was revealed that cancer related pathways, including immune related pathways, are significantly activated in high scoring samples, and some drugs have significantly lower IC50 values than low scoring drugs. The model developed based on immune related genes is robust and reflects various states of colorectal carcinoma (CRC), which may be a potential clinical indicator.

Zhou et al. used machine learning models to screen metabolomic features from cross queue datasets of colorectal cancer (CRC) and colorectal adenoma (CRA). Then select a CRC and CRA dataset from the CuratedMetagenomicData database to meet the requirement of having both metabolomic and clinical data. Integrating cross queue common identification of microbial characteristics with clinical features helps to construct a stable diagnostic model for early non-invasive screening of CRC and colorectal adenoma CRA.

Shi et al. investigated the shared genes between rheumatoid arthritis (RA) and lung adenocarcinoma (LUAD) through differential analysis and weighted gene co expression network analysis (WGCNA). Subsequently, COX regression and least absolute shrinkage and selection operator (LASSO) analysis were used to screen for genes significantly associated with survival. Verify the expression levels of candidate genes using qRT-PCR and Western blot techniques. The research results indicate that RA and LUAD share common physiological and pathological processes and molecular characteristics. Rheumatoid arthritis and lung adenocarcinoma (RALUADS) is expected to become an excellent prognostic factor and immune related biomarker, which can be applied to screen potential effective drugs and target LUAD patients with RA.

In summary, the aforementioned studies yield intriguing findings and contribute fresh insights to the field of big data in biomedical research on cancer. We are appreciative of “Frontiers in Molecular Biosciences” for facilitating this research theme and assembling a diverse group of authors. We also extend our gratitude to the editor for ensuring the quality of the manuscript, the reviewers for their diligent contributions and insightful recommendations, and the journal for providing a vital platform for academic exchange.

Funding Statement

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by Chongqing Natural Science Foundation General Project (2023NSCQ-MSX1633, CSTB2023NSCQ-MSX0393), Key Scientific and Technological Research Project of Chongqing Municipal Education Commission (KJ202302884457913, KJZD-K202302801), 2022 scientific research project of Chongqing Medical and Pharmaceutical College (ygz2022104), and Scientific research and seedling breeding project of Chongqing Medical Biotechnology Association (cmba2022kyym-zkxmQ0003) respectively.

Author contributions

JM: Writing–review and editing, Writing–original draft. KZ: Writing–review and editing. XW: Writing–review and editing.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

  1. Bray F., Ferlay J., Soerjomataram I., Siegel R. L., Torre L. A., Jemal A. (2018). Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA a cancer J. Clin. 68 (6), 394–424. 10.3322/caac.21492 [DOI] [PubMed] [Google Scholar]
  2. Hassan M., Awan F. M., Naz A., deAndrés-Galiana E. J., Alvarez O., Cernea A., et al. (2022). Innovations in genomics and big data analytics for personalized medicine and health care: a review. Int. J. Mol. Sci. 23 (9), 4645. 10.3390/ijms23094645 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Krallinger M., Leitner F., Valencia A. (2010). Analysis of biological processes and diseases using text mining approaches. Bioinforma. Methods Clin. Res. 593, 341–382. 10.1007/978-1-60327-194-3_16 [DOI] [PubMed] [Google Scholar]
  4. Mao J., Meng F., Wang G. (2023). Editorial: big data for biomedical research of inflammatory diseases. Front. Pharmacol. 14, 1287616. 10.3389/fphar.2023.1287616 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Tabib N. S. S., Madgwick M., Sudhakar P., Verstockt B., Korcsmaros T., Vermeire S. (2020). Big data in IBD: big progress for clinical practice. Gut 69 (8), 1520–1532. 10.1136/gutjnl-2019-320065 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Xi Y., Xu P. (2021). Global colorectal cancer burden in 2020 and projections to 2040. Transl. Oncol. 14 (10), 101174. 10.1016/j.tranon.2021.101174 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Frontiers in Molecular Biosciences are provided here courtesy of Frontiers Media SA

RESOURCES