Abstract
Precision medicine aims to empower clinicians to predict the most appropriate course of action for patients with complex diseases like cancer, diabetes, cardiomyopathy, and COVID-19. With a progressive interpretation of the clinical, molecular, and genomic factors at play in diseases, more effective and personalized medical treatments are anticipated for many disorders. Understanding patient’s metabolomics and genetic make-up in conjunction with clinical data will significantly lead to determining predisposition, diagnostic, prognostic, and predictive biomarkers and paths ultimately providing optimal and personalized care for diverse, and targeted chronic and acute diseases. In clinical settings, we need to timely model clinical and multi-omics data to find statistical patterns across millions of features to identify underlying biologic pathways, modifiable risk factors, and actionable information that support early detection and prevention of complex disorders, and development of new therapies for better patient care. It is important to calculate quantitative phenotype measurements, evaluate variants in unique genes and interpret using ACMG guidelines, find frequency of pathogenic and likely pathogenic variants without disease indicators, and observe autosomal recessive carriers with a phenotype manifestation in metabolome. Next, ensuring security to reconcile noise, we need to build and train machine-learning prognostic models to meaningfully process multisource heterogeneous data to identify high-risk rare variants and make medically relevant predictions. The goal, today, is to facilitate implementation of mainstream precision medicine to improve the traditional symptom-driven practice of medicine, and allow earlier interventions using predictive diagnostics and tailoring better-personalized treatments. We strongly recommend automated implementation of cutting-edge technologies, utilizing machine learning (ML) and artificial intelligence (AI) approaches for the multimodal data aggregation, multifactor examination, development of knowledgebase of clinical predictors for decision support, and best strategies for dealing with relevant ethical issues.
Keywords: Precision medicine, Clinics, Genomics, Metabolomics, Integrative analysis, Artificial intelligence, Machine learning
Background
Since the beginning of scientific discoveries, it has been central to understand the cause of disease and senescence [1]. Pain is one of the key triggers for patients to seek diagnosis and treatment. However, when dealing with some of the life-threatening diseases, patients may not feel pain. To identify and help patients with known diseases and symptoms, and those heading toward late stages of novel infectious (e.g., COVID-19), chronic (e.g., diabetes, heart disease), acute (e.g., flu, stroke, heart attack), and complex (e.g., cancer) diseases, it is essential to provide timely personalized treatment [2–10]. Our evolving understanding of the complex nature has led us to realize that to effectively diagnose and treat patients with these conditions, it is essential to provide personalized utilize a precision medicine approach [10]. Progress in the molecular technology developments have led to vast amounts of human health-related data that are expected to greatly expand our understanding of human biology and health, and to drive personalized medicine. We hypothesize that on-demand access and analysis of clinical, genetic, and metabolic data will align biomarker identification with treatment windows necessary for real-time personalized care and enhance prediction of potential disease risks [11]. Despite current advancements, there is still no platform available that can efficiently integrate clinical, multi-omics, and epidemiological data acquisition, and enable effective management of data analytics with a user-friendly physician-oriented clinical interface [12, 13]. Platforms like The Cancer Genome Atlas (TCGA) [14] provide a great resource for scientific data (i.e., genomics or epigenetics sequence data) but offer limited capacity for clinical information, because they are not directly integrated to clinical health systems like Epic, NextGen, and Cerner etc. The inability of disparate platforms to effectively integrate is largely due to the high volume and heterogenous nature of the different types of data they contain, which is acquired from variable sources, each with unique data structures. It is essential to address a major gap in developing precision diagnostics and therapeutic agents in healthcare by establishing a digital solution for practicing precision medicine. Intelligent big data platforms are necessary to improve the quality of care-delivery process by increasing permeation of electronic health record (EHR) systems into clinical environments, focusing on predictive diagnosis, enabling real-time telemedicine, and precise treatment resulting in lower spending on life-threatening complex and chronic diseases [15].
Practicing precision medicine and AI
Precision medicine has the potential to improve the traditional symptom-driven practice of medicine, and allow earlier interventions using predictive diagnostics and tailoring better-personalized treatments [2–9, 16, 17]. However, practicing precision medicine is not straightforward, as significant efforts are required from the experts in multidisciplinary sciences. This necessitates development of progressive healthcare environment that will enable clinicians and researchers to gain a complete picture of the patient to deepen their understanding, using additional details from healthcare and multi-omics data. We hypothesize that clinical information will enrich genomics and metabolomics data such that combined predictors will perform better than the individual classifiers only based on either genomics, metabolomics, or clinical data. Practically supporting the hypothesis, we need to design research methodology, which includes modeling of patient-specific (healthcare, genomics, metabolomics, proteomics, and lifestyle) and publicly available annotation (genes, variants, diseases, drugs, biomarkers) data storage, management, integration, knowledgebase creation, and analysis using different artificial intelligence (AI) and machine learning (ML) approaches (Fig. 1) [18, 19].
Fig. 1.
Design modeling of heterogenous patient-specific healthcare, genomics, metabolomics, phenotypic, and lifestyle data, and publicly available annotation data including genes, variants, diseases, drugs, and biomarkers. Analysis using AI and ML approaches (Support Vector Machine, Deep Learning, Logistic Regression, Discrimination Analysis, Decision tree, Random Forest, Linear Regression, Naïve Bayes, K-Nearest Neighbor, Hidden Markov Model, and Genetic Algorithm), multifactor examination, knowledgebase and decision support system for data classification, cluster, and regression analysis. Furthermore, resource allocation for data storage and computational analysis
Precision medicine is moving forward but with many challenges that require addition of useful analytic tools, technologies, databases, and approaches to efficiently manage massive heterogeneous data, augment networking and interoperability of clinical, laboratory, and public health systems. A major barrier to implementation of precision medicine is the data analysis requirement. Most of the precision medicine efforts today are manual or semi-automated, time-consuming, and unable to facilitate on-demand analysis of diverse human datasets to impact critical treatment windows and predict potential disease risks [19–27]. The traditional way of computational analysis is based on running a series of command-line applications, which require good programming skills and ability to work in the UNIX environment. It hinders linking information generated at different stages of treatments and experiments conducted at levels of sampling, sequencing, and analysis. While precision medicine analyses require complex coordinated efforts between disparate groups with non-aligned data formats and massive amounts of computing time that is essential in many cases to positively impact treatment outcomes. Furthermore, it is difficult but mandatory to address ethical and social issues related to healthcare data collection, privacy, and protection with effective balance [18, 28]. Further, current potential pitfalls are given in attached Table 1.
Table 1.
Current potential pitfalls
| Number | Potential pitfalls |
|---|---|
| 1 | Uneven distribution of informatics resources. |
| 2 | Integration of biomedical data located among heterogeneous sources. |
| 3 | Hazards in dehumanization of healthcare data. |
| 4 | Handling of extensively available irrelevant, error prone, and missing data. |
| 5 | Intelligent and user-friendly interface development. |
| 6 | Applying regulations and policies for data collection, usage and sharing. |
| 7 | Harmonizing big data with the definitions of clinical phenotypes and diagnosis. |
| 8 | Inflexible EHR database schemas not geared for precision medicine. |
| 9 | Lack of data availability on social determinants of health. |
| 10 | Unstandardized genomics tools and modifications in their versions and outcome format. |
| 11 | Overloaded Data generated during unnecessary follow-up diagnoses and treatments. |
| 12 | Augmented computational complexity with increasing number of attributes. |
| 13 | Slow SQL based high volume data processing speed. |
| 14 | Determining optimal parameters and understanding structures of AI and ML algorithms. |
| 15 | Handling continuous explanatory variables with more than two levels and understanding odds and probabilities in AI and ML algorithms. |
| 16 | Possibility of too many overfitting attributes in AI and ML algorithms. |
| 17 | Handling redundant attributes, distribution of statistically independent attributes, and management of class frequencies affecting accuracy. |
| 18 | Reduced evidence and reproducibility. |
| 19 | Correct predictor variables selection, and evidence-based observational data analysis and screening. |
| 20 | Gaining confidence of clinicians at AI produced results. |
| 21 | Ethical and social issues related to healthcare data collection, privacy and protection. |
The efficient use of information technology, data science, and AI has the potential to enhance public health surveillance and tracking, with systematic collection, management, analysis, and interpretation of data within accelerated timelines [6, 19, 29–31]. We need detailed bioinformatics and AI platforms for supporting real-time processes involved in multisource heterogeneous raw data generation, mathematical modeling, computational analysis, data fusion, integration, management, and visualization (Fig. 1). Platforms need to be user-friendly, multi-functional, and multi-roles-based to address complex and big data-oriented problems in clinical settings. It will support categorizing interaction patterns among variables, learning from experiences, and strategizing and predicting better orientations. Multiple AI and ML algorithms (e.g., Support Vector Machine, Deep Learning, Logistic Regression, Discrimination Analysis, Decision tree, Random Forest, Linear Regression, Naïve Bayes, K-Nearest Neighbor, Hidden Markov Model, and Genetic Algorithm, etc.) are available for multifactor examination, scientific knowledge extraction, and decision support system (Fig. 1) [32–36]. However, determining which AI and ML approaches to use for which task is a challenge in itself [37]. We suggest classifying tasks based on the available predictor variables, as a key to correctly address this problem. Best fitting use of ML and AI algorithms have the potential to predict the existence of life-threatening diseases risk susceptibility, starting from the most common to rare among the population data [19].
AI has the ability to improve identification of relevant variables for patient data stratification with timely detection of statistical patterns across millions of features to identify conditions that are likely to manifest later and discover modifiable risk factors that support the best utilization of known therapies [38]. Impactful and automated implementation of AI and ML can elevate investigating correlation and overlapping of reported diagnoses of a patient in clinical data, and assess genotype and phenotype associations among various diseases to find potential indistinct results for patient care from highly expressed genes and disease-causing variants [9, 39]. Understanding how genetic variations contribute to health is one important aspect of precision medicine, where additional approaches involve measuring levels of proteins and metabolic products. By harnessing the power of metabolomics, we need to profile a patient’s metabolome and correlate it with their body mass index (BMI). Further, AI can assist in finding metabolite penetrance using listed features and abnormalities, and analyzing biochemical pathways in metabolites [40–42] with patterns of multimodal distributions for candidate genes [10, 43].
Conclusions
The scientific approach would be to perform analysis of individual genomes giving rise to a new form of preventive and personalized medicine in healthcare. Availability of gene-based designer drugs, precise targeting of molecular fingerprints for tumor, appropriate drug therapy, predicting individual susceptibility to disease, diagnosis, and treatment of mental illness are all a few of the many transformations expected in the decade to come. Precision medicine will timely enable clinicians to integrate healthcare data with targeted assays and tests to identify and assess disease biomarkers and risks, determine actionable genetic variants in patients, obtain the entire picture of the metabolome, and map metabolites to disease pathways.
Acknowledgements
We appreciate great support by the Institute for Health, Health Care Policy and Aging Research (IFH), and Rutgers Robert Wood Johnson Medical School, Rutgers Biomedical and Health Sciences at the Rutgers, The State University of New Jersey.
Abbreviations
- AI
Artificial intelligence
- BMI
Body mass index
- COVID-19
Coronavirus disease 2019
- EHR
Electronic health record
- HIPAA
Health Insurance Portability and Accountability Act
- ML
Machine learning
- TCGA
The Cancer Genome Atlas
Author’s contributions
ZA conceived of the presented idea and wrote the manuscript. The author read and approved the final manuscript.
Authors’ information
ZA is Assistant Professor of Medicine and Core Faculty Member at the Institute for Health, Health Care Policy and Aging Research (IFH), and the Department of Medicine, Rutgers Robert Wood Johnson Medical School (RWJMS), Rutgers Biomedical and Health Sciences (RBHS), at Rutgers, The State University of New Jersey.
Funding
This work was supported by the Institute for Health, Health Care Policy and Aging Research, and Robert Wood Johnson Medical School, at Rutgers, The State University of New Jersey.
Availability of data and materials
Not applicable.
Ethics approval and consent to participate
Not applicable
Consent for publication
Not applicable
Competing interests
The author declares that they have no competing interests.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Zeeshan S, Xiong R, Liang BT, Ahmed Z. 100 years of evolving gene–disease complexities and scientific debutants. Brief Bioinform. 2019;21(3):885–905. doi: 10.1093/bib/bbz038. [DOI] [PubMed] [Google Scholar]
- 2.Collins FS, Varmus H. A new initiative on precision medicine. N Engl J Med. 2015;372:793–795. doi: 10.1056/NEJMp1500523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Venter JC, Smith HO, Adams MD. The sequence of the human genome. Clin Chem. 2015;61:1207–1208. doi: 10.1373/clinchem.2014.237016. [DOI] [PubMed] [Google Scholar]
- 4.Long T, et al. Whole-genome sequencing identifies common-to-rare variants associated with human blood metabolites. Nat Genet. 2017;49:568–578. doi: 10.1038/ng.3809. [DOI] [PubMed] [Google Scholar]
- 5.Guo L, et al. Plasma metabolomic profiles enhance precision medicine for volunteers of normal health. Proc Natl Acad Sci USA. 2015;112(35):E4901–E4910. doi: 10.1073/pnas.1508425112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Khoury MJ, Iademarco MF, Riley WT. Precision public health for the era of precision medicine. American journal of preventive medicine. 2016;50(3):398–401. doi: 10.1016/j.amepre.2015.08.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Lazaridis KN, et al. Implementing individualized medicine into the medical practice. Am J Med Genet C Semin Med Genet. 2014;166C:15–23. doi: 10.1002/ajmg.c.31387. [DOI] [PubMed] [Google Scholar]
- 8.Bakris G, Sorrentino M. Redefining hypertension — assessing the new blood pressure guidelines. N Engl J Med. 2018;378:497–499. doi: 10.1056/NEJMp1716193. [DOI] [PubMed] [Google Scholar]
- 9.Hou YC, et al. Precision medicine integrating whole-genome sequencing, comprehensive metabolomics, and advanced imaging. Proceed Natl Acad Sci U S A. 2020;117(6):3053–3062. doi: 10.1073/pnas.1909378117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Ahmed Z, Zeeshan S, Foran DJ, Kleinman LC, Wondisford FE, Dong X. Integrative Clinical, Genomics and Metabolomics Data Analysis for Mainstream Precision Medicine to Investigate COVID-19. BMJ Innov. 2020. Published Online First: 04 September 2020. 10.1136/bmjinnov-2020-000444.
- 11.Ahmed Z, Kim M, Liang BT. MAV-clic: management, analysis, and visualization of clinical data. JAMIA Open. 2019;2:23–28. doi: 10.1093/jamiaopen/ooy052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Sboner A, Elemento O. A primer on precision medicine informatics. Brief Bioinform. 2016;17:145–153. doi: 10.1093/bib/bbv032. [DOI] [PubMed] [Google Scholar]
- 13.Karczewski KJ, Snyder MP. Integrative omics for health and disease. Nat Rev Genet. 2018;19(5):299–310. doi: 10.1038/nrg.2018.4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Tomczak K, Czerwińska P, Wiznerowicz M. The Cancer Genome Atlas (TCGA): an immeasurable source of knowledge. Contemp Oncol (Poznan, Poland), 2015;19(1A):A68–A77. 10.5114/wo.2014.47136. [DOI] [PMC free article] [PubMed]
- 15.Ahmed Z, et al. Human gene-disease associations for clinical-genomics and precision medicine research. Clin Transl Med. 2020;2020:1–22. doi: 10.1002/ctm2.28. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Schüssler-Fiorenza RSM, Contrepois K, Moneghetti KJ, et al. A longitudinal big data approach for precision health. Nat Med. 2019;25:792–804. doi: 10.1038/s41591-019-0414-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Perkins BA, Caskey CT, Brar P, et al. Precision medicine screening using whole-genome sequencing and advanced imaging to identify disease risk in adults. Proc Natl Acad Sci U S A. 2018;115(14):3686–3691. doi: 10.1073/pnas.1706096114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Bradford W, Hurdle JF, LaSalle B, Facelli JC. Development of a HIPAA-compliant environment for translational research data and analytics. Journal of the American Medical Informatics Association: JAMIA. 2014;21(1):185–189. doi: 10.1136/amiajnl-2013-001769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Ahmed Z, Mohamed K, Zeeshan S, Dong X. Artificial intelligence with multi-functional machine learning platform development for better healthcare and precision medicine. Database. 2020;2020:baaa010. doi: 10.1093/database/baaa010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Lee SI, et al. A machine learning approach to integrate big data for precision medicine in acute myeloid leukemia. Nature communications. 2018;9(1):42. doi: 10.1038/s41467-017-02465-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Van Panhuis WG, et al. A systematic review of barriers to data sharing in public health. BMC Public Health. 2014;14:1144. doi: 10.1186/1471-2458-14-1144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Beltran H, et al. Whole-exome sequencing of metastatic cancer and biomarkers of treatment response. JAMA oncology. 2015;1(4):466–474. doi: 10.1001/jamaoncol.2015.1313. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Luo Y, Ahmad FS, Shah SJ. Tensor factorization for precision medicine in heart failure with preserved ejection fraction. J Cardiovasc Transl Res. 2017;10:305–312. doi: 10.1007/s12265-016-9727-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Katsanis N. The continuum of causality in human genetic disorders. Genome Biol. 2016;17:233. doi: 10.1186/s13059-016-1107-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Manrai AK, Ioannidis JP, Kohane IS. Clinical genomics: from pathogenicity claims to quantitative risk estimates. JAMA. 2016;315:1233–1234. doi: 10.1001/jama.2016.1519. [DOI] [PubMed] [Google Scholar]
- 26.Boyle EA, Li YI, Pritchard JK. An expanded view of complex traits: from polygenic to omnigenic. Cell. 2017;169:1177–1186. doi: 10.1016/j.cell.2017.05.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Shieh Y, et al. Breast cancer screening in the precision medicine era: risk-based screening in a population-based trial. J Natl Cancer Inst. 2017;109:djw290. doi: 10.1093/jnci/djw290. [DOI] [PubMed] [Google Scholar]
- 28.Feero WG. Clinical application of whole-genome sequencing: proceed with care. JAMA. 2014;311:1017–1019. doi: 10.1001/jama.2014.1718. [DOI] [PubMed] [Google Scholar]
- 29.Khoury MJ, Ioannidis JP. Medicine. Big data meets public health. Science. 2014;346(6213):1054–1055. doi: 10.1126/science.aaa2709. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Bali J, Garg R, Bali RT. Artificial intelligence (AI) in healthcare and biomedical research: why a strong computational/AI bioethics framework is required? Indian journal of ophthalmology. 2019;67:3–6. doi: 10.4103/ijo.IJO_1292_18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Buch VH, Ahmed I, Maruthappu M. Artificial intelligence in medicine: current trends and future possibilities. The British journal of general practice: the journal of the Royal College of General Practitioners. 2018;68:143–144. doi: 10.3399/bjgp18X695213. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Shah P, et al. Artificial intelligence and machine learning in clinical development: a translational perspective. NPJ Digit Med. 2019;2:69. doi: 10.1038/s41746-019-0148-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Rajkomar A, et al. Scalable and accurate deep learning with electronic health records. NPJ Digit Med. 2018;1:18. doi: 10.1038/s41746-018-0029-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Alipanahi B, Delong A, Weirauch MT, Frey BJ. Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat. Biotechnol. 2015;33:831–838. doi: 10.1038/nbt.3300. [DOI] [PubMed] [Google Scholar]
- 35.Esteva A, et al. A guide to deep learning in healthcare. Nat. Med. 2019;25:24–29. doi: 10.1038/s41591-018-0316-z. [DOI] [PubMed] [Google Scholar]
- 36.Norgeot B, Glicksberg BS, Butte AJ. A call for deep-learning healthcare. Nat. Med. 2019;25:14–5. doi: 10.1038/s41591-018-0320-3. [DOI] [PubMed] [Google Scholar]
- 37.Kelly CJ, Karthikesalingam A, Suleyman M, Carrado G, King D. Key challenges for delivering clinical impact with artificial intelligence. BMC Med. 2019;17:195. doi: 10.1186/s12916-019-1426-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Ahmed Z, Zeeshan S, Xiong R, Liang BT. Debutant iOS app and gene-disease complexities in clinical genomics and precision medicine. Clin Trans Med. 2019;8:26. doi: 10.1186/s40169-019-0243-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Marouli E, et al. Rare and low-frequency coding variants alter human adult height. Nature. 2017;542(7640):186–190. doi: 10.1038/nature21039. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Dandekar T, Fieselmann A, Majeed S, Ahmed Z. Software applications toward quantitative metabolic flux analysis and modeling. Brief Bioinform. 2014;15:91–107. doi: 10.1093/bib/bbs065. [DOI] [PubMed] [Google Scholar]
- 41.Ahmed Z, et al. Software LS-MIDA for efficient mass isotopomer distribution analysis in metabolic modelling. BMC Bioinformatics. 2013;14:218. doi: 10.1186/1471-2105-14-218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Ahmed Z, et al. “Isotopo” a database application for facile analysis and management of mass isotopomer data. Database. 2014;2014:bau077. doi: 10.1093/database/bau077. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Tonn MK, Thomas P, Barahona M, Oyarzún DA. Stochastic modelling reveals mechanisms of metabolic heterogeneity. Communications biology. 2019;2:108. doi: 10.1038/s42003-019-0347-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Not applicable.

