Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2025 Apr 1.
Published in final edited form as: Arterioscler Thromb Vasc Biol. 2024 Mar 27;44(4):759–762. doi: 10.1161/ATVBAHA.124.320686

Multi-omics and Single-cell Omics: New Tools in Drug Target Discovery

Joseph Loscalzo 1
PMCID: PMC10977648  NIHMSID: NIHMS1969733  PMID: 38536899

Biological systems are inherently noisy, with noise caused by the measurement process (i.e., technical noise) and by inherent biological variability (i.e., biological noise). Biological noise accounts for phenotypic differences within a population of individuals, as well as differences between two different cells of the same lineage within an organ of a single individual. Biomedical investigators typically strive to eliminate technical noise as much as possible to optimize the signal-to-noise ratio of a measurement, and to mitigate the impact of biological noise by repeated measurements, thereby improving the statistical confidence in the mean signal.

With the advent of the genomic era, the biomedical research paradigm has moved from pure reductionism to a more holistic, integrated approach to system analysis. In this setting, biological noise plays a key role, accounting for heterogeneity in response to perturbations between cells and between individuals, and offering a mechanistic explanation for incomplete genetic penetrance (divergent pathophenotypes), variable functional phenotypes in subjects with identical disease-causing genetic variants, as well as variability in drug response. Clearly, understanding the bases for this type of biological heterogeneity is essential for defining individual disease risk, prognosis, and precision therapeutics.

Since the beginning of the Human Genome Project, we have moved from DNA sequence determination to a detailed assessment of other omic components. Technical approaches to measuring each omic level have evolved rapidly. More efficient, rapid, lower-cost DNA sequencing is now complemented by bulk transcriptomics, proteomics, epigenomics, and metabolomics, which, in turn, have been followed by single-cell transcriptomics and spatial transcriptomics, as well as single-cell proteomics, epigenomics, and metabolomics. This extraordinary growth in the detailed omic characterization of a biological system has led to analytical challenges that have yet to be resolved satisfactorily. These multi-omic systems are of high dimensionality and are over-determined from a dynamical systems perspective, which leads to the following questions: How can one assess the interactions among multi-omic layers? How can one reduce the dimensionality of the system to make it optimally tractable and biomedically useful? How can one exploit the biological noise inherent in these multi-omic systems to define disease expression within an individual and consequent precision therapeutics tailored to that individual’s multi-ome?

We and others have approached this problem through the lens of molecular interaction networks (Figure 1). Molecular networks (e.g., protein-protein interaction networks, metabolic networks, gene regulatory networks, Bayesian coexpression networks) provide graphical depictions of relationships between the elements within a given network. For example, in the typical protein-protein interaction network, each node represents a protein and its physical interaction with another protein in the proteome is depicted by a link or edge. We have shown that subnetworks are contained within the comprehensive protein-protein interaction network that are associated with specific diseases (disease modules) (1,2). By reducing the overall dimensionality of the network through a disease-focused perspective, these constructs offer a more detailed, causal view of the pathways and proteins that govern disease pathobiology. Disease modules that overlap provide insights into pathways and proteins that are common to different diseases. In addition, disease modules offer a mechanistic path toward drug target identification, making the target discovery process more efficient and rational. One can also use this same approach to repurpose approved drugs as we have done for coronary artery disease (3) and SARS-CoV-2 infection (4) by applying a variety of network-based statistical approaches including network-based AI-strategies (35).

Figure 1. Network integration of multi-omics data from the patient to the single cell level.

Figure 1.

Analysis of integrated multi-omic networks provides a basis for resolving phenotypic heterogeneity in human pathobiology and in defining precise drug therapies.

(Reproduced with permission: Wang R et al., Arterioscler Thromb Vase Biol 2023;43:493-503)

Expanding this analysis of the protein-protein interaction network to include the transcriptome yields additional useful information, including which pathways within a disease module are present in a given cell type or tissue (6). Only tissues that express key components of pathways governing disease phenotype manifest disease, regardless of whether or not genetic variants associated with the disease are themselves expressed in that tissue. Furthermore, using a transcriptome-based differential analysis of gene expression pairs (pairwise correlation analysis) in diseased vs. normal tissue, one can generate patient-specific disease modules, or reticulotypes (after the Latin for network), that can provide unique information on patient-specific disease mechanisms and patient-specific drug targets, as we showed for hypertrophic cardiomyopathy (7).

It is important to point out that biological variability of complex multi-omic systems can be a reflection of deterministic genetic variability (variants, mutants) as well as (bio)chemical stochasticity (epigenetics, post-translational modification of the proteome or the transcriptome). This latter class of causes of omic noise is more challenging to assess owing to its randomness and incompleteness. Theoretically, these stochastic modifications of the proteome can serve as the basis for a multitude of instantiations of the protein-protein interaction network with possibly differing effects on the phenotypes of interest. In the case of exposome-based epigenomic modification, it is not always clear as to what the time-activity relationship is to the (patho)phenotypes of interest. Furthermore, natural, time-dependent variation in the epigenome as a result of stage of development coupled with trans-generational effects on epigenetic marks make this level of omic analysis even more complicated, with no widely accepted approach available as yet.

Are there evolutionary advantages to multi-omic networks that can inform our analysis of them? One answer to this question lies in an analysis of the dynamical behavior of such networks. Coupling among omic layers appears to increase the stable operating range of the system, mitigating the consequences of (adverse) perturbations, rendering the system more resilient that it would be were one omic layer to operate in isolation (8). It is likely that variance in network structure, protein expression, or protein function (heritable or stochastic) yields biological noise that influences this resilience. Variation in the concentration of metabolites in metabolic networks coupled with variation in the catalytic constants of their coupled reactions provide a stabler operating range than limited or no variation, providing the system with the ability to utilize, for example, energy sources effectively in conditions of dearth or abundance (9). Allosteric modulation of enzyme systems provides a similar functional advantage (10).

In addition to molecular-level omics, cell-based immunophenotyping has been utilized in conjunction with single-cell RNA sequencing to characterize in more detail subsets of cells of ostensibly similar lineage. This approach, coupled with multi-organ and single-cell analyses of gene expression, has provided insight into patient-specific regulation of the immune response and patient-specific drug target identification in allergic (11) and autoimmune disorders (12).

Detailed immunophenotyping is one method for characterizing relevant phenotypic features that govern disease expression (13). Robust phenotyping in general, including orthogonal phenotyping (i.e., characterizing phenotypic features that are not believed or known to play a role in the disease of interest), is essential for identifying disease mechanisms and potential drug targets. Coupled with multi-omic data, in-depth phenotyping can be utilized to ascertain key mechanisms of disease in a patient-specific manner, as well as potential therapeutic targets. Furthermore, similar approaches can be used to identify network-based, multi-omic-derived biomarkers that can be used to predict disease course and response to therapy.

To achieve these goals effectively requires a specific computational workflow linked to newer network-based analytical algorithms (14) (Figure 2). Typically, the analytical exercise requires first a feature selection step (i.e., which elements in the phenotypic and multi-omic datasets may be important for outcome prediction), followed by data integration and representation (i.e., dimensionality reduction and newly transformed data rendering), and a data clustering methodology. This approach is designed to yield information on outcome associations, biomarker generation, and risk or response prediction. There are numerous methods that have been developed for use in this workflow that include supervised vs. unsupervised feature selection, linear vs. nonlinear feature extraction and data representation, linear vs. network-based multi-omic data integration, and partitioning vs. hierarchical clustering algorithms. There are many examples of specific algorithms under each of these feature selection or model types, the choice of which depends upon the nature of the dataset and the goal of the analysis.

Figure 2. Computational strategy for disease subtyping.

Figure 2.

Upper figure: progression from one-size-fits-all medicine to individualized medicine.

Lower figure: Computational pipeline for analysis of multi-omic datasets

(Reproduced with permission: Maiorino E, Loscalzo J. Arterioscler Thromb Vase Biol 2023;43:1111-1123)

Recent advances in network-based machine learning and artificial intelligence utilize a combination of physicochemical features of small-molecule drugs, estimated binding constants to protein targets, and network features of disease modules and their multi-omic context to predict drug-target interactions (5,15). This type of analysis coupled with AI-based functional assessment of the disease pathway or module within which the drug target is localized can then be used to minimize the number of drug-target interactions that require experimental testing in vitro and in vivo, decreasing development time and offering the promise of accelerating implementation in human clinical trials.

Biomedical investigators have spent the past 200 years attempting to reduce the complexity of the biological systems they study. In the current era of large datasets of multi-omic systems, we can no longer afford to avoid their intrinsic complexity. In fact, that complexity and the natural biological variation (noise) that further complicates it should be viewed as a means to move the field of precision medicine forward. Recognizing that many, if not most, pathophenotypes are convergent, that deeper phenotyping will unveil the subtle distinctions that discriminate between individuals, and that a robust analysis of the molecular complexity underlying these phenotypes will provide a deeper individualized understanding of disease mechanism and a path to more precises and effective therapies.

Acknowledgements:

This work was supported in part by NIH grants HL155107, HL166137, and HG007691; and by AHA grants 957729 and 24MERIT185447.

References

  • 1.Barabasi A-L, Gulbahce N, Loscalzo J. Network medicine: a network-based approach to human disease. Nature Rev Genet 2011;12:56–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Menche J, Sharma A, Kitsak M, Ghiassian S, Vidal M, Loscalzo J, Barabasi A-L. Disease Networks: Uncovering disease-disease relationships through the incomplete human interactome. Science 2015;347:841. 10.1126/science.1257601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Cheng F, Desai RJ, Handy DE, Wang R, Schneeweiss S, Barabasi A-L, Loscalzo J. Network-based approach to prediction and population-based validation of in silico drug repurposing. Nature Comm 2018;9:2691. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Patten JJ, Keiser P, Morselli-Gysi D, Menichetti G, Mori H, Donahue CJ, Gan X, do Valle I, Geoghegan-Barek K, Anantpadma M, Boytz R, Berrigan JL, Stubbs SH, Ayazika T, O’Leary C, Jalloh S, Wagner F, Ayehunie S, Elledge SJ, Anderson D, Loscalzo J, Zitnik M, Gummuluru S, Namchuk MN, Barabasi A-L, Davey RA. Identification of potent inhibitors of SARS-CoV-2 infection by combined pharmacological evaluation and cellular network prioritization. iScience 2022;25:104925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Menichetti G. An AI pipeline to investigate the binding properties of poorly annotated molecules. Nature Rev Phys 2022;4:359. [Google Scholar]
  • 6.Kitsak M, Sharma A, Menche J, Guney E, Ghiassian SD, Loscalzo J, Barabasi A-L. Tissue specificity of human disease module. Sci Rep 2016;6:35241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Maron BA, Wang R-S, Shevtsov S, Drakos SG, Arons E, Pinson OW, Huggins GS, Samokhin AO, Oldham WM, Aguib Y, Yacoub MH, Rowin EJ, Maron BJ, Maron MS, Loscalzo J. Individualized interactomes for network-based precision medicine in hypertrophic cardiomyopathy with implications for other clinical pathophenotypes. Nature Comm 2021;12:873. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Liu X, Maiorino E, Halu A, Glass K, Prasad RB, Loscalzo J, Gao J, Sharma A. Robustness and lethality in multilayer biological molecular networks. Nat Commun 2020;11:6043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Menichetti G, Barabasi A-L. Nutrient concentrations in food display universal behaviour. Nature Food 2022;3:375–382. [DOI] [PubMed] [Google Scholar]
  • 10.Grimbs S, Selbig J, Bulik S, Holzhutter H-G, Steur R. The stability and robustness of metabolic states: identifying stabilizing sites in metabolic networks. Mol Syst Biol 2007;3:146. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Li X, Lee EJ, Lilja S, Loscalzo J, Schafer S, Smelik M, Strobl MR, Sysoev O, Wang H, Zhang H, Zhao Y, Gawel DR, Bohle B, Benson M. A dynamic single cell-based framework for digital twins to prioritize disease genes and drug targets. Genome Med 2022;14:48. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Lilja S, Li X, Smelik M, Lee EJ, Loscalzo J, Marthanda PB, Hu L, Magnuysson M, Sysoev O, Zhang H, Zhao Y, Sjowall C, Gawel D, Wang H, Benson M. Multi-organ single-cell analysis reveals an on/off switch system with potential for personalized treatment of immunological diseases. Cell Rep Med 2023;4:100956. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Mueller AA, Sasaki T, Keegan JW, Nguyen JP, Griffith A, Horisberger AM, Licata T, Fieg E, Cao Y, Elahee M, Marks KE, Simmons DP, Briere LC, Cobban LA, Pallais JC, High FA, Walker MA, Linnoila JJ, Sparks JA, Holers VM, Costenbader KH, Undiagnosed Diseases Network, Sweetser DA, Krier JB, Loscalzo J, Lederer JA, Rao DA. High-dimensional immunophenotyping reveals immune cell aberrations in patients with undiagnosed inflammatory and autoimmune diseases. J Clin Invest 2023; Oct 24:e169619. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Maiorino E, Loscalzo J. Phenomics and robust multiomics data for cardiovascular disease subtyping. Arterioscler Thromb Vasc Biol 2023;43:1111–1123. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Späth J, SWang R-S, Humphrey M, Baumbach J, Loscalzo J. Machine learning-based integration of network features and chemical structures of compounds for SARS-CoV-2 drug effect analysis. CPT: Pharmacomet Syst Pharmacol 2023, in press. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES