Abstract
Understanding and manipulating cell fate determination is pivotal in biology. Cell fate is determined by intricate and nonlinear interactions among molecules, making mathematical model-based quantitative analysis indispensable for its elucidation. Nevertheless, obtaining the essential dynamic experimental data for model development has been a significant obstacle. However, recent advancements in large-scale omics data technology are providing the necessary foundation for developing such models. Based on accumulated experimental evidence, we can postulate that cell fate is governed by a limited number of core regulatory circuits. Following this concept, we present a conceptual control framework that leverages single-cell RNA-seq data for dynamic molecular regulatory network modeling, aiming to identify and manipulate core regulatory circuits and their master regulators to drive desired cellular state transitions. We illustrate the proposed framework by applying it to the reversion of lung cancer cell states, although it is more broadly applicable to understanding and controlling a wide range of cell-fate determination processes.
Subject terms: Systems biology, Computational biology and bioinformatics
Introduction
Cell-fate determination is an evolutionarily well-conserved process through which cells make critical decisions regarding their ultimate roles within a multicellular organism1. This foundational process underpins functions that are indispensable for all multicellular organisms, including the precise orchestration of normal developmental pathways, the maintenance of internal equilibrium (homeostasis), and the facilitation of adult tissue regeneration. Due to its vital importance, cell-fate determination has evolved to be highly resilient to various perturbations. Nevertheless, seminal experimental findings have revealed that the predetermined destiny of a cell can be dramatically reshaped by a few molecular modifications. For instance, the overexpression of Yamanaka factors OCT4, SOX2, KLF4, and c-MYC (OSKM)-can reprogram differentiated fibroblasts into induced pluripotent stem cells2. Moreover, ectopic activation of Yes-associated protein can transdifferentiate terminally differentiated hepatocytes into biliary epithelial-like cells3. In addition, adenomatous polyposis coli restoration is able to reverse colon carcinoma cells back to a functionally normal state despite the presence of potent oncogenic mutations4. These findings collectively highlight that predisposed cell fates, in principle, can be changed by manipulating a few specific molecules, called master regulators, while remaining highly robust against most other molecular perturbations. This perspective further raises the following challenges: how can we identify the master regulators, and through what molecular regulatory mechanisms do they induce cell-fate changes?
Cells are dynamic systems composed of intricate signaling pathways interconnected by various feedbacks and crosstalks, forming a complex network5,6. The regulatory relationships within this network are predominantly nonlinear, adding layers of complexity that defy simple, intuitive predictions about how altering a specific molecule might impact cellular functions7,8. The inherent complexity and nonlinearity present significant challenges in understanding the principles underlying cell-fate determination and its control. To comprehend the intricate networks of intracellular molecular regulations, a systems biology approach that integrates quantitative mathematical modeling with molecular experimentation is indispensable.
In this perspective, we argue that the integration of newly emerging dynamic information with mathematical models enables us to not only decode the fundamental principles embedded in the process of cell-fate determination, but also to exert control over this intricate process to the degree that was previously unachievable. In particular, we propose that although cell-fate determination accompanies genome-wide molecular state changes, it might be underpinned by only specific subnetworks within the network, which we refer to as ‘core regulatory circuits’. These circuits are instrumental in orchestrating the intricate sequence of events that determine cell fate. Based on this, we present a conceptual framework that employs single-cell RNA-sequencing (scRNA-seq) data to identify and manipulate core regulatory circuits determining cell fate. By applying this framework to scRNA-seq data of tumorigenesis, we propose a system-level approach to identify molecular candidates for cancer reversion and their molecular mechanisms through which deregulated gene regulatory dynamics are rewired, and cancer hallmarks can be compromised to reestablish normal phenotypes.
Representing cell-fate changes by complex molecular network dynamics
Cellular functions are orchestrated by a gene regulatory network (GRN) composed of tens of thousands of genes, intertwined through intricate nonlinear interactions. Network dynamics can be conceptualized through Waddington’s landscape, an intuitive model that illustrates how cells navigate through various states within a high-dimensional state space9 (Fig. 1a). In this landscape, valleys correspond to specific cell types, known as ‘attractors’, representing stable states that cells naturally settle into10. Building on this, the concept of ‘attractor landscape’ further illustrates the array of potentially stable states that cells can adopt. The basins surrounding these attractors indicate the probability of cells adopting each phenotype, offering insights into the dynamics and probabilities of cell-fate transitions. In this context, cell fates can be regarded as the most probable states a cell can occupy within the attractor landscape, reflecting their potential transition trajectories. In principle, the predetermined cell fates can be changed by altering the attractor landscapes of cells by rewiring the core regulatory circuits that underlie their complex molecular interactions11,12 (Fig. 1a). Can Waddington’s landscape metaphor be used for a quantitative description of actual cell-fate determination? Moreover, can it be applied to cell-fate inference and control?
Such a metaphorical landscape can be quantitatively described through mathematical models (Fig. 1b). These models formalize the state of a cell at time t by the collective value of each molecular state in the network. For example, a network composed of n genes can be represented at time t by the state vector . In this vector, for denotes the state of the ith gene at time t, and thus, corresponds to a point in an n-dimensional gene expression state space. The state of each molecule is influenced by the GRN, where the future state of each molecule is determined by nonlinear interactions with its upstream molecules. Consequently, a cellular state evolves over time as a nonlinear function of all molecules in the network, , converging towards a particular state (or a set of states) in the state space, which is called the attractor state. Empirical estimation of the network structure and nonlinear function parameters from experimental data is essential for a reasonably accurate model of cell-fate determination. However, obtaining the dynamic data needed for the estimation has been experimentally challenging. Fortunately, recent advances in single-cell omics and related analytical technologies can now provide the necessary temporal data required for the construction of comprehensive mathematical models.
Methodologies available to construct mathematical models using single-cell omics data
Single-cell sequencing technologies are advancing rapidly and are now capable of analyzing datasets, including transcriptomes, epigenomes, and proteomes from hundreds of thousands of cells. In particular, RNA sequencing has emerged as the leading technique in single-cell omics13. Sequencing RNA at the single-cell level allows for a detailed examination of gene transcription, providing a high-dimensional fingerprint that identifies unique cellular characteristics. Consequently, scRNA-seq has become an invaluable tool for investigating cell identity and state transitions at the level of individual cells. In this perspective, we focus on scRNA-seq. We illustrate current methodologies for utilizing scRNA-seq data in studying cell-fate determination, which encompass several critical stages as summarized below.
The initial stage involves preprocessing raw count data (Fig. 2a). This crucial first step, using tools such as Seurat14, Scanpy15, and Bioconductor-based SingleCellExperiment16, involves rigorous quality control, normalization, and feature selection, to establish a robust foundation for subsequent analyses. Following preprocessing, the complexity of data is dealt with through dimensionality reduction techniques like PCA17, t-SNE18, and UMAP19, effectively simplifying the data while preserving its essential characteristics (Fig. 2b). Following dimensionality reduction, clustering algorithms such as Louvain20, Leiden20, DBSCAN21, and SINCERA22 are used to group cells with shared identities. This step is followed by the annotation of each clustered cell type using established biological knowledge, and the identification of specific cell states that correlate with the cell fates of interest.
To re-order cellular trajectories according to gene expression changes, pseudotime analysis tools like Monocle323 are employed to map cell trajectories from scRNA-seq snapshots (Fig. 2c). RNA velocity24,25, based on vectors derived from mRNA splicing dynamics, also indicates the direction and likelihood of cell state transitions. Additional methods like Slingshot26, PAGA27, and FateID28 are also useful for inferring cellular dynamic trajectories and quantifying cell-fate probabilities.
The next phase involves constructing a molecular regulatory network (Fig. 2d, top), which infers molecular interactions from previously processed data29, often utilizing mutual information to understand nonlinear relationships within transcriptomic data. Tools such as Scribe30 and CLR31, utilize mutual information to gauge nonlinear transcriptomic relationships, and are instrumental in building the molecular regulatory network. GENIE332, GRNBoost233, PIDC34, LEAP35, and SCENIC36 are also widely used for this purpose. Furthermore, by integrating different single-cell data modalities (e.g., gene expression, chromatin accessibility), SCENIC+37, Pando38, Dictys39, and CellOracle40 can estimate the regulatory effect of each transcription factor (TF) on each gene mediated by specific regions of DNA, and then infer the more specific GRN structure.
In addition to the constructed GRN structure, mathematical models are constructed by inferring and encapsulating critical regulatory dynamics from scRNA-seq data (Fig. 2d, bottom). To construct a logical dynamic model, methods such as BTR (BoolTraineR)41 and SCNS (single cell network synthesis)42 can be employed. Building a Boolean network model requires optimizing Boolean regulation logic. This process is tailored to each algorithm used, such as the Z3 solver and the Quine-McCluskey (QM) algorithm, and is performed on binarized data to ensure optimal precision. Parallel to this, continuous dynamic modeling adopts a different approach. This method often involves deriving ordinary differential equations (ODEs) or stochastic differential equations, with a particular emphasis on pseudotime ordering of the data43–45. For example, SCODE44 employs linear regression techniques, whereas SCOUP45 utilizes a continuous diffusion process and is designed to analyze single-cell expression data during differentiation processes. This latter model, based on the Ornstein–Uhlenbeck process, is especially adept at determining the interdependencies between gene expression levels at distinct temporal points. This method offers a more detailed understanding of cellular dynamics across various time points.
To summarize, we provide a brief overview of useful methods for constructing mathematical models (Table 1). These models are invaluable for identifying molecular targets, particularly master regulators, through extensive systematic perturbation analysis using tools like CellOracle40 and scTenifoldKnk46, which predict gene perturbation effects and identify key cell-fate regulators (Fig. 2e). However, despite their advanced capabilities, these methods require laborious processes that involve individually controlling molecules and analyzing complex systems. Furthermore, these tools do not inherently incorporate an understanding of system dynamics, such as the attractor landscape, into their analyses. This poses significant limitations on revealing the specific molecular regulatory mechanisms that dictate cell-fate decisions. To address these challenges, control theory may offer a new path forward.
Table 1.
Methods | Possible input data types | Type of network interactions | Analysis of regulation dynamics | Default motif database | Implementation | URL | ||
---|---|---|---|---|---|---|---|---|
scRNA -seq | scATAC -seq | Signed | Weighted | |||||
GENIE332/ GRNBoost233 |
O | X | X | O | X | X |
Python and R |
https://github.com/aertslab |
SINCERETIES79 | O | X | O | X | X | X |
R and MATLAB |
https://github.com/CABSEL/SINCERITIES |
PIDC34 | O | X | X | X | X | X | Julia | https://github.com/Tchanders/NetworkInference.jl |
LEAP35 | O | X | O | O | X | X | R | R package LEAP available on CRAN |
SCENIC36 | O | X | O | O | X | cisTarget |
Python and R |
https://scenic.aertslab.org/ |
SCENIC+37 | O | O | O | O | X | cisTarget |
Python and R |
https://github.com/aertslab/scenicplus |
scMTNI80 | O | O | X | O | X | CIS-BP | C++ | https://github.com/Roy-lab/scMTNI |
Pando38 | O | O | O | X | X | CIS-BP | R | https://github.com/quadbio/Pando |
CellOracle40 | O | O | O | O | X | CIS-BP | Python | https://github.com/morris-lab/CellOracle |
FigR81 | O | O | O | X | X | CIS-BP | R | https://github.com/buenrostrolab/FigR |
Dictys39 | O | O | O | O | X | HOCOMOCO | Python | https://github.com/pinellolab/dictys |
scTenifoldKnk46 | O | X | O | O | X | X |
R and MATLAB |
https://github.com/cailab-tamu/scTenifoldKnk |
BTR41 | O | X | O | X | O | X | R | https://github.com/cheeyeelim/btr |
SCNS42 | O | X | O | X | O | X |
F# and Javascript |
https://github.com/swoodhouse/SCNS-GUI |
SCODE44 | O | X | O | O | O | X |
R and Julia |
https://github.com/hmatsu1226/SCODE |
SCOUP45 | O | X | O | X | O | X | C++ | https://github.com/hmatsu1226/SCOUP |
Emerging significance of applying control theory to explore core regulatory circuits and their master regulators
Control theory is a field of study that focuses on system characterization and manipulation. It involves understanding how systems behave and devising methods to achieve desired outcomes through control actions. This discipline has evolved over time to address vital challenges in complex systems. In the early 20th century, Black’s development of the negative feedback amplifier laid the groundwork for feedback control of simple systems47. In the 21st century, Wolkenhauer, Kitano, and Cho introduced the interdisciplinary principles of systems biology, merging control theory with high-throughput technologies for cellular research48. Aligned with this evolution, in 2011, Barabási and Slotine made a significant advancement in complex network control by integrating network science with control theory49. Their works highlighted the future focus on addressing the intricate interplay of elements in a nonlinear, networked system.
Current developments in complex network control theories have evolved in two principal directions: one that focuses on the structural characteristics of networks and another that considers the inherent nonlinear dynamics within these networks. Controlling network centrality determines the most influential regulators in network interactions using metrics such as degree centrality, betweenness centrality, and eigenvector centrality50. This approach often aims to identify a minimal subset of driver nodes required to direct a network from any given state to a specific desired state51. In contrast, strategies for controlling network dynamics, like logical domain of influence (LDOI)52, feedback vertex set (FVS)53, stable motif54, searching for differential expressed positive circuits (DEPCs)55, and global stabilization analysis56, were suggested to manage state transitions caused by the network dynamics. These methods collectively aim not only to identify ‘control targets’, defined as a specific set of nodes capable of steering the system towards a set of desired states, but also offer tools for analyzing molecular regulatory mechanisms. For instance, stable motif analysis has been instrumental in uncovering crucial feedback loops that govern processes like the epithelial-to-mesenchymal transition57, leukemia cell-fate decisions54, and the differentiation of helper T cells54. In addition, the LDOI approach has identified key control targets and their influences, and the DEPC method has discovered related positive circuits to stabilize a certain attractor55. While these methods provide solutions for controlling theoretical model systems, some result in suboptimal solutions with unnecessary targets, and yet others face scalability issues, rendering them limited to very small-scale models. Furthermore, to our knowledge, there has been no attempt to systematically identify both control targets and the resulting fate-determining paths (or circuits) revealing specific molecular regulatory mechanisms. Thus, identifying optimal control targets (i.e., master regulators) and the resulting core regulatory circuits for real-world cell-fate control remains a significant challenge.
To address the aforementioned challenge, leveraging the key biological features that play a role in cell-fate determination could be an essential strategy. Many previous experimental studies have already shown that while cell systems are comprised of a complex large network, a core regulatory circuit, composed of only a few key molecules, plays a crucial role in determining cell fates58–61. In particular, such core regulatory circuits are often composed of multiple nested feedback loops, with at least one being a positive feedback loop. For example, in quorum sensing, molecules like N-acyl homoserine lactone (AHL) in bacteria facilitate population-wide communication62; the regulatory feedback between TFs OCT4, SOX2, and NANOG maintains pluripotency in embryonic stem cells63; the double negative feedback loop between PU.1 and GATA1 controls the erythroid versus myeloid lineage commitment64; and the interconnected SNAIL/miR-34 and ZEB/miR-200 feedback loops are key regulatory components of the epithelial–mesenchymal transition (EMT) process in cancer metastasis65 (Fig. 3a). Furthermore, combinations of feedback and feedforward controls can generate biological transitions reliably in noisy environments where the activity of individual components can vary over a range of parameters66. Those previous studies indicate the significance of specific feedback mechanisms within core regulatory circuits in governing crucial cellular processes. This leads to the following question: how can we identify these core regulatory circuits and their master regulators?
To computationally resolve this problem, the concept of a network kernel has been introduced, encompassing the ‘kernel’ method for simplifying networks67 and the ‘control kernel’ approach for altering attractor landscapes with minimal regulators68,69. These kernel-based approaches are crucial for focusing on and controlling the core regulatory circuits that govern vital biological phenomena. Similarly, since cellular circuits with positive feedback loops induce multistationary behavior, identifying and analyzing the properties of feedback loops can offer critical clues to determine core regulatory circuits within large networks69–71. Another algorithm, ‘OpitCon’, can identify combination targets using a subgraph based on structural controllability theory, and then describe specific core downstream subnetworks and their crosstalk links that contribute to therapy resistance72. More recently, Rukhlenko et al.73 developed a novel framework called ‘cSTAR’. This method uses omics data to classify cell states and transforms them into mechanistic models. These models consist of a key molecular regulatory network that is instrumental in controlling the attractor landscape that governs cell-fate determination. Moreover, systematic perturbation to the model helps to identify control targets for the desired change in cell fate.
Based on previous studies, we postulate that core regulatory circuits consist of subnetworks interconnected by feedback loops, and that these circuits are the primary drivers of cell fate. Further advancing this concept, we can propose a detailed procedure as follows. First, we can investigate all feedback loops associated with different cellular states of interest (Fig. 3b). Subsequently, through an analysis of network dynamics influenced by feedback loop states, we can prioritize the positive feedback loops that are instrumental in controlling the attractor landscape governing cell-fate determination (Fig. 3c). For this, we can employ previous control methods such as LDOI52, stable motif54, and DEPC55, or improve these methods for further exploration of key subnetworks. As a result, the prioritized feedback loops can be refined to form a core regulatory circuit. Lastly, through systematic perturbation to the model (or core regulatory circuit) or by employing the improved kernel-based methods, we can identify (minimal) master regulators for the desired cell-fate change (Fig. 3d). Together, this control theory-based approach would be a highly promising way to understand and manipulate cell-fate processes, centered around core regulatory circuits, of a dynamic system.
From this, we integrate aforementioned progresses and suggest a comprehensive framework for systematically identifying core regulatory circuits and their master regulators for cell-fate change (Fig. 4). This framework consists of three interconnected components: capturing dynamic information from single-cell omics data, constructing and analyzing a mathematical model of molecular mechanisms in cell-fate determination, and identifying the candidates of master regulators for a desired cell-fate control (Fig. 4a–c). To explicitly introduce our framework, we provide an illustrative example as follows.
Illustrative example: identifying control targets for cancer reversion by using single-cell RNA-sequencing data from lung cancer samples
Traditional anticancer therapies have focused on removing cancer cells, but their effectiveness is limited due to the inevitable emergence of resistance, which arises as a consequence of cancer cell plasticity. Cancer plasticity is an emerging hallmark of cancer cells, and it plays a crucial role in cancer initiation and progression, as well as adaptation to therapy and intra-tumoral heterogeneity74–76. Hence, recent research has shifted focus towards targeting highly plastic cancer cells emerging during tumorigenesis, aiming to inhibit their development.
This illustrative example shows the emerging concept of ‘cancer reversion’, which seeks to transform highly plastic lung cancer cells back into normal cells, instead of merely eliminating cancer cells. Recent studies have indicated that cells with high plasticity emerge during mouse lung tumorigenesis upon introduction of KRAS mutations into AT2 cells, which are typically considered the origin of lung cancer. These cells lose their original characteristics and transition through AT1/AT2-like states, eventually diversifying into various cancer cell types such as EMT-type, embryonic liver-like, GI-like, and high-cycling cells, thereby increasing tumor heterogeneity (Fig. 5a).
In this illustrative example, we begin with obtaining time-series scRNA-seq data collected during lung tumorigenesis from AT2 cell state to high-plasticity cell state (HPCS) from a public repository77. This dataset provides single-cell transcriptomic information for ~2200 cells. The proposed framework entails preprocessing, dimensionality reduction, and clustering to annotate cell identities at different stages during tumorigenesis. Then, it conducts pseudotime ordering to delineate the trajectory from AT2 cells, through intermediate states, to HPCS cells, providing a comprehensive map of cellular evolution during lung tumorigenesis (Fig. 5b).
The proposed framework identifies critical TFs that can regulate gene expression changes along the lung tumorigenesis trajectory by comparing the activities of TFs among the distinct cell clusters. Then, the interactions among these TFs can be inferred, resulting in a GRN structure consisting of 105 nodes and 304 links (Fig. 5c). Next, to explore essential regulatory mechanisms associated with specific phenotypes, the framework can investigate all positive feedback loops showing differential activation between AT2 and HPCS states by evaluating their TF activities. By aggregating those feedback loops, the framework generates a reduced GRN, consisting of 23 nodes and 74 links, which contains key dynamical information (Fig. 5d). Intriguingly, TFs within the same state (e.g., AT2 or HPCS) are primarily interconnected through mutually activating positive feedback, whereas TFs across distinct states (e.g., AT2 versus HPCS) engage predominantly through mutually inhibitory interactions.
With the GRN structure and trajectory information, a Boolean logic model can be constructed to simulate the tumorigenesis process. This process involves discretizing the continuous expression values of each TF along the pseudotime trajectory (employing clustering methods like k-means). Subsequently, the influence of each TF in the GRN structure can be determined through Boolean logic functions, utilizing the QM algorithm (Fig. 5e). Using the Boolean network model, the framework can pinpoint the feedback loops of the most dynamic significance (in this example, stable motifs). These feedback loops and their corresponding LDOIs have the potential to stabilize either phenotype when modulated. Then the framework can assess the influence of the LDOIs on AT2 and HPCS modules, and prioritize key feedback loops based on their relevance to either phenotype (Fig. 5f). The most influential feedback loops can form a core regulatory circuit, consisting of Rel, Irf1, Irf7, Fosl1, Myc, and Relb. Once the core regulatory circuit is stabilized, it can fix the values of most TFs within the respective AT2 and HPCS modules.
The proposed framework then leverages information on core regulatory circuits to find control targets that can inhibit the HPCS module and activate the AT2 module. The LDOIs for individual nodes and pairs of nodes are calculated, then each LDOI is checked whether it can fix the influential feedback loops in the desired state to identify control targets. For example, a combination of Fosl1 (or Myc) and Nfkb2 can be identified as a master regulator, thereby ensuring the stable activation of the AT2 module and the deactivation of the HPCS module (Fig. 5g). Inhibition of Fosl1 can deactivate HPCS TF activities while having no impact on AT2 TFs. On the other hand, disruption of Nfkb2 can de-repress AT2 TF activities in the HPCS state, activating downstream regulators and leading to the re-emergence of the AT2 state. Combined inhibition of E2f4 and Nfkb2 can destabilize the HPCS state, and enable a stable activation of the AT2 module. This combinatorial inhibition strategy can, therefore, effectively disrupt the positive feedback within the HPCS module while also neutralizing its negative impact on the AT2 module. The result shows a stable shift from a highly plastic cancerous state back to a normal-like AT2 state. Moreover, the role of key molecules in plastic cancer cells can be illustrated using the landscape concept (Fig. 5h). In these cells, Fosl1 and Nfkb2 are highly active, stabilizing the HPCS while repressing the normal AT2 module. This ensures the persistence of the HPCS. However, to revert HPCS back to normal, merely inhibiting either Fosl1 or Nfkb2 is insufficient, as each alone does not fully shift the landscape towards the AT2 phenotype. The positive feedback of Fosl1 maintains the HPCS state, or the remaining activity of Nfkb2 suppresses the AT2 module. Only the simultaneous inhibition of both Fosl1 and Nfkb2 can significantly alter the landscape, favoring a shift towards the AT2 phenotype.
In summary, this example illustrates how systems biology can be applied to understand and manipulate cell-fate transitions in cancer. By integrating mathematical modeling with experimental data, we can identify crucial regulatory mechanisms that reverse cell states, offering new insights and potential therapeutic strategies in cancer treatment. Although this example illustrates the whole process proposed for inducing cancer reversion from public single-cell data, limitations are noted. Cell-fate determination is a complex phenomenon that encompasses changes not just at the transcriptomic level but also across various molecular levels. The continuous development of technologies that utilize single-cell multiomics data, which integrate transcriptomic, proteomic, and epigenetic data, has been pivotal. The use of such data can significantly enhance the proposed framework. In particular, the recent increase in single-cell multiomics data, combining scRNA-seq and scATAC-seq, has been noteworthy. scATAC-seq allows for the pruning of indirect regulation information from functional regulatory relationships obtained from transcriptomic data by providing information on the open chromatin regions at promoter sites. This enables the construction of more accurate GRN structures and, consequently, the development of more precise mathematical models. Furthermore, factors like the physiological state of a cell, not directly included in omics data, can significantly influence cell fate78. Although our framework does not directly incorporate such state information, it can be indirectly reflected through TF activity within the GRN structure and its mathematical model. The future availability of technologies for simultaneous measurement of the physiological and molecular states of a cell promises the development of models that integrate these dimensions for a comprehensive understanding.
Conclusions
Among the various molecules within a cell, only a few key molecules have a significant impact on cell fate. What distinguishes these master regulators from other molecules? How can we identify those master regulators, and through what molecular regulatory mechanisms do they induce cell-fate changes? To answer these questions, it is necessary to analyze and understand the behavior of a huge molecular regulatory network within a cell. In the past, such attempts were limited due to experimental constraints. Yet, recent advances in single-cell omics technologies, along with a decade of progress in network control technologies, enable us to answer these fundamental questions and usher in a renaissance for systems biology. The conceptual framework we introduce can offer an unprecedented opportunity for cell fate control by integrating the latest technological innovations into a comprehensive, novel strategy.
Acknowledgements
The authors thank Corbin Hopper for his critical reading and comments. This work was supported by the National Research Foundation of Korea (NRF) grants funded by the Korean government, the Ministry of Science and ICT (2023R1A2C3002619 and 2021M3A9I4024447 (Bio & Medical Technology Development Program)).
Author contributions
All authors contributed to the discussion of the content, writing, reviewing, and editing the manuscript. K.-H.C. conceived the idea, designed the project, and supervised the study.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Jonghoon Lee, Namhee Kim.
References
- 1.Moris N, Pina C, Arias AM. Transition states and cell fate decisions in epigenetic landscapes. Nat. Rev. Genet. 2016;17:693–703. doi: 10.1038/nrg.2016.98. [DOI] [PubMed] [Google Scholar]
- 2.Takahashi K, Yamanaka S. Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell. 2006;126:663–676. doi: 10.1016/j.cell.2006.07.024. [DOI] [PubMed] [Google Scholar]
- 3.Panciera T, et al. Induction of expandable tissue-specific stem/progenitor cells through transient expression of YAP/TAZ. Cell Stem Cell. 2016;19:725–737. doi: 10.1016/j.stem.2016.08.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Dow LE, et al. Apc restoration promotes cellular differentiation and reestablishes crypt homeostasis in colorectal cancer. Cell. 2015;161:1539–1552. doi: 10.1016/j.cell.2015.05.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Shin SY, et al. Functional roles of multiple feedback loops in extracellular signal-regulated kinase and Wnt signaling pathways that regulate epithelial-mesenchymal transition. Cancer Res. 2010;70:6715–6724. doi: 10.1158/0008-5472.CAN-10-1377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Hong JY, et al. Computational modeling of apoptotic signaling pathways induced by cisplatin. BMC Syst. Biol. 2012;6:122. doi: 10.1186/1752-0509-6-122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Park SG, et al. The influence of the signal dynamics of activated form of IKK on NF-kappaB and anti-apoptotic gene expressions: a systems biology approach. FEBS Lett. 2006;580:822–830. doi: 10.1016/j.febslet.2006.01.004. [DOI] [PubMed] [Google Scholar]
- 8.Lee HS, Hwang CY, Shin SY, Kwon KS, Cho KH. MLK3 is part of a feedback mechanism that regulates different cellular responses to reactive oxygen species. Sci. Signal. 2014;7:ra52. doi: 10.1126/scisignal.2005260. [DOI] [PubMed] [Google Scholar]
- 9.Waddington, C. H. The strategy of the genes. (Routledge, 2014).
- 10.Wang J, Zhang K, Xu L, Wang E. Quantifying the Waddington landscape and biological paths for development and differentiation. Proc. Natl. Acad. Sci. USA. 2011;108:8257–8262. doi: 10.1073/pnas.1017017108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Zhang J, Nie Q, Zhou T. Revealing dynamic mechanisms of cell fate decisions from single-cell transcriptomic data. Front. Genet. 2019;10:1280. doi: 10.3389/fgene.2019.01280. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Joo JI, Zhou JX, Huang S, Cho K-H. Determining relative dynamic stability of cell states using boolean network model. Sci. Rep. 2018;8:12077. doi: 10.1038/s41598-018-30544-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Hwang B, Lee JH, Bang D. Single-cell RNA sequencing technologies and bioinformatics pipelines. Exp. Mol. Med. 2018;50:1–14. doi: 10.1038/s12276-018-0071-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Satija R, Farrell JA, Gennert D, Schier AF, Regev A. Spatial reconstruction of single-cell gene expression data. Nat. Biotechnol. 2015;33:495–502. doi: 10.1038/nbt.3192. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Wolf FA, Angerer P, Theis FJ. SCANPY: large-scale single-cell gene expression data analysis. Genome Biol. 2018;19:15. doi: 10.1186/s13059-017-1382-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Amezquita RA, et al. Orchestrating single-cell analysis with Bioconductor. Nat. Methods. 2020;17:137–145. doi: 10.1038/s41592-019-0654-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Jolliffe IT, Cadima J. Principal component analysis: a review and recent developments. Philos. Trans. A Math. Phys. Eng. Sci. 2016;374:20150202. doi: 10.1098/rsta.2015.0202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Van der Maaten, L. & Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res.9, 2579−2605 (2008).
- 19.McInnes, L., Healy, J. & Melville, J. Umap: Uniform manifold approximation and projection for dimension reduction. arXivhttps://arxiv.org/abs/1802.03426 (2018).
- 20.Traag VA, Waltman L, Van Eck NJ. From Louvain to Leiden: guaranteeing well-connected communities. Sci. Rep. 2019;9:5233. doi: 10.1038/s41598-019-41695-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ester M, Kriegel H-P, Sander J, Xu X. A density-based algorithm for discovering clusters in large spatial databases with noise. In Kdd. 1996;96:226–231. [Google Scholar]
- 22.Guo M, Wang H, Potter SS, Whitsett JA, Xu Y. SINCERA: a pipeline for single-cell RNA-seq profiling analysis. PLoS Comput. Biol. 2015;11:e1004575. doi: 10.1371/journal.pcbi.1004575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Trapnell C, et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat. Biotechnol. 2014;32:381–386. doi: 10.1038/nbt.2859. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.La Manno G, et al. RNA velocity of single cells. Nature. 2018;560:494–498. doi: 10.1038/s41586-018-0414-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Gayoso A, et al. Deep generative modeling of transcriptional dynamics for RNA velocity analysis in single cells. Nat. Methods. 2024;21:50–59. doi: 10.1038/s41592-023-01994-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Street K, et al. Slingshot: cell lineage and pseudotime inference for single-cell transcriptomics. BMC Genomics. 2018;19:1–16. doi: 10.1186/s12864-018-4772-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Wolf FA, et al. PAGA: graph abstraction reconciles clustering with trajectory inference through a topology preserving map of single cells. Genome Biol. 2019;20:9. doi: 10.1186/s13059-019-1663-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Herman JS, Sagar N, Gruen D. FateID infers cell fate bias in multipotent progenitors from single-cell RNA-seq data. Nat. Methods. 2018;15:379–386. doi: 10.1038/nmeth.4662. [DOI] [PubMed] [Google Scholar]
- 29.Badia IMP, et al. Gene regulatory network inference in the era of single-cell multi-omics. Nat. Rev. Genet. 2023;24:739–754. doi: 10.1038/s41576-023-00618-5. [DOI] [PubMed] [Google Scholar]
- 30.Qiu X, et al. Inferring causal gene regulatory networks from coupled single-cell expression dynamics using scribe. Cell Syst. 2020;10:265–274. e211. doi: 10.1016/j.cels.2020.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Greenfield A, Madar A, Ostrer H, Bonneau R. DREAM4: combining genetic and dynamic information to identify biological networks and dynamical models. PLoS One. 2010;5:e13397. doi: 10.1371/journal.pone.0013397. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Huynh-Thu VA, Irrthum A, Wehenkel L, Geurts P. Inferring regulatory networks from expression data using tree-based methods. PLoS One. 2010;5:e12776. doi: 10.1371/journal.pone.0012776. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Moerman T, et al. GRNBoost2 and Arboreto: efficient and scalable inference of gene regulatory networks. Bioinformatics. 2019;35:2159–2161. doi: 10.1093/bioinformatics/bty916. [DOI] [PubMed] [Google Scholar]
- 34.Chan TE, Stumpf MPH, Babtie AC. Gene regulatory network inference from single-cell data using multivariate information measures. Cell Syst. 2017;5:251–267.e253. doi: 10.1016/j.cels.2017.08.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Specht AT, Li J. LEAP: constructing gene co-expression networks for single-cell RNA-sequencing data using pseudotime ordering. Bioinformatics. 2017;33:764–766. doi: 10.1093/bioinformatics/btw729. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Aibar S, et al. SCENIC: single-cell regulatory network inference and clustering. Nat. Methods. 2017;14:1083–1086. doi: 10.1038/nmeth.4463. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Bravo Gonzalez-Blas C, et al. SCENIC+: single-cell multiomic inference of enhancers and gene regulatory networks. Nat. Methods. 2023;20:1355–1367. doi: 10.1038/s41592-023-01938-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Fleck JS, et al. Inferring and perturbing cell fate regulomes in human brain organoids. Nature. 2023;621:365–372. doi: 10.1038/s41586-022-05279-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Wang L, et al. Dictys: dynamic gene regulatory network dissects developmental continuum with single-cell multiomics. Nat. Methods. 2023;20:1368–1378. doi: 10.1038/s41592-023-01971-3. [DOI] [PubMed] [Google Scholar]
- 40.Kamimoto K, et al. Dissecting cell identity via network inference and in silico gene perturbation. Nature. 2023;614:742–751. doi: 10.1038/s41586-022-05688-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Lim CY, et al. BTR: training asynchronous Boolean models using single-cell expression data. BMC Bioinformatics. 2016;17:355. doi: 10.1186/s12859-016-1235-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Woodhouse S, Piterman N, Wintersteiger CM, Gottgens B, Fisher J. SCNS: a graphical tool for reconstructing executable regulatory networks from single-cell genomic data. BMC Syst. Biol. 2018;12:59. doi: 10.1186/s12918-018-0581-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Luo S, Wang Z, Zhang Z, Zhou T, Zhang J. Genome-wide inference reveals that feedback regulations constrain promoter-dependent transcriptional burst kinetics. Nucleic Acids Res. 2023;51:68–83. doi: 10.1093/nar/gkac1204. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Matsumoto H, et al. SCODE: an efficient regulatory network inference algorithm from single-cell RNA-Seq during differentiation. Bioinformatics. 2017;33:2314–2321. doi: 10.1093/bioinformatics/btx194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Matsumoto H, Kiryu H. SCOUP: a probabilistic model based on the Ornstein-Uhlenbeck process to analyze single-cell expression data during differentiation. BMC Bioinformatics. 2016;17:232. doi: 10.1186/s12859-016-1109-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Osorio, D. et al. scTenifoldKnk: an efficient virtual knockout tool for gene function predictions via single-cell gene regulatory network perturbation. Patterns3, 100434 (2022). [DOI] [PMC free article] [PubMed]
- 47.Black HS. Stabilized feedback amplifiers. Bell Syst. Tech. J. 1934;13:1–18. doi: 10.1002/j.1538-7305.1934.tb00652.x. [DOI] [Google Scholar]
- 48.Wolkenhauer O, Kitano H, Kwang-Hyun C. Systems biology. IEEE Control Syst. Mag. 2003;23:38–48. doi: 10.1109/MCS.2003.1213602. [DOI] [Google Scholar]
- 49.Liu Y-Y, Slotine J-J, Barabási A-L. Controllability of complex networks. nature. 2011;473:167–173. doi: 10.1038/nature10011. [DOI] [PubMed] [Google Scholar]
- 50.Valente TW, Coronges K, Lakon C, Costenbader E. How correlated are network centrality measures? Connect (Tor.) 2008;28:16–26. [PMC free article] [PubMed] [Google Scholar]
- 51.Liu YY, Slotine JJ, Barabasi AL. Control centrality and hierarchical structure in complex networks. PLoS One. 2012;7:e44459. doi: 10.1371/journal.pone.0044459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Yang G, Gomez Tejeda Zanudo J, Albert R. Target control in logical models using the domain of influence of nodes. Front. Physiol. 2018;9:454. doi: 10.3389/fphys.2018.00454. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Mochizuki A, Fiedler B, Kurosawa G, Saito D. Dynamics and control at feedback vertex sets. II: a faithful monitor to determine the diversity of molecular activities in regulatory networks. J. Theor. Biol. 2013;335:130–146. doi: 10.1016/j.jtbi.2013.06.009. [DOI] [PubMed] [Google Scholar]
- 54.Zanudo JG, Albert R. Cell fate reprogramming by control of intracellular network dynamics. PLoS Comput. Biol. 2015;11:e1004193. doi: 10.1371/journal.pcbi.1004193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Crespo I, Perumal TM, Jurkowski W, del Sol A. Detecting cellular reprogramming determinants by differential stability analysis of gene regulatory networks. BMC Syst. Biol. 2013;7:140. doi: 10.1186/1752-0509-7-140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Yang, J.-M., Lee, C.-K. & Cho, K.-H. Robust stabilizing control of perturbed biological networks via coordinate transformation and algebraic analysis. In: IEEE Transactions on Neural Networks and Learning Systems (2022). [DOI] [PubMed]
- 57.Steinway SN, et al. Network modeling of TGFbeta signaling in hepatocellular carcinoma epithelial-to-mesenchymal transition reveals joint sonic hedgehog and Wnt pathway activation. Cancer Res. 2014;74:5963–5977. doi: 10.1158/0008-5472.CAN-14-0225. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Choi SR, Hwang CY, Lee J, Cho KH. Network analysis identifies regulators of basal-like breast cancer reprogramming and endocrine therapy vulnerability. Cancer Res. 2022;82:320–333. doi: 10.1158/0008-5472.CAN-21-0621. [DOI] [PubMed] [Google Scholar]
- 59.Kim N, Hwang CY, Kim T, Kim H, Cho KH. A cell-fate reprogramming strategy reverses epithelial-to-mesenchymal transition of lung cancer cells while avoiding hybrid states. Cancer Res. 2023;83:956–970. doi: 10.1158/0008-5472.CAN-22-1559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.An S, et al. Inhibition of 3-phosphoinositide-dependent protein kinase 1 (PDK1) can revert cellular senescence in human dermal fibroblasts. Proc. Natl Acad. Sci. USA. 2020;117:31535–31546. doi: 10.1073/pnas.1920338117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Seo CH, Kim JR, Kim MS, Cho KH. Hub genes with positive feedbacks function as master switches in developmental gene regulatory networks. Bioinformatics. 2009;25:1898–1904. doi: 10.1093/bioinformatics/btp316. [DOI] [PubMed] [Google Scholar]
- 62.Kumar, L. et al. Molecular mechanisms and applications of N-acyl homoserine lactone-mediated quorum sensing in bacteria. Molecules27, 10.3390/molecules27217584 (2022). [DOI] [PMC free article] [PubMed]
- 63.Chickarmane V, Troein C, Nuber UA, Sauro HM, Peterson C. Transcriptional dynamics of the embryonic stem cell switch. PLoS Comput. Biol. 2006;2:e123. doi: 10.1371/journal.pcbi.0020123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Xu J, Orkin SH. The erythroid/myeloid lineage fate paradigm takes a new player. Embo J. 2011;30:983–985. doi: 10.1038/emboj.2011.45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Lu M, Jolly MK, Levine H, Onuchic JN, Ben-Jacob E. MicroRNA-based regulation of epithelial-hybrid-mesenchymal fate determination. Proc. Natl Acad. Sci. USA. 2013;110:18144–18149. doi: 10.1073/pnas.1318192110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Hornung G, Barkai N. Noise propagation and signaling sensitivity in biological networks: a role for positive feedback. PLoS Comput. Biol. 2008;4:e8. doi: 10.1371/journal.pcbi.0040008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Kim JR, et al. Reduction of complex signaling networks to a representative kernel. Sci. Signal. 2011;4:ra35. doi: 10.1126/scisignal.2001390. [DOI] [PubMed] [Google Scholar]
- 68.Kim J, Park SM, Cho KH. Discovery of a kernel for controlling biomolecular regulatory networks. Sci. Rep. 2013;3:2223. doi: 10.1038/srep02223. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.An, S. et al. Global stabilizing control of large-scale biomolecular regulatory networks. Bioinformatics39, btad045 (2023). [DOI] [PMC free article] [PubMed]
- 70.Kwon YK, Cho KH. Boolean dynamics of biological networks with multiple coupled feedback loops. Biophys. J. 2007;92:2975–2981. doi: 10.1529/biophysj.106.097097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Deritei D, Rozum J, Ravasz Regan E, Albert R. A feedback loop of conditionally stable circuits drives the cell cycle from checkpoint to checkpoint. Sci. Rep. 2019;9:16430. doi: 10.1038/s41598-019-52725-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Hu Y, et al. Optimal control nodes in disease-perturbed networks as targets for combination therapy. Nat. Commun. 2019;10:2180. doi: 10.1038/s41467-019-10215-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Rukhlenko OS, et al. Control of cell state transitions. Nature. 2022;609:975–985. doi: 10.1038/s41586-022-05194-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Torborg SR, Li ZX, Chan JE, Tammela T. Cellular and molecular mechanisms of plasticity in cancer. Trends Cancer. 2022;8:735–746. doi: 10.1016/j.trecan.2022.04.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.Barkley D, Rao A, Pour M, Franca GS, Yanai I. Cancer cell states and emergent properties of the dynamic tumor system. Genome Res. 2021;31:1719–1727. doi: 10.1101/gr.275308.121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Hanahan D. Hallmarks of cancer: new dimensions. Cancer Discov. 2022;12:31–46. doi: 10.1158/2159-8290.CD-21-1059. [DOI] [PubMed] [Google Scholar]
- 77.Marjanovic ND, et al. Emergence of a high-plasticity cell state during lung cancer evolution. Cancer Cell. 2020;38:229–246.e213. doi: 10.1016/j.ccell.2020.06.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Zhu J, Chu P, Fu X. Unbalanced response to growth variations reshapes the cell fate decision landscape. Nat. Chem. Biol. 2023;19:1097–1104. doi: 10.1038/s41589-023-01302-9. [DOI] [PubMed] [Google Scholar]
- 79.Papili Gao N, Ud-Dean SMM, Gandrillon O, Gunawan R. SINCERITIES: inferring gene regulatory networks from time-stamped single cell transcriptional expression profiles. Bioinformatics. 2018;34:258–266. doi: 10.1093/bioinformatics/btx575. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Zhang S, et al. Inference of cell type-specific gene regulatory networks on cell lineages from single cell omic datasets. Nat. Commun. 2023;14:3064. doi: 10.1038/s41467-023-38637-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Kartha, V. K. et al. Functional inference of gene regulation using single-cell multi-omics. Cell Genom2, 100166 (2022). [DOI] [PMC free article] [PubMed]