Abstract
Elucidation of the genomic mechanism that guides the cell-fate change is one of the fundamental issues of biology. We previously demonstrated that whole genome expression is coordinated by the emergence of a critical point at both the cell-population and single-cell levels through the physical principle of self-organized criticality. In this paper, we further examine the genomic mechanism that determines the cell-fate changes from embryo to cancer development. The state of the critical point, acting as the organizing center of the cell fate, determines whether the genome resides in a super- or sub-critical state. In the super-critical state, a specific stochastic perturbation can spread over the entire system through the “genome engine”, an autonomous critical-control genomic system, whereas in the sub-critical state, the perturbation remains at a local level. The cell-fate changes when the genome becomes super-critical. We provide a consistent framework to develop a time-evolutional transition theory for the biological regulation of the cell-fate change.
Keywords: genome expression, cell fate decision, self-organized criticality (SOC), critical point, genome engine, genome attractor, biological regulation, transition theory
1. Introduction
A mature mammalian somatic cell can reprogram its state and consequently acquire a very different gene-expression profile through exposure to a few reprogramming stimuli [1]. Such a drastic state change involves the coherent on/off switching of thousands of functionally heterogeneous genes [2]. However, there are fundamental physical difficulties in achieving such large-scale coordinated control on a gene-by-gene basis. These difficulties become more evident in a situation where there is (1) a lack of a sufficient number of molecules to reach a stable thermodynamic state (i.e., breakdown of the central limit theorem) and (2) a consequent stochastic noise due to the low copy numbers of specific gene mRNAs, which induces a substantial instability in genetic product concentrations [3,4].
In our previous studies [5,6,7,8,9,10], we demonstrated that the self-organization of whole-genome expression constitutes a “physically motivated” alternative to gene-specific regulation at both the population and single-cell levels. The mechanism of self-organization, through global genome reprogramming, eliminates the need for physically unfeasible gene-by-gene control of expression.
The core of the self-organization mechanism is the presence of massive system changes elicited by “apparently minor” external causes. To address this problem, Bak and colleagues [11] proposed self-organized criticality (SOC; the Bak–Tang–Wiesenfeld sandpile model). SOC is a general theory of complexity that describes self-organization and emergent order in non-equilibrium systems (thermodynamically open systems). Self-organization occurs at the edge between order and chaos, where the system lies on a “marginally stable” attractor oscillating around an equilibrium position in response to continuous (small) microenvironment perturbations and eventually an “effective” perturbation can spread over the entire system to cause a phase transition [12,13,14,15,16].
SOC builds upon the fact that stochastic perturbations in the great majority of cases propagate locally (i.e., sub-critical state, small avalanches in the sandpile metaphor [11]); however, due to the particularity of the disturbance, the perturbation can spread over the entire system in a highly cooperative manner (i.e., super-critical state, domino effect). As the system approaches its critical point, global behavior emerges in a self-organized manner. The coordinated character of the process stems from the so-called “domino effect” present in all biological signaling processes (e.g., for an allosteric effect, see [17]) where microscopic local effects generalize to the entire system, spreading along “preferential pathways”.
The above-depicted classical concept of SOC has been extended to the cell-fate decision (critical-like self-organization or rapid SOC) through the extension of minimalist models of cellular behavior [18]. The cell-fate decision-making model considers gene regulatory networks to adopt an exploratory process, where diverse the cell-fate options are generated by the priming of various transcriptional programs. In this manuscript, “fate” refers to any stable phenotypic condition of the system. As a result, a cell-fate gene module is selectively amplified as the network system approaches a critical state. Such amplification corresponds to the emergence of the long-range activation/deactivation of genes across the entire genome.
We have adapted the SOC paradigm for the cell-fate decision and investigated whole-genome expression and its dynamics to address the following fundamental questions:
-
-
Is there any underlying principle that self-regulates the time evolution of whole-genome expression?
-
-
Can we identify a peculiar genome region that guides the super-critical genome and determines the cell-fate change?
-
-
Can we delineate a universal mechanism to understand the processes of the cell-fate change, and to further comprehend when and how it occurs?
Our previous studies (see details in [7]) from embryo to cancer development have demonstrated that whole-genome expression is dynamically self-organized through the emergence of a critical point, with the co-existence of three distinct response domains (critical states). Furthermore, dynamic pictures of between-state flux provide a potential universal mechanism of self-organization interpreted in terms of a “genome engine”. An autonomous critical-control genomic system is developed through the highly coherent behavior of low-variance genes (sub-critical state), which in turn, generates a dominant cyclic expression flux with high-variance genes (super-critical state) through the nuclear environment of the cell. This is evident given that gene expression is sorted and grouped according to the temporal variance of expression (nrmsf: normalized root mean square fluctuation), which acts as an order parameter for the self-organization of whole-genome expression [6,7]. On the contrary, randomly shuffled gene expression exhibits no evidence of cooperative behavior (no emergence of coordinated activity).
In this report, we further investigate time-series of whole-genome expression data for the cell-fate change: cell differentiation (heregulin (HRG)-stimulated MCF-7 human breast cancer cells compared with non-differentiated epidermal growth factor (EGF)-stimulated MCF-7 cells [19,20]; all-trans retinoic acid (atRA)- and dimethyl sulfoxide (DMSO)-stimulated HL-60 human leukemia cells differentiated to neutrophil cells [21]; differentiation of T helper 17 (Th17) cells from naïve T helper (Th0) cells [22]) and reprogramming (mouse and human early embryo development, [23] and [24], respectively). It is worth noting that these cases must be considered examples of a general mechanism and have been selected based on the availability of a sufficient number of experimental time points for the nrmsf value.
We update our previous findings by elucidating both the specific genome region (i.e., critical point, CP) for critical transition and an underlying mechanism that controls the cell-fate change. Our report is organized as follows (Figure 1):
(1) The CP corresponds to the center of mass (CM(tj): the average value of whole-genome expression at t = tj) (Section 2.1). (2) The dynamics of the center of mass of any stochastic expression converge to that of whole-genome expression (i.e., CM(tj)). This shows that the CP is the genome attractor (Section 2.2). (3) The switching singular behaviors of the CP transform the genome into a super-critical state (i.e., super-critical genome). This in turn induces a “global expression avalanche”, which is revealed by (4) the probability density function (PDF) of whole-genome expression. (5) Lastly, a cell-fate change occurs after the genome becomes a “super-critical genome” (Section 2.3, Section 2.4, Section 2.5 and Section 2.6) and after the genome passes over a stable point (non-equilibrium fixed point) of the thermodynamically open system. This passing indicates symmetry-breaking which induces coherent perturbation of the genome engine (Section 3.1).
These five points constitute a framework for developing a time-evolutional transition theory of biological regulation.
2. Results
2.1. Fixed Critical Point: A Specific Group of Genes Corresponding to the Center of Mass of Whole-Genome Expression
The existence of a critical point (CP) is essential for determining distinct response domains (critical states) [7]. To generate a unified model of biological regulation, we must go in depth into specific features of the CP in terms of “sandpile criticality” (Figure 2A). Sandpile criticality emerges when whole-genome expression is sorted and grouped according to the fold-change in expression between two different time points (e.g., between t = 0 and t = 10 min). For the same groupings, the nrmsf value (Section 4.2.1) of the CP in the HRG-stimulated MCF-7 cancer cells (population level) can be estimated (ln<nrmsf> ~ −2.5: Figure 2B).
Our study of HRG-stimulated MCF-7 cancer cells demonstrated that the temporal group correlation (between-group correlation) along the order parameter (nrmsf) reveals a focal point (FP) (see Figure 5B in [6]) when we consider the center of mass (CM) of whole-genome expression (changing in time) as a reference expression point. The grouping (baseline as CM(tj)) according to the degree of nrmsf is called CM grouping, (kth group; k = 1,2,…, K; see Figure 1), where grouping from the CM is distinguished from that of non-reference, .
Notably, as shown in Figure 2C, the CP corresponds to the zero-expression point in the CM grouping, which explains why the CP is a specific set of genes corresponding to the CM(tj) and the CP exists as an almost-fixed point in terms of the order parameter nrmsf. This feature holds for both single-cell and population data (Figure 2D–I: refer to the natural log of nrmsf value for different biological regulations). Therefore, we can develop correlation metrics based on the CM grouping (called CM correlation: Section 4.2.2) to grasp how whole-genome expression can be self-organized through the critical/singular behavior of the CP.
Zimatore et al. [10] demonstrated that CP gene expression (Table S1) corresponds to attractor-specific values with almost no influence of fluctuation modes, consistent with their zero-expression point in CM grouping.
2.2. Critical Point Acting as the Genome Attractor: Mechanism for Genome-Wide Avalanche
Genome expression exhibits coherent-stochastic behavior (CSB) in which coherent behavior emerges from an ensemble of stochastic expression [5,6,7,8]. In CSB mode, gene expression is inherently stochastic but its dynamics follow the center of mass (CM) of expression. Interestingly, CSB is evident at both the whole-genome expression level (Figure 3) and at a specific critical-state level (Figure 3b, Figure 3D in [8]).
To give a proof-of-concept of CSB, we performed a bootstrap simulation to capture two basic signatures of CSB:
-
(1)
The stochastic behavior of gene expression shows (relatively low) correlation convergence for randomly selected gene ensembles as the number of elements (n) is increased (Figure 3A).
-
(2)
The CM of randomly selected gene ensembles (with hundreds of repetitions for the gene ensembles) must dynamically converge to that of whole-genome expression as the number of elements (n) is increased (Figure 3B).
This happens at both the population and single-cell levels (Figure 3C,D). The existence of a threshold n at around 50 randomly picked genes [6,25] allows us to reproduce CSB with a random choice of N genes with N > n, which is further proof of the reliability of CSB. The constancy of the minimum number of randomly selected genes to reach convergence (Figure 3) is remarkable and suggests the presence of a sort of “percolation threshold” reminiscent of the size of the genetic networks operating in the system.
This convergence clearly reveals that the dynamics of the CM of genome expression describe an attractor of the dynamics of stochastic expression (see also Section 2.6). Therefore, as shown in Figure 3, the CP, the CM according to the degree of nrmsf, acts as the genome attractor. Therefore, a change in the CP provokes a genome-wide avalanche over the entire genome expression. Whole-genome expression follows the change in the CP, and this is the origin of coherent gene expression behaviors [26].
Here, it is important to note that CSB does not stem from the law of large numbers under an equilibrium condition, but rather is deduced from the general specific property of an emergent property of an open (non-equilibrium) system.
2.3. ON/OFF State of the CP: Cell Fate-Guiding and Global Avalanches
2.3.1. MCF-7 Breast Cancer Development (Cell-Population Level): ON–OFF State of the CP Revealed
We consider the cell-fate change equivalent to a genome-state change (refer to biological discussions in [7]). The genome-state change occurs in such a way that the initial-state SOC control of whole-genome expression, which is in charge of maintaining a largely invariant global gene-expression profile, is destroyed through the erasure of initial-state sandpile criticality [7,9]. Given that the CP acts as a genome attractor, it is essential to understand how changes in the CP lead to the super-critical genome. To elucidate the timing of the change in the CP, we focus on CM correlation dynamics (see Section 4.2.2), i.e., the CM correlation between the initial and other time points: (k = 1,2, …, K) over the experimental point, tj, where is the unit vector (unit length) of the kth group vector, . CM grouping is used to examine singular behavior at the CP.
In MCF-7 breast cancer cells, activation of the ErbB receptor by HRG and EGF elicits two different biological responses (Figure 4). HRG stimulation induces cell differentiation, whereas EGF stimulation provokes cell proliferation [19,20]. The temporal CM correlation in HRG-stimulated MCF-7 cells reveals a divergent behavior at tj = 15 min (Figure 4A, left panel), whereas EGF-stimulated MCF-7 cells (right panel) do not show any divergent behavior. Both responses exhibit sandpile CPs (Figure 4D). These results suggest that the CP possesses both activated and inactivated states, i.e., ON/OFF expression states for a set of genes (critical gene set) corresponding to the CP. In EGF-stimulated MCF-7 cells, the CP is in the inactivated (OFF mode) state, whereas in HRG-stimulated MCF-7 cells, the CP is ON at 10–15 min and thereafter turns OFF.
Direct evidence of the ON/OFF state of the CP is as follows:
-
(1)
In Figure 4B, for HRG stimulation, the switching of singular behaviors at 15–20 min occurs at the CP. At the boundary of the CP (ln<nrmsf> ~ −2.5), the singular behavior exhibits bimodal behavior in the fold-change on the CM grouping. At 15 min, a dominant positive (i.e., fold-change greater than one) singular behavior (ln<nrmsf> > −2.5) suggests that the swollen coil state of DNA occurs at the CP, while a negative singular behavior (ln<nrmsf> < −2.5) suggests the compact coil state of DNA. At 20 min, this bimodal behavior switches to a dominant negative singular behavior (> −2.5) with a positive singular behavior (< −2.5). With regard to EGF stimulation, such a switching transition does not occur during the early time points. Note: A negative fold-change occurs given that the reference point is the CM of whole-genome expression. Subtraction of the CM(tj) from each gene expression gives negative expression.
-
(2)
In Figure 4C, the probability density function (PDF) of whole- genome expression [27] shows that at 15 min, around the CP region, the maximum probability density occurs in a positive area, whereas it becomes negative at 15–20 min. This validates that the CP is in the ON state at 15 min and in the OFF state at 20 min. The PDF clearly shows that ON–OFF switching of the CP induces a global avalanche in whole- genome expression.
The fold-changes within expression groups <ck(tj+1)>/<ck(tj)> exhibit a clear first-order phase transition involving genome-sized DNA molecules (see more in Section 3.2). Through our studies, it became evident that the transition occurs as coherent behavior (at a mega-bp scale on the chromosome) and emerges from stochastic expression (coherent-stochastic behavior [8]). In other words, the averaging behavior (mean-field) of group expression corresponds to a coherent transition. In contrast, the ensemble average of the fold-change in individual expressions between two temporal groups, <ck(tj+1)/ck(tj)>, does not reveal such characteristics in the transition. Interestingly, the ensemble average of the time difference in the expression group <ck(tj+1) - ck(tj)> supports the coherent scenario. This indicates that fluctuation (noise) of coherent dynamics is eliminated (see attached Supplementary Figure S1).
Regarding the cell-fate change in HRG-stimulated MCF-7 cells, the erasure of initial sandpile criticality occurs after 2 h (Figure 4D). While the cell fate-guiding critical transition occurs at an earlier time point (15–20 min), the cell-fate change happens after 2 h. This time lag is needed to develop a new cell-fate attractor through coordinated local chromatin interaction (see details in [10]).
2.3.2. HL-60 Breast Cancer Cell Development (Cell-Population Level): Timing of Cell-Fate Change
Cell development in HL-60 human leukemia cells further supports the following scenario: (1) the state of the CP changes, such as from an inactivated to an activated state (OFF–ON) or vice versa (ON–OFF), while (2) the switching of singular behaviors occurs and (3) induces a cell fate-guiding global avalanche.
As for the switching behaviors at the CP,
-
(1)
Under atRA stimulation (Figure 5B), the dominant negative fold-change (compact globule state) indicates the OFF state of the CP at 18–24 h, while the dominant positive fold-change (swollen coil state) indicates the ON state at 24–48 h: the change from OFF to ON occurs at the CP. The global avalanche is shown by swelling of the probability density from 18–24 h to 24–48 h and contraction at 48–72 h (Figure 5C).
-
(2)
Under DMSO stimulation (Figure 6B), the dominant positive-to-negative fold-change (the opposite of the case with atRA stimulation) occurs from 8–12 h to 12–18 h, i.e., the change from OFF to ON occurs at the CP. At 18–24 h, the CP is neither ON nor OFF. These points are confirmed by the temporal change in the PDF profile (Figure 6C) around the CP: the fold-change (ln<nrmsf> ~ −2.5) goes from positive to negative to neutral.
Regarding the cell-fate change, the timing of the state-change of the CP through the switching of singular behaviors at the CP coincides with the timing of the erasure of the initial-state sandpile: at 24–48 h for atRA stimulation (Figure 5D) and at 12–18 h for DMSO stimulation (Figure 6D), suggesting that the cell-fate change occurs at 24–48 h for atRA and at 12–18 h for DMSO. The divergent behavior of the temporal CM correlation points to the occurrence of the cell-fate change: the change after the divergent behavior for atRA (Figure 5A) and the change before it for DMSO (Figure 6A).
As for the DMSO response, it is interesting to observe a multi-step process of erasure of initial-state sandpile criticality (Figure 6D): erasure at 8–12 h; recovery of criticality at 12–18 h; and then erasure again at 18–24 h. Multiple erasures suggest that the cell-population response passes over two SOC landscapes [7] at 8–12 h and 18–24 h.
The results obtained in cancer cells suggest that activation-deactivation of the critical gene set (CP) as the genome attractor plays an important role in the cell-fate change.
2.4. Cell-Fate Change in Single-Cell Dynamics
2.4.1. Human Embryo Development: Genome Avalanche Along Low-Expressed Genes
Embryo development starts from a single cell (zygote) and thus corresponds to a completely different situation than in the cases discussed above, where the data refer to averages over millions of cells in a plate (i.e., cell-population level). The CP acts as a genome attractor at the single-cell level and in cell populations (Figure 3). In temporal CM correlation, the CP is a point with no differential (Figure 7A, Figure 8A and Figure 9A), while it appears as a divergent point in cell populations. This feature reveals distinct response domains (critical states) in single-cell genome expression.
In a reprogramming event, the temporal CM correlation for the CP traverses a value of zero (corresponding to random-like behavior) at the 8-cell state for a human embryo cell (Figure 7A). This implies that, after the 8-cell state, the human embryo cell completely erases the initial zygote criticality. Furthermore, this coincides with the erasure of the sandpile-type zygote CP (Figure 7D). In biological terms, this corresponds to erasure of the initial stage of embryogenesis (driven by maternal heredity; see “SOC Control in Human and Mouse Related to the Developmental Oocyte-to-Embryo Transition” in the Discussion [7]).
It is important to note that groups of low-nrmsf presenting flattened CM correlations in time (ln<nrmsf> < −8.0) do not point to a no-response situation; on the contrary, they behave in a highly coherent manner to generate the autonomous SOC mechanism (see, e.g., Figure 6 in [6]).
The switching transition of the CP (Figure 7B) occurs at every cell state change from zygote to blastocyst (only results from the 4-cell state to blastocyst are shown). This suggests that in early embryogenesis, changes in the cell state involve a global change (genome avalanche) in whole-genome expression. Notably, the temporal change in the PDF profile (Figure 7C) shows that a genome avalanche occurs along low-expressed genes (around zero on the y-axis), shown as a traveling density wave (higher to lower nrmsf) from the 8-cell state to the morula state. The reverse traveling wave (lower to higher nrmsf) occurs after the morula state. This traveling wave points to the important role of collective behaviors that emerge in low-expressed genes (refer to local sub-critical state as the generator of the genome engine in Section 3.1). It is intriguing to observe that scaling behaviors (in a log–log plot) emerge, and the most vivid scaling behaviors develop at the 8-cell–morula state.
Therefore, with the results regarding temporal CM correlation, we conclude that a major the cell-fate change (embryonic reprogramming) occurs after the 8-cell state. This coincides with the timing of inverse coherent perturbation of the genome engine (cyclic flux flow), where the genome system passes over the non-equilibrium fixed point after the 8-cell state (the onset of symmetry-breaking occurs; see Section 3.1).
2.4.2. Mouse Embryo Development: Switching Scaling Behaviors in the Cell-Fate Change
In mouse embryo development, the scenario of the cell-fate change described above is confirmed:
-
(1)
Complete erasure of the memory of the zygote CP occurs after the late 2-cell state (Figure 8A), where the switching singular transition occurs in the cell-state change from the middle 2-cell state (2-cell M) to the 8-cell state (Figure 8B), showing a major change in the genome.
-
(2)
This is confirmed by a temporal change in the PDF (Figure 8C): during cell development from the middle 2-cell to 4-cell states through the late 2-cell state (2-cell L), the whole density profile reflects along the axis of zero-fold change. This clearly manifests as the ON state of the CP at the middle 2-cell – late 2-cell state, and as the OFF state at the late 2-cell – 8-cell state. There are also three distinct scaling behaviors, as in human embryo development. Notably, during the middle 2-cell to 4-cell state through the late 2-cell state, whole-genome expression shifts from up- to downregulation, i.e., marking the occurrence of a global avalanche, and furthermore, there is a reflection of the scaling behavior in the local sub-critical state (ln(nrmsf) < −11) along the zero-fold change axis (y = 0). This reflection illustrates that a coherent switching of folding and unfolding chromatin dynamics occurs, which supports the important role in the local sub-critical state in early embryo development [8].
-
(3)
The timing of the cell-fate change is further confirmed by the erasure of zygote sandpile criticality after the late 2-cell state.
2.4.3. Differentiation of Th17 Immune Cells from Th0 Cells: Partial Avalanche Guides the Cell-Fate Change
The cell-fate change in Th17 immune single-cell development again follows the same general scenario. Compared with embryo development, there are several distinct differences worth noting:
-
(1)
The CP does not pass over zero temporal correlation with the initial state (t = 0 h; Figure 9A), although the correlation response increases in time, which indicates that cell differentiation induces a partial-scale (specific set) change in whole-genome expression (vs. whole genome-scale change in embryonic reprogramming).
-
(2)
The switching singular transition at the CP (genome attractor) occurs sequentially from 3 h to 12 h (Figure 9B). This manifests as the OFF state of the CP at 6–9 h and as the ON state at 9–12 h. Figure 9C illustrates that the change in expression for switching is still on a large scale, which is confirmed by the temporal change in the PDF of whole-genome expression. However, there is no distinct scaling behavior as in embryo development.
The timing of the cell-fate change from Th0 to Th17 cells, as in other biological regulations, is determined by the erasure of initial-state sandpile criticality (Figure 9D). This appears at 6 h, where inversion of the singular behaviors of the CP takes place, which is further confirmed by the timing of the inversion of perturbation of the genome engine (see Section 3.1). The OFF–ON switching at the CP guides the cell-fate change.
2.5. Systematic Determination of Local Critical States in Genome Expression
We demonstrate that the CP is a fixed point relative to a given biological regulation. Next, based on this fact, we show that critical states in genome expression can be determined systematically for both single-cell and cell population genome expression:
-
(1)
Single-cell level: Temporal CM correlation (Figure 7A, Figure 8A and Figure 9A) manifests distinct response domains according to nrmsf: low-variance expression (sub-critical state) for the region of flattened correlation, intermediate variance for the near-critical state from the edge of flattened correlation to the CP, and high-variance expression for the super-critical state above the CP (Table 1).
-
(2)
Cell-population level: the Euclidean distance (from the highest nrmsf group) between the temporal responses of CM grouping (Figure 2: t = 0 vs. other experimental time points) reveals critical states (Figure 10; summary: Table 2), where the CP exists at the boundary between near-and sub-critical states (vs. between super-and sub-critical states in a single cell). This occurs for both MCF-7 and HL-60 cancer cell populations. In our previous studies on microarray data (cell-population level), critical states were determined by a change in expression profile by means of Sarle’s bimodality coefficient (Figure 10A; see Figure 9 for MCF-7 and HL-60 cells in [7]). This was accomplished by adding evidence of distinct response domains (super-, near- and sub-critical domains according to nrmsf).
Table 1.
Mouse Embryo Development | Human Embryo Development | Th17 Cell Differentiation |
---|---|---|
Critical States: | Critical States: | Critical States: |
Super-critical: 3583 RNAs −5.6 * <ln<nrmsf> |
Super-critical: 2607 RNAs −5.9 * <ln<nrmsf> |
Super-critical: 3064 RNAs −4.4 * <ln<nrmsf> |
Near-critical: 5922 RNAs −7.5< ln<nrmsf> < −5.6 |
Near-critical: 6707 RNAs −8.0 < ln<nrmsf> < −5.9 |
Near-critical: 4702 RNAs −6.0< ln<nrmsf> < −4.4 |
Sub-critical: 7636 RNAs ln<nrmsf> < −7.5 |
Sub-critical: 7345 RNAs ln<nrmsf> < −8.0 |
Sub-critical: 5368 RNAs ln<nrmsf> < −6.0 |
Table 2.
HRG-MCF-7 Cells | EGF-MCF-7 Cells | atRA-HL-60 Cells | DMSO-HL-60 Cells |
---|---|---|---|
Critical States: | Critical States: | Critical States: | Critical States: |
Super-critical: 4756 RNAs −2.0 <ln<nrmsf> |
Super-critical: 4210 RNAs −2.1 <ln<nrmsf> |
Super-critical: 2187 RNAs −2.1 <ln<nrmsf> |
Super-critical: 1721 RNAs −2.0 <ln<nrmsf> |
Near-critical: 7649 RNAs −2.5 * < ln<nrmsf> <−2.0 |
Near-critical: 8667 RNAs −2.6 * < ln<nrmsf> < −2.1 |
Near-critical: 2935 RNAs −2.6 * < ln<nrmsf> < −2.1 |
Near-critical: 2663 RNAs −2.5 * < ln<nrmsf> <−2.0 |
Sub-critical: 9872 RNAs ln<nrmsf> < −2.5 |
Sub-critical: 9400 RNAs ln<nrmsf> < −2.6 |
Sub-critical: 7503 RNAs ln<nrmsf> < −2.6 |
Sub-critical: 8241 RNAs ln<nrmsf> < −2.5 |
2.6. Genome Engine Mechanism for SOC-Control of Genome Expression: Heteroclinic Critical-State Attractor System
The existence of distinct critical states at both the single-cell and cell-population levels (Figure 7, Figure 8, Figure 9 and Figure 10) is further demonstrated through the different convergence behaviors of stochastic expression, where the center of mass (CM) of a randomly selected expression of a critical state converges to the CM of the critical state (Figure 11). This further explains the co-existence of distinct local attractors within the genome attractor (Figure 3), i.e., the formation of a heteroclinic critical-state attractor system. The co-existence of distinct local critical states points to the existence of an autonomous critical-control genomic system, i.e., the genome engine. Distinct coherent dynamics in a local critical state emerge from stochastic expression (refer to Figure 10 in [7]) due to the critical-state attractor self-organizing the dynamics of stochastic expression within the local critical state.
Thus, dynamic expression flux analysis (Section 4.2.4) for the CM of critical states can be applied to reveal the genome engine mechanism for describing how autonomous SOC control of genome expression arises. Figure 12 shows that the sub-critical state acts as an internal source of expression flux and the super-critical state acts as a sink. The cyclic flux forms a dominant flux flow that generates a strong coupling between the super- and sub-critical states accompanied by their anti-phase expression dynamics (Figure 13). This results in a change in oscillatory feedback to sustain autonomous SOC control of overall gene expression. The average between-state flux (Figure 12) represents a stable manifold for the thermodynamically open system. Formation of the dominant cyclic flux provides a genome engine metaphor for SOC control mechanisms pointing to a universal mechanism in the gene-expression regulation of mammalian cells for both the population and single-cell levels. As demonstrated in Section 3.1, global perturbation stems from the change in CP, which either enhances or suppresses the genome engine, and thus transforms the cell-fate change.
3. Discussion and Conclusion
3.1. Genomic Dynamics Determining the Cell-Fate Change from Embryo to Cancer: Cell-Fate Change Passing through a Non-Equilibrium Fixed Point
The different examples of cell-fate determination (cell differentiation and reprogramming in embryo development) have highlighted two crucial common elements: (1) the critical point (CP) acts as the organizing center of the cell fate, and (2) a sort of genome engine (Figure 12) fuels the transition and constitutes the basic mechanism of the cell-fate change. This genome engine works by canalizing the stochastic expression of local critical states into coherent behavior, provoking the motion of the center of mass (CM) of genome expression (i.e., change in the genome attractor). This is exactly what happens when an artificial engine transforms combustion (or any other energy source) into motion.
The CM acts as an attractor at both the whole-genome (Figure 2) and critical-state levels (Figure 11). Based on this fact, the expression flux (effective force) approach (Section 4.2.4) was developed to reveal dynamic interaction flux between critical-state attractors. Figure 13 demonstrates that the expression flux approach is instrumental for revealing a heteroclinic critical transition at the occurrence of a global avalanche that guides the coherent behaviors that emerge in the critical states. Interaction flux of between-state attractors is the underlying basic mechanism of epigenetic self-regulation, incorporating a rich variety of transcriptional factors and non-coding RNA regulation to determine the coherent oscillatory behaviors of critical states.
The flux dynamics approach has been further developed to analyze the quantitative evaluation of the degree of non-harmonicity and time-reversal symmetry-breaking in nonlinear coupled oscillator systems [28].
Thus, we summarize how and when the cell-fate change occurs as follows:
(1) How the cell-fate change occurs:
The intersection of interaction fluxes (Figure 14 for cell population and Figure 15 for single cell; see the dashed ovals in interaction expression flux) occurs just before a cell-fate change. This intersection means that, thermodynamically, the genomic system lies at a non-equilibrium fixed point (e.g., see Figure 14C: atRA stimulation at 24 h with small interaction fluxes; see also Section 4.2.4), which suggests that before the cell-fate change, time-reversal symmetry is broken through passing over a non-equilibrium fixed condition. In terms of the genome engine, around the cell-fate change, a global perturbation induces either enhancement or suppression of the genome engine, where there is a dominant cyclic flux flow between the super- and sub-critical states (Table 3: single cell; Table 4: cell population). In HL-60 cells (cell population), the genome engine is enhanced before the cell-fate change and suppressed (enhancement-suppression) thereafter. On the contrary, a reverse process of suppression–enhancement takes place in MCF-7 cancer cells (Figure 14A; see more in [7]). In single-cell cases (embryo development and Th17 immune cells), a suppression–enhancement of the genome engine occurs (Figure 15). The varying sequences of perturbation of the genome engine may stem from different stages of the suppressive pressure on cell differentiation against cell proliferation.
Table 3.
Biological Regulation | Location of the CP | State Change * in the CP | Coherent Perturbation of Genome Engine | Reprogramming **/Cell-Fate Cange |
---|---|---|---|---|
Mouse Embryo Development | ln<nrmsf>CP ~ −5.6 | ON → OFF: At late 2-cell - 4-cell | Suppression- Enhancement | After late 2-cell ** |
Human Embryo Development | ln<nrmsf>CP ~ −5.9 | Singular Switching: At 8-cell -Morula | Suppression- Enhancement | After 8-cell ** |
Th17 Cell Immune
Differentiation |
ln<nrmsf>CP ~ −4.4 | OFF → ON: At 6–9 h | Suppression- Enhancement | After 6 h |
Table 4.
Biological Regulation | Location of the CP | State Change * in the CP | Coherent Perturbation of Genome Engine | Cell-Fate Change |
---|---|---|---|---|
Cell Proliferation: EFG-Stimulated MCF-7 Cells | ln<nrmsf>CP ~ −2.6 | OFF State | Local Perturbation | NO |
Cell Differentiation: HRG-Stimulated MCF-7 Cells | ln<nrmsf>CP ~ −2.5 | OFF → ON: At 15–20 min | Suppression- Enhancement | After 2 h ** |
Cell Differentiation: atRA-Stimulated HL-60 Cells | ln<nrmsf>CP ~ −2.6 | OFF → ON: At 24–48 h | Enhancement- Suppression | After 24 h |
Cell Differentiation: DMSO-Stimulated HL-60 Cells | ln<nrmsf>CP ~ −2.5 | ON → OFF: At 12–18 h | Enhancement- Suppression | After 18 h |
(2) When the cell-fate change occurs:
Cancer cell differentiation is accompanied by a change from a compact globule (OFF) to a swollen coil (ON) or vice versa at the CP (see Figure 4, Figure 5 and Figure 6). As shown in Figure 5C and Figure 6C (HL-60 cells), a global genome avalanche occurs at the timing of the cell-fate change. For MCF-7 cells (HRG stimulation), the cell-fate change (after 2 h: Figure 4D) occurs after the genome avalanche (see details of the attractor mechanism in [10]). The global impact provoked by the CP is further supported by the fact that (1) EGF-stimulated MCF-7 cells, where the CP is OFF with no cell differentiation, induce only local perturbation (Figure 16C) and (2) HL-60 cells coincide with the timing of global perturbation (Figure 16D).
Regarding the cell-fate change in single-cell dynamics, a global perturbation (Figure 16A,B) occurs for the genome engine (after the late 2-cell and 8-cell stages and after 6 h for mouse, human embryo, and Th17 cells, respectively), where the timing coincides with that of the cell-fate change (i.e., erasure of the initial-state CP memory). This suggests that the cell fate-guiding change in the CP corresponds to the erasure of initial-state sandpile criticality. For embryo reprogramming, this picture is supported by the fact that temporal CM correlation of the CP from the initial state (zygote) passes zero-correlation (i.e., erasure of the initial-state CP memory: Figure 7A and Figure 8A). Due to the genome attractor, the change in the CP (the edge of criticality) has a global impact on the entire genome expression through perturbation of the genome engine: global perturbation leads to a genome avalanche at the timing of the cell-fate change (Figure 7C, Figure 8C and Figure 9C; also refer to Figure 13 and Biological Discussion in [7]).
Therefore, elucidation of the activation-deactivation mechanism for the CP (involving more than hundreds of genes) through a coordinating change in chromatin structure [10] uncovers how the genome engine is either enhanced or suppressed around the timing of the cell-fate change. Furthermore, our SOC hypothesis is expected to predict when and how the cell-fate change occurs in different situations (e.g., cancer, induced pluripotent stem cells (iPS cells), etc.). The universality of the proposed model does not stem from our experimental results, but rather from the existence of very basic physical constraints (e.g., chromatin dimension and general organization, and phase transition phenomenology) independent of biological specificities.
3.2. Synergetic Behaviors of Mega Base Pair-Size DNAs Through a Phase Transition
Up to this point, we have focused on the genomic dynamics that convert the signatures of the cell-fate change into self-organized complexity. Here, we should recall that our first objective was to recognize the unfeasibility of precise gene-by-gene regulation in the presence of the extremely compact folding of a 2-meter-long molecule that is compressed within a few microns of space. This calls for a search for a structural counterpart of the above-discussed model. In this respect, it is worth noting that the CP shows a clear bimodal expression distribution reminiscent of the swollen coil (ON) and compact globule (OFF) DNA transition, corresponding to inversion of the intra-chain phase segregation of the coil and globule states. This behavior closely resembles the intrinsic characteristics of the first-order phase transition inherent in genome-sized DNA molecules. The activation-deactivation of the CP is essential for the occurrence of the cell-fate change.
As for the phase transition of mega base pair-size DNA, until the late 20th century, it was believed that single polymer chains, including DNA molecules, always exhibited a cooperative but mild transition between the elongated coil and compact globule states, which was considered neither a first-order nor a second-order phase transition [30,31]. More recently, it has become evident that long DNA molecules above the size of several tens of kilo base pairs exhibit characteristics that help them undergo a large discrete transition, i.e., first-order properties related to the coil–globule transition [32,33,34]. Such first-order characteristics are rather general for semi-flexible polymers, especially polyelectrolyte chains such as giant DNA molecules. Additionally, it is important to note that insufficient charge neutralization in the globule state causes instability and leads to the generation of intra-chain segregation of individual single genomic DNA [35,36] (see, e.g., Figure 8B). When such instability is enhanced with long-range Coulombic interaction, the characteristic correlation length tends to become shorter, corresponding to generation of the critical state in the transition. Such transitional behavior that accompanies the folding transition of DNA is also observed for reconstructed chromatin [37,38,39].
Thus, the state change in the CP is suggested to guide the synergistic effect on mega base pair-size DNAs through the phase transition in the cell-fate change. Our recent study based on non-linear signal analysis techniques (recurrence quantification analysis and principal component analysis) revealed that chromatin remodeling played a role as the material basis of SOC-controlled genome expression [10]. A genome-wide expression avalanche stems from the coordinated activity of positional-based (not biological-functional) local chromatin interaction. The cell fate-guiding critical transition occurs based on the non-linear resonance of two major expression fluctuation modes, which establish an autonomous genome engine through activation of the CP. Elucidation of the link between the spatial position on the chromosome and co-regulation together with the identification of specific locations on the genome devoted to the generalization of perturbation stimuli provides a molecular basis for the self-organization dynamics of genome expression and the cell-fate decision.
3.3. CP Potentially Acting as the Center of Genome Computing
The singular behavior at the CP exhibits bimodal folding and unfolding chromatin dynamics through synergistic or cooperative behavior in higher-order structural transition of genomic DNA, which suggests the existence of a complex network of mega-sized ON/OFF DNA phase transitions in the cell-fate change. This interpretation was recently confirmed by the evidence of a massive re-arrangement of peri-centromeric nuclear regions correspondent to gene expression transitions [40]. The necessary link between chromatin remodeling and gene expression regulation was experimentally assessed [41]. These further imply that the CP may act as the center of genome computing, where chromatin remodeling is the material basis of such computation.
The time-dependent behavior of the genomic DNA transition follows a kinetic equation to exhibit cubic nonlinearity. A simultaneous change in the translational and conformational entropy of giant DNA together with surrounding counter ions causes bimodality in the free energy, which in turn speeds the transition under cubic nonlinearity in the kinetic equation [42,43]. The incorporation of negative feedback in cubic nonlinearity leads to fundamental characteristic nerve firings. Therefore, the occurrence of SOC in the brain’s complex neural networks suggests that the state change in the CP is a computing process through coordinated DNA structural transitions to guide the cell-fate change. Elucidation of the computing process at the CP could reveal the control mechanism for the cell-fate change in a desirable manner, i.e., the existence of “genome intelligence”. Here, note that cubic nonlinearity in the time-differentiation equation corresponds to a kinetic representation of the bimodal free energy, and criticality is generally interpreted based on the symmetry argument under this type of energy landscape [44].
3.4. Biological Meaning of the CP
As noted above, we investigated a new interpretation of the specificity of biological regulation without relying on an improbable “Maxwell’s demon” ability to select the “right genes” interspersed in a two-meter long molecule constrained within the few cubic microns of the cell nucleus. The genes exert specific biological functions and an analysis of such functions could give us further hints regarding the mechanism of the cell-fate change. The strict link between chromatin remodeling and the phase transition is in fact mirrored by the gene composition of the super-critical (genome) state. Table S1 based on principal component analysis shows the 200 genes that are most strongly affected by the second principal component (PC2) of gene expression variance in HRG-treated MCF7 cells, as explained in [10]. PC2 corresponds to the super-critical set referring to the component with genes that show higher temporal variation. The continuous fluctuation in time of this component (orthogonal by construction to the first principal component (PC1), which corresponds to the equilibrium position, i.e., the attractor) allows for both continuous adaptation to the changing micro-environment (small avalanches) and fueling of the state change (phase transition). The biological role of these genes (endowed with statistically significant z-scores on PC2 and thus being the “top players” in the supercritical state, see Table S1) confirms that chromatin remodeling is the basic structural driver of the genome engine. The bolded genes in Table S1 are all consistent with structural chromatin remodeling obtained by both histone modifications and DNA breaks that allow for global nuclear re-organization. This nuclear re-organization requires a cross-talk with the perinuclear environment (note the prevalence of actin, myosin, cadherin and integrin genes) and activates known signaling hubs such as proto-oncogenes c-FOS and c-Myc, Mitogen-activated protein (MAP) kinase, and transformation-related protein 53 (TP53) (Table S1). This is a stringent, albeit indirect, proof of the proposed model in terms of mainstream biology.
3.5. Final Remarks
We highlighted the basic invariance of the model in systems from embryo development to cancer. The different model cases are considered as examples of a general mechanism, where the need for a relevant time course to be analyzed is the only inclusion criterion, together with the presence of a relevant perturbation of the studied system. We can anticipate that other kinds of cells will behave according to the SOC model with differences linked to the presence (or absence) of a stable phenotype change during the time course considered. As aptly stated in [45], criticality is a general feature of biological regulation.
Finally, we summarize our findings in the following points:
-
(1)
The critical point (CP) acts as the center of the cell fate.
-
(2)
How the cell-fate change occurs: Before the cell-fate change, the genomic system passes over the stable point (non-equilibrium fixed point) of the thermodynamically open system. The genome engine, an emergent dynamic property of an autonomous interaction flux between local critical states (distinct expression domains in whole-genome expression), is enhanced or suppressed to induce coherent perturbation of the dominant cyclic flux between local super- and sub-critical states.
-
(3)
When the cell-fate change occurs: The change occurs at the timing of the erasure of the initial-state sandpile CP. At this time, a global genome avalanche occurs, except in HRG-stimulated MCF-7 cell differentiation, with a time lag between the genome avalanche and the cell-fate change (for further details of this mechanism, see [10]).
The genome engine suggests that the activation-deactivation mechanism of the CP elucidates how global perturbation occurs on self-organization through a change in signaling due to external or internal stimuli. Coordinated chromatin dynamics emerge to guide reprogramming through SOC control. A recent study found that the dynamics of the high-order structure of chromatin exhibit liquid-like behavior [46], which could be a crucial characteristic that enables the genome to exhibit SOC gene expression control for determination of the cell fate.
Further studies on these matters are needed to clarify the underlying fundamental molecular mechanism(s). The development of a theoretical foundation for the autonomous critical control mechanism in genome expression as revealed in our findings is expected to lead to new insights regarding for a general control mechanism for determination of the cell fate and genome computing.
For now, we can safely affirm that the strong interactions among genes with very different expression variance and physiological roles push for a complete reshaping of the current molecular-reductionist view of biological regulation focused on a single “significantly affected” gene to explain these regulation processes. The view that the genome acts as an integrated dynamic system is here to stay.
4. Materials and Methods
4.1. Biological Data Sets
We analyzed mammalian transcriptome experimental data for seven distinct the cell-fates in different tissues.
4.1.1. Cell population:
1. Microarray data of the activation of ErbB receptor ligands in human breast cancer MCF-7 cells by EGF and HRG; Gene Expression Omnibus (GEO) ID: GSE13009 (N = 22,277 mRNAs; experimental details in [29]) at 18 time points: t1 = 0, t2 = 10, 15, 20, 30, 45, 60, 90 min, 2, 3, 4, 6, 8, 12, 24, 36, 48 h, tT = 18 = 72 h.
2. Microarray data of the induction of terminal differentiation in human leukemia HL-60 cells by DMSO and atRA; GEO ID: GSE14500 (N = 12,625 mRNAs; details in [21]) at 13 time points: t1 = 0, t2 = 2, 4, 8, 12, 18, 24, 48, 72, 96, 120, 144 h, tT=13 = 168 h.
4.1.2. Single-cell:
3. RNA-Seq data of early embryonic development in human and mouse developmental stages in RPKM (reads per kilobase of transcript, per million mapped reads) values; GEO ID: GSE36552 (human: N = 20,286 RNAs) and GSE45719 (mouse: N = 22,957 RNAs) with experimental details in [24] and [23], respectively.
We analyzed 7 human and 10 mouse embryonic developmental stages as listed below:
Human: oocyte (m = 3), zygote (m = 3), 2-cell (m = 6), 4-cell (m = 12), 8-cell (m = 20), morula (m = 16), and blastocyst (m = 30);
Mouse: zygote (m = 4), early 2-cell (m = 8), middle 2-cell (m = 12), late 2-cell (m = 10), 4-cell (m = 14), 8-cell (m = 28), morula (m = 50), early blastocyst (m = 43), middle blastocyst (m = 60), and late blastocyst (m = 30), where m is the total number of single cells.
4. RNA-Seq data of T helper 17 (Th17) cell differentiation from mouse naive CD4+ T cells in FPKM (fragments per kilobase of transcript, per million mapped reads) values, where Th17 cells were cultured with anti-IL-4, anti-IFNγ, IL-6 and TGF-β, (details in [22]; GEO ID: GSE40918 (mouse: N = 22,281 RNAs) at 9 time points: t1 = 0, t2 = 1, 3, 6, 9, 12, 16, 24 h, tT=9 = 48 h. For each time point, the reference sample numbers are listed: GSM1004869-SL2653 (t = 0 h); GSM1004941-SL1851 (t = 1 h); GSM1004943-SL1852 (t = 3 h); GSM1005002-SL1853 (t = 6 h); GSM1005003-SL1854 (t = 9 h); GSM1004934-SL1855 (t = 12 h); GSM1004935,6,7-SL1856, SL8353, SL8355 (t = 16 h; average of three data points); GSM1004942-SL1857 (t = 24 h); GSM1004960-SL1858 (t = 48 h).
The colors used in the various plots throughout this report are based on the experimental events and were assigned as follows: black as the initial event, purple as the second event, and subsequent events as blue, dark cyan, dark green, dark yellow, brown, orange, red, dark pink, and pink.
For microarray data, the robust multichip average (RMA) was used to normalize expression data for further background adjustment and to reduce false positives [47,48,49].
For RNA-Seq data, RNAs with RPKM and FPKM values of zero over all of the cell states and experimental time points were excluded. In the analysis of sandpile criticality, random real numbers in the interval [0, a] generated from a uniform distribution were added to all expression values (only in Figure 7D, Figure 8D and Figure 9D). This procedure avoids the divergence of zero values in the logarithm. The robust sandpile-type criticality through the grouping of expression was checked by changing a positive constant: a (0 < a < 10); we set a = 0.001. Note: The addition of large random noise (a >> 10) destroys the sandpile CP.
4.2. Methods
4.2.1. Normalized Root Mean Square Fluctuation (nrmsf)
Nrmsf (see also Methods in [6]) is defined by dividing rmsf (root mean square fluctuation) by the overall maximum {rmsfi} (Equation (1)):
(1) |
where rmsfi is the rmsf value of the ith RNA expression, which is expressed as εi(sj) at a specific cell state sj or experimental time (e.g., in mouse embryo development, S = 10: s1 = zygote, early 2-cell, middle 2-cell, late 2-cell, 4-cell, 8-cell, morula, early blastocyst, middle blastocyst, and s10 = late blastocyst), and is its expression average over the number of cell states. Note: Nrmsf is a time-independent variable, where nrmsf acts as an order parameter of self-organization [6,7].
4.2.2. CM Correlation Analysis
To investigate transition dynamics, the correlation metrics based on the center of mass (CM) grouping, the CM correlation, was built upon the following basic statistical formalization:
(1) CM grouping: genome expression is considered as an N-dimensional vector, where the average value (CM) of whole-genome expression at t = tj is subtracted from each expression (refer to Figure 1). Next, the whole-genome expression is sorted and grouped according to the degree of nrmsf, where CM grouping has K groups and within each group there are n number of expressions (N = K.n): N-dimensional CM grouping vector,; and are the highest and lowest group vectors of nrmsf, respectively. Here, the unit vector of the kth vector is defined as . Note that the elements less than n in the last group (the lowest nrmsf) have been removed from the analysis.
(2) If we keep in mind that correlation corresponds to the cosine of the angle between unit vectors, i.e., the inner product of unit vectors (, ; : dot product (scalar) of unit vectors:,), two different CM correlations can be considered:
(i) Spatial CM correlation: for a given time point (), the development of CM correlation between the first group (highest nrmsf group) and other vectors: ; (k = 2,3,…K).
(ii) Temporal CM correlation: for a given group (k), the development of CM correlation between the initial and other experimental points: (k = 1,2,3,…K) over experimental time points, tj (see Biological Data Sets).
4.2.3. Probability Density Function (PDF)
The robustness of gene expression clustering by means of the density analysis of noisy gene-expression profiles was demonstrated by Shu et al. [24]. The probability density function (PDF) based on the Gaussian kernel is examined here. We consider an N-dimensional whole-genome expression vector for the natural logarithm (natural log) of expression and its fold-change vector, where the natural log of the whole expression vector is defined as ln(ε(tj)) = (ln(ε1(tj)), ln(ε2(tj)),…, ln(εN(tj))) with the ith expression, εi(tj) at t = tj (Figure 1) and its fold-change vector is defined as ln(ε(tj)/ε(tk)) = (ln(ε1(tj)/ε1(tk)), ln(ε2(tj)/ε2(tk)),…, ln(εN(tj)/εN(tk)). This satisfies the logarithm of a quotient: ln(ε(tj)/ε(tk)) = ln(ε(tj)) − ln(ε(tk)).
Next, we consider these vectors with regard to their CM (called the CM expression vector), the CM natural log of the whole-genome expression vector and its fold-change vector. The CM natural log of the whole-genome expression vector is defined as ln(c(tj)) = ln(ε(tj)) − CM(ln(ε(tj))) and CM(ln(ε(tj))) = and its fold-change vector is ln(c(tj)/c(tk)) = ln(ε(tj)/ε(tk)) − CM(ln(ε(tj)/ε(tk))). This vector representation again satisfies the logarithm of a quotient: ln(c(tj)/c(tk)) = ln(c(tj)) − ln(c(tk)).
From embryo to cancer development, global avalanches in changes in the CP are captured, and furthermore, this presents scaling behaviors in addition to the traveling expression wave in human and mouse embryo development (RNA-Seq data) (see Figure 7C and Figure 8C), which suggests coordinated DNA folding-unfolding dynamics.
4.2.4. Expression Flux Analysis: Expression Flux Between Critical-State Attractors
We developed an expression flux analysis to describe the genome engine mechanism for both single-cell and population genome expression (see Figure 12). The key fact is that the dynamics of coherent behavior emerging from stochastic expression in distinct critical states (coherent-stochastic behavior: CSB) follow the dynamics of the CM of a critical state. This convergence shows that the CM of a critical state acts as a critical-state attractor for stochastic expression within the critical state (Figure 11). We developed the expression flux approach to reveal the dynamic interaction flux between critical-state attractors [6,7,8].
The CSB in a critical state corresponds to the scalar dynamics of its CM. The numerical value of a specific critical state (i.e., super-, near- or sub-critical state) is represented by X(sj) at a specific experimental event (sj), where an experimental event (sj) corresponds to a cell state or an experimental time point. The expression flux between critical states is interpreted as a non-equilibrium system and evaluated in terms of a dynamic network of effective forces, where interaction flux is driven by effective forces between different critical states and can be described by a second-order time difference. It is important to note that the oscillatory phenomenon is interpreted using a second-order difference equation with a single variable. This is equivalent to the inhibitor-activator dynamics given by a couple of first-order difference equations with two variables.
The flux dynamics approach has been further developed to analyze the quantitative evaluation of the degree of non-harmonicity and time-reversal symmetry-breaking in nonlinear coupled oscillator systems [28].
The basic formulas of expression flux dynamics are given as follows:
Net self-flux of a critical state: The net self-flux, the difference between IN flux and OUT flux, describes the effective force on a critical state. This net self-flux represents the difference between the positive sign for incoming force (net IN self-flux) and the negative sign for outgoing force (net OUT self-flux); the CM from its average over all cell states represents up- (or down-) regulated expression for the corresponding net IN (or OUT) flux.
The effective force is a combination of incoming flux from the past to the present and outgoing flux from the present to the future cell states (Equation (2)):
(2) |
where ΔP is the change in momentum with a unit mass (i.e., the impulse: FΔs = ΔP) and the natural log of the average (<…>) of a critical state, with the ith expression at the jth experimental event, s = sj (NC = the number of RNAs in a critical state; refer to Table 1 and Table 2); the average of net self-flux over the number of critical states, <f(X)> = <INflux> − <OUTflux>.
Here, scaling and critical behaviors occur in log–log plots of group expression, where the natural log of an average value associated with group expression such as ln<nrmsf> or ln<expression> is taken. Thus, in defining expression flux, the natural log of the average expression (CM) of a critical state is considered.
It is important to note that each embryo cell state is considered as a statistical event (a statistical event does not necessarily coincide with a biological event) and its development is considered as a time arrow (time-development) when evaluating the average of group expression: the fold-change in expression and temporal expression variance (nrmsf). This implies that an interval in the dynamic system (Equation (2)) is evaluated as the difference in events, i.e., Δsj = sj − sj−1= 1 and Δs = sj+1 − sj−1 = 2 in embryo development, as well as the difference in experimental times such as in cell differentiation (the actual time difference can be considered as scaling in time). We then evaluate a force-like action in expression flux.
The interaction flux of a critical state: The interaction flux, representing the flux of a critical state X(sj) with respect to another critical state (super-, near-, sub-) or the environment (E: milieu), Yj, can be defined as (Equation (3)):
(3) |
where, again, the first and second terms represent IN flux and OUT flux, respectively, and the net value (i.e., IN flux − OUT flux) represents incoming (IN) interaction flux from Y for a positive sign and outgoing (OUT) interaction flux to Y for a negative sign. Y represents the numerical value of either a specific critical state or the environment, where a state represented by Y is different from one represented by X.
With regard to the global perturbation event, the net kinetic energy flux [7] clearly reveals the timing of the global perturbation event (Figure 16) (Equation (4)):
(4) |
where the kinetic energy of the CM for the critical state with unit mass at s = sj is defined as 1/2.2 with average velocity: .
Net self-flux as summation of interaction fluxes: Due to the law of force, the net self-flux of a critical state is the sum of the interaction fluxes with other critical states and the environment (Equation (5)):
(5) |
where state with X, and M is the number of internal interactions (M = 2), i.e., for a given critical state, there are two internal interactions with other critical states. Equation (5) tells us that the sign of the difference between the net self-flux and the overall contribution from internal critical states, reveals incoming flux (positive) from the environment to a critical state or outgoing flux (negative) from a critical state to the environment. When the difference in all critical states is zero, the genome system itself lies in a stable point (non-equilibrium fixed point) of a thermodynamically open system (no average flux flow from the environment). Average between-state flux (Figure 12) represents a stable manifold of the thermodynamically open system.
Order of averaging: vs.
Here, we need to address the previous result of expression flux dynamics in mouse single-cell genome expression [8], where the expression of a critical state was taken as , which has a different order of operations: first take the natural log of the expression and then calculate the average. Hence, in flux dynamics, we examine whether or not the mathematical operations of averaging and the natural log, i.e., between and , can be exchanged (mathematically commuted). In microarray data, flux behaviors do not change much between different orders of actions, whereas in RNA-Seq data, they are not commuted because its data structure has many zero values. While the addition of small random noise to the log of expression, (used for the previous result [8], a noise-sensitive case) has a good effect, the addition of such noise to ), used for noise-insensitive cases (such as in this report), does not have a good effect. Therefore, due to this sensitivity, micro-array data may be better for expression flux analysis (see Figure 13). Although the detailed dynamics of interaction flux changes by ordering actions differently for RNA-Seq data (e.g., Figure 6 in [8]), two important characteristics in the genome engine, the formation of dominant cyclic flux between super- and sub-critical states and the generator role of the sub-critical state, do not change (invariant features). Thus, we conclude that the concept of the genome engine is quite robust.
Acknowledgments
M.T. sincerely thanks the following institution and individuals who helped complete this research project: the SEIKO Life Science Laboratory, Osaka, Japan, his family (particularly, his daughters, Kimiko and Kazumi Tsuchiya, and Harry Taylor with any editing), Andrzej Kasperski for fruitful discussions through the proof-reading and Jekaterina Erenpreisa for biological discussions.
Abbreviations
atRA CM |
All-trans retinoic acid Center of mass |
CP | Critical point |
CSB | Coherent-stochastic behavior |
EGF | Epidermal growth factor |
KEF | Kinetic energy flux |
DMSO | Dimethyl sulfoxide |
HRG | Heregulin |
iPS cell | Induced pluripotent stem cell |
Nrmsf | Normalized root mean square fluctuation |
PC1, PC2 | The first principal component, the second principal component |
Probability density function | |
SOC | Self-organized criticality |
Th0 | Naïve T helper |
Th17 | T helper 17 |
Supplementary Materials
Supplementary Materials can be found at https://www.mdpi.com/1422-0067/21/13/4581/s1.
Author Contributions
M.T. initiated the project; M.T., A.G. and K.Y. designed the study; M.T. developed the study and analyzed data; A.G. and K.Y. provided theoretical support; M.T., A.G. and K.Y. wrote the manuscript. All authors have read and agreed to the published version of the manuscript.
Funding
This work was supported by JSPS KAKENHI Grant Number JP20H01877.
Conflicts of Interest
The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.
References
- 1.Takahashi K., Yamanaka S. A decade of transcription factor-mediated reprogramming to pluripotency. Nat. Rev. Mol. Cell Biol. 2016;17:183–193. doi: 10.1038/nrm.2016.8. [DOI] [PubMed] [Google Scholar]
- 2.MacArthur B.D., Ma’ayan A., Lemischka I.R. Systems biology of stem the cell-fate and cellular reprogramming. Nat. Rev. Mol. Cell Biol. 2009;10:672–681. doi: 10.1038/nrm2766. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Raser J.M., O’Shea E.K. Noise in gene expression: Origins, consequences, and control. Science. 2005;309:2010–2013. doi: 10.1126/science.1105891. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Yoshikawa K. Field hypothesis on the self-regulation of gene expression. J. Biol. Phys. 2002;28:701–712. doi: 10.1023/A:1021251125101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Tsuchiya M., Hashimoto M., Takenaka Y., Motoike I.N., Yoshikawa K. Global genetic response in a cancer cell: Self-organized coherent expression dynamics. PLoS ONE. 2014;9:e97411. doi: 10.1371/journal.pone.0097411. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Tsuchiya M., Giuliani A., Hashimoto M., Erenpreisa J., Yoshikawa K. Emergent Self-Organized Criticality in gene expression dynamics: Temporal development of global phase transition revealed in a cancer cell line. PLoS ONE. 2015;11:e0128565. doi: 10.1371/journal.pone.0128565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Tsuchiya M., Giuliani A., Hashimoto M., Erenpreisa J., Yoshikawa K. Self-organizing global gene expression regulated through criticality: Mechanism of the cell-fate change. PLoS ONE. 2016;11:e0167912. doi: 10.1371/journal.pone.0167912. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Tsuchiya M., Giuliani A., Yoshikawa K. Single-Cell Reprogramming in Mouse Embryo Development through a Critical Transition State. Entropy. 2017;19:584. doi: 10.3390/e19110584. [DOI] [Google Scholar]
- 9.Giuliani A., Tsuchiya M., Yoshikawa K. Self-Organization of Genome Expression from Embryo to Terminal The cell-fate: Single-Cell Statistical Mechanics of Biological Regulation. Entropy. 2018;20:13. doi: 10.3390/e20010013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Zimatore G., Tsuchiya M., Hashimoto M., Kasperski A., Giuliani A. Self-Organization of Whole Gene Expression through Coordinated Chromatin Structural Transition: Validation of Self-Organized Critical Control of Genome Expression. bioRxiv. 2019 doi: 10.1101/852681. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Bak P., Tang C., Wiesenfeld K. Self-organized criticality: An explanation of the 1/f noise. Phys. Rev. Lett. 1987;59:381–384. doi: 10.1103/PhysRevLett.59.381. [DOI] [PubMed] [Google Scholar]
- 12.Langton C.G. Computation at the edge of chaos-phase transitions and emergent computation. Physica D. 1990;42:12–37. doi: 10.1016/0167-2789(90)90064-V. [DOI] [Google Scholar]
- 13.Kauffman S.A. The Origins of Order: Self-Organization and Selection in Evolution. Oxford University Press; New York, NY, USA: 1993. [DOI] [Google Scholar]
- 14.Jensen H.J. Self-Organized Criticality. Cambridge University Press; Cambridge, UK: 1998. [Google Scholar]
- 15.Marković D., Gros C. Power laws and self-organized criticality in theory and nature. Phys. Rep. 2014;536:41–74. doi: 10.1016/j.physrep.2013.11.002. [DOI] [Google Scholar]
- 16.Muñoz M.A. Colloquium: Criticality and dynamical scaling in living systems. Rev. Mod. Phys. 2018;90:31001–31030. doi: 10.1103/RevModPhys.90.031001. [DOI] [Google Scholar]
- 17.Wagner J.R., Lee C.T., Durrant J.D., Malmstrom R.D., Feher V.A., Amaro R.E. Emerging Computational Methods for the Rational Discovery of Allosteric Drugs. Chem. Rev. 2016;116:6370–6390. doi: 10.1021/acs.chemrev.5b00631. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Halley J.D., Burden F.R., Winkler D.A. Summary of stem cell decision making and critical-like exploratory networks. Stem Cell Res. 2009;2:165–177. doi: 10.1016/j.scr.2009.03.001. [DOI] [PubMed] [Google Scholar]
- 19.Nagashima T., Shimodaira H., Ide K., Nakakuki T., Tani Y., Takahashi K., Yumoto N., Hatakeyama M. Quantitative transcriptional control of ErbB receptor signaling undergoes graded to biphasic response for cell differentiation. J. Biol. Chem. 2007;282:4045–4056. doi: 10.1074/jbc.M608653200. [DOI] [PubMed] [Google Scholar]
- 20.Nakakuki T., Birtwistle M.R., Saeki Y., Yumoto N., Ide K., Nagashima T., Brusch L., Ogunnaike B.A., Okada-Hatakeyama M., Kholodenko B.N. Ligand-specific c-Fos expression emerges from the spatiotemporal control of ErbB network dynamics. Cell. 2010;141:884–896. doi: 10.1016/j.cell.2010.03.054. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Huang S., Eichier G., Bar-Yam Y., Ingber D.E. The cell-fates as high-dimensional attractor states of a complex gene regulatory network. Phys. Rev. Lett. 2005;94:128701–128705. doi: 10.1103/PhysRevLett.94.128701. [DOI] [PubMed] [Google Scholar]
- 22.Ciofani M., Madar A., Galan C., Sellars M., Mace K., Pauli F., Agarwal A., Huang W., Parkhurst C.N., Muratet M., et al. A validated regulatory network for Th17 cell specification. Cell. 2012;151:289–303. doi: 10.1016/j.cell.2012.09.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Deng Q., Ramsköld D., Reinius B., Sandberg R. Single-cell RNA-seq reveals dynamic, random monoallelic gene expression in mammalian cells. Science. 2014;343:193–196. doi: 10.1126/science.1245316. [DOI] [PubMed] [Google Scholar]
- 24.Yan L., Yang M., Guo H., Yang L., Wu J., Li R., Liu P., Lian Y., Zheng X., Yan J., et al. Single-cell RNA-Seq profiling of human preimplantation embryos and embryonic stem cells. Nat. Struct. Mol. Biol. 2013;20:1131–1139. doi: 10.1038/nsmb.2660. [DOI] [PubMed] [Google Scholar]
- 25.Censi F., Giuliani A., Bartolini P., Calcagnini G. A Multiscale Graph Theoretical Approach to Gene Regulation Networks: A Case Study in Atrial Fibrillation. IEEE Trans. Biomed. Eng. 2011;99:1–5. doi: 10.1109/TBME.2011.2150747. [DOI] [PubMed] [Google Scholar]
- 26.Tsuchiya M., Wong S.T., Yeo Z.X., Colosimo A., Palumbo M.C., Crescenzi M., Mazzola A., Negri R., Bianchi M.M., Selvarajoo K., et al. Gene expression waves: Cell cycle independent collective dynamics in cultured cells. FEBS J. 2007;274:2874–2886. doi: 10.1111/j.1742-4658.2007.05822.x. [DOI] [PubMed] [Google Scholar]
- 27.Shu G., Zeng B., Chen Y.P., Smith O.H. Performance assessment of kernel density clustering for gene expression profile data. Comp. Funct. Genomics. 2003;4:287–299. doi: 10.1002/cfg.290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Tsuchiya M., Giuliani A., Yoshikawa K. A Quantitative Evaluation of Symmetry Breaking in Nonlinear-Oscillatory System—Based on Flux Dynamics (Effective force) View Point. [(accessed on 31 March 2020)];2018 Available online: https://www.researchgate.net/publication/323365534_A_Quantitative_Evaluation_of_Symmetry_Breaking_In_Nonlinear-Oscillatory_System_-_Based_on_Flux_Dynamics_Effective_force_View_Point.
- 29.Saeki Y., Endo T., Ide K., Nagashima T., Yumoto N., Toyoda T., Suzuki H., Hayashizaki Y., Sakaki Y., Okada-Hatakeyama M. Ligand-specific sequential regulation of transcription factors for differentiation of MCF-7 cells. BMC Genom. 2009;10:545. doi: 10.1186/1471-2164-10-545. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Flory P. Principles of Polymer Chemistry. Cornell University Press; Ithaca, NY, USA: 1953. [Google Scholar]
- 31.De Gennes P.G. Scaling Concepts in Polymer Physics. Cornell University Press; Ithaca, NY, USA: 1979. [Google Scholar]
- 32.Yoshikawa K., Takahashi M., Vasilevskaya V.V., Khokhlov A.R. Large Discrete Transition in a Single DNA Molecule Appears Continuous in the Ensemble. Phys. Rev. Lett. 1996;76:3029–3031. doi: 10.1103/PhysRevLett.76.3029. [DOI] [PubMed] [Google Scholar]
- 33.Yoshikawa K., Yoshikawa Y. Compaction and Condensation of DNA, in Pharmaceutical Perspective of Nucleic Acid-Base Therapy. Taylor & Francis; Abingdon, UK: 2002. pp. 137–163. [Google Scholar]
- 34.Zinchenko A., Pyshkina O., Lezov A., Sergeyev V., Yoshikawa K. DNA interactions with Polymers and Surfactants. John Wiley & Sons, Inc.; Hoboken, NJ, USA: 2008. Single DNA molecules: Compaction and Decompaction (chapter 3) pp. 59–88. [DOI] [Google Scholar]
- 35.Sakaue T., Yoshikawa K. On the formation of rings-on-a-string conformations in a single polyelectrolyte chain: A possible scenario. J. Chem. Phys. 2006;125:074904. doi: 10.1063/1.2244555. [DOI] [PubMed] [Google Scholar]
- 36.Shew C.Y., Yoshikawa K. Mean field theory for the intermolecular and intramolecular conformational transitions of a single flexible polyelectrolyte chain. J. Chem. Phys. 2007;126:144913. doi: 10.1063/1.2714552. [DOI] [PubMed] [Google Scholar]
- 37.Schiessel H. The physics of chromatin. J. Phys. Condes. Matter. 2003;15:R699–R774. doi: 10.1088/0953-8984/15/19/203. [DOI] [Google Scholar]
- 38.Nakai T., Hizume K., Yoshimura S.H., Takeyasu K., Yoshikawa K. Phase transition in reconstituted chromatin. Europhys. Lett. 2005;69:1024–1030. doi: 10.1209/epl/i2004-10444-6. [DOI] [Google Scholar]
- 39.Suzuki Y., Yoshikawa Y., Yoshimura S.H., Yoshikawa K., Takeyasu K. Unraveling DNA dynamics using atomic force microscopy. Wiley Interdiscip. Rev. Nanomed. Nanobiotechnol. 2011;3:574–588. doi: 10.1002/wnan.150. [DOI] [PubMed] [Google Scholar]
- 40.Krigerts J., Salmina K., Freivalds T., Rumnieks F., Inashkina I., Zayakin P., Hausmann M., Erenpreisa J. Early Critical Phase Transitions of Pericentromere-Associated Domains in MCF-7 Breast Cancer Cells Committed to Differentiation by Heregulin. Preprints. 2020 doi: 10.20944/preprints202005.0248.v1. [DOI] [Google Scholar]
- 41.Wachsmuth M., Knoch T.A., Rippe K. Dynamic properties of independent chromatin domains measured by correlation spectroscopy in living cells. Epigenetics Chromatin. 2016;9:57. doi: 10.1186/s13072-016-0093-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Takagi S., Yoshikawa K. Stepwise collapse of polyelectrolyte chains entrapped in a finite space as predicted by theoretical considerations. Langmuir. 1999;15:4143–4146. doi: 10.1021/la981158o. [DOI] [Google Scholar]
- 43.Yoshikawa K., Noguchi H. A working hypothesis on the mechanism of molecular machinery. Chem. Phys. Lett. 1999;303:10–14. doi: 10.1016/S0009-2614(99)00203-1. [DOI] [Google Scholar]
- 44.Ginzburg V.L., Landau L.D. On the theory of superconductors. Zh. Eksp. Teor. Fiz. 1965;20:1064–1082. [Google Scholar]
- 45.Daniels B.C., Kim H., Moore D., Zhou S., Smith H.B., Karas B., Kauffman S.A., Walker S.I. Criticality distinguishes the ensemble of biological regulatory networks. Phys. Rev. Lett. 2018;121:138102. doi: 10.1103/PhysRevLett.121.138102. [DOI] [PubMed] [Google Scholar]
- 46.Maeshima K., Ide S., Hibino K., Sasai M. Liquid-like behavior of chromatin. Curr. Opin. Genet. Dev. 2016;37:36–45. doi: 10.1016/j.gde.2015.11.006. [DOI] [PubMed] [Google Scholar]
- 47.Bolstad B.M., Irizarry R.A., Astrand M., Speed T.P. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics. 2003;19:185–193. doi: 10.1093/bioinformatics/19.2.185. [DOI] [PubMed] [Google Scholar]
- 48.Irizarry R.A., Hobbs B., Collin F., Beazer-Barclay Y.D., Antonellis K.J., Scherf U., Speed T.P. Exploration, normalization, and summaries of high density oligonucleotide array probe level data. Biostatistics. 2003;4:249–264. doi: 10.1093/biostatistics/4.2.249. [DOI] [PubMed] [Google Scholar]
- 49.McClintick J.N., Edenberg H.J. Effects of filtering by present call on analysis of microarray experiments. BMC Bioinform. 2006;7:49. doi: 10.1186/1471-2105-7-49. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.