Information-theoretic analysis of the directional influence between cellular processes

Sourabh Lahiri; Philippe Nghe; Sander J Tans; Martin Luc Rosinberg; David Lacoste

doi:10.1371/journal.pone.0187431

. 2017 Nov 9;12(11):e0187431. doi: 10.1371/journal.pone.0187431

Information-theoretic analysis of the directional influence between cellular processes

Sourabh Lahiri ¹, Philippe Nghe ², Sander J Tans ³, Martin Luc Rosinberg ⁴, David Lacoste ^1,^*

Editor: Ramon Grima⁵

PMCID: PMC5679622 PMID: 29121044

Abstract

Inferring the directionality of interactions between cellular processes is a major challenge in systems biology. Time-lagged correlations allow to discriminate between alternative models, but they still rely on assumed underlying interactions. Here, we use the transfer entropy (TE), an information-theoretic quantity that quantifies the directional influence between fluctuating variables in a model-free way. We present a theoretical approach to compute the transfer entropy, even when the noise has an extrinsic component or in the presence of feedback. We re-analyze the experimental data from Kiviet et al. (2014) where fluctuations in gene expression of metabolic enzymes and growth rate have been measured in single cells of E. coli. We confirm the formerly detected modes between growth and gene expression, while prescribing more stringent conditions on the structure of noise sources. We furthermore point out practical requirements in terms of length of time series and sampling time which must be satisfied in order to infer optimally transfer entropy from times series of fluctuations.

Introduction

Quantifying information exchange between variables is a general goal in many studies of biological systems because the complexity of such systems prohibits mechanistic bottom-up approaches. Several statistical methods have been proposed to exploit either the specific dependence of the covariances between input and output variables with respect to a perturbation applied to the network [1], or the information contained in 3-point correlations [2]. These methods are potentially well suited for datasets obtained from destructive measurements, such as RNA sequencing or immunohistochemistry.

However, none of these methods exploits the information contained in time-lagged statistics, which is provided for instance by non-destructive measurements obtained from time-lapse microscopy of single cells. Such experimental data should be quite relevant to understand functional relationships since they merely reflect the time delays present in the dynamics of the system. Time-delayed cross-correlations between gene expression fluctuations have indeed been shown to discriminate between several mechanistic models of well characterized genetic networks [3]. However, such methods become difficult to interpret in the presence of feedback.

This situation is illustrated in reference [4] where the fluctuations in the growth rate and in the expression level of metabolic enzymes have been measured as a function of time by tracking single cells of E. coli with time-lapse microscopy. The interplay between these variables has been characterized using cross-correlations as proposed in [3]. To circumvent the difficulty of discriminating between many complex and poorly parametrized metabolic models, the authors reduced functional relations to effective linear responses with a postulated form of effective couplings.

In the present work, we instead use a time-lagged and information-based method to analyze the interplay between the two fluctuating variables. A crucial feature in this method is that it is model-free and it is able to disentangle the two directions of influence between the two variables, unlike the cross-correlations discussed above. This type of approach was first proposed by Granger [5] in the field of econometrics and found applications in a broader area. More recently, transfer entropy [6], which is a non-linear extension of Granger causality, has become a popular information-theoretic measure to infer directional relationships between jointly dependent processes [7]. It has been successfully applied to various biomedical time series (see for instance [8]) and used extensively in the field of neurobiology, as shown in Ref. [9] and in references therein. This is the tool that will be used in this work.

The plan of this paper is as follows. We first introduce two measures of information dynamics, transfer entropy (TE) and information flow (IF). We then illustrate our numerical method on a well controlled case, namely a simple linear Langevin model, and show that we can properly estimate these quantities from the generated time series. We then analyze experimental data on the fluctuations of metabolism of E. coli taken from Ref. [4]. We provide analytical expressions for the transfer entropy and information flow rates for the model proposed in that reference. After identifying a divergence in one TE rate as the sampling time goes to zero, we introduce a simplified model which is free of divergences while still being compatible with the experimental data. We conclude that the inference of information-theoretic dynamical quantities can be helpful to build physically sound models of the various noise components present in chemical networks.

Information theoretic measures

Unlike the mutual information I(X : Y) that only quantifies the amount of information exchanged between two random variables X and Y as defined in the section on Methods, the transfer entropy (TE) is an asymmetric measure that can discriminate between a source and a target [6]. Consider two sampled time series {‥x_i−1, x_i, x_i+1‥} and {‥y_i−1, y_i, y_i+1‥}, where i is the discrete time index, generated by a source process X and a target process Y. The transfer entropy T_X→Y from X to Y is a conditional, history-dependent mutual information defined as

\begin{matrix} T_{X \to Y} & = & \sum P (y_{i + 1}, y_{i}^{(k)}, x_{i}^{(l)}) ln \frac{P (y_{i + 1} | y_{i}^{(k)}, x_{i}^{(l)})}{P (y_{i + 1} | y_{i}^{(k)})}, \\ = & \sum_{i} [H (y_{i + 1} | y_{i}^{(k)}) - H (y_{i + 1} | y_{i}^{(k)}, x_{i}^{(l)})] \end{matrix}

(1)

where $y_{i}^{(k)} = {y_{i - k + 1}, \dots, y_{i}}$ and $x_{i}^{(l)} = {x_{i - l + 1}, \dots, x_{i}}$ denote two blocks of past values of Y and X of length k and l respectively, $P (y_{i + 1}, y_{i}^{(k)}, x_{i}^{(l)})$ is the joint probability of observing $y_{i + 1}, y_{i}^{(k)}, x_{i}^{(l)}$ , and $P (y_{i + 1} | y_{i}^{(k)}, x_{i}^{(l)}), P (y_{i + 1} | y_{i}^{(k)})$ are conditional probabilities. In the second line, H(.|.) denotes the conditional Shannon entropy (see Section on Methods for definition). In the first equation, the summation is taken over all possible values of the random variables $y_{i + 1}, y_{i}^{(k)}, x_{i}^{(l)}$ and over all values of the time index i.

To put it in simple terms, T_X→Y quantifies the information contained from the past of X about the future of Y, which the past of Y did not already provide [7, 8]. Therefore, it should be regarded as a measure of predictability rather than a measure of causality between two time-series [10]. For instance, when $x_{i}^{(l)}$ does not bring new information on y_i+1, then $P (y_{i + 1} | y_{i}^{(k)}, x_{i}^{(l)}) = P (y_{i + 1} | y_{i}^{(k)})$ and the transfer entropy vanishes because the prediction on y_i+1 is not improved. With a similar definition for T_Y→X, one can define the net variation of transfer entropy from X to Y as ΔT_X→Y ≡ T_X→Y − T_Y→X. The sign of ΔT_X→Y informs on the directionality of the information transfer.

The statistics required for properly evaluating the transfer entropy rapidly increases with k and l, which in practice prohibits the use of large values of k and l. The most accessible case thus corresponds to k = l = 1, which we denote hereafter as ${\bar{T}}_{X \to Y}$ . This quantity is then simply defined as

\begin{matrix} {\bar{T}}_{X \to Y} = \sum_{i} [H (y_{i + 1} | y_{i}) - H (y_{i + 1} | y_{i}, x_{i})], \end{matrix}

(2)

When the dynamics of the joint process {X, Y} is Markovian, one has $P (y_{i + 1} | y_{i}^{(k)}, x_{i}^{(l)}) = P (y_{i + 1} | y_{i}, x_{i})$ and since $H (y_{i + 1} | y_{i}^{(k)}) \leq H (y_{i + 1} | y_{i})$ one has ${\bar{T}}_{X \to Y} \geq T_{X \to Y}$ (see Ref. [11]). Therefore, ${\bar{T}}_{X \to Y}$ represents an upper bound on the transfer entropy. In the case of stationary time series, which is the regime we consider in this work, it is natural to also introduce the TE rate

\begin{matrix} {\bar{T}}_{X \to Y} & = & lim_{τ \to 0} \frac{H (y_{t + τ} | y_{t}) - H (y_{t + τ} | x_{t}, y_{t})}{τ} \\ = & lim_{τ \to 0} \frac{I (y_{t + τ} : y_{t}, x_{t}) - I (y_{t + τ} : y_{t})}{τ}, \end{matrix}

(3)

where the continuous time variable t replaces the discrete index i. In practice ${\bar{T}}_{X \to Y} ≃ {\bar{T}}_{X \to Y} / τ$ , but only for sufficiently small time step τ.

The most direct strategy to evaluate Eq (1) would be to construct empirical estimators of the probabilities from histograms of the data. Although this procedure works well for evaluating other quantities, for instance the entropy production in small stochastic systems [12], it completely fails in the case of transfer entropy. Indeed, such a method leads to a non-zero TE even between uncorrelated signals, due to strong biases in standard estimators based on data binning. In order to overcome this problem, we used the Kraskov-Stögbauer-Grassberger (KSG) estimator which does not rely on binning, as implemented in the software package JIDT (Java Information Dynamics Toolkit) [13]. Using estimators of this kind is particularly important for variables that take continuous values.

In the following, the inference method will be applied to time series generated by diffusion processes. It will then be interesting to compare the TE rate ${\bar{T}}_{X \to Y}$ to another measure of information dynamics, the so-called information flow [14–16] (also dubbed learning rate in the context of sensory systems [11, 17]), which is defined as the time-shifted mutual information [18]

\begin{matrix} I_{X \to Y}^{f l o w} = lim_{τ \to 0} \frac{I (y_{t} : x_{t}) - I (y_{t} : x_{t + τ})}{τ} . \end{matrix}

(4)

In the special case where the two processes X and Y experience independent noises (the system is then called bipartite) [15], one has the inequality $I_{X \to Y}^{f l o w} \leq T_{X \to Y}$ [17], which in turn implies that

\begin{matrix} I_{X \to Y}^{f l o w} \leq {\bar{T}}_{X \to Y} \end{matrix}

(5)

when the joint process is Markovian. Observing a violation of this inequality is thus a strong indication that the noises on X and Y are correlated. As will be seen later, this is indeed the situation in biochemical networks, due the presence of the so-called extrinsic noise generated by the stochasticity in the cell and in the cell environment [19] which acts on all chemical reactions within the cell, and thus induces correlations.

Results

Test of the inference method on a Langevin model

In order to benchmark our inference method and perform a rigorous test in a controlled setting, we first applied it on times series generated by a simple model for which the transfer entropy and the information flow can be computed analytically. The data were obtained by simulating the two coupled Langevin equations

\begin{matrix} m \dot{v} & = & - γ v - a y + ξ, \\ τ_{r} \dot{y} & = & v - y + η \end{matrix}

(6)

that describe the dynamics of a particle of mass m subjected to a velocity-dependent feedback that damps thermal fluctuation [16, 20, 21] (in these equations, the dependence of the variables on the time t is implicit). Here, ξ(t) is the noise generated by the thermal environment with viscous damping γ and temperature T, while η(t) is the noise associated with the measurement of the particle’s velocity v(t). The two noises are independent and Gaussian with zero-mean and variances ⟨ξ(t)ξ(t′)⟩ = 2γk_BTδ(t − t′) and ⟨η(t)η(t′)⟩ = σ² δ(t − t′). a is the feedback gain and τ_r is a time constant.

The two Langevin equations were numerically integrated with the standard Heun’s method [22] using a time step Δt = 10⁻³, and the transfer entropy in the steady state was estimated from 100 time series of duration t = 2000 with a sampling time (i.e., the time between two consecutive data points) τ = Δt. We first checked that the TE in the direction Y → V does vanish in the absence of feedback, i.e. for a = 0, whereas it is non-zero as soon as a > 0. We then tested the influence of the measurement error σ² for a fixed value of the gain a. As can be seen in Fig 1, T_V→Y diverges as σ² → 0, a feature that will play an important role in our discussion of the model for the metabolic network. In the figure, the color of the symbols correspond to three different values of the parameter k which represents the history length in the definition of the transfer entropy (see Eq (1)). One can see that the estimates of T_V→Y for k = 1 are in very good agreement with the theoretical prediction for ${\bar{T}}_{V \to Y}$ (upper solid line). Moreover, the estimates decrease as k is increased from 1 to 5, and one can reasonably expect that the theoretical value of T_V→Y (lower solid line) computed in Ref. [16] and given by Eq (21) in the section on Methods would be reached in the limit k → ∞.

Fig 1 — The parameter l present in the definition of Eq (1) is fixed to 1. The lower red (resp. upper blue) solid line represents the value of T_Y→V (resp. ${\bar{T}}_{Y \to V}$ ) obtained by multiplying the theoretical rate $T_{Y \to V}$ (resp. ${\bar{T}}_{Y \to V}$ ) given by Eq (21) (resp. Eq (23) by the sampling time τ = 10⁻³. The parameters of the model are T = 5, γ = m = 1, τ_r = 0.1, and a = 8.

Finally, by estimating the information flow and the transfer entropy, we checked that inequality (5) holds, as a result of the independence of the two noises ξ and η (see section on Methods).

Analysis of stochasticity in a metabolic network

Experimental time series

We are now in position to analyze the fluctuations in the metabolism of E. coli at the single cell level obtained in Ref. [4] using the information-theoretic notions introduced and tested in the previous section. Since there are a multitude of reactions and interactions involved in the metabolism of E. coli, a complete mechanistic description is not feasible, and our model-free inference method has a crucial advantage. In Ref. [4], the length of the cells was recorded as a function of time using image analysis, and the growth rate was then obtained by fitting this data over subparts of the cell cycle. In the same experiment, the fluorescence level of GFP, which is co-expressed with growth enzymes LacY and LacZ was recorded. Three set of experiments were carried out corresponding to three levels of an inducer IPTG: low, intermediate and high.

The two time series have a branching structure due to the various lineages, which all start from a single mother cell as shown in Fig 2. The experimental data thus come in the form of a large ensemble of short times series which represent a record of all the cell cycles. There are about ∼3000 time series, with 2 to 8 measurement points in each of them which are represented as colored points in Fig 2. In order to correctly estimate the transfer entropy from such data, we have analyzed the multiple time series as independent realizations of the same underlying stochastic process. For the present analysis, we fix the history length parameters k and l to the value k = l = 1, which means that we focus on $\bar{T}$ rather than T. We infer the values of $\bar{T}$ in the two directions, from growth (denoted μ) to gene expression (denoted E) and vice versa. The results obtained for the three concentrations of IPTG are represented in Table 1. The negative value of ${\bar{T}}_{μ \to E}$ which is found in the intermediate case is due to the numerical inference method and should be regarded as a value which cannot be distinguished from zero.

Table 1. Inferred values of the transfer entropies in the directions E → μ and μ → E, and the difference $Δ {\bar{T}}_{E \to μ} = {\bar{T}}_{E \to μ} - {\bar{T}}_{μ \to E}$ for low, medium and high concentrations of IPTG based on the data of ref. [4].

The TE are given in nats.

Conc. of IPTG	Low	Intermediate	High
${\bar{T}}_{E \to μ}$	2.35 ⋅ 10⁻²	1.37 ⋅ 10⁻²	1.06 ⋅ 10⁻³
${\bar{T}}_{μ \to E}$	2.16 ⋅ 10⁻²	−4.08 ⋅ 10⁻³	9.94 ⋅ 10⁻³
$Δ {\bar{T}}_{E \to μ}$	1.84 ⋅ 10⁻⁴	1.78 ⋅ 10⁻²	−8.88 ⋅ 10⁻³

Open in a new tab

Based on this analysis, we conclude that the influence between the variables is directed primarily from enzyme expression to growth in the low and intermediate IPTG experiments, while it mainly proceeds in the reverse direction in the high IPTG experiment. Such results are in line with the conclusions of Ref. [4] based on the measured asymmetry of the time-lagged cross-correlations. Moreover, the present analysis provides an estimate of the influence between the two variables separately in the two directions from E to μ and from μ to E. In particular, we observe for the low experiment that the values of TE in the two directions are of same order of magnitude, whereas in the intermediate experiment the TE from E to μ is larger, a feature which could not have been guessed from measured time delays.

Theoretical models

We now turn to the analysis of the model proposed in Ref. [4] to account for the experimental data. The question we ask is whether the model correctly reproduces the above results for the transfer entropies, in particular the change in the sign of $Δ {\bar{T}}_{E \to μ}$ for the high concentration of IPTG.

The central equation of the model describes the production of the enzyme as

\begin{matrix} \dot{E} = p - μ \cdot E, \end{matrix}

(7)

where E is the enzyme concentration, p its production rate, and μ the rate of increase in cell volume. Although the function p is typically non-linear, its precise expression is irrelevant because (7) is linearized around the stationary point defined by the mean values E = E₀ and μ = μ₀. This linearization then yields

\begin{matrix} δ \dot{E} = δ p - δ μ E_{0} - μ_{0} δ E, \end{matrix}

(8)

in terms of perturbed variables δX(t) = X(t) − X₀, where X₀ denotes the mean of X.

The model of Ref. [4] is essentially phenomenological in nature because it approximates the noises as Gaussian processes. Although this approximation is often done in this field, it may not always hold since fluctuations due to low copy numbers are generally not Gaussian [23]. In any case, the model contains three Gaussian noises: N_G is a common component while N_E and N_μ are component specific to E and μ. These noises are assumed to be independent Ornstein-Uhlenbeck noises with zero mean and autocorrelation functions $〈 N_{i} (t) N_{i} (t^{'}) 〉 = η_{i}^{2} e^{- β_{i} | t - t^{'} |}$ (i = E, μ, G). As commonly done, the three Ornstein-Uhlenbeck noises are generated by the auxiliary equations

\begin{matrix} {\dot{N}}_{i} = - β_{i} N_{i} + ξ_{i}, \end{matrix}

(9)

where the $ξ_{i}^{'} s$ are zero-mean Gaussian white noises satisfying $〈 ξ_{i} (t) ξ_{j} (t^{'}) 〉 = θ_{i}^{2} δ (t - t^{'}) δ_{i j}$ with $θ_{i} = η_{i} \sqrt{2 β_{i}}$ . Introducing the constant logarithmic gains T_XY that represent how a variable X responds to the fluctuations of a source Y, the equations of the model read [4]

\begin{matrix} \frac{δ p}{E_{0} μ_{0}} & = & T_{E E} \frac{δ E}{E_{0}} + T_{E G} N_{G} + N_{E}, \\ \frac{δ μ}{μ_{0}} & = & T_{μ E} \frac{δ E}{E_{0}} + T_{μ G} N_{G} + N_{μ}, \end{matrix}

(10)

where specifically T_Eμ = −1 and T_μG = 1. Then, eliminating δp from Eqs (8) and (10), one obtains the coupled equations

\begin{matrix} \dot{x} & = & μ_{0} [(T_{E E} - 1) x + T_{E μ} y + T_{E G} N_{G} + N_{E}] \\ y & = & T_{μ E} x + T_{μ G} N_{G} + N_{μ}, \end{matrix}

(11)

where we have defined the reduced variables x = δE/E₀, y = δμ/μ₀. We stress that N_G is an extrinsic noise that affects both the enzyme concentration and the growth rate, whereas N_E (resp. N_μ) is an intrinsic noise that only affects E (resp. μ). Note that the two effective noises T_EGN_G + N_E and T_μGN_G + N_μ acting on $\dot{x}$ and y are colored and correlated, which makes the present model more complicated than most stochastic models studied in the current literature. In fact, since we are mainly interested in the information exchanged between x and y, it is convenient to replace one of the noises, say N_G, by the dynamical variable y. Differentiating the second equation in Eq (11), using Eq (9) and performing some simple manipulations, one then obtains a new set of equations for the four random variables x, y, u ≡ N_E, v ≡ N_μ:

\begin{matrix} \dot{x} & = & a_{1} x + a_{2} u + a_{3} v + a_{4} y \\ \dot{y} & = & b_{1} x + b_{2} u + b_{3} v + b_{4} y + ξ_{y} \\ \dot{u} & = & - β_{E} u + ξ_{E} \\ \dot{v} & = & - β_{μ} v + ξ_{μ}, \end{matrix}

(12)

where the coefficients a_j and b_j (j = 1…4) are defined by Eq (24) in the section on Methods and ξ_y = ξ_μ + ξ_G is a new white noise satisfying $〈 ξ_{y} (t) ξ_{y} (t^{'}) 〉 = (θ_{μ}^{2} + θ_{G}^{2}) δ (t - t^{'})$ and $〈 ξ_{y} (t) ξ_{μ} (t^{'}) 〉 = θ_{μ}^{2} δ (t - t^{'})$ .

The calculation of the transfer entropy rate ${\bar{T}}_{X \to Y}$ (which coincides with ${\bar{T}}_{E \to μ}$ since the TE is invariant under the change of variables from E to x and μ to y) is detailed in the section on Methods, together with the calculation of the information flows. The final expression reads

\begin{matrix} {\bar{T}}_{X \to Y} & = & \frac{1}{4 (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2})} \int d x d y p (x, y) [{\bar{g}}_{y}^{2} (x, y) - {\bar{\bar{g}}}_{y}^{2} (y)] \end{matrix}

(13)

where p(x, y) is the steady state probability distribution and the functions ${\bar{g}}_{y}$ and ${\bar{\bar{g}}}_{y}$ are defined in Eqs (40) and (43), respectively. This result agrees with that obtained in Refs. [11, 18] and in [24] in special cases.

In Table 2, we show the results of the analysis of the time series generated by Eq (12) using our numerical inference method with a sampling time τ = 1min (equal to the time step Δt used to numerically integrate the model). One can see that the estimates of ${\bar{T}}_{E \to μ}$ are in good agreement with the predictions of Eq (13), with the values of the model parameters taken from Table S1 in Ref. [4]. Note that the negative number given by the inference method in the high IPTG experiment signals that the actual value of ${\bar{T}}_{E \to μ}$ cannot be distinguished from zero, which is indeed the theoretical prediction. In contrast, the estimated and theoretical results for ${\bar{T}}_{μ \to E}$ do not agree, as the inference method yields finite values in all cases whereas the theoretical values diverge.

Table 2. Comparison between the theoretical values of the transfer entropy rates ${\bar{T}}_{E \to μ}$ and ${\bar{T}}_{μ \to E}$ for the model of Ref. [4] and the values inferred from simulation data.

Averages are taken over 100 times series of duration 10⁶ min, sampled every 1 min.

Conc. of IPTG	Low	Intermediate	High
${\bar{T}}_{E \to μ}$ (in h⁻¹) (theo.)	0.033	0.034	0
${\bar{T}}_{E \to μ}$ (simul.)	0.031	0.034	−0.011
${\bar{T}}_{μ \to E}$ (theo.)	∞	∞	∞
${\bar{T}}_{μ \to E}$ (simul.)	0.202	0.123	0.347

Open in a new tab

This behavior is due to the absence of a white noise source directly affecting the dynamical evolution of x in the set of Eq (12). Indeed, as pointed out in Ref. [6] and also observed above in Fig 1, a TE rate diverges when the coupling between the variables is deterministic. In the model of Ref. [4], this feature can be traced back to the fact that the noise N_E affecting the enzyme concentration is colored with a finite relaxation time $β_{E}^{- 1}$ . Therefore, when taking the limit τ → 0 in Eq (3), one explores a time interval $τ < β_{E}^{- 1}$ where N_E is not really random. This is illustrated in Fig 3a that corresponds to the low IPTG experiment: we see that the estimate of ${\bar{T}}_{μ \to E}$ with the inference method is indeed diverging when the sampling time τ approaches zero. On the other hand, as expected, ${\bar{T}}_{E \to μ}$ remains finite and the points nicely lie on the plateau determined by Eq (13).

Fig 3 — (a) Original model of Ref. [4] (b) Modified model where N_E is a white noise. The symbols are the estimates from the inference method when varying the sampling time τ, and the solid lines are the theoretical predictions from Eq (13) in (a) and from Eq (60) in (b). Note that ${\bar{T}}_{μ \to E}$ diverges as τ goes to zero in (a) but not (b).

The obvious and simplest way to cure this undesirable feature of the original model is to treat N_E as a purely white noise, which amounts to taking the limit $β_{E}^{- 1} \to 0$ . In fact, it is noticeable that the values of $β_{E}^{- 1}$ extracted from the fit of the correlation functions in Ref. [4] (resp. $β_{E}^{- 1} = 10.7, 9.9$ and 8.15 min for the low, intermediate, and high IPTG concentrations) are significantly smaller than the time steps τ_exp used for collecting the data (resp. τ_exp = 28, 20 and 15.8 min). Therefore, it is clear that the experimental data are not precise enough to decide whether N_E is colored or not. This issue does not arise for the other relaxation times in the model, $β_{μ}^{- 1} = β_{G}^{- 1}$ and $μ_{0}^{- 1}$ , which are much longer (at least for the low and intermediate IPTG concentrations), and can be correctly extracted from the experimental data.

We thus propose to modify the model of Ref. [4] by describing N_E as a Gaussian white noise with variance 〈N_E(t)N_E(t′)〉 = 2D_Eδ(t − t′) and the same intensity as the colored noise in the original model, i.e. $D_{E} = η_{E}^{2} / β_{E}$ (which yields D_E ≈ 0.188h, 0.100h, 0.031h for the three IPTG concentrations). Unsurprisingly, this modification does not affect the auto and cross-correlation functions used to fit the data, as shown in Fig 4 (see also section on Methods for a detailed calculation). On the other hand, the values of ${\bar{T}}_{E \to μ}$ are changed (compare Tables 2 and 3) and, more importantly, ${\bar{T}}_{μ \to E}$ , given by Eq (60) is now finite. As a result, the model predicts that the difference $Δ {\bar{T}}_{E \to μ} = {\bar{T}}_{E \to μ} - {\bar{T}}_{μ \to E}$ is positive at low and intermediate IPTG concentrations and becomes negative at high concentration, which is in agreement with the direct analysis of the experimental data in Table 1. In contrast, $Δ {\bar{T}}_{E \to μ}$ was always negative in the original model as ${\bar{T}}_{μ \to E}$ is infinite.

Fig 4 — (a) Autocorrelation function R_μμ(τ) for the three IPTG concentrations. Black lines: original model of Ref. [4], red circles: simplified model where N_E is a white noise. (b) Same as (a) for R_EE(τ). (c) Same as (a) for R_Eμ(τ).

Table 3. Theoretical values of the transfer entropy rates ${\bar{T}}_{E \to μ}$ and ${\bar{T}}_{μ \to E}$ and their difference in the modified model.

Conc. of IPTG	Low	Intermediate	High
${\bar{T}}_{E \to μ}$ (h⁻¹)	1.23 ⋅ 10⁻²	8.2 ⋅ 10⁻³	0
${\bar{T}}_{μ \to E}$ (h⁻¹)	1.9 ⋅ 10⁻³	5 ⋅ 10⁻⁴	2.97 ⋅ 10⁻²
$Δ {\bar{T}}_{E \to μ}$ (h⁻¹)	1.04 ⋅ 10⁻²	7.7 ⋅ 10⁻³	−2.97 ⋅ 10⁻²

Open in a new tab

This new behavior of the TE rates is also manifest when the inference method is applied to the time series generated by the model and the sampling time τ is varied. As observed in Fig 3b, the inferred value of ${\bar{T}}_{μ \to E}$ no longer diverges as τ → 0 (compare the vertical scale with that in Fig 3a). The estimates of ${\bar{T}}_{E \to μ}$ and ${\bar{T}}_{μ \to E}$ are also in good agreement with the theoretical predictions, except for the shortest value of τ which is equal to the time step Δt = 1 min used to numerically integrate the equations. It worth mentioning, however, that the error bars increase as τ is decreased.

While the change in the sign of $Δ {\bar{T}}_{E \to μ}$ is now confirmed by the model, which is the main outcome of our analysis, one may also wonder whether the numerical values in Table 1 are recovered. This requires to multiply the rates in Table 3 by the experimental sampling times τ_exp which are different in each experiment, as indicated above. One then observes significant discrepancies for the low and intermediate IPTG experiments. We believe that the problem arises from the presence of many short time series in the set of experimental data. This is a important issue that needs to be examined in more detail since it may be difficult to obtain long time series in practice.

To this aim, we have studied the convergence of the estimates of $Δ {\bar{T}}_{E \to μ}$ to the exact asymptotic value as a function of N, the length of the time series generated by the model in the stationary regime. As shown in Fig 5, the convergence with N is slow, which means that one can make significant errors in the estimation of $Δ {\bar{T}}_{E \to μ}$ if N is small. On the other hand, the convergence can be greatly facilitated by choosing a value of the sampling time which is not too short (but of course shorter than the equilibration time of the system), for instance τ = 6min instead of 1 min in the case considered in Fig 5. The important observation is that the sign of $Δ {\bar{T}}_{E \to μ}$ is then correctly inferred even with N ≈ 1000. In contrast, with τ = 1min, this is only possible for much longer series, typically N ≈ 50000. This is an encouraging indication for experimental studies, as the overall acquisition time of the data can be significantly reduced.

Fig 5 — Panels (a) and (b) correspond to sampling times τ = 6 min and τ = 1 min, respectively. $Δ {\bar{T}}_{E \to μ} (\infty)$ is the exact asymptotic value.

Finally, we briefly comment on the results for the information flows $I_{E \to μ}^{f l o w}$ and $I_{μ \to E}^{f l o w}$ . As already pointed out, the fact that the noises acting on the two random variables are correlated invalidates inequality (5). This is indeed what is observed in Table 4. It is also noticeable that $I_{E \to μ}^{f l o w} \neq - I_{μ \to E}^{f l o w}$ , except in the high IPTG experiment where T_μE = 0.

Table 4. Comparison between the theoretical values of the TE rates and the information flows for the modified model and the values inferred from simulation data (all quantities are expressed in h⁻¹).

The analysis was performed with a sampling τ = 6 min and 100 time series of 10⁶ points.

Conc. of IPTG	Low	Intermediate	High
${\bar{T}}^{E \to μ}$ , analytical	0.0123	0.0082	0
${\bar{T}}^{E \to μ}$ , simulation	0.0128 ± 6 ⋅ 10⁻⁴	0.0064 ± 6 ⋅ 10⁻⁴	−0.0002 ± 5 ⋅ 10⁻⁴
${\bar{T}}^{μ \to E}$ , analytical	0.0019	0.0005	0.0297
${\bar{T}}^{μ \to E}$ , simulation	0.0023 ± 6 ⋅ 10⁻⁴	0.0012 ± 6 ⋅ 10⁻⁴	0.0215 ± 7 ⋅ 10⁻⁴
$I_{E \to μ}^{f l o w}$ , analytical	0.0751	0.092	−0.0214
$I_{E \to μ}^{f l o w}$ , simulation	0.076 ± 10⁻³	0.09 ± 8 ⋅ 10⁻⁴	−0.018 ± 8 ⋅ 10⁻⁴
$I_{μ \to E}^{f l o w}$ , analytical	0.0455	0.0743	0.0214
$I_{μ \to E}^{f l o w}$ , simulation	0.047 ± 10⁻³	0.072 ± 10⁻³	0.015 ± 10⁻³

Open in a new tab

Discussion and conclusion

A challenge when studying any biochemical network is to properly identify the direction of information. In this work, using the notion of transfer entropy, we have characterized the directed flow of information between the single cell growth rate and the gene expression, using a method that goes beyond what could be obtained from correlation functions, or from other inference techniques which do not exploit dynamical information.

Another crucial challenge in the field is to properly model the various noise components. It turns out that biological systems are generally non-bipartite due the presence of an extrinsic component in the noise. The present work provides on the one hand analytical expressions for the magnitude of the transfer entropy (or at least an upper bound on it) and of the information flow when the system is not bipartite, and, on the other hand a numerical method to infer the TE in all cases. Furthermore, we have shown that one can correctly infer the sign of the TE difference even with short time series by properly choosing the sampling time (see Ref. [25] for more details on the dependence of TE on the sampling time).

To conclude, we would like to emphasize that the transfer entropy is a general tool to identify variables which are relevant for time series prediction [26]. As such, the method has a lot of potential beyond the particular application covered in this paper: Predicting the current or future state of the environment by sensing it is an adaptation strategy followed by biological systems which can be understood using information-theoretic concepts [11, 27]. Similarly, during evolution, biological systems accumulate information from their environment, process it and use it quasi-optimally to increase their own fitness [28, 29]. In this context, transfer entropy-based methods have the potential to identify the directional interactions in co-evolution processes, which could be for instance the genomic evolution of a virus compared to that of its antigenes [30]. With the recent advances in high-throughput techniques and experimental evolution, we might soon be able to predict reliably the evolution of biological systems [31], and without doubt tools of information theory will play a key role in these advances.

Methods

In this section, we provide a detailed analysis of the information-theoretic quantities for the various models considered in this paper. The section is organized as follows:

Basic information-theoretic measures
Transfer entropy and information flow in the feedback cooling model
Transfer entropy rates and information flows in the model of Ref. [4] for a metabolic network
Transfer entropy rates and information flows in the modified model for the metabolic network

Basic information-theoretic measures

Below we briefly recall some definitions and properties of the information-theoretic measures. A fundamental quantity is the Shannon entropy which quantifies the uncertainty associated with the measurement x of a random variable X:

\begin{matrix} H (X) = - \sum_{x} P (x) ln P (x), \end{matrix}

(14)

where P(x) is the probability that event x is realized, given an ensemble of possible outcomes. With this convention, the entropy is measured in nats. Similarly, for two random variables X and Y, one defines the joint Shannon entropy

\begin{matrix} H (X, Y) = - \sum_{x, y} P (x, y) ln P (x, y), \end{matrix}

(15)

and the conditional Shannon entropy

\begin{matrix} H (X | Y) = - \sum_{x, y} P (x, y) ln P (x | y), \end{matrix}

(16)

where P(x, y) and P(x|y) are joint and conditional probability distribution functions, respectively. The mutual information I(X : Y) is then a symmetric measure defined as

\begin{matrix} I (X : Y) & = \sum_{x, y} P (x, y) ln \frac{P (x, y)}{P (x) P (y)}, \\ = H (X) - H (X | Y) \\ = H (Y) - H (Y | X), \end{matrix}

(17)

which quantifies the reduction of the uncertainty about X (resp. Y) resulting from the knowledge of the value of Y (respX). The more strongly X and Y are correlated, the larger I(X : Y) is.

These notions can be readily extended to random processes X = {X_i} and Y = {Y_i} viewed as collections of individual random variables sorted by an integer time index i. The mutual information between the ordered time series {x_i} and {y_i}, realizations of X and Y, is then defined as

\begin{matrix} I (X : Y) = I (Y : X) \equiv \sum_{{x_{i}, y_{i}}} P (x_{i}, y_{i}) ln \frac{P (x_{i}, y_{i})}{P (x_{i}) P (y_{i})}, \end{matrix}

(18)

and characterizes the undirected information exchanged between the two processes. The conditional mutual information is defined similarly.

In contrast, the transfer entropy T_X→Y is a information-theoretic measure that is both asymmetric and dynamic as it captures the amount of information that a source process X provides about the next state of a target process Y. More precisely, as defined by Eq (1) in the introduction,

\begin{matrix} T_{X \to Y} = \sum_{i} [I (Y_{i + 1} : X_{i}^{(l)}, Y_{i}^{(k)}) - I (Y_{i + 1} : Y_{i}^{(k)})], \end{matrix}

(19)

where k and l define the lengths of the process histories, i.e., $Y_{i}^{(k)} = {Y_{i - k + 1}, \dots, Y_{i}}$ and $X_{i}^{(l)} = {X_{i - l + 1}, \dots, X_{i}}$ . In this work, we have focused on a history length of 1 (i.e. k = l = 1) and denoted the corresponding TE by ${\bar{T}}_{X \to Y}$ . Hence, ${\bar{T}}_{X \to Y} = \sum_{i} [H (Y_{i + 1} | Y_{i}) - H (Y_{i + 1} | X_{i}, Y_{i})]$ , which is an upper bound to T_X→Y(k, l) for l = 1 when the joint process {X, Y} obeys a Markovian dynamics [11].

On the other hand, the information flow from X to Y is defined as the time-shifted mutual information

\begin{matrix} I_{X \to Y}^{f l o w} = \sum_{i} [I (Y_{i} : X_{i}) - I (Y_{i} : X_{i + 1})], \end{matrix}

(20)

and informs on the reduction of uncertainty in Y_i when knowing about X_i+1 as compared to what we had with X_i only. In practice, $I_{X \to Y}^{f l o w}$ can be obtained by shifting in time one time series with respect to the other one. Contrary to the transfer entropy which is always a positive quantity, the information flow $I_{X \to Y}^{f l o w}$ may be negative or positive, depending on whether X sends information to Y (or X gains control of Y), or Y sends information to X (or X looses control over Y). In a bipartite system one has $I_{X \to Y}^{f l o w} = - I_{Y \to X}^{f l o w}$ in the stationary regime. This is no longer true when the system is non-bipartite.

Transfer entropy and information flow in the feedback cooling model

We first recall the theoretical expressions of the transfer entropy rates and the information flows for the feedback-cooling model described by Eq (6). These quantities were computed in Ref. [16]. The transfer entropy rates in the stationary state are given by

\begin{matrix} T_{V \to Y} & = & \frac{γ}{2 m} (\sqrt{1 + \frac{2 T}{γ σ^{2}}} - 1) \\ T_{Y \to V} & = & \frac{1}{2 τ_{r}} (\sqrt{1 + \frac{a^{2} σ^{2}}{2 γ T}} - 1) . \end{matrix}

(21)

Note that 2T/(γσ²) is the signal-to-noise ratio that quantifies the relative size of the measurement accuracy to the thermal diffusion of the velocity. Accordingly, the TE rate $T_{V \to Y}$ diverges when the control is deterministic. The information flow $I_{V \to Y}^{f l o w}$ is given by

\begin{matrix} I_{V \to Y}^{f l o w} = \frac{γ}{m} (\frac{T ⟨ y^{2} ⟩}{m | Σ |} - 1) \end{matrix}

(22)

where $| Σ |$ is the determinant of the covariance matrix. The analytical expressions of the elements of the matrix, 〈v²〉, 〈y²〉 and 〈vy〉, are given by Eqs (A2) in Ref. [16]. In contrast with $T_{V \to Y}$ , the information flow $I_{V \to Y}^{f l o w}$ remains finite as the noise intensity vanishes.

The upper bounds to the transfer entropies (see Eq (2)) were computed in Ref. [24] in the general case of coupled linear Langevin equations. For the feedback cooling model, one obtains

\begin{matrix} {\bar{T}}_{V \to Y} & = & \frac{1}{2 σ^{2} ⟨ y^{2} ⟩} | Σ | \\ {\bar{T}}_{Y \to V} & = & \frac{a^{2}}{4 γ k_{B} T ⟨ v^{2} ⟩} | Σ | . \end{matrix}

(23)

As shown in Fig 1, the estimate of the transfer entropy obtained by the inference method is in good agreement with the theoretical value (we stress that the figure shows the rates multiplied by the sampling time τ = 10⁻³). In Fig 6, we also obtain satisfactory agreement between inferred value of the information flow $I_{V \to Y}^{f l o w}$ and theoretical value, when representing these quantities against the noise intensity σ². These results of this figure confirm the inequalities $I_{V \to Y}^{f l o w} \leq T_{V \to Y} \leq {\bar{T}}_{V \to Y}$ .

Fig 6 — The parameters of the model are T = 5, γ = m = 1, τ_r = 0.1 and a = −0.7.

Transfer entropy rates and information flows in the model of Ref. [4] for a metabolic network

Stationary distributions and correlation functions

We first compute the stationary probability distributions (pdfs) associated with Eq (12) were the coefficients a_j and b_j are given by

\begin{matrix} a_{1} & = & - [μ_{E} + μ_{0} T_{μ E} (T_{E G} - 1)] \\ a_{2} & = & μ_{0} \\ a_{3} & = & - μ_{0} T_{E G} \\ a_{4} & = & μ_{0} (T_{E G} - 1) \\ b_{1} & = & T_{μ E} [β_{G} - μ_{E} - μ_{0} T_{μ E} (T_{E G} - 1)] \\ b_{2} & = & μ_{0} T_{μ E} \\ b_{3} & = & β_{G} - β_{μ} - μ_{0} T_{μ E} T_{E G} \\ b_{4} & = & μ_{0} T_{μ E} (T_{E G} - 1) - β_{G} . \end{matrix}

(24)

We recall that μ_E = μ₀(1 + T_μE − T_EE) sets the timescale of E-fluctuations [4]. Since Eq (12) describe a set of coupled Markovian Ornstein-Uhlenbeck processes, the stationary pdf p_xuvy(x, u, v, y) is Gaussian and given by

\begin{matrix} p_{x u v y} (x, u, v, y) = \frac{1}{{(2 π)}^{2} \sqrt{| Σ |}} e^{- \frac{1}{2} (x, u, v, y) . Σ^{- 1} . {(x, u, v, y)}^{T}}, \end{matrix}

(25)

where $Σ$ is the covariance matrix which obeys the Lyapunov equation [32]

\begin{matrix} A Σ + Σ A^{T} = 2 D, \end{matrix}

(26)

where

\begin{matrix} A = (\begin{matrix} - a_{1} & - a_{2} & - a_{3} & - a_{4} \\ 0 & β_{E} & 0 & 0 \\ 0 & 0 & β_{μ} & 0 \\ - b_{1} & - b_{2} & - b_{3} & - b_{4} \end{matrix}), and D = (\begin{matrix} 0 & 0 & 0 & 0 \\ 0 & β_{E} η_{E}^{2} & 0 & 0 \\ 0 & 0 & β_{μ} η_{μ}^{2} & β_{μ} η_{μ}^{2} \\ 0 & 0 & β_{μ} η_{μ}^{2} & β_{G} η_{G}^{2} + β_{μ} η_{μ}^{2} \end{matrix}) . \end{matrix}

The solution of Eq (26) reads

\begin{matrix} σ_{11} & = & \frac{μ_{0}^{2}}{μ_{E}} [\frac{η_{E}^{2}}{μ_{E} + β_{E}} + \frac{η_{μ}^{2}}{μ_{E} + β_{μ}} + \frac{{(T_{E G} - 1)}^{2}}{μ_{E} + β_{G}} η_{G}^{2}] \\ σ_{12} & = & σ_{21} = \frac{μ_{0}}{μ_{E} + β_{E}} η_{E}^{2} \\ σ_{13} & = & σ_{31} = \frac{- μ_{0}}{μ_{E} + β_{μ}} η_{μ}^{2} \\ σ_{14} & = & σ_{41} = \frac{μ_{0}}{μ_{E}} [\frac{μ_{0} T_{μ E}}{μ_{E} + β_{E}} η_{E}^{2} + \frac{(μ_{0} T_{μ E} - μ_{E})}{μ_{E} + β_{μ}} η_{μ}^{2} \\ + \frac{(T_{E G} - 1) [μ_{0} T_{μ E} (T_{E G} - 1) + μ_{E}]}{μ_{E} + β_{G}} η_{G}^{2}] \\ σ_{22} & = & η_{E}^{2} \\ σ_{23} & = & 0 \\ σ_{24} & = & σ_{42} = \frac{μ_{0} T_{μ E}}{μ_{E} + β_{E}} η_{E}^{2} \\ σ_{33} & = & η_{μ}^{2} \\ σ_{34} & = & σ_{43} = \frac{μ_{E} + β_{μ} - μ_{0} T_{μ E}}{μ_{E} + β_{μ}} η_{μ}^{2} \\ σ_{44} & = & \frac{μ_{0}^{2} T_{μ E}^{2}}{μ_{E} (μ_{E} + β_{E})} η_{E}^{2} + \frac{[{(μ_{0} T_{μ E} - μ_{E})}^{2} + μ_{E} β_{μ}]}{μ_{E} (μ_{E} + β_{μ})} η_{μ}^{2} \\ + \frac{μ_{0}^{2} T_{μ E}^{2} {(T_{E G} - 1)}^{2} + μ_{E} [μ_{E} + β_{G}]}{μ_{E} (μ_{E} + β_{G})} η_{G}^{2} \\ + \frac{2 μ_{0} T_{μ E} (T_{E G} - 1)]}{μ_{E} (μ_{E} + β_{G})} η_{G}^{2} \end{matrix}

(27)

From this we can compute all marginal pdfs, in particular

\begin{matrix} p_{x y} (x, y) & = & \frac{1}{2 π \sqrt{σ_{11} σ_{44} - σ_{14}^{2}}} e^{- \frac{1}{2} \frac{σ_{44} x^{2} - 2 σ_{14} x y + σ_{11} y^{2}}{σ_{11} σ_{44} - σ_{14}^{2}}}, \end{matrix}

(28)

and

\begin{matrix} p_{x} (x) & = & \frac{1}{\sqrt{2 π σ_{11}}} e^{- \frac{x^{2}}{2 σ_{11}}} \\ p_{y} (y) & = & \frac{1}{\sqrt{2 π σ_{44}}} e^{- \frac{y^{2}}{2 σ_{44}}} . \end{matrix}

(29)

As an illustration, the steady-state pdf $p (μ) = \frac{1}{μ_{0}} p_{y} (y = \frac{μ - μ_{0}}{μ_{0}})$ is plotted in Fig 7 for the three different IPTG concentrations (low, intermediate, and high). The agreement with the experimental curves displayed in Fig 1d of Ref. [4] is satisfactory.

For completeness, we also quote the expressions of R_pp(0) and R_pμ(0) (properly normalized) obtained from the definition $δ p / (μ_{0} E_{0}) = δ \dot{E} / (μ_{0} E_{0}) + δ μ / μ_{0} + δ E / E_{0} = (T_{E E} - T_{E G} T_{μ E}) x + u - T_{E G} (v - y)$ :

\begin{matrix} R_{p p} (0) & = & {(T_{E E} - T_{E G} T_{μ E})}^{2} σ_{11} + σ_{22} + T_{E G}^{2} (σ_{33} + σ_{44}) \\ + 2 (T_{E E} - T_{E G} T_{μ E}) [σ_{12} + T_{E G} (σ_{14} - σ_{13})] \\ + 2 T_{E G} σ_{24} - 2 T_{E G}^{2} σ_{34} \end{matrix}

(30)

\begin{matrix} R_{p μ} (0) & = & \frac{(T_{E E} - T_{E G} T_{μ E}) σ_{14} + σ_{24} + T_{E G} (σ_{44} - σ_{34})}{\sqrt{R_{p p} (0) R_{μ μ} (0)}} \end{matrix}

(31)

with R_μμ(0) = σ₄₄.

The correlation functions R_μμ(τ), R_EE(τ), and R_Eμ(τ), obtained by taking the inverse Fourier transform of Eqs (6) in the Supplementary Information of [4] are plotted in Fig 4. In passing, we correct a few misprints in these equations: i) The correct expression of R_μμ(τ) is obtained by replacing A_E(τ) by R_EE(τ) in the first term of Eq (12) in the Supplementary Information of [4]. ii) Eq (10) corresponds to R_Eμ(τ) and not to R_μE(τ) = R_Eμ(−τ). Eq (8) then gives the correct expression of R_Eμ(τ) (and not of R_μE(τ)) provided the function A_X(τ) defined in Eq (10) is altered. For τ ≥ 0, one should have

\begin{matrix} A_{X} (τ) = θ_{X}^{2} \frac{μ_{0}}{2 β_{X} (β_{X} + μ_{E})} e^{- β_{X} t} . \end{matrix}

(32)

Transfer entropy rates

We now address the computation of the conditional probabilities $p_{x^{'} y^{'}}^{y} (y, t + τ | x^{'}, y^{'}, t)$ and $p_{y^{'}}^{y} (y, t + τ | y^{'}, t)$ at first order in τ. This will allow us to obtain the expressions of the upper bounds to the transfer entropy rates defined by

\begin{matrix} {\bar{T}}_{X \to Y} & = & lim_{τ \to 0} \frac{I [y_{t + τ} : x_{t}, y_{t}] - I [y_{t + τ} : y_{t}]}{τ} \\ {\bar{T}}_{Y \to X} & = & lim_{τ \to 0} \frac{I [x_{t + τ} : x_{t}, y_{t}] - I [x_{t + τ} : x_{t}]}{τ}, \end{matrix}

(33)

where I is the mutual information, for instance $I [y_{t + τ} : x_{t}, y_{t}] = \int d y d x^{'} d y^{'} p_{x^{'} y^{'}}^{y} (y, t + τ; x^{'}, y^{'}, t) ln [p_{x^{'} y^{'}}^{y} (y, t + τ; x^{'}, y^{'}, t) / [p_{y} (y) p_{x y} (x^{'}, y^{'})]$ in the steady state (where p_xy(x′, y′) and p_y(y) become time independent pdfs). Therefore,

\begin{matrix} {\bar{T}}_{X \to Y} & = & lim_{τ \to 0} \frac{1}{τ} \int d y d x^{'} d y^{'} p_{x^{'} y^{'}}^{y} (y, t + τ; x^{'}, y^{'}, t) \\ \times ln \frac{p_{x^{'} y^{'}}^{y} (y, t + τ | x^{'}, y^{'}, t)}{p_{y^{'}}^{y} (y, t + τ | y^{'}, t)} \\ {\bar{T}}_{Y \to X} & = & lim_{τ \to 0} \frac{1}{τ} \int d y d x^{'} d y^{'} p_{x^{'} y^{'}}^{x} (x, t + τ; x^{'}, y^{'}, t) \\ \times ln \frac{p_{x^{'} y^{'}}^{x} (x, t + τ | x^{'}, y^{'}, t)}{p_{x^{'}}^{x} (x, t + τ | x^{'}, t)} . \end{matrix}

(34)

Note that the actual transfer entropy rates are defined as

\begin{matrix} T_{X \to Y} & = & lim_{τ \to 0} \frac{I [y_{t + τ} : x_{t}, {y_{t^{'}}}_{t^{'} \leq t}] - I [y_{t + τ} : {y_{t^{'}}}_{t^{'} \leq t}]}{τ} \\ T_{Y \to X} & = & lim_{τ \to 0} \frac{I [x_{t + τ} : {x_{t^{'}}}_{t^{'} \leq t}, y_{t}] - I [x_{t + τ} : {x_{t^{'}}}_{t^{'} \leq t}]}{τ} . \end{matrix}

(35)

where {x_t′}_t′≤t and {y_t′}_t′≤t denote the full trajectories of x_t and y_t in the time interval [0, t]. Since the present model is not bipartite, the calculation of these quantities is a nontrivial task that is left aside.

The two-time distributions $p_{x^{'} y^{'}}^{y} (y, t + τ; x^{'}, y^{'}, t)$ and $p_{x^{'} y^{'}}^{x} (x, t + τ; x^{'}, y^{'}, t)$ are given by

\begin{matrix} p_{x^{'} y^{'}}^{y} (y, t + τ; x^{'}, y^{'}, t) & = & \int d x d u d v d u^{'} d v^{'} p_{z^{'}}^{z} (z, t + τ | z^{'}, t) p_{x u v y} (z^{'}) \\ p_{x^{'} y^{'}}^{x} (x, t + τ; x^{'}, y^{'}, t) & = & \int d y d u d v d u^{'} d v^{'} p_{z^{'}}^{z} (z, t + τ | z^{'}, t) p_{x u v y} (z^{'}) \end{matrix}

(36)

where $p_{z^{'}}^{z} (z, t + τ | z^{'}, t)$ is the transition probability from the state $z^{'} = (x^{'}, u^{'}, v^{'}, y^{'})$ at time t to the state $z = (x, u, v, y)$ at time t + τ. From the definition of the Fokker-Planck operator $L_{F P}$ associated with the 4-dimensional diffusion process described by Eq (12), the transition probability for small times is given by [32]

\begin{matrix} p_{z^{'}}^{z} (z, t + τ | z^{'}, t) & = & [1 + τ L_{F P} (z, t) + O (τ^{2})] δ (z - z^{'}) \\ = & δ (z - z^{'}) - τ \sum_{i = 1}^{4} \partial_{z_{i}} [g_{i} (z^{'}) - \sum_{j} \frac{θ_{i, j}^{2}}{2} \partial_{z_{j}}] δ (z - z^{'}) \end{matrix}

(37)

where $g_{i} (z)$ is the drift coefficient in the equation for z_i (with z₁ = x, z₂ = u, z₃ = v, z₄ = y), $θ_{2, 2} = θ_{E}, θ_{3, 3} = θ_{3, 4} = θ_{μ}, θ_{4, 4} = \sqrt{θ_{μ}^{2} + θ_{G}^{2}}$ and all other θ_{i, j} being equal to 0.

Let us first consider the calculation of ${\bar{T}}_{X \to Y}$ . By integrating $p_{z^{'}}^{z} (z, t + τ | z^{'}, t)$ over x, u, and v, we readily obtain

\begin{matrix} p_{z^{'}}^{y} (y, t + τ | z^{'}, t) = δ (y - y^{'}) - τ \partial_{y} [g_{y} (z^{'}) - β_{μ} η_{μ}^{2} \partial_{v} - (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) \partial_{y}] δ (y - y^{'}) + O (τ^{2}) \end{matrix}

where the terms involving ∂_x, ∂_u, ∂_v cancel due to natural boundary conditions. Hence,

\begin{matrix} p_{z^{'}}^{y} (y, t + τ; z^{'}, t) & = & p_{z^{'}}^{y} (y, t + τ | z^{'}, t) p_{x u v y} (z^{'}) \\ = & δ (y - y^{'}) p (z^{'}) - τ p_{x u v y} (z^{'}) \times \\ \partial_{y} [g_{y} (z^{'}) - β_{μ} η_{μ}^{2} \partial_{v} - (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) \partial_{y}] δ (y - y^{'}), \end{matrix}

(38)

which yields

\begin{matrix} p_{x^{'} y^{'}}^{y} (y, t + τ; x^{'}, y^{'}, t) & = & δ (y - y^{'}) p_{x y} (x^{'}, y^{'}) - τ p_{x y} (x^{'}, y^{'}) \partial_{y} [{\bar{g}}_{y} (x^{'}, y^{'}) \\ - (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) \partial_{y}] δ (y - y^{'}) . \end{matrix}

(39)

after integration over u′ and v′, where we have defined the averaged drift coefficient

\begin{matrix} {\bar{g}}_{y} (x, y) = \frac{1}{p_{x y} (x, y)} \int d u d v g_{y} (z) p_{x u v y} (z) . \end{matrix}

(40)

We thus finally obtain

\begin{matrix} p_{x^{'} y^{'}}^{y} (y, t + τ | x^{'}, y^{'}, t) & = & δ (y - y^{'}) - τ \partial_{y} [{\bar{g}}_{y} (x^{'}, y^{'}) \\ - (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) \partial_{y}] δ (y - y^{'}) + O (τ^{2}) . \end{matrix}

(41)

Similarly, by also integrating $p_{z^{'}}^{y} (y, t + τ; x^{'}, y^{'}, t)$ over x′, we obtain

\begin{matrix} p_{y^{'}}^{y} (y, t + τ | y^{'}, t) & = & δ (y - y^{'}) - τ \partial_{y} [{\bar{\bar{g}}}_{y} (y^{'}) - (β_{μ} η_{μ}^{2} \\ + β_{G} η_{G}^{2}) \partial_{y}] δ (y - y^{'}) + O (τ^{2}) . \end{matrix}

(42)

where

\begin{matrix} {\bar{\bar{g}}}_{y} (y) & = & \frac{1}{p_{y} (y)} \int d x d u d v g_{y} (z) p_{x u v y} (z) \\ = & \frac{1}{p_{y} (y)} \int d x {\bar{g}}_{y} (x, y) p_{x y} (x, y) . \end{matrix}

(43)

Due to the linearity of Eq (12) and the Gaussian character of the pfds, one simply has ${\bar{g}}_{y} (x, y) = a x + b y$ and ${\bar{\bar{g}}}_{y} (y) = c y$ , where a, b, c are complicated functions of the model parameters which we do not display here.

Eq (41) (resp. Eq (42)) merely shows that $p_{x^{'} y^{'}}^{y} (y, t + τ | x^{'}, y^{'}, t)$ (resp. $p_{y^{'}}^{y} (y, t + τ | y^{'}, t)$ ) at the lowest order in τ is identical to the transition probability associated with an Ornstein-Uhlenbeck process with drift coefficient ${\bar{g}}_{y} (x, y)$ (resp. ${\bar{\bar{g}}}_{y} (y)$ ) and diffusion coefficient $β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}$ . To proceed further, it is then convenient to use to the Fourier integral representation of the δ function and re-express $p_{x^{'} y^{'}}^{y} (y, t + τ | x^{'}, y^{'}, t)$ and $p_{y^{'}}^{y} (y, t + τ | y^{'}, t)$ for small times as

\begin{matrix} p_{x^{'} y^{'}}^{y} (y, t + τ | x^{'}, y^{'}, t) = \frac{1}{2 \sqrt{π (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) τ}} e^{- \frac{1}{4 (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) τ} {[y - y^{'} - τ {\bar{g}}_{y} (x^{'}, y^{'})]}^{2}} \end{matrix}

(44)

and

\begin{matrix} p_{y^{'}}^{y} (y, t + τ | y^{'}, t) = \frac{1}{2 \sqrt{π (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) τ}} e^{- \frac{1}{4 (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) τ} {[y - y^{'} - τ {\bar{\bar{g}}}_{y} (y^{'})]}^{2}} . \end{matrix}

(45)

up to corrections of the order τ² [32]. This leads to

\begin{matrix} ln \frac{p_{x^{'} y^{'}}^{y} (y, t + τ | x^{'}, y^{'}, t)}{p_{y^{'}}^{y} (y, t + τ | y^{'}, t)} & = & \frac{1}{4 (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2})} [2 (y - y^{'}) - τ [{\bar{g}}_{y} (x^{'}, y^{'}) + {\bar{\bar{g}}}_{y} (y^{'})]] \\ \times [{\bar{g}}_{y} (x^{'}, y^{'}) - {\bar{\bar{g}}}_{y} (y^{'})], \end{matrix}

(46)

and from Eq (39) and the definition of the transfer entropy rate [Eq (34)],

\begin{matrix} 4 (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) {\bar{T}}_{X \to Y} & = & lim_{τ \to 0} \frac{1}{τ} \int d y d x^{'} d y^{'} p_{x^{'} y^{'}}^{y} (y, t + τ; x^{'}, y^{'}, t) [2 (y - y^{'}) \\ - τ [{\bar{g}}_{y} (x^{'}, y^{'}) + {\bar{\bar{g}}}_{y} (y^{'})]] [{\bar{g}}_{y} (x^{'}, y^{'}) - {\bar{\bar{g}}}_{y} (y^{'})] \\ = & lim_{τ \to 0} \frac{1}{τ} \int d y d x^{'} d y^{'} p_{x y} (x^{'}, y^{'}) [δ (y - y^{'}) - τ \partial_{y} [{\bar{g}}_{y} (x^{'}, y^{'}) \\ - (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) \partial_{y}] δ (y - y^{'})] \\ \times [2 (y - y^{'}) - τ [{\bar{g}}_{y} (x^{'}, y^{'}) + {\bar{\bar{g}}}_{y} (y^{'})]] [{\bar{g}}_{y} (x^{'}, y^{'}) - {\bar{\bar{g}}}_{y} (y^{'})] \end{matrix}

(47)

We then use

\begin{matrix} \int d y (y - y^{'}) [δ (y - y^{'}) - τ \partial_{y} [{\bar{g}}_{y} (x^{'}, y^{'}) - (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) \partial_{y}] δ (y - y^{'})] = τ {\bar{g}}_{y} (x^{'}, y^{'}), \end{matrix}

(48)

and

\begin{matrix} \int d x^{'} p_{x y} (x^{'}, y^{'}) {\bar{g}}_{y} (x^{'}, y^{'}) & = & p_{y} (y^{'}) {\bar{\bar{g}}}_{y} (y^{'}) = \int d x^{'} p_{x y} (x^{'}, y^{'}) {\bar{\bar{g}}}_{y} (y^{'}), \end{matrix}

(49)

to finally arrive at Eq (13), namely

\begin{matrix} {\bar{T}}_{X \to Y} & = & \frac{1}{4 (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2})} \int d x d y p_{x y} (x, y) [{\bar{g}}_{y}^{2} (x, y) - {\bar{\bar{g}}}_{y}^{2} (y)] . \end{matrix}

(50)

A similar expression can be found in Ref. [11] (see Eq (A.31) in that reference). Note also that the result given in Ref. [24] is obtained as a special case.

Inserting into Eq (13) the values of the parameters given in Table S1 of Ref. [4], we obtain the values given in Table 2. Note that ${\bar{T}}_{E \to μ} = 0$ for the high IPTG concentration because T_μE = 0, and therefore μ(t) no longer depends on E(t) as can be seen from Eq (10).

There is no need to detail the calculation of ${\bar{T}}_{μ \to E}$ (i.e. ${\bar{T}}_{Y \to X}$ ) because it goes along the same line, with y replaced by x. The crucial difference is that there is no white noise acting on $\dot{x}$ . Therefore, the denominator in Eq (13), which is the variance of the noise ξ_y, is replaced by 0. This implies that ${\bar{T}}_{μ \to E}$ is infinite.

Information flows

The information flows $I_{X \to Y}^{f l o w}$ and $I_{Y \to X}^{f l o w}$ are derived from the time-shifted mutual informations I[x_t+τ : y_t] and I[y_t+τ : x_t]. Specifically,

\begin{matrix} I_{X \to Y}^{f l o w} & = & lim_{τ \to 0} \frac{I [x_{t} : y_{t}] - I [x_{t + τ} : y_{t}]}{τ} \\ I_{Y \to X}^{f l o w} & = & lim_{τ \to 0} \frac{I [y_{t} : x_{t}] - I [y_{t + τ} : x_{t}]}{τ} . \end{matrix}

(51)

Let us first consider the second flow $I_{Y \to X}^{f l o w}$ which requires the knowledge of $p_{x^{'}}^{y} (y, t + τ; x^{'}, t)$ whose expression is obtained by integrating Eq (39) over x′. This yields

\begin{matrix} p_{x^{'}}^{y} (y, t + τ; x^{'}, t) & = & p_{x y} (x^{'}, y) - τ \partial_{y} [{\bar{g}}_{y} (x^{'}, y) \\ - (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) \partial_{y}] p_{x y} (x^{'}, y) + O (τ^{2}) . \end{matrix}

(52)

Hence

\begin{matrix} I [y_{t + τ} : x_{t}] & = & \int d x^{'} d y p_{x^{'}}^{y} (y, t + τ; x^{'}, t) \\ \times ln \frac{p_{x^{'}}^{y} (y, t + τ; x^{'}, t)}{p_{y} (y) p_{x} (x^{'})} \\ = & I [y_{t} : x_{t}] - τ \int d x d y \partial_{y} [{\bar{g}}_{y} (x, y) \\ - (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) \partial_{y}] p_{x y} (x, y) ln \frac{p_{x y} (x, y)}{p_{y} (y) p_{x} (x)} . \end{matrix}

(53)

We finally obtain

\begin{matrix} I_{Y \to X}^{f l o w} & = & \int d x d y \partial_{y} [{\bar{g}}_{y} (x, y) p_{x y} (x, y) \\ - (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) \partial_{y} p_{x y} (x, y)] ln \frac{p_{x y} (x, y)}{p_{y} (y) p_{x} (x)} . \end{matrix}

(54)

A similar calculation yields

\begin{matrix} I_{X \to Y}^{f l o w} & = & \int d x d y \partial_{x} [{\bar{g}}_{x} (x, y) p_{x y} (x, y)] ln \frac{p_{x y} (x, y)}{p_{y} (y) p_{x} (x)}, \end{matrix}

(55)

where

\begin{matrix} {\bar{g}}_{x} (x, y) = \frac{1}{p_{x y} (x, y)} \int d u d v g_{x} (z) p_{x u v y} (z) \end{matrix}

(56)

is an averaged drift coefficient. Contrary to the case of the transfer entropy rate ${\bar{T}}_{Y \to X}$ , the absence of a white noise acting on $\dot{x}$ does not lead to an infinite result for $I_{Y \to X}^{f l o w}$ . In fact, one has the symmetry relation

\begin{matrix} I_{X \to Y}^{f l o w} = - I_{Y \to X}^{f l o w}, \end{matrix}

(57)

which is readily obtained by noting that p_xy(x, y), the stationary solution of the Fokker-Planck equation, satisfies the equation

\begin{matrix} \partial_{x} [{\bar{g}}_{x} (x, y) p_{x y} (x, y)] + \partial_{y} [{\bar{g}}_{y} (x, y) p_{x y} (x, y)] \\ - (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2}) \frac{\partial^{2}}{\partial y^{2}} p_{x y} (x, y) = 0 . \end{matrix}

(58)

Inserting the numerical values of the parameters given in Table S1 of Ref. [4], we obtain the values given in Table 5 below. Interestingly, $I_{E \to μ}^{f l o w}$ decreases as the IPTG concentration increases and that it becomes negative at high concentration.

Table 5. Theoretical values of $I_{X \to Y}^{f l o w} = - I_{Y \to X}^{f l o w}$ in the original model of Ref. [4].

Conc. of IPTG	Low	Intermediate	High
$I_{E \to μ}^{f l o w}$ (in h⁻¹)	0.0148	0.0088	-0.0243

Open in a new tab

Transfer entropy rates and information flows in the modified model for the metabolic network

We now repeat the above calculations for the modified model where N_E is treated as a white noise. Eliminating again the variable w (i.e. N_G) in favor of y, the new set of equations that describe the stochastic dynamics and replace Eq (12) reads

\begin{matrix} \dot{x} & = & - [μ_{E} + μ_{0} T_{μ E} (T_{E G} - 1)] x - μ_{0} T_{E G} v \\ + μ_{0} (T_{E G} - 1) y + ξ_{x} \\ \dot{v} & = & - β_{μ} v + ξ_{μ} \\ \dot{y} & = & T_{μ E} [β_{G} - μ_{E} - μ_{0} T_{μ E} (T_{E G} - 1)] x + [β_{G} - β_{μ} \\ - μ_{0} T_{μ E} T_{E G}] v + [μ_{0} T_{μ E} (T_{E G} - 1) - β_{G}] y + {\tilde{ξ}}_{y}, \end{matrix}

(59)

where we have defined the white noises ξ_x = μ₀N_E and ${\tilde{ξ}}_{y} = ξ_{y} + T_{μ E} ξ_{x}$ satisfying $〈 ξ_{x} (t) ξ_{x} (t^{'}) 〉 = 2 D_{E} μ_{0}^{2} δ (t - t^{'})$ and $〈 {\tilde{ξ}}_{y} (t) {\tilde{ξ}}_{y} (t^{'}) 〉 = (θ_{μ}^{2} + θ_{G}^{2} + 2 D_{E} μ_{0}^{2} T_{μ E}^{2}) δ (t - t^{'})$ , respectively. These two noises are correlated, with $〈 ξ_{x} (t) {\tilde{ξ}}_{y} (t^{'}) 〉 = 2 D_{E} μ_{0}^{2} T_{μ E} δ (t - t^{'})$ .

The pdfs and the correlation functions can be computed as before. In fact, it is clear that this simply amounts to taking the limit β_E → ∞ with $D_{E} = η_{E}^{2} / β_{E}$ finite in the previous equations (for instance in Eq (27) for the covariances). The new correlation functions are plotted in Fig 4. As expected, they are almost indistinguishable from those obtained with the original model and they fit the experimental data just as well (this of course is also true for the pdfs).

Much more interesting are the results for the transfer entropy rates and the information flows. Again, there is no need to repeat the calculations as they follow the same lines as before. We now obtain

\begin{matrix} {\bar{T}}_{X \to Y} & = & \frac{1}{4 (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2} + D_{E} μ_{0}^{2} T_{μ E}^{2})} \times \\ \int d x d y p_{x y} (x, y) [{\bar{g}}_{y}^{2} (x, y) - {\bar{\bar{g}}}_{y}^{2} (y)] \end{matrix}

(60)

\begin{matrix} {\bar{T}}_{Y \to X} & = & \frac{1}{4 D_{E} μ_{0}^{2}} \int d x d y p_{x y} (x, y) [{\bar{g}}_{x}^{2} (x, y) - {\bar{\bar{g}}}_{x}^{2} (x)], \end{matrix}

(61)

where

\begin{matrix} {\bar{g}}_{x} (x, y) & = & \frac{1}{p_{x y} (x, y)} \int d u g_{x} (x, v, y) p_{x v y} (x, v, y) \end{matrix}

(62)

\begin{matrix} {\bar{g}}_{y} (x, y) & = & \frac{1}{p_{x y} (x, y)} \int d u g_{y} (x, v, y) p_{x v y} (x, v, y), \end{matrix}

(63)

and

\begin{matrix} {\bar{\bar{g}}}_{x} (x) & = & \frac{1}{p_{x} (x)} \int d y {\bar{g}}_{x} (x, y) p_{x y} (x, y) \end{matrix}

(64)

\begin{matrix} {\bar{\bar{g}}}_{y} (y) & = & \frac{1}{p_{y} (y)} \int d x {\bar{g}}_{y} (x, y) p_{x y} (x, y) . \end{matrix}

(65)

(Again, g_x(x, v, y) and g_y(x, v, y) denote the drift coefficients in Eq (59)). The crucial difference with the results for the original model is that ${\bar{T}}_{Y \to X}$ is now finite. Similarly, we have

\begin{matrix} {\dot{I}}_{X \to Y}^{f l o w} & = & \int d x d y \partial_{x} [{\bar{g}}_{x} (x, y) p_{x y} (x, y) \\ - D_{E} μ_{0}^{2} \partial_{x} p_{x y} (x, y)] ln \frac{p_{x y} (x, y)}{p_{y} (y) p_{x} (x)} \end{matrix}

(66)

\begin{matrix} {\dot{I}}_{Y \to X}^{f l o w} & = & \int d x d y \partial_{y} [{\bar{g}}_{y} (x, y) p_{x y} (x, y) \\ - (β_{μ} η_{μ}^{2} + β_{G} η_{G}^{2} + D_{E} μ_{0}^{2} T_{μ E}^{2}) \partial_{y} p_{x y} (x, y)] \\ \times ln \frac{p_{x y} (x, y)}{p_{y} (y) p_{x} (x)} . \end{matrix}

(67)

The numerical values of ${\bar{T}}_{E \to μ}$ and ${\bar{T}}_{μ \to E}$ are given in Table 3. For completeness, we also compare these values with the estimates obtained by the inference method in Table 4. We see that satisfactory results are obtained by properly choosing the sampling time τ. This is also true for the information flows $I_{E \to μ}^{f l o w}$ and $I_{μ \to E}^{f l o w}$ . It is worth noting that the symmetry relation ${\dot{I}}_{E \to μ}^{f l o w} = - {\dot{I}}_{μ \to E}^{f l o w}$ no longer holds, except for the high IPTG concentration (as T_μE = 0). This contrasts with the preceding case where N_E was modeled by an Ornstein-Uhlenbeck noise. We also observe that the information flows are not always smaller than the transfer entropy rates, contrary to what occurs in bipartite systems. Therefore, the concept of a “sensory capacity” as introduced in Ref. [11] is here ineffective.

Acknowledgments

We acknowledge J. Lizier for many insightful comments regarding the numerical evaluation of transfer entropies, and L. Peliti for stimulating discussions. S.L. thanks the Institute of Complex Systems (ISC-PIF), the Region Ile-de-France, and the Labex CelTisPhyBio (No. ANR-10- LBX-0038) part of the IDEX PSL (No. ANR-10-IDEX-0001-02 PSL) for financial support.

Data Availability

All relevant data are within the paper.

Funding Statement

S.L. thanks the Institute of Complex Systems (ISC-PIF), the Region Ile-de-France, and the Labex CelTisPhyBio (No. ANR-10- LBX-0038) part of the IDEX PSL (No. ANR-10-IDEX-0001-02 PSL) for financial support (to DL). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1. Prill RJ, Vogel R, Cecchi GA, Altan-Bonnet G, Stolovitzky G. Noise-Driven Causal Inference in Biomolecular Networks. PLoS ONE. 2015;10(6):e0125777 doi: 10.1371/journal.pone.0125777 [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Affeldt S, Verny L, Isambert H. 3off2: A network reconstruction algorithm based on 2-point and 3-point information statistics. BMC Bioinformatics. 2016;17(S2). doi: 10.1186/s12859-015-0856-x [DOI] [PMC free article] [PubMed] [Google Scholar]
3. Dunlop MJ, Cox RS, Levine JH, Murray RM, Elowitz MB. Regulatory activity revealed by dynamic correlations in gene expression noise. Nat Genet. 2008;40(12):1493–1498. doi: 10.1038/ng.281 [DOI] [PMC free article] [PubMed] [Google Scholar]
4. Kiviet DJ, Nghe P, Walker N, Boulineau S, Sunderlikova V, Tans SJ. Stochasticity of metabolism and growth at the single-cell level. Nature. 2014;514:376 doi: 10.1038/nature13582 [DOI] [PubMed] [Google Scholar]
5. Granger CWJ. Investigating Causal Relations by Econometric Models and Cross-spectral Methods. Econometrica. 1969;37(3):424–438. doi: 10.2307/1912791 [Google Scholar]
6. Schreiber T. Measuring information transfer. Phys Rev Lett. 2000;85:461 doi: 10.1103/PhysRevLett.85.461 [DOI] [PubMed] [Google Scholar]
7. Wibral M, Pampu N, Priesemann V, Siebenhühner F, Seiwert H, Lindner M, et al. Measuring Information-Transfer Delays. PLoS ONE. 2013;8:e55809 doi: 10.1371/journal.pone.0055809 [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Pahle J, Green AK, Dixon CJ, Kummer U. Information transfer in signaling pathways: A study using coupled simulated and experimental data. BMC Bioinformatics. 2008;9(1):139 doi: 10.1186/1471-2105-9-139 [DOI] [PMC free article] [PubMed] [Google Scholar]
9. Vicente R, Wibral M, Lindner M, Pipa G. Transfer entropy—a model-free measure of effective connectivity for the neurosciences. Journal of Computational Neuroscience. 2011;30(1):45–67. doi: 10.1007/s10827-010-0262-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
10. Lizier TJ, Prokopenko M. Differentiating information transfer and causal effect. Eur Phys J B. 2010;73(4):605–615. doi: 10.1140/epjb/e2010-00034-5 [Google Scholar]
11. Hartich D, Barato AC, Seifert U. Sensory capacity: An information theoretical measure of the performance of a sensor. Phys Rev E. 2016;93:022116 doi: 10.1103/PhysRevE.93.022116 [DOI] [PubMed] [Google Scholar]
12. Tusch S, Kundu A, Verley G, Blondel T, Miralles V, Démoulin D, et al. Energy versus Information Based Estimations of Dissipation Using a Pair of Magnetic Colloidal Particles. Phys Rev Lett. 2014;112:180604 doi: 10.1103/PhysRevLett.112.180604 [DOI] [PubMed] [Google Scholar]
13.Lizier JT. JIDT: An information-theoretic toolkit for studying the dynamics of complex systems. Frontiers in Robotics and AI. 2014;1:11(11).
14. Parrondo JMR, Horowitz JM, Sagawa T. Thermodynamics of Information. Nature Physics. 2015;11:131 doi: 10.1038/nphys3230 [Google Scholar]
15. Horowitz JM, Esposito M. Thermodynamics with Continuous Information Flow. Phys Rev X. 2014;4:031015. [Google Scholar]
16. Horowitz JM, Sandberg H. Second-law-like inequalities with information and their interpretations. New J Phys. 2014;16:125007 doi: 10.1088/1367-2630/16/12/125007 [Google Scholar]
17. Hartich D, Barato AC, Seifert U. Stochastic thermodynamics of bipartite systems: transfer entropy inequalities and a Maxwell’s demon interpretation. J Stat Mech. 2014;2014(2):P02016 doi: 10.1088/1742-5468/2014/02/P02016 [Google Scholar]
18. Allahverdyan AE, Janzing D, Mahler G. Thermodynamic efficiency of information and heat flow. J Stat Mech. 2009;2009(09):P09011 doi: 10.1088/1742-5468/2009/09/P09011 [Google Scholar]
19. Bowsher CG, Swain PS. Identifying sources of variation and the flow of information in biochemical networks. Proc Natl Acad Sci USA. 2012;109(20):E1320–E1328. doi: 10.1073/pnas.1119407109 [DOI] [PMC free article] [PubMed] [Google Scholar]
20. Kim KH, Qian H. Entropy Production of Brownian Macromolecules with Inertia. Phys Rev Lett. 2004;93:120602 doi: 10.1103/PhysRevLett.93.120602 [DOI] [PubMed] [Google Scholar]
21. Munakata T, Rosinberg ML. Feedback cooling, measurement errors, and entropy production. J Stat Mech. 2013;2013(06):P06014 doi: 10.1088/1742-5468/2013/06/P06014 [Google Scholar]
22. Sauer T. Computational solution of stochastic differential equations. Wiley Interdisciplinary Reviews: Computational Statistics. 2013;5(5):362–371. doi: 10.1002/wics.1272 [Google Scholar]
23. Monteoliva D, Diambra L. Information propagation in a noisy gene cascade. Phys Rev E. 2017;96:012403 doi: 10.1103/PhysRevE.96.012403 [DOI] [PubMed] [Google Scholar]
24. Ito S, Sagawa T. Maxwell’s demon in biochemical signal transduction with feedback loop. Nat Commun. 2014;6:7498 doi: 10.1038/ncomms8498 [DOI] [PMC free article] [PubMed] [Google Scholar]
25. Barnett L, Seth AK. Detectability of Granger causality for subsampled continuous-time neurophysiological processes. Journal of Neuroscience Methods. 2017;275:93–121. doi: 10.1016/j.jneumeth.2016.10.016 [DOI] [PubMed] [Google Scholar]
26.Tishby N, Pereira FC, Bialek W. The Information bottleneck. arXiv preprint physics/0004057. 2000;.
27. Tostevin F, ten Wolde PR. Mutual Information between Input and Output Trajectories of Biochemical Networks. Phys Rev Lett. 2009;102:218101 doi: 10.1103/PhysRevLett.102.218101 [DOI] [PubMed] [Google Scholar]
28. Kobayashi T, Sughiyama Y. Fluctuation Relations of Fitness and Information in Population Dynamics. Phys Rev Lett. 2015;115:238102 doi: 10.1103/PhysRevLett.115.238102 [DOI] [PubMed] [Google Scholar]
29. Halabi N, Rivoire O, Leibler S, Ranganathan R. Protein Sectors: Evolutionary Units of Three-Dimensional Structure. Cell. 2009;138(4):774–786. doi: 10.1016/j.cell.2009.07.038 [DOI] [PMC free article] [PubMed] [Google Scholar]
30. Smith DJ, Lapedes AS, de Jong JC, Bestebroer TM, Rimmelzwaan GF, Osterhaus ADME, et al. Mapping the Antigenic and Genetic Evolution of Influenza Virus. Science. 2004;305(5682):371–376. doi: 10.1126/science.1097211 [DOI] [PubMed] [Google Scholar]
31. Lässig M, Mustonen V, Walczak AM. Predicting evolution. Nat Ecol Evol. 2017;1(0077):1–9. [DOI] [PubMed] [Google Scholar]
32. Risken H. The Fokker-Planck equation. Springer; 1989. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All relevant data are within the paper.

[pone.0187431.ref001] 1. Prill RJ, Vogel R, Cecchi GA, Altan-Bonnet G, Stolovitzky G. Noise-Driven Causal Inference in Biomolecular Networks. PLoS ONE. 2015;10(6):e0125777 doi: 10.1371/journal.pone.0125777 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0187431.ref002] 2. Affeldt S, Verny L, Isambert H. 3off2: A network reconstruction algorithm based on 2-point and 3-point information statistics. BMC Bioinformatics. 2016;17(S2). doi: 10.1186/s12859-015-0856-x [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0187431.ref003] 3. Dunlop MJ, Cox RS, Levine JH, Murray RM, Elowitz MB. Regulatory activity revealed by dynamic correlations in gene expression noise. Nat Genet. 2008;40(12):1493–1498. doi: 10.1038/ng.281 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0187431.ref004] 4. Kiviet DJ, Nghe P, Walker N, Boulineau S, Sunderlikova V, Tans SJ. Stochasticity of metabolism and growth at the single-cell level. Nature. 2014;514:376 doi: 10.1038/nature13582 [DOI] [PubMed] [Google Scholar]

[pone.0187431.ref005] 5. Granger CWJ. Investigating Causal Relations by Econometric Models and Cross-spectral Methods. Econometrica. 1969;37(3):424–438. doi: 10.2307/1912791 [Google Scholar]

[pone.0187431.ref006] 6. Schreiber T. Measuring information transfer. Phys Rev Lett. 2000;85:461 doi: 10.1103/PhysRevLett.85.461 [DOI] [PubMed] [Google Scholar]

[pone.0187431.ref007] 7. Wibral M, Pampu N, Priesemann V, Siebenhühner F, Seiwert H, Lindner M, et al. Measuring Information-Transfer Delays. PLoS ONE. 2013;8:e55809 doi: 10.1371/journal.pone.0055809 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0187431.ref008] 8. Pahle J, Green AK, Dixon CJ, Kummer U. Information transfer in signaling pathways: A study using coupled simulated and experimental data. BMC Bioinformatics. 2008;9(1):139 doi: 10.1186/1471-2105-9-139 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0187431.ref009] 9. Vicente R, Wibral M, Lindner M, Pipa G. Transfer entropy—a model-free measure of effective connectivity for the neurosciences. Journal of Computational Neuroscience. 2011;30(1):45–67. doi: 10.1007/s10827-010-0262-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0187431.ref010] 10. Lizier TJ, Prokopenko M. Differentiating information transfer and causal effect. Eur Phys J B. 2010;73(4):605–615. doi: 10.1140/epjb/e2010-00034-5 [Google Scholar]

[pone.0187431.ref011] 11. Hartich D, Barato AC, Seifert U. Sensory capacity: An information theoretical measure of the performance of a sensor. Phys Rev E. 2016;93:022116 doi: 10.1103/PhysRevE.93.022116 [DOI] [PubMed] [Google Scholar]

[pone.0187431.ref012] 12. Tusch S, Kundu A, Verley G, Blondel T, Miralles V, Démoulin D, et al. Energy versus Information Based Estimations of Dissipation Using a Pair of Magnetic Colloidal Particles. Phys Rev Lett. 2014;112:180604 doi: 10.1103/PhysRevLett.112.180604 [DOI] [PubMed] [Google Scholar]

[pone.0187431.ref013] 13.Lizier JT. JIDT: An information-theoretic toolkit for studying the dynamics of complex systems. Frontiers in Robotics and AI. 2014;1:11(11).

[pone.0187431.ref014] 14. Parrondo JMR, Horowitz JM, Sagawa T. Thermodynamics of Information. Nature Physics. 2015;11:131 doi: 10.1038/nphys3230 [Google Scholar]

[pone.0187431.ref015] 15. Horowitz JM, Esposito M. Thermodynamics with Continuous Information Flow. Phys Rev X. 2014;4:031015. [Google Scholar]

[pone.0187431.ref016] 16. Horowitz JM, Sandberg H. Second-law-like inequalities with information and their interpretations. New J Phys. 2014;16:125007 doi: 10.1088/1367-2630/16/12/125007 [Google Scholar]

[pone.0187431.ref017] 17. Hartich D, Barato AC, Seifert U. Stochastic thermodynamics of bipartite systems: transfer entropy inequalities and a Maxwell’s demon interpretation. J Stat Mech. 2014;2014(2):P02016 doi: 10.1088/1742-5468/2014/02/P02016 [Google Scholar]

[pone.0187431.ref018] 18. Allahverdyan AE, Janzing D, Mahler G. Thermodynamic efficiency of information and heat flow. J Stat Mech. 2009;2009(09):P09011 doi: 10.1088/1742-5468/2009/09/P09011 [Google Scholar]

[pone.0187431.ref019] 19. Bowsher CG, Swain PS. Identifying sources of variation and the flow of information in biochemical networks. Proc Natl Acad Sci USA. 2012;109(20):E1320–E1328. doi: 10.1073/pnas.1119407109 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0187431.ref020] 20. Kim KH, Qian H. Entropy Production of Brownian Macromolecules with Inertia. Phys Rev Lett. 2004;93:120602 doi: 10.1103/PhysRevLett.93.120602 [DOI] [PubMed] [Google Scholar]

[pone.0187431.ref021] 21. Munakata T, Rosinberg ML. Feedback cooling, measurement errors, and entropy production. J Stat Mech. 2013;2013(06):P06014 doi: 10.1088/1742-5468/2013/06/P06014 [Google Scholar]

[pone.0187431.ref022] 22. Sauer T. Computational solution of stochastic differential equations. Wiley Interdisciplinary Reviews: Computational Statistics. 2013;5(5):362–371. doi: 10.1002/wics.1272 [Google Scholar]

[pone.0187431.ref023] 23. Monteoliva D, Diambra L. Information propagation in a noisy gene cascade. Phys Rev E. 2017;96:012403 doi: 10.1103/PhysRevE.96.012403 [DOI] [PubMed] [Google Scholar]

[pone.0187431.ref024] 24. Ito S, Sagawa T. Maxwell’s demon in biochemical signal transduction with feedback loop. Nat Commun. 2014;6:7498 doi: 10.1038/ncomms8498 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0187431.ref025] 25. Barnett L, Seth AK. Detectability of Granger causality for subsampled continuous-time neurophysiological processes. Journal of Neuroscience Methods. 2017;275:93–121. doi: 10.1016/j.jneumeth.2016.10.016 [DOI] [PubMed] [Google Scholar]

[pone.0187431.ref026] 26.Tishby N, Pereira FC, Bialek W. The Information bottleneck. arXiv preprint physics/0004057. 2000;.

[pone.0187431.ref027] 27. Tostevin F, ten Wolde PR. Mutual Information between Input and Output Trajectories of Biochemical Networks. Phys Rev Lett. 2009;102:218101 doi: 10.1103/PhysRevLett.102.218101 [DOI] [PubMed] [Google Scholar]

[pone.0187431.ref028] 28. Kobayashi T, Sughiyama Y. Fluctuation Relations of Fitness and Information in Population Dynamics. Phys Rev Lett. 2015;115:238102 doi: 10.1103/PhysRevLett.115.238102 [DOI] [PubMed] [Google Scholar]

[pone.0187431.ref029] 29. Halabi N, Rivoire O, Leibler S, Ranganathan R. Protein Sectors: Evolutionary Units of Three-Dimensional Structure. Cell. 2009;138(4):774–786. doi: 10.1016/j.cell.2009.07.038 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0187431.ref030] 30. Smith DJ, Lapedes AS, de Jong JC, Bestebroer TM, Rimmelzwaan GF, Osterhaus ADME, et al. Mapping the Antigenic and Genetic Evolution of Influenza Virus. Science. 2004;305(5682):371–376. doi: 10.1126/science.1097211 [DOI] [PubMed] [Google Scholar]

[pone.0187431.ref031] 31. Lässig M, Mustonen V, Walczak AM. Predicting evolution. Nat Ecol Evol. 2017;1(0077):1–9. [DOI] [PubMed] [Google Scholar]

[pone.0187431.ref032] 32. Risken H. The Fokker-Planck equation. Springer; 1989. [Google Scholar]

PERMALINK

Information-theoretic analysis of the directional influence between cellular processes

Sourabh Lahiri

Philippe Nghe

Sander J Tans

Martin Luc Rosinberg

David Lacoste

Roles

Abstract

Introduction

Information theoretic measures

Results

Test of the inference method on a Langevin model

Fig 1. Transfer entropy TY→V for the feedback model governed by Eq (6) as a function of the noise intensity σ2 for k = 1 (blue circles), k = 3 (green circles) and k = 5 (red circles).

Analysis of stochasticity in a metabolic network

Experimental time series

Fig 2. Pedigree tree representing the evolution of the colony of E. coli. studied in Ref. [4].

Table 1. Inferred values of the transfer entropies in the directions E → μ and μ → E, and the difference ΔT¯E→μ=T¯E→μ-T¯μ→E for low, medium and high concentrations of IPTG based on the data of ref. [4].

Theoretical models

Table 2. Comparison between the theoretical values of the transfer entropy rates T¯E→μ and T¯μ→E for the model of Ref. [4] and the values inferred from simulation data.

Fig 3. Transfer entropy rates T¯E→μ and T¯μ→E in the low IPTG experiment.

Fig 4.

Table 3. Theoretical values of the transfer entropy rates T¯E→μ and T¯μ→E and their difference in the modified model.

Fig 5. Inferred values of ΔT¯E→μ for the low IPTG experiment as a function of the length N of the time series generated by the modified model.

Table 4. Comparison between the theoretical values of the TE rates and the information flows for the modified model and the values inferred from simulation data (all quantities are expressed in h−1).

Discussion and conclusion

Methods

Basic information-theoretic measures

Transfer entropy and information flow in the feedback cooling model

Fig 6. TV→Y,T¯V→Y and IV→Yflow as a function of the noise intensity σ2.

Transfer entropy rates and information flows in the model of Ref. [4] for a metabolic network

Stationary distributions and correlation functions

Fig 7. Steady-state probability distribution of the growth rate for the three IPTG concentrations: Low (black), intermediate (red), high (blue).

Transfer entropy rates

Information flows

Table 5. Theoretical values of IX→Yflow=-IY→Xflow in the original model of Ref. [4].

Transfer entropy rates and information flows in the modified model for the metabolic network

Acknowledgments

Data Availability

Funding Statement

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Fig 1. Transfer entropy T_Y→V for the feedback model governed by Eq (6) as a function of the noise intensity σ² for k = 1 (blue circles), k = 3 (green circles) and k = 5 (red circles).

Table 1. Inferred values of the transfer entropies in the directions E → μ and μ → E, and the difference $Δ {\bar{T}}_{E \to μ} = {\bar{T}}_{E \to μ} - {\bar{T}}_{μ \to E}$ for low, medium and high concentrations of IPTG based on the data of ref. [4].

Table 2. Comparison between the theoretical values of the transfer entropy rates ${\bar{T}}_{E \to μ}$ and ${\bar{T}}_{μ \to E}$ for the model of Ref. [4] and the values inferred from simulation data.

Fig 3. Transfer entropy rates ${\bar{T}}_{E \to μ}$ and ${\bar{T}}_{μ \to E}$ in the low IPTG experiment.

Table 3. Theoretical values of the transfer entropy rates ${\bar{T}}_{E \to μ}$ and ${\bar{T}}_{μ \to E}$ and their difference in the modified model.

Fig 5. Inferred values of $Δ {\bar{T}}_{E \to μ}$ for the low IPTG experiment as a function of the length N of the time series generated by the modified model.

Table 4. Comparison between the theoretical values of the TE rates and the information flows for the modified model and the values inferred from simulation data (all quantities are expressed in h⁻¹).

Fig 6. $T_{V \to Y}, {\bar{T}}_{V \to Y}$ and $I_{V \to Y}^{f l o w}$ as a function of the noise intensity σ².

Table 5. Theoretical values of $I_{X \to Y}^{f l o w} = - I_{Y \to X}^{f l o w}$ in the original model of Ref. [4].