Skip to main content
BMC Systems Biology logoLink to BMC Systems Biology
. 2014 May 21;8:60. doi: 10.1186/1752-0509-8-60

Gene perturbation and intervention in context-sensitive stochastic Boolean networks

Peican Zhu 1, Jinghang Liang 1, Jie Han 1,
PMCID: PMC4062525  PMID: 24886608

Abstract

Background

In a gene regulatory network (GRN), gene expressions are affected by noise, and stochastic fluctuations exist in the interactions among genes. These stochastic interactions are context dependent, thus it becomes important to consider noise in a context-sensitive manner in a network model. As a logical model, context-sensitive probabilistic Boolean networks (CSPBNs) account for molecular and genetic noise in the temporal context of gene functions. In a CSPBN with n genes and k contexts, however, a computational complexity of O(nk222n ) (or O(nk2 n )) is required for an accurate (or approximate) computation of the state transition matrix (STM) of the size (2 n k) × (2 n k) (or 2 n × 2 n ). The evaluation of a steady state distribution (SSD) is more challenging. Recently, stochastic Boolean networks (SBNs) have been proposed as an efficient implementation of an instantaneous PBN.

Results

The notion of stochastic Boolean networks (SBNs) is extended for the general model of PBNs, i.e., CSPBNs. This yields a novel structure of context-sensitive SBNs (CSSBNs) for modeling the stochasticity in a GRN. A CSSBN enables an efficient simulation of a CSPBN with a complexity of O(nLk2 n ) for computing the state transition matrix, where L is a factor related to the required sequence length in CSSBN for achieving a desired accuracy. A time-frame expanded CSSBN can further efficiently simulate the stationary behavior of a CSPBN and allow for a tunable tradeoff between accuracy and efficiency. The CSSBN approach is more efficient than an analytical method and more accurate than an approximate analysis.

Conclusions

Context-sensitive stochastic Boolean networks (CSSBNs) are proposed as an efficient approach to modeling the effects of gene perturbation and intervention in gene regulatory networks. A CSSBN analysis provides biologically meaningful insights into the oscillatory dynamics of the p53-Mdm2 network in a context-switching environment. It is shown that random gene perturbation has a greater effect on the final distribution of the steady state of a network compared to context switching activities. The CSSBN approach can further predict the steady state distribution of a glioma network under gene intervention. Ultimately, this will help drug discovery and develop effective drug intervention strategies.

Keywords: Gene regulatory networks, Boolean networks, Stochastic Boolean networks, Context dependent, Gene perturbation, Intervention, Context switch, Steady state distribution, p53 network, glioma network

Background

Diverse biological functions are regulated through the interactions among genes, proteins and other molecules in a cell. Gene expressions are however affected by the intrinsic and extrinsic noise in a gene network [1]. A major source of the noise is the stochastic fluctuations in gene regulatory interactions [2]. The genetic interactions are also context dependent, that is, certain regulatory functions are active in some cellular states, but inactive in others [3]. This indicates the necessity to consider noise in a context-sensitive manner in the study of gene regulatory networks (GRNs).

Various methods have been proposed to model GRNs; these include logical models [4], continuous models using differential equations [5,6] and stochastic models at the single-molecule level [7,8]. As a classic logical model, Boolean networks (BNs) have been widely used to qualitatively model the interactions among genes [4,9-11]. Probabilistic Boolean networks (PBNs) have been proposed to consider noise in a BN model [12-14]. In a PBN, the next state of a gene is determined by its current state and a Boolean function. If the Boolean function is randomly selected, a PBN is referred to as an instantaneous PBN [12]. As a general model, a context-sensitive PBN (CSPBN) considers the feature of context dependence in a biological network [15]. In a CSPBN, a context is a combination of Boolean functions and each function determines the next state of a gene. A context remains unchanged until a switching occurs. This switching of contexts, possibly caused by external stimuli, is considered to occur randomly in a network.

The study of PBNs has focused on the analysis of steady state distributions (SSDs) under gene perturbation and intervention. A Markov Chain Monte Carlo method is used in [16] to analyze the long run behavior of a PBN; however, this method generally requires a large number of simulations to reach a steady state, due to the slow convergence typically encountered in a Monte Carlo method [17]. An analysis is performed in [18] for finding the SSD of a PBN through the computation of the state transition matrix (STM). However, the application of an analytical approach is generally limited to small networks due to the exponential increase in the size of an STM with gene numbers. The analysis of CSPBNs presents even a greater challenge due to its significantly increased computational complexity. Analytical expressions have been derived for analyzing CSPBNs [19]; however, this method is only applicable to the steady state analysis of a network with small perturbation and switching probabilities. The method in [20] ignores some BNs with very small probabilities for reducing the size of the STM and thus provides a more efficient but approximate solution for computing the SSD of a CSPBN. In [15,21,22], gene intervention is investigated for avoiding undesirable states associated with certain diseases (such as cancer). Due to external stimuli, the STM is changed by external control variables, so desirable states can be obtained with larger probabilities in the SSD. In a context-sensitive network with n genes and k contexts, however, a (2 n k) × (2 n k) [23] (or 2 n  × 2 n [20]) matrix is required for an accurate (or approximate) analysis of the SSD; this results in a computational complexity of O(nk222n) (or O(nk2 n )) for an accurate (or approximate) computation of the STM. Hence, the application of current gene network analysis is limited to those of less than a dozen of genes.

Recently, stochastic Boolean networks (SBNs) have been proposed for an efficient computation of the STM and SSD of an instantaneous PBN [24]. The SBN approach can recover biologically-proven regulatory behaviors, such as the oscillatory dynamics of a simplified p53-Mdm2 network [25] and the dynamic attractors in a T cell immune response network [26]. To further exploit the simplicity of logical models, stochastic multiple-valued networks (SMNs) have been utilized to investigate the dynamics of gene networks with multiple-valued gene states [27]. Asynchronous SBNs have also been developed to investigate the asynchronous state-update behavior of genes [28]. In this paper, the notion of SBNs is extended for the general model of PBNs, i.e., the CSPBNs. This presents a novel structure of context-sensitive SBNs (CSSBNs) for modeling the stochasticity in a context-dependent GRN. In particular, gene perturbation and intervention are considered in CSSBNs. Through an efficient simulation of a CSSBN, the computational complexity in the evaluation of a CSPBN is reduced from O(nk222n ) to O(nLk2 n ) for computing the STM, where L is a factor related to the required sequence length in CSSBN for achieving a desired accuracy. The use of non-Bernoulli sequences of random permutations of fixed number of 1’s and 0’s further increases computational efficiency and allows for a tunable tradeoff between accuracy and efficiency. A time-frame expanded CSSBN can further simulate the stationary behavior of a CSPBN and produce results that are more accurate than an approximate analysis, thus making the CSSBN useful in modeling complex context-sensitive GRNs. The CSSBN models are applied to the study of a simplified p53-Mdm2 network and the results using an SSD control policy are reported on the effect of external gene intervention in a glioma network [18,20].

Methods

Context-Sensitive Probabilistic Boolean Networks (CSPBNs)

For a network of n genes, a probabilistic Boolean network (PBN) is defined by G(V, F), where V = {x1, x2, …, x n }, a set of binary-valued nodes, F = (F1, F2, …, F n ), a list of sets of Boolean functions: Fi=f1i,f2i,,flii and l(i) is the number of possible functions for gene i, i = 1, 2, …, n[12-14]. A node x i represents the state of gene i; x i  = 1 (or 0) indicates that gene i is (or not) expressed. The set F i contains the rules that determine the next state of gene i. Each fjii:0,1n0,1 for 1 ≤ j(i) ≤ l(i) is a mapping or predictor function determining the state of gene i. In a context-sensitive PBN (CSPBN) with k contexts, a network function is defined as fj=fj1,fj2,,fjn, where fji:0,1n0,1 is the predictor function for gene i, i = 1, 2, …, n, in context j (j = 1, 2, …, k) [15,29].

Due to the stochasticity in genetic networks, the next state of gene i is determined by all the l(i) functions in F i , i.e., by f1i,f2i,,flii, with probabilities c1i,c2i,,clii. For an instantaneous PBN of n genes, there are a total number of i=1nli possible Boolean networks (BNs), each of which is a possible realization of the genetic network. Each BN can be considered as the network function for a context [20]. In the jth context, assume the network function is given by fj=fj11,fj22,,fjnn, where each fjii:0,1n0,1 for 1 ≤ j(i) ≤ l(i) is a mapping or predictor function determining the state of gene i. The probability that the jth context is selected, is obtained as Cj=i=1ncjii for j = 1, 2, …, k, where k is the number of contexts in a context-sensitive network [20]. The state of a gene is updated in the selected context; thus the next state depends on both the present state and the selected context.

In a context-sensitive network, a context may remain for certain time until a random event occurs. A context switching usually occurs with probability q. For q = 1, the CSPBN becomes an instantaneous PBN. For q < 1, if a new context is to be selected, it is randomly chosen from the set of network functions: {f1, f2, …, f k }, with a set of context selection probabilities: {C1, C2, …, C k } [29]. If noise is considered in a CSPBN, it is often referred to as a perturbation, by which a gene flips its state with a probability p (p ≠ 0). It has been shown that a PBN with perturbation is an ergodic Markov chain in that all the states are connected in the PBN [13]. The transition probability for any two states is determined by the values of p and q pairs. Following [19], one of four mutually exclusive events occurs at time t in a CSPBN with perturbation:

1: The predictor functions in the currently selected context are applied to update the gene expressions and this context remains for the next transition.

2: The predictor functions in the currently selected context are applied to update the states of the genes and then a new context is selected for the next transition.

3: A random perturbation occurs and the currently selected context remains for the next transition.

4: A random perturbation occurs and a new context is selected for the next transition.

The effect of a switching order, i.e., whether a network switches its context before or after its state transition, is considered in [23], whereas in this paper, we focus on the transition rules, i.e., the network function is applied first and then the context switches.

A gene activity profile (GAP) is defined as a vector for describing the state of a network at time t, x (t) = (x1(t), x2(t), …, x n (t)), where x i (t) ∈ {0, 1} for i = 1, 2, …, n. The state of a CSPBN can be represented as a combination of a context and a GAP, i.e., S  = (context j, GAP i). By this definition, the number of states in a CSPBN increases from 2 n to 2 n k for a network with n genes and k contexts. A GAP can also be represented by its decimal index, i.e., d=j=1n2n-jxjt+1. A state of a CSPBN is then given by S  = ( f i , d) for i ∈ {1, 2, …, k} and d ∈ {1, 2, …, 2 n }, where d is the decimal index of a GAP. For convenience, a state ( f i , d) is referred to as (i, d) in the following analysis.

In a CSPBN with a perturbation probability p and a switching probability q, the transition probability for any two states a and b is given by [23]:

pSt+1=b|St=a=1-pnfr1,x1,x2+1-pn-hphsh1-q+qcr1ga,b+qcr21-ga,b (1)

where

fr1,x1,x2=1,0,ifx1directlytransitionstox2incontextr1otherwise (2)
ga,b=1,0,ifr1=r2=rotherwise (3)
sh=0,1,ifh=0otherwise (4)

and h is the Hamming distance between two GAPs with decimal indices x1 and x2, x1, x2 ∈ {1, 2, …, 2 n }. This distance indicates the number of genes with different expressions in the two GAPs x1 and x2.

The state transition matrix (STM) of a CSPBN without perturbation is of the size 2 n k × 2 n k, given by [23]:

A=1-q+qc1P1qc1Pkqc2P1qc2PkqckP11-q+qckPk, (5)

and the STM of a CSPBN with perturbation is given by:

A=1-q+qc1P˜1qc1P˜kqc2P˜1qc2P˜kqckP˜11-q+qckP˜k, (6)

where P i and P˜i denote the STMs for the Boolean network i, i ∈ {1, 2, …, k}, without and with perturbation respectively.

Context-Sensitive Stochastic Boolean Networks (CSSBNs)

A CSSBN without perturbation

Because of the similarities between logic circuits and biological networks, digital circuits have been used to simulate genetic networks [30], as well as to determine the node vulnerability in cellular networks [31]. Stochastic logic has been demonstrated in several biological applications [32,33]. Stochastic computation implements probabilistic analysis using Boolean logic by encoding a real number or probability into a random binary bit sequence [34]. A probability is usually represented by a proportional number of 1’s in a stochastic sequence. The complement of a probability can be computed by an inverter and the multiplication of probabilities can be implemented by an AND gate for independent inputs. A multiplexer computes a weighted sum of its input probabilities, with the weights given by the selection inputs. Figure 1 shows an inverter (NOT), an AND gate, a buffer, an OR gate, an XOR gate and a multiplexer [24,35]. Due to stochastic fluctuations, the computational results by stochastic logic are not deterministic but probabilistic. However, this stochastic fluctuation can be reduced through the use of non-Bernoulli sequences of random permutations of fixed numbers of 1’s and 0’s as initial inputs, thus producing more accurate results than using Bernoulli sequences [35]. It has been shown that a probabilistic distribution of genes’ states can be accurately determined in a stochastic Boolean network (SBN) with a reasonably long stochastic sequence length [24].

Figure 1.

Figure 1

Stochastic logic. (a) A buffer (or an identity gate). (b) An inverter. (c) An AND gate. (d) An OR gate. (e) An XOR gate. (f) A multiplexer. Stochastic logic performs a probabilistic analysis by encoding probabilities into random binary bit streams as a proportional number of bits, for example, the mean number of 1’s in a binary sequence.

In a CSPBN, the selection of a context is determined by the context switching probability. This probability indicates the likelihood to maintain the current context i or to select a new context from the k contexts (including the currently selected context i). Based on the present state and the selected context, a gene’s state is updated. If no perturbation occurs in an n-gene CSPBN with a switching probability of q, the transition probability from state (s, y) to (r, x) is given by [23]:

pr,x|s,y=fs,y,x1-q+qCsfs,y,xqCrifr=sifrs (7)

where

fs,y,x=1,0,ifydirectlytransitionstoxincontextsotherwise (8)

s and r denote the sth and rth contexts, y and x represent two gene activity profiles (GAPs, in decimal indices), and C s and C r indicate the probability of selecting the sth and rth contexts respectively.

An SBN structure is proposed in [24] to implement an instantaneous PBN. For a CSPBN, a context-sensitive SBN (CSSBN) is constructed to consider the switching of contexts, as shown in Figure 2. In this CSSBN model, the probabilistic switching is implemented using a multiplexer in stochastic computation and the switching probability q is encoded as a random binary sequence Q that serves as the control sequence of a 2-to-1 multiplexer (MUX). If the jth bit in the sequence Q is 1, a new context will be selected for the next transition. Otherwise, the current context will remain. The selection probability of the new context is determined by the original and current context selection probabilities. As shown in the lower section of Figure 2, this process is implemented by 2-to-1 multiplexers with the original and current context selection sequences as inputs and Q as the control sequence. The selection probability of the new context is then obtained as encoded in the output stochastic sequences of the multiplexers. If q = 0, the CSSBN functions as a fixed Boolean Network (BN). If q = 1, the CSSBN is simplified to an instantaneous SBN.

Figure 2.

Figure 2

A context-sensitive stochastic Boolean network (CSSBN) without perturbation (at time t). The multiplexer (MUX) with control sequences S1 ~ Sm probabilistically determine the selection of a network function for context i, while the multiplexer with control sequence Q determines whether a switch of contexts occurs. The selection probabilities of the new context are computed by the 2-to-1 multiplexer (MUX) with the original and current context selection sequences as inputs and Q as the control sequence.

If a switch does occur, the context selection process is implemented by another multiplexer for choosing one from the k contexts, according to predefined selection probabilities. As a network function is a combination of each gene’s predictor function, a combination of the m control sequences of S1 ~ S m is used to encode the predefined selection probabilities; this in turn determines the selection probability of each new context. For a CSPBN with n genes, the CSSBN needs to be run for each of the 2 n input states and sequences need to be generated for the m control signals of the multiplexer.

For a switching probability of q in the proposed CSSBN, the currently selected context i remains at time t with a probability of q and switches to one of the k contexts with a probability of 1 - q. If the network transitions from a GAP y to x in the sth context (i.e., f s,y,x  = 1), then context s will remain with probability 1 - q + qC s at the next time step. Otherwise, the network moves into a new context r with probability qC r . From this analysis, it can be seen that the CSSBN in Figure 2 computes the transition probability from state (s, y) to (r, x) as given by (7). This indicates that the proposed CSSBN model accurately implements the function of a CSPBN.

A CSSBN with perturbation

In a PBN with random perturbation, a gene may change its state with a probability p during a state transition. Following [12], the effect of perturbation is considered to flip a gene’s state. Assume in an n-gene CSPBN, the current GAP at time t is given by x  = (x1, x2, …, x n ) and γ is the perturbation vector, the GAP at t + 1, x′, is given by:

x'=xγfjxwithaprobabilityof1-1-pnwithaprobabilityof1-pn (9)

where ⊕ is the addition modulo 2 and f j (⋅) is the network function for the jth context at time t. To account for the effect of perturbation, a CSSBN with perturbation is constructed, as shown in Figure 3. In this CSSBN, XOR gates are used to implement the addition modulo 2 of the perturbation vector and the present state, while an n-input OR gate is used to compute the probability that a perturbation occurs. The output of the OR gate is then used as the control sequence of a bus (or multiple-bit) multiplexer to decide the selection of sequences with or without perturbation. If the output sequence of the OR gate contains all 0’s, which means that there is no perturbation, then the next state is given by the predictor functions in the currently selected context in the original CSSBN without perturbation; otherwise, the next state is determined by the perturbation effect. A stochastic analysis of the function of the CSSBN with perturbation shows that the next state of the network is given by:

Figure 3.

Figure 3

A context-sensitive stochastic Boolean network (CSSBN) with perturbation. A perturbation network is implemented by the XOR logic of the perturbation vector and the present state. The probability that either a new context works or a perturbation occurs is given by the output sequence of an n-input OR gate, which in turn determines the selection of a new context (without perturbation) or a perturbed network by a bus (or multiple-bit) multiplexer (MUX).

x'=xγ1-1-pn+fjx1-pn (10)

which is equivalent to (9). This indicates that a CSPBN with perturbation can be accurately implemented by a CSSBN with perturbation.

Intervention in a CSSBN

In contrast to perturbation, gene intervention refers to the process of deliberately changing the states of some genes to guide a network into a desired state [12]. External stimuli are applied to a network to avoid undesirable states that might be associated with certain diseases. For an effective intervention, control policies are developed for different intervention strategies; as several genes may affect the state of the target gene, a single gene that is most influential on the network state is usually identified as the control gene, due to its simplistic biological implications [15]. The STM is then changed by an external intervention [15,21,22].

In a CSPBN, the context information is usually hidden and thus difficult to obtain in practice. However, gene expressions can readily be observed, so a GAP within all possible contexts is considered to be undesirable if the GAP is an undesirable state of the network [36]. Hence, gene intervention refers to changing the state of a network as represented by a GAP.

For an n-gene network, a vector s  = (s1, s2, s3, …, s n ) with s i ∈ {0, 1} for any i ∈ {1, 2, …, n}, is defined as the control gene vector, that is, if s i  = 1, then gene i is selected as a control gene [37]. An intervention vector is defined as u=u1,u2,u3,,u2n, u j ∈ {0, 1} for any j ∈ {1, 2, …, 2 n }, where u j  = 1 (or 0) indicates a flipping (or remaining) of the state of a control gene at GAP j. If gene i is selected to be a control gene, for example, then s i  = 1. The expression level of the control gene i is then determined by its current state and the status of u . If u j  = 1, i.e., s i u j  = 1, the state of control gene i is flipped by the external intervention when GAP j emerges; otherwise, the state of control gene i remains unchanged and thus the current state is preserved in GAP j[38,39]. An intervention vector can be obtained by various methods; in this paper, a control policy using the steady state distribution (SSD) [40] is used to obtain u .

The state of a network under intervention, i.e., a GAP, is determined by the control signal, the state prior to the intervention and the intervention vector. Let x^t and x t be the GAPs before and after intervention at time t; if the gth gene is the control gene (i.e., s g  = 1), the state of the network under intervention is given by [29]:

xt=x^tsg1ux^t=1+x^t1ux^t=0, (11)

where s g is a vector of n bits with 0 at every bit except the gth bit. 1(∙) is an indicator function: 1ux^t=1=1 if ux^t=1 and 1ux^t=1=0 otherwise. When ux^t=0, the GAP x^t is maintained; otherwise, the GAP x^t is an undesirable state, for which the control bit is to be toggled, thereby improving the occurring probability of a desirable state.

The CSSBN model under intervention is given in Figure 4. If gene i is selected as the control gene, i.e., s i  = 1, the output of the AND gate is determined by the state of u . If state j is an undesirable state, i.e., u j  = 1, the state of the control gene i is toggled. This is implemented by an XOR gate: if s i u j  = 1, the state of gene i, given by the output of an XOR gate, is flipped at the state, or GAP, j; otherwise, the state is not changed by the intervention. From this analysis, the state of a network after intervention is given by:

Figure 4.

Figure 4

A context-sensitive stochastic Boolean network (CSSBN) for external gene intervention. p: perturbation rate; q: context switching probability. si indicates whether gene i is selected as a control gene and u is the derived intervention vector for all states or gene activity profiles (GAPs) of a network.

xt=x^tsi,x^t,ifux^t=1otherwise (12)

which is equivalent to (11). This indicates that the CSSBN model in Figure 4 accurately implements the process of external gene intervention.

State transition matrix (STM) and steady state analysis

As discussed previously, the state of a CSPBN can be represented as S  = (context, x) by considering the selected context and GAP information. Using a CSSBN, the STM can be obtained through the statistics encoded in the output sequences. Given an input state, each simulation of a CSSBN produces the transition probabilities from this input to all output states, i.e., the row in the STM for this input. For a CSPBN with n genes and k contexts, the CSSBN needs to be run for each of the 2 n k input states and an O(n) number of sequences need to be generated for the control signals of the multiplexers. As in [24], a factor, L, is used to account for the computational overhead required by using a longer stochastic sequence. Therefore, a complexity of O(nLk2 n ) results for computing the STM of a context-sensitive network. In a CSSBN, the required minimum sequence length is typically on a polynomial order of the numbers of genes and contexts, as shown in the simulation results in Table 1. Since the number of possible BNs, k, generally increases exponentially with the number of predictor functions for each gene, the complexity of using a CSSBN to compute the STM, i.e., O(nLk2 n ), is smaller than O(nk222n ) of an accurate analysis for a CSPBN [23]. As indicated by the shorter average run time in Table 1, this difference becomes significant for a network with a large number of gene states. In Table 1, the simulation results are also provided for CSSBNs with a single context, i.e., k = 1. In this case, the CSSBN degenerates into a deterministic BN, for which an analytical approach is more efficient, due to the use of random binary bit sequences in the stochastic approach. However, the CSSBN approach is more efficient in computing the STM for a gene network with a large number of states. See Additional file 1: The pseudocode for computing the STM of a CSSBN.

Table 1.

Minimum sequence length and average run time in computing the state transition matrix (STM) for context-sensitive stochastic Boolean networks (CSSBNs)

n
k
Num
CSSBN (Norm 2 = 0.05)
CSSBN (Norm 2 = 0.03)
CSPBN [23]
      L (bits) Average time(s) Standard deviation L (bits) Average time(s) Standard deviation Average time(s) Standard deviation
2
1
4
40
0.00148
0.00048
100
0.00181
0.00048
0.00049
0.00046
2
4
16
900
0.00553
0.00167
2000
0.01116
0.00036
0.01248
0.00251
2
16
64
1000
0.64383
0.41941
6000
1.02666
0.10779
0.09593
0.00901
3
1
8
140
0.00446
0.00132
340
0.00812
0.00094
0.00157
0.00045
4
1
16
250
0.01438
0.00287
750
0.03685
0.00268
0.00443
0.00027
4
16
256
8000
1.43115
0.01844
20000
3.29188
0.03830
1.66014
0.10349
5
1
32
420
0.04602
0.00261
1200
0.12341
0.00314
0.01862
0.00214
5
32
1024
8000
10.7359
0.01350
25000
35.5412
0.04335
24.0659
0.08390
6
1
64
600
0.09993
0.00458
1700
0.33797
0.00197
0.06662
0.00705
6
64
4096
24000
74.5058
1.20649
60000
195.793
4.86945
531.068
6.36186
7
1
32
1000
0.52519
0.02762
2800
1.34503
0.06377
0.25873
0.00719
7
16
2048
8000
18.8938
0.42939
25000
50.0857
0.27419
221.922
1.65582
8
1
256
1400
1.45711
0.02798
3500
3.67661
0.16084
1.05147
0.01302
8
4
1024
4000
5.50577
0.05167
12500
15.6728
0.11539
37.4102
2.20907
9
1
512
2600
5.36480
0.07636
8000
16.2693
0.14106
4.06749
0.06899
9
4
2048
8000
23.1989
0.40195
18000
45.9247
0.78522
72.4459
2.62472
10
1
1024
4000
18.4761
0.12540
11000
49.2457
0.36526
17.5134
0.12540
10 4 4096 10000 59.9416 0.41164 30000 162.686 1.13288 619.311 13.4098

n: the number of genes; k: the number of contexts. Perturbation and context switch probabilities: p = 0.1 and q = 0.8 (for k > 1). L: the required minimum sequence length for a given accuracy measured by Norm 2. Num: the number of states for the CSSBN. The results that at least one case of the CSSBN study is more efficient than the CSPBN, are highlighted in bold.

In general, a (2 n k) × (2 n k) STM is required for an accurate CSPBN analysis [23] while a 2 n  × 2 n STM is needed by an approximate method [20]. As the number of genes increases in a network, a matrix-based analysis becomes infeasible due to a significant increase in the required computational resources. The analysis of the SSD is even more challenging for a CSPBN. For a CSSBN, however, the STM can be accurately and efficiently computed. Furthermore, the SSD can be evaluated through an iterative simulation in the temporal domain (or the so-called time-frame expansion technique [24]). By this technique, an iterative structure of the CSSBN is used to simulate the temporal evolution of a GRN. The required number of iterations is determined by the number of state transitions before reaching a steady state.

Given an initial state distribution x (0), let x (m) and x (m + 1) be the state distributions after m and m + 1 transitions respectively. Assume that the STM of the network is given by A , then the state transitions from x (m) to x (m + 1) are described by:

xm+1=xmA (13)

If || x (m + 1) -  x (m)|| is used to compute the maximum absolute value of the summed difference of each row in x (m) and x (m + 1), the condition for reaching a steady state is given by [18]:

xm+1-xm<ϵ (14)

where ϵ indicates a threshold for determining whether the steady state has been reached or not. If (14) is met, a network is considered to have reached a steady state, i.e., x (∞) =  x (m); thus x (m) is considered as the stationary distribution. Alternatively, a reasonable number of network transitions (e.g. a few hundreds) can be run to obtain the SSD in practice. As shown in the Results and Discussion section, a time-frame extended simulation of CSSBNs is more efficient than an analytical approach while producing more accurate results than an approximate analysis. See Additional file 2: The pseudocode for computing the SSD of a CSSBN.

In a time-frame expanded CSSBN, random binary bit streams are generated for predictor function selection, gene perturbation and context switching probabilities. As in [24], non-Bernoulli sequences of random permutations of fixed numbers of 1’s and 0’s are used for encoding initial probabilities. These sequences then propagate through the CSSBNs and the statistics encoded in the output sequences are used to obtain the SSD. Compared to the use of Bernoulli sequences, as shown in [35], the use of the non-Bernoulli sequences produces more accurate results for a given sequence length or requires a shorter sequence length for a desired evaluation accuracy. As an efficient alternative to an STM-based analysis, the stochastic simulation of CSSBNs provides flexibility in achieving a tunable accuracy-efficiency tradeoff by using stochastic sequences of different lengths. Hence, the proposed CSSBN approach is potentially useful in the analysis of large GRNs.

Results and discussion

Simulation with a p53-Mdm2 Network

As a tumour suppressor gene, p53 plays an important role in preventing the development and progression of tumour cells [41,42]. In a p53 network, signaling pathways are triggered by external stimuli. For example, DNA damages activate pathways that involve the genes p53 and Mdm2. It has been shown that the expression level of the p53 protein is reversely related with that of the Mdm2 gene, which leads to an oscillatory behavior of the p53-Mdm2 network [25,43]. Various Boolean models have been developed to simulate the dynamics of a p53 network [44-46]. In this section, the two-gene probabilistic Boolean network (PBN) model developed in [24] for the p53-Mdm2 network is used to illustrate the applicability of context-sensitive stochastic Boolean networks (CSSBNs). This (instantaneous) PBN of the two genes p53 and Mdm2 consists of V = {x1, x2} and the predictor function sets F1=f11,f21,f31,f41 and F2=f12,f22,f32,f42. The truth table for the state transitions of this PBN is given as Table 2[24].

Table 2.

Truth table of a PBN for the p53-Mdm2 network: x 1 , x 2 are the present states of p53 and Mdm2

x 1 x 2 f11 f21 f31 f41 f12 f22 f32 f42
00
1
1
1
0
0
0
0
1
01
1
1
0
0
0
0
1
1
10
0
0
1
1
1
1
0
0
11
0
1
1
1
1
0
0
0
cji 0.5 0.4 0.09 0.01 0.5 0.4 0.09 0.01

An internal entry indicates whether a logical 1 or 0 would result at the next state of a gene from a selected Boolean function. cji indicates the transition probability by a function fji[24].

An internal entry indicates whether a logical 1 or 0 would result at the next state of a gene from a selected Boolean function. cji indicates the transition probability by a function fji[24].

In [24], a stochastic Boolean network (SBN) of the p53-Mdm2 network is proposed. Based on the general CSSBN model in Figure 2, a CSSBN for the p53-Mdm2 network is shown in Figure 5. In this model, a network function (or a Boolean network (BN)) is selected at time t - 1 for context i and at time t, the network remains at context i with a probability of q or switches to one of the k contexts (k = 16 for this network) with a probability of 1 - q. This context switching behavior is modeled by a 2-to-1 multiplexer (MUX) for each gene and the switching probability q is encoded in the control bit sequence Q. At the same time, the selection probabilities of the new context are determined by the 2-to-1 multiplexers with the original and current context selection sequences as inputs and Q as the control sequence, as shown in the upper section of Figure 5.

Figure 5.

Figure 5

A context-sensitive stochastic Boolean network (CSSBN) for the p53-Mdm2 network.

The context-sensitive p53-Mdm2 network has 2 genes and 16 contexts. If both context and gene activity profile (GAP) are considered, there are 22 × 16 = 64 states for the p53-Mdm2 CSSBN. Given the transition probability for each predictor function (in Table 2), the probability for selecting each context can readily be computed for independent functions, as shown in Table 3.With random perturbation to the genes, a CSSBN with perturbation is constructed as shown in Figure 6. For this two-gene network, a two-input multiplexer is used for each gene to probabilistically select a perturbed state or the original CSSBN state without perturbation.

Table 3.

The network function and selection probability for each context in the p53-Mdm2 network

Context S2S3, S0S1 Combination Selection probability Context S2S3, S0S1 Combination Selection probability
1
00,00
f12f11
0.25
9
10,00
f32f11
0.045
2
00,01
f12f21
0.2
10
10,01
f32f21
0.036
3
00,10
f12f31
0.045
11
10,10
f32f31
0.0081
4
00,11
f12f41
0.005
12
10,11
f32f41
0.0009
5
01,00
f22f11
0.2
13
11,00
f42f11
0.005
6
01,01
f22f21
0.16
14
11,01
f42f21
0.004
7
01,10
f22f31
0.036
15
11,10
f42f31
0.0009
8 01,11 f22f41 0.004 16 11,11 f42f41 0.0001

The control bits to select a predictor function in Figure 5 are also listed. S0S1: the control bits for p53; S2S3: the control bits for Mdm2.

Figure 6.

Figure 6

A context-sensitive stochastic Boolean network (CSSBN) with perturbation for the p53-Mdm2 network.

The CSSBN is then used to obtain the state transition matrix (STM) for the p53-Mdm2 network. See Additional file 3: The Matlab program that describes the structure of the CSSBN in Figure 6 and computes its STM for the p53-Mdm2 network with perturbation.

The norms || ⋅ ||1, || ⋅ ||2, and || ⋅ || are used to measure the differences of the STMs obtained for the CSSBN and CSPBN. || ⋅ ||1 and || ⋅ || indicate the maximum absolute values of the summed differences of the columns and rows respectively, while || ⋅ ||2 measures the average difference of all entries. Assume A CSSBN and A CSPBN are the obtained STMs for the CSSBN and CSPBN respectively. Let Δ A  =  A CSSBN  -  A CSPBN ; the norms of the differences of the computed matrices (||Δ A ||1, ||Δ A ||2 and ||Δ A ||) are then shown in Table 4 for different values of the switching probability q and perturbation rate p. The average run time is also shown for using the CSSBN.

Table 4.

Differences in the state transition matrices (STMs) obtained using the context-sensitive stochastic Boolean network (CSSBN) with perturbation, compared to the results by using the analytical context-sensitive probabilistic Boolean network (CSPBN) approach in[23]

  L (bits) q = 1 p = 0 q = 0.99 p = 0 q = 0.5 p = 0 q = 0.8 p = 0.01
||Δ A ||1
1k
0.1820
0.2067
0.2320
0.3469
10k
0.0754
0.0754
0.0605
0.0890
100k
0.0291
0.0239
0.0306
0.0210
||Δ A ||2
1k
0.0642
0.0754
0.0706
0.1012
10k
0.0258
0.0259
0.0207
0.0297
100k
0.0101
0.0091
0.0097
0.0080
||Δ A ||
1k
0.0382
0.0413
0.0530
0.0888
10k
0.0144
0.0145
0.0164
0.0249
100k
0.0050
0.0055
0.0064
0.0074
Average time (s)
1k
0.0601
0.0547
0.0534
0.0623
10k
0.3172
0.3180
0.3863
0.3186
  100k 2.8781 2.8531 2.8467 2.9226

The average run time of 20 simulations using the CSSBN is also shown for different perturbation rates, p, and context switching probabilities, q. L: sequence length.

As revealed in Table 4, the difference in the STMs computed using the CSSBN and an analytical CSPBN approach is significantly reduced by increasing the sequence length L. However, the inaccuracies, due to the inherent stochastic fluctuations in stochastic computation, are generally small and thus negligible. Hence, the proposed CSSBN model can be used to accurately and efficiently compute the STM of a CSPBN.

A steady state analysis is further performed on the proposed CSSBN. For the p53-Mdm2 network, there are four states or GAPs. The probability of each GAP is given by the sum of the probabilities for all contexts. The simulation results for the four GAPs with respect to different p and q values are shown in Figure 7 (using a sequence length of 500k bits).As can be seen in Figure 7, while the SSD is determined by both the perturbation and switching probabilities, the perturbation rate has a greater effect on the final distribution of the steady state compared to the switching probability. The SSD changes drastically with the increase of the perturbation rate, whereas the effect of context switching is rather limited for a given perturbation rate.

Figure 7.

Figure 7

The steady state distribution (SSD) of the p53-Mdm2 network for different perturbation rate, p, and context switching probability, q. The steady-state probabilities are shown for (a) state 00, (b) state 01, (c) state 10, and (d) state 11. (L = 500k bits).

As p, q ∈ [0, 1], several p and q pairs are chosen for further analysis. The difference in the SSDs obtained by using the time-frame expanded CSSBN technique, the approximate [20] and accurate analysis [23] are shown in Table 5.

Table 5.

Differences in the steady state distributions (SSDs) computed using the context-sensitive stochastic Boolean network (CSSBN) model, compared to the results by using the approximate [20]and accurate analysis[23]

 
 
p = 0.01
p = 0.1
p = 0.3
      q = 0.9 q = 0.8 q = 0.9
||Δ SSD AP - AC ||1
 
0.0228
0.0214
0.0041
||Δ SSD CSSBN - AC ||1
L (bits)
10k
0.0094
0.0099
0.0073
50k
0.0061
0.0060
0.0057
100k
0.0047
0.0056
0.0040
||Δ SSD AP - AC ||2
 
0.0118
0.0118
0.0025
||Δ SSD CSSBN - AC ||2
L (bits)
10k
0.0055
0.0057
0.0041
50k
0.0036
0.0036
0.0030
100k
0.0028
0.0031
0.0025
||Δ SSD AP - AC ||
 
0.0074
0.0081
0.0020
||Δ SSD CSSBN - AC ||
L (bits)
10k
0.0047
0.0045
0.0027
50k
0.0030
0.0029
0.0021
    100k 0.0020 0.0022 0.0017

Δ SSD AP - AC : the difference between the results obtained by using the approximate and accurate analysis. Δ SSD CSSBN - AC : the difference between the results obtained by using the CSSBN approach and the accurate analysis. p: perturbation rate, q: context switching probability, L: sequence length.

As revealed in Table 5, the CSSBN approach can compute the SSD more accurately than the approximate analysis. In fact, the difference in the results between the CSSBN and the accurate analysis is negligible when reasonably long stochastic sequences are used. With the STM obtained for a CSSBN, an SSD is evaluated and the results are very accurate compared to those obtained using the accurate approach, as shown in Figure 8. A further analysis shows that the relative error is limited to less than approximately 0.2% for the CSSBN approach.The SSDs obtained using different methods, as well as the corresponding values of norms, are shown in Figure 9. It can be seen that the SSD can be accurately evaluated by the CSSBN model compared to the accurate analytical approach.

Figure 8.

Figure 8

Accuracy comparison of the proposed context-sensitive stochastic Boolean network (CSSBN) approach and the accurate analytical approach [23]forp = 0.01,q = 0.9 andL = 10k bits. The relative error is generally less than 0.2% for the CSSBN approach.

Figure 9.

Figure 9

Steady state distributions (SSDs) of the four states, or gene activity profiles (GAPs), obtained using the context-sensitive stochastic Boolean network (CSSBN) approach for different perturbation and switching probabilities.L = 10k bits.

It has been shown that a PBN with perturbation evolves as an ergodic Markov chain [12]. For a larger perturbation probability p, there is an increased randomness in the network activities, thus the steady states of the network are more evenly distributed [23]. This can be seen in the simulation results in Figure 9. Figures 7 and 9 further show that context switching has little impact on the SSD of the network with perturbation. This is due to the fact that the context switching activity only affects the selection probability of a context, but not the predictor functions. Also shown in Figure 9 is that an oscillation of the states of p53 and Mdm2 exists, indicated by the higher probabilities of the network states (or GAPs) 01 and 10 compared with the other states. However, this two-state oscillation is less evident when the perturbation rate is significantly higher. On the other hand, a modest perturbation to the network only has a minor effect on the oscillatory behavior, confirming the stability and robustness of the p53-Mdm2 network in a dynamic environment [47].

Experiments on a glioma network

A network in [16] obtained from a human glioma gene expression data set [48] is further used to illustrate the efficiency of the CSSBN model and the time-frame expansion technique. Based on this data set, a PBN was inferred by a method using the coefficients of determination (CODs) and an SSD analysis was performed in [18]. An approximate method was developed in [20] as an extension to estimate the SSD of a CSPBN with perturbation. The gene TOP2A was not considered in either study as it is an input gene with an in-degree of zero. In our study, the network setting is considered the same as in [18,20] with the gene TOP2A removed; this leads to a total of 214 GAPs. Figure 10 shows a detailed structure of the considered glioma network with double (or single)-headed arrows indicating the bi (or uni)-directional relationships of gene pairs. The selection probabilities for the predictor functions are shown in Table 6[20] and the Boolean functions for each gene are obtained through an analysis of the data in [18].

Figure 10.

Figure 10

A glioma network (adapted from [18,20]).

Table 6.

Selection probabilities of the Boolean functions, F i , i ∈ {1, 2, …, 14}, for each gene in the glioma network in Figure10[18,20]

F 1
F 2
F 3
F 4
F 5
F 6
F 7
0.8560
0.2768
0.6759
1.0000
1.0000
0.0263
1.0000
0.1440
0.7232
0.3241
 
 
0.4983
 
 
 
 
 
 
0.4754
 
F 8
F 9
F 10
F 11
F 12
F 13
F 14
0.0857
1.0000
1.0000
0.8508
1.0000
0.8697
0.6004
0.9143     0.1492   0.1303 0.3996

In Table 6, each column for F i , i ∈ {1, 2, …, 14}, contains the selection probabilities for the Boolean functions of gene i, with the value in the jth row as the probability for fji, thus the probabilities sum to 1 in each column. Table 6 also shows that six genes have only a single predictor function; only the state of gene 6 is determined by three functions, while the states of the other genes are determined by two predictor functions.

Steady state analysis

For the 14-gene glioma network, there are a total of 384 contexts, as can be determined from Table 6. It requires an STM with 214 × 384 = 6291456 columns and rows for an accurate analysis. This makes it infeasible to estimate the steady states of a CSPBN using a matrix-based analysis. The approximate analysis in [20] would require the computation of an STM of the size 214 × 214. Thus, it is difficult in general to use an analytical approach to evaluate a large network, due to the demanding computational resources required. However, a CSSBN model can be constructed for the glioma network; this CSSBN is based on the constituent SBN, as shown in Figure 11. With the CSSBN, the SSD can be estimated using the aforementioned time-frame expansion technique with a greatly reduced complexity. The obtained SSD is then compared with that obtained from the approximate analysis in [20]. In this paper, a network is considered to have reached a steady state if the discrepancy between two adjacent iterations is smaller than a required threshold ϵ (by (14)) or the number of simulation iterations has reached a maximum constant value. See Additional file 4: The Matlab program that evaluates the steady state distribution using the time-frame expanded CSSBN technique for the glioma network with context switching and perturbation.

Figure 11.

Figure 11

A stochastic Boolean network (SBN) for the glioma network as a basis for the context-sensitive SBN (CSSBN) model with 384 contexts.

The state or GAP of the glioma network can be represented by a binary vector as (x1, x2,, x14), x i ∈ {0, 1} for i ∈ {1, 2, …, 14}, or its decimal index. For example, the state (01001100011011) can be represented as state 4892. The SSDs of the context-sensitive glioma network with all 16384 states, obtained using the CSSBN approach and the approximate analysis [20], are shown in Figure 12.

Figure 12.

Figure 12

The steady state distributions (SSDs) of the glioma network obtained using the CSSBN time-frame expansion technique and the approximate analysis [20].L = 800k bits.

The norms of the differences of SSDs, obtained using the CSSBN with different sequence lengths and the approximate method in [20], are shown in Table 7.

Table 7.

Norms of the differences in the computed steady state distributions (SSDs) and average run time for the glioma network

 
p = 0.01 q = 0.9
p = 0.1 q = 0.9
L (bits) 50k 200k 1M 50k 200k 1M
||Δ SSD AP - CSSBN ||1
0.2314
0.1323
0.0861
0.4251
0.2104
0.0939
||Δ SSD AP - CSSBN ||2
0.0055
0.0032
0.0023
0.0045
0.0023
0.0010
||Δ SSD AP - CSSBN ||
0.0013
8.01 × 10- 4
4.13 × 10- 4
3.05 × 10- 4
2.16 × 10- 4
7.85 × 10- 5
Average time (s)
CSSBN
10.106
311.55
1377.5
10.069
315.65
1380.7
  Approximate 21421 21417

A maximum of 300 iterations are used for obtaining the SSD. p: perturbation rate; q: context switching probability; L: sequence length. Δ SSD AP - CSSBN : the difference between the SSDs obtained by the approximate analysis [20] and the context-sensitive stochastic Boolean network (CSSBN) approach.

As shown in Table 7, the CSSBN time-frame expansion technique efficiently evaluates the SSD of the glioma network and produces results comparable to those obtained by the approximate analysis [20]. Evaluation accuracy is further improved by using longer stochastic sequences with yet a shorter runtime compared to the approximate method. Since it is difficult, if not impossible, to compute the STM or SSD of a large GRN by using an accurate or approximate analysis, the CSSBN time-frame expansion approach provides an alternative means to accurately and efficiently estimate the SSD of a large network with a tunable accuracy by using stochastic sequences of different lengths.

Intervention analysis

In an n-gene network, if gene X i is the target gene, the states of all genes can be divided into a set of desirable states, D, and a set of undesirable states, U, by the expression level of the target gene [40]. As can be seen in Figure 10, gene X14, the (BCL2A1); BFL1 protein; GRS protein is one of the most influential genes that interacts closely with others, thus it is chosen as the target gene. The desirable and undesirable states are then separated by the expression level of X14. Assume that the inactivation of X14 is preferred; the cumulative probabilities of the desirable and undesirable states are given by x14=0πx and x14=1πy, respectively, where π x and π y are elements in the desirable and undesirable SSDs respectively.

As discussed previously, a GRN can be intervened by applying external stimuli to minimize the likelihood of being in an undesirable state. Various methods have been proposed for deriving control vectors for an effective intervention [29,40]; in this study the SSD algorithm proposed in [40] is used to determine an intervention vector. For simplicity, 16 most significant contexts that account for a total selection probability of 57.27% are chosen from the total 384 contexts of the glioma network for an intervention analysis. For these 16 contexts, the expression levels of gene X5 and X7 have no effect on the states of other genes, so these two genes are removed; this yields a simplified 12-gene glioma network. The cumulative distributions of the desirable steady states of the context-sensitive 12-gene glioma network are shown in Table 8 for using a different gene as the control gene.

Table 8.

Cumulative distributions of the desirable states with a different gene selected as the control gene for the simplified 12-gene context-sensitive glioma network, with perturbation rate p = 0.001 and context switching probability q = 0.99

Gene X 1 X 2 X 3 X 4 X 6 X 8 X 9 X 10 X 11 X 12 X 13
x14=0πx 70.91% 62.93% 52.61% 45.90% 48.98% 54.78% 53.76% 61.61% 57.23% 56.99% 61.78%

The cumulative probability of the desirable states with no intervention is 45.97%. Sequence length L = 800k bits.

When a different gene is selected as the control gene, the effect of intervention varies with respect to improving the probabilities of desirable states in the SSD. For the glioma network, as revealed in Table 8, gene X1, Tie-2, is the most effective for maximizing the percentage of the desirable states. The cumulative distribution of the desirable states is increased from 45.97% to 70.91% by an intervention using Tie-2 as the control gene. Also revealed in the table is that an intervention via gene X4, (HSP40); (HDJ1; DNAJ1), has nearly no impact on the distribution of the desirable states, as the percentage of the desirable states has not been changed much by the intervention (i.e., 45.90% vs. 45.97%). A modest improvement in the desirable state distribution is obtained by an intervention with another gene as the control gene. For any target gene, this process can be applied to find the most significant gene that maximizes the probabilities of desirable states.

The effects of intervention can also be seen in Figure 13, where three different scenarios are considered: (a) no intervention, (b) an indirect intervention via gene X1, i.e., Tie-2, and (c) a direct intervention via the target gene X14, the (BCL2A1); BFL1 protein; GRS protein. The cumulative probabilities of the desirable states are 45.97% for no intervention, 70.91% and 99.99% for the two intervention strategies. As revealed in these results, a direct intervention of the target gene is perhaps optimal for avoiding the undesirable states and almost certainly taking the network into a desirable state. However, when an intervention on the target gene is not possible, an intervention through Tie-2 is the most effective means to maximize the probability of the down-regulation of the target gene X14.

Figure 13.

Figure 13

Steady state distributions (SSDs) of the 12-gene context sensitive glioma network (p= 0.001,q= 0.99) obtained by: (a) no intervention, (b) an indirect intervention via geneX1, Tie-2, and (c) a direct intervention via the target geneX14, the (BCL2A1); BFL1 protein; GRS protein.

Conclusion

Context-sensitive stochastic Boolean networks (CSSBNs) are proposed as an efficient approach to modeling the effects of gene perturbation and intervention in gene regulatory networks (GRNs). In a CSSBN, the state transition matrix can be accurately and efficiently computed with a complexity of O(nLk2 n ), where n is the number of genes in a context-sensitive probabilistic Boolean network (CSPBN), k is the number of contexts and L is a factor determined by the stochastic sequence length. This result is an improvement compared to the previous result of O(nk222n ) for an accurate analysis. The use of non-Bernoulli stochastic sequences further increases computational efficiency and allows for a tunable tradeoff between accuracy and efficiency. A steady state analysis using a time-frame expansion technique has shown a significant speedup and produced more accurate results than an approximate analysis in the computation of the steady state distribution.

CSSBNs are constructed for the analysis of gene perturbation in a p53-Mdm2 network and gene intervention in a glioma network. Biologically meaningful insights are gained into the oscillatory dynamics of the p53-Mdm2 network in a context-switching environment with random gene perturbation. It has been shown that the steady state distribution (SSD) changes drastically with the increase of the perturbation rate, whereas the effect of context switching is rather limited for a given perturbation rate. Therefore, random gene perturbation may have a greater effect on the final distribution of the steady state compared to context switching activities. By predicting the SSD, the CSSBN approach can further evaluate the effectiveness of external gene intervention. A case study of the glioma network shows that the CSSBNs are useful in modeling the effects of gene perturbation and intervention in a complex context-sensitive GRN. This will eventually help drug discovery for an effective drug intervention therapy.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

PZ carried out the GRN studies, performed the statistical analysis and drafted the manuscript. JL developed the initial models and participated in discussion. JH participated in the GRN studies, revised the manuscript and supervised the research. All authors read and approved the final manuscript.

Supplementary Material

Additional file 1

The pseudocode for computing the state transition matrix (STM) of a context-sensitive stochastic Boolean network (CSSBN).

Click here for file (126.1KB, pdf)
Additional file 2

The pseudocode for computing the steady-state distribution (SSD) of a context-sensitive stochastic Boolean network (CSSBN).

Click here for file (205.8KB, pdf)
Additional file 3

The Matlab program that describes the structure of the context-sensitive stochastic Boolean network (CSSBN) in Figure 6and computes its state transition matrix for the p53-Mdm2 network with perturbation.

Click here for file (1.9KB, zip)
Additional file 4

The Matlab program that evaluates the steady state distribution using the time-frame expanded context-sensitive stochastic Boolean network (CSSBN) technique for the glioma network with context switching and perturbation.

Click here for file (3.9KB, zip)

Contributor Information

Peican Zhu, Email: peican@ualberta.ca.

Jinghang Liang, Email: jinghang@ualberta.ca.

Jie Han, Email: jhan8@ualberta.ca.

Acknowledgements

This work was supported in part by the Natural Sciences and Engineering Research Council (NSERC) of Canada in a Discovery Grant and a Startup fund of the University of Alberta.

References

  1. Elowitz MB, Levine AJ, Siggia ED, Swain PS. Stochastic gene expression in a single cell. Sci. 2002;297:1183–1186. doi: 10.1126/science.1070919. [DOI] [PubMed] [Google Scholar]
  2. Pedraza JM, van Oudenaarden A. Noise propagation in genetic networks. Sci. 2005;307:1965–1969. doi: 10.1126/science.1109090. [DOI] [PubMed] [Google Scholar]
  3. Dunlop M, Cox RIII, Levine J, Murray R, Elowitz M. Regulatory activity revealed by dynamic correlations in gene expression noise. Nat Genet. 2008;40:1493–1498. doi: 10.1038/ng.281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Kauffman SA. Metabolic stability and epigenesis in randomly constructed genetic nets. Theor Biol. 1969;22:437–467. doi: 10.1016/0022-5193(69)90015-0. [DOI] [PubMed] [Google Scholar]
  5. Klipp E. Systems Biology in Practice: Concepts, Implementation and Application. Weinheim, Germany: Wiley-VCH; 2005. [Google Scholar]
  6. Qian L, Wang H, Dougherty E. Inference of noisy nonlinear differential equation models for gene regulatory networks using genetic programming and Kalman filtering. IEEE Trans Signal Process. 2008;56(7):3327–3339. [Google Scholar]
  7. Gillespie DT. A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J Comput Phys. 1976;22:403. doi: 10.1016/0021-9991(76)90041-3. [DOI] [Google Scholar]
  8. Gillespie DT. Exact stochastic simulation of coupled chemical reactions. J Phys Chem. 1977;81:2340–2361. doi: 10.1021/j100540a008. [DOI] [Google Scholar]
  9. Davidich M, Bornholt S. Boolean network model predicts cell cycle sequence of Fission yeast. PLoS One. 2008;3(2):e1672–e2008. doi: 10.1371/journal.pone.0001672. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Giacomantonio CE, Goodhill GJ. A Boolean model of the gene regulatory network underlying Mammalian cortical area development. PLoS Comput Biol. 2010;6(9):e1000936. doi: 10.1371/journal.pcbi.1000936. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Huang S. Gene expression profiling, genetic networks, and cellular states: an integrating concept for tumorigenesis and drug discovery. J Mol Med. 1999;77:469–480. doi: 10.1007/s001099900023. [DOI] [PubMed] [Google Scholar]
  12. Shmulevich I, Dougherty ER, Zhang W. From Boolean to probabilistic Boolean networks as models of genetic regulatory networks. Proc IEEE. 2002;90:1778–1792. doi: 10.1109/JPROC.2002.804686. [DOI] [Google Scholar]
  13. Dougherty ERSI, Zhang W. Gene perturbation and intervention in probabilistic Boolean networks. Bioinform. 2002;18(10):1319–1331. doi: 10.1093/bioinformatics/18.10.1319. [DOI] [PubMed] [Google Scholar]
  14. Shmulevich I, Dougherty E. Probabilistic Boolean Networks: The Modeling and Control of Gene Regulatory Networks. Philadelphia: SIAM; 2009. [Google Scholar]
  15. Pal R, Datta A, Bittner ML, Dougherty ER. Intervention in context-sensitive probabilistic Boolean networks. Bioinform. 2005;21:1211–1218. doi: 10.1093/bioinformatics/bti131. [DOI] [PubMed] [Google Scholar]
  16. Shmulevich I, Gluhovsky I, Hashimoto RF, Dougherty ER, Zhang W. Steady-state analysis of genetic regulatory networks modeled by probabilistic Boolean networks. Comp Funct Genomics. 2003;4:601–608. doi: 10.1002/cfg.342. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Rosenthal JS. Minorization conditions and convergence rates for Markov chain Monte Carlo. J Am Stat Assoc. 1995;90:558–566. doi: 10.1080/01621459.1995.10476548. [DOI] [Google Scholar]
  18. Zhang S, Ching W, Ng M, Akutsu T. Simulation study in probabilistic Boolean network models for genetic regulatory networks. Int J Data Min Bioinform. 2007;1(3):217–240. doi: 10.1504/IJDMB.2007.011610. [DOI] [PubMed] [Google Scholar]
  19. Brun M, Dougherty ER, Shmulevich I. Steady-state probabilities for attractors in probabilistic Boolean networks. Signal Process. 2005;85:1993–2013. doi: 10.1016/j.sigpro.2005.02.016. [DOI] [Google Scholar]
  20. Ching W, Zhang S, Ng M, Akutsu T. An approximation method for solving the steady-state probability distribution of probabilistic Boolean networks. Bioinform. 2007;23:1511–1518. doi: 10.1093/bioinformatics/btm142. [DOI] [PubMed] [Google Scholar]
  21. Datta A, Choudhary A, Bittner ML, Dougherty ER. External control in markovian genetic regulatory networks. Mach Learn. 2003;52:169–191. doi: 10.1023/A:1023909812213. [DOI] [PubMed] [Google Scholar]
  22. Datta A, Choudhary A, Bittner ML, Dougherty ER. External control in markovian genetic reegulatory networks: the imperfect information case. Bioinform. 2004;20:924–993. doi: 10.1093/bioinformatics/bth008. [DOI] [PubMed] [Google Scholar]
  23. Pal R. Context-sensitive probabilistic Boolean networks: steady-state properties, reduction, and steady-state approximation. IEEE Trans Signal Process. 2010;58(2):879–890. [Google Scholar]
  24. Liang J, Han J. Stochastic Boolean networks: an efficient approach to modeling gene regulatory networks. BMC Syst Biol. 2012;6:113. doi: 10.1186/1752-0509-6-113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Lahav G, Rosenfeld N, Sigal A, Geva-Zatorsky N, Levine AJ, Elowitz MB, Alon U. Dynamics of the p53-Mdm2 feedback loop in individual cells. Nat Genet. 2004;36:147–150. doi: 10.1038/ng1293. [DOI] [PubMed] [Google Scholar]
  26. Martin S, Zhang Z, Martino A, Faulon J-L. Boolean dynamics of genetic regulatory networks infer red from microarray time series data. Bioinform. 2007;23(7):866–874. doi: 10.1093/bioinformatics/btm021. [DOI] [PubMed] [Google Scholar]
  27. Zhu P, Han J. Stochastic multiple-valued gene networks. IEEE Transac Biomed Circ Syst. 2014;8(1):42–53. doi: 10.1109/TBCAS.2013.2291398. [DOI] [PubMed] [Google Scholar]
  28. Zhu P, Han J. Asynchronous stochastic Boolean networks as gene network models. J Comput Biol. 2014. in press. [DOI] [PubMed]
  29. Faryabi B, Vahedi G, Chamberland JF, Datta A, Dougherty ER. Intervention in context-sensitive probabilistic Boolean networks revisited. EURASIP J Bioinform Syst Biol. 2009;2009:5. doi: 10.1155/2009/360864. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. McAdams HH, Shapiro L. Circuit simulation of genetic networks. Sci. 1995;269(5224):650. doi: 10.1126/science.7624793. [DOI] [PubMed] [Google Scholar]
  31. Abdi A, Tahoori MB, Emamian ES. Fault diagnosis engineering of digital circuits can identify vulnerable molecules in complex cellular pathways. Sci Signal. 2008;1(42):ra10. doi: 10.1126/scisignal.2000008. [DOI] [PubMed] [Google Scholar]
  32. Adar R, Benenson Y, Linshiz G, Rosner A, Tishby N, Shapiro E. Stochastic computing with biomolecular automata. PNAS. 2004;101(27):9960–9965. doi: 10.1073/pnas.0400731101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Benenson Y, Gil B, Ben-Dor U, Adar R, Shapiro E. An autonomous molecular computer for logical control of gene expression. Nat. 2004;429:423–429. doi: 10.1038/nature02551. [DOI] [PubMed] [Google Scholar]
  34. Gaines BR. Stochastic computing systems. Adv Inf Syst Sci. 1969;2:37–172. [Google Scholar]
  35. Han J, Chen H, Liang J, Zhu P, Yang Z, Lombardi F. A Stochastic Computational Approach for Accurate and Efficient Reliability Evaluation. IEEE Transactions on Computers. 2013. in press. Advance access in IEEE Xplore.
  36. Pal R, Datta A, Bittner ML, Dougherty ER. Optimal infinite horizon control for probabilistic Boolean networks. IEEE Trans Signal Process. 2006;54:2375–2387. [Google Scholar]
  37. Faryabi B, Datta A, Dougherty ER. On approximate stochastic control in genetic regulatory networks. IET Syst Biol. 2007;1(6):361–368. doi: 10.1049/iet-syb:20070015. [DOI] [PubMed] [Google Scholar]
  38. Layek R, Datta A, Pal R, Dougherty ER. Adaptive intervention in probabilistic Boolean networks. Bioinform. 2009;25(16):2042–2048. doi: 10.1093/bioinformatics/btp349. Doi: 10.1093/bioinformatics/btp34. [DOI] [PubMed] [Google Scholar]
  39. Qian X, Dougherty ER. Effect of function perturbation on the steady-state distribution of genetic regulatory networks: optimal structural intervention. IEEE Trans Signal Process. 2008;56:4966–4976. [Google Scholar]
  40. Qian X, Ivanov I, Ghaffari N, Dougherty ER. Intervention in gene regulatory networks via greedy control policies based on long-run behavior. BMC Syst Biol. 2009;3:16. doi: 10.1186/1752-0509-3-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Weinberg RA. The Biology of Cancer. 1. New York: Garland Science; 2006. [Google Scholar]
  42. Vogelstein B, Lane D, Levine AJ. Surfing the p53 network. Nat. 2000;408:307–310. doi: 10.1038/35042675. [DOI] [PubMed] [Google Scholar]
  43. Batchelor E, Loewer A, Lahav G. The ups and downs of p53: understanding protein dynamics in single cells. Nat Rev Cancer. 2009;9(5):371–377. doi: 10.1038/nrc2604. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Ciliberto A, Novak B, Tyson JJ. Steady states and oscillations in the p53-Mdm2 network. Cell Cycle. 2005;4:486–493. doi: 10.4161/cc.4.3.1548. [DOI] [PubMed] [Google Scholar]
  45. Abou-Jaoude W, Ouattara D, Kaufman M. From structure to dynamics: frequency tuning in the p53-mdm2 network: I: logical approach. J Theor Biol. 2009;258(4):561–577. doi: 10.1016/j.jtbi.2009.02.005. [DOI] [PubMed] [Google Scholar]
  46. Murrugarra D, Veliz-Cuba A, Aguilar B, Arat S, Laubenbacher R. Modeling stochasticity and variability in gene regulatory networks. EURASIP J Bioinform Syst Biol. 2012;1:5. doi: 10.1186/1687-4153-2012-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Fuller GN, Rhee CH, Hess KR, Caskey LS, Wang R, Bruner JM, Yung WK, Zhang W. Reactivation of insulin-like growth factor binding protein 2 expression in glioblastoma multiforme: a revelation by parallel gene expression profiling. Cancer Res. 1999;59:4228–4232. [PubMed] [Google Scholar]
  48. Ge H, Qian M. Boolean network approach to negative feedback loops of the p53 pathways: synchronized dynamics and stochastic limit cycles. J Comput Biol. 2009;16:119–132. doi: 10.1089/cmb.2007.0181. doi:10.1089/cmb.2007.0181. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Additional file 1

The pseudocode for computing the state transition matrix (STM) of a context-sensitive stochastic Boolean network (CSSBN).

Click here for file (126.1KB, pdf)
Additional file 2

The pseudocode for computing the steady-state distribution (SSD) of a context-sensitive stochastic Boolean network (CSSBN).

Click here for file (205.8KB, pdf)
Additional file 3

The Matlab program that describes the structure of the context-sensitive stochastic Boolean network (CSSBN) in Figure 6and computes its state transition matrix for the p53-Mdm2 network with perturbation.

Click here for file (1.9KB, zip)
Additional file 4

The Matlab program that evaluates the steady state distribution using the time-frame expanded context-sensitive stochastic Boolean network (CSSBN) technique for the glioma network with context switching and perturbation.

Click here for file (3.9KB, zip)

Articles from BMC Systems Biology are provided here courtesy of BMC

RESOURCES