Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

ArXiv logoLink to ArXiv
[Preprint]. 2024 Jan 23:arXiv:2401.12477v1. [Version 1]

Modular Control of Biological Networks

David Murrugarra 1, Alan Veliz-Cuba 2, Elena Dimitrova 3, Claus Kadelka 4, Matthew Wheeler 5, Reinhard Laubenbacher 5
PMCID: PMC10854280  PMID: 38344220

Abstract

The concept of control is central to understanding and applications of biological network models. Some of their key structural features relate to control functions, through gene regulation, signaling, or metabolic mechanisms, and computational models need to encode these. Applications of models often focus on model-based control, such as in biomedicine or metabolic engineering. This paper presents an approach to model-based control that exploits two common features of biological networks, namely their modular structure and canalizing features of their regulatory mechanisms. The paper focuses on intracellular regulatory networks, represented by Boolean network models. A main result of this paper is that control strategies can be identified by focusing on one module at a time. This paper also presents a criterion based on canalizing features of the regulatory rules to identify modules that do not contribute to network control and can be excluded. For even moderately sized networks, finding global control inputs is computationally very challenging. The modular approach presented here leads to a highly efficient approach to solving this problem. This approach is applied to a published Boolean network model of blood cancer large granular lymphocyte (T-LGL) leukemia to identify a minimal control set that achieves a desired control objective.

Keywords: Boolean networks, modularity, control, canalization, gene regulatory networks

1. Introduction

With the availability of more experimental data and information about the structure of biological networks, computational modeling can capture increasingly complex features of biological networks [1, 2]. However, the increased size and complexity of dynamic network models also poses challenges in understanding and applying their structure as a tool for model-based control, important for a range of applications [3, 4]. This is our focus here. To narrow the scope of the problems we address we limit ourselves to intracellular networks represented by Boolean network (BN) models. BNs are widely used in molecular systems biology to capture the coarse-grained dynamics of a variety of regulatory networks [5]. They have been shown to provide a good approximation of the dynamics of continuous processes [6].

For the commonly-used modeling framework of ordinary differential equations, there is a well-developed theory of optimal control, which is largely absent for other modeling frameworks, such as Boolean networks or agent-based models, both frequently used in systems biology and biomedicine. Furthermore, control inputs, in many cases, are of a binary nature, such as gene knockouts or the blocking of mechanisms. For BNs, there is no readily available mathematical theory that could be used for control, leaving sampling and simulation. As networks get larger, with hundreds [7] or even thousands of nodes [8], this leaves few computational tools to identify control inputs for achieving preselected objectives, such as moving a network from one phenotype (e.g., cancer) to another (e.g., normal). One approach is to reduce the system in a way that the reduced system maintains relevant dynamical properties such as its attractors [9, 10]. This allows the control methods to be applied to the reduced system, and the same controls can then be used for the original system.

An approach that has not been used so far, to our knowledge, is to exploit the modular structure of many biological systems to identify control strategies by focusing on one module at a time. Modularity refers to the division of the system into separate units, or modules, that each have a specific function [11, 12]. Modularity is a fundamental property of biological systems that is essential for the evolution of new functions and the development of robustness [13, 14]. In [15], we developed a mathematical theory of modularity for Boolean network models and showed that one can identify network-level control inputs at the modular level. That is, we obtain global control inputs by identifying them at the local, modular level and assembling them to global control. This enables network control for much larger networks than would otherwise be computationally feasible. In this paper, we develop this approach into a mathematical theory of biological network control.

We further propose to use another property of biological networks, represented through Boolean network models. Almost all Boolean rules that describe the dynamics of over 120 published, expert-curated biological Boolean network models have the property that they exhibit some degree of canalization [16]. A Boolean function is canalizing if it has one or more variables that, when they take on a particular value, they determine the value of the function, irrespective of the values of all the other variables. As an example, any variable in a conjunctive rule (e.g., xyz) determines the value of the entire rule, when it takes on the value 0. We derive a criterion for Boolean network models whose Boolean functions are all canalizing, that can be used to exclude certain modules from needing to be considered for the identification of controls.

Our approach to control via modularity is summarized in Figure 1. We decompose the network into its constituent modules, then apply control methods to each module to identify a control target for the entire network. We show that by combining the controls of the modules, we can control the entire network. In the last part of the paper, we present theoretical results that exploit the canalizing properties of the regulatory functions to exclude certain modules from the control search. Finally, we demonstrate our approach by applying it to a published model of the blood cancer large granular lymphocyte (T-LGL) leukemia [17].

Figure 1:

Figure 1:

Control via modularity. First, the network is decomposed into its constituent modules: F1,,Fn. Then, controls μ1,,μn are identified for each module. Combining the controls of the modules μ=μ1,,μn yields a set of controls for the whole network.

2. Background

We first describe Boolean networks and how to decompose a network into modules. In a BN, each gene is represented by a node that can be in one of two states: ON or OFF. Time is discretized as well, and the state of a gene at the next time step is determined by a Boolean function that takes as input the current states of a subset of the nodes in the BN. The dependence of a gene on the state of another gene can be graphically represented by a directed edge, and the wiring diagram contains all such dependencies.

2.1. Boolean Networks

Boolean networks can be seen as discrete dynamical systems. Specifically, consider n variables x1,,xn each of which can take values in F2:={0,1}, where F2 is the field with two elements, 0 and 1, where arithmetic is performed modulo 2. Then, a synchronously updated Boolean network is a function F=f1,,fn:F2nF2n, where each coordinate function fi describes how the future value of variable xi depends on the present values of all variables. All variables are updated at the same time (synchronously).

The wiring diagram of a Boolean network F=f1,,fn:F2nF2n is the directed graph with vertices x1,,xn and an edge from xi to xj if fj depends on xi. That is, if there exists xF2n such that fj(x)fx+ei, where ei is the i th unit vector.

Example 2.1.

Figure 2a shows the wiring diagram of the Boolean network F:F23F23 given by

Fx1,x2,x3=x2¬x3,x3,¬x1x2.
Figure 2:

Figure 2:

Wiring diagram and state space of the Boolean network in Example 2.12.8. (a) The wiring diagram encodes the dependency between variables. (b) The state space is a directed graph with edges between all states and their images. This graph therefore encodes all possible trajectories.

2.2. Dynamics of Boolean networks

Another directed graph associated with a BN is the state space. It describes all possible transition of the BN from one time step to another. The attractors of a BN are sets of states from which there is no escape as the system evolves. An attractor with a single state is also called a steady state (or fixed point). In mathematical models of intracellular regulatory networks, the attractors of the model are often associated with the possible phenotypes of the cell. This idea can be traced back to Waddington 18 and Kauffman 19. For example, in a model of cancer cells, the steady states of the model correspond to proliferative, apoptotic, or growth-arrest phenotypes 20. Mathematically, a phenotype is associated with a group of attractors where a subset of the system’s variables have the same states. These shared states are then used as biomarkers that indicate diverse hallmarks of the system.

There are two ways to describe the dynamics of a Boolean network F:F2nF2n, (i) as trajectories for all 2n possible initial conditions, or (ii) as a directed graph with nodes in F2n. Although the first description is less compact, it will allow us to formalize the dynamics of coupled networks.

Definition 2.2.

A trajectory of a Boolean network F:F2nF2n is a sequence (x(t))t=0 of elements of F2n such that x(t+1)=F(x(t)) for all t0.

Example 2.3.

For the network in the example above, Fx1,x2,x3=x2¬x3,x3,¬x1x2, there are 23=8 possible initial states giving rise to the following trajectories (commas and parenthesis for states are omitted for brevity).

T1=(000,000,000,000,)
T2=001,010,101,010,
T3=(010,101,010,101,)
T4=(011,011,011,011,)
T5=(100,000,000,000,)
T6=(101,010,101,010,)
T7=(110,100,000,000,)
T8=(111,010,101,010,)

We can see that T3 and T6 are periodic trajectories with period 2. Similarly, T1 and T4 are periodic with period 1. All other trajectories eventually reach one of these 4 states.

When seen as trajectories, T3 and T6 are different, but they can both be encoded by the fact that F(0,1,0)=(1,0,1) and F(1,0,1)=(0,1,0). Similarly, T1 and T4 can be encoded by the equalities F(0,1,1)=(0,1,1) and F(0,0,0)=(0,0,0). This alternative, more compact way of encoding the dynamics of a Boolean network is the standard approach, which we formalize next.

Definition 2.4.

The state space of a (synchronously updated) Boolean network F:F2nF2n is a directed graph with vertices in F2n and an edge from x to y if F(x)=y.

Example 2.5.

Figure 2b shows the state space of the (synchronously updated) Boolean network from Example 2.1.

From the state space, one can easily obtain all periodic points, which form the attractors of the network.

Definition 2.6.

The space of attractors for a Boolean network is the collection 𝒟(F) of all minimal subsets 𝒞F2n satisfying F(𝒞)=𝒞.

  1. The subset 𝒟1(F)𝒟(F) of sets of exact size 1 consists of all steady states (also known as fixed points) of F.

  2. The subset 𝒟r(F)𝒟(F) of sets of exact size r consists of all cycles of length r of F.

    Equivalently, an attractor of length r is an ordered set with r elements, 𝒞=c1,,cr, such that Fc1=c2,Fc2=c3,,Fcr1=cr,Fcr=c1.

Remark 2.7.

In the case of steady states, the attractor 𝒞={c} may be denoted simply by c.

Example 2.8.

The Boolean network from Example 2.1 has 2 steady states (i.e., attractors of length 1) and one cycle of length 2, which can be easily derived from its state space representation (Figure 2b).

2.3. Modules

Before defining modules we will describe the restriction of a Boolean network to a subset of its variables using the following example. For a formal definition of restrictions, please see [15].

Example 2.9.

Consider the Boolean network

F(x)=x2x1,¬x1,x1¬x4,x1¬x2x3x4

with wiring diagram in Figure 3a. The restriction of this network to Y=x1,x2 is the 2-variable network FYx1,x2=x2x1,¬x1, which forms the first module (indicated by the amber box in Figure 3a), while the restriction of F to Y=x3,x4 is the 2-variable network FYx3,x4=¬x4,x3x4, which forms the second module (indicated by the green module in Figure 3a). Note that the wiring diagram of FY is always a subgraph of the wiring diagram of F, irrespective of the choice of Y.

Figure 3:

Figure 3:

Boolean network decomposition into modules. (a) Wiring diagram of a non-strongly connected Boolean network where the non-trivial modules are highlighted by amber and green boxes. (b) Directed acyclic graph describing the corresponding connections between the nontrivial modules.

Definition 2.10.

The wiring diagram of a Boolean network is strongly connected if every pair of nodes is connected by a directed path. That is, for each pair of nodes xi, xj in the wiring diagram with xixj there exists a directed path from xi to xj (and vice versa). In particular, a one-node wiring diagram is strongly connected by definition.

Remark 2.11.

The wiring diagram of any Boolean network is either strongly connected or it is composed of a collection of strongly connected components where connections between different component move in only one direction.

Let F be a Boolean network and let W1,,Wm be the strongly connected components of its wiring diagram, with Xi denoting the set of variables in strongly connected component Wi. Then, the modules of F are defined by FX1,,FXn.

Definition 2.12.

Let W1,,Wm be the strongly connected components of the wiring diagram of a Boolean network F. By setting WiWj if there exists at least one edge from a vertex in Wi to a vertex in Wj, we obtain a (directed) acyclic graph

Q=(i,j)WiWj,

which describes the connections between the strongly connected components of F.

Example 2.13.

For the Boolean network F from Example 2.9, the wiring diagram has two strongly connected components W1 and W2 with variables X1=x1,x2 and X2=x3,x4 (Figure 3a), connected according to the directed acyclic graph Q={(1,2)}. The two modules of F are given by the restriction of F to X1 and X2, that is, FX1x1,x2=x2x1,¬x1 and FX2x3,x4=¬x4,x3x4 (Figure 3b). Note that the module FX1, i.e., the restriction of F to X1, is simply the projection of F onto the variables X1 because W1 does not receive feedback from the other component (i.e., because (2,1)Q).

3. Control via Modularity

In this section, we apply the modular decomposition theory described in the previous section and in [15] to make the control problem of Boolean networks more tractable. We show how the decomposition into modules can be used to obtain controls for each module, which can then be combined to obtain a control for the entire network. In this context, two types of control actions are generally considered: edge controls and node controls. For each type of control, one can consider deletions or constant expressions as defined below. The motivation for considering these control actions is that they represent the common interventions that can be implemented in practice. For instance, edge deletions can be achieved by the use of therapeutic drugs that target specific gene interactions, whereas node deletions represent the blocking of effects of products of genes associated to these nodes; see [21, 22].

Once the modules have been identified, different methods for phenotype control (that is, control of the attractor space) can be used to identify controls in these networks. Some of these methods employ stable motifs 23, feedback vertex sets 24, as well as algebraic approaches 25, 26, 27. For our examples below, we will use the methods defined in 23, 25, 24 to find controls for the simple networks.

A Boolean network F=f1,,fn:F2nF2n with control is a Boolean network :F2n×UF2n, where U is a set that denotes all possible controls, defined below. The case of no control coincides with the original Boolean network, that is, (x,0)=F(x). Given a control uU, the dynamics are given by x(t+1)=(x(t),u). See [25] for additional details and examples of how to encode control edges and nodes in a Boolean network.

Definition 3.1 (Edge Control).

Consider the edge xixj in the wiring diagram W. The function

jx,ui,j:=fjx1,,ui,j+1xi+ui,jai,,xn, (1)

where ai is a constant in F2, encodes the control of the edge xixj, since for each possible value of ui,jF2 we have the following control settings:

  • If ui,j=0, j(x,0)=fjx1,,xi,,xn. That is, the control is not active.

  • If ui,j=1, j(x,1)=fjx1,,xi=ai,,xn. In this case, the control is active, and the action represents the removal of the edge xixj when ai=0, and the constant expression of the edge if ai=1. We use xiaixj to denote that the control is active.

This definition can be easily extended for the control of many edges, so that we obtain :F2n×F2eF2n, where e is the number of edges in the wiring diagram. Each coordinate, ui,j, of u in (x,u) encodes the control of an edge xixj.

Definition 3.2 (Node Control).

Consider the node xi in the wiring diagram W. The function

jx,ui,ui+:=ui+ui++1fj(x)+ui+ (2)

encodes the control (knock-out or constant expression) of the node xi, since for each possible value of ui,ui+F22 we have the following control settings:

  • For ui=0, ui+=0, j(x,0,0)=fj(x). That is, the control is not active.

  • For ui=1, ui+=0, j(x,1,0)=0. This action represents the knock-out of the node xj.

  • For ui=0, ui+=1, j(x,0,1)=1. This action represents the constant expression of the node xj.

  • For ui=1, ui+=1, j(x,1,1)=fjxt1,,xtm+1. This action changes the Boolean function to its negative value.

Definition 3.3.

A control μ stabilizes a network F at an attractor 𝒞 when the resulting network after applying μ to F (denoted here as Fμ) has 𝒞 as its only attractor.

For a Boolean network F, we let 𝒟(F) denote the set of its attractors. Whenever the Boolean network F is decomposable into multiple constituent modules F1,F2,,Fn(n2), we write F=F1P1F2P2Pn1Fn where the semi-product operation Pi indicates the coupling of the subnetworks, as described in 15. Furthermore, from the decomposition theory described in [15, the attractors of F are of the form 𝒞=𝒞1𝒞2𝒞n where 𝒞i𝒟Fi is an attractor of the subnetwork, for i=1,,n. The following theorem takes advantage of the modular structure of the network to find controls one module at a time.

Theorem 3.4.

Given a decomposable network F=F1PF2, if μ1 is a control that stabilizes F1 in 𝒞1 (whether 𝒞1 is an existing attractor or a new one) and μ2 is a control that stabilizes F2𝒞1 in 𝒞2 (whether 𝒞2 is an existing attractor or a new one), then μ=μ1,μ2 is a control that stabilizes F in 𝒞=𝒞1𝒞2 provided that either 𝒞1 or 𝒞2 is a steady state.

Proof.

Let F1μ1 be the resulting network after applying the control μ1. Thus, the dynamics of F1μ1 is 𝒞1, that is 𝒟(F1μ1)=𝒞1. Similarly, the dynamics of F2𝒞1,μ2 is 𝒞2. That is, 𝒟(F2𝒞1,μ2)=𝒞2. Then,

Fμ=F1PF2μ=F1μPF2μ=F1μ1PF2μ2.

Thus,

𝒟Fμ=𝒟F1μ1PF2μ2=𝒞'𝒟(F1μ1)𝒞'𝒟(F2𝒞',μ2)=𝒞1𝒟(F2𝒞1,μ2)=𝒞1𝒞2.

For the last equality we used the fact that the product of a steady state and a cycle (or vice versa) will result in only one attractor for the combined network. The former is not always true in general because multiplying two attractors (of length greater than 1) might result in several attractors for the composed network due to the attractors starting at different states.

It follows that there is only one attractor of Fμ and that attractor is 𝒞1𝒞2. Thus, F is stabilized by μ=μ1,μ2 and we have 𝒟Fμ=𝒞.▫

Theorem 3.4 shows how the modular structure can be used to identify controls that stabilize the network in any desired state. In particular, we can use the modular structure of a network to find controls that stabilize a network at an existing attractor, which is often the case in biological control applications. We state this fact in the following corollary.

Corollary 3.5.

Given a decomposable network F=F1PF2, let 𝒞=𝒞1𝒞2 be an attractor of F, where 𝒞1𝒟F1 and 𝒞2𝒟F2𝒞1 and at least 𝒞1 or 𝒞2 is a steady state. If μ1 is a control that stabilizes F1 in 𝒞1 and μ2 is a control that stabilizes F2𝒞1 in 𝒞2, then μ=μ1,μ2 is a control that stabilizes F in 𝒞.

Remark 3.6.

In Theorem 3.4 we required one of the stabilized attractors to be a steady state in order to be able to combine the controls from the modules. We can remove this requirement from Theorem 3.4 by using the following definition of stabilization for non-autonomous networks, which will guarantee that 𝒞1 and 𝒞2 can be combined in a unique way, resulting in a unique attractor of the whole network.

Definition 3.7.

A non-autonomous Boolean network is defined by

yt+1=Hgt,yt,

where H:F2m+nF2n and (g(t))t=0 is a sequence with elements in F2m. We call this type of network non-autonomous because its dynamics will depend on g(t). We use Hg to denote this non-autonomous network.

A state cF2n is a steady state of Hg if H(g(t),c)=c for all t. Similarly, an ordered set with r elements, 𝒞=c1,,cr, is an attractor of length r of Hg if c2=Hg(1),c1, c3=Hg(2),c2,,cr=Hg(r1),cr1, c1=Hg(r),cr, c2=Hg(r+1),c1, Note that in general g(t) is not necessarily of period r and may even not be periodic.

If H(g(t),y)=G(y) for some network G (that is, it does not depend on g(t)) for all t, then y(t+1)=H(g(t), y(t))=G(y(t)) and this definition of attractors coincides with the classical definition of attractors for (autonomous) Boolean networks (Definition 2.6).

Definition 3.8.

Consider a controlled non-autonomous network given by y(t+1)=F2(g(t),y(t),u), where g(t) is a trajectory representation of an attractor 𝒞1 of an upstream network. We say that a control μ2 stabilizes this network, F2𝒞1 (defined as in Definition 3.7), at an attractor 𝒞2 when the resulting network after applying μ2 (denoted here as F2𝒞1,μ2) has 𝒞2 as its unique attractor. For non-autonomous networks the definition of unique attractor requires that (g(t),y(t))t=0 has a unique periodic trajectory up to shifting of t (which is automatically satisfied if 𝒞1 or 𝒞2 is a steady state).

Example 3.9.

Consider, again, the network Fx1,x2,x3,x4=x2,x1,x2x4,x3, which can be decomposed into F=F1F2, with F1x1,x2=x2,x1 and F2x3,x4=x4,x3. Suppose we want to stabilize F in 0110 (which is not an attractor of F). Note that the non-autonomous network F2x1,x2,x3,x4=x2x4,x3 and 𝒟F1={00,11,{01,10}}.

  • Consider the control μ1:(x11x2,x20x1). That is, the control is the combined action of setting the input from x1 to x2 to 1 and the input from x2 to x1 to 0. The control μ1 stabilizes F1 at 01, which is not an original attractor of F1. Let 𝒞1={01}𝒟F1μ1. Note that the space of attractors for F2𝒞1 is 𝒟(F2𝒞1)={00,11,{01,10}}.

  • Now consider the control μ2:(x41x3,x30x4). That is, the control is the combined action of setting the input from x4 to x3 to 1 and the input from x3 to x4 to 0. This control stabilizes F2𝒞1 at 𝒞2={10}𝒟F2𝒞1, which is not an original attractor of F2𝒞1.

  • Finally, the control μ=μ1,μ2 stabilizes F at 𝒞=𝒞1𝒞2={0110}. Note that 𝒞 is a new attractor for F.

Theorem 3.4 uses the modular structure of a Boolean network to identify controls that stabilize the network in any desired attractor. In biological applications, the attractors typically correspond to distinct biological phenotypes (defined more rigorously in the next section) and a common question is how to force a network to always transition to only one of these phenotypes. For example, cancer biologists may use an appropriate Boolean network model with the two phenotypes proliferation and apoptosis to identify drug targets (i.e., edge or node controls), which force the system to always undergo apoptosis. The following example illustrates this specific control aspect, described in Corollary 3.5

Example 3.10.

Consider again the network Fx1,x2,x3,x4=x2,x1,x2x4,x3=F1F2 from Example 3.9 with F1x1,x2=x2,x1 and F2x3,x4=x4,x3. Suppose we want to stabilize F in 1111, which is an attractor of F (but not the only one). Note that the non-autonomous network F2x1,x2,x3,x4=x2x4,x3 and 𝒟F1={00,11,{01,10}}. Let 𝒞1={11}𝒟F1.

  • The edge control μ1:x11x2 (that is, the control that constantly expresses the edge from x1 to x2) stabilizes F1 at 𝒞1={11}. The space of attractors for F2𝒞1 is then 𝒟(F2𝒞1)={00,11,{01,10}}. Note that x21x1 would be an alternative control.

  • The edge control μ2:x41x3 (that is, the control that constantly expresses the edge from x4 to x3) stabilizes F2𝒞1 at 𝒞2={11}𝒟(F2𝒞1). Again, note that x31x4 would be an alternative control.

  • Now, the control μ=μ1,μ2=(x11x2,x41x3) stabilizes F at 𝒞=𝒞1𝒞2={1111}.

4. Control via Modularity and Canalization

In addition to using the modular structure of the network, we can take advantage of the canalizing structure of the regulatory functions to identify contol targets.

We first review some concepts and definitions, and introduce the concept of canalization.

Definition 4.1.

A Boolean function fx1,,xn is essential in the variable xi if there exists an x{0,1}n such that

fxfxei,

where ei is the ith unit vector. In that case, we also say f depends on xi.

Definition 4.2.

A Boolean function fx1,,xn is canalizing if there exists a variable xi, a Boolean function gx1,,xi1,xi+1,,xn and a,b{0,1} such that

fx1,x2,,xn=b,ifxi=agx1,x2,,xi1,xi+1,,xn,ifxia

In that case, we say that xi canalizes f (to b) and call a the canalizing input (of xi) and b the canalized output.

Definition 4.3.

A Boolean function fx1,,xn is nested canalizing with respect to the permutation σ𝒮n, inputs a1,,an and outputs b1,,bn, if

fx1,,xn=b1xσ(1)a1,b2xσ(1)a1,xσ2=a2,b3xσ(1)a1,xσ2a2,xσ3=a3,bnxσ(1)a1,,xσn1an1,xσn=an,1bnxσ(1)a1,,xσn1an1,xσnan.

The last line ensures that f actually depends on all n variables.

We restate the following stratification theorem for reference.

Theorem 4.4([28]).

Every Boolean function fx1,,xn0 can be uniquely written as

fx1,,xn=M1M2Mr1MrpC+1+1+1+q, (3)

where each Mi=j=1ki(xij+aij) is a nonconstant extended monomial, pC is the core polynomial of f, and k=i=1rki is the canalizing depth. Each xi appears in exactly one of M1,,Mr,pC, and the only restrictions are the following “exceptional cases”:

  1. 1. If pC1 and r1, then kr2;

  2. 2. If pC1 and r=1 and k1=1, then q=0.

When f is not canalizing (i.e., when k=0), we simply have pC=f.

From Equation 3, we can directly derive an important summary statistic of canalizing functions.

Definition 4.5.

Given a Boolean function fx1,,xn represented as in Equation 3, we call the extended monomials Mi the layers of f and define, as in [29], the layer structure as the vector k1,,kr, which describes the number of variables in each layer. Note that f is nested canalizing if and only if k1++kr=n.

Example 4.6.

The Boolean functions fx1,x2,x3,x4=x1¬x2x3x4 and gx1,x2,x3,x4=x1¬x2x3x4 are nested canalizing. f consists of three layers with layer structure (1, 1, 2), while g possesses only two layers and layer structure (1, 3).

While finding the layer structure of a Boolean function is an NP-hard problem, there exist several algorithmic implementations [30].

A phenotype is associated with a group of attractors where a subset of the system’s variables have a shared state. The states of the shared attractors will be called markers of the phenotype.

Suppose F=F1PF2 is a decomposable network, and that there is a phenotype that depends on variables in F2 only (that is, all markers of the phenotype are part of F2 ), and that we wish to control the phenotype through F2. The most straightforward approach is to set the variables that the phenotype depends on to the appropriate values that result in the desired phenotype. However, such intervention may not be experimentally possible. Instead, we can exploit the canalizing properties of the functions corresponding to the nodes connecting the modules F1 and F2 to identify control targets.

Lemma 4.7.

Suppose F=F1PF2 is a decomposable network. Suppose further that only one node xF2 with update function fx is regulated by nodes in F1. If fx is canalizing with r layers, let {1,,r} be the lowest (i.e., most important) layer of fx, which contains nodes from F1. If all regulators of x from F1 appear in the core polynomial, we set =r+1. Then, setting yF1 to its canalizing value decouples the systems F1 and F2, as long as y appears in a layer <.

Proof.

The lemma is a direct consequence of Theorem 4.4. If y receives its canalizing input and is in a more important layer of fx than all variables in F1, then none of these variables can affect fx anymore. Thus, controlling y to receive its canalizing input eliminates the link between F1 and F2. □

Theorem 4.8.

Suppose F=F1P1F2P2Pn1Fn is a decomposable network. If for some i<j,

  1. only one node xFj with update function fx is regulated by nodes in Fi, and

  2. fx is a canalizing function, which possesses none of the variables from Fi in its most important layer, and

  3. the phenotype of interest depends only on variables in Fj and modules that are not “downstream” of Fi in the directed acyclic graph of F (see Defintion 2.12), then the module Fi can be excluded from the control search by setting any node yFi to its canalizing input, as long as this node appears in a more dominant layer of fx than all variables of Fi.

Proof.

By Lemma 4.7, setting y to its canalizing value results in decoupling Fi and Fj. Fi will no longer have any effect on Fj, and thus, due to condition (iii), on the phenotype of interest. Fi can therefore be removed from the control search.▫

Theorem 4.8 is illustrated in Fig. 4 Note that node y can be in Fj or some other module as in the figure.

Figure 4:

Figure 4:

Control via modularity and canalization. Once the network is decomposed into modules F1,,Fn, we could override the effect of module Fi by the using another module (Fk in this case) whose variables are inputs of fx that are located in a higher canalizing hierarchy than the layers containing the variables of Fi.

Remark 4.9.

The method in Theorem 4.8 can be extended to the case when Fi and Fj are connected via multiple nodes. In that case decoupling is achieved through the same procedure presented above, applied to each node in Fj that is regulated by nodes in Fi.

In Theorem 4.8, we assumed that none of the variables of Fi are in the most dominant layer in the update rules of variables in Fj. If some of the variables of Fi are in the most dominant layer, we can still remove module Fi from the control search using an edge control, as shown in the following theorem.

Theorem 4.10.

Suppose F=F1P1F2P2Pn1Fn is a decomposable network. If for some i<j,

  1. only one node xFj with update function fx is regulated by nodes in Fi, and

  2. fx is a canalizing function with some variables from Fi in its most important layer, and

  3. the phenotype of interest depends only on variables in Fj and modules that are not “downstream” of Fi in the directed acyclic graph of F (see Defintion 2.12), then the module Fi can be excluded from the control search by applying an edge control to any input in the most dominant layer of fx.

Proof.

Let yFi such that ysuppfx, and that y is located in the most dominant layer fx. Then, setting y to its canalizing value results in decoupling the subnetworks Fi and Fj. Thus, Fi will no longer have any effect on Fj and thus it can be removed from the control search. □

Remark 4.11.

The method can be extended to the case when Fi and Fj are connected via multiple nodes. In that case decoupling is achieved through the same procedure presented above applied to each node in Fj with regulators from Fi.

To showcase these methods, we will now decompose a published Boolean network model into its modules, and then identify the minimal set of controls for the entire network by exploiting the canalizing structure of the regulatory functions within the modules. The identified set of controls will force the entire system into a desired attractor.

Example 4.12.

We consider a Boolean network model for the blood cancer large granular lymphocyte (T-LGL) leukemia, which was published in [17]. T-LGL leukemia is a clonal hematological disorder characterized by persistent increases of large granular lymphocytes in the absence of reactive cause [31]. The wiring diagram of this model is depicted in Figure 5a. This network has 16 nodes and three nontrivial modules (highlighted by the amber, green, and gray boxes in Figure 5a). The control objective here is to identify control targets that lead the system to programmed cell death. In other words, we aim to direct the system into an attractor that has the marker apoptosis ON.

Figure 5:

Figure 5:

(a) Wiring diagram of the T-LGL model, published in [17], which describes the mechanisms that regulate apoptosis. The non-trivial modules are highlighted by amber, green, and gray boxes. (b) The regulatory inputs of the node DISC. (c) Writing the regulatory function corresponding to node DISC in its standard monomial form (Theorem 4.4) reveals its canalizing structure.

Since the model has three nontrivial modules, the approach described in Section 3 would require us to identify control targets for three modules. However, an exploitation of the canalizing structure and common sense reveals that we do not need to control every module to ensure apoptosis, the desired control objective. First, irrespective of canalization, the module highlighted in gray in Figure 5a does not affect the phenotype apoptosis. Therefore, we can focus on the modules “upstream” of apoptosis (i.e., the amber and green modules in Figure 5a).

In this case, we will apply Theorem 4.10 to identify control targets for this model. We note that the edges from the upstream module (amber box in Figure 5a) to the downstream module (green box in Figure 5a) all end in the node DISC. Therefore, we will investigate the canalizing properties of the regulatory function of DISC (see Figure 5b),

fDISC=Ceramide(FasFLIP¯).

Using the approach described in [30], we find that fDISC has two canalizing layers, L1={Ceramide} and L2={Fas,FLIP} and its layering structure is given by (see Figure 5c)

fDISC=(Ceramide+1)[(Fas)(FLIP+1)+1]+1

We note that the only variable in the most important canalizing layer, Ceramide, is in the upstream module. Thus, we can decouple the modules via an edge control on the connection between the upstream and downstream modules. That is, the constant expression of the edge from Ceramide to DISC will decouple the two modules and will lead to constant expression of DISC. We can check that this control is effective at stabilizing the system in the desired attractor and the control set obviously has minimal size.

In summary, in this example we used an edge control to decouple the upstream and downstream modules and then identified a control target in the downstream module which contains the markers of the phenotype of interest.

5. Conclusion

Model-based control is a mainstay of industrial engineering, and there is a well-developed mathematical theory of optimal control that can be applied to models consisting of systems of ordinary differential equations. While this model type is also commonly used in biology, for instance in biochemical network modeling or epidemiology and ecology, there are many biological systems that are more suitably modeled in other ways. Boolean network models provide a way to encode regulatory rules in networks that can be used to capture qualitative properties of biological networks, when it is unfeasible or unnecessary to determine kinetic information. While they are intuitive to build, they have the drawback that there is very little mathematical theory available that can be used for model analysis, beyond simulation approaches. And for large networks, simulation quickly becomes ineffective.

The results in this paper, building on those in [15], can be considered as a contribution to a mathematical control theory for Boolean networks, incorporating key features of biological networks. There are many open problems that remain, and we hope that this work will inspire additional developments.

Our concrete contributions here are as follows. The modularization method makes the control search far more efficient and allows us to combine controls at the module level obtained with different control methods. For example, methods based on computational algebra [25, 27] can identify controllers that can create new (desired) steady states, which other methods cannot. Feedback vertex set [32, 24] is a structure-based method that identifies a subset of nodes whose removal makes the graph acyclic. Stable motifs [23] are based on identifying strongly connected subgraphs in the extended graph representation of the Boolean network. Other control methods include [33, 34, 35]. We can use any combination of these methods to identify the controls in each module.

6. Acknowledgments

Author Matthew Wheeler was supported by The American Association of Immunologists through an Intersect Fellowship for Computational Scientists and Immunologists. This work was further supported by the Simons foundation [grant numbers 712537 (to C.K.), 850896 (to D.M.), 516088 (to A.V.)]; the National Institute of Health [grant number 1 R01 HL169974-01 (to R.L.)]; and the Defense Advanced Research Projects Agency [grant number HR00112220038 (to R.L.)]. The authors also thank the Banff International Research Station for support through its Focused Research Group program during the week of May 29, 2022 (22frg001), which was of great help in framing initial ideas of this paper.

References

  • [1].Aguilar Boris, Gibbs David L, Reiss David J, McConnell Mark, Danziger Samuel A, Dervan Andrew, Trotter Matthew, Bassett Douglas, Hershberg Robert, Ratushny Alexander V, et al. A generalizable data-driven multicellular model of pancreatic ductal adenocarcinoma. GigaScience, 9(7):giaa075, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [2].Plaugher Daniel and Murrugarra David. Modeling the pancreatic cancer microenvironment in search of control targets. Bulletin of Mathematical Biology, 83(11):1–26, 2021. [DOI] [PubMed] [Google Scholar]
  • [3].Rozum Jordan and Albert Réka. Leveraging network structure in nonlinear control. NPJ systems biology and applications, 8(1):36, 2022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [4].Plaugher Daniel and Murrugarra David. Phenotype control techniques for boolean gene regulatory networks. Bulletin of Mathematical Biology, 85(10):1–36, 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [5].Schwab Julian D, Kühlwein Silke D, Ikonomi Nensi, Kühl Michael, and Kestler Hans A. Concepts in boolean network modeling: What do they all mean? Computational and structural biotechnology journal, 18:571–582, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [6].Veliz-Cuba Alan, Arthur Joseph, Hochstetler Laura, Klomps Victoria, and Korpi Erikka. On the relationship of steady states of continuous and discrete models arising from biology. Bulletin of mathematical biology, 74:2779–2792, 2012. [DOI] [PubMed] [Google Scholar]
  • [7].Singh Vidisha, Naldi Aurelien, Soliman Sylvain, and Niarakis Anna. A large-scale boolean model of the rheumatoid arthritis fibroblast-like synoviocytes predicts drug synergies in the arthritic joint. NPJ systems biology and applications, 9, 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Aghamiri Sara Sadat, Singh Vidisha, Naldi Aurélien, Helikar Tomáš, Soliman Sylvain, and Niarakis Anna. Automated inference of boolean models from molecular interaction maps using casq. Bioinformatics, 36(16):4473–4482, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [9].Veliz-Cuba A.. Reduction of Boolean network models. Journal of Theoretical Biology, 289:167–172, 2011. [DOI] [PubMed] [Google Scholar]
  • [10].Saadatpour Assieh, Albert Réka, and Reluga Timothy. A reduction method for boolean network models proven to conserve attractors. SIAM Journal on Applied Dynamical Systems, 12:1997–2011, 01 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [11].Kashtan Nadav and Alon Uri. Spontaneous evolution of modularity and network motifs. Proceedings of the National Academy of Sciences, 102(39):13773–13778, 2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Hartwell Leland H, Hopfield John J, Leibler Stanislas, and Murray Andrew W. From molecular to modular cell biology. Nature, 402(Suppl 6761):C47–C52, 1999. [DOI] [PubMed] [Google Scholar]
  • [13].Kitano Hiroaki. Biological robustness. Nature Reviews Genetics, 5(11):826–837, 2004. [DOI] [PubMed] [Google Scholar]
  • [14].Lorenz Dirk M, Jeng Alice, and Deem Michael W. The emergence of modularity in biological systems. Physics of life reviews, 8(2):129–160, 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [15].Kadelka Claus, Wheeler Matthew, Alan Veliz-Cuba David Murrugarra, and Laubenbacher Reinhard. Modularity of biological systems: a link between structure and function. Journal of the Royal Society Interface, 20(207):20230505, 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [16].Kadelka Claus, Butrie Taras-Michael, Hilton Evan, Kinseth Jack, Schmidt Addison, and Serdarevic Haris. A meta-analysis of boolean network models reveals design principles of gene regulatory networks. Science Advances, 10(2):eadj0822, 2024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [17].Saadatpour Assieh, Wang Rui-Sheng, Liao Aijun, Liu Xin, Loughran Thomas P, Albert István, and Albert Réka. Dynamical and structural analysis of a t cell survival network identifies novel candidate therapeutic targets for large granular lymphocyte leukemia. PLoS Comput Biol, 7(11):e1002267, Nov 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Conrad Hal Waddington. The strategy of the genes. Routledge, 2014. [Google Scholar]
  • [19].Kauffman S A. Metabolic stability and epigenesis in randomly constructed genetic nets. J Theor Biol, 22(3):437–67, Mar 1969. [DOI] [PubMed] [Google Scholar]
  • [20].Plaugher Daniel, Aguilar Boris, and Murrugarra David. Uncovering potential interventions for pancreatic cancer patients via mathematical modeling. Journal of theoretical biology, 548:111197, 2022. [DOI] [PubMed] [Google Scholar]
  • [21].Choi Minsoo, Shi Jue, Jung Sung Hoon, Chen Xi, and Cho Kwang-Hyun. Attractor landscape analysis reveals feedback loops in the p53 network that control the cellular response to dna damage. Science signaling, 5(251):ra83–ra83, 2012. [DOI] [PubMed] [Google Scholar]
  • [22].Wooten David J, Zañudo Jorge Gómez Tejeda, Murrugarra David, Perry Austin M, Dongari-Bagtzoglou Anna, Laubenbacher Reinhard, Nobile Clarissa J, and Albert Réka. Mathematical modeling of the candida albicans yeast to hyphal transition reveals novel control strategies. PLoS computational biology, 17(3):e1008690, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [23].Zanudo Jorge GT and Albert Réka. Cell fate reprogramming by control of intracellular network dynamics. PLoS computational biology, 11(4):e1004193, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [24].Zañudo Jorge Gomez Tejeda, Yang Gang, and Albert Réka. Structure-based control of complex networks with nonlinear dynamics. Proceedings of the National Academy of Sciences, 114(28):7234–7239, 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [25].Murrugarra David, Veliz-Cuba Alan, Aguilar Boris, and Laubenbacher Reinhard. Identification of control targets in boolean molecular network models via computational algebra. BMC systems biology, 10:1–11, 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [26].Murrugarra D and Dimitrova ES. Molecular network control through boolean canalization. eurasip j. Bioinformatics Syst. Biol, 9, 2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [27].Vieira Luis Sordo, Laubenbacher Reinhard C, and Murrugarra David. Control of intracellular molecular networks using algebraic methods. Bulletin of mathematical biology, 82(1):2, 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [28].He Qijun and Macauley Matthew. Stratification and enumeration of boolean functions by canalizing depth. Physica D: Nonlinear Phenomena, 314:1–8, 2016. [Google Scholar]
  • [29].Kadelka Claus, Kuipers Jack, and Laubenbacher Reinhard. The influence of canalization on the robustness of boolean networks. Physica D: Nonlinear Phenomena, 353:39–47, 2017. [Google Scholar]
  • [30].Dimitrova Elena, Stigler Brandilyn, Kadelka Claus, and Murrugarra David. Revealing the canalizing structure of boolean functions: Algorithms and applications. Automatica, 146:110630, 2022. [Google Scholar]
  • [31].Rashid Anila, Khurshid Mohammad, and Ahmed Arsalan. T-cell large granular lymphocytic leukemia: 4 cases. Blood research, 49(3):203–205, 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [32].Mochizuki Atsushi, Fiedler Bernold, Kurosawa Gen, and Saito Daisuke. Dynamics and control at feedback vertex sets. ii: A faithful monitor to determine the diversity of molecular activities in regulatory networks. Journal of theoretical biology, 335:130–146, 2013. [DOI] [PubMed] [Google Scholar]
  • [33].Choo Sang-Mok, Ban Byunghyun, Joo Jae Il, and Cho Kwang-Hyun. The phenotype control kernel of a biomolecular regulatory network. BMC systems biology, 12(1):1–15, 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [34].Cifuentes-Fontanals Laura, Tonello Elisa, and Siebert Heike. Control in boolean networks with model checking. Frontiers in Applied Mathematics and Statistics, 8, 2022. [Google Scholar]
  • [35].Borriello Enrico and Daniels Bryan C. The basis of easy controllability in boolean networks. Nature communications, 12(1):1–15, 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from ArXiv are provided here courtesy of arXiv

RESOURCES