Double and multiple knockout simulations for genome-scale metabolic network reconstructions

Yaron AB Goldstein; Alexander Bockmayr

doi:10.1186/s13015-014-0028-y

. 2015 Jan 9;10:1. doi: 10.1186/s13015-014-0028-y

Double and multiple knockout simulations for genome-scale metabolic network reconstructions

Yaron AB Goldstein ^1,^✉, Alexander Bockmayr ¹

PMCID: PMC4302510 PMID: 25649004

Abstract

Background

Constraint-based modeling of genome-scale metabolic network reconstructions has become a widely used approach in computational biology. Flux coupling analysis is a constraint-based method that analyses the impact of single reaction knockouts on other reactions in the network.

Results

We present an extension of flux coupling analysis for double and multiple gene or reaction knockouts, and develop corresponding algorithms for an in silico simulation. To evaluate our method, we perform a full single and double knockout analysis on a selection of genome-scale metabolic network reconstructions and compare the results.

Software

A prototype implementation of double knockout simulation is available at http://hoverboard.io/L4FC.

Keywords: Constraint-based modeling, Metabolic network, Flux coupling analysis, Reaction knockout, Gene knockout

Background

Constraint-based modeling has become a widely used approach for the analysis of genome-scale reconstructions of metabolic networks [1]. Given a set of metabolites Inline graphic and a set of reactions , the metabolic network is represented by its stoichiometric matrix $S \in R^{M \times R}$ , and a subset of irreversible reactions $Irr \subseteq R$ . The flux cone $C = {v \in R^{R} ∣ Sv = 0, v_{r} \geq 0, r \in Irr}$ contains all steady-state flux vectors satisfying the stoichiometric and thermodynamic irreversibility constraints. Based on this flux cone, many analysis methods have been proposed over the years (see e.g. [2] for an overview). Flux Balance Analysis (FBA) [3,4] solves a linear program (LP) max{z(v)∣Sv=0,l≤v≤u} over the (truncated) flux cone in order to predict how efficiently an organism can realize a certain biological objective, represented by the linear objective function z(v). For example, one may compute the maximal biomass production rate under some specific growth conditions. Flux Coupling Analysis (FCA) [5,6] studies dependencies between reactions. Here the question is whether or not for all steady-state flux vectors v∈C, zero flux v_r=0 through some reaction r implies zero flux v_s=0, for some other reaction s.

Knockout analysis has become an important technique for the study of metabolic networks and in metabolic engineering. Starting from flux balance analysis (FBA), various in silico screening methods for genetic modifications have been developed, see [7,8] for an overview. On the one hand, complete methods have been proposed, which systematically explore all possible knockout sets up to a given size, e.g. [9,10]. On the other hand, there exist heuristic algorithms such as [11-14], which may be considerably faster, but in general are not complete. Klamt et al. [15-17] developed the related concept of minimal cut sets, which are (inclusion-wise) minimal sets of reactions whose knockout will block certain undesired flux distributions while maintaining others.

Recent progress in the development of algorithms for flux coupling analysis (FCA) [6,18] may lead to a different approach. FCA [5] describes the impact of each possible single reaction knockout in a metabolic network. It analyzes which other reactions become blocked after removing one reaction (“directional coupling”), and which reactions are always active together (“partial coupling”). As we will see, using flux coupling information inside a double or multiple knockout simulation may significantly reduce the search space, without loosing any information.

In this paper, we present an algorithmic framework for double and multiple knockouts in qualitative models of metabolic networks. We will use a lattice-theoretic approach [18], which includes classical constraint-based models at steady-state as a special case, but which is much more general. We illustrate and evaluate our method by computing full double knockout simulations on a selection of genome-scale metabolic network reconstructions. In particular, we compare the impact of single vs. double reaction knockouts on the other reactions in the network. We also show how our method can be extended to gene (in contrast to reaction) knockouts, and provide computational results for both cases.

Our algorithms are based on an efficient search for the maximal element in suitably defined lattices [18]. To simulate all double or multiple reaction knockouts, we describe a method to select a subset of the reactions as representatives for the whole system. More precisely, we partition the reaction set in equivalence classes of partially coupled reactions. This enables us to obtain the information about all possible double or multiple reaction knockouts much faster and to store the results in a compact format.

The approach developed in this paper is a qualitative method. We do not measure the quantitative impact of knockout sets on the cellular growth rate (or other metabolic fluxes) as this would be done in an FBA approach. Instead, we count how many reactions become blocked by a knockout, similar to the flux balance impact degree introduced in [19]. However, even though we do not apply FBA to evaluate the impact of a knockout, the idea of working with representatives for reaction classes via partial coupling could also be applied in an FBA context. Thus, studies like [20] and even MILP-based approaches like [21] might benefit from this method.

Methods

Reaction coupling in the context of knockout analysis

We start from a metabolic network $N = (M, R, S, Irr)$ given by a set of metabolites Inline graphic , a set of reactions , a stoichiometric matrix $S \in R^{M \times R}$ , and a set of irreversible reactions $Irr \subseteq R$ , see Figure 1 for an example. The set $C = {v \in R^{R} ∣ Sv = 0, v_{r} \geq 0, r \in Irr}$ of all flux vectors $v \in R^{R}$ satisfying the steady-state (mass balance) constraints Sv=0 and the thermodynamic irreversibility constraints v_r≥0, for all r∈Irr, is called the steady-state flux cone. A reaction $s \in R$ is called blocked if v_s=0, for all v∈C, otherwise s is unblocked. Two unblocked reactions r,s are called directionally coupled [5], written $r \overset{= 0}{\to} s$ , if for all v∈C, v_r=0 implies v_s=0. A possible biological interpretation is that the reactions directionally coupled to r are those reactions that will become blocked by knocking out the reaction r.

**Example network with corresponding lattice and coupling relations.** The network contains the set of metabolites $R = {A, B, C, D}$ and the set of reactions $R = {1, 2, 3, 4, 5, 6}$ . We assume that all coefficients s _mr of the stoichiometric matrix S belong to {0,+1,−1}. Thus, reaction 2 has the stoichiometry s _A2=−1,s _B2=s _C2=1 and s _D2=0. The set of irreversible reactions is $Irr = R ∖ {1}$ . A possible flux vector satisfying the steady-state condition S v=0 is v=(0,1,1,2,1,1) with supp v={2,3,4,5,6}. The corresponding lattice contains the trivial element ∅ representing the vector v=0 and the minimal (non-trivial) elements {1,2,3,4},{1,4,5,6} and {2,3,4,5,6}. The maximal element is {1,2,3,4,5,6}, i.e., there is no blocked reaction. a) There are two pairs of partially coupled reactions, namely 2⇔3 and 5⇔6. Therefore, no knockout sets containing reaction 3 or 5 need to be analysed. The impact of a double knockout of {3,r} will be the same as for {2,r}. b) Reaction 1 is coupled to reaction 4. Thus, a double knockout of {1,4} will have the same effect as the simple knockout of 4. In both cases, all reactions {1,2,3,4,5,6} get blocked.

To determine which reactions are coupled, a simple approach would be to solve for each pair of reactions (r,s) two linear programs (LPs) and to check whether max {v_s | v ∈C,v_r=0}= min {v_s | v ∈C,v_r=0}=0. During the last years, efficient flux coupling algorithms have been developed [6,18] that drastically reduce the number of LPs to be solved, so that that genome-wide metabolic network reconstructions can now be analyzed in a few minutes on a desktop computer (compared to a couple of days of running time before).

Whether reactions are blocked or coupled does not depend on the specific flux values. It only matters whether or not v_r=0 resp. v_s=0. In this sense, flux coupling is a qualitative property that can be analysed by studying the set L^C={supp v | v ∈C} of all supports of flux vectors v∈C, where $supp v = {r \in R | v_{r} \neq 0}$ . Each element a∈L^C is the set of active reactions of some flux vector v∈C. Therefore, we can interpret L^C as the set of all possible reaction sets or pathways in the flux cone C. Since L^C does not contain any information about specific flux values, we also speak of a qualitative model of the metabolic network Inline graphic .

In [18,22], we have shown that flux coupling analysis can be extended to much more general qualitative models, where the space of possible pathways $L \subseteq 2^{R}$ can be any non-empty subset of the power set $2^{R}$ , e.g. L={supp v∣v∈C,v satisfies thermodynamic loop law constraints}. The definition of flux coupling needs only be slightly modified in order to be applicable to these qualitative models. A reaction $t \in R$ is called blocked in L if and only if for all a∈L, we have t∉a. For reactions $r, s \in R$ that are unblocked in L, we define $r \overset{= 0}{\to} s$ in L, if for all a∈L, r∉a implies s∉a. To distinguish between the original flux coupling and its qualitative extension, we will call the latter reaction coupling from now on.

The goal of this paper is to study more general dependencies between reactions, where the flux through some reaction has to be zero, if the flux through two or more other reactions is zero.

Definition1 (Joint reaction coupling).

Given a qualitative model $L \subseteq 2^{R}$ of a metabolic network Inline graphic , let $r, s, t \in R$ be unblocked reactions in L such that neither $r \overset{= 0}{\to} t$ in L nor $s \overset{= 0}{\to} t$ in L holds. We say t is jointly coupled to the pair {r,s} in L, written $\{r, s\} \overset{= 0}{\to} t$ in L, if for all a∈L, r∉a and s∉a implies t∉a.

More generally, given a set $K \subseteq R$ of unblocked reactions in L, we say that t is jointly coupled to Inline graphic in L, written $K \overset{= 0}{\to} t$ in L, if for all a∈L, $a \cap K = \emptyset$ implies t∉a, and $K^{'} \overset{= 0}{\to} t$ in L does not hold for any $\emptyset \neq K^{'} ⫋ K$ .

Note that in the definition of the joint coupling $\{r, s\} \overset{= 0}{\to} t$ in L, we require that the simple couplings $r \overset{= 0}{\to} t$ in L and $s \overset{= 0}{\to} t$ in L both do not hold. Thus, joint coupling is about the synergistic effect of a pair of reactions r,s on some other reaction t, which cannot be obtained by either r or s alone. Similarly, $K \overset{= 0}{\to} t$ in L can only hold if $K^{'} \overset{= 0}{\to} t$ in L does not hold, for any smaller knockout set $\emptyset \neq K^{'} ⫋ K$ .

Lattices and maximal elements

In [18], we presented a generic algorithm for flux coupling analysis in qualitative models. This algorithm determines the pairs of coupled reactions by computing the maximal element in suitably defined lattices.

A family of reaction sets $L \subseteq 2^{R}$ is a (finite) lattice if ∅∈L and for all a₁,a₂∈L, we have a₁∪a₂∈L. The biological interpretation of this property is that the combination of two metabolic pathways is again a pathway. In [18] we showed that L^C is a lattice. Any finite lattice L has a unique maximal element 1_L (w.r.t. set inclusion), which is simply the union of all lattice elements, i.e., $1_{L} = ⋃_{a \in L} a$ . For any subset of reactions $K \subseteq R$ , we may define the family

L_{⊥ K} = \{a \in L | a \cap K = \emptyset\}

called Lwithout Inline graphic of those reaction sets a∈L that do not contain any reaction in . If L is a lattice, then $L_{⊥ K}$ is a lattice again, and thus it has a maximal element

1_{L_{⊥ K}} = ⋃_{a \in L, a \cap K = \emptyset} a.

Given any lattice $L \subseteq 2^{R}$ , we have shown in [18] that a reaction $r \in R$ is unblocked in L if and only if r∈1_L. For two unblocked reactions r,s∈1_L, the coupling relation $r \overset{= 0}{\to} s$ in L holds if and only if $s \notin 1_{L_{⊥ {r}}}$ . In [18], we also presented an efficient algorithm to compute 1_L and $1_{L_{⊥ {r}}}$ . Once these maximal elements have been found, one can immediately determine the blocked and coupled reactions.

In this paper, we generalize these results to joint couplings. We present a method to compute the effects of double (resp. multiple) reaction knockouts based on the maximal element $1_{L_{⊥ {r, s}}}$ (resp. $1_{L_{⊥ K}}$ ).

Proposition1.

If $L \subseteq 2^{R}$ is a lattice, then for any unblocked reactions r,s,t∈1_L we have

{r, s} \overset{= 0}{\to} t in L if and only if t \in (1_{L_{⊥ {r}}} \cap 1_{L_{⊥ {s}}}) ∖ 1_{L_{⊥ {r, s}}} .

More generally, for a set of unblocked reactions $K \subseteq 1_{L}$ , we have

K \overset{= 0}{\to} t in L if and only if t \in (⋂_{k \in K} 1_{L_{⊥ K ∖ \{k\}}}) ∖ 1_{L_{⊥ K}} .

Proof.

We prove only the first part. The second part follows by induction.

Assume ${r, s} \overset{= 0}{\to} t$ in L. By definition, we know t∉a for all $a \in L_{⊥_{\{r, s\}}}$ , and therefore $t \notin 1_{L_{⊥ {r, s}}}$ . If ${r, s} \overset{= 0}{\to} t$ in L, we also know that neither $r \overset{= 0}{\to} t$ in L nor $s \overset{= 0}{\to} t$ in L and that all three reactions are unblocked, i.e., r,s,t∈1_L. As discussed in [18], we have $r \overset{= 0}{\to} t$ in L if and only if $t \in 1_{L} ∖ 1_{L_{⊥ {r}}}$ . Since t∈1_L, we conclude $t \in 1_{L_{⊥ {r}}}$ , and by the same argument $t \in 1_{L_{⊥ {s}}}$ . Hence, $t \in (1_{L_{⊥ {r}}} \cap 1_{L_{⊥ {s}}}) ∖ 1_{L_{⊥ {r, s}}}$ .

If $t \in (1_{L_{⊥ {r}}} \cap 1_{L_{⊥ {s}}}) ∖ 1_{L_{⊥ {r, s}}}$ holds, then $t \notin 1_{L_{⊥ {r, s}}}$ , which implies t∉a for all a∈L_⊥{r,s}. Since $t \in 1_{L_{⊥ {r}}} \cap 1_{L_{⊥ {s}}}$ , we can again apply [18] to see that $r \overset{= 0}{\to} t$ in L and $s \overset{= 0}{\to} t$ in L do not hold. Finally, since r,s,t∈1_L are unblocked, we get $\{r, s\} \overset{= 0}{\to} t$ in L.

In [22], we considered even more general qualitative models $\emptyset \neq P \subseteq 2^{R}$ , where P needs not be a lattice. We showed there that qualitative flux coupling analysis can be done in the lattice L^P=〈P〉 that is generated by P. The results we will present in this paper would be applicable to those qualitative models P as well, but for simplicity we will continue to work with models L that are lattices.

Classes of partially coupled reactions

To determine joint coupling relations $K \overset{= 0}{\to} t$ in L, we will use as much as possible the information that can be obtained from standard couplings $r \overset{= 0}{\to} s$ in L, i.e., with normal FCA. If $r \overset{= 0}{\to} s$ in L, any pathway a∈L that does not use reaction r will also not use reaction s. Thus, knocking out s in addition to r will not affect the system, i.e., {a∈L | r,s∉a}={a∈L | r∉a}.

Additional improvements can be obtained by looking at partially coupled reactions. Two unblocked reactions r,s∈1_L are called partially coupled in the lattice L, written r⇔s, if both $r \overset{= 0}{\to} s in L$ and $s \overset{= 0}{\to} r in L$ . The relation ⇔ is reflexive, transitive and symmetric, and thus an equivalence relation. Any equivalence relation defines a partition of its ground set into equivalence classes. In our case, $1_{L} = ⋃_{r \in 1_{L}} {[r]}_{\Leftrightarrow}$ , where [r]_⇔={s∈1_L | r⇔s}. An equivalence class can be represented by any of its elements, i.e., ${[r]}_{\Leftrightarrow} = {[\tilde{r}]}_{\Leftrightarrow}$ if $r \Leftrightarrow \tilde{r}$ . By selecting one element from each equivalence class, we get a set of representatives Rep⊆1_L that covers all unblocked reactions, i.e., $1_{L} = ⋃_{r \in Rep} {[r]}_{\Leftrightarrow}$ . We will call [ r]_⇔ the coupling class or reaction class of reaction r. Biologically, coupling classes can be interpreted as subsets of reactions that are always active together, similarly to the notion of enzyme subsets in [23].

For $r, \tilde{r} \in {[r]}_{\Leftrightarrow}$ and a∈L, we have r∈a if and only if $\tilde{r} \in a$ . Thus, a knockout of r has the same impact as a knockout of $\tilde{r}$ . Furthermore, r can only be blocked by another knockout k∉[r]_⇔ if the same holds for $\tilde{r}$ , i.e., $k \overset{= 0}{\to} r$ in L if and only if $k \overset{= 0}{\to} \tilde{r}$ in L. It follows that to analyse the effect of a knockout pair $\{\tilde{r}, \tilde{s}\}$ , one can instead knockout the corresponding representatives {r,s} with $\tilde{r} \in {[r]}_{\Leftrightarrow}$ in L and $\tilde{s} \in {[s]}_{\Leftrightarrow}$ . To simulate all double knockouts, one does not have to check all pairs $\{\{\tilde{r}, \tilde{s}\} | \tilde{r}, \tilde{s} \in 1_{L}\}$ , but it is enough to iterate over a fixed set of representatives: {{r,s}|r,s∈Rep}, see Figure 1a) for illustration. As we will see, for many genome-scale network reconstructions, there are only about half as many different equivalence classes as there are unblocked reactions (Table 1). Thus, only about 1/4 of all original pairs need to be checked. As mentioned before, although we apply this compression to reaction coupling analysis, it could also be combined with FBA-based methods.

Table 1.

Knockout impact on different networks

Model			Single KOs	Double KOs
	ub	classes	Impact	Impact	ratio
E. coli iJO1366	1718	1078	4.51 (16.6)	4.41 (10.1)	1.0%
E. coli iAF1260	1543	975	4.12 (13.7)	4.04 (9.24)	0.8%
S. cerevisiae iND750	631	371	5.42 (14.6)	5.52 (10.3)	2.7%
M. tuberculosis iNJ661	744	370	4.74 (35.6)	1.99 (5.78)	5.1%
S. aureus iSB619	465	207	11.7 (44.9)	7.31 (17.2)	9.2%
H. pylori iIT341	436	150	6.65 (58.6)	4.71 (15.5)	9.7%
E. coli textbook	87	55	1.96 (3.58)	15.7 (24.5)	12%

Open in a new tab

ub: Number of unblocked reactions in the original network.

classes: Number of different reaction classes, i.e., equivalence classes w.r.t. partial coupling ⇔.

Single KOs impact: Average impact of single reaction knockouts, i.e., average number of reactions classes that become blocked by a single knockout. In brackets: Average number of reactions that become blocked (belonging to different reaction classes).

Double KOs impact: Average additional impact of double reaction knockouts, i.e., average number of reactions classes that become blocked by a double knockout {r, s}, but are not blocked by a single knockout of either r or s. In brackets: Average number of additional reactions that become blocked.

Double KOs ratio: Percentage of pairs of (uncoupled) reaction classes that have joint coupling effects. The average numbers are determined by $\frac{1}{| K |} \sum_{κ \in K} impact (κ)$ with K =Rep for the single, and K = {{r, s} | r, s ∈Rep with neither r $\overset{= 0}{\to}$ s in L nor s $\overset{= 0}{\to}$ r for the double knockouts.

Algorithms

In [18], we introduced an algorithm that performs flux coupling analysis by computing maximal elements of suitably defined finite lattices $\tilde{L}$ (see also the section above on lattices and maximal elements). The basic ingredient of this algorithm is a method that checks if a given reaction $r \in R$ is blocked in $\tilde{L}$ , and if not returns a pathway $a \in \tilde{L}$ with r∈a. The maximal element $1_{\tilde{L}}$ of $\tilde{L}$ is computed by improving lower and upper bounds $lb, ub \in \tilde{L}$ with $lb \subseteq 1_{\tilde{L}} \subseteq ub$ . In each step of the algorithm, either lb is increased or ub is decreased, until finally $lb = ub = 1_{\tilde{L}}$ . The following Algorithm 1 is an extension of this method. It allows finding all the reactions in Inline graphic that are unblocked after a multiple knockout $K \subseteq 1_{L}$ .

graphic file with name 13015_2014_28_Fige_HTML.gif

As discussed in [18], the flexibility of the lattice-based approach comes from hiding the search for specific pathways in a separate function FindPath. For traditional steady-state based models, FindPath can be realized by solving the linear programs $max {\pm v_{t} | Sv = 0, v_{Irr} \geq 0, v_{k} = 0, k \in K}$ . But, one can also use other modeling hypotheses and corresponding algorithmic methods (see [22] for the example of thermodynamic loop law constraints). The skeleton of Algorithm 1 will remain the same, only the auxiliary function FindPath has to be changed.

In Algorithm 1, we perform a multiple knockout analysis with a fixed knockout set Inline graphic . For a full d-dimensional knockout analysis, we would have to iterate over all $K \subseteq 1_{L}$ with $| K | = d$ , i.e., we would have to run the algorithm $O ((\binom{| R |}{d}))$ times. In each iteration, we have to solve $O (| R |)$ linear programs. Since linear programming can be done in polynomial time, full d-dimensional knockout analysis is still polynomial (for fixed d), but computationally very expensive as soon as d>2. However, we can still use the partition of 1_L into equivalence classes of partially coupled reactions. Thus, our next Algorithm 2 calculates representatives of all jointly coupled reactions in the case of double knockouts.

graphic file with name 13015_2014_28_Figf_HTML.gif

In Algorithm 2, we iterate over a subset of all possible double knockouts without loosing any information. For this, we filter redundant knockout pairs such as $r \overset{= 0}{\to} s$ in L (by checking $s \in 1_{L_{⊥ {r}}}$ ). It is unnecessary to test such a pair, because a knockout of {r,s} is equivalent to the single knockout of r, see Figure 1b) for illustration. For higher-dimensional knockout sets one can proceed in a similar fashion:

Let $K = \{k_{1}, \dots, k_{d}\} \subseteq Rep$ be a d-dimensional knockout set. Then we do not need to test Inline graphic , if any of the following conditions is fulfilled:

$k_{i} \overset{= 0}{\to} k_{j} in L$ for two reactions $k_{i}, k_{j} \in K$ ,
$\{k_{i_{1}}, k_{i_{2}}\} \overset{= 0}{\to} k_{j} in L$ for three reactions $k_{i_{1}}, k_{i_{2}}, k_{j} \in K$ ,
$\{k_{i_{1}}, k_{i_{2}}, k_{i_{3}}\} \overset{= 0}{\to} k_{j} in L$ for four reactions $k_{i_{1}}, k_{i_{2}}, k_{i_{3}}, k_{j} \in K$ ,
etc.

Standard FCA finds all pairs of reactions that are directionally coupled. This allows us to iterate in Algorithm 2 over all ${r, s} \in K_{2, 1}$ with

K_{2, 1} = \{\{k_{1}, k_{2}\} \subseteq Rep | not k_{1} \overset{= 0}{\to} k_{2} in L and not k_{2} \overset{= 0}{\to} k_{1} in L\} .

$K_{2, 1}$ contains all 2-tuples of coupling class representatives that are not coupled with respect to knockouts up to cardinality 1.

If one is interested to perform a full triple knockout analysis and joint coupling information is available, one can adapt the filtering technique and iterate over all ${r_{1}, r_{2}, r_{3}} \in K_{3, 1}$ (or K_3,2) with

\begin{array}{lcr} K_{3, 1} & = & \{\{k_{1}, k_{2}, k_{3}\} \subseteq Rep | not k_{i} \overset{= 0}{\to} k_{j} in L, \\ for all i \neq j \in {1, 2, 3}\}, \\ K_{3, 2} & = & \{\{k_{1}, k_{2}, k_{3}\} \subseteq Rep | not k_{i_{1}} \overset{= 0}{\to} k_{j} in L \\ and not \{k_{i_{1}}, k_{i_{2}}\} \overset{= 0}{\to} k_{j} in L, for all pairwise \\ different i_{1}, i_{2}, j \in {1, 2, 3}\} . \end{array}

$K_{3, 1}$ contains all 3-tuples of coupling class representatives that are not directionally coupled, and $K_{3, 2}$ all triples that do not contain reactions that are coupled with respect to knockouts up to cardinality 2. Similarly one could define $K_{d, m}$ .

While these techniques are applied here only to reaction coupling analysis, they could also be combined with FBA-based methods. Thus, if one is interested to measure the impact of all possible triple knockouts on FBA, it would be sufficient to solve $max \{v_{biomass} | Sv = 0, v_{Irr} \geq 0, v_{K} = 0\}$ for all $K \in K_{3, 1}$ (if only FCA data is available) or all $K \in K_{3, 2}$ (if FCA and joint coupling data is available).

The case of gene knockouts

Often metabolic networks contain regulatory rules for the gene products that catalyze the reactions, e.g. reaction r₁ is catalyzed by the product of a gene g₁ and reaction r₂ is catalyzed by the gene product of g₁ or g₂. Here r₁ is only possible if g₁ is active, and r₂ can only be blocked by a simultaneous knockout of the two genes g₁ and g₂. Typically, there is no 1-1 relationship between the set of genes Inline graphic and the set of reactions . On the one hand, there are reactions that only get blocked by a combination of two or more gene knockouts, as indicated above in r₂≡g₁∨g₂. On the other hand, the knockout of a single gene $g \in G$ may block more than one reaction. For example, reactions r₁ and r₃ may both depend on the gene g₁. Then one immediately gets that a knockout of g₁ implies v₁=v₃=0. Let us further assume that FCA and double reaction knockout analysis have been performed, leading to $3 \overset{= 0}{\to} 4$ in L and $\{1, 3\} \overset{= 0}{\to} 6$ in L. Based on this information, we can extend the reactions that are blocked by the knockout of gene g₁ to v₁=v₃=v₄=v₆=0. Thus, in this example we have 2 reactions (r₁,r₃) that are associated to the geneg₁ based on information that is directly available in the network reconstruction, but in total 4 reactions (r₁,r₃,r₄,r₆) that are coupled to the geneg₁. We formalize these notions in the following definition.

Definition2 (Gene coupling).

Consider a qualitative model $L \subseteq 2^{R}$ of a metabolic network Inline graphic with reaction set and gene set . Let $α : 2^{G} \to 2^{R}, Γ \mapsto K_{Γ}$ be a function defining a set of reactions $K_{Γ}$ associated to the knockout of all genes in the set Γ. For an unblocked reaction r∈1_L and $Γ \subseteq G$ we define:

Γ \overset{= 0}{\to} r in L if and only if r \notin 1_{L_{⊥ K_{Γ}}} .

We say that the reaction r is coupled to the gene knockout Γ. If Γ={g} is a single gene, we simply write $g \overset{= 0}{\to} r$ in L.

Given the function $α : 2^{G} \to 2^{R}$ , we can determine the reactions coupled to the gene set Γ by applying Algorithm 1 to the set of associated reactions $K_{Γ}$ . Note that the definition of gene coupling slightly differs from the one of joint reaction coupling. Here, we do not exclude reactions that are already knocked out by single (or smaller set of) gene knockouts. This is to account for the possibility that, for example, a reaction r may be associated to a single gene knockout g₁, but not to the double knockout {g₁,g₂} (assume r≡g₁∨¬g₂).

To simulate the impact of all single gene knockouts, one can perform an iteration over all genes $g \in G$ . Similarly, one can determine all double gene knockout effects by an iteration over all pairs of genes ${g_{1}, g_{2}} \subseteq G$ . However, in contrast to Algorithm 2, we cannot use gene class representatives to decrease the number of pairs that have to be analyzed.

Results and discussion

To evaluate our method, we simulated all single and double reaction knockouts for a number of genome-scale metabolic network reconstructions from the BiGG-database [24]. The computations were done on a MacBook Air (2012), with 1.8 GHz Intel Core i5, 4GB RAM, and running Java Oracle JDK 1.7.45 under Mac OS X 10.9. To solve linear programs (LPs), we used CPLEX Version 12.6.

Impact of double knockouts

Table 1 shows the impact of single and double reaction knockouts for the different networks. In most cases, the knockout of a single reaction class (due to the knockout of one or more of its reactions) blocks the reactions in 4 to 5 other reaction classes in average. The least robust system is S. aureus iSB619, where a single knockout has an average impact of almost 12 coupled reaction classes. In S. aureus iSB619, about 9.2% of all possible double knockouts {r,s} have joint coupling effects, i.e., there exist reactions $t \in R$ that are blocked by the double knockout {r,s}, but not by a single knockout of r or s alone. This is a comparatively large number. For the bigger E. coli models iAF1260 and iJO1366, only around 1% of all double knockouts of two uncoupled reaction classes {r,s} have an impact that exceeds the effects of the corresponding two single knockouts. In S. aureus iSB619, double knockouts also have very strong combined effects. In addition to the reaction classes that would be knocked out by r or s alone, in average more than 7 reaction classes are coupled to a double knockout corresponding to a joint coupling ${r, s} \overset{= 0}{\to} t$ in L. But, even for the most robust system, M. tuberculosis iNJ661, a double knockout (if its impact is different from the two single knockouts) in average has a combined effect of 2 additional knocked out classes resp. 5.8 reactions.

Knockout options

In our next experiment, we take the opposite perspective (Table 2). We analyse how robust an average reaction is to single or double knockouts. More precisely, we ask the following question: Given a reaction t, what are the possible choices for a single reaction r resp. a pair of reactions {r,s} such that $r \overset{= 0}{\to} t$ in L resp. ${r, s} \overset{= 0}{\to} t$ in L holds. This perspective corresponds to a lab experiment for finding knockout targets for the reaction t. Here, we consider single reactions instead of reaction classes. This means that for ${r, s} \overset{= 0}{\to} t$ in L with r,s,t∈Rep, we get |[r]|·|[s]| knockout options for all the |[t]| reactions that belong to the same reaction class as t.

Table 2.

Average number of knockout options

Model	Single KOs	Double KOs
	Options	Options
E. coli iJO1366	35.1	143
E. coli iAF1260	26.4	78.0
S. cerevisiae iND750	25.6	106
M. tuberculosis iNJ661	82.7	120
S. aureus iSB619	65.9	245
H. pylori iIT341	143	126
E. coli textbook	6.92	132

Open in a new tab

Single KOs options: Average number of reactions r that lead as single knockouts to inactivity of a target reaction t: $\frac{1}{| 1_{L} |} \sum_{t \in 1_{L}} \sum_{r \overset{= 0}{\to} t in L}$ 1.

Double KOs options: Average number of uncoupled reaction pairs {r, s} that lead as double knockouts to inactivity of a target reaction t: $\frac{1}{| 1_{L} |} \sum_{t \in 1_{L}} \sum_{{r, s} \overset{= 0}{\to} t in L}$ 1.

For most of the studied networks, the average number of knockout options for a given target reaction is in the range of 25-85 single reactions and 100-150 reaction pairs. With all double knockout information at hand, one can reduce the set of all possible knockout candidates for a wet lab experiment to a small number, and additionally decide beforehand which of them have the smallest side effects.

Impact on biomass production

To finish our discussion, we study the impact of knockouts on biomass production. To measure this, we counted the number of single and double knockouts that block the biomass reaction. Table 3 presents the results for the largest available models of the respective organisms. For two of them, more than one biomass reaction was available. In the case of E. coli iJO1366, we present the results for the two biomass reactions, for S. aureus, we selected 2 out of the 14 available reactions.

Table 3.

Number of knockouts for the biomass reaction in selected networks

Model		Single knockouts		Double knockouts
reaction id	cl. size	classes	reactions	cl. pairs	reac. pairs
E. coli iJO1366
Ec_biomass_iJO1366_WT_53p95M	20	101	343	130	339
Ec_biomass_iJO1366_core_53p95M	1	80	288	90	268
S. cerevisiae iND750
biomass_SC4_bal	26	54	156	60	142
M. tuberculosis iNJ661
biomass_Mtb_9_60atp	160	64	154	48	83
S. aureus iSB619
SA_biomass_1a	8	25	63	59	157
SA_biomass_5a	1	58	215	54	100
H. pylori iIT341
BiomassHP_published	189	36	76	41	81

Open in a new tab

class size: Number of reactions in the same coupling class as the biomass reaction, i.e., number of reactions that carry flux if and only if the biomass reaction carries flux.

Single Knockouts: Number of different single knockouts (classes and reactions) that block the biomass reaction. Only reactions that are not partially coupled to the biomass (from a different reaction class) are counted.

Double Knockouts: Number of different double knockouts (class pairs and reaction pairs) that block the biomass reaction when combined. Only reactions that are not directionally coupled to the biomass are counted.

We observe that for most of the organisms, the number of single knockouts that block biomass production is very similar to the number of different double knockouts (corresponding to joint couplings) having this property, although the number of double knockout candidates is much larger (quadratic in |1_L|).

Algorithmic considerations

To perform a double knockout analysis, we first run standard flux coupling analysis (FCA) using the L4FC routine from [18]. Then we calculate the unblocked reactions for each double knockout of a pair of reaction class representatives. Table 4 presents the running times for six genome-scale network reconstructions and the central metabolism of E. coli. Even for our largest network, E. coli iJO1366 with its 2583 reactions, the complete simulation of all double reaction knockouts took less than 1h 10 min.

Table 4.

Runtime and number of solved LPs for double reaction knockouts (Algorithm 2)

Model		Step			Total
		Blocked	Couples	dko
E. coli iJO1366	LPs	1718	9943	133225	144886
	time	2.0	42.2	4016.4	1h 8min
E. coli iAF1260	LPs	1679	10780	52112	64571
	time	1.7	31.5	2688.2	45min 21s
S. cerevisiae iND750	LPs	597	3987	90664	95248
	time	0.33	6.8	397.8	6min 45s
M. tuberculosis iNJ661	LPs	327	3416	20647	24390
	time	0.33	5.6	177.7	3min 4s
S. aureus iSB619	LPs	144	3638	19477	23259
	time	0.09	2.8	43.2	46.0s
H. pylori iIT341	LPs	106	1812	6753	8671
	time	0.06	1.9	18.0	20.0s
E. coli textbook	LPs	26	341	1739	2106
	time	0.004	0.06	0.62	0.68s

Open in a new tab

The computation was done in three steps: Calculation of the blocked reactions, flux coupling analysis to determine the coupled reactions, and finally the double knockout simulations.

Times are given in seconds if not specified otherwise (numbers may not add up due to rounding errors).

Next we discuss the number of LPs we have to solve in order to obtain this additional information. For all our networks, double knockout analysis required solving 5 to 20 times as many LPs than single knockouts, i.e., classical FCA. While this seems to be a large number, it is relatively small compared to the complexity of the problem. A full double knockout simulation is comparable to iterating over all reactions r∈Rep, removing the reaction r and performing a single knockout simulation for each of the resulting subnetworks. Reusing known pathways as witnesses and including reaction coupling information as proposed in [18] allows performing |Rep| simulations with only 5 to 20 times the effort in LP solving. Table 1 shows that the median value for |Rep| is 370 for our networks.

In order to evaluate the runtime effect of our algorithmic improvements, we considered two variants of Algorithm 2: Variant A (no representatives) In the main loop of Algorithm 2, we do not iterate over all representatives r,s∈Rep,r<s, but over all pairs of uncoupled reactions r,s∈1_L,r<s, with not $r \overset{= 0}{\to} s in L$ and not $s \overset{= 0}{\to} r$ in L. Variant B (no witnesses) Same as Variant A. Additionally we do not save witnesses, thus $W = \emptyset$ .

These two experiments allow determining time savings due to representatives (comparing Algorithm 2 and Variant A) and time savings due to warm starts based on knowledge of existing reaction sets (comparing Variant A and B). We should emphasize here that Variant B is is still more efficient than a naive brute force algorithm. The runtime results are given in Table 5, where we stopped computations after a timeout of 6h. Table 5 shows that the efficiency of Algorithm 2 is mostly due to the re-use of (up to 10000) pathways as witnesses (factor 10 in the case of E. coli textbook and factor 100 for S. aureus). Nevertheless, iterating over the set of representatives adds another improvement of up to 50% (S. aureus). Since it takes a very small effort to calculate a set of representatives to profit from this additional speed-up, we highly recommend to iterate over representatives whenever possible.

Table 5.

Runtime of variants of Algorithm 2 for computing double reaction knockouts

Model	Algor. 2	Variant A	Variant B
		(no representatives)	(no witnesses)
E. coli iJO1366	1h 8min	1h 59min	> 6h
E. coli iAF1260	45min 21s	1h 30min	>6h
S. cerevisiae iND750	6min 45s	8min 39s	> 6h
M. tuberculosis iNJ661	3min 4s	7min 42s	> 6h
S. aureus iSB619	46s	1min 59s	2h 32min
H. pylori iIT341	20s	58.1s	52min
E. coli textbook	0.68s	2.2s	23.0s

Open in a new tab

Gene knockouts

Table 6 gives the runtimes and the number of LPs for single and double gene knockouts. To determine the reactions associated to a (double) gene knockout, we used the library JEval that allows fast evaluation of logical formulas given as Java strings. As expected we are confronted with longer runtimes up to almost 4h for double gene knockouts compared to < 70 min for double reaction knockouts. This is due to the fact that we need to check every single pair of genes instead of a representative selection like the one we could apply in double reaction knockout analysis. In spite of this, with the methods proposed here, a full simulation of double reaction or double gene knockouts on a genome-scale metabolic network reconstruction can still be performed in a reasonable time.

Table 6.

Runtime and number of solved LPs for single and double gene knockouts

Model		Step
		gko	dgko
E. coli iJO1366	LPs	719	263844
	time	1.2	3h 49min
E. coli iAF1260	LPs	516	229498
	time	8.6	2h 55min
S. cerevisiae iND750	LPs	1323	308145
	time	6.4	37min 36s
M. tuberculosis iNJ661	LPs	175	77346
	time	1.2	15min 59s
S. aureus iSB619	LPs	49	38689
	time	0.68	9min 42s
H. pylori iIT341	LPs	27	19348
	time	0.24	1min 52s
E. coli textbook	LPs	2	2023
	time	0.04	4.4s

Open in a new tab

Times are given in seconds if not specified otherwise.

Conclusions

On the algorithmic side, this study presented the following main results:

Algorithm 2 is an effective method for a complete double knockout analysis in genome-scale metabolic networks.
Using Algorithm 1, it is possible to compute the impact of specific multiple knockout sets containing 3 or more reactions.
By exploiting the information present in reaction coupling data (obtained by FCA), one can significantly decrease the number of candidates that need to be tested in double and multiple knockout simulations.

Regarding the biological data, we can make the following observations based on our computational experiments:

In the genome-scale metabolic network reconstructions that were considered in this study, 1-10% of the possible double knockout sets have joint coupling effects. Thus, given a randomly chosen reaction pair, the probability is high that the combined effect of the double knockout (in terms of other blocked reactions) will be the same as for the two corresponding single knockouts.
However, in all these networks, there exists a small number of double knockouts showing synergistic effects, blocking 5 to 20 additional reactions in average. These double knockouts cannot be predicted from the single knockout/reaction coupling data alone.

Due to the algorithmic improvements, we are now able to perform full double gene or reaction knockout simulations in a few hours of computation time. Thus, whenever one is interested in understanding the robustness of a network to knockouts, one should take the opportunity and run such an in silico simulation, before starting other more time consuming and expensive experiments.

A prototype implementation of double knockout simulation is available at http://hoverboard.io/L4FC.

Acknowledgements

The PhD work of Yaron Goldstein was supported by the Berlin Mathematical School and the Gerhard C. Starck Stiftung.

Footnotes

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

The paper is based on the PhD thesis of YG, which was supervised by AB. YG implemented the algorithms and performed the computational experiments. YG and AB together wrote the manuscript and approved the final version.

Contributor Information

Yaron AB Goldstein, Email: goldsteiny.de@gmail.com.

Alexander Bockmayr, Email: alexander.bockmayr@fu-berlin.de.

References

1.Bordbar A, Monk JM, King ZA, Palsson B. Constraint-based models predict metabolic and associated cellular functions. Nat Rev Genet. 2014;15(2):107–120. doi: 10.1038/nrg3643. [DOI] [PubMed] [Google Scholar]
2.Lewis NE, Nagarajan H, Palsson B. Constraining the metabolic genotype-phenotype relationship using a phylogeny of in silico methods. Nat Rev Microbiol. 2012;10(4):291–305. doi: 10.1038/nrmicro2737. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Varma A, Palsson BO. Predictions for oxygen supply control to enhance population stability of engineered production strains. Biotechnol Bioeng. 1994;43(4):275–285. doi: 10.1002/bit.260430403. [DOI] [PubMed] [Google Scholar]
4.Orth JD, Thiele I, Palsson BO. What is flux balance analysis? Nat Biotechnol. 2010;28(3):245–248. doi: 10.1038/nbt.1614. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Burgard AP, Nikolaev EV, Schilling CH, Maranas CD. Flux coupling analysis of genome-scale metabolic network reconstructions. Genome Res. 2004;14(2):301–312. doi: 10.1101/gr.1926504. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Larhlimi A, David L, Selbig J, Bockmayr A. F2C2: a fast tool for the computation of flux coupling in genome-scale metabolic networks. BMC Bioinformatics. 2012;13(1):57. doi: 10.1186/1471-2105-13-57. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Tomar N, De RK. Comparing methods for metabolic network analysis and an application to metabolic engineering. Gene. 2013;521(1):1–14. doi: 10.1016/j.gene.2013.03.017. [DOI] [PubMed] [Google Scholar]
8.Zomorrodi AR, Suthers PF, Ranganathan S, Maranas CD. Mathematical optimization applications in metabolic networks. Metabolic Eng. 2012;14(6):672–686. doi: 10.1016/j.ymben.2012.09.005. [DOI] [PubMed] [Google Scholar]
9.Burgard AP, Pharkya P, Maranas CD. Optknock: a bilevel programming framework for identifying gene knockout strategies for microbial strain optimization. Biotechnol Bioeng. 2003;84(6):647–657. doi: 10.1002/bit.10803. [DOI] [PubMed] [Google Scholar]
10.Tepper N, Shlomi T. Predicting metabolic engineering knockout strategies for chemical production: accounting for competing pathways. Bioinformatics. 2010;26(4):536–543. doi: 10.1093/bioinformatics/btp704. [DOI] [PubMed] [Google Scholar]
11.Patil KR, Rocha I, Förster J, Nielsen J. Evolutionary programming as a platform for in silico metabolic engineering. BMC Bioinformatics. 2005;6(1):308. doi: 10.1186/1471-2105-6-308. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Lun DS, Rockwell G, Guido NJ, Baym M, Kelner JA, Berger B, Galagan JE, Church GM. Large-scale identification of genetic design strategies using local search. Mol Syst Biol. 2009;5(1):296. doi: 10.1038/msb.2009.57. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Rocha I, Maia P, Evangelista P, Vilaça P, Soares S, Pinto JP, Nielsen J, Patil KR, Ferreira EC, Rocha M. Optflux: an open-source software platform for in silico metabolic engineering. BMC Syst Biol. 2010;4(1):45. doi: 10.1186/1752-0509-4-45. [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Ohno S, Shimizu H, Furusawa C. FastPros screening of reaction knockout strategies for metabolic engineering. Bioinformatics. 2014;30(7):981–987. doi: 10.1093/bioinformatics/btt672. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Klamt S, Gilles ED. Minimal cut sets in biochemical reaction networks. Bioinformatics. 2004;20(2):226–234. doi: 10.1093/bioinformatics/btg395. [DOI] [PubMed] [Google Scholar]
16.Jungreuthmayer C, Nair G, Klamt S, Zanghellini J. Comparison and improvement of algorithms for computing minimal cut sets. BMC Bioinformatics. 2013;14(1):318. doi: 10.1186/1471-2105-14-318. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.von Kamp A, Klamt S. Enumeration of smallest intervention strategies in genome-scale metabolic networks. PLOS Comput Biol. 2014;10(1):1003378. doi: 10.1371/journal.pcbi.1003378. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Goldstein YAB, Bockmayr A. A lattice-theoretic framework for metabolic pathway analysis. In: Gupta A, Henzinger T, editors. Computational Methods in Systems Biology. Lecture Notes in Computer Science. Vol. 8130, Berlin: Springer; 2013. [Google Scholar]
19.Zhao Y, Tamura T, Akutsu T, Vert J-P. Flux balance impact degree: a new definition of impact degree to properly treat reversible reactions in metabolic networks. Bioinformatics. 2013;29(17):2178–2185. doi: 10.1093/bioinformatics/btt364. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Nogales J, Gudmundsson S, Thiele I. An in silico re-design of the metabolism in thermotoga maritima for increased biohydrogen production. Int J Hydrogen Energy. 2012;37(17):12205–12218. doi: 10.1016/j.ijhydene.2012.06.032. [DOI] [Google Scholar]
21.Suthers PF, Zomorrodi A, Maranas CD. Genome-scale gene/reaction essentiality and synthetic lethality analysis. Mol Syst Biol. 2009;5(1):301. doi: 10.1038/msb.2009.56. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Reimers AC, Goldstein YAB, Bockmayr A. Qualitative and thermodynamic flux coupling analysis. Technical Report #1054, Matheon (March 2014). http://nbn-resolving.de/urn:nbn:de:0296-matheon-12801
23.Pfeiffer T, Sánchez-Valdenebro I, Nuño JC, Montero F, Schuster S. METATOOL: for studying metabolic networks. Bioinformatics. 1999;15:251–257. doi: 10.1093/bioinformatics/15.3.251. [DOI] [PubMed] [Google Scholar]
24.Schellenberger J, Park JO, Conrad TM, Palsson BO. BiGG: a biochemical genetic and genomic knowledgebase of large scale metabolic reconstructions. BMC Bioinformatics. 2010;11(213):213. doi: 10.1186/1471-2105-11-213. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR1] 1.Bordbar A, Monk JM, King ZA, Palsson B. Constraint-based models predict metabolic and associated cellular functions. Nat Rev Genet. 2014;15(2):107–120. doi: 10.1038/nrg3643. [DOI] [PubMed] [Google Scholar]

[CR2] 2.Lewis NE, Nagarajan H, Palsson B. Constraining the metabolic genotype-phenotype relationship using a phylogeny of in silico methods. Nat Rev Microbiol. 2012;10(4):291–305. doi: 10.1038/nrmicro2737. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR3] 3.Varma A, Palsson BO. Predictions for oxygen supply control to enhance population stability of engineered production strains. Biotechnol Bioeng. 1994;43(4):275–285. doi: 10.1002/bit.260430403. [DOI] [PubMed] [Google Scholar]

[CR4] 4.Orth JD, Thiele I, Palsson BO. What is flux balance analysis? Nat Biotechnol. 2010;28(3):245–248. doi: 10.1038/nbt.1614. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR5] 5.Burgard AP, Nikolaev EV, Schilling CH, Maranas CD. Flux coupling analysis of genome-scale metabolic network reconstructions. Genome Res. 2004;14(2):301–312. doi: 10.1101/gr.1926504. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR6] 6.Larhlimi A, David L, Selbig J, Bockmayr A. F2C2: a fast tool for the computation of flux coupling in genome-scale metabolic networks. BMC Bioinformatics. 2012;13(1):57. doi: 10.1186/1471-2105-13-57. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Tomar N, De RK. Comparing methods for metabolic network analysis and an application to metabolic engineering. Gene. 2013;521(1):1–14. doi: 10.1016/j.gene.2013.03.017. [DOI] [PubMed] [Google Scholar]

[CR8] 8.Zomorrodi AR, Suthers PF, Ranganathan S, Maranas CD. Mathematical optimization applications in metabolic networks. Metabolic Eng. 2012;14(6):672–686. doi: 10.1016/j.ymben.2012.09.005. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Burgard AP, Pharkya P, Maranas CD. Optknock: a bilevel programming framework for identifying gene knockout strategies for microbial strain optimization. Biotechnol Bioeng. 2003;84(6):647–657. doi: 10.1002/bit.10803. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Tepper N, Shlomi T. Predicting metabolic engineering knockout strategies for chemical production: accounting for competing pathways. Bioinformatics. 2010;26(4):536–543. doi: 10.1093/bioinformatics/btp704. [DOI] [PubMed] [Google Scholar]

[CR11] 11.Patil KR, Rocha I, Förster J, Nielsen J. Evolutionary programming as a platform for in silico metabolic engineering. BMC Bioinformatics. 2005;6(1):308. doi: 10.1186/1471-2105-6-308. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Lun DS, Rockwell G, Guido NJ, Baym M, Kelner JA, Berger B, Galagan JE, Church GM. Large-scale identification of genetic design strategies using local search. Mol Syst Biol. 2009;5(1):296. doi: 10.1038/msb.2009.57. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR13] 13.Rocha I, Maia P, Evangelista P, Vilaça P, Soares S, Pinto JP, Nielsen J, Patil KR, Ferreira EC, Rocha M. Optflux: an open-source software platform for in silico metabolic engineering. BMC Syst Biol. 2010;4(1):45. doi: 10.1186/1752-0509-4-45. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR14] 14.Ohno S, Shimizu H, Furusawa C. FastPros screening of reaction knockout strategies for metabolic engineering. Bioinformatics. 2014;30(7):981–987. doi: 10.1093/bioinformatics/btt672. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Klamt S, Gilles ED. Minimal cut sets in biochemical reaction networks. Bioinformatics. 2004;20(2):226–234. doi: 10.1093/bioinformatics/btg395. [DOI] [PubMed] [Google Scholar]

[CR16] 16.Jungreuthmayer C, Nair G, Klamt S, Zanghellini J. Comparison and improvement of algorithms for computing minimal cut sets. BMC Bioinformatics. 2013;14(1):318. doi: 10.1186/1471-2105-14-318. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.von Kamp A, Klamt S. Enumeration of smallest intervention strategies in genome-scale metabolic networks. PLOS Comput Biol. 2014;10(1):1003378. doi: 10.1371/journal.pcbi.1003378. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Goldstein YAB, Bockmayr A. A lattice-theoretic framework for metabolic pathway analysis. In: Gupta A, Henzinger T, editors. Computational Methods in Systems Biology. Lecture Notes in Computer Science. Vol. 8130, Berlin: Springer; 2013. [Google Scholar]

[CR19] 19.Zhao Y, Tamura T, Akutsu T, Vert J-P. Flux balance impact degree: a new definition of impact degree to properly treat reversible reactions in metabolic networks. Bioinformatics. 2013;29(17):2178–2185. doi: 10.1093/bioinformatics/btt364. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] 20.Nogales J, Gudmundsson S, Thiele I. An in silico re-design of the metabolism in thermotoga maritima for increased biohydrogen production. Int J Hydrogen Energy. 2012;37(17):12205–12218. doi: 10.1016/j.ijhydene.2012.06.032. [DOI] [Google Scholar]

[CR21] 21.Suthers PF, Zomorrodi A, Maranas CD. Genome-scale gene/reaction essentiality and synthetic lethality analysis. Mol Syst Biol. 2009;5(1):301. doi: 10.1038/msb.2009.56. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Reimers AC, Goldstein YAB, Bockmayr A. Qualitative and thermodynamic flux coupling analysis. Technical Report #1054, Matheon (March 2014). http://nbn-resolving.de/urn:nbn:de:0296-matheon-12801

[CR23] 23.Pfeiffer T, Sánchez-Valdenebro I, Nuño JC, Montero F, Schuster S. METATOOL: for studying metabolic networks. Bioinformatics. 1999;15:251–257. doi: 10.1093/bioinformatics/15.3.251. [DOI] [PubMed] [Google Scholar]

[CR24] 24.Schellenberger J, Park JO, Conrad TM, Palsson BO. BiGG: a biochemical genetic and genomic knowledgebase of large scale metabolic reconstructions. BMC Bioinformatics. 2010;11(213):213. doi: 10.1186/1471-2105-11-213. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Double and multiple knockout simulations for genome-scale metabolic network reconstructions

Yaron AB Goldstein

Alexander Bockmayr

Abstract

Background

Results

Software

Background

Methods

Reaction coupling in the context of knockout analysis

Figure 1.

Definition1 (Joint reaction coupling).

Lattices and maximal elements

Proposition1.

Proof.

Classes of partially coupled reactions

Table 1.

Algorithms

The case of gene knockouts

Definition2 (Gene coupling).

Results and discussion

Impact of double knockouts

Knockout options

Table 2.

Impact on biomass production

Table 3.

Algorithmic considerations

Table 4.

Table 5.

Gene knockouts

Table 6.

Conclusions

Acknowledgements

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases