A mathematical framework for yield (vs. rate) optimization in constraint-based modeling and applications in metabolic engineering

Steffen Klamt; Stefan Müller; Georg Regensburger; Jürgen Zanghellini

doi:10.1016/j.ymben.2018.02.001

. 2018 May;47:153–169. doi: 10.1016/j.ymben.2018.02.001

A mathematical framework for yield (vs. rate) optimization in constraint-based modeling and applications in metabolic engineering

Steffen Klamt ^a,¹, Stefan Müller ^b,¹, Georg Regensburger ^c,¹, Jürgen Zanghellini ^d,^e,^⁎,¹

PMCID: PMC5992331 PMID: 29427605

Abstract

Background: The optimization of metabolic rates (as linear objective functions) represents the methodical core of flux-balance analysis techniques which have become a standard tool for the study of genome-scale metabolic models. Besides (growth and synthesis) rates, metabolic yields are key parameters for the characterization of biochemical transformation processes, especially in the context of biotechnological applications. However, yields are ratios of rates, and hence the optimization of yields (as nonlinear objective functions) under arbitrary linear constraints is not possible with current flux-balance analysis techniques. Despite the fundamental importance of yields in constraint-based modeling, a comprehensive mathematical framework for yield optimization is still missing.

Results: We present a mathematical theory that allows one to systematically compute and analyze yield-optimal solutions of metabolic models under arbitrary linear constraints. In particular, we formulate yield optimization as a linear-fractional program. For practical computations, we transform the linear-fractional yield optimization problem to a (higher-dimensional) linear problem. Its solutions determine the solutions of the original problem and can be used to predict yield-optimal flux distributions in genome-scale metabolic models. For the theoretical analysis, we consider the linear-fractional problem directly. Most importantly, we show that the yield-optimal solution set (like the rate-optimal solution set) is determined by (yield-optimal) elementary flux vectors of the underlying metabolic model. However, yield- and rate-optimal solutions may differ from each other, and hence optimal (biomass or product) yields are not necessarily obtained at solutions with optimal (growth or synthesis) rates. Moreover, we discuss phase planes/production envelopes and yield spaces, in particular, we prove that yield spaces are convex and provide algorithms for their computation. We illustrate our findings by a small example and demonstrate their relevance for metabolic engineering with realistic models of E. coli.

Conclusions: We develop a comprehensive mathematical framework for yield optimization in metabolic models. Our theory is particularly useful for the study and rational modification of cell factories designed under given yield and/or rate requirements.

Abbreviations: ATP, adenosine triphosphate; ECC2, EColiCore2; EFM, elementary flux mode; EFV, elementary flux vector; FBA, flux-balance analysis; gDW, gram dry weight; glc, glucose; GSMM, genome-scale metabolic model; LFP, linear-fractional program; LP, linear program; MCS, minimal cut set; PE, production envelope; PP, phase plane; YS, yield space

Keywords: Constraint-based modeling, Elementary flux mode, Elementary flux vector, Flux-balance analysis, Linear-fractional program, Metabolic pathway analysis, Production envelope, Productivity, Strain design, Yield space

Highlights

•
Comprehensive theory of yield analysis and optimization in constraint-based metabolic models is developed.
•
Calculation of optimal yields and yield spaces in (genome-scale) metabolic models.
•
Differences between rate-optimal and yield-optimal flux distributions are highlighted.
•
Design strategies for of yield vs. rate optimal cell factories are discussed and illustrated.

1. Introduction

Productivity and yield are crucial characteristics of biotechnological production processes based on microbial cell factories (Nielsen and Keasling, 2016, Sanford et al., 2016). Yield is a relative measure of the efficiency of (bio)chemical conversions. In particular, it is the amount of product or biomass formed per amount of substrate consumed. In contrast, productivity measures the speed of product formation, i.e., the amount of product or biomass formed per unit of time. Thereby, one is mainly concerned with productivity quantified by specific production rate (e.g., mmol product per gram dry weight and hour) or specific growth rate (with unit per hour).

Yield and productivity are not independent of each other. Moreover, in the case of biomass as the (natural) product, trade-offs between (growth) yield and (growth) rate are believed to shape the evolutionary trajectories of microorganism (Goel et al., 2012). For instance, higher growth yields allow an organism to produce more progenies for the same amount of nutrients, while higher growth rates support faster proliferation, but are often accompanied by reduced biomass yields. The latter growth strategy may be better suited under nutrient excess in order to overgrow any competitors, while the former provides a fitness advantage under nutrient scarcity (Schuster et al., 2008). Another example where the difference between growth yield and growth rate maximization has been discussed intensely in the literature is in the context of respiration vs. fermentation (Schuster et al., 2015, Schuster et al., 2011, Simeonidis et al., 2010).

Optimal yields and rates and their trade-offs are frequently studied with the help of mathematical models. Here, evolutionary pressure is often modeled in terms of some kind of optimality. Constraint-based modeling represents one approach to investigate optimality principles based on the underlying metabolic network structure (Bordbar et al., 2014). Flux-balance analysis (FBA) is a particularly prominent constraint-based modeling approach that predicts steady-state flux distributions in genome-scale metabolic models (GSMMs) by assuming an objective function that the cell aims to optimize (Fell and Small, 1986, Orth et al., 2010, Varma and Palsson, 1994, Watson, 1984). Mathematically, FBA is formulated as a linear program (LP) maximizing a single reaction rate or a linear combination of rates. Typically maximizing growth rate is used as a proxy for evolutionary pressure that improves fitness. Other objective functions have been proposed as well (Schuetz et al., 2012, Schuetz et al., 2007, Gianchandani et al., 2008). In this work, we neither address whether (microbial) cells perform optimally at all nor do we claim what rates or yields could be true biological objectives. Instead, we provide computational means to identify yield-optimal flux distributions for any constraint-based metabolic model.

In its mathematical formulation, FBA clearly maximizes rates, however, it has also been used for maximizing yield. In some applications, limiting substrate uptake rates $r_{S}$ are fixed to experimentally measured values. Since biomass yield $Y^{B / S}$ is the ratio of growth rate $μ$ and uptake rate $r_{S}$ , maximizing yield is then equivalent to maximizing growth rate (Teusink and Smid, 2006). In other cases, only steady-state and irreversibility constraints are known. Then the optimization of rates via FBA leads to infinite values, and only the optimization of product (or biomass) yields $Y^{P / S} = r_{P} / r_{S}$ is meaningful. In a normalization step, the substrate uptake is fixed, e.g. $r_{S} = 1$ , and the (normalized) product formation rate $r_{P}$ is maximized via FBA (Schuster et al., 2008, Schuster et al., 2000). For medium-scale networks, this case can also be addressed in the framework of elementary flux mode (EFM) analysis (Schuster and Hilgetag, 1994, Schuster et al., 1999, Schuster et al., 2000, Zanghellini et al., 2013).

For unknown substrate uptake rate and in the presence of other constraints (e.g., lower or upper flux bounds), the situation is different, and the methods described above cannot be used. Here, the nonlinear yield $Y^{P / S} = r_{P} / r_{S}$ has to be maximized explicitly since the substrate uptake rate at the maximum product yield is not known. In particular, rate optimization $(r_{P} \to \max)$ and yield optimization $(r_{P} / r_{S} \to \max)$ may then lead to different solutions.

Although (maximal) yields are of central interest in metabolic modeling and engineering, a comprehensive mathematical framework for yield analysis and optimization in the context of constraint-based modeling is still missing. In the present work, we study yield optimization under arbitrary linear constraints as a linear-fractional program (LFP), as opposed to rate optimization in FBA which is studied as an LP. For practical computations, we use that a linear-fractional yield optimization problem can be transformed to an equivalent, but higher-dimensional linear problem (which cannot be interpreted as an FBA problem in a straightforward way). By solving this LP and transforming the solutions back to the LFP, we can compute yield-optimal flux distributions in GSMMs.

In our theoretical analysis, we furthermore show that yield-optimal solution sets (like rate-optimal solution sets) can be characterized in terms of EFMs and elementary flux vectors (EFVs) (Klamt et al., 2017). Throughout our study, we emphasize similarities and differences of rate and yield optimization, in particular, in the context of metabolic engineering. We discuss the concepts of phase planes (PPs) and yield spaces (YSs) as important tools for computer-aided strain design and, as another theoretical result, we prove the convexity of YSs.

2. Rate and yield optimization

2.1. Basic terminology and examples

A metabolic network is represented by its stoichiometric matrix $N \in R^{m \times n}$ containing the net stoichiometric coefficients of m internal metabolites in n reactions. The vector $r \in R^{n}$ denotes a flux distribution (flux vector) through the network, and its components $r_{i}$ with $i \in {1, ..., n}$ are the respective reaction rates or fluxes.

Flux-balance analysis (FBA) identifies particular flux distributions through a metabolic network by optimizing a linear objective function $c^{T} r$ subject to steady-state (2a), capacity and irreversibility (2b), as well as possible additional linear constraints (2c), e.g., for resource allocation (Mori et al., 2016):

\max c^{T} r

(1)

subject to

Nr = 0,

(2a)

r^{lb} \leq r \leq r^{ub},

(2b)

Gr \leq h .

(2c)

The constraints (2b) include irreversibility constraints, where $r_{i}^{lb} = 0$ for irreversible reactions $i \in I_{irr}$ . The additional constraints in (2c) are expressed by a set of q linear inequalities represented by a matrix $G \in R^{q \times n}$ and a vector $h \in R^{q}$ . The objective function together with the linear constraints form a linear program (LP), which can be solved with standard LP solvers.

A related, but mathematically different type of optimization is the identification of yield-optimal flux vectors. A yield is the ratio of two fluxes or, more generally, the ratio of linear combinations of fluxes,

Y (r) = \frac{c^{T} r}{d^{T} r} .

(3)

Usually, the numerator contains a sum of (weighted) product fluxes, whereas the denominator contains a sum of (weighted) substrate uptake fluxes and is assumed to be positive. Thereby, the directions of substrate uptake reactions are fixed, and the corresponding signs of fluxes and coefficients match. That is, for uptake reaction i, either $r_{i} \geq 0$ and $d_{i} \geq 0$ or $r_{i} \leq 0$ and $d_{i} \leq 0$ .

Yield optimization poses the following problem:

\max Y (r)

(4)

subject to the same constraints (2a), (2b), (2c) as for FBA and the additional assumption $d^{T} r > 0$ . Since the objective function is a fraction of two linear functions, this kind of optimization problem is called a linear-fractional program (LFP) (Boyd and Vandenberghe, 2004, Frenk and Schaible, 2005, Maranas and Zomorrodi, 2016). We note that, in a general LFP, the objective function has the form $(c^{T} r + p) / (d^{T} r + q)$ . In the case of yield optimization, we have $p = q = 0$ which simplifies the mathematical treatment.

In the following, we will use the example network in Fig. 1 to illustrate rate- and yield-optimal flux distributions in three different scenarios. With reaction R1 as S(ubstrate) uptake, we are particularly interested in the synthesis of B(iomass or a biomass component) via reaction R4. Thereby, P and Q (with excretion reactions R3 and R2) represent two byproducts. In each of the following three scenarios, we are interested in the maximal biomass synthesis rate $r_{4}$ and the maximal biomass yield $Y^{B / S} = r_{4} / r_{1}$ . The main characteristics of these scenarios are summarized in Fig. 2(a).

Fig. 2 — Rate- and yield-optimal flux distributions for scenarios S1–S3 of the example network in Fig. 1. Panel (a) lists the main characteristics of the three scenarios together with the respective rate- and yield-optimal flux distributions (depending on the input flux $r_{1}$ ). The contributing reactions are illustrated as blue lines in panels (b) and (c). Full lines indicate fluxes depending only on $r_{1}$ , while dashed lines indicate fluxes depending also on $r_{11}$ .

Scenario S1. First, we only consider the steady-state and irreversibility constraints. Clearly, maximization of the biomass synthesis rate $r_{4}$ is then meaningless since the resulting LP is unbounded, leading to an infinite rate $r_{4}$ . In contrast, the optimal B(iomass) yield is one, irrespective of the amount of substrate taken up. See Fig. 2(a) and (b).

Scenario S2. Next, we use the (standard) condition of Scenario S1 and restrict the substrate uptake rate by $r_{1} \leq 10$ . Now, the maximum rate for biomass synthesis is ten, whereas the maximum biomass yield is still one. In this case, the maximum biomass rate is just given by the substrate uptake rate multiplied by the maximum biomass yield. Note that for $r_{1}$ infinitely many solutions exist that result in yield-optimality $(0 < r_{1} \leq 10)$ , whereas only one solution results in rate-optimality $(r_{1} = 10)$ . See Fig. 2(a) and (b).

Scenario S3. Finally, we also set a capacity constraint for reaction R5: $r_{5} \leq 5$ . Apparently, the maximum rate for biomass synthesis is now 7.5, whereas the maximum biomass yield is still one. However, the maximum yield can only be reached for substrate uptake rates $r_{1} \leq 5$ , whereas the maximum rate occurs at $r_{1} = 10$ . As a consequence, the rate-optimal solution is not yield-optimal. See Figs. 2(a)–(c).

Remark. Note that, in some special cases, the maximum yield is reached only for infinitely large fluxes (one says “it is not attained”). If we take scenario S1 and add a nonzero lower bound for reaction R2, e.g. $r_{2} \geq 1$ , then the maximum B(iomass) yield of one is approached for very large substrate uptake flux $r_{1}$ (but will never be attained).

Numerous LPs, given by Eq. (1) subject to the constraints (2a), (2b), (2c), have been solved in the context of FBA to find rate-optimal solutions, e.g., growth-rate-optimal phenotypes or flux distributions with optimal product synthesis rates. Optimal yields are obviously also of high interest in the context of metabolic engineering. However, surprisingly, with only one exception (Burgard et al., 2004) (in the context of flux coupling analysis, see Section 2.3), we are not aware of a single constraint-based modeling study that solved the LFP given by (4) subject to (2a), (2b), (2c) to optimize yield in a metabolic network. Instead, yield-optimal solutions have often been identified either via elementary flux modes (EFMs) (applicable only in smaller networks, see below) or by fixing the substrate uptake rate and then maximizing product synthesis rate via a standard LP (Schuster et al., 2008, Teusink and Smid, 2006). However, these methods cannot be used in the general case of arbitrary constraints. For example, fixing the substrate uptake rate in scenario S3 to its maximum value $r_{1} = 10$ and then maximizing the biomass synthesis rate will deliver a flux vector that is not biomass-yield optimal for scenario S3.

In the following, we give a mathematical definition of yield optimization problems in the context of constraint-based modeling and discuss important properties of the resulting LFPs. Further, we show how they can be written as LPs, which allows the use of efficient algorithms even for genome-scale metabolic models (GSMMs). In a subsequent section, we characterize rate- as well as yield-optimal solution sets in terms of rates and yields of (optimal) generators of the flux polyhedron.

2.2. Mathematical treatment

We define rate optimization (as an LP) and yield optimization (as an LFP). Most importantly, we rewrite yield optimization as an LP.

The constraints (2a), (2b), (2c) define the flux polyhedron

P = {x \in R^{n} ∣ Ax \leq b},

(5)

where

A = (\begin{matrix} N \\ - N \\ I \\ - I \\ G \end{matrix}), b = (\begin{matrix} 0 \\ 0 \\ r^{ub} \\ - r^{lb} \\ h \end{matrix}),

(6)

and $I \in R^{n \times n}$ is the identity matrix. As standard in convex optimization, we use the variable $x \in R^{n}$ for the (flux) vector $r \in R^{n}$ .

2.2.1. Definitions

Given a vector $c \in R^{n}$ , we define the linear objective function $l : R^{n} \to R$ ,

l (x) = c^{T} x

(7)

and study rate optimization as the LP

\max_{x \in P} l (x) .

(8)

Given $c, d \in R^{n}$ , we define the yield $Y : D \to R$ as the rational function

Y (x) = \frac{c^{T} x}{d^{T} x}

(9)

on the set

D = {x \in R^{n} ∣ d^{T} x > 0} .

(10)

That is, we require a positive denominator. We study yield optimization as the LFP

\max_{x \in P_{>}} Y (x)

(11)

with

P_{>} = P \cap D = {x \in P ∣ d^{T} x > 0} .

(12)

We note that, given rate- and yield-optimal solutions, $x^{* r}$ and $x^{* y}$ , respectively, the inequalities $c^{T} x^{* r} \geq c^{T} x^{* y} > 0$ and $Y (x^{* y}) \geq Y (x^{* r}) > 0$ imply $d^{T} x^{* r} \geq d^{T} x^{* y}$ . In biological terms, the substrate uptake in yield-optimal states is never larger than the substrate uptake in rate-optimal states.

2.2.2. Yield optimization as an LP

In general, an LFP is equivalent to an LP (Boyd and Vandenberghe, 2004, Frenk and Schaible, 2005, Maranas and Zomorrodi, 2016). Let $P = {x ∣ Ax \leq b}$ be a polyhedron (with recession cone ${x \in R^{n} ∣ Ax \leq 0}$ ). The above LFP

\max_{x \in P_{>}} \frac{c^{T} x}{d^{T} x}

with

P_{>} = {x \in P ∣ d^{T} x > 0}

is equivalent to the LP

\max_{(x', t) \in P'} c^{T} x'

(13)

with the auxiliary polyhedron

P' = {(x', t) ∣ Ax' \leq tb, d^{T} x' = 1, t \geq 0}

(14)

in the following sense: The LFP is feasible if and only if the LP is feasible, and both optimization problems have the same optimal value. In detail, if $x \in P_{>}$ is feasible in the LFP, then $(x', t) \in P'$ with

x' = \frac{x}{d^{T} x} and t = \frac{1}{d^{T} x} > 0

is feasible in the LP with the same objective value.

Conversely, if $(x', t) \in P'$ with $t > 0$ is feasible in the LP, then

x = \frac{x'}{t} \in P_{>}

is feasible in the LFP with the same objective value.

Finally, if $t = 0$ , then $(x', 0) \in P'$ corresponds to an element of the recession cone of $P$ since $Ax' \leq 0$ (and $d^{T} x' = 1$ ). In those cases, the objective value of the LFP approaches the objective value of the LP in the limit: for arbitrary, but fixed $x_{0} \in P_{>}$ , we have

{x_{0} + λ x' ∣ λ \geq 0} \subseteq P_{>}

and

\lim_{λ \to \infty} \frac{c^{T} (x_{0} + λ x')}{d^{T} (x_{0} + λ x')} = \frac{c^{T} x'}{d^{T} x'} = c^{T} x' .

If the LP has an optimal solution $(x^{*}, 0)$ , two cases can occur. If there exists another optimal solution $(\tilde{x}, \tilde{t})$ with $\tilde{t} > 0$ , then the optimal yield is attained at the corresponding element of $P_{>}$ . Otherwise, the optimal yield is reached only in the limit (as already illustrated in the remark in Section 2.1). In practice, one is interested in optimal yields attained at finite fluxes and one can proceed as follows. If the LP solver returns an optimal solution $(x^{*}, 0)$ , one first determines the feasible range of t, by maximizing t in the auxiliary polyhedron (14). Then one chooses a feasible $\tilde{t} > 0$ close to zero (but numerically unproblematic) and determines an optimal solution $(\tilde{x}, \tilde{t})$ of (13). If $c^{T} \tilde{x} = c^{T} x^{*}$ , then the optimal yield is attained at finite fluxes.

We note that the equivalence of the LP and the LFP also implies that the LP is unbounded if and only if the original LFP is unbounded. Such cases arise, for example, if alternative substrates can be used to synthesize the product, but are not accounted for in the denominator of the yield function (9).

2.3. Flux coupling analysis

Flux coupling analysis is a method to detect functionally related reactions (Burgard et al., 2004, Maranas and Zomorrodi, 2016). The fluxes $r_{i}$ and $r_{j}$ of any two reactions i and j can be either fully, partially, directionally, or not at all coupled. These features can be detected by maximizing and minimizing the flux ratio $r_{i} / r_{j}$ over the set of feasible flux vectors. If $\max (r_{i} / r_{j}) = \min (r_{i} / r_{j})$ , then the two fluxes are fully coupled, i.e., one flux is a multiple of the other. If both ratios are not equal, but nonzero and finite, then the reactions are partially coupled. This means that, if any of the two fluxes is nonzero, then the other one is nonzero, too. Finally, if the activity of one reaction implies the activity of the other but not vice versa, then the reactions are directionally coupled. This is the case when one of the ratios is finite and the other is zero. For example, in the network in Fig. 1, the fluxes $r_{2}$ and $r_{8}$ are fully coupled, whereas $r_{6}$ and $r_{1}$ are directionally coupled, and no pair of reactions is partially coupled.

Maximizing and minimizing a flux ratio is a special case of optimizing a rational function as in (4). In fact, Burgard et al. (2004) used an LFP for flux coupling analysis, but they did not extend their work to the analysis of yields.

3. Rate-optimal and yield-optimal solution sets

3.1. Basic terminology and examples

In this section we show how optimal rates and yields and the corresponding rate- and yield-optimal solution sets can be analyzed by means of generating vectors.

If the constraints (2a), (2b), (2c) for the objective functions (1), (4) contain only the steady-state and irreversibility constraints, then the set of feasible flux vectors forms a polyhedral cone, the flux cone. A well-known and particularly useful generating set of the flux cone is given by the set of elementary flux modes (EFMs), which are non-decomposable (support-minimal) flux vectors (Schuster and Hilgetag, 1994, Schuster et al., 1999, Schuster et al., 2000, Zanghellini et al., 2013). Every feasible steady-state flux distribution is a conical (nonnegative linear) combination of EFMs, and it is well-known that EFMs can be used to identify yield-optimal pathways. (For examples, see below.)

Rate maximization on the (unbounded) flux cone usually yields an unbounded maximum and is thus meaningless. Therefore, FBA usually involves constraints in (2b), (2c) that go beyond steady state and irreversibility (e.g., maximal substrate uptake rates or resource allocation constraints). This usually bounds the feasible solutions, or at least the value of the objective function, see scenarios S2 and S3 in Fig. 2(a). The resulting flux polyhedron cannot be analyzed with EFMs, but with the more general approach of elementary flux vectors (EFVs) (Urbanczik, 2007, Klamt et al., 2017, Müller and Regensburger, 2016). EFVs are a particularly useful generating set of a flux polyhedron and generalize EFMs, since EFVs and EFMs coincide in the case of a flux cone. Only recently it has been realized that EFVs can indeed be used to characterize rate- and yield-optimal solution sets (of linear programs (LPs) and linear-fractional programs (LFPs), respectively (Klamt et al., 2017)).

For the three scenarios S1–S3 of the example network in Fig. 1, the corresponding EFMs and EFVs are listed in Table 1. Here, we illustrate how they can be used to characterize the sets of rate- and yield-optimal flux vectors.

Table 1.

List of generators (EFMs and EFVs) for scenarios S1 to S6 in the example network in Fig. 1. Scenarios are characterized by an increasing number of constraints (indicated by “ $*$ ”). Membership of generators in different scenarios is indicated by “+”. Generators are characterized by their B(iomass) yield $Y^{B / S}$ and P(roduct) yield $Y^{P / S}$ . Some generators are not bounded (indicated in the column “Bounded”), and normalized rates are then listed.

Open in a new tab

Scenario S1. The standard condition involves only the steady-state and irreversibility constraints of the network. These constraints form the flux cone which is generated by the six EFMs listed in Table 1. Note that the first five EFMs represent pathways from S(ubstrate) to products (B, P, Q), whereas EFM6 is an internal cycle involving the reactions R7, R10, and R11. (Such a cycle is thermodynamically infeasible, but we keep it for illustrating the concept.) Except for the cycle, all EFMs have a well-defined B(iomass) yield $Y^{B / S} = r_{4} / r_{1}$ , and the maximum yield of $Y^{B / S} = 1$ is reached by EFM4 and EFM5. As will be detailed below, the set of yield-optimal flux distributions is given by all possible conical sums of EFM4, EFM5 (having maximum yield) and the cycle EFM6 (with undefined yield). Thereby, EFM4 and/or EFM5 must contribute to the sum. All these flux distributions, indicated in Fig. 2(b), have maximum yield. Again, note that rate-optimal fluxes are unbounded in flux cones.

Scenario S2. The substrate uptake flux is constrained by an upper bound, and the set of feasible solutions changes from a flux cone to a flux polyhedron. Since EFMs are not defined for general (flux) polyhedra, we determine the EFVs. Indeed, the flux polyhedron is generated by seven EFVs (Table 1). Six of them correspond to the EFMs of S1: EFV7-EFV11 correspond to EFM1-EFM5 but are now scaled to the maximal substrate uptake rate, whereas EFV22 is identical to the cycle EFM6 and remains unscaled. Finally, EFV23 is the zero vector (which is always an EFV if it is contained in the flux polyhedron). Now we can characterize solutions with maximum biomass production rate and maximum biomass yield, respectively. Similarly to scenario S1, the yield-optimal solution set is given by all possible convex sums of EFV10 and EFV11 (with a maximum yield of $Y^{B / S} = 1$ ), and the zero vector EFV23 plus some nonnegative multiple of the cycle EFV22. Although EFV22 and EFV23 have undefined yield, they do not affect the overall (maximum) yield as long as at least one EFV with maximum yield (EFV10 and/or EFV11) contributes to the sum. These flux distributions are again illustrated in Fig. 2(b). In contrast, the set of rate-optimal flux vectors (with a maximum rate of $r_{4} = 10$ ) is given by all convex sums of (only) EFV10 and EFV11 plus some nonnegative multiple of the cycle EFV22.

Scenario S3. Finally, with the additional flux bound $r_{5} \leq 5$ , we find that the B(iomass) yield-optimal solutions are now described by EFV18 and EFV19, which correspond to EFV10 and EFV11 in scenario S2, but are now scaled to the additional flux bound. The yield-optimal solution set is given by all convex sums of EFV18 and EFV19 (having maximum yield $Y^{B / S} = 1$ ) and the zero vector EFV23 plus some nonnegative multiple of the cycle EFV22 (again, with some minimum contribution of EFV18 and/or EFV19). In contrast, the set of rate-optimal solutions (with a maximum rate of $r_{4} = 7.5$ ) is given by all convex sums of the (newly arising) EFV16 and EFV17 plus some nonnegative multiple of the cycle EFV22, see Fig. 2(c).

3.2. Rate-optimal solution sets

Recently, a general guideline has been given how to use EFVs (or EFMs in the special case of flux cones) to describe rate- and yield-optimal solution sets (Klamt et al., 2017). For the case of yield optimization, a detailed mathematical treatment will be given in 3.3, 3.4. Before that, we summarize the results for the simpler case of rate optimization,

\max_{x \in P} c^{T} x .

By Minkowski's Theorem, the flux polyhedron $P = {x \in R^{n} ∣ Ax \leq b}$ is the sum of a polytope (a bounded polyhedron) and a finitely generated cone, the recession cone ${x \in R^{n} ∣ Ax \leq 0}$ . The linear objective $c^{T} x$ is bounded on $P$ if and only if it vanishes on the recession cone. In this case, we call the LP bounded.

Most importantly, if the LP is bounded, then every optimal solution is a convex sum of optimal generators of the polytope plus a conical sum of generators of the recession cone – and vice versa. Hence, the optimal solution set is a (sub)polyhedron. It is the sum of the optimal (sub)polytope and the recession cone.

Any set of generators (of $P$ ) can be used to characterize the rate-optimal solution set, for example, the set of minimal generators (as used in (Kelk et al., 2012)) or the set of EFVs. In fact, EFVs have several valuable properties for characterizing (optimal) solution sets (Klamt et al., 2017), and we suggest to use EFVs as in the scenarios S2 and S3 above.

3.3. Mathematical properties of yield optimization

In order to characterize optimal solution sets of the yield optimization problem, we require a number of auxiliary results. For detailed statements and proofs, see the Appendix.

Property 1

In the definition of the yield Y, the numerator $c^{T} x$ and the denominator $d^{T} x$ do not contain constant terms and hence

$Y (λ x) = Y (x)$ (15)

for $x \in D$ and $λ > 0$ . That is, Y is constant on the ray ${λ x ∣ λ > 0}$ .

Property 2

The yield Y is not a linear function, not even convex (or concave). Still, for $x, y \in D$ and $λ \in [0, 1]$ , the yield of the convex combination

$z = (1 - λ) x + λ y$

is given by the convex combination of yields

$Y (z) = (1 - λ') Y (x) + λ' Y (y)$

with a unique $λ' \in [0, 1]$ . See Proposition 1 in the Appendix.

Property 3

The domain $P_{>}$ is contained in the polyhedron

$P_{\geq} = {x \in P ∣ d^{T} x \geq 0} .$ (16)

Clearly,

$P_{\geq} = P_{>} \cup P_{=},$

where

$P_{=} = {x \in P ∣ d^{T} x = 0} .$ (17)

In fact, $P_{\geq}$ is the smallest polyhedron containing $P_{>}$ .

Again, by Minkowski's Theorem, the polyhedron $P_{\geq}$ is the sum of a polytope and a finitely generated cone (the recession cone of $P_{\geq}$ ). Explicitly, let $v^{i} \in P_{\geq}$ $(i \in I)$ and $u^{j} \in rec (P_{\geq})$ $(j \in J)$ be generators of the polytope and the recession cone, respectively (I and J are finite index sets of these generators). Then, every $x \in P_{\geq}$ (in particular, every $x \in P_{>}$ ) can be written as

x = \sum_{i \in I} α_{i} v^{i} + \sum_{j \in J} β_{j} u^{j}

(18)

with $α_{i}, β_{j} \geq 0$ and $\sum_{i \in I} α_{i} = 1$ . Again, any set of generators (of $P_{\geq}$ ) can be used and we suggest to deploy the set of EFVs (Urbanczik and Wagner, 2005, Urbanczik, 2007, Klamt et al., 2017, Müller and Regensburger, 2016).

In our main result below, we characterize optimal solutions of the yield optimization problem in terms of generators (EFVs) of $P_{\geq}$ .

Property 4

A vector $x \in P_{=}$ (with $d^{T} x = 0$ ) may have unbounded yield (if $c^{T} x \neq 0$ ) or undefined yield (if $c^{T} x = 0$ ). That is,

$P_{=} = P_{\pm / 0} \cup P_{0 / 0},$

where

$\begin{matrix} P_{\pm / 0} & = & {x \in P_{=} ∣ c^{T} x \neq 0}, \\ P_{0 / 0} & = & {x \in P_{=} ∣ c^{T} x = 0} . \end{matrix}$ (19)

The definition of the yield directly implies the following two results:

On the one hand, if there is a vector in $P_{\geq}$ with unbounded yield (that is, a vector in $P_{\pm / 0}$ ), then Y is unbounded on $P_{>}$ . See Lemma 1 in the Appendix.

On the other hand, the addition of a vector in $P_{\geq}$ with undefined yield (that is, a vector in $P_{0 / 0}$ ) to a vector in $P_{>}$ does not change the yield. See Lemma 2 in the Appendix.

As a consequence, if Y is bounded on $P_{>}$ , then generators of $P_{\geq}$ cannot have unbounded yield, but may have undefined yield.

Property 5

Most importantly, if Y is bounded, then the yield of a vector in $P_{>}$ is a convex sum of the yields of generators of $P_{\geq}$ with defined yield. See Lemma 3 in the Appendix. The corresponding index sets are given by

$\begin{matrix} I^{d} & = & {i \in I ∣ v^{i} \in P_{>}}, \\ J^{d} & = & {j \in J ∣ u^{j} \in P_{>}} . \end{matrix}$ (20)

3.4. Yield-optimal solution sets

Now we are in a position to characterize optimal solutions of the yield optimization problem in terms of generators of $P_{\geq}$ .

First, we note that the maximum yield need not be attained, that is, the maximum is only approached in the limit (as already illustrated in the remark in Section 2.1). The following theorem determines when this is the case.

Theorem 1

Let the yield Y be bounded on the nonempty domain $P_{>}$ . Then the maximum is not attained if and only if

$I^{d} = I and \max_{i \in I^{d}} Y (v^{i}) < \max_{j \in J^{d}} Y (u^{j}) .$

If the maximum is not attained, let $Y^{*}$ be the supremum. Then $Y (x) \to Y^{*}$ for $x = v + β_{j} u^{j} \in P_{>}$ with $v \in P_{>}$ , $Y (u^{j}) = Y^{*}$ , and $β_{j} \to \infty$ .

Proof

See the Appendix. ▫

If the maximum yield is attained, then the following theorem states that every optimal solution is a convex/conical sum of generators with maximum or undefined yield – and vice versa. Thereby, at least one generator with maximum yield contributes to the sum. Hence, the closure of the optimal solution set is a (sub)polyhedron.

Theorem 2

Let the yield Y be bounded on the domain $P_{>}$ and the maximum $Y^{*}$ be attained. Then $x^{*} \in P_{>}$ is an optimal solution if and only if

$x^{*} = \sum_{i \in I^{*}} α_{i}^{*} v^{i} + \sum_{i \in I^{u}} α_{i} v^{i} + \sum_{j \in J^{*}} β_{j}^{*} u^{j} + \sum_{j \in J^{u}} β_{j} u^{j},$

where

$\begin{matrix} I^{*} & = & {i \in I ∣ Y (v^{i}) = Y^{*}}, \\ I^{u} & = & {i \in I ∣ v^{i} \in P_{0 / 0}}, \\ J^{*} & = & {j \in J ∣ Y (u^{j}) = Y^{*}}, \\ J^{u} & = & {j \in J ∣ u^{j} \in P_{0 / 0}}, \end{matrix}$

and $α_{i}^{*}, α_{i}, β_{j}^{*}, β_{j} \geq 0$ with

$\sum_{i \in I^{*}} α_{i}^{*} + \sum_{i \in I^{u}} α_{i} = 1 and \sum_{i \in I^{*}} α_{i}^{*} + \sum_{j \in J^{*}} β_{j}^{*} > 0 .$

Proof

See the Appendix. ▫

With the theoretical results obtained above, we can characterize the yield-optimal solution set in terms of generators of $P_{\geq}$ , in particular, in terms of EFVs (see Fig. S1). For realistic applications, we assume that the denominator of the yield function is nonnegative (that is, the flux polyhedron $P$ coincides with $P_{\geq}$ ) and that the yield is bounded (on the nonempty domain $P_{>}$ ). The set of EFVs of $P$ consists of “bounded” EFVs v of the polytope associated to $P$ and “unbounded” EFVs u of the recession cone of $P$ .

Case 1: If the flux polyhedron is the flux cone (given by steady-state and irreversibility constraints), then no “bounded” EFV v exists, and the (sub)cone of yield-optimal solutions is the set of conical sums of the EFVs u (here coinciding with the set of EFMs) with maximum or undefined yield. Thereby, at least one EFM with maximum yield contributes to the sum.

Case 2: If the flux polyhedron is a polytope (the recession cone is empty), then no EFV u exists, and the (sub)polytope of yield-optimal solutions is the set of convex sums of EFVs v with maximum or undefined yield. (Note that the zero vector is an EFV with undefined yield if it is contained in the flux polyhedron). Again, at least one EFV with maximum yield contributes to the sum.

Case 3a: If the flux polyhedron is a general polyhedron (generated by “bounded” and “unbounded” EFVs) and at least one of the EFVs v has maximum or undefined yield, then the maximum yield is attained, and the (sub)polyhedron of yield-optimal solutions is the set of convex sums of EFVs v plus conical sums of EFVs u with maximum or undefined yield. Again, at least one EFV (u or v) with maximum yield contributes to the sum.

Case 3b: If the flux polyhedron is a general polyhedron (as in case 3a) and all EFVs v have defined, but not maximum yield, then the maximum yield is not attained and only approached in the limit, in particular, for an “infinite” contribution of EFVs u with maximum yield.

In Section 3.1, we already applied our theoretical results to characterize yield-optimal solution sets by EFMs (for the flux cone in scenario S1) and EFVs (for the flux polyhedra in scenarios S2 and S3).

4. Phase planes and yield spaces

4.1. Basic terminology and examples

So far we have analyzed the optimization of a single rate or yield. Now we study (phenotypic) phase planes (PPs) and yield spaces (YSs) which have become an important tool in constraint-based modeling for metabolic networks. A PP is a projection of the flux polyhedron on two (or three) selected fluxes, that is, on a two-dimensional plane or on a three-dimensional space. Similarly, a YS is a map from the flux polyhedron to two (or three) selected yields. Thereby, we assume that the denominator $d^{T} x$ in Eq. (3) is identical for all yields, while the numerators $c^{T} x$ differ, i.e., we consider the same substrate(s), but different products. Note that a YS is not a projection, since yields are nonlinear functions of the flux vectors, see Eq. (3). PPs and YSs allow to analyze dependencies between selected fluxes and yields, respectively. This is particularly useful in the context of metabolic engineering and biotechnological applications (see 5, 6). Growth rate and synthesis rate of a target product are frequently chosen for projection; in this case, the resulting PP is often called production envelope (PE) or trade-off plot (Maranas and Zomorrodi, 2016, Machado and Herrgard, 2015). Likewise, biomass yield and product yield are often chosen for YS analysis.

Again, we use the example network in Fig. 1 to illustrate PPs and YSs in the three scenarios S1–S3. As before, R1 represents S(ubstrate) uptake and R4 (B)iomass synthesis. In addition, we now consider the product P (excreted by R3) with synthesis rate $r_{3}$ and yield $Y^{P / S} = r_{3} / r_{1}$ . The product Q is still considered undesired.

Scenario S1. Since the flux cone is unbounded, the PP is unbounded as well, see Fig. 3(a). In contrast, the YS is bounded, since biomass and product synthesis rates are normalized by the substrate uptake rate, see Fig. 3(b). Both, maximal product yield and maximal biomass yield are one. The triangle shape of the YS indicates the trade-off between biomass and product yields due to mass conservation: the more product is formed, the less biomass can be made from the substrate. However, other shapes of the YS do exist, as will be shown below. Note that every flux distribution of the example network is mapped to exactly one point in the YS. Conversely, every point in the YS corresponds to (possibly infinitely many) flux vectors exhibiting the respective biomass and product yields. For example, all flux distributions indicated in Fig. 2(b) map to the point (1,0) in the YS.

Scenario S2. Limiting the substrate uptake rate by $r_{1} \leq 10$ also bounds $r_{4}$ and $r_{3}$ and hence the PP, see Fig. 3(c). However, the YS remains the same as in scenario S1, see Fig. 3(d). Note that, up to scaling, the PP and the YS are identical. Still, points (and line segments) of the PP and the YS are not in one-to-one correspondence. For example, the point (5,0) in the PP is the projection of all flux distributions having a biomass synthesis rate $r_{4} = 5$ and a product formation rate $r_{3} = 0$ . Out of those, all flux vectors of the form $r = {(5, 0, 0, 5, 5, 0, λ, 0, 0, λ, 5 - λ)}^{T}$ with $λ \geq 0$ have a substrate uptake rate $r_{1} = 5$ and hence convert all substrate to biomass. That is, their biomass yield is one, and they are mapped to the point (1,0) in the YS, see Fig. 4. However, there are other flux vectors with the same projection (5,0) in the PP: in those solutions, the substrate is taken up with $r_{1} = 5 + δ$ $(0 \leq δ \leq 5)$ and B is synthesized with $r_{4} = 5$ and Q with $r_{2} = δ$ . Therefore, all flux distributions projected to (5,0) in the PP are mapped to the (closed) line segment between (0.5,0) and (1,0) in the YS, see Fig. 4 (full lines). Conversely, flux vectors mapped to (0.5,0) in the YS are projected on the (half-open) line segment between (0,0) and (10,0) in the PP, see Fig. 4 (dashed lines).

Fig. 4 — Relationships between PP (blue) and YS (red) of scenario S2. Cyan lines indicate the mapping of the point (5,0) from the PP to the YS. Magenta lines indicate the mapping of the point (0.5,0) from the YS to the PP. Brackets indicate (half-)open and closed intervals, respectively.

Note that flux vectors with undefined yield (e.g., the zero vector or the internal cycle) have no corresponding point in the YS.

Scenario S3. The additional capacity constraint $r_{5} \leq 5$ leads to different shapes of the PP and the YS, see Fig. 3(e) and (f). While the YS still forms a triangle (with a maximum biomass yield of 1), the PP becomes a quadrangle (with the maximal biomass synthesis rate dropping from 10 to 7.5). The decrease is caused by the limitation of reaction R5 and the resulting use of reactions R6 and R3 (product formation and excretion). As discussed earlier, rate- and yield-optimal flux distributions differ in this scenario. In particular, optimal biomass yields occur at suboptimal biomass synthesis rates, since all yield-optimal solutions have substrate uptake rate $r_{4} = r_{1}$ with $0 < r_{1} \leq 5$ . These flux distributions are projected to the line segment between (0,0) and (0,5) in the PP. However, note that every point on the x-axis of the PP corresponds to many flux vectors, including those with suboptimal yield.

In the following we give a precise mathematical formulation of YSs and PPs and discuss their properties.

4.2. Mathematical treatment

In constraint-based modeling, one often considers two fluxes $x_{i}$ and $x_{j}$ (that is, two components of the flux vector $x \in R^{n}$ ) and the resulting PP

{{(x_{i}, x_{j})}^{T} \in R^{2} ∣ x \in P},

that is, the projection of the flux polyhedron $P$ on the $(x_{i}, x_{j})$ -plane. If $x_{i}$ and $x_{j}$ are bounded on $P$ , then the PP is a polytope.

More generally, one may consider $ℓ$ linear objective functions $l_{1}, \dots, l_{ℓ} : R^{n} \to R$ ,

l_{1} (x) = c_{1}^{T} x, \dots, l_{ℓ} (x) = c_{ℓ}^{T} x,

and the resulting objective space

{{(l_{1} (x), \dots, l_{ℓ} (x))}^{T} \in R^{ℓ} ∣ x \in P} .

(In case $ℓ = 2$ and $l_{1} (x) = x_{i}$ , $l_{2} (x) = x_{j}$ , the objective space equals the PP defined above). If all objectives are bounded on $P$ , then they vanish on the recession cone of $P$ , and the objective space is a polytope.

Analogously, one may consider $ℓ$ yields $Y_{1}, \dots, Y_{ℓ} : D \to R$ ,

Y_{1} (x) = \frac{c_{1}^{T} x}{d^{T} x}, \dots, Y_{ℓ} (x) = \frac{c_{ℓ}^{T} x}{d^{T} x},

having different numerators, but the same denominator. They define the yield vector

Y (x) = {(Y_{1} (x), \dots, Y_{ℓ} (x))}^{T}

and the resulting YS

{Y (x) \in R^{ℓ} ∣ x \in P_{>}} .

First of all, the YS is a convex set. Indeed, let $x, y \in P_{>}$ and $λ \in [0, 1]$ . By Property 2 (or Proposition 1 in the Appendix),

Y ((1 - λ) x + λ y) = (1 - λ') Y (x) + λ' Y (y),

with $λ' \in [0, 1]$ , since $λ'$ depends on d (which is identical for all yields), but not on $c_{1}, \dots, c_{ℓ}$ , and the map $λ \mapsto λ'$ is bijective.

By Lemma 3 in the Appendix (extended to yield vectors), if all yields are bounded on $P_{>}$ , then the YS is contained in the polytope generated by the yield vectors of generators of $P_{\geq}$ (with defined yield). It can be shown that the closure of the YS equals this polytope. If moreover all yields are attained on $P_{>}$ , as in most realistic applications, then the YS equals this polytope.

It remains to specify generators of PPs and YSs. Clearly, a PP, that is, the projection of a flux polyhedron $P$ , is given by all convex combinations of projections of generators of $P$ . Similarly, a YS, that is, the set of yield vectors on the domain $P_{>}$ , is given by all convex combinations of yield vectors of generators of $P_{\geq}$ .

4.3. EFMs and EFVs in phase planes and yield spaces

As just discussed, generators of the flux polyhedron $P$ (or the polyhedron $P_{\geq}$ ) can be used to generate the PP and the YS. Due to their special properties, we suggest to use EFVs (or EFMs in the special case of a flux cone) as generators. The PP then results as the convex hull of the projected EFVs. The PP can be bounded (if the projected rates are bounded), see Fig. 3(c) and (e), or unbounded, see Fig. 3(a). The YS arises as the convex hull of the mapped EFVs (or EFMs) with defined yield. Yields of interest typically refer to ratios of product(s) excreted vs. substrates(s) taken up. In realistic applications, the YS is finite even if fluxes are unbounded, cf. Figs. 3(a) and (b). If a metabolic model is not formulated properly, the YS may become unbounded.

Since PPs and YSs are convex hulls of projected/mapped EFVs (or EFMs in flux cones), all vertices of a PP or a YS correspond to EFVs (or EFMs, respectively), see Fig. 3(a)–(f). In the YS of scenario S1, all EFMs lie on the boundary of the YS, cf. Fig. 3(b), while EFMs of more complex and realistic networks may also lie in the interior of the YS. The point (0,0) represents EFM1, a mode that converts S straight into Q. Again, note that the cycle EFM6 does not have a corresponding point in the YS since it has zero substrate uptake $(r_{1})$ and product synthesis $(r_{3}, r_{4})$ rates and thus undefined yields.

Generally, flux vectors with undefined yield (with zero numerator and denominator in the yield function), including the zero flux vector, cannot be mapped to the YS. The YS becomes unbounded (and thus indicates either ill-posed models or ill-posed yields) only if one yield in the YS is infinite due to the existence of flux vectors with product synthesis (nonzero numerator), but without substrate uptake (zero denominator).

For scenario S2, the EFVs are shown in the PP and the YS in Fig. 3(c) and (d), respectively. In contrast to S1, the PP is now bounded in $r_{4}$ and $r_{3}$ . It would be unbounded if one of the rates $r_{7}$ , $r_{10}$ , or $r_{11}$ was used for projection. While the zero vector (EFV23) and the cycle (EFV22) are not represented in the YS, they are contained in the PP: in fact, they are projected to the point (0,0), as EFM7. In the YS, the point (0,0) corresponds only to EFV7 involving reactions R1, R2, and R8.

4.4. Computation of phase planes and yield spaces

As explained above, PPs and YSs can be computed as convex hulls of projected/mapped EFVs (or EFMs). In genome-scale metabolic models (GSMMs), however, an enumeration of all these generating vectors is usually not feasible. Nevertheless, in two or three dimensions, PPs and YSs of GSMMs can readily be obtained. PPs can either be approximated by sampling their boundary, using flux variability analysis, or even be exactly computed, e.g., by the convex hull method (Huynh et al., 1992, Lassez and Lassez, 1990). A pseudo-code for sampling the boundary of a PP is given by Algorithm 1 in Table 2.

Table 2.

Pseudo codes of sampling algorithms for a two-dimensional PP and a two-dimensional YS. Note that in Algorithm 2 optimization is carried out over the auxiliary polyhedron $P'$ rather than the flux polyhedron $P$ . Otherwise, the two algorithms are structurally identical.

Open in a new tab

To the best of our knowledge YSs of GSMMs have never been determined or studied in the literature. In fact, a YS can be computed similarly to a PP, thereby using the LP equivalent to the linear-fractional program (LFP), in particular, the auxiliary polyhedron $P'$ , cf. Section 2.2. A pseudo-code for sampling the boundary of a YS is given by Algorithm 2 in Table 2.

A discretization parameter of $n = 20$ is often sufficient to approximate PPs and YSs of GSMMs, usually requiring less than one minute computation time.

4.5. Implementation in CellNetAnalyzer

Our MATLAB^® toolbox CellNetAnalyzer (von Kamp et al., 2017, Klamt et al., 2007) supports the maximization of both linear (rate) functions (1) and linear-fractional (yield) functions (4) in GSMMs. Moreover, CellNetAnalyzer allows one to study rate- and yield-optimal solution sets by means of EFM and EFV analysis if the computation of these generating vectors is feasible. Finally, CellNetAnalyzer supports the computation and visualization of PPs and YSs either exactly via EFVs (or EFMs) or, as a recent extension for GSMMs, approximately via the sampling algorithms described above.

5. Production envelopes and yield spaces in strain design

Phase plane (PP) and yield space (YS) analysis are valuable tools in metabolic engineering and (computational) strain design to study the trade-off between biomass and product synthesis in production organisms. Accordingly, growth rate and product synthesis rate are usually chosen to span the PP. In the context of metabolic engineering, PPs are often called production envelopes (PEs) or trade-off plots (Machado and Herrgard, 2015, Maranas and Zomorrodi, 2016). In the following two sections, we will exclusively use the term PEs for PPs. Likewise, biomass and product yields typically span YSs of microbial cell factories.

Various constraint-based modeling methods have been developed to predict rational metabolic intervention strategies that improve the performance of production strains, see Machado and Herrgard, 2015, Maia et al., 2016 for recent reviews. Most of them rely on the concept of (stoichiometric) coupling of growth with product synthesis (Burgard et al., 2003, Klamt and Mahadevan, 2015, von Kamp and Klamt, 2017). Thereby, constraint-based modeling methods are roughly divided into two groups: biased and unbiased approaches (Lewis et al., 2012, Machado and Herrgard, 2015, Maia et al., 2016).

5.1. Biased strain design

Biased methods, such as OptKnock (Burgard et al., 2003), OptReg (Pharkya and Maranas, 2006), OptORF (Kim and Reed, 2010), RobustKnock (Tepper and Shlomi, 2010), and others, rely on the assumption that wild type cells as well as constructed mutant strains optimize their metabolism with respect to a fitness function, usually some kind of growth (rate) optimization. Thus, these methods usually operate on PEs, which they try to (re)shape, e.g., by gene/reaction knockouts, such that a sufficiently high product synthesis rate is achieved at optimal growth. Suboptimal growth states are considered less relevant. For instance, the (wild type) strain under the constraints of scenario S3 depicted in Fig. 3(e) already has such a desired shape. At maximal growth rate $r_{4} = 7.5$ we can expect a product yield of 0.25 and a production rate of $r_{3} = 2.5$ . If this performance is sufficient, no intervention will be required. However, if – for whatever reasons – the strain grows suboptimally with less than the maximal growth rate, then little or even no product may be synthesized.

Scenario S4. Ideally, the PE of designer strains should contain no solutions on the x-axis because then net product synthesis for any nonzero growth rate and thus (strong) growth coupling would be guaranteed. Fig. 5(a) illustrates such a PE for the example network in Fig. 1 (with bounded substrate uptake as in scenario S2) where reaction R5 has been knocked-out. However, flux distributions with low or even zero yields for P and B still exist [cf. Fig. 5(b)], namely if the substrate is converted to Q.

Fig. 5 — PEs (blue) and YSs (red) for the engineering scenarios S4–S6 of the example network in Fig. 1. The circles correspond to (possibly multiple) EFVs (*cf.*Table 1).

5.2. Unbiased strain design

Unbiased strain design algorithms, such as minimal metabolic functionalities (Trinh et al., 2008), FluxDesign (Melzer et al., 2009), or minimal cut sets (Klamt and Gilles, 2004, Hädicke and Klamt, 2011, Jungreuthmayer and Zanghellini, 2012), were originally introduced in the context of elementary flux mode (EFM) analysis (for flux cones), restricting their applicability to medium-scale metabolic models without inhomogeneous constraints (flux bounds etc.). While PE analysis is meaningless in unbounded flux cones [see Fig. 3(a)], yields of the respective EFMs are well defined and YS analysis therefore played a central role for unbiased strain design. We illustrate the use of YSs to identify metabolic intervention strategies based on constrained minimal cut sets (MCSs). A constrained MCS is a set of (reaction) knockouts blocking undesired while maintaining desired phenotypes (Hädicke and Klamt, 2011).

Scenario S5. In unbiased strain designs, it is often demanded that all EFMs/elementary flux vectors (EFVs) with low product yield are removed from the network. A prototypical YS of this type is illustrated in Fig. 5(d). Such a design guarantees a minimum product yield for every flux vector consuming substrate, in particular, for every flux vector with nonzero growth rate. A suitable MCS that achieves this design is knocking out R5 and R8. Note that although the PEs in Fig. 5(a) and (c) are identical, they represent different states, best illustrated by looking at the solutions projected to (0,0). In both cases the zero vector (EFV23; Table 1) as well as the cycle EFV22 (R7, R10, and R11) are projected to (0,0). However, in the design of Fig. 5(a) it also includes vectors that convert all available S to Q (EFV7 in Table 1). Such fluxes are infeasible in Fig. 5(c). In that way the second design guarantees strong coupling, i.e., high product yields for every flux distribution in the mutant. In fact, it was recently shown that such a growth-coupled overproduction is, in principle, feasible for almost all metabolites in five major host organisms (von Kamp and Klamt, 2017). Hence, the design principle aiming at strong yield coupling of growth and product synthesis has wide applicability.

The computational difficulties associated with MCS analysis (i.e., the restriction to flux cones without inhomogeneous constraints and to medium-scale metabolic models) were resolved in recent years. First of all, by generalizing EFMs to EFVs (Klamt et al., 2017), MCSs can now be computed from the set of EFVs of general flux polyhedra with the same established algorithms developed for the computation of MCSs from EFMs of flux cones (Jungreuthmayer et al., 2013, Hädicke and Klamt, 2011). Furthermore, by transforming a metabolic network to its so-called dual network, a preceding computation of EFMs or EFVs is not needed anymore, and it is now possible to directly compute the smallest MCSs subject to arbitrary linear constraints even in genome-scale networks (von Kamp and Klamt, 2014). Thus, current MCS methods allow to specify both yield and rate constraints in order to obtain the favored shape of the PE and YS of the desired strains. Due to its generality, the method of MCSs can also be employed to find biased intervention strategies as in scenario S4 [Fig. 5(a) and (b)].

Although strong growth coupling can often be achieved in the YS, neither biased nor unbiased strain design methods can be used to find knockout strategies that guarantee high synthesis rates of the target product. In fact, if the zero flux vector is part of the wild-type solution set, it cannot be removed by just knocking out reactions since deleting reactions cannot enforce a minimum substrate uptake (and thus not a nonzero production rate). However, a highly desirable PE as in Fig. 5(e) would be possible if we were able to upregulate certain fluxes, for example, by overexpressing the associated enzymes. Mathematically, this translates to introducing positive lower bounds for the absolute magnitude of these fluxes.

Scenario S6. Enforcing an additional lower bound of $r_{6} \geq 3$ for the strain of scenario S5 will assure a minimum flux of $r_{3} \geq 3$ for P(roduct) formation and thus lead to a PE as in Fig. 5(e).

Computational strain design methods that allow one to predict targeted upregulation of certain fluxes have already been proposed (Mahadevan et al., 2015, Jungreuthmayer and Zanghellini, 2012, Ranganathan et al., 2010, Pharkya and Maranas, 2006). However, experimental implementation of such strains with guaranteed upregulated fluxes is usually much more difficult (if not impossible) than “just” deleting genes or reactions.

6. Examples of production envelopes and yield spaces in E. coli and their use for strain design

In the following we use a core and a genome-scale metabolic model (GSMM) of E. coli to illustrate the value of production envelopes (PEs) and yield spaces (YSs) in analyzing and designing metabolic networks.

6.1. Acetate production in E. coli

We use EColiCore2 (ECC2) (Hädicke and Klamt, 2017), a recently published core network of the central metabolism in E. coli, which reproduces key properties of its genome-scale parent model iJO1366 (Orth et al., 2011).

As an example, we analyze the trade-off between biomass and acetate production in ECC2 for growth on glucose (glc) (standard scenario). Initially, we consider the biomass-acetate YS for the flux cone, (without any inhomogeneous constraints). ECC2 is small enough, and we can compute the 558,647 elementary flux modes (EFMs) of its flux cone and map their specific biomass and acetate yields onto the YS as shown in Fig. 6(a). The maximal acetate yield is 2 mmol/(mmol glc) and the maximal biomass yield is close to 0.1 gDW/(mmol glc). Clearly, maximal acetate or maximal biomass yields imply zero production of the other. A PE analysis is not possible as the flux cone is unbounded.

Fig. 6 — YSs and PE for the production of acetate in *E. coli*, computed for the metabolic core model EColiCore2. (a) YS for acetate and biomass for the flux cone (*i.e.*, without flux bounds), computed by mapping the EFMs of the flux cone (shown as blue dots). (b) YS and (c) PE for acetate and biomass for the flux polyhedron with flux bounds for substrate uptake $(r_{GlcUp})$ , non-growth associated maintenance ATP demand $(r_{ATPmaint})$ , and oxygen uptake $(r_{O 2 Up})$ . YS (b) and PE (c) were computed by projecting the EFVs of the flux polyhedron (shown as blue dots). In (b) and (c), colors indicate the location of optimal flux vectors; red: maximal acetate yield; yellow: maximal biomass yield; green: maximal acetate synthesis rate; gray: maximal growth rate.

For a more realistic scenario, we introduce a maximal glucose uptake rate of 10 mmol/gDW/h and a non-growth associated, maintenance demand of adenosine triphosphate (ATP) of at least 3.15 mmol/gDW/h. In addition to these standard flux bounds, we assume an oxygen-limited culture with a maximal oxygen uptake rate of 5 mmol/gDW/h. With these three bounds, the flux cone turns into a flux polyhedron, and growth and acetate synthesis rates are now bounded. The resulting flux polyhedron is characterized by 904,599 elementary flux vectors (EFVs), and we can now analyze the YS [Fig. 6(b)] as well as the PE [Fig. 6(c)].

The YS of the flux polyhedron is very similar to that of the flux cone. However, we notice that the biomass yield (reached by 2 EFVs) gets slightly reduced to 0.0945 gDW/(mmol glc) due to the non-growth associated maintenance demand of ATP, which must be produced from glucose, thus reducing the amount of substrate available for biomass synthesis. In contrast, the maximum yield of acetate (exhibited by 676 EFVs) remains constant at 2 mmol/(mmol glc) because ATP can be produced as side product of acetate synthesis.

Comparing YS and PE of the flux polyhedron [Figs. 6(b) and (c)], we see that their shape is quite different. Growth with maximal rate (2 EFVs) is coupled to the production of some acetate while maximal acetate synthesis (748 EFVs) is only achievable without growth. Mapping the EFVs with maximal yields (biomass: yellow; acetate: red) to the PE and, in the other direction, the EFVs with optimal rates (growth: gray; acetate synthesis: green) to the YS reveals non-intuitive relationships. The maximal acetate yield of 2 mmol/(mmol glc) can be reached for acetate production rates between 0.573 and 10 mmol/gDW/h. The lower bound describes the minimum amount of acetate to be produced to reach the maximum acetate yield while simultaneously forming sufficient amounts of ATP for non-growth associated maintenance. The upper bound is a consequence of the limited availability of oxygen as electron sink. At an acetate production rate of 10 mmol/gDW/h, the maximum amount of oxygen has been utilized, and higher acetate production rates require the simultaneous production of fermentation products in order to balance redox, which reduces the acetate yield by 50%. For this reason, flux vectors with rate-optimal acetate synthesis (15 mmol/gDW/h, bounded by the maximum substrate uptake rate) have a reduced yield of 1.5 mmol/(mmol glc) and are thus not yield-optimal.

The situation is similar, but not fully analogous for biomass as product. There are two biomass-yield-optimal EFVs. They use the maximum amount of oxygen available, but only a fraction (i.e., 2.605 mmol/gDW/h) of the possible maximum substrate uptake rate. For larger substrate uptake rates, fermentative pathways with lower biomass yields would have to be used. In contrast to acetate, there is exactly one growth rate (0.246 h⁻¹) at which this maximal biomass yield can be reached. For lower growth rates, the relative proportion of the substrate to be used for ATP synthesis for non-growth associated maintenance is larger, thus resulting in lower biomass yields. Finally, the two EFVs with maximal growth rate (0.530 h⁻¹) are located in interior of the YS; they are not lying on the boundary, i.e., there are flux vectors with the same biomass yield and higher acetate yield (but the respective rates are lower in these flux vectors).

6.2. Ethanol production in a E. coli genome-scale model

As mentioned before, the analysis of PEs and YSs is also feasible in GSMMs, where an enumeration of EFMs and EFVs is usually computationally impracticable. In the Supplementary material, Text S1, an example is presented, where we study the trade-off between biomass and ethanol production in the E. coli GSMM iJO1366 (Orth et al., 2011). We use CellNetAnalyzer to compute the biomass-ethanol YS and PE (see Fig. S3) via the approximative algorithms given in Section 4.4. In Text S1, we also show a scenario where the maximal specific product synthesis rate can be reached only with maximal product yield and maximal growth rate only with maximal biomass yield (which was not the case in the acetate example discussed above).

6.3. Designing E. coli acetate producer strains

If we now aim to design an E. coli strain for acetate production (with the same core model and flux bounds as used in Fig. 6(b) and (c)) we may apply (biased and unbiased) strain design strategies which differ in the specifications of undesired and protected regions in the PE and/or YS.

Design D1. In the sense of a biased strain design, we may demand a minimum acetate production rate of 10 mmol/gDW/h if the cell grows with maximal growth rate. In fact, the PE (Fig. 6(c)) shows that, given the environmental constraints of low oxygen availability, this is already fulfilled in the wild type.

Design D2. However, we might fear that the maximum growth rate is not reached by the strain which could then lead to lower or even zero production rates of acetate. We could therefore demand a stronger coupling in the sense that the cell must produce acetate whenever it grows, hence, we search for interventions enforcing a PE similar as in Fig. 5(a). To achieve this we draw a line in the PE that starts at (0,0) and has a certain slope, i.e., a certain ratio (yield) of acetate excretion vs. growth rate, see Fig. 7(a) and (b). We may chose a slope of 20 mmol acetate/gDW; the solutions with maximum growth rate are then close to, but still above this line. We now specify all solutions below this line as undesired metabolic behaviors (that must be eliminated); further, all solutions that are above this line and have a minimum growth rate of 0.2 h⁻¹ as desired phenotypes (of which at least one must be kept). See Fig. 7(a) and (b); individual constraints are illustrated in Fig. S2. Based on this partitioning, we can compute the minimal cut set (MCS), Fig. 7(c) and (d), and observe that the minimum number of reaction knockouts is three. However, with these MCS, a low or even zero acetate yield would still be possible if the cell does not grow, and a low nonzero acetate yield for small growth rates, see example MCS in Fig. 7(c) and (d).

Fig. 7 — Designing PE and YS for the production of acetate in *E. coli*, according to design strategy D2. Green dots represent desired EFVs (at least one of which must be kept), red dots represent target EFVs (all of which must deleted), and blue dots represent neutral EFVs (which do not interfere with the design objective and therefore may or may not be present in the final design). (a) and (b) specify the desired phenotype, whereas (c) and (d) show the actual structure of the PE and the YS of an exemplary quintuple mutant. In all panels, thin black lines represent the boundaries of the PEs and YSs.

Design D3. In an unbiased design approach, we therefore demand a minimum acetate yield of 1.5 mmol/(mmol glc) and again a minimum growth rate of 0.2 h⁻¹. See Fig. 8(a) and (b); individual constraints are illustrated in Fig. S2. Thus, for separating desired and undesired phenotypes, we set a horizontal line in the YS and a vertical line in the PE. This combination of design constraints leads to MCSs with a minimum number of seven reaction knockouts, now guaranteeing a high acetate yield, see Fig. 8(c) and (d). Generally, coupling growth with high product yields usually requires more interventions than demanding only coupling growth and product synthesis rates in the PE.

Fig. 8 — Designing PE and YS for the production of acetate in *E. coli*, according to design strategy D3. Green dots represent desired EFVs (at least one of which must be kept), red dots represent target EFVs (all of which must deleted), and blue dots represent neutral EFVs (which do not interfere with the design objective and therefore may or may not be present in the final design). (a) and (b) specify the desired phenotype, whereas (c) and (d) show the actual structure of the PE and the YS of an exemplary quintuple mutant. In all panels, thin black lines represent the boundaries of the PEs and YSs.

7. Conclusions

Rates and yields of biomass or/and product synthesis are fundamental performance indicators of biotransformation processes. While flux-balance analysis (FBA) provides an established theoretical tool to analyze and predict (optimal) metabolic rates (Lewis et al., 2012) and to design microbial cell factories for optimal (specific) productivity (Maia et al., 2016), similar methods and a rigorous mathematical framework were so far missing for the analysis of optimal metabolic yields. In fact, in the context of FBA, rate and yield optimization were often considered equivalent and FBA, i.e. the maximization of rates, was frequently used to also compute yield-optimal solutions. In flux cones, that is, in models where all fluxes are unbounded, setting the substrate uptake rate to a fixed (non-zero) value and then maximizing the product synthesis rate indeed leads to a yield-optimal solution (Schuster et al., 2008, Santos et al., 2011). Equivalent to this FBA-based approach, elementary flux modes (EFMs) have often been used to identify metabolic pathways with optimal yields in flux cones (Prauße et al., 2016). However, in general, the situation is more complex. In applications, constraint-based models typically do contain inhomogeneous (non-zero) flux bounds (Oberhardt et al., 2009), for example, substrate (and/or oxygen) uptake rates within a certain (bounded) range, a minimum ATP maintenance demand rate, etc., which change the solution set from a flux cone to a flux polyhedron. As we unambiguously showed (for our example network, scenario S3, and for acetate synthesis in E. coli under oxygen-limited conditions), FBA cannot be used for finding yield-optimal solutions, in general. FBA always identifies rate-optimal solutions, which sometimes (e.g., when fixing the substrate uptake rate to a specific value), but not always, coincide with yield-optimal solutions.

For the general case, we derived several theoretical results that establish a framework for yield analysis and yield optimization in constraint-based metabolic models:

1.
Rather than an ordinary linear program (LP), yield optimization in metabolic networks requires the solution of a linear-fractional program (LFP) for a correct mathematical treatment. Since an LFP can be converted into an LP, yield optimization can efficiently be performed even for genome-scale metabolic models (GSMMs).
2.
Production envelopes (PEs) and yield spaces (YSs) are invaluable tools for the rational design of optimal cell factories in metabolic engineering, although they have been confused at times. Indeed, PEs and YSs sometimes, but not always, have similar shapes although they carry different information. Moreover, we demonstrated that also YSs can readily be computed in GSMMs.
3.
For characterizing yield-optimal solution sets and yield spaces in metabolic networks, elementary flux vectors (EFVs) (or EFMs in case of a flux cone) are extremely useful. It was already known that the set of rate-optimal solutions is spanned by the rate-optimal EFVs (Kelk et al., 2012, Klamt et al., 2017). Similarly, we showed that yield-optimal solutions are convex/conical sums of the yield-optimal EFVs and of EFVs that neither take up substrate nor excrete the product (EFVs with undefined yield). These observations reinforce the fundamental importance of EFVs (or EFMs) as the “coordinates of metabolism” in constraint-based modeling (Zanghellini et al., 2013). Despite the fact that EFMs/EFVs cannot be computed in GSMMs, it is important to understand how they shape yield-optimal and rate-optimal solution sets in metabolic networks.

The methods and algorithms developed are available in our MATLAB toolbox CellNetAnalyzer and add an essential building block for constraint-based metabolic modeling and computational strain design.

Acknowledgments

We are grateful to Stefan Schuster (University of Jena) for inspiring discussions on yield- vs. rate-optimal solutions in constraint-based modeling.

This work was supported by the European Research Council (ERC Consolidator Grant 721176) and the German Federal Ministry of Education and Research (FKZ: 031L104B and 031A180B) [S.K.]; the Austrian Science Fund (FWF), project P28406 [S.M.]; the FWF, project P27229 [G.R.]; the Austrian Federal Ministry of Science, Research and Economy (BMWFW), the Austrian Federal Ministry of Traffic, Innovation and Technology (bmvit), the Styrian Business Promotion Agency (SFG), the Standortagentur Tirol, the Government of Lower Austria, and the Technology Agency of the City of Vienna (ZIT) through the COMET-Funding Program managed by the Austrian Research Promotion Agency (FFG), project 23071 [J.Z.].

Footnotes

Supplementary data associated with this article can be found in the online version at doi:10.1016/j.ymben.2018.02.001.

Contributor Information

Steffen Klamt, Email: klamt@mpi-magdeburg.mpg.de.

Stefan Müller, Email: st.mueller@univie.ac.at.

Georg Regensburger, Email: georg.regensburger@jku.at.

Jürgen Zanghellini, Email: juergen.zanghellini@boku.ac.at.

Appendix: Mathematical results and proofs

The yield function

Y (x) = \frac{c^{T} x}{d^{T} x}

is not linear, not even convex (or concave). Still, it is convex in the following sense:

Proposition 1

Let $x, y \in D$ and $λ \in [0, 1]$ . Then for

$z = (1 - λ) x + λ y \in D$

on the line segment between x and y, the yield amounts to

$Y (z) = (1 - λ') Y (x) + λ' Y (y)$

with

$λ' = \frac{λ d^{T} y}{(1 - λ) d^{T} x + λ d^{T} y} \in [0, 1] .$

Moreover, the map $λ \mapsto λ'$ is bijective. In particular,

$λ' = 0 if and only if λ = 0 .$

Proof

$\begin{matrix} Y (z) & = & \frac{c^{T} z}{d^{T} z} \\ = & \frac{(1 - λ) c^{T} x + λ c^{T} y}{(1 - λ) d^{T} x + λ d^{T} y} \\ = & \frac{(1 - λ) d^{T} x}{(1 - λ) d^{T} x + λ d^{T} y} \frac{c^{T} x}{d^{T} x} \\ + \frac{λ d^{T} y}{(1 - λ) d^{T} x + λ d^{T} y} \frac{c^{T} y}{d^{T} y} \\ = & (1 - λ') Y (x) + λ' Y (y) \end{matrix}$

and

$\frac{d λ'}{d λ} = \frac{d^{T} x d^{T} y}{({(1 - λ) d^{T} x + λ d^{T} y)}^{2}} > 0 .$

▫

The definition of the yield directly implies the following two results. On the one hand, if there is a vector in $P_{\geq}$ with unbounded yield, then Y is unbounded on $P_{>}$ .

Lemma 1

Let $x \in P_{>}$ , $y \in P_{\pm / 0}$ , and $λ \in [0, 1)$ . Then for

$z = (1 - λ) x + λ y \in P_{>}$

on the half-open line segment between x and y, the yield amounts to

$Y (z) = Y (x) + \frac{λ}{1 - λ} \frac{c^{T} y}{d^{T} x} .$

In particular,

$\lim_{λ \to 1} Y (z) = \pm \infty .$

On the other hand, the addition of a vector in $P_{\geq}$ with undefined yield to a vector in $P_{>}$ does not change the yield.

Lemma 2

Let $x \in P_{>}$ and $y \in P_{0 / 0}$ . Then,

$Y (x + y) = Y (x) .$

As a consequence, if Y is bounded on $P_{>}$ , then generators of $P_{\geq}$ cannot have unbounded yield, but may have undefined yield.

In fact, if Y is bounded, then the yield of a vector in $P_{>}$ is a convex sum of the yields of generators of $P_{\geq}$ with defined yield.

Lemma 3

Let $x \in P_{>}$ be written as in (18), that is,

$x = \sum_{i \in I} α_{i} v^{i} + \sum_{j \in J} β_{j} u^{j}$

with $\sum_{i \in I} α_{i} = 1$ . If Y is bounded on $P_{>}$ , then

$Y (x) = \sum_{i \in I^{d}} {α'}_{i} Y (v^{i}) + \sum_{j \in J^{d}} {β'}_{j} Y (u^{j})$

with $I^{d}$ and $J^{d}$ as in (20),

${α'}_{i}, {β'}_{j} \geq 0, and \sum_{i \in I^{d}} {α'}_{i} + \sum_{j \in J^{d}} {β'}_{j} = 1 .$

In particular,

$\begin{matrix} {α'}_{i} = 0 if and only if α_{i} = 0, \\ {β'}_{j} = 0 if and only if β_{j} = 0, \end{matrix}$

and

$\sum_{i \in I^{d}} {α'}_{i} > 0 if I^{d} = I .$

Proof

By Lemma 1, if Y is bounded on $P_{>}$ , then generators of $P_{\geq}$ cannot have unbounded yield. By Lemma 2, the addition of vectors with undefined yield does not change the yield. Hence

$Y (x) = Y (x^{d})$

with

$x^{d} = \sum_{i \in I^{d}} α_{i} v^{i} + \sum_{j \in J^{d}} β_{j} u^{j} \in P_{>} .$

Now, consider

$x' = λ x^{d} = \sum_{i \in I^{d}} (λ α_{i}) v^{i} + \sum_{j \in J^{d}} (λ β_{j}) u^{j} \in D$

with

$λ = \frac{1}{\sum_{i \in I^{d}} α_{i} + \sum_{j \in J^{d}} β_{j}} > 0$

and hence

$\sum_{i \in I^{d}} λ α_{i} + \sum_{j \in J^{d}} λ β_{j} = 1 .$

Using Eq. (15),

$Y (x^{d}) = Y (x^{'}),$

and using Proposition 1 (inductively),

$Y (x') = \sum_{i \in I^{d}} {α'}_{i} Y (v^{i}) + \sum_{j \in J^{d}} {β'}_{j} Y (u^{j})$

with

${α'}_{i}, {β'}_{j} \in [0, 1], \sum_{i \in I^{d}} {α'}_{i} + \sum_{j \in J^{d}} {β'}_{j} = 1,$

and

$\begin{matrix} {α'}_{i} = 0 if and only if α_{i} = 0, \\ {β'}_{j} = 0 if and only if β_{j} = 0 . \end{matrix}$

If $I^{d} = I$ , then

$\sum_{i \in I^{d}} α_{i} = \sum_{i \in I} α_{i} = 1$

and hence

$\sum_{i \in I^{d}} {α'}_{i} > 0 .$

▫

Proof of Theorem 1

Let $x \in P_{>}$ be written as in (18), that is,

$x = \sum_{i \in I} α_{i} v^{i} + \sum_{j \in J} β_{j} u^{j}$

with $\sum_{i \in I} α_{i} = 1$ . By Lemma 3, the yield of $x \in P_{>}$ is a convex sum of the yields of generators with defined yield, that is,

$Y (x) = \sum_{i \in I^{d}} {α'}_{i} Y (v^{i}) + \sum_{j \in J^{d}} {β'}_{j} Y (u^{j})$

with

${α'}_{i}, {β'}_{j} \in [0, 1] a n d \sum_{i \in I^{d}} {α'}_{i} + \sum_{j \in J^{d}} {β'}_{j} = 1 .$

If

$I^{d} = I and \max_{i \in I^{d}} Y (v^{i}) < \max_{j \in J^{d}} Y (u^{j}) =: Y^{*},$

then

$\sum_{i \in I^{d}} {α'}_{i} > 0 and hence Y (x) < Y^{*}$

for all $x \in P_{>}$ .

Now, consider

$x = v + β_{j} u^{j} \in P_{>}$

with $v \in P_{>}$ and $Y (u^{j}) = Y^{*}$ and

$x' = \frac{x}{1 + β_{j}} = (1 - λ) v + λ u^{j} \in D$

with

$λ = \frac{β_{j}}{1 + β_{j}} \in [0, 1) .$

Using Eq. (15),

$Y (x') = Y (x),$

and using Proposition 1,

$Y (x') = (1 - λ') Y (v) + λ' Y (u^{j}),$

where $λ' = 1$ if and only if $λ = 1$ . Hence, $β_{j} \to \infty$ implies $λ \to 1$ , $λ' \to 1$ , and

$Y (x) = Y (x') \to Y (u^{j}) = Y^{*} .$

That is, $Y^{*}$ is the supremum.

Conversely, if $I^{d} \neq I$ or $\max_{i \in I^{d}} Y (v^{i}) \geq \max_{j \in J^{d}} Y (u^{j})$ , then the maximum is attained. ▫

Proof of Theorem 2

By Lemma 3, the yield of $x \in P_{>}$ is a convex sum of the yields of generators of $P_{\geq}$ with defined yield. Since the maximum yield is attained, generators with defined, but not maximum yield do not contribute to the optimal solution. Hence, every optimal solution is a convex/conical sum of generators with maximum or undefined yield – and vice versa. Thereby, at least one generator with maximum yield contributes to the sum. ▫

Supplementary material

Application 1

mmc1.pdf^{(1.1MB, pdf)}

References

Bordbar A., Monk J.M., King Z.A., Palsson B.O. Constraint-based models predict metabolic and associated cellular functions. Nat. Rev. Genet. 2014;15:107–120. doi: 10.1038/nrg3643. [DOI] [PubMed] [Google Scholar]
Boyd S., Vandenberghe L. Cambridge University Press; Cambridge: 2004. Convex Optimization. [Google Scholar]
Burgard A.P., Pharkya P., Maranas C.D. Optknock: a bilevel programming framework for identifying gene knockout strategies for microbial strain optimization. Biotechnol. Bioeng. 2003;84:647–657. doi: 10.1002/bit.10803. [DOI] [PubMed] [Google Scholar]
Burgard A.P., Nikolaev E.V., Schilling C.H., Maranas C.D. Flux coupling analysis of genome-scale metabolic network reconstructions. Genome Res. 2004;14:301–312. doi: 10.1101/gr.1926504. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fell D.A., Small J.R. Fat synthesis in adipose tissue. An examination of stoichiometric constraints. Biochem. J. 1986;238:781–786. doi: 10.1042/bj2380781. [DOI] [PMC free article] [PubMed] [Google Scholar]
Frenk J.B.G., Schaible S. Fractional programming. In: Hadjisavvas Nicolas, Komlósi Sándor, Schaible Siegfried., editors. Handbook of Generalized Convexity and Generalized Monotonicity, Volume 76 of Nonconvex Optim. Appl. Springer; New York,: 2005. pp. 335–386. [Google Scholar]
Gianchandani E.P., Oberhardt M.A., Burgard A.P., Maranas C.D., Papin J.A. Predicting biological system objectives de novo from internal state measurements. BMC Bioinforma. 2008;9:43. doi: 10.1186/1471-2105-9-43. [DOI] [PMC free article] [PubMed] [Google Scholar]
Goel A., Wortel M.T., Molenaar D., Teusink B. Metabolic shifts: a fitness perspective for microbial cell factories. Biotechnol. Lett. 2012;34:2147–2160. doi: 10.1007/s10529-012-1038-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hädicke O., Klamt S. Computing complex metabolic intervention strategies using constrained minimal cut sets. Metab. Eng. 2011;13:204–213. doi: 10.1016/j.ymben.2010.12.004. [DOI] [PubMed] [Google Scholar]
Hädicke O., Klamt S. EColiCore2: a reference network model of the central metabolism of Escherichia coli and relationships to its genome-scale parent model. Sci. Rep. 2017;7:39647. doi: 10.1038/srep39647. [DOI] [PMC free article] [PubMed] [Google Scholar]
Huynh T., Lassez C., Lassez J.-L. Practical issues on the projection of polyhedral sets. Ann. Math. Artif. Intell. 1992;6:295–315. [Google Scholar]
Jungreuthmayer C., Zanghellini J. Designing optimal cell factories: integer programming couples elementary mode analysis with regulation, BMC. Syst. Biol. 2012;6:103. doi: 10.1186/1752-0509-6-103. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jungreuthmayer C., Nair G., Klamt S., Zanghellini J. Comparison and improvement of algorithms for computing minimal cut sets. BMC Bioinforma. 2013;14:318. doi: 10.1186/1471-2105-14-318. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kelk S.M., Olivier B.G., Stougie L., Bruggeman F.J. Optimal flux spaces of genome-scale stoichiometric models are determined by a few subnetworks. Sci. Rep. 2012;2:580. doi: 10.1038/srep00580. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kim J., Reed J. OptORF: optimal metabolic and regulatory perturbations for metabolic engineering of microbial strains. BMC Syst. Biol. 2010;4:53. doi: 10.1186/1752-0509-4-53. [DOI] [PMC free article] [PubMed] [Google Scholar]
Klamt S., Gilles E.D. Minimal cut sets in biochemical reaction networks. Bioinformatics. 2004;20:226–234. doi: 10.1093/bioinformatics/btg395. [DOI] [PubMed] [Google Scholar]
Klamt S., Mahadevan R. On the feasibility of growth-coupled product synthesis in microbial strains. Metab. Eng. 2015;30:166–178. doi: 10.1016/j.ymben.2015.05.006. [DOI] [PubMed] [Google Scholar]
Klamt S., Saez-Rodriguez J., Gilles E. Structural and functional analysis of cellular networks with CellNetAnalyzer. BMC Syst. Biol. 2007;1:2. doi: 10.1186/1752-0509-1-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
Klamt S., Regensburger G., Gerstl M.P., Jungreuthmayer C., Schuster S., Mahadevan R., Zanghellini J., Müller S. From elementary flux modes to elementary flux vectors: metabolic pathway analysis with arbitrary linear flux constraints. PLOS Comput. Biol. 2017;13:e1005409. doi: 10.1371/journal.pcbi.1005409. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lassez C., Lassez J.-L. Quantifier Elimination for Conjunctions of Linear Constraints via a Convex Hull Algorithm. In: Donald Bruce Randall, Kapur Deepak, Mundy Joseph L., editors. Symbolic and Numerical Computation for Artificial Intelligence. Academic Press Limited; Oval Road London NW1: 1990. pp. 24–28. [Google Scholar]
Lewis N.E., Nagarajan H., Palsson B.O. Constraining the metabolic genotypephenotype relationship using a phylogeny of in silico methods. Nat. Rev. Microbiol. 2012;10:291–305. doi: 10.1038/nrmicro2737. [DOI] [PMC free article] [PubMed] [Google Scholar]
Müller S., Regensburger G. Elementary vectors and conformal sums in polyhedral geometry and their relevance for metabolic pathway analysis. Front. Genet. 2016;7:90. doi: 10.3389/fgene.2016.00090. [DOI] [PMC free article] [PubMed] [Google Scholar]
Machado D., Herrgard M.J. Co-evolution of strain design methods based on flux balance and elementary mode analysis. Metab. Eng. Commun. 2015;2:85–92. doi: 10.1016/j.meteno.2015.04.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
Mahadevan R., Kamp A.v., Klamt S. Genome-scale strain designs based on regulatory minimal cut sets. Bioinformatics. 2015;31:2844–2851. doi: 10.1093/bioinformatics/btv217. [DOI] [PubMed] [Google Scholar]
Maia P., Rocha M., Rocha I. In silico constraint-based strain optimization methods: the quest for optimal cell factories. Microbiol. Mol. Biol. Rev. 2016;80:45–67. doi: 10.1128/MMBR.00014-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
Maranas C.D., Zomorrodi A.R. 1 edition. John Wiley & Sons; Hoboken, New Jersey: 2016. Optimization Methods in Metabolic Networks. [Google Scholar]
Melzer G., Esfandabadi M.E., Franco-Lara E., Wittmann C. Flux design: in silico design of cell factories based on correlation of pathway fluxes to desired properties. BMC Syst. Biol. 2009;3:120. doi: 10.1186/1752-0509-3-120. [DOI] [PMC free article] [PubMed] [Google Scholar]
Mori M., Hwa T., Martin O.C., De Martino A., Marinari E. Constrained allocation flux balance analysis. PLoS Comput. Biol. 2016;12:e1004913. doi: 10.1371/journal.pcbi.1004913. [DOI] [PMC free article] [PubMed] [Google Scholar]
Nielsen J., Keasling J.D. Engineering cellular metabolism. Cell. 2016;164:1185–1197. doi: 10.1016/j.cell.2016.02.004. [DOI] [PubMed] [Google Scholar]
Oberhardt M.A., Palsson B.Ø., Papin J.A. Applications of genomescale metabolic reconstructions. Mol. Syst. Biol. 2009;5:320. doi: 10.1038/msb.2009.77. [DOI] [PMC free article] [PubMed] [Google Scholar]
Orth J.D., Thiele I., Palsson B.Ø. What is flux balance analysis? Nat. Biotechnol. 2010;28:245–248. doi: 10.1038/nbt.1614. [DOI] [PMC free article] [PubMed] [Google Scholar]
Orth J.D., Conrad T.M., Na J., Lerman J.A., Nam H., Feist A.M., Palsson B.O. A comprehensive genome-scale reconstruction of Escherichia coli metabolism – 2011. Mol. Syst. Biol. 2011;7 doi: 10.1038/msb.2011.65. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pharkya P., Maranas C.D. An optimization framework for identifying reaction activation/inhibition or elimination candidates for overproduction in microbial systems. Metab. Eng. 2006;8:1–13. doi: 10.1016/j.ymben.2005.08.003. [DOI] [PubMed] [Google Scholar]
Prauße M.T.E., Schäuble S., Guthke R., Schuster S. Computing the various pathways of penicillin synthesis and their molar yields. Biotechnol. Bioeng. 2016:173–181. doi: 10.1002/bit.25694. [DOI] [PubMed] [Google Scholar]
Ranganathan S., Suthers P.F., Maranas C.D. OptForce: an optimization procedure for identifying all genetic manipulations leading to targeted overproductions. PLoS Comput. Biol. 2010;6:e1000744. doi: 10.1371/journal.pcbi.1000744. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sanford K., Chotani G., Danielson N., Zahn J.A. Scaling up of renewable chemicals. Curr. Opin. Biotechnol. 2016;38:112–122. doi: 10.1016/j.copbio.2016.01.008. [DOI] [PubMed] [Google Scholar]
Santos F., Boele J., Teusink B. Chapter twenty-four – a practical guide to genome-scale metabolic models and their analysis. In: Jameson D., Verma M., Westerhoff H.V., editors. Methods in Enzymology, Volume 500 of Methods in Systems Biology. Academic Press; San Diego, CA, USA: 2011. pp. 509–532. [DOI] [PubMed] [Google Scholar]
Schuetz R., Kuepfer L., Sauer U. Systematic evaluation of objective functions for predicting intracellular fluxes in Escherichia coli. Mol. Syst. Biol. 2007;3 doi: 10.1038/msb4100162. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schuetz R., Zamboni N., Zampieri M., Heinemann M., Sauer U. Multidimensional optimality of microbial metabolism. Science. 2012;336:601–604. doi: 10.1126/science.1216882. [DOI] [PubMed] [Google Scholar]
Schuster S., Hilgetag C. On elementary flux modes in biochemical reaction systems at steady state. J. Biol. Syst. 1994;2:165–182. [Google Scholar]
Schuster S., Dandekar T., Fell D.A. Detection of elementary flux modes in biochemical networks: a promising tool for pathway analysis and metabolic engineering. Trends Biotechnol. 1999;17:53–60. doi: 10.1016/s0167-7799(98)01290-6. [DOI] [PubMed] [Google Scholar]
Schuster S., Fell D.A., Dandekar T. A general definition of metabolic pathways useful for systematic organization and analysis of complex metabolic networks. Nat. Biotech. 2000;18:326–332. doi: 10.1038/73786. [DOI] [PubMed] [Google Scholar]
Schuster S., Dandekar T., Mauch K., Reuss M., Fell D. Technological and Medical Implications of Metabolic Control Analysis, NATO Science Series. Springer; Dordrecht: 2000. Recent developments in metabolic pathway analysis and their potential implications for biotechnology and medicine; pp. 57–66. [Google Scholar]
Schuster S., Pfeiffer T., Fell D.A. Is maximization of molar yield in metabolic networks favoured by evolution? J. Theor. Biol. 2008;252:497–504. doi: 10.1016/j.jtbi.2007.12.008. [DOI] [PubMed] [Google Scholar]
Schuster S., de Figueiredo L.F., Schroeter A., Kaleta C. Combining metabolic pathway analysis with evolutionary game theory. Explaining the occurrence of low-yield pathways by an analytic optimization approach. Biosystems. 2011;105:147–153. doi: 10.1016/j.biosystems.2011.05.007. [DOI] [PubMed] [Google Scholar]
Schuster S., Boley D., Möller P., Stark H., Kaleta C. Mathematical models for explaining the Warburg effect: a review focussed on ATP and biomass production. Biochem. Soc. Trans. 2015;43:1187–1194. doi: 10.1042/BST20150153. [DOI] [PubMed] [Google Scholar]
Simeonidis E., Murabito E., Smallbone K., Westerhoff H.V. Why does yeast ferment? A flux balance analysis study. Biochem. Soc. Trans. 2010;38:1225. doi: 10.1042/BST0381225. [DOI] [PubMed] [Google Scholar]
Tepper N., Shlomi T. Predicting metabolic engineering knockout strategies for chemical production: accounting for competing pathways. Bioinformatics. 2010;26:536–543. doi: 10.1093/bioinformatics/btp704. [DOI] [PubMed] [Google Scholar]
Teusink B., Smid E.J. Modelling strategies for the industrial exploitation of lactic acid bacteria. Nat. Rev. Microbiol. 2006;4:46–56. doi: 10.1038/nrmicro1319. [DOI] [PubMed] [Google Scholar]
Trinh C.T., Unrean P., Srienc F. Minimal Escherichia coli cell for the most efficient production of ethanol from hexoses and pentoses. Appl. Environ. Microbiol. 2008;74:3634–3643. doi: 10.1128/AEM.02708-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
Urbanczik R., Wagner C. Functional stoichiometric analysis of metabolic networks. Bioinformatics. 2005;21:4176–4180. doi: 10.1093/bioinformatics/bti674. [DOI] [PubMed] [Google Scholar]
Urbanczik R. Enumerating constrained elementary flux vectors of metabolic networks. IET Syst. Biol. 2007;1:274–279. doi: 10.1049/iet-syb:20060073. [DOI] [PubMed] [Google Scholar]
Varma A., Palsson B.O. Metabolic flux balancing: basic concepts, scientific and practical use. Nat. Biotechnol. 1994;12:994. [Google Scholar]
von Kamp A., Klamt S. Enumeration of smallest intervention strategies in genome-scale metabolic networks. PLoS Comput. Biol. 2014;10:e1003378. doi: 10.1371/journal.pcbi.1003378. [DOI] [PMC free article] [PubMed] [Google Scholar]
von Kamp A., Klamt S. Growth-coupled overproduction is feasible for almost all metabolites in five major production organisms. Nat. Commun. 2017;8:15956. doi: 10.1038/ncomms15956. [DOI] [PMC free article] [PubMed] [Google Scholar]
von Kamp A., Thiele S., Hädicke O., Klamt S. Use of CellNetAnalyzer in biotechnology and metabolic engineering. J. Biotechnol. 2017;261:221–228. doi: 10.1016/j.jbiotec.2017.05.001. [DOI] [PubMed] [Google Scholar]
Watson M.R. Metabolic maps for the Apple II. Biochem. Soc. Trans. 1984;12:1093–1094. [Google Scholar]
Zanghellini J., Ruckerbauer D.E., Hanscho M., Jungreuthmayer C. Elementary flux modes in a nutshell: properties, calculation and applications. Biotechnol. J. 2013;8:1009–1016. doi: 10.1002/biot.201200269. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Application 1

mmc1.pdf^{(1.1MB, pdf)}

[bib1] Bordbar A., Monk J.M., King Z.A., Palsson B.O. Constraint-based models predict metabolic and associated cellular functions. Nat. Rev. Genet. 2014;15:107–120. doi: 10.1038/nrg3643. [DOI] [PubMed] [Google Scholar]

[bib2] Boyd S., Vandenberghe L. Cambridge University Press; Cambridge: 2004. Convex Optimization. [Google Scholar]

[bib3] Burgard A.P., Pharkya P., Maranas C.D. Optknock: a bilevel programming framework for identifying gene knockout strategies for microbial strain optimization. Biotechnol. Bioeng. 2003;84:647–657. doi: 10.1002/bit.10803. [DOI] [PubMed] [Google Scholar]

[bib4] Burgard A.P., Nikolaev E.V., Schilling C.H., Maranas C.D. Flux coupling analysis of genome-scale metabolic network reconstructions. Genome Res. 2004;14:301–312. doi: 10.1101/gr.1926504. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib5] Fell D.A., Small J.R. Fat synthesis in adipose tissue. An examination of stoichiometric constraints. Biochem. J. 1986;238:781–786. doi: 10.1042/bj2380781. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] Frenk J.B.G., Schaible S. Fractional programming. In: Hadjisavvas Nicolas, Komlósi Sándor, Schaible Siegfried., editors. Handbook of Generalized Convexity and Generalized Monotonicity, Volume 76 of Nonconvex Optim. Appl. Springer; New York,: 2005. pp. 335–386. [Google Scholar]

[bib7] Gianchandani E.P., Oberhardt M.A., Burgard A.P., Maranas C.D., Papin J.A. Predicting biological system objectives de novo from internal state measurements. BMC Bioinforma. 2008;9:43. doi: 10.1186/1471-2105-9-43. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] Goel A., Wortel M.T., Molenaar D., Teusink B. Metabolic shifts: a fitness perspective for microbial cell factories. Biotechnol. Lett. 2012;34:2147–2160. doi: 10.1007/s10529-012-1038-9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib9] Hädicke O., Klamt S. Computing complex metabolic intervention strategies using constrained minimal cut sets. Metab. Eng. 2011;13:204–213. doi: 10.1016/j.ymben.2010.12.004. [DOI] [PubMed] [Google Scholar]

[bib10] Hädicke O., Klamt S. EColiCore2: a reference network model of the central metabolism of Escherichia coli and relationships to its genome-scale parent model. Sci. Rep. 2017;7:39647. doi: 10.1038/srep39647. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib11] Huynh T., Lassez C., Lassez J.-L. Practical issues on the projection of polyhedral sets. Ann. Math. Artif. Intell. 1992;6:295–315. [Google Scholar]

[bib12] Jungreuthmayer C., Zanghellini J. Designing optimal cell factories: integer programming couples elementary mode analysis with regulation, BMC. Syst. Biol. 2012;6:103. doi: 10.1186/1752-0509-6-103. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] Jungreuthmayer C., Nair G., Klamt S., Zanghellini J. Comparison and improvement of algorithms for computing minimal cut sets. BMC Bioinforma. 2013;14:318. doi: 10.1186/1471-2105-14-318. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib14] Kelk S.M., Olivier B.G., Stougie L., Bruggeman F.J. Optimal flux spaces of genome-scale stoichiometric models are determined by a few subnetworks. Sci. Rep. 2012;2:580. doi: 10.1038/srep00580. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib15] Kim J., Reed J. OptORF: optimal metabolic and regulatory perturbations for metabolic engineering of microbial strains. BMC Syst. Biol. 2010;4:53. doi: 10.1186/1752-0509-4-53. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] Klamt S., Gilles E.D. Minimal cut sets in biochemical reaction networks. Bioinformatics. 2004;20:226–234. doi: 10.1093/bioinformatics/btg395. [DOI] [PubMed] [Google Scholar]

[bib17] Klamt S., Mahadevan R. On the feasibility of growth-coupled product synthesis in microbial strains. Metab. Eng. 2015;30:166–178. doi: 10.1016/j.ymben.2015.05.006. [DOI] [PubMed] [Google Scholar]

[bib18] Klamt S., Saez-Rodriguez J., Gilles E. Structural and functional analysis of cellular networks with CellNetAnalyzer. BMC Syst. Biol. 2007;1:2. doi: 10.1186/1752-0509-1-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib19] Klamt S., Regensburger G., Gerstl M.P., Jungreuthmayer C., Schuster S., Mahadevan R., Zanghellini J., Müller S. From elementary flux modes to elementary flux vectors: metabolic pathway analysis with arbitrary linear flux constraints. PLOS Comput. Biol. 2017;13:e1005409. doi: 10.1371/journal.pcbi.1005409. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib20] Lassez C., Lassez J.-L. Quantifier Elimination for Conjunctions of Linear Constraints via a Convex Hull Algorithm. In: Donald Bruce Randall, Kapur Deepak, Mundy Joseph L., editors. Symbolic and Numerical Computation for Artificial Intelligence. Academic Press Limited; Oval Road London NW1: 1990. pp. 24–28. [Google Scholar]

[bib21] Lewis N.E., Nagarajan H., Palsson B.O. Constraining the metabolic genotypephenotype relationship using a phylogeny of in silico methods. Nat. Rev. Microbiol. 2012;10:291–305. doi: 10.1038/nrmicro2737. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] Müller S., Regensburger G. Elementary vectors and conformal sums in polyhedral geometry and their relevance for metabolic pathway analysis. Front. Genet. 2016;7:90. doi: 10.3389/fgene.2016.00090. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib23] Machado D., Herrgard M.J. Co-evolution of strain design methods based on flux balance and elementary mode analysis. Metab. Eng. Commun. 2015;2:85–92. doi: 10.1016/j.meteno.2015.04.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib24] Mahadevan R., Kamp A.v., Klamt S. Genome-scale strain designs based on regulatory minimal cut sets. Bioinformatics. 2015;31:2844–2851. doi: 10.1093/bioinformatics/btv217. [DOI] [PubMed] [Google Scholar]

[bib25] Maia P., Rocha M., Rocha I. In silico constraint-based strain optimization methods: the quest for optimal cell factories. Microbiol. Mol. Biol. Rev. 2016;80:45–67. doi: 10.1128/MMBR.00014-15. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] Maranas C.D., Zomorrodi A.R. 1 edition. John Wiley & Sons; Hoboken, New Jersey: 2016. Optimization Methods in Metabolic Networks. [Google Scholar]

[bib27] Melzer G., Esfandabadi M.E., Franco-Lara E., Wittmann C. Flux design: in silico design of cell factories based on correlation of pathway fluxes to desired properties. BMC Syst. Biol. 2009;3:120. doi: 10.1186/1752-0509-3-120. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib28] Mori M., Hwa T., Martin O.C., De Martino A., Marinari E. Constrained allocation flux balance analysis. PLoS Comput. Biol. 2016;12:e1004913. doi: 10.1371/journal.pcbi.1004913. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib29] Nielsen J., Keasling J.D. Engineering cellular metabolism. Cell. 2016;164:1185–1197. doi: 10.1016/j.cell.2016.02.004. [DOI] [PubMed] [Google Scholar]

[bib30] Oberhardt M.A., Palsson B.Ø., Papin J.A. Applications of genomescale metabolic reconstructions. Mol. Syst. Biol. 2009;5:320. doi: 10.1038/msb.2009.77. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib31] Orth J.D., Thiele I., Palsson B.Ø. What is flux balance analysis? Nat. Biotechnol. 2010;28:245–248. doi: 10.1038/nbt.1614. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib32] Orth J.D., Conrad T.M., Na J., Lerman J.A., Nam H., Feist A.M., Palsson B.O. A comprehensive genome-scale reconstruction of Escherichia coli metabolism – 2011. Mol. Syst. Biol. 2011;7 doi: 10.1038/msb.2011.65. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib33] Pharkya P., Maranas C.D. An optimization framework for identifying reaction activation/inhibition or elimination candidates for overproduction in microbial systems. Metab. Eng. 2006;8:1–13. doi: 10.1016/j.ymben.2005.08.003. [DOI] [PubMed] [Google Scholar]

[bib34] Prauße M.T.E., Schäuble S., Guthke R., Schuster S. Computing the various pathways of penicillin synthesis and their molar yields. Biotechnol. Bioeng. 2016:173–181. doi: 10.1002/bit.25694. [DOI] [PubMed] [Google Scholar]

[bib35] Ranganathan S., Suthers P.F., Maranas C.D. OptForce: an optimization procedure for identifying all genetic manipulations leading to targeted overproductions. PLoS Comput. Biol. 2010;6:e1000744. doi: 10.1371/journal.pcbi.1000744. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] Sanford K., Chotani G., Danielson N., Zahn J.A. Scaling up of renewable chemicals. Curr. Opin. Biotechnol. 2016;38:112–122. doi: 10.1016/j.copbio.2016.01.008. [DOI] [PubMed] [Google Scholar]

[bib37] Santos F., Boele J., Teusink B. Chapter twenty-four – a practical guide to genome-scale metabolic models and their analysis. In: Jameson D., Verma M., Westerhoff H.V., editors. Methods in Enzymology, Volume 500 of Methods in Systems Biology. Academic Press; San Diego, CA, USA: 2011. pp. 509–532. [DOI] [PubMed] [Google Scholar]

[bib38] Schuetz R., Kuepfer L., Sauer U. Systematic evaluation of objective functions for predicting intracellular fluxes in Escherichia coli. Mol. Syst. Biol. 2007;3 doi: 10.1038/msb4100162. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib39] Schuetz R., Zamboni N., Zampieri M., Heinemann M., Sauer U. Multidimensional optimality of microbial metabolism. Science. 2012;336:601–604. doi: 10.1126/science.1216882. [DOI] [PubMed] [Google Scholar]

[bib40] Schuster S., Hilgetag C. On elementary flux modes in biochemical reaction systems at steady state. J. Biol. Syst. 1994;2:165–182. [Google Scholar]

[bib41] Schuster S., Dandekar T., Fell D.A. Detection of elementary flux modes in biochemical networks: a promising tool for pathway analysis and metabolic engineering. Trends Biotechnol. 1999;17:53–60. doi: 10.1016/s0167-7799(98)01290-6. [DOI] [PubMed] [Google Scholar]

[bib42] Schuster S., Fell D.A., Dandekar T. A general definition of metabolic pathways useful for systematic organization and analysis of complex metabolic networks. Nat. Biotech. 2000;18:326–332. doi: 10.1038/73786. [DOI] [PubMed] [Google Scholar]

[bib43] Schuster S., Dandekar T., Mauch K., Reuss M., Fell D. Technological and Medical Implications of Metabolic Control Analysis, NATO Science Series. Springer; Dordrecht: 2000. Recent developments in metabolic pathway analysis and their potential implications for biotechnology and medicine; pp. 57–66. [Google Scholar]

[bib44] Schuster S., Pfeiffer T., Fell D.A. Is maximization of molar yield in metabolic networks favoured by evolution? J. Theor. Biol. 2008;252:497–504. doi: 10.1016/j.jtbi.2007.12.008. [DOI] [PubMed] [Google Scholar]

[bib45] Schuster S., de Figueiredo L.F., Schroeter A., Kaleta C. Combining metabolic pathway analysis with evolutionary game theory. Explaining the occurrence of low-yield pathways by an analytic optimization approach. Biosystems. 2011;105:147–153. doi: 10.1016/j.biosystems.2011.05.007. [DOI] [PubMed] [Google Scholar]

[bib46] Schuster S., Boley D., Möller P., Stark H., Kaleta C. Mathematical models for explaining the Warburg effect: a review focussed on ATP and biomass production. Biochem. Soc. Trans. 2015;43:1187–1194. doi: 10.1042/BST20150153. [DOI] [PubMed] [Google Scholar]

[bib47] Simeonidis E., Murabito E., Smallbone K., Westerhoff H.V. Why does yeast ferment? A flux balance analysis study. Biochem. Soc. Trans. 2010;38:1225. doi: 10.1042/BST0381225. [DOI] [PubMed] [Google Scholar]

[bib48] Tepper N., Shlomi T. Predicting metabolic engineering knockout strategies for chemical production: accounting for competing pathways. Bioinformatics. 2010;26:536–543. doi: 10.1093/bioinformatics/btp704. [DOI] [PubMed] [Google Scholar]

[bib49] Teusink B., Smid E.J. Modelling strategies for the industrial exploitation of lactic acid bacteria. Nat. Rev. Microbiol. 2006;4:46–56. doi: 10.1038/nrmicro1319. [DOI] [PubMed] [Google Scholar]

[bib50] Trinh C.T., Unrean P., Srienc F. Minimal Escherichia coli cell for the most efficient production of ethanol from hexoses and pentoses. Appl. Environ. Microbiol. 2008;74:3634–3643. doi: 10.1128/AEM.02708-07. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib51] Urbanczik R., Wagner C. Functional stoichiometric analysis of metabolic networks. Bioinformatics. 2005;21:4176–4180. doi: 10.1093/bioinformatics/bti674. [DOI] [PubMed] [Google Scholar]

[bib52] Urbanczik R. Enumerating constrained elementary flux vectors of metabolic networks. IET Syst. Biol. 2007;1:274–279. doi: 10.1049/iet-syb:20060073. [DOI] [PubMed] [Google Scholar]

[bib53] Varma A., Palsson B.O. Metabolic flux balancing: basic concepts, scientific and practical use. Nat. Biotechnol. 1994;12:994. [Google Scholar]

[bib54] von Kamp A., Klamt S. Enumeration of smallest intervention strategies in genome-scale metabolic networks. PLoS Comput. Biol. 2014;10:e1003378. doi: 10.1371/journal.pcbi.1003378. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib55] von Kamp A., Klamt S. Growth-coupled overproduction is feasible for almost all metabolites in five major production organisms. Nat. Commun. 2017;8:15956. doi: 10.1038/ncomms15956. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib56] von Kamp A., Thiele S., Hädicke O., Klamt S. Use of CellNetAnalyzer in biotechnology and metabolic engineering. J. Biotechnol. 2017;261:221–228. doi: 10.1016/j.jbiotec.2017.05.001. [DOI] [PubMed] [Google Scholar]

[bib57] Watson M.R. Metabolic maps for the Apple II. Biochem. Soc. Trans. 1984;12:1093–1094. [Google Scholar]

[bib58] Zanghellini J., Ruckerbauer D.E., Hanscho M., Jungreuthmayer C. Elementary flux modes in a nutshell: properties, calculation and applications. Biotechnol. J. 2013;8:1009–1016. doi: 10.1002/biot.201200269. [DOI] [PubMed] [Google Scholar]

PERMALINK

A mathematical framework for yield (vs. rate) optimization in constraint-based modeling and applications in metabolic engineering

Steffen Klamt

Stefan Müller

Georg Regensburger

Jürgen Zanghellini

Abstract

Highlights

1. Introduction

2. Rate and yield optimization

2.1. Basic terminology and examples

Fig. 1.

Fig. 2.

2.2. Mathematical treatment

2.2.1. Definitions

2.2.2. Yield optimization as an LP

2.3. Flux coupling analysis

3. Rate-optimal and yield-optimal solution sets

3.1. Basic terminology and examples

Table 1.

3.2. Rate-optimal solution sets

3.3. Mathematical properties of yield optimization

Property 1

Property 2

Property 3

Property 4

Property 5

3.4. Yield-optimal solution sets

Theorem 1

Proof

Theorem 2

Proof

4. Phase planes and yield spaces

4.1. Basic terminology and examples

Fig. 3.

Fig. 4.

4.2. Mathematical treatment

4.3. EFMs and EFVs in phase planes and yield spaces

4.4. Computation of phase planes and yield spaces

Table 2.

4.5. Implementation in CellNetAnalyzer

5. Production envelopes and yield spaces in strain design

5.1. Biased strain design

Fig. 5.

5.2. Unbiased strain design

6. Examples of production envelopes and yield spaces in E. coli and their use for strain design

6.1. Acetate production in E. coli

Fig. 6.

6.2. Ethanol production in a E. coli genome-scale model

6.3. Designing E. coli acetate producer strains

Fig. 7.

Fig. 8.

7. Conclusions

Acknowledgments

Footnotes

Contributor Information

Appendix: Mathematical results and proofs

Proposition 1

Proof

Lemma 1

Lemma 2

Lemma 3

Proof

Proof of Theorem 1

Proof of Theorem 2

Supplementary material

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases