Abstract
Microbial communities have become a major research focus due to their importance for biogeochemical cycles, biomedicine and biotechnological applications. While some biotechnological applications, such as anaerobic digestion, make use of naturally arising microbial communities, the rational design of microbial consortia for bio-based production processes has recently gained much interest. One class of synthetic microbial consortia is based on specifically designed strains of one species. A common design principle for these consortia is based on division of labor, where the entire production pathway is divided between the different strains to reduce the metabolic burden caused by product synthesis. We first show that classical division of labor does not automatically reduce the metabolic burden when metabolic flux per biomass is analyzed. We then present ASTHERISC (Algorithmic Search of THERmodynamic advantages in Single-species Communities), a new computational approach for designing multi-strain communities of a single-species with the aim to divide a production pathway between different strains such that the thermodynamic driving force for product synthesis is maximized. ASTHERISC exploits the fact that compartmentalization of segments of a product pathway in different strains can circumvent thermodynamic bottlenecks arising when operation of one reaction requires a metabolite with high and operation of another reaction the same metabolite with low concentration. We implemented the ASTHERISC algorithm in a dedicated program package and applied it on E. coli core and genome-scale models with different settings, for example, regarding number of strains or demanded product yield. These calculations showed that, for each scenario, many target metabolites (products) exist where a multi-strain community can provide a thermodynamic advantage compared to a single strain solution. In some cases, a production with sufficiently high yield is thermodynamically only feasible with a community. In summary, the developed ASTHERISC approach provides a promising new principle for designing microbial communities for the bio-based production of chemicals.
Author summary
Communities of microbes are ubiquitous in nature and also of high relevance for industrial applications, e.g. for the production of biogas. The development and use of non-natural communities for biotechnological applications has become an important subject of research. In this work, we present a new computational method to design synthetic communities with improved capabilities for the synthesis of desired target metabolites. Our method takes a constraint-based metabolic model of an organism as input and searches for a suitable partitioning of the product pathway via different strains of the organism such that the thermodynamic driving force for product synthesis is maximized. Essentially, this approach exploits the fact that having multiple strains allows adjustment of different metabolite concentrations in the different strains by which the thermodynamic driving force for product synthesis can often be increased. We tested this approach with a core and with a genome-scale metabolic network model of Escherichia coli. We found that, for dozens of metabolites, there exist communities with specifically designed strains of E. coli where the maximal thermodynamic driving force can be increased compared to a single E. coli strain. In summary, our presented method provides a new approach, together with a new design principle, for the computational design of microbial communities.
Introduction
In nature, organisms rarely occur as isolated populations of single species. They rather form communities (or consortia) with different types of complex interactions between the participating species which often drive evolution [1]. In particular, communities of microorganisms are ubiquitous in nature and play a fundamental role for ecology, bioremediation, geochemical cycles and human health. Microbial communities typically consist of different species, however, especially under controlled laboratory conditions, they may also comprise coexisting strains of a single species with certain physiological differences between the strains. One way to establish such single-species (multi-strain) communities is to construct and co-culture different, genetically modified strains with obligate mutual metabolic dependencies [2].
Mathematical modeling has become a valuable tool to formally describe and simulate complex microbial communities and to gain a deeper understanding of their behavior and properties. A range of different methods has been developed for community modeling [3–5]. As for modeling single species, two major approaches are kinetic modeling based on differential equations [6,7] and steady-state modeling utilizing constraint-based (flux balance analysis (FBA) [8]) techniques [9–13]. FBA-related methods are limited to predictions on stationary metabolic fluxes, but they require as input only the metabolic reaction network of the participating species and no data on kinetic mechanisms and parameters and can therefore be used to analyze large (genome-scale) metabolic models. Applying (linear) FBA-techniques to study communities models requires partially adaptations, e.g. due to arising bilinearities. Two particular examples of such formalizations are SteadyCom [14] and RedCom [15]. Both methods are based on the concept of balanced growth of a community [16] and allow predictions of growth rates, metabolic exchange rates and feasible community compositions. An example of an FBA-based method for the design of single-species (multi-strain) communities with certain constraints is DOLMN (Division Of Labor In Metabolic Networks) [17].
Microbial consortia are extensively used in biotechnological applications. One example is anaerobic digestion in biogas plants where methane is produced by naturally occurring communities. In recent years, artificial co-cultures have been constructed for synthesizing biofuels and value-added products in bioreactors [18]. While some of these realizations use multi-species communities [19,20], more and more realizations focus on the use of communities with different, purposefully constructed strains of a single species. Successful examples of single-species communities for product synthesis include (a) a two-strain E. coli community for the conversion of xylan to ethanol with high ethanol yields [21], (b) another two-strain E. coli community for the synthesis of flavonoids [22] with much higher synthesis rates than without communities, and (c) the synthesis of anthocyanins with a community of four E. coli strains [23], the first successful production of this chemical by microbes at all.
The high potential of dedicated single-species communities for optimizing biotechnological production of a certain compound has been intensively discussed in recent literature. Division of labor (DoL) is seen as one major design principle of such artificial co-cultures [24–28]. The basic idea of DoL is illustrated with the toy example shown in Fig 1A, where we consider a linear metabolic pathway converting a substrate S into a desired target product P. Concretely, S is taken up, then, in a first reaction catalyzed by enzyme E1, converted to the intermediate metabolite M, which is in turn further converted to the product P in a second reaction catalyzed by enzyme E2, after which P is finally excreted. In a biotechnological application, the goal is to achieve a high flux through this pathway which will require a high abundance of the enzymes E1 and E2. Due to limited intracellular resources, the synthesis of each of the two enzymes competes with each other and with other cellular processes, which may result in a reduced production performance of the strain. Here, DoL may help to reduce the “metabolic burden” of the single strain by dividing the respective metabolic task and thus the metabolic costs on several strains of the same species [24,29]. In our example, a DoL strategy could be to split the pathway into two parts (or tasks) each being implemented in one dedicated strain. With an exchange of the intermediate metabolite M, each of the two strains may now concentrate on its dedicated task which could circumvent physiological bottlenecks. Clearly, the effectiveness of a DoL strategy will depend on the relative burden on each pathway step. However, even in the ideal case that the metabolic burden is equally distributed over the two reactions and thus over the two different strains, this DoL perspective neglects one major constraint: if we assume that we are given a certain amount of biomass that can be used for product synthesis (higher amounts of biomass will inevitably reduce product yield), DoL will not necessarily lead to increased product flux (Fig 1B). In the single strain approach, the specific rates (in [mmol/(gDW∙h)]) for reaction R1 are given by
(1) |
where [E1]SS is the concentration of enzyme E1 in the single strain, f1 the saturation function characterizing the kinetic rate law of reaction R1, and c the vector of the metabolite concentrations. Analogously, for the specific rate of reaction R2 we have:
(2) |
For simplicity, in the following we assume that the same amount of enzyme E1 and E2 is needed to induce a certain flux in the reactions R1 and R2 (implying kcat,1∙f1(c) = kcat,2∙fs(c)), and that both enzyme costs are identical (S1 Text shows that the same conclusions can be drawn in the more general case with arbitrary pathway length and enzyme costs under assumption of constant metabolite concentrations in the strains). Therefore, since the specific rates in the single-strain solution must fulfill in steady state, the enzyme concentrations must be identical: [E1]SS = [E2]SS. For the total product synthesis flux [mmol/h] we need to multiply the specific flux with the amount of biomass BSS (unit [gDW]): . Now, in the DoL approach with the assumed simplified case of identical enzyme requirements and costs, the biomass of the single strain would be divided into two equal parts (strains 1 and 2, each having 50% of the original total biomass: ). For the steady-state product synthesis flux in the DoL approach, we obtain . Therefore, in order to obtain the same total product synthesis flux as for the single-strain solution, the specific rates and must be doubled due to the halved biomass available for each strain. This requires doubled enzyme concentrations in the two strains, [E1]DoL = 2[E1]SS = [E2]DoL = 2[E2]SS, implying that the total concentration of enzymes required for the product pathway is identical in all three strains considered: . Thus, the total enzyme costs per biomass in the DoL scenario are the same as with the single-strain solution and the metabolic burden is not reduced. Furthermore, the exchange of the intermediate metabolite M between the strains under DoL is also associated with additional metabolic costs (transporters must be produced, energetic costs of transport etc.). In fact, as also shown for the more general case (S1 Text), a DoL strategy cannot be advantageous (or is even unfavorable) with respect to metabolic burden as long as a potential kinetic advantage due to different metabolite concentrations in the different strains does not outweigh the added costs of metabolite transport. This point has rarely been considered when discussing DoL strategies.
As indicated in the last statement, one degree of freedom in a community that could be used to truly enhance the overall production rate, is the possibility of having different metabolite concentrations in the strains. In this study we present a new approach for designing and optimizing communities with multiple strains of a single species, which exploits this degree of freedom. Specifically, our concept seeks to identify a suitable division of the metabolic network, such that the thermodynamic driving force of the entire product synthesis pathway is maximized. The basic concept is shown with the toy model in Fig 2: The pathway from substrate S to product P includes a thermodynamic bottleneck at the reactions R2 and R5 because of their relatively low thermodynamic driving forces (positive standard Gibbs free energies). The driving force of reaction R2, where X is a reactant, could be increased by adjusting a lower concentration of metabolite X, however, this would simultaneously further reduce the driving force of reaction R5, where X is a product. In fact, with the chosen set of constraints for the metabolite concentrations in Fig 2, the pathway is thermodynamically infeasible indicated by the negative optimal MDF (OptMDF) value. For a given pathway, the MDF (max-min-driving force [30]) maximizes the minimum of the driving force (i.e., the negative value of the Gibbs free energy change) of all participating reactions in a pathway and OptMDF finds the pathway in a metabolic network with maximal MDF [31]. Here, splitting the pathway in two sub-pathways and implementing them in two different strains with an exchange of intermediate B (Fig 2) circumvents these bottlenecks as the concentration of metabolite X can now be adjusted individually in each strain (low in strain 1, high in strain 2) through which product synthesis becomes thermodynamically feasible. Following this idea, we developed a new framework, ASTHERISC, for an automated design of single-species (multi-strain) communities that enable higher thermodynamic driving forces for product synthesis compared to a single-strain approach. Focusing on pure production (without growth), we demonstrate the power of the method by identifying multi-strain communities of E. coli that maximize the thermodynamic driving force of product synthesis for a wide range of compounds.
Methods
Structure of community models
We first summarize the representation of stoichiometric models of microbial communities under balanced growth (for a detailed description can be found in [15]). Herein, we will focus on communities with different strains of one species, which can be seen as a special case of communities with multiple species. As common for constraint-based metabolic models we represent the metabolic network (with m metabolites and q reactions) of the given species with a m×q stoichiometric matrix N, a flux vector r containing the net reaction rates, and upper (βi) and lower (αi) flux bounds for each reaction. The steady-state assumption for the internal metabolites implies
(3) |
and the flux bounds restrict the range of possible reaction rates:
(4) |
While the stoichiometric matrix N comprises internal metabolites only, this matrix can be extended to , which also contains the stoichiometries of the external metabolites (substrates, products etc.) which need not fulfill the steady state constraint (3). We also allow the definition of additional linear flux constraints (such as demanded minimal product-to-substrate yields) with a suitable matrix D and vector d:
(5) |
In the description of microbial communities, the net reaction rates r are normalized with respect to the total community biomass [mmol/(gDWtotal∙ h)]. Furthermore, the biomass fractions Fi = BMi/BMtot (0≤Fi≤1) of each strain i at the total biomass are variables of the model. To fulfill the constraint of balanced growth (i.e., constant composition and identical specific growth rate for each strain), the ratio of biomass production of strain i to total biomass synthesis must also equal Fi. Importantly, as the original flux bounds (4) for each strain are given in mmol/(gDWi∙h), they must be multiplied with the fraction of the respective strain in the community to obtain flux bounds with units normalized to total biomass
(6) |
In communities with (balanced) growth, the formulation leads to bilinearities [15], which can be resolved by fixing either the strain fractions or the community growth rate in order to obtain a linear optimization problem. However, in this study we focus on optimal partitioning of the product synthesis pathway (without growth) and the only flux bound used will be the (community) uptake rate of the carbon source. With that we can disregard the production of biomass and assume identical fractions for all involved strain (e.g. 50% in two-strain communities and 33% in three-strain communities; see S2 Text and Results section). However, in general, growth or/and different biomass fractions may be considered if this is desired.
In the community model, each of the n strains represent a compartment with an associated metabolic network, captured by stoichiometric matrices N1 to Nn. In a multi-strain model of one species, the n stoichiometric matrices are, at least initially, identical. In addition, an exchange compartment is added from/to which the strains can uptake/release compounds (Fig 3). In particular, this enables the exchange of metabolites between strains. The quasi-steady-state condition (3) must also be fulfilled for the compounds within the exchange compartment, but a defined selection of those can themselves be exported/imported to/from the environment. Accordingly, the stoichiometric matrix Nc of the community model reads as follows:
(7) |
E1,ex … En,ex represent the stoichiometries of the exported/imported metabolites from strains 1 … n
to/from the exchange compartment and Ec describes the exchange of metabolites between the exchange compartment and the environment. The steady-state constraint (3) for the community now reads:
(8) |
Inclusion of thermodynamics
Several FBA-based modeling techniques incorporate thermodynamic constraints, which reduce the solution space by removing thermodynamically infeasible or unrealistic pathways [32–36]. An example for the usage of thermodynamics-including FBA-based methods in a community model is the analysis of a methanogenic consortium of S. fumaroxidans and M. hungatei [37]. Thermodynamics-based methods are usually based on the Gibbs free energy ΔrG′ of (bio)chemical reactions and the fact that the net rate of a reaction can only be positive if its ΔrG′ is negative. Generally, the of reaction i depends on the concentrations of the participating reactants and the reaction-specific standard change in Gibbs free energy ΔrG′0 [38]:
(9) |
R is the gas constant, T the temperature and x the logarithmized metabolite concentration vector and the transposed i-th column (reaction) of the extended stoichiometric matrix with explicitly included external metabolites. In the following, we do not distinguish between community and single-strain models, hence, the stoichiometric matrices and reaction rate vectors may also refer to the community versions Nc and rc. As a quantitative measure of how thermodynamically feasible a specific reaction i is under given metabolite concentrations, the thermodynamic driving force fi can be defined, which simply is the negative value of the Gibbs free energy of that reaction:
(10) |
This reaction-centric concept was extended to quantify the thermodynamically feasibility of an entire pathway by introducing the concept of the max-min driving force (MDF) [30]. With a given range of feasible metabolite concentrations, the MDF is the maximal value B such that all reactions of the pathway have a driving force of at least B. The MDF is thus the maximal driving force an entire pathway can reach. The original MDF formulation only allows the computation of the MDF for a given pathway. In order to find, within a genome-scale network, a pathway with maximal MDF that fulfills certain constraints (e.g. with a given minimal yield of a certain target product) the OptMDFpathway method was developed [31]. As it is fundamental for the ASTHERISC algorithm presented herein we summarize its basic features.
Binary variables zi∈{0,1} are introduced for each reaction i of the model and we constrain it to be 1 if the associated reaction flux ri is non-zero (βi is the upper bound for ri):
(11) |
The logarithmized metabolite concentration values in x are constrained by physiologically meaningful lower (cmin) and upper bounds (cmax) for the respective concentrations:
(12) |
Sometimes constraints on certain ratios of metabolite concentrations are included (e.g. for cofactors such as NADH and NAD). For a fixed ratio of two metabolites i and j we can write
(13) |
and a feasible range of a metabolite concentration ratio can be expressed by the following two constraints (hmin and hmax representing the minimal and maximal ratio, respectively):
(14) |
(15) |
In a preprocessing step, the minimal and maximal possible driving forces fi,min and fi,max under the given concentration ranges (12)-(15) are determined for each reaction i. With K we denote the maximal possible driving force over all reactions
(16) |
Next, we introduce B as a lower bound for the driving force of all active reactions:
(17) |
Finally, maximization of B then yields the optimal (maximal) MDF, abbreviated with OptMDF, for the given constraints:
(18) |
The defined OptMDF optimization problem combines binary and continuous variables in linear constraints and thus represents a mixed-integer linear program (MILP).
ASTHERISC algorithm
ASTHERISC (Algorithmic Search of THERmodynamic advantages in Single-species Communities) searches for multi-strain communities that compared to solutions with a single strain, improve the thermodynamic driving force of product synthesis. The algorithm starts with building a community model with n (initially identical) strains of a given species. Standard exchange metabolites (e.g. substrates, nutrients, oxygen, carbon dioxide, common fermentation products such as acetate, ethanol etc.) can be specified, which can be exchanged by the strains with the exchange compartment via standard exchange reactions and exchange reactions between the exchange compartment and the environment are added as well to allow uptake/release of these metabolites from/to the environment. In addition, to allow the exchange of (intermediate) metabolites between the strains in the community, we add, to each internal metabolite deemed to be eligible for exchange between the strains, a reversible transport reaction which can import or export the metabolite from/to the exchange compartment. Accordingly, we distinguish standard exchanges as introduced above, from extra exchanges which allow metabolic interactions between the strains but, in contrast to standard exchanges, no exchange with the environment. The standard as well as the extra exchanges can be specifically defined for every application. Furthermore, when designing a community for the production of a given target metabolite, an export reaction from the exchange compartment to the environment is temporarily added to allow its net production. ASTHERISC searches then for optimal pathways that traverse between the strains to maximize the MDF for synthesizing the target product with a given minimum product yield. The optimal pathway will involve individual (but possibly overlapping) sets of reactions and metabolite concentrations in the different strains.
With this model setup, ASTHERISC employs the OptMDFpathway MILP and slightly modified variants thereof with additional constraints or/and an altered objective function. These variants are described in the following.
OptMDF: the original OptMDFpathway algorithm as described above
OptMDF2: limited number of (extra) exchanges
If EX contains the indices of the extra exchange reactions, an upper bound c for the number of active extra exchanges can be demanded by adding the following constraint to the OptMDF MILP:
(19) |
-
3
OptMDF3: adding a constraint demanding a lower bound for the MDF
In this variant, we take OptMDF2 and drop the objective function given in Eq (18) and include instead a constraint demanding a minimum value γ for the MDF B:
(20) |
Note that OptMDF3 just searches for a feasible solution. It returns infeasible if no solution with the demanded minimal MDF exists.
-
4
OptMDF4: minimize the overall flux
OptMDF4 uses OptMDF3 together with the objective of minimizing the sum of absolute fluxes:
(21) |
Note that the OptMDF algorithm splits reversible reactions internally into two irreversible ones ensuring that all rates are non-negative and that (21) really maximizes the sum of absolute fluxes.
-
5
OptMDF5: searching for minimal/maximal metabolite concentrations
Based on OptMDF3, this variant first maximizes and then minimizes the (logarithmized) concentration of a given metabolite i:
(22) |
(23) |
This concentration variability analysis is done for all metabolites and determines the feasible metabolite concentrations under the given constraints (including Eq (20)).
The pseudo code of the ASTHERISC algorithm is given in Fig 4. For each (target) metabolite to be tested by ASTHERISC as target product, the ASTHERISC algorithm seeks to find a flux vector and associated metabolite concentrations in the community model such that (a) the community reaches a higher OptMDF than the single-strain solution under (b) the constraint of a minimal defined target product yield. Accordingly, in the first step of the ASTHERISC algorithm, the maximal target product yield, achievable without thermodynamic constraints, is computed for the single-strain model in mol/mol (technically, all single-strain calculations are done via the community model in which only one single strain has unblocked default and target metabolite exchanges) and a specified minimal fraction of this maximum yield must be reached in all subsequent community model calculations. Next, the OptMDF is computed in the single strain and then for multiple strains in the community; in the latter case, an upper bound for the number of active extra exchange reactions can be specified (step 2). If the OptMDF in the multi-strain community model is higher than in the single-strain model, the next steps are performed to characterize the solution (otherwise the algorithm continues with the next target metabolite). Steps 3 and 4 determine the flux vector in the community model that has the minimal flux (sum) under the maximal product yield possible with the found OptMDF. This step removes unnecessary and thermodynamically irrelevant reactions and the obtained minimal flux vector thus indicates the truly required reactions in the MDF-optimal community solution. This flux vector can be optionally further analyzed to determine the feasible metabolite concentration ranges for the optimal community solution (step 5) and to identify thermodynamic bottleneck reactions (step 6) in a single-strain representation of the community solution. As in [31], thermodynamic bottlenecks are defined as those reactions where a change of the standard Gibbs free energy alone is enough to further increase the OptMDF value.
Our implementation of the algorithm uses, for some MILPs, indirect approximation steps as they turned out to be advantageous and more robust than direct MILP optimizations in larger models. For example, direct optimization of MDF with the standard OptMDFpathway MILP algorithm may require several hours to days in a genome-scale community model, even when using a high-performance computer cluster. However, approximating the OptMDF with iterative application of OptMDF3 (testing of feasibility with a fixed MDF level) may run in several minutes. This can be used to approximate the OptMDF value by iterative refinement of this value for which a feasible solution can be found (the approximate algorithms stops when a demanded precision level has been reached). The same is done for maximization of product yields under a given optimal MDF constraint.
Implementation
The generation of community models in the described forms as well as the application of the introduced ASTHERISC algorithm are implemented in two separate packages, the CommModelPy and the ASTHERISC package (Fig 5).
CommModelPy
The CommModelPy package can be used to compile multi-strain community models as described in this study, but it also supports construction of multi-species community models under balanced growth. While other Python-based program packages for the generation of communities have been published (including CarveMe [39] and micom [40]), CommModelPy can create community models with a format required for the ASTHERISC package (e.g. fixed species fractions and simulations without growth are allowed). CommModelPy is written in Python and based on the module cobrapy [41]. Among others it supports the SBML format [42]. Herein we used it with Python in version 3.7 and cobrapy 0.17.1. CommModelPy is free and open-source and can be retrieved via its GitHub repository at:
https://github.com/klamt-lab/CommModelPy
The repository also includes a manual and, as a usage example, the CommModelPy-based scripts for the generation of all multi-strain community models used in this study.
ASTHERISC package
The actual ASTHERISC algorithm with all its subroutines has been implemented in the form of the ASTHERISC package. As input, it takes a CommModelPy-generated multi-strain community model from an SBML file and a list of reaction-associated ΔrG′0 values from a JSON file. As output, it generates detailed text reports for which target metabolites OptMDF advantages could be found in a community. The package is written in MATLAB and uses API functions from CellNetAnalyzer [43] and the IBM CPLEX solver. Herein, it was used with MATLAB 2018a, CellNetAnalyzer 2019.2 and CPLEX 12.9. The ASTHERISC package is free and open-source and can be found under its GitHub repository:
https://github.com/klamt-lab/astheriscPackage
This repository includes all scripts and detailed report files of the ASTHERISC-based analyses performed in this study.
Results
Application of ASTHERISC to a small example model
We first illustrate our developed ASTHERISC framework with the toy model given in Fig 2. CommModelPy was used to build a community model. As feasible metabolite ranges, we assumed 1 M-10 M for all metabolites, except for the external product (Pex) where we demand a narrower range of 5M-10M so that the product can be synthesized also under higher external product concentrations (see legend of Fig 2). The application of ASTHERISC successfully identified the shown community solution in Fig 2, which improves the maximal thermodynamic driving force as quantified by the OptMDF value. With the single strain, the calculated OptMDF of the pathway from external substrate to external product had a negative value of -0.19 kJ/mol and would thus be infeasible. With the found two-strain community solution, the OptMDF had a positive value of around 0.46 kJ/mol. Hence, in this scenario, the OptMDF is not only increased; the sign change indicates that the community makes product synthesis feasible at all.
The detailed report generated by ASTHERISC for this example is shown in S3 Text. One important information provided in this report is which metabolite concentrations differ in the two strains in the found community solution indicating where a separation in two strains helps to overcome thermodynamic bottlenecks. To obtain the OptMDF of 0.46 kJ/mol in the community solution, X must have a lower concentration between 1 M and 1.37 M in strain 1 and a higher concentration range between 3.65 M and 10 M in strain 2. This explains why the two reactions R2 and R5 had to be separated into strains. Together with the P-exporting reaction R6, which has a low maximal driving force due to the constraint of a high external product (Pex) concentration, these two reactions were also identified by ASTHERISC as the sole bottleneck reactions of the single-strain case.
Applying ASTHERISC to multi-strain community models of E. coli
As a practically relevant case, we apply ASTHERISC to identify multi-strain communities of on E. coli that could potentially improve the thermodynamic driving force of the production of metabolites. In total, we considered three different community models. Two of them, ecolicore2double and ecolicore2triple, are based on the EColiCore2 model [44] (a reduced version of the genome-scale model iJO1366 [45] containing 499 reactions and 486 metabolites). ecolicore2double contains two and ecolicore2triple three duplicates of the EColiCore2 network as strains in the community model. The third community model, iML1515double, contains two copies of the recently published genome-scale network iML1515 [46]. A detailed description of the construction of the community models, the assignment/computation of ΔrG′0, the used standard exchange reactions and other constraints can be found in S2 Text. In short, as the main source of ΔrG′0, we employed the Python-based eQuilibrator API, which uses the component contribution method [47]. Reactions, for which no ΔrG′0 could be found or calculated (93 reactions (18%) in EColiCore2 and 646 (24%) in iML1515), were blocked in the model (rate fixed to zero) to avoid the calculation of thermodynamically infeasible solutions when leaving these ΔrG′0 unconstrained. Glucose was used as substrate and its uptake rate was fixed to 1 mmol/(gDW∙h) (this is only made for technical reasons; it does not change the results since all solutions are scalable). No other flux bounds were set and since growth was not considered this allows use of identical biomass fractions for all involved strain (50% in two-strain communities and 33% in three-strain communities). The three community models were built with CommModelPy and can be found as SBML files in the mentioned GitHub repositories.
With the reduced set of active reactions (due to some missing ΔrG′0) and with a demanded minimal product yield of 10−6 (to prevent potential numeric problems), there were 161 producible target metabolites in the EColiCore2-related community models and 254 in iML1515double. For each of the producible target metabolites, we considered in each of the three community models 8 different scenarios resulting from four different yield thresholds (40%, 60%, 80% and 98% of the maximal yield of the target metabolite) and two different bounds for the maximal number of (extra) exchange reactions (an infinite or a maximal number of 9 extra exchanges). Hence, 24 scenarios are considered in total. To exclude solutions with just marginal MDF improvements, we demanded a minimal community MDF advantage of 0.2 kJ/mol and a positive OptMDF value was demanded for the community (not for an associated single-strain) solution in order to get thermodynamically feasible results.
For all 24 scenarios, ASTHERISC could find a wide range of target metabolites where a community was able to provide a pathway with a higher OptMDF than a single strain (for a complete listing of all results see the report files in the ASTHERISC package’s repository). The percentage of producible metabolites for which an advantage could be found in the community ranged from 15.75% up to 40.55% (Table 1). In most cases, as expected, limiting the maximal extra exchanges to 9 reduces the number of found improvements compared to the case with unlimited exchanges. Since the computation was stopped for several metabolites due to MILP timeouts (especially in the genome-scale community model), the percentages given for the different scenarios should be seen as lower bound of possible improvement percentages.
Table 1. Percentage (and absolute number) of all producible target metabolites for which ASTHERISC found a higher optimal MDF in a multi-strain community compared to a single-strain model.
Model (number of allowed extra exchanges) | Demanded minimum product yield (% of maximum yield) | |||
---|---|---|---|---|
40% | 60% | 80% | 98% | |
ecolicore2double (9) | 22.36% (36) | 21.74% (35) | 21.12% (34) | 21.12% (34) |
ecolicore2double (infinite) | 23.6% (38) | 35.4% (57) | 27.95% (45) | 31.06% (50) |
ecolicore2triple (9) | 21.12% (34) | 19.88% (32) | 17.39% (28) | 19.25% (31) |
ecolicore2triple (infinite) | 19.25% (31) | 31.06% (50) | 27.33% (44) | 29.81% (48) |
iML1515double (9) | 17.72% (45) | 15.75% (40) | 21.26% (54) | 25.59% (65) |
iML1515double (infinite) | 18.5% (47) | 20.08% (51) | 25.59% (65) | 40.55% (103) |
The OptMDF advantages of community solutions compared to single-strain solutions ranged from 0.22 kJ/mol (just over the demanded minimal MDF advantage of 0.2 kJ/mol) up to 7.26 kJ/mol, while the mean MDF advantages range from 0.64 kJ/mol to 2.0 kJ/mol for ecolicore2double and ecolicore2triple and from 0.91 kJ/mol to 1.94 kJ/mol for iML1515double (Table 2). Interestingly, the highest MDF improvements occur when high product yields are demanded. This indicates that low-yield pathways in the single strain might have high MDFs which are more difficult to improve by a community solution. We also found some solutions in which the MDF of the single strain is negative and in the community positive, hence, where product synthesis becomes thermodynamically feasible at all in the community (similar as in the toy model scenario in Fig 2). This was the case for 7 metabolites in all runs with a yield threshold of 98% with ecolicore2double and ecolicore2triple, and for 3 metabolites in all runs with iML1515double.
Table 2. Statistics about OptMDF advantages (in kJ/mol) with communities for each of the 24 scenarios.
Model (number of allowed extra exchanges) | Demanded minimum product yield (% of maximum yield) | |||
---|---|---|---|---|
40% | 60% | 80% | 98% | |
ecolicore2double (9) | 0.32/0.94/2.76 | 0.2/0.64/3.13 | 0.29/0.67/3.24 | 0.27/2.0/5.49 |
ecolicore2double (infinite) | 0.32/1.29/3.42 | 0.26/1.02/5.07 | 0.22/1.04/5.07 | 0.23/1.77/6.28 |
ecolicore2triple (9) | 0.32/0.83/2.76 | 0.22/0.63/3.03 | 0.29/0.7/3.24 | 0.5/1.85/5.49 |
ecolicore2triple (infinite) | 0.29/1.42/3.42 | 0.26/1.09/4.71 | 0.22/0.93/4.71 | 0.23/1.81/6.28 |
iML1515double (9) | 0.26/0.92/2.95 | 0.33/1.0/2.79 | 0.23/0.93/2.66 | 0.22/1.58/5.55 |
iML1515double (infinite) | 0.26/0.97/3.04 | 0.26/0.91/2.79 | 0.3/0.91/2.64 | 0.22/1.94/6.04 |
Next, we analyzed how many metabolite exchanges were used in the found community solutions (Table 3). For the scenario with a limited number of (extra) exchanges, it should be noted that a maximum of 9 active exchange reactions in a community with two strains correspond to an exchange of maximal 4 metabolites between the two strains: 4 exchange reactions from strain 1 for export/import of metabolites and 4 reactions of strain 2 for import/export of the same 4 metabolites plus the export of the target product by one of the two strains. The number of used metabolite exchanges per target metabolite solution ranged from 1 (in those cases, default metabolite exchanges were typically used between the strains which are not counted as extra exchanges) up to 51.
Table 3. Statistics about the number of used extra exchanges in solutions with a community OptMDF advantage for each of the 24 scenarios analyzed.
Model (number of allowed extra exchanges) | Demanded minimum product yield (% of maximum yield) | |||
---|---|---|---|---|
40% | 60% | 80% | 98% | |
ecolicore2double (9) | 3/5.39/8 | 4/6.34/7 | 4/6.41/9 | 3/6.03/9 |
ecolicore2double (infinite) | 3/16.66/29 | 7/15.88/31 | 6/13.76/29 | 8/15.88/41 |
ecolicore2triple (9) | 3/4.47/7 | 3/5.88/7 | 4/5.89/7 | 3/5.29/9 |
ecolicore2triple (infinite) | 3/18.13/38 | 3/14.8/36 | 5/13.36/34 | 4/16.83/51 |
iML1515double (9) | 1/5.18/7 | 3/5.05/9 | 1/5.11/9 | 2/5.14/9 |
iML1515double (infinite) | 3/15.28/33 | 3/14.16/27 | 3/14.11/37 | 3/14.34/28 |
Some metabolites occur with high frequency as exchange metabolites in the found solutions. For example, glycerol3-phosphate, (glyc3p), fructose-6-phosphate (f6p) and oxaloacetate (oaa) are top-ranked exchange metabolites in ecolicore2double and ecolicore2triple while dihydroxyacetone (dha), L-aspartate (asp__L) and acetaldehyde (acald) are often contained in solutions for iML1515double. Most of the top-ranked exchange metabolites are involved in the glycolysis or in pathways directly connected to glycolysis indicating central thermodynamic bottlenecks relevant for the production of many target metabolites.
Coinciding with the dominant extra exchange reactions, a few discrete MDF values are often dominating for each scenario. In particular, if target metabolites are closely located in the metabolic network, then their corresponding community solutions have often the same associated OptMDF value as they share the same metabolic bottleneck for their production. Some of these dominant MDF values occurred in all analyzed models. This might indicate that a small set of central bottleneck reactions is thermodynamically constraining a large set of possible solutions.
Community solution example
To discuss one representative solution as an example, we selected the production of 3-Deoxy-D-manno-octulosonate 8-phosphate (kdo8p) in ecolicore2double with a demanded minimal product yield of 98% of the maximum yield (0.74775 mol kdo8p per mol glucose) and with an allowed maximum of 9 extra exchanges. The last step of kdo8p synthesis is catalyzed by the kdo8p synthase (KDOPS), which has been evaluated as a potential antibiotics target [48]. In the following, the two designed strains within the ecolicore2double community model will be denoted by “ecoli1” and “ecoli2”, respectively. The report delivered by ASTHERISC for kdo8p can be found in S4 Text.
With the given constraints, an OptMDF of 2.27 kJ/mol can be reached for the production of kdo8p from glucose with a single strain. In the community solution found by ASTHERISC, the MDF value can be increased by 26% up to 2.87 kJ/mol reaching a yield of 0.73311 mol/mol, which is just 0.04% higher than the demanded minimal yield. While the single-strain solution uses 42 reactions (thereof 14 standard exchange reactions for glucose, water, etc.), the community requires 66 reactions, thereof 22 exchange reactions, 28 internal reactions in ecoli1 and 16 reactions in ecoli2. 17 of the 22 exchange reactions are standard exchanges and four extra exchange reactions are used for metabolite exchanges between ecoli1 and ecoli2 (Fig 6): two for the exchange of dihydroxyacetone (dha) from ecoli1 to ecoli2, two for the exchange of 6-Phospho-D-gluconate (6pgc) from ecoli2 to ecoli1 and one for the secretion of kdo8p to the environment.
To understand why this community solution leads to an MDF advantage, it is helpful to look at the three thermodynamic bottleneck reactions for kdo8p synthesis in the MDF-optimal single-strain solution (Fig 6). These three connected reactions (shown in red in Fig 6), which all occur in the upper glycolysis, are (1) the fructose-bisphosphate aldolase reaction (FBA; ΔrG′0 = 22.4 kJ/mol) producing glyceraldehyde 3-phosphate (g3p) and dihydroxyacetone phosphate (dhap) from D-fructose 1,6-bisphosphate (fdp), (2) the triose-phosphate isomerase reaction (TPI; ΔrG′0 = 5.6 kJ/mol) converting dhap to g3p, and (3) the glyceraldehyde-3-phosphate dehydrogenase reaction (GAPD; ΔrG′0 = 4.6 kJ/mol) where g3p is converted to 3-phospho-D-glycerol phosphate (13dpg). As already pointed out in [30] and [31], these three reactions form a distributed thermodynamic bottleneck as they all have unfavorable thermodynamics to operate in forward direction (positive ΔrG′0) and conflicting needs regarding the g3p concentration since the FBA and TPI reactions thermodynamically benefit from a lower and the GAPD reaction from a higher g3p concentration. There is also a fourth associated bottleneck reaction, the symporter-assisted uptake of phosphate into the cytosol (PIt2rpp) with a ΔrG′0 of 0 kJ/mol, which delivers the phosphate needed in the GAPD reaction. For the sake of simplicity, it was not included in Fig 6. The optimal adjustment of metabolite concentrations in the single strain allows a maximum MDF value of 2.27 kJ/mol and changing the concentration, e.g. of g3p, will inevitably lead to an increase of the ΔrG′ of at least one reaction and thus to a decrease of the MDF.
In the community solution, the bottleneck is resolved as follows (Fig 6): In order to produce kdo8p through the reaction KDOPS (kdo8p synthase), both phosphoenolpyruvate (pep) and D-arabinose 5-phosphate (ara5p) are needed as substrate. In the single strain solution, pep is produced using the glycolysis, thus requiring the bottleneck reactions FBA, TPI and GAPD. ara5p is produced through reactions which do not pose a bottleneck, using a pathway via 6-phospho-D-gluconate (6pgc). In the community solution, the bottleneck triangle of the FBA, TPI and GAPD reactions is resolved by separating the occurrence of the g3p-using GAPD reaction into ecoli1 and the g3p-producing FBA and TPI reactions into ecoli2, thus omitting the conflicting needs of g3p. This is achieved in the following way: (1) ecoli1 concentrates on pep synthesis, thereby omitting parts of the upper glycolysis by using the fructose 6-phosphate aldolase reaction (F6PA), which has f6p as a substrate and dihydroxyacetone (dha) and g3p as products. g3p is further processed using GAPD, and dha is secreted and taken up by ecoli2 for further processing (see below). With a higher concentration of f6p, a higher and thermodynamically more favorable concentration (compared to the single-strain solution) of g3p is achieved in ecoli1, so that the ΔrG′ value of the GAPD reaction can be lowered (and thus the MDF be increased) in the community solution. Furthermore, a higher f6p concentration also requires an increased D-glucose 6-phosphate (g6p) concentration since f6p is produced from it using the glucose-6-phosphate isomerase reaction (PGI). (2) ecoli2 concentrates on the production of 6pgc from dha. In a first step, the reverse F6PA reaction is used to produce f6p from dha and g3p. While dha is provided from ecoli1, the required g3p is recycled from a portion of the produced f6p via the g3p-producing reactions FBA and TPI. Due to the very negative ΔrG′0 of the reverse F6PA reaction, low concentrations of dha and g3p can be used in ecoli2 resulting in a more negative ΔrG′ of the FBA and TPI reactions. The second half of f6p is used to synthesize 6pgc via the (reverse) PGI, G6PDH2r and PGL reactions. In order to make all these reactions work with low ΔrG′ (resulting in high MDF), the concentrations of dha, f6p, g6p and, as mentioned before, of g3p are reduced compared to ecoli1. The three reactions on the pathway from f6p to 6pgc still reach sufficient thermodynamic driving force because of the very negative ΔrG′0 of these reactions. (3) The 6pgc produced by ecoli2 is then secreted and taken up by ecoli1and further processed via two pathways to synthesize pep and D-arabinose 5-phosphate (ara5p), the two precursors of kdo8p. pep is produced via g3p and pyruvate (pyr) using Entner-Doudoroff pathway reactions, including the 6-phosphogluconate dehydratase reaction (EDD) with a very low ΔrG′0 of -42.8, which allows the concentrations of the affected metabolites to be more variable in order to obtain a high MDF. Note that, in ecoli1, some additional amount of pep is produced via glycolytic reactions from g3p using, for example, the GAPD reaction (Fig 6). The other precursor ara5p is synthesized via the two reactions GND and A5PISO. Finally, ara5p and pep are used by KDOPS to produce kdo8p, which is then secreted into the environment. It has to be mentioned that more reactions are involved in the single-strain as well as in the multi-strain solution to obtain a stoichiometrically balanced flux (not displayed in Fig 6). These reactions are required, for example, to balance cofactors (ATP, NAD(P)H) produced or consumed along the sketched pathways, however, they are not critical for the respective MDF.
Overall, the community solution follows the basic principle of the toy model (Fig 2): through a suitable partitioning (and partially complementation) of the pathway from glucose to the target product kdo8p in the two strains, critical metabolite concentrations (in this example, dha, f6p, g3p and gp6 as indicated in Fig 6) can be maintained with different levels in ecoli1 (high concentrations) and ecoli2 (low concentrations), thereby reaching more negative ΔrG′ for the bottleneck reactions and thus a higher overall MDF. This can also be seen by the feasible concentrations ranges of the critical metabolites in ecoli1 and ecoli2 which maintain the OptMDF of 2.87 KJ/mol (Table 4). These metabolites, which occur in the bottleneck reactions of the single-strain solution, have no overlapping concentration ranges between ecoli1 and ecoli2 showing that the given community solution indeed cannot be reached in a single strain. Interestingly, the community solution has in turn new thermodynamic bottlenecks, again involving the reactions FBA, TPI and GAPD. Those might be further resolved by using a community with more than two strains. However, the corresponding kdo8p solution with ecolicore2triple does not provide a higher OptMDF in the community, indicating that more than three species might be needed in order to achieve a further MDF advantage.
Table 4. Concentration ranges of critical metabolites in the ecolicore2double community solution for kdo8p synthesis with optimal MDF.
ecoli1 | ecoli2 | |||
---|---|---|---|---|
Minimum [mmol/l] | Maximum [mmol/l] | Minimum [mmol/l] | Maximum [mmol/l] | |
dha_c | 0.59826 | 0.59905 | 0.058969 | 0.059048 |
f6p_c | 0.37078 | 0.37128 | 0.001 | 0.0010027 |
g3p_c | 0.32482 | 0.32525 | 0.090047 | 0.090286 |
g6p_c | 19.973 | 20 | 0.0019629 | 0.053238 |
Run time
For all calculations we used a high-performance computer cluster comprising, for each of its nodes, 192 GB working memory and two processors each including eight cores with a standard clock rate of 2.1 GHz. Each of the mentioned 24 scenarios was running on its own node. The resulting run times for the scenarios (cumulated for all producible target metabolites) ranged from 1.63 days (ecolicore2double with a minimal yield factor of 98% and 9 maximal extra exchanges) up to 4.54 days (iML1515double with a minimal yield factor of 80% and 9 maximal extra exchanges).
Discussion
In this study we presented ASTHERISC, a new algorithm for designing multi-strain microbial communities that maximize the thermodynamic driving force of product synthesis with a given production host. The key methodological approach behind ASTHERISC is to compartmentalize certain segments of the product pathway via different strains and to allow an exchange of intermediates between these strains such that the overall product synthesis pathway through the involved strains may reach a higher thermodynamic driving force than in the single strain. While the set of active reactions in the different strains can overlap, using more than one strain adds degrees of freedom with respect to the achievable MDF because the metabolite concentrations may differ between the strains. In this way, thermodynamic bottlenecks, which may arise when operation of one reaction requires a metabolite with high and operation of another reaction the same metabolite with a low concentration, can be circumvented.
In recent years, microbial consortia and their potential for biotechnological production processes have attracted much attention. However, although several single-species (multi-strain) communities with essential metabolic dependencies between the strains have been constructed and analyzed, there are still only few methods available for an automated design of communities with superior properties [5], with FLYCOP [50] and DOLMN [17] as two particular examples. Here, ASTHERISC adds a new class of such algorithms which is driven by a thermodynamic principle. The recently presented DOLMN approach [17], which searches for an optimal partitioning of a species’ metabolism when limiting the number of reactions per strain, shares some similarities with ASTHERISC. However, to the best of our knowledge, a design of microbial communities with the goal to optimize the thermodynamic feasibility of production of a target metabolite has not been addressed before.
The application of our algorithm to a metabolic core and a genome-scale model of E. coli revealed that there is indeed a significant number of metabolic products where a multi-strain community of E. coli may provide higher thermodynamic driving forces. These communities may be an effective way to enhance or even enable the synthesis of certain metabolic products. ASTHERISC, together with the developed software packages, provides an automatable way to identify such beneficial communities consisting of multiple strains of one species. However, while the found solutions demonstrate that higher thermodynamic driving forces may be generated with specifically designed communities, significant effort will be needed for their practical realization. First of all, designing those strains will require the deactivation of certain metabolic reactions (e.g. through gene knockouts) according to the found community solution. Finding suitable knockout strategies that leave only the relevant pathway segments active in the respective strains can be computed by dedicated strain design algorithms, e.g. based on minimal cut sets [51]. Implementing the respective gene knockout strategies may lead to unstable strains or even prevent growth. While the (stoichiometric) feasibility of growth may be demanded when computing the respective intervention strategies, it might be more practical to use two-stage processes, where the strains can first grow before a pure production phase is initiated (e.g. via dynamic metabolic switches [52,53]). This also circumvents possible limitations in the MDF of the production pathway when growth and production are coupled (in our calculations we focused on optimal pathway partitioning for pure product synthesis and did not consider feasibility of growth).
Next, transport of exchange metabolites between the strains must be faciitated (e.g. via overexpression of dedicated transport systems). Transport systems are only available/known for a subset of metabolites and solutions involving only few exchanges may thus be preferred. However, many of the found community solutions for different products use similar sets of exchange metabolites. Hence, once suitable transporter mechanisms have been discovered or implemented for these metabolites, a larger number of strain designs may become feasible. As another aspect for metabolite transport, it was assumed for all exchange reactions that a concentration gradient must exist between the cellular and the external compartment, however, transport costs, e.g. energy consumption to actively transport a metabolite over the membrane, were not explicitly considered (except for transporters already included in the base model). These overhead costs may have adverse effects on product yields and also abolish the MDF advantage of a community solution. To test the sensitivity of the latter when introducing energy costs for metabolite transport, we reran all calculations in a variant of the ecolicore2double model, where each introduced (extra) metabolite exchange consumes 1/3 ATP per transported metabolite molecule. Although there are few specific changes in the set of metabolites where an MDF advantage can be found, we did not find larger changes in the overall percentage of metabolites with MDF advantage in a community. In fact, export of the (end) product does then also require ATP in the single-strain solution, which may lead to a change in the structure of the active network possibly favoring a solution in the community. The associated scripts and result files can be found in the already mentioned GitHub repositories of CommModelPy and the ASTHERISC package.
Another issue for realization of a calculated community design is the long-term stability of the community, which may require genetic methods and general principles as developed in [54] and, more recently, [9]. Finally, reaching the computed maximal thermodynamic driving forces for the communities also demands that the metabolite concentrations are within the respective ranges required for the optimal MDF and it is not per se clear whether this will be the case or can be adjusted by suitable selection mechanisms. It thus remains to be shown that the computed communities may constitute an economically relevant enhancement. However, it should be noted that many of the discussed difficulties are relevant for the implementation of any (computed) synthetic community.
The ASTHERISC runs in the example calculations of this study were stable. Potential numerical problems arising due to precision tolerances of the MILP are partially directly prevented or detected and handled by our implementation. For example, “fake community solutions” delivered by the solver, where effectively only one strain is used, are detected and disregarded. It should also be noted that the set computational time limits may affect the results. This can be seen in the reduced MDF advantage percentages with ecolicore2triple compared to ecolicore2double (Table 1), where the MDF advantage in ecolicore2triple should be at least as high as in ecolicore2double as the former should implicitly contain all two-strain solutions. This effect arose due to MILP time limits which were more likely to be reached in the larger model. Nevertheless, the runs of the ecolicore2triple community model clearly showed that a larger number of strains may increase the overall community advantage. Finally, the calculations with the iML1515double model demonstrated that our implementation of the ASTHERISC algorithm can be applied to genome-scale models with a reasonable runtime. Indeed, comparing the found solutions for ecolicore2double and iML1515double indicates that, for some metabolites, other MDF-optimizing pathways may be found in a genome-scale model, although they may involve more uncommon pathways (with possibly lower capacity) than in the solutions found in the core model.
The results obtained with ASTHERISC also depend on the used settings (e.g. demanded minimum MDF advantage and minimum product yield) and the calculations involve partially uncertain parameters or assumptions (e.g. the metabolite concentration ranges or the ΔrG′0 calculated with the eQuilibrator method [47]). However, the calculations can be easily repeated with alternative values if these are considered to be more relevant or evident. Furthermore, the developed CommModelPy and ASTHERISC packages are programmatically independent from the used models and can thus be readily applied for the generation and study of other single-species. Although not yet tested, ASTHERISC could also be used with multi-species communities, for example, to test which product synthesis pathway (with associated metabolite exchanges) in a multi-species community leads to the highest MDF and whether this MDF is superior over a potentially existing single-species pathway. Finally, we also anticipate useful applications of the ASTHERISC method for finding suitable compartmentalization strategies maximizing thermodynamic driving forces in cell-free production systems.
Community design by ASTHERISC exploits the fact that having multiple strains allows adjustment of different metabolite concentrations in the different strains. While ASTHERISC uses this degree of freedom for maximizing the thermodynamic driving force, future work could seek to directly maximize the product synthesis rate in the community by finding a pathway partitioning (again with specific metabolite concentrations in each strain) such that, under consideration of kinetic rate laws, the overall product flux is optimized. Ideally, if a kinetic model of the (central) metabolism would be available one could directly search for such an optimal solution. For example, if we would have a simple kinetic model of the toy network in Fig 2, it would most likely suggest to split the pathway in the same way as was done via the thermodynamic principles to maximize the pathway flux. However, a kinetic approach would also allow the consideration of saturation effects, allosteric regulations (e.g. feedback inhibition of the first reaction by intermediate or end metabolites of the pathway) or of resource (enzyme) allocation constraints. Since predictive kinetic models are often not available, it needs to be investigated whether other approaches requiring less information (e.g. based on kcat or Km values [55,56]) may already suggest useful solutions.
Supporting information
Acknowledgments
We are grateful to Axel von Kamp for his assistance and discussions on using OptMDFpathway in the context of ASTHERISC.
Data Availability
Data are available within the manuscript and Supporting Information and in parts from a public repository: https://github.com/klamt-lab/CommModelPy https://github.com/klamt-lab/astheriscPackage.
Funding Statement
This work was supported by the EU program ERDF (European Regional Development Fund) of the German Federal State of Saxony-Anhalt within the Research Center for Dynamic Systems (CDS; given to SK), and by the German Federal Ministry of Education and Research (FKZ: 031B0524B; given to SK). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Smith JM, Szathmary E. The major transitions in evolution: OUP Oxford; 1997. 1997-10-30. 361 p. [Google Scholar]
- 2.Pande S, Merker H, Bohl K, Reichelt M, Schuster S, de Figueiredo LF, et al. Fitness and stability of obligate cross-feeding interactions that emerge upon gene loss in bacteria. The ISME Journal. 2014;8(5):953–62. doi: 10.1038/ismej.2013.211 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Song H-S, Cannon WR, Beliaev AS, Konopka A. Mathematical modeling of microbial community dynamics: a methodological review. Processes. 2014;2(4):711–52. doi: 10.3390/pr2040711 [DOI] [Google Scholar]
- 4.Succurro A, Ebenhöh O. Review and perspective on mathematical modeling of microbial ecosystems. Biochemical Society Transactions. 2018;46(2):403–12. doi: 10.1042/BST20170265 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.García-Jiménez B, Torres-Bacete J, Nogales J. Metabolic modelling approaches for describing and engineering microbial communities. Computational and Structural Biotechnology Journal. 2021;19:226–46. doi: 10.1016/j.csbj.2020.12.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Mauri M, Gouzé J-L, Jong Hd, Cinquemani E. Enhanced production of heterologous proteins by a synthetic microbial community: Conditions and trade-offs. PLOS Computational Biology. 2020;16(4):e1007795. doi: 10.1371/journal.pcbi.1007795 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Anane E, López C DC, Neubauer P, Cruz Bournazou MN. Modelling overflow metabolism in Escherichia coli by acetate cycling. Biochemical Engineering Journal. 2017;125:23–30. doi: 10.1016/j.bej.2017.05.013 [DOI] [Google Scholar]
- 8.Orth JD, Thiele I, Palsson BØ. What is flux balance analysis? Nature biotechnology. 2010;28(3):245–8. doi: 10.1038/nbt.1614 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Losoi PS, Santala VP, Santala SM. Enhanced population control in a synthetic bacterial consortium by interconnected carbon cross-feeding. ACS Synthetic Biology. 2019;8(12):2642–50. doi: 10.1021/acssynbio.9b00316 [DOI] [PubMed] [Google Scholar]
- 10.Stolyar S, Van Dien S, Hillesland KL, Pinel N, Lie TJ, Leigh JA, et al. Metabolic modeling of a mutualistic microbial community. Molecular Systems Biology. 2007;3(1):92. doi: 10.1038/msb4100131 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Chaganti SR, Kim D-H, Lalman JA. Flux balance analysis of mixed anaerobic microbial communities: Effects of linoleic acid (LA) and pH on biohydrogen production. International Journal of Hydrogen Energy. 2011;36(21):14141–52. doi: 10.1016/j.ijhydene.2011.04.161 [DOI] [Google Scholar]
- 12.Benito-Vaquerizo S, Diender M, Olm IP, Santos VMd, Schaap PJ, Sousa DZ, et al. Modeling a co-culture of Clostridium autoethanogenum and Clostridium kluyveri to increase syngas conversion to medium-chain fatty-acids. bioRxiv. 2020:2020.06.23.167189. doi: 10.1016/j.csbj.2020.10.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Schwalm ND, Mojadedi W, Gerlach ES, Benyamin M, Perisin MA, Akingbade KL. Developing a microbial consortium for enhanced metabolite production from simulated food waste. Fermentation. 2019;5(4):98. doi: 10.3390/fermentation5040098 [DOI] [Google Scholar]
- 14.Chan SHJ, Simons MN, Maranas CD. SteadyCom: Predicting microbial abundances while ensuring community stability. PLOS Computational Biology. 2017;13(5):e1005539. doi: 10.1371/journal.pcbi.1005539 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Koch S, Kohrs F, Lahmann P, Bissinger T, Wendschuh S, Benndorf D, et al. RedCom: A strategy for reduced metabolic modeling of complex microbial communities and its application for analyzing experimental datasets from anaerobic digestion. PLOS Computational Biology. 2019;15(2):e1006759. doi: 10.1371/journal.pcbi.1006759 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Khandelwal RA, Olivier BG, Röling WFM, Teusink B, Bruggeman FJ. Community flux balance analysis for microbial consortia at balanced growth. PLOS ONE. 2013;8(5):e64567. doi: 10.1371/journal.pone.0064567 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Thommes M, Wang T, Zhao Q, Paschalidis IC, Segrè D. Designing metabolic division of labor in microbial communities. mSystems. 2019;4(2). doi: 10.1128/mSystems.00263-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Zhang H, Wang X. Modular co-culture engineering, a new approach for metabolic engineering. Metabolic Engineering. 2016;37:114–21. doi: 10.1016/j.ymben.2016.05.007 [DOI] [PubMed] [Google Scholar]
- 19.Zhou K, Qiao K, Edgar S, Stephanopoulos G. Distributing a metabolic pathway among a microbial consortium enhances production of natural products. Nature Biotechnology. 2015;33(4):377–83. doi: 10.1038/nbt.3095 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Lee CR, Kim C, Song YE, Im H, Oh Y-K, Park S, et al. Co-culture-based biological carbon monoxide conversion by Citrobacter amalonaticus Y19 and Sporomusa ovata via a reducing-equivalent transfer mediator. Bioresource Technology. 2018;259:128–35. doi: 10.1016/j.biortech.2018.02.129 [DOI] [PubMed] [Google Scholar]
- 21.Shin H-D, McClendon S, Vo T, Chen RR. Escherichia coli binary culture engineered for direct fermentation of hemicellulose to a biofuel. Applied and Environmental Microbiology. 2010;76(24):8150–9. doi: 10.1128/AEM.00908-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Jones JA, Vernacchio VR, Sinkoe AL, Collins SM, Ibrahim MHA, Lachance DM, et al. Experimental and computational optimization of an Escherichia coli co-culture for the efficient production of flavonoids. Metabolic Engineering. 2016;35:55–63. doi: 10.1016/j.ymben.2016.01.006 [DOI] [PubMed] [Google Scholar]
- 23.Jones JA, Vernacchio VR, Collins SM, Shirke AN, Xiu Y, Englaender JA, et al. Complete biosynthesis of anthocyanins using E. coli polycultures. mBio. 2017;8(3). doi: 10.1128/mBio.00621-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Lindemann SR, Bernstein HC, Song H-S, Fredrickson JK, Fields MW, Shou W, et al. Engineering microbial consortia for controllable outputs. The ISME Journal. 2016;10(9):2077–84. doi: 10.1038/ismej.2016.26 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Roell GW, Zha J, Carr RR, Koffas MA, Fong SS, Tang YJ. Engineering microbial consortia by division of labor. Microbial Cell Factories. 2019;18(1):35. doi: 10.1186/s12934-019-1083-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Tsoi R, Wu F, Zhang C, Bewick S, Karig D, You L. Metabolic division of labor in microbial systems. Proceedings of the National Academy of Sciences. 2018;115(10):2526–31. doi: 10.1073/pnas.1716888115 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Kreft J-U, Griffin BM, González-Cabaleiro R. Evolutionary causes and consequences of metabolic division of labour: why anaerobes do and aerobes don’t. Current Opinion in Biotechnology. 2020;62:80–7. doi: 10.1016/j.copbio.2019.08.008 [DOI] [PubMed] [Google Scholar]
- 28.Harvey E, Heys J, Gedeon T. Quantifying the effects of the division of labor in metabolic pathways. Journal of theoretical biology. 2014;360:222–42. doi: 10.1016/j.jtbi.2014.07.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Wu G, Yan Q, Jones JA, Tang YJ, Fong SS, Koffas MAG. Metabolic Burden: Cornerstones in Synthetic Biology and Metabolic Engineering Applications. Trends in Biotechnology. 2016;34(8):652–64. doi: 10.1016/j.tibtech.2016.02.010 [DOI] [PubMed] [Google Scholar]
- 30.Noor E, Bar-Even A, Flamholz A, Reznik E, Liebermeister W, Milo R. Pathway thermodynamics highlights kinetic obstacles in central metabolism. PLOS Computational Biology. 2014;10(2):e1003483. doi: 10.1371/journal.pcbi.1003483 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Hädicke O, Kamp Av, Aydogan T, Klamt S. OptMDFpathway: identification of metabolic pathways with maximal thermodynamic driving force and its application for analyzing the endogenous CO2 fixation potential of Escherichia coli. PLOS Computational Biology. 2018;14(9):e1006492. doi: 10.1371/journal.pcbi.1006492 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Henry CS, Broadbelt LJ, Hatzimanikatis V. Thermodynamics-based metabolic flux analysis. Biophysical Journal. 2007;92(5):1792–805. doi: 10.1529/biophysj.106.093138 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Zhu Y, Song J, Xu Z, Sun J, Zhang Y, Li Y, et al. Development of thermodynamic optimum searching (TOS) to improve the prediction accuracy of flux balance analysis. Biotechnology and Bioengineering. 2013;110(3):914–23. doi: 10.1002/bit.24739 [DOI] [PubMed] [Google Scholar]
- 34.Hoppe A, Hoffmann S, Holzhütter H-G. Including metabolite concentrations into flux balance analysis: thermodynamic realizability as a constraint on flux distributions in metabolic networks. BMC Systems Biology. 2007;1(1):23. doi: 10.1186/1752-0509-1-23 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Schellenberger J, Lewis NE, Palsson BØ. Elimination of thermodynamically infeasible loops in steady-state metabolic models. Biophysical Journal. 2011;100(3):544–53. doi: 10.1016/j.bpj.2010.12.3707 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Müller AC, Bockmayr A. Fast thermodynamically constrained flux variability analysis. Bioinformatics. 2013;29(7):903–9. doi: 10.1093/bioinformatics/btt059 [DOI] [PubMed] [Google Scholar]
- 37.Hamilton JJ, Contreras MC, Reed JL. Thermodynamics and H2 transfer in a methanogenic, syntrophic community. PLOS Computational Biology. 2015;11(7):e1004364. doi: 10.1371/journal.pcbi.1004364 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Noor E. Removing both internal and unrealistic energy-generating cycles in flux balance analysis. arXiv:180304999 [q-bio]. 2018. [Google Scholar]
- 39.Machado D, Andrejev S, Tramontano M, Patil KR. Fast automated reconstruction of genome-scale metabolic models for microbial species and communities. Nucleic Acids Research. 2018;46(15):7542–53. doi: 10.1093/nar/gky537 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Diener C, Gibbons SM, Resendis-Antonio O. MICOM: Metagenome-scale modeling to infer metabolic interactions in the gut microbiota. mSystems. 2020;5(1). doi: 10.1128/mSystems.00606-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Ebrahim A, Lerman JA, Palsson BO, Hyduke DR. COBRApy: COnstraints-Based Reconstruction and Analysis for Python. BMC Systems Biology. 2013;7(1):74. doi: 10.1186/1752-0509-7-74 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Hucka M, Finney A, Sauro HM, Bolouri H, Doyle JC, Kitano H, et al. The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics. 2003;19(4):524–31. doi: 10.1093/bioinformatics/btg015 [DOI] [PubMed] [Google Scholar]
- 43.Klamt S, Saez-Rodriguez J, Gilles ED. Structural and functional analysis of cellular networks with CellNetAnalyzer. BMC Systems Biology. 2007;1(1):2. doi: 10.1186/1752-0509-1-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Hädicke O, Klamt S. EColiCore2: a reference network model of the central metabolism of Escherichia coli and relationships to its genome-scale parent model. Scientific Reports. 2017;7:39647. doi: 10.1038/srep39647 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Orth JD, Conrad TM, Na J, Lerman JA, Nam H, Feist AM, et al. A comprehensive genome-scale reconstruction of Escherichia coli metabolism—2011. Molecular Systems Biology. 2011;7(1):535. doi: 10.1038/msb.2011.65 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Monk JM, Lloyd CJ, Brunk E, Mih N, Sastry A, King Z, et al. i ML1515, a knowledgebase that computes Escherichia coli traits. Nature Biotechnology. 2017;35(10):904–8. doi: 10.1038/nbt.3956 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Flamholz A, Noor E, Bar-Even A, Milo R. eQuilibrator—the biochemical thermodynamics calculator. Nucleic Acids Research. 2012;40(D1):D770–D5. doi: 10.1093/nar/gkr874 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Du S, Faiger H, Belakhov V, Baasov T. Towards the development of novel antibiotics: synthesis and evaluation of a mechanism-based inhibitor of Kdo8P synthase. Bioorganic & Medicinal Chemistry. 1999;7(12):2671–82. doi: 10.1016/s0968-0896(99)00233-3 [DOI] [PubMed] [Google Scholar]
- 49.King ZA, Lu J, Dräger A, Miller P, Federowicz S, Lerman JA, et al. BiGG Models: A platform for integrating, standardizing and sharing genome-scale models. Nucleic Acids Research. 2016;44(D1):D515–D22. doi: 10.1093/nar/gkv1049 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.García-Jiménez B, García JL, Nogales J. FLYCOP: metabolic modeling-based analysis and engineering microbial communities. Bioinformatics. 2018;34(17):i954–i63. doi: 10.1093/bioinformatics/bty561 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Schneider P, Kamp Av, Klamt S. An extended and generalized framework for the calculation of metabolic intervention strategies based on minimal cut sets. PLOS Computational Biology. 2020;16(7):e1008110. doi: 10.1371/journal.pcbi.1008110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Venayak N, von Kamp A, Klamt S, Mahadevan R. MoVE identifies metabolic valves to switch between phenotypic states. Nature Communications. 2018;9(1):5332. doi: 10.1038/s41467-018-07719-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Hartline CJ, Schmitz AC, Han Y, Zhang F. Dynamic control in metabolic engineering: Theories, tools, and applications. Metabolic Engineering. 2021;63:126–40. doi: 10.1016/j.ymben.2020.08.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Johns NI, Blazejewski T, Gomes AL, Wang HH. Principles for designing synthetic microbial communities. Current Opinion in Microbiology. 2016;31:146–53. doi: 10.1016/j.mib.2016.03.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Noor E, Flamholz A, Bar-Even A, Davidi D, Milo R, Liebermeister W. The Protein Cost of Metabolic Fluxes: Prediction from Enzymatic Rate Laws and Cost Minimization. PLOS Computational Biology. 2016;12(11):e1005167. doi: 10.1371/journal.pcbi.1005167 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Chen Y, Nielsen J. Mathematical modeling of proteome constraints within metabolism. Current Opinion in Systems Biology. 2021;25:50–6. doi: 10.1016/j.coisb.2021.03.003 [DOI] [Google Scholar]