Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2021 Jun 15;17(6):e1009093. doi: 10.1371/journal.pcbi.1009093

Designing microbial communities to maximize the thermodynamic driving force for the production of chemicals

Pavlos Stephanos Bekiaris 1, Steffen Klamt 1,*
Editor: Christoph Kaleta2
PMCID: PMC8232427  PMID: 34129600

Abstract

Microbial communities have become a major research focus due to their importance for biogeochemical cycles, biomedicine and biotechnological applications. While some biotechnological applications, such as anaerobic digestion, make use of naturally arising microbial communities, the rational design of microbial consortia for bio-based production processes has recently gained much interest. One class of synthetic microbial consortia is based on specifically designed strains of one species. A common design principle for these consortia is based on division of labor, where the entire production pathway is divided between the different strains to reduce the metabolic burden caused by product synthesis. We first show that classical division of labor does not automatically reduce the metabolic burden when metabolic flux per biomass is analyzed. We then present ASTHERISC (Algorithmic Search of THERmodynamic advantages in Single-species Communities), a new computational approach for designing multi-strain communities of a single-species with the aim to divide a production pathway between different strains such that the thermodynamic driving force for product synthesis is maximized. ASTHERISC exploits the fact that compartmentalization of segments of a product pathway in different strains can circumvent thermodynamic bottlenecks arising when operation of one reaction requires a metabolite with high and operation of another reaction the same metabolite with low concentration. We implemented the ASTHERISC algorithm in a dedicated program package and applied it on E. coli core and genome-scale models with different settings, for example, regarding number of strains or demanded product yield. These calculations showed that, for each scenario, many target metabolites (products) exist where a multi-strain community can provide a thermodynamic advantage compared to a single strain solution. In some cases, a production with sufficiently high yield is thermodynamically only feasible with a community. In summary, the developed ASTHERISC approach provides a promising new principle for designing microbial communities for the bio-based production of chemicals.

Author summary

Communities of microbes are ubiquitous in nature and also of high relevance for industrial applications, e.g. for the production of biogas. The development and use of non-natural communities for biotechnological applications has become an important subject of research. In this work, we present a new computational method to design synthetic communities with improved capabilities for the synthesis of desired target metabolites. Our method takes a constraint-based metabolic model of an organism as input and searches for a suitable partitioning of the product pathway via different strains of the organism such that the thermodynamic driving force for product synthesis is maximized. Essentially, this approach exploits the fact that having multiple strains allows adjustment of different metabolite concentrations in the different strains by which the thermodynamic driving force for product synthesis can often be increased. We tested this approach with a core and with a genome-scale metabolic network model of Escherichia coli. We found that, for dozens of metabolites, there exist communities with specifically designed strains of E. coli where the maximal thermodynamic driving force can be increased compared to a single E. coli strain. In summary, our presented method provides a new approach, together with a new design principle, for the computational design of microbial communities.

Introduction

In nature, organisms rarely occur as isolated populations of single species. They rather form communities (or consortia) with different types of complex interactions between the participating species which often drive evolution [1]. In particular, communities of microorganisms are ubiquitous in nature and play a fundamental role for ecology, bioremediation, geochemical cycles and human health. Microbial communities typically consist of different species, however, especially under controlled laboratory conditions, they may also comprise coexisting strains of a single species with certain physiological differences between the strains. One way to establish such single-species (multi-strain) communities is to construct and co-culture different, genetically modified strains with obligate mutual metabolic dependencies [2].

Mathematical modeling has become a valuable tool to formally describe and simulate complex microbial communities and to gain a deeper understanding of their behavior and properties. A range of different methods has been developed for community modeling [35]. As for modeling single species, two major approaches are kinetic modeling based on differential equations [6,7] and steady-state modeling utilizing constraint-based (flux balance analysis (FBA) [8]) techniques [913]. FBA-related methods are limited to predictions on stationary metabolic fluxes, but they require as input only the metabolic reaction network of the participating species and no data on kinetic mechanisms and parameters and can therefore be used to analyze large (genome-scale) metabolic models. Applying (linear) FBA-techniques to study communities models requires partially adaptations, e.g. due to arising bilinearities. Two particular examples of such formalizations are SteadyCom [14] and RedCom [15]. Both methods are based on the concept of balanced growth of a community [16] and allow predictions of growth rates, metabolic exchange rates and feasible community compositions. An example of an FBA-based method for the design of single-species (multi-strain) communities with certain constraints is DOLMN (Division Of Labor In Metabolic Networks) [17].

Microbial consortia are extensively used in biotechnological applications. One example is anaerobic digestion in biogas plants where methane is produced by naturally occurring communities. In recent years, artificial co-cultures have been constructed for synthesizing biofuels and value-added products in bioreactors [18]. While some of these realizations use multi-species communities [19,20], more and more realizations focus on the use of communities with different, purposefully constructed strains of a single species. Successful examples of single-species communities for product synthesis include (a) a two-strain E. coli community for the conversion of xylan to ethanol with high ethanol yields [21], (b) another two-strain E. coli community for the synthesis of flavonoids [22] with much higher synthesis rates than without communities, and (c) the synthesis of anthocyanins with a community of four E. coli strains [23], the first successful production of this chemical by microbes at all.

The high potential of dedicated single-species communities for optimizing biotechnological production of a certain compound has been intensively discussed in recent literature. Division of labor (DoL) is seen as one major design principle of such artificial co-cultures [2428]. The basic idea of DoL is illustrated with the toy example shown in Fig 1A, where we consider a linear metabolic pathway converting a substrate S into a desired target product P. Concretely, S is taken up, then, in a first reaction catalyzed by enzyme E1, converted to the intermediate metabolite M, which is in turn further converted to the product P in a second reaction catalyzed by enzyme E2, after which P is finally excreted. In a biotechnological application, the goal is to achieve a high flux through this pathway which will require a high abundance of the enzymes E1 and E2. Due to limited intracellular resources, the synthesis of each of the two enzymes competes with each other and with other cellular processes, which may result in a reduced production performance of the strain. Here, DoL may help to reduce the “metabolic burden” of the single strain by dividing the respective metabolic task and thus the metabolic costs on several strains of the same species [24,29]. In our example, a DoL strategy could be to split the pathway into two parts (or tasks) each being implemented in one dedicated strain. With an exchange of the intermediate metabolite M, each of the two strains may now concentrate on its dedicated task which could circumvent physiological bottlenecks. Clearly, the effectiveness of a DoL strategy will depend on the relative burden on each pathway step. However, even in the ideal case that the metabolic burden is equally distributed over the two reactions and thus over the two different strains, this DoL perspective neglects one major constraint: if we assume that we are given a certain amount of biomass that can be used for product synthesis (higher amounts of biomass will inevitably reduce product yield), DoL will not necessarily lead to increased product flux (Fig 1B). In the single strain approach, the specific rates (in [mmol/(gDW∙h)]) for reaction R1 are given by

r1SS=[E1]SSkcat,1f1(c) (1)

where [E1]SS is the concentration of enzyme E1 in the single strain, f1 the saturation function characterizing the kinetic rate law of reaction R1, and c the vector of the metabolite concentrations. Analogously, for the specific rate of reaction R2 we have:

r2SS=[E2]SSkcat,2f2(c) (2)

Fig 1. An example illustrating that DoL does not reduce the overall metabolic burden.

Fig 1

(A) Basic idea of DoL: a two-step pathway converting a substrate S to a product P via the two reactions R1 (catalyzed by enzyme E1) and R2 (catalyzed by enzyme E2) in a single strain is split into two parts each being performed by one dedicated strain. (B) Assuming an identical total amount of biomass, the metabolic burden (enzyme cost per biomass) is identical for the single strain and the DoL solution.

For simplicity, in the following we assume that the same amount of enzyme E1 and E2 is needed to induce a certain flux in the reactions R1 and R2 (implying kcat,1f1(c) = kcat,2fs(c)), and that both enzyme costs are identical (S1 Text shows that the same conclusions can be drawn in the more general case with arbitrary pathway length and enzyme costs under assumption of constant metabolite concentrations in the strains). Therefore, since the specific rates in the single-strain solution must fulfill rSSS=r1SS=r2SS=rPSS in steady state, the enzyme concentrations must be identical: [E1]SS = [E2]SS. For the total product synthesis flux JPSS [mmol/h] we need to multiply the specific flux with the amount of biomass BSS (unit [gDW]): JPSS=BSSr1SS(=BSSr2SS). Now, in the DoL approach with the assumed simplified case of identical enzyme requirements and costs, the biomass of the single strain would be divided into two equal parts (strains 1 and 2, each having 50% of the original total biomass: B1DoL=B2DoL=0.5BSS). For the steady-state product synthesis flux in the DoL approach, we obtain JPDoL=B1DoLr1DoL=B2DoLr2DoL. Therefore, in order to obtain the same total product synthesis flux as for the single-strain solution, the specific rates r1DoL and r2DoL must be doubled due to the halved biomass available for each strain. This requires doubled enzyme concentrations in the two strains, [E1]DoL = 2[E1]SS = [E2]DoL = 2[E2]SS, implying that the total concentration of enzymes required for the product pathway is identical in all three strains considered: [E]totSS=[E1]SS+[E2]SS=[E1]DoL=[E2]DoL. Thus, the total enzyme costs per biomass in the DoL scenario are the same as with the single-strain solution and the metabolic burden is not reduced. Furthermore, the exchange of the intermediate metabolite M between the strains under DoL is also associated with additional metabolic costs (transporters must be produced, energetic costs of transport etc.). In fact, as also shown for the more general case (S1 Text), a DoL strategy cannot be advantageous (or is even unfavorable) with respect to metabolic burden as long as a potential kinetic advantage due to different metabolite concentrations in the different strains does not outweigh the added costs of metabolite transport. This point has rarely been considered when discussing DoL strategies.

As indicated in the last statement, one degree of freedom in a community that could be used to truly enhance the overall production rate, is the possibility of having different metabolite concentrations in the strains. In this study we present a new approach for designing and optimizing communities with multiple strains of a single species, which exploits this degree of freedom. Specifically, our concept seeks to identify a suitable division of the metabolic network, such that the thermodynamic driving force of the entire product synthesis pathway is maximized. The basic concept is shown with the toy model in Fig 2: The pathway from substrate S to product P includes a thermodynamic bottleneck at the reactions R2 and R5 because of their relatively low thermodynamic driving forces (positive standard Gibbs free energies). The driving force of reaction R2, where X is a reactant, could be increased by adjusting a lower concentration of metabolite X, however, this would simultaneously further reduce the driving force of reaction R5, where X is a product. In fact, with the chosen set of constraints for the metabolite concentrations in Fig 2, the pathway is thermodynamically infeasible indicated by the negative optimal MDF (OptMDF) value. For a given pathway, the MDF (max-min-driving force [30]) maximizes the minimum of the driving force (i.e., the negative value of the Gibbs free energy change) of all participating reactions in a pathway and OptMDF finds the pathway in a metabolic network with maximal MDF [31]. Here, splitting the pathway in two sub-pathways and implementing them in two different strains with an exchange of intermediate B (Fig 2) circumvents these bottlenecks as the concentration of metabolite X can now be adjusted individually in each strain (low in strain 1, high in strain 2) through which product synthesis becomes thermodynamically feasible. Following this idea, we developed a new framework, ASTHERISC, for an automated design of single-species (multi-strain) communities that enable higher thermodynamic driving forces for product synthesis compared to a single-strain approach. Focusing on pure production (without growth), we demonstrate the power of the method by identifying multi-strain communities of E. coli that maximize the thermodynamic driving force of product synthesis for a wide range of compounds.

Fig 2. Example illustrating how division of labor may lead to a thermodynamic advantage in the production of a target metabolite.

Fig 2

In the left, a metabolic pathway in a cell is considered that synthesizes the target product P. The red values indicate positive values for the standard Gibbs free energy change (ΔrG0 [in kJ/mol]) and thus potential thermodynamic bottlenecks. With an allowed concentration range from 1 M to 10 M for all metabolites except for Pex, where a minimum concentration of 5 M was assumed to consider product synthesis under high external product concentrations, a negative optimal MDF (OptMDF) value would follow, indicating thermodynamic infeasibility of product synthesis in the single strain. In the two-strain community (right), the pathway is divided and an exchange of metabolite B introduced. With this, individual concentrations of metabolite X can be adjusted in the two strains by which thermodynamic feasibility (a positive OptMDF) of the overall transformation is achieved (the blue triangles indicate the direction of the concentrations of X (high/low) when maximizing the driving force). Black arrows in the two-strain solution indicate active and grey arrows inactive reactions.

Methods

Structure of community models

We first summarize the representation of stoichiometric models of microbial communities under balanced growth (for a detailed description can be found in [15]). Herein, we will focus on communities with different strains of one species, which can be seen as a special case of communities with multiple species. As common for constraint-based metabolic models we represent the metabolic network (with m metabolites and q reactions) of the given species with a m×q stoichiometric matrix N, a flux vector r containing the net reaction rates, and upper (βi) and lower (αi) flux bounds for each reaction. The steady-state assumption for the internal metabolites implies

Nr=0 (3)

and the flux bounds restrict the range of possible reaction rates:

αjrjβj (4)

While the stoichiometric matrix N comprises internal metabolites only, this matrix can be extended to N˜, which also contains the stoichiometries of the external metabolites (substrates, products etc.) which need not fulfill the steady state constraint (3). We also allow the definition of additional linear flux constraints (such as demanded minimal product-to-substrate yields) with a suitable matrix D and vector d:

Drd (5)

In the description of microbial communities, the net reaction rates r are normalized with respect to the total community biomass [mmol/(gDWtotal∙ h)]. Furthermore, the biomass fractions Fi = BMi/BMtot (0≤Fi≤1) of each strain i at the total biomass are variables of the model. To fulfill the constraint of balanced growth (i.e., constant composition and identical specific growth rate for each strain), the ratio of biomass production of strain i to total biomass synthesis must also equal Fi. Importantly, as the original flux bounds (4) for each strain are given in mmol/(gDWi∙h), they must be multiplied with the fraction of the respective strain in the community to obtain flux bounds with units normalized to total biomass

FiαjirjiFiβji (6)

In communities with (balanced) growth, the formulation leads to bilinearities [15], which can be resolved by fixing either the strain fractions or the community growth rate in order to obtain a linear optimization problem. However, in this study we focus on optimal partitioning of the product synthesis pathway (without growth) and the only flux bound used will be the (community) uptake rate of the carbon source. With that we can disregard the production of biomass and assume identical fractions for all involved strain (e.g. 50% in two-strain communities and 33% in three-strain communities; see S2 Text and Results section). However, in general, growth or/and different biomass fractions may be considered if this is desired.

In the community model, each of the n strains represent a compartment with an associated metabolic network, captured by stoichiometric matrices N1 to Nn. In a multi-strain model of one species, the n stoichiometric matrices are, at least initially, identical. In addition, an exchange compartment is added from/to which the strains can uptake/release compounds (Fig 3). In particular, this enables the exchange of metabolites between strains. The quasi-steady-state condition (3) must also be fulfilled for the compounds within the exchange compartment, but a defined selection of those can themselves be exported/imported to/from the environment. Accordingly, the stoichiometric matrix Nc of the community model reads as follows:

Nc=[N10000Nn0E1,exEn,exEc] (7)

Fig 3. Community model structure used in this study and possible exchange directions of metabolites.

Fig 3

Dashed arrows indicate exchange reactions, all other arrows biochemical conversions.

E1,exEn,ex represent the stoichiometries of the exported/imported metabolites from strains 1 … n

to/from the exchange compartment and Ec describes the exchange of metabolites between the exchange compartment and the environment. The steady-state constraint (3) for the community now reads:

Ncrc=0 (8)

Inclusion of thermodynamics

Several FBA-based modeling techniques incorporate thermodynamic constraints, which reduce the solution space by removing thermodynamically infeasible or unrealistic pathways [3236]. An example for the usage of thermodynamics-including FBA-based methods in a community model is the analysis of a methanogenic consortium of S. fumaroxidans and M. hungatei [37]. Thermodynamics-based methods are usually based on the Gibbs free energy ΔrG′ of (bio)chemical reactions and the fact that the net rate of a reaction can only be positive if its ΔrG′ is negative. Generally, the ΔrGi of reaction i depends on the concentrations of the participating reactants and the reaction-specific standard change in Gibbs free energy ΔrG0 [38]:

ΔrGi=ΔrGi0+RT(N˜,i)Tx (9)

R is the gas constant, T the temperature and x the logarithmized metabolite concentration vector and (N˜,i)T the transposed i-th column (reaction) of the extended stoichiometric matrix N˜ with explicitly included external metabolites. In the following, we do not distinguish between community and single-strain models, hence, the stoichiometric matrices and reaction rate vectors may also refer to the community versions Nc and rc. As a quantitative measure of how thermodynamically feasible a specific reaction i is under given metabolite concentrations, the thermodynamic driving force fi can be defined, which simply is the negative value of the Gibbs free energy of that reaction:

fi=ΔrGi (10)

This reaction-centric concept was extended to quantify the thermodynamically feasibility of an entire pathway by introducing the concept of the max-min driving force (MDF) [30]. With a given range of feasible metabolite concentrations, the MDF is the maximal value B such that all reactions of the pathway have a driving force of at least B. The MDF is thus the maximal driving force an entire pathway can reach. The original MDF formulation only allows the computation of the MDF for a given pathway. In order to find, within a genome-scale network, a pathway with maximal MDF that fulfills certain constraints (e.g. with a given minimal yield of a certain target product) the OptMDFpathway method was developed [31]. As it is fundamental for the ASTHERISC algorithm presented herein we summarize its basic features.

Binary variables zi∈{0,1} are introduced for each reaction i of the model and we constrain it to be 1 if the associated reaction flux ri is non-zero (βi is the upper bound for ri):

riziβi (11)

The logarithmized metabolite concentration values in x are constrained by physiologically meaningful lower (cmin) and upper bounds (cmax) for the respective concentrations:

ln(cmin)xln(cmax) (12)

Sometimes constraints on certain ratios of metabolite concentrations are included (e.g. for cofactors such as NADH and NAD). For a fixed ratio h=CiCj of two metabolites i and j we can write

xixj=ln(h) (13)

and a feasible range of a metabolite concentration ratio can be expressed by the following two constraints (hmin and hmax representing the minimal and maximal ratio, respectively):

xixjln(hmin) (14)
xixjln(hmax) (15)

In a preprocessing step, the minimal and maximal possible driving forces fi,min and fi,max under the given concentration ranges (12)-(15) are determined for each reaction i. With K we denote the maximal possible driving force over all reactions

K=max(fi,max) (16)

Next, we introduce B as a lower bound for the driving force of all active reactions:

fi+(1zi)KB (17)

Finally, maximization of B then yields the optimal (maximal) MDF, abbreviated with OptMDF, for the given constraints:

Maximizex,r,BB (18)

                                                                   s.t. Eqs (6)–(17)

The defined OptMDF optimization problem combines binary and continuous variables in linear constraints and thus represents a mixed-integer linear program (MILP).

ASTHERISC algorithm

ASTHERISC (Algorithmic Search of THERmodynamic advantages in Single-species Communities) searches for multi-strain communities that compared to solutions with a single strain, improve the thermodynamic driving force of product synthesis. The algorithm starts with building a community model with n (initially identical) strains of a given species. Standard exchange metabolites (e.g. substrates, nutrients, oxygen, carbon dioxide, common fermentation products such as acetate, ethanol etc.) can be specified, which can be exchanged by the strains with the exchange compartment via standard exchange reactions and exchange reactions between the exchange compartment and the environment are added as well to allow uptake/release of these metabolites from/to the environment. In addition, to allow the exchange of (intermediate) metabolites between the strains in the community, we add, to each internal metabolite deemed to be eligible for exchange between the strains, a reversible transport reaction which can import or export the metabolite from/to the exchange compartment. Accordingly, we distinguish standard exchanges as introduced above, from extra exchanges which allow metabolic interactions between the strains but, in contrast to standard exchanges, no exchange with the environment. The standard as well as the extra exchanges can be specifically defined for every application. Furthermore, when designing a community for the production of a given target metabolite, an export reaction from the exchange compartment to the environment is temporarily added to allow its net production. ASTHERISC searches then for optimal pathways that traverse between the strains to maximize the MDF for synthesizing the target product with a given minimum product yield. The optimal pathway will involve individual (but possibly overlapping) sets of reactions and metabolite concentrations in the different strains.

With this model setup, ASTHERISC employs the OptMDFpathway MILP and slightly modified variants thereof with additional constraints or/and an altered objective function. These variants are described in the following.

  1. OptMDF: the original OptMDFpathway algorithm as described above

  2. OptMDF2: limited number of (extra) exchanges

If EX contains the indices of the extra exchange reactions, an upper bound c for the number of active extra exchanges can be demanded by adding the following constraint to the OptMDF MILP:

iEXzic. (19)
  • 3

    OptMDF3: adding a constraint demanding a lower bound for the MDF

In this variant, we take OptMDF2 and drop the objective function given in Eq (18) and include instead a constraint demanding a minimum value γ for the MDF B:

Bγ. (20)

Note that OptMDF3 just searches for a feasible solution. It returns infeasible if no solution with the demanded minimal MDF exists.

  • 4

    OptMDF4: minimize the overall flux

OptMDF4 uses OptMDF3 together with the objective of minimizing the sum of absolute fluxes:

Minimizei=1qri (21)

Note that the OptMDF algorithm splits reversible reactions internally into two irreversible ones ensuring that all rates are non-negative and that (21) really maximizes the sum of absolute fluxes.

  • 5

    OptMDF5: searching for minimal/maximal metabolite concentrations

Based on OptMDF3, this variant first maximizes and then minimizes the (logarithmized) concentration of a given metabolite i:

Maximizexi (22)
Minimizexi (23)

This concentration variability analysis is done for all metabolites and determines the feasible metabolite concentrations under the given constraints (including Eq (20)).

The pseudo code of the ASTHERISC algorithm is given in Fig 4. For each (target) metabolite to be tested by ASTHERISC as target product, the ASTHERISC algorithm seeks to find a flux vector and associated metabolite concentrations in the community model such that (a) the community reaches a higher OptMDF than the single-strain solution under (b) the constraint of a minimal defined target product yield. Accordingly, in the first step of the ASTHERISC algorithm, the maximal target product yield, achievable without thermodynamic constraints, is computed for the single-strain model in mol/mol (technically, all single-strain calculations are done via the community model in which only one single strain has unblocked default and target metabolite exchanges) and a specified minimal fraction of this maximum yield must be reached in all subsequent community model calculations. Next, the OptMDF is computed in the single strain and then for multiple strains in the community; in the latter case, an upper bound for the number of active extra exchange reactions can be specified (step 2). If the OptMDF in the multi-strain community model is higher than in the single-strain model, the next steps are performed to characterize the solution (otherwise the algorithm continues with the next target metabolite). Steps 3 and 4 determine the flux vector in the community model that has the minimal flux (sum) under the maximal product yield possible with the found OptMDF. This step removes unnecessary and thermodynamically irrelevant reactions and the obtained minimal flux vector thus indicates the truly required reactions in the MDF-optimal community solution. This flux vector can be optionally further analyzed to determine the feasible metabolite concentration ranges for the optimal community solution (step 5) and to identify thermodynamic bottleneck reactions (step 6) in a single-strain representation of the community solution. As in [31], thermodynamic bottlenecks are defined as those reactions where a change of the standard Gibbs free energy alone is enough to further increase the OptMDF value.

Fig 4. Pseudo-code of the ASTHERISC algorithm.

Fig 4

For detailed explanations see text.

Our implementation of the algorithm uses, for some MILPs, indirect approximation steps as they turned out to be advantageous and more robust than direct MILP optimizations in larger models. For example, direct optimization of MDF with the standard OptMDFpathway MILP algorithm may require several hours to days in a genome-scale community model, even when using a high-performance computer cluster. However, approximating the OptMDF with iterative application of OptMDF3 (testing of feasibility with a fixed MDF level) may run in several minutes. This can be used to approximate the OptMDF value by iterative refinement of this value for which a feasible solution can be found (the approximate algorithms stops when a demanded precision level has been reached). The same is done for maximization of product yields under a given optimal MDF constraint.

Implementation

The generation of community models in the described forms as well as the application of the introduced ASTHERISC algorithm are implemented in two separate packages, the CommModelPy and the ASTHERISC package (Fig 5).

Fig 5. Schematic overview of the combined usage of CommModelPy and the ASTHERISC package.

Fig 5

Orange boxes stand for user settings, green boxes for generated or given data files, red boxes for primary program package dependencies, and blue boxes for the programs themselves.

CommModelPy

The CommModelPy package can be used to compile multi-strain community models as described in this study, but it also supports construction of multi-species community models under balanced growth. While other Python-based program packages for the generation of communities have been published (including CarveMe [39] and micom [40]), CommModelPy can create community models with a format required for the ASTHERISC package (e.g. fixed species fractions and simulations without growth are allowed). CommModelPy is written in Python and based on the module cobrapy [41]. Among others it supports the SBML format [42]. Herein we used it with Python in version 3.7 and cobrapy 0.17.1. CommModelPy is free and open-source and can be retrieved via its GitHub repository at:

https://github.com/klamt-lab/CommModelPy

The repository also includes a manual and, as a usage example, the CommModelPy-based scripts for the generation of all multi-strain community models used in this study.

ASTHERISC package

The actual ASTHERISC algorithm with all its subroutines has been implemented in the form of the ASTHERISC package. As input, it takes a CommModelPy-generated multi-strain community model from an SBML file and a list of reaction-associated ΔrG0 values from a JSON file. As output, it generates detailed text reports for which target metabolites OptMDF advantages could be found in a community. The package is written in MATLAB and uses API functions from CellNetAnalyzer [43] and the IBM CPLEX solver. Herein, it was used with MATLAB 2018a, CellNetAnalyzer 2019.2 and CPLEX 12.9. The ASTHERISC package is free and open-source and can be found under its GitHub repository:

https://github.com/klamt-lab/astheriscPackage

This repository includes all scripts and detailed report files of the ASTHERISC-based analyses performed in this study.

Results

Application of ASTHERISC to a small example model

We first illustrate our developed ASTHERISC framework with the toy model given in Fig 2. CommModelPy was used to build a community model. As feasible metabolite ranges, we assumed 1 M-10 M for all metabolites, except for the external product (Pex) where we demand a narrower range of 5M-10M so that the product can be synthesized also under higher external product concentrations (see legend of Fig 2). The application of ASTHERISC successfully identified the shown community solution in Fig 2, which improves the maximal thermodynamic driving force as quantified by the OptMDF value. With the single strain, the calculated OptMDF of the pathway from external substrate to external product had a negative value of -0.19 kJ/mol and would thus be infeasible. With the found two-strain community solution, the OptMDF had a positive value of around 0.46 kJ/mol. Hence, in this scenario, the OptMDF is not only increased; the sign change indicates that the community makes product synthesis feasible at all.

The detailed report generated by ASTHERISC for this example is shown in S3 Text. One important information provided in this report is which metabolite concentrations differ in the two strains in the found community solution indicating where a separation in two strains helps to overcome thermodynamic bottlenecks. To obtain the OptMDF of 0.46 kJ/mol in the community solution, X must have a lower concentration between 1 M and 1.37 M in strain 1 and a higher concentration range between 3.65 M and 10 M in strain 2. This explains why the two reactions R2 and R5 had to be separated into strains. Together with the P-exporting reaction R6, which has a low maximal driving force due to the constraint of a high external product (Pex) concentration, these two reactions were also identified by ASTHERISC as the sole bottleneck reactions of the single-strain case.

Applying ASTHERISC to multi-strain community models of E. coli

As a practically relevant case, we apply ASTHERISC to identify multi-strain communities of on E. coli that could potentially improve the thermodynamic driving force of the production of metabolites. In total, we considered three different community models. Two of them, ecolicore2double and ecolicore2triple, are based on the EColiCore2 model [44] (a reduced version of the genome-scale model iJO1366 [45] containing 499 reactions and 486 metabolites). ecolicore2double contains two and ecolicore2triple three duplicates of the EColiCore2 network as strains in the community model. The third community model, iML1515double, contains two copies of the recently published genome-scale network iML1515 [46]. A detailed description of the construction of the community models, the assignment/computation of ΔrG0, the used standard exchange reactions and other constraints can be found in S2 Text. In short, as the main source of ΔrG0, we employed the Python-based eQuilibrator API, which uses the component contribution method [47]. Reactions, for which no ΔrG0 could be found or calculated (93 reactions (18%) in EColiCore2 and 646 (24%) in iML1515), were blocked in the model (rate fixed to zero) to avoid the calculation of thermodynamically infeasible solutions when leaving these ΔrG0 unconstrained. Glucose was used as substrate and its uptake rate was fixed to 1 mmol/(gDW∙h) (this is only made for technical reasons; it does not change the results since all solutions are scalable). No other flux bounds were set and since growth was not considered this allows use of identical biomass fractions for all involved strain (50% in two-strain communities and 33% in three-strain communities). The three community models were built with CommModelPy and can be found as SBML files in the mentioned GitHub repositories.

With the reduced set of active reactions (due to some missing ΔrG0) and with a demanded minimal product yield of 10−6 (to prevent potential numeric problems), there were 161 producible target metabolites in the EColiCore2-related community models and 254 in iML1515double. For each of the producible target metabolites, we considered in each of the three community models 8 different scenarios resulting from four different yield thresholds (40%, 60%, 80% and 98% of the maximal yield of the target metabolite) and two different bounds for the maximal number of (extra) exchange reactions (an infinite or a maximal number of 9 extra exchanges). Hence, 24 scenarios are considered in total. To exclude solutions with just marginal MDF improvements, we demanded a minimal community MDF advantage of 0.2 kJ/mol and a positive OptMDF value was demanded for the community (not for an associated single-strain) solution in order to get thermodynamically feasible results.

For all 24 scenarios, ASTHERISC could find a wide range of target metabolites where a community was able to provide a pathway with a higher OptMDF than a single strain (for a complete listing of all results see the report files in the ASTHERISC package’s repository). The percentage of producible metabolites for which an advantage could be found in the community ranged from 15.75% up to 40.55% (Table 1). In most cases, as expected, limiting the maximal extra exchanges to 9 reduces the number of found improvements compared to the case with unlimited exchanges. Since the computation was stopped for several metabolites due to MILP timeouts (especially in the genome-scale community model), the percentages given for the different scenarios should be seen as lower bound of possible improvement percentages.

Table 1. Percentage (and absolute number) of all producible target metabolites for which ASTHERISC found a higher optimal MDF in a multi-strain community compared to a single-strain model.

Note that only those communities were considered, where the MDF was at least 0.2 kJ/mol higher than in the single-strain model.

Model (number of allowed extra exchanges) Demanded minimum product yield (% of maximum yield)
40% 60% 80% 98%
ecolicore2double (9) 22.36% (36) 21.74% (35) 21.12% (34) 21.12% (34)
ecolicore2double (infinite) 23.6% (38) 35.4% (57) 27.95% (45) 31.06% (50)
ecolicore2triple (9) 21.12% (34) 19.88% (32) 17.39% (28) 19.25% (31)
ecolicore2triple (infinite) 19.25% (31) 31.06% (50) 27.33% (44) 29.81% (48)
iML1515double (9) 17.72% (45) 15.75% (40) 21.26% (54) 25.59% (65)
iML1515double (infinite) 18.5% (47) 20.08% (51) 25.59% (65) 40.55% (103)

The OptMDF advantages of community solutions compared to single-strain solutions ranged from 0.22 kJ/mol (just over the demanded minimal MDF advantage of 0.2 kJ/mol) up to 7.26 kJ/mol, while the mean MDF advantages range from 0.64 kJ/mol to 2.0 kJ/mol for ecolicore2double and ecolicore2triple and from 0.91 kJ/mol to 1.94 kJ/mol for iML1515double (Table 2). Interestingly, the highest MDF improvements occur when high product yields are demanded. This indicates that low-yield pathways in the single strain might have high MDFs which are more difficult to improve by a community solution. We also found some solutions in which the MDF of the single strain is negative and in the community positive, hence, where product synthesis becomes thermodynamically feasible at all in the community (similar as in the toy model scenario in Fig 2). This was the case for 7 metabolites in all runs with a yield threshold of 98% with ecolicore2double and ecolicore2triple, and for 3 metabolites in all runs with iML1515double.

Table 2. Statistics about OptMDF advantages (in kJ/mol) with communities for each of the 24 scenarios.

The given numbers stand for the minimal/mean/maximal OptMDF advantage.

Model (number of allowed extra exchanges) Demanded minimum product yield (% of maximum yield)
40% 60% 80% 98%
ecolicore2double (9) 0.32/0.94/2.76 0.2/0.64/3.13 0.29/0.67/3.24 0.27/2.0/5.49
ecolicore2double (infinite) 0.32/1.29/3.42 0.26/1.02/5.07 0.22/1.04/5.07 0.23/1.77/6.28
ecolicore2triple (9) 0.32/0.83/2.76 0.22/0.63/3.03 0.29/0.7/3.24 0.5/1.85/5.49
ecolicore2triple (infinite) 0.29/1.42/3.42 0.26/1.09/4.71 0.22/0.93/4.71 0.23/1.81/6.28
iML1515double (9) 0.26/0.92/2.95 0.33/1.0/2.79 0.23/0.93/2.66 0.22/1.58/5.55
iML1515double (infinite) 0.26/0.97/3.04 0.26/0.91/2.79 0.3/0.91/2.64 0.22/1.94/6.04

Next, we analyzed how many metabolite exchanges were used in the found community solutions (Table 3). For the scenario with a limited number of (extra) exchanges, it should be noted that a maximum of 9 active exchange reactions in a community with two strains correspond to an exchange of maximal 4 metabolites between the two strains: 4 exchange reactions from strain 1 for export/import of metabolites and 4 reactions of strain 2 for import/export of the same 4 metabolites plus the export of the target product by one of the two strains. The number of used metabolite exchanges per target metabolite solution ranged from 1 (in those cases, default metabolite exchanges were typically used between the strains which are not counted as extra exchanges) up to 51.

Table 3. Statistics about the number of used extra exchanges in solutions with a community OptMDF advantage for each of the 24 scenarios analyzed.

The given numbers stand for the minimal/mean/maximal number of used extra exchanges.

Model (number of allowed extra exchanges) Demanded minimum product yield (% of maximum yield)
40% 60% 80% 98%
ecolicore2double (9) 3/5.39/8 4/6.34/7 4/6.41/9 3/6.03/9
ecolicore2double (infinite) 3/16.66/29 7/15.88/31 6/13.76/29 8/15.88/41
ecolicore2triple (9) 3/4.47/7 3/5.88/7 4/5.89/7 3/5.29/9
ecolicore2triple (infinite) 3/18.13/38 3/14.8/36 5/13.36/34 4/16.83/51
iML1515double (9) 1/5.18/7 3/5.05/9 1/5.11/9 2/5.14/9
iML1515double (infinite) 3/15.28/33 3/14.16/27 3/14.11/37 3/14.34/28

Some metabolites occur with high frequency as exchange metabolites in the found solutions. For example, glycerol3-phosphate, (glyc3p), fructose-6-phosphate (f6p) and oxaloacetate (oaa) are top-ranked exchange metabolites in ecolicore2double and ecolicore2triple while dihydroxyacetone (dha), L-aspartate (asp__L) and acetaldehyde (acald) are often contained in solutions for iML1515double. Most of the top-ranked exchange metabolites are involved in the glycolysis or in pathways directly connected to glycolysis indicating central thermodynamic bottlenecks relevant for the production of many target metabolites.

Coinciding with the dominant extra exchange reactions, a few discrete MDF values are often dominating for each scenario. In particular, if target metabolites are closely located in the metabolic network, then their corresponding community solutions have often the same associated OptMDF value as they share the same metabolic bottleneck for their production. Some of these dominant MDF values occurred in all analyzed models. This might indicate that a small set of central bottleneck reactions is thermodynamically constraining a large set of possible solutions.

Community solution example

To discuss one representative solution as an example, we selected the production of 3-Deoxy-D-manno-octulosonate 8-phosphate (kdo8p) in ecolicore2double with a demanded minimal product yield of 98% of the maximum yield (0.74775 mol kdo8p per mol glucose) and with an allowed maximum of 9 extra exchanges. The last step of kdo8p synthesis is catalyzed by the kdo8p synthase (KDOPS), which has been evaluated as a potential antibiotics target [48]. In the following, the two designed strains within the ecolicore2double community model will be denoted by “ecoli1” and “ecoli2”, respectively. The report delivered by ASTHERISC for kdo8p can be found in S4 Text.

With the given constraints, an OptMDF of 2.27 kJ/mol can be reached for the production of kdo8p from glucose with a single strain. In the community solution found by ASTHERISC, the MDF value can be increased by 26% up to 2.87 kJ/mol reaching a yield of 0.73311 mol/mol, which is just 0.04% higher than the demanded minimal yield. While the single-strain solution uses 42 reactions (thereof 14 standard exchange reactions for glucose, water, etc.), the community requires 66 reactions, thereof 22 exchange reactions, 28 internal reactions in ecoli1 and 16 reactions in ecoli2. 17 of the 22 exchange reactions are standard exchanges and four extra exchange reactions are used for metabolite exchanges between ecoli1 and ecoli2 (Fig 6): two for the exchange of dihydroxyacetone (dha) from ecoli1 to ecoli2, two for the exchange of 6-Phospho-D-gluconate (6pgc) from ecoli2 to ecoli1 and one for the secretion of kdo8p to the environment.

Fig 6. Excerpt of selected central reactions of the MDF-optimal single-species and community solution for kdo8p synthesis.

Fig 6

Black arrows indicate active reactions, dashed arrows indicate a sequence of active reactions, and the blue triangles indicate increased or decreased metabolite concentrations of a strain in the community relative to the other strain. The shown ΔrG′ and ΔrG0 values have all unit kJ/mol. The ΔrG′ values are taken from the specific MDF-optimal solution delivered by ASTHERISC. The red ΔrG′ values indicate thermodynamic bottlenecks, i.e., reactions whose ΔrG′ is fixed under the optimal MDF (and corresponds to the negative value of the OptMDF) All black ΔrG′ values are variable under the given OptMDF. All reaction and metabolite identifiers are based on the definitions in the BiGG database [49].

To understand why this community solution leads to an MDF advantage, it is helpful to look at the three thermodynamic bottleneck reactions for kdo8p synthesis in the MDF-optimal single-strain solution (Fig 6). These three connected reactions (shown in red in Fig 6), which all occur in the upper glycolysis, are (1) the fructose-bisphosphate aldolase reaction (FBA; ΔrG0 = 22.4 kJ/mol) producing glyceraldehyde 3-phosphate (g3p) and dihydroxyacetone phosphate (dhap) from D-fructose 1,6-bisphosphate (fdp), (2) the triose-phosphate isomerase reaction (TPI; ΔrG0 = 5.6 kJ/mol) converting dhap to g3p, and (3) the glyceraldehyde-3-phosphate dehydrogenase reaction (GAPD; ΔrG0 = 4.6 kJ/mol) where g3p is converted to 3-phospho-D-glycerol phosphate (13dpg). As already pointed out in [30] and [31], these three reactions form a distributed thermodynamic bottleneck as they all have unfavorable thermodynamics to operate in forward direction (positive ΔrG0) and conflicting needs regarding the g3p concentration since the FBA and TPI reactions thermodynamically benefit from a lower and the GAPD reaction from a higher g3p concentration. There is also a fourth associated bottleneck reaction, the symporter-assisted uptake of phosphate into the cytosol (PIt2rpp) with a ΔrG0 of 0 kJ/mol, which delivers the phosphate needed in the GAPD reaction. For the sake of simplicity, it was not included in Fig 6. The optimal adjustment of metabolite concentrations in the single strain allows a maximum MDF value of 2.27 kJ/mol and changing the concentration, e.g. of g3p, will inevitably lead to an increase of the ΔrG′ of at least one reaction and thus to a decrease of the MDF.

In the community solution, the bottleneck is resolved as follows (Fig 6): In order to produce kdo8p through the reaction KDOPS (kdo8p synthase), both phosphoenolpyruvate (pep) and D-arabinose 5-phosphate (ara5p) are needed as substrate. In the single strain solution, pep is produced using the glycolysis, thus requiring the bottleneck reactions FBA, TPI and GAPD. ara5p is produced through reactions which do not pose a bottleneck, using a pathway via 6-phospho-D-gluconate (6pgc). In the community solution, the bottleneck triangle of the FBA, TPI and GAPD reactions is resolved by separating the occurrence of the g3p-using GAPD reaction into ecoli1 and the g3p-producing FBA and TPI reactions into ecoli2, thus omitting the conflicting needs of g3p. This is achieved in the following way: (1) ecoli1 concentrates on pep synthesis, thereby omitting parts of the upper glycolysis by using the fructose 6-phosphate aldolase reaction (F6PA), which has f6p as a substrate and dihydroxyacetone (dha) and g3p as products. g3p is further processed using GAPD, and dha is secreted and taken up by ecoli2 for further processing (see below). With a higher concentration of f6p, a higher and thermodynamically more favorable concentration (compared to the single-strain solution) of g3p is achieved in ecoli1, so that the ΔrG′ value of the GAPD reaction can be lowered (and thus the MDF be increased) in the community solution. Furthermore, a higher f6p concentration also requires an increased D-glucose 6-phosphate (g6p) concentration since f6p is produced from it using the glucose-6-phosphate isomerase reaction (PGI). (2) ecoli2 concentrates on the production of 6pgc from dha. In a first step, the reverse F6PA reaction is used to produce f6p from dha and g3p. While dha is provided from ecoli1, the required g3p is recycled from a portion of the produced f6p via the g3p-producing reactions FBA and TPI. Due to the very negative ΔrG0 of the reverse F6PA reaction, low concentrations of dha and g3p can be used in ecoli2 resulting in a more negative ΔrG′ of the FBA and TPI reactions. The second half of f6p is used to synthesize 6pgc via the (reverse) PGI, G6PDH2r and PGL reactions. In order to make all these reactions work with low ΔrG′ (resulting in high MDF), the concentrations of dha, f6p, g6p and, as mentioned before, of g3p are reduced compared to ecoli1. The three reactions on the pathway from f6p to 6pgc still reach sufficient thermodynamic driving force because of the very negative ΔrG0 of these reactions. (3) The 6pgc produced by ecoli2 is then secreted and taken up by ecoli1and further processed via two pathways to synthesize pep and D-arabinose 5-phosphate (ara5p), the two precursors of kdo8p. pep is produced via g3p and pyruvate (pyr) using Entner-Doudoroff pathway reactions, including the 6-phosphogluconate dehydratase reaction (EDD) with a very low ΔrG0 of -42.8, which allows the concentrations of the affected metabolites to be more variable in order to obtain a high MDF. Note that, in ecoli1, some additional amount of pep is produced via glycolytic reactions from g3p using, for example, the GAPD reaction (Fig 6). The other precursor ara5p is synthesized via the two reactions GND and A5PISO. Finally, ara5p and pep are used by KDOPS to produce kdo8p, which is then secreted into the environment. It has to be mentioned that more reactions are involved in the single-strain as well as in the multi-strain solution to obtain a stoichiometrically balanced flux (not displayed in Fig 6). These reactions are required, for example, to balance cofactors (ATP, NAD(P)H) produced or consumed along the sketched pathways, however, they are not critical for the respective MDF.

Overall, the community solution follows the basic principle of the toy model (Fig 2): through a suitable partitioning (and partially complementation) of the pathway from glucose to the target product kdo8p in the two strains, critical metabolite concentrations (in this example, dha, f6p, g3p and gp6 as indicated in Fig 6) can be maintained with different levels in ecoli1 (high concentrations) and ecoli2 (low concentrations), thereby reaching more negative ΔrG′ for the bottleneck reactions and thus a higher overall MDF. This can also be seen by the feasible concentrations ranges of the critical metabolites in ecoli1 and ecoli2 which maintain the OptMDF of 2.87 KJ/mol (Table 4). These metabolites, which occur in the bottleneck reactions of the single-strain solution, have no overlapping concentration ranges between ecoli1 and ecoli2 showing that the given community solution indeed cannot be reached in a single strain. Interestingly, the community solution has in turn new thermodynamic bottlenecks, again involving the reactions FBA, TPI and GAPD. Those might be further resolved by using a community with more than two strains. However, the corresponding kdo8p solution with ecolicore2triple does not provide a higher OptMDF in the community, indicating that more than three species might be needed in order to achieve a further MDF advantage.

Table 4. Concentration ranges of critical metabolites in the ecolicore2double community solution for kdo8p synthesis with optimal MDF.

The concentration ranges do not overlap between the two strains.

ecoli1 ecoli2
Minimum [mmol/l] Maximum [mmol/l] Minimum [mmol/l] Maximum [mmol/l]
dha_c 0.59826 0.59905 0.058969 0.059048
f6p_c 0.37078 0.37128 0.001 0.0010027
g3p_c 0.32482 0.32525 0.090047 0.090286
g6p_c 19.973 20 0.0019629 0.053238

Run time

For all calculations we used a high-performance computer cluster comprising, for each of its nodes, 192 GB working memory and two processors each including eight cores with a standard clock rate of 2.1 GHz. Each of the mentioned 24 scenarios was running on its own node. The resulting run times for the scenarios (cumulated for all producible target metabolites) ranged from 1.63 days (ecolicore2double with a minimal yield factor of 98% and 9 maximal extra exchanges) up to 4.54 days (iML1515double with a minimal yield factor of 80% and 9 maximal extra exchanges).

Discussion

In this study we presented ASTHERISC, a new algorithm for designing multi-strain microbial communities that maximize the thermodynamic driving force of product synthesis with a given production host. The key methodological approach behind ASTHERISC is to compartmentalize certain segments of the product pathway via different strains and to allow an exchange of intermediates between these strains such that the overall product synthesis pathway through the involved strains may reach a higher thermodynamic driving force than in the single strain. While the set of active reactions in the different strains can overlap, using more than one strain adds degrees of freedom with respect to the achievable MDF because the metabolite concentrations may differ between the strains. In this way, thermodynamic bottlenecks, which may arise when operation of one reaction requires a metabolite with high and operation of another reaction the same metabolite with a low concentration, can be circumvented.

In recent years, microbial consortia and their potential for biotechnological production processes have attracted much attention. However, although several single-species (multi-strain) communities with essential metabolic dependencies between the strains have been constructed and analyzed, there are still only few methods available for an automated design of communities with superior properties [5], with FLYCOP [50] and DOLMN [17] as two particular examples. Here, ASTHERISC adds a new class of such algorithms which is driven by a thermodynamic principle. The recently presented DOLMN approach [17], which searches for an optimal partitioning of a species’ metabolism when limiting the number of reactions per strain, shares some similarities with ASTHERISC. However, to the best of our knowledge, a design of microbial communities with the goal to optimize the thermodynamic feasibility of production of a target metabolite has not been addressed before.

The application of our algorithm to a metabolic core and a genome-scale model of E. coli revealed that there is indeed a significant number of metabolic products where a multi-strain community of E. coli may provide higher thermodynamic driving forces. These communities may be an effective way to enhance or even enable the synthesis of certain metabolic products. ASTHERISC, together with the developed software packages, provides an automatable way to identify such beneficial communities consisting of multiple strains of one species. However, while the found solutions demonstrate that higher thermodynamic driving forces may be generated with specifically designed communities, significant effort will be needed for their practical realization. First of all, designing those strains will require the deactivation of certain metabolic reactions (e.g. through gene knockouts) according to the found community solution. Finding suitable knockout strategies that leave only the relevant pathway segments active in the respective strains can be computed by dedicated strain design algorithms, e.g. based on minimal cut sets [51]. Implementing the respective gene knockout strategies may lead to unstable strains or even prevent growth. While the (stoichiometric) feasibility of growth may be demanded when computing the respective intervention strategies, it might be more practical to use two-stage processes, where the strains can first grow before a pure production phase is initiated (e.g. via dynamic metabolic switches [52,53]). This also circumvents possible limitations in the MDF of the production pathway when growth and production are coupled (in our calculations we focused on optimal pathway partitioning for pure product synthesis and did not consider feasibility of growth).

Next, transport of exchange metabolites between the strains must be faciitated (e.g. via overexpression of dedicated transport systems). Transport systems are only available/known for a subset of metabolites and solutions involving only few exchanges may thus be preferred. However, many of the found community solutions for different products use similar sets of exchange metabolites. Hence, once suitable transporter mechanisms have been discovered or implemented for these metabolites, a larger number of strain designs may become feasible. As another aspect for metabolite transport, it was assumed for all exchange reactions that a concentration gradient must exist between the cellular and the external compartment, however, transport costs, e.g. energy consumption to actively transport a metabolite over the membrane, were not explicitly considered (except for transporters already included in the base model). These overhead costs may have adverse effects on product yields and also abolish the MDF advantage of a community solution. To test the sensitivity of the latter when introducing energy costs for metabolite transport, we reran all calculations in a variant of the ecolicore2double model, where each introduced (extra) metabolite exchange consumes 1/3 ATP per transported metabolite molecule. Although there are few specific changes in the set of metabolites where an MDF advantage can be found, we did not find larger changes in the overall percentage of metabolites with MDF advantage in a community. In fact, export of the (end) product does then also require ATP in the single-strain solution, which may lead to a change in the structure of the active network possibly favoring a solution in the community. The associated scripts and result files can be found in the already mentioned GitHub repositories of CommModelPy and the ASTHERISC package.

Another issue for realization of a calculated community design is the long-term stability of the community, which may require genetic methods and general principles as developed in [54] and, more recently, [9]. Finally, reaching the computed maximal thermodynamic driving forces for the communities also demands that the metabolite concentrations are within the respective ranges required for the optimal MDF and it is not per se clear whether this will be the case or can be adjusted by suitable selection mechanisms. It thus remains to be shown that the computed communities may constitute an economically relevant enhancement. However, it should be noted that many of the discussed difficulties are relevant for the implementation of any (computed) synthetic community.

The ASTHERISC runs in the example calculations of this study were stable. Potential numerical problems arising due to precision tolerances of the MILP are partially directly prevented or detected and handled by our implementation. For example, “fake community solutions” delivered by the solver, where effectively only one strain is used, are detected and disregarded. It should also be noted that the set computational time limits may affect the results. This can be seen in the reduced MDF advantage percentages with ecolicore2triple compared to ecolicore2double (Table 1), where the MDF advantage in ecolicore2triple should be at least as high as in ecolicore2double as the former should implicitly contain all two-strain solutions. This effect arose due to MILP time limits which were more likely to be reached in the larger model. Nevertheless, the runs of the ecolicore2triple community model clearly showed that a larger number of strains may increase the overall community advantage. Finally, the calculations with the iML1515double model demonstrated that our implementation of the ASTHERISC algorithm can be applied to genome-scale models with a reasonable runtime. Indeed, comparing the found solutions for ecolicore2double and iML1515double indicates that, for some metabolites, other MDF-optimizing pathways may be found in a genome-scale model, although they may involve more uncommon pathways (with possibly lower capacity) than in the solutions found in the core model.

The results obtained with ASTHERISC also depend on the used settings (e.g. demanded minimum MDF advantage and minimum product yield) and the calculations involve partially uncertain parameters or assumptions (e.g. the metabolite concentration ranges or the ΔrG0 calculated with the eQuilibrator method [47]). However, the calculations can be easily repeated with alternative values if these are considered to be more relevant or evident. Furthermore, the developed CommModelPy and ASTHERISC packages are programmatically independent from the used models and can thus be readily applied for the generation and study of other single-species. Although not yet tested, ASTHERISC could also be used with multi-species communities, for example, to test which product synthesis pathway (with associated metabolite exchanges) in a multi-species community leads to the highest MDF and whether this MDF is superior over a potentially existing single-species pathway. Finally, we also anticipate useful applications of the ASTHERISC method for finding suitable compartmentalization strategies maximizing thermodynamic driving forces in cell-free production systems.

Community design by ASTHERISC exploits the fact that having multiple strains allows adjustment of different metabolite concentrations in the different strains. While ASTHERISC uses this degree of freedom for maximizing the thermodynamic driving force, future work could seek to directly maximize the product synthesis rate in the community by finding a pathway partitioning (again with specific metabolite concentrations in each strain) such that, under consideration of kinetic rate laws, the overall product flux is optimized. Ideally, if a kinetic model of the (central) metabolism would be available one could directly search for such an optimal solution. For example, if we would have a simple kinetic model of the toy network in Fig 2, it would most likely suggest to split the pathway in the same way as was done via the thermodynamic principles to maximize the pathway flux. However, a kinetic approach would also allow the consideration of saturation effects, allosteric regulations (e.g. feedback inhibition of the first reaction by intermediate or end metabolites of the pathway) or of resource (enzyme) allocation constraints. Since predictive kinetic models are often not available, it needs to be investigated whether other approaches requiring less information (e.g. based on kcat or Km values [55,56]) may already suggest useful solutions.

Supporting information

S1 Text. Proof showing that division of labor cannot increase the product synthesis flux compared to a single-strain solution for arbitrary pathways and kinetics (under assumption of constant metabolite concentrations in the strains).

(PDF)

S2 Text. Detailed description of configuration of the single-strain models and the derived multi-strain community models of E. coli.

(PDF)

S3 Text. The ASTHERISC package text report generated for the toy model.

(PDF)

S4 Text. The ASTHERISC package text report generated for the discussed community solution example for kdo8p synthesis with ecolicore2double.

(PDF)

Acknowledgments

We are grateful to Axel von Kamp for his assistance and discussions on using OptMDFpathway in the context of ASTHERISC.

Data Availability

Data are available within the manuscript and Supporting Information and in parts from a public repository: https://github.com/klamt-lab/CommModelPy https://github.com/klamt-lab/astheriscPackage.

Funding Statement

This work was supported by the EU program ERDF (European Regional Development Fund) of the German Federal State of Saxony-Anhalt within the Research Center for Dynamic Systems (CDS; given to SK), and by the German Federal Ministry of Education and Research (FKZ: 031B0524B; given to SK). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Smith JM, Szathmary E. The major transitions in evolution: OUP Oxford; 1997. 1997-10-30. 361 p. [Google Scholar]
  • 2.Pande S, Merker H, Bohl K, Reichelt M, Schuster S, de Figueiredo LF, et al. Fitness and stability of obligate cross-feeding interactions that emerge upon gene loss in bacteria. The ISME Journal. 2014;8(5):953–62. doi: 10.1038/ismej.2013.211 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Song H-S, Cannon WR, Beliaev AS, Konopka A. Mathematical modeling of microbial community dynamics: a methodological review. Processes. 2014;2(4):711–52. doi: 10.3390/pr2040711 [DOI] [Google Scholar]
  • 4.Succurro A, Ebenhöh O. Review and perspective on mathematical modeling of microbial ecosystems. Biochemical Society Transactions. 2018;46(2):403–12. doi: 10.1042/BST20170265 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.García-Jiménez B, Torres-Bacete J, Nogales J. Metabolic modelling approaches for describing and engineering microbial communities. Computational and Structural Biotechnology Journal. 2021;19:226–46. doi: 10.1016/j.csbj.2020.12.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Mauri M, Gouzé J-L, Jong Hd, Cinquemani E. Enhanced production of heterologous proteins by a synthetic microbial community: Conditions and trade-offs. PLOS Computational Biology. 2020;16(4):e1007795. doi: 10.1371/journal.pcbi.1007795 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Anane E, López C DC, Neubauer P, Cruz Bournazou MN. Modelling overflow metabolism in Escherichia coli by acetate cycling. Biochemical Engineering Journal. 2017;125:23–30. doi: 10.1016/j.bej.2017.05.013 [DOI] [Google Scholar]
  • 8.Orth JD, Thiele I, Palsson BØ. What is flux balance analysis? Nature biotechnology. 2010;28(3):245–8. doi: 10.1038/nbt.1614 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Losoi PS, Santala VP, Santala SM. Enhanced population control in a synthetic bacterial consortium by interconnected carbon cross-feeding. ACS Synthetic Biology. 2019;8(12):2642–50. doi: 10.1021/acssynbio.9b00316 [DOI] [PubMed] [Google Scholar]
  • 10.Stolyar S, Van Dien S, Hillesland KL, Pinel N, Lie TJ, Leigh JA, et al. Metabolic modeling of a mutualistic microbial community. Molecular Systems Biology. 2007;3(1):92. doi: 10.1038/msb4100131 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Chaganti SR, Kim D-H, Lalman JA. Flux balance analysis of mixed anaerobic microbial communities: Effects of linoleic acid (LA) and pH on biohydrogen production. International Journal of Hydrogen Energy. 2011;36(21):14141–52. doi: 10.1016/j.ijhydene.2011.04.161 [DOI] [Google Scholar]
  • 12.Benito-Vaquerizo S, Diender M, Olm IP, Santos VMd, Schaap PJ, Sousa DZ, et al. Modeling a co-culture of Clostridium autoethanogenum and Clostridium kluyveri to increase syngas conversion to medium-chain fatty-acids. bioRxiv. 2020:2020.06.23.167189. doi: 10.1016/j.csbj.2020.10.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Schwalm ND, Mojadedi W, Gerlach ES, Benyamin M, Perisin MA, Akingbade KL. Developing a microbial consortium for enhanced metabolite production from simulated food waste. Fermentation. 2019;5(4):98. doi: 10.3390/fermentation5040098 [DOI] [Google Scholar]
  • 14.Chan SHJ, Simons MN, Maranas CD. SteadyCom: Predicting microbial abundances while ensuring community stability. PLOS Computational Biology. 2017;13(5):e1005539. doi: 10.1371/journal.pcbi.1005539 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Koch S, Kohrs F, Lahmann P, Bissinger T, Wendschuh S, Benndorf D, et al. RedCom: A strategy for reduced metabolic modeling of complex microbial communities and its application for analyzing experimental datasets from anaerobic digestion. PLOS Computational Biology. 2019;15(2):e1006759. doi: 10.1371/journal.pcbi.1006759 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Khandelwal RA, Olivier BG, Röling WFM, Teusink B, Bruggeman FJ. Community flux balance analysis for microbial consortia at balanced growth. PLOS ONE. 2013;8(5):e64567. doi: 10.1371/journal.pone.0064567 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Thommes M, Wang T, Zhao Q, Paschalidis IC, Segrè D. Designing metabolic division of labor in microbial communities. mSystems. 2019;4(2). doi: 10.1128/mSystems.00263-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Zhang H, Wang X. Modular co-culture engineering, a new approach for metabolic engineering. Metabolic Engineering. 2016;37:114–21. doi: 10.1016/j.ymben.2016.05.007 [DOI] [PubMed] [Google Scholar]
  • 19.Zhou K, Qiao K, Edgar S, Stephanopoulos G. Distributing a metabolic pathway among a microbial consortium enhances production of natural products. Nature Biotechnology. 2015;33(4):377–83. doi: 10.1038/nbt.3095 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Lee CR, Kim C, Song YE, Im H, Oh Y-K, Park S, et al. Co-culture-based biological carbon monoxide conversion by Citrobacter amalonaticus Y19 and Sporomusa ovata via a reducing-equivalent transfer mediator. Bioresource Technology. 2018;259:128–35. doi: 10.1016/j.biortech.2018.02.129 [DOI] [PubMed] [Google Scholar]
  • 21.Shin H-D, McClendon S, Vo T, Chen RR. Escherichia coli binary culture engineered for direct fermentation of hemicellulose to a biofuel. Applied and Environmental Microbiology. 2010;76(24):8150–9. doi: 10.1128/AEM.00908-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Jones JA, Vernacchio VR, Sinkoe AL, Collins SM, Ibrahim MHA, Lachance DM, et al. Experimental and computational optimization of an Escherichia coli co-culture for the efficient production of flavonoids. Metabolic Engineering. 2016;35:55–63. doi: 10.1016/j.ymben.2016.01.006 [DOI] [PubMed] [Google Scholar]
  • 23.Jones JA, Vernacchio VR, Collins SM, Shirke AN, Xiu Y, Englaender JA, et al. Complete biosynthesis of anthocyanins using E. coli polycultures. mBio. 2017;8(3). doi: 10.1128/mBio.00621-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Lindemann SR, Bernstein HC, Song H-S, Fredrickson JK, Fields MW, Shou W, et al. Engineering microbial consortia for controllable outputs. The ISME Journal. 2016;10(9):2077–84. doi: 10.1038/ismej.2016.26 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Roell GW, Zha J, Carr RR, Koffas MA, Fong SS, Tang YJ. Engineering microbial consortia by division of labor. Microbial Cell Factories. 2019;18(1):35. doi: 10.1186/s12934-019-1083-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Tsoi R, Wu F, Zhang C, Bewick S, Karig D, You L. Metabolic division of labor in microbial systems. Proceedings of the National Academy of Sciences. 2018;115(10):2526–31. doi: 10.1073/pnas.1716888115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Kreft J-U, Griffin BM, González-Cabaleiro R. Evolutionary causes and consequences of metabolic division of labour: why anaerobes do and aerobes don’t. Current Opinion in Biotechnology. 2020;62:80–7. doi: 10.1016/j.copbio.2019.08.008 [DOI] [PubMed] [Google Scholar]
  • 28.Harvey E, Heys J, Gedeon T. Quantifying the effects of the division of labor in metabolic pathways. Journal of theoretical biology. 2014;360:222–42. doi: 10.1016/j.jtbi.2014.07.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Wu G, Yan Q, Jones JA, Tang YJ, Fong SS, Koffas MAG. Metabolic Burden: Cornerstones in Synthetic Biology and Metabolic Engineering Applications. Trends in Biotechnology. 2016;34(8):652–64. doi: 10.1016/j.tibtech.2016.02.010 [DOI] [PubMed] [Google Scholar]
  • 30.Noor E, Bar-Even A, Flamholz A, Reznik E, Liebermeister W, Milo R. Pathway thermodynamics highlights kinetic obstacles in central metabolism. PLOS Computational Biology. 2014;10(2):e1003483. doi: 10.1371/journal.pcbi.1003483 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Hädicke O, Kamp Av, Aydogan T, Klamt S. OptMDFpathway: identification of metabolic pathways with maximal thermodynamic driving force and its application for analyzing the endogenous CO2 fixation potential of Escherichia coli. PLOS Computational Biology. 2018;14(9):e1006492. doi: 10.1371/journal.pcbi.1006492 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Henry CS, Broadbelt LJ, Hatzimanikatis V. Thermodynamics-based metabolic flux analysis. Biophysical Journal. 2007;92(5):1792–805. doi: 10.1529/biophysj.106.093138 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Zhu Y, Song J, Xu Z, Sun J, Zhang Y, Li Y, et al. Development of thermodynamic optimum searching (TOS) to improve the prediction accuracy of flux balance analysis. Biotechnology and Bioengineering. 2013;110(3):914–23. doi: 10.1002/bit.24739 [DOI] [PubMed] [Google Scholar]
  • 34.Hoppe A, Hoffmann S, Holzhütter H-G. Including metabolite concentrations into flux balance analysis: thermodynamic realizability as a constraint on flux distributions in metabolic networks. BMC Systems Biology. 2007;1(1):23. doi: 10.1186/1752-0509-1-23 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Schellenberger J, Lewis NE, Palsson BØ. Elimination of thermodynamically infeasible loops in steady-state metabolic models. Biophysical Journal. 2011;100(3):544–53. doi: 10.1016/j.bpj.2010.12.3707 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Müller AC, Bockmayr A. Fast thermodynamically constrained flux variability analysis. Bioinformatics. 2013;29(7):903–9. doi: 10.1093/bioinformatics/btt059 [DOI] [PubMed] [Google Scholar]
  • 37.Hamilton JJ, Contreras MC, Reed JL. Thermodynamics and H2 transfer in a methanogenic, syntrophic community. PLOS Computational Biology. 2015;11(7):e1004364. doi: 10.1371/journal.pcbi.1004364 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Noor E. Removing both internal and unrealistic energy-generating cycles in flux balance analysis. arXiv:180304999 [q-bio]. 2018. [Google Scholar]
  • 39.Machado D, Andrejev S, Tramontano M, Patil KR. Fast automated reconstruction of genome-scale metabolic models for microbial species and communities. Nucleic Acids Research. 2018;46(15):7542–53. doi: 10.1093/nar/gky537 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Diener C, Gibbons SM, Resendis-Antonio O. MICOM: Metagenome-scale modeling to infer metabolic interactions in the gut microbiota. mSystems. 2020;5(1). doi: 10.1128/mSystems.00606-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Ebrahim A, Lerman JA, Palsson BO, Hyduke DR. COBRApy: COnstraints-Based Reconstruction and Analysis for Python. BMC Systems Biology. 2013;7(1):74. doi: 10.1186/1752-0509-7-74 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Hucka M, Finney A, Sauro HM, Bolouri H, Doyle JC, Kitano H, et al. The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics. 2003;19(4):524–31. doi: 10.1093/bioinformatics/btg015 [DOI] [PubMed] [Google Scholar]
  • 43.Klamt S, Saez-Rodriguez J, Gilles ED. Structural and functional analysis of cellular networks with CellNetAnalyzer. BMC Systems Biology. 2007;1(1):2. doi: 10.1186/1752-0509-1-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Hädicke O, Klamt S. EColiCore2: a reference network model of the central metabolism of Escherichia coli and relationships to its genome-scale parent model. Scientific Reports. 2017;7:39647. doi: 10.1038/srep39647 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Orth JD, Conrad TM, Na J, Lerman JA, Nam H, Feist AM, et al. A comprehensive genome-scale reconstruction of Escherichia coli metabolism—2011. Molecular Systems Biology. 2011;7(1):535. doi: 10.1038/msb.2011.65 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Monk JM, Lloyd CJ, Brunk E, Mih N, Sastry A, King Z, et al. i ML1515, a knowledgebase that computes Escherichia coli traits. Nature Biotechnology. 2017;35(10):904–8. doi: 10.1038/nbt.3956 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Flamholz A, Noor E, Bar-Even A, Milo R. eQuilibrator—the biochemical thermodynamics calculator. Nucleic Acids Research. 2012;40(D1):D770–D5. doi: 10.1093/nar/gkr874 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Du S, Faiger H, Belakhov V, Baasov T. Towards the development of novel antibiotics: synthesis and evaluation of a mechanism-based inhibitor of Kdo8P synthase. Bioorganic & Medicinal Chemistry. 1999;7(12):2671–82. doi: 10.1016/s0968-0896(99)00233-3 [DOI] [PubMed] [Google Scholar]
  • 49.King ZA, Lu J, Dräger A, Miller P, Federowicz S, Lerman JA, et al. BiGG Models: A platform for integrating, standardizing and sharing genome-scale models. Nucleic Acids Research. 2016;44(D1):D515–D22. doi: 10.1093/nar/gkv1049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.García-Jiménez B, García JL, Nogales J. FLYCOP: metabolic modeling-based analysis and engineering microbial communities. Bioinformatics. 2018;34(17):i954–i63. doi: 10.1093/bioinformatics/bty561 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Schneider P, Kamp Av, Klamt S. An extended and generalized framework for the calculation of metabolic intervention strategies based on minimal cut sets. PLOS Computational Biology. 2020;16(7):e1008110. doi: 10.1371/journal.pcbi.1008110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Venayak N, von Kamp A, Klamt S, Mahadevan R. MoVE identifies metabolic valves to switch between phenotypic states. Nature Communications. 2018;9(1):5332. doi: 10.1038/s41467-018-07719-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Hartline CJ, Schmitz AC, Han Y, Zhang F. Dynamic control in metabolic engineering: Theories, tools, and applications. Metabolic Engineering. 2021;63:126–40. doi: 10.1016/j.ymben.2020.08.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Johns NI, Blazejewski T, Gomes AL, Wang HH. Principles for designing synthetic microbial communities. Current Opinion in Microbiology. 2016;31:146–53. doi: 10.1016/j.mib.2016.03.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Noor E, Flamholz A, Bar-Even A, Davidi D, Milo R, Liebermeister W. The Protein Cost of Metabolic Fluxes: Prediction from Enzymatic Rate Laws and Cost Minimization. PLOS Computational Biology. 2016;12(11):e1005167. doi: 10.1371/journal.pcbi.1005167 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Chen Y, Nielsen J. Mathematical modeling of proteome constraints within metabolism. Current Opinion in Systems Biology. 2021;25:50–6. doi: 10.1016/j.coisb.2021.03.003 [DOI] [Google Scholar]
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009093.r001

Decision Letter 0

Christoph Kaleta, Daniel A Beard

17 Mar 2021

Dear Dr. Klamt,

Thank you very much for submitting your manuscript "Designing Microbial Communities to Maximize the Thermodynamic Driving Force for the Production of Chemicals" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments. Please pay particular attention to the points raised with regard of the justification of assumptions that your algorithm makes.

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Christoph Kaleta

Associate Editor

PLOS Computational Biology

Daniel Beard

Deputy Editor

PLOS Computational Biology

***********************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors present a new method, ASTHERISC, for designing synthetic microbial communities with optimal thermodynamic driving force for the production of a target compound. The method is novel with regard to being the first community design method that accounts for thermodynamics, and I believe it is quite relevant to other people working in this field. However, I think the authors should discuss some limitations in a bit more detail. Please see my comments bellow.

- The theoretical model of division of labor (DoL) assumes that there is no cost associated with the transport of the exchanged metabolic intermediates. For instance, in Figure 1, if one adds a cost term for the transport of M, the DoL model actually has an increased metabolic burden. This issue is only very briefly mentioned in the discussion, but given that it can make the whole concept unfeasible in a real scenario, I think it should be better discussed. For instance, the authors could have tested the sensitivity of the solutions with respect to the addition of an exchange cost (not just a limit on the total number of exchanges).

- Also, the authors add artificial exchange reactions to transport the metabolites between the cytosol and the growth medium, which leads to the exchange of metabolites, like phosphorylated sugars, that in principle cannot be transported outside the cell (glyc3p and f6p are reported to be among the most frequently exchanged). Why not simply use the transporters that are already present in the models ?

- Another aspect which is not discussed is what would happen during the scale-up to very large (i.e. not well-mixed bioreactors). Could the rate of diffusion of the metabolic intermediates become a bottleneck?

- The results presented in Table 1 show the fraction of cases where the optimal MDF is increased. But it is not mentioned how marginal this increase can be. I think the application of ASTHERISC is mainly relevant to the case illustrated in Fig 2, where a pathway with negative OptMDF can be split between species to obtain a positive MDF. How many of the cases presented in table 1 are actually of this kind?

- Why are the metabolite concentrations specified in M (i.e. mol/L)? What is the volume referring to in this case? Is it the cell volume? Shouldn't these values be 1000-fold lower, i.e., in the mM range ?

- I don't understand why the authors opted to block the reactions with unknown dG0's, which is essentially the same as removing them from the models. I think it would be more realistic, and biologically meaningful, to leave them unconstrained, since in reality these reactions would be able to carry flux, potentially deviating flux from the target pathway.

- I didn't see any mention about uncertainty in the dG0 calculations, which is one of the outputs returned by eQuilibrator. Were the uncertainties ignored? Considering that some can be quite large, shouldn't they be included in equation 9 (i.e. by replacing the equality with lower/upper bounds)?

- The algorithm doesn't actually propose any strain design targets to implement the mutant strains. It is only briefly mentioned during the discussion that this is something that must be done after computing the optimal pathway. However, considering that the title of the paper is Designing Microbial Communities (...), I think the authors should elaborate a bit more

on this aspect.

- Small suggestion: all the four tables presented could be more intuitively represented as plots.

Reviewer #2: The comments are also attached as a pdf. It might be easier to read the equations there.

### Peer review of *Designing Microbial Communities to Maximize the Thermodynamic Driving Force for the Production of Chemicals*

The authors present an interesting algorithm for the computational design of single-species communities that maximise the thermodynamic driving force of producing a certain target metabolite. The paper is written in a clear and educational style, which makes it pleasant to read. The algorithm is potentially important, and has novelty for both the modelling of microbial communities and the design of bio-based production. Although I encountered some issues that I think should be resolved, I think the manuscript is fit for publication in Plos Computational Biology after some major revisions.

My main criticism is related to the strong and non-negligible assumptions that the ASTHERISC algorithm is based on. I understand that making these assumptions is necessary: the modelling of microbial communities is difficult, and some assumptions are needed to make computational problems feasible. However, I think the reader should be guided much more in interpreting the consequences of these assumptions. My remarks, ordered per section, can be found below.

#### Remarks Introduction

##### The example described in Figure 1

Although I appreciate the attempt to introduce the concept of DoL in a very basic way, the example model is a bit too basic for my taste. At a first glance, I am left with the idea that the assumptions that are made are necessary to draw the conclusions that the authors want to make. I think this is not the impression that the authors want to leave, because their argument is much more general. Also, the statement “Hence, it is unlikely that a DoL strategy will be advantageous with respect to the metabolic burden …” is repeated a couple of times in the introduction, and I think it is not entirely correct. See my explanation below on how this could be nuanced.

The two assumptions that I find too specific are:

- the saturation functions only depend on the substrate of the reaction, rather than also on the product.

- the same amount of enzymes E1 and E2 is needed for a specific flux

I know that the authors already state that the second assumption does not affect the conclusions, but the reader is still left wondering what the effect of this assumption might be.

I would suggest two edits:

1. I think it is fairly straightforward to write a small mathematical proof that DoL does not give an advantage when the metabolite concentrations are unchanged, i.e., the saturation functions remain the same. This proof can be added to the SI, and can be referred to in this Introduction. I have added an outline for the proof below.

Adding this proof would also allow for a more exact statement. The authors now state: *“Hence, it is unlikely that a DoL strategy will be advantageous with respect to the metabolic burden … ”* But this should actually be something like: *“Hence, it is unlikely that a DoL strategy will be advantageous with respect to the metabolic burden, as long as the kinetic advantage of having different metabolite concentrations in the different strains do not outweigh the added costs of metabolite transport.”* The statement is a bit less strong, but definitely more correct. Also, adjusting this statement would allow the authors to give a preview on the rest of their work, because adding a growth advantage by having different metabolite concentrations is exactly what they are going to do, albeit focusing on thermodynamics and not on kinetics.

2. In this Introduction I would at least remove the first assumption, and just make the saturation functions depend on all metabolite concentrations. This does not change the reasoning at all. Personally, I would also skip the second assumption, but I realize that this is a matter of taste. Generalising this here might complicate matters. It is also not really necessary as long as you can refer to the general proof in the SI.

###### An idea for the proof:

*I wanted to write a quick outline for the proof, but it got a bit more involved than I expected. I hope the proof below is useful for the authors. Please check everything carefully, because I did not.*

One could start by taking a general reaction vector, $v$, that leads to some production of $P$. We can multiply $v$ by a constant and still have the production pathway, and the production rate of $P$ is proportional to the reaction rates in $v$. Let $\\tilde v$ now be rescaled such that $\\tilde v_P = 1$. The enzyme costs of this production pathway can be expressed in terms of some (arbitrary) saturation function $f_i(S,M,P,…)=:f_i(x)$ for each reaction $i$. The enzyme costs for having $v_P=1$ are then

<math xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><munder><mo>∑</mo><mi>i</mi></munder><msub><mi>e</mi><mi>i</mi></msub><mo>=</mo><munder><mo>∑</mo><mi>i</mi></munder><msub><mover><mi>v</mi><mo>~</mo></mover><mi>i</mi></msub><mo>/</mo><mo>(</mo><msub><mi>k</mi><mrow><mtext>cat</mtext><mo>,</mo><mi>i</mi></mrow></msub><msub><mi>f</mi><mi>i</mi></msub><mo>(</mo><mi>x</mi><mo>)</mo><mo>)</mo><mo>.</mo></mrow><annotation encoding="LaTeX">\\sum_i e_i = \\sum_i \\tilde v_i/(k_{\\text{cat},i}f_i(x)).</annotation></semantics></math>

So, given a limited amount of resources ($\\sum_i e_i \\leq C$) we can get a maximal production flux of

<math xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><msub><mi>v</mi><mi>P</mi></msub><mo>=</mo><mfrac><mi>C</mi><mrow><msub><mo>∑</mo><mi>i</mi></msub><msub><mover><mi>v</mi><mo>~</mo></mover><mi>i</mi></msub><mo>/</mo><mo>(</mo><msub><mi>k</mi><mrow><mtext>cat</mtext><mo>,</mo><mi>i</mi></mrow></msub><msub><mi>f</mi><mi>i</mi></msub><mo>(</mo><mi>x</mi><mo>)</mo><mo>)</mo></mrow></mfrac></mrow><annotation encoding="LaTeX">v_P = \\frac{C}{\\sum_i \\tilde v_i/(k_{\\text{cat},i}f_i(x))}</annotation></semantics></math>.

Note however that this is the production flux per gram biomass. The total biomass may also be constrained by $B\\leq C_2$. The absolute production flux for a single strain is thus

<math xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><msub><mi>J</mi><mi>P</mi></msub><mo>=</mo><msub><mi>C</mi><mn>2</mn></msub><mfrac><mi>C</mi><mrow><msub><mo>∑</mo><mi>i</mi></msub><msub><mover><mi>v</mi><mo>~</mo></mover><mi>i</mi></msub><mo>/</mo><mo>(</mo><msub><mi>k</mi><mrow><mtext>cat</mtext><mo>,</mo><mi>i</mi></mrow></msub><msub><mi>f</mi><mi>i</mi></msub><mo>(</mo><mi>x</mi><mo>)</mo><mo>)</mo></mrow></mfrac></mrow><annotation encoding="LaTeX">J_P= C_2 \\frac{C}{\\sum_i \\tilde v_i/(k_{\\text{cat},i}f_i(x))}</annotation></semantics></math>.

Now, we can divide the reaction pathway over two strains. One could for example take a $v^1$ and $v^2$, such that $v^1+v^2 = v$, where one should note that the metabolites should be transported in between, and this should be written down more carefully than I did here. Again, let $\\tilde v_1$ and $\\tilde v_2$ be normalised such that $\\tilde v_1 + \\tilde v_2$ produces $v_P=1$. Again, we have an enzyme constraint per strain, so we get maximal fluxes per gram biomass of

<math xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><msup><mi>λ</mi><mn>1</mn></msup><mo>=</mo><mfrac><mi>C</mi><mrow><msub><mo>∑</mo><mi>i</mi></msub><msubsup><mover><mi>v</mi><mo>~</mo></mover><mi>i</mi><mn>1</mn></msubsup><mo>/</mo><mo>(</mo><msub><mi>k</mi><mrow><mtext>cat</mtext><mo>,</mo><mi>i</mi></mrow></msub><msub><mi>f</mi><mi>i</mi></msub><mo>(</mo><mi>x</mi><mo>)</mo><mo>)</mo></mrow></mfrac></mrow><annotation encoding="LaTeX">\\lambda^1 = \\frac{C}{\\sum_i \\tilde v^1_i/(k_{\\text{cat},i}f_i(x))}</annotation></semantics></math>,

and

<math xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><msup><mi>λ</mi><mn>2</mn></msup><mo>=</mo><mfrac><mi>C</mi><mrow><msub><mo>∑</mo><mi>i</mi></msub><msubsup><mover><mi>v</mi><mo>~</mo></mover><mi>i</mi><mn>2</mn></msubsup><mo>/</mo><mo>(</mo><msub><mi>k</mi><mrow><mtext>cat</mtext><mo>,</mo><mi>i</mi></mrow></msub><msub><mi>f</mi><mi>i</mi></msub><mo>(</mo><mi>x</mi><mo>)</mo><mo>)</mo></mrow></mfrac></mrow><annotation encoding="LaTeX">\\lambda^2=\\frac{C}{\\sum_i \\tilde v^2_i/(k_{\\text{cat},i}f_i(x))}</annotation></semantics></math>,

where $\\lambda^j$ denotes the number of times that $\\tilde v_1$ can run per gram biomass of strain 1. Since we need to add $\\tilde v^1$ and $\\tilde v^2$ in equal amounts to get the production of $v_P=1$, one needs

$B^1 \\lambda^1 = B^2 \\lambda^2$,

with $B^1 + B^2 = C_2$. This means that $B^1/B^2 = \\lambda^2/\\lambda^1$, and inserting this in the total biomass constraint gives

<math xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><msub><mi>C</mi><mn>2</mn></msub><mo>=</mo><msup><mi>B</mi><mn>1</mn></msup><mo>+</mo><msup><mi>B</mi><mn>2</mn></msup><mo>=</mo><msup><mi>B</mi><mn>2</mn></msup><mfrac><msup><mi>λ</mi><mn>2</mn></msup><msup><mi>λ</mi><mn>1</mn></msup></mfrac><mo>+</mo><msup><mi>B</mi><mn>2</mn></msup><mo>=</mo><msup><mi>B</mi><mn>2</mn></msup><mo>(</mo><mn>1</mn><mo>+</mo><mfrac><msup><mi>λ</mi><mn>2</mn></msup><msup><mi>λ</mi><mn>1</mn></msup></mfrac><mo>)</mo></mrow><annotation encoding="LaTeX">C_2 = B^1 + B^2 = B^2\\frac{\\lambda^2}{\\lambda^1} + B^2 = B^2 (1+ \\frac{\\lambda^2}{\\lambda^1})</annotation></semantics></math>,

so that

<math xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><msup><mi>B</mi><mn>2</mn></msup><mo>=</mo><mfrac><msub><mi>C</mi><mn>2</mn></msub><mrow><mo>(</mo><mn>1</mn><mo>+</mo><mfrac><msup><mi>λ</mi><mn>2</mn></msup><msup><mi>λ</mi><mn>1</mn></msup></mfrac><mo>)</mo></mrow></mfrac></mrow><annotation encoding="LaTeX">B^2 = \\frac{C_2}{(1+ \\frac{\\lambda^2}{\\lambda^1})}</annotation></semantics></math>.

The total production is given by

<math xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><msub><mi>J</mi><mi>P</mi></msub><mo>=</mo><msup><mi>B</mi><mn>1</mn></msup><msup><mi>λ</mi><mn>1</mn></msup><mo>=</mo><msup><mi>B</mi><mn>2</mn></msup><msup><mi>λ</mi><mn>2</mn></msup><mo>=</mo><mfenced><mfrac><msub><mi>C</mi><mn>2</mn></msub><mrow><mo>(</mo><mn>1</mn><mo>+</mo><mfrac><msup><mi>λ</mi><mn>2</mn></msup><mrow><mi>l</mi><mi>a</mi><mi>m</mi><mi>b</mi><mi>d</mi><msup><mi>a</mi><mn>1</mn></msup></mrow></mfrac><mo>)</mo></mrow></mfrac></mfenced><mfenced><msup><mi>λ</mi><mn>2</mn></msup></mfenced><mo>=</mo><mfrac><msub><mi>C</mi><mn>2</mn></msub><mrow><mn>1</mn><mo>/</mo><msup><mi>λ</mi><mn>1</mn></msup><mo>+</mo><mn>1</mn><mo>/</mo><msup><mi>λ</mi><mn>2</mn></msup></mrow></mfrac></mrow><annotation encoding="LaTeX">J_P = B^1 \\lambda^1 = B^2 \\lambda^2 = \\left(\\frac{C_2}{(1+ \\frac{\\lambda^2}{lambda^1})}\\right)\\left(\\lambda^2\\right)=\\frac{C_2}{1/\\lambda^1+1/\\lambda^2}</annotation></semantics></math>,

where we can insert the expression for the maximal possible $\\lambda^1$ and $\\lambda^2$ to get:

$J_P = \\frac{C_2C}{\\sum_i \\tilde v^1_i/(k_{\\text{cat},i}f_i(x)) + \\sum_i \\tilde v^2_i/(k_{\\text{cat},i}f_i(x))} = \\frac{C_2C}{\\sum_i \\tilde v_i/(k_{\\text{cat},i}f_i(x))}$,

which is exactly equal to the production that could be achieved with DoL.

#### Remarks Methods

The Methods-section is clear and covers all prerequisite knowledge without becoming excessively long or boring. The pseudocode is written very clearly, and will definitely help potential users of ASTHERISC. Several strong assumptions are made in the Methods section, and I feel the following should at least be justified a lot more:

##### Equation (3): <math xmlns="http://www.w3.org/1998/Math/MathML"><semantics><mrow><mi>N</mi><mi>r</mi><mo>=</mo><mn>0</mn></mrow><annotation encoding="LaTeX">Nr=0</annotation></semantics></math>

Of course, this seems like a standard assumption that is always made in stoichiometric modelling. However, in this paper the authors are considering the thermodynamic effects that in part depend on metabolite concentrations. The correct formula to use is $Nr=\\mu c$, where $\\mu$ is the growth rate, and $c$ is the vector of metabolite concentrations. This dilution term implies that all metabolites should have some net production when the growth rate is positive. However, is this small net production still thermodynamically feasible in the solutions that ASTHERISC finds? ASTHERISC might just select a really high metabolite concentration, and it will detect no problem because there does not have to be any net production of this metabolite. I wonder if the authors foresee any problems, or if this issue is negligible. This might be important to add in the text. It is true that this issue is resolved by the following assumption that the growth rate is 0, but this might just be the issue: the dilution of metabolites is one of the reasons that I would say it is unreasonable to set the growth rates to 0.

##### The growth rate of all strains is 0

Now this is a rather strong assumption to make, and, as a reader, I really need to be convinced that this is justified. For example, if ASTHERISC assumes that biomass production is zero, how do I know that the algorithm doesn't choose a division of reactions that makes biomass production impossible for one of the strains? For example, it might be thermodynamically ideal to have a high concentration of ATP in one strain, and a low concentration in another. But can the strain with low ATP still grow? Do the biomass-producing reactions not become thermodynamically infeasible? When an organism is unable to grow, is the chosen division of labor still useful for biotechnological applications?

My suggestion is that the authors explicitly state these type of problems. Finding all possible Divisons of Labors is still useful, if they can afterwards be tested for their feasibility. For example, testing if growth is still possible, and if the suggested metabolite exchanges are possible (as the authors do point out in the Dicussion). I would like to see for which multi-strain solutions presented in the paper, growth at these metabolite concentrations is still possible.

##### All strains have identical fractions

Just like in the example in the Introduction, this assumption feels very limiting, but it might not be. Maybe the authors can justify this assumption, but this is not done at the moment. Now, the reader has to think about the consequences of this assumption, and I think that this is actually a task for the authors. My suggestion is that the authors could point out that they focus on the thermodynamic feasibility of a pathway, and not on resource-allocation. Maybe, the multiplication of all reaction rates of one strain is therefore allowed, and then the biomass fraction of the strain does not matter.

To get a feel for the consequences of this assumption, I suggest that the authors write down the full community model without assuming identical fractions (possibly in the SI). Then, the authors may look at the consequences of multiplying the biomass fraction of strain $i$ with a scalar: $F_i \\rightarrow \\lambda_i F_i$. It will turn out that to keep statisfying the steady-state constraint, all reaction rates of strain $i$ must then be divided by $\\lambda_i$. It can then be seen where this will have consequences. I suspect the most important influence in on *satisfying the inhomogeneous flux bounds*. The authors may then explain why this is acceptable.

Probably, the authors can argue that leaving this flexibility of choosing the biomass fractions out only takes away flexibility from the model. So, something that is thermodynamically feasible in this constrained case, will also be feasible in the general case. This seems to be true, as long as the authors do not consider kinetics and limited cellular resources.

#### Remarks Results

The results of applying ASTHERISC to example models are presented. The models are well-chosen to illustrate the workings of the model, and the results are exposed in a clear and un-assuming fashion. The example of the community solution serves its purpose: it illustrates the principle underlying ASTHERISC, and its complexity shows why these results cannot be obtained without a computational toolbox.

##### Page 15

"Reactions for which no $\\Delta_rG'^0$ could be found or calculated (93 reactions (18%) in EColiCore2 and 646 (24%) in iML1515) were blocked in the model (rate fixed to zero)" There are several things I'm wondering about after reading this comment. Why is the chosen solution of blocking these reactions the most reasonable? What are the consequences? Is there any particular property of these reactions which explains why the standard Gibbs free energy cannot be found or calculated?

##### Bottom page 15

"Interestingly, the highest MDF improvements occur when high product yields are demanded. This indicates that low-yield pathways in the single strain might have high MDFs which are more difficult to improve by a community solution." I agree that this is an interesting note.

##### Examplary community solution

The authors state that "kdo8p is of interest as its synthase (KDOPS) has been evaluated as a potential antibiotics target". This does not sufficiently explain to me why kdo8p is of interest. How can the over-production of kdo8p be used to inhibit its synthase? It is not that I do not believe that the authors are correct, I just need a little more explanation.

I don't understand why the concentration of dha is decreased in ecoli2. The authors state that it is used to facilitate the (reverse) PGI, G6PDH2r and PGL reactions, but I don't see this. dha seems to only be used as a substrate in ecoli2, so why would it not be favourable to maximise its concentration, so that g3p-concentration can be decreased even more. Similarly, one would expect dha to be decreased in ecoli1.

#### Remarks Discussion

The discussion is clear and addresses some important points. Especially the paragraph starting with "The application of our algorithm to a metabolic core and a genome-scale model" is very important. Some additional questions that I had with the eventual application of the algorithm:

- I agree that reactions can be knocked-out, but will the optimisation of metabolite concentrations happen automatically? The genetically altered organism has not been evolved to use its new reaction set efficiently, so how do we know that the metabolite concentrations that make the production pathway thermodynamically feasible will be selected?

- Is there a way to add the energy expenditure needed for exchanging metabolites into the algorithm? I guess, maybe naively, that when a metabolite is exported from a low concentration, and imported to a high concentration, at least one of the transport steps needs to use energy. This means that it should at least be thermodynamically feasible for this strain to produce energy.

- Is there a way in ASTHERISC to select the metabolites that may be exchanged by the algorithm? This does not seem hard to implement.

In addition, I would like a discussion of the kinetic analog of this thermodynamic problem. Except for making pathways thermodynamically feasible, Divison of Labor could also help with the efficient usage of cellular resources. Indeed, as the authors show in the Introduction, not when the metabolite concentrations are unchanged, but what if these concentrations are changed. Can this problem also be addressed? If not, why not? Or is it just less important according to the authors?

#### Minor comments

**Figure 1** I think Figure 1 is rather messy. A reader that did not understand the explanation in the text, will probably not be aided by this figure a lot. This is a matter of taste though, not of right and wrong.

**Caption Figure 2** Add comma: "would follow, indicating"

**After Equation 5** Sentence starting with "Furthermore, the biomass fractions" is pretty hard to read.

**Equation 9** It is not immediately clear whether the $i$-th column is taken before or after the transpose

**Equation 10** I fail to see why the notation is different between Equations (9) and (10): $\\Delta_r G_i' \\rightarrow \\Delta G_i$ and $\\hat N \\rightarrow N$. These might be typos. If not, more explanation might be needed.

**Page 11: "A specified minimal fraction of this maximum yield must be reached in all subsequent community model calculations"** Can this fraction also be chosen larger than 1? Or is the maximum possible yield determined without taking into account the thermodynamic feasibility? If so, please indicate this in the text.

**Page 14** Typo in: "had to be separated in to strains"

**Page 15** Typo: "For each of the producible target metabolite"

**Page 17** "Some of these dominant MDF values occurred in all analyzed models." So, there are MDF values that appear often in the core- and in the genome-scale model? If so, it might be interesting to explain a bit more why this is.

**The use of "exemplary"** It might be that I'm wrong here, but it seems that the word 'exemplary' is used meaning that something is an example, but my interpretation of this word is that it is "a perfect example". It therefore seems a bit like bragging to me when this word is used.

**Exemplary community solution** "In the community solution found by ASTHERISC, the MDF value can be increased by 26 % up to 2.87 kJ/mol reaching a yield of 0.733 mol/mol" It confuses me that this yield is here repeated, it seems like this yield is different than the yield of the single-strain model. But this was exactly the yield that was requested, right? So, the reported MDF value for the single-strain model also attains this yield?

Reviewer #3: Review on „Designing microbial communities to maximize the thermodynamic driving force for the production of chemicals by Bekiaris and Klamt

Currently, investigations of mixed synthetic cultures is in the focus of experimental and theoretical studies in biotechnology. Mixed cultures have certain advantages over single species cultivation (for example division of labour) and therefore, the potential of this approach becomes important for the design of processes in various applications. Synthetic mixed cultures could be seen as a mixture of two different species or a mixture of the same species with defined mutations. The manuscript is based on the second variant and analysis the steady-state behaviour of two or three different cultures form a thermodynamic point of view; while standard tools for analysis of metabolic networks like FBA (and its extensions) usually determine intracellular fluxes, more sophisticated tools take into account thermodynamic limitations only for single species. Bekiaris and Klamt ask on a meaningful division of the metabolic network such, that the production of selected compounds of interest is improved (or even first possible) by using more than one strain variant. The criterion used is the MDF, that is, the Max-min-driving force (the maximization of the minimal negative value of the Gibbs energy of each reaction of the entire network/ pathway). The study presented by Bekiaris and Klamt is accurate and comprehensive, and I only have a minor number of comments to improve the manuscript.

(i) How can the minimal OptMDF advantage (0,2 kJ/mol) be interpreted? Are there any reasons for taking just this value?

(ii) Are there any constraints on the uptake rate of glucose and other main components from the environment?

(iii) The authors have chosen models with a higher number of reactions and components. For many applications core models are applied? How would the results look like if a standard core model (for example from the COBRA toolbox or the compressed version of the Ecolicor2 with only 82 reactions and 54 metabolites) is used? This is of special interest in light of the result that most exchange metabolites are found in glycolysis or in pathways closely related to glycolysis.

(iv) A basic assumption is that the growth rate for all species is the same. I miss a discussion of the influence of the growth rate on the results. What is the value of the growth rate for the example presented in detail (kdo8p)? Strain ecoli1 takes up glucose and I expect a high growth rate, and I wonder why - in practice - a substrate like dha should be taken up by ecoli2 even in the case a transport system is available?

Is it implemented that the growth rate must be equal for both strains (maybe I missed this constrain in the Method section)?

___

Minor comment: while I could reproduce the value of Figure 2 right, I found a different value for Figure 2 left (-0,245 instead of -0,19).

**********

Have all data underlying the figures and results presented in the manuscript been provided?

Large-scale datasets should be made available via a public repository as described in the PLOS Computational Biology data availability policy, and numerical data that underlies graphs or summary statistics should be provided in spreadsheet form as supporting information.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Daniel Machado

Reviewer #2: Yes: Daan de Groot

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, PLOS recommends that you deposit laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. For instructions, please see http://journals.plos.org/compbiol/s/submission-guidelines#loc-materials-and-methods

Attachment

Submitted filename: peer review comments PCOMPBIOL-D-21-00273.pdf

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009093.r003

Decision Letter 1

Christoph Kaleta, Daniel A Beard

18 May 2021

Dear Dr. Klamt,

We are pleased to inform you that your manuscript 'Designing Microbial Communities to Maximize the Thermodynamic Driving Force for the Production of Chemicals' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests. Moreover, there is one final comment by reviewer 3 which I expect you to address along with these formatting changes.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Christoph Kaleta

Associate Editor

PLOS Computational Biology

Daniel Beard

Deputy Editor

PLOS Computational Biology

***********************************************************

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have properly addressed all my comments.

Reviewer #2: The reviewers addressed all my raised concerns, so I advise publication.

Reviewer #3: Second review

The authors responded satisfactorily to my questions. However, based also on the comments of other reviewers, I recommend to make a clear statement already in the abstract and in the introduction that a scenario is considered where the growth rate is set to zero. This is only mentioned incidental but has a tremendous effect on the outcome of the simulation results. Since a basic requirement is that the growth rates are equal, a number of additional constraints appear and I would like to see a deeper discussion on this issue in the manuscript. Currently, the already added paragraph in the discussion section is not enough.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: None

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: Yes: Daniel Machado

Reviewer #2: Yes: Daan de Groot

Reviewer #3: No

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009093.r004

Acceptance letter

Christoph Kaleta, Daniel A Beard

9 Jun 2021

PCOMPBIOL-D-21-00273R1

Designing Microbial Communities to Maximize the Thermodynamic Driving Force for the Production of Chemicals

Dear Dr Klamt,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Katalin Szabo

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Text. Proof showing that division of labor cannot increase the product synthesis flux compared to a single-strain solution for arbitrary pathways and kinetics (under assumption of constant metabolite concentrations in the strains).

    (PDF)

    S2 Text. Detailed description of configuration of the single-strain models and the derived multi-strain community models of E. coli.

    (PDF)

    S3 Text. The ASTHERISC package text report generated for the toy model.

    (PDF)

    S4 Text. The ASTHERISC package text report generated for the discussed community solution example for kdo8p synthesis with ecolicore2double.

    (PDF)

    Attachment

    Submitted filename: peer review comments PCOMPBIOL-D-21-00273.pdf

    Attachment

    Submitted filename: Response_Letter_final.docx

    Data Availability Statement

    Data are available within the manuscript and Supporting Information and in parts from a public repository: https://github.com/klamt-lab/CommModelPy https://github.com/klamt-lab/astheriscPackage.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES