Skip to main content
Nature Communications logoLink to Nature Communications
. 2019 Mar 5;10:1045. doi: 10.1038/s41467-019-08890-y

A theoretical framework for controlling complex microbial communities

Marco Tulio Angulo 1,, Claude H Moog 2, Yang-Yu Liu 3,4,
PMCID: PMC6401173  PMID: 30837457

Abstract

Microbes form complex communities that perform critical roles for the integrity of their environment or the well-being of their hosts. Controlling these microbial communities can help us restore natural ecosystems and maintain healthy human microbiota. However, the lack of an efficient and systematic control framework has limited our ability to manipulate these microbial communities. Here we fill this gap by developing a control framework based on the new notion of structural accessibility. Our framework uses the ecological network of the community to identify minimum sets of its driver species, manipulation of which allows controlling the whole community. We numerically validate our control framework on large communities, and then we demonstrate its application for controlling the gut microbiota of gnotobiotic mice infected with Clostridium difficile and the core microbiota of the sea sponge Ircinia oros. Our results provide a systematic pipeline to efficiently drive complex microbial communities towards desired states.


Controlling microbial communities could help restore ecosystems and maintain healthy microbiota. Here, the authors introduce the notion of structural accessibility and develop a framework to identify minimal sets of driver species, manipulation of which could allow control of a microbial community.

Introduction

Microorganisms form complex communities that play critical roles in maintaining the well-being of their hosts or the integrity of their environment14. Disrupting these microbial communities can have severe consequences. In humans, for example, a disruption to the gut microbiota—the aggregate of microorganisms residing in our intestine—is associated with several disorders including irritable bowel syndrome, Clostridium difficile Infection (CDI), autism, obesity, and cavernous cerebral malformations57. For agriculture crops, a disruption of rhizosphere microbiota can reduce their disease resistance and hence decrease the overall crop yield8,9. In the oceans, a disruption to their microbiota can impact global climate by altering carbon sequestration rates3,4,10. Driving disrupted microbial communities back to their healthy states could offer novel solutions to prevent and treat complex human diseases, enhance sustainable agriculture, and regulate global warming11,12. For instance, inoculating soil microbes can restore terrestrial ecosystems13, and fecal microbiota transplantation (FMT) is so far the most successful therapy for treating recurrent CDI14. Despite the success of these empirical strategies, a broad application of microbial-manipulation strategies will be possible only if we can efficiently control large complex microbial communities15.

There are two big challenges down the road. First, an efficient control method should only manipulate a minimum set of species in the community. However, we still lack a systematic method to identify minimum sets of those “driver species” whose control can help us drive a whole community to desired states. Here, we use the term “species” without necessarily representing the lowest major taxonomic rank. One could also organize microbes by strains, genera, or operational taxonomical units. Second, even when those driver species have been identified, designing the control strategy that should be applied to them (e.g., how their abundance needs to be manipulated) for driving the community towards the desired state remains difficult. This difficulty arises because of the inherent complexity of microbial dynamics and our limited knowledge of them.

To address those two challenges, here we develop a control framework using the ecological network underlying the microbial community. First, we introduce the new notion of “structural accessibility”, which generalizes the notion of structural linear controllability16,17 to systems with nonlinear dynamics. Then, we derive a complete graph-theoretical  characterization of structural accessibility. This result enables  us to efficiently identify minimum sets of driver species of any microbial community purely from the topology of its underlying ecological network, even if some microbial interactions are missing and its population dynamics is unknown. Once the driver species are identified, we systematically design feedback control strategies to drive a microbial community towards the desired state, even if its dynamics is not precisely known. We numerically validated our control framework in large microbial communities, analyzing its performance for different parameters of the community (e.g., the connectivity of its underlying ecological network), and for errors in the ecological network used to identify the driver species. Finally, we demonstrate our framework by controlling the core microbiota of the sea sponge Ircinia oros, and restoring the gut microbiota of gnotobiotic mice infected by Clostridium difficile. Our results provide a rational and systematic framework to control microbial communities and other complex ecosystems.

Results

Modeling controlled microbial communities

Our framework focuses on the impact that manipulating a subset of species has on the abundances of other species. We thus consider a microbial community whose state at time t is determined from the abundance profile x(t)RN of its N species, where the i-th entry xi(t) represents the abundance of the i-th species at time t. The state evolves according to some population dynamics

x˙(t)=f(x(t)), 1

where the function f:RNRN models the species intrinsic growth and the inter/intra-species interactions of the community (see Supplementary Note 1 for details). For most microbial communities f is unknown and difficult to infer given the many interaction mechanisms between microbes18. Thus, we assume that f(x) is some unknown meromorphic function of x (i.e., the quotient of analytic functions). This assumption is very mild as it is satisfied by most population dynamics models19.

Instead of knowing the population dynamics of the microbial community, we assume we know its underlying ecological network G=(X,E). This network is a directed graph where nodes X = {x1, …, xN} represent species, and edges (xj → xi) ∈ E denote that the j-th species has a direct ecological impact (i.e., direct promotion or inhibition of growth) on the i-th species (Fig. 1a). Mapping ecological networks requires performing mono- and co-culture experiments20,21, using system identification techniques with time-resolved abundance data22,23, or using steady-state abundance data via a recently developed inference method24. In general, ecological networks are different from correlation networks20,25 because correlation does not imply causation26,27.

Fig. 1.

Fig. 1

Controlling a microbial community. a Ecological network G for a toy microbial community of N = 3 species (green, yellow, blue). The controlled ecological network Gc contains M = 1 control input actuating the third species. b Initial and desired abundance profiles (bars). Controlling the community consists in driving its state from the initial state x0 to the desired state xd, represented by two points in the state space of the community. c In the continuous control scheme, the control inputs u(t) are continuous signals modifying the growth of the actuated species. The controlled population dynamics of this community is given by x˙1=0.1+x1(1-x15)(x13-1)-(0.1x1x3)(1+x3), x˙2=0.1+x2(1-x24)(x2-1)+(x2x3)(1+x3), x˙3=x3(1-x32)(x3-1)+u. In the absence of control, this community has two equilibria x0=(3.14,4.58,1) and xd=(4.57,4.73,2), chosen as the initial and desired states, respectively. d In the impulsive control scheme, the control inputs u(t) are impulses applied at the intervention instants T={t1,t2,}, instantaneously changing the abundance of the actuated species. The controlled population dynamics is the same as in panel (c), except that x˙3=x3(1-x32)(x3-1) and x3(t+) = x3(t) + u(t) if tT={5,10,15}. Under this controlled population dynamics, our mathematical formalism identifies x3 as the solo driver species needed to drive this microbial community (Example 1 in Supplementary Note 2)

Controlling the community consists in driving its state from an initial value x0=x(0)RN at t = 0 (e.g., a “diseased” state) towards the desired value xdRN (e.g., the “healthy” state, Fig. 1b). We assume that the community will not evolve by itself to xd. To drive the community, we use M control inputs u(t)RM directly affecting certain species that we call “actuated species” (Fig. 1a). Control inputs encode a combination of M control actions applied at time t. We consider four possible control actions. If uj(t) < 0, the j-th control action at time t can be a bacteriostatic agent or bactericide, decreasing the abundance28 of the species it actuates. If uj(t) > 0, the j-th control action at time t can be a prebiotic29 or transplantation, stimulating the growth or engrafting a consortium of the species it actuates, respectively. Probiotics administration30 and FMTs14 are examples of transplantations. To specify the species actuated by each control input we introduce the controlled ecological network Gc=(XU,EB). Here, U = {u1, …, uM} are the control input nodes and (uj → xi) ∈ B denotes that the j-th control input actuates the i-th species (Fig. 1a).

We introduce two control schemes describing how the control inputs change the species abundance (see Supplementary Note 1 for details). The first control scheme models a combination of prebiotics (if uj(t) > 0) and bacteriostatic agents (if uj(t) < 0) as continuous control inputs modifying the growth of the actuated species (Fig. 1c):

x˙(t)=fx(t)+gx(t)u(t),tR. 2

The second control scheme models a combination of transplantations (if uj(t) > 0) and bactericides (if uj(t) < 0) applied at discrete intervention instants T={t1,t2,}, rendering impulsive control inputs that instantaneously modify the abundance of the actuated species (Fig. 1d):

x˙(t)=fx(t)iftT,x(t+)=x(t)+gx(t)u(t)iftT. 3

Above, x(t+) denotes the state “right after time t”, so x(t) “jumps” at tT if u(t) ≠ 0. The pair {f, g} characterizes both control schemes, describing the controlled population dynamics of the microbial community. The function g:RNRN×M models the direct susceptibility of the species to the control actions. The j-th control input actuates the i-th species if gij0. Because g is typically unknown, we just assume that g(x) is some unknown meromorphic function of x such that gij0 iff (uj → xi) ∈ B.

Notice that when all species are directly controlled (i.e., an independent control input actuates each species), the whole microbial community can easily be driven to the desired state. Fortunately, as we show next, actuating all the species is far from being necessary. Thanks to the inter-species interactions encoded in the ecological network G, we can identify minimum sets of species that we need to actuate in order to drive the whole community. We call those species “driver species”.

Identifying driver species

To understand when a set of actuated species is a set of driver species, consider the three-species community with Generalized Lotka–Volterra (GLV) population dynamics of Fig. 2a. This toy community has one control input actuating x3. Actuating only this species creates an autonomous element—namely, a constraint between some species abundances that the control input cannot break, confining the state of the community to a low-dimensional manifold (Fig. 2a, right). More precisely, our mathematical formalism reveals that ξ = x1x2 is the autonomous element (Example 2 in Supplementary Note 2). Indeed, differentiating ξ with respect to time yields ξ˙=x1x2(1-x3)+x1x2(-1+x3)0, confining the community to {xR3x1x2=x1(0)x2(0)}. Intuitively, the autonomous element exists because the control input cannot change x1 without changing x2 in a predefined way, making it impossible to drive the community in the three-dimensional state space. This observation indicates that x3 alone cannot be a driver species for this community. Introducing a second control input actuating x1 helps the community jump out of the low-dimensional manifold eliminating the autonomous element, allowing us to drive this community to any desired state with positive abundance (Fig. 2b, and Example 6 in Supplementary Note 5). Therefore, {x1, x3} is a minimum set of driver species for this community.

Fig. 2.

Fig. 2

Autonomous elements constrain the state of microbial communities, characterizing their driver species. a A three-species community with GLV dynamics x˙1=x1(-1+x3), x˙2=x2(1-x3), x˙3=x3(-0.5+1.5x3). For actuating x3, we consider the impulsive control scheme with x3(t+) = x3(t) + u1(t) for tT. With this controlled population dynamics, our mathematical formalism reveals the autonomous element x1x2 that constraints the state of this microbial community to the low-dimensional manifold {xR3x1x2=x1(0)x2(0)} (gray) for all control inputs. Five state trajectories (in colors) with random control inputs illustrate this fact. Hence, {x3} alone cannot be a set of driver species for this controlled population dynamics. b Including a second control input u2(t) actuating x1 (i.e., x1(t+) = x1(t) + u2(t) for tT) eliminates the autonomous element, since the state of the microbial community (colors) can explore a three-dimensional space (gray). Hence {x1, x3} is a minimum set of driver species for this community with GLV dynamics. c We proved that, generically, increasing the complexity of the controlled population dynamics cannot create autonomous elements. In this example, increasing the deformation size C from the GLV in panel (a) (with C = 0) to the controlled population dynamics in Fig. 1 (with C > 0) eliminates the autonomous element that was present by actuating x3 alone (Example 1 in Supplementary Note 2). Therefore, increasing the complexity of the population dynamics makes {x3} a solo driver species

In the general case of N species and M control inputs, we define a set of actuated species as a set of driver species if the corresponding controlled population dynamics {f, g} lacks autonomous elements. For linear dynamics {f(x), g(x)} = {Ax, B}, ARN×N, BRN×M, the absence of autonomous elements is equivalent to their controllability31—the ability to drive the system between any two states, easily verified using Kalman’s condition rank[B,AB,,AN-1B]=N. In the case of nonlinear dynamics, the absence of autonomous elements can be characterized using a mathematical formalism based on differential one-forms (see Methods and Supplementary Note 2). For the continuous control scheme of Eq. (2), the conditions for the absence of autonomous elements are well understood as they define when a system is accessible31, a cornerstone concept in nonlinear control theory. Because it is more natural to control microbial communities with impulsive control actions, in this paper we extended the study of autonomous elements to the impulsive control systems of Eq. (3). We first introduced a definition of autonomous elements for impulsive control systems (Definition 3 in Supplementary Note 2). We then characterized necessary and sufficient conditions for the absence of autonomous elements in a controlled population dynamics (Theorem 2 in Supplementary Note 2). To our surprise, the conditions for the absence of autonomous elements for the continuous and the impulsive control schemes are identical (Remark 2 in Supplementary Note 2). This result means that transplantations and bactericides (impulsive control actions) can be as effective as prebiotics and bacteriostatic agents (continuous control actions).

Structural accessibility characterizes the generic absence of autonomous elements

In general, it remains extremely difficult finding a pair {f, g} that models the controlled population dynamics of a microbial community. This fact might suggest it is impossible to predict if the controlled community has autonomous elements or not, making it impossible to identify its driver species. We now show that this seemingly unavoidable limitation can be solved using the topology of the controlled ecological network of the community.

Define the network Gf,g=(XU,Ef,gBf,g) associated with {f, g} as follows: (xj → xi) ∈ Ef,g if xj appears in the right-hand side of x˙i in Eq. (2) or xi(t+) in Eq. (3). Similarly, (uj → xi) ∈ Bf,g if gij0. Using this definition, we next describe the class D of all possible controlled population dynamics that a controlled microbial community can have given we know its Gc. Mathematically, D contains all base models {f*, g*} such that Gf*,g*=Gc, together with all deformations {f, g} of each of those base models. The base models characterize the simplest controlled population dynamics that the community can have, leading us to choose them as controlled GLV models with constant susceptibilities:

fi*(x)=rixi+j=1Naijxixj,gij*(x)=bij, 4

for i = 1, …, N. The parameters A=(aij)RN×N, r=(ri)RN, and B=(bij)RN×M represent the interaction matrix, the intrinsic growth rate vector, and the susceptibility matrix of the community, respectively. As the simplest population dynamics, the GLV model has been applied to microbial communities in lakes, soils, and human bodies14,15,20,3238.

A deformation of {f*, g*} is any meromorphic pair {f, g} such that: (i) Gf,g=Gf*,g*; (ii) there exists a finite set of parameters θRC such that {f(x),g(x)}={f~(x;θ),g~(x;θ)}; and (iii) the identity {f~(x;0),g~(x;0)}={f*(x),g*(x)} holds. The smallest integer C ≥ 0 satisfying these three conditions is called the size of the deformation. A general class of controlled population dynamics are deformations of Eq. (4), including

fi(x;θ)=θi,1+xi-ri-θi,2xiθi,3xi-1+j=1Naijxixj1+θij,4+θij,5xi+θij,6xixj+θij,7xj, 5

for i = 1, …, N. Above, θi,1 are migration rates from/to neighboring habitats, θi,2-1 are the carrying capacities of the environment, θi,3-1 are the Allee constants, and {θij,k}k=47 characterize the functional responses39. θi,1 > 0 also models species like C. difficile that sporulate into “inactive” forms and then recover. “Higher-order interactions” (e.g., θixixjxk) and susceptibilities mediated by species abundance (e.g., gij(x;θ) = bij + θijkxk) are deformations as well.

We call D structurally accessible if almost all of its base models and almost all of their deformations lack autonomous elements. This definition means that except for a zero-measure set of “singularities,” all the controlled population dynamics that the community may take have to lack autonomous elements. The conditions under which D is structurally accessible are fully characterized using our mathematical formalism and they depend only on Gc (see Methods and Supplementary Note 3). Hence, if D is structurally accessible, hereafter we also call Gc structurally accessible. We first proved that, generically, increasing the size of a deformation cannot create autonomous elements (Proposition 1 in Supplementary Note 3). See also Fig. 2c for an illustration. This result reduces the search for autonomous elements to the deformations in D with minimum size C = 0 (i.e., all base models whose graph matches Gc). Finally, we proved that D is structurally accessible if and only if Gc satisfies the following two graph–theoretical conditions: (i) each species is the end-node of a path that starts at a control input node; and (ii) there is a disjoint union of cycles (excluding self-loops) and paths that cover all species nodes (Theorem 3 of Supplementary Note 3). Note that the conditions for structural accessibility depend on the chosen base model.

Structural accessibility is a nonlinear generalization of “structural controllability” for linear systems16. The latter notion has received increasing attention in Network Science17. Interestingly, the two graph–theoretical conditions for structural accessibility are almost the same as those for structural linear controllability16. The key difference is that for structural linear controllability self-loops (corresponding to intrinsic nodal dynamics) can be used to satisfy condition (ii). See Remark 4 in Supplementary Note 3 for more details.

Identifying minimum sets of driver species in microbial communities

The above result provides a complete graph-characterization of driver species: a set of actuated species is a set of driver species (for all but a zero-measure set of controlled population dynamics that the community may have) if and only if its corresponding Gc satisfies the two graph–theoretical conditions. See Fig. 3 for an illustration. With this characterization, one can apply the maximum matching algorithm directly to G to calculate the minimum number of control inputs needed to ensure the structural accessibility of Gc, as did in the structural linear controllability case17,40. However, this may not provide a minimum set of driver species because one control input may actuate multiple species. Fortunately, we can dedicate one control input to one species. Therefore, we adapted the notion of a “feasible dedicated input configuration”41 and a polynomial-time algorithm (combining maximum matching with a strongly connected component decomposition of G) to identify one minimum set of driver species (Methods and Supplementary Note 4). Note that once Gc is structurally accessible, it cannot lose its structural accessibility when new edges are added to it. This observation implies that the driver species can be identified from an “incomplete” ecological network (e.g., containing only high-confidence interactions).

Fig. 3.

Fig. 3

Identifying driver species. For each network, a minimum set of driver species is shown providing a disjoint union of paths (purple) and cycles (green) covering all species nodes (see Supplementary Table 1 for the species name). Thus, the resulting controlled ecological network is structurally accessible. Self-loops are omitted from these networks to improve readability. a Inferred ecological network of the gut microbiota of germ-free mice pre-colonized with a mixture of human commensal bacterial type strains and then infected with C. difficile (species 7), as in ref. 22 b Inferred ecological network of the core microbiota of the sea sponge Ircinia oros, as in ref. 23

Driving the driver species

We next calculate the control inputs to be applied to a set of driver species for driving the whole community towards the desired state xd. We will show that it is more efficient to calculate impulsive control inputs. To calculate these impulsive control inputs {u(tk),tkT} we adopt a model predictive control (MPC) approach42. Based on the current state of the community x(tk) at tkT, we use knowledge of its controlled population dynamics {f, g} to predict the sequence of states X^k,L={x^(tk+1),,x^(tk+L+1)} that the community will take in response to a sequence of L impulsive control inputs Uk,L={u(tk),,u(tk+L-1)}. The prediction horizon L > 0 determines how far into the future we predict. Then, we choose u(tk)=u1*(tk) where u1*(tk) is the first element of the optimal control sequence Uk,L* calculated as:

Uk,L*=argminUk,LRM×LJxd(X^k,L,Uk,L)subjecttoUk,LΩ. 6

Here, ΩRM×L specifies constraints in the control inputs, and Jxd is some cost function penalizing deviations of the predicted trajectory X^k,L from xd. For example, the cost function Jxd(X^k,L,Uk,L)=x^(tk+L+1)-xd penalizes the deviations of the predicted final state. By recalculating Uk,L* at each tk using the actual state of the community the MPC creates a feedback loop enhancing its robustness against prediction errors42. The prediction horizon can be chosen based on the controlled population dynamics of the community (Methods). For L = 1, this methodology is similar to ref. 43. Equation (6) is a finite-dimensional optimization problem that can be solved using algorithms like DIRECT44. Solving the analogous optimization problem for continuous control inputs is more challenging because the optimization is over the infinite-dimensional space of continuous functions.

We illustrate the above MPC strategy driving the microbial community of Fig. 1 with its solo driver species. According to its dynamics, L = 3 impulsive control inputs are sufficient (see caption in Fig. 1, and Example 4 in Supplementary Note 5). We chose  Jxd(X^k,L,Uk,L)=x^(tk,L)-xd2. Solving Eq. (6) using DIRECT yields the nonlinear MPC strategy u*(t1) = −0.8815, u*(t2) = 2.0089 and u*(t3) = −10−4 (pink in Fig. 4a). We compared the performance of two other control strategies. The first strategy uses one transplantation to increase the abundance of the driver species to its desired value, reminiscent of one probiotic administration restoring its “healthy” abundance (purple in Fig. 4a). The second control strategy ignores the driver species, setting the abundance of the two non-driver species to their desired values (blue in Fig. 4a).

Fig. 4.

Fig. 4

Success and failure of different control strategies. a Three control strategies for driving the microbial community of Fig. 1a toward the desired state. First, MPC applied to the identified driver species {x3} (pink dots). The second control strategy increases the abundance of the driver species to match its value at the desired state x3(t1) = x3,d (purple dots). The third control strategy does not actuate the driver species, but actuates the other two species {x1, x2} by setting their abundance to their desired values (i.e., x1(tk) = x1,d and x2(tk) = x2,d, solid and hollow blue dots, respectively). b Response of the microbial community to these three control strategies. Here and in panel (d), the “jumps” produced by the control inputs are depicted by dashed lines. The equilibria of the population dynamics are shown as gray dots. Only the MPC applied to the driver species succeeds in driving the community to xd. c Control strategies obtained by using the linear MPC with parameters Q=diag(20,1,10) and different values for R: 10−4 (pink), 10−3 (green), 10−2 (blue). d Trajectories of the controlled community using the linear MPC strategies described in panel (c). Colors correspond to the different values of R

Among the above three control strategies, only the nonlinear MPC applied to the driver species succeeds (Fig. 4b). This strategy succeeds in a somewhat unconventional way: although the driver species is more abundant in the desired state than in the initial state, the first control action decreases its abundance further. Such control action lets the non-driver species reach their desired abundances and, once that happens, the abundance of the driver species is finally increased to its desired value (pink in Fig. 4b). Just restoring the abundance of the driver species succeeds in driving x2 and x3, but it fails to drive x1 to the desired abundance (purple in Fig. 4b). Ignoring the driver species is the worst control strategy, failing to drive any of the three species to their desired values (blue in Fig. 4b). This toy example demonstrates the advantage of identifying and actuating driver species.

Driving large communities with uncertain dynamics

Solving the non-convex optimization problem of Eq. (6) is challenging as N or L increase, and it also requires knowing {f, g}, which may be impossible for large communities. We next circumvent these two drawbacks leveraging the network underlying the controlled microbial community.

Consider we can obtain a weighted adjacency matrix ÂRN×N from G, providing a proxy for its interaction matrix. Without additional knowledge of the community,  we just assume that we can increase or decrease the abundance of each driver species. We thus use B^{0,1}N×M as a proxy for the susceptibility matrix, with bij = 1 if the j-th control input actuates the i-th driver species. By rewriting {f(x),g(x)}={Âx+wx,B^+wu}, we use {Âx,B^} to provide a linear prediction for the response of the community to the control inputs. Here, wx=f-Âx and wu=g-B^ are considered as “perturbations”. Using {Âx,B^}, we design a linear MPC by solving Eq. (6) with the quadratic cost function

Jxd(X^k,,Uk,)=i=kx^(ti)-xdQx^(ti)-xd+u(ti)Ru(ti).

Above, the positive definite matrices Q=QRN×N and R=RRM×M are design parameters. Q penalizes the deviations of the predicted trajectory from the desired state, and R penalizes the control inputs magnitude. Under this scenario, Eq. (6) can be solved in closed form45 yielding the linear MPC u(tk) = Kx(tk), where KRM×N is the solution of a Riccati equation (Supplementary Note 6). Since the Ricatti equation can be efficiently solved for large N, the linear MPC can be calculated for large communities. This linear MPC is robust against (wx, wu) and it allows calculating the control inputs for the continuous control scheme (Supplementary Note 6). However, its performance strongly depends on the chosen (Â,B^) and the distance to the desired state (Supplementary Note 6).

We applied the linear MPC for driving the toy three-species community of Fig. 1, assuming its dynamics is uncertain. Considering the ecological network of this community and its nonlinear population dynamics, we chose Â=(-0.5,0,-0.1;0,-5,1;0,0,-1) as a proxy for its interaction matrix. Here  is a rough approximation of the linearization of the population dynamics at the desired state given by (−0.37, 0, −0.05; 0, −5. 31, 0.52; 0, 0, −1). Choosing Q=diag(20,1,10), we compared the performance of three different linear MPCs obtained with R = 10−4, 10−3, 10−2 (Fig. 4c). For R = 10−4, without using knowledge of the population dynamics, the performance of the linear MPC (pink in Fig. 4d) is very similar to the performance of the nonlinear MPC that uses full knowledge of the nonlinear population dynamics (pink in Fig. 4b). This success illustrates the robustness of the linear MPC against the perturbations. As R increases, the performance of the linear MPC deteriorates (green and blue in Fig. 4d).

Numerical validation on large uncertain microbial communities

To validate our control framework for large communities, we built communities of N = 100 species having random directed Erdös–Rényi ecological networks with connectivity c ∈ [0, 1], see Fig. 5a. The network edge-weights were  chosen from a normal distribution with zero mean and standard deviation σ > 0, where σ characterizes the typical interspecies interaction strength. Negative self-loops with weights −1 were added to each species. We used this ecological network to identify the driver species of the community, and its corresponding weighted adjacency matrix as the interaction matrix to construct the linear MPC. We simulated the population dynamics of these communities using Eq. (5) ensuring all share xdRN as equilibrium. The resulting communities have nonlinear population dynamics, and their linearization at the desired state is different from the interaction matrix used for the linear MPC (Supplementary Note 8).

Fig. 5.

Fig. 5

Numerical validation on large microbial communities. a Example of the ecological network for a random microbial community with N = 100 species (c = 0.03). A minimum set of M = 6 driver species is shown in purple. The desired state is chosen as xd=(1,,1). b With a random initial abundance x0 at distance d = 0.4 from the desired state, the uncontrolled microbial community does not reach xd. c, d For the same community and initial abundance as in panel (b), we apply the control input generated by the linear MPC (panel c) to the six identified driver species. This control strategy drives the community to xd (panel d). e, f, h Mean success rate as a function of d. Error bars denote the standard error of the mean. Parameters are: c = 0.025, θmax = 0.05 for panel (e), σ = 0.8, θmax = 0.05 for panel (f), and c = 0.025, σ = 0.8, θmax = 0.05 for panel (h). g Success rate as function of the proportion M/N of driver species. Black dots show the success rate of 7700 random communities. Pink shows the mean success rate

To quantify the success of our control framework on a given community, we generated 300 initial species abundances that are uniformly distributed at a distance d > 0 from xd. The success rate at distance d is defined as the proportion of those initial conditions that are driven to xd only when the linear MPC is applied to a minimum set of driver species of the community (Fig. 5b–d). Namely, we discard all initial conditions that naturally evolve to xd. Finally, we calculated the mean success rate by averaging the success rate over 100 random communities (see Supplementary Note 8 for details).

The mean success rate is close to 1 for small d regardless of the community’s parameters (Fig. 5e, f), confirming the theoretical guarantee that the linear MPC succeeds if d is small enough. The mean success rate decreases as σ increases, especially for large distances (Fig. 5e). Since increasing σ damages the stability of the population dynamics46, this result suggests that microbial communities become “harder” to control as they lose stability. The mean success rate is higher in communities with low connectivity (Fig. 5f). In general, the size of a minimum set of driver species increases as c decreases, indicating that the success rate increases as the number of driver species increases. Indeed, regardless of d, our control framework attains a mean success rate >0.8 provided that at least 6 from 100 species are driver species (Fig. 5g). This result suggests that the success rate can be enhanced by actuating a few additional species. Finally, to investigate the robustness of our control framework to errors in the ecological network, we randomly rewired each of its edges with probability p ∈ [0,1] (e.g., p = 0.05 corresponds to a 5% error). The success rate deteriorates but remains larger than zero despite large errors (Fig. 5h), showing the robustness of our control framework. However, a 5% error decreases the mean success rate in about 30%, emphasizing the importance of accurately mapping ecological networks for controlling microbial communities.

Application

We analyzed the ecological network of the gut microbiota of germ-free mice that were  pre-colonized with a mixture of human commensal bacterial type strains and then infected with C. difficile spores22. In Fig. 3a, we identified a minimum set of five driver species in this 14-species community: Ruminococcus obeum (x1), Raphitoma mirabilis (x12), Bacteroides ovatus (x2), Clostridium ramosum (x6), and Akkermansia muciniphila (x10). We also used the ecological network underlying the core microbiota of the sea sponge I. oros23, finding ten driver species in this twenty-species community (Fig. 3b).

We studied by simulation the efficacy of the identified driver species and the linear MPC for driving these two microbial communities, assuming that their dynamics are uncertain (see Supplementary Note 7 for details of the simulation). For the mice gut microbiota, our framework succeeds in driving the community from an initial state where C. difficile is overabundant towards the desired state with a better balance of species (Fig. 6a, c). Similar results were obtained for controlling the core microbiota of I. oros (Fig. 6b, d). These results show again that the linear MPC method is robust enough to drive nonlinear microbial communities.

Fig. 6.

Fig. 6

Controlling host-associated microbial communities. The controlled population dynamics of both microbial communities were simulated using the controlled GLV equations (see Supplementary Note 7 for details). The intrinsic growth rates were adjusted such that the community has an initial “diseased” equilibrium state x0 in which one species (C. difficile for the mice gut microbiota) is overabundant compared to the rest of species. We chose the desired state xd as another equilibrium with a more balanced abundance profile. For each microbial community, we used the minimum set of driver species identified in Fig. 3. a, b Control inputs obtained using the linear MPC for the impulsive and continuous control schemes. c, d Projection of the high-dimensional abundance profiles (states of the microbial communities) into their first three principal components (PCs). See Supplementary Figure 1 for the temporal response of each species. The calculated control strategies applied to the driver species succeed in driving the community to the desired state, using either continuous or impulsive control

Discussion

Our theoretical framework allows systematically and efficiently controlling microbial communities towards desired states by identifying their driver species. Identifying the driver species of a microbial community only requires knowledge of its underlying ecological network. Note that there could be multiple different minimum sets of driver species for the same community. If the cost of choosing any species as a driver species is known, a combinatorial optimization scheme will allow selecting the best minimum driver species set. We emphasize that the driver species discussed here may not coincide with other notions in ecology such as keystone47,48 or core49 species. For example, the selection of driver species do not directly depend on their abundances, while keystone species do47.

For large uncertain communities, the linear model predictive controller gives a robust and efficient way to calculate the control inputs. The performance of this controller could be further improved by modeling the susceptibility of species to the control actions (e.g., pharmacokinetics). In such case, different control actions could be modeled by different pairs {f, g}, making the conditions for the absence of autonomous elements different for continuous and impulsive control actions. Control algorithms based on reinforcement learning50 (RL) could provide even better performance. Our characterization of minimum sets of driver species will help to efficiently apply those control algorithms to microbial communities, as RL algorithms require specifying the “driver variables” they can actuate51. Here, controlling small synthetic communities could provide valuable insights for designing such controllers. We also note that altering the ecological network or obtaining a “simplified” network, in the spirit of refs. 52,53, could be complementary control approaches (e.g., for reducing the minimum number of driver species).

It has been suggested that the success of ecosystem management strategies could be predicted using the notion of controllability54. However, this notion is somewhat inadequate for microbial communities and other biological systems. By their nature, biological systems cannot be fully controllable because there are states they cannot reach (e.g., states with negative abundances). Furthermore, since dynamic models for microbial communities are nonlinear and uncertain, it is impossible even to test if those systems are controllable. Structural accessibility overcomes these two limitations, generalizing the notion of accessibility31 to systems with uncertain dynamics. Counterintuitively, our mathematical formalism suggests that communities with more complicated population dynamics (i.e., deformations with larger size) require fewer driver species. However, using fewer driver species could complicate the design of control strategies (Remark 9 in Supplementary Note 5). Indeed, by choosing an adequate base model55 and making mild assumptions on the dynamics (i.e., f and g are meromorphic functions), our framework can identify minimum sets of “driver variables” for general nonlinear systems when their underlying networks are known (see Supplementary Note 9 for an example of a small gene regulatory network).

There are two limitations in our current framework for controlling microbial communities. First, stochastic effects are considered negligible. Incorporating stochastic effects yields stochastic differential equations for which the notion of autonomous elements still needs to be mathematically formulated. We anticipate that this is quite challenging, but definitely merits further studies. Second, our current framework does not explicitly model the dynamics of resources provided to and/or chemicals produced by the microbial species5663. Our characterization of driver species only applies to some instances of resource-based models, e.g., the classical MacArthur’s consumer-resource model64 when the resource dynamics is much faster than the species dynamics65. For general resource-based models, identifying their driver species requires analyzing a new kind of “output accessibility” that characterizes the absence of autonomous elements in the species abundances and ignores autonomous elements in the resource abundances. Then, the notion of “structural output accessibility” (i.e., generic output accessibility given an adequate base model) would provide a nonlinear counterpart of linear target controllability66. Structural output accessibility could allow us to identify driver species and/or “driver resources” of a community from knowing the bipartite interaction network of species and resources. This is beyond the scope of this work and deserves dedicated efforts.

To fully harvest the benefits of controlling microbial communities, a stronger synergy between microbial ecology and control theory is necessary. We hope that this work will catalyze new interdisciplinary approaches that enhance our ability to control complex microbial communities inside and around us.

Methods

Detecting autonomous elements in the continuous control scheme

For the continuous control systems of Eq. (2), the notion of autonomous elements and the conditions for their absence are well understood, since they define when a system is accessible (see Supplementary Note 2.2 and ref. 31 for details). An autonomous element for Eq. (2) is a non-constant function ξ(x) such that there exists an integer ν ≥ 0 and a meromorphic function F such that F(ξ,ξ˙,,ξ(ν))=0. In words, an autonomous element ξ is an “internal variable” of the system that evolves completely unaffected by the control inputs. System (2) is said accessible if it has no autonomous element31.

The absence of autonomous elements can be characterized by using a mathematical formalism based on differential one-forms31. Consider the set of meromorphic functions K in the variables {x,u,u˙,ü,}, and the sets of differential symbols dx=(dx1,,dxN) and du=(du1,,duM). Let X=spanK{dx} be the vector space spanned over K by the elements of dx, intuitively playing the role of “all functions of state variables”. Any ωX is a “one-form”31 (see Supplementary Note 2.1 for details). In this setting, the chain rule provides a way to formally operate with one-forms, such as taking time derivatives: if ω=βdx then ω˙:=β˙dx+βdx˙. To identify the presence of autonomous elements in the dynamics with continuous control of Eq. (2), one calculates the sequence of subspaces HkX defined recursively by

Hk={ωHkω˙Hk},k1, 7

starting with H1=X. Then, one can prove that Eq. (2) lacks autonomous elements if and only if there exists an integer k* such that Hk*={0}, see ref. 31 (page 49, Thm.3.17).

Detecting autonomous elements in the impulsive control scheme

For the impulsive control systems of Eq. (3) the notions of autonomous elements and accessibility are rather unexplored. Recall that an autonomous element is an internal variable of the system that is completely unaffected by the control actions. To introduce a  suitable definition of autonomous element for the impulsive control systems, note that the control inputs cause “jumps” in the actuated variables (i.e., discontinuities). These jumps are propagated to other state variables by the continuous dynamics. Thus, we define an autonomous element of Eq. (3) as a non-constant function ξ(x) such that ξ(x(t)), tR, is a C function (i.e., infinitely differentiable function) under any impulsive input (see Supplementary Note 2.3 for details). By analogy to the case of continuous control, we say that system (3) is accessible if it has no autonomous element according to the above definition.

To characterize the accessibility of impulsive control systems, we built the sequence of subspaces Hk of all functions of the state variables that can be differentiated at least (k − 1) times (see details in Supplementary Note 2.3). The functions belonging to the limit H are the autonomous elements of the system, since they are completely unaffected by the control inputs. Consequently, because the limit subspace H is also “integrable” (informally, it does not contain “fictitious” autonomous elements), accessibility is equivalent to the condition H={0} (see Theorem 2 in Supplementary Note 2). We further prove that the limit H is attained in a finite step (i.e., there exists a finite k* such that Hk*=Hk*+1==H).

We illustrate the above formalism using the three-species microbial community of Fig. 1 where x3 is the actuated species (see caption for its population dynamics). To compute the sequence Hk, one starts by definition with H1=spanK{dx1,dx2,dx3}. Next, H2 are all one-forms in H1 that can be differentiated once (i.e., they are continuous, so they are not directly affected by u). Because u actuates x3, we get H2=spanK{dx2,dx1}. Similarly, H3 are all those one-forms in H2 that can be differentiated twice (i.e., their first derivative is continuous), yielding H3=spanK{x2dx1+x1dx2}. Finally, H4={0} (see details in Example 1 in Supplementary Note 2). This implies that the controlled population dynamics is free of autonomous elements and hence it is accessible. See also Example 2 in Supplementary Note 2 for a community with autonomous elements.

Detecting autonomous elements without knowledge of the population dynamics

When the controlled population dynamics of the microbial community is unknown, we consider the class D of all controlled dynamics that the community may have given we know its controlled ecological network. Identifying the presence of autonomous elements in the full class D becomes possible thanks to so-called “generic properties” of meromorphic functions31. This is a mathematical property implying that a meromorphic function will satisfy a certain condition in almost all points of its domain—that is, everywhere except for a zero-measure set of “singularities”—provided that such condition holds at a single point. We exploited this property to prove that, generically, increasing the size C of a deformation cannot create new autonomous elements (Proposition 1 in Supplementary Note 3). See Fig. 2c for an illustration. This result allows us to only search for autonomous elements on the subset D0D of all {f,g}D with size C = 0, corresponding to all base controlled GLV models of Eq. (4). Finally, we proved that the generic absence of autonomous elements in D0 can be determined only from the topology of the controlled ecological network Gc (Theorem 3 in Supplementary Note 3).

Identifying a minimum set of driver species

Let G~(X) be the subgraph obtained by removing all self-loops from the ecological network G(X) of the (uncontrolled) community. Let B~(X-X+) be the bipartite representation of G~(X), built by placing the edge (xj+,xi-) in B~ if the directed edge (xj → xi) is in G~. Then, to identify a minimum set of driver species, we applied the notion of a “dedicated input configuration” introduced in ref. 41 (see details in Supplementary Note 4).

A strongly connected component (SCC) is said “non-top linked” if it has no incoming edges from other SCCs. Let M* be a maximum matching in B~. Then, a non-top linked SCC is said to be “top assignable” with respect to M* if it contains at least one right-unmatched node in M*. Let Z ⊆ X be the set of right-unmatched nodes of some maximum matching of B~ with maximum top assignability. Let W ⊆ X be a set consisting of one state node from each non-top linked SCC of G~ not already present in Z. Then, we prove that XD ⊆ X is a minimum set of driver species if and only if there exist two disjoint subsets Z and W as defined above, such that XD = Z ∪ W (Proposition 3 in Supplementary Note 4). Using this result, we applied Algorithm 1 of ref. 41 to G~ to obtain a minimum set of driver species. This algorithm is implemented in Julia as the DriverSpecies function in the DriverSpeciesModule package. This algorithm is illustrated for communities of N = 100 species in Fig. 5a and Supplementary Fig. 2.

Choosing the prediction horizon

To choose the prediction horizon L for the nonlinear MPC we proved there are two possible cases (Theorem 4 in Supplementary Note 5). First, when the community can be driven to xd using L < ∞ impulsive control inputs. Second, when the community can only be asymptotically driven to xd, meaning that LN should be chosen sufficiently large. This second case could be circumvented by increasing the number of actuated species (Remark 8 in Supplementary Note 5).

Reporting summary

Further information on experimental design is available in the Nature Research Reporting Summary linked to this article.

Code availability

A Julia implementation of the algorithm for identifying a minimum set of driver species, as well as all other functions necessary to reproduce the results of the paper, is provided at the GitHub repository: https://github.com/mtangulo/DriverSpecies.

Supplementary information

Peer Review File (704.1KB, pdf)
Reporting Summary (67.1KB, pdf)

Acknowledgements

M.T.A. acknowledges the financial support from CONACyT, Mexico, and L2SN, France. The authors thank Jorge X. Velasco, Jorge Zañudo, Jean-Jacques Slotine, Chuliang Song, and Yandong Xiao for valuable comments and discussions.

Author contributions

Y.-Y.L. initiated the project. M.T.A. and Y.-Y.L. conceived and designed the project together. M.T.A. and C.H.M. did the theoretical analysis. M.T.A. did the numerical analysis. M.T.A. and Y.-Y.L. wrote the manuscript. C.H.M. edited the manuscript.

Data availability

All the experimental datasets analyzed in this study are publicly available.

Competing interests

The authors declare no competing interests.

Footnotes

Journal peer review information: Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Marco Tulio Angulo, Email: mangulo@im.unam.mx.

Yang-Yu Liu, Email: yyl@channing.harvard.edu.

Supplementary information

Supplementary Information accompanies this paper at 10.1038/s41467-019-08890-y.

References

  • 1.Pepper JW, Rosenfeld S. The emerging medical ecology of the human gut microbiome. Trends Ecol. Evol. 2012;27:381–384. doi: 10.1016/j.tree.2012.03.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.DeLeon-Rodriguez N, et al. Microbiome of the upper troposphere: species composition and prevalence, effects of tropical storms, and atmospheric implications. Proc. Natl. Acad. Sci. U.S.A. 2013;110:2575–2580. doi: 10.1073/pnas.1212089110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Buchan A, LeCleir GR, Gulvik CA, González JM. Master recyclers: features and functions of bacteria associated with phytoplankton blooms. Nat. Rev. Microbiol. 2014;12:686–698. doi: 10.1038/nrmicro3326. [DOI] [PubMed] [Google Scholar]
  • 4.Hultman J, et al. Multi-omics of permafrost, active layer and thermokarst bog soil microbiomes. Nature. 2015;521:208–212. doi: 10.1038/nature14238. [DOI] [PubMed] [Google Scholar]
  • 5.Karczewski J, Poniedziałek B, Adamski Z, Rzymski P. The effects of the microbiota on the host immune system. Autoimmunity. 2014;47:494–504. doi: 10.3109/08916934.2014.938322. [DOI] [PubMed] [Google Scholar]
  • 6.Cox LM, Blaser MJ. Antibiotics in early life and obesity. Nat. Rev. Endocrinol. 2015;11:182–190. doi: 10.1038/nrendo.2014.210. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Tang AT, et al. Endothelial tlr4 and the microbiome drive cerebral cavernous malformations. Nature. 2017;545:305–310. doi: 10.1038/nature22075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.East R. Microbiome: soil science comes to life. Nature. 2013;501:S18–S19. doi: 10.1038/501S18a. [DOI] [PubMed] [Google Scholar]
  • 9.Mueller UG, Sachs JL. Engineering microbiomes to improve plant and animal health. Trends Microbiol. 2015;23:606–617. doi: 10.1016/j.tim.2015.07.009. [DOI] [PubMed] [Google Scholar]
  • 10.Guidi L, et al. Plankton networks driving carbon export in the oligotrophic ocean. Nature. 2016;532:465–470. doi: 10.1038/nature16942. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Alivisatos AP, et al. A unified initiative to harness earth’s microbiomes. Science. 2015;350:507–508. doi: 10.1126/science.aac8480. [DOI] [PubMed] [Google Scholar]
  • 12.Dubilier, N., McFall-Ngai, M. & Zhao, L. Create a global microbiome effort. Nature526, 631–634 (2015). [DOI] [PubMed]
  • 13.Wubs EJ, van der Putten WH, Bosch M, Bezemer TM. Soil inoculation steers restoration of terrestrial ecosystems. Nat. Plants. 2016;2:16107. doi: 10.1038/nplants.2016.107. [DOI] [PubMed] [Google Scholar]
  • 14.Buffie CG, et al. Precision microbiome reconstitution restores bile acid mediated resistance to Clostridium difficile. Nature. 2015;517:205–208. doi: 10.1038/nature13828. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Gibson TE, Bashan A, Cao HT, Weiss ST, Liu YY. On the origins and control of community types in the human microbiome. PLoS Comput. Biol. 2016;12:e1004688. doi: 10.1371/journal.pcbi.1004688. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Lin, C. T. Structural controllability. IEEE Trans. Autom. Control19, 201–208 (1974).
  • 17.Liu YY, Barab´asi AL. Control principles of complex systems. Rev. Mod. Phys. 2016;88:035006. doi: 10.1103/RevModPhys.88.035006. [DOI] [Google Scholar]
  • 18.Phelan VV, Liu WT, Pogliano K, Dorrestein PC. Microbial metabolic exchange—the chemotype-to-phenotype link. Nat. Chem. Biol. 2012;8:26–35. doi: 10.1038/nchembio.739. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Turchin, P. Complex Population Dynamics: A Theoretical/Empirical Synthesis, Vol. 35 (Princeton University Press, 2003), Princeton, New Jersey.
  • 20.Faust K, Raes J. Microbial interactions: from networks to models. Nat. Rev. Microbiol. 2012;10:538–550. doi: 10.1038/nrmicro2832. [DOI] [PubMed] [Google Scholar]
  • 21.Friedman, J., Higgins, L. M. & Gore, J. Community structure follows simple assembly rules in microbial microcosms. Nat. Ecol. Evol.1, 0109 (2017). [DOI] [PubMed]
  • 22.Bucci V, et al. Mdsine: microbial dynamical systems inference engine for microbiome timeseries analyses. Genome Biol. 2016;17:121. doi: 10.1186/s13059-016-0980-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Thomas, T. et al. Diversity, structure and convergent evolution of the global sponge microbiome. Nat. Commun.7, 11870 (2016). [DOI] [PMC free article] [PubMed]
  • 24.Xiao Y, et al. Mapping the ecological networks of microbial communities. Nat. Commun. 2017;8:2042. doi: 10.1038/s41467-017-02090-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Friedman J, Alm EJ. Inferring correlation networks from genomic survey data. PLoS Comput. Biol. 2012;8:e1002687. doi: 10.1371/journal.pcbi.1002687. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Sugihara, G. et al. Detecting causality in complex ecosystems. Science338, 446–500 (2012). [DOI] [PubMed]
  • 27.Berry D, Widder S. Deciphering microbial interactions and detecting keystone species with co-occurrence networks. Front. Microbiol. 2014;5:219. doi: 10.3389/fmicb.2014.00219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Waksman SA. What is an antibiotic or an antibiotic substance? Mycologia. 1947;39:565–569. doi: 10.1080/00275514.1947.12017635. [DOI] [PubMed] [Google Scholar]
  • 29.Oremland, R. S. & Capone, D. G. Use of specific inhibitors in biogeochemistry and microbial ecology. In Advances in Microbial Ecology 285–383 (Springer, 1988), Plenum Press, New York.
  • 30.Schrezenmeir J, de Vrese M. Probiotics, prebiotics, and synbiotics approaching a definition. Am. J. Clin. Nutr. 2001;73:361S–364S. doi: 10.1093/ajcn/73.2.361s. [DOI] [PubMed] [Google Scholar]
  • 31.Conte, G., Moog, C. H. & Perdon, A. M. Algebraic Methods for Nonlinear Control Systems (Springer Science & Business Media, 2007), London.
  • 32.Moore JC, de Ruiter PC, Hunt HW, Coleman DC, Freckman DW. Microcosms and soil ecology: critical linkages between fields studies and modelling food webs. Ecology. 1996;77:694–705. doi: 10.2307/2265494. [DOI] [Google Scholar]
  • 33.Mounier J, et al. Microbial interactions within a cheese microbial community. Appl. Environ. Microbiol. 2008;74:172–181. doi: 10.1128/AEM.01338-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Stein RR, et al. Ecological modeling from time-series inference: insight into dynamics and stability of intestinal microbiota. PLoS Comput. Biol. 2013;9:e1003388. doi: 10.1371/journal.pcbi.1003388. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Gerber GK. The dynamic microbiome. FEBS Lett. 2014;588:4131–4139. doi: 10.1016/j.febslet.2014.02.037. [DOI] [PubMed] [Google Scholar]
  • 36.Coyte KZ, Schluter J, Foster KR. The ecology of the microbiome: networks, competition, and stability. Science. 2015;350:663–666. doi: 10.1126/science.aad2602. [DOI] [PubMed] [Google Scholar]
  • 37.Bashan A, et al. Universality of human microbial dynamics. Nature. 2016;534:259–262. doi: 10.1038/nature18301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Dam P, Fonseca LL, Konstantinidis KT, Voit EO. Dynamic models of the complex microbial metapopulation of lake mendota. npj Syst. Biol. Appl. 2016;2:16007. doi: 10.1038/npjsba.2016.7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Jost C, Ellner SP. Testing for predator dependence in predator–prey dynamics: a nonparametric approach. Proc. R. Soc. Lond. B Biol. Sci. 2000;267:1611–1620. doi: 10.1098/rspb.2000.1186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Liu YY, Slotine JJ, Barabási AL. Controllability of complex networks. Nature. 2011;473:167–173. doi: 10.1038/nature10011. [DOI] [PubMed] [Google Scholar]
  • 41.Pequito SD, et al. A framework for structural input/output and control configuration selection in large-scale systems. IEEE Trans. Autom. Contr. 2016;61:303–318. doi: 10.1109/TAC.2015.2437525. [DOI] [Google Scholar]
  • 42.Camacho, E. F. & Alba, C. B. Model Predictive Control (Springer Science & Business Media, 2013), London.
  • 43.Cornelius SP, Kath WL, Motter AE. Realistic control of network dynamics. Nat. Commun. 2013;4:1942. doi: 10.1038/ncomms2939. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Jones DR, Perttunen CD, Stuckman BE. Lipschitzian optimization without the Lipschitz constant. J. Optim. Theory Appl. 1993;79:157–181. doi: 10.1007/BF00941892. [DOI] [Google Scholar]
  • 45.Åström, K. J. & Murray, R. M. Feedback Systems: An Introduction for Scientists and Engineers (Princeton University Press, 2010), Princeton, New Jersey.
  • 46.May, R. M. Stability and Complexity in Model Ecosystems, Vol. 6 (Princeton University Press, 2001), Princeton, New Jersey.
  • 47.Power ME, et al. Challenges in the quest for keystones: identifying keystone species is difficult but essential to understanding how loss of species will affect ecosystems. Bioscience. 1996;46:609–620. doi: 10.2307/1312990. [DOI] [Google Scholar]
  • 48.Ortiz M, et al. Quantifying keystone species complexes: ecosystem-based conservation management in the King George Island (Antarctic Peninsula) Ecol. Indic. 2017;81:453–460. doi: 10.1016/j.ecolind.2017.06.016. [DOI] [Google Scholar]
  • 49.Jain S, Krishna S. Crashes, recoveries, and core shifts in a model of evolving networks. Phys. Rev. E. 2002;65:026103. doi: 10.1103/PhysRevE.65.026103. [DOI] [PubMed] [Google Scholar]
  • 50.Sutton RS, Barto AG. Introduction to Reinforcement Learning. Cambridge: MIT Press; 1998. [Google Scholar]
  • 51.Mnih V, et al. Human-level control through deep reinforcement learning. Nature. 2015;518:529–533. doi: 10.1038/nature14236. [DOI] [PubMed] [Google Scholar]
  • 52.Campbell C, Albert R. Stabilization of perturbed Boolean network attractors through compensatory interactions. BMC Syst. Biol. 2014;8:53. doi: 10.1186/1752-0509-8-53. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Zañudo JG, Albert R. An effective network reduction approach to find the dynamical repertoire of discrete dynamic networks. Chaos: Interdiscip. J. Nonlinear Sci. 2013;23:025111. doi: 10.1063/1.4809777. [DOI] [PubMed] [Google Scholar]
  • 54.Loehle C. Control theory and the management of ecosystems. J. Appl. Ecol. 2006;43:957–966. doi: 10.1111/j.1365-2664.2006.01208.x. [DOI] [Google Scholar]
  • 55.Barzel B, Barabási AL. Universality in network dynamics. Nat. Phys. 2013;9:673. doi: 10.1038/nphys2741. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Posfai A, Taillefumier T, Wingreen NS. Metabolic trade-offs promote diversity in a model ecosystem. Phys. Rev. Lett. 2017;118:028103. doi: 10.1103/PhysRevLett.118.028103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Taillefumier T, Posfai A, Meir Y, Wingreen NS. Microbial consortia at steady supply. eLife. 2017;6:e22644. doi: 10.7554/eLife.22644. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Butler S, ODwyer JP. Stability criteria for complex microbial communities. Nat. Commun. 2018;9:2970. doi: 10.1038/s41467-018-05308-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Good BH, Martis S, Hallatschek O. Adaptation limits ecological diversification and promotes ecological tinkering during the competition for substitutable resources. Proc. Natl. Acad. Sci. U.S.A. 2018;115:E10407–E10416. doi: 10.1073/pnas.1807530115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Goyal A, Maslov S. Diversity, stability, and reproducibility in stochastically assembled microbial ecosystems. Phys. Rev. Lett. 2018;120:158102. doi: 10.1103/PhysRevLett.120.158102. [DOI] [PubMed] [Google Scholar]
  • 61.Tikhonov, M. & Monasson, R. Innovation rather than improvement: a solvable high-dimensional model highlights the limitations of scalar fitness. J. Stat. Phys.172, 74–104 (2018).
  • 62.Marsland, R. III et al. Available energy fluxes drive a phase transition in the diversity, stability, and functional structure of microbial communities. PLoS Comput. Biol.15, e1006793 (2018). [DOI] [PMC free article] [PubMed]
  • 63.Niehaus, L. et al. Microbial coexistence through chemical-mediated interactions. bioRxiv 358481 (2018). [DOI] [PMC free article] [PubMed]
  • 64.MacArthur R. Species packing and competitive equilibrium for many species. Theor. Popul. Biol. 1970;1:1–11. doi: 10.1016/0040-5809(70)90039-0. [DOI] [PubMed] [Google Scholar]
  • 65.Chesson P. MacArthur’s consumer-resource model. Theor. Popul. Biol. 1990;37:26–38. doi: 10.1016/0040-5809(90)90025-Q. [DOI] [Google Scholar]
  • 66.Gao J, Liu YY, D’souza RM, Barabási AL. Target control of complex networks. Nat. Commun. 2014;5:5415. doi: 10.1038/ncomms6415. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Peer Review File (704.1KB, pdf)
Reporting Summary (67.1KB, pdf)

Data Availability Statement

All the experimental datasets analyzed in this study are publicly available.


Articles from Nature Communications are provided here courtesy of Nature Publishing Group

RESOURCES