Significance
Explaining and predicting qualitative changes in the taxon membership (community) of microbial systems is synonymous for understanding qualitative changes in the environment. Yet, it remains unclear the extent to which the capacity of going back and forth among few different communities is partially conditioned by the current taxon membership or is fully an outcome of random environmental changes. Here, we introduce a conceptual, mathematical, and practical framework to demonstrate that the relatively high switching capacity of microbial systems can be explained by structured community transitions that increase the dependency on the current community. We corroborate our theory using temporal data of human and ocean microbiota, opening the opportunity to enhance our understanding of a wide array of microbial systems.
Keywords: microbiota, information theory, community dynamics, feasibility, structuralism
Abstract
Microbial systems appear to exhibit a relatively high switching capacity of moving back and forth among few dominant communities (taxon memberships). While this switching behavior has been mainly attributed to random environmental factors, it remains unclear the extent to which internal community dynamics affect the switching capacity of microbial systems. Here, we integrate ecological theory and empirical data to demonstrate that structured community transitions increase the dependency of future communities on the current taxon membership, enhancing the switching capacity of microbial systems. Following a structuralist approach, we propose that each community is feasible within a unique domain in environmental parameter space. Then, structured transitions between any two communities can happen with probability proportional to the size of their feasibility domains and inversely proportional to their distance in environmental parameter space—which can be treated as a special case of the gravity model. We detect two broad classes of systems with structured transitions: one class where switching capacity is high across a wide range of community sizes and another class where switching capacity is high only inside a narrow size range. We corroborate our theory using temporal data of gut and oral microbiota (belonging to class 1) as well as vaginal and ocean microbiota (belonging to class 2). These results reveal that the topology of feasibility domains in environmental parameter space is a relevant property to understand the changing behavior of microbial systems. This knowledge can be potentially used to understand the relevant community size at which internal dynamics can be operating in microbial systems.
The behavior and function of ecological systems are defined by the specific membership of interacting taxa present in a given place and time—known as community (1). In microbial systems, the communities can vary quickly, especially under fast-changing environments due to the short generation times of organisms, and their strong context dependencies (2–4). For example, considering a pool of n taxa, the membership (presence–absence) combinations that are possible to be observed grow exponentially with the pool size as . Yet, it has been reported (5–7) that these systems tend to switch back and forth among few dominant communities. While changes in microbial systems have been mainly attributed to random environmental factors (8, 9), it remains unclear the extent to which internal community dynamics affect the switching capacity of microbial systems (5, 10). This knowledge is important, for instance, to accurately distinguish risky factors within a host or ecosystem that depend strongly on community dynamics across time (11). This has motivated a solid body of work documenting and studying microbial abundance and composition fluctuations over time (4–7, 9, 12–19).
From a dynamical systems perspective (20), the switching behavior of microbial systems can be described by their capacity to move back and forth among alternative stable states (21). Typically, alternative stable states are defined by situations involving multiple equilibria at a fixed set of environmental conditions (7, 22). Similarly, alternative stable states are also often related to bifurcations leading to sudden changes in the number of states with changing environmental conditions (20, 23). This implies that changes in environmental conditions can combine with multiple equilibria to produce complex switching behaviors in dynamical systems (24). Unfortunately, the virtual impossibility of measuring exact initial conditions of state variables and environmental parameters limits our capacity to separate the potential confounding factors acting on the emergence of alternative stable states (25). This has highlighted the possibility to provide a coarse-grained description of a system without an elaborate examination of the underlying quantitative dynamics (25–27). These limitations and opportunities involved in measuring and understanding nonlinear dynamical systems have motivated the statistical analysis of qualitative behaviors as a potential simplification (28, 29).
The structuralist approach has been instrumental in our qualitative understanding of complex living systems (30–33). The structuralist approach takes as valid that any quantitative change in the state of a living system is causally connected to a change in external conditions. This conceptual simplification enables the construction of a one-to-one mapping between environmental conditions in environmental parameter space and the quantitative states of a system. Additionally, the structuralist approach assumes that the qualitative effects of these external changes (and consequently quantitative changes) are constrained by an invariant structure of interactions among the components of the system. Thus, contrary to quantitative states, the diversity of qualitative states in a system does not form a continuum but is limited by a set of descriptive laws that restrict the possibilities of qualitative change. These restrictions introduce a discrete partition of feasibility domains in environmental parameter space associated with a finite collection of qualitative states. Therefore, one can interpret the probability of observing a particular qualitative state in a system proportional to the size of its feasibility domain in environmental parameter space. This suggests that systems subject to randomly changing environments may spend most of their time in a qualitative state with a large feasibility domain precisely because such a state is more probable (34–36). The structuralist approach does not replace the definition of alternative stable states but simply provides a parsimonious framework to overcome some of the limitations of studying the complex nature of living system by moving the analysis to the probability distribution of qualitative behaviors (25, 33). However, this approach does not enable the possibility to differentiate the probability distribution of quantitative stable states, nor if alternative stable states owe their origin to different initial conditions operating under the exact same environmental conditions.
Following the structuralist approach (Materials and Methods), we investigate whether the switching capacity of qualitative states in microbial systems is explained only by the size of feasibility domains in environmental parameter space (what we call unstructured transitions) or it is also necessary to include the topology of such feasibility domains in environmental parameter space (what we call structured transitions). Unstructured transitions, contrary to structured transitions, would be equivalent to cases where changes in qualitative states are not conditioned by the current taxon membership, but rather, only by environmental stochasticity. Formally, we introduce six basic assumptions to establish a statistical analysis of our structural approach on microbial dynamics: i) Any abiotic and biotic changes external to a focal system are considered part of the environmental parameter space. ii) There is a one-to-one mapping between parameters in environmental parameter space and a focal system’s abundance (quantitative state). iii) Many combinations of parameters in environmental parameter space (and consequently abundance compositions) will result in the same community (taxon membership: qualitative state). iv) The probability of observing a particular community is directly related to the size of its feasibility domain in environmental parameter space. v) Unstructured transitions between any two communities happen proportionally to the size of their feasibility domains and independent of their distance in environmental parameter space. vi) Structured transitions between any two communities happen proportionally to the size of their feasibility domains and inversely proportional to their distance in environmental parameter space—a special case of the gravity model (Materials and Methods) (37–39).
Put simply, picturing the finite partition of the environmental parameter space as a multicolored board, unstructured community transitions are equivalent to throwing a dart randomly over the entire board, where each color represents a different community and the colored area relative to the board size specifies the probability of observing such transitions. By definition, unstructured transitions do not depend on the current community of the system, external perturbations move the system to a possible community in proportion to the size of its feasibility domain in environmental parameter space. Alternatively, structured community transitions are equivalent to throwing a dart randomly over a restricted section of the multicolored board, where the section is relatively close to the current community’s feasibility domain. By definition, structured transitions depend on the current community of the system, external perturbations move the system to a possible community in proportion to the size of its feasibility domain and its location in environmental parameter space. These two processes generate distinct transition matrices under a Markov chain analysis (Materials and Methods), from which we can measure and compare the expected switching capacity generated in a system across different community sizes. We showcase the predictions of our theory by generating partitions of feasibility domains in environmental parameter space following a general (model-free) framework. Then, we corroborate our general predictions using temporal data of human gut, vaginal, and oral microbiota as well as ocean microbiota (5, 6, 12, 40). We discuss how this theory can be applied under ecological models to describe specific partitions of feasibility domains in environmental parameter space.
Theoretical Analysis
Constructing the Topology of Feasibility Domains.
The structuralist approach (31, 41) is rooted in the topology of feasibility domains (partitions) in environmental parameter space. These domains are associated with the realization of possible qualitative states of a system (Fig. 1A). To describe the feasibility domain assigned to each qualitative state , it is necessary to introduce two parameters: and . The former is a -element size vector denoting the volume (or size) of each domain . The latter is a matrix denoting the distance between centroids and of domains and . To establish general predictions from this theory, we study the topology of feasibility domains from random distributions (more details below) and from ecological models (more details in Materials and Methods).
Fig. 1.
Illustration of concepts and measures to estimate the switching capacity of microbial systems. (A) Following a structuralist approach (30, 31), the black lines denote a hypothetical environmental parameter space: set of external parameters compatible with the realization of a given community formed from a given taxon (species) pool. The broken arrows depict the distance in environmental parameter space between two different communities. (B) The transition force between communities (i and j) is modeled as a special case of the gravity model (37–39) written as a function of the size of feasibility domains [() and ()] and inversely related to the distance . Note that unstructured (resp. structured) community transitions correspond to the structural parameter (resp. ). The transition probability () is the transition force normalized by the sum of all the forces going out from community i. (C) The calculation of switching capacity is a function of the transition matrix () and the stationary distribution (). Note that corresponds to the probability of remaining in the community i. (D) Transition matrix (T) summarizing all the transition probabilities between each pair of possible communities. This transition matrix gives rise to the stationary distribution () specifying the long-term frequency of each community. The darker the element, the higher the probability and frequency . For each type of transition, we show an illustrative example of a transition matrix (Left) and the corresponding evolution of probability distribution of states/communities (Right). Starting from an initial distribution (), unstructured transitions converge immediately to the long-term probability distribution (equilibrium: ) shown in Panel A. Instead, starting from the same initial distribution (), structured transitions gradually converge (i.e., require many steps) to their unique long-term probability distribution . The larger the structural parameter , the larger the distance between and -pagination (SI Appendix, section S1).
To study the topology of feasibility domains from random distributions, we let be independently drawn from a uniform distribution and normalized such that all elements sum to . Then, we make a list of centroid vectors , where each has coordinates jointly sampled from a uniform distribution on the S-dimensional unit sphere (36). Finally, we calculate the pairwise distances as . This implies that all the subsets ( communities) of the pool have a given feasibility domain in environmental parameter space. This random partition of feasibility domains corresponds to a mutually exclusive and exhaustive partition of environmental parameter space. The randomly generated distances assume that each feasibility domain can be located freely at any region within the environmental parameter space with (Fig. 1).
Probability of Community Transition.
Formally, a community is a subset of interacting taxa from a constant taxa pool of dimension (i.e., ). These definitions assume that while there is a relatively constant but rare flux of taxa to a given place, the current context defined in the environmental parameter space only allows the sustained growth of the taxa . Thus, community transitions happen when the parameters shift from being compatible with to being compatible with . In turn, we define the current context in environmental parameter space as an effective vector of growth rates (, Fig. 1A). We call this an effective value to denote the phenomenological net effect from all the abiotic and biotic factors not explicitly considered within the focal system. Thus, there could be cases when the effective vector (the current context) is inside the feasibility domain () of , allowing the realization of such a community.
Next, to account for transition dynamics, we define the transition force of moving from to , , as the phenomenological interaction strength. In general, is an arbitrary functional form, similar to the possibility of using or not higher-order interaction terms in Lotka–Volterra models (42, 43). Here, we write as a special case of the gravity model (37–39) given by
[1] |
where is a nonnegative structural parameter that penalizes distant community transitions described in the topology of the environmental parameter space. Specifically, we assume that transition forces from to are proportional to the size of each feasibility domain and decay monotonically with distance in environmental parameter space (Fig. 1B).
Then, the probability of observing a transition from community to community is a function of the topology of feasibility domains, formally written as
[2] |
This definition intuitively implies that the probability of such a transition is the relative ratio of the transition force normalized by the total forces acting on given that . Different from the transition force, the transition probability does not necessarily preserve the symmetry to exchange and . Then, if , we recover unstructured transitions
[3] |
where the probability is independent of both the current community and the topology of feasibility domains in the environmental parameter space. Likewise, if , we recover structured transitions
[4] |
where the probability does depend on both the current community and the topology of feasibility domains in the environmental parameter space.
It is important to note that unstructured () and structured () transitions have different stationary probability distributions (Eqs. 3–4). Furthermore, unstructured transitions need just one step to converge to their stationary probability distribution, that is, , while structured transitions gradually converge to their unique distributions (Fig. 1D). The larger the structural parameter , the larger the distance between and (SI Appendix, section S1). Therefore, it is expected that unstructured and structured transitions exhibit different dynamics characterized by their transition matrices, which can then translate into differences in switching capacities.
Measuring Switching Capacity.
Our definition of transition probability enables us to measure the switching capacity: the ability of a system to move back and forth between dominant communities. Formally, defines a Markov process on a taxon pool (44) and specifies the -th element of a row-normalized Markov transition matrix T (Materials and Methods) (Fig. 1D). Therefore, is the probability that the system will stay at the current community , and its complement is the probability that the system will move to any other destination. Then, the switching capacity of the system can be expressed as
[5] |
which represents the uncertainty of transition of each community i (row i of T) weighted by its frequency of occurrence (Fig. 1D). This frequency vector () can be theoretically derived from the stationary distribution of Markov chains (44). Specifically, , where corresponds to the eigenvector with eigenvalue from the transposed transition matrix . Regardless of the pool size, the base 2 of the logarithm establishes that the maximum switching capacity is attained when the system has equal probability of staying or leaving the most frequent (dominant) communities (i.e., close to 0.5 and close to uniform).
Theoretical Results.
Following our theory, the switching capacity of a system is a function of the corresponding partitions of feasibility domains in environmental parameter space. These partitions determine the forces of community transitions, which are themselves a function of the structural parameter and the size of taxon pool S Eq. 1. To illustrate this dependence, we start with 200 random systems (following the methods above) formed by (32 possible communities). For each random system, we compute its corresponding switching capacity Eq. 5 across different values of the structural parameter .
Fig. 2A shows the distribution of conditioned on . Switching capacity is generally stronger in structured transitions (, blue symbols) than in unstructured ones (, orange symbols). This is confirmed with two-sample Kolmogorov–Smirnov tests (P value ). Specifically, as becomes larger, the switching capacity initially increases and then decreases (as well as the variance), with both the median and extreme values of reaching their maximum at (broken vertical line). This implies that H becomes uninformative for large values. This is mainly because large values make even regions with small distances highly penalized, and becomes extremely small—allowing to take most of the probability due to the normalization and consequently the switching level for each row of T decreases together with . This pattern is robust across a wide range of pool sizes (SI Appendix, section S2). Importantly, the value of the structural parameter that maximizes the switching capacity changes as a linear function of pool size approximately as (Fig. 2B). This indicates that the larger the community size, the larger the transition constraints needed to distinguish it from random (unstructured transitions).
Fig. 2.
Theoretical predictions. (A) Each point corresponds to one out of 200 randomly generated transition matrices (T) of a pool with 5 taxa (see main text). The x-axis corresponds to different values of the structural parameter modulating the distance effect of transition probabilities according to Eq. 1. Note that (resp. ) corresponds to unstructured (resp. structured) community transitions. The y-axis corresponds to the computed switching capacity (Eq. 5). For , the median of is with an IQR (interquartile range) of . As a reference, for (closest to for , broken vertical line), the median of is 0.954 with an IQR of . (B) changes monotonically as a function of community size S approximately as . (C) Each point corresponds to one out of 200 randomly generated transition matrices (T) of a pool from 4 to 12 taxa (see main text). The Top gray points correspond to structured community transitions with . The Middle blue points correspond to structured community transitions with . The Bottom orange points correspond to unstructured community transitions with . In general, structured community transitions yield a stronger switching capacity (y-axis), yet for fixed values, the maximum values of decrease with pool size (x-axis).
Comparing the switching capacity across different pool sizes, Fig. 2C clearly distinguishes between structured and unstructured transitions. Structured transitions (, bottom orange symbols) are represented by either a fixed (as a reference, we use , middle blue symbols) or optimized (top gray symbols) as a function of pool size. Overlaps between these two structured transition cases may occur only when S is small. Qualitative results are not sensitive to the choice of distributions when generating partitions of feasibility domains (SI Appendix, section S3). These results reveal that there are two broad classes of systems following structured transitions: one class where switching capacity is high across a wide range of community sizes structured transitions are a function of pool size, , and another class where switching capacity is high only inside a narrow size range (structured transitions are not a function of pool size, ).
Empirical Analysis
Measuring Switching Capacity from Time Series Data.
To corroborate our theory with data from natural microbial communities, we measured switching capacity from publicly available times series data of microbial systems from the human gut, vaginal, and oral microbiota as well as ocean microbiota (5, 6, 12, 40). In each dataset, we have 16S sequencing measurements of relative abundance for each detected operational taxonomic unit (OTU) from multiple individual donors (human microbiota) or locations (ocean microbiota) over time. The gut microbiota dataset contains time series from 80 donors, the longest time series (donor am) comprising 317 samples over effectively 204 d (12) and recording 43 unique microbial families. The vaginal microbiota dataset comprises time series from 32 donors, with samples collected twice a week for 16 wk (5). These data track more than 86 families for each donor. The oral microbiota dataset samples a single subject on a daily basis (6), recording in total 211 microbial families over effectively 273 d. The ocean microbiota dataset focuses on a consecutive 93-d sampling of coastal plankton in three different sampling sites (40), recording a maximum of 228 microbial families on each site. In SI Appendix, section S4, we illustrate the presence–absence data using a principal coordinate analysis, supporting previous observations that microbial communities tend to be characterized by few dominant communities.
To increase the detection level and reduce the dimensionality of the data, we aggregated OTUs by family. We confirmed that aggregating OTUs at different taxonomic levels yielded the same qualitative results (SI Appendix, section S5). To convert the original time series of taxon abundance into a binarized time series, first, we calculate the daily relative abundance of each taxon (in relation to the total abundance of taxa reported in each day), and then, we translated into daily presence–absence observations using of relative abundance as a threshold. In SI Appendix, sections S6–S9, we use different sensitivity analyses to validate this procedure. For each donor and location, we compiled all the unique taxa detected across the donor’s entire time series. Because it is virtually impossible to track the exact taxon membership of microbial systems due to technical limitations, and practical interventions are typically targeted to a specific set of taxa, it is then useful to focus on a focal system and take everything external to this system as the environment (i.e., biotic and abiotic conditions). This approach is well aligned with our basic assumptions. Specifically, many combinations of parameters in environmental parameter space (and consequently abundance compositions) will result in the same community (taxon membership: qualitative state). Therefore, while the environment (external set of taxa not explicitly considered as part of a system) can be different, the observed communities from the focal system can be qualitatively the same. Thus, we sub-sampled from each donor’s unique taxon list to generate pools with sizes ranging from 3 to 20 (Fig. 3A). Then, we used the time series of the set and binarized it to denote presence (one) and absence (zero) (Fig. 3B). For each day, these binarized data specify a given community formed by the daily presence of taxa from the studied pool . Therefore, the binarized data provides an empirical time series of community changes, which we used to quantify community transitions and their switching capacities. We assumed that each observed community is feasible, but not necessarily at equilibrium (e.g., can be under transient dynamics) (41).
Fig. 3.
Workflow of empirical analysis. (A) For a given taxon (species) pool of size (S), we randomly choose the studied (focal) taxon using compiled time series data (see main text). Then we compute the relative abundance of the In SI Appendix, sections S6–S9, we use different sensitivity analyses. (B) We binarize the time series data based on presence and absence of taxa, yielding a time series of community transitions. Additionally, as a null model, we randomized the data per day by randomly shuffling the presence of taxa while preserving the number of observed taxa per day. This null model is the equivalent of unstructured transitions established theoretically (Fig. 1). (C) We estimate the observed transition probability () and stationary distribution () from the frequency of occurrences. Note that the observed (structured) and randomized (unstructured) binarized time series generate different and . We calculate the observed (structured: )) and randomized (unstructured: )) switching capacities using the corresponding transition matrices and stationary distributions. This analysis is performed separately for each empirical time series.
To quantify empirical transitions, we calculated the frequency () a composition is followed by another composition . Then, the observed transition matrix () and observed stationary distribution () are, respectively, given by
[6] |
where and m is the length of the time series in the data. The stationary distribution can be derived directly from the transition matrix as we do in our theoretical analysis, giving similar results because and are not independent. The observed switching capacity () can be estimated by combining Eq. 5 and Eq. 6. Last, to compare the observed (structured) switching capacity to a randomized (unstructured) case , we randomized the original time series by randomly shuffling the presence of taxa while preserving the number of taxa per day. This process assumes that environmental conditions can affect equally the presence of each taxon independently of the community. We then estimate the randomized (unstructured) switching capacity following the same methodology established for as in Eq. 6. It is worth mentioning that empirical results are a function of the length and quality of the observed time series. However, our analysis is not focused on the specific values of statistics, rather it is focused on the general patterns in the distribution of switching capacities. Sensitivity analyses corroborate the robustness of our empirical results (SI Appendix, sections S6–S9).
Empirical Results.
Corroborating our theoretical predictions (shown in Fig. 2), Fig. 4 shows that the studied microbial systems display a stronger switching capacity (for pools with more than 4 taxa, i.e., ) than the randomized (unstructured) counterparts (i.e., ). In Fig. 4, each boxplot corresponds to the distribution of switching capacity (y-axis) generated by 200 randomly chosen family pools with given size (x-axis). This pattern is consistent across a wide range of pool sizes () and statistically confirmed with two-sample Kolmogorov–Smirnov tests (P value ). In particular, Fig. 4 shows the analysis for donor am (human gut microbiota), subject 5 (human vaginal microbiota), subject A (human oral microbiota), and location 37 (ocean microbiota). All the other donors and locations display similar patterns across a wide range of community sizes (SI Appendix, sections S10 and S11). In SI Appendix, sections S12 and S13, we present evidence that the inferred Markov process from time series data as well as the sampling procedure does not affect our conclusions.
Fig. 4.
Empirical analysis of switching capacity in microbial systems. The analysis is performed using empirical time series of sequenced human gut, vaginal, and oral microbiota as well as ocean microbiota. This analysis is similar to its theoretical counterpart shown in Fig. 2, corroborating our theory. Data were collected and made available in previous publications (5, 6, 12, 40). In each panel, each boxplot consists of 200 data points in total and corresponds to a different pool of size varied between 3 and 20 taxa. These pools are randomly chosen from the time series of 43 families observed in donor am (human gut microbiota, Panel A), from 86 families observed in subject 5 (human vaginal microbiota, Panel B), from 211 families observed in subject A (human oral microbiota, Panel C), and 228 families in location 37 (human ocean microbiota, Panel D). Results are qualitatively the same for different time series, inference, and sampling procedures (SI Appendix, sections S10–S13). The randomized (unstructured) analysis is generated using the surrogate time series such that we randomly shuffle the presence of taxa while keeping the number of observed taxa per day (see main text).
In agreement with our theory, empirical results reveal two broad classes of microbial systems. One class formed by the human gut and oral microbiota (Panels A and C) displays high values of switching capacity across a wide range of community sizes (more aligned to the scenario in Fig. 2). This first class reveals that strong and structured community dynamics can operate at different levels of living matter organization. The second class formed by the human vaginal microbiota and ocean microbiota (Panels B and D) displays high values of switching capacity only inside a narrow range of pool sizes (more aligned to a fixed scenario in Fig. 2). This second class is subject to stronger seasonal or periodical environmental changes than the first class, and community transitions become more predictable (lower switching capacity) the higher the pool size. Additionally, we found that the relationship between switching capacities and the diagonal probabilities generated by transition matrices (i.e., the probabilities of remaining in a current state i) follow a nontrivial nonmonotonic pattern that is observed in both theoretical and empirical data (SI Appendix, section S14). These results suggest that different external pressures can give rise to different switching patterns and limit the scope of internal community dynamics in microbial systems.
Conclusions
Living systems are constantly evolving and adapting as a function of environmental changes (4, 32). Microbial systems tend to exhibit relatively stable communities, especially over short timescales (5, 6, 8, 40). This stability appears to be linked with a switching behavior among few dominant communities (5, 6, 8). Following a structuralist approach (30, 31), we have introduced a simplified framework to distinguish the interplay between internal community dynamics and external forces in shaping the switching capacity of microbial systems. We assumed that it is possible to construct an environmental parameter space, where such a space can be partitioned into feasibility domains associated with the realization of each possible community. Then, the probability of observing a given community is proportional to the size of its feasibility domain in environmental parameter space. We established that structured transitions between communities are inversely related to their distance in environmental parameter space. In other words, structured transitions limit the possible changes to nearby communities in environmental parameter space, but changes do not need to happen between similar taxon memberships (it all depends on the topology of feasibility domains in environmental parameter space). Likewise, unstructured transitions are independent of their distance in environmental parameter space, they are only dependent on the size of their feasibility domains. That is, unstructured transitions are not initially restricted, it all depends on environmental stochasticity. Thus, unstructured transitions can be considered as processes that are already at equilibrium (i.e., the long-term probability that the system will be in each state), whereas structured transitions are processes that are gradually moving toward an equilibrium (Materials and Methods). We calculated the switching capacity as the weighted capacity of going back and forth between the system’s dominant communities using measures anchored in information theory (45).
Theoretically, we predicted that structured transitions display a stronger switching capacity than unstructured transitions. Our theoretical results also revealed two broad classes of systems under structured transitions: one class where a high switching capacity happens across a wide range of community sizes and another class where high switching capacity happens only inside a narrow size range. Although we presented our theoretical predictions using partitions of feasibility domains in environmental parameter space following a general (model-free) framework, these partitions can be described by specific models in order to increase our understanding of the topology of feasibility domains in environmental parameter space. For example, leveraging on the scalability and tractability of the generalized Lotka–Volterra (gLV) model, one can construct well-defined partitions of feasibility domains (Materials and Methods). Additionally, from a mechanistic perspective, partitions of feasibility domains can incorporate the functional landscape of microbial consortia, not only their taxon membership (43, 46). Future work may benefit from delving deeper into such broader schemes of partitioning the environmental parameter space and their implications.
We corroborated all our theoretical predictions using time series data on human gut, vaginal, and oral microbiota as well as ocean microbiota. Community transitions and switching capacities were calculated directly from the observed time series. The unstructured community transitions were then generated by randomizing the presence of families in the time series while keeping the number of observed families per day, mimicking a daily effect of unrestricted environmental forces. Our results revealed that the estimated switching capacity was generally higher than the expected capacity under the null model (unstructured case). This implies that we cannot rule out the possibility that the switching capacity of microbial systems is affected by internal community dynamics, and not an outcome derived simply from environmental stochasticity. Specifically, we found that structured community transitions increase the dependency of future communities on the current taxon membership, enhancing the switching capacity of microbial systems. While unstructured transitions converge immediately to a long-term probability distribution of states, structured transitions converge gradually. Importantly, this differentiation of community transitions does not happen across all community sizes. Our empirical results revealed two broad classes of systems following structured transitions. Systems subject to strong periodicity, such as the human vaginal microbiota and ocean microbiota (5, 40) displayed a high switching capacity inside a narrow range of community sizes. Instead, systems subject to less periodicity, such as the human gut and oral microbiota (6, 12) displayed a high switching capacity across a wide range of community sizes. We speculate that these differences may arise as potential trade-offs between adaptability (size of feasibility domain in environmental parameter space) and evolvability (location of the feasibility domain in environmental parameter space) as a response to external perturbations acting on the size and membership of microbial systems (31)—revealing the importance of understanding the topology of feasibility domains in environmental parameter space. This knowledge can be potentially used to understand the relevant community size at which internal dynamics can be operating in microbial systems. Our work introduces a conceptual, mathematical, and practical framework that future work can potentially use and expand to detect anomalous community behaviors within hosts or ecosystems.
Materials and Methods
Structuralist Approach.
To guide the reader into the structuralist perspective, we provide a formal application of this perspective in ecology (36). Let us suppose a generalized gLV system of S populations, where the interaction coefficients are represented by the matrix and the per capita growth rates are represented by the vector . Then, the dynamical system can be described as . In the structuralist perspective, the interaction matrix can be considered the invariant internal structure, while the growth rate vector can be considered as the effective factors (not explicitly considered in the system) acting on the intrinsic growth rate of each population. Then, the environmental parameter space is thus the S-dimensional space of the parameter . The linearity of the gLV model implies a one-to-one mapping between directions of and the quantitative outcome of species abundances at equilibrium, i.e., . The community is feasible () as long as the effective vector is inside the feasibility domain: . The size of the feasibility domain can be geometrically calculated (41). Moreover, any subset (community) of has a feasibility domain (41) located in environmental parameter space (SI Appendix, section S15)
Gravity Model.
The gravity model can be seen as a class of models that describe the general idea of attraction forces between a pair of entities. It starts from Newton’s gravity law, where two bodies with mass and attract each other with a force of . In general, the attraction force is proportional to the individual weights/sizes/importance and inversely proportional to their distances. The gravity model has been widely used in areas as diverse as transportation (37), disease transmission (38), and genetic inheritance (39), among others. In our work, we take the gravity model form to write the transition force between communities.
Markov Chain Analysis.
For both unstructured and structured community transitions, the future community at most can only depend on the current community. Thus, any switching process generated by these two types of transitions is a Markov process. Because we care about the possible communities, the Markov process is on finite space. Thus, the transition probability generated from the gravity model has the property that , implying that the Markov process is also irreducible. Having an irreducible and finite Markov chain guarantees the existence of a unique stationary distribution . This reveals that the switching capacity expressed in Eq. 5 as a function of T and is a well-defined measure.
Supplementary Material
Appendix 01 (PDF)
Acknowledgments
We thank two anonymous reviewers for their constructive reports that led to a much improved manuscript. C.L. thanks the financial support from the Takeda Graduate Fellowship. J.D. and S.S. thank the financial support from MathWorks. J.N. thanks the NSF Postdoctoral Research Fellowships in Biology Program for support under Grant No.2109890. R.S.thanks the support of the Generalitat de Catalunya under grant AGAUR 2021 SGR 00751. Y.-Y.L. acknowledges funding support from the National Institutes of Health (under grant R01AI141529) and the Department of Defense (under grant W81XWH22-S-TBIPH2).
Author contributions
C.L., Y.-Y.L., R.S., and S.S. designed research; C.L. performed research; C.L., J.D., and S.S. contributed new reagents/analytic tools; C.L., J.D., J.N., Y.-Y.L., E.J.A., R.S., and S.S. analyzed data; and C.L., J.D., J.N., Y.-Y.L., E.J.A., R.S., and S.S. wrote the paper.
Competing interests
The authors declare no competing interest.
Footnotes
This article is a PNAS Direct Submission.
Data, Materials, and Software Availability
Previously published data were used for this work (5, 6, 12, 40). The source code to produce the results is available on GitHub at https://github.com/MITEcology/Long_PNAS_2024 (47).
Supporting Information
References
- 1.Vellend M., The Theory of Ecological Communities (Princeton University Press, NJ, 2016). [Google Scholar]
- 2.Faust K., Raes J., Microbial interactions: From networks to models. Nat. Rev. Microbiol. 10, 538–550 (2012). [DOI] [PubMed] [Google Scholar]
- 3.Hoek T. A., et al. , Resource availability modulates the cooperative and competitive nature of a microbial cross-feeding mutualism. PLoS Biol. 14, e1002540 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Nguyen J., Lara-Gutiérrez J., Stocker R., Environmental fluctuations and their effects on microbial communities, populations and individuals. FEMS Microbiol. Rev. 45, 1–16 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gajer P., et al. , Temporal dynamics of the human vaginal microbiota. Sci. Transl. Med. 4, 132ra52 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.David L. A., et al. , Host lifestyle affects human microbiota on daily timescales. Genome Biol. 15, R89 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Fujita H., et al. , Alternative stable states, nonlinear behavior, and predictability of microbiome dynamics. Microbiome 11, 63 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Gibbons S. M., Kearney S. M., Smillie C. S., Alm E. J., Two dynamic regimes in the human gut microbiome. PLoS Comput. Biol. 13, e1005364 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wang X. W., Liu Y. Y., Origins of scaling laws in microbial dynamics. Phys. Rev. Res. 5, 013004 (2023). [Google Scholar]
- 10.Gibson T. E., Bashan A., Cao H. T., Weiss S. T., Liu Y. Y., On the origins and control of community types in the human microbiome. PLoS Comput. Biol. 12, e1004688 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Scheffers B. R., et al. , The broad footprint of climate change from genes to biomes to people. Science 354, aff7671 (2016). [DOI] [PubMed] [Google Scholar]
- 12.Poyet M., et al. , A library of human gut bacterial isolates paired with longitudinal multiomics data enables mechanistic microbiome research. Nat. Med. 25, 1442–1452 (2019). [DOI] [PubMed] [Google Scholar]
- 13.Zaoli S., Grilli J., The stochastic logistic model with correlated carrying capacities reproduces beta-diversity metrics of microbial communities. PLoS Comput. Biol. 18, e1010043 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.George A. B., O’Dwyer J., Universal abundance fluctuations across microbial communities, tropical forests, and urban populations. Proc. Natl. Acad. Sci. U.S.A. 120, e2215832120 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Ji B. W., Sheth R. U., Dixit P. D., Tchourine K., Vitkup D., Macroecological dynamics of gut microbiota. Nat. Microbiol. 5, 768–775 (2020). [DOI] [PubMed] [Google Scholar]
- 16.Cao H. T., Gibson T. E., Bashan A., Liu Y. Y., Inferring human microbial dynamics from temporal metagenomics data: Pitfalls and lessons. Bioessays 39, 1600188 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Priya S., Blekhman R., Population dynamics of the human gut microbiome: Change is the only constant. Genome Biol. 20, 150 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Vandeputte D., et al. , Temporal variability in quantitative human gut microbiome profiles and implications for clinical research. Nat. Commun. 12, 6740 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Koliada A., et al. , Seasonal variation in gut microbiota composition: Cross-sectional evidence from Ukrainian population. BMC Microbiol. 20, 100 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Strogatz S. H., Nonlinear Dynamics and Chaos: With Applications to Physics, Biology, Chemistry, and Engineering (Westview Press, 2014). [Google Scholar]
- 21.Hastings A., et al. , Transient phenomena in ecology. Science 361, eaat6412 (2018). [DOI] [PubMed] [Google Scholar]
- 22.Fukami T., Historical contingency in community assembly: Integrating niches, species pools, and priority effects. Annu. Rev. Ecol. Evolu. Syst. 46, 1–23 (2015). [Google Scholar]
- 23.Scheffer M., et al. , Early-warning signals for critical transitions. Nature 461, 53–59 (2009). [DOI] [PubMed] [Google Scholar]
- 24.Saavedra S., Rohr R. P., Fortuna M. A., Selva N., Bascompte J., Seasonal species interactions minimize the impact of species turnover on the likelihood of community persistence. Ecology 97, 865–873 (2016). [DOI] [PubMed] [Google Scholar]
- 25.Parisi G., Complex systems: A physicist’s viewpoint. Phys. A 263, 557–564 (1999). [Google Scholar]
- 26.Flack J. C., Coarse-graining as a downward causation mechanism. Phil. Trans. R. Soc. A 375, 20160338 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Hoel E. P., Albantakis L., Tononi G., Quantifying causal emergence shows that macro can beat micro. Proc. Natl. Acad. Sci. U.S.A. 110, 19790–19795 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Gibbs J. W., Elementary Principles in Statistical Mechanics (Charles Scribner’s Sons, New York, 1902). [Google Scholar]
- 29.Prigogine I., Stengers I., Order Out of Chaos: Man’s New Dialogue with Nature (Bantam, New York, 1984). [Google Scholar]
- 30.Alberch P., The logic of monsters: Evidence for internal constraint in development and evolution. Geobios 22, 21–57 (1989). [Google Scholar]
- 31.Alberch P., From genes to phenotype: Dynamical systems and evolvability. Genetica 84, 5–11 (1991). [DOI] [PubMed] [Google Scholar]
- 32.Kirschner M. W., Gerhart J. C., The Plausibility of Life: Resolving Darwin’s Dilemma (Yale University Press, Chicago, IL, 2005). [Google Scholar]
- 33.Thom R., Stabilité structurelle et morphogenèse (InterÉditions, Paris, 1972). [Google Scholar]
- 34.Medeiros L. P., Boege K., del Val E., Zaldivar-Riverón A., Saavedra S., Observed ecological communities are formed by species combinations that are among the most likely to persist under changing environments. Am. Nat. 197, E17–E29 (2021). [DOI] [PubMed] [Google Scholar]
- 35.García-Callejas D., et al. , Non-random interactions within and across guilds shape the potential to coexist in multi-trophic ecological communities. Ecol. Lett. 26, 831–842 (2023). [DOI] [PubMed] [Google Scholar]
- 36.Saavedra S., Medeiros L. P., AlAdwani M., Structural forecasting of species persistence under changing environments. Ecol. Lett. 23, 1511–1521 (2020). [DOI] [PubMed] [Google Scholar]
- 37.Erlander S., Stewart N. F., The Gravity Model in Transportation Analysis (VSP, Utrecht, 1990). [Google Scholar]
- 38.Balcan D., et al. , Multiscale mobility networks and the spatial spreading of infectious diseases. Proc. Natl. Acad. Sci. U.S.A. 106, 21484–21489 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Watts A., et al. , How spatio-temporal habitat connectivity affects amphibian genetic structure. Front. Genet. 6, 275 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Martin-Platero A. M., et al. , High resolution time series reveals cohesive but short-lived communities in coastal plankton. Nat. Commun. 9, 266 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Deng J., Taylor W., Saavedra S., Understanding the impact of third-party species on pairwise coexistence. PLoS Comput. Biol. 18, e1010630 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.AlAdwani M., Saavedra S., Is the addition of higher-order interactions in ecological models increasing the understanding of ecological dynamics? Math. Biosci. 315, 108222 (2019). [DOI] [PubMed] [Google Scholar]
- 43.Sanchez-Gorostiaga A., Bajić D., Osborne M. L., Poyatos J. F., Sanchez A., High-order interactions distort the functional landscape of microbial consortia. PLoS Biol. 17, e3000550 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Hill M. F., Witman J. D., Caswell H., Markov chain analysis of succession in a rocky subtidal community. Am. Nat. 164, E46–E61 (2004). [DOI] [PubMed] [Google Scholar]
- 45.Shannon C., Shannon C., A mathematical theory of communication. Bell Syst. Techn. J. 27, 379–423 (1948). [Google Scholar]
- 46.Estrela S., et al. , Functional attractors in microbial community assembly. Cell Syst. 13, 29–42 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Long C., Codes for reproducing results. Github. https://github.com/MITEcology/Long_PNAS_2024. Deposited 10 January 2024.
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Appendix 01 (PDF)
Data Availability Statement
Previously published data were used for this work (5, 6, 12, 40). The source code to produce the results is available on GitHub at https://github.com/MITEcology/Long_PNAS_2024 (47).