Skip to main content
Computational and Structural Biotechnology Journal logoLink to Computational and Structural Biotechnology Journal
. 2025 Mar 12;27:1482–1497. doi: 10.1016/j.csbj.2025.03.012

Characterising functional redundancy in microbiome communities via relative entropy

Daniel Fässler a,, Almut Heinken b, Johannes Hertel a,c
PMCID: PMC12013412  PMID: 40265160

Abstract

Functional redundancy has been hypothesised to be at the core of the well-evidenced relation between high ecological microbiome diversity and human health. Here, we conceptualise and operationalise functional redundancy on a single-trait level for functionally annotated microbial communities, utilising an information-theoretic approach based on relative entropy that also allows for the quantification of functional interdependency across species. Via constraint-based microbiome community modelling of a public faecal metagenomic dataset, we demonstrate that the strength of the relation between species diversity and functional redundancy is dependent on specific attributes of the function under consideration such as the rarity and the occurring functional interdependencies. Moreover, by integrating faecal metabolome data, we highlight that measures of functional redundancy have correlates in the host’s metabolome. We further demonstrate that microbiomes sampled from colorectal cancer patients display higher levels of species-species functional interdependencies than those of healthy controls. By analysing microbiome community models from an inflammatory bowel disease (IBD) study, we show that although species diversity decreased in IBD subjects, functional redundancy increased for certain metabolites, notably hydrogen sulphide. This finding highlights their potential to provide valuable insights beyond species diversity. Here, we formalise the concept of functional redundancy in microbial communities and demonstrate its usefulness in real microbiome data, providing a foundation for a deeper understanding of how microbiome diversity shapes the functional capacities of a microbiome.

Keywords: Functional redundancy, Microbiome function, Microbiome diversity, Relative entropy, Constraint-based community modelling, Information theory

Graphical Abstract

graphic file with name ga1.jpg

1. Introduction

To understand the multitude of functions of the microbiome, ecological diversity represents a key concept. Ecological diversity encompasses various dimensions of diversity and is often used synonymously with species diversity in microbiome research, which describes the richness of species within a community and the evenness of their distribution [4], [5]. Multiple measures, such as Shannon entropy or Simpson’s diversity, have become essential in microbiome research [6], [7], [8], [9], [10]. However, summarising species diversity with a single measure is problematic [11], and the question remains as to how informative diversity measures are and what they capture [12]. Measures of species diversity rely solely on taxonomic information and compositional variation, and do not directly reflect functional diversity or functional redundancy of a microbial community [13], [14], [15]. Hence, two microbial communities may diverge strongly in their composition and their species diversity but be largely equivalent in their metabolic functions [16], [17], [18]. A compositionally highly diverse community can have low functional redundancy, just as a microbial community with lower species diversity can be rich in functional redundancy. Nevertheless, a range of research has asserted that high species diversity leads to more stable host-microbiome interactions through functional redundancy [19], [20], aiming to explain the wealth of evidence for a range of human diseases being associated with reduced microbiome species diversity [21], [22], [23], [24].

Functional diversity and redundancy have been investigated previously by incorporating pairwise functional dissimilarities among co-existing species based on multiple traits [25], [26], [27], [28], [29]. However, selecting relevant functional traits while excluding non-informative ones is challenging, as this directly affects the accuracy and ecological relevance of functional diversity measures [30]. Moreover, in microbiome research, we are often interested in individual functions that are core to certain physiological aspects of host-microbe interactions [31], [32], [33].

Here, we target functional redundancy at the single-trait level by introducing information-theoretic operationalisations that can be realised via detailed functional and quantitative annotations as delivered by constraint-based microbiome community modelling based on genome-scale reconstructions [34], [35], [36].

Genome-scale metabolic models of microorganisms [37], [38], [39], [40] based on constraint-based reconstruction and analysis [34], [36], [41] (COBRA) provide a powerful resource to capture metabolic functions of microbiomes and can be applied to generate microbiome communities by integrating personalised metagenomic data. COBRA-based community modelling delivers a detailed quantitative annotation of metabolic functions across a wide range of biochemical pathways and can be utilised to quantify the maximum metabolite secretion potential of the community as a whole and of each individual microbe [42], [43]. Corresponding predictions in the form of maximal net secretion fluxes have been shown to hold predictive power through the integration of metabolome data in multiple works [37], [44], [45], [3].

Applying our methodology to two public datasets, we will demonstrate that the derived operationalisation provides meaningful insights not only into the theoretical and empirical relations between species diversity and functional redundancy but also into host-microbiome interactions in colorectal cancer (CRC) and inflammatory bowel disease (IBD).

2. Theoretical concepts

To motivate the chosen operationalisation of functional redundancy, we first define functional redundancy for a given function of a microbial community in biological terms. Intuitively, if many species or organisms in a community are able to perform a certain function, such that the loss of some of these species or organisms does not lead to the loss of the function, we would call the community functionally redundant with respect to that function. Therefore, we define functional redundancy as follows:

Definition 1

Functional redundancy of a specific function in microbial communities

Functional redundancy of a certain function describes the potential of a microbial community to retain that function under the loss of microbial biomass.

Thus, we understand functional redundancy as the projected capacity of a microbial community to perform a certain function (e.g., butyrate production) if the community would experience the depletion or the entire loss of microbial species, in contrast to previous conceptualisations that understand functional redundancy as a multi-trait attribute [28], [46], [47].

An important aspect of our definition is that it can be operationalised in different ways. First, we can characterise functional redundancy on the level of taxonomic units, meaning, for example, that multiple species capable of performing the same function are present in a given community (taxon-based functional redundancy). Second, we can characterise functional redundancy by the abundance of organisms that can perform a function in a given community (abundance-based functional redundancy). Both characterisations have their merits. Which one is more appropriate depends on the biological phenomenon under investigation and the available data. For example, consider a community where only a single but highly abundant species can perform a certain function. In terms of taxon-based functional redundancy, functional redundancy would be minimal, as a single species is responsible for upholding the function, and thus the loss of this species would result in the entire loss of the function. In terms of abundance-based functional redundancy, in contrast, the functional redundancy may be considerable, as one may lose a lot of biomass without losing the function. Depending on the type of biomass loss we expect to be relevant for a certain community, we may choose one operationalisation over the other. Thus, we provide a formal framework for both operationalisations.

2.1. Formalisation

We formalise functional redundancy for a microbial community containing n-species, denoted by S={S1,,Sn}, with relative abundances a=(a1,,an)(0,1]n. For each species in the community, we consider a quantitative output of a specific function F, denoted as f=(f1,,fn)[0,γ]n, where γ is the maximum of the function of interest. Let f~=f~1,,f~n, where f~i=fiftotal0,1, be the vector of shares of each species Si for the function F relative to the total community output ftotal=i=1nfi>0. Furthermore, let Sref be a reference set containing all m-species that can perform the function in theory. We analogously define fref and correspondingly f~ref as the quantitative output and corresponding shares based on the reference Sref. Applied to a concrete microbial community, f and fref will only differ in the number of zeros, as each species in the microbial community capable of performing the function must be present in the reference set and vice versa.

2.1.1. Taxon-based functional redundancy

We search for measures that reach their maximum, when each species of the system performs the same quantitative amount of the function of interest, and their minimum if and only if a single species can perform the function. We search for a mapping R:(f1,,fn) that is maximised, when 0<f1=f2==fn and minimised, when fj>0, for one j1,..,n and fi=0, for all ij. Here, we propose to utilise an information-theoretic operationalisation that satisfies the above mentioned criteria and uses the functional shares f~.

The taxon-based functional redundancy in respect to the function F is then given by:

RTaxon=DKL(f~Un)=H(f~)Hmax(f~)=i:f~i>0f~ilog(f~i)log(n) (1)

Here, DKL(f||Un) denotes the relative entropy or Kullback-Leibler divergence between f~ and a discrete uniform distribution Un, Hmaxf~ the maximal possible entropy of f~, Hf~ the actual entropy of f~, and n the number of species in the microbial community (Supplementary Material). Note that we use the negative relative entropy to ensure that a higher functional redundancy measure corresponds to higher functional redundancy. The functional redundancy measure RTaxon is negative, or zero in case of maximum functional redundancy. Hence, we define functional redundancy analogously to absolute redundancy in information theory, where it is defined as the difference between maximum entropy and actual entropy [48].

The above-defined measure of taxon-based functional redundancy is difficult to compare across communities of different species richness, as the measure depends heavily on the species richness of the community under consideration. This problem can be circumvented by utilising f~ref as defined above:

RTaxon=DKL(fref||Um)=i:fi>0filog(fi)log(m), (2)

where m is the number of species in the reference, fixed for each function, across all microbiome communities. We refer to Eqn 1 as sample taxon-based functional redundancy and to Eqn 2 as reference taxon-based functional redundancy.

2.1.2. Abundance-based functional redundancy

Abundance-based functional redundancy is characterised by the distribution of individual organisms capable of performing the function within a given microbial community. Maximum functional redundancy would be attained if each organism would contribute equally to the total community output, while minimal functional redundancy would be attained if only one organism would be responsible for the entire community output. The abundance-based functional redundancy is then given by:

RAbundance=DKL(f||a)=i:f˜i>0filog(fiai). (3)

Thus, Eqn 3 represents the relative entropy between the relative shares of species in terms of total community output and the abundance vector. This measure is maximal if each fi is a linear function of its corresponding abundance ai, i.e., it must hold that fi=bai+d, where b and d are constants across all i{1,,n}.

2.1.3. The functional interdependency index

Going one step further, we can ask how DKL(f||a) relates to the total abundance of k species capable of performing a certain function. Indeed, one can show that DKLfa is a direct function of i=1kai, if all fi are only dependent on ai (Supplementary Methods, Attribute 1). This scenario represents a case of missing species-species interdependencies with respect to the shares f, conceptualising interdependency as the dependency of a function on the abundances of more than one species. Thus, this formalisation enables a quantification of the intensity of functional interdependencies in a microbial community. To this end, we define the functional interdependency index I by considering the sub-system of all species with shares greater zero: (fj)jJ=(fj1,,fjk), where J={j1,,jk}={i:f˜i>0}. The functional interdependency index is defined as the relative entropy between the shares vector of functions greater zero and its associated vector of restricted abundances. Let fJ~=f~jjJ and a~J=a~jjJ, where a~j=ajjJaj. The interdependency index I is calculated as:

I=DKL(fJ||aJ)=jJfjlog(fjaj). (4)

The functional interdependency index delivers a measure of the intensity of interdependencies that may also occur between more than two microbes, quantifying an aspect which has been hard to measure so far.

2.1.4. The global functional interdependency index

On functional interdependency indices for multiple functions of a sample, we define a global index of functional interdependency to measure functional interdependency at a community-level across all functions. Let I1,,In represent multiple functional interdependency indices across n functions of the same sample. We define the global functional interdependency index as the median of {I1,,In}:

Ig=med{I1,,In}, (5)

where the median is used for robustness against outliers and potentially skewed distributions. However, other metrics of central tendency could also be employed.

Fig. 1 provides a graphical overview of the introduced concepts and operationalisations using a toy example. A more detailed discussion of this toy example, including further illustrations and example calculations, can be found in the Supplementary Material along with fundamental mathematical attributes of the proposed operationalisations.

Fig. 1.

Fig. 1

Overview of a toy example, illustrating scenarios of maximal functional redundancy and minimal functional interdependency. A, Introduction of the toy example which consists of five species in the sample, four species that can perform the function in the sample, and three additional species in the reference that can perform the function but are not present in the sample (non-optimal functional redundancy). B, All species in the sample share an equal amount of the function (maximal sample taxon-based functional redundancy, upper part), and all species capable of performing the function are present in the sample and contribute equally to the total function share (maximal reference taxon-based redundancy, lower part). C, All organisms in the sample share the same function equally. The amount of function shared corresponds to the abundance of each species (maximal abundance-based functional redundancy). D, The abundance of each species present is normalised based on the total abundance of species capable of performing the function. With minimal functional interdependency, each species in the sample shares the function in accordance with the normalised species abundance, and the species’ abundance completely explains the species’ functional share (minimal functional interdependency).

3. Materials & methods

3.1. Study samples

3.1.1. Colorectal cancer study

For an empirical application of our approach, we utilised a Japanese colorectal cancer (CRC) cohort dataset from Yachida et al. [1]. This study provided publicly available shotgun faecal metagenomic sequencing data for n = 616 individuals (365 CRC cases and 251 healthy controls), as well as mass spectrometric metabolome data, including quantifications for 450 metabolites in stool. Sequencing reads and taxonomic assignments were processed using the MetaPhlAn2 [49] pipeline. The dataset included several metadata of which age, sex, and BMI were included in our analysis. Details on metagenomic and metabolomics measurements, along with the raw metabolome and microbiome abundance table can be found in the supplementary material of the stem publication.

3.1.2. Pediatric Crohn’s disease cohort

We utilised data from a prospective cohort of pediatric Crohn’s disease patients reported by Lewis et al. [2]. Specifically, we utilised individual maximum secretion fluxes of the species in each of 108 microbiome community models previously generated in Heinken et al. [3]. The community models correspond to 20 samples characterized as inflammatory bowel disease (IBD) microbiomes with dysbiosis, 63 non-dysbiotic IBD microbiomes, and 25 control samples, with no additional metadata provided.

3.2. Generation of sample-specific COBRA community models for the CRC study

Functional annotations were derived through COBRA community modelling. For the CRC study, the relative abundances obtained from Yachida et al. [1] were mapped onto the reference set of 818 microbial metabolic reconstructions (AGORA [38]). A more detailed explanation of generating COBRA community models is noted in Hertel et al. [44]. Briefly, the created (pan-)models of AGORA reconstructions of microbes present in a sample were connected through a shared lumen compartment to create a microbiome community model of each sample. Each community model was parameterised using the biomass reaction of each genome-scale reconstruction present, weighted by their relative abundances, to derive a community biomass reaction. Diet constraints were applied by constraining the lower bound of the corresponding sink reaction of the diet compartment corresponding to an average Japanese Diet as described previously [44]. Each model was built in MATLAB (R2018b, MathWorks, Inc.) using the COBRA toolbox [41] and the Microbiome Modeling Toolbox [42], [50]. Personalised models for the 108 IBD samples were also generated using AGORA reconstructions, following the same methodology applied to the CRC study [3].

3.3. Calculation of individual maximum secretion fluxes of COBRA community models

The maximum secretion fluxes for the CRC study were computed for each metabolite in every COBRA community model where at least one exchange reaction into the lumen compartment was available via the predictMicrobeContributions function of the COBRA toolbox. The function employs Flux Variability Analysis (FVA) to determine the maximum and minimum fluxes of internal exchange reactions into the lumen compartment, with the community biomass serving as the initial objective function, maximised and subsequently held constant. The function provided individual maximal secretion fluxes for 91 metabolites across all communities and microbes in the CRC study. The computed individual secretion fluxes for the finally analysed metabolites can be found in the Table S1. We inverted the sign of the output values, such that a positive value of each flux represents secretion, and a negative value represents uptake, following this convention throughout the text. All simulations were performed in MATLAB (Mathworks, Inc.) version R2021a with IBM CPLEX (IBM) as the linear programming solver. A formal operationalisation of functional redundancy measures based on the computed individual secretion fluxes can be found in the Supplementary Material.

3.4. Calculation of functional redundancies in the IBD and CRC study

All functional redundancy and functional interdependency measures, simulations, and statistical analyses were conducted using R version 4.4.2. We have created an R package, FunRed, to calculate functional redundancy and interdependency measures, which includes instructions and descriptions of each index, demonstrated using the toy example from the manuscript (with corresponding input/output). The package is available at https://github.com/SysPsyHertel/FunRed. For calculating the Kullback-Leibler divergence for our redundancy measures, we used the KL function of the philentropy [51] package with default parameters, except for setting unit = "log" to compute divergence using the natural logarithm. We normalised the abundances for each sample prior to computing functional redundancy and functional interdependency to ensure that the shares summed up to 1. For computing Shannon entropy and Simpson’s diversity we utilised the vegan [52] package. We calculated species evenness by dividing Shannon entropy by the logarithm of species richness. All calculations and measures reliant on abundances were calculated exclusively from the abundance of the species mapped onto AGORA.

We excluded samples with fewer than 25 species in their respective community models, resulting in 612 analysed community models for the CRC study (out of 616) and 102 community models for the IBD study (out of 108). The COBRA community models were constructed at the species level for the CRC study and at the strain level for the IBD study. For each community model, we set measures of functional redundancy and functional interdependency as missing if in a simulation of a sample-metabolite pair, at least one microbe had to take up the metabolite to optimise the community biomass reaction, or if there were no microbes in the community capable of secreting the metabolite. Subsequently, for the analyses of metabolic functional redundancy measures, we exclusively considered metabolites with over 80 % non-missing values of functional redundancy across the community models. This criterion led to the inclusion of 45 metabolites for the CRC study and 161 metabolites for the IBD study.

3.5. Statistical analysis

For descriptive purposes, metric variables were characterised by means and standard deviations and categorical variables by proportions. For all analyses, we used the natural logarithm for any log transformation. P-values are reported as two-sided. For multiple testing correction, we utilised the Bonferroni correction. The significance level was set to α=0.05 or 0.05#tests when adjusting with the Bonferroni method. Bonferroni-adjusted p-values in the supplement were reported using the p.adjust function of R. Linear regressions were performed using the standard lm function in R. For visualisations of regression results, regression fits are displayed with 95 % confidence bands.

3.5.1. Statistical analysis of the CRC study

To test the relation between the three functional redundancy measures with each other, we performed regressions across all three possible combinations without any adjustments. Furthermore, we fitted linear regressions with the intercepts of the regressions between reference and sample taxon-based functional redundancy measures as the dependent variable and the rarity of a function (e.g., the number of producing species) as the sole predictor. To explore the statistical relations between functional redundancy and species diversity, we used linear regression models, treating Shannon entropy as response variable and the respective functional redundancy measure as predictor of interest, without adjusting for additional covariates. For testing on associations between functional redundancy measures and faecal concentrations, we log-transformed the faecal concentrations. Of the 45 metabolites for which we calculated functional redundancy measures, 21 metabolites had non-missing faecal measurements in at least 50 % of cases after log transformations and were included in the analysis. We used linear regression models with the log-scaled faecal concentrations as response variable, the corresponding functional redundancy measure as predictor of interest and adjusted for possible confounders: Age, BMI, sex, and health status (CRC: yes/no). Permutation tests were performed by randomly permuting the corresponding redundancy measure 10,000 times, refitting the linear model on each permuted dataset, and calculating the p-value by comparing the observed coefficient’s p-value to the distribution of p-values obtained from the permuted datasets. For the analyses of CRC health status associations with functional redundancy and interdependency, each functional redundancy measure and the log-transformed interdependency index were treated as response variables in a separate linear regression model. Health status was chosen as the predictor of interest for each model while adjusting for age, BMI and sex.

3.5.2. Statistical analysis of the IBD study

We performed an analysis of variance (ANOVA) using the standard aov function in R to assess differences among the three stratification groups. Planned contrasts were applied to compare the average between IBD patients and healthy controls, with the contrast defined using weights (-1, 0.5, 0.5) representing (Healthy, IBD non-dysbiotic, IBD dysbiotic). The contrasts and analyses were implemented using the emmeans [53] package. Linear regressions were conducted, treating the corresponding redundancy-measure as the response variable and the disease group as independent variable while adjusting for Shannon entropy. Wald tests were performed for determining the significance of the three-valued categorical variable using the car package in R.

3.6. Comparing community-level functional redundancy with trait-based functional redundancy

For computing community-level functional redundancy based on Rao’s quadratic entropy, we used the rao.diversity function of the SYNCSA [54] R package, which, by default, calculates the dissimilarity matrix using the square root of the one-complement of Gower’s similarity. Functional redundancy in this sense can be defined as the portion of a community’s alpha taxonomic diversity not explained by its alpha functional diversity. A commonly used approach is to quantify the difference between Simpson’s diversity and functional diversity (represented by Rao’s quadratic entropy): FR=DQ, where D represents Simpson’s diversity (D=1i=1nai2), and Q represents Rao’s quadratic entropy (Q=i=1nj=1ndijaiaj), for a community containing n-species. Thus:

FR=1i=1nai2i=1nj=1ndijaiaj, (6)

with dij, i,j{1,,n} being dissimilarity weights between species Si and Sj, i,j{1,,n} ranging from 0 (no dissimilarity) to 1 (complete dissimilarity), with dii=0 for all i{1,,n}. When species traits are maximally dissimilar (dij=1, for all ij), Q is maximal and equals Simpson’s diversity, D. The value of FR is always scaled from 0 to 1, where 0 indicates minimal redundancy and 1 represents maximal redundancy.

3.7. Simulation for characterising the effect of unclassified species with unknown contributions

For simulating the effect of unclassified species and for comparison with community-level functional redundancy measures, we utilised synthetic communities sampled from Dirichlet distributions using the rdirichlet function of the MCMCpack [55] R package. For simulating unknown species, we first simulated 1000 iterations of functional contributions of 100 species with f~=f~1,,f~100 and abundances a=a1,,a100. The parameters α1,,α100 of the Dirichlet distribution were set randomly with a probability of 50 % either to α=α1==α100=1 (indicating an uneven distribution) or to α=α1==α100=5 (indicating a more even distribution). Next, we randomly reclassified 10 species and their contributions as “unknown”. Afterward, we recalculated the measures of functional redundancy on the normalised and shortened vectors (having now only 90 entries) and repeated this process iteratively until a total of 80 species had been reclassified as unknown.

4. Results

We structure the results in three parts. First, we explore the relation of the introduced functional redundancy measures to previously utilised community-level functional redundancy operationalisations [28], [46], [47], [56], [57]. Second, we provide simulations to investigate the effects of unclassified species with unknown functions on the information-theory based functional redundancy measures. Finally, we present the results of the analysis of microbiome functional redundancies in CRC and IBD.

4.1. Trait-based functional redundancy and community-level functional redundancy

First we demonstrate that the introduced single-trait functional redundancy measures are neither mathematically nor conceptually a special case of equation of the multi-trait community-level functional redundancy measures as formalised in Eq. (6). To this end, we calculate the dissimilarity weights dij in Eq. (6) based on one single function, and compare both approaches in an extreme case of one keystone species being solely responsible for a certain function among a reference of 100 species.

We now calculated the functional redundancies for a series of increasing species richness, from 1 to 100, such that f#1=1=a#1, f#2=1,0 and a#2=12,12, continuing until f#100=1,0,,0 and a#100=1100,,1100. In this scenario, reference taxon-based functionality is always minimal and therefore log(100) (Supplementary Material, Attribute 2), since regardless of the species diversity and species richness, only one species can contribute to the function (Fig. 2A). If this species would be lost, the community would lose the entire function, which refers to a case of minimal functional redundancy with respect to the investigated trait. In contrast, FR as calculated by Eq. (6) increases strongly with the size of the community (Fig. 2B), and thus does not reflect this scenario of minimal functional redundancy.

Fig. 2.

Fig. 2

Functional redundancy illustrated with a single species performing a function while increasing species richness with evenly distributed abundances for both, trait-based functional redundancy and community-level functional redundancy and the effect of unknown species on functional redundancy measures. A, Trait-based functional redundancy as captured by our derived functional redundancy measures. The dashed line represents minimal functional redundancy (log(100)) and maximal functional redundancy (0). The reference-based functional redundancy measure remains minimal and independent of species richness (blue), while sample- and abundance-based redundancy decrease from maximal functional redundancy (one species in the community) to minimal functional redundancy (100 species in the community). B, Community-level functional redundancy based on Rao’s quadratic entropy. Functional redundancy (FR) is minimal with a single species and increases monotonically with species richness, converging to maximal functional redundancy of FR (1). C, Simulations for the three redundancy measures with varying the alpha parameter, where α=5 representing more even distributions and α=1 more uneven ones.

The sample taxon-based and abundance-based measures, being equivalent in this specific scenario (Supplementary Material, Attribute 3), behave very differently to FR as well (Fig. 2A). With increasing species richness, sample taxon-based and abundance-based functional redundancy will decrease, as the trait is dependent on a decreasing share of the total abundance and/or number of species, while FR will increase. In conclusion, multi-trait measures such as Eq. (6) do not capture functional redundancy in the sense of Definition 1 in contrast to the operationalisation introduced in this manuscript.

4.2. Impact of unknown species on measures of functional redundancy

An important problem in microbiome analysis is the presence of unclassified species with unknown functional contributions. Attribute 4 (Supplementary Material) shows that an increasing number of unclassified species will introduce an overestimation of the true functional redundancy if those species do not contribute to the function of interest. Thus, with rare functions, where unclassified species will be unlikely to exhibit the function, higher shares of unclassified species will lead to overestimation of the true functional redundancy. However, for other situations, the bias will be highly dependent on the distributional aspects of f and a, making it challenging to derive insights via closed-form solutions.

To nevertheless derive insights on the effect of unclassified species in a more general setting, we performed 1000 simulations of communities consisting of 100 species with functional contributions f~=f~1,,f~100 and abundances a=a1,,a100 via parameterized Dirichlet distributions (see Methods).

Introducing an increasing share of unclassified species with unknown functional contributions led to a higher distributional spread for all functional redundancy measures, reflecting increased random error and thus a higher uncertainty. The uncertainty was more pronounced with uneven distributions (α=1) than in scenarios with more even distributions (α=5) (Fig. 2C). Importantly, while for the sample taxon-based and abundance-based functional redundancy measure, no systematic error could be detected in this simulation setup, the true reference-based measure was systematically underestimated in the presence of unclassified species, which can be also theoretically explained (Supplementary Material, Attribute 5). As a result, systematic biases will be introduced when comparing reference taxon-based functional redundancies across populations with different shares of unclassified species. Thus, while the abundance-based and sample taxon-based measure suffer from increased random error with high numbers of unclassified species, the reference taxon-based measure will exhibit a systematic error underestimating the true functional redundancy.

4.3. Empirical applications using public metagenomic data

We calculated functional redundancy measures for 612 participants and 45 metabolites via employing COBRA community modelling using the faecal metagenomics data from Yachida et al. [1]. Table 1 and Fig. 3 provide an overview of the study, including characteristics of the utilised public dataset and summary statistics of the COBRA community models.

Table 1.

Sample characteristics of the study.

CRC Patients
(n = 365)
Healthy controls
(n = 247)
P-value
Age, mean (SD) 62.41 (9.90) 60.81 (12.59) 9.34e−02a
BMI, mean (SD) 22.95 (3.56) 22.68 (3.04) 0.30a
Female, % 39.18 % 46.15 % 9.52e−02b
Species richness, mean (SD) 69.75 (18.31) 64.70 (14.80) 1.87e−04a
# Metabolites produced, mean (SD) 209.25 (7.96) 208.48 (8.02) 0.25a
# Reactions in community models, mean (SD) 83310.11 (20725.58) 77674.06 (16850.53) 2.40e−04a

CRC = Colorectal cancer, SD = Standard deviation,

a

Two-sided t-test,

b

Fisher’s exact test

Fig. 3.

Fig. 3

Overview of the utilised dataset. A, Mapping of the measured faecal metabolome onto the resource of genome-scale reconstructions (AGORA). B, Mapping of the detected species onto AGORA.

4.3.1. Taxon-based and abundance-based measures of functional redundancy are complimentary

First, we analysed the empirical relations across the different operationalisations of functional redundancy introduced above (sample taxon-based vs. reference taxon-based vs. abundance-based, Table S4). Generally, R-squared values for sample taxon-based and reference taxon-based functional redundancy were substantial for most metabolites (R-squared > 0.4 for 41 of 45 metabolites). However, for taurine, cholic acid, L-cystine, and xanthosine, only moderate R-squared values (0.19–0.40) were observed, highlighting that reference and sample taxon-based functional redundancy are related but not equivalent across functions. The statistical relation is likely weakened for these four metabolites due to the strong difference between the number of species that could in theory produce a metabolite in the reference and the number of species with the function found in each sample (Table S4), introducing divergence between the reference-based measure and the sample-based measure. The intercept of the generally linear relation between reference- and sample taxon-based functional redundancy (Figure S1A) was logarithmically dependent both on the number of species capable of secreting the metabolite within the sample (Figure S1B) and the reference (Figure S1C), as expected given the mathematical properties of Eqn 1 and Eqn 2. In contrast, abundance-based functional redundancy measures correlated generally only little with taxon-based functional redundancy measures with the maximal R-squared being 0.15 (N-Acetyl-D-glucosamine, Figure S1D). In most cases, a slight negative correlation was observed, providing statistical evidence that the operationalisation of abundance-based and taxon-based functional redundancy capture two biologically distinct concepts. In essence, having many taxa being able to perform a function (high taxon-based functional redundancy) did not statistically correlate with having a high abundance of such taxa (high abundance-based functional redundancy). One explanation could be that having many similar species may lead to competition, limiting thereby each individual species in its abundance.

4.3.2. Functional interdependency indices reveal extent of species-species interactions on metabolite secretions

Recall that the functional interdependency index of a metabolite is zero if and only if the maximum secretion of the metabolite by a microbe is given by the same linear function of its abundance for all producing microbes in a sample. The higher the deviation from this proportionality attribute, the higher the functional interdependency index. In Fig. 4A, we demonstrate this concept using Sutterella parvirubra and its maximal in silico secretion of fumaric acid and trimethylamine N-oxide (TMAO). The maximal secretion of fumaric acid, a TCA cycle intermediate, by Sutterella parvirubra is strictly proportional to its abundance, independent of the community composition and thus the abundances of other species. In contrast, for TMAO, this strict proportionality is lost. The latter is the case if 1) a species does not contain the entire pathway for the production of a metabolite, or 2) metabolite availability in the production pathway is constrained by the presence or absence of other species in the community. Fig. 4B illustrates the union of metabolic pathways in AGORA reconstructions leading to TMAO production from choline. While Sutterella parvirubra encodes for a trimethylamine N-oxide reductase, enabling the conversion of trimethylamine (TMA) to TMAO, it lacks the enzymatic capacity to synthesize TMA from choline or betaine, rendering its TMAO production dependent on the presence of other species capable of TMA synthesis. The above species-interdependency pattern is not unique to Sutterella parvirubra, as fumaric acid exhibits lower functional interdependency indices than TMAO across all species (Fig. 4C), highlighting the broad relevance of metabolic cross-feeding interdependencies in TMAO production in comparison to other metabolites such as fumaric acid or butyric acid.

Fig. 4.

Fig. 4

Results on functional interdependency using the CRC cohort. A, Maximal secretion potential (in mmol per (gram dry weight (gdW) * hour (h)) of Sutterella parvirubra of fumaric acid and trimethylamine N-oxide (TMAO) as a function of its abundance and linear dependency (dashed line). B, Pathway of AGORA reconstructions to produce Trimethylamine N-oxide, pathway in red is present in Sutterella parvirubra, whereas pathways in blue are not present in the reconstruction of Sutterella parvirubra. C, Boxplots of interdependency indices for each of the 45 metabolites, sorted by their interquartile range from lowest to highest. D, Scatterplot of the total abundance of species that can produce the metabolite to the abundance-based redundancy measure for butyric acid (blue), L-lactic acid (red) and succinic acid (yellow).

Importantly, the presence of functional interdependencies leads to lower abundance-based functional redundancy, given the same share of producing species in a community, as Fig. 4D shows for three common fermentation products (succinate, lactate, and butyrate). This is a plausible attribute. Consider, for example, a pathway that needs the cooperation of multiple species in comparison to a pathway that is present in each species alone. Losing one species will cause the loss of the function in the former case, while the function will be retained in the latter case. Thus, the functional interdependency index allows the mapping of the extent of interdependencies within a microbial community with regard to metabolic functions, as the TMAO example highlights.

4.3.3. Rarity and functional interdependencies influence the relation between species diversity and functional redundancy

Given the importance of measures of species diversity in microbiome analysis and their theoretical relation to the proposed operationalisation of functional redundancy, we explored their relations to each other. In fact, in the specific scenario where every species can perform the function and there are no interdependencies between species, the vector of shares of functions, f~, is equal to the vector of abundances, a (Supplementary Material, Attribute 1). In this case, the taxon-based measures are equivalent to Shannon entropy, except for a normalisation term. If this congruence would persist universally across all samples and functions, calculating functional redundancy following our operationalisations would carry no additional information in comparison to calculating the Shannon entropy as a measure of species diversity.

However, in the general case, the Shannon entropy and functional redundancy measures, as defined in this work, are not interchangeable. As rare functions impose a restriction in f~, we expect that the statistical relationship between functional redundancy and species diversity weakens for rare functions, in particular if the total sum of abundances of species that can perform the function is low. As interdependencies introduce further variation in the divergence between a and f~, we anticipate a diminished statistical relation between taxon-based functional redundancy and species diversity with increased interdependency.

Confirming the hypothesis that high species diversity leads to greater taxon-based functional redundancy, most but not all statistical associations between species diversity and functional redundancy measures were positive (Table S5). The strength of association was generally stronger for the reference taxon-based functional redundancy than for the sample taxon-based functional redundancy (Figure S2), which can be explained by varying normalisation terms across samples for the sample-based functional redundancy coefficient.

As supposed, the strength of the relationship was dependent on the commonness of a metabolic function across the sample, coupled to the total abundance of species that were capable of executing the function (Fig. 5A,C). Metabolite secretions characterised by a high number of producing species, such as succinate, with low functional interdependencies (medIsuccinate=1.42e04), demonstrated strong correlations between species diversity and taxon-based functional redundancy. For glycine, having comparable numbers and total abundances of producing species to succinate but higher interdependencies (medIglycine=0.04), a lower R-squared value was observed. For functions that were rare and displayed a high level of functional interdependencies, the relation between species diversity and taxon-based functional redundancy was in tendency weaker. In line with our expectation, we did not observe high R-squared values between species diversity and measures of taxon-based functional redundancy for functions with high interdependencies (Table S5, Fig. 5B,D).

Fig. 5.

Fig. 5

Scatterplots and regression lines for R-squared values between species diversity and measures of functional redundancy (Y-axis) against average number of producing species and the median functional interdependency index (X-axis) across all metabolites. In each scatterplot, the metabolites are coloured based on their average total species abundance that can perform the function. A, R-squared values (Y-axis, variance explained in sample taxon-based functional redundancy through species diversity) against average number of species with the function in a sample, and R-squared values (Y-axis, variance explained in sample taxon-based functional redundancy through species diversity) against median functional interdependency. B, R-Squared values (Y-axis, variance explained in reference taxon-based functional redundancy through species diversity) against average of species with the function in a sample, and R-squared values (Y-axis, variance explained in reference taxon-based functional redundancy through species diversity) against median functional interdependency. C, R-squared values (Y-axis, variance explained in abundance-based functional redundancy through species diversity) against average number of species with the function in a sample, and R-squared values (Y-axis, variance explained in abundance-based functional redundancy through species diversity) against median functional interdependency.

However, these arguments and statistical relations do not generally apply to abundance-based functional redundancy (Fig. 5E,F). While also negative in most cases, the statistical relationship between species diversity and abundance-based functional redundancy was generally weaker than for taxon-based measures and more often significantly reversed (Table S5). The latter effect highlights that high abundance-based functional redundancy can be achieved in ecologically non-diverse communities, where individual microbes expressing a certain function may dominate the community in terms of abundance. We also tested species richness, species evenness, and Simpson’s diversity as predictors, respectively, in analogous analyses, with species evenness and Simpson’s diversity yielding similar results, while the results for species richness differed in their overall structure (Table S6, Figure S3). In particular, R-squared values for species richness were much lower in general compared to Shannon diversity, Simpson diversity, and evenness (Tables S5, S6 and Figure S3), demonstrating that species richness in itself is only a weak predictor of functional redundancy.

4.3.4. Measures of functional redundancy are associated with the host faecal metabolome

Next, we analysed the statistical relations between measures of functional redundancy and host faecal metabolite concentrations for 21 of the 45 analysed metabolites (Fig. 3), utilising data from Yachida et al. [1]. In fully adjusted linear regression models, a total of 20 pairs achieved nominal significance with positive and negative associations. Seven associations remained significant after applying the Bonferroni correction for 63 tests (p < 7.94e-04, Table S7, Fig. 6). The top three hits belonged to the reference-based measure of gamma-Aminobutyric acid (GABA) (b = −0.85, 95 % CI: (-1.16, −0.54), p = 1.45e-07), the abundance-based measure of 1,5-Diaminopentane (cadaverine) (b = 0.32, 95 % CI: (0.22, 0.42), p = 3.94e-09), and the abundance-based measures of butyrate (b = 0.26, 95 % CI: (0.18, 0.34), p = 7.17e-10). Each of the three measures of functional redundancy of cadaverine was significantly positive correlated after multiple testing with its corresponding measured log faecal concentration (Fig. 6E-G). From the 21 tested metabolites, four were exclusively derived from the microbiome. Among these, three demonstrated a Bonferroni-corrected significant association with a redundancy measure. In contrast, only two metabolites that are co-secreted by the host showed similar associations, suggesting an enrichment of microbiome derived metabolites (Fisher’s exact test, p = 0.03, Table S7). As cadaverine and butyrate secretion are functions with close to zero functional interdependencies, higher abundance-based functional redundancies refer to communities with a large proportion of species that can produce the corresponding metabolite. Given that the average number of species able to secrete cadaverine in a community is relatively low (2.76, SD = 1.79), abundance-based functional redundancy measures close to zero, along with a high log faecal concentration of cadaverine, suggests the presence of a few key species that are crucial for cadaverine secretion. In contrast, as GABA and taurine are also produced by human pathways, the respective associations are not easily interpreted. In conclusion, the associations show that functional redundancy measures have correlates in the faecal host metabolome, in particular for microbiome-derived metabolites.

Fig. 6.

Fig. 6

Scatterplots between measures of functional redundancy and log faecal concentrations of metabolites with fitted regression lines (blue) and confidence bands (grey). A, Scatterplot and fitted line between reference taxon-based functional redundancy of gamma-Aminobutyric acid and the log faecal concentration of gamma-Aminobutyric acid. B, Scatterplot and fitted line between reference taxon-based redundancy of isovaleric acid and the log faecal concentration of isovaleric acid. C, Scatterplot and fitted regression between reference taxon-based redundancy of taurine and the log faecal concentration of taurine. D, Scatterplot and fitted regression line between abundance-based redundancy of butyric acid and the log faecal concentration of butyric acid. E, Scatterplot and fitted regression line between sample taxon-based redundancy of 1,5-Diaminopentane and the log faecal concentration of 1,5-Diaminopentane. F, Scatterplot and fitted regression line between reference taxon-based redundancy of 1,5-Diaminopentane and the log faecal concentration of 1,5-Diaminopentane. G, Scatterplot and fitted regression line between abundance-based redundancy of 1,5-Diaminopentane and the log faecal concentration of 1,5-Diaminopentane.

4.3.5. Colorectal cancer affects the relation between functional redundancy and functional interdependency

To highlight the potential analysis routes incorporating clinical meta-data into the analysis of functional redundancy, we analysed the effect of colorectal cancer on measures of functional redundancy and functional interdependency. Notably, we observed a significant difference in species richness (number of species) in CRC patients compared to controls (b = 4.19, 95 % CI: (1.51, 6.86), p = 2.20e-03) and a marginally significant difference in species diversity (Shannon entropy) (b = 0.07, 95 % CI: (-2.62e-03, 0.14), p = 0.06). A total of 24 functional redundancy-measures, most of them belonging to the reference taxon-based measure (16), and eight functional interdependency indices showed nominal significance (Table S8). After adjusting for 180 tests with the Bonferroni method, the reference taxon-based functional redundancy of pyruvate (b = 0.13, 95 % CI: (0.06, 0.20), p = 2.23–04, Table S8) was the only measure achieving statistical significance (p < 2.78e-04). Despite none of the interdependency measures being significant after multiple testing correction, all but one nominally significant associations demonstrated positive coefficients. Indeed, the log-scaled global functional interdependency index revealed a positive association between global functional interdependency and CRC (b = 1.16, 95 % CI: (0.15, 2.17), p = 0.02), indicating that CRC microbiomes display higher levels of functional interdependencies.

4.3.6. Inflammatory bowel disease impacts functional redundancy beyond species diversity

To showcase a situation of a disease with well-evidenced influence on species diversity, we utilised an existing inflammatory bowel disease (IBD) study, from which we analysed 102 microbiome community models (n = 17 dysbiotic IBD, n = 60 non-dysbiotic IBD, n = 25 healthy controls). We computed redundancy measures for 161 metabolites (Table S3). As expected, species richness (ANOVA: F = 14.78, p = 2.43e-06) and Shannon diversity (ANOVA: F = 5.52, p = 5.35e-03) were clearly lower in the IBD conditions (Fig. 7A).

Fig. 7.

Fig. 7

Boxplots illustrating ecological diversity, hydrogen sulphide-specific outcomes, and measures of functional redundancy in healthy, IBD non-dysbiotic, and IBD dysbiotic samples. A, Species richness and species diversity. B, Number and share of hydrogen sulphide producing species. C, Reference taxon-based measures of functional redundancy for metabolites implicated in sulphur metabolism. D, Sample taxon-based measures of functional redundancy for metabolites implicated in sulphur metabolism. P-values are derived from ANOVA analyses without incorporating any contrasts.

After Bonferroni correction across 483 tests (161 metabolites × 3 redundancy measures), 122 metabolites showed significant differences (p < 1.04e-4) in at least one redundancy measure (Table S9). Most of the significant changes were observed in the reference-based measure (87 metabolites), with all but two showing a reduction in redundancy for IBD patients compared to controls (Table S10). Even when adjusting for the loss of species diversity, 121 associations with the disease status (healthy vs. dysbiotic IBD vs. non-dysbiotic IBD) were significant after multiple testing correction, demonstrating that functional redundancy measures carry information beyond species diversity (Table S11). Generally, the sample-based measure showed lower effect sizes than the reference-based measure (Fig. 7D), which could be due to reduced comparability of sample-based measures across communities with strong differences in species richness.

For illustration, we focus on sulphur metabolism, as sulphur metabolism has been implicated in the pathophysiology of IBD [58], [59], [60], [61]. Reference taxon-based measures reveal substantial losses in functional redundancy in IBD patients compared to controls, particularly for methionine (b = −0.90, 95 % CI: (-1.14, −0.66), p = 3.45e-11), and taurine (b = −0.99, 95 % CI: (-1.22, −0.76), p = 1.97e-13). Remarkably, reference taxon-based taurine functional redundancy measures perfectly separated between dysbiotic IBD cases from healthy controls (Fig. 7C), indicating that taurine functional redundancy could be a dysbiosis biomarker in IBD. However, other important sulphur compounds, such as sulphate and hydrogen sulphide, did not show similar declines (Fig. 7C). Interestingly, despite the loss in species richness, no decrease in hydrogen sulphide-producing species was observed, leading to an increased proportion of hydrogen sulphide-producing species in IBD patients (Fig. 7B). As a consequence, the reference-based measure for hydrogen sulphide did not show significant differences of IBD patients compared to controls, while the sample-based measure even increased. This result highlights how the functional changes in IBD microbiomes driven by a strong loss in species diversity can be fine-mapped by examining functional-specific changes that are not captured by species diversity alone.

5. Discussion

Species diversity of the human gut microbiome is thought to contribute to human health and disease through functional redundancy. Here, we introduce and analyse an information-theoretic approach to functional redundancy that enables the function-specific quantification of functional redundancy. The herein introduced operationalisation of functional redundancy is markedly different from previous operationalisations based on community-level multi-trait functional dissimilarity measures [28], [46], [47], [56], [57], which are not easily interpreted and meaningfully applied to the analysis of functional redundancy of individual traits.

To clarify the scope, we defined functional redundancy in three different contexts. With the sample taxon-based functional redundancy, the quantification is coupled to the species prevalent in a concrete sample, whereas reference taxon-based functional redundancy is calculated by taking all species into account that could perform a function in theory based on references such as AGORA [38] or AGORA2 [37]. Abundance-based functional redundancy, on the other hand, is defined on the abundance of organisms that could perform a function. The three operationalisations are statistically interrelated (Fig. S1, Table S4) but neither statistically, nor conceptually redundant to each other, since they capture different distributional aspects of functions across microbiomes and reflect different ways to understand functional redundancy.

Which of the operationalisations should be preferred depends on the research question and study design aspects. For example, in a situation without a suitable reference or in situations with highly variable amounts of unclassified species, the sample-based measure could be superior to the reference-based measure, as the latter suffers from systematic bias from unclassified species. In contrast, the sample-based measure, due to its dependence on the species richness, may suffer from less comparability across populations with equal shares of unclassified species but strong differences in species richness.

Our approach also opened up the possibility to derive measures of functional interdependency across species. The degree of species-species interactions has been challenging to determine [62], [63], [64], [65], even within relatively simplified and controlled modelling environments. Our operationalisation gives a computationally feasible solution to assess the extent of functional interdependencies on metabolite secretions. The analysis shows, for example, that microbial TMAO production is a pathway with strong species-species interdependencies. This result is of potential clinical interest, since TMAO has been implicated in atherosclerosis and chronic kidney disease [66], [67], [68], [69], [70].

From an information-theoretic perspective, the interdependency index reflects the extent of information encoded within species abundances when modelling a function of a microbial community using their abundances. Similar to the concept of conditional entropy H(Y|X) of two random variables that is zero if and only if Y is completely determined by X, the functional interdependency index is zero if and only if the functional output of each species is a linear function of its abundance. Therefore, the functional interdependency index may be used in investigating causal structures of microbiomes [44], [71], [72] as, for example, in structural equation models [73], [74], as conditional entropy is already used for identifying causal relations [75], [76], [77]. However, the interdependency index does not capture all aspects of species-species interdependencies as ecological relationships such as symbiosis or competition cannot be inferred from the measure. It also does not capture other aspects important to species-species relations such as host physiology or dietary behaviour.

Analysing the relation between species diversity and measures of functional redundancy, we observed that for rare functions and functions with high interdependencies, functional redundancy is seemingly determined by the presence and abundance of specific microbes rather than by the species diversity in general. In contrast, with common functions characterised by low functional interdependencies, high species diversity will often directly lead to high functional redundancy. These insights may help understanding the observed relation between high species diversity in the gut microbiome and human health [78], [79], [80], [81]. If functional redundancy is the mediating factor between species diversity and human health, this would point to relatively common functions with low species-species functional interdependencies such as butyrate production, for example. Thus, the concepts allow for the targeted investigation of individual metabolic functions mediating between human health and microbial diversity.

In this vein, our results highlight evidence for increased species-species functional interdependencies in CRC microbiomes. This finding may reflect less functional stability in CRC microbiomes, contributing thereby to CRC specific dysbiosis. We also demonstrated that the introduced measures have correlates in the host metabolome, in particular for microbiome-specific metabolites such as cadaverine and butyrate. In the case of butyrate, an important short chain fatty acid and nutrient for colonocytes [82], [83], [84], [85], we could show that higher functional redundancy is associated with higher faecal butyrate levels. Noteworthy though, functional redundancy in general is not necessarily indicative of “high function”, since functional redundancy describes a potential to retain a function under perturbation rather than the absolute level of a function. In general, as the introduced operationalisation is based on a projected potential to retain a function under biomass loss in contrast to an observed capacity, results from such analyses have hypothetical character and thus are warranting further validations.

In a further example, our results demonstrated that the introduced redundancy measures can fine-map the changes in IBD-associated microbiomes, which are generally associated with a loss of species diversity [86], [87], [88], [89]. While IBD mostly led to reduced functional redundancy across a wide range of functions in our analysis, certain key functions such as hydrogen sulphide production did not suffer from a loss in functional redundancy despite decreased sulphur metabolism diversity in IBD [3]. Hydrogen sulphide is a pro-inflammatory metabolite [90], [91], generated during the breakdown of sulphur-containing amino acids [92]. It plays a pivotal role in modulating inflammatory responses through the activation of immune cells and the production of pro-inflammatory cytokines [93], [94]. Thus, while functional redundancy is overall reduced in IBD, communities stay remarkably functionally redundant with regards to hydrogen sulphide production with possible clinical implications.

Several limitations of our approach have to be kept in mind. The information theoretic approach to functional redundancy can, in principle, be applied to any ecosystem where quantitative data on species-specific contributions to a single function or an aggregated function (e.g., based on pathways) along with their abundances exist. However, species-specific contributions are often lacking. For example, functional profiles derived from 16S rRNA gene sequencing data are commonly utilised in tools such as PICRUSt2 [95], [96], [97]. In contrast to our approach, such methods based on aggregated gene abundances on the community level do not provide a suitable basis for the introduced information theoretic approach. Furthermore, our definitions are not specifically designed to capture aspects of functional redundancy as represented, for example, in time-series data. Future research may show whether suitable and fruitful generalisations to longitudinal settings within the outlined framework of information theoretic measures can be formulated. Using constraint-based reconstruction and analysis (COBRA) to model the maximum secretion fluxes of species that form a microbial community comes with several limitations that should be noted. The applied modelling relies on the simulation of fluxes (not concentrations) under the steady state assumption, meaning that the input for the functional redundancy measures are based on simplifications and may not hold in situations of dynamic challenges (e.g., antibiotics intake, changes in diet). Moreover, the modelling approach neglected the spatial structure of the gut microbiome. Microbiome functions might vary in different biofilms [98], [99], compartments [100] or host characteristics which we did not model. However, ecosystem compartmentalisation can be integrated by calculating functional redundancy in a compartmentalised way if sufficient information about the spatial distribution of species is available. Flux calculations rely on correct representation of the metabolic capacities of microbial species in the knowledge-base. However, certain pathways and species are better researched than others and the thereby introduced biases represent a core limitation of knowledge-based modelling techniques such as COBRA. Furthermore, the introduced measures of functional redundancy are constructed on relative abundances and relative functional shares. While in the case of microbiome data, compositional data is normally the data-type available, it is plausible that important information on functional redundancy is coded in absolute abundances and absolute function values.

6. Conclusions

Formalising measures of metabolic functional redundancy via information theoretical concepts offers a promising avenue for understanding microbial community functions in relation to microbiome diversity. Building on the concept of relative entropy, we introduced information-theoretic operationalisations not only of functional redundancy in various contexts, but also of functional interdependency within microbiomes. Our work shows 1) that the relation between microbiome diversity and patterns of functional redundancy is dependent on the attributes of the function of interest, 2) microbiome functional redundancy has correlates in the host’s metabolome, and 3) can reveal hidden functional differences across medical conditions as exemplified by the analysis of CRC and IBD microbiomes. We believe that the introduced measures can serve as a valuable complement to conventional species diversity measures, providing a nuanced perspective on microbial community functions and their interdependencies with a wide range of applications in microbiome research.

Funding

This work was funded by the Deutsche Forschungsgemeinschaft (DFG, GermanResearch Foundation) Project-ID 499552394 – SFB 1597 and HE9198/1-1 to J.H. A.H. received funding from the Agence Nationale de la Recherche (ANR) under the decree 2021-1710.

CRediT authorship contribution statement

Fässler Daniel: Writing – review & editing, Writing – original draft, Visualization, Software, Methodology, Formal analysis, Data curation, Conceptualization. Hertel Johannes: Writing – review & editing, Writing – original draft, Supervision, Methodology, Formal analysis, Conceptualization. Heinken Almut: Conceptualization, Data curation, Software, Writing – review & editing.

Declaration of Competing Interest

The authors declare that they have no known financial interests or personal relationships that could have influenced the work presented in this paper.

Acknowledgements

We are thankful to Carsten Dormann for valuable discussions, and express gratitude to the entire Project Group Resilience and Diversity in Complex Systems at the Academy of the Sciences and Humanities in Hamburg, namely Anne Dienelt, for inspiring debates on the interrelations of the concepts of resilience, redundancy, and diversity in multiple sciences and humanities. We also wish to thank Eleftheria Charalambous and Antoine Weihs for their assistance with visualisation and R package development.

Footnotes

Appendix A

Supplementary data associated with this article can be found in the online version at doi:10.1016/j.csbj.2025.03.012.

Contributor Information

Daniel Fässler, Email: daniel.faessler@med.uni-greifswald.de.

Almut Heinken, Email: almut-katrin.heinken@univ-lorraine.fr.

Johannes Hertel, Email: johannes.hertel@med.uni-greifswald.de.

Appendix A. Supplementary material

Supplementary material

mmc1.docx (3.3MB, docx)

Supplementary material

mmc2.xlsx (26.4MB, xlsx)

Data Availability Statement

The raw data of the utilised colorectal cancer study can be found in the stem publication and is available in [1]. The raw IBD study can be found in [2]. Utilised AGORA reconstructions can be found on https://github.com/VirtualMetabolicHuman/AGORA/. Scripts to generate microbiome community models for the CRC study and individual maximum secretion fluxes for the CRC study can be found on https://github.com/SysPsyHertel/CodeBase/tree/main/Scripts_Faessler_Functional_Redundancy. The scripts for running microbiome community models and maximum secretion fluxes for the IBD study can be found on https://github.com/ThieleLab/CodeBase/tree/master/Analysis_Heinken_npj_Syst_Biol_Appl_2021 and is explained in [3]. The R package for computing measures of functional redundancy can be found on https://github.com/SysPsyHertel/FunRed

The generated individual maximum secretion fluxes for the CRC study can be found in the Table S1. The computed functional redundancy and functional interdependency measures for the IBD and CRC study are included in this published article (Table S2 (CRC study) and Table S3 (IBD study)). All scripts and code utilised to generate the results in this work based on the generated individual maximal secretion fluxes, including simulations can be found on https://github.com/SysPsyHertel/CodeBase/tree/main/Scripts_Faessler_Functional_Redundancy. The Graphical Abstract (https://BioRender.com/tboy2bh), Figure 1 (https://BioRender.com/d2wx0pg), and Figure 4B (https://BioRender.com/eecp36i) were created in BioRender.

References

  • 1.Yachida S., Mizutani S., Shiroma H., Shiba S., Nakajima T., Sakamoto T., et al. Metagenomic and metabolomic analyses reveal distinct stage-specific phenotypes of the gut microbiota in colorectal cancer. Nat Med. 2019;25:968–976. doi: 10.1038/s41591-019-0458-7. [DOI] [PubMed] [Google Scholar]
  • 2.Lewis J.D., Chen E.Z., Baldassano R.N., Otley A.R., Griffiths A.M., Lee D., et al. Inflammation, antibiotics, and diet as environmental stressors of the gut microbiome in pediatric Crohn’s disease. Cell Host Microbe. 2015;18:489–500. doi: 10.1016/j.chom.2015.09.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Heinken A., Hertel J., Thiele I. Metabolic modelling reveals broad changes in gut microbial metabolism in inflammatory bowel disease patients with dysbiosis. Npj Syst Biol Appl. 2021;7:19. doi: 10.1038/s41540-021-00178-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Magurran A.E. Why Diversity? Ecological Diversity and Its Measurement. Springer Netherlands; Dordrecht: 1988. pp. 1–5. [DOI] [Google Scholar]
  • 5.Okpiliya F. Ecological diversity indices: any hope for one again. J Environ Earth Sci. 2012;2:45–52. https://scholar.google.com/scholar_lookup?hl=en&volume=2&publication_year=2012&pages=45-52&journal=Journal+of+Environment+and+Earth+Science&issue=10&author=F.+I.+Okpiliya&title=Ecological+Diversity+Indices%3A+Any+Hope+for+One+Again%3F [Google Scholar]
  • 6.Le Chatelier E., Nielsen T., Qin J., Prifti E., Hildebrand F., Falony G., et al. Richness of human gut microbiome correlates with metabolic markers. Nature. 2013;500:541–546. doi: 10.1038/nature12506. [DOI] [PubMed] [Google Scholar]
  • 7.Methorst J., Rehdanz K., Mueller T., Hansjürgens B., Bonn A., Böhning-Gaese K. The importance of species diversity for human well-being in Europe. Ecol Econ. 2021;181 doi: 10.1016/j.ecolecon.2020.106917. [DOI] [Google Scholar]
  • 8.Holzhausen E.A., Peppard P.E., Sethi A.K., Safdar N., Malecki K.C., Schultz A.A., et al. Associations of gut microbiome richness and diversity with objective and subjective sleep measures in a population sample. SLEEP. 2024;47 doi: 10.1093/sleep/zsad300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Cox M.J., Cookson W.O.C.M., Moffatt M.F. Sequencing the human microbiome in health and disease. Hum Mol Genet. 2013;22:R88–R94. doi: 10.1093/hmg/ddt398. [DOI] [PubMed] [Google Scholar]
  • 10.Li K., Bihan M., Yooseph S., Methé B.A. Analyses of the microbial diversity across the human microbiome. PLoS ONE. 2012;7 doi: 10.1371/journal.pone.0032118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Daly A., Baetens J., De Baets B. Ecological diversity: measuring the unmeasurable. Mathematics. 2018;6:119. doi: 10.3390/math6070119. [DOI] [Google Scholar]
  • 12.Johnson K.V.-A., Burnet P.W.J. Microbiome: should we diversify from diversity? Gut Microbes. 2016;7:455–458. doi: 10.1080/19490976.2016.1241933. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Louca S., Parfrey L.W., Doebeli M. Decoupling function and taxonomy in the global ocean microbiome. Science. 2016;353:1272–1277. doi: 10.1126/science.aaf4507. [DOI] [PubMed] [Google Scholar]
  • 14.Biswas S.R., Mallik A.U. Species diversity and functional diversity relationship varies with disturbance intensity. Ecosphere. 2011;2:art52. doi: 10.1890/ES10-00206.1. [DOI] [Google Scholar]
  • 15.Martiny J.B.H., Jones S.E., Lennon J.T., Martiny A.C. Microbiomes in light of traits: a phylogenetic perspective. Science. 2015;350 doi: 10.1126/science.aac9323. [DOI] [PubMed] [Google Scholar]
  • 16.Louca S., Jacques S.M.S., Pires A.P.F., Leal J.S., Srivastava D.S., Parfrey L.W., et al. High taxonomic variability despite stable functional structure across microbial communities. Nat Ecol Evol. 2016;1 doi: 10.1038/s41559-016-0015. [DOI] [PubMed] [Google Scholar]
  • 17.Louca S., Polz M.F., Mazel F., Albright M.B.N., Huber J.A., O’Connor M.I., et al. Function and functional redundancy in microbial systems. Nat Ecol Evol. 2018;2:936–943. doi: 10.1038/s41559-018-0519-1. [DOI] [PubMed] [Google Scholar]
  • 18.Chen H., Ma K., Lu C., Fu Q., Qiu Y., Zhao J., et al. Functional redundancy in soil microbial community based on metagenomics across the globe. Front Microbiol. 2022;13 doi: 10.3389/fmicb.2022.878978. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Fassarella M., Blaak E.E., Penders J., et al. Gut microbiome stability and resilience: elucidating the response to perturbations in order to modulate gut health. Gut. 2021;70:595–605. doi: 10.1136/gutjnl-2020-321747. [DOI] [PubMed] [Google Scholar]
  • 20.Pierre Ramond, Pierre E Galand, Ramiro Logares, Microbial functional diversity and redundancy: moving forward, FEMS Microbiology Reviews, Volume 49, 2025, fuae031, 10.1093/femsre/fuae031. [DOI] [PMC free article] [PubMed]
  • 21.Morgan X.C., Tickle T.L., Sokol H., Gevers D., Devaney K.L., Ward D.V., et al. Dysfunction of the intestinal microbiome in inflammatory bowel disease and treatment. Genome Biol. 2012;13 doi: 10.1186/gb-2012-13-9-r79. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Lloyd-Price J., Arze C., Ananthakrishnan A.N., Schirmer M., Avila-Pacheco J., Poon T.W., et al. Multi-omics of the gut microbial ecosystem in inflammatory bowel diseases. Nature. 2019;569:655–662. doi: 10.1038/s41586-019-1237-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Manor O., Dai C.L., Kornilov S.A., Smith B., Price N.D., Lovejoy J.C., et al. Health and disease markers correlate with gut microbiome composition across thousands of people. Nat Commun. 2020;11:5206. doi: 10.1038/s41467-020-18871-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.He J., Li Y., Cao Y., Xue J., Zhou X. The oral microbiome diversity and its relation to human diseases. Folia Microbiol. 2015;60:69–80. doi: 10.1007/s12223-014-0342-2. [DOI] [PubMed] [Google Scholar]
  • 25.Botta-Dukát Z. Rao’s quadratic entropy as a measure of functional diversity based on multiple traits. J Veg Sci. 2005;16:533–540. doi: 10.1111/j.1654-1103.2005.tb02393.x. [DOI] [Google Scholar]
  • 26.Schleuter D., Daufresne M., Massol F., Argillier C. A user’s guide to functional diversity indices. Ecol Monogr. 2010;80:469–484. doi: 10.1890/08-2225.1. [DOI] [Google Scholar]
  • 27.Mammola S., Carmona C.P., Guillerme T., Cardoso P. Concepts and applications in functional diversity. Funct Ecol. 2021;35:1869–1885. doi: 10.1111/1365-2435.13882. [DOI] [Google Scholar]
  • 28.Ricotta C., De Bello F., Moretti M., Caccianiga M., Cerabolini B.E.L., Pavoine S. Measuring the functional redundancy of biological communities: a quantitative guide. Methods Ecol Evol. 2016;7:1386–1395. doi: 10.1111/2041-210X.12604. [DOI] [Google Scholar]
  • 29.de Bello F., Lepš J., Lavorel S., et al. Importance of species abundance for assessment of trait composition: an example based on pollinator communities. COMMUNITY ECOL. 2007;8:163–170. doi: 10.1556/ComEc.8.2007.2.3. [DOI] [Google Scholar]
  • 30.Petchey O.L., Gaston K.J. Functional diversity: back to basics and looking forward. Ecol Lett. 2006;9:741–758. doi: 10.1111/j.1461-0248.2006.00924.x. [DOI] [PubMed] [Google Scholar]
  • 31.Clark R.L., Connors B.M., Stevenson D.M., Hromada S.E., Hamilton J.J., Amador-Noguez D., et al. Design of synthetic human gut microbiome assembly and butyrate production. Nat Commun. 2021;12:3254. doi: 10.1038/s41467-021-22938-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Siptroth J., Moskalenko O., Krumbiegel C., Ackermann J., Koch I., Pospisil H. Variation of butyrate production in the gut microbiome in type 2 diabetes patients. Int Microbiol. 2023;26:601–610. doi: 10.1007/s10123-023-00324-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Braccia D.J., Jiang X., Pop M., Hall A.B. The capacity to produce Hydrogen Sulfide (H2S) via cysteine degradation is ubiquitous in the human gut microbiome. Front Microbiol. 2021;12 doi: 10.3389/fmicb.2021.705583. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Bordbar A., Monk J.M., King Z.A., Palsson B.O. Constraint-based models predict metabolic and associated cellular functions. Nat Rev Genet. 2014;15:107–120. doi: 10.1038/nrg3643. [DOI] [PubMed] [Google Scholar]
  • 35.Hertel J., Heinken A., Martinelli F., Thiele I. Integration of constraint-based modeling with fecal metabolomics reveals large deleterious effects of Fusobacterium spp. on community butyrate production. Gut Microbes. 2021;13:1–23. doi: 10.1080/19490976.2021.1915673. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Heinken A., Basile A., Hertel J., Thinnes C., Thiele I. Genome-scale metabolic modeling of the human microbiome in the era of personalized medicine. Annu Rev Microbiol. 2021 doi: 10.1146/annurev-micro-060221-012134. [DOI] [PubMed] [Google Scholar]
  • 37.Heinken A., Hertel J., Acharya G., Ravcheev D.A., Nyga M., Okpala O.E., et al. Genome-scale metabolic reconstruction of 7,302 human microorganisms for personalized medicine. Nat Biotechnol. 2023 doi: 10.1038/s41587-022-01628-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Magnúsdóttir S., Heinken A., Kutt L., Ravcheev D.A., Bauer E., Noronha A., et al. Generation of genome-scale metabolic reconstructions for 773 members of the human gut microbiota. Nat Biotechnol. 2017;35:81–89. doi: 10.1038/nbt.3703. [DOI] [PubMed] [Google Scholar]
  • 39.Tian L., Wang X-W, Wu A-K, Fan Y., Friedman J., Dahlin A., et al. Deciphering functional redundancy in the humanmicrobiome. Nat. Commun. 2020;11:6217. doi: 10.1038/s41467-020-19940-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Fang X., Lloyd C.J., Palsson B.O. Reconstructing organisms in silico: genome-scale models and their emerging applications. Nat Rev Microbiol. 2020;18:731–743. doi: 10.1038/s41579-020-00440-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Heirendt L., Arreckx S., Pfau T., Mendoza S.N., Richelle A., Heinken A., et al. Creation and analysis of biochemical constraint-based models using the COBRA Toolbox v.3.0. Nat Protoc. 2019;14:639–702. doi: 10.1038/s41596-018-0098-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Heinken A., Thiele I. Microbiome Modelling Toolbox 2.0: efficient, tractable modelling of microbiome communities. Bioinformatics. 2022;38:2367–2368. doi: 10.1093/bioinformatics/btac082. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Heinken A., Basile A., Thiele I. Advances in constraint-based modelling of microbial communities. Curr Opin Syst Biol. 2021;27 doi: 10.1016/j.coisb.2021.05.007. [DOI] [Google Scholar]
  • 44.Hertel J., Heinken A., Fässler D., Thiele I. Causal inference on microbiome-metabolome relations in observational host-microbiome data via in silico in vivo association pattern analyses. Cell Rep Methods. 2023;3 doi: 10.1016/j.crmeth.2023.100615. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Hertel J., Harms A.C., Heinken A., Baldini F., Thinnes C.C., Glaab E., et al. Integrated Analyses of Microbiome and Longitudinal Metabolome Data Reveal Microbial-Host Interactions on Sulfur Metabolism in Parkinson’s Disease. Cell Rep. 2019;29:1767–1777. doi: 10.1016/j.celrep.2019.10.035. e8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Pillar V.D., Blanco C.C., Müller S.C., Sosinski E.E., Joner F., Duarte L.D.S. Functional redundancy and stability in plant communities. J Veg Sci. 2013;24:963–974. doi: 10.1111/jvs.12047. [DOI] [Google Scholar]
  • 47.Galland T., Pérez Carmona C., Götzenberger L., Valencia E., De Bello F. Are redundancy indices redundant? An evaluation based on parameterized simulations. Ecol Indic. 2020;116 doi: 10.1016/j.ecolind.2020.106488. [DOI] [Google Scholar]
  • 48.Borda M. Springer Science & Business Media; 2011. Fundamentals in information theory and coding.https://books.google.nl/books?id=Lyte2yl1SPAC [Google Scholar]
  • 49.Truong D.T., Franzosa E.A., Tickle T.L., Scholz M., Weingart G., Pasolli E., et al. MetaPhlAn2 for enhanced metagenomic taxonomic profiling. Nat Methods. 2015;12:902–903. doi: 10.1038/nmeth.3589. [DOI] [PubMed] [Google Scholar]
  • 50.Baldini F., Heinken A., Heirendt L., Magnusdottir S., Fleming R.M.T., Thiele I. The Microbiome Modeling Toolbox: from microbial interactions to personalized microbial communities. Bioinformatics. 2019;35:2332–2334. doi: 10.1093/bioinformatics/bty941. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.HG D. Philentropy: information theory and distance quantification with R. J Open Source Softw. 2018;3:765. doi: 10.21105/joss.00765. [DOI] [Google Scholar]
  • 52.Oksanen J., Simpson G.L., Blanchet F.G., Kindt R., Legendre P., Minchin P.R., et al. vegan: Community Ecology Package. 2022.
  • 53.Lenth R.V. emmeans: Estimated Marginal Means, aka Least-Squares Means 2017:1.10.7. 10.32614/CRAN.package.emmeans. [DOI]
  • 54.Vanderlei Julio Debastiani. SYNCSA: Analysis of Functional and Phylogenetic Patterns in Metacommunities 2011:1.3.4. 10.32614/CRAN.package.SYNCSA. [DOI]
  • 55.Martin A.D., Quinn K.M., Park J.H. MCMCpack: Markov Chain Monte Carlo (MCMC) Package. 2003;1 doi: 10.32614/CRAN.package.MCMCpack. 7-1. [DOI] [Google Scholar]
  • 56.Eisenhauer N., Hines J., Maestre F.T., Rillig M.C. Reconsidering functional redundancy in biodiversity research. Npj Biodivers. 2023;2:9. doi: 10.1038/s44185-023-00015-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Fischer F.M., De Bello F. On the uniqueness of functional redundancy. Npj Biodivers. 2023;2:23. doi: 10.1038/s44185-023-00029-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Stummer N., Feichtinger R.G., Weghuber D., Kofler B., Schneider A.M. Role of hydrogen sulfide in inflammatory bowel disease. Antioxidants. 2023;12:1570. doi: 10.3390/antiox12081570. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Ijssennagger N., Van Der Meer R., Van Mil S.W.C. Sulfide as a mucus barrier-breaker in inflammatory bowel disease? Trends Mol Med. 2016;22:190–199. doi: 10.1016/j.molmed.2016.01.002. [DOI] [PubMed] [Google Scholar]
  • 60.Kushkevych I., Dordević D., Kollar P., Vítězová M., Drago L. Hydrogen sulfide as a toxic product in the small–large intestine axis and its role in IBD development. JCM. 2019;8:1054. doi: 10.3390/jcm8071054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Guo F.-F., Yu T.-C., Hong J., Fang J.-Y. Emerging roles of hydrogen sulfide in inflammatory and neoplastic colonic diseases. Front Physiol. 2016;7 doi: 10.3389/fphys.2016.00156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Chamberlain S.A., Bronstein J.L., Rudgers J.A. How context dependent are species interactions? Ecol Lett. 2014;17:881–890. doi: 10.1111/ele.12279. [DOI] [PubMed] [Google Scholar]
  • 63.Berlow E.L., Navarrete S.A., Briggs C.J., Power M.E., Menge B.A. Quantifying variation in the strengths of species interactions. Ecology. 1999;80:2206–2224. doi: 10.1890/0012-9658(1999)080[2206:QVITSO]2.0.CO;2. [DOI] [Google Scholar]
  • 64.Delmas E., Besson M., Brice M., Burkle L.A., Dalla Riva G.V., Fortin M., et al. Analysing ecological networks of species interactions. Biol Rev. 2019;94:16–36. doi: 10.1111/brv.12433. [DOI] [PubMed] [Google Scholar]
  • 65.Wootton J.T., Emmerson M. Measurement of interaction strength in nature. Annu Rev Ecol Evol Syst. 2005;36:419–444. doi: 10.1146/annurev.ecolsys.36.091704.175535. [DOI] [Google Scholar]
  • 66.Ding L., Chang M., Guo Y., Zhang L., Xue C., Yanagita T., et al. Trimethylamine-N-oxide (TMAO)-induced atherosclerosis is associated with bile acid metabolism. Lipids Health Dis. 2018;17:286. doi: 10.1186/s12944-018-0939-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Zhu Y., Li Q., Jiang H. Gut microbiota in atherosclerosis: focus on trimethylamine N-oxide. APMIS. 2020;128:353–366. doi: 10.1111/apm.13038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Liu Y., Dai M. Trimethylamine N-Oxide Generated by the Gut Microbiota Is Associated with Vascular Inflammation: New Insights into Atherosclerosis. Mediat Inflamm. 2020;2020:1–15. doi: 10.1155/2020/4634172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Zixin Y., Lulu C., Xiangchang Z., Qing F., Binjie Z., Chunyang L., et al. TMAO as a potential biomarker and therapeutic target for chronic kidney disease: A review. Front Pharm. 2022;13 doi: 10.3389/fphar.2022.929262. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Tang W.H.W., Wang Z., Kennedy D.J., Wu Y., Buffa J.A., Agatisa-Boyle B., et al. Gut Microbiota-Dependent Trimethylamine N -Oxide (TMAO) Pathway Contributes to Both Development of Renal Insufficiency and Mortality Risk in Chronic Kidney Disease. Circ Res. 2015;116:448–455. doi: 10.1161/CIRCRESAHA.116.305360. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Sazal M., Stebliankin V., Mathee K., Yoo C., Narasimhan G. Causal effects in microbiomes using interventional calculus. Sci Rep. 2021;11:5724. doi: 10.1038/s41598-021-84905-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Corander J., Hanage W.P., Pensar J. Causal discovery for the microbiome. Lancet Microbe. 2022;3:e881–e887. doi: 10.1016/S2666-5247(22)00186-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Kline R.B. Guilford Publications; 2023. Principles and practice of structural equation modeling.http://scholar.google.com/scholar_lookup?&title=Principles%20and%20practice%20of%20structural%20equation%20modeling&publication_year=2016&author=Kline%2CR [Google Scholar]
  • 74.Hair Jr.J.F., Hult G.T.M., Ringle C.M., Sarstedt M., Danks N.P., Ray S., et al. An introduction to structural equation modeling. Partial Least Sq Struct Equ Model (PLS-SEM) Using R: A Workb. 2021:1–29. doi: 10.1007/978-3-030-80519-7_1. [DOI] [Google Scholar]
  • 75.Weilenmann M., Colbeck R. Analysing causal structures with entropy. Proc R Soc A. 2017;473:20170483. doi: 10.1098/rspa.2017.0483. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Feigenbaum I., Arpit D., Heinecke S., Niebles J.C., Yao W., Xiong C., et al. In: Locatello F., Didelez V., editors. Vol. 236. PMLR; 2024. Causal layering via conditional entropy; pp. 1176–1191.https://proceedings.mlr.press/v236/feigenbaum24a.html (Proceedings of the Third Conference on Causal Learning and Reasoning). [Google Scholar]
  • 77.Kocaoglu M., Shakkottai S., Dimakis A.G., Caramanis C., Vishwanath S. Applications of common entropy for causal inference. Adv Neural Inf Process Syst. 2020;33:17514–17525. https://proceedings.neurips.cc/paper_files/paper/2020/file/cae7115f44837c806c9b23ed00a1a28a-Paper.pdf [Google Scholar]
  • 78.The Human Microbiome Project Consortium Structure, function and diversity of the healthy human microbiome. Nature. 2012;486:207–214. doi: 10.1038/nature11234. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Ogunrinola G.A., Oyewale J.O., Oshamika O.O., Olasehinde G.I. The human microbiome and its impacts on health. Int J Microbiol. 2020;2020:1–7. doi: 10.1155/2020/8045646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Van Hul M., Cani P.D., Petitfils C., De Vos W.M., Tilg H., El-Omar E.M. What defines a healthy gut microbiome? Gut. 2024;73:1893–1908. doi: 10.1136/gutjnl-2024-333378. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Ross F.C., Patangia D., Grimaud G., Lavelle A., Dempsey E.M., Ross R.P., et al. The interplay between diet and the gut microbiome: implications for health and disease. Nat Rev Microbiol. 2024;22:671–686. doi: 10.1038/s41579-024-01068-4. [DOI] [PubMed] [Google Scholar]
  • 82.Hamer H.M., Jonkers D., Venema K., Vanhoutvin S., Troost F.J., Brummer R.-J. Review article: the role of butyrate on colonic function. Aliment Pharm Ther. 2008;27:104–119. doi: 10.1111/j.1365-2036.2007.03562.x. [DOI] [PubMed] [Google Scholar]
  • 83.Pryde S.E., Duncan S.H., Hold G.L., Stewart C.S., Flint H.J. The microbiology of butyrate formation in the human colon. FEMS Microbiol Lett. 2002;217:133–139. doi: 10.1111/j.1574-6968.2002.tb11467.x. [DOI] [PubMed] [Google Scholar]
  • 84.Verma M.S., Fink M.J., Salmon G.L., Fornelos N., Ohara T.E., Ryu S.H., et al. A common mechanism links activities of butyrate in the colon. ACS Chem Biol. 2018;13:1291–1298. doi: 10.1021/acschembio.8b00073. [DOI] [PubMed] [Google Scholar]
  • 85.Bridgeman S.C., Northrop W., Melton P.E., Ellison G.C., Newsholme P., Mamotte C.D.S. Butyrate generated by gut microbiota and its therapeutic role in metabolic syndrome. Pharmacol Res. 2020;160 doi: 10.1016/j.phrs.2020.105174. [DOI] [PubMed] [Google Scholar]
  • 86.Sepehri S., Kotlowski R., Bernstein C.N., Krause D.O. Microbial diversity of inflamed and noninflamed gut biopsy tissues in inflammatory bowel disease. Inflamm Bowel Dis. 2007;13:675–683. doi: 10.1002/ibd.20101. [DOI] [PubMed] [Google Scholar]
  • 87.Sha S., Xu B., Wang X., Zhang Y., Wang H., Kong X., et al. The biodiversity and composition of the dominant fecal microbiota in patients with inflammatory bowel disease. Diagn Microbiol Infect Dis. 2013;75:245–251. doi: 10.1016/j.diagmicrobio.2012.11.022. [DOI] [PubMed] [Google Scholar]
  • 88.Gong D., Gong X., Wang L., Yu X., Dong Q. Involvement of Reduced Microbial Diversity in Inflammatory Bowel Disease. Gastroenterol Res Pract. 2016;2016:1–7. doi: 10.1155/2016/6951091. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Alam M.T., Amos G.C.A., Murphy A.R.J., Murch S., Wellington E.M.H., Arasaradnam R.P. Microbial imbalance in inflammatory bowel disease patients at different taxonomic levels. Gut Pathog. 2020;12(1) doi: 10.1186/s13099-019-0341-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Mok Y.-Y.P., Moore P.K. Hydrogen sulphide is pro-inflammatory in haemorrhagic shock. Inflamm Res. 2008;57:512–518. doi: 10.1007/s00011-008-7231-6. [DOI] [PubMed] [Google Scholar]
  • 91.Kloesch B., Liszt M., Krehan D., Broell J., Kiener H., Steiner G. High concentrations of hydrogen sulphide elevate the expression of a series of pro-inflammatory genes in fibroblast-like synoviocytes derived from rheumatoid and osteoarthritis patients. Immunol Lett. 2012;141:197–203. doi: 10.1016/j.imlet.2011.10.004. [DOI] [PubMed] [Google Scholar]
  • 92.Whiteman M., Winyard P.G. Hydrogen sulfide and inflammation: the good, the bad, the ugly and the promising. Expert Rev Clin Pharmacol. 2011;4:13–32. doi: 10.1586/ecp.10.134. [DOI] [PubMed] [Google Scholar]
  • 93.Dilek N., Papapetropoulos A., Toliver-Kinsky T., Szabo C. Hydrogen sulfide: an endogenous regulator of the immune system. Pharmacol Res. 2020;161 doi: 10.1016/j.phrs.2020.105119. [DOI] [PubMed] [Google Scholar]
  • 94.Rose P., Zhu Y.-Z., Moore P.K. Hydrogen Sulfide and the Immune System. In: Zhu Y.-C., editor. Advances in Hydrogen Sulfide Biology, vol. 1315, Singapore: Springer Singapore; 2021, p. 99–128. https://doi.org/10.1007/978-981-16-0991-6_5.. [DOI] [PubMed]
  • 95.Douglas G.M., Maffei V.J., Zaneveld J.R., Yurgel S.N., Brown J.R., Taylor C.M., et al. PICRUSt2 for prediction of metagenome functions. Nat Biotechnol. 2020;38:685–688. doi: 10.1038/s41587-020-0548-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Wang G., Fan Y., Zhang G., Cai S., Ma Y., Yang L., et al. Microbiota-derived indoles alleviate intestinal inflammation and modulate microbiome by microbial cross-feeding. Microbiome. 2024;12:59. doi: 10.1186/s40168-024-01750-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Huang B., Chau S.W.H., Liu Y., Chan J.W.Y., Wang J., Ma S.L., et al. Gut microbiome dysbiosis across early Parkinson’s disease, REM sleep behavior disorder and their first-degree relatives. Nat Commun. 2023;14:2501. doi: 10.1038/s41467-023-38248-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Vestby L.K., Grønseth T., Simm R., Nesse L.L. Bacterial biofilm and its role in the pathogenesis of disease. Antibiotics. 2020;9:59. doi: 10.3390/antibiotics9020059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Flemming H.-C., Van Hullebusch E.D., Neu T.R., Nielsen P.H., Seviour T., Stoodley P., et al. The biofilm matrix: multitasking in a shared space. Nat Rev Microbiol. 2023;21:70–86. doi: 10.1038/s41579-022-00791-0. [DOI] [PubMed] [Google Scholar]
  • 100.Maki K.A., Kazmi N., Barb J.J., Ames N. The oral and gut bacterial microbiomes: similarities, differences, and connections. Biol Res Nurs. 2021;23:7–20. doi: 10.1177/1099800420941606. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

mmc1.docx (3.3MB, docx)

Supplementary material

mmc2.xlsx (26.4MB, xlsx)

Data Availability Statement

The raw data of the utilised colorectal cancer study can be found in the stem publication and is available in [1]. The raw IBD study can be found in [2]. Utilised AGORA reconstructions can be found on https://github.com/VirtualMetabolicHuman/AGORA/. Scripts to generate microbiome community models for the CRC study and individual maximum secretion fluxes for the CRC study can be found on https://github.com/SysPsyHertel/CodeBase/tree/main/Scripts_Faessler_Functional_Redundancy. The scripts for running microbiome community models and maximum secretion fluxes for the IBD study can be found on https://github.com/ThieleLab/CodeBase/tree/master/Analysis_Heinken_npj_Syst_Biol_Appl_2021 and is explained in [3]. The R package for computing measures of functional redundancy can be found on https://github.com/SysPsyHertel/FunRed

The generated individual maximum secretion fluxes for the CRC study can be found in the Table S1. The computed functional redundancy and functional interdependency measures for the IBD and CRC study are included in this published article (Table S2 (CRC study) and Table S3 (IBD study)). All scripts and code utilised to generate the results in this work based on the generated individual maximal secretion fluxes, including simulations can be found on https://github.com/SysPsyHertel/CodeBase/tree/main/Scripts_Faessler_Functional_Redundancy. The Graphical Abstract (https://BioRender.com/tboy2bh), Figure 1 (https://BioRender.com/d2wx0pg), and Figure 4B (https://BioRender.com/eecp36i) were created in BioRender.


Articles from Computational and Structural Biotechnology Journal are provided here courtesy of Research Network of Computational and Structural Biotechnology

RESOURCES