Skip to main content

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

Research Square logoLink to Research Square
[Preprint]. 2023 Jan 13:rs.3.rs-2463503. [Version 1] doi: 10.21203/rs.3.rs-2463503/v1

A microbial causal mediation analytic tool for health disparity and applications in body mass index

Chan Wang 1, Jiyoung Ahn 2, Thaddeus Tarpey 1, Stella S Yi 3, Richard B Hayes 2, Huilin Li 1,*
PMCID: PMC9882678  PMID: 36712075

Abstract

Background:

Emerging evidence suggests the potential mediating role of microbiome in health disparities. However, no analytic framework is available to analyze microbiome as a mediator between health disparity and clinical outcome, due to the unique structure of microbiome data, including high dimensionality, sparsity, and compositionality.

Methods:

Considering the modifiable and quantitative features of microbiome, we propose a microbial causal mediation model framework, SparseMCMM_HD, to uncover the mediating role of microbiome in health disparities, by depicting a plausible path from a non-manipulable exposure (e.g. race or region) to a continuous outcome through microbiome. The proposed SparseMCMM_HD rigorously defines and quantifies the manipulable disparity measure that would be eliminated by equalizing microbiome profiles between comparison and reference groups. Moreover, two tests checking the impact of microbiome on health disparity are proposed.

Results:

Through three body mass index (BMI) studies selected from the curatedMetagenomicData 3.4.2 package and the American gut project: China vs. USA, China vs. UK, and Asian or Pacific Islander (API) vs. Caucasian, we exhibit the utility of the proposed SparseMCMM_HD framework for investigating microbiome’s contributions in health disparities. Specifically, BMI exhibits disparities and microbial community diversities are significantly distinctive between the reference and comparison groups in all three applications. By employing SparseMCMM_HD, we illustrate that microbiome plays a crucial role in explaining the disparities in BMI between races or regions. 11.99%, 12.90%, and 7.4% of the overall disparity in BMI in China-USA, China-UK, and API-Caucasian comparisons, respectively, would be eliminated if the between-group microbiome profiles were equalized; and 15, 21, and 12 species are identified to play the mediating role respectively.

Conclusions:

The proposed SparseMCMM_HD is an effective and validated tool to elucidate the mediating role of microbiome in health disparity. Three BMI applications shed light on the utility of microbiome in reducing BMI disparity by manipulating microbial profiles.

Keywords: Casual mediation model, Health disparity, Manipulable disparity measure, Microbiome mediator, Non-manipulable exposure

Background

Health disparities refer to the inequalities in the quality of health, health care, and health outcomes experienced by groups that are usually classified by race, ethnicity, and region. Many factors, including genetics, social-economic status, culture, dietary habits, and geographical conditions, contribute to health disparities between groups. Researchers have long been interested in identifying the modifiable environmental determinants of health disparity to pave the way to improve health equity. However, environmental exposures are often numerous, ubiquitous, descriptive, or hard to measure, which makes this task difficult.

Gut microbiome is the aggregate of all genomes harbored by gut microbiota, which is the collection of all microbes that reside in human gut. Benefiting from the advent of high throughput sequencing technologies, a great number of microbiome studies have been conducted to quantitatively characterize the microbiome profiling and understand its role in human health [14]. Gut microbiome has been closely linked with host metabolic, immune, and neuroendocrine functions [512]. On the other hand, many environmental and social factors, such as diet, drugs, lifestyle, psychological state and behavior, aid in shaping gut microbial profiles [1316]. Recently, the mediating role of microbiome between these environmental exposures and various human diseases, including obesity, type 2 diabetes, inflammatory bowel disease, depression, and different cancers, has been investigated and recognized [1722]. Given the modifiable and quantitative features of microbiome, we here aim to disentangle health disparities by exploring the extent of the observed disparity in the outcome of interest that could be reduced if the gut microbial profile was modified. In Figure 1, we propose a mediation framework to answer such questions. Here, the disparity group, e.g., race or region, is the exposure denoted by R; the gut microbial profile is the mediator denoted by M; and the continuous study outcome, e.g., body mass index (BMI), is denoted by Y.

Figure 1.

Figure 1.

Microbiome (M) may play a mediating role in the health disparity of the continuous outcome (Y) between two categories of a non-manipulable exposure group (e.g. race or region) (R). We aim to investigate how much disparity of the outcome Y can be reduced by manipulating microbiome profiles.

There are several existing mediation analysis frameworks tailored for non-manipulable exposures, such as race, region, sex or socioeconomic position [27], however, due to the unique structure of microbiome data, including high dimensionality, sparsity and compositionality, these approaches are not immediately applicable for analyzing microbiome as a mediator for health disparity. Recently, we developed a rigorous Sparse Microbial Causal Mediation Model (SparseMCMM) [12] for interrogating the mediating role of microbiome in a typical three-factor (randomized treatments, microbiome as mediator, and outcome) clinical trial causal study design. SparseMCMM quantifies the overall mediation effect of microbiome community and the component-wise mediation effect for each individual microbe under the counterfactual framework, identifies the signature causal microbes with regularization strategies, and tests the mediation effects while fully acknowledging the unique structure of microbiome data. In this paper, by extending SparseMCMM to a non-manipulable exposure setting, we propose a microbial causal mediation framework for health disparity study and denote it as SparseMCMM_HD (SparseMCMM for Health Disparity). As VanderWeele and Robinson [23] discussed, causal interpretation of a non-manipulable exposure, i.e., ethnicity or region, is not definable in the traditional counterfactual framework, because a hypothetical intervention on a non-manipulable exposure is not possible. Instead, one can interpret the causality of health inequality by the hypothesized intervention effect on the manipulable mediating variable. Thus, in SparseMCMM_HD, we aim to quantify the overall health inequality on the outcome (called overall disparity), the health inequality effect that would be eliminated by equalizing microbiome profiles across racial or regional groups (called manipulable disparity), and the healthy inequality effect that would remain even after microbiome profiles across racial or regional groups were equalized (called residual disparity). In addition, we equip two hypothesis tests to examine the mediating role of microbiome in health disparity and statistically identify which specific microbes contribute to it.

Obesity (defined via BMI) is a global epidemic and a persistent public health problem [24]. It is well documented that the prevalence of adult obesity is distributed unevenly across racial groups and regions. Partial effect of manipulable exposures such as diet, medication, and antibiotics use [1719] on obesity has been shown to be mediated through microbiome. In addition, accumulating evidence indicates that gut microbial profile varies across ethnicities as well as geographically [2527]. Together, these studies suggest that microbiome may play a mediating role in the ethnic or regional disparity of obesity. It is crucial to investigate rigorously how much health inequalities in BMI can be reduced by manipulating microbiome profiles. Utilizing SparseMCMM_HD, we investigate the role of microbiome in the regional and racial disparity of BMI in curated microbiome data from the curatedMetagenomicData 3.4.2 package [28] and the American Gut Project (AGP) (www.americangut.org) respectively. Through these real data analyses, we illustrate a clear and plausible causal path analysis to understand the current racial or regional disparity in BMI and identify a comprehensive set of mediating microbial taxa. The proposed analytic pipeline is available through an interactive web app at https://chanw0.shinyapps.io/sparsemcmm_hd/. We believe that this novel pipeline will be useful for investigating the manipulable disparity through gut microbiome and understanding the causes of health disparity.

Methods

SparseMCMM_HD framework

Casual mediation model.

Suppose there are I subjects from two categories of a non-manipulable exposure group (e.g. race or region), J taxa, and K covariates. Subscripts i, j, k, indicate a subject, a taxon, and a covariate respectively. For the ith subject, let Ri = 1 or 0 indicate the reference or comparison group, let Mi=(Mi1,,MiJ)T be the microbiome relative abundance vector with the constraint j=1JMij=1, and let Xi=(Xi1,,XiK)T represent the covariates, and let Yi be a continuous outcome of interest.

To statistically describe the causal relationships shown in Figure 1, following our previous work [12], we use the linear log-contrast model to regress the continuous outcome on the non-manipulable exposure, microbiome compositions, interactions between the non-manipulable exposure and microbiome compositions, while adjusting the confounding covariates:

Yi=α0+αXTXi+αRRi+αMT[log(Mi)]+αCT[log(Mi)]Ri+ϵi,subject toαMT1=0,andαCT1=0, (1)

where α0 is the intercept, αR is the coefficient of the non-manipulable exposure, αX=(αX1,,αXK)T,αM=(αM1,,αMJ)T, and αC=(αC1,,αCJ)T are the vectors of coefficients of covariates, microbiome compositions, interactions between the non-manipulable exposure and microbiome compositions, respectively. Due to the compositionality of microbiome data as j=1JMij=1,αM and αC are subject to αMT1=0 and αCT1=0.ϵiN(0,σ2) is the error term. On the other hand, the Dirichlet regression [29] is used to model the microbial relative abundance as a function of the non-manipulable exposure and covariates:

E[Mij]=γj(Ri,Xi)m=1Jγm(Ri,Xi),log{γj(Ri,Xi)}=β0j+βRjRi+βXjTXi. (2)

Specifically, we assume that Mi(Ri,Xi)Dirichlet(γ1(Ri,Xi),,γJ(Ri,Xi)), and their microbial relative means are linked with the non-manipulable exposure and covariates (R, Xi) in the generalized linear model fashion with a log link. β0j is the intercept and βRj and βXj are the coefficients of the non-manipulable exposure and covariates for the jth taxon, respectively.

Definition of disparity measures in the counterfactual framework.

As discussed in the Background, we propose to conceptualize an overall disparity measure (ODM) on the outcome that can be decomposed into manipulable disparity measure (MDM) and residual disparity measure (RDM). MDM represents the portion of disparity that would be eliminated by equalizing microbiome profiles between comparison and reference groups, and RDM represents the portion that would remain even after microbiome profiles between comparison and reference groups were equalized. With the counterfactual notation, mathematically we have:

ODM=MDM+RDM,MDM=E[E[YMx(1)R=1,x]]E[E[YMx(0)R=1,x]],andRDM=E[E[YMx(0)R=1,x]E[YMx(0)R=0,x]].

Here, Mx(0) (Mx(1)) is a random value from the microbiome distribution of the reference (comparison) population with given covariates x. Ym denotes an individual’s potential counterfactual outcome if his or her microbial mediators were set to m, where m can be Mx(0)orMx(1).E[YMx(0)R=0,x](E[YMx(1)R=1,x]) denotes the expected outcome for a reference (comparison) individual with given covariates x,E[YMx(0)R=1,x] denotes the expected outcome for a comparison individual with given covariates x if their microbial mediators were set to a random value from that of the reference population with the same covariates x.

MDM, RDM, and ODM expressions.

Two assumptions must be satisfied for the identification of MDM, RDM, and ODM [23, 30]. The effect of non-manipulable exposure R on outcome Y are unconfounded conditional on all covariates X, i.e., YR X and the effects of mediator M on outcome Y are unconfounded conditional on the non-manipulable exposure R and all covariates X, i.e., YM | R, X. With these sufficient identifiability assumptions and the models (1)(2) proposed in the SparseMCMM_HD framework, disparity measures MDM, RDM, and ODM can be further expressed, respectively, as follows (see Section S1 for the detailed derivations):

MDM=j=1J(αMj+αCj){E[log(Mj)R=1,x]E[log(Mj)R=0,x]}
RDM=αR+αCTE[log(M)R=0,x]=αR+j=1JαCjE[log(Mj)R=0,x],

and

ODM=MDM+RDM=αR+j=1J(αMj+αCj)E[log(Mj)R=1,x]j=1JαMjE[log(Mj)R=0,x],

where E[log(Mj)R=r,x]=ψ[γj(R=r,x)]ψ[m=1Jγm(R=r,x)],γj(R=r,x)=exp(β0j+βRjr+βXjTx),r=0or1,andψ()=ddxln(Γ(x)) is the digamma function, with given covariates x.

Note that these mathematical expressions of RDM and MDM are the same as the formulas of causal direct effect of treatment and mediation effect through microbiome correspondingly on the outcome in the typical three-factor causal design based on the traditional causal mediation inference, developed in our SparseMCMM [12]. Analogous to ME in SparseMCMM, MDM is the summation of individual mediation effects from each taxon MDMj: MDM:=j=1JMDMj and MDMj = (αMj + αCj){E[log(Mj)|R = 1, x] − E[log(Mj)|R = 0, x]}. MDMj thus is non-zero only when both the jth microbial effect on the outcome and the exposure effect on the jth taxon are not zero. Therefore, SparseMCMM_HD illuminates the mediating role of microbiome in the health disparity of outcome, and quantifies the manipulable disparity for overall microbiome community and for each specific taxon, respectively.

Parameter estimation.

Note that in [12], we have demonstrated the excellent performance of SparseMCMM in terms of estimation by extensive simulations and real data analysis in various scenarios. Thus for SparseMCMM_HD, we directly employ the same two-step procedure to estimate the regression parameters in models (1)(2) to obtain the estimated RDM, MDM, MDMj for each taxon, and ODM. Furthermore, SparseMCMM_HD has the full capability to perform variable selection to select the signature causal microbes that play mediating roles in the disparity of the continuous outcome with regularization strategies. Specifically, L1 norm and group-lasso penalties are incorporated for variable selection meanwhile addressing the heredity condition.

Hypothesis tests for manipulable disparity.

Similarly, we employ the hypothesis tests for mediation effects in SparseMCMM to examine whether microbiome has any mediation effect on the disparity in an outcome, at both community and taxon levels. Specifically, regarding the null hypothesis of no manipulable disparity H0: MDM = 0, the first test statistic is defined as OMD=MDM^, the estimator of the manipulable disparity. Meanwhile, we consider another null hypothesis, H0: MDMj = 0, ∀ j ∈ {1,⋯,J} and define the second test statistic as CMD=j=1JMDM^J2, the summation of the squared estimators of individual mediation effects across all taxa. Permutation procedure is employed to assess the significance of these two test statistics. This provides a mechanism to check whether microbiome has any impact on health disparity that could be potentially eliminated through microbiome.

Implementation.

The simulation evaluation results regarding the estimation and testing of SparseMCMM [12] are applicable to SparseMCMM_HD framework. Therefore, the proposed SparseMCMM_HD is a validated analytic tool to illuminate the mediating role of microbiome in the disparity of outcome, and quantifies the manipulable disparity for overall microbiome community and for each specific taxon, respectively. In practice, we perform both parameter estimation and hypothesis testing using the analytical procedures in the SparseMCMM package and illustrate the proposed SparseMCMM_HD pipeline through an interactive web app (https://chanw0.shinyapps.io/sparsemcmm_hd/).

Control for confounding covariates

Due to the non-manipulable nature of the exposure in health disparity research, in principle, it is impossible to design a randomized trial on the exposure of interest to eliminate the potential confounding effect on the interested causal pathway. Many studies on health disparity are observational and usually include significant degrees of confounding, due to factors such as lifestyle, health status, and disease history. We want to emphasize that it is a necessary step to control for confounding covariates while utilizing the proposed SparseMCMM_HD to estimate RDM, MDM, and ODM in a typical observational study. Specifically, we propose to perform propensity score matching (PSM) [31], which is a commonly used method in biomedical research to create a balanced covariate distribution between two groups, to control confounding covariates in our applications (see Section S2). Standardized mean difference (SMD) is used to evaluate the balance of the covariate distributions between groups. A SMD that is less than 0.1 indicates a balanced distribution [32]. The matched data will then be used to quantify RDM, MDM, and ODM, and examine whether the microbiome could reduce the health disparity between two non-manipulable exposure groups. The control for confounding covariates procedure has been included as a preprocessing step in the proposed SparseMCMM_HD analytic pipeline.

curatedMetagenomicDataV3.4.2

The curatedMetagenomicData 3.4.2 package [28] provides a curated human microbiome meta dataset aggregated from 86 shotgun sequencing cohorts in 6 body sites. The raw sequencing data were processed using the same bioinformatics protocol and pipelines. Each sample has 6 types of data available including gene family, marker abundance, marker presence, pathway abundance, pathway coverage, and taxonomic (relative) abundance. The taxonomic abundance was calculated with MetaPhlAn3, and metabolic functional potential was calculated with HUMAnN3. The manually curated clinical and phenotypic metadata are available as well. More details can be found in the curatedMetagenomicData package document [28]. Here we focus on healthy subjects to explore the relationship among region, microbiome, and BMI. Specifically, we chose subjects from all cohorts based on the following inclusion criteria: 1) healthy status; 2) no missing values in BMI, gender, and age; 3) age ≥ 18; 4) no pregnant; 5) currently no antibiotic use; 6) currently no alcohol consumption; 7) no smoking; and 8) fecal sample with more than 1,250 sample reads. In addition, when multiple samples available for a subject, we randomly selected one sample. Overall, we identified 4,868 healthy adults from different regions. Here we further focus on three regional groups which have large sample sizes: China (n=570), United States (USA; n=350), and United Kingdom (UK; n=1019) for the analysis in the main text. Specifically, we conducted two comparison studies: China-USA and China-UK comparisons to investigate the regional difference of BMI in the China group compared to the USA and UK groups, respectively.

American Gut Project

The AGP project is a crowd-sourcing citizen science cohort to describe the comprehensive characterization of human gut microbiota and to identify factors being linked to human microbiota. The AGP includes 16S rRNA V4 gene sequences from more than 8,000 fecal samples using standard pipelines, and host metadata. Detailed descriptions can be found in Liu et al. and Hu et al. [1, 33]. Our primary investigation is on the disparity of BMI between Asian or Pacific Islander (API) and non-Hispanic Caucasian adults. We selected a subset of the AGP data based on the following inclusion criteria: 1) USA resident; 2) Asian or Pacific Islander or Caucasian race; 3) no missing values in gender, age, and BMI; 4) age ≥ 18; 5) 80 ≥ BMI; 6) 210cm ≥ height ≥ 80cm; 7) 200kg ≥ weight ≥ 35kg; 8) fecal sample with more than 1,250 sample reads; 9) not duplicate sample; and 10) no self-reported history of inflammatory bowel disease, diabetes, or antibiotic use in the past year. The subjects are filtered out when the reported BMIs are not consistent with the calculated BMI based on the reported heights and weights, i.e. (|BMIreported − BMIcalculated|/BMIcalculated > 5%). A dataset with 130 API and 2,263 Caucasian adults then is used in this paper (Figure S1a).

Statistical Analysis

Data pre-processing and PSM were conducted in three BMI studies. Specifically, for the China-USA and China-UK comparisons, we performed PSM with the parameters described in Section S2 to control for age and gender. For the API-Caucasian comparison, as the AGP includes more than 400 covariates that were collected through self-reported surveys, we first implemented several pre-processing steps to prepare the self-reported covariates for the subsequent analysis, including cleaning up the inconsistent definition of variables, and collapsing the sparse categorical variables into fewer and less sparse categories. Details are provided in Section S3. Forty-four covariates were retained for PSM. We performed univariate linear regressions to identify the potential confounding variables for the relationship among race, microbiome, and BMI. Twenty-three covariates (p-value ≤ 0.05; Figure S1b) were identified as confounders that need to be controlled further based on PSM.

With the matched data, alpha (Observed, Shannon, and Simpson indices) and beta diversities (Bray–Curtis dissimilarity and Jensen–Shannon divergence) were used to estimate microbial community-level diversity. T tests were used for group comparisons of BMI and alpha diversity. Permutational multivariate analysis of variance (PERMANOVA) [34] was used to assess group difference of beta diversity. We performed the proposed SparseMCMM_HD framework at the species rank (Section S4) to quantify RDM, MDM, and ODM, and examine whether the microbiome could explain the health disparity between two non-manipulable exposure groups. The proposed SparseMCMM_HD pipeline was implemented through an interactive web app (https://chanw0.shinyapps.io/sparsemcmm_hd/) for easy exploration.

Results

Results for curatedMetagenomicDataV3.4.2

Matched datasets.

With the healthy adults included in the China-USA and China-UK comparisons, we identified 328 matched Chinese-USA subject pairs, and 559 matched Chinese-UK subject pairs, separately. Figures S2 and S3 show that both matched datasets have comparable propensity scores. The SMDs decrease dramatically on the matched subjects (SMD=0.036 and 0.033), from using all subjects (SMD=0.302 and 0.470) in both China-USA and China-UK datasets. This indicates that PSM has effectively evened the distribution of confounders between two exposure groups in our studies and practically eliminated or controlled the influence of the confounders. In the well-matched datasets, the China group still has significantly lower average BMIs compared to the matched USA (mean [standard deviation]: 22.64 [3.77] vs. 25.77 [4.56]) and the matched UK (22.98 [4.48] vs. 25.77 [4.79]) groups (Figure 2a and 2d).

Figure 2.

Figure 2.

Association analyses in two matched datasets from the curatedMetagenomicData package [28]. a Violin plots of BMI in matched Chinese vs. USA subjects. b Violin plots of alpha diversities (Observed, Shannon, and Simpson indices) in matched Chinese vs. USA samples. c PCoA plots using Bray–Curtis dissimilarity and Jensen–Shannon divergence in matched Chinese and USA samples. d Violin plots of BMI in matched Chinese vs. UK subjects. e Violin plots of alpha diversities (Observed, Shannon, and Simpson indices) in matched Chinese and UK samples. f PCoA plots using Bray–Curtis dissimilarity and Jensen–Shannon divergence in matched Chinese vs. UK samples.

Community level results.

The Chinese group has distinctive microbial community diversities, compared to the matched USA or UK group. For alpha diversity, samples from China have lower Shannon and Simpson diversities and a higher observed diversity than the matched USA or UK samples (Figure 2b and 2e). For beta diversity, Bray-Curtis dissimilarity and Jensen-Shannon divergence both indicate that the Chinese group is significantly different in community structure from the matched USA or UK groups (PERMANOVA [34] all p-values < 1.0 × 10−4. Figure 2c and 2f).

Taxon-level analysis.

After implementing the filtering criteria described in Section S4, 25 species remained in both matched datasets (China vs. USA and China vs. UK). The testing results for OMD and CMD show that the overall and component-wise MDMs through microbiome are significant in both data sets for regional differences in BMI (all p-values<0.001 based on 1,000 permutations). Figure 3a shows that the ODM of BMI are 3.17 and 2.79, respectively, for the matched Chinese and USA subjects, and the matched Chinese and UK subjects; the corresponding MDMs due to microbiome are 0.38 and 0.36. These results suggest that 11.99% and 12.90% of the disparity in BMI between the Chinese and matched USA and UK groups, respectively, would be eliminated if the between-group microbiome profiles were equalized.

Figure 3.

Figure 3.

Health disparity analyses in two matched datasets from the curatedMetagenomicData package [28]. a Manipulable disparity measure (MDM) and residual disparity measure (RDM) of BMI in the China-USA comparison and China-UK comparison, respectively. b Component-wise point and 95% CI estimates of MDMj for the identified species that have mediation effects on the differences of BMI between matched Chinese vs. USA subjects and between matched Chinese vs. UK subjects, respectively. 95% CI estimates of MDMj were calculated by bootstrapping procedure, and the number of bootstrapping is 50. c Venn diagram to show the relationship of the species playing mediation effects in the disparity of BMI among China-USA, China-UK, and API-Caucasian comparisons. API: Asian or Pacific Islander.

Significant CMD testing results show that there is at least one species playing a mediating role in the disparity of BMI between Chinese and USA subjects, and Chinese and UK subjects. Figure 3b reports 15 species and 21 species further identified by SparseMCMM_HD, with the point and 95% confidence interval (CI) estimates for their mediation effects on the regional differences of BMI between China and USA, and between China and UK, respectively. Among the twelve overlapping species identified in both matched datasets (Figure 3b and 3c), five species— Anaerostipes hadrus, Bacteroides plebeius, Bacteroides thetaiotaomicron, Bacteroides uniformis, and Escherichia coli—play consistent positive mediating roles in regional disparity in BMI for Chinese compared to USA subjects, and for Chinese compared to UK subjects. The relative evaluation of these five species in terms of their relative abundances (Figure 4a) and their associations with BMI (Figure 4b) are quite similar between two independent studies: China-USA comparison and China-UK comparison, which validates their mediating roles in the regional disparity on BMI. Confirming with the published studies, B. plebeius, B. thetaiotaomicron, and B. uniformis belong to the same genus Bacteroides, and all play important roles in human metabolism and have been linked with diet-induced obesity, by improving whole-body glucose disposal, promoting lipid digestion and absorption, and degrading host-derived carbohydrates [3538]. B. thetaiotaomicron also possesses glycine lipid biosynthesis pathway (Figure S4). A. hadrus, and E. coli also have been reported by multiple studies that they contribute to or are associated with the BMI or obesity [3941]. On the other hand, 12 species play mediating roles in BMI but with the opposite directions between China-USA comparison and China-UK comparison, that reflects the distinguishing characteristics between USA and UK (Figure S5). This is not surprising considering the microbial profile is inherently dynamic and racially or geographically specific. Moreover, there are three and nine unique species identified in the China-USA and China-UK comparisons respectively (Figures S6 and S7). Most of these study-specific species have been reported being associated with BMI, obesity or metabolic disorders [4150]. Notably, Anaerostipes hadrus, Fusicatenibacter saccharivorans, Lachnospira pectinoschiza, and Roseburia inulinivorans belong to family Lachnospiraceae (Figure 5d), which is related to metabolic syndrome and obesity and whose controversial role has been discussed across different studies [51].

Figure 4.

Figure 4.

Five species who play positive mediation roles in the disparity of BMI in both China-USA and China-UK comparisons. a Violin plots illustrating the relative abundances of these five identified species in the matched Chinese and USA samples, and the matched Chinese and UK samples, respectively. b Scatterplots of BMI and the relative abundances of these five identified species in the matched Chinese and USA subjects, and the matched Chinese and UK subjects, respectively.

Figure 5.

Figure 5.

Health disparity analyses in the matched APIs and Caucasians from the AGP dataset. a Violin plots of BMI in the matched APIs and Caucasians from the AGP dataset. b MDM and RDM of BMI in the API- Caucasian comparison. c Component-wise point and 95% CI estimates of MDMj for the identified species that have mediation effects on the differences of BMI between matched APIs and Caucasians from the AGP dataset. 95% CI estimates of MDMj were calculated by bootstrapping procedure, and the number of bootstrapping is 50. d The taxonomic relationship of the species playing mediation effects in the disparity of BMI among China-USA, China-UK, and API-Caucasian comparisons. The tree figure was generated by Metacoder [65]. From the outer to the center, taxonomic ranks are species, genus, family, order, class, phylum, and kingdom (Bacteria), respectively. For each species, color represents the number of comparisons that identify it among China-USA, China-UK, and API-Caucasian comparisons. APIs: Asian or Pacific Islanders.

Results for AGP

Matched dataset.

After performing PSM, as described in Section S2, 98 Caucasians and 98 APIs are matched. Figures S8 and S9 show that the matched Caucasians and APIs have very similar propensity scores (SMD=0.005 for the matched subjects vs. SMD=1.033 for the raw subjects), indicating that the confounding effects are well controlled. With this well-matched dataset, Figure 5a shows that the Caucasian group has a significantly higher BMI (23.96 [3.92]), compared to the API group (22.38 [3.59]), as observed in the other studies [52, 53].

Community level results.

Caucasians and APIs have distinct microbial profiles in terms of community diversity. For alpha diversity, Caucasians have higher microbial richness and evenness as measured by Observed, Shannon, and Simpson diversities (p-value = 3.1 × 10−5, 1.5 × 10−4, and 3.9 × 10−3, respectively. Figure S10a). For Beta diversity, Bray-Curtis dissimilarity and Jensen-Shannon divergence both show that Caucasian samples have different community structures compared to API samples (PERMANOVA p-value=0.0036 and 0.0012, respectively. Figure S10b).

Taxon-level analysis.

The above community level results indicate that the microbiome may play a mediating role in the racial diversity of BMI. To investigate this assumption, we perform the proposed SparseMCMM_HD on this matched dataset. With the filtering criteria described in Section S4, 28 species are included in the following taxon-level analysis.

We found that the ODM of BMI between Caucasians and APIs is 1.63 (Figure 5b). Microbiome plays a significant role in mediating the racial disparity of BMI indicated by the test results of both OMD (p-value=0.038) and CMD (p-value=0.048). The microbial manipulable disparity measure MDM is 0.12. This suggests that the difference of microbiome profiles contributes to 7.4% of ODM, which would be eliminated if the microbiome profiles between the Caucasians and APIs were identical.

We further identified 12 species playing mediating roles in the racial disparity of BMI between the Caucasians and APIs (Figure 5c). Eight species ([Ruminococcus] gnavus, Faecalibacterium prausnitzii, Bacteroides uniformis, [Eubacterium] biforme, Bacteroides fragilis, Prevotella copri, Bacteroides ovatus, Haemophilus parainfluenzae) mediate positively on the racial disparity of BMI, meanwhile, four species (Bifidobacterium adolescentis, Bacteroides plebeius, Parabacteroides distasonis, Staphylococcus aureus) play negative mediating roles. Remarkably, there are six common species B. ovatus, B. plebeius, B. uniformis, B. adolescentis, F. prausnitzii, P. distasonis, and P. copri identified by both comparisons: China-USA and China-UK illustrated in the previous subsection (Figure 5d). Literature reveals that all identified species are associated with the BMI or obesity [4149].

Collectively, the findings in the matched China vs. USA, China vs. UK, and API vs. Caucasian datasets show that the microbiome is an important mediator in the regional or racial disparity of BMI and they substantially shed light on how to reduce the disparity of BMI. The identified microbial agents can be used as the potential therapeutic target for the treatment based on microbiota modulation in the future.

Discussion

The emerging evidence highlights the potential of microbiome in understanding health disparity. In this paper, we proposed a mediation analytical framework, SparseMCMM_HD, to investigate the microbiome’s role in health disparity. Considering a health disparity framework with three components: non-manipulable exposure (e.g. race or region), microbiome as mediator, and outcome, the proposed SparseMCMM_HD deciphers the overall health disparity of the non-manipulable exposure on the outcome into two components: MDM that would be eliminated by equalizing microbiome profiles and RDM that would remain and could not be explained through microbiome. Remarkably, MDM paves a viable path towards reduction of health disparity with microbial modulation. Similar to SparseMCMM, SparseMCMM_HD can be used to identify the signature causal microbes and examine whether the overall or component-wise MDM is significantly non-zero.

It is vital to control confounding effects beforehand in the real data analysis to satisfy the identifiability assumptions of the proposed SparseMCMM_HD. In three BMI applications, we first employed PSM to remove the confounding effects by selecting matched subsets in which the distributions of confounders were notably comparable between two exposure groups, and then performed the proposed SparseMCMM_HD framework. The utilization of SparseMCMM_HD in two datasets, the curatedMetagenomicData 3.4.2 package and the AGP dataset, depicts an explicit causal path among region or race, microbiome, and BMI. These findings confirm not only that microbiome is differentially distributed across races or regions, but also that the differential microbiome profile contributes to the disparities in BMI across races or regions. The identified microbial signatures potentially aid in developing personalized medication or nutrition to reduce obesity disparity.

It is not surprising that the proportion of disparities in BMI explained by the microbiome profiles is not large (~10%) in all three applications, due to the heritable and polygenic nature of BMI [54, 55]. Further investigations to integrate the microbiome profile and genetic factors are necessary to better understand disparity in BMI. However, we here emphasize that the proposed SparseMCMM_HD is a rigorous and validated causal mediation framework and has preeminent potential to identify the microbiome’s roles in much broader health disparity studies.

Recently, several other microbial mediation methods have been proposed, such as CMM [56], MedTest [57], Zhang, et al. [58], LDM-med [59], and MarZIC [60], in a typical three-factor (manipulable exposure, microbiome as mediator, and outcome) study design. Considering distinct model assumptions and characteristics, a few recent benchmark studies [12, 5660] show that there is no method performing consistently and accurately better than others in all circumstances. However, since the assumptions for model identification in health disparity are weaker than those for the causal mediation effects in the manipulable exposure-mediator-outcome framework [23], it is expected that the idea of how the proposed SparseMCMM_HD framework rigorously defines, quantifies, and tests health disparity measures as an extension of SparseMCMM [12] can provide insight into extending these available mediation models to investigate the microbiome’s role in health disparity. Then, a useful path forward will be to mutually employ these multiple and complimentary methods to better characterize the microbiome’s role in health disparity by capitalizing their distinct assumptions and strengths.

Our study has several limitations. First, similar to discussions in SparseMCMM [12], SparseMCMM_HD takes microbiome data at a fixed time point into the proposed frame and is limited to accommodate the dynamic nature of microbiome. Second, the proposed SparseMCMM_HD currently deals with disparity in a continuous outcome. Given the fact that multiple binary or categorical outcomes are disproportionately prevalent across races or regions [6163], it will be worthwhile to extend the current framework to handle categorical outcomes. Third, microbiome studies typically characterize both taxonomic and functional profiles of microbial communities. Functional profile is generally thought to be more closely linked with human health and disease. Identifying the role of microbiome in terms of gene function in health disparity is of high practical value [64].

Conclusions

This paper elucidates the role of microbiome in health disparity by providing a causal mediation analytic framework for investigating the relationship among race or region, microbiome, and outcome under the counterfactual framework. The proposed SparseMCMM_HD framework is a useful tool to investigate the underlying biological mechanism of health disparity and disentangles the substantial contributions of microbiome to health disparity. The applications of SparseMCMM_HD in the disparity of BMI across races and regions uncover the microbial mediating roles in reducing the disparities of BMI and improving health equality.

Funding

The study was supported in part by grant number U54MD000538 from the National Institutes of Health (NIH) National Institute on Minority Health and Health Disparities, and grant number P20CA252728 from the National Cancer Institute. The contents of this publication are solely the responsibility of the authors and do not necessarily represent the official views of the NIH.

List of abbreviations

AGP

American gut project

API

Asian or Pacific Islander

BMI

body mass index

MDM

manipulable disparity measure

ODM

overall disparity measure

PERMANOVA

permutational multivariate analysis of variance

PSM

propensity score matching

RDM

residual disparity measure

SparseMCMM

sparse microbial causal mediation model

SparseMCMM_HD

SparseMCMM for health disparity

SMD

standardized mean difference

UK

United Kingdom

USA

United States

Footnotes

Ethics approval and consent to participate

All utilized microbiome datasets are publicly available. No ethics approval or consent to participate was required for this study.

Consent for publication

Not applicable: All utilized microbiome datasets are publicly available. No consent for publication was required for this study.

Competing interests

The authors declare that they have no competing interests.

Availability of data and materials

All relevant datasets are publicly available. The data used in investigations of the regional difference of BMI in the China group compared to the United States (USA) and United Kingdom (UK) groups can be downloaded from the curatedMetagenomicData 3.4.2 package [28]. The data used in investigations of the racial difference in BMI between Caucasians and Asian or Pacific Islanders are from the American Gut Project. Their raw data and metadata are publicly available on the FTP website (ftp://ftp.microbio.me/AmericanGut/). Version 07/29/2016 is used in our analyses.

SparseMCMM R package is available at https://github.com/chanw0/SparseMCMM. The interactive web app for the proposed SparseMCMM_HD framework is available at https://chanw0.shinyapps.io/sparsemcmm_hd/.

References

  • 1.Hu J, Koh H, He L, Liu M, Blaser MJ, Li H: A two-stage microbial association mapping framework with advanced FDR control. Microbiome 2018, 6(1):1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Gilbert JA, Quinn RA, Debelius J, Xu ZZ, Morton J, Garg N, Jansson JK, Dorrestein PC, Knight R: Microbiome-wide association studies link dynamic microbial consortia to disease. Nature 2016, 535(7610):94–103. [DOI] [PubMed] [Google Scholar]
  • 3.Koh H, Livanos AE, Blaser MJ, Li H: A highly adaptive microbiome-based association test for survival traits. BMC genomics 2018, 19(1):1–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Koh H, Blaser MJ, Li H: A powerful microbiome-based association test and a microbial taxa discovery framework for comprehensive association mapping. Microbiome 2017, 5(1):1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Ahn J, Sinha R, Pei Z, Dominianni C, Wu J, Shi J, Goedert JJ, Hayes RB, Yang L: Human gut microbiome and risk for colorectal cancer. Journal of the National Cancer Institute 2013, 105(24):1907–1911. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Kostic AD, Xavier RJ, Gevers D: The microbiome in inflammatory bowel disease: current status and the future ahead. Gastroenterology 2014, 146(6):1489–1499. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Hoffmann AR, Proctor L, Surette M, Suchodolski J: The microbiome: the trillions of microorganisms that maintain health and cause disease in humans and companion animals. Veterinary Pathology 2016, 53(1):10–21. [DOI] [PubMed] [Google Scholar]
  • 8.Kelly TN, Bazzano LA, Ajami NJ, He H, Zhao J, Petrosino JF, Correa A, He J: Gut microbiome associates with lifetime cardiovascular disease risk profile among bogalusa heart study participants. Circulation research 2016, 119(8):956–964. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Gilbert JA, Blaser MJ, Caporaso JG, Jansson JK, Lynch SV, Knight R: Current understanding of the human microbiome. Nature medicine 2018, 24(4):392–400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Fattorusso A, Di Genova L, Dell’Isola GB, Mencaroni E, Esposito S: Autism spectrum disorders and the gut microbiota. Nutrients 2019, 11(3):521. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Integrative H, Proctor LM, Creasy HH, Fettweis JM, Lloyd-Price J, Mahurkar A, Zhou W, Buck GA, Snyder MP, Strauss III JF: The integrative human microbiome project. Nature 2019, 569(7758):641–648. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wang C, Hu J, Blaser MJ, Li H: Estimating and testing the microbial causal mediation effect with high-dimensional and compositional microbiome data. Bioinformatics (Oxford, England) 2020, 36(2):347–355. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Gupta VK, Paul S, Dutta C: Geography, ethnicity or subsistence-specific variations in human microbiome composition and diversity. Frontiers in microbiology 2017, 8:1162. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Dehingia M, Adak A, Khan MR: Ethnicity-influenced microbiota: a future healthcare perspective. Trends in microbiology 2019, 27(3):191–193. [DOI] [PubMed] [Google Scholar]
  • 15.Findley K, Williams DR, Grice EA, Bonham VL: Health disparities and the microbiome. Trends in microbiology 2016, 24(11):847–850. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Rothschild D, Weissbrod O, Barkan E, Kurilshikov A, Korem T, Zeevi D, Costea PI, Godneva A, Kalka IN, Bar N: Environment dominates over host genetics in shaping human gut microbiota. Nature 2018, 555(7695):210–215. [DOI] [PubMed] [Google Scholar]
  • 17.Schulz MD, Atay Ç, Heringer J, Romrig FK, Schwitalla S, Aydin B, Ziegler PK, Varga J, Reindl W, Pommerenke C: High-fat-diet-mediated dysbiosis promotes intestinal carcinogenesis independently of obesity. Nature 2014, 514(7523):508–512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Zhang X, Zhao Y, Zhang M, Pang X, Xu J, Kang C, Li M, Zhang C, Zhang Z, Zhang Y: Structural changes of gut microbiota during berberine-mediated prevention of obesity and insulin resistance in high-fat diet-fed rats. 2012. [DOI] [PMC free article] [PubMed]
  • 19.Cox LM, Blaser MJ: Antibiotics in early life and obesity. Nature Reviews Endocrinology 2015, 11(3):182–190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Taur Y, Pamer EG: Microbiome mediation of infections in the cancer setting. Genome medicine 2016, 8(1):1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Lv B-M, Quan Y, Zhang H-Y: Causal inference in microbiome medicine: Principles and applications. Trends in microbiology 2021, 29(8):736–746. [DOI] [PubMed] [Google Scholar]
  • 22.Ananthakrishnan AN, Bernstein CN, Iliopoulos D, Macpherson A, Neurath MF, Ali RAR, Vavricka SR, Fiocchi C: Environmental triggers in IBD: a review of progress and evidence. Nature Reviews Gastroenterology & Hepatology 2018, 15(1):39–49. [DOI] [PubMed] [Google Scholar]
  • 23.VanderWeele TJ, Robinson WR: On causal interpretation of race in regressions adjusting for confounding and mediating variables. Epidemiology (Cambridge, Mass) 2014, 25(4):473. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Haththotuwa RN, Wijeyaratne CN, Senarath U: Worldwide epidemic of obesity. In: Obesity and obstetrics. Elsevier; 2020: 3–8. [Google Scholar]
  • 25.Gaulke CA, Sharpton TJ: The influence of ethnicity and geography on human gut microbiome composition. Nature medicine 2018, 24(10):1495–1496. [DOI] [PubMed] [Google Scholar]
  • 26.Deschasaux M, Bouter KE, Prodan A, Levin E, Groen AK, Herrema H, Tremaroli V, Bakker GJ, Attaye I, Pinto-Sietsma S-J: Depicting the composition of gut microbiota in a population with varied ethnic origins but shared geography. Nature medicine 2018, 24(10):1526–1531. [DOI] [PubMed] [Google Scholar]
  • 27.He Y, Wu W, Zheng H-M, Li P, McDonald D, Sheng H-F, Chen M-X, Chen Z-H, Ji G-Y, Zheng Z-D-X: Regional variation limits applications of healthy gut microbiome reference ranges and disease models. Nature medicine 2018, 24(10):1532–1535. [DOI] [PubMed] [Google Scholar]
  • 28.Pasolli E, Schiffer L, Manghi P, Renson A, Obenchain V, Truong DT, Beghini F, Malik F, Ramos M, Dowd JB: Accessible, curated metagenomic data through ExperimentHub. Nature methods 2017, 14(11):1023–1024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Hijazi RH, Jernigan RW: Modelling compositional data using Dirichlet regression models. Journal of Applied Probability & Statistics 2009, 4(1):77–91. [Google Scholar]
  • 30.Naimi AI, Schnitzer ME, Moodie EE, Bodnar LM: Mediation analysis for health disparities research. American journal of epidemiology 2016, 184(4):315–324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Rosenbaum PR, Rubin DB: The central role of the propensity score in observational studies for causal effects. Biometrika 1983, 70(1):41–55. [Google Scholar]
  • 32.Austin PC: An introduction to propensity score methods for reducing the effects of confounding in observational studies. Multivariate behavioral research 2011, 46(3):399–424. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Liu M, Koh H, Kurtz ZD, Battaglia T, PeBenito A, Li H, Nazzal L, Blaser MJ: Oxalobacter formigenes-associated host features and microbial community structures examined using the American Gut Project. Microbiome 2017, 5(1):1–17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Anderson MJ: Permutational multivariate analysis of variance (PERMANOVA). Wiley statsref: statistics reference online 2014:1–15. [Google Scholar]
  • 35.López-Almela I, Romaní-Pérez M, Bullich-Vilarrubias C, Benítez-Páez A, Gómez Del Pulgar EM, Francés R, Liebisch G, Sanz Y: Bacteroides uniformis combined with fiber amplifies metabolic and immune benefits in obese mice. Gut Microbes 2021, 13(1):1–20. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Cho S-H, Cho Y-J, Park J-H: The human symbiont Bacteroides thetaiotaomicron promotes diet-induced obesity by regulating host lipid metabolism. Journal of Microbiology 2022, 60(1):118–127. [DOI] [PubMed] [Google Scholar]
  • 37.Hehemann J-H, Kelly AG, Pudlo NA, Martens EC, Boraston AB: Bacteria of the human gut microbiome catabolize red seaweed glycans with carbohydrate-active enzyme updates from extrinsic microbes. Proceedings of the National Academy of Sciences 2012, 109(48):19786–19791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Thomas F, Hehemann J-H, Rebuffet E, Czjzek M, Michel G: Environmental and gut bacteroidetes: the food connection. Frontiers in microbiology 2011, 2:93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Holmes ZC, Silverman JD, Dressman HK, Wei Z, Dallow EP, Armstrong SC, Seed PC, Rawls JF, David LA: Short-chain fatty acid production by gut microbiota from children with obesity differs according to prebiotic choice and bacterial community composition. MBio 2020, 11(4):e00914–00920. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Million á, Angelakis E, Maraninchi M, Henry M, Giorgi R, Valero R, Vialettes B, Raoult D: Correlation between body mass index and gut concentrations of Lactobacillus reuteri, Bifidobacterium animalis, Methanobrevibacter smithii and Escherichia coli. International journal of obesity 2013, 37(11):1460–1466. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
  • 41.Ignacio A, Fernandes M, Rodrigues V, Groppo F, Cardoso A, Avila-Campos M, Nakano V: Correlation between body mass index and faecal microbiota from children. Clinical microbiology and infection 2016, 22(3):258. e251–258. e258. [DOI] [PubMed] [Google Scholar]
  • 42.Journey EK, Ortega-Santos CP, Bruening M, Whisner CM: Changes in weight status and the intestinal microbiota among college freshman, aged 18 years. Journal of Adolescent Health 2020, 66(2):166–171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Palmas V, Pisanu S, Madau V, Casula E, Deledda A, Cusano R, Uva P, Vascellari S, Loviselli A, Manzin A: Gut microbiota markers associated with obesity and overweight in Italian adults. Scientific reports 2021, 11(1):1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Chatel J-M, Maioli TU, Borras-Nogues E, Barbosa SC, Martins VD, Torres L, Azevedo VADC: Possible benefits of Faecalibacterium prausnitzii for obesity-associated gut disorders. Frontiers in Pharmacology 2021:2869. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Duan M, Wang Y, Zhang Q, Zou R, Guo M, Zheng H: Characteristics of gut microbiota in people with obesity. Plos one 2021, 16(8):e0255446. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Li Y, Yang Y, Wang J, Cai P, Li M, Tang X, Tan Y, Wang Y, Zhang F, Wen X: Bacteroides ovatus-mediated CD27− MAIT cell activation is associated with obesity-related T2D progression. Cellular & Molecular Immunology 2022:1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Assmann TS, Cuevas-Sierra A, Riezu-Boj JI, Milagro FI, Martínez JA: Comprehensive analysis reveals novel interactions between circulating MicroRNAs and gut microbiota composition in human obesity. International journal of molecular sciences 2020, 21(24):9509. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Befus M, Lowy FD, Miko BA, Mukherjee DV, Herzig CT, Larson EL: Obesity as a determinant of Staphylococcus aureus colonization among inmates in maximum-security prisons in New York State. American journal of epidemiology 2015, 182(6):494–502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Yan H, Qin Q, Chen J, Yan S, Li T, Gao X, Yang Y, Li A, Ding S: Gut microbiome alterations in patients with visceral obesity based on quantitative computed tomography. Frontiers in Cellular and Infection Microbiology 2022:1451. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Yang M, Bose S, Lim S, Seo J, Shin J, Lee D, Chung W-H, Song E-J, Nam Y-D, Kim H: Beneficial effects of newly isolated Akkermansia muciniphila strains from the human gut on obesity and metabolic dysregulation. Microorganisms 2020, 8(9):1413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Vacca M, Celano G, Calabrese FM, Portincasa P, Gobbetti M, De Angelis M: The controversial role of human gut lachnospiraceae. Microorganisms 2020, 8(4):573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.McDonald D, Hyde E, Debelius JW, Morton JT, Gonzalez A, Ackermann G, Aksenov AA, Behsaz B, Brennan C, Chen Y: American gut: an open platform for citizen science microbiome research. Msystems 2018, 3(3):e00031–00018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Obana KK, Davis J: Racial disparities in the prevalence of arthritis among native Hawaiians and Pacific Islanders, Whites, and Asians. Hawai’i Journal of Medicine & Public Health 2016, 75(6):155. [PMC free article] [PubMed] [Google Scholar]
  • 54.Bouchard C: Genetics of obesity: what we have learned over decades of research. Obesity 2021, 29(5):802–820. [DOI] [PubMed] [Google Scholar]
  • 55.Loos RJ, Yeo GS: The genetics of obesity: from discovery to biology. Nature Reviews Genetics 2022, 23(2):120–133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Sohn MB, Li H: Compositional mediation analysis for microbiome studies. The Annals of Applied Statistics 2019, 13(1):661–681. [Google Scholar]
  • 57.Zhang J, Wei Z, Chen J: A distance-based approach for testing the mediation effect of the human microbiome. Bioinformatics 2018, 34(11):1875–1883. [DOI] [PubMed] [Google Scholar]
  • 58.Zhang H, Chen J, Feng Y, Wang C, Li H, Liu L: Mediation effect selection in high-dimensional and compositional microbiome data. Statistics in medicine 2021, 40(4):885–896. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Yue Y, Hu Y: Testing Mediation Effects in High-Dimensional Microbiome Data with False Discovery Rate Control. 2021.
  • 60.Wu Q, O’malley J, Datta S, Gharaibeh RZ, Jobin C, Karagas MR, Coker MO, Hoen AG, Christensen BC, Madan JC: MarZIC: A Marginal Mediation Model for Zero-Inflated Compositional Mediators with Applications to Microbiome Data. Genes 2022, 13(6):1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Royston KJ, Adedokun B, Olopade OI: Race, the microbiome and colorectal cancer. World Journal of Gastrointestinal Oncology 2019, 11(10):773. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Siddharth S, Sharma D: Racial disparity and triple-negative breast cancer in African-American women: a multifaceted affair between obesity, biology, and socioeconomic determinants. Cancers 2018, 10(12):514. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Johnson JR, Kittles RA: Genetic ancestry and racial differences in prostate tumours. Nature Reviews Urology 2022, 19(3):133–134. [DOI] [PubMed] [Google Scholar]
  • 64.Tian L, Wang X-W, Wu A-K, Fan Y, Friedman J, Dahlin A, Waldor MK, Weinstock GM, Weiss ST, Liu Y-Y: Deciphering functional redundancy in the human microbiome. Nature communications 2020, 11(1):1–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Foster ZS, Sharpton TJ, Grünwald NJ: Metacoder: An R package for visualization and manipulation of community taxonomic diversity data. PLoS computational biology 2017, 13(2):e1005404. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All relevant datasets are publicly available. The data used in investigations of the regional difference of BMI in the China group compared to the United States (USA) and United Kingdom (UK) groups can be downloaded from the curatedMetagenomicData 3.4.2 package [28]. The data used in investigations of the racial difference in BMI between Caucasians and Asian or Pacific Islanders are from the American Gut Project. Their raw data and metadata are publicly available on the FTP website (ftp://ftp.microbio.me/AmericanGut/). Version 07/29/2016 is used in our analyses.

SparseMCMM R package is available at https://github.com/chanw0/SparseMCMM. The interactive web app for the proposed SparseMCMM_HD framework is available at https://chanw0.shinyapps.io/sparsemcmm_hd/.


Articles from Research Square are provided here courtesy of American Journal Experts

RESOURCES