A randomization-based causal inference framework for uncovering environmental exposure effects on human gut microbiota

Alice J Sommer; Annette Peters; Martina Rommel; Josef Cyrys; Harald Grallert; Dirk Haller; Christian L Müller; Marie-Abèle C Bind

doi:10.1371/journal.pcbi.1010044

. 2022 May 9;18(5):e1010044. doi: 10.1371/journal.pcbi.1010044

A randomization-based causal inference framework for uncovering environmental exposure effects on human gut microbiota

Alice J Sommer ^1,^2,^3,^*, Annette Peters ^2,^3,^4,^*, Martina Rommel ^3,⁵, Josef Cyrys ³, Harald Grallert ^5,⁶, Dirk Haller ^7,⁸, Christian L Müller ^9,^10,^11,^*, Marie-Abèle C Bind ^1,¹²

Editor: Simon Anders¹³

¹Department of Statistics, Harvard University, Cambridge, Massachusetts, United States of America

²Institute for Medical Information Processing, Biometry, and Epidemiology, Faculty of Medicine, Ludwig-Maximilians-University München, Munich, Germany

³Institute of Epidemiology, Helmholtz Zentrum München, Neuherberg, Germany

⁴Department of Environmental Health, Harvard T. H. Chan School of Public Health, Boston, Massachusetts, United States of America

⁵Research Unit of Molecular Epidemiology, Helmholtz Zentrum München, Neuherberg, Germany

⁶German Center for Diabetes Research (DZD), München-Neuherberg, Germany

⁷ZIEL - Institute for Food & Health, Technical University of Munich, Freising, Germany

⁸Chair of Nutrition and Immunology, Technical University of Munich, Freising, Germany

⁹Institute of Computational Biology, Helmholtz Zentrum München, Neuherberg, Germany

¹⁰Department of Statistics, Ludwig-Maximilians-University München, Munich, Germany

¹¹Center for Computational Mathematics, Flatiron Institute, New York City, New York, United States of America

¹²Biostatistics Center, Massachusetts General Hospital and Harvard Medical School, Boston, Massachusetts, United States of America

¹³Ruprecht Karls Universitat Heidelberg, GERMANY

The authors have declared that no competing interests exist.

^✉

* E-mail: alice.j.sommer@gmail.com (AJS); peters@helmholtz-muenchen.de (AP); cmueller@flatironinstitute.org (CLM)

Roles

Alice J Sommer: Conceptualization, Methodology, Visualization, Writing – original draft, Writing – review & editing

Annette Peters: Conceptualization, Funding acquisition, Supervision, Writing – review & editing

Martina Rommel: Writing – review & editing

Josef Cyrys: Writing – review & editing

Harald Grallert: Writing – review & editing

Dirk Haller: Writing – review & editing

Christian L Müller: Conceptualization, Methodology, Supervision, Writing – review & editing

Marie-Abèle C Bind: Conceptualization, Funding acquisition, Methodology, Supervision, Writing – review & editing

Simon Anders: Editor

PMCID: PMC9129050 PMID: 35533202

Abstract

Statistical analysis of microbial genomic data within epidemiological cohort studies holds the promise to assess the influence of environmental exposures on both the host and the host-associated microbiome. However, the observational character of prospective cohort data and the intricate characteristics of microbiome data make it challenging to discover causal associations between environment and microbiome. Here, we introduce a causal inference framework based on the Rubin Causal Model that can help scientists to investigate such environment-host microbiome relationships, to capitalize on existing, possibly powerful, test statistics, and test plausible sharp null hypotheses. Using data from the German KORA cohort study, we illustrate our framework by designing two hypothetical randomized experiments with interventions of (i) air pollution reduction and (ii) smoking prevention. We study the effects of these interventions on the human gut microbiome by testing shifts in microbial diversity, changes in individual microbial abundances, and microbial network wiring between groups of matched subjects via randomization-based inference. In the smoking prevention scenario, we identify a small interconnected group of taxa worth further scrutiny, including Christensenellaceae and Ruminococcaceae genera, that have been previously associated with blood metabolite changes. These findings demonstrate that our framework may uncover potentially causal links between environmental exposure and the gut microbiome from observational data. We anticipate the present statistical framework to be a good starting point for further discoveries on the role of the gut microbiome in environmental health.

Author summary

Environmental influences on the human gut microbiome are still to be discovered or better understood. In this paper, we contribute to the field of microbiome research and environmental epidemiology by suggesting a stage-based causal inference framework relying on the foundations of the Rubin Causal Model. A particularity of the framework is the use of randomization-based inference, which we value to be a necessary exploratory inference method when tackling untapped research questions. To illustrate the framework, we explore the effects of two inhaled environmental exposures previously hypothesized to be linked with gastrointestinal diseases and the gut microbiome: air pollution exposure and cigarette smoking.

This is a PLOS Computational Biology Methods paper.

1 Introduction

The human microbiome plays a pivotal role in maintaining a healthy physiology via multiple interactions with the host. The gut microbiome, for instance, provides important metabolic capabilities for food digestion [1, 2] and regulates immune homeostasis [3]. Although dietary interventions [4], pathogen infections [5], and antibiotics use [6] can trigger rapid changes of gut microbial compositions and can lead to dysbiotic disruptions of host-microbiome interactions, the long-term impact of environmental exposures on the human gut microbiome remains poorly understood. In this paper, we provide a causal inference framework for assessing such epidemiological questions and analyze a prospective cohort with collected microbiome data. Recent technological advances, through culture-independent analyses, have facilitated a surge in observational studies of the human microbiome [7–9]. A common method to catalog microbial constituents is high-throughput amplicon sequencing [10], allowing the acquisition of gut microbiome survey data for large prospective cohort studies. Important examples include the Human Microbiome Project [11], the British TwinsUK study [12], the Dutch LifeLines-DEEP [13] and Rotterdam Studies [14], the Chinese Guangdong Gut Microbiome Project [15], the American Gut Project [16], and the German KORA study [17].

Thus far, these and other studies have linked alterations in gut microbial compositions to several common diseases, including rheumatoid arthritis, colorectal cancer, obesity, inflammatory bowel disease (IBD), and diabetes [18]. Although environmental exposures such as particulate matter (PM) [19] and smoking [20] are also related to these diseases, an understanding of environment-gut microbiome relationships and their implications for disease mechanisms has remained elusive. Here, we examine such environment-gut microbiome relationships within a causal inference framework [21] combined with state-of-the-art statistical methods for amplicon sequence variant (ASV) data [22]. We illustrate our analysis framework using data from the German KORA study [17] and focus on two inhaled environmental exposures previously hypothesized to be linked with gastrointestinal diseases and the gut microbiome: (i) particulate matter (PM) with diameter smaller or equal to 2.5 μm (PM_2.5) and (ii) cigarette smoking.

Air pollution exposure has been found to be associated with gastrointestinal diseases, such as appendicitis [23], inflammatory bowel disease [24], abdominal pain [25], and metabolic disorders [26]. Current research suggests that air pollution may impact the gut microbiome which, in turn, acts as a “mediator” of the association between air pollution and metabolic disorders such as obesity and type 2 diabetes [27–29]. These studies found associations between nitric oxide, nitrogen dioxide [27], PM [28], and ozone [30] exposures and the gut microbiome. Several potential pathways explain how particles affect human health. The gut is exposed to PM through: (i) mucociliary clearance, i.e., the self-cleaning mechanism of the bronchi, inducing inhaled PM to be cleared from the lungs to the gut, and (ii) oral route exposure, when food and water are contaminated by PM prior to being ingested or in the alimentary canal via inhalation [31, 32]. Results from murine studies of the effect of PM on the gut [33–37] suggest that exposure to PM changes the microbial composition and increases gut permeability, leading to higher systemic inflammation due to the unrestrained influx of microbial products from the gut into the systemic circulation [38].

The chemical mixture of cigarette smoke inhaled into the lungs has an effect on blood markers that, in turn, interact with the gut. Another pathway is that the toxicants of cigarette smoke swallowed into the gastrointestinal tract induce gastrointestinal microbiota dysbiosis via antimicrobial activity and regulation of the intestinal microenvironment [39]. Cigarette smoking is an inhaled exposure that has been shown to influence the susceptibility of diseases such as IBD, colorectal cancer, and systemic diseases [20, 40, 41]. Animal studies suggest that cigarette smoke may mediate its effects through alterations of intestinal microbiota [42]. In humans, shifts in the gut microbiome composition and diversity were observed after smoking cessation. These shifts were similar to previously observed shifts in obese vs. lean patients, suggesting a potential microbial link between the metabolic function of the gut and smoking cessation [43]. Comparison of the gut microbiome composition of smokers and never-smokers led to similar observations [44]. So far, the underlying mechanisms of the effect of smoking on not only gut-related, but also autoimmune diseases have not been established. It has been hypothesizes that the gut microbiome may be the missing link between smoking and autoimmune diseases [20].

Central to the present study is the investigation of the causal question: Does reducing inhaled environmental exposures alter the human gut microbiome? As summarized in Fig 1, we answer this question using the following four-stage analysis framework: (i) conceptualize hypothetical environmental interventions that could have resulted in the observed data at hand, (ii) design our non-randomized data, so that the unconfoundedness assumption can be assumed, (iii) choose powerful, state-of-the-art test statistics from the literature to compare human gut microbiome at different levels of taxonomic granularity between subjects assigned to the interventions vs. not, and (iv) interpret the implications of the results for recommending further studies or the studied hypothetical intervention. The reason for using this four-stage approach is for the transparency of its assumptions when interpreting results. The Methods section elaborates on each of these steps. An essential ingredient in stage (iii) of our framework is the use of a randomization-based hypothesis testing with powerful test statistics comparing subjects under an intervention vs. not [45, 46]. We do not attempt to provide an estimate of (and uncertainty around) an estimand to avoid relying on assumptions such as the additivity of the treatment effects, asymptotic arguments, or an imputation model, which may be the case when drawing Neymanian (i.e., distribution-based) or Bayesian inferences. This Fisherian approach is a non-asymptotic first step to start shedding light on merely-touched research questions dependent on complex data structures, such as human gut microbiome data.

Fig 1 — Stage 1: Formulation of a plausible hypothetical intervention (e.g., decreasing inhaled environmental exposures) to examine its impacts on the gut microbiome. Stage 2: Construct a hypothetical paired-randomized experiment in which the environmental intervention been implemented randomly. Stage 3: Choose powerful test statistics comparing the gut microbiome had the subjects been hypothetically randomized to the environmental intervention vs. not and test the sharp null hypotheses of no effect of the intervention at different aggregation levels of the data. Stage 4: Interpretation of the statistical analyses and recommendations for future studies or implementation of the intervention.

The present causal inference framework relies on ideas developed in the 70s [47–50] and the Rubin Causal Model [51, 52] to analyze observational data by reconstructing the ideal conditions of randomized experiments, the “gold standard” to draw objective causal inferences on the effects of an intervention [53]. A formidable statistical challenge is, however, to define and test these intervention effects for high-dimensional taxonomically-structured microbiome relative abundance data. Here, we adapted and advanced several state-of-the-art approaches from the statistical literature tailored to amplicon data, ranging from tests for α-diversity in networked communities [54, 55], Microbiome Regression-based Kernel Association Tests (MiRKAT) for β-diversity to randomization-based differential compositional mean tests [56]. We also applied and analyzed individual taxon differential abundance tests with taxonomic rank-dependent reference selection [57] and sparse compositionally robust taxon-taxon network estimation schemes [58] with novel differential edge tests [59], thus covering a comprehensive list of archetypical microbiome data analysis tasks.

Our framework complements recent causal inference approaches for microbiome data such as mediation methods [60, 61], graphical models [62], and Mendelian randomization [63, 64] to analyze observational gut microbiome data. In these studies, the target for interventions is the microbiome and the understanding of its effects on diseases, i.e., the microbiome is treated as the exposure and diseases as outcomes. Here, we are interested in examining the effects of environmental exposures (interventions) on the gut microbiome (“the” outcome), when only non-randomized data are available. To the best of our knowledge, no other observational study interested in environmental effects on the gut microbiome addressed their research question using causal inference methods.

In the following, we detail the characteristics of the KORA FF4 study population and highlight potential effects of the hypothetical interventions, air pollution reduction and smoking prevention, on the gut microbiome. In particular, we characterize potential effects in terms of changes in overall microbial diversity, taxon-level abundances, and microbial associations. In the smoking prevention analysis, we identified taxa, including Ruminococcaceae (UCG-005, UCG-003, UCG-002) and Christensenellaceae R-7-group, that are part of a stable sub-community in the microbial association networks and have been found to contribute to circulating blood metabolites in the LifeLines-Deep cohort [65].

2 Methods

2.1 The German KORA FF4 cohort study

The data come from the German KORA FF4 cohort study, which involves participants aged 25 to 74 years old living in the city of Augsburg [17]. The participants were subject to health questionnaires and follow-up examinations. During the study, stool samples were collected and the gut microbiota data for 2,033 participants were obtained with 16S rRNA gene sequencing. For each participant we have their long-term exposure to air pollution (particulate matter). The long-term exposure variables come from the ULTRA III study, in which air pollutants were monitored several times a year at 20 locations within the Augsburg region. From this data, annual averages of air pollutants were calculated using land-use regression models. The models explain the spatial variation of the pollutants with predictor variables derived from geographic information systems (GIS). To obtain the long-term air pollution values for each participant, land-use regression models were applied to their residential address. Moreover, to elucidate relationships between health outcomes and diet, dietary intake data were collected for 1,469 participants of the KORA FF4 cohort. Dietary intake was derived using a method combining information from a food frequency questionnaire (FFQ) and repeated 24-h food lists [66]. In brief, the usual food intake (in gram/day) was calculated as the product of the probability of consumption of a food on a given day and the average amount of a food consumed on a consumption day.

2.1.1 Gut microbiome data sequencing and preprocessing

DNA Extraction, 16S rRNA Gene Amplification, and Amplicon Sequencing. Fecal DNA extraction was isolated by following the protocol of [67]. The samples were profiled by high-throughput amplicon sequencing with dual-index barcoding using the Illumina MiSeq platform. Based on a study providing guidelines for selecting primer pairs [68], the V3-V4 region of the gene encoding 16S ribosomal RNA was amplified using the primers 341-forward (CCTACGGGNGGCWGCAG; bacterial domain specific) and 785-reverse (GACTACHVGGGTATCTAATCC; bacterial domain specific). Amplification was undertaken using the Phusian High-Fidelity DNA Polymerase Hotstart as per manufacturer’s instructions. The PCR libraries were then barcoded using a dual-index system. Following a round of purification with AMPure XP beads (Beckman Coulter), libraries were quantified and pooled to 2nM. The libraries were sequenced on an Illumina MiSeq (2 x 250 bp), using facilities provided by the Ziel NGS-Core Facility of the Technical University Muenchen (TUM).

Bioinformatics. The demultiplexed, per-sample, primer-free amplicon reads were processed by the DADA2 workflow [22, 69] to infer sequence variants, remove chimeras, and assign taxonomies with the Silva v128 database [70] using the naive Bayesian classifier method [71] until the genus-level assignment and the exact matching method [72] for species-level assignment. We opted for the high-resolution DADA2 method to infer sequence variants without any fixed threshold, thereby resolving variants that differ by as little as one nucleotide. Amplicon sequence variants (ASVs) do not impose the arbitrary dissimilarity thresholds that define OTUs. They provide consistent labels because they represent a biological reality that exists outside the data being analyzed: the DNA sequence of the assayed organism, thus they remain consistent into the indefinite future [22]. The result of the DADA2 pipeline is two datasets: (i) a ASV count dataset, where each row specifies how often an ASV was sequenced and (ii) a taxonomic assignment dataset, where each row specifies the taxonomic names of an ASV. It is common to create a phylogenetic tree of the ASVs to later on calculate microbial diversity measures such as the DivNet [55] and UniFrac [73] (see the Statistical analysis stage of Methods Section 2). The multiple genome alignment for the phylogenetic tree was built with the DECIPHER R package enabling a profile-to-profile method aligns a sequence set by merging profiles along a guide tree until all the input sequences are aligned [74]. The multiple genome alignment was used to construct the de novo phylogenetic tree using phangorn R package. We first construct a neighbor-joining tree [75], and then fit a maximum likelihood tree using the neighbor-joining tree as a starting point. After 16S rRNA sequencing the 2,033 stool samples from the KORA cohort and processing the sequences with the DADA2 pipeline, we observe 15,801 ASVs (see Fig A and Table A in S1 Text).

2.2 Causal inference framework

The four stages of the causal framework [21] that we use to construct hypothetical randomized experiments to study the environment-microbiome relationship are the following:

Conceptual: Formulation of a plausible hypothetical intervention (e.g., decreasing air pollution levels) to examine its impacts on the gut microbiome.
Design: Reconstruct the hypothetical randomized experiment had the environmental intervention been implemented randomly.
Analysis: Choose valid and powerful test statistics comparing the gut microbiome had the subjects been hypothetically randomized to the environmental intervention vs. not and test the sharp null hypotheses of no effect of the intervention at different aggregation levels of the data.
Summary: Interpretation of the statistical analyses and recommendations for future studies and interventions.

2.3 Conceptual stage: Formulation of the hypothetical randomized experiment in terms of potential outcomes

To understand whether environmental interventions have an effect on the human gut microbiome, the objective is to reconstruct a hypothetical experiment that mimics a controlled randomized experiment [53], in which an environmental intervention could be believed to have been randomized. Let W_i be the indicator of the assignment for subject i (i = 1, …, N) to an environmental intervention vs. none, where:

\begin{matrix} W_{i} = {\begin{matrix} 1 & if i is under the intervention, \\ 0 & if i is not . \end{matrix} \end{matrix}

(1)

The composition of a human gut microbiome can be expressed as a B-dimensional vector of the microbial abundance. We define $Y_{i}^{b}$ as the real abundance (count) of the b^th bacterial taxon, b = 1, …, B for subject i. We define the potential outcomes of subject i as $Y_{i}^{b} (1)$ , the b^th taxon abundance (count) had subject i been randomized to the environmental intervention (W_i = 1), and $Y_{i}^{b} (0)$ , had subject i not been randomized to the intervention (W_i = 0). Table 1 shows the potential outcomes for the N subjects.

Table 1. Potential outcomes for the subjects of the hypothetical experiment.

Taxa	1		2		…		B
Subjects	W_i = 0	W_i = 1	W_i = 0	W_i = 1			W_i = 0	W_i = 1
1	$Y_{1}^{1} (0)$	$Y_{1}^{1} (1)$	$Y_{1}^{2} (0)$	$Y_{1}^{2} (1)$	…	…	$Y_{1}^{B} (0)$	$Y_{1}^{B} (1)$
2	$Y_{2}^{1} (0)$	$Y_{2}^{1} (1)$	$Y_{2}^{2} (0)$	$Y_{2}^{2} (1)$	…	…	$Y_{2}^{B} (0)$	$Y_{2}^{B} (1)$
…	…	…	…	…	…	…	…	…
N	$Y_{N}^{1} (0)$	$Y_{N}^{1} (1)$	$Y_{N}^{2} (0)$	$Y_{N}^{2} (1)$	…	…	$Y_{N}^{B} (0)$	$Y_{N}^{B} (1)$

Open in a new tab

Only one of the two potential outcomes can actually be observed for each subject: this is why the Rubin Causal Model characterizes causal inference as a missing data problem [52], where the observed outcome of subject-i and taxa-b can be expressed as a function of both potential outcomes:

\begin{matrix} Y_{i}^{b, o b s} = W_{i} Y_{i}^{b} (1) + (1 - W_{i}) Y_{i}^{b} (0) \end{matrix}

(2)

2.3.1 Observed outcomes measurement

The human gut microbiome can be composed of trillions of bacteria. However, due to technology limitations, the exact abundance and number of all strains present in a human subject cannot be measured. To tackle this limitation, we opted for the processing of Amplicon Sequence Variants (ASVs) from our sequencing data to approximate the true gut microbiome composition of our study population [22, 69]. ASVs refer to individual DNA sequences recovered from a high-throughput marker gene analysis, the 16S rRNA gene in our case. Therefore, in this study the observed outcome under investigation is a N × A matrix, for a = 1, …, A ASVs, an approximation of the N × B matrix described above. This limitation adds another layer of missing data, i.e., we are missing the true gut microbial composition of each subject. We define the ASV counts we measured for each subject-i as $C_{i}^{a, o b s}$ , which corresponds to $Y_{i}^{b \in A, o b s}$ plus some measurement error.

2.4 Design stage: Reconstruction of the conceptualized hypothetical experiment

To assess causality, randomized experiments have long been regarded as the “gold standard”. We are interested in the effect of environmental interventions that are often unpractical or ethical to assign randomly to humans within an experiment [21]. Therefore, we resort to a design stage [76] with a matched-sampling strategy to construct two hypothetical randomized experiments to assess the effects of an intervention on the changes in gut microbiome composition. The aim of our pair-matching strategy is to achieve balance in background covariates distributions as it is expected, on average, in randomized experiments. This approach attempts to create exchangeable groups as if the exposure was randomly assigned to each participant given measured covariates, to guarantee exposure assignment is not confounded by the measured background covariates. The exposure assignment mechanism determines which units receive which exposure; in other words, which potential outcomes are observed and which are missing [52]. The unconfoundedness of the assignment mechanism given covariates is a key assumption of the Rubin Causal Model.

Our pair-matching strategy aims to remove individual-specific confounding (e.g., years of age, sex, unit of BMI). Briefly, subject i under $W_{i}^{o b s} = 1$ with pre-exposure covariates X_i is matched to subject i^⋆, under $W_{i^{⋆}}^{o b s} = 0$ only if X_i^⋆ is “similar” to X_i. For each unit, the vector of covariates is given by $X_{i} = (X_{i}^{(1)}, \dots, X_{i}^{(k)})$ . In order to ensure covariate balance, we only allow a treated unit to be matched with a control unit if the component-wise distances between their covariate vectors are less than some pre-specified thresholds δ₁, …, δ_k. For any pair of covariate vectors X_i and X_i^⋆, we define the difference between them as

\begin{matrix} Δ (X_{i}, X_{i^{⋆}}) = {\begin{matrix} 0 & if | X_{i}^{(k)} - X_{i^{⋆}}^{(k)} | < δ_{k} for k = 1, \dots K, \\ + \infty & otherwise \end{matrix} \end{matrix}

(3)

This constrained pair matching can be achieved using a maximum bipartite matching [77] on a graph such that: (i) there is one node per unit, partitioned into intervention nodes and control nodes, (ii) the edges are pairs of treated and control nodes with covariates X_i and X_i^⋆, and (iii) an edge exists if and only if Δ(X_i, X_i^⋆) < +∞. By construction, using a maximum bipartite matching algorithm on this graph as implemented in the igraph R package produces the largest set of matched pairs that satisfy the unit-specific proximity constraints set by our thresholds. Let $N_{E} = \sum_{i = 1}^{N} W_{i}$ be the number of subjects under the environmental intervention and $N_{C} = \sum_{i = 1}^{N} 1 - W_{i}$ the number of control subjects, after matching.

After excluding the participants of the cohort that take antibiotics and had a cancer of the digestive organ, the pre-matched data set consists of 1,967 participants. At this stage, the objective is to create balanced data subsets for which the plausibility of the “unconfoundedness” assumption is based on a diagnostic of our choice. We choose the thresholds, δ₁, …, δ₇, according to the pre-matching diagnostic plots of the covariate distributions (see Figs B-G in S1 Text). We privilege a large dataset with balance, while assuring that the created pairs, or in other words “twins”, are scientifically plausible, e.g., no male and female could be matched. We assume a covariate to be balanced when its distribution is approximately the same under the exposure vs. not. The thresholds are: the absolute differences between the amount of alcohol consumption is less than δ₁ = 25 g/day, between the body-mass-index is less than δ₂ = 4 kg/m², between age is less than δ₃ = 5 years, the diabetes status (diabetic, non-diabetic) is identical, i.e., δ₄ = 0, and so are sex (male, female), i.e., δ₅ = 0, and physical activity (active, inactive), i.e., δ₆ = 0. Additionally, in the air pollution reduction experiment: the smoking status (smoker, ex-smoker, never-smoker) is identical, i.e., δ₇ = 0, and in the smoking prevention experiment: the absolute difference between years of education is less than δ₇ = 3 years.

After matching, we obtain two subsets of the data that can be analyzed as coming from two pair-randomized experiments: (i) an air pollution (ap) reduction hypothetical experiment (N_ap = 198), and (ii) a smoking prevention hypothetical experiment (N_s = 542); both data sets exhibit no evidence against covariate imbalance (see Table 2 and Figs B-G in S1 Text).

Table 2. Before and after matching number of units.

The thresholds for the air pollution experiment are based on 90^th and 10^th percentiles of the PM_2.5 distribution.

	Air pollution		Smoking
	N _C	N _E	N _C	N _E
Matching	PM_2.5 ≥ 13.0 μg/m³	PM_2.5 ≤ 10.3 μg/m³	Smoker	Never smoker
Before	206	193	302	908
After	99	99	271	271

Open in a new tab

It is well known that diet has an influence on the gut microbiome and future studies on the gut should include dietary intake data in their analysis [78, 79]. In our study, we only have access to dietary intake data for a portion of our samples, therefore we examine balance diagnostics in usual nutrient intake after matching in order to maintain a large data set before matching. Figs H-I in S1 Text show that after matching, our intervention and control units (in both hypothetical experiments) do not exhibit imbalance with respect to the following food items: potatoes/roots, vegetables, legumes, fruits/nuts, dairy products, cereal products, meat, fish, egg products, fat, and sugar. In the same way, we checked for covariate balance after matching for medication intake, also a well-known confounder in human gut microbiome studies. Figs D and G in S1 Text show that after matching, our intervention and control units (in both hypothetical experiments) do not exhibit imbalance with respect to medication intake.

2.5 Statistical analysis stage: Randomization-based inference

To compare the gut microbiome of subjects under the environmental intervention to control subjects, we choose to not rely on asymptotic arguments, but instead take a Fisherian perspective (i.e., randomization-based inference) [45, 80]. We test sharp null hypotheses (H₀) of no effect of the intervention for any unit by choosing test statistics that account for the complex microbiome data structure, including the additional “layer” of missing data. The ASV count data has a challenging structure because: (i) it is high-dimensional, (ii) some ASVs have low prevalence, (iii) the ASVs are strongly correlated, and (iv) it is compositional. ASV-count data is said to be “compositional” because between units comparison of ASV counts might not be informative due to the limited sequencing depth of the machine and the total number of sequenced reads varies from unit to unit (i.e., they have no common denominator) [81].

In randomization-based inference the goal is to construct the null randomization distribution of a test statistic assuming H₀, T, by computing the values of the test statistic for all possible intervention assignments. Because the number of assignments is very large, we calculate an approximating p-value using N_iter iterations, i.e., the proportion of computed test statistics that are as large or larger than the observed test statistic: $\frac{1}{N_{i t e r}} \sum_{l = 1}^{N_{i t e r}} 1_{T_{l} \geq T^{o b s}}$ , where $1_{T_{l} \geq T^{o b s}} = 1$ when T_l ≥ T^obs, and 0 otherwise (for two-sided tests we obtain the p-values by taking absolute value of T_l and T^obs, i.e., |T_l| and |T^obs|). A small p-value shows that the observed test statistic is a rare event when the null hypothesis is true, which indicates the results are worth further scrutiny [82]. In the following subsections, we describe the null hypotheses we test and the test statistics we use to draw randomization-based inferences with N_iter = 10,000 possible intervention assignments following a matched-pair design (see summary Table 3). This means that the permutations of the intervention assignment vectors needed to calculate the Fisher p-values follow the design of our hypothetical experiments. When units have varying probabilities of being treated, the analysis of experiments, even when hypothetical, should reflect their design [53, 76].

Table 3. Data transformation and choice of test statistics.

analysis level	data transformation	test statistic
richness	breakaway [83]	betta regression coefficient [54]
α-diversity	DivNet [55]	betta regression coefficient [54]
β-diversity	pairwise distance matrices	MiRKAT score statistic [84]
high-dimensional means	centered log ratios	mean abundance difference [56]
abundance	normalization by ratio [57]	LogFold mean difference
correlation	association matrices [58]	differential associations [59]

Open in a new tab

2.5.1 Diversity analyses

*Within Subjects Diversity.

One of the challenges of analyzing ASV-count data is working around the low prevalence of some ASVs that are due to the limited sequencing depth of the machine and the fact that some ASVs are not shared in the entire population (see Fig A in S1 Text). Therefore, before directly testing within-subject diversity differences with so called “plug-in” estimates, it has been recently suggested to start with estimating the diversity with statistical models [54]. We will follow this idea by estimating richness with the breakaway method [83] and estimating the Shannon index for α-diversity with the DivNet method [55].

Richness. The sharp null hypothesis of no effect of the intervention on the richness can be written as: $H_{0, R} : \sum_{b = 1}^{B} 1_{Y_{i}^{b} (0) > 0} = \sum_{b = 1}^{B} 1_{Y_{i}^{b} (1) > 0}$ . To estimate the richness of subject i (i.e., the number of bacterial taxa present in subject i), we will estimate the total richness in subject i, observed and unobserved, by B_i with the breakaway model [83]. Let f_i,1, f_i,2, … denote the number of bacterial taxa observed once, twice, and so on, in a subject i, and let f_i,0 denote the number of unobserved bacteria, so that B_i = f_i,0 + f_i,1 + f_i,2 + …. The idea behind the breakaway method is that for each subject i, it predicts the number of unobserved bacteria, f_i,0, with a nonlinear regression model to, in turn, provide an estimate of B_i.

α-diversity. The sharp null hypothesis of no effect of the intervention on α-diversity can be written as: $H_{0, α} : \sum_{b = 1}^{B} Y_{i}^{b} (0) = \sum_{b = 1}^{B} Y_{i}^{b} (1)$ . To have estimates for indices of the α-diversity of subject i (i.e., its total microbial abundance) and their variance, we use the DivNet method, because it accounts for the co-occurrence patterns (i.e., ecological networks) of bacterial taxa in the microbial community [55]. Let $Z_{i}^{b} = Y_{i}^{b} / \sum_{b = 1}^{B} Y_{i}^{b} \in [0, 1]$ denote the unknown relative abundance of taxa b in subject i, noting that $\sum_{b = 1}^{B} Z_{i}^{b} = 1$ . As a reminder, $C_{i}^{a, o b s}$ denotes the number of times taxa a was observed in the stool sample of subject i in our data. One of the most common α-diversity indices is the Shannon entropy [85], which is defined as: $α_{i, S h a n n o n} = - \sum_{b = 1}^{B} Z_{i}^{b} l o g (Z_{i}^{b})$ . This index captures information about both the species richness (i.e., number of species) and relative abundances of the species: as the number of species in the population increases, so does the Shannon index, and as the relative abundances diverge from a uniform distribution and become more unequal, the Shannon index decreases. In the ecological literature, researchers mostly use the following maximum likelihood estimate of α_i,Shannon (often referred to as a “plug-in” estimate): $- \sum_{a = 1}^{A} \frac{C_{i}^{a}}{\sum_{a = 1}^{A} C_{i}^{a}} l o g (\frac{C_{i}^{a}}{\sum_{a = 1}^{A} C_{i}^{a}})$ . It has been proven that this estimate is negatively biased [86]. Therefore, various corrections have been proposed and are detailed in [55]. However, most of the suggested estimates are only functions of the ASV count vectors $C_{i}^{a}$ and do not utilize the full ASV count data matrix C and the co-occurrence pattern, i.e., ecological network, of the ASVs. Willis and Martin [55] showed that these networks can have substantial effects on estimates of diversity and proposed an approach, called DivNet, to estimating diversity in the presence of an ecological network. DivNet estimates are based on log-ratio transformations by fixing a “baseline” taxon for comparison, which are modeled by a multivariate normal distribution to incorporate the co-occurrence structure between the taxa as the covariance matrix. The main advantage of DivNet method is the use of information shared across all samples to obtain more precise and accurate estimates.

Choice of test statistic. The test statistic we use to test H_0,R and H_0,α are the coefficient of the intervention indicator estimated by the regression suggested by Willis et al. [54]. Using the coefficient of a model as the test statistic of a Fisher test was introduced in the 70s [87]. At this stage, to achieve larger bias reductions, frequentist regression models can be used to remove residual confounding that was not accounted for, during the design stage [47, 48].

Willis et al. [54] suggest to test changes in richness (B_i) and α-diversity ( ${\hat{α}}_{i}$ ) with a hierarchical regression model, assuming that richness is a function of: the intervention indicator W_i, random variation that is not attributed to the covariates, and the standard error previously estimated with breakaway or DivNet (because not every bacterial taxon in each subject was observed so we cannot not know the true richness or α-diversity for any i). The regression models are built with the betta function available in the breakaway R package [54, 83].

Between Subjects Diversity.

β-diversity. Distance-based analysis is a popular approach for evaluating the association between an exposure and microbiome diversity. The pairwise distances, d_ii^⋆, for high-dimensional data we consider are the: UniFrac (unweighted) distance [73], Jaccard index, Aitchison distance [88] (i.e, Euclidean distance on centered log-ratio transformed data), and Gower distance [89] (on centered log-ratio transformed data). We choose the unweighted paired UniFrac, because it is a distance metric (i.e., a non-negative real-valued function) as opposed to the generalized UniFrac. In the same way, the Jaccard distance was chosen as opposed to the commonly used Bray-Curtis. The sharp null hypothesis of no effect of the intervention on β-diversity can be written as: H_0,β: d_ii^⋆(0) = d_ii^⋆(1).

Choice of test statistic. Despite the popularity of distance-based approaches, the field of microbiome studies suffers from technical challenges, especially in selecting the best distance. Therefore, we use the suggested microbiome regression-based kernel association test (MiRKAT) [84] that uses a kernel regression and a standard variance-component score test statistic [90]. To consider different distance measures, the optimal MiRKAT: tests H_0,β for each individual kernel, obtains the p-value for each of the tests, and then adjust for multiple comparison with a p-value with an omnibus test. Instead, we use a fully randomization-based multiple comparison adjustment method detailed subsequently.

Multiple comparison adjustments. We follow the fully randomization-based procedure for multiple comparisons adjustments suggested by Lee et al. [91], which is directly motivated by the intervention assignment actually used in the experiment. This procedure has been suggested to have sufficient power to detect causal effects [91]. In our hypothetical experiments, we have matched paired intervention assignments. Both the unadjusted and adjusted p-values in the procedure are randomization-based, so do not require any assumptions about the underlying distribution of the data. The adjusted p-values are calculated following Steps 1–4:

Calculate for each hypothesis h, an unadjusted p-value for the observed test statistic by taking the proportion of computed test statistics that are as large or larger than the observed test statistic. This procedure is detailed in the introduction of the Statistical analysis stage section. Also, for each hypothesis h, h = 1, ‥, H, and intervention assignment iteration iter, iter = 1, …, N_iter, record the vector of calculated test statistics $T_{β}^{h, i t e r} = (T_{β}^{1, 1}, \dots, T_{β}^{H, N_{i t e r}})$ .
For each h and each iteration iter, calculate an unadjusted randomization-based p-value, with $T_{β}^{h, i t e r}$ as the observed test statistic. For each iter, record the minimum p-value of the H p-values.
The repetitions of Step 2 capture the joint randomization distribution of the test statistics and thus, of the unadjusted p-values.
To calculate the adjusted p-values for the observed test statistics, for each h, take the proportion of “minimum p-values” (recorded in Step 2) that are less than or equal to its unadjusted p-value calculated in Step 1.

Step 2–3. essentially represent a translation of the multiple test statistics into p-values sharing a common 0–1 scale.

2.5.2 Composition analyses

Compositional equivalence.

The compositionality problem means that: a change in abundance (i.e., sequenced counts) of a taxon in a sample induces a change in sequenced counts across all taxa. This problem, among others, leads to many false positive discoveries when comparing taxon abundances between groups. Moreover, because the components of a composition must sum to unity, directly applying standard multivariate statistical methods intended for unconstrained data to compositional data may result in inappropriate and misleading inferences [88]. Therefore, we impose a centered log-ratio transformation of the compositions before testing the null hypothesis of no difference in average microbial abundance as suggested by [56].

For the measured microbiome data C, the centered log-ratio matrices L = (L_i, …, L_N) are defined by $L_{i}^{a} = l o g (\frac{C_{i}^{a}}{g (C_{i})})$ , where $g (C_{i}) = {(\prod_{a = 1}^{A} C_{i}^{a})}^{1 / A}$ denotes the geometric mean of the vector $C_{i} = (C_{i}^{1}, \dots, C_{i}^{A})$ . The sharp null hypothesis of no microbiome composition difference between the subjects under the intervention vs. not can be written as H_0,M: for each subject i, L_i(0) = L_i(1).

Choice of test statistic. The scale invariant test statistic suggested by [56] for testing H_0,M is based on the differences ${\bar{L}}_{E}^{a, o b s} - {\bar{L}}_{C}^{a, o b s}$ , where ${\bar{L}}_{E}^{a, o b s} = 1 / N_{E} \sum_{i : W_{i} = 1} L_{i}^{a}$ is the sample mean of the centered log ratios for subjects under the intervention. Because microbiome data are often sparse (i.e., only a small number of taxa may have different mean abundance), the following test statistic is considered: $T_{M} = \frac{N_{E} N_{C}}{N_{E} + N_{C}} \underset{1 \leq a \leq A}{m a x} \frac{{(L_{E}^{a, o b s} - L_{C}^{a, o b s})}^{2}}{{\hat{γ}}_{a a}}$ , where ${\hat{γ}}_{a a}$ are the pooled-sample centered log-ratio variances.

Differential abundance

The compositional nature of the microbiome data requires to choose appropriate reference sets with respect to which testing of changes in individual taxon relative abundances becomes feasible [81]. A recent approach that follows this methodology is the DACOMP (differential abundance testing with compositionality adjustment) method, proposed by [57]. DACOMP is a data-adaptive approach that: 1) identifies a subset of non-differentially abundant (reference) ASVs (R) in a testing dataset, and 2) tests the null of no differential abundance (DA) of the other ASVs (a) “normalized-by-ratio” in a training dataset. First, a taxon enters the set R = (r₁, …, r_F) if it has low variance (< 2) and high prevalence (> 90%) (see Figs L-M in S1 Text). For the analyses at the ASV level, we chose the variance to be < 3 and the prevalence to be > 40% as thresholds in order the have at least one reference per subject. Second, using the suggested “normalization-by-ratio” approach, the null hypothesis to be tested for ASV a is that ASV a is not differentially abundant: $H_{0, DA}^{(a \notin R)} : \frac{C_{i}^{a} (0)}{C_{i}^{a} (0) + \sum_{f = 1}^{R} C_{i}^{r_{f}} (0)} = \frac{C_{i}^{a} (1)}{C_{i}^{a} (1) + \sum_{f = 1}^{R} C_{i}^{r_{f}} (1)}$ ,

Choice of test statistic. To test this sharp null hypothesis, we use the LogFold change available in the dacomp package with the Compute.resample.test function. This function is useful to perform randomization-based inference for differential abundance testing, because it enables to directly incorporate a matrix of hypothetically randomized intervention assignments, which is an appealing feature when researchers work with particular designs. Because we are testing $H_{0, DA}^{(a \notin R)}$ ||A|| − ||R|| times at all taxonomic ranks, we adjust for multiple tests with the method described in the β-diversity analysis section [91].

Partial correlation structure

For our matched intervention and control subjects, we predicted microbial association networks using the Sparse InversE Covariance estimation for Ecological ASsociation Inference (SPIEC-EASI) framework [58] that uses 1) centered log-ratio transformations of the observed ASV counts, $C_{i}^{a, o b s}$ , to perform 2) Sparse Inverse Covariance selection (with the graphical lasso method [92]), and finally 3) pick a model based on edge stability (with the StARS method [93]) to obtain a sparse inverse covariance matrix. The non-zero entries of this matrix are proportional to the negative partial correlations among the taxa and form the edge set in an undirected weighted graph G = (V, E). Here, the vertex (or node) set V = v₁, …, v_p represents the p genera and the edge set E ⊂ V × V the possible associations among them. The null hypotheses of no effect of the environmental intervention on the observed genera network associations can be expressed as: H_0,N: E(0) = E(1).

Choice of test statistic. We compare the intervention and control networks with test statistics for the difference in genera associations individually. To generate sampling distributions of the test statistics under H_0,N, the intervention and control labels are reassigned 10,000 times to the samples while the matched pair structure is maintained, i.e., the assignment to intervention or control is permuted within each pair. The SPIEC-EASI framework is then re-applied to each permuted data set. This procedure is implemented with the Network Construction and Comparison for Microbiome Data, NetCoMi, R package [59]. To adjust for multiple differential association tests, we use the method described in the β-diversity and differential abundance analyses section [91].

2.6 Summary stage: Interpretation of the results

If the null hypothesis of no difference in the gut microbiome between the matched groups of treated and control units is rejected, that difference warrants further scrutiny to assess whether it can be attributed to the different treatments, assuming the assignment “unconfoundness” assumption holds. We can then report that the gut microbiome composition was or was not altered by the introduction of the environmental intervention. It is important to note that interpretation should be restricted to units that remain in the finite sample after matching (see their detailed characteristics in Figs B-I in S1 Text). The data do not provide direct information for “unmatched” units. Caution regarding extrapolation to units with covariate values beyond values observed in the balanced subset of the data is necessary.

3 Results

To illustrate our causal inference framework, we first conceptualize two hypothetical environmental interventions that potentially influence the gut microbiome: (i) an air pollution reduction, and (ii) a smoking prevention intervention. Second, for each intervention, we construct a hypothetical matched-pair randomized experiment, aiming at satisfying the “unconfoundedness” assumption (see Methods section). Third, we analyze the “unconfounded”/“as-if randomized” data subset with randomization-based inference to test sharp null hypotheses of no effect of the interventions for each unit at different taxonomic levels of the microbial ASV data. The results presented subsequently correspond to the third stage of the framework. Fourth, causal conclusions are developed in the Discussion section. Following the American Statistical Association statement [82, 94], we avoid searching for “statistically significant” results with a dichotomous approach. To give structure to our results reporting, we reject the sharp null hypotheses of no effect of an environmental intervention when the p-value is lower or equal to 0.1 or, when computed, when the adjusted p-value is lower or equal to 0.2. We are more tolerant with adjusted p-values because multiple comparison adjustments are conservative and our study is exploring a nearly untapped field. Nonetheless, we highly recommend to the readers interested in our research questions or result replication to examine all reported p-values in Figs and Tables, because higher p-values do not mean that an effect is improbable, absent, false, or unimportant [82].

3.1 Characteristics of study population

Our study is based on data from the KORA FF4 study cohort [17]. Because we performed a design stage before analyzing the data we have two study populations, one per hypothetical experiment, which are subsets of the entire cohort (see Design stage in the Methods section). In the air pollution reduction experiment, we analyze 99 matched pairs of subjects living in highly (PM_2.5 ≥ 13.0 μg/m³) and less (PM_2.5 ≤ 10.3 μg/m³) polluted areas with similar background characteristics distributions (Table 4 and Figs B-D and Fig H in S1 Text). The thresholds for the air pollution experiment intervention are based on 90^th and 10^th percentiles of the PM_2.5 distribution. We focus on the PM_2.5 pollutant, originating mainly from traffic emissions and fossil fuel combustion, for its known penetrating effects into the lung and potential implication for the gut microbiome [27]. In the smoking prevention experiment, we analyze 271 matched pairs of smokers and never-smokers (with background characteristics distributions presented in Table 4 and Figs E-G and Fig I in S1 Text). A total of 45 units are included in the balanced data subset of both hypothetical experiments.

Table 4. Baseline characteristics of the study population in the air pollution reduction (left table) and smoking prevention experiments (right table).

Continuous variables: mean and standard deviation (St. d.). Categorical variables: number of samples per category (N) and proportion of category (%).

		Air pollution (PM_2.5)				Smoking
		≥ 13.0 μg/m³		≤ 10.3 μg/m³		Smoker		Never-Smoker
		Mean	St. d.	Mean	St. d.	Mean	St. d.	Mean	St. d.
Age		60.6	12.4	60.3	12.4	54.2	9.4	54.4	9.6
Body Mass Index		27.0	4.3	27.0	3.8	26.7	4.4	26.7	4.2
Alcohol intake (g/day)		11.3	14.1	11.5	13.9	13.0	15.6	11.6	14.3
Years of education		11.9	2.6	11.7	2.8	11.7	2.3	11.8	2.2
		N	%	N	%	N	%	N	%
Sex	F	41	20.7	41	20.7	130	24.0	130	24.0
Sex	M	58	29.3	58	29.3	141	26.0	141	26.0
Smoking	Ex-S.	27	13.6	27	13.6	-	-	-	-
	Never-S.	62	31.3	62	31.3	-	-	-	-
	Smoker	10	5.1	10	5.1	-	-	-	-
Diabetes	No	95	48.0	95	48.0	264	48.7	264	48.7
Diabetes	Yes	4	2.0	4	2.0	7	1.3	7	1.3
Phys. Activity	No	36	18.2	36	18.2	125	23.1	125	23.1
Phys. Activity	Yes	63	31.8	63	31.8	146	26.9	146	26.9

Open in a new tab

3.2 Microbial diversity analysis

A common first step in microbiome data analysis is estimating and assessing microbial diversity. We begin by investigating the potentially causal effects of the interventions on within-subject diversity (α−diversity) and between-subject variation (β−diversity), respectively.

3.2.1 Within-subject diversity

Gut bacterial richness and Shannon diversity were estimated on the ASV level with the breakaway [83] and DivNet [55] method, respectively. Comparisons of the distributions of these estimated variables between subject under the intervention vs. not in both hypothetical experiments are shown by boxplots in Fig 2. The small approximate Fisherian p-values based on 10,000 permutations of the intervention assignment give us ground for rejecting the null hypotheses of no effect of an air pollution reduction (p-value_ap,richness ≈ 0.0008, p-value_ap,α−div. ≈ 0.0388) and smoking prevention (p-value_s,richness ≈ 0.1518, p-value_s,α−div. ≈ 0.0497) on the diversity of the human gut microbiome. On average, lower diversity was observed in the subjects living in polluted areas or smokers compared to participants living in less polluted areas or non-smokers. This diversity difference motivates the more in-depth analyses of the gut microbiome composition presented subsequently.

Fig 2 — Boxplots (with median), values of the test-statistics from the `betta` regression [54], and one-sided randomization-based p-values for 10,000 permutations of the intervention assignment following a matched-pair design. (A) Boxplots of the richness. (B) Boxplots of the α-diversity.

3.2.2 Between-subject variation

To estimate β-diversity indices, we calculated UniFrac, Aitchison, Jaccard, and Gower dissimilarities between all possible pairs of subjects. The results are shown in Table 5. To alleviate the problem of choosing the best dissimilarity metric for β−diversity estimation, we follow the Microbiome Regression-based Kernel Association Test (MiRKAT) of Zhao et al. [84] suggesting to compute several metrics and then adjust for multiple comparisons. In both experiments, we reject the sharp null hypotheses of no effect of the intervention on between-subject variation.

Table 5. β-diversity.

Microbiome Regression-based Kernel Association Test (MiRKAT), unadjusted and adjusted one-sided randomization-based p-values for 10,000 permutations of the intervention assignment following a matched-pair design.

	Air pollution			Smoking
distance	test-statistic	p-value	p-value_adj	test-statistic	p-value	p-value_adj
UniFrac	12.1	0.0199	0.0506	61.5	0.0024	0.0070
Aitchison	82596.0	0.1096	0.2466	356921.5	0.0001	0.0003
Jaccard	19.4	0.0884	0.2043	84.5	0.0001	0.0003
Gower	0.2	0.0089	0.0250	0.1	0.0485	0.1204

Open in a new tab

3.3 Microbial compositions analysis

We next investigated whether shifts in microbial compositions as a whole or differences in specific microbial taxa were observable in the hypothetical experiments. We illustrate this by designing and analyzing sharp null hypotheses for global compositional means and differential genus abundances.

3.3.1 Compositional mean differences

Testing whether two study groups have the same microbiome composition can be viewed as a two-sample testing problem for high-dimensional compositional mean equivalence. We tested sharp null hypotheses using a test statistic developed particularly for that purpose by Cao et al. [56]. Table 6 summarizes the results for each taxonomic level. We reject the sharp null hypotheses of gut microbiome composition equivalence for the air pollution reduction and smoking prevention experiments. In both experiments, p-values are higher at the ASV level than at higher taxonomy levels.

Table 6. Compositional equivalence test.

Test statistic for high-dimensional data suggested by [56] and one-sided randomization-based p-values for 10,000 permutations of the intervention assignment following a matched-pair design.

		ASV	Species	Genus	Family	Order	Class	Phylum
Air Pollution	nb. of taxa (p)	4,370	414	252	74	44	29	15
	test statistic	12.8	12.9	11.9	8.8	8.4	8.4	8.1
	p-value	0.1451	0.0722	0.0733	0.1521	0.1161	0.1021	0.0591
Smoking	nb. of taxa (p)	7,409	479	271	81	48	31	16
	test statistic	13.0	14.5	13.3	11.6	8.6	9.4	10.4
	p-value	0.1607	0.0302	0.0384	0.0279	0.0859	0.0440	0.0135

Open in a new tab

3.3.2 Differential taxon abundances

For compositional microbiome data, identifying sets of potentially “differentially abundant taxa” relates to testing sharp null hypotheses of no difference in abundance of individual taxa with respect to a reference set. We conducted such an analysis on the genus level for all genera present in at least 5% of the samples. This prevalence threshold was guided by the amount of information preserved when performing filtering, i.e., microbial abundance and the number of taxa observed per sample (see Figs N-Q in S1 Text). We applied the Differential abundance testing for compositional data (DACOMP) approach [57] and used two-sided tests since we lack prior knowledge on the direction of the abundance changes. Fig 3 highlights the key DACOMP results for both experiments. In the air pollution reduction experiment, we reject the sharp null hypothesis of no differential abundance only for the Marvinbryantia genus (p-value_adj. = 0.0120) (see Table B in S1 Text). We also reject the sharp null hypothesis of no effect of smoking prevention for eleven genera (see Fig 3 and Table C in S1 Text). Five belong to the Ruminococcaceae family: Ruminococcaceae-UCG-002, Ruminococcaceae-UCG-003, Ruminococcaceae-UCG-005, Ruminococcus-1, and Ruminococcaceae-NK4A214-group, three to the Lachnospiraceae family: Lachnospira, Lachnospiraceae-NK4A136-group, and Coprococcus-1, one to the Christensenellaceae family: Christensenellaceae-R-7-group, and two to the Mollicutes class, which belong to the NB1-n and Mollicutes-RF9 order.

Fig 3 — For each genus, adjusted two-sided randomization-based p-values for 10,000 permutations of the smoking prevention intervention assignment following a matched-pair design. Genera with no tip point belong to the set of reference taxa. Black circled tip point: differentially abundant genus (*Marvinbryantia*) in the air pollution reduction experiment.

3.4 Microbial network analysis

To gain insights into changes in the organizational structure of the underlying microbial gut ecosystem, we next calculated sparse genus-genus association networks for each exposure level and hypothetical experiment and highlight the results of our randomization-based differential association testing.

3.4.1 Genus-genus association networks

We used the Sparse InversE Covariance estimation for Ecological ASsociation Inference (SPIEC-EASI) framework [58] to infer genus-genus associations in our two hypothetical experiments. We used the glasso mode of SPIEC-EASI with default parameters (see Methods for details). Fig 4A shows the overall structure of the learned sparse association networks for the smoking prevention experiment (smokers (left panel) and non-smokers (right panel), respectively). Each network possesses a single large connected component consisting of 30–40 mostly Firmicutes genera (highlighted area in Fig 4A). These connected components also included the majority of the previously identified potentially differentially abundant genera, including Ruminococcaceae (UCG-005, UCG-002), Ruminococcus-1, and Christensenellaceae-R-7-group (see Fig 4B for a detailed view of the connectivity pattern). The genus-genus associations networks derived from the air pollution reduction experiment showed similar overall topological features containing one large connected component of 60 genera, including Ruminococcaceae (UCG-005, UCG-003, UCG-002) and Christensenellaceae-R-7-group among others (see also Fig R in S1 Text).

Fig 4 — (A) Visualization of the genus-genus partial correlations estimated with the SPIEC-EASI method. Edges thickness is proportional to partial correlation, and color to sign: red: negative partial correlation, green: positive partial correlation. Node size is proportional to the centered log ratio of the genus abundances, and color is according to phyla. Triangle shaped nodes are differentially abundant (see Fig 3). (B) Zoom in largest connected component and differential associations (bold genera).

3.4.2 Differential genus-genus associations

To identify potentially differential network associations in the intervention experiments, we coupled the SPIEC-EASI network estimation procedure with permutations of the intervention assignment, available in the NetCoMi R package [59] (see also Methods for details). For each hypothetical experiment, we list the five genus-genus associations with smallest adjusted two-sided randomization-based p-values in Table 7 and highlight these associations in Fig 4B. In the air pollution reduction experiment, we reject the sharp null hypothesis of no differential association for two edges: the Succinivibrio/Slackia edge (p-value_adj. ≈ 0.0661), and the Ruminiclostridium/Cloacibacillus edge (p-value_adj. ≈ 0.1063) (see Table 7 and Fig R in S1 Text).

Table 7. Differential associations of genera.

Smallest five adjusted two-sided randomization-based p-values for 10,000 permutations of the intervention assignment following a matched-pair design.

Air pollution
Genus-genus associations (-: disappearance after intervention)	p-value_adj
Succinivibrio/Slackia (-)	0.0661
Ruminiclostridium/Cloacibacillus (-)	0.1063
Cloacibacillus/Lachnospiraceae-FCS020-group	0.2795
Megasphaera/Alistipes	0.4147
Bacteroidales (Genus: unknown)/Prevotella-2	0.4753
Smoking
Genus-genus associations (-: disappearance after intervention)	p-value_adj
Christensenellaceae-R-7/Ruminiclostridium-6 (-)	0.1585
Ruminococcaceae-UCG-010/Ruminiclostridium-6 (-)	0.1585
Ruminococcaceae-UCG-014/Flavonifractor	0.2031
Clostridiales-vadinBB60/Ruminiclostridium-6	0.2376
Ruminococcaceae-UCG-013/Faecalibacterium	0.2492

Open in a new tab

In the smoking prevention experiment, we also reject the sharp null hypothesis of no differential association for two edges: the Ruminiclostridium-6/Ruminococcaceae-UCG-010 edge (p-value_adj. ≈ 0.1585), and the Ruminiclostridium-6/Christensenellaceae-R-7-group edge (p-value_adj. ≈ 0.1585) (see Table 7). The genera that participate in these potentially differential associations are also highlighted in Fig 4B.

3.5 Exploring associations between genera and lipid metabolites

The gut microbiome is a substantial driver of circulating lipid levels, and prior work has shown [65, 95, 96] that the relative abundance of several microbial families, including Christensenellaceae, Ruminococcaceae, and the Tenericutes phylum were negatively correlated with triglyceride and positively associated with high-density lipoproteins (HDL) cholesterol. Since our analysis identified a small interconnected group of genera, including Christensenellaceae and Ruminococcaceae, for whom we rejected the no differential abundance hypothesis, we performed an exploratory data analysis to investigate taxa-serum lipid measurements associations. Four lipids were measured in blood serum samples of our study population from the KORA cohort: total, HDL, and LDL, cholesterol, as well as triglyceride levels. Fig 5A shows the correlation between these lipids and the genera we discovered in our hypothetical experiments. Tendencies similar to those reported in previous studies can be observed in our data.

Fig 5 — (A) Lipid metabolites correlation with selected genera from the smoking prevention experiment (green). (B) Scatterplots of high-density lipoprotein (HDL) cholesterol and triglycerides vs. centered log-ratio transformed relative abundances of the genera *Ruminococcaceae-UCG-005* and *Christensenellaceae-R-7-group*.

For instance, in the smoking prevention dataset, we observed a positive correlation of Christensenellaceae R-7-group and Ruminococcaceae (UCG-005) genus abundances (under centered log-ratio transformation) with HDL cholesterol and negative correlation with triglyceride levels, respectively (see Fig 5B). Similar correlation patterns were also found for the other genera for whom we rejected the no differential abundance hypothesis (see second and forth column in Fig 5A). Our findings were also in line with recently reported correlation results in Vojinovic et al. [65] using the Dutch LifeLines-DEEP cohort [13] and the Rotterdam Study [14].

3.6 Sensitivity analysis

To assess whether the pair-matching strategy chosen for the design stage influenced the conclusions of this study, we conducted a sensitivity analysis (see Sensitivity Analysis section in S1 Text). For that, we implemented the more commonly-used propensity score matching algorithm [97] and obtained matched samples of: 1) 158 participants living in low PM_2.5 areas and 158 participants living in higher PM_2.5 areas, and 2) 290 smokers and 290 never smokers (see Table D and Figs T-Y in S1 Text for the balance diagnostics). For both hypothetical randomized experiments, using propensity score matching at the design stage results in analyzing more matched samples. The microbial diversity analyses lead to the same conclusion for both experiments despite different design stages (see Fig Z and Tables E-F in S1 Text). Overall, we also observe small approximate Fisherian p-values after performing the propensity score matching, in the same way we observe small approximate Fisherian p-values with our pair-matching strategy. The test statistics have the same direction and magnitude. For the air pollution reduction experiment, the adjusted p-values are higher when performing propensity score matching when checking for differential abundances, i.e., we cannot reject the sharp null hypothesis of no differential abundance for the Marvinbryantia genus. For the smoking prevention experiment, we can reject the sharp null of no differential abundance for the same taxa and additional ones when performing propensity score matching compared to pair-matching (see Table C and Table G in S1 Text).

4 Discussion

We first discuss the results presented above, then elaborate on the statistical framework we used for our analyses, and suggest statistical and epidemiological extensions of our work.

In the air pollution (PM_2.5) reduction hypothetical experiment, we reject the sharp null hypotheses of no richness, no α-diversity, no β-diversity, and no high-dimensional mean differences. We also reject the no differential abundance hypothesis for the Marvinbryantia genus, and the no differential association hypothesis between: the Succinivibrio and Slackia genera, as well as the Ruminiclostridium and Cloacibacillus genera. Experiments exposing mice to PM_2.5 resulted in mixed findings concerning difference in microbial richness and diversity. This might be due to regional differences in the chemical composition of PM_2.5 as well as differences in the duration of exposure [29]. Thus far, only one human study estimated associations between PM_2.5 exposure and the gut microbiome, and investigated the pathway of diabetes induction associated with PM exposure [28]. One of their key findings was that PM_2.5 exposure reduced α-diversity (measured by Chao1 and Shannon indices), which is consistent with our observations.

In the smoking prevention hypothetical experiment, we rejected the sharp null hypotheses of no richness, no α-diversity, no β-diversity, and no high-dimensional mean differences. We also rejected the no differential abundance hypothesis for eleven genera (five of the Ruminococcaceae family, three of the Lachnospiraceae family, one of the Christensenellaceae family, and two of the Mollicutes class), and the no differential association hypothesis between the Ruminiclostridium-6 and Ruminococcaceae-UCG-010 genera, and between the Ruminiclostridium-6 and Christensenellaceae R-7-group genera. Interestingly, the associations of Ruminococcaceae-UCG-010 and Christensenellaceae R-7-group with Ruminiclostridium-6 were also found to be worth further scrutiny. Their positive associations in the genus-genus network of smokers was absent in the genus-genus network of the never-smokers. The one study comparing the gut microbiome of smokers (n = 203) and never-smokers (n = 288) with similar sample size has a men-only study population [44]. They did not find any differences in α-diversity (measured with the Shannon index), whereas we conclude that α-diversity analyses are worth further scrutiny. Lee et al.’s PERMANOVA analyses for β-diversity differences, measured with Jaccard and weighted UniFrac distances, suggested differences. We reject the sharp null hypothesis at the between-subject differences analysis level. In their analysis of bacterial taxa on the phylum level, smokers had an increased proportion of Bacteroidetes with decreased Firmicutes and Proteobacteria compared with never-smokers. When we compare these phyla, we do not observe the same differences (see Fig S in S1 Text). Also, our compositional difference analyses do not result in the same set of differentially abundant genera that were reported by Lee et al. [44]. These conflicting findings could be due to the fact that their study was done on Korean men only. Nonetheless, it shows that there is a lack of knowledge on the effects of smoking on the human gut microbiome and that additional scientific investigations are necessary to make causal conclusions.

Throughout the extensive statistical analyses presented in this paper, we have tested sharp null hypotheses of no effect of an intervention on a wide range of gut microbiome outcomes, ranging from high-level microbial diversity estimates to differential genus-genus associations. To do so, we have performed randomization-based inference based on 10,000 permutations. This mode of inference has been motivated by two reasons: (i) it is difficult to postulate a joint model for the potential outcomes, and thereby provide an estimate of (and uncertainty around) a causal estimand, and (ii) it has been shown that using the actual randomization procedure that led to the observed data helps to report valid Fisher-exact p-values as opposed to p-values relying on approximating null randomization distributions [46]. As an example, in our mean difference analyses, we found some differences between the null randomization distribution of the test statistic when approximated by permuting the intervention assignment vector and when drawn from the approximating asymptotic distribution (see Figs J-K in S1 Text). A natural extension of this study would be to use a Neymanian or Bayesian mode of inference to tackle the same research questions. There, simulations should support evidence whether the approach can indeed recover the then estimated causal effects. Simulating microbiome data requires effort so that the common properties, such as compositionality and zero-inflation, can be preserved, but re-sampling approaches [98] and generative models [99] have been developed to achieve this end.

An important component of our randomization-based procedure is that the permutations of the intervention assignment vector conserves the matched-pair design of the hypothetical randomized experiment. This strategy has been advocated by Rubin [100] in the context of randomized trials, and more recently by Bind and Rubin [46] in the context of hypothetical randomized experiments, because assumptions on the underlying distribution of the data are not required. Only few R packages were built to perform randomization-based inference while conserving the design of the intervention assignment. Therefore, for every analysis in our study, we imported a matrix of 10,000 unique randomized intervention assignments to calculate our p-values (see https://github.com/AliceSommer/Causal_Microbiome_Tutorial for a reproducible example on the American Gut Data [16, 101]). Nonetheless, the DACOMP and NetCoMi R packages provide flexible functions enabling the calculation of randomization-based p-values for our study design to test sharp null hypotheses of no difference in taxa abundance and associations, respectively. We advocate for more development of such user-friendly software functions permitting flexibility and accountability of the design stage of observational studies. P-value adjustments for multiple comparison also follow a fully randomization-based procedure, while preserving the design of the experiment. The method has proven to be more powerful while maintaining the family-wise error rate [91].

Notice that when presenting our results, we never accepted alternative hypotheses but only rejected sharp nulls when unadjusted and adjusted p-values were small, i.e., indicating the hypotheses warrants further scrutiny [82]. In the field of microbiome data analysis, the terms differential abundance and associations are frequently used. Researchers report “differentially abundant” and “differentially associated” sets of taxa after testing sharp null hypotheses of no effect of an intervention. This terminology implicitly implies acceptance of the alternative hypotheses. However, when testing sharp null hypotheses we assess the amount of evidence against them in the observed data, which does not prove the alternative hypothesis to be true.

During the design stage, the outcome variable was ignored and only pre-exposure covariates were considered. The chosen balanced data is a sub-sample of units that can be used to estimate the effects of an intervention. Omitting the outcome data until the analysis avoids “model cherry-picking”, because the effect of the intervention is estimated once, after a successful design stage. Nonetheless, at the design stage, we can only consider the observed pre-exposure variables but the assignment mechanism could depend on unobserved pre-exposure variables. In gut microbiome studies, diet is often an unobserved confounder. For example, in this study, dietary intake data was collected for only 1,469/2,033 (i.e., 72%) participants. We verified balance in dietary intake for our balanced data subset (see Figs H-I in S1 Text). Even though we made sure that the observed potential confounding covariates are fairly balanced, there could still be imbalances in other unobserved background covariates, which could have an effect on our results. In such cases, Rosenbaum [102] has recommended to consider sensitivity analyses of how the Fisher-exact p-value would change, had the intervention assignment been plausibly different, see also Bind and Rubin [46]. Subject-matter knowledge on the probability of the binary exposure (i.e., smoking or air pollution) given the observed and unobserved background covariates should guide the plausible range of “sensitivity” p-values and the reason why they could deviate from the p-value calculated based on the assumed hypothetical intervention assignment. This idea provides material for an extension of the framework presented in this study.

The framework suggested in this paper facilitates a more transparent interpretation of results than standard approaches directly modeling the observed outcome. First, interpretation is only valid within the range of the background covariates of the study population in the respective hypothetical experiment (see their detailed characteristics in Table 4 and Figs B-I in S1 Text). The data do not provide direct information for the “unmatched” units. In addition to our pair-matching strategy, we conducted a sensitivity analysis using a propensity score matching algorithm at the design stage, which led to more matched pairs, and thus a broader range of background covariates values (see Table D in S1 Text). Both matching algorithms do not lead to conflicting results in the smoking prevention experiments. In the air pollution reduction experiment, only the differential abundance analysis does not lead to the same overall conclusion. At this stage, the researcher can decide between a larger number of units or more similar groups of units to compare. When designing our hypothetical experiment, we chose a pair-matching strategy, because it creates similar pairs of participants based on subject-matter knowledge. For example, the number of females and males in the intervention and control groups is identical after pair-matching, whereas with propensity score matching, these numbers slightly differ (see Table 4 and Table D in S1 Text). Note that the matching algorithm considerations should be a priori specified before any statistical analysis is performed. Ideally, the design stage should be conducted by a statistician who is not involved in the subsequent statistical analysis stage. Second, the assumed assignment mechanism and underlying assumptions have to be clearly stated to obtain meaningful p-values. Standard approaches usually make strong assumptions (e.g., linearity), whose discussions are often neglected. Modeling the observed data and solely adjusting for confounders by including them in a regression, without a design stage, can be unreliable, especially when the pre-exposure covariates distributions of the control and intervention units are not similar. For instance, Cochran and Rubin [47], Heckman et al. [103], and Rubin [104] have shown that regression models can estimate biased treatment effects when the true relationship between the covariates and the outcome is not modeled accurately. Dehejia and Wahba have also shown that standard nonexperimental estimators such as regression are sensitive to the specification used in the regression [105]. This is another reason why we opted for an inference method that does not rely on parametric assumptions.

In contrast to other studies interested in the effect of air pollution exposures on health outcomes, this study does not provide any estimation of an exposure-response curve. Instead, we examine the effect of interventions and provide results that can directly contribute to policy recommendations. Until now, relationships between inhaled environmental exposures and the human gut microbiome were not examined with causal inference methods, so a first step to make advances in the field is to test, whether air pollution and smoking have no effect on the units of our study. If so, a potential next step would be to work with a dataset adequate for balancing covariates along different doses of the exposure such as suggested in [106] and estimate a causal dose-response in order to protect populations at risk.

In the smoking prevention experiment, the subset of genera retained at the differential abundance analysis step was linked to the serum markers triglycerides and high-density lipoprotein in previous studies [65, 95, 96]. In our data, we observe correlations between these genera and metabolites in the same direction than previously found by Vojinovic [65] (see Fig 5). Serum triglycerides and high-density lipoprotein play a role in metabolic syndrome, and associations between smoking and metabolic syndrome have also been found previously [107]. Therefore, we suggest further investigation on the pathway of cigarette smoke impacting the gut, which in turn has effects on circulating metabolites (and metabolic syndrome). A logical next step would be to apply our framework to other cohorts with similar amplicon data preprocessing and available pre-exposure covariates such as the Dutch LifeLines-DEEP [13] and Rotterdam Studies [14], and observe whether our results replicate.

Supporting information

S1 Text

Fig A: Gut microbiome data description. Number of observed ASV per sample (top left), sequencing depth per sample (top right), number of sequences per ASV (bottom left), number of zero count per ASV (bottom right). Fig B: Empirical distributions of the matched covariates among the subjects under the intervention vs. not in the original (left panel) and the balanced (right panel) data for the air pollution reduction hypothetical experiment. Fig C: Empirical distributions of the disease covariates among the subjects under the intervention vs. not in the original (left panel) and the balanced (right panel) data for the air pollution reduction hypothetical experiment. Fig D: Empirical distributions of the medication covariates among the subjects under the intervention vs. not in the original (left panel) and the balanced (right panel) data for the air pollution reduction hypothetical experiment. Fig E: Empirical distributions of the matched covariates among the subjects under the intervention vs. not in the original (left panel) and the balanced (right panel) data for the smoking prevention hypothetical experiment. Fig F: Empirical distributions of the diseases covariates among the subjects under the intervention vs. not in the original (left panel) and the balanced (right panel) data for the smoking prevention hypothetical experiment. Fig G: Empirical distributions of the medication covariates among the subjects under the intervention vs. not in the original (left panel) and the balanced (right panel) data for the smoking prevention hypothetical experiment. Fig H: Empirical distributions of the nutrition covariates among the subjects under the intervention vs. not in the balanced data for the air pollution reduction hypothetical experiment. Fig I: Empirical distributions of the nutrition covariates among the subjects under the intervention vs. not in the balanced data for the smoking prevention hypothetical experiment. Fig J: Permutation-based (grey) and asymptotic (blue) null randomization distributions for the air pollution reduction hypothetical experiment. Fig K: Permutation-based (grey) and asymptotic (blue) null randomization distributions for the smoking prevention hypothetical experiment. Fig L: Reference set selection in the air pollution reduction experiment. A taxa enters the set R = (r₁, …, r_F) if it has low variance (< 2) and high prevalence (> 90%). For the analyses at the ASV level, we chose the variance to be < 3 and the prevalence to be > 40% as thresholds in order the have at least one reference per subject. Fig M: Reference set selection in the smoking prevention experiment. A taxa enters the set R = (r₁, …, r_F) if it has low variance (< 2) and high prevalence (> 90%). For the analyses at the ASV level, we chose the variance to be < 3 and the prevalence to be > 40% as thresholds in order the have at least one reference per subject. Fig N: Distribution of number of ASVs per sample when data is filtered at different ASV prevalence thresholds (0%, 5%, 10%, 15%) in the air pollution reduction experiment. Red value: minimum observed ASVs per sample. Fig O: Distribution of the total ASV counts per sample when data is filtered at different ASV prevalence thresholds (0%, 5%, 10%, 15%) in the air pollution reduction experiment. Red value: minimum ASV counts per sample. Fig P: Distribution of number of ASVs per sample when data is filtered at different ASV prevalence thresholds (0%, 5%, 10%, 15%) in the smoking prevention reduction experiment. Red value: minimum observed ASVs per sample. Fig Q: Distribution of the total ASV counts per sample when data is filtered at different ASV prevalence thresholds (0%, 5%, 10%, 15%) in the smoking prevention experiment. Red value: minimum ASV counts per sample. Fig R: Genus-genus associations for subject under the air pollution reduction experiment vs. not (n = 99, p = 149). (A) Visualization of the between genera partial correlations estimated with the SPIEC-EASI method. Edges thickness is proportional to partial correlation, and color to direction: red: negative partial correlation, green: positive partial correlation. Node size is proportional to the centered log ratio of the genus abundances, and color is according to phyla. Triangle shaped nodes are differentially abundant (see Fig 3). (B) Zoom in largest connected component and differential associations (bold genera). Fig S: Phyla comparison. Fig T: Sensitivity analysis—Empirical distributions of the matched covariates among the subjects under the intervention vs. not in the original (left panel) and the balanced (right panel) data for the air pollution reduction hypothetical experiment. Fig U: Sensitivity analysis—Empirical distributions of the diseases covariates among the subjects under the intervention vs. not in the original (left panel) and the balanced (right panel) data for the air pollution reduction hypothetical experiment. Fig V: Sensitivity analysis—Empirical distributions of the medication covariates among the subjects under the intervention vs. not in the original (left panel) and the balanced (right panel) data for the air pollution reduction hypothetical experiment. Fig W: Sensitivity analysis—Empirical distributions of the matched covariates among the subjects under the intervention vs. not in the original (left panel) and the balanced (right panel) data for the smoking prevention hypothetical experiment. Fig X: Sensitivity analysis—Empirical distributions of the diseases covariates among the subjects under the intervention vs. not in the original (left panel) and the balanced (right panel) data for the smoking prevention hypothetical experiment. Fig Y: Sensitivity analysis—Empirical distributions of the medication covariates among the subjects under the intervention vs. not in the original (left panel) and the balanced (right panel) data for the smoking prevention hypothetical experiment. Fig Z: Sensitivity analysis—Richness and α-diversity. Boxplots (with median), values of the test-statistics from the betta regression, and one-sided randomization-based p-values for 10,000 permutations of the intervention assignment following a matched-pair design. Table A: Gut microbiome data description. Number of observed ASV per sample, sequencing depth per sample, number of sequences per ASV, number of zero count per ASV. Table B: Air pollutiion reduction experiment results. Differentially abundant taxa and adjusted Fisher p-values for 10,000 iterations at 5% prevalence filtering. Selected adjusted p-values ≤ 0.2 (sign of abundance difference: y(1)—y(0)). Table C: Smoking prevention experiment results. Differentially abundant taxa and adjusted Fisher p-values for 10,000 iterations at 5% prevalence filtering. Selected adjusted p-values ≤ 0.2 (sign of abundance difference: y(1)—y(0)). Table D: Sensitivity analysis—Baseline characteristics of the study population in the air pollution reduction (left table) and smoking prevention experiments (right table). Continuous variables: mean and standard deviation (St. d.). Categorical variables: number of samples per category (N) and proportion of category (%). Table E: Sensitivity analysis—β-diversity. Microbiome Regression-based Kernel Association Test (MiRKAT), unadjusted and adjusted one-sided randomization-based p-values for 10,000 permutations of the intervention assignment following a matched-pair design. Table F: Sensitivity analysis—Compositional equivalence test. Test statistic for high-dimensional data and one-sided randomization-based p-values for 10,000 permutations of the intervention assignment following a matched-pair design. Table G: Sensitivity analysis—Smoking prevention experiment results. Differentially abundant taxa and adjusted Fisher p-values for 10,000 iterations at 5% prevalence filtering. Selected adjusted p-values ≤ 0.2 (sign of abundance difference: y(1)—y(0)).

(PDF)

Click here for additional data file.^{(17.1MB, pdf)}

Acknowledgments

We thank all KORA participants and technical assistants without whose contributions this study could not have been realized. We also thank Stefanie Peschel and Viet Tran for testing the code for the tutorial with the American Gut Data as well as Barak Brill for his support in the DACOMP implementation. The computations in this paper were run on the FASRC Odyssey cluster supported by the FAS Division of Science Research Computing Group at Harvard University.

Data Availability

The KORA cohort data discussed in the paper is available upon request via the kora.passt portal: https://helmholtz-muenchen.managed-otrs.com. The code for analysis and visualization of the data are accessible on the following GitHub public repository: https://github.com/AliceSommer/Pipeline_Microbiome. A tutorial to get acquainted with the framework and open source data is accessible on the following GitHub public repository: https://github.com/AliceSommer/Causal_Microbiome_Tutorial.

Funding Statement

Research reported in this publication was supported by the Office of the Director, National Institutes of Health under Award Number DP5OD021412 and the John Harvard Distinguished Science Fellows Program within the FAS Division of Science of Harvard University (MACB). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The KORA study was initiated and financed by the Helmholtz Zentrum München—German Research Center for Environmental Health, which is funded by the German Federal Ministry of Education and Research (BMBF) and by the State of Bavaria (AP). Furthermore, KORA research was supported within the Munich Center of Health Sciences (MC-Health), Ludwig-Maximilians-Universität, as part of LMUinnovativ (AP). Microbiota profiling of KORA samples was supported by enable Kompetenzcluster der Ernährungsforschung (No. 01EA1409A) and the European Union Joint Programming Initiative DINAMIC (No. 2815ERA04E, 2815ERA11E) (DH). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1. Wikoff WR, Anfora AT, Liu J, Schultz PG, Lesley SA, Peters EC, et al. Metabolomics analysis reveals large effects of gut microflora on mammalian blood metabolites. Proceedings of the National Academy of Sciences of the United States of America. 2009;106(10):3698–3703. doi: 10.1073/pnas.0812874106 [DOI] [PMC free article] [PubMed] [Google Scholar]
2. Visconti A, Le Roy CI, Rosa F, Rossi N, Martin TC, Mohney RP, et al. Interplay between the human gut microbiome and host metabolism. Nature Communications. 2019;10(1). doi: 10.1038/s41467-019-12476-z [DOI] [PMC free article] [PubMed] [Google Scholar]
3. Belkaid Y, Hand T. Role of the Microbiota in Immunity and inflammation Yasmine. Cell. 2015;157(1):121–141. doi: 10.1016/j.cell.2014.03.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
4. David LA, Maurice CF, Carmody RN, Gootenberg DB, Button JE, Wolfe BE, et al. Diet rapidly and reproducibly alters the human gut microbiome. Nature. 2014;505(7484):559–563. doi: 10.1038/nature12820 [DOI] [PMC free article] [PubMed] [Google Scholar]
5. David La, Materna AC, Friedman J, Campos-Baptista MI, Blackburn MC, Perrotta A, et al. Host lifestyle affects human microbiota on daily timescales. Genome Biology. 2014;15(7):R89. doi: 10.1186/gb-2014-15-7-r89 [DOI] [PMC free article] [PubMed] [Google Scholar]
6. Langdon A, Crook N, Dantas G. The effects of antibiotics on the microbiome throughout development and alternative approaches for therapeutic modulation. Genome Medicine. 2016;8(1). doi: 10.1186/s13073-016-0294-z [DOI] [PMC free article] [PubMed] [Google Scholar]
7. Thursby E, Juge N. Introduction to the human gut microbiota. Biochemical Journal. 2017;474(11):1823–1836. doi: 10.1042/BCJ20160510 [DOI] [PMC free article] [PubMed] [Google Scholar]
8. Marchesi JR, Adams DH, Fava F, Hermes GDA, Hirschfield GM, Hold G, et al. The gut microbiota and host health: a new clinical frontier. Gut. 2016;65(2):330–339. doi: 10.1136/gutjnl-2015-309990 [DOI] [PMC free article] [PubMed] [Google Scholar]
9. Young VB. The role of the microbiome in human health and disease: an introduction for clinicians. BMJ. 2017;356. [DOI] [PubMed] [Google Scholar]
10. Pace NR, Stahl DA, Lane DJ, Olsen GJ. The Analysis of Natural Microbial Populations by Ribosomal RNA Sequences. In: C MK, editor. Advances in Microbial Ecology. vol. 9. Boston, MA: Springer; 1986. p. 1–55. [Google Scholar]
11. Turnbaugh PJ, Ley RE, Hamady M, Fraser-Liggett CM, Knight R, Gordon JI. The Human Microbiome Project. Nature. 2007;449(7164):804–810. doi: 10.1038/nature06244 [DOI] [PMC free article] [PubMed] [Google Scholar]
12. Goodrich JK, Waters JL, Poole AC, Sutter JL, Koren O, Blekhman R, et al. Human genetics shape the gut microbiome. Cell. 2014;159(4):789–799. doi: 10.1016/j.cell.2014.09.053 [DOI] [PMC free article] [PubMed] [Google Scholar]
13. Scholtens S, Smidt N, Swertz MA, Bakker SJ, Dotinga A, Vonk JM, et al. Cohort Profile: LifeLines, a three-generation cohort study and biobank. International Journal of Epidemiology. 2015;44(4):1172–1180. doi: 10.1093/ije/dyu229 [DOI] [PubMed] [Google Scholar]
14. Ikram MA, Brusselle GGO, Murad SD, van Duijn CM, Franco OH, Goedegebure A, et al. The Rotterdam Study: 2018 update on objectives, design and main results. Eur J Epidemiol. 2017;32(9):807–850. doi: 10.1007/s10654-017-0321-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
15. He Y, Wu W, Zheng HM, Li P, McDonald D, Sheng HF, et al. Regional variation limits applications of healthy gut microbiome reference ranges and disease models. Nature Medicine. 2018;24(10):1532–1535. doi: 10.1038/s41591-018-0164-x [DOI] [PubMed] [Google Scholar]
16. McDonald D, Hyde E, Debelius JW, Morton JT, Gonzalez A, Ackermann G, et al. American Gut: an Open Platform for Citizen Science Microbiome Research. mSystems. 2018;3(3). doi: 10.1128/mSystems.00031-18 [DOI] [PMC free article] [PubMed] [Google Scholar]
17. Holle R, Happich M, Löwel H, Wichmann H; MONICA/KORA Study Group. KORA—A Research Platform for Population Based Health Research. Gesundheitswesen (Bundesverband der Ärzte des Öffentlichen Gesundheitsdienstes (Germany)). 2005;67(S 01):19–25. doi: 10.1055/s-2005-858235 [DOI] [PubMed] [Google Scholar]
18. Shreiner AB, Kao JY, Young VB. The gut microbiome in health and in disease. Curr Opin Gastroenterol. 2015;31(1):69–75. doi: 10.1097/MOG.0000000000000139 [DOI] [PMC free article] [PubMed] [Google Scholar]
19. Rückerl R, Schneider A, Breitner S, Cyrys J, Peters A. Health effects of particulate air pollution: A review of epidemiological evidence. Inhalation Toxicology. 2011;23(10):555–592. doi: 10.3109/08958378.2011.593587 [DOI] [PubMed] [Google Scholar]
20. Huang C, Shi G. Smoking and microbiome in oral, airway, gut and some systemic diseases. Journal of translational medicine. 2019;17(1):225–225. doi: 10.1186/s12967-019-1971-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
21. Bind MC, Rubin DB. Bridging observational studies and randomized experiments by embedding the former in the latter. Statistical Methods in Medical Research. 2019;28(7):1958–1978. doi: 10.1177/0962280217740609 [DOI] [PMC free article] [PubMed] [Google Scholar]
22. Callahan BJ, McMurdie PJ, Holmes SP. Exact sequence variants should replace operational taxonomic units in marker-gene data analysis. The ISME Journal. 2017;11(12). doi: 10.1038/ismej.2017.119 [DOI] [PMC free article] [PubMed] [Google Scholar]
23. Kaplan GG, Dixon E, Panaccione R, Fong A, Chen L, Szyszkowicz M, et al. Effect of ambient air pollution on the incidence of appendicitis. Canadian Medical Association Journal. 2009;181(9):591–597. doi: 10.1503/cmaj.082068 [DOI] [PMC free article] [PubMed] [Google Scholar]
24. Ananthakrishnan AN, McGinley EL, Binion DG, Saeian K. Ambient air pollution correlates with hospitalizations for inflammatory bowel disease: an ecologic analysis. Inflamm Bowel Dis. 2011;17(5):1138–45. doi: 10.1002/ibd.21455 [DOI] [PubMed] [Google Scholar]
25. Kaplan GG, Szyszkowicz M, Fichna J, Rowe BH, Porada E, Vincent R, et al. Non-specific abdominal pain and air pollution: a novel association. PLoS One. 2012;7(10):1–8. doi: 10.1371/journal.pone.0047669 [DOI] [PMC free article] [PubMed] [Google Scholar]
26. Peters A. Epidemiology: Air pollution and mortality from diabetes mellitus. Nature Reviews Endocrinology. 2012;8(12):706. doi: 10.1038/nrendo.2012.204 [DOI] [PubMed] [Google Scholar]
27. Alderete TL, Jones RB, Chen Z, Kim JS, Habre R, Lurmann F, et al. Exposure to traffic-related air pollution and the composition of the gut microbiota in overweight and obese adolescents. Environmental Research. 2018;161:472–478. doi: 10.1016/j.envres.2017.11.046 [DOI] [PMC free article] [PubMed] [Google Scholar]
28. Liu T, Chen X, Xu Y, Wu W, Tang W, Chen Z, et al. Gut microbiota partially mediates the effects of fine particulate matter on type 2 diabetes: Evidence from a population-based epidemiological study. Environment International. 2019;130. doi: 10.1016/j.envint.2019.05.076 [DOI] [PubMed] [Google Scholar]
29. Bailey MJ, Naik NN, Wild LE, Patterson WB, Alderete TL. Exposure to air pollutants and the gut microbiota: a potential link between exposure, obesity, and type 2 diabetes. Gut Microbes. 2020;11(5):1188–1202. doi: 10.1080/19490976.2020.1749754 [DOI] [PMC free article] [PubMed] [Google Scholar]
30. Fouladi F, Bailey MJ, Patterson WB, Sioda M, Blakley IC, Fodor AA, et al. Air pollution exposure is associated with the gut microbiome as revealed by shotgun metagenomic sequencing. Environment International. 2020;138:105604. doi: 10.1016/j.envint.2020.105604 [DOI] [PMC free article] [PubMed] [Google Scholar]
31. Möller W, Häußinger K, Winkler-Heil R, Stahlhofen W, Meyer T, Hofmann W, et al. Mucociliary and long-term particle clearance in the airways of healthy nonsmoker subjects. Journal of Applied Physiology. 2004;97(6):2200–2206. doi: 10.1152/japplphysiol.00970.2003 [DOI] [PubMed] [Google Scholar]
32. Beamish LA, Osornio-Vargas AR, Wine E. Air pollution: An environmental factor contributing to intestinal disease. Journal of Crohn’s and Colitis. 2011;5(4):279–286. doi: 10.1016/j.crohns.2011.02.017 [DOI] [PubMed] [Google Scholar]
33. Mutlu EA, Engen PA, Soberanes S, Urich D, Forsyth CB, Nigdelioglu R, et al. Particulate matter air pollution causes oxidant-mediated increase in gut permeability in mice. Particle and Fibre Technology. 2011;8:19. doi: 10.1186/1743-8977-8-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
34. Kish L, Hotte N, Kaplan GG, Vincent R, Tso R, Gänzle M, et al. Environmental particulate matter induces murine intestinal inflammatory responses and alters the gut microbiome. PLoS One. 2013;8(4):1–15. doi: 10.1371/journal.pone.0062220 [DOI] [PMC free article] [PubMed] [Google Scholar]
35. Li R, Navab K, Hough G, Daher N, Zhang M, Mittelstein D, et al. Effect of exposure to atmospheric ultrafine particles on production of free fatty acids and lipid metabolites in the mouse small intestine. Environ Health Perspectives. 2015;123(1):34–41. doi: 10.1289/ehp.1307036 [DOI] [PMC free article] [PubMed] [Google Scholar]
36. Mutlu EA, Comba IY, Cho T, Engen PA, Yazıcı C, Soberanes S, et al. Inhalational exposure to particulate matter air pollution alters the composition of the gut microbiome. Environmental Pollution. 2018;240:817–830. doi: 10.1016/j.envpol.2018.04.130 [DOI] [PMC free article] [PubMed] [Google Scholar]
37. Wang W, Zhou J, Chen M, Huang X, Xie X, Li W, et al. Exposure to concentrated ambient PM2.5 alters the composition of gut microbiota in a murine model. Particle and Fibre Toxicology. 2018;15(1):1–13. doi: 10.1186/s12989-018-0252-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
38. Salim SY, Kaplan GG, Madsen KL. Air pollution effects on the gut microbiota. Gut Microbes. 2014;5(2):215–219. doi: 10.4161/gmic.27251 [DOI] [PMC free article] [PubMed] [Google Scholar]
39. Gui X, Yang Z, Li MD. Effect of Cigarette Smoke on Gut Microbiota: State of Knowledge. Frontiers in Physiology. 2021;12. doi: 10.3389/fphys.2021.673341 [DOI] [PMC free article] [PubMed] [Google Scholar]
40. Calkins BM. A meta-analysis of the role of smoking in inflammatory bowel disease. Digestive Diseases and Sciences. 1989;34(12):1841–1854. doi: 10.1007/BF01536701 [DOI] [PubMed] [Google Scholar]
41. Cosnes J, Beaugerie L, Carbonnel F, Gendre J. Smoking cessation and the course of Crohn’s disease: An intervention study. Gastroenterology. 2001;120(5):1093–1099. doi: 10.1053/gast.2001.23231 [DOI] [PubMed] [Google Scholar]
42. Benjamin JL, Hedin CRH, Koutsoumpas A, Ng SC, McCarthy NE, Prescott NJ, et al. Smokers with Active Crohn’s Disease Have a Clinically Relevant Dysbiosis of the Gastrointestinal Microbiota. Inflammatory Bowel Diseases. 2011;18(6):1092–1100. doi: 10.1002/ibd.21864 [DOI] [PubMed] [Google Scholar]
43. Biedermann L, Zeitz J, Mwinyi J, Sutter-Minder E, Rehman A, Ott SJ, et al. Smoking cessation induces profound changes in the composition of the intestinal microbiota in humans. PloS one. 2013;8(3):e59260–e59260. doi: 10.1371/journal.pone.0059260 [DOI] [PMC free article] [PubMed] [Google Scholar]
44. Lee SH, Yun Y, Kim SJ, Lee EJ, Chang Y, Ryu S, et al. Association between Cigarette Smoking Status and Composition of Gut Microbiota: Population-Based Cross-Sectional Study. Journal of clinical medicine. 2018;7(9):282. doi: 10.3390/jcm7090282 [DOI] [PMC free article] [PubMed] [Google Scholar]
45. Fisher RA. The Design of Experiments. Edinburgh: Oliver and Boyd; 1935. [Google Scholar]
46. Bind MAC, Rubin DB. When possible, report a Fisher-exact P value and display its underlying null randomization distribution. Proceedings of the National Academy of Sciences. 2020;117(32):19151–19158. doi: 10.1073/pnas.1915454117 [DOI] [PMC free article] [PubMed] [Google Scholar]
47. Cochran WG, Rubin DB. Controlling Bias in Observational Studies: A Review. Sankhyā: The Indian Journal of Statistics, Series A (1961-2002). 1973;35(4):417–446. [Google Scholar]
48. Rubin DB. The Use of Matched Sampling and Regression Adjustment to Remove Bias in Observational Studies. Biometrics. 1973;29(1):185–203. [Google Scholar]
49. Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology. 1974;66(5):688–701. doi: 10.1037/h0037350 [DOI] [Google Scholar]
50. Rubin DB. Inference and Missing Data. Biometrika. 1976;63(3):581–592. doi: 10.1093/biomet/63.3.581 [DOI] [Google Scholar]
51. Holland PW. Statistics and Causal Inference. Journal of the American Statistical Association. 1986;81(396):945–960. doi: 10.2307/2289069 [DOI] [PubMed] [Google Scholar]
52. Imbens GW, Rubin DB. Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction. New York, NY, USA: Cambridge University Press; 2015. [Google Scholar]
53. Rubin DB. For Objective Causal Inference, Design Trumps Analysis. The Annals of Applied Statistics. 2008;2(3):808–840. doi: 10.1214/08-AOAS187 [DOI] [Google Scholar]
54. Willis A, Bunge J, Whitman T. Improved detection of changes in species richness in high diversity microbial communities. Journal of the Royal Statistical Society: Series C (Applied Statistics). 2017;66(5):963–977. [Google Scholar]
55. Willis AD, Martin BD. Estimating diversity in networked ecological communities. Biostatistics. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
56. Cao Y, Lin W, Li H. Two-sample tests of high-dimensional means for compositional data. Biometrika. 2018;105(1):115–132. doi: 10.1093/biomet/asx060 [DOI] [Google Scholar]
57. Brill B, Amir A, Heller R. Testing for differential abundance in compositional counts data, with application to microbiome studies. The Annals of Applied Statistics. 2022. [Google Scholar]
58. Kurtz ZD, Müller CL, Miraldi ER, Littman DR, Blaser MJ, Bonneau RA. Sparse and Compositionally Robust Inference of Microbial Ecological Networks. PLOS Computational Biology. 2015;11(5):e1004226. doi: 10.1371/journal.pcbi.1004226 [DOI] [PMC free article] [PubMed] [Google Scholar]
59. Peschel S, Müller CL, von Mutius E, Boulesteix AL, Depner M. NetCoMi: network construction and comparison for microbiome data in R. Briefings in Bioinformatics. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
60. Sohn MB, Li H. Compositional mediation analysis for microbiome studies. The Annals of Applied Statistics. 2019;13(1):661–681. doi: 10.1214/18-AOAS1210 [DOI] [Google Scholar]
61. Wang C, Hu J, Blaser MJ, Li H. Estimating and testing the microbial causal mediation effect with high-dimensional and compositional microbiome data. Bioinformatics. 2019;36(2):347–355. doi: 10.1093/bioinformatics/btz565 [DOI] [PMC free article] [PubMed] [Google Scholar]
62. Sazal MR, Stebliankin V, Mathee K, Narasimhan G. Causal Inference in Microbiomes Using Intervention Calculus. bioRxiv. 2020. [Google Scholar]
63. Wade KH, Hall LJ. Improving causality in microbiome research: can human genetic epidemiology help? Wellcome open research. 2020;4:199–199. doi: 10.12688/wellcomeopenres.15628.3 [DOI] [PMC free article] [PubMed] [Google Scholar]
64. Hughes D, Bacigalupe R, Wang J, Rühlemann M, Falony G, Joossens M, et al. Genome-wide associations of human gut microbiome variation and implications for causal inference analyses. Nature Microbiology. 2020;5. doi: 10.1038/s41564-020-0743-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
65. Vojinovic D, Radjabzadeh D, Kurilshikov A, Amin N, Wijmenga C, Franke L, et al. Relationship between gut microbiota and circulating metabolites in population-based cohorts. Nature Communications. 2019;10:Article: 5813. doi: 10.1038/s41467-019-13721-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
66. Breuninger TA, Riedl A, Wawro N, Rathmann W, Strauch K, Quante A, et al. Differential associations between diet and prediabetes or diabetes in the KORA FF4 study. Journal of Nutritional Science. 2018;7:e34. doi: 10.1017/jns.2018.25 [DOI] [PMC free article] [PubMed] [Google Scholar]
67. Godon JJ, Zumstein E, Dabert P, Habouzit F, Moletta R. Molecular microbial diversity of an anaerobic digestor as determined by small-subunit rDNA sequence analysis. Applied and environmental microbiology. 1997;63(7). doi: 10.1128/aem.63.7.2802-2813.1997 [DOI] [PMC free article] [PubMed] [Google Scholar]
68. Klindworth A, Pruesse E, Schweer T, Peplies J, Quast C, Horn M, et al. Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies. Nucleic acids research. 2013;41(1). doi: 10.1093/nar/gks808 [DOI] [PMC free article] [PubMed] [Google Scholar]
69. Callahan BJ, Sankaran K, Fukuyama JA, Mcmurdie PJ, Holmes SP. Bioconductor workflow for microbiome data analysis: from raw reads to community analyses [version 1; referees: 2 approved]. F1000Research. 2016;5. doi: 10.12688/f1000research.8986.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
70. Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic acids research. 2013;41(Database issue):D590. doi: 10.1093/nar/gks1219 [DOI] [PMC free article] [PubMed] [Google Scholar]
71. Wang Q, Garrity GM, Tiedje JM, Cole JR. Naive Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy. Applied and Environmental Microbiology. 2007;73(16):5261. doi: 10.1128/AEM.00062-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
72. Edgar RC, Valencia A. Updating the 97% identity threshold for 16S ribosomal RNA OTUs. Bioinformatics. 2018;34(14):2371–2375. doi: 10.1093/bioinformatics/bty113 [DOI] [PubMed] [Google Scholar]
73. Lozupone C, Knight R. UniFrac: a new phylogenetic method for comparing microbial communities. Applied and environmental microbiology. 2005;71(12):8228–8235. doi: 10.1128/AEM.71.12.8228-8235.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
74. Wright ES. Using DECIPHER v2.0 to Analyze Big Biological Sequence Data in R. The R Journal. 2016;8(1):352–359. doi: 10.32614/RJ-2016-025 [DOI] [Google Scholar]
75. Studier JA, Keppler KJ. A note on the neighbor-joining algorithm of Saitou and Nei. Molecular biology and evolution. 1988;5(6):729. [DOI] [PubMed] [Google Scholar]
76. Rubin DB. The design versus the analysis of observational studies for causal effects: parallels with the design of randomized trials. Statistics in Medicine. 2007;26(1):20–36. doi: 10.1002/sim.2739 [DOI] [PubMed] [Google Scholar]
77.Micali S, Vazirani VV. An Algoithm for Finding Maximum Matching in General Graphs. In: Proceedings of the 21st Annual Symposium on Foundations of Computer Science. SFCS’80. Washington, DC, USA: IEEE Computer Society; 1980. p. 17–27.
78. Singh RK, Chang HW, Yan D, Lee KM, Ucmak D, Wong K, et al. Influence of diet on the gut microbiome and implications for human health. Journal of Translational Medicine. 2017;15(1):73. doi: 10.1186/s12967-017-1175-y [DOI] [PMC free article] [PubMed] [Google Scholar]
79. Johnson AJ, Zheng JJ, Kang JW, Saboe A, Knights D, Zivkovic AM. A Guide to Diet-Microbiome Study Design. Frontiers in Nutrition. 2020;7:79. doi: 10.3389/fnut.2020.00079 [DOI] [PMC free article] [PubMed] [Google Scholar]
80. Rubin DB. Randomization Analysis of Experimental Data: The Fisher Randomization Test Comment. Journal of the American Statistical Association. 1980;75(371):591–593. doi: 10.2307/2287653 [DOI] [Google Scholar]
81. Gloor GB, Macklaim JM, Pawlowsky-Glahn V, Egozcue JJ. Microbiome Datasets Are Compositional: And This Is Not Optional. Frontiers in Microbiology. 2017;8. doi: 10.3389/fmicb.2017.02224 [DOI] [PMC free article] [PubMed] [Google Scholar]
82. Wasserstein RL, Schirm AL, Lazar NA. Moving to a World Beyond “p < 0.05”. The American Statistician. 2019;73(sup1):1–19. doi: 10.1080/00031305.2019.1583913 [DOI] [Google Scholar]
83. Willis A, Bunge J. Estimating diversity via frequency ratios. Biometrics. 2015;71(4):1042–1049. doi: 10.1111/biom.12332 [DOI] [PubMed] [Google Scholar]
84. Zhao N, Chen J, Carroll I, Ringel-Kulka T, Epstein M, Zhou H, et al. Testing in Microbiome-Profiling Studies with MiRKAT, the Microbiome Regression-Based Kernel Association Test. The American Journal of Human Genetics. 2015;96(5):797–807. doi: 10.1016/j.ajhg.2015.04.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
85. Shannon CE. A Mathematical Theory of Communication. Bell System Technical Journal. 1948;27(3):379–423. doi: 10.1002/j.1538-7305.1948.tb01338.x [DOI] [Google Scholar]
86. Basharin GP. On a Statistical Estimate for the Entropy of a Sequence of Independent Random Variables. Theory of Probability and its Applications. 1959;4(3):333. doi: 10.1137/1104033 [DOI] [Google Scholar]
87. Brillinger DR, Jones LV, Tukey JW. The Role of Statistics in Weather Resources Management. In: The Management of Weather Resources. vol. 2. Washington D.C., USA: U.S. Government Printing Office; 1978. p. 25. [Google Scholar]
88. Aitchison JJ. The statistical analysis of compositional data. Caldwell, N.J.: Blackburn Press; 2003. [Google Scholar]
89. Gower JC. A General Coefficient of Similarity and Some of Its Properties. Biometrics. 1971;27(4):857–871. doi: 10.2307/2528823 [DOI] [Google Scholar]
90. Lin X. Variance Component Testing in Generalised Linear Models with Random Effects. Biometrika. 1997;84(2):309–326. doi: 10.1093/biomet/84.2.309 [DOI] [Google Scholar]
91. Lee JJ, Forastiere L, Miratrix L, Pillai NS. More powerful multiple testing in randomized experiments with non-compliance. Statistica Sinica. 2017;27(3):1319–1345. [Google Scholar]
92. Friedman J, Hastie T, Tibshirani R. Sparse inverse covariance estimation with the graphical lasso. Biostatistics. 2008;9(3):432–441. doi: 10.1093/biostatistics/kxm045 [DOI] [PMC free article] [PubMed] [Google Scholar]
93. Liu H, Roeder K, Wasserman L. Stability Approach to Regularization Selection (StARS) for High Dimensional Graphical Models. Adv Neural Inf Process Syst. 2010;24(2):1432–1440. [PMC free article] [PubMed] [Google Scholar]
94. Wasserstein R, Lazar N. The ASA’s Statement on p-Values: Context, Process, and Purpose. American Statistician. 2016;70(2):129–131. doi: 10.1080/00031305.2016.1154108 [DOI] [Google Scholar]
95. Fu J, Bonder MJ, Cenit MC, Tigchelaar EF, Maatman A, Dekens JAM, et al. The Gut Microbiome Contributes to a Substantial Proportion of the Variation in Blood Lipids. Circulation research. 2015;117(9):817–824. doi: 10.1161/CIRCRESAHA.115.306807 [DOI] [PMC free article] [PubMed] [Google Scholar]
96. He Y, Wu W, Wu S, Zheng HM, Li P, Sheng HF, et al. Linking gut microbiota, metabolic syndrome and economic status based on a population-level analysis. Microbiome. 2018;6(1):172. doi: 10.1186/s40168-018-0557-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
97. Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55. doi: 10.1093/biomet/70.1.41 [DOI] [Google Scholar]
98. McMurdie PJ, Holmes S. phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data. PLOS ONE. 2013;8(4):1–11. doi: 10.1371/journal.pone.0061217 [DOI] [PMC free article] [PubMed] [Google Scholar]
99. Ma S, Ren B, Mallick H, Moon YS, Schwager E, Maharjan S, et al. A statistical model for describing and simulating microbial community profiles. PLOS Computational Biology. 2021;17(9):1–27. doi: 10.1371/journal.pcbi.1008913 [DOI] [PMC free article] [PubMed] [Google Scholar]
100. Rubin DB. More powerful randomization-based p-values in double-blind trials with non-compliance. Statistics in Medicine. 1998;17(3):371–385. doi: [DOI] [PubMed] [Google Scholar]
101. Mishra AK, Müller CL. Negative Binomial factor regression with application to microbiome data analysis. Statistics in Medicine, accepted. 2022;. doi: 10.1002/sim.9384 [DOI] [PMC free article] [PubMed] [Google Scholar]
102. Rosenbaum PR. Design of Observational Studies. Springer, New-York; 2010. [Google Scholar]
103. Heckman JJ, Ichimura H, Todd P. Matching as an econometric evaluation estimator. Review of Economic Studies. 1998;65:261–294. doi: 10.1111/1467-937X.00044 [DOI] [Google Scholar]
104. Rubin DB. Using Propensity Scores to Help Design Observational Studies: Application to the Tobacco Litigation. Health Services and Outcomes Research Methodology. 2001;2(3):169–188. doi: 10.1023/A:1020363010465 [DOI] [Google Scholar]
105. Dehejia RH, Wahba S. Causal Effects in Nonexperimental Studies: Reevaluating the Evaluation of Training Programs. Journal of the American Statistical Association. 1999;94(448):1053–1062. doi: 10.1080/01621459.1999.10473858 [DOI] [Google Scholar]
106. Wu X, Braun D, Schwartz J, Kioumourtzoglou MA, Dominici F. Evaluating the impact of long-term exposure to fine particulate matter on mortality among the elderly. Science advances. 2020;6(29):eaba5692. doi: 10.1126/sciadv.aba5692 [DOI] [PMC free article] [PubMed] [Google Scholar]
107. Sun K, Liu J, Ning G. Active Smoking and Risk of Metabolic Syndrome: A Meta-Analysis of Prospective Studies. PLOS ONE. 2012;7(10):e47791. doi: 10.1371/journal.pone.0047791 [DOI] [PMC free article] [PubMed] [Google Scholar]

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1010044.r001

Decision Letter 0

Kiran Raosaheb Patil, Simon Anders

25 Oct 2021

Dear Dr. Mueller,

Thank you very much for submitting your manuscript "A randomization-based causal inference framework for uncovering environmental exposure effects on human gut microbiota" for consideration at PLOS Computational Biology.

As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. In light of the reviews (below this email), we would like to invite the resubmission of a significantly-revised version that takes into account the reviewers' comments.

GUEST EDITOR'S COMMENTS:

Reviewer #2, who is overall very positive, wrote that "the strength of the paper lies in applying the Rubin Causal Model to the data sets to go beyond correlational and associational explorations", and this, I suppose, is a fair summary of your paper's main selling point. Nevertheless, the Reviwer considers the "organization of the paper [as] awkward".

In contrast to Reviewer #1, Reviewer #3 seems to doubt that you are really using a causal frame: "the framework is nothing but just matching". I do not know much about causal inference, myself, but I also had this thought when reading your paper: if normal inference of correlation cannot show direction of causality, how can a matching scheme like your overcome that? I suppose that the answer to this question is the core idea of Rubin's model, and hence, maybe you need add a more thorough introduction to Rubin's methodology, accessible to readers not yet familiar with Rubin's work, and make sure that it is well linked to your actually method and explains how your matching-based scheme allows to actually infer causation rather than only correlation.

Reviewer #1 does not seem to share Reviewer #3's and my doubts about how matching can recover causal relations (and this is why I suppose that my doubts are merely due to my lack of understaning of Rubin's work and how you apply it). Nevertheless, that reviewer has several questions on how you justify your selection of a matching scheme, and how it compares to other state-of-the-art approaches. The reviewer also is unconvinced that your method is actually able to recover causal relationships, and suggests to prove this with simulations.

Therefore, I would ask you to carefully address all the reviewers' concerns and especially improve the explanation on how the method actually achieves causal inference. This may require a reorganizing of the manuscript as suggested by Reveiwer #2.

----

We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent to reviewers for further evaluation.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to the review comments and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out.

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Please prepare and submit your revised manuscript within 60 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email. Please note that revised manuscripts received after the 60-day due date may require evaluation and peer review similar to newly submitted manuscripts.

Thank you again for your submission. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Simon Anders

Guest Editor

PLOS Computational Biology

Kiran Patil

Deputy Editor

PLOS Computational Biology

***********************

GUEST EDITOR'S COMMENTS:

----

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: See attached.

Reviewer #2: see attached

Reviewer #3: The paper introduces a causal inference framework to investigate the treatment effect of environmental factors on the human microbiome. The core idea is to use matched pairs to balance the pre-exposure characteristics of participants and then to use randomization-based inference. The authors illustrated their framework on the German KORA cohort study, specifically, the effects of air pollution reduction and smoking prevention on the human gut microbiome. The paper is easy to follow and includes various analyses in the proposed framework. However, the framework is nothing but just matching, where sensitivity analysis is an essential part. The authors mentioned sensitivity analysis in the Discussion, but the reviewer doesn't think that is enough. Results for sensitivity analysis should be included, as well-known factors that are associated with the microbiome, such as diet and medication, were not used in matching. It is also not clear how their results differ from the analyses with covariate adjustments, which is commonly performed in microbiome studies. The authors mentioned the unreliability of regression with covariate adjustment in the Discussion, but matching is also not reliable if there are unmeasured covariates that have confounding effects. It would be very instructive if the authors could demonstrate that matching is more reliable than covariate adjustment in the microbiome study.

Minor:

Column titles in Table 1 are switched.

It seems rather subjective in determining thresholds for covariates in matching. How was the threshold for each covariate determined?

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Attachment

Submitted filename: PLOS_CB_report.pdf

Click here for additional data file.^{(120KB, pdf)}

Attachment

Submitted filename: review.pdf

Click here for additional data file.^{(68.5KB, pdf)}

PLoS Comput Biol. 2022 May 9;18(5):e1010044. doi: 10.1371/journal.pcbi.1010044.r002

Author response to Decision Letter 0

20 Jan 2022

Attachment

Submitted filename: Response_reviewer_comments_Sommer.pdf

Click here for additional data file.^{(138.1KB, pdf)}

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1010044.r003

Decision Letter 1

Kiran Raosaheb Patil, Simon Anders

24 Feb 2022

Dear Dr. Mueller,

---8<---

Guest Editor's comments:

I would like to thank the authors for their revised submission and the reviewers for their detailled and insightful comments on it. I'm enjoying being guest editor for this paper as it prompted me to read up on Rubin causality, and I have learned a lot from working through paper and comments.

Reviewer #2 had much praise for the improved paper, and I also agree with them that the author's reorganization of the paper has improved readability greatly. Reviewers #1 and #3 still have a few major concerns. Overall, I tend to agree with reviewer #2 that the paper is now in a good shape, pending the authors addressing the numerous minor comments made by all three reviewers. I would ask you to go through them carefully and amend and clarify the text acordingly.

The authors have rightfully pointed out that by applying Rubin's causal model to high-dimensional data and complex hypotheses that intermingle the dimensions (such as diversity measures) they are the first to bridge two subjects so far not connected, and that this paper is therefore a first step and cannot be expected to address every aspect of the topic. In this light, I would say that some of the Reviewers' remaining major concern merit a paper in its own right and would not need to be addressed here, provided the necessity for such further research is mentioned in the discussion.

Specifically:

- Reviewer #1 is certainly right that PSM is the only workable solution if one has many covariates, but the present experiment has only few -- so we can leave this issue to whoever is the first to actually face a situation with many covariates. For the present case of few covariates, the authors consider it obvious that individual matching beats PSM, the Reviewer doubts that. Hence, this statement should be either elaborated or dropped.

- I do not quite agree with Reviewer #1 on the need for simulations to ensure that null distributions are as expected. A simulation cannot help us to assess whether matching properly remedies confounding. However, once we assume there to be no confounding, we are in the same situation as in an actual randomized study. The question whether the tests employed by the authors are appropriate for randomized studies or controlled experiments in metabolomics have been discussed thoroughly in the papers that introduced these tests to the field of metabolomics -- and hopefully checked by simulations there. So, citing existing literature seems sufficient to me here.

- Reviewer #3 critizises the lack of a quantitative analysis of sensitivity to confounding. I imagine that here the bigger issue is not residual confounding to the (very few) known covariates but the influence of unknown/unrecorded covariates. Hence, while ideally, the authors would add a quantitative sensitivity anaysis in the Reviewer's sense, a qualitative discussion of the risk of (a) insufficient compensation for the known confounders and (b) unaccounted confounders should be sufficient, too.

- Regarding Reviewer #2's second comment, on comparing matching on a subset with accounting and using all samples: This is a question of gaining optimal power, not of ensuring correctness of inference. As the present paper does not claim to be the final word on the topic, such ways of optimizing inferential power are arguably beyond its scope, so we can leave this discussion to some future work.

---8<--

When you are ready to resubmit, please upload the following:

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Sincerely,

Simon Anders

Guest Editor

PLOS Computational Biology

Kiran Patil

Deputy Editor

PLOS Computational Biology

***********************

Specifically:

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: The authors have done a good job of responding to the previous comments and suggestions. However, two responses need further clarifications.

1. Comparisons to propensity score matching (PSM). The sensitivity analysis suggests that PSM seems to generate results consistent with those from matching on covariates while tends to get more matching pairs and thus lead to smaller approximate p-values. This comparison indicates PSM might be a better approach for matching. However, in the revised Discussion section, the authors claim that the proposed approach is favored over PSM since "unconfoundedness" should be prioritized. I'm not completely clear why matching on covariates directly would achieve better unconfoundedness compared to PSM, especially considering the fact that a propensity score model can incorporate high dimensional potential confounders which to me appears as a more flexible tool to adjust for confounding. The authors should clarify more on this point.

2. I still believe some form of simulations should be done in order to validate the proposed approach. As the authors pointed out, understanding the causal effects of microbiome is nearly untapped and we cannot easily transfer our knowledge of the statistical properties of matching algorithms in regular univariate outcome scenarios to microbiome data. I understand that the goal of the approach is to provide exploratory analysis and hard thresholding of p-values for decision making is somewhat questionable. But we at least need to know whether the p-values from the approach under the null has expected behaviors (e.g., uniform distribution) and whether the approach after multiple correction has sufficient power to detect causal effects when the data generation process is known (e.g. in a simulation study). Even a simple low-dimensional example with several homogeneous treatment effect would make the paper much stronger.

Reviewer #2: See attached.

Reviewer #3: The authors made a great effort to address reviewers’ comments, but the authors' responses are not quite satisfactory.

1. In causal inference for observational studies, "sensitivity analysis" typically refers to a method that assesses the magnitude of violations from unconfoundedness (See Imbens & Rubin, 2015). The propensity score matching is another way of matching and needs an assessment of potentially unmeasured confounding effects, like every method in causal inference for observational studies. An assessment of unconfoundedness should be included, and it would make this paper more appealing.

2. The authors responded that the covariate adjustments method is unreliable, citing several references. However, the covariate adjustments method is another common approach used in causal inference (Pearl, 2000); the combination of this method with an inverse propensity scores weighting (known as the doubly robust estimator) is one of the most popular methods used in causal inference. Could the authors demonstrate its unreliability empirically? The estimates of the covariate adjustment method can also be given a causal interpretation under the same assumption of no unmeasured confounding effect. So, it is important to demonstrate matching, which discards a large portion of samples, is more reliable than the covariate adjustments method, which uses all samples.

3. Response to the authors' question about column titles of Table 1 (now Table 4):

In the Characteristics of Study Population section, a paragraph says that the number of matched pairs is 99 for the air pollution reduction experiment and 271 for the smoking prevention experiment. However, the sum of F and M is 271 in the left column (titled Air Pollution) and 99 in the right one (titled Smoking). I am not sure if only these numbers were switched or the column names were switched. I assumed the latter.

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Figure Files:

Data Requirements:

Reproducibility:

Attachment

Submitted filename: review2.pdf

Click here for additional data file.^{(109.5KB, pdf)}

PLoS Comput Biol. 2022 May 9;18(5):e1010044. doi: 10.1371/journal.pcbi.1010044.r004

Author response to Decision Letter 1

14 Mar 2022

Attachment

Submitted filename: Reviewer_comments_Sommer_roundTWO.pdf

Click here for additional data file.^{(120.7KB, pdf)}

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1010044.r005

Decision Letter 2

Kiran Raosaheb Patil, Simon Anders

21 Mar 2022

Dear Dr. Mueller,

We are pleased to inform you that your manuscript 'A randomization-based causal inference framework for uncovering environmental exposure effects on human gut microbiota' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology.

Best regards,

Simon Anders

Guest Editor

PLOS Computational Biology

Kiran Patil

Deputy Editor

PLOS Computational Biology

***********************************************************

Gues Editor's Comment:

The remaining reviewer concerns in the second review round did not concern technical aspects but rather the reviewers' impression that we authors overstated certain points. The authors have chosen one of several possible approaches in the field of causal inference and demonstrated how it can be used for metagenomic data. They could have chosen another. The reviewers were concerned that the authors wanted to claim that their choice is not just a possible choice but a superior one, but the authors made it clear in their response that it was not their intention to make any statement in this regard. i feel the the text in its current form no longer gives that impression, and the reviewers remaining concerns are addressed. Hence, I do not think that another review round is required and consider the paper as ready for publication. I hope the reviewers agree.

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1010044.r006

Acceptance letter

Kiran Raosaheb Patil, Simon Anders

24 Apr 2022

PCOMPBIOL-D-21-01632R2

A randomization-based causal inference framework for uncovering environmental exposure effects on human gut microbiota

Dear Dr Müller,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Anita Estes

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Text

(PDF)

Click here for additional data file.^{(17.1MB, pdf)}

Attachment

Submitted filename: PLOS_CB_report.pdf

Click here for additional data file.^{(120KB, pdf)}

Attachment

Submitted filename: review.pdf

Click here for additional data file.^{(68.5KB, pdf)}

Attachment

Submitted filename: Response_reviewer_comments_Sommer.pdf

Click here for additional data file.^{(138.1KB, pdf)}

Attachment

Submitted filename: review2.pdf

Click here for additional data file.^{(109.5KB, pdf)}

Attachment

Submitted filename: Reviewer_comments_Sommer_roundTWO.pdf

Click here for additional data file.^{(120.7KB, pdf)}

Data Availability Statement

[pcbi.1010044.ref001] 1. Wikoff WR, Anfora AT, Liu J, Schultz PG, Lesley SA, Peters EC, et al. Metabolomics analysis reveals large effects of gut microflora on mammalian blood metabolites. Proceedings of the National Academy of Sciences of the United States of America. 2009;106(10):3698–3703. doi: 10.1073/pnas.0812874106 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref002] 2. Visconti A, Le Roy CI, Rosa F, Rossi N, Martin TC, Mohney RP, et al. Interplay between the human gut microbiome and host metabolism. Nature Communications. 2019;10(1). doi: 10.1038/s41467-019-12476-z [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref003] 3. Belkaid Y, Hand T. Role of the Microbiota in Immunity and inflammation Yasmine. Cell. 2015;157(1):121–141. doi: 10.1016/j.cell.2014.03.011 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref004] 4. David LA, Maurice CF, Carmody RN, Gootenberg DB, Button JE, Wolfe BE, et al. Diet rapidly and reproducibly alters the human gut microbiome. Nature. 2014;505(7484):559–563. doi: 10.1038/nature12820 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref005] 5. David La, Materna AC, Friedman J, Campos-Baptista MI, Blackburn MC, Perrotta A, et al. Host lifestyle affects human microbiota on daily timescales. Genome Biology. 2014;15(7):R89. doi: 10.1186/gb-2014-15-7-r89 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref006] 6. Langdon A, Crook N, Dantas G. The effects of antibiotics on the microbiome throughout development and alternative approaches for therapeutic modulation. Genome Medicine. 2016;8(1). doi: 10.1186/s13073-016-0294-z [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref007] 7. Thursby E, Juge N. Introduction to the human gut microbiota. Biochemical Journal. 2017;474(11):1823–1836. doi: 10.1042/BCJ20160510 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref008] 8. Marchesi JR, Adams DH, Fava F, Hermes GDA, Hirschfield GM, Hold G, et al. The gut microbiota and host health: a new clinical frontier. Gut. 2016;65(2):330–339. doi: 10.1136/gutjnl-2015-309990 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref009] 9. Young VB. The role of the microbiome in human health and disease: an introduction for clinicians. BMJ. 2017;356. [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref010] 10. Pace NR, Stahl DA, Lane DJ, Olsen GJ. The Analysis of Natural Microbial Populations by Ribosomal RNA Sequences. In: C MK, editor. Advances in Microbial Ecology. vol. 9. Boston, MA: Springer; 1986. p. 1–55. [Google Scholar]

[pcbi.1010044.ref011] 11. Turnbaugh PJ, Ley RE, Hamady M, Fraser-Liggett CM, Knight R, Gordon JI. The Human Microbiome Project. Nature. 2007;449(7164):804–810. doi: 10.1038/nature06244 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref012] 12. Goodrich JK, Waters JL, Poole AC, Sutter JL, Koren O, Blekhman R, et al. Human genetics shape the gut microbiome. Cell. 2014;159(4):789–799. doi: 10.1016/j.cell.2014.09.053 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref013] 13. Scholtens S, Smidt N, Swertz MA, Bakker SJ, Dotinga A, Vonk JM, et al. Cohort Profile: LifeLines, a three-generation cohort study and biobank. International Journal of Epidemiology. 2015;44(4):1172–1180. doi: 10.1093/ije/dyu229 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref014] 14. Ikram MA, Brusselle GGO, Murad SD, van Duijn CM, Franco OH, Goedegebure A, et al. The Rotterdam Study: 2018 update on objectives, design and main results. Eur J Epidemiol. 2017;32(9):807–850. doi: 10.1007/s10654-017-0321-4 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref015] 15. He Y, Wu W, Zheng HM, Li P, McDonald D, Sheng HF, et al. Regional variation limits applications of healthy gut microbiome reference ranges and disease models. Nature Medicine. 2018;24(10):1532–1535. doi: 10.1038/s41591-018-0164-x [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref016] 16. McDonald D, Hyde E, Debelius JW, Morton JT, Gonzalez A, Ackermann G, et al. American Gut: an Open Platform for Citizen Science Microbiome Research. mSystems. 2018;3(3). doi: 10.1128/mSystems.00031-18 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref017] 17. Holle R, Happich M, Löwel H, Wichmann H; MONICA/KORA Study Group. KORA—A Research Platform for Population Based Health Research. Gesundheitswesen (Bundesverband der Ärzte des Öffentlichen Gesundheitsdienstes (Germany)). 2005;67(S 01):19–25. doi: 10.1055/s-2005-858235 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref018] 18. Shreiner AB, Kao JY, Young VB. The gut microbiome in health and in disease. Curr Opin Gastroenterol. 2015;31(1):69–75. doi: 10.1097/MOG.0000000000000139 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref019] 19. Rückerl R, Schneider A, Breitner S, Cyrys J, Peters A. Health effects of particulate air pollution: A review of epidemiological evidence. Inhalation Toxicology. 2011;23(10):555–592. doi: 10.3109/08958378.2011.593587 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref020] 20. Huang C, Shi G. Smoking and microbiome in oral, airway, gut and some systemic diseases. Journal of translational medicine. 2019;17(1):225–225. doi: 10.1186/s12967-019-1971-7 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref021] 21. Bind MC, Rubin DB. Bridging observational studies and randomized experiments by embedding the former in the latter. Statistical Methods in Medical Research. 2019;28(7):1958–1978. doi: 10.1177/0962280217740609 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref022] 22. Callahan BJ, McMurdie PJ, Holmes SP. Exact sequence variants should replace operational taxonomic units in marker-gene data analysis. The ISME Journal. 2017;11(12). doi: 10.1038/ismej.2017.119 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref023] 23. Kaplan GG, Dixon E, Panaccione R, Fong A, Chen L, Szyszkowicz M, et al. Effect of ambient air pollution on the incidence of appendicitis. Canadian Medical Association Journal. 2009;181(9):591–597. doi: 10.1503/cmaj.082068 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref024] 24. Ananthakrishnan AN, McGinley EL, Binion DG, Saeian K. Ambient air pollution correlates with hospitalizations for inflammatory bowel disease: an ecologic analysis. Inflamm Bowel Dis. 2011;17(5):1138–45. doi: 10.1002/ibd.21455 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref025] 25. Kaplan GG, Szyszkowicz M, Fichna J, Rowe BH, Porada E, Vincent R, et al. Non-specific abdominal pain and air pollution: a novel association. PLoS One. 2012;7(10):1–8. doi: 10.1371/journal.pone.0047669 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref026] 26. Peters A. Epidemiology: Air pollution and mortality from diabetes mellitus. Nature Reviews Endocrinology. 2012;8(12):706. doi: 10.1038/nrendo.2012.204 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref027] 27. Alderete TL, Jones RB, Chen Z, Kim JS, Habre R, Lurmann F, et al. Exposure to traffic-related air pollution and the composition of the gut microbiota in overweight and obese adolescents. Environmental Research. 2018;161:472–478. doi: 10.1016/j.envres.2017.11.046 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref028] 28. Liu T, Chen X, Xu Y, Wu W, Tang W, Chen Z, et al. Gut microbiota partially mediates the effects of fine particulate matter on type 2 diabetes: Evidence from a population-based epidemiological study. Environment International. 2019;130. doi: 10.1016/j.envint.2019.05.076 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref029] 29. Bailey MJ, Naik NN, Wild LE, Patterson WB, Alderete TL. Exposure to air pollutants and the gut microbiota: a potential link between exposure, obesity, and type 2 diabetes. Gut Microbes. 2020;11(5):1188–1202. doi: 10.1080/19490976.2020.1749754 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref030] 30. Fouladi F, Bailey MJ, Patterson WB, Sioda M, Blakley IC, Fodor AA, et al. Air pollution exposure is associated with the gut microbiome as revealed by shotgun metagenomic sequencing. Environment International. 2020;138:105604. doi: 10.1016/j.envint.2020.105604 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref031] 31. Möller W, Häußinger K, Winkler-Heil R, Stahlhofen W, Meyer T, Hofmann W, et al. Mucociliary and long-term particle clearance in the airways of healthy nonsmoker subjects. Journal of Applied Physiology. 2004;97(6):2200–2206. doi: 10.1152/japplphysiol.00970.2003 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref032] 32. Beamish LA, Osornio-Vargas AR, Wine E. Air pollution: An environmental factor contributing to intestinal disease. Journal of Crohn’s and Colitis. 2011;5(4):279–286. doi: 10.1016/j.crohns.2011.02.017 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref033] 33. Mutlu EA, Engen PA, Soberanes S, Urich D, Forsyth CB, Nigdelioglu R, et al. Particulate matter air pollution causes oxidant-mediated increase in gut permeability in mice. Particle and Fibre Technology. 2011;8:19. doi: 10.1186/1743-8977-8-19 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref034] 34. Kish L, Hotte N, Kaplan GG, Vincent R, Tso R, Gänzle M, et al. Environmental particulate matter induces murine intestinal inflammatory responses and alters the gut microbiome. PLoS One. 2013;8(4):1–15. doi: 10.1371/journal.pone.0062220 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref035] 35. Li R, Navab K, Hough G, Daher N, Zhang M, Mittelstein D, et al. Effect of exposure to atmospheric ultrafine particles on production of free fatty acids and lipid metabolites in the mouse small intestine. Environ Health Perspectives. 2015;123(1):34–41. doi: 10.1289/ehp.1307036 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref036] 36. Mutlu EA, Comba IY, Cho T, Engen PA, Yazıcı C, Soberanes S, et al. Inhalational exposure to particulate matter air pollution alters the composition of the gut microbiome. Environmental Pollution. 2018;240:817–830. doi: 10.1016/j.envpol.2018.04.130 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref037] 37. Wang W, Zhou J, Chen M, Huang X, Xie X, Li W, et al. Exposure to concentrated ambient PM2.5 alters the composition of gut microbiota in a murine model. Particle and Fibre Toxicology. 2018;15(1):1–13. doi: 10.1186/s12989-018-0252-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref038] 38. Salim SY, Kaplan GG, Madsen KL. Air pollution effects on the gut microbiota. Gut Microbes. 2014;5(2):215–219. doi: 10.4161/gmic.27251 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref039] 39. Gui X, Yang Z, Li MD. Effect of Cigarette Smoke on Gut Microbiota: State of Knowledge. Frontiers in Physiology. 2021;12. doi: 10.3389/fphys.2021.673341 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref040] 40. Calkins BM. A meta-analysis of the role of smoking in inflammatory bowel disease. Digestive Diseases and Sciences. 1989;34(12):1841–1854. doi: 10.1007/BF01536701 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref041] 41. Cosnes J, Beaugerie L, Carbonnel F, Gendre J. Smoking cessation and the course of Crohn’s disease: An intervention study. Gastroenterology. 2001;120(5):1093–1099. doi: 10.1053/gast.2001.23231 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref042] 42. Benjamin JL, Hedin CRH, Koutsoumpas A, Ng SC, McCarthy NE, Prescott NJ, et al. Smokers with Active Crohn’s Disease Have a Clinically Relevant Dysbiosis of the Gastrointestinal Microbiota. Inflammatory Bowel Diseases. 2011;18(6):1092–1100. doi: 10.1002/ibd.21864 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref043] 43. Biedermann L, Zeitz J, Mwinyi J, Sutter-Minder E, Rehman A, Ott SJ, et al. Smoking cessation induces profound changes in the composition of the intestinal microbiota in humans. PloS one. 2013;8(3):e59260–e59260. doi: 10.1371/journal.pone.0059260 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref044] 44. Lee SH, Yun Y, Kim SJ, Lee EJ, Chang Y, Ryu S, et al. Association between Cigarette Smoking Status and Composition of Gut Microbiota: Population-Based Cross-Sectional Study. Journal of clinical medicine. 2018;7(9):282. doi: 10.3390/jcm7090282 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref045] 45. Fisher RA. The Design of Experiments. Edinburgh: Oliver and Boyd; 1935. [Google Scholar]

[pcbi.1010044.ref046] 46. Bind MAC, Rubin DB. When possible, report a Fisher-exact P value and display its underlying null randomization distribution. Proceedings of the National Academy of Sciences. 2020;117(32):19151–19158. doi: 10.1073/pnas.1915454117 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref047] 47. Cochran WG, Rubin DB. Controlling Bias in Observational Studies: A Review. Sankhyā: The Indian Journal of Statistics, Series A (1961-2002). 1973;35(4):417–446. [Google Scholar]

[pcbi.1010044.ref048] 48. Rubin DB. The Use of Matched Sampling and Regression Adjustment to Remove Bias in Observational Studies. Biometrics. 1973;29(1):185–203. [Google Scholar]

[pcbi.1010044.ref049] 49. Rubin DB. Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology. 1974;66(5):688–701. doi: 10.1037/h0037350 [DOI] [Google Scholar]

[pcbi.1010044.ref050] 50. Rubin DB. Inference and Missing Data. Biometrika. 1976;63(3):581–592. doi: 10.1093/biomet/63.3.581 [DOI] [Google Scholar]

[pcbi.1010044.ref051] 51. Holland PW. Statistics and Causal Inference. Journal of the American Statistical Association. 1986;81(396):945–960. doi: 10.2307/2289069 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref052] 52. Imbens GW, Rubin DB. Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction. New York, NY, USA: Cambridge University Press; 2015. [Google Scholar]

[pcbi.1010044.ref053] 53. Rubin DB. For Objective Causal Inference, Design Trumps Analysis. The Annals of Applied Statistics. 2008;2(3):808–840. doi: 10.1214/08-AOAS187 [DOI] [Google Scholar]

[pcbi.1010044.ref054] 54. Willis A, Bunge J, Whitman T. Improved detection of changes in species richness in high diversity microbial communities. Journal of the Royal Statistical Society: Series C (Applied Statistics). 2017;66(5):963–977. [Google Scholar]

[pcbi.1010044.ref055] 55. Willis AD, Martin BD. Estimating diversity in networked ecological communities. Biostatistics. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref056] 56. Cao Y, Lin W, Li H. Two-sample tests of high-dimensional means for compositional data. Biometrika. 2018;105(1):115–132. doi: 10.1093/biomet/asx060 [DOI] [Google Scholar]

[pcbi.1010044.ref057] 57. Brill B, Amir A, Heller R. Testing for differential abundance in compositional counts data, with application to microbiome studies. The Annals of Applied Statistics. 2022. [Google Scholar]

[pcbi.1010044.ref058] 58. Kurtz ZD, Müller CL, Miraldi ER, Littman DR, Blaser MJ, Bonneau RA. Sparse and Compositionally Robust Inference of Microbial Ecological Networks. PLOS Computational Biology. 2015;11(5):e1004226. doi: 10.1371/journal.pcbi.1004226 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref059] 59. Peschel S, Müller CL, von Mutius E, Boulesteix AL, Depner M. NetCoMi: network construction and comparison for microbiome data in R. Briefings in Bioinformatics. 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref060] 60. Sohn MB, Li H. Compositional mediation analysis for microbiome studies. The Annals of Applied Statistics. 2019;13(1):661–681. doi: 10.1214/18-AOAS1210 [DOI] [Google Scholar]

[pcbi.1010044.ref061] 61. Wang C, Hu J, Blaser MJ, Li H. Estimating and testing the microbial causal mediation effect with high-dimensional and compositional microbiome data. Bioinformatics. 2019;36(2):347–355. doi: 10.1093/bioinformatics/btz565 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref062] 62. Sazal MR, Stebliankin V, Mathee K, Narasimhan G. Causal Inference in Microbiomes Using Intervention Calculus. bioRxiv. 2020. [Google Scholar]

[pcbi.1010044.ref063] 63. Wade KH, Hall LJ. Improving causality in microbiome research: can human genetic epidemiology help? Wellcome open research. 2020;4:199–199. doi: 10.12688/wellcomeopenres.15628.3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref064] 64. Hughes D, Bacigalupe R, Wang J, Rühlemann M, Falony G, Joossens M, et al. Genome-wide associations of human gut microbiome variation and implications for causal inference analyses. Nature Microbiology. 2020;5. doi: 10.1038/s41564-020-0743-8 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref065] 65. Vojinovic D, Radjabzadeh D, Kurilshikov A, Amin N, Wijmenga C, Franke L, et al. Relationship between gut microbiota and circulating metabolites in population-based cohorts. Nature Communications. 2019;10:Article: 5813. doi: 10.1038/s41467-019-13721-1 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref066] 66. Breuninger TA, Riedl A, Wawro N, Rathmann W, Strauch K, Quante A, et al. Differential associations between diet and prediabetes or diabetes in the KORA FF4 study. Journal of Nutritional Science. 2018;7:e34. doi: 10.1017/jns.2018.25 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref067] 67. Godon JJ, Zumstein E, Dabert P, Habouzit F, Moletta R. Molecular microbial diversity of an anaerobic digestor as determined by small-subunit rDNA sequence analysis. Applied and environmental microbiology. 1997;63(7). doi: 10.1128/aem.63.7.2802-2813.1997 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref068] 68. Klindworth A, Pruesse E, Schweer T, Peplies J, Quast C, Horn M, et al. Evaluation of general 16S ribosomal RNA gene PCR primers for classical and next-generation sequencing-based diversity studies. Nucleic acids research. 2013;41(1). doi: 10.1093/nar/gks808 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref069] 69. Callahan BJ, Sankaran K, Fukuyama JA, Mcmurdie PJ, Holmes SP. Bioconductor workflow for microbiome data analysis: from raw reads to community analyses [version 1; referees: 2 approved]. F1000Research. 2016;5. doi: 10.12688/f1000research.8986.2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref070] 70. Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, et al. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic acids research. 2013;41(Database issue):D590. doi: 10.1093/nar/gks1219 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref071] 71. Wang Q, Garrity GM, Tiedje JM, Cole JR. Naive Bayesian Classifier for Rapid Assignment of rRNA Sequences into the New Bacterial Taxonomy. Applied and Environmental Microbiology. 2007;73(16):5261. doi: 10.1128/AEM.00062-07 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref072] 72. Edgar RC, Valencia A. Updating the 97% identity threshold for 16S ribosomal RNA OTUs. Bioinformatics. 2018;34(14):2371–2375. doi: 10.1093/bioinformatics/bty113 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref073] 73. Lozupone C, Knight R. UniFrac: a new phylogenetic method for comparing microbial communities. Applied and environmental microbiology. 2005;71(12):8228–8235. doi: 10.1128/AEM.71.12.8228-8235.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref074] 74. Wright ES. Using DECIPHER v2.0 to Analyze Big Biological Sequence Data in R. The R Journal. 2016;8(1):352–359. doi: 10.32614/RJ-2016-025 [DOI] [Google Scholar]

[pcbi.1010044.ref075] 75. Studier JA, Keppler KJ. A note on the neighbor-joining algorithm of Saitou and Nei. Molecular biology and evolution. 1988;5(6):729. [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref076] 76. Rubin DB. The design versus the analysis of observational studies for causal effects: parallels with the design of randomized trials. Statistics in Medicine. 2007;26(1):20–36. doi: 10.1002/sim.2739 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref077] 77.Micali S, Vazirani VV. An Algoithm for Finding Maximum Matching in General Graphs. In: Proceedings of the 21st Annual Symposium on Foundations of Computer Science. SFCS’80. Washington, DC, USA: IEEE Computer Society; 1980. p. 17–27.

[pcbi.1010044.ref078] 78. Singh RK, Chang HW, Yan D, Lee KM, Ucmak D, Wong K, et al. Influence of diet on the gut microbiome and implications for human health. Journal of Translational Medicine. 2017;15(1):73. doi: 10.1186/s12967-017-1175-y [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref079] 79. Johnson AJ, Zheng JJ, Kang JW, Saboe A, Knights D, Zivkovic AM. A Guide to Diet-Microbiome Study Design. Frontiers in Nutrition. 2020;7:79. doi: 10.3389/fnut.2020.00079 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref080] 80. Rubin DB. Randomization Analysis of Experimental Data: The Fisher Randomization Test Comment. Journal of the American Statistical Association. 1980;75(371):591–593. doi: 10.2307/2287653 [DOI] [Google Scholar]

[pcbi.1010044.ref081] 81. Gloor GB, Macklaim JM, Pawlowsky-Glahn V, Egozcue JJ. Microbiome Datasets Are Compositional: And This Is Not Optional. Frontiers in Microbiology. 2017;8. doi: 10.3389/fmicb.2017.02224 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref082] 82. Wasserstein RL, Schirm AL, Lazar NA. Moving to a World Beyond “p < 0.05”. The American Statistician. 2019;73(sup1):1–19. doi: 10.1080/00031305.2019.1583913 [DOI] [Google Scholar]

[pcbi.1010044.ref083] 83. Willis A, Bunge J. Estimating diversity via frequency ratios. Biometrics. 2015;71(4):1042–1049. doi: 10.1111/biom.12332 [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref084] 84. Zhao N, Chen J, Carroll I, Ringel-Kulka T, Epstein M, Zhou H, et al. Testing in Microbiome-Profiling Studies with MiRKAT, the Microbiome Regression-Based Kernel Association Test. The American Journal of Human Genetics. 2015;96(5):797–807. doi: 10.1016/j.ajhg.2015.04.003 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref085] 85. Shannon CE. A Mathematical Theory of Communication. Bell System Technical Journal. 1948;27(3):379–423. doi: 10.1002/j.1538-7305.1948.tb01338.x [DOI] [Google Scholar]

[pcbi.1010044.ref086] 86. Basharin GP. On a Statistical Estimate for the Entropy of a Sequence of Independent Random Variables. Theory of Probability and its Applications. 1959;4(3):333. doi: 10.1137/1104033 [DOI] [Google Scholar]

[pcbi.1010044.ref087] 87. Brillinger DR, Jones LV, Tukey JW. The Role of Statistics in Weather Resources Management. In: The Management of Weather Resources. vol. 2. Washington D.C., USA: U.S. Government Printing Office; 1978. p. 25. [Google Scholar]

[pcbi.1010044.ref088] 88. Aitchison JJ. The statistical analysis of compositional data. Caldwell, N.J.: Blackburn Press; 2003. [Google Scholar]

[pcbi.1010044.ref089] 89. Gower JC. A General Coefficient of Similarity and Some of Its Properties. Biometrics. 1971;27(4):857–871. doi: 10.2307/2528823 [DOI] [Google Scholar]

[pcbi.1010044.ref090] 90. Lin X. Variance Component Testing in Generalised Linear Models with Random Effects. Biometrika. 1997;84(2):309–326. doi: 10.1093/biomet/84.2.309 [DOI] [Google Scholar]

[pcbi.1010044.ref091] 91. Lee JJ, Forastiere L, Miratrix L, Pillai NS. More powerful multiple testing in randomized experiments with non-compliance. Statistica Sinica. 2017;27(3):1319–1345. [Google Scholar]

[pcbi.1010044.ref092] 92. Friedman J, Hastie T, Tibshirani R. Sparse inverse covariance estimation with the graphical lasso. Biostatistics. 2008;9(3):432–441. doi: 10.1093/biostatistics/kxm045 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref093] 93. Liu H, Roeder K, Wasserman L. Stability Approach to Regularization Selection (StARS) for High Dimensional Graphical Models. Adv Neural Inf Process Syst. 2010;24(2):1432–1440. [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref094] 94. Wasserstein R, Lazar N. The ASA’s Statement on p-Values: Context, Process, and Purpose. American Statistician. 2016;70(2):129–131. doi: 10.1080/00031305.2016.1154108 [DOI] [Google Scholar]

[pcbi.1010044.ref095] 95. Fu J, Bonder MJ, Cenit MC, Tigchelaar EF, Maatman A, Dekens JAM, et al. The Gut Microbiome Contributes to a Substantial Proportion of the Variation in Blood Lipids. Circulation research. 2015;117(9):817–824. doi: 10.1161/CIRCRESAHA.115.306807 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref096] 96. He Y, Wu W, Wu S, Zheng HM, Li P, Sheng HF, et al. Linking gut microbiota, metabolic syndrome and economic status based on a population-level analysis. Microbiome. 2018;6(1):172. doi: 10.1186/s40168-018-0557-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref097] 97. Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55. doi: 10.1093/biomet/70.1.41 [DOI] [Google Scholar]

[pcbi.1010044.ref098] 98. McMurdie PJ, Holmes S. phyloseq: An R Package for Reproducible Interactive Analysis and Graphics of Microbiome Census Data. PLOS ONE. 2013;8(4):1–11. doi: 10.1371/journal.pone.0061217 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref099] 99. Ma S, Ren B, Mallick H, Moon YS, Schwager E, Maharjan S, et al. A statistical model for describing and simulating microbial community profiles. PLOS Computational Biology. 2021;17(9):1–27. doi: 10.1371/journal.pcbi.1008913 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref100] 100. Rubin DB. More powerful randomization-based p-values in double-blind trials with non-compliance. Statistics in Medicine. 1998;17(3):371–385. doi: [DOI] [PubMed] [Google Scholar]

[pcbi.1010044.ref101] 101. Mishra AK, Müller CL. Negative Binomial factor regression with application to microbiome data analysis. Statistics in Medicine, accepted. 2022;. doi: 10.1002/sim.9384 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref102] 102. Rosenbaum PR. Design of Observational Studies. Springer, New-York; 2010. [Google Scholar]

[pcbi.1010044.ref103] 103. Heckman JJ, Ichimura H, Todd P. Matching as an econometric evaluation estimator. Review of Economic Studies. 1998;65:261–294. doi: 10.1111/1467-937X.00044 [DOI] [Google Scholar]

[pcbi.1010044.ref104] 104. Rubin DB. Using Propensity Scores to Help Design Observational Studies: Application to the Tobacco Litigation. Health Services and Outcomes Research Methodology. 2001;2(3):169–188. doi: 10.1023/A:1020363010465 [DOI] [Google Scholar]

[pcbi.1010044.ref105] 105. Dehejia RH, Wahba S. Causal Effects in Nonexperimental Studies: Reevaluating the Evaluation of Training Programs. Journal of the American Statistical Association. 1999;94(448):1053–1062. doi: 10.1080/01621459.1999.10473858 [DOI] [Google Scholar]

[pcbi.1010044.ref106] 106. Wu X, Braun D, Schwartz J, Kioumourtzoglou MA, Dominici F. Evaluating the impact of long-term exposure to fine particulate matter on mortality among the elderly. Science advances. 2020;6(29):eaba5692. doi: 10.1126/sciadv.aba5692 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pcbi.1010044.ref107] 107. Sun K, Liu J, Ning G. Active Smoking and Risk of Metabolic Syndrome: A Meta-Analysis of Prospective Studies. PLOS ONE. 2012;7(10):e47791. doi: 10.1371/journal.pone.0047791 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

A randomization-based causal inference framework for uncovering environmental exposure effects on human gut microbiota

Alice J Sommer

Annette Peters

Martina Rommel

Josef Cyrys

Harald Grallert

Dirk Haller

Christian L Müller

Marie-Abèle C Bind

Roles

Abstract

Author summary

1 Introduction

Fig 1. The four stages of the causal inference framework [21] adapted to the exploration of environment-gut microbiome relationships.

2 Methods

2.1 The German KORA FF4 cohort study

2.1.1 Gut microbiome data sequencing and preprocessing

2.2 Causal inference framework

2.3 Conceptual stage: Formulation of the hypothetical randomized experiment in terms of potential outcomes

Table 1. Potential outcomes for the subjects of the hypothetical experiment.

2.3.1 Observed outcomes measurement

2.4 Design stage: Reconstruction of the conceptualized hypothetical experiment

Table 2. Before and after matching number of units.

2.5 Statistical analysis stage: Randomization-based inference

Table 3. Data transformation and choice of test statistics.

2.5.1 Diversity analyses

2.5.2 Composition analyses

2.6 Summary stage: Interpretation of the results

3 Results

3.1 Characteristics of study population

Table 4. Baseline characteristics of the study population in the air pollution reduction (left table) and smoking prevention experiments (right table).

3.2 Microbial diversity analysis

3.2.1 Within-subject diversity

Fig 2. Richness and α-diversity.

3.2.2 Between-subject variation

Table 5. β-diversity.

3.3 Microbial compositions analysis

3.3.1 Compositional mean differences

Table 6. Compositional equivalence test.

3.3.2 Differential taxon abundances

Fig 3. Differential abundance.

3.4 Microbial network analysis

3.4.1 Genus-genus association networks

Fig 4. Genus-genus associations of smokers and never-smokers (n = 271, p = 140).

3.4.2 Differential genus-genus associations

Table 7. Differential associations of genera.

3.5 Exploring associations between genera and lipid metabolites

Fig 5. Lipid metabolites exploration.

3.6 Sensitivity analysis

4 Discussion

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Decision Letter 0

Kiran Raosaheb Patil

Simon Anders

Roles

Author response to Decision Letter 0

Decision Letter 1

Kiran Raosaheb Patil

Simon Anders

Roles

Author response to Decision Letter 1

Decision Letter 2

Kiran Raosaheb Patil

Simon Anders

Roles

Acceptance letter

Kiran Raosaheb Patil

Simon Anders

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES