Accommodating multiple potential normalizations in microbiome associations studies

Hoseung Song; Wodan Ling; Ni Zhao; Anna M Plantinga; Courtney A Broedlow; Nichole R Klatt; Tiffany Hensley-McBain; Michael C Wu

doi:10.1186/s12859-023-05147-w

. 2023 Jan 19;24:22. doi: 10.1186/s12859-023-05147-w

Accommodating multiple potential normalizations in microbiome associations studies

Hoseung Song ¹, Wodan Ling ¹, Ni Zhao ², Anna M Plantinga ³, Courtney A Broedlow ⁴, Nichole R Klatt ⁴, Tiffany Hensley-McBain ⁵, Michael C Wu ^1,^✉

PMCID: PMC9850542 PMID: 36658484

Abstract

Background

Microbial communities are known to be closely related to many diseases, such as obesity and HIV, and it is of interest to identify differentially abundant microbial species between two or more environments. Since the abundances or counts of microbial species usually have different scales and suffer from zero-inflation or over-dispersion, normalization is a critical step before conducting differential abundance analysis. Several normalization approaches have been proposed, but it is difficult to optimize the characterization of the true relationship between taxa and interesting outcomes.

Results

To avoid the challenge of picking an optimal normalization and accommodate the advantages of several normalization strategies, we propose an omnibus approach. Our approach is based on a Cauchy combination test, which is flexible and powerful by aggregating individual p values. We also consider a truncated test statistic to prevent substantial power loss. We experiment with a basic linear regression model as well as recently proposed powerful association tests for microbiome data and compare the performance of the omnibus approach with individual normalization approaches. Experimental results show that, regardless of simulation settings, the new approach exhibits power that is close to the best normalization strategy, while controling the type I error well.

Conclusions

The proposed omnibus test releases researchers from choosing among various normalization methods and it is an aggregated method that provides the powerful result to the underlying optimal normalization, which requires tedious trial and error. While the power may not exceed the best normalization, it is always much better than using a poor choice of normalization.

Supplementary Information

The online version contains supplementary material available at 10.1186/s12859-023-05147-w.

Keywords: Normalization, Different library size, Omnibus approach, Cauchy combination test

Introduction and motivation

Microbial communities have been revealed to be closely related to many conditions, such as obesity [1–3], diabetes [4–6], and HIV [7–9]. With the development of high-throughput sequencing technologies enabling large-scale microbiome studies, human microbiome profiling studies for health conditions and diseases are gaining more attention. A central objective of human microbiome profiling studies is to identify individual bacterial taxa related to host outcomes, exposures, or other variables of interest among the meta data. This provides an important understanding of the mechanisms underlying different outcomes as well as host responses to exposures and potential contribution to therapeutic interventions.

The most common approach for identifying individual taxa related to variables of interest is to test the association between the variable and the abundance of each taxon, one by one. Many differential abundance analysis approaches have been proposed based on the linear model [10, 11], phylogenetic tree [12, 13], and zero-inflated model [14–16]. Based on these tests, the p value for the association of each taxon is generated and statistical significance is determined after controlling for multiple testing. On the other hand, global association tests have also been proposed to accommodate general dependency patterns between overall microbiome composition and profiles of other types of genomic data, including kernel-based tests [17–20] and distance-based tests [21–23].

Unfortunately, intrinsic challenges of microbiome data often make it difficult to identify differentially abundant taxa. For example, a central challenge of microbiome profiling studies is the issue of differential library size (total counts per sample), which is difficult to control [24] and does not reflect actual differences in microbial communities. Under-sampling in some individuals may also exacerbate zero-inflation and over-dispersion, leading to substantial power loss [25].

Normalization, broadly defined, is a strategy for overcoming differences in library size [26]. Failure to harmonize library sizes can lead to a severe loss of power as the scale of assessed abundances is essentially different for each sample. Common approaches to normalization include scaling the observations to have a unit sum (such that the norm of the abundances for each sample is equal to 1—the original definition of normalization), and scaling by other measures of central tendency, among others (see [27]). Theoretically, as long as read depth is not systematically confounded with the variable of interest, analysis under any normalizations is valid. However, in addition to philosophical differences between different normalization approaches, different normalizations also implicitly specify the expected relationship between each taxon under consideration and the outcome. A normalization that results in a better characterization of the true relationship between taxa and outcomes will lead to better power. Yet, the best normalization (leading to the highest power) is difficult to ascertain, as this depends on the unknown true state of nature and may also be different depending on the taxon under consideration.

To address the challenge of picking a single, optimal normalization, we develop an omnibus approach wherein we consider analyzing the data under several different normalization strategies. We then aggregate the results from different normalization approaches while adjusting for the fact that different normalizations have been used, in order to prevent p-hacking. Our approach is based on a Cauchy Combination Test (CCT) [28] which allows aggregation of correlated p values (calculated under different normalizations). Simulations show that our approach often leads to power similar to, or exceeds, the best individual normalization strategy, while still protecting the false discovery rate.

Methods

We assume a study in which there are n samples on which p taxa are measured. Let the raw vector of abundances for the $i^{th}$ sample be $X_{i}$ . For a given normalization $T (\cdot)$ , we set ${\tilde{X}}_{i} = T (X_{i})$ and $\tilde{X} = T (X) = [T (X_{1}), \dots, T (X_{n})]$ . We further assume that there are J different normalizations that can be considered such that we have $T_{1} (\cdot), T_{2} (\cdot), \dots, T_{J} (\cdot)$ . We focus on the objective of identifying individual bacterial taxa as associated with the outcome of interest.

The fundamental challenge that we hope to address is that it is unclear which normalization to use. Different normalizations correspond to different interpretations and implications of different bacterial taxa. Consequently, we propose a strategy in which we consider multiple potential normalizations. In this section, we first describe different, commonly used normalization strategies before outlining the specific approach for implicating taxa across different normalization approaches. We further describe simulation settings for evaluating the power and type I error of our strategy.

Common normalization approaches

Normalization is an important step for reducing variability in the data due to differential library size. Some easily applicable normalization strategies include:

(i)
None $T (X_{i}) = X_{i}$ .
(ii)
Rarefaction Each sample’s observed counts are sub-sampled such that the total count is the same for all samples.
(iii)
Total sum scaling (TSS) Observed counts are scaled by the sample’s library size (sum of counts).
(iv)
Cumulative sum scaling (CSS) Observed counts are scaled by the sum of counts up to a cutoff quantile1. (see details in [29]).
(v)
Center Log Ratio (CLR) transform Observed counts are divided by the geometric mean of the sample’s counts, then log-transformed.

Even though these normalization strategies work well under many settings, some concerns are discussed in the literature: artificial uncertainty in the sub-sampling step (rarefaction) [30], a bias in differential abundance estimates (TSS) [31], and uncertainty in the selection of quantile (CSS). Hence, it is difficult to find an optimal normalization approach and it depends on the unknown form of relationship between microbiome data and interesting outcomes.

Given the inherent challenge of selecting a single optimal normalization strategy, we propose to simply apply multiple normalization approaches. Specifically, for each choice of normalization, we apply a valid statistical test to get the p value for the association between each taxon and the variable of interest. In the next section, we describe how we combine the p values to get a single omnibus p value for each taxon.

Combining results across normalizations via the cauchy combination test

Assume that after applying each normalization, we apply a valid test to assess the association between each taxon and the variable of interest such that we have a $J \times p$ matrix of individual p values $P$ with $p_{j, k}$ , the p value for the $k^{th}$ taxon after applying the $j^{th}$ normalization strategy to the dataset.

Given $P$ , we apply the Cauchy combination test (CCT) [28] for each taxon to obtain the omnibus p value. It is well-known that the Cauchy combination test is useful for dealing with sparse alternatives, high-dimensional large-scale datasets, and small p values, which are common situations in GWAS. In particular, the analytic p value approximation by the Cauchy distribution is very accurate under arbitrary dependency structures. Hence, in practice, the test only requires the individual p values as input, so this omnibus testing procedure is very fast. Specifically, for $k = 1, \dots, p$ , we set

\begin{matrix} p_{k} = \frac{1}{J} \sum_{j = 1}^{J} tan {(0.5 - p_{j, k}) π} \end{matrix}

to be the final p value for the kth taxon and $p_{k}$ incorporates each normalization strategy.

Though CCT is convenient and exact for any number of p values, it suffers the drawback of sensitivity to p values at or near 1. Specifically, the Cauchy combination p value, $p_{k}$ , converges to 1 as one of $p_{j, k}$ approaches 1 $(j = 1, \dots, J)$ . This can happen for tests of discrete data or when the model to derive p values is mis-specified. To address this, we propose to use a truncated Cauchy combination test proposed by [32]:

\begin{matrix} p_{k} = \frac{1}{J} \sum_{j = 1}^{J} tan {(0.5 - min (p_{j, k}, 1 - ϵ)) π}, \end{matrix}

where $ϵ = 0.01$ . This prevents the overshoot of $p_{k}$ over the threshold $1 - ϵ$ .

Simulation setup

To check the performance of the method, we first follow the simulation setup in [16] using data generated to mimic a real data set. Specifically, we simulate data from the Coronary Artery Risk Development in Young Adults Study (CARDIA) [33] which aimed to investigate microbial taxa related to cardiovascular disease risk factors. The broader parent study [34] was balanced in terms of race (black or white), education (more than high school or not), and age. Between 1987 and 2016, each subject had up to eight follow-up visits. A variety of cardiovascular disease-related parameters, as well as physical measurements and lifestyle factors, were gathered.

We follow the preprocessing of [16] and focus on microbiome count data aggregated at the genus-level. The processed data contains data on 106 genera for 549 subjects. Our goal is to identify differentially abundant taxa between subjects with high blood pressure (HBP) and without HBP and examine how powerful the omnibus approach is over individual normalization strategies. Here, we treat blood pressure as a binary variable (HBP vs. non-HBP). We then consider two scenarios based on unadjustment/adjustment for covariates:

Setting 1 (type I error and power of individual-level analysis on a single taxon): We select ‘Streptococcus’ taxon, which is differentially abundant with strong differences in the mean abundance by HBP status, and test the association between the selected taxon and HBP without adjustment for other covariates. To assess the type I error rate, we generate 600 samples from the empirical distribution functions (edf) of the normalized abundance in subjects without HBP. To assess the power of the test, we generate 300 samples each from the edf of HBP and non-HBP groups. We also examine cases where the two groups are mixed by $δ$ % with each other. We simulate 10,000 datasets and the significance level is set to be 0.01.
Setting 2 (type I error and power of individual-level analysis on an OTU table): We create the starting dataset by rarefying the CARDIA dataset 10 times and averaging the resulting datasets. With this starting data, we fit each of the genera by the two-part quantile regression model (see Additional file 1: Figure S1):
$\begin{matrix} logit {P (D = 1 | X)} & = γ_{0} + γ_{1} HBP + γ_{2} age + γ_{3} physical activity \\ + γ_{4} diet quality score, \\ Q_{Y} (τ | X, Y > 0) & = β_{0} (τ) + β_{1} (τ) HBP + β_{2} (τ) age + β_{3} (τ) physical activity \\ + β_{4} (τ) diet quality score, \end{matrix}$
where $D = I (Y > 0)$ is a binary indicator of the presence of genus, and $γ_{i}$ ’s and $β_{i}$ ’s are estimated by the starting data and non-zero observations of the starting data using $τ = 0.01, \dots, 0.99$ , respectively. To assess the type I error rate, we generate n samples by resampling each of the real covariates with replacement independently and simulate D with the constraint $γ_{1} = β_{1} (τ) = 0$ . If $D = 0$ , we assign 0 as the count. If $D = 1$ , we simulate the count by the inverse CDF method: compute $Y = β_{0} (u) + β_{2} (u) age + β_{3} (u) physical activity + β_{4} (u) diet quality score$ , where $u \sim U (0, 1)$ and round it to the nearest integer. To assess the power of the test, we follow the same procedure without the constraint. We simulate 1000 datasets and the significance level is set to be 0.05.

For each test, normalization strategies (i)–(v) are considered and we compare the omnibus p value with the p values based on each normalization strategy. To obtain the p values, we apply the simple linear regression, a zero-inflated quantile approach (ZINQ) proposed by [16], a quantile regression using a rank score function and ignoring zero inflation (QRank) proposed by [35]. In addition, the mixing rate $(δ)$ and sample size (n) are chosen so that the results can be compared well between normalization strategies.

We also consider the simulation setup in [36] to examine the performance of the omnibus method by a global microbiome association test. Specifically, we generate n/2 genotype data of haplotypes from African and European ancestry by randomly pairing 2 haplotypes, respectively, over a 1 MB chromosome according to coalescent theory using the cosi2 program [37]. We also generate n samples of microbiome OTU counts from the Dirichlet-multinomial distribution. We first estimate the parameters of the Dirichlet-multinomial distribution using a real upper-respiratory-tract microbiome dataset [38]. This dataset is publicly available by an $R$ package $GUniFrac$ . This consists of 856 OTUs.

Setting 3 (type I error and power of community-level analysis on an OTU table): To assess the type I error rate, we use the above setting without introducing any genetic effect on the microbiome. To assess the power of the test, we introduce the association between genetics and microbiome. For each individual $i = 1, \dots, n$ , let $g_{i}$ be the genotype at a chosen common SNP (with MAF $\geq$ 0.05). Then we increase the counts of the $η$ th–20th most common OTUs by a factor $f_{i} = 1 + 1.7 * g_{i}$ .

For each test, we use a kernel RV coefficient (KRV) test to evaluate the overall association between genetic expression and microbiome composition. [39, 40]. We use the Bray-Curtis kernel and choose normalization strategies (i)–(iv) since CLR transformation provides negative results and this does not allow the KRV test. We simulate 1000 datasets and the significance level is set to be 0.05.

Results

Type I error

Table 1, 2, and 3 show the empirical size of the tests under Setting 1, 2, and 3, respectively. We see that the omnibus approach in general controls the type I error rate well, compared to other normalization approaches.

Table 1.

Empirical size of the tests under Setting 1 at 0.01 significance level

	None	Rarefaction	TSS	CSS	CLR	Omnibus
Linear regression	0.009	0.010	0.010	0.010	0.010	0.010
ZINQ	0.010	0.009	0.010	0.010	0.012	0.010
QRank	0.009	0.010	0.009	0.009	0.009	0.009

Open in a new tab

Table 2.

Empirical size of the tests under Setting 2 at 0.05 significance level, where n is the sample size

		None	Rarefaction	TSS	CSS	CLR	Omnibus
$n = 700$	Linear Regression	0.048	0.048	0.049	0.050	0.050	0.051
	ZINQ	0.052	0.051	0.052	0.052	0.055	0.056
	QRank	0.050	0.050	0.050	0.049	0.049	0.052
$n = 600$	Linear Regression	0.049	0.049	0.048	0.050	0.050	0.052
	ZINQ	0.052	0.052	0.055	0.053	0.055	0.056
	QRank	0.052	0.052	0.052	0.050	0.049	0.055
$n = 500$	Linear Regression	0.047	0.047	0.048	0.050	0.049	0.051
	ZINQ	0.052	0.052	0.054	0.053	0.055	0.056
	QRank	0.050	0.050	0.050	0.049	0.049	0.055
$n = 400$	Linear Regression	0.047	0.047	0.047	0.050	0.049	0.050
	ZINQ	0.052	0.053	0.054	0.054	0.055	0.056
	QRank	0.051	0.050	0.050	0.050	0.049	0.052

Open in a new tab

Table 3.

Empirical size of the tests under Setting 3 at 0.05 significance level, where n is the sample size

	None	Rarefaction	TSS	CSS	Omnibus
$n = 500$	0.049	0.049	0.049	0.049	0.046
$n = 400$	0.056	0.056	0.056	0.053	0.056
$n = 300$	0.058	0.058	0.058	0.053	0.056

Open in a new tab

Power

Table 4, 5, and 6 show the estimated power of the tests under Setting 1, 2, and 3, respectively. Figure 1 shows their visualization. Under Setting 1, we see that CLR normalization exhibits the best performance, while rarefaction shows the worst performance. Within this gap, the omnibus approach exhibits power that is almost as high as the performance when using CLR normalization. On the other hand, under Setting 2, when applying the linear regression, CSS normalization exhibits high power, while the rarefaction approach shows the worst performance. Surprisingly, the omnibus approach exhibits the best performance, even over the CSS approach. When using ZINQ, rarefaction approach still shows the worst performance, while ZINQ achieves high power without normalization. The omnibus approach shows the best performance, the same as the linear regression case. When applying QRank, CLR approach exhibits high power, but the omnibus approach achieves the best power. Under Setting 3, the omnibus approach exhibits high power, while raw data and TSS normalization show good performance as well.

Table 4.

Estimated power of the tests under Setting 1, where $δ$ is the mixing proportion

		None	Rarefaction	TSS	CSS	CLR	Omnibus
$δ = 0 %$	Linear Regression	0.943	0.842	0.844	0.991	0.999	0.996
	ZINQ	0.882	0.738	0.902	0.882	0.992	0.974
	QRank	0.780	0.821	0.799	0.809	0.943	0.900
$δ = 10 %$	Linear Regression	0.738	0.613	0.616	0.924	0.976	0.948
	ZINQ	0.658	0.493	0.689	0.651	0.923	0.833
	QRank	0.526	0.576	0.551	0.564	0.761	0.571
$δ = 20 %$	Linear Regression	0.415	0.332	0.336	0.658	0.793	0.690
	ZINQ	0.348	0.241	0.371	0.346	0.650	0.497
	QRank	0.245	0.276	0.261	0.267	0.417	0.328
$δ = 30 %$	Linear Regression	0.148	0.120	0.122	0.268	0.364	0.271
	ZINQ	0.114	0.084	0.125	0.117	0.265	0.164
	QRank	0.086	0.098	0.090	0.092	0.140	0.108

Open in a new tab

The two highest powers are in bold

Table 5.

Estimated power of the tests under Setting 2, where n is the sample size

		None	Rarefaction	TSS	CSS	CLR	Omnibus
$n = 700$	Linear Regression	0.270	0.255	0.258	0.324	0.308	0.348
	ZINQ	0.398	0.338	0.377	0.377	0.388	0.441
	QRank	0.341	0.278	0.304	0.305	0.409	0.411
$n = 600$	Linear Regression	0.243	0.229	0.232	0.295	0.279	0.312
	ZINQ	0.353	0.301	0.336	0.337	0.348	0.392
	QRank	0.300	0.242	0.268	0.270	0.361	0.362
$n = 500$	Linear Regression	0.233	0.219	0.222	0.280	0.265	0.298
	ZINQ	0.330	0.283	0.314	0.315	0.325	0.365
	QRank	0.278	0.230	0.252	0.252	0.331	0.335
$n = 400$	Linear Regression	0.190	0.178	0.181	0.230	0.219	0.240
	ZINQ	0.260	0.223	0.248	0.248	0.259	0.286
	QRank	0.215	0.179	0.196	0.196	0.252	0.257

Open in a new tab

The two highest powers are in bold

Table 6.

Estimated power of the tests under Setting 3, where n is the sample size and $η$ indicates the order of taxon

		None	Rarefaction	TSS	CSS	Omnibus
$n = 500$	$η = 14$	0.975	0.965	0.966	0.719	0.966
	$η = 15$	0.921	0.905	0.910	0.647	0.909
	$η = 16$	0.880	0.864	0.871	0.591	0.871
$n = 400$	$η = 14$	0.947	0.929	0.930	0.655	0.931
	$η = 15$	0.897	0.881	0.886	0.607	0.883
	$η = 16$	0.833	0.824	0.828	0.493	0.830
$n = 300$	$η = 14$	0.894	0.875	0.881	0.563	0.876
	$η = 15$	0.824	0.812	0.819	0.470	0.820
	$η = 16$	0.806	0.797	0.804	0.375	0.804

Open in a new tab

The two highest powers are in bold

Fig. 1 — Estimated power of the tests under Setting 1, 2, and 3

As shown in Table 4, 5, and 6, the best normalization strategy depends on different differential abundance methods and situations, and it is difficult to choose the optimal normalization strategy. However, the omnibus method generally performs well without prior or true relationship knowledge between taxa and outcomes, so this would be more efficient and useful when applying differential abundance tests in microbiome studies.

Real data application

In this section, we illustrate the omnibus approach on the HIV dataset analyzed in [41]. The HIV dataset is obtained by a multicolor flow cytometry-based method that separates neutrophils from other leukocytes in order to get a more precise measurement of neutrophil frequencies in the Gastrointestinal (GI) during HIV infection. This allows the identification of neurophils in blood and fresh GI issues and the calculation of the frequency of neutrophils as a percentage of all live CD45 $+$ cells.

As a result, this dataset consists of colorectal biopsies from a total of 40 HIV-infected, antiretroviral therapy (ART) suppressed individuals and 35 HIV-uninfected individuals with relevant participant demographic information, such as age, sex, sexual orientation, and race/ethnicity. The authors characterized the intestinal microbiome of colorectal biopsies using 16S rRNA sequencing and studied the association between the mucosal microbiome composition of colorectal biopsies and some relevant factors. For example, based on the fact that men who have sex with men (MSM) have an increased abundance of Prevotella, independent of HIV status, and this may result in the dysbiosis previously attributed to HIV infection, the authors observed the significant association between the overall microbial composition at the genus level and sexual orientation (MSM or non-MSM). They also showed that this association remained when adjusted for age, race, and HIV status.

We utilize this dataset to illustrate how the omnibus approach accommodates several normalization strategies. Specifically, we conduct association tests between the mucosal microbiome composition of colorectal biopsies and the sexual orientation. According to the results in [41], we test the association with the sexual orientation without adjustment for other covariates or with adjustment for age, race, and HIV status. Here, we fix the false discovery rate (FDR) at 20%.

Table 7 shows the number of differentially abundant taxa out of 108 taxa based on the sexual orientation at 20% FDR. We see that the best normalization approach is different for each case, but the omnibus approach in general identifies nearly as many significant taxa as the best normalization approach.

Table 7.

Number of differentially abundant taxa based on the sexual orientation

		None	Rarefaction	TSS	CSS	CLR	Omnibus
without covariates	Linear Regression	1	5	7	24	26	21
	ZINQ	13	11	8	16	24	17
	QRank	10	9	13	20	10	15
with covariates	Linear Regression	1	6	6	20	23	19
	ZINQ	0	0	11	1	24	16
	QRank	13	7	6	5	6	8

Open in a new tab

The two highest powers are in bold

We also assess type I error control based on permuted HIV datasets to check the validity of the omnibus test. For each normalization strategy, covariates as well as the sexual orientation are jointly permuted over the whole samples to create a permuted OTU table. This removes the association between the mucosal microbioal abundance and the sexual orientation, and taxa with small p values are considered false positive signals. We evaluate type I error control by the proportion of taxa with p values less than 0.1. We repeat this procedure 50 times and the results are presented via boxplots (Figs. 2 and 3). Compared to other normalization strategies, we see that the omnibus approach consistently controls the type I error rate well.

Fig. 2 — Boxplots of type I error rate under different normalization strategies without adjustment for covariates

Fig. 3 — Boxplots of type I error rate under different normalization strategies with adjustment for covariates

Discussion

We propose the omnibus approach to accommodate multiple potential normalization strategies. Essentially, each choice of normalization inherently assumes a different model for the relationship between taxa and variables of interest, with the optimal choice being unknown (and potentially differing across taxa). By using the omnibus approach, we can avoid the problem of choosing the optimal normalization strategy and instead test across a range of different normalization approaches. Numerical experiments and the real data application demonstrate that the omnibus test not only avoids the possibility of choosing the worst-performing normalization method but also exhibits power nearly as high as the power of the best normalization strategy.

Other factors, such as sequencing methods, amplicon bias, and target gene copy number could impact the differential abundance analysis results. However, they generate bias in microbiome sequencing, making measured relative abundances systematically different from their underlying truth. In this case, bias-resistant modeling or bias-insensitive analytical methods are needed. However, the impact of bias is usually not handled via normalization approaches, and thus goes beyond the scope of this paper. This paper mainly handles the differences caused by differential library sizes that affect all taxa more or less evenly. The focus of this paper is to study whether the omnibus approach helps bypass the normalization issue in microbiome differential abundance testing.

Certain transformations, in principle, can reduce the impact of compositional effects. However, without extrinsic information, there ultimately remains a particular denominator for normalization. Thus, our approach does not seek to overcome the issue, but rather we note that the statistical analysis is “valid” under any choice of normalization and compositionality is an issue of subsequent interpretation. That is, interpretation of significant findings is affected by the compositionality of the data and potentially by the normalization approaches considered, but fully assessing the impact of compositionality on the omnibus test lies outside the scope of the present work.

In this paper, we focus on the linear regression, ZINQ, QRank, and KRV. We acknowledge that a wide range of alternative methods for differential abundance analysis could also be used [42–44]; however, many of these methods are specialized in particular normalizations. For example, a metagenomeSeq method proposed by [43] internally implements CSS normalization and a DESeq2 method proposed by [45] implements relative log expression (RLE) normalization. More importantly, many of these approaches often fail to control the type I error [26, 46, 47] and are, therefore, statistically invalid. In contrast, the simple linear regression, ZINQ, QRank, and KRV have been shown to consistently protect the type I error and do not depend on the particular choice of normalization. We could also combine results across different association testing methods, as well as normalizations, but given their lack of statistical validity, aggregating invalid results just results in further false positives.

In addition, we consider four normalization strategies, including the rarefaction, TSS, CSS, and CLR transformations. Other normalization methods are also commonly used, such as an additive log-ratio transformation (ALR) or an isometric log-ratio transformation (ILR) [48]. However, ALR is computed with respect to the last element of variables, so it heavily depends on this element and its noise. Though ILR has good theoretical properties, it is difficult to interpret the result since transformed variables are intricate mixtures of the original variables. To fairly examine the performance of our approach, we choose four normalization methods that are simpler and more flexible to use.

Supplementary Information

12859_2023_5147_MOESM1_ESM.pdf^{(74.3KB, pdf)}

Additional file 1 Figure on histograms of some simulated abundances vs. true abundances.

Acknowledgements

Not applicable.

Abbreviations

CCT:: Cauchy combination test
TSS:: Total sum scaling
CSS:: Cumulative sum scaling
CLR:: Center log ratio transformation
CARDIA:: The coronary artery risk development in young adults study dataset
HBP:: High blood pressure
EDF:: Empirical distribution functions
ART:: Antiretroviral therapy
MSM:: Men who have sex with men
FDR:: False discovery rate
RLE:: Relative log expressions

Author Contributions

HS developed the method, designed the analysis, and wrote the manuscript. MCW conceived the study. WL, NZ and AP contributed to the conceptation, methodological developments, and presentation. CAB, NRK, and THM prepared the data and contributed to the data analysis. All authors reviewed the manuscript.

Funding

This work is supported in part by the National Institutes of Health grants R01 GM129512, R01 HL155417.

Availability of data and materials

Data used in this article is available from the CARDIA Study Data Coordinating Center at the University of Alabama at Birmingham. The process for obtaining data through CARDIA is outlined at https://www.cardia.dopm.uab.edu/publications-2/publications-documents. The HIV dataset and R scripts used during the current study are available from the corresponding author upon reasonable request. The main function code for the omnibus approach with an example code is available at https://github.com/hoseungs/Omnibus_normalization.

Declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Footnotes

https://www.metagenomics.wiki/tools/16s/norm/css

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Hoseung Song, Email: hsong3@fredhutch.org.

Wodan Ling, Email: wling@fredhutch.org.

Ni Zhao, Email: nzhao10@jhu.edu.

Anna M. Plantinga, Email: amp9@williams.edu

Courtney A. Broedlow, Email: broed015@umn.edu

Nichole R. Klatt, Email: klat0037@umn.edu

Tiffany Hensley-McBain, Email: thmcbain@mclaughlinresearch.org.

Michael C. Wu, Email: mcwu@fredhutch.org

References

1.Turnbaugh PJ, Ley RE, Mahowald MA, Magrini V, Mardis ER, Gordon JI. An obesity-associated gut microbiome with increased capacity for energy harvest. Nature. 2006;444(7122):1027–1031. doi: 10.1038/nature05414. [DOI] [PubMed] [Google Scholar]
2.John GK, Mullin GE. The gut microbiome and obesity. Curr Oncol Rep. 2016;18(7):1–7. doi: 10.1007/s11912-016-0528-7. [DOI] [PubMed] [Google Scholar]
3.Maruvada P, Leone V, Kaplan LM, Chang EB. The human microbiome and obesity: moving beyond associations. Cell Host Microbe. 2017;22(5):589–599. doi: 10.1016/j.chom.2017.10.005. [DOI] [PubMed] [Google Scholar]
4.Hartstra AV, Bouter KE, Bäckhed F, Nieuwdorp M. Insights into the role of the microbiome in obesity and type 2 diabetes. Diabetes Care. 2015;38(1):159–165. doi: 10.2337/dc14-0769. [DOI] [PubMed] [Google Scholar]
5.Komaroff AL. The microbiome and risk for obesity and diabetes. JAMA. 2017;317(4):355–356. doi: 10.1001/jama.2016.20099. [DOI] [PubMed] [Google Scholar]
6.Vallianou NG, Stratigou T, Tsagarakis S. Microbiome and diabetes: Where are we now? Diabetes Res Clin Pract. 2018;146:111–118. doi: 10.1016/j.diabres.2018.10.008. [DOI] [PubMed] [Google Scholar]
7.Saxena D, Li Y, Yang L, Pei Z, Poles M, Abrams WR, Malamud D. Human microbiome and HIV/AIDS. Curr HIV/AIDS Rep. 2012;9(1):44–51. doi: 10.1007/s11904-011-0103-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Bandera A, De Benedetto I, Bozzi G, Gori A. Altered gut microbiome composition in HIV infection: causes, effects and potential intervention. Curr Opin HIV AIDS. 2018;13(1):73–80. doi: 10.1097/COH.0000000000000429. [DOI] [PubMed] [Google Scholar]
9.Desai SN, Landay AL. HIV and aging: role of the microbiome. Curr Opin HIV AIDS. 2018;13(1):22–27. doi: 10.1097/COH.0000000000000433. [DOI] [PubMed] [Google Scholar]
10.Mallick H, Rahnavard A, McIver LJ, Ma S, Zhang Y, Nguyen LH, Tickle TL, Weingart G, Ren B, Schwager EH, et al. Multivariable association discovery in population-scale meta-omics studies. PLoS Comput Biol. 2021;17(11):1009442. doi: 10.1371/journal.pcbi.1009442. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Zhou H, He K, Chen J, Zhang X. Linda: linear models for differential abundance analysis of microbiome compositional data. Genome Biol. 2022;23(1):1–23. doi: 10.1186/s13059-022-02655-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Kim KJ, Park J, Park S-C, Won S. Phylogenetic tree-based microbiome association test. Bioinformatics. 2020;36(4):1000–1006. doi: 10.1093/bioinformatics/btz686. [DOI] [PubMed] [Google Scholar]
13.Huang C, Callahan BJ, Wu MC, Holloway ST, Brochu H, Lu W, Peng X, Tzeng J-Y. Phylogeny-guided microbiome otu-specific association test (post). 2021. [DOI] [PMC free article] [PubMed]
14.Hu T, Gallins P, Zhou Y-H. A zero-inflated beta-binomial model for microbiome data analysis. Stat. 2018;7(1):185. doi: 10.1002/sta4.185. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Ai D, Pan H, Li X, Gao Y, Liu G, Xia LC. Identifying gut microbiota associated with colorectal cancer using a zero-inflated lognormal model. Front Microbiol. 2019;10:826. doi: 10.3389/fmicb.2019.00826. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Ling W, Zhao N, Plantinga AM, Launer LJ, Fodor AA, Meyer KA, Wu MC. Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (zinq) Microbiome. 2021;9(1):1–19. doi: 10.1186/s40168-021-01129-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Zhao N, Chen J, Carroll IM, Ringel-Kulka T, Epstein MP, Zhou H, Zhou JJ, Ringel Y, Li H, Wu MC. Testing in microbiome-profiling studies with Mirkat, the microbiome regression-based kernel association test. Am J Human Genet. 2015;96(5):797–807. doi: 10.1016/j.ajhg.2015.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Zhan X, Tong X, Zhao N, Maity A, Wu MC, Chen J. A small-sample multivariate kernel machine test for microbiome association studies. Genet Epidemiol. 2017;41(3):210–220. doi: 10.1002/gepi.22030. [DOI] [PubMed] [Google Scholar]
19.Zhan X, Xue L, Zheng H, Plantinga A, Wu MC, Schaid DJ, Zhao N, Chen J. A small-sample kernel association test for correlated data with application to microbiome association studies. Genet Epidemiol. 2018;42(8):772–782. doi: 10.1002/gepi.22160. [DOI] [PubMed] [Google Scholar]
20.Koh H, Li Y, Zhan X, Chen J, Zhao N. A distance-based kernel association test based on the generalized linear mixed model for correlated microbiome studies. Front Genet. 2019;10:458. doi: 10.3389/fgene.2019.00458. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Chen J, Bittinger K, Charlson ES, Hoffmann C, Lewis J, Wu GD, Collman RG, Bushman FD, Li H. Associating microbiome composition with environmental covariates using generalized unifrac distances. Bioinformatics. 2012;28(16):2106–2113. doi: 10.1093/bioinformatics/bts342. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Zhang Y, Han SW, Cox LM, Li H. A multivariate distance-based analytic framework for microbial interdependence association test in longitudinal study. Genet Epidemiol. 2017;41(8):769–778. doi: 10.1002/gepi.22065. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Zhang J, Wei Z, Chen J. A distance-based approach for testing the mediation effect of the human microbiome. Bioinformatics. 2018;34(11):1875–1883. doi: 10.1093/bioinformatics/bty014. [DOI] [PubMed] [Google Scholar]
24.Pan AY. Statistical analysis of microbiome data: the challenge of sparsity. Curr Opin Endoc Metab Res. 2021;19:35–40. [Google Scholar]
25.Chen L, Reeve J, Zhang L, Huang S, Wang X, Chen J. Gmpr: a robust normalization method for zero-inflated count data with application to microbiome sequencing data. PeerJ. 2018;6:4600. doi: 10.7717/peerj.4600. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Weiss S, Xu ZZ, Peddada S, Amir A, Bittinger K, Gonzalez A, Lozupone C, Zaneveld JR, Vázquez-Baeza Y, Birmingham A, et al. Normalization and microbial differential abundance strategies depend upon data characteristics. Microbiome. 2017;5(1):1–18. doi: 10.1186/s40168-017-0237-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Lin H, Peddada SD. Analysis of microbial compositions: a review of normalization and differential abundance analysis. NPJ Biofilms Microbiomes. 2020;6(1):1–13. doi: 10.1038/s41522-020-00160-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Liu Y, Xie J. Cauchy combination test: a powerful test with analytic p-value calculation under arbitrary dependency structures. J Am Stat Assoc. 2020;115(529):393–402. doi: 10.1080/01621459.2018.1554485. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Flynn S, Reen FJ, Caparrós-Martín JA, Woods DF, Peplies J, Ranganathan SC, Stick SM, O’Gara F. Bile acid signal molecules associate temporally with respiratory inflammation and microbiome signatures in clinically stable cystic fibrosis patients. Microorganisms. 2020;8(11):1741. doi: 10.3390/microorganisms8111741. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.McMurdie PJ, Holmes S. Waste not, want not: why rarefying microbiome data is inadmissible. PLoS Comput Biol. 2014;10(4):1003531. doi: 10.1371/journal.pcbi.1003531. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Dillies M-A, Rau A, Aubert J, Hennequet-Antier C, Jeanmougin M, Servant N, Keime C, Marot G, Castel D, Estelle J, et al. A comprehensive evaluation of normalization methods for illumina high-throughput rna sequencing data analysis. Brief Bioinform. 2013;14(6):671–683. doi: 10.1093/bib/bbs046. [DOI] [PubMed] [Google Scholar]
32.Fang Y, Tseng GC, Chang C. Heavy-tailed distribution for combining dependent $p$ values with asymptotic robustness. arXiv:2103.12967 (2021).
33.Sun S, Lulla A, Sioda M, Winglee K, Wu MC, Jacobs DR, Jr, Shikany JM, Lloyd-Jones DM, Launer LJ, Fodor AA, et al. Gut microbiota composition and blood pressure: the cardia study. Hypertension. 2019;73(5):998–1006. doi: 10.1161/HYPERTENSIONAHA.118.12109. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Friedman GD, Cutter GR, Donahue RP, Hughes GH, Hulley SB, Jacobs DR, Jr, Liu K, Savage PJ. Cardia: study design, recruitment, and some characteristics of the examined subjects. J Clin Epidemiol. 1988;41(11):1105–1116. doi: 10.1016/0895-4356(88)90080-7. [DOI] [PubMed] [Google Scholar]
35.Song X, Li G, Zhou Z, Wang X, Ionita-Laza I, Wei Y. Qrank: a novel quantile regression tool for eqtl discovery. Bioinformatics. 2017;33(14):2123–2130. doi: 10.1093/bioinformatics/btx119. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Liu H, Ling W, Hua X, Moon J-Y, Williams-Nguyen JS, Zhan X, Plantinga AM, Zhao N, Zhang A, Knight R, et al. Kernel-based genetic association analysis for microbiome phenotypes identifies host genetic drivers of beta-diversity. bioRxiv (2021) [DOI] [PMC free article] [PubMed]
37.Shlyakhter I, Sabeti PC, Schaffner SF. Cosi2: an efficient simulator of exact and approximate coalescent with selection. Bioinformatics. 2014;30(23):3427–3429. doi: 10.1093/bioinformatics/btu562. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Charlson ES, Chen J, Custers-Allen R, Bittinger K, Li H, Sinha R, Hwang J, Bushman FD, Collman RG. Disordered microbial communities in the upper respiratory tract of cigarette smokers. PLoS ONE. 2010;5(12):15216. doi: 10.1371/journal.pone.0015216. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Zhan X, Zhao N, Plantinga A, Thornton TA, Conneely KN, Epstein MP, Wu MC. Powerful genetic association analysis for common or rare variants with high-dimensional structured traits. Genetics. 2017;206(4):1779–1790. doi: 10.1534/genetics.116.199646. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Zhan X, Plantinga A, Zhao N, Wu MC. A fast small-sample kernel independence test for microbiome community-level association analysis. Biometrics. 2017;73(4):1453–1463. doi: 10.1111/biom.12684. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Hensley-McBain T, Wu MC, Manuzak JA, Cheu RK, Gustin A, Driscoll CB, Zevin AS, Miller CJ, Coronado E, Smith E, et al. Increased mucosal neutrophil survival is associated with altered microbiota in hiv infection. PLoS Pathog. 2019;15(4):1007672. doi: 10.1371/journal.ppat.1007672. [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Segata N, Izard J, Waldron L, Gevers D, Miropolsky L, Garrett WS, Huttenhower C. Metagenomic biomarker discovery and explanation. Genome Biol. 2011;12(6):1–18. doi: 10.1186/gb-2011-12-6-r60. [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Paulson JN, Stine OC, Bravo HC, Pop M. Differential abundance analysis for microbial marker-gene surveys. Nat Methods. 2013;10(12):1200–1202. doi: 10.1038/nmeth.2658. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Trapnell C, Cacchiarelli D, Grimsby J, Pokharel P, Li S, Morse M, Lennon NJ, Livak KJ, Mikkelsen TS, Rinn JL. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat Biotechnol. 2014;32(4):381–386. doi: 10.1038/nbt.2859. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for rna-seq data with deseq2. Genome Biol. 2014;15(12):1–21. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Hawinkel S, Mattiello F, Bijnens L, Thas O. A broken promise: microbiome differential abundance methods do not control the false discovery rate. Brief Bioinform. 2019;20(1):210–221. doi: 10.1093/bib/bbx104. [DOI] [PubMed] [Google Scholar]
47.Ferreira JA, Fuentes S. Some comments on certain statistical aspects of the study of the microbiome. Brief Bioinform. 2020;21(4):1487–1494. doi: 10.1093/bib/bbz077. [DOI] [PubMed] [Google Scholar]
48.Egozcue JJ, Pawlowsky-Glahn V, Mateu-Figueras G, Barcelo-Vidal C. Isometric logratio transformations for compositional data analysis. Math Geol. 2003;35(3):279–300. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

12859_2023_5147_MOESM1_ESM.pdf^{(74.3KB, pdf)}

Additional file 1 Figure on histograms of some simulated abundances vs. true abundances.

Data Availability Statement

[CR1] 1.Turnbaugh PJ, Ley RE, Mahowald MA, Magrini V, Mardis ER, Gordon JI. An obesity-associated gut microbiome with increased capacity for energy harvest. Nature. 2006;444(7122):1027–1031. doi: 10.1038/nature05414. [DOI] [PubMed] [Google Scholar]

[CR2] 2.John GK, Mullin GE. The gut microbiome and obesity. Curr Oncol Rep. 2016;18(7):1–7. doi: 10.1007/s11912-016-0528-7. [DOI] [PubMed] [Google Scholar]

[CR3] 3.Maruvada P, Leone V, Kaplan LM, Chang EB. The human microbiome and obesity: moving beyond associations. Cell Host Microbe. 2017;22(5):589–599. doi: 10.1016/j.chom.2017.10.005. [DOI] [PubMed] [Google Scholar]

[CR4] 4.Hartstra AV, Bouter KE, Bäckhed F, Nieuwdorp M. Insights into the role of the microbiome in obesity and type 2 diabetes. Diabetes Care. 2015;38(1):159–165. doi: 10.2337/dc14-0769. [DOI] [PubMed] [Google Scholar]

[CR5] 5.Komaroff AL. The microbiome and risk for obesity and diabetes. JAMA. 2017;317(4):355–356. doi: 10.1001/jama.2016.20099. [DOI] [PubMed] [Google Scholar]

[CR6] 6.Vallianou NG, Stratigou T, Tsagarakis S. Microbiome and diabetes: Where are we now? Diabetes Res Clin Pract. 2018;146:111–118. doi: 10.1016/j.diabres.2018.10.008. [DOI] [PubMed] [Google Scholar]

[CR7] 7.Saxena D, Li Y, Yang L, Pei Z, Poles M, Abrams WR, Malamud D. Human microbiome and HIV/AIDS. Curr HIV/AIDS Rep. 2012;9(1):44–51. doi: 10.1007/s11904-011-0103-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Bandera A, De Benedetto I, Bozzi G, Gori A. Altered gut microbiome composition in HIV infection: causes, effects and potential intervention. Curr Opin HIV AIDS. 2018;13(1):73–80. doi: 10.1097/COH.0000000000000429. [DOI] [PubMed] [Google Scholar]

[CR9] 9.Desai SN, Landay AL. HIV and aging: role of the microbiome. Curr Opin HIV AIDS. 2018;13(1):22–27. doi: 10.1097/COH.0000000000000433. [DOI] [PubMed] [Google Scholar]

[CR10] 10.Mallick H, Rahnavard A, McIver LJ, Ma S, Zhang Y, Nguyen LH, Tickle TL, Weingart G, Ren B, Schwager EH, et al. Multivariable association discovery in population-scale meta-omics studies. PLoS Comput Biol. 2021;17(11):1009442. doi: 10.1371/journal.pcbi.1009442. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR11] 11.Zhou H, He K, Chen J, Zhang X. Linda: linear models for differential abundance analysis of microbiome compositional data. Genome Biol. 2022;23(1):1–23. doi: 10.1186/s13059-022-02655-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR12] 12.Kim KJ, Park J, Park S-C, Won S. Phylogenetic tree-based microbiome association test. Bioinformatics. 2020;36(4):1000–1006. doi: 10.1093/bioinformatics/btz686. [DOI] [PubMed] [Google Scholar]

[CR13] 13.Huang C, Callahan BJ, Wu MC, Holloway ST, Brochu H, Lu W, Peng X, Tzeng J-Y. Phylogeny-guided microbiome otu-specific association test (post). 2021. [DOI] [PMC free article] [PubMed]

[CR14] 14.Hu T, Gallins P, Zhou Y-H. A zero-inflated beta-binomial model for microbiome data analysis. Stat. 2018;7(1):185. doi: 10.1002/sta4.185. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR15] 15.Ai D, Pan H, Li X, Gao Y, Liu G, Xia LC. Identifying gut microbiota associated with colorectal cancer using a zero-inflated lognormal model. Front Microbiol. 2019;10:826. doi: 10.3389/fmicb.2019.00826. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR16] 16.Ling W, Zhao N, Plantinga AM, Launer LJ, Fodor AA, Meyer KA, Wu MC. Powerful and robust non-parametric association testing for microbiome data via a zero-inflated quantile approach (zinq) Microbiome. 2021;9(1):1–19. doi: 10.1186/s40168-021-01129-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR17] 17.Zhao N, Chen J, Carroll IM, Ringel-Kulka T, Epstein MP, Zhou H, Zhou JJ, Ringel Y, Li H, Wu MC. Testing in microbiome-profiling studies with Mirkat, the microbiome regression-based kernel association test. Am J Human Genet. 2015;96(5):797–807. doi: 10.1016/j.ajhg.2015.04.003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR18] 18.Zhan X, Tong X, Zhao N, Maity A, Wu MC, Chen J. A small-sample multivariate kernel machine test for microbiome association studies. Genet Epidemiol. 2017;41(3):210–220. doi: 10.1002/gepi.22030. [DOI] [PubMed] [Google Scholar]

[CR19] 19.Zhan X, Xue L, Zheng H, Plantinga A, Wu MC, Schaid DJ, Zhao N, Chen J. A small-sample kernel association test for correlated data with application to microbiome association studies. Genet Epidemiol. 2018;42(8):772–782. doi: 10.1002/gepi.22160. [DOI] [PubMed] [Google Scholar]

[CR20] 20.Koh H, Li Y, Zhan X, Chen J, Zhao N. A distance-based kernel association test based on the generalized linear mixed model for correlated microbiome studies. Front Genet. 2019;10:458. doi: 10.3389/fgene.2019.00458. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR21] 21.Chen J, Bittinger K, Charlson ES, Hoffmann C, Lewis J, Wu GD, Collman RG, Bushman FD, Li H. Associating microbiome composition with environmental covariates using generalized unifrac distances. Bioinformatics. 2012;28(16):2106–2113. doi: 10.1093/bioinformatics/bts342. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Zhang Y, Han SW, Cox LM, Li H. A multivariate distance-based analytic framework for microbial interdependence association test in longitudinal study. Genet Epidemiol. 2017;41(8):769–778. doi: 10.1002/gepi.22065. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Zhang J, Wei Z, Chen J. A distance-based approach for testing the mediation effect of the human microbiome. Bioinformatics. 2018;34(11):1875–1883. doi: 10.1093/bioinformatics/bty014. [DOI] [PubMed] [Google Scholar]

[CR24] 24.Pan AY. Statistical analysis of microbiome data: the challenge of sparsity. Curr Opin Endoc Metab Res. 2021;19:35–40. [Google Scholar]

[CR25] 25.Chen L, Reeve J, Zhang L, Huang S, Wang X, Chen J. Gmpr: a robust normalization method for zero-inflated count data with application to microbiome sequencing data. PeerJ. 2018;6:4600. doi: 10.7717/peerj.4600. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR26] 26.Weiss S, Xu ZZ, Peddada S, Amir A, Bittinger K, Gonzalez A, Lozupone C, Zaneveld JR, Vázquez-Baeza Y, Birmingham A, et al. Normalization and microbial differential abundance strategies depend upon data characteristics. Microbiome. 2017;5(1):1–18. doi: 10.1186/s40168-017-0237-y. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Lin H, Peddada SD. Analysis of microbial compositions: a review of normalization and differential abundance analysis. NPJ Biofilms Microbiomes. 2020;6(1):1–13. doi: 10.1038/s41522-020-00160-w. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR28] 28.Liu Y, Xie J. Cauchy combination test: a powerful test with analytic p-value calculation under arbitrary dependency structures. J Am Stat Assoc. 2020;115(529):393–402. doi: 10.1080/01621459.2018.1554485. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR29] 29.Flynn S, Reen FJ, Caparrós-Martín JA, Woods DF, Peplies J, Ranganathan SC, Stick SM, O’Gara F. Bile acid signal molecules associate temporally with respiratory inflammation and microbiome signatures in clinically stable cystic fibrosis patients. Microorganisms. 2020;8(11):1741. doi: 10.3390/microorganisms8111741. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR30] 30.McMurdie PJ, Holmes S. Waste not, want not: why rarefying microbiome data is inadmissible. PLoS Comput Biol. 2014;10(4):1003531. doi: 10.1371/journal.pcbi.1003531. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR31] 31.Dillies M-A, Rau A, Aubert J, Hennequet-Antier C, Jeanmougin M, Servant N, Keime C, Marot G, Castel D, Estelle J, et al. A comprehensive evaluation of normalization methods for illumina high-throughput rna sequencing data analysis. Brief Bioinform. 2013;14(6):671–683. doi: 10.1093/bib/bbs046. [DOI] [PubMed] [Google Scholar]

[CR32] 32.Fang Y, Tseng GC, Chang C. Heavy-tailed distribution for combining dependent $p$ values with asymptotic robustness. arXiv:2103.12967 (2021).

[CR33] 33.Sun S, Lulla A, Sioda M, Winglee K, Wu MC, Jacobs DR, Jr, Shikany JM, Lloyd-Jones DM, Launer LJ, Fodor AA, et al. Gut microbiota composition and blood pressure: the cardia study. Hypertension. 2019;73(5):998–1006. doi: 10.1161/HYPERTENSIONAHA.118.12109. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR34] 34.Friedman GD, Cutter GR, Donahue RP, Hughes GH, Hulley SB, Jacobs DR, Jr, Liu K, Savage PJ. Cardia: study design, recruitment, and some characteristics of the examined subjects. J Clin Epidemiol. 1988;41(11):1105–1116. doi: 10.1016/0895-4356(88)90080-7. [DOI] [PubMed] [Google Scholar]

[CR35] 35.Song X, Li G, Zhou Z, Wang X, Ionita-Laza I, Wei Y. Qrank: a novel quantile regression tool for eqtl discovery. Bioinformatics. 2017;33(14):2123–2130. doi: 10.1093/bioinformatics/btx119. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR36] 36.Liu H, Ling W, Hua X, Moon J-Y, Williams-Nguyen JS, Zhan X, Plantinga AM, Zhao N, Zhang A, Knight R, et al. Kernel-based genetic association analysis for microbiome phenotypes identifies host genetic drivers of beta-diversity. bioRxiv (2021) [DOI] [PMC free article] [PubMed]

[CR37] 37.Shlyakhter I, Sabeti PC, Schaffner SF. Cosi2: an efficient simulator of exact and approximate coalescent with selection. Bioinformatics. 2014;30(23):3427–3429. doi: 10.1093/bioinformatics/btu562. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR38] 38.Charlson ES, Chen J, Custers-Allen R, Bittinger K, Li H, Sinha R, Hwang J, Bushman FD, Collman RG. Disordered microbial communities in the upper respiratory tract of cigarette smokers. PLoS ONE. 2010;5(12):15216. doi: 10.1371/journal.pone.0015216. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR39] 39.Zhan X, Zhao N, Plantinga A, Thornton TA, Conneely KN, Epstein MP, Wu MC. Powerful genetic association analysis for common or rare variants with high-dimensional structured traits. Genetics. 2017;206(4):1779–1790. doi: 10.1534/genetics.116.199646. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR40] 40.Zhan X, Plantinga A, Zhao N, Wu MC. A fast small-sample kernel independence test for microbiome community-level association analysis. Biometrics. 2017;73(4):1453–1463. doi: 10.1111/biom.12684. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR41] 41.Hensley-McBain T, Wu MC, Manuzak JA, Cheu RK, Gustin A, Driscoll CB, Zevin AS, Miller CJ, Coronado E, Smith E, et al. Increased mucosal neutrophil survival is associated with altered microbiota in hiv infection. PLoS Pathog. 2019;15(4):1007672. doi: 10.1371/journal.ppat.1007672. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR42] 42.Segata N, Izard J, Waldron L, Gevers D, Miropolsky L, Garrett WS, Huttenhower C. Metagenomic biomarker discovery and explanation. Genome Biol. 2011;12(6):1–18. doi: 10.1186/gb-2011-12-6-r60. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR43] 43.Paulson JN, Stine OC, Bravo HC, Pop M. Differential abundance analysis for microbial marker-gene surveys. Nat Methods. 2013;10(12):1200–1202. doi: 10.1038/nmeth.2658. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR44] 44.Trapnell C, Cacchiarelli D, Grimsby J, Pokharel P, Li S, Morse M, Lennon NJ, Livak KJ, Mikkelsen TS, Rinn JL. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat Biotechnol. 2014;32(4):381–386. doi: 10.1038/nbt.2859. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR45] 45.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for rna-seq data with deseq2. Genome Biol. 2014;15(12):1–21. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR46] 46.Hawinkel S, Mattiello F, Bijnens L, Thas O. A broken promise: microbiome differential abundance methods do not control the false discovery rate. Brief Bioinform. 2019;20(1):210–221. doi: 10.1093/bib/bbx104. [DOI] [PubMed] [Google Scholar]

[CR47] 47.Ferreira JA, Fuentes S. Some comments on certain statistical aspects of the study of the microbiome. Brief Bioinform. 2020;21(4):1487–1494. doi: 10.1093/bib/bbz077. [DOI] [PubMed] [Google Scholar]

[CR48] 48.Egozcue JJ, Pawlowsky-Glahn V, Mateu-Figueras G, Barcelo-Vidal C. Isometric logratio transformations for compositional data analysis. Math Geol. 2003;35(3):279–300. [Google Scholar]

PERMALINK

Accommodating multiple potential normalizations in microbiome associations studies

Hoseung Song

Wodan Ling

Ni Zhao

Anna M Plantinga

Courtney A Broedlow

Nichole R Klatt

Tiffany Hensley-McBain

Michael C Wu

Abstract

Background

Results

Conclusions

Supplementary Information

Introduction and motivation

Methods

Common normalization approaches

Combining results across normalizations via the cauchy combination test

Simulation setup

Results

Type I error

Table 1.

Table 2.

Table 3.

Power

Table 4.

Table 5.

Table 6.

Fig. 1.

Real data application

Table 7.

Fig. 2.

Fig. 3.

Discussion

Supplementary Information

Acknowledgements

Abbreviations

Author Contributions

Funding

Availability of data and materials

Declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Footnotes

Contributor Information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases