ABSTRACT
The increasing use of metagenomic sequencing (MGS) for microbiome analysis has significantly advanced our understanding of microbial communities and their roles in various biological processes, including human health, environmental cycling, and disease. However, the inherent compositionality of MGS data, where the relative abundance of each taxon depends on the abundance of all other taxa, complicates the measurement of individual taxa and the interpretation of microbiome data. Here, we describe an experimental design that incorporates exogenous internal standards in routine MGS analyses to correct for compositional distortions. A mathematical framework was developed for using the observed internal standard relative abundance to calculate “Scaled Abundances” for native taxa that were (i) independent of sample composition and (ii) directly proportional to actual biological abundances. Through analysis of mock community and human gut microbiome samples, we demonstrate that Scaled Abundances outperformed traditional relative abundance measurements in both precision and accuracy and enabled reliable, quantitative comparisons of individual microbiome taxa across varied sample compositions and across a wide range of taxon abundances. By providing a pathway to accurate taxon quantification, this approach holds significant potential for advancing microbiome research, particularly in clinical and environmental health applications where precise microbial profiling is critical.
IMPORTANCE
Metagenomic sequencing (MGS) analysis has become central to modern characterizations of microbiome samples. However, the inherent compositionality of these analyses, where the relative abundance of each taxon depends on the abundance of all other taxa, often complicates interpretations of results. We present here an experimental design and corresponding mathematical framework that uses internal standards with routine MGS methods to correct for compositional distortions. We validate this approach for both amplicon and shotgun MGS analysis of mock communities and human gut microbiome (fecal) samples. By using internal standards to remove compositionality, we demonstrate significantly improved measurement accuracy and precision for quantification of taxon abundances. This approach is broadly applicable across a wide range of microbiome research applications.
KEYWORDS: compositionality, scaled abundance, internal standard, experimental design, metagenomic sequencing
INTRODUCTION
Over the last several decades, the decreasing cost and increasing throughput of next-generation sequencing (NGS) measurements have made metagenomic sequencing (MGS) characterization a default strategy for microbiome analyses (1–3). Owing to the ubiquity of naturally occurring microbiomes throughout natural and man-made environments, MGS analyses have correlated microbiomes, and their resident microbes, with a variety of important phenomena, including human, animal, and plant health, renewable resources, infrastructure degradation, environmental biogeochemical cycling, and waste remediation, among others (3–12). Particularly in the field of health, many connections have been postulated between the human microbiome and various health and disease conditions, such as obesity, gut health, autism, depression, or autoimmune disease (4, 6, 8, 13–18).
However, despite promising initial indications, many of the earliest correlations between microbiomes and health conditions have failed to bear fruit. For instance, questions have arisen about the relevance of the Firmicutes:Bacteroides ratio to obesity and gut health, about the existence of a “cancer microbiome,” and about the role of microbes in Autism (19–22). Even early in the Human Microbiome project, it was widely recognized that MGS results were much more reproducible (precise) than they were accurate (reflecting the underlying biology) (23, 24). Many comparisons have shown that analysis results vary significantly between samples and methodologies (25–30).
One of the complications with drawing conclusions from MGS results lies in the data’s inherent compositionality (25, 27, 30, 31). In short, the observed relative abundance of each taxon in a sample depends on both its actual abundance as well as the abundances of all other taxa in the sample. While aggregate compositional differences between samples can be accurately assessed, this severely limits the quantification of variations in the individual abundances of constituent taxa. Alternately, rigorous compositional data analysis strategies hold great promise for quantifying individual microbiome taxon abundances, correlating observed and actual abundances of constituent taxa, and allowing direct comparisons between microbiome samples with varied compositions (32).
A variety of ratiometric analysis strategies have been applied in the context of microbiome MGS to try to correct for data compositionality (32–35). One effective strategy has been to use ratios of pairs of taxon relative abundances within each sample to correct for the effects of compositionality (25, 36). Although this accurately accounts for compositionality, ratios of abundances of native taxa can be challenging to interpret and require that both taxa be present in all samples of interest. In wastewater biosurveillance for extracellular antimicrobial resistance genes by MGS, the routine addition of exogenous DNA as an internal standard has been demonstrated to improve MGS quantitation (37, 38).
In the current effort, we describe an experimental design for the routine inclusion of internal standards in microbiome MGS analyses. When genetic material from exogenous microbes is systematically added to samples, it serves as an internal reference standard to assess and mathematically correct for each sample’s composition. A rigorous framework for this analysis is provided along with several demonstrative examples. A previously reported systematic study of mock communities is reanalyzed here using the proposed internal standard framework. Then, a series of human gut microbiome samples are prepared and evaluated for two demonstration cases: (i) constant microbe concentrations with varied sample compositions; and (ii) systematically varied microbe concentrations with constant sample composition. In all cases, the results validate the proposed mathematical model for using internal standards to improve quantitative MGS analyses.
RESULTS
Internal standard and mathematical framework
In the current effort, we propose an experimental design where an exogenous microorganism is systematically added into each sample to be analyzed as an internal standard (Fig. 1).
Fig 1.
Experimental design specifying systematic addition of an exogenous internal standard microbe into every sample ahead of metagenomic sequencing analysis.
Following the inclusion of an internal standard, MGS results were analyzed to remove compositionality by comparing the relative abundance of each taxon to that of the internal standard. A mathematical framework was derived to use analysis of internal standards to improve MGS data (see Materials and Methods). We defined a new ‘Scaled Abundance’ metric that is calculated for each taxon using measured relative abundances and the known actual abundance of the internal standard (equation 4). These Scaled Abundances were mathematically predicted to be directly proportional to each taxon’s actual (biological) abundance, with a constant of proportionality based solely on the biases of that taxon and the internal standard (equation 5). Additionally, when a taxon of interest was present across multiple samples (analyzed using a common protocol), the bias ratio term remained constant, and the fold change in Scaled Abundance (ΔSA) was predicted to directly report the fold change in actual abundance (equation 6). The remainder of this manuscript demonstrates the validity and utility of equations 5 and 6 through a series of new analyses and experimental explorations.
Previously published data
In 2015, Brooks et al. systematically generated a series of 80 combinations of seven specific bacterial strains commonly associated with the vaginal microbiome (26). These diverse ‘mock community’ metagenomic samples were then analyzed using a common measurement pipeline, including DNA extraction, PCR amplification of the V1–V3 region of the rRNA gene, and NGS. We have reanalyzed this data set in the current framework by designating one added strain as an internal standard and using its specified abundance to calculate Scaled Abundances for the remaining taxa (Fig. 2).
Fig 2.
Previously published data from a series of microbial mock communities (26) were reanalyzed using the mathematical framework described herein by treating one strain as an internal standard (IS) for calculating Scaled Abundances (SAs). Strain abundances, as represented by relative abundance (RA) or SA, were plotted against their experimentally specified actual abundances (A). (Group B Strep was treated as the IS, so no SAs were calculated.) Linear regressions and 95% confidence bounds show best fit linear correlation models. The regression goodness-of-fits are summarized (B) as calculated coefficients of variation (CVs) for each regression. The Scaled Abundance for Group B Strep was not available (“NA”) because it was treated here as the internal standard. This procedure was generalized by systematically treating each strain as the IS, and calculating the average goodness-of-fit (and 95% confidence intervals) for remaining strains as measured by each metric (C).
As reported in the analysis from the original manuscript, measured relative abundance of individual taxa exhibited poor correlation with the actual abundances known from sample preparation (Fig. 2A, relative abundance). However, when we designated one of the taxa (e.g., Group B Strep) as an internal standard to correct for compositionality, the calculated Scaled Abundances for the remaining taxa were much better correlated with the known actual abundances (Fig. 2A, Scaled Abundances). Regressions for each taxon’s Scaled Abundance demonstrated direct proportionality with their actual abundances, as predicted by equation 5. The slopes of the Scaled Abundance regressions also varied between taxa, consistent with varying constants of proportionality in each case, as predicted by the bias ratios in equation 5.
For each taxon, the degree of agreement between the observed and actual abundances (goodness-of-fit) was summarized using a calculated coefficient of variation (CV, Fig. 2B). Using Group B Strep as the internal standard, the CVs for relative abundance measurements were quite large and consistently higher than the CVs for Scaled Abundances, presumably because the Scaled Abundance corrected for sample compositionality. Recognizing that any of the mock community taxa could have been treated as the internal standards, these analyses were repeated for each taxon used in the study. In each case, the average of CVs from the remaining taxa showed that Scaled Abundances, calculated using any taxa as the internal standard, consistently produced better correlation (lower CVs) between measured and actual abundances than relative abundances (Fig. 2C).
Testable Hypothesis 1: Scaled Abundance is independent of sample composition
In order to demonstrate that the measured Scaled Abundances were independent of sample composition, a series of samples were prepared wherein constant amounts of microbial DNA from four exogenous taxa were spiked into individual fecal samples exhibiting identical compositions (i.e., technical replicates) or varied compositions (Fig. 3). The concentrations of the spiked-in DNA were systematically varied by taxa across two orders of magnitude, and they were then uniformly added to each sample. For each sample, MGS analysis yielded relative abundance measurements which were used to calculate Scaled Abundances for the spiked-in taxa through comparison to an internal standard as described in equation 4.
Fig 3.
Metagenomic sequencing of diverse stool samples. Stacked barcharts (A) of the most abundant genera (colors) reveal high reproducibility between replicate analyses of a single stool sample, and significant compositional differences between stool samples from different donors. Known concentrations of DNA from four taxa were uniformly spiked-in, and their individual abundances (B) are plotted for the relative abundance (RelAbund) and Scaled Abundance (SA) metrics and grouped for comparisons between replicates and between distinct stools. The precision of these abundance measurements is tabulated (C) as calculated CVs. Data from the SA technical replicates were used to calculate bias ratios (±95% confidence interval), as described in equation 5. Using these bias ratios, the actual abundances for the “between stools” data were calculated from equation 5 and plotted (D) to show good correlation with ground truth spike-in concentrations (dotted lines). Shown here for 16S MGS analysis, the same samples were analyzed using a separate shotgun MGS analysis pipeline with substantially similar results (Fig. S1).
As expected for MGS analysis of fecal samples, good reproducibility was observed between replicate analyses, while significant compositional differences were observed between five different stool samples (Fig. 3A). However, when individual spike-in taxa were uniformly added to all samples, their measured relative abundances (Fig. 3B, RelAbund) were only consistent within a common sample composition. Even though the spiked-in DNA was known to be constant, the observed relative abundances between different fecal samples exhibited significant variation between different sample compositions. In contrast, Scaled Abundances calculated using an internal standard (Fig. 3B, ScaledAbund) exhibited consistency for both technical replicates and across differing sample compositions.
The precision of observed abundances (by relative abundance and Scaled Abundance) was calculated as the CV across each group of samples and for each of the spiked-in taxa (Fig. 3C). Within the measured relative abundances of technical replicates, CVs ranged 1–16%, with lower precision (i.e., CV > 10%) generally observed with lower spike-in concentrations. However, much higher variability in measured relative abundance (90–100% CV) was observed when taxa were spiked into varied sample compositions. In contrast, the calculated Scaled Abundances exhibited generally high precision (CV ≤ 16%) for all spiked-in taxa, both within technical replicates and between different sample compositions.
Accounting for bias
While the calculated Scaled Abundances succeed at accounting for differences between sample compositions, they are not without bias. Indeed, equation 5 predicts that the constant of proportionality between the Scaled Abundances for each taxon and their Actual Abundances arises from the ratio of the analytical biases between the specified taxa and the internal standard. Using the known Actual Abundances of each of the spike-in taxa, bias ratios were calculated from the technical replicate fecal samples (Fig. 3C). While the measured biases are expected to be highly protocol-specific, they were highly reproducible across replicate samples for a specified protocol. These bias ratios were then used to calculate actual abundances from the Scaled Abundance measurements for each distinct stool sample (Fig. 3D), and these calculated absolute abundances exhibited excellent agreement to ground truth spike-in amounts (dotted lines in Fig. 3D).
Alternately, was calculated by comparing Scaled Abundances between pairs of fecal samples, as described by equation 6. For the uniformly added spike-in taxa, the results hovered around unity for all of the uniformly added taxa and samples considered (Fig. 4A). The calculated for the spike-in taxa as well as the most abundant 10 native taxa are also tabulated (Fig. 4B). For native taxa, the fold changes varied significantly between pairs of samples, as expected for diverse fecal samples.
Fig 4.
for each of the spike-in taxa was calculated and plotted (A) for pairs of the stool samples shown in Fig. 3. Since the spike-ins were uniformly added to all samples, was expected to be unity (dotted line). The table (B) provides calculated for each of the spike-in taxa and the top 10 most abundant taxa quantified in all five stool samples. Shown here for 16S MGS analysis, the same samples were analyzed using a shotgun MGS analysis pipeline with substantially similar results (Fig. S2 and S3). An exhaustive list of all taxa and their measured values are also available (Table S1).
When the same fecal samples were analyzed using shotgun MGS (instead of 16S MGS), substantially similar results were attained (Fig. S2). In particular, the quantified values for native taxa between samples were nearly identical (Fig. S3 and Table S1).
Testable Hypothesis 2: Scaled Abundance is proportional to actual abundance
To further demonstrate the utility of an experimental design that includes internal standards, a series of samples were prepared wherein the sample composition was kept constant even while microbial abundances were systematically varied. This was accomplished by diluting a single fecal sample into Tris-EDTA buffer (Fig. 5A). As can be seen from the stacked bar charts of taxon relative abundances, as measured by shotgun MGS, sample composition remained unchanged across all samples, even while the Actual Abundances of native taxa were known to be decreasing based on sample preparation. This dilution series was then treated as a series of independent samples, into which the internal standard was uniformly added. For these samples, the dilution fraction served as a stand-in for taxon actual abundance when comparing between samples, and measured abundances were correlated with dilution fraction as described in equation 2a and equation 2b (relative abundance) and equation 5 (Scaled Abundance).
Fig 5.
Metagenomic sequencing of a human stool dilution series. Stacked bar charts (A) of the most abundant species (colors) for a systematic dilution of a single stool sample with TE buffer show a consistent sample composition. Regression showed that measured Scaled Abundance and actual stool concentration (B), for the nine most-abundance native taxa) were highly correlated. (Relative abundances for the same taxa exhibited poor correlations with stool concentration, as seen in Fig. S4.) Similar regressions were determined for all taxa native to the stool, and their goodness-of-fits were plotted (C) as a function of each taxon’s measured relative abundance. Across all taxa (blue smoothed fit, 95% confidence interval), these regressions revealed high correlation (CV ≤ 25%, horizontal red line) between measured Scaled Abundances and known actual abundances across a wide range of taxon relative abundances.
When individual Scaled Abundances of the most abundant native taxa were compared to each sample’s known fecal concentration, good correlations were observed, as predicted from equation 5 (Fig. 5B). In contrast, analysis of relative abundances for the same taxa produced poor correlations, which were generally indistinguishable from the null hypothesis of no correlation (Fig. S4).
While regressions only from the nine most abundant taxa in the stool sample are shown, linear fits of Scaled Abundance to fecal concentration were calculated for each native taxa, the goodness-of-fit was measured for each regression as a CV, and these CVs were plotted as a function of taxon relative abundance in whole stool (Fig. 5C). Across a wide range of taxon abundances (down to a relative abundance of 0.00003), the calculated Scaled Abundances were generally (blue trace in Fig. 5C) accurate in reporting changes to actual taxon abundances with high precision (CV ≤ 25%).
DISCUSSION
Internal standard and mathematical framework
Because the protocols used in MGS analyses vary substantially between research groups, it can be challenging to address specific measurement biases in ways that are generalizable. However, the strategy of incorporating internal standards into samples of interest (Fig. 1) allows direct correction for the particular biases of the protocol being used. Internal standards have been advantageously employed for other analytical methods such as fluorescence quantitation, gene arrays, and metabolomic analyses to correct for method- and sample-specific biases (39–41). For application in MGS data sets, the internal standard was readily distinguished from other taxa based on its unique genome sequence, and its measured relative abundance was used to correct for compositional distortion.
The analysis described herein focuses on a measurement pipeline that is held constant between samples. The mathematical model (described in Materials and Methods) explicitly assumes that each step of the MGS measurement pipeline (e.g., DNA extraction, library preparation, NGS, read trimming, and taxonomic assignment) contributes biases that vary between microorganisms and accumulate throughout the protocol steps, but remain constant between multiple samples analyzed within a locked-down (i.e., fully specified) method (25, 30). Thus, these individual biases could be aggregated using a single taxon-specific bias term, as shown in equation 1 and equation 2a and equation 2b (25).
The process saturation that leads to the denominator in equation 2a and equation 2b explicitly denotes the inherently compositional nature of MGS data sets, since the measured relative abundance for each taxon is inextricably tied to the relative abundance of all other taxa within the sample. So, for example, a measured increase in one taxon’s relative abundance could arise from either an increase in its actual biological abundance or from a decrease in the actual abundances of other taxa.
These two scenarios are phenomenologically distinct but generally indistinguishable experimentally, particularly as (i) the actual abundances and biases remain unknown for most taxa in naturally occurring samples and (ii) biases vary substantially and unpredictably between individual taxa and various common protocols (25, 30). Thus, while whole-sample comparisons based on measured relative abundances generally reflect aggregate compositional changes with some accuracy, at least within the context of locked-down protocols, the quantitative comparison of individual taxon relative abundances between samples is unreliable. Instead, compositional data analysis approaches developed for or adapted to MGS data sets often take advantage of the constancy of the denominator in equation 2a and equation 2b among all taxa within a sample to remove or diminish compositionality using ratios, as is shown in equation 3 (25).
Using the systematically added internal standard for the taxon comparison improves on equation 3 by removing compositionality without the ambiguity of referencing two native taxa that may change independently of each other between samples. Then, a new metric, Scaled Abundance, is defined by explicitly comparing each native taxon relative abundance to that of the internal standard and accounting for the known internal standard actual abundance. The result of this experimental and mathematical approach is a direct proportionality between each native taxon’s calculated Scaled Abundance and its actual abundance, which is independent of sample compositionality (equation 5).
In the common case where multiple samples are analyzed using a common protocol, the bias ratio from equation 5 is constant across samples, and the fold change in a taxon’s Scaled Abundance between pairs of samples becomes independent of the bias ratio and directly reports the fold change in actual abundance (equation 6). This mathematical framework is further demonstrated through several test cases.
Previously published data
The data from Brooks et al. was systematically collected, annotated, and made readily available, making this data set useful for reanalysis here (25, 26). For the purpose of evaluating the mathematical framework described here (see Materials and Methods section), we arbitrarily selected one of the strains added to various mock communities to treat as an internal standard with a known actual abundance in order to calculate Scaled Abundances for the remaining strains. The results for these Scaled abundances (Fig. 2) were entirely consistent with the predictions from equation 5, including improved correlation between measured and actual abundances, varied constants of proportionality (varied slopes between taxa in Fig. 2A, Scaled Abundance), and reduced variability overall among mock community compositions (Fig. 2B). Importantly for this reanalysis, it did not matter which strain was selected to be treated as an internal standard (Fig. 2C). Overall, this reanalysis demonstrated that using internal standards to calculate Scaled Abundances significantly improved MGS quantitation and the ability to accurately correlate individual taxon measurements with known actual abundances.
Testable Hypothesis 1: Scaled Abundance is independent of sample composition
In the first testable hypothesis, exogenous microbial DNA was systematically added to diverse fecal samples and analyzed to demonstrate that calculated Scaled Abundances were substantially independent of the sample compositionality (Fig. 3). For this evaluation, the precision of technical replicates (“Between Replicates”; CV ≤ 16%) was interpreted as the fundamental precision limit of the method employed. However, for Scaled Abundances, similar reproducibility was achieved even between diverse stool samples for all spiked-in DNA. This demonstrated, as predicted from equation 5 (see Materials and Methods section), that Scaled Abundances were independent of sample compositionality and achieved high precision across at least two orders of magnitude, and even at low relative abundances (RA ~ 0.0005).
An important implication of the presented mathematical framework (see Materials and Methods section) is that the calculation of Scaled Abundances is largely method independent. (While the bias particular bias terms in adata set are unique, the framework overall is generalizable.) This is demonstrated here in the application of Scaled Abundances to improve quantitation for 16S MGS data from the previously published data set (Fig. 2) and for 16S MGS data for Stool samples (Fig. 3). The samples here were also analyzed with a completely distinct protocol for shotgun MGS with substantially similar results (Fig. S1). Under each of these three distinct protocols, the specific bias terms in equation 5 vary, but the mathematical framework remains generalizable to improve the correlation between measured Scaled Abundances and taxon actual abundances.
Accounting for bias
Because taxon-specific biases arise from the particular protocol steps employed, the bias ratio in equation 5 was predicted to be constant within locked-down protocols, and the calculated Scaled Abundance metric was predicted to enable direct comparisons of abundances between samples, independent of differing sample compositions. There are two primary applications for using this Scaled Abundance metric. In the first scenario, taxa of interest may have been previously identified, isolated, and cultured. In this case, the native taxa of interest can be spiked into samples to allow calculation of the bias ratio in equation 5. This, in turn, allows the Scaled Abundance measured in unknown samples to be directly converted into accurately measured actual abundances. Using this approach bias, ratios for spiked-in DNA were calculated from technical replicates (Fig. 3C) and then used to calculate actual abundances across diverse stool samples with good agreement to known concentrations (Fig. 3D). Fig. S1C and S1D show similar analyses using shotgun MGS instead of 16S amplicon MGS.
However, for many native taxa, bias ratios cannot be independently determined for taxa of interest (e.g., unknown or unculturable taxa). Nevertheless, in this scenario, the biases remain constant between samples (analyzed using a common protocol), so fold changes in Scaled Abundance between samples still directly report the fold changes in actual abundance (as predicted in equation 6). This approach is highlighted in Fig. 4 and 5 ;(and Fig. S2, Table S1, and Fig. S3). As expected, the uniformly spiked-in DNA species exhibited no differences between pairs of stool samples (Fig. 4A), while native taxa demonstrated very different fold changes between the biologically distinct stool samples. Importantly, the quantified fold changes for all taxa were largely consistent even when comparing across 16S and shotgun MGS analyses (Table S1 and Fig. S3).
In both cases, this experimental design for the addition of internal standards and the mathematical framework to calculate Scaled Abundances provides a rigorous and quantitative strategy for directly comparing taxon actual abundances between samples in ways that help understand real changes in the underlying sample biology at the resolution of individual taxa.
Testable Hypothesis 2: Scaled Abundance is proportional to actual abundance
In a second testable hypothesis, systematic dilutions of human stool into PBS were treated as a series of distinct microbiome samples. For this series, the composition was held constant (Fig. 5A), while the actual abundances of constituent taxa were systematically varied. While the true biological abundances of native taxa in the samples remain unknown, the overall stool concentration (20–100%) provided a reasonable surrogate for correlating taxon actual abundances with calculated Scaled Abundances.
As predicted from equation 5, (i) strong correlations were observed between individual native taxa and their Scaled Abundances, and (ii) linear regressions exhibited varied slopes, consistent with a taxon-specific bias ratio (Fig. 5B). Additionally, CVs for each regression provided taxon-specific assessments of the predictive value of the Scaled Abundances. These CVs revealed generally good precision (CV ≤ 25%) across more than three orders of magnitude in observed relative abundance and showed a low limit of quantitation (relative abundance ~ 0.00003) in the current study.
Conclusions
This study introduces a new experimental design for improving the reliability of microbiome analyses by incorporating internal standards to account for compositional distortions inherent in MGS data. The mathematical framework developed herein demonstrates that the inclusion of an exogenous internal standard, whose actual abundance is measurable, allows for the calculation of taxon-specific Scaled Abundances” that are (i) independent of sample composition and (ii) directly proportional to actual biological abundances. This approach is consistent with prior ratiometric analysis strategies, with the added simplicity of accounting for compositionality through comparison to a single, well-characterized taxa systematically added at a known actual abundance.
For example, while conventional relative abundances can be used to detect compositional shifts among all taxa in aggregate and to categorize whole microbiome samples (e.g., case-control comparisons), the calculated Scaled Abundance described herein will enable researchers to evaluate individual constituent taxa and quantify changes in taxon actual abundances between samples. This ability to quantitatively interrogate individual taxa is central for hypothesis generation and understanding how microbial composition relates to observed microbiome function. In longitudinal studies with locked-down protocols, Scaled Abundances will enable accurate tracking of individual microbial abundance over time (e.g., following a defined intervention). Finally, by correcting for MGS compositionality, the framework described here opens new opportunities for quantitatively characterizing conserved sub-microbiomes found within larger, variable microbiome compositions (42).
Through rigorous analysis of previously published mock community data and freshly analyzed gut microbiome samples, we show that calculated Scaled Abundances outperform traditional relative abundance measurements in both precision and accuracy and demonstrate their potential for quantitative metagenomic profiling. Specifically, Scaled Abundances calculated for MGS characterization of mock communities agreed much better than raw relative abundances with taxon actual abundances across varied compositions. When exogenous taxa were spiked into fecal samples at known actual abundances spanning several orders of magnitude, the Scaled Abundance metric accurately reflected actual abundances and was wholly independent of sample composition. Furthermore, the use of internal standards allowed accurate comparisons of native taxa between samples, even when taxon-specific analytical biases could not be independently measured. Finally, high precision and accuracy were demonstrated by using Scaled Abundances to track actual abundances of native taxa across a series of stool dilutions, even for low-relative abundance taxa.
The results presented here validate the proposed experimental design and mathematical framework that uses routine, systematic addition of internal standards to microbiome samples to correct for compositionality in MGS analyses. This is particularly crucial in microbiome studies where variability in sample composition often complicates the interpretation of data. This approach is flexible and is shown to be applicable to both shotgun and amplicon-based sequencing methods, which further broadens its utility across a wide range of microbiome research areas. Moreover, by offering a pathway to accurate and reproducible abundance measurements at the level of individual taxa, this methodology could play a pivotal role in advancing microbiome research, particularly in clinical and environmental health contexts where precise taxon quantification is essential for modeling diverse biological phenomena.
MATERIALS AND METHODS
Mathematical framework
The methods of analysis of microbiome samples using MGS vary among researchers but invariably require a series of physical and bioinformatic manipulations, typically including: sample acquisition, DNA extraction, library preparation, and NGS, as well as read trimming and taxonomic assignment (1, 25, 29, 30). Importantly, each step contributes biases that vary between microorganisms and accumulate throughout the measurement pipeline (25, 30). For a locked-down method, where the procedure is tightly specified and highly reproducible, individual biases from multiple steps of an MGS protocol can be combined into a single aggregate bias term for each taxon that describes the general analytical response throughout the entire protocol (25):
| (1) |
However, this analytical response is not directly observable from MGS results due to distortion arising from process saturation. Since NGS analyses arbitrarily limit the total number of DNA molecules that are analyzed, the measured relative abundance for each taxon depends on both the analytical response of that taxon (equation 1) and the aggregate responses of all taxa in the sample:
| (2a) |
| (2b) |
Recognizing that the compositional denominator in equation 2a and equation 2b is identical for each taxon within a sample, the ratio of the relative abundances of two native taxa can be considered independent of the rest of the sample composition (25):
| (3) |
We proposed herein an experimental design where an exogenous microorganism was systematically added into each sample to be analyzed as an internal standard (Fig. 1). Thus, equation 3 was rearranged to calculate a ‘Scaled Abundance’ metric as the product of the internal standard actual abundance and the ratio of each native taxon’s relative abundance to the internal standard relative abundance.
| (4) |
Algebraic manipulation of equations 3 and 4 then predicted that Scaled Abundances for each taxon would then be directly proportional to that taxon’s actual abundance, with a constant of proportionality tied to the ratio of the biases for that taxon and the internal standard:
| (5) |
Finally, since taxon-specific biases were assumed to be constant between samples (analyzed using a common protocol), changes in the Scaled Abundance of taxa of interest were predicted to directly report changes in taxon actual abundance without the need to calculate bias ratios:
| (6) |
Mock community data analysis
Amplicon-based MGS analyses of 80 distinct mock communities of seven microbial taxa were downloaded from the original publication (26). The downloaded results provide relative abundances for each taxon in each mock community. These data were reanalyzed in R (version 4.3.2) here by treating “Group B Strep” as an internal standard to allow calculation of Scaled Abundances for the remaining taxa. (Mock communities that did not include Group B Strep were omitted from further analysis.) The specified dilutions from overnight stock (e.g., 1/2, 1/3, and 1/7) were taken as the known actual abundances for each taxon and correlated to measured relative abundances or Scaled Abundances. The goodness-of-fit of these regressions was summarized as CVs for each taxon. This procedure was repeated, treating alternate strains as the ‘internal standard’, and the average of CVs in each case was used to compare goodness-of-fits with respect to internal standard selection. The raw data and full code used for this analysis have been made publicly available (doi.org/10.18434/mds2-3760).https://data.nist.gov/od/id/mds2-3760
Human microbiome sample prep
The human microbiome samples used in this study were used previously in the Mosaic Standards Challenge and have been described previously (29, 30). In short, five fecal samples were collected from five different anonymized donors. Each sample was prepared by pooling and homogenizing multiple bowel movements from each donor, stabilizing the mixtures with Omnigene Gut Solution, and preparing 1 mL identical aliquots at a final concentration of 100 mg/mL. Fecal aliquots were stored at −80 C until ready for analysis. Dilution series were generated by mixing the thawed samples into the Tris-EDTA buffer.
Internal standard
The experimental design described here specified the systematic addition of an internal standard prior to the traditional MGS measurement pipeline. We selected DNA from Legionella pneumophila as an internal standard due to its absence in previous metagenomic characterizations of the stool samples. Additionally, this material was stable and had been well quantified by ddPCR previously (43). Genomic DNA from L. pneumophila was added uniformly to all samples for a final concentration of (2.16 × 105 ± 3.7 × 103) copies/mcL.
Spike-in DNA
For experiments where specified concentrations of microbial DNA were spiked-in to fecal samples (i.e., Testable Hypothesis #1), pure genomic microbial DNA was added from Acinetobacter baumanii, Vibrio furnissii, Neisseria meningitidis, and Aeromonas hydrophila. These DNA genomes were sourced from NIST RM8376 (43) and were added to fecal samples at the final concentrations of 4.8 × 104, 9.6 × 103, 5.4 × 103, and 6.2 × 102 copies/mcL, respectively.
MGS measurement pipeline
DNA extraction
DNA was extracted using the ZR Fecal DNA miniprep (cat# D6010) following the manufacturer’s protocol. Briefly, each sample was combined with 400 mcL of the lysis buffer and vortexed (MoBio Genie 2) for 20 min at full speed. Lysis tubes were centrifuged at 10,000 × g for 1 min, and the lysate was processed through the spin filter at 7,000 × g for 1 min. The filtrate was added to a 1.2 mL DNA binding buffer, mixed by pipetting, and centrifuged in batches at 10,000 × g through the spin column. The bound DNA was washed and eluted into 150 mcL of elution buffer. The extracted DNA was quantified by fluorescence using the DeNovix DS-11FX fluorometer with the DeNovix dsDNA High Sensitivity kit (Catalog # KIT-DADNA-HIGH-2).
Library preparation
NGS libraries were prepared for both shotgun and 16S amplicon analyses. For shotgun sequencing, extracted DNA was fragmented, amplified, and barcoded using the Nextera XT DNA Library Preparation Kit (Illumina) and Nextera XT Index kit V2 (Illumina, catalog #15052163) as specified by manufacturer protocols. For amplicon sequencing, the 16S rRNA gene was amplified using primers for the V4 variable region sourced from IDT (10 mcM RxnRead Primer Pool). 1mcL of the primer pool was combined with 12.5 mcL of Kapa HiFi HotStart readymix (KapaBiosystems, Catalog # 07958935001) and 12 ng extracted DNA in a final volume of 25 mcL for PCR (initial denaturation (180 s at 95°C), 18 cycles of denaturing (30 s at 98°C), annealing (15 s at 55°C), and elongation (20 s at 72.0°C), and a final extension (300 s at 72.0°C). The 16S amplicons were purified using SPRIselect beads (Beckman Coulter, Catalog # B23318) at a 0.8:1 ratio of Beads:amplicon, washing beads with molecular-grade ethanol and resuspended in pure water. Finally, barcodes were added (Nextera XT Index kit V2) as specified by manufacturer protocols. For both shotgun and amplicon library prep, samples were quantified by fluorescence (DeNovix) for DNA yield, and 10 ng from each sample was pooled for sequencing.
Sequencing
Pooled libraries were quantified by fluorescence, diluted to 4 nM, and denatured following manufacturer protocol (Document # 15039740 v10, Protocol A). Denatured libraries were further diluted to 12 pM, combined with a 5% PhiX (V3 cat# 15017666 from Illumina), and sequenced by paired-end sequencing on an Illumina MiSeq (MiSeq Reagent Kit v3 600-cycle, cat #: MS-102-3003).
Initial bioinformatic analysis
Demultiplexing and adapter trimming was completed as part of the Illumina MiSeq Generate FASTQ workflow. Fastq files were analyzed separately for shotgun and amplicon sequencing, as described subsequently. The final output of both shotgun and amplicon bioinformatic analysis was a table specifying the taxa observed in each sample and their measured relative abundances.
For shotgun sequencing, BBduk (38.90) (44) was used to quality filter the raw data, and paired-end sample data were analyzed using Centrifuge (45) with the Web of Life database (46). The bash script files have been made publicly available (doi.org/10.18434/mds2-3760).
For amplicon sequencing, Cutadapt (2.8) (47) was used to remove primer sequences, and DADA2 (1.20.0) (48) was used to account for sequencing errors, determine absolute sequence variants, measure relative abundances, and assign taxonomy based on the Silva database (version 132) (49). The raw data and code (R, version 4.3.2) (50) used for amplicon sequencing analysis have been made publicly available (doi.org/10.18434/mds2-3760).
Scaled Abundance calculations
The definitions and mathematical framework for calculating the Scaled Abundance metric from tables of identified taxa from each sample and their measured relative abundances is provided in the “Mathematical framework” section of Materials and Methods. These operations were implemented in R (version 4.3.2), and all raw data and the code have been made publicly available (doi.org/10.18434/mds2-3760).
ACKNOWLEDGMENTS
Certain commercial equipment, instruments, or materials are identified in this paper to foster understanding. Such identification does not imply recommendation or endorsement by the National Institute of Standards and Technology, nor does it imply that the materials or equipment identified are necessarily the best available for the purpose.
S.P.F. conceptualized the experiments with input from S.L.S. and J.G.K. Experiments were designed by S.P.F. and S.L.S. and executed by M.E.H. and J.N.D. Initial bioinformatic analysis was done by J.G.K. and S.L.S., and Scaled Abundances calculations and final bioinformatic analyses were done by S.P.F. S.P.F. wrote the manuscript and prepared the figures. In writing this manuscript, ChatGPT was used to generate initial drafts of the Conclusions section and abstract that were then edited extensively by the authors. All authors read and approved the final manuscript.
Contributor Information
Samuel P. Forry, Email: sam.forry@nist.gov.
Isaac Cann, University of Illinois Urbana-Champaign, Urbana, Illinois, USA.
DATA AVAILABILITY
All raw data and analysis code described in this paper have been made publicly available (doi.org/10.18434/mds2-3760).
ETHICS APPROVAL
All work was reviewed and approved by the U.S. National Institute of Standards and Technology (NIST) Research Protections Office. This study (protocol # MML-2019-0135) was determined to be “not human subjects research” as defined in the Common Rule (45 CFR 46, Subpart A).
SUPPLEMENTAL MATERIAL
The following material is available online at https://doi.org/10.1128/aem.01126-25.
Figures S1 to S4; Table S1.
ASM does not own the copyrights to Supplemental Material that may be linked to, or accessed through, an article. The authors have granted ASM a non-exclusive, world-wide license to publish the Supplemental Material files. Please contact the corresponding author directly for reuse.
REFERENCES
- 1. Clooney AG, Fouhy F, Sleator RD, O’ Driscoll A, Stanton C, Cotter PD, Claesson MJ. 2016. Comparing apples and oranges?: next generation sequencing and its impact on microbiome analysis. PLoS One 11:e0148028. doi: 10.1371/journal.pone.0148028 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. de la Cuesta-Zuluaga J, Escobar JS. 2016. Considerations for optimizing microbiome analysis using a marker gene. Front Nutr 3:26. doi: 10.3389/fnut.2016.00026 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Bharti R, Grimm DG. 2021. Current challenges and best-practice protocols for microbiome analysis. Brief Bioinform 22:178–193. doi: 10.1093/bib/bbz155 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Sorboni SG, Moghaddam HS, Jafarzadeh-Esfehani R, Soleimanpour S. 2022. A comprehensive review on the role of the gut microbiome in human neurological disorders. Clin Microbiol Rev 35:e0033820. doi: 10.1128/CMR.00338-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Zhao H, Yang M, Fan X, Gui Q, Yi H, Tong Y, Xiao W. 2024. A metagenomic investigation of potential health risks and element cycling functions of bacteria and viruses in wastewater treatment plants. Viruses 16:535. doi: 10.3390/v16040535 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Hou K, Wu Z-X, Chen X-Y, Wang J-Q, Zhang D, Xiao C, Zhu D, Koya JB, Wei L, Li J, Chen Z-S. 2022. Microbiota in health and diseases. Signal Transduct Target Ther 7:135. doi: 10.1038/s41392-022-00974-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Nwachukwu BC, Babalola OO. 2022. Metagenomics: a tool for exploring key microbiome with the potentials for improving sustainable agriculture. Front Sustain Food Syst 6. doi: 10.3389/fsufs.2022.886987 [DOI] [Google Scholar]
- 8. Afzaal M, Saeed F, Shah YA, Hussain M, Rabail R, Socol CT, Hassoun A, Pateiro M, Lorenzo JM, Rusu AV, Aadil RM. 2022. Human gut microbiota in health and disease: unveiling the relationship. Front Microbiol 13:999001. doi: 10.3389/fmicb.2022.999001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Lawson CE. 2021. Retooling microbiome engineering for a sustainable future. mSystems 6:101128msystems0092521. doi: 10.1128/msystems.00925-21 [DOI] [Google Scholar]
- 10. Bruno A, Fumagalli S, Ghisleni G, Labra M. 2022. The microbiome of the built environment: the nexus for urban regeneration for the cities of tomorrow. Microorganisms 10:2311. doi: 10.3390/microorganisms10122311 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Bosch TCG, Wigley M, Colomina B, Bohannan B, Meggers F, Amato KR, Azad MB, Blaser MJ, Brown K, Dominguez-Bello MG, et al. 2024. The potential importance of the built-environment microbiome and its impact on human health. Proc Natl Acad Sci U S A 121:e2313971121. doi: 10.1073/pnas.2313971121 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Nandy S, Kapley A. 2024. Unraveling the potential of microbial communities for lake bioremediation via the metagenomics tool: a review. AQUA — Water Infrastructure, Ecosystems and Society 73:11–33. doi: 10.2166/aqua.2024.154 [DOI] [Google Scholar]
- 13. Taniya MA, Chung H-J, Al Mamun A, Alam S, Aziz MA, Emon NU, Islam MM, Hong S-TS, Podder BR, Ara Mimi A, Aktar Suchi S, Xiao J. 2022. Role of gut microbiome in autism spectrum disorder and its therapeutic regulation. Front Cell Infect Microbiol 12:915701. doi: 10.3389/fcimb.2022.915701 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Kho ZY, Lal SK. 2018. The human gut microbiome - a potential controller of wellness and disease. Front Microbiol 9:1835. doi: 10.3389/fmicb.2018.01835 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Qiu P, Ishimoto T, Fu L, Zhang J, Zhang Z, Liu Y. 2022. The gut microbiota in inflammatory bowel disease. Front Cell Infect Microbiol 12:733992. doi: 10.3389/fcimb.2022.733992 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Shaikh SD, Sun N, Canakis A, Park WY, Weber HC. 2023. Irritable bowel syndrome and the gut microbiome: a comprehensive review. J Clin Med 12:2558. doi: 10.3390/jcm12072558 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Akinsulie OC, Olowu BI, Adesola RO, Adenaya A, Banwo OG, Ugwu CE, Idris I, Babawale P, Akande QA, Olukogbe OO, Shahzad S, Akinsulie JM. 2024. Complex interaction between gut microbiome and autoimmunity: focus on antiphospholipid syndrome. Bacteria 3:330–343. doi: 10.3390/bacteria3040022 [DOI] [Google Scholar]
- 18. Cheung SG, Goldenthal AR, Uhlemann A-C, Mann JJ, Miller JM, Sublette ME. 2019. Systematic review of gut microbiota and major depression. Front Psychiatry 10:34. doi: 10.3389/fpsyt.2019.00034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Offord C. 2023. “Major” errors may undermine research that identified cancers based on their microbes. AAAS 381:590–591. doi: 10.1126/science.adk2103 [DOI] [Google Scholar]
- 20. Cho KY. 2023. Association of gut microbiota with obesity in children and adolescents. Clin Exp Pediatr 66:148–154. doi: 10.3345/cep.2021.01837 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Houtman TA, Eckermann HA, Smidt H, de Weerth C. 2022. Gut microbiota and BMI throughout childhood: the role of firmicutes, bacteroidetes, and short-chain fatty acid producers. Sci Rep 12:3140. doi: 10.1038/s41598-022-07176-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Vasileva S, Yap CX, Whitehouse AJO, Gratten J, Eyles D. 2024. Absence of association between maternal adverse events and long-term gut microbiome outcomes in the Australian autism biobank. Brain Behav Immun Health 39:100814. doi: 10.1016/j.bbih.2024.100814 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Sinha R., Abnet CC, White O, Knight R, Huttenhower C. 2015. The microbiome quality control project: baseline study design and future directions. Genome Biol 16:276. doi: 10.1186/s13059-015-0841-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Sinha Rashmi, Abu-Ali G, Vogtmann E, Fodor AA, Ren B, Amir A, Schwager E, Crabtree J, Ma S, Microbiome Quality Control Project Consortium, Abnet CC, Knight R, White O, Huttenhower C. 2017. Assessment of variation in microbial community amplicon sequencing by the Microbiome Quality Control (MBQC) project consortium. Nat Biotechnol 35:1077–1086. doi: 10.1038/nbt.3981 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25. McLaren MR, Willis AD, Callahan BJ. 2019. Consistent and correctable bias in metagenomic sequencing experiments. eLife 8:e46923. doi: 10.7554/eLife.46923 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Brooks JP, Edwards DJ, Harwich MD Jr, Rivera MC, Fettweis JM, Serrano MG, Reris RA, Sheth NU, Huang B, Girerd P, Vaginal Microbiome Consortium, Strauss JF 3rd, Jefferson KK, Buck GA. 2015. The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies. BMC Microbiol 15:66. doi: 10.1186/s12866-015-0351-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Gloor GB, Macklaim JM, Pawlowsky-Glahn V, Egozcue JJ. 2017. Microbiome datasets are compositional: and this is not optional. Front Microbiol 8:2224. doi: 10.3389/fmicb.2017.02224 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Nearing JT, Comeau AM, Langille MGI. 2021. Identifying biases and their potential solutions in human microbiome studies. Microbiome 9:113. doi: 10.1186/s40168-021-01059-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Forry SP, Servetas SL, Kralj JG, Soh K, Hadjithomas M, Cano R, Carlin M, Amorim MG de, Auch B, Bakker MG, et al. 2024. Variability and bias in microbiome metagenomic sequencing: an interlaboratory study comparing experimental protocols. Sci Rep 14:9785. doi: 10.1038/s41598-024-57981-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Forry SP, Servetas SL, Dootz JN, Hunter ME, Kralj JG, Filliben JJ, Jackson SA. 2025. A sensitivity analysis of methodological variables associated with microbiome measurements. Microbiol Spectr 13:e0069624. doi: 10.1128/spectrum.00696-24 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Kumar MS, Slud EV, Okrah K, Hicks SC, Hannenhalli S, Corrada Bravo H. 2018. Analysis and correction of compositional bias in sparse sequencing count data. BMC Genomics 19:799. doi: 10.1186/s12864-018-5160-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Nearing JT, Douglas GM, Hayes MG, MacDonald J, Desai DK, Allward N, Jones CMA, Wright RJ, Dhanani AS, Comeau AM, Langille MGI. 2022. Microbiome differential abundance methods produce different results across 38 datasets. Nat Commun 13:342. doi: 10.1038/s41467-022-28034-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33. Fernandes AD, Reid JN, Macklaim JM, McMurrough TA, Edgell DR, Gloor GB. 2014. Unifying the analysis of high-throughput sequencing datasets: characterizing RNA-seq, 16S rRNA gene sequencing and selective growth experiments by compositional data analysis. Microbiome 2:15. doi: 10.1186/2049-2618-2-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Mandal S, Van Treuren W, White RA, Eggesbø M, Knight R, Peddada SD. 2015. Analysis of composition of microbiomes: a novel method for studying microbial composition. Microb Ecol Health Dis 26:27663. doi: 10.3402/mehd.v26.27663 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Morton JT, Marotz C, Washburne A, Silverman J, Zaramela LS, Edlund A, Zengler K, Knight R. 2019. Establishing microbial composition measurement standards with reference frames. Nat Commun 10:2719. doi: 10.1038/s41467-019-10656-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36. McLaren MR, Nearing JT, Willis AD, Lloyd KG, Callahan BJ. 2022. Implications of taxonomic bias for microbial differential-abundance analysis. Bioinformatics. doi: 10.1101/2022.08.19.504330 [DOI]
- 37. Crossette E, Gumm J, Langenfeld K, Raskin L, Duhaime M, Wigginton K. 2021. Metagenomic quantification of genes with internal standards. mbio 12:e03173-20. doi: 10.1128/mBio.03173-20 [DOI] [Google Scholar]
- 38. Davis BC, Brown C, Gupta S, Calarco J, Liguori K, Milligan E, Harwood VJ, Pruden A, Keenum I. 2023. Recommendations for the use of metagenomics for routine monitoring of antibiotic resistance in wastewater and impacted aquatic environments. Crit Rev Environ Sci Technol 53:1731–1756. doi: 10.1080/10643389.2023.2181620 [DOI] [Google Scholar]
- 39. Vogt RF, Cross GD, Henderson LO, Phillips DL. 1989. Model system evaluating fluorescein-labeled microbeads as internal standards to calibrate fluorescence intensity on flow cytometers. Cytometry 10:294–302. doi: 10.1002/cyto.990100308 [DOI] [PubMed] [Google Scholar]
- 40. Dozmorov I, Lefkovits I. 2009. Internal standard-based analysis of microarray data. Part 1: analysis of differential gene expressions. Nucleic Acids Res 37:6323–6339. doi: 10.1093/nar/gkp706 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Go Y-M, Walker DI, Liang Y, Uppal K, Soltow QA, Tran V, Strobel F, Quyyumi AA, Ziegler TR, Pennell KD, Miller GW, Jones DP. 2015. Reference standardization for mass spectrometry and high-resolution metabolomics applications to exposome research. Toxicol Sci 148:531–543. doi: 10.1093/toxsci/kfv198 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Huws SA, Edwards JE, Lin W, Rubino F, Alston M, Swarbreck D, Caim S, Stevens PR, Pachebat J, Won M-Y, Oyama LB, Creevey CJ, Kingston-Smith AH. 2021. Microbiomes attached to fresh perennial ryegrass are temporally resilient and adapt to changing ecological niches. Microbiome 9:143. doi: 10.1186/s40168-021-01087-w [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43. Kralj J, Tourlousse D, Hunter ME, Romsos E, Toman B, Vallone P, Jackson S. 2022. Reference material 8376: microbial pathogen DNA standards for detection and identification. NIST Special Publication 260-225. 10.6028/NIST.SP.260-225. [DOI]
- 44. Bushnell B, Rood J, Singer E. 2017. BBMerge - accurate paired shotgun read merging via overlap. PLoS One 12:e0185056. doi: 10.1371/journal.pone.0185056 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Kim D, Song L, Breitwieser FP, Salzberg SL. 2016. Centrifuge: rapid and sensitive classification of metagenomic sequences. Genome Res 26:1721–1729. doi: 10.1101/gr.210641.116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Zhu Q, Mai U, Pfeiffer W, Janssen S, Asnicar F, Sanders JG, Belda-Ferre P, Al-Ghalith GA, Kopylova E, McDonald D, et al. 2019. Phylogenomics of 10,575 genomes reveals evolutionary proximity between domains Bacteria and Archaea. Nat Commun 10:5477. doi: 10.1038/s41467-019-13443-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Martin M. 2011. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet j 17:10. doi: 10.14806/ej.17.1.200 [DOI] [Google Scholar]
- 48. Callahan BJ, McMurdie PJ, Rosen MJ, Han AW, Johnson AJA, Holmes SP. 2016. DADA2: High-resolution sample inference from Illumina amplicon data. Nat Methods 13:581–583. doi: 10.1038/nmeth.3869 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Peplies J, Glöckner FO. 2013. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res 41:D590–6. doi: 10.1093/nar/gks1219 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. R Core Team . 2023. R: a language and environment for statistical computing. R foundation for statistical computing, Vienna, Austria. Available from: https://www.R-project.org
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Figures S1 to S4; Table S1.
Data Availability Statement
All raw data and analysis code described in this paper have been made publicly available (doi.org/10.18434/mds2-3760).





