Abstract
Metataxonomic studies of ecosystem microbiotas require the simultaneous processing of samples with contrasting physical and biochemical traits. However, there are no published studies of comparisons of different DNA extraction kits to characterize the microbiotas of the main components of terrestrial ecosystems. Here, and to our knowledge for the first time, five DNA extraction kits were used to investigate the composition and diversity of the microbiota of a subset of samples typically studied in terrestrial ecosystems such as bulk soil, rhizosphere soil, invertebrate taxa and mammalian feces. DNA extraction kit was associated with changes in the relative abundance of hundreds of ASVs, in the same samples, resulting in significant differences in alpha and beta diversity estimates of their microbiotas. Importantly, the impact of DNA extraction kit on sample diversity varies according to sample type, with mammalian feces and soil samples showing the most and least consistent diversity estimates across DNA extraction kits, respectively. We show that the MACHEREY–NAGEL NucleoSpin® Soil kit was associated with the highest alpha diversity estimates, providing the highest contribution to the overall sample diversity, as indicated by comparisons with computationally assembled reference communities, and is recommended to be used for any large-scale microbiota study of terrestrial ecosystems.
Keywords: 16S rRNA gene, Amplicon-sequencing, Metataxonomy, Method evaluation, MICCA
Subject terms: Microbial ecology, Microbiome
Introduction
The soil ecosystem is a complex, three-dimensional habitat for many taxa of microorganisms, (bacteria, fungi, archaea and viruses) that provide essential ecosystem services, such as soil fertility, critical to human survival. The transmission of microorganisms from soil to plants and animals, maintains and enhances the microbiotas of these species, which are essential for their development and health1. For example, plants secrete organic substances into soils in the form of root exudates, which aid the recruitment of specialized members of soil microbial communities into plant rhizosphere2; at the same time, bulk soil is one of the most important contributors to plant endophytic microbiota. Soil dwelling invertebrates such as bacterial-feeding roundworms and earthworms feed on soil microorganisms, but they also influence soil microbial community directly (as in the cases of roundworms) or indirectly, by contributing to soil structure and nutrient availability3–5. Furthermore, the deliberate or involuntary consumption of soil (‘geophagy’) contributes commensal microorganisms to the animal gut microbiota; in fact, up to 3% of the rumen microbiome of sheep and cattle can be traced back to the soil1. Additionally, according to the biodiversity hypothesis6,7, contact with natural environments, as well as ingestion of unprocessed and fermented foods, is expected to enrich human microbiota, promote immune balance, and protect against allergies and inflammatory disorders. Characterizing the compositional similarities and co-occurrences across microbial taxa found in soil, plants and animals (both vertebrates or invertebrates, living below and above the soil) is of pivotal importance for understanding the role of microbiota in the One Health framework, as well in the conservation and management of natural and traditional agricultural ecosystems.
Metataxonomics has recently become the most widely adopted method for studying complex microbial communities, using short read sequencing of amplicons generated by targeting specific hypervariable regions of key conserved genes8–10. This analytical strategy has now been applied to the investigation of microbiota in a great variety of terrestrial and aquatic ecosystems on Earth11–15 and even in space16. However, the application of amplicon sequencing-based metataxonomy approaches to the understanding of microbiota interactions across the soil ecosystem requires the simultaneous profiling of microorganisms from various matrices, such as soil, plant roots, invertebrates, and animal faeces (often used as a proxy for gut microbiota). Purification of DNA extracts suitable for amplicon-sequencing from environmental samples could be hampered by the co-extraction of secondary metabolites with enzymatic inhibitory effects such as humic substances or fulvic acids from soil samples17 as well as complex polysaccharides, bile salts, lipids, and urate from fecal samples18. Furthermore, the extraction of sufficient amounts of DNA for subsequent metataxonomic analyses could be challenging in the case of microscopic animals such as soil dwelling roundworms or intestinal helminths19 due to their small size and limited biomass. Progress in this field has been made possible, at least in part, by the availability of a number of commercial kits allowing the extraction of nucleic acids suitable for PCR amplification from a wide variety of samples, regardless of their physical properties (e.g. size, mass, physical state) and chemical composition (e.g. water content, presence and abundance of PCR inhibitors)18. However, each DNA extraction protocol has its own specifications (e.g. buffers composition, lysis conditions), and these methodological choices throughout the workflow of a metataxonomic study can greatly affect the results20–23. In fact, DNA extraction has been shown to be the main contributor to the distortion of bacterial abundance from their original values, by altering, for instance, the abundance of taxa more difficult to lyse (e.g. gram-positive bacteria22,24). Consequently, in all cases where sample types are processed with different DNA extraction methods, the technical variation resulting from DNA extraction has the potential to bias alpha and beta diversity estimates24,25, limiting our ability to compare datasets. As a result, microbial ecology studies focusing on community assemblies in diverse sample types require a workflow minimizing technical variation, starting with the adoption of a DNA extraction kit suitable for the sample types being studied.
Thus far, comparative methodological studies have largely focused on single sample types, such as stool20,24,26,27, soil28–30 and invertebrates31–34. To the best of our knowledge, only a single eDNA-oriented study has been published that compared extraction kits for a range of sample types, including water, litter and soil35. However, this study did not include many sample types that are of pivotal importance for understanding microbial co-occurrences (and potential interactions) between plants, soil, invertebrates, and vertebrates.
Here, with the aim of quantifying differences in microbiota composition between the main actors of terrestrial ecosystems, we tested five commercial DNA extraction kits on two soil components, two taxa of soil invertebrates, and two above-ground vertebrate taxa from the same sample site. Standard metataxonomics methods and bioinformatics software were then applied to: (i) investigate the extent to which kits specifically designed for a given sample type can be used for the processing of other sample types; (ii) estimate the extent to which variation in diversity estimates for each sample type or taxon can be associated with DNA extraction method; (iii) define quantitative variation in taxa abundances resulting from DNA extraction methods; (iv) identify the most suitable kit for terrestrial ecosystem microbiota studies.
Results
DNA extraction efficiency varies across kits and sample types
Genomic DNA was successfully extracted from all samples of each sample type with all DNA extraction kits tested: DNeasy® Blood & Tissue (QIAGEN; hereafter QBT), QIAamp® DNA Micro (QIAGEN; QMC), NucleoSpin® Soil (MACHEREY–NAGEL; MNS), DNeasy® PowerSoil® Pro (QIAGEN; QPS) and QIAamp® Fast DNA Stool Mini (QIAGEN; QST) (Figure S1). However, we did not find a single kit that outperformed all others for the entire set of investigated sample types (Figure S1). QST was the best performing kit for Feces_hare samples in terms of DNA concentration, but not for other sample types, including Feces_cattle; in fact, QST produced significantly lower DNA quantity than other kits. Additionally, despite QMC being commercialized for small sample sizes, it was among those providing the highest DNA concentrations for invertebrate (Invert_beetle, Invert_roundworm), bulk soil (Soil_horizonA) and rhizosphere soil (Soil_rhizosphere) samples, but provided yields similar to QBT, MNS and QPS for Feces_hare, although not for Feces_cattle. Considering the 260/280 ratio, we found significant differences in purity of DNA extracts across kits (Figure S2A); with QST providing the highest values. For the 260/230 ratio, MNS provided the best performances regarding the 260/230 ratio in all samples except for Invert_beetle (Figure S2B). Of note, despite the observed variability in DNA concentration and purity, the V4 region of the 16S rRNA gene was successfully amplified using the eluted DNA from all kits and sample types. Sequencing of the resulting 284 libraries (including extraction and PCR negative controls) generated a median number of 46,283 sequence reads per library, ranging from 15,945 (St_NP4_3_mc) to 216,074 (St_SP4_3_mc).
Differences in the extraction efficiency of gram-positive bacteria
To assess the extraction efficiency of gram-positive bacteria we included a commercial mock community (MC) composed by a known number of cells of two halotolerant bacteria I. halotolerans (gram-negative) and A. halotolerans (gram-positive) with each combination of sample type and kit for aliquots D1mc, D2mc, and D3mc (Fig. 1A and B). The abundance of MC-related ASVs relative to the total number of reads ranged from 1.4% (QMC-BP4-3-mc) to 91.1% (QMC-NP4-3-mc) (Fig. 1A and B). The mean ratio between ASV abundance of the two MC-derived bacteria was 0.71 ± 0.08 for QBT, 1.40 ± 0.15 for QMC, 1.35 ± 0.19 for MNS, 1.31 ± 0.25 for QPS and 1.39 ± 0.19 for QST (Fig. 1B). Although the observed values were higher than the expected 0.43, based on the composition of the MC provided by the supplier, in all combinations of sample type and DNA kit (Fig. 1), QBT always produced the lowest ratio regardless of sample type (mean value: 0.71 ± 0.08), indicating that this kit had the highest extraction efficiency of A. halotolerans (Fig. 1A, B) of these five kits. While we found a clear connection between the ratios of the two taxa and the use of lysozyme during DNA extraction (Fig. 1D), we found no association between this ratio and the adoption of a sample-specific kit (Fig. 1C), a particular lysis temperature (Fig. 1E), or homogenization strategy and lysis time (data not shown).
DNA extraction method affects microbial diversity estimates
Taxonomic classification of assembled ASVs shows that a distinct microbial community characterizes technical and biological replicates of each sample type (Figure S3). Clostridia and Bacteroidia were the most represented Classes found in Feces_cattle. These two classes were highly represented in Feces_hare as well, but the microbial community of these latter samples were also rich in Gammaproteobacteria and unclassified Firmicutes. We found that taxa belonging to the classes Gammaproteobacteria and Bacilli were the most abundant in our samples of Invert_beetle, followed by Clostridia, Bacteroidia and Alphaproteobacteria (Figure S3). The two most represented classes of bacteria in Invert_roundworm were Gammaproteobacteria and Alphaproteobacteria. Soil_horizon A and Soil_rhizosphere also displayed distinct different microbial communities, although the most represented taxa found in both were Alphaproteobacteria, Gammaproteobacteria and Actinobacteria. While Soil_rhizosphere also had a high proportion of Bacteroidia, Soil_horizonA were further enriched in Blastocatellia, Verrucomicrobia, Vicinamibacteria and Thermoleophilia, among others (Figure S3).
Alpha diversity estimates for each sample type and DNA extraction kit are shown in Fig. 2 and S4. Both indices considered in this study highlighted significant differences in diversity estimates across DNA extraction methods (Figs. 2, S4, Table S2). Of note, we found no significant association between alpha diversity estimates and purity of DNA extracts expressed as 260/230 ratio (Table S3). Instead, richness estimates in Invert_beetle were negatively associated with the concentration of DNA eluates (Richness: t-value − 2.89; p-value: 0.0078; Table S3) and purity of DNA extracts expressed as 260/280 ratio (Richness: t-value − 2.75; p-value: 0.0108). At the same time, alpha diversity estimates were positively associated with concentration of DNA eluates in Soil_rhizosphere (Richness: t-value 4.03; p-value: 0.0005; Shannon: t-value 3.90; p-value: 0.0006; Table S3) and 260/280 ratio and Soil_horizonA (Richness: t-value 2.26; p-value: 0.0325; Table S3).
The extraction kits providing significantly higher alpha diversity estimates were QMC for Feces_cattle, QMC and MNS for Feces_hare, QBT and QPS for Invert_beetle, QPS for Soil_rhizosphere and MNS for Soil_horizonA. No significant difference in alpha diversity estimates across kits was detected for Invert_roundworms. While for Feces_cattle, Feces_hare and Invert_beetle, the variability across biological replicates (Fig. 2B) superseded the differences between DNA extraction methods (Fig. 2A), Soil_horizonA and Soil_rhizosphere had significantly different diversity estimates across kits in both technical and biological replicates (Fig. 2A, B). In fact, Euclidean distances of estimated richness (S) were significantly higher in pairwise comparisons Between kits than Within kit in all sample types, except for Invert_roundworm (Fig. 2C). Instead, we found no significant differences in Euclidean distances of estimated richness for pairwise comparisons Between kits and Between individuals in most sample types, except for Feces_cattle and Soil_rhizosphere. Interestingly, for Soil_rhizosphere, Euclidean distances of estimated richness were lower in pairwise comparisons Between sampling sites (reported as Between individuals in the corresponding Figure) than Between kits (Fig. 2C).
Between-sample diversity estimates were computed as weighted UniFrac (Fig. 3) and Bray–Curtis (Figure S5) distances. PERMANOVA statistical tests performed on the entire set of samples (N = 180, replicates without MC) to test the impact of DNA extraction method on beta diversity estimates, after controlling for the variation associated with the different type of samples, showed that a significant fraction of variation in these estimates could be explained by DNA extraction methods alone (Unifrac R2: 0.03, p-value ≤ 0.001; Bray–Curtis R2: 0.02, p-value ≤ 0.001; Table S4) and by the interaction between sample type and DNA extractions method (Unifrac R2: 0.06, p-value ≤ 0.001; Bray–Curtis R2: 0.07, p-value ≤ 0.001; Table S4). At the level of single sample types, PERMANOVA statistical tests on UniFrac diversity estimates showed that a large fraction of variation in these estimates is associated with the extraction kit used (R2 ranging from 0.299 to 0.830, p-values ranging from 0.010 to 0.001; Fig. 3A and B, Table S4). Furthermore, our data showed that extraction kit is the main driver of diversity estimates in Soil_horizonA (R2: 0.830, p-value < 0.001, Fig. 3, Table S4), Soil_rhizosphere (R2: 0.764, p-value < 0.001, Fig. 3, Table S4) and Invert_roundworm (R2: 0.389, p-value < 0.001, Fig. 3, Table S4). Interestingly, although pool ID is the main driver of diversity in Feces_cattle and Feces_hare (R2: 0.427 and 0.555, p-values < 0.001, respectively), a fraction of this diversity (e.g. R2: 0.427 and 0.555, p-values < 0.001, respectively) was associated with DNA extraction methods for these sample types as well (Fig. 3, Table S4).
As an index of closeness of each sample type to the ‘true’ microbial composition, our reference microbial communities that were computationally assembled for each sample type by merging the fastq files from each DNA extraction method, were used to retrospectively measure the contribution of each library to these reference communities (Figs. 4A, S5). The clustering of all samples based on weighted UniFrac (Fig. 4A) and Jaccard (Figure S6) distances placed each reference community in proximity to the samples from which they were generated. In addition, we also found some significant differences in diversity estimates between reference communities and corresponding samples (Figs. 4B, S5), which, together with the observed clustering pattern, further indicates that kits adopted for DNA extraction affect composition (e.g. the presence of a given taxa, Figs. 4, S6) and frequency (Fig. 4) of identified taxa. Referring to the UniFrac diversity estimates, kits with the lowest distances between samples and corresponding and reference communities, and hence providing the highest contribution to their assemblage, were QST and QMC for Feces_cattle, QST and MNS for Feces_hare and MNS for soil (Soil_horizonA and Soil_rhizosphere) (Fig. 4B). In the case of invertebrates (Invert_beetle, Invert_roundworm), we found QBT and most of the kits (including MNS) providing similar contributions to the assembly of corresponding reference communities. In fact, as shown in Fig. 4B, for all sample types except Soil_horizonA and Soil_rhizosphere, we found several DNA extraction kits performing similarly in terms of distances between corresponding and reference communities and providing estimates comparable to that between reference communities and their closest kit (e.g. “QMC” for Bos, “MNS” for Lepus). Inspection of each library's contribution to the corresponding reference communities using Jaccard distance estimates, which differently from weighted UniFrac only account for presence-absence data, further highlighted the significantly better performance of MNS for soil samples (Soil_horizonA, Soil_rhizosphere), invertebrates (Invert_beetle, Invert_roundworm) and Feces_hare (Figure S6).
Significant changes in ASV abundance due to DNA extraction methods
Taxa significantly enriched or depleted by the adopted DNA extraction kit were identified by pairwise comparisons across kits in all sample types except for Invert_beetle (Fig. 5A, Table S5). The number of differential ASVs across kits varied considerably (Table S5). Soil_horizonA and Soil_rhizosphere displayed the highest variation across kits, with an observed number of differential ASVs ranging from about 100 (for the contrasts: QMC vs MNS and QBT vs QST, respectively) to more than 500 (for the contrast QBT vs QPS in both cases). For Feces_cattle and Feces_hare the number of ASVs with significant differences in pairwise comparisons ranged from less than 10 (for the Feces_hare contrasts: MNS vs QST and MNS vs QPS; Table S5) to 334 (for the contrast QBT vs QPS in Feces_hare; Table S5). The clustering of libraries based on the abundance of differential ASVs (Fig. 5A) highlighted again a clear separation between soil and fecal microbial communities and, within each sample type, further emphasized the separation of libraries generated from DNA extracts of different kits (Fig. 5A). Of note, and in agreement with our beta diversity estimates, both soil samples (Soil_horizonA and Soil_rhizosphere) were associated with a clear separation between kits specifically designed for soil (e.g. MNS and QPS) and the other kits (e.g. QBT, QMC, QST) (Fig. 5A).
When differential ASVs were collapsed to the taxonomic level of class, we found several bacterial and archaeal taxa whose abundance was significantly enriched and/or depleted by the DNA extraction method (Fig. 5B, Tables S5, S6). Bacterial classes affected in fecal samples (Feces_cattle and Feces_hare) were mainly Clostridia, Bacteroidia, Bacteria_unclassified and Cyanobacteria. This latter Class was significantly enriched in Feces_hare (but not Feces_cattle) only when extracted with QMC and QBT. Likewise, the adoption of alternative DNA extraction methods significantly modulated the relative abundance of Actinobacteria, Alphaproteobacteria and Gammaproteobacteria in Soil_horizonA and Soil_rhizosphere, Blastocatellia in Soil_horizonA as well as Thermoleophilia, Vicinamibacteria and Nitrososphaeria in Soil_rhizosphere. Soil_horizonA and Soil_rhizosphere extracted with the QPS were associated with a depletion in Blastocatellia, Gammaproteobacteria (along with Bacteroidia and Verrucomicrobiae) and an enrichment in Actinobacteria, Bacilli, Nitrososphaeria and Thermoleophilia for all pairwise comparisons, except when MNS was included in the comparison (Fig. 5, Tables S5, S6).
Discussion
The microbial communities of terrestrial ecosystems represent study areas of great interest for microbial ecology with potential applications to biodiversity conservation36,37. Research projects focusing on complex ecosystems frequently require the simultaneous processing of biological matrices with non-uniform physical and bio-chemical properties, for which the adoption of methodological strategies minimizing technical variability across project samples is of pivotal importance. The present study deepens our understanding of the impact of DNA extraction methods on diversity indices of soil, soil invertebrates and vertebrate fecal sample bacterial microbiota (Table 1). We show that DNA extraction methods provide DNA eluates with different quantitative and qualitative profiles. Although all tested DNAs were successfully amplified and sequenced, we found that kit selection can bias both alpha and beta diversity estimates of resulting microbial communities. Additionally, switching DNA extraction kit is reflected in significant differences in the abundance of hundreds of ASVs. Interestingly, however, the impact of DNA extraction kit on estimated sample diversity varied according to the sample type, with some type of samples, such as Feces_cattle, showing highly consistent results across kits and other samples, such as Soil_horizonA and Soil_rhizosphere, displaying marked and reproducible signs of kit-associated bias.
Table1.
Sample type (abbreviation) | Kits with the highest α diversity (S) estimates | Pattern of variation (S)a | Main β diversity driver | Kit with highest contribution to the reference communityb | Number of DATs (max–min) | DATs taxonomy (Class) |
---|---|---|---|---|---|---|
Feces cattle (Feces_cattle) |
All, except QST | W kit < B kit < B ind | Individual | QST, QMC | 231–22 | Clostridia, Bacteroidia |
Feces hares (Feces_hare) |
QMC, MNS, QST | W kit < B kit ≈ B ind | Individual | QST, MNS | 334–3 | Clostridia, Bacteroidia, Cyanobacteriia |
Beetles (Invert_beetle) |
QBT, QPS, MNS | W kit < B kit ≈ B ind | Individual | All, except QST | nd | nd |
Roundworms (Invert_roundworm) |
All | W kit ≈ B kit ≈ B ind | Kit | All, except QPS | 16 – 1 | Alphaproteobacteria, Thermoleophilia, Bacteroidia, Bacilli |
Rhizosphere Soil (Soil_rhizosphere) |
QPS, QMC, MNS | W kit < B kit > B ind | Kit | MNS | 501 – 90 | Actinobacteria, Alphaproteobacteria, Gammaproteobacteria, Thermoleophilia, Vicinamibacteria, Nitrososphaeria |
HorizonA soil (Soil_horizonA) |
All, except QMC | W kit < B kit ≈ B ind | Kit | MNS | 537 – 108 | Actinobacteria, Alphaproteobacteria, Gammaproteobacteria, Blastocatellia |
For each sample type, the DNA extraction kit resulting in the highest α diversity estimates (S), the comparison of patterns of variation in S between extraction kits and replicates, the main driver of β diversity, and the kit with the highest contribution to the assembly of the corresponding reference communities are reported. The maximum and minimum number of taxa with significant differences in their abundance in all pairwise comparisons (DATs) across the five DNA extraction kits, and the main taxonomic Classes associated with these DATs are also reported. QBT: DNeasy® Blood & Tissue (QIAGEN); QMC: QIAamp® DNA Micro (QIAGEN); MNS: NucleoSpin® Soil (MACHEREY–NAGEL); QPS: DNeasy® PowerSoil® Pro (QIAGEN); QST: QIAamp® Fast DNA Stool Mini (QIAGEN). nd: not detected.
aW kit: within kit; B kit: between kits; B ind.: between individual samples. bReferred to UniFrac distance estimates.
Direct comparisons across DNA extraction methods were initially made using quantity and quality of DNA extracts from the wide range of combinations between sample types and DNA extraction methods considered in this study. The performance of each kit varied between and within sample types. We did not find a single kit clearly outperforming the others in terms of concentration of DNA extracts in all sample types (Figure S1). As we focused on the amplification of a short target region (16S-V4: ~ 250 bp excluding primer binding sites) from both environmental and non-invasive samples, which are notoriously prone to high DNA degradation38, we considered purity (e.g. 260/280 and 260/230 ratios from UV/VIS spectra) rather than DNA integrity24 as key factor for direct assessment of DNA extraction efficiency. Three kits among those tested (namely, MNS, QPS, QST) make use of specific strategies for the removal of secondary inhibitors, such as inhibitor removal columns (MNS), or inhibitor removal buffers (namely IRT and InhibitEX Buffer for QPS and QST, respectively). We found that DNA extracted with MNS were characterized by the highest purity values for the 260/230 ratio across all samples except for Invert_beetle. Higher effectiveness of MNS compared to QPS, QST, and PowerFecal kit from Mo Bio Laboratories Inc. (which was not tested in this study) in extracting high purity DNA from raw feces and lagoon effluents was also reported by39. Although a previous study on fecal samples collected from different animal species failed to find a clear correlation between the DNA purity and success of subsequent amplification and sequencing40, efficient removal of potential inhibitors of enzymatic activity is considered of pivotal importance for metataxonomics, as high contaminant concentration may lead to reduced PCR amplification, and consequently, to skewed composition of microbial communities24,40–42.
The inclusion of a commercial mock community (MC) with each combination of sample type and kit (Fig. 1) provided the opportunity to infer the extraction efficiency of gram-positive bacteria across different commercial kits. The mean ratio between ASV abundance of the two MC-derived bacteria, which based on the MC composition and 16S rRNA gene copy number in the two MC taxa is expected to be ~ 0.43, varied across DNA extraction methods, with QBT displaying higher performances in the extraction of A. halotolerans irrespectively of the sample type and abundance of MC-related ASVs detected in each library. Interestingly, since the extractions with the QBT kit were performed by making an optional pretreatment for gram-positive bacteria that uses lysozyme to promote cell wall lysis of these microorganisms, our data support the idea that lysozyme might be an effective alternative to other widely adopted strategies for cell lysis such as bead beating (adopted by MNS and QPS), prolonged lysis time (as in the case of overnight lysis for QMC) and/or high lysis temperatures (as in the case of QST, in which lysis is performed at 70 °C). Although the fine tuning of MC dose for the different types of sample (and associated microbial communities) was not needed to efficiently estimate the extraction efficiency of gram-positive bacteria, our data pinpoint the importance of calibrating the MC dose in different sample types to avoid overpowering the sample bacterial community (as seen in Invert_roundworm samples), as this would lead to a markedly distorted representation of community data43.
We found compelling evidence that composition and diversity estimate of sample microbial communities are affected by DNA extraction kit (Figs. 2, 3, 4, 5). However, the increased efficiency in the extraction of the gram-positive bacteria A. halotolerans observed for QBT did not translate into higher alpha diversity estimates, a finding that could be explained assuming that A. halotolerans is not a good proxy for the extraction efficiency of gram-positive bacteria. Unfortunately, as previously highlighted by43, since its recent release only a few studies have used this product, and information regarding its correct use and potential practical limitations in non-model sample types is still scarce. Alternatively, we might consider that gram-positive bacteria only represent a small fraction of considered communities. However, despite the strong discrepancies between estimates made with different methodologies44, several studies report that gram-positive bacteria account for a significant fraction of fecal44–46 and soil47,48 microbiota. In fact, kit providing significantly higher alpha diversity estimates across the different sample types considered in our study were QMC for Feces_cattle, QMC and MNS for Feces_hare, QBT and QPS for Invert_beetle, QPS for Soil_rhizosphere and MNS for Soil_horizonA (Table 1, Fig. 2). Importantly, alpha diversity estimates indicated that differences associated with DNA extraction methods can be of the same order of magnitude than differences detected across biological replicates (Fig. 2). Moreover, regardless of the amplitude of estimated diversity, variation in richness (S) across technical replicates processed with the different kits (Between kits; Table 1) was generally comparable or higher than variation detected between biological replicates of the same sample type (Between individuals), and both were higher than variation estimated across technical replicates processed with the same method (Within kit) in all investigated samples except Invert_roundworms. Despite that variation in beta diversity estimates across all samples was driven by the sample types (Unifrac, permanova R2: 0.779, p-value < 0.001; Bray–Curtis, permanova R2: 0.631, p-value < 0.001), a smaller but significant portion of variation in diversity estimates was associated with the kit alone (Unifrac, permanova R2: 0.027, p-value < 0.001; Bray–Curtis, permanova R2: 0.021, p-value < 0.001) and its interaction with sample type (Unifrac, permanova R2: 0.059, p-value < 0.001; Bray–Curtis, permanova R2: 0.070, p-value < 0.001). Soil samples showed the greatest difference in alpha diversity across kits, and consistently, kit was the main driver of beta diversity estimates in Soil_horizonA (Unifrac, permanova R2: 0.830, p-value < 0.001), Soil_ rhizosphere (Unifrac, permanova R2: 0.764, p-value < 0.001) and soil-dwelling animals (Invert_roundworm; Unifrac, permanova R2: 0.389, p-value < 0.001) (Fig. 3, Table S4). However, a significant fraction of diversity was associated with the kit also in our vertebrate fecal samples: Feces_hare (Unifrac, permanova R2: 0.298, p-value < 0.01) and Feces_cattle (Unifrac, permanova R2: 0.378, p-value < 0.001). Our findings for soil are in agreement with a previous study highlighting that variation in diversity estimates due to DNA extraction method in soil might overwhelm biological differences due to contrasting land uses41, and confirm those from a previous investigation23, but are in contrast with those reported by30, who found no differences in diversity estimates associated with three DNA extraction methods (none of which were tested in our study). Regarding fecal samples, previous benchmarking studies on human fecal samples24 have shown the importance of mechanical lysis and bead beating for the efficient extraction of DNA from gram-positive bacteria and highlighted the good performance of MNS in the extraction of fungal taxa for metagenomic studies49.
Although the reference communities adopted in this study can not provide an exhaustive representation of original communities, as they are (by definition) dependent on the subset of samples adopted for their assembly, their use as reference communities allowed us to rank the diverse kits based on their distance (expressed as weighted UniFrac or Jaccard dissimilarity) from the corresponding reference communities, thus providing a valuable indication of the effectiveness of each method to embrace the complexity of each sample and its microbial community. Consistent with the presented diversity estimates, the contribution of each DNA extraction method to the assembly of the reference communities was, again, highly dependent on sample type (Figs. 2, S5). Thus, while technical replicates of invertebrate animals (Invert_beetle and Invert_roundworm) were prone to compositional differences when processed with alternative DNA extraction methods (Fig. 4 and S6), other sample types such as soil (Soil_horizonA and Soil_rhizosphere) as well as Feces_hare were prone to variation in both composition and frequency between observed taxa (as indicated by weighted UniFrac distances; Fig. 4). According to the weighted UniFrac distance, the kit providing the lowest distance between samples and corresponding reference communities were the following: QST and QMC for Feces_cattle, QST and MNS for Feces_hare and MNS for both types of soil samples (e.g. Soil_horizonA and Soil_rhizosphere). Conversely, invertebrate samples were characterized by several kits (including QBT and MNS) providing similar UniFrac distance estimates with the corresponding reference communities. Consistently, switching DNA extraction kit resulted in the identification of hundreds of ASVs with significant differences in their abundance across kits50. This was observed in all sample types, except Invert_beetle, for which the lack of significant differences across methods could reflect the observed high variability in diversity estimates which was observed across replicates. Two clear examples of variation associated with the DNA kit are, for example, the enrichment of Cyanobacteria in Feces_hare extracted with QBT and QMC as well as the enrichment of Actinobacteria, Alphaproteobacteria, Bacilli and Nitrososphaeria observed in Soil_horizonA and Soil_rhizosphere extracted with QPS and MNS. The enrichment of gram-positive bacteria (e.g. Actinobacteria, Bacilli) and other bacterial classes difficult to lyse such as Alphaproteobacteria51 in both soil samples extracted with QPS and MNS was consistent with earlier observations that mechanical lysis was the most efficient method for the extraction of DNA from gram-positive and spore-forming bacteria24,28,52,53. The observation that an increase in gram-positive bacteria was mirrored by an apparent depletion of taxa belonging to Gammaproteobacteria, Bacteroidia and Gemmatimonadetes is also consistent with previous studies24,28,49,54 and can be explained by competition between DNA fragments/amplicons in compositional datasets55. Interestingly, the list of taxa modulated by DNA extraction includes microorganisms involved in fundamental ecosystem processes such as phosphorus cycling (Vicinamibacteria56) as well as nitrification and methane oxidizers (Nitrososphaeria57–59), which emphasizes that different extraction methods might affect the outcomes and conclusions of ecological studies, and why uniformity of extraction methods is desirable.
Despite the constant decrease of amplicon-sequencing costs, resources limitations most frequently imply that only a limited number of highly referenced DNA extraction methods and types of samples could be compared before any study. This study addresses the impact of DNA extraction methods on a wide range of sample types by testing highly referenced DNA extraction methods selected among the many available. Comparisons between selected methods demonstrate the impact of switching DNA extraction kit varies across sample types, with some sample types, such as bovine feces, being resilient to technical variation and others, such as bulk soil, rhizosphere soil and invert roundworms, being particularly vulnerable to technical variation derived from DNA extraction methods. Therefore, the selection of the kit to be used in projects involving different sample types (such as feces and soil or invertebrate animals) should be done by considering the project specific sample susceptibility to methodological variation (see Table 1) and by extracting the DNA from most resilient samples (such as mammalian feces) using methods specifically designed for the set of samples showing higher sensitivity to DNA-derived technical variation (such as soil samples). Using different DNA extraction kits leads to significant differences in the abundance of several bacterial classes and reproducible changes in alpha and beta diversity estimates and thus actively contributes to shape the sample microbial community as addressed by conventional metataxonomic pipelines. From the comparison of the tested kits, we noted that QST was generally associated with reduced alpha diversity estimates and poor contribution to the theoretical reference community, in particularly for soil, rhizosphere and invertebrate animals (Invert_beetle). QBT, QMC and QPS were among kits providing the highest alpha diversity estimates in mammalian feces, invertebrate animals, rhizosphere and bulk soil samples. However, these kits typically performed worse than MNS regarding the relative contribution to the theoretical reference community, even in invertebrate and soil samples. Conversely, eluates generated with MNS were associated with the highest alpha diversity estimates (in all sample types except Invert_beetle) and greater contribution to the corresponding theoretical microbial communities (in all sample types except Feces_cattle). These observations indicate that MNS is very effective in capturing the composition and diversity of most samples considered in this study and for this reason, we recommend its use for any large-scale ecological study on microbial communities, especially those studies comparing soil and rhizosphere with other fundamental actors of terrestrial ecosystems.
Future microbial ecology studies involving the simultaneous processing of different sample types, such as soil, feces and invertebrate animals would benefit from higher methodological harmonization as this would facilitate comparisons of amplicon sequencing data in complex ecological systems. Similarly, greater methodological harmonization would facilitate the study of the functional potential present or expressed by microbial communities found in these heterogeneous contexts through the application of metagenomics and metatranscriptomics, leading to a better understanding of the contribution of the microbiota to the adaptive potential and health of animals, plants, and soil to changing environments, which is of pivotal interest to One Health studies.
Methods
Study area
All samples were collected from dry grassland pastures at the Long-Term (Socio-) Ecological Research (LT(S)ER) area in Val Mazia/Matschertal (Vinschgau Valley, South Tyrol, Italy; site code LTER_EU_IT_097, 46.6928°, 10.6157°; https://deims.org/11696de6-0ab9-4c94-a06b-7ce40f56c964) in June-July 2019. The LT(S)ER in Val Mazia/Matschertal is managed by the Institute for Alpine Environment (EURAC) and comprises different land-uses and various forest types with an elevational ranging from 950 to 3700 m a.sl.
Above-ground vertebrates
Fresh fecal samples from Pezzata Rossa Italiana cattle (hereafter Feces_cattle) were collected from two sites at 2000 m a.s.l. Each of nine samples was collected from a single freshly deposited cowpat using sterile tweezers (three collection points per pat) and placed in sterile 50 ml polypropylene tubes. A total of nine single fresh hare (Lepus spp.) fecal pellets (Feces_hare) at least 20 m apart were collected using sterile tweezers from five sites located between 1000 and 2000 m a.s.l. and placed singly in sterile 15 ml polypropylene tubes. All fecal samples were stored at − 20 °C up to 24 h before being transferred to the laboratories of Fondazione E. Mach where they were archived at − 80 °C until pooling and DNA extraction. Separately for each of the two species, three to five fecal samples (Table S1) were pooled to generate technical and biological replicates as follows: for each pool approximately 0.5 g of frozen feces from each sample were placed in a sterile pestle and mortar and ground to powder in liquid nitrogen. Replicates of 50 mg of powder were then stored at − 80 °C until DNA extraction.
Soil invertebrates
Live Coleoptera from the families Staphilinidae and Carabidae (Invert_beetle) were collected from six sites between 1000 and 2500 m a.s.l. using sterile gloves (Moret et al. 2016), rinsed with a in 0.1 M Tris Saline Buffer to remove soil, pooled by site in sterile 50 ml polypropylene tubes, and stored at − 20 °C, following taxonomic identification using morphological keys (Freude, Harde, and Lohse 2012; Pesarini and Monzini 2010). Three frozen whole Coleoptera from each pool were placed in a sterile mortar, ground to powder with a sterile pestle in liquid nitrogen, then divided into six biological replicates (~ 30 mg of powder each) and stored at − 80 °C until DNA extraction. For technical replicates, the frozen Coleoptera powder from a single pool was split into 30 aliquots of ~ 10 mg each and stored at − 80 °C until DNA extraction (Table S1).
Live bacterivorous Nematoda (Invert_roundworm) were extracted from soil samples (5 cm in diameter, 5 cm deep) taken from 10 sites between 1000 and 2500 m a.s.l. following the Baermann funnel extraction protocol60. Using an optical microscope at 40X magnification, all nematodes from a single soil sample were pooled and pipetted into 0.1 M Tris Saline Buffer and stored at − 20 °C. Frozen Nematoda samples were thawed on ice, pooled in 2 ml DNA/DNase free tubes, mixed by gentle vortexing and divided into six biological or 30 technical replicates (Table S1). Aliquots were stored at -80 °C until DNA extraction.
Soil samples
Soil was collected from the upper mineral horizon (Ah) just beneath the root system of the overlying plant community (Soil_horizonA), from one pasture at 1000 m a.s.l (Table S1). At three sites at least 2 m apart, a site sample with an area of 20 × 15 × 10 cm was taken for subsequent soil sampling at a depth of 12–20 cm; for each of the site samples, approximately 100 g of soil was collected from 5 to 10 such different digging points on the site sample and combined into one composite sample. For rhizosphere soil of the grass Festuca spp. (Soil_rhizosphere), 2–3 cespitoses (including roots) were dug out at the same three sites near the bulk soil (Soil_horizonA) sampling. Care was taken to ensure that individual cespitoses were isolated from others, that there were no foreign roots, and that there was enough soil sticking to the roots to protect the plant-root complex during transport. Soil and plant samples were transported at ambient temperature the same day to the laboratory and stored at 4 °C until further processing (up to two weeks).
To collect the rhizosphere soil samples, plant phyllosphere and roots were separated at the root collar using a sterile blade, and roots underwent a washing-centrifugation procedure as described in37,61. To allow the comparison of DNA extracts of rhizosphere soil and bulk soil (Horizon ‘A’ soil), the latter was sieved to < 1 mm and 100 mg of this fraction was subjected to the same washing-centrifugation protocol as the rhizosphere soils37,61. From each of the three sites, 6 aliquots (biological replicates) containing approximately 100 mg of bulk and rhizosphere soil were frozen at − 80 °C until DNA extraction. To generate technical replicates of bulk and rhizosphere soil respectively, aliquots of the soil suspensions from the three sites were pooled, mixed by vortexing, portioned in 30 tubes of equal amount and frozen at − 80 °C until DNA extraction.
DNA extraction
For each sample type (Feces_cattle, Feces_hare, Invert_beetle, Invert_roundworm, Soil_horizonA, Soil_rhizosphere), DNA extractions were performed for three biological replicates (hereafter referred to as Pools A, B, C) and six technical replicates (Pool D) (Table S1). Three aliquots from pool D (D1mc, D2mc, D3mc) were extracted with a bacterial mock community before the initial lysis step, using the ZymoBIOMICS™ Spike-in Control I (High Microbial Load; EuroClone, Milan, Italy; hereafter MC; after43) and the other three aliquots (called D1, D2, D3) were extracted without the inclusion of MC. A single MC dose (20 μl, as defined by the manufacturers) includes 2 × 107 cells, corresponding to 6.0 × 107 (I. halotolerans) and 1.4 × 108 (A. halotolerans) 16S rRNA gene copies). Therefore, the MC dose adopted for the different sample types were the following: 0.8 dose for Feces_cattle and Feces_hare; 0.33 dose for Invert_beetle; 0.12 dose for Invert_roundworm; and 1.0 dose for Soil_horizonA and Soil_rhizosphere.
DNA extractions were performed using the manufacturer’s protocols for the following kits: DNeasy® Blood & Tissue (QIAGEN; hereafter QBT), QIAamp® DNA Micro (QIAGEN; QMC), NucleoSpin® Soil (MACHEREY–NAGEL; MNS), DNeasy® PowerSoil® Pro (QIAGEN; QPS) and QIAamp® Fast DNA Stool Mini (QIAGEN; QST). These kits were selected because they are widely cited in microbial ecology studies; in addition, they are specifically recommended by manufacturers for one or more of the sample types considered in this study. For QBT, samples were subjected to the pre-treatment for gram-positive bacteria, followed by the protocol ‘Purification of Total DNA from Animal Tissues’. Extractions with QMC were performed using the protocol ‘Isolation of Genomic DNA from Tissues’. For MNS, lysis buffer SL1 and 50 μl of Enhancer SX were used as reported by37. For QST, we followed the protocol ‘Isolation of DNA from stool for pathogen detection’. For each kit, negative DNA extraction controls (lysis buffer only: no sample material) were included for contamination control. The purity and quantity of all DNA extracts were assessed by checking the UV/VIS spectra of each extract with a Spark® multimode microplate reader (Tecan, Switzerland). Estimated DNA concentration, and extracts purity expressed as 260/280 and 260/230 ratios were plotted as Log2 values boxplots generated using the R package ggplot262; negative values were replaced with 0.001. Analysis of variance (ANOVA) was performed with the R function ‘aov’, and Tukey's test for post-hoc analysis was performed with the R package agricolae63.
16S rRNA gene amplification, library preparation and sequencing
Amplification reactions were performed in a volume of 30 μl, containing 9 ng of genomic DNA, 1X FastStart High Fidelity Reaction Buffer (Roche Applied Science), primers to a final concentration of 0.4 μM each and 1.5 U of FastStart High Fidelity Enzyme Blend (Roche Applied Science). The following primer pairs were adopted in all reactions: 515F-mod_ILL8 and 806R-mod_ILL64. All PCR reactions were performed on Veriti™ 96-Well Fast Thermal Cyclers (Applied Biosystems, USA). The experimental conditions for PCR amplification were as follows: 3 min at 95 °C, followed by 31 cycles of 45 s at 95 °C, 60 s at 55 °C, 90 s at 72 °C, and a single final extension step of 7 min at 72 °C. Non-template controls (amplification mix only: PCR-grade water instead of DNA template) were included for contamination control. Amplicons were visualized by high-resolution capillary electrophoresis using the QIAxcel Advanced System (QIAGEN). Quality and quantification of pooled amplicon libraries as well as high throughput sequencing by Illumina technology were performed at the Sequencing and Genotyping Platform, Fondazione Edmund Mach (FEM, San Michele all’Adige, Italy). Samples, DNA extraction negative controls and amplification blank controls were sequenced using Illumina MiSeq Standard Flow Cells targeting a minimum depth of 30,000 reads per sample. In total, we generated 284 amplicon libraries, which were named by specifying the DNA extraction kit (e.g. QBT, QMC, MNS, QPS, QST), followed by the sample identifier and the replicate number. For example, the library ‘MNS-BPD-1’ was generated using the extraction kit MNS from the Feces_cattle Pool D (BPD) and was identified as replicate 1 (see Table S1).
Data analysis
Bioinformatic pre-processing of fastq reads was carried out with the software MICCA65 with default settings. Sequence Variants (SVs) were generated using UNOISE366 implemented in MICCA. Taxonomy was assigned with the ‘classify.seqs’ command in MOTHUR67 and by using silva.nr_v13868 as the template and taxonomy reference files. To generate a phylogenetic tree, ASVs were aligned to the silva.nr_v138 sequences using Nearest Alignment Space Termination (NAST) approach implemented in MICCA. The generation of the phylogenetic tree and midpoint rooting were performed with default parameters in MICCA.
The following statistical analyses were performed with the R packages phyloseq69 and vegan70: ASVs matching the MC sequences (ACC: NR116607.1, NR117181.2) were extracted from the MC-control samples, verified by BLAST (https://blast.ncbi.nlm.nih.gov) and, when confirmed by the sequence alignment (number of ASVs: 17), removed from all datasets before performing subsequent steps. Following the removal of ASVs matching the MC, the dataset consisted of 26,996 ASVs. Separately for each DNA extraction kit, putative contaminant ASVs were detected using decontam v1.22. 0 with the prevalence method71. The number of putative contaminant ASVs was 140 for QBT (median prevalence: 2, range: 0–22), 13 for QMC (median prevalence: 2, range: 1–22), 398 for MNS (median prevalence: 2, range: 0–23), 34 for QPS (median prevalence: 2.5, range: 1–23) and 26 for QST (median prevalence: 1, range: 0–17). Considering that the detected median prevalence was so low that that decontam v1.22 was not effective in identifying cross contaminations71, and to avoid affecting comparisons across kits due to the filtering of sample taxa, we did not filter the potential contaminating ASVs from the dataset. To estimate alpha diversity indices (Richness, S; Shannon diversity, H and Inverse Simpson, D2), libraries were rarefied to 99% of the minimum sample depth in the dataset (21,675 reads per sample) using the R package phyloseq69. Log2 Euclidean distances of observed number of taxa (richness, S) across samples extracted with the same kit (referred as: Within kit), with different kits (Between kits), between biological replicates (Between ind) and across kits and biological replicates (Between kits and ind) were estimated with the R package agricolae63. Significant differences between estimates were tested using the Kruskal–Wallis test in the R package agricolae63 with a significance level of 0.05. Plots were generated with the R package ggplot262.
To measure the effect of the explanatory variables, DNA concentration, 260/280 and 260/230 ratios on estimated alpha diversity (response variables: Richness, Shannon diversity) was estimated with Linear Mixed Model (LMM) using the R lmerTest package72,73.
To compare microbial communities generated from different DNA extraction kits, two reference microbial communities (hereafter, reference communities) were computationally generated for each sample type by merging the fastq files generated from the corresponding DNA extracts. Only technical replicates processed without MC were used to generate these reference communities. Fastq files of single libraries and reference communities were reanalyzed together with the software MICCA65 and R packages phyloseq69 and vegan70, as described above.
Beta diversity indices Bray–Curtis, weighted UniFrac and Jaccard were estimated with the R package phyloseq69 and vegan70. To compare beta-diversity estimates across samples, permutational MANOVA (permanova) statistical tests were performed by using the function ‘adonis’ with 999 permutations (R package vegan). Plots were generated with the R package ggplot2.
Differential abundance testing was carried out for each sample type using non-rarefied data with the package DEseq250 in R72. To exclude biological variation and focus on the effect of DNA extraction methods on ASV differential abundances, only technical replicates (N = 3) were used. p-values were corrected for false discovery rate using the Benjamini–Hochberg correction implemented in DEseq2 (significance cutoff of FDR corrected p-values: 0.05). Taxonomic classification of enriched and deleted taxa and the corresponding barplots were performed with the R package phyloseq.
Supplementary Information
Acknowledgements
The authors wish to thank the Fondazione E. Mach for access to facilities, and the staff of the Sequencing and Genotyping Platform for their outstanding support. The authors wish also to thank the Conservation Genomics Unit for their expertise and excellent laboratory management especially during the SARS-cov19 pandemic.
Author contributions
G.G., H.C.H., J.S., N.P. and P.I. conceived the study. G.G., H.C.H., J.S., N.P., P.I., F.C. and T.R. performed field sampling and/or sample preparation. G.G. and E.S. carried out the experiment. G.G. and N.P. carried out the computational analyses. G.G. and H.C.H. drafted the manuscript. All authors helped to edit the manuscript and read and approved the final manuscript.
Funding
The EUREGIO project: MICROVALU—Evaluating microbiodiversity in alpine pastures (Project ID: IPN94) is funded by the "Euregio Tirolo-Alto Adige-Trentino" Interregional Project Network.
Data availability
The raw sequencing data has been deposited at NCBI Sequence Read Archive (SRA) under the BioProject ID PRJNA949849.
Competing interests
The authors declare no competing interests.
Footnotes
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
The online version contains supplementary material available at 10.1038/s41598-024-59086-4.
References
- 1.Banerjee S, van der Heijden MGA. Soil microbiomes and one health. Nat. Rev. Microbiol. 2022;21(1):6–20. doi: 10.1038/s41579-022-00779-w. [DOI] [PubMed] [Google Scholar]
- 2.Bais HP, Park SW, Weir TL, Callaway RM, Vivanco JM. How plants communicate using the underground information superhighway. Trends Plant Sci. 2004;9:26–32. doi: 10.1016/j.tplants.2003.11.008. [DOI] [PubMed] [Google Scholar]
- 3.Blanc C, et al. Nutrition on bacteria by bacterial-feeding nematodes and consequences on the structure of soil bacterial community. Eur. J. Soil Biol. 2006;42:S70–S78. doi: 10.1016/j.ejsobi.2006.06.003. [DOI] [Google Scholar]
- 4.Curry JP, Schmidt O. The feeding ecology of earthworms—A review. Pedobiologia (Jena) 2007;50:463–477. doi: 10.1016/j.pedobi.2006.09.001. [DOI] [Google Scholar]
- 5.Medina-Sauza RM, et al. Earthworms building up soil microbiota, a review. Front. Environ. Sci. 2019;7:450952. doi: 10.3389/fenvs.2019.00081. [DOI] [Google Scholar]
- 6.Ottman N, et al. Soil exposure modifies the gut microbiota and supports immune tolerance in a mouse model. J. Allergy Clin. Immunol. 2019;143:1198–1206.e12. doi: 10.1016/j.jaci.2018.06.024. [DOI] [PubMed] [Google Scholar]
- 7.Haahtela T. A biodiversity hypothesis. Allergy. 2019;74:1445–1456. doi: 10.1111/all.13763. [DOI] [PubMed] [Google Scholar]
- 8.Caporaso JG, et al. Ultra-high-throughput microbial community analysis on the Illumina HiSeq and MiSeq platforms. ISME J. 2012;6(8):1621–1624. doi: 10.1038/ismej.2012.8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Methé BA, et al. A framework for human microbiome research. Nature. 2012;486:215–221. doi: 10.1038/nature11209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Gilbert, J. A., Jansson, J. K. & Knight, R. Earth Microbiome Project and Global Systems Biology. mSystems3 (2018). [DOI] [PMC free article] [PubMed]
- 11.Turnbaugh, P. J., et al. The Human Microbiome Project. Nat. 2007 4497164449, 804–810 (2007). [DOI] [PMC free article] [PubMed]
- 12.Tedersoo, L., et al. Global diversity and geography of soil fungi. Science (80-. ).346, 1256688 (2014). [DOI] [PubMed]
- 13.Delgado-Baquerizo, M., et al. A global atlas of the dominant bacteria found in soil. Science (80-. ).325, 320–325 (2018). [DOI] [PubMed]
- 14.Meadow, J. F., Altrichter, A. E. & Green, J. L. Mobile phones carry the personal microbiome of their owners. PeerJ2014 (2014). [DOI] [PMC free article] [PubMed]
- 15.Blaustein RA, et al. Toothbrush microbiomes feature a meeting ground for human oral and environmental microbiota. Microbiome. 2021;9:1–14. doi: 10.1186/s40168-020-00983-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Checinska Sielaff A, et al. Characterization of the total and viable bacterial and fungal communities associated with the International Space Station surfaces. Microbiome. 2019;7:1–21. doi: 10.1186/s40168-019-0666-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wnuk E, et al. The effects of humic substances on DNA isolation from soils. PeerJ. 2020;8:e9378. doi: 10.7717/peerj.9378. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Schrader C, Schielke A, Ellerbroek L, Johne R. PCR inhibitors—occurrence, properties and removal. J. Appl. Microbiol. 2012;113:1014–1026. doi: 10.1111/j.1365-2672.2012.05384.x. [DOI] [PubMed] [Google Scholar]
- 19.Kreisinger, J., Bastien, G., Hauffe, H. C., Marchesi, J. & Perkins, S. E. Interactions between multiple helminths and the gut microbiota in wild rodents. Philos. Trans. R. Soc. B Biol. Sci.370 (2015). [DOI] [PMC free article] [PubMed]
- 20.Ducarmon, Q. R., Hornung, B. V. H., Geelen, A. R., Kuijper, E. J. & Zwittink, R. D. Toward standards in clinical microbiota studies: Comparison of three DNA extraction methods and two bioinformatic pipelines. mSystems5 (2020). [DOI] [PMC free article] [PubMed]
- 21.Pollock, J., Glendinning, L., Wisedchanwet, T. & Watson, M. The madness of microbiome: Attempting to find consensus ‘best practice’ for 16S microbiome studies. Appl. Environ. Microbiol.84 (2018). [DOI] [PMC free article] [PubMed]
- 22.McLaren, M. R., Willis, A. D. & Callahan, B. J. Consistent and correctable bias in metagenomic sequencing experiments. Elife8 (2019). [DOI] [PMC free article] [PubMed]
- 23.Wagner AO, Praeg N, Reitschuler C, Illmer P. Effect of DNA extraction procedure, repeated extraction and ethidium monoazide (EMA)/propidium monoazide (PMA) treatment on overall DNA yield and impact on microbial fingerprints for bacteria, fungi and archaea in a reference soil. Appl. Soil Ecol. 2015;93:56–64. doi: 10.1016/j.apsoil.2015.04.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Costea PI, et al. Towards standards for human fecal sample processing in metagenomic studies. Nat. Biotechnol. 2017;35:1069–1076. doi: 10.1038/nbt.3960. [DOI] [PubMed] [Google Scholar]
- 25.Wagner Mackenzie B, Waite DW, Taylor MW. Evaluating variation in human gut microbiota profiles due to DNA extraction method and inter-subject differences. Front. Microbiol. 2015;0:130. doi: 10.3389/fmicb.2015.00130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kennedy NA, et al. The impact of different DNA extraction kits and laboratories upon the assessment of human gut microbiota composition by 16S rRNA gene sequencing. PLoS One. 2014;9:1–9. doi: 10.1371/journal.pone.0088982. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Fiedorová K, et al. The impact of DNA extraction methods on stool bacterial and fungal microbiota community recovery. Front. Physiol. 2019;10:1–11. doi: 10.3389/fmicb.2019.00821. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Terrat S, et al. Molecular biomass and MetaTaxogenomic assessment of soil microbial communities as influenced by soil DNA extraction procedure. Microb. Biotechnol. 2012;5:135–141. doi: 10.1111/j.1751-7915.2011.00307.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Santos SS, Nielsen TK, Hansen LH, Winding A. Comparison of three DNA extraction methods for recovery of soil protist DNA. J. Microbiol. Methods. 2015;115:13–19. doi: 10.1016/j.mimet.2015.05.011. [DOI] [PubMed] [Google Scholar]
- 30.Bollmann-Giolai A, et al. A low-cost pipeline for soil microbiome profiling. Microbiologyopen. 2020;9:e1133. doi: 10.1002/mbo3.1133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Ammazzalorso AD, Zolnik CP, Daniels TJ, Kolokotronis S-O. To beat or not to beat a tick: Comparison of DNA extraction methods for ticks (Ixodes scapularis) PeerJ. 2015;3:e1147. doi: 10.7717/peerj.1147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Schiebelhut LM, Abboud SS, Daglio LEG, Swift HF, Dawson MN. A comparison of DNA extraction methods for high-throughput DNA analyses. Mol. Ecol. Resour. 2017;17:721–729. doi: 10.1111/1755-0998.12620. [DOI] [PubMed] [Google Scholar]
- 33.Ketchum RN, et al. DNA extraction method plays a significant role when defining bacterial community composition in the marine invertebrate Echinometra mathaei. Front. Mar. Sci. 2018;0:255. doi: 10.3389/fmars.2018.00255. [DOI] [Google Scholar]
- 34.Majaneva M, Diserud OH, Eagle SHC, Hajibabaei M, Ekrem T. Choice of DNA extraction method affects DNA metabarcoding of unsorted invertebrate bulk samples. Metabarcoding Metagenom. 2018;2:1–12. doi: 10.3897/mbmg.2.26664. [DOI] [Google Scholar]
- 35.Hermans SM, Buckley HL, Lear G. Optimal extraction methods for the simultaneous analysis of DNA from diverse organisms and sample types. Mol. Ecol. Resour. 2018;18:557–569. doi: 10.1111/1755-0998.12762. [DOI] [PubMed] [Google Scholar]
- 36.Hauffe HC, Barelli C. Conserve the germs: The gut microbiota and adaptive potential. Conserv. Genet. 2019;20:19–27. doi: 10.1007/s10592-019-01150-y. [DOI] [Google Scholar]
- 37.Praeg N, Pauli H, Illmer P. Microbial diversity in bulk and rhizosphere soil of Ranunculus glacialis along a high-alpine altitudinal gradient. Front. Microbiol. 2019;0:1429. doi: 10.3389/fmicb.2019.01429. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Brinkman TJ, Schwartz MK, Person DK, Pilgrim KL, Hundertmark KJ. Effects of time and rainfall on PCR success using DNA extracted from deer fecal pellets. Conserv. Genet. 2010;11:1547–1552. doi: 10.1007/s10592-009-9928-7. [DOI] [Google Scholar]
- 39.Desneux J, Pourcher AM. Comparison of DNA extraction kits and modification of DNA elution procedure for the quantitation of subdominant bacteria from piggery effluents with real-time PCR. Microbiologyopen. 2014;3:437–445. doi: 10.1002/mbo3.178. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Hart ML, Meyer A, Johnson PJ, Ericsson AC. Comparative evaluation of DNA extraction methods from feces of multiple host species for downstream next-generation sequencing. PLoS One. 2015;10:e0143334. doi: 10.1371/journal.pone.0143334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Sáenz JS, Roldan F, Junca H, Arbeli Z. Effect of the extraction and purification of soil DNA and pooling of PCR amplification products on the description of bacterial and archaeal communities. J. Appl. Microbiol. 2019;126:1454–1467. doi: 10.1111/jam.14231. [DOI] [PubMed] [Google Scholar]
- 42.Pollock, J., Glendinning, L., Wisedchanwet, T. & Watson, M. The madness of microbiome: Attempting to find consensus “best practice” for 16S microbiome studies. Appl. Environ. Microbiol.84 (2018). [DOI] [PMC free article] [PubMed]
- 43.Galla G, et al. Mock community as an in situ positive control for amplicon sequencing of microbiotas from the same ecosystem. Sci. Rep. 2023;13:1–13. doi: 10.1038/s41598-023-30916-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Hugon P, et al. Molecular studies neglect apparently gram-negative populations in the human gut microbiota. J. Clin. Microbiol. 2013;51:3286–3293. doi: 10.1128/JCM.00473-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Bluemel S, et al. Intestinal and hepatic microbiota changes associated with chronic ethanol administration in mice. Gut Microbes. 2020;11:265–275. doi: 10.1080/19490976.2019.1595300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.IJssennagger N, et al. Dietary heme alters microbiota and mucosa of mouse colon without functional changes in host-microbe cross-talk. PLoS One. 2012;7:e49868. doi: 10.1371/journal.pone.0049868. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Sekhohola-Dlamini L, Dlamini P, Selvarajan R, Ogola HJO, Tekere M. Influences of geochemical factors and substrate availability on Gram-positive and Gram-negative bacterial distribution and bio-processes in ageing municipal landfills. Int. Microbiol. 2021;24:311–324. doi: 10.1007/s10123-021-00167-z. [DOI] [PubMed] [Google Scholar]
- 48.Wei M, et al. 35 years of manure and chemical fertilizer application alters soil microbial community composition in a Fluvo-aquic soil in Northern China. Eur. J. Soil Biol. 2017;82:27–34. doi: 10.1016/j.ejsobi.2017.08.002. [DOI] [Google Scholar]
- 49.Yang F, et al. Assessment of fecal DNA extraction protocols for metagenomic studies. Gigascience. 2020;9:1–12. doi: 10.1093/gigascience/giaa071. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:1–21. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Guo F, Zhang T. Biases during DNA extraction of activated sludge samples revealed by high throughput sequencing. Appl. Microbiol. Biotechnol. 2013;97:4607–4616. doi: 10.1007/s00253-012-4244-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Maukonen J, Simões C, Saarela M. The currently used commercial DNA-extraction methods give different results of clostridial and actinobacterial populations derived from human fecal samples. FEMS Microbiol. Ecol. 2012;79:697–708. doi: 10.1111/j.1574-6941.2011.01257.x. [DOI] [PubMed] [Google Scholar]
- 53.Salonen A, et al. Comparative analysis of fecal DNA extraction methods with phylogenetic microarray: Effective recovery of bacterial and archaeal DNA using mechanical cell lysis. J. Microbiol. Methods. 2010;81:127–134. doi: 10.1016/j.mimet.2010.02.007. [DOI] [PubMed] [Google Scholar]
- 54.Mackenzie BW, Waite DW, Taylor MW. Evaluating variation in human gut microbiota profiles due to DNA extraction method and inter-subject differences. Front. Microbiol. 2015;6:1–11. doi: 10.3389/fmicb.2015.00130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Gloor GB, Macklaim JM, Pawlowsky-Glahn V, Egozcue JJ. Microbiome datasets are compositional: And this is not optional. Front. Microbiol. 2017;0:2224. doi: 10.3389/fmicb.2017.02224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Wu X, et al. Metagenomic insights into nitrogen and phosphorus cycling at the soil aggregate scale driven by organic material amendments. Sci. Total Environ. 2021;785:147329. doi: 10.1016/j.scitotenv.2021.147329. [DOI] [PubMed] [Google Scholar]
- 57.Wang Z, Cao Y, Wright AL, Shi X, Jiang X. Different ammonia oxidizers are responsible for nitrification in two neutral paddy soils. Soil Tillage Res. 2019;195:104433. doi: 10.1016/j.still.2019.104433. [DOI] [Google Scholar]
- 58.Zhao J, Wang B, Jia Z. Phylogenetically distinct phylotypes modulate nitrification in a paddy soil. Appl. Environ. Microbiol. 2015;81:3218–3227. doi: 10.1128/AEM.00426-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Wang B, et al. Active ammonia oxidizers in an acidic soil are phylogenetically closely related to neutrophilic archaeon. Appl. Environ. Microbiol. 2014;80:1684–1691. doi: 10.1128/AEM.03633-13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Viglierchio DR, Schmitt RV. On the methodology of nematode extraction from field samples: BaermannFunnel modifications. J. Nematol. 1983;15:438. [PMC free article] [PubMed] [Google Scholar]
- 61.Barillot CDC, Sarde C-O, Bert V, Tarnaud E, Cochet N. A standardized method for the sampling of rhizosphere and rhizoplan soil bacteria associated to a herbaceous root system. Ann. Microbiol. 2013;63:471–476. doi: 10.1007/s13213-012-0491-y. [DOI] [Google Scholar]
- 62.Wickham, H. ggplot2 Elegant Graphics for Data Analysis Second Edition (2016).
- 63.De Mendiburu, F. Agricolae: Statistical procedures for agricultural research. R package version, 2014, 1.1.https://scholar.google.com/scholar?hl=it&as_sdt=0%2C5&q=Agricolae%3A+Statistical+Procedures+for+Agricultural+Research&btnG= (2014).
- 64.Apprill A, McNally S, Parsons R, Weber L. Minor revision to V4 region SSU rRNA 806R gene primer greatly increases detection of SAR11 bacterioplankton. Aquat. Microb. Ecol. 2015;75:129–137. doi: 10.3354/ame01753. [DOI] [Google Scholar]
- 65.Albanese D, Fontana P, De Filippo C, Cavalieri D, Donati C. MICCA: A complete and accurate software for taxonomic profiling of metagenomic data. Sci. Rep. 2015;5:1–7. doi: 10.1038/srep09743. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Edgar, R. C. UNOISE2: Improved error-correction for Illumina 16S and ITS amplicon sequencing. bioRxiv 081257 (2016). 10.1101/081257.
- 67.Schloss PD, et al. Introducing mothur: Open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl. Environ. Microbiol. 2009;75:7537–7541. doi: 10.1128/AEM.01541-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Quast C, et al. The SILVA ribosomal RNA gene database project: Improved data processing and web-based tools. Nucleic Acids Res. 2013;41:D590–D596. doi: 10.1093/nar/gks1219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.McMurdie PJ, Holmes S. phyloseq: An R package for reproducible interactive analysis and graphics of microbiome census data. PLoS One. 2013;8:e61217. doi: 10.1371/journal.pone.0061217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Oksanen, J., Kindt, R., O’, B. & Maintainer, H. The vegan Package Title Community Ecology Package (2005).
- 71.Davis NM, Proctor DM, Holmes SP, Relman DA, Callahan BJ. Simple statistical identification and removal of contaminant sequences in marker-gene and metagenomics data. Microbiome. 2018;6:1–14. doi: 10.1186/s40168-018-0605-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.Team, R. C. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2021).
- 73.Kuznetsova A, Brockhoff PB, Christensen RHB. lmerTest package: Tests in linear mixed effects models. J. Stat. Softw. 2017;82:1–26. doi: 10.18637/jss.v082.i13. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The raw sequencing data has been deposited at NCBI Sequence Read Archive (SRA) under the BioProject ID PRJNA949849.