Skip to main content
MicrobiologyOpen logoLink to MicrobiologyOpen
. 2020 Dec 25;10(1):e1144. doi: 10.1002/mbo3.1144

Hidden heterogeneity and co‐occurrence networks of soil prokaryotic communities revealed at the scale of individual soil aggregates

Márton Szoboszlay 1, Christoph C Tebbe 1,
PMCID: PMC7884235  PMID: 33369241

Abstract

Sequencing PCR‐amplified gene fragments from metagenomic DNA is a widely applied method for studying the diversity and dynamics of soil microbial communities. Typically, DNA is extracted from 0.25 to 1 g of soil. These amounts, however, neglect the heterogeneity of soil present at the scale of soil aggregates and thus ignore a crucial scale for understanding the structure and functionality of soil microbial communities. Here, we show with a nitrogen‐depleted agricultural soil the impact of reducing the amount of soil used for DNA extraction from 250 mg to approx. 1 mg to access spatial information on the prokaryotic community structure, as indicated by 16S rRNA gene amplicon analyses. Furthermore, we demonstrate that individual aggregates from the same soil differ in their prokaryotic community compositions. The analysis of 16S rRNA gene amplicon sequences from individual soil aggregates allowed us, in contrast to 250 mg soil samples, to construct a co‐occurrence network that provides insight into the structure of microbial associations in the studied soil. Two dense clusters were apparent in the network, one dominated by Thaumarchaeota, known to be capable of ammonium oxidation at low N concentrations, and the other by Acidobacteria subgroup 6, representing an oligotrophic lifestyle to obtain energy from SOC. Overall this study demonstrates that DNA obtained from individual soil aggregates provides new insights into how microbial communities are assembled.

Keywords: microhabitats, network analysis, soil aggregates, soil DNA extraction, soil microbial diversity, spatial scales


DNA extracted from ≥250‐mg of soil neglects spatial information stored in individual soil aggregates and thus limits our understanding of the structure and functionality of soil microbial communities. Here, we demonstrate that DNA obtained from individual mg‐sized soil aggregates provides new insights into how microbial communities are assembled and increases the likelihood of finding truly interacting microbial taxa by network analysis.

graphic file with name MBO3-10-e1144-g010.jpg

1. INTRODUCTION

One of the major challenges in microbial ecology is to gain a predictive understanding of microbial diversity through elucidating the principles, patterns, and interactions that lead to the assembly of highly diverse microbial communities as found for example in soil (Green et al., 2008). To achieve this, it is essential to consider temporal and spatial variation in microhabitat conditions. In soil microbial ecology, the latter, however, is commonly ignored (Lombard et al., 2011; Vos et al., 2013). Instead, large composite samples are favored to obtain an overview of microbial diversity at the scale of a plot or a field. This neglects the fine‐scale heterogeneity of soil structure, and thus loses much information on patterns of community assembly (Thakur et al., 2020).

Soil structure develops as primary particles of different sizes and mineral composition, that is, clay, silt, and sand interact with each other, and with organic material to build microaggregates and macroaggregates, that are below or above 250 µm in diameter respectively (Six et al., 2000). Most bacterial cells occur inside aggregates rather than on their surfaces (Ranjard, Poly, et al., 2000), and biogeochemical cycles, which are key ecosystem services driven by an interacting microbial community (Smith et al., 2015), are considered to mainly occur within aggregates (Wilpiszeski et al., 2019). Soil aggregates have been regarded as “massively concurrent evolutionary incubators” (Rillig et al., 2017) or as “microbial villages” (Wilpiszeski et al., 2019) that represent small communities separated by distance and physical barriers and connected only periodically, for example, during wetting events.

DNA‐based methods to assess microbial diversity typically start with extracting 250 mg to 1 g of soil material (Young et al., 2014). This strategy has been useful to investigate the overall microbial diversity of soils (Fierer & Jackson, 2006; Roesch et al., 2007), its variation across geographical regions (Griffiths et al., 2011; Karimi et al., 2018), or its response to land‐use change at a continental scale (Szoboszlay et al., 2017). However, to understand the processes and interactions occurring within soil microbial communities, it would be rewarding to increase the spatial resolution of the community analysis to individual aggregates and investigate microbial diversity in these spatial entities. Approaching the “calling distance” of microbial interactions (Nunan, 2017) would increase the likelihood of detecting interacting microbial partners. Correlation networks have increasingly been applied to reveal relationships between microbial community members as detected by PCR amplicon sequence analyses (Banerjee et al., 2016; Barberan et al., 2012; Karimi et al., 2020; Li et al., 2017). However, to interpret a positive correlation as a mutualistic and a negative correlation as an antagonistic relationship is possible only for community members sharing the same microhabitat (Weiss et al., 2016). Without distinction of microhabitats, it is hard to assign the presence of taxa to niche exclusion (Faust & Raes, 2012). Analyzing soil DNA extracted from 250 mg to 1 g represents mixed DNA from many microhabitats. In contrast, working with individual soil aggregates should strongly enhance the ecological significance of soil microbial network analyses.

The potential impact of the heterogeneous soil constituents on structuring microbial communities at a microscale has already been demonstrated with pooled soil primary particles, where the majority of abundant bacterial and fungal taxa exhibited particular preferences for clay, silt, or sand‐sized fractions with particulate organic matter (Hemkemeyer et al., 2018, 2019). Furthermore, comparing pooled samples of micro‐ and macroaggregates revealed that these two size classes also differ in microbial community composition (Constancias et al., 2014; Davinic et al., 2012; Fox et al., 2018), diversity (Bach et al., 2018; Ivanova et al., 2015) and their response to stress (Ranjard, Nazaret, et al., 2000). However, information on the heterogeneity of microbial communities of individual aggregates within specific aggregate fractions is still lacking. A major limitation of analyzing individual aggregates is the difficulty of obtaining a sufficient quantity of nucleic acids from small amounts of soil for molecular methods. Attempts made so far, therefore, either pooled several aggregates for DNA extraction (Bach et al., 2018; Bailey et al., 2013; Ivanova et al., 2015), sampled very large aggregates weighing 20–70 mg (Kravchenko et al., 2014), or applied whole genome amplification (WGA) (Bailey, McCue, et al., 2013). These solutions, however, do not deliver data on individual aggregates, provide coarse spatial resolution, or generate substantial bias in the results (Direito et al., 2014; Wang et al., 2016), respectively. To our knowledge, the only study that reported the bacterial community composition in smaller, that is below 3 mm, individual aggregates without applying WGA utilized taxonomic microarrays; a method of relatively low resolution, and focused solely on linking enzyme activity profiles with community structure (Kim et al., 2015). Furthermore, applying molecular methods to small samples that yield very low amounts of nucleic acids require validation to prove the consistent performance of the methods and rule out the possibility of contamination and stochastic effects influencing the results.

The tremendous scientific potential that individual soil aggregate‐based microbial community analysis should have for characterizing the heterogeneity of soil microbial communities at a biologically and ecologically more meaningful scale motivated us testing the following hypotheses in this study:

  1. Metagenomic DNA of sufficient quantity and quality for PCR‐based analyses can be extracted from soil samples in the mg‐range, thus representing the scale of macroaggregates

  2. Increasing spatial resolution reveals heterogeneity in soil bacterial and archaeal community structure and abundance

  3. A higher heterogeneity seen among small soil samples is not a result of contamination or sub‐optimal performance of molecular methods

  4. Comparing individual aggregates from the same soil unveils patterns of microbial co‐occurrence within the soil microbial community not seen with the commonly used 250 mg soil sample size.

2. MATERIALS AND METHODS

2.1. Overview of the experiments

Three experiments were conducted in this study. In the 1st experiment, samples decreasing in size from 250 mg to 1 mg taken from the same soil were subjected to DNA extraction. To address the first two hypotheses, qPCR and high‐throughput amplicon sequencing targeting the 16S rRNA gene were conducted to characterize the bacterial, archaeal, and fungal abundance and the prokaryotic diversity in these samples. The 2nd experiment addressed the third hypothesis by comparing 250 mg soil samples and aliquots of a homogenized soil slurry. The volumes of the aliquots were chosen to contain the amount of DNA expected from 1, 5, and 25 mg soil samples. Since all aliquots were taken from the same thoroughly homogenized soil slurry, differences in prokaryotic community structure between these soil homogenate samples must be results of contamination, stochastic effects, or sub‐optimal performance of the DNA extraction and PCR. In the 3rd experiment, DNA was extracted from individual aggregates and 250 mg soil samples taken from the same soil to address the fourth hypothesis. All experiments included several control samples to test for the presence of contamination.

2.2. Soil sampling and DNA extraction

The soil used in all experiments was loam topsoil of a Haplic Chernozem (FAO classification) from the Bad Lauchstädt experimental research station of the Helmholtz Centre for Environmental Research in Germany (51°24'N 11°53'E) (Merbach & Schulz, 2013). It originated from the Static Fertilization Experiment, initiated in 1902, and samples were collected in December from a plot without any fertilization since 1903 (treatment NIL) (Ludwig et al., 2011), which was under long‐term sugar beet—potato—winter wheat—barley rotation. Consequently, the soil was compared with its fertilized variants depleted in nitrogen (Blair et al., 2006). The soil samples had a pH value of 7.1 (in 0.01 M CaCl2) and 17.7 mg kg−1 organic C. It was sieved (2 mm mesh size) and stored at 4°C until use.

Before sampling, approximately 100 g of soil was incubated at room T in the dark for 24 h. The soil was then spread out in a sterile Petri dish and samples were taken with sterilized spatulas directly into the bead‐beating tubes of the DNA extraction kit. Control samples were included in all experiments. They were handled together with, and the same way as the soil samples. DNA was extracted with the Quick‐DNA Fecal/Soil Microbe Microprep Kit (Zymo Research, Freiburg, Germany) including two 45 s bead‐beating cycles in an MP FastPrep‐24 5G Instrument (MP Biomedicals, Eschwege, Germany) at 6.5 m/s speed with a 300 s break in between. The DNA extracts were eluted in 30 µl elution buffer. All work was done in a biosafety cabinet decontaminated with UV light to minimize the chance of contamination. Measurement of the DNA yield was attempted with Quant‐iT PicoGreen dsDNA Assay Kit (Molecular Probes, Life Technologies, Eugene, OR) but accurate quantification was not possible for many of the small samples due to results close to the background fluorescence in the blank controls. In preparation for this study, several DNA extractions methods were tested but were not found suitable. This included the Fast DNA Spin kit for soil (MP Biomedicals, LLC, Illkrich, France) and variations of the phenol‐chloroform protocol (Miller et al., 1999).

In the 1st experiment, five size‐groups of soil samples were collected: 250, 125, 25, 5, and 1 mg, respectively. Eight samples were taken from each size‐group along with six control samples. Sample weights from all experiments are listed in Table 1. In the 2nd experiment, ten samples of 250 mg soil and six control samples were taken. Eight of the soil samples were processed normally in the DNA extraction, while for the other two, the DNA extraction was interrupted after the centrifugation following the bead beating. By this point, the samples had been turned into homogenized slurry by the bead beating, and the soil debris had been separated from the supernatant that contained the metagenomic DNA. The supernatant from the two samples was pooled and homogenized by vortexing. The mass of the resulting suspension was 1112 mg and it originated from 497 mg soil in total. Accordingly, a 55.9 mg aliquot of this suspension contained the amount of DNA extractable from 25 mg soil, an 11.2 mg aliquot the amount from 5 mg soil, and a 2.2 mg aliquot the amount from 1 mg soil. Eight aliquots from each of these sizes, hereafter 25, 5, and 1 mg soil homogenate samples, were taken and mixed with 350 µl BashingBead Buffer from the DNA extraction kit to continue the DNA extraction. In the 3rd experiment, 37 individual soil aggregates, weighing 5.3 mg on average and similar in size (ca. 2 mm), were taken for DNA extraction along with 35 samples of 250 mg soil and nine control samples.

TABLE 1.

Number of samples and soil weights within each sample category

Soil weight class or sample type Sample weight (mg) ±SD Number of soil samples
1st experiment
250 mg 251 ± 1 8
125 mg 125 ± 1 8
25 mg 25.1 ± 0.4 8
5 mg 4.9 ± 0.2 8
1 mg 1.1 ± 0.2 8
Control, no soil 6
2nd experiment
250 mg 252 ± 3 8
25 mg soil homogenate 8
5 mg soil homogenate 8
1 mg soil homogenate 8
Control, no soil 6
3rd experiment
250 mg 251 ± 0 35
Soil aggregate 5.3 ± 1.6 37
Control 9

2.3. Abundances of microbial groups assessed by qPCR

The abundance of Bacteria, Archaea, and Fungi was assessed by qPCR targeting the 16S rRNA gene and the ITS region as described previously (Hemkemeyer et al., 2015). Archaeal and fungal abundance were investigated only in the 1st experiment. Reactions were run in a Bio‐Rad CFX96 real‐time PCR system in duplicates from different dilutions of the DNA extracts. In the case of the 250 and 125 mg soil samples, 50‐ and 100‐fold dilutions were taken; from the 25 mg samples 10‐ and 20‐fold dilutions; and from the 5, 1 mg, individual aggregate, and control samples undiluted DNA extracts and twofold dilutions were used. PCR efficiencies were 95.9%–104.6% for the bacterial 16S rRNA gene, 94.2%–96.2% for the archaeal 16S rRNA gene, and 83.3%–84.3% for the fungal ITS with R 2 ≥0.995 in all cases. Results were compared with Tukey's HSD tests or Welch's t‐test in case of the data from the 3rd experiment. The analysis was carried out in R 3.4.4 (www.r‐project.org). One of the 250 mg samples from the 1st experiment yielded a magnitude higher copy number in the fungal ITS qPCR assay than the others. It was treated as an outlier and excluded from the analysis.

2.4. High‐throughput sequencing of 16S rRNA gene amplicons and data processing

To characterize the bacterial and archaeal communities in the samples, DNA extracts were subjected to high‐throughput amplicon sequencing of the V4 region of the 16S rRNA gene following the protocol of Kozich et al. (2013) with primers updated to match the modified 515f and 806r sequences according to Walters et al. (2016). In the case of the small soil samples and control samples, due to the low DNA yield, 10 µl DNA extract was used as a template in the PCRs. Paired‐end sequencing was done on Illumina MiSeq instruments at StarSEQ (Mainz, Germany). Samples from the same experiment were sequenced in the same run. For the availability of all data, see Data Availability Statement.

The sequencing data from the three experiments were analyzed separately. Raw reads were processed with the dada2 (version 1.6.0) pipeline (Callahan et al., 2016) in R 3.4.4. Forward and reverse reads were truncated at positions 240 and 90, respectively. Reads with any ambiguous bases were discarded as well as forward reads with over two and reverse reads with over one expected error. The data from the 2nd experiment had higher quality allowing the reverse reads to be truncated at position 130 and keeping those with two or less expected errors. Error models were constructed from 106 randomly selected reads. Sequence variants (SVs) were inferred using the pool option. Forward and reverse SVs were merged trimming overhangs, and the removeBimeraDenovo function was employed to detect chimeras. The SVs were classified according to the SILVA reference version 132 (Pruesse et al., 2007) accepting only results with ≥70% bootstrap support. SVs shorter than 220 nt or longer than 275 nt, or identified as chimeric, mitochondrial, or chloroplast sequences, or not classified into Bacteria or Archaea were deleted. Good's index was calculated to estimate the coverage of the SVs. SVs with ≥0.1% relative abundance in any of the control samples of an experiment were regarded as a potential contaminant and removed from the dataset of that experiment.

2.5. Analysis of sequencing results

Principal component analysis (PCA) plots were created in R using the rda function of the vegan package version 2.5‐2 (Oksanen et al., 2018). To decrease the sparsity of the data, SVs not reaching 0.1% relative abundance in any of the samples included in the PCA were removed. Zeroes were replaced with the count zero multiplicative (CZM) method using the zCompositions package version 1.1.1 (Palarea‐Albaladejo & Martin‐Fernandez, 2015), and centered log‐ratio (CLR) transformation was applied to the data to correct for compositional effects and differences in sequencing depth (Gloor et al., 2016).

Aitchison distances between the samples were calculated as Euclidean distances in the CLR transformed dataset (Gloor et al., 2017). SVs that did not have at least 0.1% relative abundance in any of the compared samples were removed and zeroes were replaced with the CZM method to allow CLR transformation before calculating Euclidean distances with the “vegdist” function of the vegan package. Results were compared with Tukey's HSD tests.

Plots illustrating the prevalence of abundant SVs among the soil samples were prepared in Cytoscape 3.7.1 (www.cytoscape.org). CoNet 1.1.1 beta (Faust & Raes, 2016) in Cytoscape was used to construct co‐occurrence networks from the data from the 3rd experiment. To limit the number of parallel significance tests and the sparsity of the data, only SVs with ≥0.2% relative abundance in at least one of the samples were included. Separate networks were constructed for the 250 mg soil samples and the soil aggregates. However, the selection of SVs was done on the joint data matrix to ensure that both networks include the same SVs. The data were relativized to the total sequence count of each sample. Pearson and Spearman correlations, mutual information (jsl setting), and Bray‐Curtis and Kullback‐Leibler (with a pseudo count of 10−8) dissimilarities were calculated and the 1 000 highest and 1 000 lowest scoring edges from each of the five metrics were kept. The ReBoot method (Faust et al., 2012), which mitigates compositional effects, was used to assess the significance of the edges based on 1000 permutations with renormalization and 1000 bootstrap iterations. In the network of the soil aggregates, edges with scores below the 2.5th and over the 97.5th percentile of the bootstrap distribution or not supported by at least three of the five metrics were considered unstable and removed. Brown's method of p‐value merging was applied followed by Benjamini‐Hochberg correction. Only edges with q ≤ 0.05 were included in the final network. In the network of the 250 mg samples, unstable edges were not removed and the Benjamini‐Hochberg correction was not applied as otherwise no edges were retained. The networks were visualized in Cytoscape using the compound spring embedder layout. Topological parameters were calculated using NetworkAnalyzer version 2.7 (Assenov et al., 2008).

3. RESULTS

3.1. Microbial DNA can be extracted from soil samples in the mg‐range

DNA could be extracted from soil samples as little as 0.87 mg as well as from intact soil aggregates. In all cases, the extracted DNA was sufficient for 16S rRNA gene amplicon sequencing and the qPCR assays. Accurate quantification of the DNA yield with PicoGreen assays was not possible because the fluorescence readings from many of the small samples were close to the background fluorescence in the blank controls.

To assess whether the DNA extraction could recover microbial DNA with similar efficiency from small quantities of soil as from 250 mg samples, estimates of the abundances of Bacteria, Archaea, and Fungi in 1 g of soil were calculated from the qPCR results (Figure 1; Table S1: https://doi.org/10.5281/zenodo.4282475). Similar estimates were obtained from the 250 and 5 mg soil samples from the 1st experiment. Estimates from the 1 mg samples tended to be lower but were not significantly different. In contrast, the estimates of fungal abundance in a gram of soil were significantly lower from the 1 and 5 mg than from the 250 mg samples. Estimates of bacterial abundance obtained from the 25 and 125 mg samples and archaeal abundance from the 25 mg samples were significantly higher than from the 250 mg samples. Bacterial abundance estimates from the single aggregates and 250 mg soil samples of the 3rd experiment covered the same range (Figure A1), but the mean of the estimates from the single aggregates (2.27 × 109 copies/g soil) was lower (p < 0.001) than from the 250 mg samples (4.69 × 109 copies/g soil).

FIGURE 1.

FIGURE 1

Estimates of (a) bacterial, (b) archaeal, and (c) fungal abundance in a gram of soil based on qPCR from the samples from the 1st experiment (gene copy numbers per g of soil wet weight). One of the 250 mg soil samples was an outlier in the fungal ITS qPCR results and is not included in the plot. Sample groups not labeled with the same letter were significantly different in Tukey's HSD tests. Thick lines indicate the median values, the upper and lower hinges the 75th and 25th percentile, whiskers extend to the data extremes

The control samples were amplified in the qPCR assay targeting the bacterial 16S rRNA gene but yielded only 3–356 copies per µl DNA extract. In comparison, the 1 mg soil samples had 8777–45,534 copies per µl DNA extract (Table S1: https://doi.org/10.5281/zenodo.4282475). The control samples from the 1st experiment had 0 to 13 fungal ITS copies per µl DNA extract and none of them showed amplification in the archaeal 16S rRNA gene qPCR assays.

3.2. Removing potentially contaminant SVs from the sequencing results

It was possible to generate sequencing results from all control samples (the complete dataset with the taxonomic classification of the SVs is in Table S2: https://doi.org/10.5281/zenodo.4282475). Good's coverage index of the SVs was >0.993 in all of them indicating that their complete prokaryotic community was captured by sequencing (Table A1). They were similar in their prokaryotic community structures but very different from the soil samples (Figure A2). Every SV that reached 0.1% relative abundance in any of the control samples of an experiment was considered as a potential contaminant. There were 450, 77, and 591 such SVs in the datasets of the 1st, 2nd, and 3rd experiments, respectively. In the data from the 1st experiment, these SVs together covered 98.9%–99.8% of the sequences obtained from the control samples and 2.5%–6.6% of the sequences from the soil samples. In the 2nd and 3rd experiments, 92.5%–99.5% and 98.6%–99.6% of the sequences from the control samples, and 2.9%–17.3% and 5.9%–9.4% of the sequences from the soil samples, respectively, were covered by the potentially contaminant SVs. To mitigate the effect of contamination on the results, the potentially contaminant SVs were deleted from the data matrices before further analysis.

3.3. Increasing spatial resolution reveals heterogeneity in soil bacterial and archaeal community structure but not in their abundance

The yield of high‐quality 16S rRNA gene amplicon sequences was lower from the 1 mg samples than from the 5 and 25 mg samples in the 1st experiment (Figure 2a). Apart from this, however, the sequencing yield did not differ between the sample groups within any of the experiments. Thus, it is possible to compare the number of SVs detected in the samples without rarefying the data. The number of SVs in the 1st experiment was not significantly different between the 250, 125, and 25 mg samples, but decreased significantly in the 5 mg and even further among the 1 mg samples (Figure 2b). An opposite trend was clear in the Good's coverage index (Table A1). Similarly, significantly lower numbers of SVs were detected in the single aggregates than in the 250 mg soil samples of the 3rd experiment.

FIGURE 2.

FIGURE 2

Number of (a) sequences and (b) sequence variants (SVs) in the samples after the removal of potentially contaminant SVs. Thick lines indicate the median values, the upper and lower hinges the 75th and 25th percentile, whiskers extend to the data extremes. Letters indicate significant differences between sample groups of the 1st experiment according to Tukey's HSD tests. * indicates a significant difference based on Welch's t‐test between the aggregate and the 250 mg soil samples of the 3rd experiment

Principal component analysis from the sequencing results from the 1st experiment arranged all 250 and 125 mg samples, and most 25 mg samples into a single, tight group, indicating high similarity in their prokaryotic community structures (Figure 3a). In contrast, samples from the 5 mg and more so from the 1 mg categories, showed higher heterogeneity in community structure. Similarly, PCA indicated heterogeneity between individual aggregates that was not seen among the 250 mg samples in the 3rd experiment (Figure 3b).

FIGURE 3.

FIGURE 3

Principle component analyses (PCA) plots from the 16S rRNA gene sequencing data from the 1st (a) and 3rd (b) experiments

The abundant SVs (≥0.1% relative abundance in at least one of the samples) in the 250, 125, and 25 mg samples from the 1st experiment were almost all detectable in each sample, showing that the composition of the soil prokaryotic community appears uniform when investigated at such a coarse spatial resolution (Figure 4). In contrast, 172 of the abundant SVs detected in the 1 mg samples were unique to just one or two of these samples. Among these SVs, representatives of Planctomycetes, Proteobacteria, and Acidobacteria were especially numerous, while Thaumarchaeota and Actinobacteria were dominant among the SVs present in all samples. The 5 mg samples represented a level of spatial resolution at which some heterogeneity in the prevalence of the abundant SVs was clear with 22 of them detectable in two or only in a single sample.

FIGURE 4.

FIGURE 4

SVs arranged according to how many of the 1, 5, 25, 125, or 250 mg samples from the 1st experiment they were detected in. Only SVs that reached at least 0.1% relative abundance in any of the samples are included. Each node represents one SV colored based on its phylum‐level classification and sized according to its average relative abundance across all samples excluding those in which it was not detected

In total, 5620 SVs were detected in the eight 1 mg soil samples of the 1st experiment (Table S2: https://doi.org/10.5281/zenodo.4282475). Of these, 4764 (85%) were also present in at least half of the 250 mg samples. The remaining 856 SVs had low relative abundance in the 1 mg samples with only 59 reaching more than 0.1% relative abundance in any of them. The 5 mg samples together contained 8010 SVs, of which 6 443 (80%) were also detectable in at least half of the 250 mg samples. Of the remaining 1567 SVs, only 26 reached more than 0.1% relative abundance in any of the 5 mg samples.

The qPCR results did not confirm our hypothesis that increasing spatial resolution would reveal heterogeneity in microbial abundance. Bacterial and archaeal 16S rRNA gene and fungal ITS copy numbers did not show a larger variation among the 1 and 5 mg samples than between the 250 mg samples from the 1st experiment (Figure 1). Similarly, in the 3rd experiment, bacterial abundance did not vary more in the single aggregates than in the 250 mg samples (Figure A1).

3.4. Impact of stochastic effects and inconsistent performance of the methods

The soil homogenate samples from the 2nd experiment served to test the influence of stochastic effects and sub‐optimal performance of the DNA extraction and PCR when extracting small amounts of soil. The estimated bacterial abundance in a gram of soil based on the qPCR results was in general higher in the samples from the 2nd experiment but followed the same pattern as in the samples from the 1st experiment with no differences between the 1, 5, and 250 mg samples but significantly higher values in the 25 mg samples (Figure A3). The small soil homogenate samples did not show the degree of heterogeneity in the prokaryotic community structure we observed among the small soil samples of the 1st experiment. The abundant SVs (≥0.1% relative abundance in at least one sample) in the 25 mg soil homogenate samples were all detectable in at least five of the eight replicates. Out of the 354 abundant SVs in the 5 mg soil homogenate samples, one was present in only three of the samples but the others were detectable in at least six. The 1 mg soil homogenate samples harbored 446 abundant SVs. None of them was unique to a single sample and 442 were present in five or more of the eight samples. The Aitchison distances of the community structure were much higher among the 5 mg, and especially among the 1 mg soil samples of the 1st experiment compared with the distances between the 250 mg soil samples (Figure 5). In contrast, the distances between 5 or 25 mg soil homogenate samples of the 2nd experiment were similar to the distances among the 250 mg soil samples, indicating no difference in the heterogeneity of prokaryotic community structure between these sample groups. The distances between the 1 mg soil homogenate samples were only slightly increased.

FIGURE 5.

FIGURE 5

Aitchison distances in the bacterial and archaeal community structure (16SrRNA gene amplicons) within sample groups from the 1st and 2nd experiments. Thick lines indicate the median values, the upper and lower hinges the 75th and 25th percentile, whiskers extend to the data extremes. Sample groups from the same experiment not labeled with the same letter were significantly different in Tukey's HSD tests

3.5. Bacterial and archaeal co‐occurrence patterns in 250 mg soil samples and aggregates

Networks of prokaryotic co‐occurrence were constructed using the 272 SVs that reached ≥0.2% relative abundance in at least one of the samples of the 3rd experiment. No network was obtained from the 250 mg samples unless the removal of unstable edges and the correction of the p‐values for multiple testing were skipped. The resulting network has 78 edges between 35 nodes (Figure 6a). Thus, this spatial resolution revealed only a small number of putative associations many of which are false discoveries. In contrast, a network of 137 edges and 67 nodes (with the removal of unstable edges and control of the false discovery rate) was obtained from the individual soil aggregates (Figure 6b). A total of 54 of the nodes are part of a connected component in which there are three nodes with high betweenness centrality: SV15 (Verrucomicrobia, Candidatus Udaeobacter), SV31 (Actinobacteria), and SV36 (Acidobacteria subgroup 6). Their relative abundance in the aggregates was 0.46 ± 0.16%, 0.33 ± 0.13%, and 0.30 ± 0.11%, respectively. These SVs potentially serve a keystone function by connecting two clusters in the network. One of the clusters contains several SVs of Thaumarchaeota, Verrucomicrobia, and Actinobacteria. The other is dominated by Acidobacteria subgroup 6. The hub of the latter cluster is SV58 (Acidobacteria subgroup 6) with a relative abundance of 0.23 ± 0.16% that has the highest degree in the network being connected to 19 nodes. SV399 (Chloroflexi) (0.06 ± 0.05%) and SV123 (Acidobacteria subgroup 6) (0.13 ± 0.06%) are linked with negative associations to several members of this cluster.

FIGURE 6.

FIGURE 6

Co‐occurrence networks from the (a) 250 mg samples and (b) the single aggregates from the 3rd experiment. The frames mark the two clusters discussed in the text. It should be noted that in (a) unstable edges were not removed and the Benjamini‐Hochberg correction for multiple comparisons was not applied

4. DISCUSSION

The large disparity between the scale in which the soil microbiota is usually studied with molecular methods (0.25–1 g of soil) and the distance over which microbial interactions occur, impede the detection of interacting partners (Nunan, 2017). To gain information on the soil microbial diversity at an increased spatial resolution that considers soil structure, in this study, we reduced the amount of soil used for DNA extraction from 250 to 1 mg and also extracted individual soil aggregates. Bacterial and archaeal DNA were recovered with not significantly different efficiencies from the 250 mg and the 5 and 1 mg samples as shown by the qPCR results. This was not true for fungal DNA. Either the DNA extraction kit was not efficient in isolating fungal DNA from samples below 25 mg, or fungi may preferentially colonize larger soil aggregates. Our results show that the DNA extraction kit was the most efficient in recovering bacterial and archaeal DNA from 25 to 125 mg soil, although the variation in the yield of 16S rRNA gene copies was large among these samples. Since this increased variation was apparent among the 25 mg soil homogenate samples of the 2nd experiment as well, it is not an indication of an uneven distribution of bacterial cells at the scale of 25 mg samples but must be due to this particular DNA extraction method not working with consistent efficiency with this amount of soil.

As a consequence of sampling small amounts of soil, the DNA extracts had low template concentrations for the subsequent PCR analyses. Thereby, we had to anticipate a high risk of contamination affecting the results (Weiss et al., 2014). Quantifiable amounts of Bacteria and Fungi, but not Archaea, were detected in the control samples without soil. However, they reached no more than 4% of the number of bacterial rRNA gene copies in the smallest soil samples, and thus, the influence of contamination on our results is negligible. The bacterial community found in the control samples was distinct from the soil communities suggesting that the contamination originated from the reagents of the DNA extraction and sequencing library preparation rather than cross‐contamination between samples (Glassing et al., 2016; Salter et al., 2014). Another concern of working with small samples is that molecular methods applied to such small amounts of a template may perform inconsistently leading to artificial variation in the results. The 2nd experiment showed that DNA extraction, PCR, and sequencing did not artificially generate more variation in the results from 5 mg samples than the variation present among the 250 mg samples and the 1 mg samples showed only a slightly higher variation. Therefore, the large heterogeneity in prokaryotic community composition and structure among the 1 and 5 mg soil samples from the 1st experiment was not caused by stochastic effects or PCR bias.

The samples of 25 up to 250 mg of soil were close to identical in prokaryotic community composition, thus they provide a good representation of the overall prokaryotic diversity of our soil. This is also indicated by the fact that increasing the amount of soil extracted up to 25 mg increased the number of SVs detected in the samples, but larger soil samples did not yield more SVs. Thus, the 25 mg samples had good coverage of the total prokaryotic community. In contrast, the 1 and 5 mg samples and single aggregates were heterogeneous in community structure. We found that while small soil samples could recover some SVs not necessarily detected with the conventionally used 250 mg samples; these SVs were typically low in abundance. Very few exceeded the relative abundance threshold we applied to control the sparsity of the data in our analysis of community structure. Therefore, the large heterogeneity of the prokaryotic community structure we observed among the small samples was not because they would have enabled the detection of more SVs. Instead, it appears that they contained different subsets of the total community present in the 25–250 mg samples. This could be explained by the fact that the smaller samples contain fewer microhabitats, each of which harbors a local community of fewer species (Leibold et al., 2004). Interestingly, the 1 and 5 mg samples did not significantly differ in the abundance of Bacteria, Archaea, and Fungi compared with the 250 mg samples. Similarly, the variation in bacterial abundance found with the individual aggregates was not different from the 250 mg samples. Microbial abundance in soil has a patchy distribution at the scale of a few micrometers (Nunan et al., 2003) but, for the soil of this study, not at the scale of macroaggregates or 1–5 mg samples.

Network analyses based on microbial co‐occurrence have been applied to soil samples as large as 10 grams (Khan et al., 2019) and are typically used with 250 mg–1 g samples (Barberan et al., 2012; Karimi et al., 2020). In this study, however, we could not detect stable and significant associations between SVs from 35 samples of 250 mg soil. These samples were taken from the same well‐mixed batch of soil and were similar in their prokaryotic community composition. This is fundamentally different from the above‐cited studies that compared soil samples taken from different ecosystems or across an entire country, thus soil samples that can greatly differ in microbial community composition. In contrast, our 250 mg samples, coming from the same soil, were similar. It is likely that each of them gave a good representation of the overall prokaryotic diversity in our soil, in which case, the variation of the relative abundance of SVs in these samples was mostly random. It is not surprising if small, random differences do not yield stable and significant associations in network analysis. The value of using much smaller samples is shown by our result that the 1 and 5 mg samples contained subsets of the total soil microbial diversity captured by the 250 mg samples, and with increasing spatial resolution the heterogeneity in the bacterial and archaeal community structure increased among the samples. This results in detectable co‐occurrence patterns. Furthermore, the smaller spatial scale increases the likelihood that the observed co‐occurrences indicate interactions (Cordero & Datta, 2016).

From 37 soil aggregates, we obtained a complex network of bacterial and archaeal co‐occurrence that contained two clusters, one with several Thaumarchaeota, Verrucomicrobia, and Actinobacteria SVs, the other mainly with Acidobacteria subgroup 6 SVs. Three SVs, which could represent keystone taxa, were found to connect these clusters. If these putative keystone SVs are abundant in an aggregate, we can expect that members of both clusters are present there. The two major clusters present in the network may provide complementary functions in the soil ecosystem. There are indications that Acidobacteria subgroup 6, dominating one of the clusters, prefer agricultural soils with low nitrogen input where it could be involved in the slower turnover of soil organic carbon (SOC) originating from microbial necromass or plant material (Hester et al., 2018; Li et al., 2018; Navarrete et al., 2013). The soil of this study originated from a long‐term nitrogen‐depleted agricultural soil, thus supporting the preference for low nitrogen concentrations and SOC turnover. The other cluster included several abundant SVs from phylum Thaumarchaeota, which is known to be a strong contributor to ammonium oxidation in agricultural soils (Leininger et al., 2006). Compared to ammonium‐oxidizing bacteria, Thaumarcheaota are thought to be adapted to lower nitrogen concentrations (Pester et al., 2011); thus, the nitrogen‐depleted soil of this study is likely a favorable environment for them. The two clusters in our aggregate co‐occurrence network could represent two distinct types of metabolism adapted to a nitrogen‐depleted soil: a chemoorganotroph that oxidizes SOC, and a chemolithotroph that oxidizes ammonia produced for example by ammonification from crop residues. The presence of the less abundant Verrucomicrobia and Actinobacteria SVs within the Thaumarchaota dominated cluster is possibly linked to an oligotrophic lifestyle (Bergmann et al., 2011; Fierer et al., 2007), but considering the limited information that 16S rRNA gene analyses can provide for these phyla, this remains yet only a hypothesis. Shotgun sequencing and metagenomic analysis of DNA extracted from individual soil aggregates could shed more light on the nature of the associations we detected in the co‐occurrence network. In general, such aggregate‐level analyses of the soil microbiota, which we call “aggregatomics,” could inspire new ways of linking structure to function in soil microbial communities.

While the spatial scale that we reached in this study is not yet fine enough to reveal most microbial interactions as they may occur in microaggregates (Raynaud & Nunan, 2014), it should be able to support the development of hypotheses and experiments to understand the patterns and processes shaping the assembly of soil microbial communities and modeling their behavior (Faust & Raes, 2012; Tecon & Or, 2017). Developing DNA extraction protocols from even smaller soil samples, approaching the microaggregate level, should be a way forward to fuel soil aggregate‐oriented research (“Aggregatomics”; https://www.thuenen.de/en/bd/fields‐of‐activity/feld‐und‐laborstudien/microbiology‐and‐molecular‐ecology/soil‐aggregatomics) for unveiling hidden patterns of functions and ecological interactions.

ETHICS STATEMENT

None required.

CONFLICT OF INTEREST

None declared.

AUTHOR CONTRIBUTION

Márton Szoboszlay: Conceptualization (equal); Data curation (lead); Formal analysis (equal); Funding acquisition (supporting); Investigation (equal); Methodology (equal); Writing‐original draft (equal); Writing‐review & editing (equal). Christoph C Tebbe: Conceptualization (equal); Data curation (supporting); Formal analysis (equal); Funding acquisition (lead); Investigation (equal); Methodology (equal); Writing‐original draft (equal); Writing‐review & editing (equal).

ACKNOWLEDGMENTS

We thank Ines Merbach, Department Biozönoseforschung, Helmholtz‐Zentrum für Umweltforschung—UFZ, Bad Lauchstädt, Germany, for providing the soil of this study. We thank Naomie Oßwald and Jerome Rischke who supported us at the early stages of this work as bachelor students, and Karin Trescher for excellent technical assistance. This project was carried out in the framework of the Priority Programme “Rhizosphere spatiotemporal organization – a key to rhizosphere functions,” (SPP2089), financially supported by the German Research Foundation (DFG; Project number 403668538). Open access funding enabled and organized by ProjektDEAL.

FIGURE A1.

FIGURE A1

Estimates of bacterial abundance in a gram of soil from the samples of the 3rd experiment based on qPCR. Thick lines indicate the median values, the upper and lower hinges the 75th and 25th percentile, whiskers extend to the data extremes

FIGURE A2.

FIGURE A2

Principle component analysis (PCA plot) from the 16S rRNA gene sequencing data from the 1st experiment including the control samples and without the removal of potentially contaminant SVs

FIGURE A3.

FIGURE A3

Estimates of bacterial abundance in a gram of soil based on qPCR from the soil and soil homogenate samples from the 1st and 2nd experiments. Thick lines indicate the median values, the upper and lower hinges the 75th and 25th percentile, whiskers extend to the data extremes. Sample groups from the same experiment not labeled with the same letter were significantly different in Tukey's HSD tests

TABLE A1.

Good's coverage index of the 16S rRNA gene sequence variants

Soil weight class or sample type Good's coverage (average ±SD)
1st Experiment
250 mg 0.962 ± 0.014
125 mg 0.965 ± 0.008
25 mg 0.970 ± 0.009
5 mg 0.980 ± 0.010
1 mg 0.992 ± 0.010
Control, no soil 0.997 ± 0.001
2nd Experiment
250 mg 0.939 ± 0.033
25 mg soil homogenate 0.939 ± 0.031
5 mg soil homogenate 0.945 ± 0.029
1 mg soil homogenate 0.923 ± 0.028
Control, no soil 0.994 ± 0.005
3rd Experiment
250 mg 0.950 ± 0.017
Soil aggregate 0.952 ± 0.032
Control 0.998 ± 0.001

DATA AVAILABILITY STATEMENT

All data of this study are provided in the results section of this paper apart from (i) the data in Table S1 (Sample weights and qPCR data) and Table S2 (Sequence variants matrices), which are both available in the Zenodo repository at https://doi.org/10.5281/zenodo.4282475, and (ii) DNA sequences, which are available at the European Nucleotide Archive under the accession numbers PRJEB36881 (1st experiment): https://www.ebi.ac.uk/ena/browser/view/PRJEB36881, PRJEB36883 (2nd experiment): https://www.ebi.ac.uk/ena/browser/view/PRJEB36883, and PRJEB36887 (3rd experiment): https://www.ebi.ac.uk/ena/browser/view/PRJEB36887.

REFERENCES

  1. Assenov, Y. , Ramirez, F. , Schelhorn, S. E. , Lengauer, T. , & Albrecht, M. (2008). Computing topological parameters of biological networks. Bioinformatics, 24(2), 282–284. 10.1093/bioinformatics/btm554 [DOI] [PubMed] [Google Scholar]
  2. Bach, E. M. , Williams, R. J. , Hargreaves, S. K. , Yang, F. , & Hofmockel, K. S. (2018). Greatest soil microbial diversity found in micro‐habitats. Soil Biology & Biochemistry, 118, 217–226. 10.1016/j.soilbio.2017.12.018 [DOI] [Google Scholar]
  3. Bailey, V. L. , Fansler, S. J. , Stegen, J. C. , & McCue, L. A. (2013). Linking microbial community structure to beta‐glucosidic function in soil aggregates. ISME Journal, 7(10), 2044–2053. 10.1038/ismej.2013.87 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bailey, V. L. , McCue, L. A. , Fansler, S. J. , Boyanov, M. I. , DeCarlo, F. , Kemner, K. M. , & Konopka, A. (2013). Micrometer‐scale physical structure and microbial composition of soil macroaggregates. Soil Biology & Biochemistry, 65, 60–68. 10.1016/j.soilbio.2013.02.005 [DOI] [Google Scholar]
  5. Banerjee, S. , Kirkby, C. A. , Schmutter, D. , Bissett, A. , Kirkegaard, J. A. , & Richardson, A. E. (2016). Network analysis reveals functional redundancy and keystone taxa amongst bacterial and fungal communities during organic matter decomposition in an arable soil. Soil Biology & Biochemistry, 97, 188–198. 10.1016/j.soilbio.2016.03.017 [DOI] [Google Scholar]
  6. Barberan, A. , Bates, S. T. , Casamayor, E. O. , & Fierer, N. (2012). Using network analysis to explore co‐occurrence patterns in soil microbial communities. ISME Journal, 6(2), 343–351. 10.1038/ismej.2011.119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bergmann, G. T. , Bates, S. T. , Eilers, K. G. , Lauber, C. L. , Caporaso, J. G. , Walters, W. A. , Knight, R. , & Fierer, N. (2011). The under‐recognized dominance of Verrucomicrobia in soil bacterial communities. Soil Biology & Biochemistry, 43(7), 1450–1455. 10.1016/j.soilbio.2011.03.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Blair, N. , Faulkner, R. D. , Till, A. R. , Körschens, M. , & Schulz, E. (2006). Long‐term management impacts on soil C, N and physical fertility ‐ Part II: Bad Lauchstädt static and extreme FYM experiments. Soil & Tillage Research, 91(1–2), 39–47. 10.1016/j.still.2005.11.001 [DOI] [Google Scholar]
  9. Callahan, B. J. , McMurdie, P. J. , Rosen, M. J. , Han, A. W. , Johnson, A. J. A. , & Holmes, S. P. (2016). DADA2: High‐resolution sample inference from Illumina amplicon data. Nature Methods, 13(7), 581–583. 10.1038/nmeth.3869 [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Constancias, F. , Prévost‐Bouré, N. C. , Terrat, S. , Aussems, S. , Nowak, V. , Guillemin, J.‐P. , Bonnotte, A. , Biju‐Duval, L. , Navel, A. , Martins, J. M. F. , Maron, P.‐A. , & Ranjard, L. (2014). Microscale evidence for a high decrease of soil bacterial density and diversity by cropping. Agronomy for Sustainable Development, 34(4), 831–840. 10.1007/s13593-013-0204-3 [DOI] [Google Scholar]
  11. Cordero, O. X. , & Datta, M. S. (2016). Microbial interactions and community assembly at microscales. Current Opinion in Microbiology, 31, 227–234. 10.1016/j.mib.2016.03.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Davinic, M. , Fultz, L. M. , Acosta‐Martinez, V. , Calderón, F. J. , Cox, S. B. , Dowd, S. E. , Allen, V. G. , Zak, J. C. , & Moore‐Kucera, J. (2012). Pyrosequencing and mid‐infrared spectroscopy reveal distinct aggregate stratification of soil bacterial communities and organic matter composition. Soil Biology & Biochemistry, 46, 63–72. 10.1016/j.soilbio.2011.11.012 [DOI] [Google Scholar]
  13. Direito, S. O. L. , Zaura, E. , Little, M. , Ehrenfreund, P. , & Roling, W. F. M. (2014). Systematic evaluation of bias in microbial community profiles induced by whole genome amplification. Environmental Microbiology, 16(3), 643–657. 10.1111/1462-2920.12365 [DOI] [PubMed] [Google Scholar]
  14. Faust, K. , & Raes, J. (2012). Microbial interactions: from networks to models. Nature Reviews Microbiology, 10(8), 538–550. 10.1038/nrmicro2832 [DOI] [PubMed] [Google Scholar]
  15. Faust, K. , & Raes, J. (2016). CoNet app: Inference of biological association networks using Cytoscape. F1000Research, 5, 1519 10.12688/f1000research.9050.2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Faust, K. , Sathirapongsasuti, J. F. , Izard, J. , Segata, N. , Gevers, D. , Raes, J. , & Huttenhower, C. (2012). Microbial co‐occurrence relationships in the human microbiome. PLoS Computational Biology, 8(7), 17 10.1371/journal.pcbi.1002606 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fierer, N. , Bradford, M. A. , & Jackson, R. B. (2007). Toward an ecological classification of soil bacteria. Ecology, 88(6), 1354–1364. 10.1890/05-1839 [DOI] [PubMed] [Google Scholar]
  18. Fierer, N. , & Jackson, R. B. (2006). The diversity and biogeography of soil bacterial communities. Proceedings of the National Academy of Sciences of the United States of America, 103(3), 626–631. 10.1073/pnas.0507535103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Fox, A. , Ikoyi, I. , Torres‐Sallan, G. , Lanigan, G. , Schmalenberger, A. , Wakelin, S. , & Creamer, R. (2018). The influence of aggregate size fraction and horizon position on microbial community composition. Applied Soil Ecology, 127, 19–29. 10.1016/j.apsoil.2018.02.023 [DOI] [Google Scholar]
  20. Glassing, A. , Dowd, S. E. , Galandiuk, S. , Davis, B. , & Chiodini, R. J. (2016). Inherent bacterial DNA contamination of extraction and sequencing reagents may affect interpretation of microbiota in low bacterial biomass samples. Gut Pathogens, 8, 10.1186/s13099-016-0103-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Gloor, G. B. , Macklaim, J. M. , Pawlowsky‐Glahn, V. , & Egozcue, J. J. (2017). Microbiome datasets are compositional: And this is not optional. Frontiers in Microbiology, 8, 6 10.3389/fmicb.2017.02224 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Gloor, G. B. , Wu, J. R. , Pawlowsky‐Glahn, V. , & Egozcue, J. J. (2016). It's all relative: Analyzing microbiome data as compositions. Annals of Epidemiology, 26(5), 322–329. 10.1016/j.annepidem.2016.03.003 [DOI] [PubMed] [Google Scholar]
  23. Green, J. L. , Bohannan, B. J. M. , & Whitaker, R. J. (2008). Microbial biogeography: From taxonomy to traits. Science, 320(5879), 1039–1043. 10.1126/science.1153475 [DOI] [PubMed] [Google Scholar]
  24. Griffiths, R. I. , Thomson, B. C. , James, P. , Bell, T. , Bailey, M. , & Whiteley, A. S. (2011). The bacterial biogeography of British soils. Environmental Microbiology, 13(6), 1642–1654. 10.1111/j.1462-2920.2011.02480.x [DOI] [PubMed] [Google Scholar]
  25. Hemkemeyer, M. , Christensen, B. T. , Martens, R. , & Tebbe, C. C. (2015). Soil particle size fractions harbour distinct microbial communities and differ in potential for microbial mineralisation of organic pollutants. Soil Biology & Biochemistry, 90, 255–265. 10.1016/j.soilbio.2015.08.018 [DOI] [Google Scholar]
  26. Hemkemeyer, M. , Christensen, B. T. , Tebbe, C. C. , & Hartmann, M. (2019). Taxon‐specific fungal preference for distinct soil particle size fractions. European Journal of Soil Biology, 94, 10.1016/j.ejsobi.2019.103103 [DOI] [Google Scholar]
  27. Hemkemeyer, M. , Dohrmann, A. B. , Christensen, B. T. , & Tebbe, C. C. (2018). Bacterial preferences for specific soil particle size fractions revealed by community analyses. Frontiers in Microbiology, 9, 10.3389/fmicb.2018.00149 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Hester, E. R. , Harpenslager, S. F. , van Diggelen, J. M. H. , Lamers, L. L. , Jetten, M. S. M. , Lüke, C. , Lücker, S. , & Welte, C. U. (2018). Linking nitrogen load to the structure and function of wetland soil and rhizosphere microbial communities. Msystems, 3(1), 10.1128/mSystems.00214-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Ivanova, E. A. , Kutovaya, O. V. , Tkhakakhova, A. K. , Chernov, T. I. , Pershina, E. V. , Markina, L. G. , Andronov, E. E. , Kogut, B. M. (2015). The structure of microbial community in aggregates of a typical chernozem aggregates under contrasting variants of its agricultural use. Eurasian Soil Science, 48(11), 1242–1256. 10.1134/s1064229315110083 [DOI] [Google Scholar]
  30. Karimi, B. , Terrat, S. , Dequiedt, S. , Saby, N. P. A. , Horrigue, W. , Lelièvre, M. , Nowak, V. , Jolivet, C. , Arrouays, D. , Wincker, P. , Cruaud, C. , Bispo, A. , Maron, P.‐A. , Bouré, N. C. P. , & Ranjard, L. (2018). Biogeography of soil bacteria and archaea across France. Science Advances, 4(7), 10.1126/sciadv.aat1808 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Karimi, B. , Villerd, J. , Dequiedt, S. , Terrat, S. , Chemidlin‐Prévost Bouré, N. , Djemiel, C. , Lelièvre, M. , Tripied, J. , Nowak, V. , Saby, N. P. A. , Bispo, A. , Jolivet, C. , Arrouays, D. , Wincker, P. , Cruaud, C. , & Ranjard, L. (2020). Biogeography of soil microbial habitats across France. Global Ecology and Biogeography, 29(8), 1399–1411. 10.1111/geb.13118 [DOI] [Google Scholar]
  32. Khan, M. A. W. , Bohannan, B. J. M. , Nüsslein, K. , Tiedje, J. M. , Tringe, S. G. , Parlade, E. , Barberán, A. , & Rodrigues, J. L. M. (2019). Deforestation impacts network co‐occurrence patterns of microbial communities in Amazon soils. FEMS Microbiology Ecology, 95(2), 10.1093/femsec/fiy230 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Kim, H. , Nunan, N. , Dechesne, A. , Juarez, S. , & Grundmann, G. (2015). The spatial distribution of exoenzyme activities across the soil micro‐landscape, as measured in micro‐ and macro‐aggregates, and ecosystem processes. Soil Biology & Biochemistry, 91, 258–267. 10.1016/j.soilbio.2015.08.042 [DOI] [Google Scholar]
  34. Kozich, J. J. , Westcott, S. L. , Baxter, N. T. , Highlander, S. K. , & Schloss, P. D. (2013). Development of a dual‐index sequencing strategy and curation pipeline for analyzing amplicon sequence data on the MiSeq Illumina sequencing platform. Applied and Environmental Microbiology, 79(17), 5112–5120. 10.1128/aem.01043-13 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Kravchenko, A. N. , Negassa, W. C. , Guber, A. K. , Hildebrandt, B. , Marsh, T. L. , & Rivers, M. L. (2014). Intra‐aggregate pore structure influences phylogenetic composition of bacterial community in macroaggregates. Soil Science Society of America Journal, 78(6), 1924–1939. 10.2136/sssaj2014.07.0308 [DOI] [Google Scholar]
  36. Leibold, M. A. , Holyoak, M. , Mouquet, N. , Amarasekare, P. , Chase, J. M. , Hoopes, M. F. , Holt, R. D. , Shurin, J. B. , Law, R. , Tilman, D. , Loreau, M. , & Gonzalez, A. (2004). The metacommunity concept: A framework for multi‐scale community ecology. Ecology Letters, 7(7), 601–613. 10.1111/j.1461-0248.2004.00608.x [DOI] [Google Scholar]
  37. Leininger, S. , Urich, T. , Schloter, M. , Schwark, L. , Qi, J. , Nicol, G. W. , Prosser, J. I. , Schuster, S. C. , & Schleper, C. (2006). Archaea predominate among ammonia‐oxidizing prokaryotes in soils. Nature, 442(7104), 806–809. 10.1038/nature04983 [DOI] [PubMed] [Google Scholar]
  38. Li, D. , Chen, L. , Xu, J. , Ma, L. , Olk, D. C. , Zhao, B. , Zhang, J. , & Xin, X. (2018). Chemical nature of soil organic carbon under different long‐term fertilization regimes is coupled with changes in the bacterial community composition in a Calcaric Fluvisol. Biology and Fertility of Soils, 54(8), 999–1012. 10.1007/s00374-018-1319-0 [DOI] [Google Scholar]
  39. Li, F. , Chen, L. , Zhang, J. B. , Yin, J. , & Huang, S. M. (2017). Bacterial community structure after long‐term organic and inorganic fertilization reveals important associations between soil nutrients and specific taxa involved in nutrient transformations. Frontiers in Microbiology, 8, 10.3389/fmicb.2017.00187 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Lombard, N. , Prestat, E. , van Elsas, J. D. , & Simonet, P. (2011). Soil‐specific limitations for access and analysis of soil microbial communities by metagenomics. FEMS Microbiology Ecology, 78(1), 31–49. 10.1111/j.1574-6941.2011.01140.x [DOI] [PubMed] [Google Scholar]
  41. Ludwig, B. , Geisseler, D. , Michel, K. , Joergensen, R. G. , Schulz, E. , Merbach, I. , Raupp, J. , Rauber, R. , Hu, K. , Niu, L. , & Liu, X. (2011). Effects of fertilization and soil management on crop yields and carbon stabilization in soils. A review. Agronomy for Sustainable Development, 31(2), 361–372. 10.1051/agro/2010030 [DOI] [Google Scholar]
  42. Merbach, I. , & Schulz, E. (2013). Long‐term fertilization effects on crop yields, soil fertility and sustainability in the Static Fertilization Experiment Bad Lauchstadt under climatic conditions 2001–2010. Archives of Agronomy and Soil Science, 59(8), 1041–1058. 10.1080/03650340.2012.702895 [DOI] [Google Scholar]
  43. Miller, D. N. , Bryant, J. E. , Madsen, E. L. , & Ghiorse, W. C. (1999). Evaluation and optimization of DNA extraction and purification procedures for soil and sediment samples. Applied and Environmental Microbiology, 65(11), 4715–4724. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Navarrete, A. A. , Kuramae, E. E. , de Hollander, M. , Pijl, A. S. , van Veen, J. A. , & Tsai, S. M. (2013). Acidobacterial community responses to agricultural management of soybean in Amazon forest soils. Fems Microbiology Ecology, 83(3), 607–621. 10.1111/1574-6941.12018 [DOI] [PubMed] [Google Scholar]
  45. Nunan, N. (2017). The microbial habitat in soil: Scale, heterogeneity and functional consequences. Journal of Plant Nutrition and Soil Science, 180(4), 425–429. 10.1002/jpln.201700184 [DOI] [Google Scholar]
  46. Nunan, N. , Wu, K. J. , Young, I. M. , Crawford, J. W. , & Ritz, K. (2003). Spatial distribution of bacterial communities and their relationships with the micro‐architecture of soil. FEMS Microbiology Ecology, 44(2), 203–215. 10.1016/s0168-6496(03)00027-8 [DOI] [PubMed] [Google Scholar]
  47. Oksanen, J. , Blanchet, F. G. , Friendly, M. , Kindt, R. , Legendre, P. , McGlinn, D. , … Wagner, H. (2018). Vegan: Community ecology package. R package version 2.5‐2.
  48. Palarea‐Albaladejo, J. , & Martin‐Fernandez, J. A. (2015). zCompositions ‐ R Package for multivariate imputation of left‐censored data under a compositional approach. Chemometrics and Intelligent Laboratory Systems, 143, 85–96. 10.1016/j.chemolab.2015.02.019 [DOI] [Google Scholar]
  49. Pester, M. , Schleper, C. , & Wagner, M. (2011). The Thaumarchaeota: An emerging view of their phylogeny and ecophysiology. Current Opinion in Microbiology, 14(3), 300–306. 10.1016/j.mib.2011.04.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Pruesse, E. , Quast, C. , Knittel, K. , Fuchs, B. M. , Ludwig, W. G. , Peplies, J. , & Glockner, F. O. (2007). SILVA: A comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Research, 35(21), 7188–7196. 10.1093/nar/gkm864 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Ranjard, L. , Nazaret, S. , Gourbiere, F. , Thioulouse, J. , Linet, P. , & Richaume, A. (2000). A soil microscale study to reveal the heterogeneity of Hg(II) impact on indigenous bacteria by quantification of adapted phenotypes and analysis of community DNA fingerprints. FEMS Microbiology Ecology, 31(2), 107–115. 10.1111/j.1574-6941.2000.tb00676.x [DOI] [PubMed] [Google Scholar]
  52. Ranjard, L. , Poly, F. , Combrisson, J. , Richaume, A. , Gourbiere, F. , Thioulouse, J. , & Nazaret, S. (2000). Heterogeneous cell density and genetic structure of bacterial pools associated with various soil microenvironments as determined by enumeration and DNA fingerprinting approach (RISA). Microbial Ecology, 39(4), 263–272. [PubMed] [Google Scholar]
  53. Raynaud, X. , & Nunan, N. (2014). Spatial ecology of bacteria at the microscale in soil. PLoS One, 9(1), 10.1371/journal.pone.0087217 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Rillig, M. C. , Müller, L. A. H. , & Lehmann, A. (2017). Soil aggregates as massively concurrent evolutionary incubators. ISEME Journal, 11(9), 1943–1948. 10.1038/ismej.2017.56 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Roesch, L. F. W. , Fulthorpe, R. R. , Riva, A. , Casella, G. , Hadwin, A. K. M. , Kent, A. D. , Daroub, S. H. , Camargo, F. A. O. , Farmerie, W. G. , & Triplett, E. W. (2007). Pyrosequencing enumerates and contrasts soil microbial diversity. ISME Journal, 1(4), 283–290. 10.1038/ismej.2007.53 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Salter, S. J. , Cox, M. J. , Turek, E. M. , Calus, S. T. , Cookson, W. O. , Moffatt, M. F. , Turner, P. , Parkhill, J. , Loman, N. J. , & Walker, A. W. (2014). Reagent and laboratory contamination can critically impact sequence‐based microbiome analyses. BMC Biology, 12, 10.1186/s12915-014-0087-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Six, J. , Elliott, E. T. , & Paustian, K. (2000). Soil macroaggregate turnover and microaggregate formation: A mechanism for C sequestration under no‐tillage agriculture. Soil Biology & Biochemistry, 32(14), 2099–2103. 10.1016/S0038-0717(00)00179-6 [DOI] [Google Scholar]
  58. Smith, P. , Cotrufo, M. F. , Rumpel, C. , Paustian, K. , Kuikman, P. J. , Elliott, J. A. , McDowell, R. , Griffiths, R. I. , Asakawa, S. , Bustamante, M. , House, J. I. , Sobocká, J. , Harper, R. , Pan, G. , West, P. C. , Gerber, J. S. , Clark, J. M. , Adhya, T. , Scholes, R. J. , & Scholes, M. C. (2015). Biogeochemical cycles and biodiversity as key drivers of ecosystem services provided by soils. Soil, 1(2), 665–685. 10.5194/soil-1-665-2015 [DOI] [Google Scholar]
  59. Szoboszlay, M. , Dohrmann, A. B. , Poeplau, C. , Don, A. , & Tebbe, C. C. (2017). Impact of land‐use change and soil organic carbon quality on microbial diversity in soils across Europe. FEMS Microbiology Ecology, 93(12), 10.1093/femsec/fix146 [DOI] [PubMed] [Google Scholar]
  60. Tecon, R. , & Or, D. (2017). Biophysical processes supporting the diversity of microbial life in soil. FEMS Microbiology Reviews, 41(5), 599–623. 10.1093/femsre/fux039 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Thakur, M. P. , Phillips, H. R. P. , Brose, U. , De Vries, F. T. , Lavelle, P. , Loreau, M. , Mathieu, J. , Mulder, C. , Van der Putten, W. H. , Rillig, M. C. , Wardle, D. A. , Bach, E. M. , Bartz, M. L. C. , Bennett, J. M. , Briones, M. J. I. , Brown, G. , Decaëns, T. , Eisenhauer, N. , Ferlian, O. , … Cameron, E. K. (2020). Towards an integrative understanding of soil biodiversity. Biological Reviews, 95(2), 350–364. 10.1111/brv.12567 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Vos, M. , Wolf, A. B. , Jennings, S. J. , & Kowalchuk, G. A. (2013). Micro‐scale determinants of bacterial diversity in soil. FEMS Microbiology Reviews, 37(6), 936–954. 10.1111/1574-6976.12023 [DOI] [PubMed] [Google Scholar]
  63. Walters, W. , Hyde, E. R. , Berg‐Lyons, D. , Ackermann, G. , Humphrey, G. , Parada, A. , Gilbert, J. A. , Jansson, J. K. , Caporaso, J. G. , Fuhrman, J. A. , Apprill, A. , & Knight, R. (2016). Improved bacterial 16S rRNA gene (V4 and V4–5) and fungal internal transcribed spacer marker gene primers for microbial community surveys. Msystems, 1(1), 10 10.1128/mSystems.00009-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Wang, Y. , Gao, Z. M. , Xu, Y. , Li, G. Y. , He, L. S. , & Qian, P. Y. (2016). An evaluation of multiple annealing and looping based genome amplification using a synthetic bacterial community. Acta Oceanologica Sinica, 35(2), 131–136. 10.1007/s13131-015-0781-x [DOI] [Google Scholar]
  65. Weiss, S. , Amir, A. , Hyde, E. R. , Metcalf, J. L. , Song, S. J. , & Knight, R. (2014). Tracking down the sources of experimental contamination in microbiome studies. Genome Biology, 15(12), 10.1186/s13059-014-0564-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Weiss, S. , Van Treuren, W. , Lozupone, C. , Faust, K. , Friedman, J. , Deng, Y. E. , Xia, L. C. , Xu, Z. Z. , Ursell, L. , Alm, E. J. , Birmingham, A. , Cram, J. A. , Fuhrman, J. A. , Raes, J. , Sun, F. , Zhou, J. , & Knight, R. (2016). Correlation detection strategies in microbial data sets vary widely in sensitivity and precision. ISME Journal, 10(7), 1669–1681. 10.1038/ismej.2015.235 [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Wilpiszeski, R. L. , Aufrecht, J. A. , Retterer, S. T. , Sullivan, M. B. , Graham, D. E. , Pierce, E. M. , Zablocki, O. D. , Palumbo, A. V. , & Elias, D. A. (2019). Soil aggregate microbial communities: Towards understanding microbiome interactions at biologically relevant scales. Applied and Environmental Microbiology, 85(14), 10.1128/aem.00324-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Young, J. M. , Rawlence, N. J. , Weyrich, L. S. , & Cooper, A. (2014). Limitations and recommendations for successful DNA extraction from forensic soil samples: A review. Science & Justice, 54(3), 238–244. 10.1016/j.scijus.2014.02.006 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All data of this study are provided in the results section of this paper apart from (i) the data in Table S1 (Sample weights and qPCR data) and Table S2 (Sequence variants matrices), which are both available in the Zenodo repository at https://doi.org/10.5281/zenodo.4282475, and (ii) DNA sequences, which are available at the European Nucleotide Archive under the accession numbers PRJEB36881 (1st experiment): https://www.ebi.ac.uk/ena/browser/view/PRJEB36881, PRJEB36883 (2nd experiment): https://www.ebi.ac.uk/ena/browser/view/PRJEB36883, and PRJEB36887 (3rd experiment): https://www.ebi.ac.uk/ena/browser/view/PRJEB36887.


Articles from MicrobiologyOpen are provided here courtesy of Wiley

RESOURCES