Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2021 Nov 18;16(11):e0260121. doi: 10.1371/journal.pone.0260121

The effect of soil sample size, for practical DNA extraction, on soil microbial diversity in different taxonomic ranks

Hiroki Morita 1, Satoshi Akao 1,*
Editor: Ashwani Kumar2
PMCID: PMC8601499  PMID: 34793564

Abstract

To determine the optimal soil sample size for microbial community structure analysis, DNA extraction, microbial composition analysis, and diversity assessments were performed using soil sample sizes of 0.2, 1, and 5 g. This study focused on the relationship between soil amount and DNA extraction container volume and the alteration in microbial composition at different taxonomic ranks (order, class, and phylum). Horizontal (0.2 and 1 g) and vertical (5 g) shaking were applied during DNA extraction for practical use in a small laboratory. In the case of the 5 g soil sample, DNA extraction efficiency and the value of α-diversity index fluctuated severely, possibly because of vertical shaking. Regarding the 0.2 and 1 g soil samples, the number of taxa, Shannon–Wiener index, and Bray–Curtis dissimilarity were stable and had approximately the same values at each taxonomic rank. However, non-metric multidimensional scaling showed that the microbial compositions of these two sample sizes were different. The higher relative abundance of taxa in the case of the 0.2 g soil sample might indicate that cell wall compositions differentiated the microbial community structures in these two sample sizes due to high shear stress tolerance. The soil sample size and tube volume affected the estimated microbial community structure. A soil sample size of 0.2 g would be preferable to the other sample sizes because of the possible higher shearing force for DNA extraction and lower experimental costs due to smaller amounts of consumables. When the taxonomic rank was changed from order to phylum, some minor taxa identified at the order rank were integrated into major taxa at the phylum rank. The integration affected the value of the β-diversity index; therefore, the microbial community structure analysis, reproducibility of structures, diversity assessment, and detection of minor taxa would be influenced by the taxonomic rank applied.

Introduction

The soil environment possesses abundant microorganisms with 1010–1011 bacteria and 6,000–50,000 species present in 1 g of soil [1]. Microbial composition and diversity in the soil environment interact with various biotic and abiotic factors [2]. Moreover, soil microorganisms are not uniformly distributed because of the physical and chemical characteristics of the environment. Nunan et al. [3] indicated that the occurrence of different bacterial distributions depends on the nutrients in the soil. The distribution of bacteria in soil is known to be affected by pH and water content [4, 5]. When analyzing the microbiome in the soil, it is necessary to apply reliable sampling methods considering the heterogeneity of bacterial distribution. The first possible solution to this problem is to use a large number of mixed soil cores. This allows us to obtain representative and reliable results for the entire soil sampling area [6, 7], although it is time-consuming and costly.

Soil sample size, i.e., sample volume, is another factor that determines the reproducibility of the soil microbial diversity analysis. Several studies have reported the effect of soil sample size on microbial diversity analysis using the polymerase chain reaction-denaturing gradient gel electrophoresis (PCR-DGGE) method. In the case of a large soil sample size (e.g., 10 g of soil), the dominant taxa accounted for a larger bacterial population ratio than that in the case of a smaller soil sample size, although the large soil sample case had larger variances in DNA extraction efficiency and microbial community structure [8]. In the case of a small soil sample size (e.g., 0.25 g), minor species could be detected, and the reproducibility of the bacterial community was high [8]. However, using small soil sample sizes (e.g., 0.1 g), another study reported high variability in bacterial structure among samples, with the detection of minor species [9]. As a result, these reports using the PCR-DGGE method do not provide a standard for the appropriate soil sample size for microbial diversity analysis.

On the other hand, amplicon sequencing analysis provides a different perspective [10]. In the reported analysis, a soil sample size of 10 g, the largest soil volume for a certain DNA extraction kit used in the study with 1, 5, and 10 g of soil sample sizes, presented the lowest variability among the repeated microbial diversity analyses and the highest microbial community reproducibility. Furthermore, minor taxa could be stably detected even in the case of the largest soil volume because of the large number of sequence reads, with an average of over 56,000 reads. Song et al. [11] also investigated the effect of soil sample size (0.25 and 10 g) on fungal amplicon sequence data.

Although the study by Penton et al. [10] was insightful, in our opinion, it can benefit from further clarifications, such as those concerning target taxonomic rank and annotated taxa in the microbial diversity analysis, the relationship between soil volume and tube volume for DNA extraction, and DNA extraction cost and efficiency. First, the focused taxonomic rank and related annotated taxa might affect microbial diversity analysis. Penton et al. [10] did not describe the targeted taxonomic rank nor the annotated taxa used for microbial diversity analysis. Regarding the taxonomic rank, a taxon belonging to a minor category in a lower taxonomic rank (e.g., order) might be integrated into a major category in an upper taxonomic rank (e.g., phylum). In other words, rare taxa in lower taxonomic ranks might not be able to be classified or recognized as minor taxa in the upper taxonomic ranks. Thus, the recognition of rare taxa would depend on the target taxonomic rank. For the annotated taxa, there would usually be numerous operational taxonomic units (OTUs) not belonging to any taxa in an upper taxonomic rank (e.g., phylum) from amplicon sequence data. If these OTUs are counted as many independent pseud-taxa, the microbial diversity analysis would be biased by these OTUs. Therefore, we think that the target taxonomic rank and the annotated taxa used should be clearly defined. Second, the relationship between soil volume, reagent volume, and tube volume may affect the efficiency and quality of DNA extraction. Penton et al. [10] used the PowerMax Soil DNA Isolation Kit (QIAGEN, Venlo, Netherlands) with a 50 mL tube for DNA extraction from 1 to 10 g of soil. They also used the PowerSoil DNA Isolation Kit (QIAGEN) with a 2 mL tube for DNA extraction from 0.25 g of soil. The performance of amplicon sequence analysis using 0.25 g of soil was better than that using 1 g of soil, the lowest ratio of soil volume to container volume (50 mL), based on the variability and reproducibility of the results. Song et al. [11] also used each kit for DNA extraction from 10 g and 0.25 g of soil, respectively. There was no significant difference in OTU richness between the two soil sample sizes, possibly because of the appropriate addition of soil volume into each kit. Finally, using a 50 mL tube for DNA extraction is expensive and inefficient; the number of tubes used for the extraction at one instance is limited if we use the recommended attachment providing horizontal vibration (Vortex Adapter for two tubes (50 mL), QIAGEN) for the extraction kit. This might be an obstacle to the processing of large amounts of samples. One possible alternative to bypass the one-time sample size (number of tubes) limitation is to use a device with vertical shaking, rather than with horizontal shaking.

In this study, we investigated the effects of changes in soil sample volume on microbial diversity using amplicon sequence analysis. This could be considered in accordance with two factors: a change in tube volume preferentially in parallel to the change in soil sample volume, and the application of a device for vertical shaking intended for DNA extraction. Microbial community structure analysis at different taxonomic ranks using the annotated taxa from the amplicon sequence data was also performed to evaluate the effect of the taxonomic rank applied on the analysis.

Materials and methods

Soil sampling and DNA extraction

Soil samples were collected from a local field for vegetable crops (34°41.32N, 135°49.91E). The collected soil was mixed well and passed through a 2 mm sieve. Five samples (n = 5) of three different soil sample sizes (0.2, 1, and 5 g) were prepared for each sample. The number of repetitions per sample (n = 5) was determined based on previous reports [7, 11]. DNA extraction was performed in a centrifugal tube using the phenol/chloroform/isoamyl alcohol (PCI, 25:24:1) reagent (Nippon Gene, Tokyo, Japan). To prevent DNA degradation and adsorption by humic acid, glass beads with a diameter of 1.5–2.5 mm (BZ-02, AS ONE, Osaka, Japan) and skim milk (FUJIFILM Wako Chemicals, Osaka, Japan) were added to the tube. The tube was shaken at high speed for 5 min using a mixer with its attachment (Table 1). Recovery of the supernatant (water phase) containing the extracted DNA was performed twice in each tube with a 2-fold addition of the PCI reagent. The specifications of DNA extraction in each case, such as tube volumes, reagents, and additives, are summarized in Table 1. Extracted DNA was collected from the supernatant after centrifugation at 10,000 × g for 2 min (CF15RXII, Hitachi Koki, Tokyo, Japan), and the DNA was purified by ethanol precipitation. The concentration and quality of the recovered DNA were measured using the fluorometer (Qubit 4, Thermo Fisher Scientific) and the Qubit dsDNA BR Assay Kit (Thermo Fisher Scientific). All reagents used were of analytical or biochemical grade.

Table 1. DNA extraction conditions.

Soil sample size (g) 0.2 1 5
Tube size (mL) 2 15 50
Glass beads (g) 0.02 0.1 0.5
Skim milk (mg) 2 10 50
PCI reagent (mL) 0.2 1 5
Mixer for Beat Beating Voltex Genie 2 (Scientific Industries, New York, USA; rated power output, 0.1 kW) ASCM-1 (AZ ONE; rated power output, 0.05 kW)
Attachment for Mixer Horizontal Plastic Clip Microtube Holder (Scientific Industries; operational number of tubes, 24) Horizontal 15 mL Tube Holder (Scientific Industries; operational number of tubes, 12) ASCM-R50 (AZ ONE; operational number of tubes, 6)

The shaking strength of the mixer was set to approximately 2,700 rpm (Voltex Genie 2) and 2,000 rpm (ASCM-1).

16S rRNA gene amplification and amplicon sequencing

The 16S rRNA gene was amplified using a primer set (V3V4f_MIX and V3V4r_MIX) [12] and a thermal cycler (MyCycler TM 170-9703JB, BIO-RAD, Hercules, USA). Bovine serum albumin (2.5 μL) (Takara Bio, Kusatsu, Japan) was added to prevent PCR inhibition. Amplicon sequencing was performed by an external analytical laboratory (Bioengineering Lab., Sagamihara, Japan, https://gikenbio.com/) according to the manufacturer’s protocol. In brief, high-throughput sequencing of amplicons was performed using the Illumina MiSeq platform (2 × 300 bp). Raw paired-end reads were subjected to quality filtering using the Fastx toolkit. After primer sequence removal, the sequences were analyzed using QIIME 2.0. The EzBioCloud 16S database (version 20200225) was used to detect OTUs with 97% similarity [13].

Microbial community structure analysis

Raw OTUs, which were estimated to be a certain valid name in the order rank, were categorized into relative abundance at the order rank. Five amplicon sequencing results were obtained for each soil sample size. The variance in microbial diversity among the soil sample sizes was compared based on the relative abundance at the order rank. Following the previous studies, the Shannon–Wiener index (H′) was applied as the alpha diversity index, and Bray–Curtis dissimilarity and Sørensen–Dice indices were applied as the beta diversity indices [14, 15]. The same indices were applied at the class and phylum ranks to elucidate the possible variance in diversity assessment.

Statistical analysis

Significant differences in DNA extraction efficiency were examined by multiple comparisons using the Bonferroni method. ANOVA was performed on the average number of non-chimeric sequence reads for each soil sample size (0.2, 1, and 5 g). Parametric t-tests were performed between the average number of OTUs derived from the 0.2 and 1 g samples. Wilcoxon rank-sum test was used to evaluate the difference in each diversity index from the 0.2 and 1 g samples. Non-metric multidimensional scaling (NMDS) based on Bray–Curtis dissimilarity was applied to compare the cohesion of plots across the soil sample sizes. Linear discriminant analysis was performed between the average numbers of OTUs derived from the 0.2 and 1 g samples using LEfSE [16]. All statistical analyses were performed using R (ver 3.6.1) with the “vegan” and “MASS” packages.

Results and discussion

DNA extraction

The DNA extraction efficiency is shown in Fig 1. It was calculated by dividing the measured DNA concentration (μg mL-1) by the soil sample size (g). Ideally, DNA extraction efficiency should not change with a change in soil sample size. However, the extraction efficiency corresponding to the soil sample size of 1 g was significantly higher (α < 0.05) than that of the other two sample sizes, based on the multiple comparisons performed.

Fig 1. DNA extraction efficiency by soil sample size.

Fig 1

Different lowercase letters indicate significant differences (α < 0.05). Error bars represent standard errors.

The low extraction efficiency with a sample size of 0.2 g might be owing to the DNA extraction process. The supernatant was recovered from the DNA extraction tube, and small but unrecoverable portions were always generated, especially with the 0.2 g sample using a small amount of the extraction reagent, to avoid taking an intermediate layer containing proteins. In contrast, the low extraction efficiency with a sample size of 5 g might be because of the bead-beating strength. DNA extraction for the 0.2 and 1 g soil samples was performed with horizontal shaking, whereas vertical shaking was used in the case of the 5 g soil sample. This might have resulted in the low efficiencies of cell disruption and DNA extraction.

Amplicon sequencing

Non-chimera reads, total OTUs, and non-single/double OTU sequences for each sample are shown in Table 2. After quality filtering and chimera removal, we obtained 604,543 reads, with a range of 29,043–45,888 reads per sample. These numbers are adequate for achieving acceptable coverage [17]. The non-chimeric reads were a consequence of differences in DNA extraction conditions for the same soil sample. In addition, there was no significant difference in the average number of non-chimeric reads among the soil sample sizes by ANOVA; therefore, we used the obtained non-chimeric reads directly for the following analyses. The averages of relative abundances identified via homology search at the order rank for each soil sample size are shown in Fig 2. The sequence data were deposited in the DDBJ Sequence Read Archive database under the BioProject ID PRJDB10834 with BioSample IDs SAMD00260206–SAMD00236620.

Table 2. Number of reads, OTUs, and OTUs excluding singleton and doubleton in each sample.

Sample name Non-chimera reads Total OTUs Non-S/D OTUs
soil-02g-1 39220 1959 1909
soil-02g-2 42242 1641 1612
soil-02g-3 45395 1733 1694
soil-02g-4 45888 1910 1895
soil-02g-5 44782 1653 1604
soil-1g-1 39646 1809 1744
soil-1g-2 35511 1709 1651
soil-1g-3 35930 1574 1519
soil-1g-4 41860 2029 1971
soil-1g-5 44150 1911 1852
soil-5g-1 31884 911 893
soil-5g-2 42882 1335 1307
soil-5g-3 45452 1651 1608
soil-5g-4 40658 1858 1798
soil-5g-5 29043 1532 1506

OTU, operational taxonomic units; S/D, non-single/double.

Fig 2. Relative abundance at the order rank in each sample.

Fig 2

The top 10 taxa are shown. “Others” includes all minor taxa.

Alpha diversities of different soil sample sizes

The number of taxa at the order rank determined for different soil sample sizes is shown in Fig 3. The total number of taxa increased with decreasing soil sample size. A small soil sample size might exhibit the so-called hot spot effect; taxa with a low relative abundance have been shown to be accidentally detected owing to small and local sampling [8, 9].

Fig 3. Determined number of taxa at the order rank in different soil sample size.

Fig 3

Error bars represent standard errors.

Box plots of the Shannon–Wiener index across the soil sample sizes are shown in Fig 4. A large variation in the index was observed in the 5 g soil sample. Furthermore, this soil sample size included the minimum and maximum index values among all the soil sample sizes. The variation in the Shannon–Wiener index and the low efficiency of DNA extraction in the case of 5 g soil sample size could be attributed to the difficulty in disrupting cells by vertical shaking. Therefore, in the subsequent sections, we will focus only on the comparison between 0.2 and 1 g soil sample sizes and not that of the 5 g soil sample size. The results for the 5 g soil sample size are only shown in the following figures for information. Comparing the 0.2 and 1 g soil sample sizes, the determined number of taxa did not present a significant difference (t-test, α < 0.05; Fig 3). Moreover, the values of the Shannon–Wiener index did not show a significant difference (non-parametric t-test, α < 0.05; Fig 4).

Fig 4. Boxplots of the Shannon–Wiener indices at the order rank.

Fig 4

Top and bottom of the boxes are quartiles, whiskers are the maximum and minimum values, and the × marks represent the mean values. The lines that divide the box into two parts are medians (n = 5).

Beta diversities of different soil sample sizes

The Bray–Curtis dissimilarity index values for each soil sample size are shown in Fig 5. There was no significant difference between the 0.2 and 1 g soil sample sizes in the non-parametric t-test (α < 0.05). The index values of Sørensen–Dice Dissimilarity for each soil sample size are shown in Fig 6. Sørensen–Dice Dissimilarity exhibits reproducibility of detected microbial flora at every repeated sampling, depending on the presence or absence of taxa. The index value of the 0.2 g soil sample size was higher than that of the 1 g soil sample size, and there was a significant difference between the two sample sizes in the non-parametric t-test (α < 0.05). Ellinsøe and Johnsen [9] indicated that a small soil sample size was highly susceptible to the influence of hot spots, thereby resulting in the determination of microbial compositions that fluctuate considerably. In our study, the 0.2 g soil sample size exhibited fluctuations in microbial composition. Penton et al. [10] demonstrated that a large soil sample size would be sufficient to acquire non-dominant (minor) taxa because of the abundant sequence reads provided by next-generation sequencing. The 0.2 g soil sample size, however, detected a higher number of taxa, as well as more minor taxa (Fig 3), than the 1 g soil sample size. In our study, minor taxa were detected in the smaller soil sample size, similar to the report by Ellinsøe and Johnsen [9].

Fig 5. Index values of Bray–Curtis dissimilarities at the order rank.

Fig 5

Error bars represent standard errors. There was no significant difference between soil samples of 0.2 g and 1 g.

Fig 6. Averages of the Sørensen–Dice indices at the order rank.

Fig 6

Different lowercase letters indicate significant differences (α < 0.05).

Visualizing community structure

The NMDS plots are shown in Fig 7. Significant differences in the plot aggregation sites between the 0.2 and 1 g soil sample sizes indicate different microbial compositions. The LEfSe algorithm for the 10 major taxa, which accounted for a large proportion of the taxa, was used to clarify the effects of taxa on the determination of belonging to the two groups. The taxa with significantly different proportions between 0.2 and 1 g soil sample sizes are shown in Fig 8. Taxa of the 0.2 g soil sample size, whose relative abundance was significantly higher than that of the 1 g soil sample size, were as follows: Acidimicrobiales, Propionibacteriales, and Sphingomonadales. The samples of 1 g size included Vicinamibacter and Rhizobiales.

Fig 7. Non-metric multidimensional scaling (NMDS) plots at the order rank.

Fig 7

The orange (upper) circle shows the aggregation sites with the 1 g soil sample sizes, and the blue (lower) circle shows those of the 0.2 g soil sample sizes. Points outside the circles are those of the 5 g soil sample sizes. 2D stress was 0.19.

Fig 8. Noticeable taxa at the order rank that determine the two groups.

Fig 8

Taxa in the 0.2 g and 1 g soil sample sizes were analyzed by the LEfSe algorithm. The 0.2 g samples presented significantly higher relative abundance of Sphingomonadales, Acidimicrobiales, and Propionibacteriales. 02g, 0.2 g sample size; 1g, 1 g sample size.

One of the common features of taxa with different relative abundances between them is the unique structure of the bacterial cell wall and cell membrane (Table 3). Gram-positive bacteria have a peptidoglycan structure, which makes them highly resistant to physical shear stress [18]. The taxa that were more abundant in the 0.2 g soil sample size (Acidimicrobiales and Propionibacteriales) belonged to gram-positive bacteria. This suggests that the bacteria in the 0.2 g soil sample size had a higher physical shear tolerance than those in the 1 g soil sample size. However, Sphingomonadales, belonging to gram-negative bacteria, were also identified as taxa with a larger population ratio in the case of 0.2 g soil sample size, rather than a 1 g soil sample size. Sphingomonadales lacks lipopolysaccharides, which are major components of the outer membrane of gram-negative bacteria. Instead, the outer membrane contains a large amount of sphingolipids [19, 20]. Sphingolipids are relatively resistant to cell disruption, which is attributed to their large capacity to form hydrogen bonds with other components of the cell membrane, resulting in a highly packed structure [20, 21]. The cell membranes of Sphingomonadales are denser and more tolerant to shear stress than those of other gram-negative bacteria [22]. Thus, microbial composition depends on the physical shear tolerance of the microbial community. The 0.2 g soil samples, which had a higher relative abundance of bacteria with high physical shear stress, are presumed to be exposed to severe stress conditions. This could be attributed to the small amount of soil that received higher shaking strength from the mixer. Although the difference in physical shear stress could also be derived from the type of mixer and the shape of containers, using small amounts of soil for DNA extraction, i.e., larger rated power output per unit soil weight, could provide an improved consequence with intense grinding.

Table 3. List of taxa whose relative abundances were significantly different between the 0.2 and 1 g soil sample sizes and their characteristics.

Kingdom Phylum Class Order Gram-Positive/Negative Bacteria Soil sample size with a significantly large relative abundance
Bacteria Acidobacteria Vicinamibacter Vicinamibacter Negative 1 g
Bacteria Actinobacteria Acidimicrobiia Acidimicrobiales Positive 0.2 g
Bacteria Actinobacteria Actinobacteria Propionibacteriales Positive 0.2 g
Bacteria Proteobacteria Alphaproteobacteria Rhizobiales Negative 1 g
Bacteria Proteobacteria Alphaproteobacteria Sphingomonadales Negative 0.2 g

Changes in taxonomic rank

The effects of changes in the taxonomic rank on the diversity indices were compared at the order, class, and phylum ranks. The variability in microbial composition at each sample size of soil was compared using the number of taxa, Shannon–Wiener index, Bray–Curtis dissimilarity, NMDS, and Sørensen–Dice index. All indices, except for the Sørensen–Dice index, were determined at the three taxonomic ranks for the different sizes of the soil samples, and are shown in S1 and S2 Figs and Fig 9. There was no significant difference in these values (α < 0.05) between the 0.2 and 1 g soil sample sizes at any taxonomic rank. It could be concluded that the results of diversity assessment using these indices were stable at any taxonomic rank. The number of taxa decreased with increasing taxonomic ranks for all ranks as shown in S1 Fig. Taxa, originally classified as different in a lower hierarchy, were integrated into a common taxon by going up a rank. In this study, some minor taxa were allocated to major taxa in a higher rank by integration; therefore, the discussion of microbial composition based on higher taxonomic rank might overlook the existence of minor taxa in the sample.

Fig 9. Bray–Curtis dissimilarity at the three taxonomic ranks.

Fig 9

Error bars represent standard errors.

NMDS plots at the class and phylum ranks are shown in S3 and S4 Figs. Differences in the plot aggregation sites between the 0.2 and 1 g soil sample sizes were observed at all ranks (Fig 7, S3 and S4 Figs). In other words, in this study, the microbial community structure was preserved even when the taxonomic rank was changed.

The Sørensen–Dice indices for the three taxonomic ranks are shown in Fig 10. A non-parametric t-test between the index value of 0.2 and 1 g soil sample sizes revealed that the 0.2 g soil sample size was significantly smaller at the phylum rank (α < 0.05), whereas that of the 1 g soil sample size was significantly smaller at the order rank. Focusing on the detected taxa to trace the taxonomic hierarchy, some rare taxa in the 0.2 g soil sample size, which contributed to the increase in the value of the Sørensen–Dice index at the order rank, were integrated into some of the major taxa at the phylum rank. The integration might contribute to a decrease in the Sørensen–Dice index at the phylum rank. In other words, the dispersion of taxa with lower taxonomic hierarchy would occur if a wide variety of taxa sequences exist in the samples. Given the dispersion, the value of the Sørensen–Dice index of the 1 g soil sample size at order rank should be large, at least larger than that of the 0.2 g soil sample size, because the 1 g level value was larger at the higher hierarchy (phylum) rank. However, the index value was lower than that of the 0.2 g sample size at the order rank, meaning the 1 g soil sample size might not include a wide variety of taxa sequences, especially those of minor taxa. Such an assumption would contradict the current recognition that a large number of amplicon sequence reads can ensure the detection of rare species, as reported by Penton et al. [10]. One reason for this contradiction would be the taxonomic rank used in their analysis. Penton et al. [10] did not describe the taxonomic rank they used; however, if they used a high taxonomic rank, such a higher taxonomic hierarchy, it might not be appropriate for the detection and discussion of rare species because of the characteristic of integration. When considering a microbial community structure at lower taxonomic ranks, such as species, a large number of amplicon sequence reads may not ensure the detection of minor taxa due to a wide variety of branching in the phylogenetic tree. In this case, applying a small soil sample size might effectively detect minor taxa using the hot spot effect. In their review, Gołębiewski and Tretyn [23] summarized that small soil sample sizes (e.g., 0.25 g) are appropriate for studying microbial diversity.

Fig 10. Sørensen–Dice index at the three taxonomic ranks.

Fig 10

Error bars represent standard errors. Asterisks indicate significant differences (α < 0.05).

Determination of soil sample size

In the previous sections, two soil sample sizes, 0.2 and 1 g, were weighed for the metagenomic analysis of soil microbial community structure. Although both sample sizes had advantages and disadvantages, we concluded that a 0.2 g soil sample size is recommended, especially for practical use. Regarding the 0.2 g soil sample size, a wide variety of taxa, including those with resistance to shear stress, were detected (Fig 2), possibly because a vortex mixer gave a higher rated power output per unit soil weight, and the smaller soil sample size was more sensitive to the hot spot effect. The values of the diversity indices, except for the Sørensen–Dice index, were almost the same between the 0.2 and 1 g soil sample sizes at different taxonomic ranks. In addition, the 0.2 g soil sample size requires smaller amounts of reagents and consumable goods for DNA extraction, making the operation economical. In contrast, the 1 g soil sample size presented stable microbial community structures at the lower taxonomic rank (order rank, Fig 10). The DNA extraction efficiency was better than that of the 0.2 g soil sample size (Fig 1). With the continued improvements in the performance of taxonomic databases and analytical techniques, an analysis of microbial community structure at a lower taxonomic rank, such as species rank, would be possible. In such a case, under the possible dispersion of taxa with the lowering of the taxonomic hierarchy, presenting a stable structure would also be required, and a larger soil sample size would be appropriate in that case.

In addition to soil sample size, the relationship between the amount of soil (soil sample size) and extraction container volume (tube volume) would be important for DNA extraction and subsequent analyses. Penton et al. [10] also demonstrated DNA extraction and subsequent analyses using a soil sample size of 0.25 g with a different DNA extraction kit (2 mL tube). The kit was prepared by the same manufacturer as that used in the cases of the 1–10 g soil sample sizes (50 mL tube). When comparing the 0.25 and 1 g soil sample sizes on numbers of bacterial OTUs determined, the 0.25 g soil sample size provided stable analytical results, such as reproducibility of microbial community structure. This does not indicate that a large soil sample size is always better. It only implies that the relationship between the soil sample size and extraction container volume is also considered essential.

Conclusions

In this study, we prepared soil samples 0.2, 1, and 5 g in size, and evaluated the effect of the soil sample size on DNA extraction and microbial community structure analysis using amplicon sequence data. We focused on the relationship between the soil amount and the extraction container volume in DNA extraction, and the differences in microbial community structure between two soil sample sizes (0.2 g and 1 g) at different taxonomic ranks. In the 0.2 and 1 g soil sample sizes, the numbers of taxa and α- and β-diversity indices used were stable, with almost the same values at each taxonomic rank. However, NMDS showed that the microbial compositions of the two cases at the order rank were clearly different, and the difference was also observed at higher taxonomic ranks. The soil sample size and tube volume affected the estimated microbial community structure. A 0.2 g soil sample size would be preferable to larger sample sizes because of the possible higher shearing force for DNA extraction and lower costs for practical use with smaller amounts of consumables. When the taxonomic rank was changed from order to phylum, some minor taxa identified at the order rank were integrated into major taxa at the phylum rank. The integration affected the value of the Sørensen–Dice index. Therefore, the microbial community structure analysis was influenced by the taxonomic rank applied.

Supporting information

S1 Fig. Number of taxa determined at the three taxonomic ranks.

Error bars represent standard errors.

(TIF)

S2 Fig. Shannon–Wiener index at the three taxonomic ranks.

Error bars represent standard errors.

(TIF)

S3 Fig. Non-metric multidimensional scaling (NMDS) plot at the class rank.

Aggregation points of the 0.2 g soil sample size are shown in blue (right) circle and those of the 1 g soil sample size are shown in orange (left) circle. Points outside the circles are for the 5 g soil sample size. 2D stress was 0.14.

(TIF)

S4 Fig. Non-metric multidimensional scaling (NMDS) plot at the phylum rank.

Agglomeration points of the 0.2 g soil sample size are shown in the blue (left) circle, and those of the 1 g soil sample size are shown in the orange (right) circle. Points outside the circles are for the 5 g soil sample size. 2D stress was 0.061.

(TIF)

Acknowledgments

We would like to thank Editage (www.editage.com) for English language editing.

Data Availability

database URL:https://www.ddbj.nig.ac.jp/index.html accession number:DRA011569.

Funding Statement

This study is supported by “The Sanitation Value Chain: Designing Sanitation Systems as Eco-Community Value System” Project (Project Leader: Prof. Taro Yamauchi) at Research Institute for Humanity and Nature (RIHN, Project No. 14200107). The funder had no role in study design, data collection, analysis, decision to publish, nor preparation of the manuscript.

References

  • 1.van der Hejiden MGA, Bardgett RD, Van Straalen NM. The unseen majority: Soil microbes as drivers of plant diversity and productivity in terrestrial ecosystems. Ecology letters. 2008; 11(3): 296–310. doi: 10.1111/j.1461-0248.2007.01139.x [DOI] [PubMed] [Google Scholar]
  • 2.Fierer N, Jackson RB. The diversity and biogeography of soil bacterial communities. Proceedings of the National Academy of Sciences. 2006; 103(3): 626–631. doi: 10.1073/pnas.0507535103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Nunan N, Wu K, Young IM, Crawford JW, Ritz K. Spatial distribution of bacterial communities and their relationships with the micro-architecture of soil. FEMS Microbiology Ecology. 2003; 44(2): 203–215. doi: 10.1016/S0168-6496(03)00027-8 [DOI] [PubMed] [Google Scholar]
  • 4.Baker KL, Kangenheder S, Nicol G. Environmental and spatial characterisation of bacterial community composition in soil to inform sampling strategies. Soil Biology and Biochemistry. 2009; 41(11): 2292–2298. [Google Scholar]
  • 5.Dechesne A, Owsianiak M, Bazire A, Grundmann GL, Binning PJ, et al. Biodegradation in a Partially Saturated Sand Matrix: Compounding Effects of Water Content, Bacterial Spatial Distribution, and Motility. Environmental Science & Technology. 2010; 44(7): 2386–2392. [DOI] [PubMed] [Google Scholar]
  • 6.Lombard N, Prestat E, van Elsas JD, Simonet P. Soil-specific limitations for access and analysis of soil microbial communities by metagenomics. FEMS Microbiology Ecology. 2011; 78(1): 31–49. doi: 10.1111/j.1574-6941.2011.01140.x [DOI] [PubMed] [Google Scholar]
  • 7.Vestergaard G, Schulz S, Schöler A, Schloter M. Making big data smart-how to use metagenomics to understand soil quality, Biology and Fertility of Soils. 2017; 53: 479–484. [Google Scholar]
  • 8.Kang S, Mills AL. The effect of sample size in studies of soil microbial community structure. Journal of Microbiological Methods. 2006; 66(2): 242–250. doi: 10.1016/j.mimet.2005.11.013 [DOI] [PubMed] [Google Scholar]
  • 9.Ellinsøe P, Johnsen K. Influence of soil sample size on the assessment of bacterial community structure. Soil Biology and Biochemistry. 2002; 34(11): 1701–1707. [Google Scholar]
  • 10.Penton R, Christopher G, Vadakattu VSR, Yu J, Tiedie JM. Size Matters: Assessing Optimum Soil Sample Size for Fungal and Bacterial Community Structure Analyses Using High Throughput Sequencing of rRNA Gene Amplicons. Frontiers in Microbiology. 2016; 824(7): 1–11. doi: 10.3389/fmicb.2016.00824 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Song Z, Schlatter D, Kennegy P, Kinkel LL, Kistler CH, et al. Effort versus Reward: Preparing Samples for Fungal Community Characterization in High Throughput Sequencing Surveys of Soils. PLoS ONE. 2014; 10(5). Available from: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0127234 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Hugerth LW, Wefer HA, Lundin A, Jakobsson HE, Lindberg M, et al. DegePrime, a program for degenerate primer design for broad-taxonomic-range PCR in microbial ecology studies. Applied and environmental microbiology. 2014; 80(16): 5116–5123. doi: 10.1128/AEM.01403-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Yoon SH, Ha SM, Kwon S, Lim J, Kim YE, et al. Introducing EzBioCloud: a taxonomically united database of 16S rRNA gene sequences and whole-genome assemblies. International Journal of Systematic and Evolutionary Microbiology. 2017; 67(5): 1613–1617. doi: 10.1099/ijsem.0.001755 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Castle SC, Song Z, Gohl DM, Gutknecht JLM, Rosen CJ, et al. DNA Template Dillution Impacts Amplicon Sequencing-Based Estimates of Soil Fungal Diversity. Phytobiomes Journal. 2018; 2(2): 100–107. [Google Scholar]
  • 15.Wen C, Wu L, Qin Y, Nostrand DVJ, Ning D, et al. Evaluation of the reproducibility of amplicon sequencing with Illumina MiSeq platform. PLoS ONE. 2017; 12(4). Available from: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0176716 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Segata N, Izard J, Waldron L, Gevers D, Miropolsky L, et al. Metagenomic biomarker discovery and explanation. Genome Biology. 2011; 12 (6): 60–70. doi: 10.1186/gb-2011-12-6-r60 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Schöler A, Jacquiod A, Vestergaard G, Schulz S, Schloter M. Analysis of soil microbial communities based on amplicon sequencing of marker genes, Biol Fertil Soils. 2017; 53: 485–489. [Google Scholar]
  • 18.Boone DR, Castenholz RW, Garrity GM. Bergey’s manual of systematic bacteriology. 4th ed. Springer; 2011. 782–783.
  • 19.Kawasaki S, Moriguchi R, Sekiya K, Nakai T, Ono E, et al. The cell envelope structure of the lipopolysaccharide-lacking gram-negative bacterium Sphingomonas paucimobilis. Journal of Bacteriology. 1994; 176(2): 284–290. doi: 10.1128/jb.176.2.284-290.1994 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Pascher I. Molecular arrangements in sphingolipids Conformation and hydrogen bonding of ceramide and their implication on membrane stability and permeability. Biochimica et Biophysica Acta (BBA). 1976; 455(2): 433–451. doi: 10.1016/0005-2736(76)90316-3 [DOI] [PubMed] [Google Scholar]
  • 21.Goñi FM, Alonso A. Biophysics of sphingolipids I. Membrane properties of sphingosine, ceramides and other simple sphingolipids. Biochimica et Biophysica Acta (BBA)–Biomembranes. 2006; 1758(12): 1902–1921. doi: 10.1016/j.bbamem.2006.09.011 [DOI] [PubMed] [Google Scholar]
  • 22.Dickson RC. Thematic Review Series: Sphingolipids. New insights into sphingolipid metabolism and function in budding yeast. Journal of Lipid Research. 2008; 49(5): 909–921. doi: 10.1194/jlr.R800003-JLR200 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Gołębiewski M, Tretyn A. Generating amplicon reads for microbial community assessment with next-generation sequencing. Journal of Applied Microbiology. 2019; 128: 330–354. doi: 10.1111/jam.14380 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Ashwani Kumar

4 Aug 2021

PONE-D-21-20731

The effect of soil sample size, for practical DNA extraction, on soil microbial diversity in different taxonomic ranks

PLOS ONE

Dear Dr. %Satoshi%,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

ACADEMIC EDITOR: Ashwani Kumar

Please submit your revised manuscript by %20-08-2021%. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Ashwani Kumar

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at 

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Thank you for stating the following in your Competing Interests section:  

[NO]. 

Please complete your Competing Interests on the online submission form to state any Competing Interests. If you have no competing interests, please state "The authors have declared that no competing interests exist.", as detailed online in our guide for authors at http://journals.plos.org/plosone/s/submit-now 

 This information should be included in your cover letter; we will change the online submission form on your behalf.

3. PLOS requires an ORCID iD for the corresponding author in Editorial Manager on papers submitted after December 6th, 2016. Please ensure that you have an ORCID iD and that it is validated in Editorial Manager. To do this, go to ‘Update my Information’ (in the upper left-hand corner of the main menu), and click on the Fetch/Validate link next to the ORCID field. This will take you to the ORCID site and allow you to create a new iD or authenticate a pre-existing iD in Editorial Manager. Please see the following video for instructions on linking an ORCID iD to your Editorial Manager account: https://www.youtube.com/watch?v=_xcclfuvtxQ

4. We note that S-Figures 1 and 2 in your submission contain copyrighted images. All PLOS content is published under the Creative Commons Attribution License (CC BY 4.0), which means that the manuscript, images, and Supporting Information files will be freely available online, and any third party is permitted to access, download, copy, distribute, and use these materials in any way, even commercially, with proper attribution. For more information, see our copyright guidelines: http://journals.plos.org/plosone/s/licenses-and-copyright.

We require you to either (1) present written permission from the copyright holder to publish these figures specifically under the CC BY 4.0 license, or (2) remove the figures from your submission:

a) You may seek permission from the original copyright holder of Figures S-Figure 1 and S-Figure 2 to publish the content specifically under the CC BY 4.0 license. 

We recommend that you contact the original copyright holder with the Content Permission Form (http://journals.plos.org/plosone/s/file?id=7c09/content-permission-form.pdf) and the following text:

“I request permission for the open-access journal PLOS ONE to publish XXX under the Creative Commons Attribution License (CCAL) CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). Please be aware that this license allows unrestricted use and distribution, even commercially, by third parties. Please reply and provide explicit written permission to publish XXX under a CC BY license and complete the attached form.”

Please upload the completed Content Permission Form or other proof of granted permissions as an "Other" file with your submission. 

In the figure caption of the copyrighted figure, please include the following text: “Reprinted from [ref] under a CC BY license, with permission from [name of publisher], original copyright [original copyright year].”

b) If you are unable to obtain permission from the original copyright holder to publish these figures under the CC BY 4.0 license or if the copyright holder’s requirements are incompatible with the CC BY 4.0 license, please either i) remove the figure or ii) supply a replacement figure that complies with the CC BY 4.0 license. Please check copyright information on all replacement figures and update the figure caption with source information. If applicable, please specify in the figure caption text when a figure is similar but not identical to the original image and is therefore for illustrative purposes only.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Partly

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: your work "the effect of soil sample size, for practical DNA extraction, on soil microbial diversity in different taxonomic ranks" is very interesting and useful for cauterising soil and understanding the nature of microorganisms presentation.

Reviewer #2: The manuscript entitled ‘The effect of soil sample size, for practical DNA extraction, on soil microbial diversity in different taxonomic ranks' tried to determine the optimal soil sample size in regard for microbial community structure. They focused on the relationship between soil amount and DNA extraction container volume and the alteration in microbial composition at different taxonomic ranks (order, class, and phylum). Three soil sample size were tested, and they concluded that microbial diversity and diversity assessment were stable on 0.2 and 1-g samples. It is interesting to found the effect of sample size to microbial diversity. Thus, I think it is valuable to publish the paper, which would provide technical insight of sampling size.

However, I would like to propose a small suggestion to the author to adjustment the conclusion. It would be easier for the reader if the conclusion is brief and concise, and which sample size practice is the best.

Reviewer #3: General comments

This manuscript reports on the optimal soil sample size for microbial community structure analysis. Besides, this study also focused on the relationship between soil amount and DNA extraction container volume and the alteration in the microbial composition at different taxonomic ranks. The results suggest that microbial community structure analysis is influenced by the taxonomic rank applied.

However, I have some concerns about some of the methodology used in this study. For example, the sample sizes used and also the number of genes used in the microbial composition analysis. These indirectly impacts the general results and findings of this study. Moreover, the authors need to clarify the novelty of their study as the same study has been shown in other research, Penton et al., 2016, even though it has been cited in the study. The authors should make this clear in the introduction and make a more comprehensive conclusion.

Hence, I have to reject this manuscript because it does not meet the PLoS ONE criteria

However, these studies still have an impact of results in its own way

* Consider carefully proofreading – there are several grammatical errors and also replicate of a sentence.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Nov 18;16(11):e0260121. doi: 10.1371/journal.pone.0260121.r002

Author response to Decision Letter 0


8 Sep 2021

Reviewer #1: your work "the effect of soil sample size, for practical DNA extraction, on soil microbial diversity in different taxonomic ranks" is very interesting and useful for cauterising soil and understanding the nature of microorganisms presentation.

Author: Thank you very much for your encouraging words.

Reviewer #2-1: The manuscript entitled ‘The effect of soil sample size, for practical DNA extraction, on soil microbial diversity in different taxonomic ranks' tried to determine the optimal soil sample size in regard for microbial community structure. They focused on the relationship between soil amount and DNA extraction container volume and the alteration in microbial composition at different taxonomic ranks (order, class, and phylum). Three soil sample size were tested, and they concluded that microbial diversity and diversity assessment were stable on 0.2 and 1-g samples. It is interesting to found the effect of sample size to microbial diversity. Thus, I think it is valuable to publish the paper, which would provide technical insight of sampling size.

Author:We appreciate the reviewer’s high evaluation.

Reviewer #2-2: However, I would like to propose a small suggestion to the author to adjustment the conclusion. It would be easier for the reader if the conclusion is brief and concise, and which sample size practice is the best.

Author: Thank you very much for the useful suggestion. We have concluded that the 0.2-g soil sample level would be preferable due to its higher detection of bacteria with a high tolerance of shearing force and its economic efficiency [L. 31 and 402]. Also, we made the Abstract and the Conclusion briefer and more concise [L. 17, 27, 30, 393, and 401].

Reviewer #3-1: General comments

This manuscript reports on the optimal soil sample size for microbial community structure analysis. Besides, this study also focused on the relationship between soil amount and DNA extraction container volume and the alteration in the microbial composition at different taxonomic ranks. The results suggest that microbial community structure analysis is influenced by the taxonomic rank applied.

However, I have some concerns about some of the methodology used in this study. For example, the sample sizes used and also the number of genes used in the microbial composition analysis. These indirectly impacts the general results and findings of this study.

Author: Thank you very much for your meaningful comments. As the reviewer mentioned, the soil sample size might affect the microbial composition analysis. Also, we have focused on the relationship between the soil sample size and the DNA extraction conditions (e.g., the volume of the container). Regarding the study using amplicon sequence data, Penton et al. and Song et al. have studied the soil sample size effect. However, However, Penton et al. evaluated the effect of the soil sample sizes (1, 5, and 10 g) using the DNA extraction kit for 10 g of soil, with the result that a larger soil sample size would be preferable. In this respect, each DNA extraction kit would have an optimal soil volume, and the smaller soil sample size in the study might deviate from the optimal volume. In other words, the relationship between the soil volume and the container volume for DNA extraction would be important, as well as the soil sample size effect, and we have investigated the points. Song et al. also elucidated the DNA extraction efficiency using the different soil sample sizes (0.25 g and 10 g) with each optimal-size DNA extraction kit, respectively. There was no significant difference in the OUT richness between the two soil sample sizes, and this result would support our idea, meaning the relationship between the soil volume and the container volume is also essential. Therefore, we added the report by Song et al. in the reference list [L. 71].

We would have liked to carry out further soil sample size cases. However, the horizontal shaking adapter for 50 mL tube volume for the vortex mixer can only equip two tubes. It is inconvenient for practical use because we usually treat many samples at one time. Therefore, we have tried a vertical, not a horizontal shaker, which can equip 6 tubes at once, to extract DNA for practical use.

We would think that the sample size indicated by Reviewer#3 means the soil sample size. However, generally, sample size means repetition of one sample. Therefore, to distinguish soil sample volume from the so-called sample size, we added the word “soil” prior to sample size, meaning soil sample volume. Also, we apologize for the confusion. Relating to the so-called sample size, we also added the reason for our repetition number of one sample (n = 5) by referring to Song et al. (2014) and Vestergaard et al. (2017) [L.50 and 117].

The number of genes also would affect microbial composition analyses. We think the reviewer’s indication has two meanings: one is the volume of sequence reads, and the other is the equality of the sequence reads by rarefaction. For the former points, 30,000-40,000 reads were obtained through the experimental protocols in our study. Therefore, the sequence reads would be enough to evaluate microbial composition analysis (Schöler et al., 2017). Also, we rewrote the sequence reads of Penton et al. from its total reads (over 5,000,000 reads) to the reads of each sample (over 56,000 reads) for the comparison of our sequence reads [L. 69]. For the latter points, we used the same soil, not the different soil. So, the number of sequence reads was one of the consequences of comparing the different soil sample sizes for DNA extraction. Therefore, we did not rarefy the read number and conducted the microbial community analysis using the obtained read number. We mentioned the point in the revised manuscript [L. 193]. By the way, the average number of sequence reads in each soil sample size would be confirmed to be almost the same (no significant difference) by applying ANOVA [L. 161].

Reviewer #3-2: Moreover, the authors need to clarify the novelty of their study as the same study has been shown in other research, Penton et al., 2016, even though it has been cited in the study. The authors should make this clear in the introduction and make a more comprehensive conclusion.

Author: Thank you for your advice. Our main discussion points are as follows; 1) effects of the target taxonomic rank and annotated taxa in microbial diversity analysis and 2) the relationship between soil volume and tube volume for DNA extraction. Firstly, we summarized the section of our motivations in an orderly way (firstly, secondly, finally) in the Introduction section [L. 76, 88, and 98]. Then, we noticed that the explanation of the former discussion point, especially using annotated taxa for microbial diversity analysis, had not been adequate in the original manuscript, so we described the point in more detail [L. 73 and 108]. We added concerns about using OTUs not annotated taxa for microbial diversity analysis [L. 76]. Also, we referred to the report by Song et al. (2015) to support our discussion [L. 71, 95]. By the way, we found our misunderstanding about the target taxonomic rank in the study by Penten et al. (2016). They did not determine a specific taxonomic rank. We apologize and corrected the point [L. 76].

We also discussed the importance of setting target taxonomic rank and using annotated taxa in microbial diversity analysis in the Results and Discussion section. We added some ideas for the transition of the Sørensen-Dice index with lowering the taxonomic rank relating to the detection of minor taxa. In addition, we wrote the possible reason for the different conclusions about detecting minor taxa between ours and Penton et al.’s. [L.339 and 348]. We also added the reference that discussed soil sample size for the detection of minor taxa [L. 355].

We re-evaluated our results for a more straightforward conclusion and concluded that a 0.2-g soil sample size would be better than a 1-g soil sample size because the soil sample size could detect taxa with higher physical shear tolerance and was carried out at a lower cost. We rewrote the choice of the two soil sample sizes in the Results and Discussion section (Determination of Sample Size) [L. 364] and the Conclusions section [L. 402]. In the Determination of Sample Size section, we summarized the advantages of each soil sample size (0.2 and 1 g) [L. 363].

We added the following contents to explain the points mentioned (underline shows the added sentences and phrases). In addition, we cleared some information in the Introduction that did not relate to the main discussing points to avoid redundancy [L. 43]. Also, we summarized the Conclusion to make it more concise [L. 393].

Reviewer #3-3: Hence, I have to reject this manuscript because it does not meet the PLoS ONE criteria

However, these studies still have an impact of results in its own way

* Consider carefully proofreading – there are several grammatical errors and also replicate of a sentence.

Author: Thank you for your advice. We have again discussed our novelty and the differences from previous reports and revised the manuscript as we mentioned above. We have sincerely read the revised manuscript and used the English proofreading service again.

Attachment

Submitted filename: 210904_Response to Reviewers.docx

Decision Letter 1

Ashwani Kumar

3 Nov 2021

The effect of soil sample size, for practical DNA extraction, on soil microbial diversity in different taxonomic ranks

PONE-D-21-20731R1

Dear Dr. Akao,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Ashwani Kumar

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Acceptance letter

Ashwani Kumar

8 Nov 2021

PONE-D-21-20731R1

The effect of soil sample size, for practical DNA extraction, on soil microbial diversity in different taxonomic ranks

Dear Dr. Akao:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. Ashwani Kumar

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Fig. Number of taxa determined at the three taxonomic ranks.

    Error bars represent standard errors.

    (TIF)

    S2 Fig. Shannon–Wiener index at the three taxonomic ranks.

    Error bars represent standard errors.

    (TIF)

    S3 Fig. Non-metric multidimensional scaling (NMDS) plot at the class rank.

    Aggregation points of the 0.2 g soil sample size are shown in blue (right) circle and those of the 1 g soil sample size are shown in orange (left) circle. Points outside the circles are for the 5 g soil sample size. 2D stress was 0.14.

    (TIF)

    S4 Fig. Non-metric multidimensional scaling (NMDS) plot at the phylum rank.

    Agglomeration points of the 0.2 g soil sample size are shown in the blue (left) circle, and those of the 1 g soil sample size are shown in the orange (right) circle. Points outside the circles are for the 5 g soil sample size. 2D stress was 0.061.

    (TIF)

    Attachment

    Submitted filename: 210904_Response to Reviewers.docx

    Data Availability Statement

    database URL:https://www.ddbj.nig.ac.jp/index.html accession number:DRA011569.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES