Skip to main content
PLOS One logoLink to PLOS One
. 2023 Feb 8;18(2):e0280391. doi: 10.1371/journal.pone.0280391

The virtual microbiome: A computational framework to evaluate microbiome analyses

Belén Serrano-Antón 1,¤a,¤b, Francisco Rodríguez-Ventura 1, Pere Colomer-Vidal 1, Riccardo Aiese Cigliano 2,*, Clemente F Arias 1,3,*, Federica Bertocchini 1,*
Editor: Patrizia Falabella4
PMCID: PMC9907852  PMID: 36753469

Abstract

Microbiomes have been the focus of a substantial research effort in the last decades. The composition of microbial populations is normally determined by comparing DNA sequences sampled from those populations with the sequences stored in genomic databases. Therefore, the amount of information available in databanks should be expected to constrain the accuracy of microbiome analyses. Albeit normally ignored in microbiome studies, this constraint could severely compromise the reliability of microbiome data. To test this hypothesis, we generated virtual bacterial populations that exhibit the ecological structure of real-world microbiomes. Confronting the analyses of virtual microbiomes with their original composition revealed critical issues in the current approach to characterizing microbiomes, issues that were empirically confirmed by analyzing the microbiome of Galleria mellonella larvae. To reduce the uncertainty of microbiome data, the effort in the field must be channeled towards significantly increasing the amount of available genomic information and optimizing the use of this information.

Introduction

The characterization of the microorganisms colonizing a particular ambient is becoming a gateway to the analysis of the physiological niche that the environment represents, revealing its potential functions or eventual pathological conditions. Examples in this direction are represented by the deep interest in the human gut microbiome (the compendium of microorganisms colonizing the human gut), due to the growing concern in the relationships between the microbiome and the immune system, and henceforth in the potential development of disease [14]; or the increasing focus on the genomic analyses of water (ocean or river) and soil samples, in search for potentially useful cataloging of the environmental niches we live in [5].

Microbiome studies are receiving increasing attention for their possible implications in the field of bioremediation, embracing issues such as degradation of organic chemicals, conversion of toxic compounds (e.g. pesticides), production of biofuels, or else from various substrates [69]. The strong interest in communities of microorganisms crossed fields and extended to the studies of insect guts [10, 11]. A growing body of research focuses on insect microbiomes in the quest for solutions within the bioremediation field [12]. For example, termites and beetles have raised interest due to their capacity to degrade lignin, an ability potentially dependent in some cases on microorganisms colonizing certain portions of their intestine [1317]. Recently, some coleopteran and lepidopteran species revealed the astonishing capability to degrade fossil fuel-derived plastics, like the sturdy polyethylene and polystyrene [1821], opening up a new niche within the field of bioremediation by insects. Even if the molecular mechanisms responsible for this extraordinary capacity are still unknown, they are normally ascribed to the microorganisms colonizing the digestive tracts of those insects. This line of research has resulted in an ever-increasing list of microorganisms with the potential to biodegrade plastics, although with still un-concluding outcomes [2026].

Studies in this field typically monitor the changes induced by alternative treatments in the relative abundance of the species present in the microbiome. For instance, feeding insects a diet of plastic is a standard procedure to identify the microorganisms that thrive in their gut as potential candidates for plastic metabolization [18, 20, 23, 2628]. Analogously, the microbiome of patients suffering a given clinical condition is often compared to that of healthy control individuals, expecting the shifts in the relative abundance of microbial species to account for the observed effects or to provide effective diagnostic tools [29]. This functional approach to the study of microbiomes relies on several implicit assumptions, whose general validity is far from evident. First, a causal link is presumed between changing conditions and the differential selection (either positive or negative) of particular microbial species. In the case of plastic fed-insects, for instance, possible metabolic changes induced by a diet of plastic in the host insect are normally neglected, implying that the microorganisms colonizing the digestive tracts are considered responsible for any metabolic activity the animal embarks on.

Leaving aside its biological plausibility, this perspective takes for granted other assumptions that have to do with the methods used to study microbiomes. These methodological assumptions are the focus of this work. It is normally presupposed that currently available techniques are accurate enough to provide reliable quantitative measurements of changes in microbiome composition. Consequently, observed variations in the abundance of species (or other relevant taxa) are regarded as valid indicators of the activity of interest. A close inspection of the currently available empirical tools reveals unexpected and critical drawbacks in this approach.

The most used methodologies to study the microbiome of a chosen animal species are amplicon and whole-genome sequencing (WGS). The former is based on the identification of taxonomical markers such as the ribosomal gene 16S for bacteria. Although this relatively short gene (∼1500 bp) is universal among bacteria and archaea [30], some studies have revealed that 16S rRNA gene does not show precise phylogenetic relationships within particular taxa [31, 32]. For this reason, in recent years there has been an increase in the number of analyses carried out with WGS, which is a more expensive and computationally intensive technique than amplicon.

Amplicon and WGS analyses rely on the comparison of DNA sequences obtained from a sample of the microbial population of interest with the sequences contained in genomic databanks. Therefore, they can only identify those sequences that are already present in databases. In addition, whole genome sequencing data can also be used to generate genome assemblies of the microbial populations, the so-called MAGs (Metagenome Assembled Genomes) which can be used to reconstruct the genome of new species. However, also in this case, databases of known species are used to try to classify the taxonomy of the MAGs. As a consequence, the accuracy of these methods should be expected to depend on the number of sequences contained in the databases used for the analysis.

Remarkably, comparative studies have shown that the results of the characterization of bacterial populations by amplicon and WGS do not necessarily overlap even if they use the same reference databanks. For example, applying both techniques on the same DNA from a human fecal sample resulted in a mere 29% overlapping at the phylum level [33], and revealed differences in the metagenomic results for a human salivary sample, both in the relative abundances of species and in the identified genera [34]. Even the same method (WGS) analyzed with different pipelines yielded results that differ by up to three orders of magnitude on the same data set [35].

In addition to the issues related to the methodology used to characterize microbiomes, the ecological structure of bacterial populations (e.g. diversity or richness of species) and the sampling strategy can also bias their characterization, adding further concerns about the accuracy of microbiome analyses [36, 37]. This scenario raises the following issues: To what extent and under what circumstances can we count on the current experimental approaches to understanding the complexity of symbiotic microorganisms living within the gut (or any other tissue) of an animal? Do the available techniques have intrinsic limitations that might influence the analysis of any microbiome, whatever the colonized environment? Or are these limitations restricted to particular habitats?

To get a deep insight into this puzzling landscape, we took an in-silico approach, creating virtual microbiome communities that simulate the bacterial populations that can be found in humans, insects, etc., or soil, water, and any other medium. These virtual communities can be examined by standard genomic techniques, giving the possibility to confront the results of the analysis with the original microbiome composition.

Within the virtual microbiome framework, it is possible to simulate the analysis of microbial populations with partially underrepresented or limited databanks, and hence to explore the impact of the lack of genomic information on the characterization of microbiomes. Using this approach, we have identified the incompleteness of databases as a key constraint to the accuracy of microbiome analyses. Our results suggest that the poor overlap between amplicon and WGS already encountered with human samples does not seem to be an exception. Furthermore, we show in this work that the simultaneous detection of a species by both techniques does not ensure its positive identification, since the overlap between amplicon and WGS can be dominated by false positives, i.e. by species that are not present in the original samples. This counterintuitive situation worsens notably when limited databases are used in the analyses to simulate the lack of genomic information. In this case, the detection of changes in the relative abundance of bacterial groups can be profoundly misleading. The variations in abundance detected by genomic analyses can be mostly spurious and unrelated to the actual changes taking place in the analyzed virtual microbiomes.

To empirically test the predictions derived from our model, we proceeded with the analysis of the bacteria colonizing different tissues of the larvae of Galleria mellonella, a lepidopteran recognized as capable of degrading polyethylene and polystyrene [19, 2325, 3841]. Despite their abundance, the microbiota of lepidopteran species has been little studied [42, 43], so the bacteria associated with this group have only a marginal representation in databanks to date. This makes G. mellonella larvae an ideal subject to examine the limitations of genomic analyses pinpointed by the virtual microbiome model.

To do that, we applied amplicon sequencing (16S) and WGS (using the same DNA samples) to the anterior part, the gut, and the silk glands of plastic-treated and control larvae. The results reflected the incongruences revealed by our model, with an astonishing lack of overlap between the outcomes of the two techniques at the genus level. Both techniques also showed striking discrepancies in the detection of changes in the insect microbiome.

For those animal species and environments that harbor microbial populations [44], microbiome analysis stands as a fundamental tool in the comprehension of their physiology and ecology. However, the shortcomings inherent to the used tools and the paucity in the available database hinder and mislead the outcome, and, therefore, the interpretation of microbiome analyses. This work highlights the need for an increased effort in the collection of genomic information and its eventual availability in public databases. In the meantime, the results of experimental studies about the composition or the dynamics of microbiome populations should be interpreted with caution, especially in the case of insects, where the available genomic information is still scarce.

Materials and methods

Generation of virtual microbiomes

Virtual microbiomes were created with a fixed number of species (300). The process of generating abundance matrices was automated by means of an algorithm that takes as input the relevant parameters of the problem (i.e. the number of populations and species and the shape parameters of the distributions, labelled as params) and outputs an abundance matrix (labelled as abundances) that complies with the macroecological laws followed by real-world microbial populations (as defined in [36]). The full implementation of the algorithm can be found at the following link https://github.com/bserranoanton/Virtual-microbiome. In addition, information on all the parameters used for the generation of the abundance matrices of the virtual microbiomes can also be accessed. A simplified pseudocode of the algorithm is shown below. The resulting distributions were tested using the Kolmogorov-Smirnov test for goodness of fit.

Algorithm 1 Generation of the ecological structure of the virtual microbiomes.

1: nFamiliesnFamilies ⊳Number of families

2: nGenusnGenus ⊳Number of genus

3: nSpeciesnSpecies ⊳Number of species

4: nCommunitiesnCommunities ⊳Number of communities

5: muMADmu ⊳mu parameter value for abundance distribution

6: sigmaMADsigma ⊳sigma parameter value for abundance distribution

7: shapeAFDshape ⊳shape parameter value for abundance distribution

8: scaleAFDscale ⊳scale parameter value for abundance distribution

9: muGenusmu ⊳mu parameter value for grouping species into genera

10: sigmaGenussigma ⊳sigma parameter value for grouping species into genera

11: muFamiliesmu ⊳mu parameter value for grouping genera into families

12: sigmaFamiliessigma ⊳sigma parameter value for grouping genera into families

13: procedure GenerateMAD(mu, sigma, n)

14:  MADvectorrandom.lognormal(mu, sigma, n)

15:  MADvectorMADvector/sum(MADvector)

16:  return MADvector

17: procedure GenerateAFD(shape, scale, n)

18:  AFDvectorrandom.gamma(shape, scale, n)

19:  AFDvectorAFDvector/sum(AFDvector)

20:  return AFDvector

21: procedure GenerateAbundanceDistribution(mu, sigma, shape, scale, nCommunities, nSpecies)

22:  MADvectorgenerateMAD(mu, sigma, nSpecies)

23:  abundancesmatrix(nCommunities, nSpecies)

24:  for speciesi ← 1 to nSpecies do

25:   AFDvectorgenerateAFD(shape, scale, nCommunities)

26:   abundances[:,speciesi] ← MADvector[speciesi] * AFDvector * nCommunities

27:  return [MADvector, abundances]

28: procedure EcologyGeneration(nCommunities, nSpecies, mu, sigma, shape, scale)

29:  [MADvector, abundances] ← generateAbundanceDistribution(muMAD, sigmaMAD, shapeAFD, scaleAFD, nCommunities, nSpecies)

30:  r_valuetestTaylorsLaw(MADvector, abundances) ⊳Continue if correlation coefficient between mean and variance ≥0.95

31:  alphaDiv, betaDivgetMeanDiversities(abundances) ⊳And classify scenario in high or low diversity

32:  genusAbundancesgroupSpeciesInGenera(muGenus, sigmaGenus, nGenus, nSpecies) ⊳Groups nSpecies in nGenus following a lognormal distribution

33:  familyAbundancesgroupGeneraInFamilies(muFamilies, sigmaFamilies, nFamilies, nGenus) ⊳Groups nGenus in nFamilies following a lognormal distribution

Genetic composition of virtual microbiomes

The species of virtual microbiomes were chosen from eight families: Clostridiaceae, Flavobacteriaceae, Enterococcaceae, Pseudomonadaceae, Acetobacteraceae, Lactobacillaceae, Enterobacteriaceae and Streptococcaceae. Species are grouped into genera and genera are grouped into families using lognormal distributions. This means that the number of genera in each family and the number of species in each genus follow lognormal distributions.

To generate the complete databank, 16S sequences from all prokaryotes and genome sequences were downloaded on February 2021 and April 2020 from NCBI Nucleotide, respectively (see S1 Data in S2 Appendix). To simulate the lack of information in genomic databases, we created incomplete databases by removing taxonomic information from this databank. We created three different databanks (see S1 Data in S2 Appendix):

  1. Deleting half of the families (Incomplete database DB1).

  2. Deleting half of the genera (Incomplete database DB2).

  3. Deleting half of the species (Incomplete database DB3).

To simulate the lack of information in the databanks about the species present in the virtual microbiomes, we created three scenarios (S2–S4 Data in S2 Appendix):

  1. 100% scenario: All the genomic information of the virtual microbiomes are present in the incomplete database.

  2. 50% scenario: Only half of the genomic information of the virtual microbiomes are present in the incomplete databases.

  3. 25% scenario: Only a quarter of the genomic information of the virtual microbiome are present in the database.

We remark that all the species in the virtual microbiomes are present in the complete database. For scenarios 2 and 3, we selected 50% and 75% of the taxonomical information from scenario 100% and replaced it with information that was not in the corresponding database, taking into account the restrictions exposed in 2.5. For the scenarios with 50% and 25% of the species present in the databases, there are several ways to remove families. In the case of removing families, the choice was made to remove those families that grouped an abundance of around 50% in the first case and 25% in the second (as close as possible within the restrictions exposed in 2.5). Therefore, the 50% scenario gives rise to two scenarios (A and B) and the 25% scenario to four (A, B, C and D). Full information on the taxonomic composition of each scenario can be found in S2–S4 Data of S2 Appendix.

Bioinformatic analysis of virtual microbiomes

The number of reads corresponding to each species was proportional to its abundance in each virtual microbiome. We considered two scenarios regarding the total number of reads: high number of reads (100,000 for 16S and 60,000 for WGS) and low number of reads (50,000 for 16S and 20,000 for WGS). The generation of reads was done with the program Grinder [45] with the following parameters:

Listing 1. Grinder commands for reads generation.

#16S reads

grinder −reference_file microb_16S seq/$name.fasta

 −forward_reverse primer16_v2. fasta −total_reads

 $abundanceTotal −read_dist 300 −insert_dist 550 normal 55

 −unidirectional 1 −length bias 0 −mutation_dist poly4 -fq 1

  −qual_levels 30 10 −base_name $name −output_dir ./ reads16S

#WGS reads

grinder −reference_file microb_WGS_seq/$name.fasta −total_reads

  $abundanceTotal −read_dist 150 −insert_dist 400 normal 40

  −unidirectional 0 −length_bias 0 −mutation_dist poly4 −fq

 1 −qual_levels 30 10 −base_name $name −output_dir ./

 readsWGS

Reference files (reference_file) store all existing sequences for each species in the complete database. In this way, reads are generated for all the species avoiding any bias in the choice of a particular sequence. total_reads refers to the number of reads needed for each species. These reads were analysed by GAIA [46] using the complete and incomplete databases.

For amplicon, the length of reads (read_dist) was set at 300 base pairs (bp), with an overlap of 50 bp (insert_dist of 550 bp). For WGS the length of the reads was 150 bp and insert_dist of 400 bp. We used unidirectional reads (from one strand only) for amplicon and bidirectionally for WGS, as recommended by Grinder documentation (https://sourceforge.net/projects/biogrinder/).

In addition, we introduced sequencing errors in the reads (mutation_dist), under the form of mutations (substitutions, insertions and deletions) at positions that follow a 4th degree polynomial, simulating Illumina errors [47]. The quality of the reads (qual_levels) varies from good (30) to bad (10), for reads with insertions or substitutions. We tried to avoid length bias (i.e. greater contribution of reads from larger genomes) by setting length_bias to 0.

Primers used for the amplification of the V3V4 region in amplicon (parameter forward_reverse) were:

>V3V4F:34

CCTACGGGNGGCWGCAG

>V3V4R:34

GGACTACHVGGGTATCTAATCC

These primers have been shown to be efficient for 16S analysis [48]. However, these sequences were not able to amplify the desired region in a non-negligible percentage of the species. For this reason, a lower number of reads than expected was obtained in some in silico amplifications. To reach the desired number of reads, the remaining percentage of abundance was distributed according to the abundance distribution. Metagenomic analysis of the samples was done with GAIA [46], which allows to obtain a comprehensive and detailed overview at any taxonomic level of microbiomes in an accurate and easy way.

To confirm the results obtained with GAIA, a second set of virtual microbiomes was analyzed with Kraken version 2.1.1 [49]. The databases were downloaded from the repository created by the Kraken2 developers (https://benlangmead.github.io/aws-indexes/k2). Specifically, the 16S database was the Silva138 whereas the WGS database was the Standard (Date 9/26/2022). The Kraken2 classification analysis was performed with default parameters.

Owing to the low overlap between Kraken2 and GAIA databases, it was not possible to analyze the same virtual microbiomes with both tools. The virtual microbiomes analyzed with Kraken were created as described above for GAIA. For Kraken2, only the scenario of 100% species in the databases was considered. Kraken2 analyses were made at the species and genus level for WGS and only at genus for 16S, as this tool does not distinguish species. The virtual microbiomes analyzed with Kraken2 can be found in S9 Data of S2 Appendix.

Analysis of the microbiome of Galleria mellonella larvae

Galleria mellonella larvae were maintained in an incubator at 28 C in the dark, and fed with beeswax from beehives.

For the experimental samples, G. mellonella larvae from the last larval stage (150–200mg) were kept in the presence of commercial plastic films only, during 4 to 6 days. For tissue extraction, live larvae were washed with 70% ethanol and rinsed with sterile water. Dissections were performed using sterilized microsurgery tools. DNA extraction was performed using the Quick-DNATM Tissue/Insect Miniprep Kit (Zymogen Research, Cat. No. D6016). Control larvae fed with beeswax from beehives were treated in the same way.

Amplicon sequencing was perfomed using the V3V4 primers described above to generate 16S Illumina libraries which were then sequenced with a MiSeq producing 300 bp paired-end reads. The same DNA was also used to generate TruSeq PCR-Free DNA libraries to perform whole metagenome sequencing (WGS) with an Illumina Novaseq 6000 producing 150 bp paired-end reads. Both the 16S amplicon sequencing data and the WGS data was processed with the following approach: first, the quality of the reads was assessed with the software FastQC (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/), then low quality bases and adapters were removed with the software Trimmomatic (http://www.usadellab.org/cms/?page=trimmomatic); minimum quality 25 and minimum length 35 bp). Amplicon data was analyzed with the software GAIA as described above. WGS data was first mapped against the G. mellonella reference genome (ASM258982v1) using the software minimap2 (https://github.com/lh3/minimap2) to remove the contamination from the host. Unmapped reads were extracted with samtools (http://www.htslib.org/) and then processed with GAIA to perform the taxonomic classification. Differential abundance analyses were performed with edgeR (https://bioconductor.org/packages/release/bioc/html/edgeR.html) applying the TMM normalization to the abundance matrix of the OTUs. The results of the amplicon and WGS analysis of the microbiome of G. mellonella can be found in S5 and S6 Data of S2 Appendix respectively. S7 and S8 Data in S2 Appendix show the taxa whose abundance changes significantly in larvae fed with plastic in 16S and WGS analysis respectively. The results of the Kraken 2 analyses of the G. mellonella gut microbiome can be found in S10 Data of S2 Appendix.

Results

Virtual microbiome models

Virtual microbiome are models of bacterial populations conceived to evaluate the performance of genomic analyses. Each virtual microbiome consists of a list of species of bacteria and their respective abundances. Their ecological structure (i.e. the taxonomic relationships among species and their relative abundance) was designed to comply with the macro-ecological rules described for real-world microbial communities (see [36] and S1 Appendix).

To study how the incompleteness of genomic databanks affects microbiome analyses, we created restricted (incomplete) databases in which some families, genera, or species were removed from the complete database (i.e. the one containing all the prokaryotes genetic sequences available to date). This allowed us to simulate different scenarios in which some of the species present in a microbiome have not been previously cataloged. In particular, we generated virtual microbiomes for which 25%, 50% and 100% of the species are represented in the incomplete databases.

Simulated amplicon sequencing and WGS reads were then generated starting from the virtual microbiomes using complete and incomplete databases and different numbers of reads (see Methods). Thus, for each virtual microbiome, we obtained several characterizations that could be compared with the original virtual microbiome composition and with one another (see Fig 1). First, we evaluated how the incompleteness of genomic databanks affects the accuracy of microbiome analyses using the following variables:

Fig 1. Rationale of the virtual microbiome framework.

Fig 1

See explanations in the text. (FP: false positives; TP: true positives; UI: unidentified; US: unsampled; O: overlap.).

  1. Unsampled species and genera: percentage of species and genera in the virtual microbiome whose abundance is below the threshold of detection imposed by the number of reads used in the analysis.

  2. Unidentified species and genera: percentage of the species and genera that were sampled but did not match any of the sequences contained in the database used in the analysis.

  3. False positives: percentage of the total of species and genera detected by the analysis that were not present in the original virtual microbiome.

  4. True positives: percentage of the species and genera that were detected by the analysis.

  5. Abundance of species and genera predicted by the metagenomic analysis.

Second, we studied the ability of microbiome analyses to detect changes in the abundance of species and genera in the virtual microbiomes. To that end, we compared virtual microbiomes containing the same set of species but differing in their relative abundance, a scenario that simulates the temporal changes in a microbiome caused, for instance, by alternative treatments on the host. Finally, we measured the overlap between 16S and WGS analyses (i.e. the species and genera simultaneously detected by both techniques) when performed on the same virtual microbiomes.

Effect of the incompleteness of databases in the characterization of virtual microbiomes by microbiome analyses

The first step of microbiome analyses consists in sampling the microbial populations of interest. The limited size of samples imposes obvious constraints to the subsequent genomic analyses of microbiomes: these analyses are performed only on the fraction of the microorganisms present in the microbiome that are actually sampled, which may hinder the identification of species with lower abundances [36].

To test the effect of sampling on the performance of microbiome analysis, we generated communities that differ in both species richness and α-diversity. Smaller species richness and greater α-diversities resulted in smaller fractions of unsampled species (Fig 2A left). In contrast, low-diversity communities are dominated by a few species or genera. Unevenness in the relative abundance implies that rare species are likely to be missed by the analysis. In these cases, increasing the number of reads only led to a moderate growth in the percentage of species detected (light grey lines, corresponding to this type of communities, do not fall below 80% in Fig 2A). This effect was more pronounced in more species-rich populations, in which greater sampling efforts did not necessarily result in a substantial reduction in the number of unsampled species, even for high values of α-diversity (Fig 2A right). The ecological structure of microbial communities (in terms of richness of species and α-diversity) imposes a major intrinsic limitation to the capacity of microbiome analyses, as shown by the loss of over 80% of the species in virtual microbiomes with low diversity (Fig 2B). In the subsequent evaluation of microbiome analyses, we used communities with low species richness (containing only 300 species) to minimize the fraction of unsampled species.

Fig 2. Evaluation of virtual microbiome characterization by microbiome analyses.

Fig 2

A-B) Effect of ecological parameters (richness of species and α-diversity) in the percentage of unsampled species in microbiome analyses (α-diversity was calculated using the Shannon index). C-D) Effect of the incompleteness of microbiome databases on the fraction of false positives (D) and true positives (E) at the species (left) and genus (right) level. E) Effect of the incompleteness of databases on the difference between detected and actual abundances of species (left) and genera (right). This difference is defined as D=1/Ni=1N(a˜i-ai)/ai, where N is the number of detected species or genera, and ai and a˜i are the actual and estimated abundances of species or genus i respectively. F-G) Ability of microbiome analysis to detect changes in the abundance of species (F) and genera (G) when incomplete (left) or complete (right) databases are used in the analysis. On the abscissa axis, the percentage of actual changes in abundance in the virtual microbiome detected by the analysis. On the ordinate, the fraction of the changes in abundance found by the analysis that does not correspond to actual changes in the virtual microbiome. (Open dots: 100% of the species or genera of the virtual microbiome are in the databases; Red dots: 50% of species or genera in DBs; Black dots: 25% of species or genera in DBs. Dashed lines: x = y.).

Unexpectedly, the incompleteness of genomic databanks led to a sharp increase in the detection of false positives (percentage of detected species or genera detected that are not present in the original virtual microbiome). Fig 2C shows that most of the analyses yielded high fractions of false positives (both at the species and genus levels) even when they were performed using the complete databank. This situation dramatically worsened for virtual microbiomes containing new species (corresponding to analyses with incomplete databases), in which false positives may represent more than half of the identified species and genera (Fig 2C). Remarkably, the incompleteness of the databanks affected the analyses of virtual microbiomes even when the 100% of their species were represented in the incomplete databases (the open dots do not lie in the diagonal in Fig 2C). The detection of false positives is determined by the presence in the reference databank of similar sequences to those sampled from the virtual microbiomes. Greater databanks may contain more sequences exhibiting similarities with virtual microbiome sequences, thus increasing the chances of misidentification. Conversely, if the number of potential candidates decreases, so can do the number of false positives.

We next analyzed the ability of genomic analysis to detect true positives (percentage of the species and genera of the virtual microbiome successfully identified in genomic analyses). This percentage was below 50% in all the virtual microbiomes at the species level (Fig 2D). As should be expected, the detection of true positives increased when larger fractions of the virtual microbiome species were included in the genomic databases used in the analysis. At the genus level, the detection of true positives was highly variable, ranging from around 10% to 100% in the studied virtual microbiomes.

Genomic analyses of virtual microbiomes provided very poor estimates of the abundance of species and genera. The differences between the actual abundance and that observed by the analyses spanned several orders of magnitude (Fig 2E). Remarkably, the results of this analysis were worse at the genus than at the species level, contrary to the characterization of unidentified, true positives, and false positives.

In view of their inability to accurately estimate the abundance of species and genera, it was not surprising that genomic analyses normally failed to detect changes in that abundance. We measured the success of the analyses to identify trends (i.e. increases or decreases in the abundance of species and genera), regardless of their magnitude. When incomplete databases were used in the analysis, most of the observed trends were spurious, i.e. they did not correspond to actual changes in species abundance in the virtual microbiomes (Fig 2F). Using complete databases greatly reduced the detection of false trends. In any case, less than half of the actual changes in abundance taking place in the virtual microbiomes were detected by genomic analyses. The detection of both actual and spurious trends was greater at the genus level (Fig 2F).

Overlap between 16S and WGS analysis of virtual microbiomes

To study the differences and similitudes between 16S and WGS analyses, we began by comparing their results when applied to the characterization of the same virtual microbiomes. As a general rule, WGS outperformed 16S in the identification of true positives but 16S analyses detected fewer false positives (see S1 Fig in S1 Appendix). Owing to the problems derived from the use of 16S primers discussed above, the fractions of unidentified species and genera were greater for amplicon than for WGS analyses (S1 Fig in S1 Appendix).

The overlap between 16S and WGS at the species level was below 50% in all the studied virtual microbiomes. In the analyses with less species present in the incomplete databases, this overlap did not reach 10% (Fig 3A). The coincidence between the results of 16S and WGS greatly increased with the amount of information available in the genomic databases. In contrast, using more reads in the analysis did not lead to better 16S-WGS fits (Fig 3B). Unexpectedly, most of the species simultaneously detected by 16S and WGS using incomplete databases were false positives (Fig 3C). Similar behaviors were observed in the analyses at the genus level (Fig 3D–3F).

Fig 3. Overlap between 16S and WGS analyses of virtual microbiomes.

Fig 3

A) Percentage of the species detected by either 16S or WGS that are simultaneously detected by both techniques. B) Effect of the number of reads on the overlap between 16S and WGS. C) Percentage of the species simultaneously detected by 16S and WGS that are not originally present in the virtual microbiome. D-F) Same as A-C for analyses at the genus level. Remark that all the open dots lie at the origin in F. (Open dots: 100% of the species or genera of the virtual microbiome are in the databases; Red dots: 50% of species or genera in DBs; Black dots: 25% of species or genera in DBs. Dashed lines: x = y.).

These results show that the simultaneous detection of a species or genus by 16S and WGS does not provide additional information about its presence in the virtual microbiome. The list of species and genera found in the intersection between both analyses does not characterize virtual microbiomes more accurately than those found separately by each analysis.

Interestingly, each technique proved capable of identifying species and genera that could not be found by the other (S2A Fig in S1 Appendix). In this regard, the number of true positives detected exclusively by WGS was normally greater than those obtained only by 16S (S2B Fig in S1 Appendix). Still, some species and genera identified by 16S were not detected in WGS analysis. The number of true positives detected only by WGS was greater in analyses with incomplete databases. However, WGS also found more false positives than 16S, an effect that was more pronounced when incomplete databases were used in the analysis (S2B Fig in S1 Appendix).

Altogether, our results point to a very limited ability of amplicon and WGS to accurately characterize virtual microbiomes, a conclusion that can be extrapolated to the study of real-world microbial communities. Since it is obviously impossible to know a priori if a given microbiome is underrepresented in the databanks used in its characterization, there is no way to judge the fraction of false positives found in the analysis, or how many of the taxa present in the microbiome are not identified or even sampled. This entails a high degree of uncertainty in the analysis of microbial communities, an aspect that should be explicitly taken into account in the interpretation of microbiome data. We address this issue in the next section.

Empirical test of the virtual microbiome predictions in the microbiome of Galleria mellonella

To test the predictions of the virtual microbiome models, the microbiome of Galleria mellonella larvae fed with a control diet or with polyethylene was investigated using amplicon and WGS. Sixty-eight 16S samples were produced from three tissues, namely Anterior Part (AP), Gut and Silk Glands (SG), control and experimental (11 samples per tissues, plus two more samples for the AP control) producing a total of 22,548,048 MiSeq paired-end 300 bp reads (S1 Data in S2 Appendix). About 50% of the total amount of reads could be classified at Phylum level, with Proteobacteria (52.16% of the reads), Firmicutes (22.97%), and Actinobacteria (15.82%) being the most abundant Phyla, on average (Fig 4A). On the other hand, about 42% of the reads could be classified at genus level with Pseudomonas (22.92%), Enterococcus (13.70%), Staphylococcus (6.75%), Serratia (6.73%), and Acinetobacter (4.84%) being the most abundant, on average (Fig 4B).

Fig 4. Overlap between 16S and WGS analyses of the microbiome of Galleria mellonella larvae.

Fig 4

A) Barplot showing the percentage of reads classified in the observed Phyla with 16S and WGS respectively in three tissues of G. mellonella (A: anterior part; G: gut; S: silk gland). B) Barplot showing the percentage of reads classified in the observed genera with 16S and WGS respectively in three tissues of G. mellonella (A: anterior part; G: gut; S: silk gland). C) Venn Diagrams showing the overlap between 16S and WGS. Upper: overlap of the identified phyla; Lower: overlap of the identified genera. D) Overlap of the genera showing significant differences in abundance between treated and control samples across the gut, anterior part, and salivary glands. E) Scatterplots showing the correlation between log2 fold changes of genera simultaneously detected by WGS and 16S in the anterior part (left), gut (medium), and salivary glands (left) from treated animals with respect to controls. Pearson correlation coefficient (R) and its significance (p) is reported for each plot. F) Coincidences and discrepancies between changes in abundance detected by 16S and WGS in the anterior part, gut, and silk glands of treated animals with respect to controls.

Thirteen samples were also processed with a WGS approach producing a total of 734,523,154 paired-end 150 bp reads (S2 Data in S2 Appendix). After removing the reads that mapped on the G. mellonella reference genome, only 221,217 (0.06%) pairs could be classified at Phylum level with Proteobacteria (22.84%), Arthropoda (17.05%), Ascomycota (12.63%), Basidiomycota (9.33%), and Firmicutes (8.19%) being the most abundant, on average (Fig 4A). The presence of Arthropoda suggests that some contamination from the host was still present. At the genus level, a total of 146,444 (0.03%) pairs could be classified, with the most abundant being Heterobasidion (Basidiomycota, 12.64%), Afipia (Proteobacteria, 12.38%), Treponema (Spirochaetes, 10.25%), Parastagonospora (Ascomycota, 10.20%), and Enterococcus (Firmicutes, 6.53%), on average (Fig 4B).

As a next step, we compared the results of both sequencing strategies, taking into account that the data was produced from the same samples. For this comparison, we only considered the taxa with a median abundance higher than 1%. At the phylum level, we observed that only 50% of the phyla detected by 16S could be also identified with the WGS strategy. In addition, the WGS approach detected 6 phyla that were not found by the 16S approach. Considering the total amount of 10 detected phyla, only 2 (20%) could be detected by both methods (Fig 4C). At the genus level, about 35% of the genera detected by 16S sequencing could also be identified by WGS. Considering the total amount of 496 detected genera, only 66 (13.3%) were simultaneously detected by both methods (Fig 4C).

We then performed differential abundance analysis using 16S and WGS data to detect genera whose abundance was affected by the diet. Each tissue was analyzed separately (S1 and S2 Data in S2 Appendix) and gut samples always contained the majority of differential genera. Considering all the tissues, 38 and 22 genera were found to show significant (FDR ≤0.05) differences in abundance with 16S and WGS respectively. Strikingly, none of these differential changes was simultaneously detected by both techniques (Fig 4D).

Even if there was no overlap among significant results, we decided to compare the fold changes detected for common genera between WGS and 16S in order to assess whether the magnitude of change was detected in the same way by the different methods (Fig 4E). Pearson Correlation of the log2 fold changes in the three tissues ranged from 0.13 to 0.27 between WGS and 16S. Focusing on the trend of the changes (i.e. increase or decrease) without considering their magnitude gave similar results (Fig 4F), with both techniques detecting opposite trends in about half of the cases. The discrepancy between 16S and WGS is even more pronounced for genera showing significant changes in abundance, since the genera found to be significant are not even the same in both cases. Overall, these results point to the fact that we observed a poor agreement in terms of changes in abundance between WGS and amplicon sequencing methods.

Discussion

In this work, we formulate a computational framework to evaluate the performance of metagenomic analyses based on the generation of virtual microbiomes that simulate real bacterial communities. Using this approach, we identified critical limitations in the capability of currently available technologies to characterize microbiomes. Similar results were obtained using two different tools for taxonomic profiling (S3 Fig in S1 Appendix), which suggests that these limitations may be general and not specific of particular tools. The constrains found within the virtual microbiome framework predicted a poor overlap between WGS and amplicon in the analysis of real-world microbiomes underrepresented in genomic databases. This prediction was confirmed by the discrepancies between both methods in the characterization of the bacteria found in G. mellonella larvae.

Some of the limitations of microbiome analysis result from intrinsic features of microbial populations (e.g. the loss of species due to sampling is greater in highly diverse communities). These limitations, imposed by the ecological structure of microbial populations, have been discussed elsewhere [36]. Other constraints arise from the lack of genomic information in the databanks used in metagenomic studies. It is trivial that 16S and WGS analyses cannot detect species that are not represented in genomic databases. Unexpected, however, is that the incompleteness of the available information also leads to an increase in the detection of false positives, which may severely distort the identification of the species present in a microbiome.

It seems natural to assume that applying 16S and WGS methods to characterize the same microbial community should increase the accuracy of the analysis. Intuitively, the detection of the same species by both techniques could be interpreted as the confirmation of its presence in the microbiome. However, our results suggest that this assumption is misleading, since the observed overlap between the results of 16S and WGS can consist mostly of false positives. This result agrees with the poor overlap between 16S and WGS observed in previous studies [33, 34, 50], which, accordingly, could be explained as emerging from the combined effect of the limitations derived from the ecological structure of the microbiome and the insufficiency of the genomic information used in the analyses.

As could be expected, better results can be obtained working at the genus levels, both in terms of microbiome characterization and overlap between 16S and WGS. However, the accuracy of the analyses is still highly dependent on the amount of information present in the databases. In this regard, it is worth noting that the amount of gathered genomic information has grown exponentially in the last decades and that the number of sequence records available at the NCBI databank ranges in the billions (Fig 5A). In contrast, the size of the databases used by metagenomic tools is much smaller owing to obvious computational constrains that limit the volume of information that can be effectively handled in the analyses. This implies that many species or genera present in a given sample will likely be missed by metagenomic analyses, even if those species and genera have been previously described in the literature. The subsequent bias in the analysis critically depends on the metagenomic tool chosen to perform the analysis (Fig 5B). Therefore, scenarios in which the 100% of the sample is in represented in the databanks can be rare, specially for less studied microbiomes such as those of insects.

Fig 5. Information available for metagenomic analyses.

Fig 5

A) The publication of new genomic information in the NCBI databank continues has grown exponentially (data obtained from https://www.ncbi.nlm.nih.gov/genbank/statistics/). B) Metagenomic tools greatly differ not only in the size of their reference databanks but also in the particular genera that are represented in their reference databases. The overlap between the databases used by GAIA and Kraken2 is small in both WGS and 16S (left). In the case of Kraken2, the datasets used in 16S and WGS analyses only coincide in a small fraction of genera (right).

All those limitations critically constrain the successful identification of the bacteria present in different communities. Importantly, they also hinder the detection of the changes in microbial communities induced by external or internal factors. Our results reveal that many of the trends identified at the species level can be spurious. Again, trend detection improves when working at the genus level, but a significant gap still exists between observed and real changes in abundance. This calls into question the approach of studies claiming to find bacteria that proliferate or decline after certain treatments [18, 20, 23, 2628]. We confirmed this point by studying the microbiome of plastic-fed and control G. mellonella larvae. Strikingly, the bacterial genera showing significant changes in abundance in hosts under a diet of plastic were totally different when observed by WGS or amplicon methods. This result questions the widespread experimental strategy of inducing changes in bacterial abundance to detect microorganisms with specific metabolic potentials.

Our results also confirm the utility of the virtual microbiome as a powerful theoretical framework to scrutinize the performance of different techniques of microbiome analysis. The shortcomings of genomic analysis identified within this framework can be addressed in several ways: 1) by working at higher taxonomic levels whenever this is possible. This solution may be insufficient though if further taxonomic precision is needed; 2) continue to accumulate information in genomic databases. This is a fundamental aspect, as the lack of data leads to worse results in terms of characterization and detection of changes in abundance; 3) one must be cautious when interpreting data from metagenomic studies. As has been shown, owing to the limitations inherent to these techniques, their results should not be taken as totally reliable or conclusive.

The ever-growing interest in microbiomes and their potential applications is very much dependent on the reliability, richness, and completeness of the databanks available for their accurate description. If microbiome data are to be useful in an effective and reproducible manner, the effort in the field must be channeled towards significantly increasing the amount of available genomic information and finding efficient ways to use this information. We believe that the virtual microbiome framework can be useful in the design of new approaches to metagenomic analysis.

Supporting information

S1 Appendix

(PDF)

S2 Appendix. List of supplementary data files.

S1 Data: Complete and incomplete databases used in the analyses of virtual microbiomes with GAIA. S2–S4 Data: Virtual microbiomes analyzed with GAIA. S5 Data: Raw data of the amplicon analysis of G. mellonella microbiome with GAIA. S6 Data: Raw data of the WGS analysis of G. mellonella microbiome with GAIA. S7 Data: Significant changes of abundance in G. mellonella microbiome detected by 16S analysis with GAIA. S8 Data: Significant changes of abundance in G. mellonella microbiome detected by WGS analysis with GAIA. S9 Data: Virtual microbiomes analyzed with Kraken2. S10 Data: Raw data of the analysis of G. mellonella gut microbiome with Kraken2.

(ZIP)

Data Availability

All relevant data are within the paper and its Supporting information files.

Funding Statement

FB and CFA gratefully acknowledge support by the Roechling foundation. BS was partially supported by MINECO grant MTM2017-85020-P. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Shreiner A. B., Kao J. Y. & Young V. B. The gut microbiome in health and in disease. Current Opinion in Gastroenterology 31, 69–75 (2015). doi: 10.1097/MOG.0000000000000139 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Vandeputte D. et al. Temporal variability in quantitative human gut microbiome profiles and implications for clinical research. Nature communications 12, 1–13 (2021). doi: 10.1038/s41467-021-27098-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Sepich-Poore G. D. et al. The microbiome and human cancer. Science 371, eabc4552 (2021). doi: 10.1126/science.abc4552 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Burki T. K. Gut microbiome and immunotherapy response. The Lancet Oncology 18, e717 (2017). doi: 10.1016/S1470-2045(17)30841-0 [DOI] [PubMed] [Google Scholar]
  • 5. Wei Z. et al. Initial soil microbiome composition and functioning predetermine future plant health. Science advances 5, eaaw0759 (2019). doi: 10.1126/sciadv.aaw0759 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. Antoniewicz M. R. A guide to deciphering microbial interactions and metabolic fluxes in microbiome communities. Current Opinion in Biotechnology 64, 230–237 (2020). doi: 10.1016/j.copbio.2020.07.001 [DOI] [PubMed] [Google Scholar]
  • 7. Hays S. G., Patrick W. G., Ziesack M., Oxman N. & Silver P. A. Better together: engineering and application of microbial symbioses. Current Opinion in Biotechnology 36, 40–49 (2015). doi: 10.1016/j.copbio.2015.08.008 [DOI] [PubMed] [Google Scholar]
  • 8. Harishankar M. K., Sasikala C. & Ramya M. Efficiency of the intestinal bacteria in the degradation of the toxic pesticide, chlorpyrifos. 3 Biotech 3, 137–142 (2012). doi: 10.1007/s13205-012-0078-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Sgobba E. & Wendisch V. F. Synthetic microbial consortia for small molecule production. Current Opinion in Biotechnology 62, 72–79 (2020). doi: 10.1016/j.copbio.2019.09.011 [DOI] [PubMed] [Google Scholar]
  • 10. Govindarajulu S. N. et al. Insect gut microbiome and its applications. Recent Advancements in Microbial Diversity 379–395 (2020). doi: 10.1016/B978-0-12-821265-3.00016-5 [DOI] [Google Scholar]
  • 11. Malacrinò A. Meta-omics tools in the world of insect-microorganism interactions. Biology 7, 50 (2018). doi: 10.3390/biology7040050 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Munoz-Benavent M., Pérez-Cobas A. E., García-Ferris C., Moya A. & Latorre A. Insects’ potential: understanding the functional role of their gut microbiome. Journal of Pharmaceutical and Biomedical Analysis 194, 113787 (2021). doi: 10.1016/j.jpba.2020.113787 [DOI] [PubMed] [Google Scholar]
  • 13. Hongoh Y. et al. Intra-and interspecific comparisons of bacterial diversity and community structure support coevolution of gut microbiota and termite host. Applied and environmental microbiology 71, 6590–6599 (2005). doi: 10.1128/AEM.71.11.6590-6599.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Brune A. & Dietrich C. The gut microbiota of termites: digesting the diversity in the light of ecology and evolution. Annual review of microbiology 69, 145–166 (2015). doi: 10.1146/annurev-micro-092412-155715 [DOI] [PubMed] [Google Scholar]
  • 15. Ni J. & Tokuda G. Lignocellulose-degrading enzymes from termites and their symbiotic microbiota. Biotechnology advances 31, 838–850 (2013). doi: 10.1016/j.biotechadv.2013.04.005 [DOI] [PubMed] [Google Scholar]
  • 16. Jeffries T. W. et al. Genome sequence of the lignocellulose-bioconverting and xylose-fermenting yeast Pichia stipitis. Nature biotechnology 25, 319–326 (2007). doi: 10.1038/nbt1290 [DOI] [PubMed] [Google Scholar]
  • 17. Chellappan M. & Ranjith M. Metagenomic approaches for insect symbionts. In Microbial Approaches for Insect Pest Management, 271–313 (Springer, 2021). [Google Scholar]
  • 18. Yang Y. et al. Biodegradation and mineralization of polystyrene by plastic-eating mealworms: Part 1. chemical and physical characterization and isotopic tests. Environmental science & technology 49, 12080–12086 (2015). doi: 10.1021/acs.est.5b02661 [DOI] [PubMed] [Google Scholar]
  • 19. Bombelli P., Howe C. J. & Bertocchini F. Polyethylene bio-degradation by caterpillars of the wax moth Galleria mellonella. Current Biology 27, R292–R293 (2017). doi: 10.1016/j.cub.2017.02.060 [DOI] [PubMed] [Google Scholar]
  • 20. Yang J., Yang Y., Wu W.-M., Zhao J. & Jiang L. Evidence of polyethylene biodegradation by bacterial strains from the guts of plastic-eating waxworms. Environmental science & technology 48, 13776–13784 (2014). doi: 10.1021/es504038a [DOI] [PubMed] [Google Scholar]
  • 21. Yang Y. et al. Biodegradation and mineralization of polystyrene by plastic-eating mealworms: part 2. role of gut microorganisms. Environmental science & technology 49, 12087–12093 (2015). doi: 10.1021/acs.est.5b02663 [DOI] [PubMed] [Google Scholar]
  • 22. Yang Y., Wang J. & Xia M. Biodegradation and mineralization of polystyrene by plastic-eating superworms Zophobas atratus. Science of the total environment 708, 135233 (2020). doi: 10.1016/j.scitotenv.2019.135233 [DOI] [PubMed] [Google Scholar]
  • 23. Lou Y. et al. Biodegradation of polyethylene and polystyrene by greater wax moth larvae (Galleria mellonella l.) and the effect of co-diet supplementation on the core gut microbiome. Environmental science & technology 54, 2821–2831 (2020). doi: 10.1021/acs.est.9b07044 [DOI] [PubMed] [Google Scholar]
  • 24. Cassone B. J., Grove H. C., Kurchaba N., Geronimo P. & LeMoine C. M. Fat on plastic: Metabolic consequences of an ldpe diet in the fat body of the greater wax moth larvae (Galleria mellonella). Journal of Hazardous Materials 425, 127862 (2022). doi: 10.1016/j.jhazmat.2021.127862 [DOI] [PubMed] [Google Scholar]
  • 25. Zhang J. et al. Biodegradation of polyethylene microplastic particles by the fungus Aspergillus flavus from the guts of wax moth Galleria mellonella. Science of the Total Environment 704, 135931 (2020). doi: 10.1016/j.scitotenv.2019.135931 [DOI] [PubMed] [Google Scholar]
  • 26. Brandon A. M. et al. Biodegradation of polyethylene and plastic mixtures in mealworms (larvae of Tenebrio molitor) and effects on the gut microbiome. Environmental science & technology 52, 6526–6533 (2018). doi: 10.1021/acs.est.8b02301 [DOI] [PubMed] [Google Scholar]
  • 27. Yang S.-S. et al. Impacts of physical-chemical property of polyethylene (pe) on depolymerization and biodegradation in insects yellow mealworms (Tenebrio molitor) and dark mealworms (Tenebrio obscurus) with high purity microplastics. Science of The Total Environment 154458 (2022). doi: 10.1016/j.scitotenv.2022.154458 [DOI] [PubMed] [Google Scholar]
  • 28. Cassone B. J., Grove H. C., Elebute O., Villanueva S. M. & LeMoine C. M. Role of the intestinal microbiome in low-density polyethylene degradation by caterpillar larvae of the greater wax moth, Galleria mellonella. Proceedings of the Royal Society B 287, 20200112 (2020). doi: 10.1098/rspb.2020.0112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29. Gupta V. K. et al. A predictive index for health status using species-level gut microbiome profiling. Nature communications 11, 1–16 (2020). doi: 10.1038/s41467-020-18476-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Khachatryan L. et al. Taxonomic classification and abundance estimation using 16s and wgs—a comparison using controlled reference samples. Forensic Science International: Genetics 46, 102257 (2020). doi: 10.1016/j.fsigen.2020.102257 [DOI] [PubMed] [Google Scholar]
  • 31. Dewhirst F. E. et al. Discordant 16s and 23s rRNA gene phylogenies for the genus Helicobacter: implications for phylogenetic inference and systematics. Journal of bacteriology 187, 6106–6118 (2005). doi: 10.1128/JB.187.17.6106-6118.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Ceuppens S., De Coninck D., Bottledoorn N., Van Nieuwerburgh F. & Uyttendaele M. Microbial community profiling of fresh basil and pitfalls in taxonomic assignment of enterobacterial pathogenic species based upon 16s rRNA amplicon sequencing. International journal of food microbiology 257, 148–156 (2017). doi: 10.1016/j.ijfoodmicro.2017.06.016 [DOI] [PubMed] [Google Scholar]
  • 33. Ranjan R., Rani A., Metwally A., McGee H. S. & Perkins D. L. Analysis of the microbiome: Advantages of whole genome shotgun versus 16S amplicon sequencing. Biochemical and Biophysical Research Communications 469, 967–977 (2016). doi: 10.1016/j.bbrc.2015.12.083 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Lazarevic V. et al. Analysis of the salivary microbiome using culture-independent techniques. Journal of Clinical Bioinformatics 2, 4 (2012). doi: 10.1186/2043-9113-2-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. McIntyre A. B. et al. Comprehensive benchmarking and ensemble approaches for metagenomic classifiers. Genome biology 18, 1–19 (2017). doi: 10.1186/s13059-017-1299-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Grilli J. Macroecological laws describe variation and diversity in microbial communities. Nature Communications 11 (2020). doi: 10.1038/s41467-020-18529-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Zhang N. et al. Contribution of sample processing to gut microbiome analysis in the model lepidoptera, silkworm Bombyx mori. Computational and structural biotechnology journal 19, 4658–4668 (2021). doi: 10.1016/j.csbj.2021.08.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Kong H. G. et al. The Galleria mellonella hologenome supports microbiota-independent metabolism of long-chain hydrocarbon beeswax. Cell Reports 26, 2451–2464 (2019). doi: 10.1016/j.celrep.2019.02.018 [DOI] [PubMed] [Google Scholar]
  • 39. Ren L. et al. Biodegradation of polyethylene by Enterobacter sp. d1 from the guts of wax moth Galleria mellonella. International journal of environmental research and public health 16, 1941 (2019). doi: 10.3390/ijerph16111941 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Peydaei A., Bagheri H., Gurevich L., de Jonge N. & Nielsen J. L. Impact of polyethylene on salivary glands proteome in Galleria melonella. Comparative Biochemistry and Physiology Part D: Genomics and Proteomics 34, 100678 (2020). [DOI] [PubMed] [Google Scholar]
  • 41. LeMoine C. M., Grove H. C., Smith C. M. & Cassone B. J. A very hungry caterpillar: polyethylene metabolism and lipid homeostasis in larvae of the greater wax moth (Galleria mellonella). Environmental Science & Technology 54, 14706–14715 (2020). doi: 10.1021/acs.est.0c04386 [DOI] [PubMed] [Google Scholar]
  • 42. Paniagua Voirol L. R., Frago E., Kaltenpoth M., Hilker M. & Fatouros N. E. Bacterial symbionts in lepidoptera: their diversity, transmission, and impact on the host. Frontiers in microbiology 9, 556 (2018). doi: 10.3389/fmicb.2018.00556 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Mereghetti V., Chouaia B. & Montagna M. New insights into the microbiota of moth pests. International Journal of Molecular Sciences 18, 2450 (2017). doi: 10.3390/ijms18112450 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Hammer T. J., Sanders J. G. & Fierer N. Not all animals need a microbiome. FEMS microbiology letters 366, fnz117 (2019). doi: 10.1093/femsle/fnz117 [DOI] [PubMed] [Google Scholar]
  • 45. Angly F. E., Willner D., Rohwer F., Hugenholtz P. & Tyson G. W. Grinder: a versatile amplicon and shotgun sequence simulator. Nucleic Acids Research 40, e94–e94 (2012). doi: 10.1093/nar/gks251 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46. Paytuví A., Battista E., Scippacercola F., Cigliano R. A. & Sanseverino W. GAIA: an integrated metagenomics suite. bioRxiv 804690 (2019). [Google Scholar]
  • 47. Korbel J. O. et al. Pemer: a computational framework with simulation-based error models for inferring genomic structural variants from massive paired-end sequencing data. Genome biology 10, 1–14 (2009). doi: 10.1186/gb-2009-10-2-r23 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Klindworth A. et al. Evaluation of general 16s ribosomal RNA gene pcr primers for classical and next-generation sequencing-based diversity studies. Nucleic acids research 41, e1–e1 (2013). doi: 10.1093/nar/gks808 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49. Wood D.E., Lu J. & Langmead B. Improved metagenomic analysis with Kraken 2. Genome biology 20, 257 (2019). doi: 10.1186/s13059-019-1891-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Hall J. B. et al. Isolation and identification of the follicular microbiome: Implications for acne research. Journal of Investigative Dermatology 138, 2033–2040 (2018). doi: 10.1016/j.jid.2018.02.038 [DOI] [PubMed] [Google Scholar]

Decision Letter 0

Patrizia Falabella

Transfer Alert

This paper was transferred from another journal. As a result, its full editorial history (including decision letters, peer reviews and author responses) may not be present.

8 Oct 2022

PONE-D-22-19792The virtualome: a computational framework to evaluate microbiome analysesPLOS ONE

Dear Dr. Arias,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by 20/11/2022. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Patrizia Falabella

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and 

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please update your submission to use the PLOS LaTeX template. The template and more information on our requirements for LaTeX submissions can be found at http://journals.plos.org/plosone/s/latex

3. We note that you have included the phrase “data not shown” in your manuscript. Unfortunately, this does not meet our data sharing requirements. PLOS does not permit references to inaccessible data. We require that authors provide all relevant data within the paper, Supporting Information files, or in an acceptable, public repository. Please add a citation to support this phrase or upload the data that corresponds with these findings to a stable repository (such as Figshare or Dryad) and provide and URLs, DOIs, or accession numbers that may be used to access these data. Or, if the data are not a core part of the research being presented in your study, we ask that you remove the phrase that refers to these data.

4. Please amend either the abstract on the online submission form (via Edit Submission) or the abstract in the manuscript so that they are identical

5. Thank you for stating the following in the Acknowledgments Section of your manuscript: 

"FB and CFA gratefully acknowledge support by the Roechling foundation. BS was partially supported by MINECO grant MTM2017-85020-P."

We note that you have provided funding information that is not currently declared in your Funding Statement. However, funding information should not appear in the Acknowledgments section or other areas of your manuscript. We will only publish funding information present in the Funding Statement section of the online submission form. 

Please remove any funding-related text from the manuscript and let us know how you would like to update your Funding Statement. Currently, your Funding Statement reads as follows: 

"FB and CFA gratefully acknowledge support by the Roechling foundation.

BS was partially supported by MINECO grant MTM2017-85020-P."

Please include your amended statements within your cover letter; we will change the online submission form on your behalf.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Partly

Reviewer #2: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

Reviewer #2: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: In this work, the authors use a statistical framework to generate virtual microbiomes that serve as benchmarks to compare the performance of amplicon and WGS-based taxonomic profiles. They find that the performance of taxonomic profiling is shckingly poor, especially if the reference datbases are incomplete, but also with complete databases. Moreover, the overlap observed between amplicon and WGS-based profiling is often due to false positives.

The implications of this study, if true, are devastating. What I find most concerning is the poor performance of taxonomic profiling even if the reference database contains all the genomes from the microbiome. Given the strong implications of this result, I find it essential to verify that such poor performance is not just a limitation of the particular tool (GAIA) and parameters used for taxonomic profiling. To that purpose, it would possibly be enough to study what happens when using other popular tools for taxonomic profiling and the complete database (which is the best-case scenario). I understand that the authors may choose not to go that longer way, but then the article should explicitly state that its conclussions are valid for GAIA, but maybe not for other taxonomic profiling algorithms.

From a technical perspective, the statistical properties and algorithms used to generate the virtual microbiome should be described in much greater detail. Given that generation of virtual microbiomes is central to (and possibly the main novelty of) the study, these details could even be presented in the Results section. If the authors prefer to focus the Results section on the comparison of taxonomic profiles, full details should still be provided in the Methods section.

I am not fully aware of Plos ONE policies about code availability; however, following the FAIR principles for open and reproducible research, it would be good to share the code to generate virtual microbiomes in a public repository, such as github or zenodo. Personally, I think that the generation of virtual microbiomes that are compatible with the abundance distributions found in real-world microbiomes is the main scientific breakthrough of this paper and not making the method available substantially diminishes its interest. I appreciate that pseudo-code for the algorithm is included in the Methods, but it needs more detail (what does function "generateAbundances" do? what is "params"?).

Related to the previous issues:

- Discussion of Figures 2C-D require further clarification. Why are there unidentified species when using the complete databases? Do the figures include unsampled species? Are some species being classified as "unidentified" because their assignations are ambiguous? How is it possible that the incomplete database sometimes prudices fewer unidentified genera than the complete database? How is it possible that the complete database produces false positives (perhaps due to an inappropriate management of ambiguous hits)?

- The fact that complete databases often lead to a large fraction of unidentified species and false positives suggests that the bioinformatic workflows used to simulate reads and assign taxonomy are intrinsically inaccurate or not properly optimized for the analysis of virtual microbiomes. Has this poor behavior been previously described for real or mock microbiomes (e.g., in the origial articles of Grinder or GAIA)? If not, how do you explain such poor behavior with virtual microbiomes? The fact that taxonomic profiles are fairly poor even when using the complete databse makes me wonder whether the results with incomplete databases would look better if a software different than GAIA had been used. Do you have any evidence that the findings reported here are qualitatively general and also valid for other taxonomy-assignment workflows?

Minor issues and suggestions:

- The portmanteau "virtualome" seems unncessary and not very intuitive. "Virtual microbiome", though longer, would be more informative and less cumbersome.

- In Section 2.2, first paragraph, sentences following cite [36]: These sentences are somewhat confusing and counterintuitive. The problem is that they refer to two measures of diversity that need to be clearly disambiguated. Species richness (discussed in terms of "small" or "large" communities) and alpha-diversity (discussed just as "diversity") are both measures of diversity, but they affect the percentage of unsampled species in opposite ways: the smaller the species richness, the smaller the unsampled fraction, but the smaller the alpha-diversity, the greater the unsampled fraction. I think that the results would be much more intuitive if ambiguous references to diversity were replaced by more informative terms. Specifically the concept of species richness is broadly known and I see no reason to not use it (instead of talking about small/large communities). As for alpha-diversity, trends would be easier to understand if it was discussed in terms of species (un)evenness, because it is quite intuitive that higher unevenness implies more rare species that may be missed by the analysis.

- Figure 2A: add quantitative legend (color bar) for the shades of gray that correspond to alpha-diversity.

- Page 3: "the simultaneous detection of a species by both techniques does not ensure its positive identification, since the overlap between amplicon and WGS can be dominated by false positives". This requires a citation. If it is a new finding of this study, it should be specified.

Reviewer #2: The manuscript of Serrano-Antón and collaborators entitled "The virtualome: a computational framework to evaluate microbiome analyses" reports the effect of database completeness on the metagenomics analysis outcomes, underlying the critical issues of the current bioinformatics approaches. The manuscript is well-presented, technically precise, and scientifically sound, with the experimental methodology used appropriated for the intended objectives. The presented framework remarked the discrepancies between the Target and the Whole Genome Sequencing approaches both in simulated and real data, taking into account the ecological structure of microbial communities. This study highlights the actual limitations as well as the known advantages of metagenomics investigations, underlying the need for an increased amount of genetics/genomics information in public databases to raise the overall accuracy of real-world data.

I have a few questions for the authors:

Page 5: Authors discussed the "ecological structure of microbial communities imposes a major intrinsic limitation to the capacity of microbiome analyses", but this part is not clear to me. Since the composition of the microbial community can widely vary in real-world data, often without a priori knowledge of the microbial composition under study, the small virtualomes approach tends to reduce the loss of species for technical purposes. In my opinion, the authors should highlight the different findings obtained by the use of the different alpha-diversity databases, as in the discussion the presented differences are not detailed as needed.

Page 6: Regarding the sentence "Remarkably, the incompleteness of the databanks affected the analyses of virtualomes even when the 100% of their species were represented in the incomplete databases (the open dots do not lie in the diagonal in Fig. 2D)." how did the authors interpret these findings? The authors should add more details on their evaluation of the reported findings.

Page 7: In the sentence "the number of true positives detected exclusively by WGS was normally greater than those obtained only by 16S" the authors highlight the difference between 16S and WGS results. Did the authors find any statistically significant difference between the number of real positives found by the use of Target compared to the use of WGS? What about the false positive?

Page 9: "The discrepancy between 16S and WGS is pronounced for genera", so even by the use of Pearson Correlation, readers cannot properly get which method could be the best in the study of an unknown microbial community. In my opinion, authors should highlight the best and the worst of each technique, even by adding a summary table reporting the main findings in both real and simulated data, to help readers in understanding of the presented results.

Page 13: is not clear to me if all reads that did not map to the G. mellonella reference genome were successfully classified by the use of the GAIA software, or if it was necessary to assemble and classify any Metagenome Assembled Genomes (MAGs). Can the authors clarify this point?

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2023 Feb 8;18(2):e0280391. doi: 10.1371/journal.pone.0280391.r002

Author response to Decision Letter 0


7 Dec 2022

Please find below a point-by-point response (in blue) to the questions posed by the Reviewers (in black).

Reviewer #1: In this work, the authors use a statistical framework to generate virtual microbiomes that serve as benchmarks to compare the performance of amplicon and WGS-based taxonomic profiles. They find that the performance of taxonomic profiling is shckingly poor, especially if the reference datbases are incomplete, but also with complete databases. Moreover, the overlap observed between amplicon and WGS-based profiling is often due to false positives.

The implications of this study, if true, are devastating. What I find most concerning is the poor performance of taxonomic profiling even if the reference database contains all the genomes from the microbiome. Given the strong implications of this result, I find it essential to verify that such poor performance is not just a limitation of the particular tool (GAIA) and parameters used for taxonomic profiling. To that purpose, it would possibly be enough to study what happens when using other popular tools for taxonomic profiling and the complete database (which is the best-case scenario). I understand that the authors may choose not to go that longer way, but then the article should explicitly state that its conclussions are valid for GAIA, but maybe not for other taxonomic profiling algorithms.


We have repeated the analysis of the best-case scenario (100% of the species and genera in the database) using Kraken2, which is one of the most used algorithms in the field and one of the few that can process both amplicon and WGS-based metagenomics data as GAIA. To do that, we have generated new virtualomes, analogous to those considered in the manuscript, using species and genera taken from the databases used by Kraken2 (we will return to this point below). The performance of Kraken2 in terms of false positives and poor overlap between 16S and WGS analyses is similar to that of GAIA, which suggests that the limitations identified in our work are not specific of GAIA (these new results are shown in a new supplementary figure, Supp. Fig. 1.A-C). In particular, it is remarkable that an important fraction of the genera simultaneously found in 16S and WGS analyses using Kraken2 are actually false positives.

We have also analyzed the biological samples taken from Galleria with Kraken2. In this case, the overlap between 16S and WGS analyses is around 10% at the genus level, which is also similar to that found with GAIA. We have included this new result in Supp. Fig. 1.D). We have changed the Methods section to add the details of the Kraken analyses.

It is worth noting that many of the species and genera of the original virtualomes are not present in the databanks used by Kraken2 (this is why we created new ad hoc virtualomes for the analysis with Kraken2). This would obviously constrain the ability of Kraken2 to characterize the original virtual communities. In fact, it is difficult to find enough species and genera simultaneously present in databases used by GAIA and Kraken to simulate the 100% scenario with both tools at the same time. This remark led us to take a closer look at the information that is actually used in metagenomics analyses. To our surprise, the databanks used by GAIA and Kraken2 are very different from each other, i.e. the number of species and genera in the intersection between both databanks is small. This implies that it is likely that some species or genera will not be present in the databases used in metagenomic analysis, even if those species and genera have been previously described. The choice of a metagenomic tool entails the loss of species and genera from the reference databank. In fact, the number of sequences contained in the NCBI database is in the range of billions. No metagenomic tool can manage this volume of information to date, which means that the databases used in the analyses only contain a small fraction of the genomes that have been described to date. We have included a new figure in the text and a brief discussion on this point (Figure 5):

"In this regard, it is worth noting that the amount of gathered genomic information has grown exponentially in the last decades and that the number of sequence records available at the NCBI databank ranges in the billions (Fig. 5.A). In contrast, the size of the databases used by metagenomic tools is much smaller owing to obvious computational constrains that limit the volume of information that can be effectively handled in the analyses. This implies that many species or genera present in a given sample will likely be missed by metagenomic analyses, even if those species and genera have been previously described in the literature. The subsequent bias in the analysis critically depends on the metagenomic tool chosen to perform the analysis (Fig. 5.B). Therefore, scenarios in which the 100% of the sample is in represented in the databanks can be rare, specially for less studied microbiomes such as those of insects." (lines 485-496)

We have also added a phrase about this in the Discussion:

"If microbiome data are to be useful in an effective and reproducible manner, the effort in the field must be channeled towards significantly increasing the amount of available genomic information and finding efficient ways to use this information. We believe that the virtualome framework can be useful in the design of new approaches to metagenomic analysis." (pages 525-528)


From a technical perspective, the statistical properties and algorithms used to generate the virtual microbiome should be described in much greater detail. Given that generation of virtual microbiomes is central to (and possibly the main novelty of) the study, these details could even be presented in the Results section. If the authors prefer to focus the Results section on the comparison of taxonomic profiles, full details should still be provided in the Methods section.


We have described the algorithms in much more detail in the Method section and in (link). We hope the new presentation will be satisfactory.


I am not fully aware of Plos ONE policies about code availability; however, following the FAIR principles for open and reproducible research, it would be good to share the code to generate virtual microbiomes in a public repository, such as github or zenodo. Personally, I think that the generation of virtual microbiomes that are compatible with the abundance distributions found in real-world microbiomes is the main scientific breakthrough of this paper and not making the method available substantially diminishes its interest. I appreciate that pseudo-code for the algorithm is included in the Methods, but it needs more detail (what does function "generateAbundances" do? what is "params"?).


We agree, see the response to the previous point.


Related to the previous issues:

- Discussion of Figures 2C-D require further clarification. Why are there unidentified species when using the complete databases? Do the figures include unsampled species? Are some species being classified as "unidentified" because their assignations are ambiguous? How is it possible that the incomplete database sometimes prudices fewer unidentified genera than the complete database? How is it possible that the complete database produces false positives (perhaps due to an inappropriate management of ambiguous hits)?


We thank the reviewer for this remark. In fact, the calculation of unidentified species was not accurate. This was an indirect estimation using the expected abundances, and the procedure to do that was not correct. Unfortunately, we did not save the information necessary to measure this variable accurately, so we have decided to remove this result from the text (originally shown in Figure 2C). In any case, this does not affect the rest of the results and the conclusions of our work.


- The fact that complete databases often lead to a large fraction of unidentified species and false positives suggests that the bioinformatic workflows used to simulate reads and assign taxonomy are intrinsically inaccurate or not properly optimized for the analysis of virtual microbiomes. Has this poor behavior been previously described for real or mock microbiomes (e.g., in the origial articles of Grinder or GAIA)? If not, how do you explain such poor behavior with virtual microbiomes? The fact that taxonomic profiles are fairly poor even when using the complete databse makes me wonder whether the results with incomplete databases would look better if a software different than GAIA had been used. Do you have any evidence that the findings reported here are qualitatively general and also valid for other taxonomy-assignment workflows?


As explained above, we have obtained similar results using Kraken2. In the manuscript, we cite some studies that show that analyzing the same samples using different techniques yield contradictory results (poor overlaps between alternative analysis of the same samples), which suggests that doubts about the accuracy of metagenomic analysis have already been raised in the field (see references 33-35 in the manuscript).


Minor issues and suggestions:

- The portmanteau "virtualome" seems unnecessary and not very intuitive. "Virtual microbiome", though longer, would be more informative and less cumbersome.

We do not agree with the Reviewer in this point. In our opinion, the term "virtualome" is intuitive and less cumbersome than "virtual microbiome".


- In Section 2.2, first paragraph, sentences following cite [36]: These sentences are somewhat confusing and counterintuitive. The problem is that they refer to two measures of diversity that need to be clearly disambiguated. Species richness (discussed in terms of "small" or "large" communities) and alpha-diversity (discussed just as "diversity") are both measures of diversity, but they affect the percentage of unsampled species in opposite ways: the smaller the species richness, the smaller the unsampled fraction, but the smaller the alpha-diversity, the greater the unsampled fraction. I think that the results would be much more intuitive if ambiguous references to diversity were replaced by more informative terms. Specifically the concept of species richness is broadly known and I see no reason to not use it (instead of talking about small/large communities). As for alpha-diversity, trends would be easier to understand if it was discussed in terms of species (un)evenness, because it is quite intuitive that higher unevenness implies more rare species that may be missed by the analysis.


We agree with the Reviewer that this paragraph was confusing. We have reformulated it to:

"To test the effect of sampling on the performance of microbiome analysis, we generated communities that differ in both species richness and α-diversity. Smaller species richness and greater α-diversities resulted in smaller fractions of unsampled species (Fig. 2A left). In contrast, low-diversity communities are dominated by a few species or genera. These unevenness in the relative abundance implies that rare species are likely to be missed by the analysis. In these cases, increasing the number of reads only led to a moderate growth in the percentage of species detected (light grey lines, corresponding to this type of communities, do not fall below 80% in 2A). This effect was more pronounced in more species-rich populations, in which greater sampling efforts did not necessarily result in a substantial reduction in the number of unsampled species, even for high values of α-diversity (Fig. 2A right). The ecological structure of microbial communities (in terms of richness of species and α-diversity) imposes a major intrinsic limitation to the capacity of microbiome analyses, as shown by the loss of over 80% of the species in virtualomes with low diversity (Fig.2B). In the subsequent evaluation of microbiome analyses, we used communities with low species richness (containing only 300 species) to minimize the fraction of unsampled species". (lines 309-324)


- Figure 2A: add quantitative legend (color bar) for the shades of gray that correspond to alpha-diversity.


Done.


- Page 3: "the simultaneous detection of a species by both techniques does not ensure its positive identification, since the overlap between amplicon and WGS can be dominated by false positives". This requires a citation. If it is a new finding of this study, it should be specified.

We have reformulated this to make it explicit that this is a result of this study:

"Using this approach, we have identified the incompleteness of databases as a key constraint to the accuracy of microbiome analyses. Our results suggest that the poor overlap between amplicon and WGS already encountered with human samples does not seem to be an exception. Furthermore, we show in this work that the simultaneous detection of a species by both techniques does not ensure its positive identification, since the overlap between amplicon and WGS can be dominated by false positives, i.e. by species that are not present in the original samples". (lines 97-102)

\fReviewer #2:

 The manuscript of Serrano-Antón and collaborators entitled "The virtualome: a computational framework to evaluate microbiome analyses" reports the effect of database completeness on the metagenomics analysis outcomes, underlying the critical issues of the current bioinformatics approaches. The manuscript is well-presented, technically precise, and scientifically sound, with the experimental methodology used appropriated for the intended objectives. The presented framework remarked the discrepancies between the Target and the Whole Genome Sequencing approaches both in simulated and real data, taking into account the ecological structure of microbial communities. This study highlights the actual limitations as well as the known advantages of metagenomics investigations, underlying the need for an increased amount of genetics/genomics information in public databases to raise the overall accuracy of real-world data.


We thank the Reviewer for his/her positive comments.


I have a few questions for the authors:

Page 5: Authors discussed the "ecological structure of microbial communities imposes a major intrinsic limitation to the capacity of microbiome analyses", but this part is not clear to me. Since the composition of the microbial community can widely vary in real-world data, often without a priori knowledge of the microbial composition under study, the small virtualomes approach tends to reduce the loss of species for technical purposes. In my opinion, the authors should highlight the different findings obtained by the use of the different alpha-diversity databases, as in the discussion the presented differences are not detailed as needed.


That the ecological structure of microbial communities can affect metagenomic analyses is not a new result (see for instance reference 36 in the text). In our analyses, the choice of virtualomes with low species richness is not motivated by technical reasons but by the relative lower fraction of unsampled species and genera in this type of communities (as shown in Fig. 2A-B). This choice is intended as defining a lower boundary to the potential inaccuracies of the subsequent metagenomic analyses of the virtualomes. For instance, since the fraction of unsampled species and genera will be greater in more species-rich communities, the fraction of true positives will be smaller.

We do not understand the question of the alpha-diversity of the databases. This ecological feature depends on the relative abundance of species and genera in a community. In the databases, each species and genus can only be present or absent, so there is no space for relative abundance. The databases used in this work were created to contain different fractions of species and genera present in the virtualomes, and the effect of this factor is discussed in depth in the section entitled "Effect of the incompleteness of databases in the characterization of virtualomes by microbiome analyses",


Page 6: Regarding the sentence "Remarkably, the incompleteness of the databanks affected the analyses of virtualomes even when the 100% of their species were represented in the incomplete databases (the open dots do not lie in the diagonal in Fig. 2D)." how did the authors interpret these findings? The authors should add more details on their evaluation of the reported findings.

We have included a more detailed interpretation of this result in the text:

"The detection of false positives is determined by the presence in the reference databank of similar sequences to those sampled from the virtualomes. Greater databanks may contain more sequences exhibiting similarities with virtualome sequences, thus increasing the chances of misidentification. Conversely, if the number of potential candidates decreases, so can do the number of false positives." (lines 335-339)


Page 7: In the sentence "the number of true positives detected exclusively by WGS was normally greater than those obtained only by 16S" the authors highlight the difference between 16S and WGS results. Did the authors find any statistically significant difference between the number of real positives found by the use of Target compared to the use of WGS? What about the false positive?

We did not perform any statistical analysis to compare the results of alternative virtualome analysis. The differences are sufficiently clear and the use of statistics would not add any valuable information. On the other hand, we did not intend to compare the efficiency of 16S vs WGS approaches, only to highlight potential limitations of metagenomic analyses in general.


Page 9: "The discrepancy between 16S and WGS is pronounced for genera", so even by the use of Pearson Correlation, readers cannot properly get which method could be the best in the study of an unknown microbial community. In my opinion, authors should highlight the best and the worst of each technique, even by adding a summary table reporting the main findings in both real and simulated data, to help readers in understanding of the presented results.


As noted in the previous point, we did not intend to compare different metagenomic techniques or tools. In fact, there is probably no single criterium to decide what "better" means in this context. Should we try to reduce the number of false positives? Increase the fraction of true positives? Reduce the cost of the analysis? In fact, we believe that the virtualome provides a useful framework in which these and other interesting issues could be addressed but those specific analyses are beyond the scope of this work. The optimal performance of metagenomic analyses is taken for granted in the field. The virtualomes show that these analyses should be interpreted with care.


Page 13: is not clear to me if all reads that did not map to the G. mellonella reference genome were successfully classified by the use of the GAIA software, or if it was necessary to assemble and classify any Metagenome Assembled Genomes (MAGs). Can the authors clarify this point?

Only a small fraction of the reads that did not map to the G. mellonella reference genome could be classified. As indicated in the text (line 417), on average only 0.06% of the filtered reads could be classified with GAIA. We obtained similar results with Kraken2 (results not showed). We attempted to generated MAGs from the reads however we did not obtain high quality genomes and most of the contigs remained unclassified as well.

Attachment

Submitted filename: Response to reviewers PDF.pdf

Decision Letter 1

Patrizia Falabella

22 Dec 2022

PONE-D-22-19792R1The virtualome: a computational framework to evaluate microbiome analysesPLOS ONE

Dear Dr. Arias,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Feb 05 2023 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Patrizia Falabella

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: (No Response)

Reviewer #2: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

Reviewer #2: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: No

Reviewer #2: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: In the revised version of the manuscript, the authors made a remarkable effort to show the generality of their results by reproducing some parts of the analysis with a different taxonomic profiling tool (Kraken). Some obscure claims regarding unidentified taxa have been removed and the origin of false positives is now clearly explained. Overall, these changes notably improved the quality of the manuscript and the robustness of its conclusions.

I appreciate the authors' effort to better describe the algorithm that produces virtual microbiomes. I still do not understand why they did not release a public, functional implementation of their code, although I leave it to the Editor to decide whether that is a requisite for publication. Anyway, the pseudocode included in the methods section is still incomplete: there are 4 subroutines in EcologyGeneration that are not proprely described. Furthermore, I could not find the parameter values (mu, sigma, shape, scale) used to generate the abundance matrices.

I understand that, from the authors' perspective, it is hard to let go of a fancy word like "virtualome", which is certainly catchy. However, I must insist that it is nonsense. There are already many "omes" in the recent biological literature, perhaps too many, but at least most of them are self-explanatory and semantically coherent. Genomes, proteomes, transcriptomes, interactomes, and microbiomes (to name a few) clearly refer to sets of genes, proteins, transcripts, interactions, and microbes, respectively. Accordingly, virtualomes would be sets of some kind of virtual entities. What does that mean? I could think of something like a set of virtual environments in a computing context, although that would be quite a stretch. However, the authors use the term to refer to a virtual microbiome (I guess that the rationale is to combine VIRTUAL with microbiOME). From a semantic perspective, that is nonsense. Following that rule, it could also mean virtual proteome, virtual interactome, virtual transcriptome, virtual metabolome... Or none of that. It is perhaps a funny word to describe the tool in an informal setting, among colleagues or perhaps as a joke in a conference if you want to get the attention of the audience. But I hardly see how it would be an acceptable (that is, unambiguous and semantically coherent) scientific term.

Some typos:

239: "does to distinguish" -> "does NOT distinguish"

325: "These unevenness in the relative abundance implies". Remove "These"

Reviewer #2: In this last version, the authors addressed all the concerns raised by the reviewers, extending certain sections and clarifying important aspects of the actual limitations of the metagenomics analysis. In particular, the new analysis performed using the Kraken2 well supported the results and conclusions of the manuscripts.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2023 Feb 8;18(2):e0280391. doi: 10.1371/journal.pone.0280391.r004

Author response to Decision Letter 1


27 Dec 2022

Please find below a point-by-point response (in blue) to the questions posed by the Reviewers (in black).

Reviewer #1: In the revised version of the manuscript, the authors made a remarkable effort to show the generality of their results by reproducing some parts of the analysis with a different taxonomic profiling tool (Kraken). Some obscure claims regarding unidentified taxa have been removed and the origin of false positives is now clearly explained. Overall, these changes notably improved the quality of the manuscript and the robustness of its conclusions.


We thank the Reviewer for his/her positive comments and for his/her contribution to the improvement of this article.


I appreciate the authors' effort to better describe the algorithm that produces virtual microbiomes. I still do not understand why they did not release a public, functional implementation of their code, although I leave it to the Editor to decide whether that is a requisite for publication. Anyway, the pseudocode included in the methods section is still incomplete: there are 4 subroutines in EcologyGeneration that are not proprely described. Furthermore, I could not find the parameter values (mu, sigma, shape, scale) used to generate the abundance matrices.

We forgot to add the link to the public repository where the code is available (https://github.com/bserranoanton/Virtual-microbiome). I apologize for this error. We have included it in the revised version of the text. In this site it is also possible to access the values of the parameters used to generate each virtual microbiome.

"The full implementation of the algorithm can be found at the following link https://github.com/bserranoanton/Virtual-microbiome. In addition, information on all the parameters used for the generation of the abundance matrices of the virtual microbiomes can also be accessed." (lines 139-142).


I understand that, from the authors' perspective, it is hard to let go of a fancy word like "virtualome", which is certainly catchy. However, I must insist that it is nonsense. There are already many "omes" in the recent biological literature, perhaps too many, but at least most of them are self-explanatory and semantically coherent. Genomes, proteomes, transcriptomes, interactomes, and microbiomes (to name a few) clearly refer to sets of genes, proteins, transcripts, interactions, and microbes, respectively. Accordingly, virtualomes would be sets of some kind of virtual entities. What does that mean? I could think of something like a set of virtual environments in a computing context, although that would be quite a stretch. However, the authors use the term to refer to a virtual microbiome (I guess that the rationale is to combine VIRTUAL with microbiOME). From a semantic perspective, that is nonsense. Following that rule, it could also mean virtual proteome, virtual interactome, virtual transcriptome, virtual metabolome... Or none of that. It is perhaps a funny word to describe the tool in an informal setting, among colleagues or perhaps as a joke in a conference if you want to get the attention of the audience. But I hardly see how it would be an acceptable (that is, unambiguous and semantically coherent) scientific term.

The Reviewer has provided solid arguments against the use of the word "virtualome". We have changed it accordingly throughout the text, including the title, and in the corresponding figures.

Some typos:
239: "does to distinguish" -> "does NOT distinguish"
325: "These unevenness in the relative abundance implies". Remove "These"

Corrected.

Reviewer #2: In this last version, the authors addressed all the concerns raised by the reviewers, extending certain sections and clarifying important aspects of the actual limitations of the metagenomics analysis. In particular, the new analysis performed using the Kraken2 well supported the results and conclusions of the manuscripts.

We thank the Reviewer for his/her positive evaluation of this work.

Attachment

Submitted filename: Response to reviewers review 2.pdf

Decision Letter 2

Patrizia Falabella

28 Dec 2022

The virtual microbiome: a computational framework to evaluate microbiome analyses

PONE-D-22-19792R2

Dear Dr. Arias,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

Patrizia Falabella

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: All my comments have been addressed. Therefore, I support publication of this work in Plos One. I would want to congratulate the authors for this fine piece of research, which will help guide future taxonomic profiling works based on metagenomics.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

**********

Acceptance letter

Patrizia Falabella

4 Jan 2023

PONE-D-22-19792R2

The virtual microbiome: a computational framework to evaluate microbiome analyses

Dear Dr. Arias:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Prof. Patrizia Falabella

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Appendix

    (PDF)

    S2 Appendix. List of supplementary data files.

    S1 Data: Complete and incomplete databases used in the analyses of virtual microbiomes with GAIA. S2–S4 Data: Virtual microbiomes analyzed with GAIA. S5 Data: Raw data of the amplicon analysis of G. mellonella microbiome with GAIA. S6 Data: Raw data of the WGS analysis of G. mellonella microbiome with GAIA. S7 Data: Significant changes of abundance in G. mellonella microbiome detected by 16S analysis with GAIA. S8 Data: Significant changes of abundance in G. mellonella microbiome detected by WGS analysis with GAIA. S9 Data: Virtual microbiomes analyzed with Kraken2. S10 Data: Raw data of the analysis of G. mellonella gut microbiome with Kraken2.

    (ZIP)

    Attachment

    Submitted filename: Response to reviewers PDF.pdf

    Attachment

    Submitted filename: Response to reviewers review 2.pdf

    Data Availability Statement

    All relevant data are within the paper and its Supporting information files.


    Articles from PLOS ONE are provided here courtesy of PLOS

    RESOURCES