Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2023 Apr 5:1–17. Online ahead of print. doi: 10.1007/s41745-023-00370-z

Big Data for a Small World: A Review on Databases and Resources for Studying Microbiomes

Pratyay Sengupta 1,2,3, Shobhan Karthick Muthamilselvi Sivabalan 1, Amrita Mahesh 1,2, Indumathi Palanikumar 1,2,3, Dinesh Kumar Kuppa Baskaran 1,2,3, Karthik Raman 1,2,3,
PMCID: PMC10073628  PMID: 37362854

Abstract

Microorganisms are ubiquitous in nature and form complex community networks to survive in various environments. This community structure depends on numerous factors like nutrient availability, abiotic factors like temperature and pH as well as microbial composition. Categorising accessible biomes according to their habitats would help in understanding the complexity of the environment-specific communities. Owing to the recent improvements in sequencing facilities, researchers have started to explore diverse microbiomes rapidly and attempts have been made to study microbial crosstalk. However, different metagenomics sampling, preprocessing, and annotation methods make it difficult to compare multiple studies and hinder the recycling of data. Huge datasets originating from these experiments demand systematic computational methods to extract biological information beyond microbial compositions. Further exploration of microbial co-occurring patterns across the biomes could help us in designing cross-biome experiments. In this review, we catalogue databases with system-specific microbiomes, discussing publicly available common databases as well as specialised databases for a range of microbiomes. If the new datasets generated in the future could maintain at least biome-specific annotation, then researchers could use those contemporary tools for relevant and bias-free analysis of complex metagenomics data.

Keywords: Microbiome databases, Environmental microbiome, Biome-specific microbiome, Industrial microbiome, Computational analysis of microbiomes, Metagenomics databases

Introduction

Microorganisms carry out numerous functions in essential ecosystem processes and collectively shape biodiversity. Along with their crucial role in human health and diseases114, microorganisms across other diverse environments help us with a range of applications from the agricultural industry88 to food processing94 and generating a sustainable utility of wide environmental resources15. However, microorganisms do not prefer solitary existence and demonstrate utilisation of resources through a complex trade-off between cooperation and competition72. Understanding these co-existing microbial communities is crucial in the post-genomic era for re-engineering communities to serve designated functional deliverables.

With recent advances in high-throughput sequencing techniques in terms of cost reduction, an enormous amount of complex metagenomics data has been generated. Over the last decade, it has helped researchers to investigate a variety of microbial communities, from human gut99, livestock109, plant rhizosphere66 to deserts5, polar ends116, deep-sea hydrothermal vents12,31, earth surface98 and beyond96,103 in a culture-independent way. However, these large quantities of data have not been matched with the comparable computational resources available. The lack of standardised protocols, proper data annotation, data archiving with metadata, common analysis pipelines, and inadequate resource organisation has restricted further discoveries through underutilizing the available microbial sequences and constrained meta-learning and meta-mining. This insufficient data interoperability led to the bulk of microbiome data being “single-use” without being repurposed beyond the original scope of the study55.

In this perspective, we provide an overview of available microbiome resources and a brief overview of the current state of computational microbiome research and its importance. In addition, we highlight some of the advances made in analysing complex microbial sequencing data and understanding their interactions in the communities. We believe it will further stimulate microbiome researchers to initiate large-scale meta-analyses of the available sequencing data and develop robust computational tools to generate more valuable insights from the available individual studies.

Microbiome Databases

To study complex microbial networks, community structures, and host–microbiome interactions, microbiome databases derived from metaomics experiments are essential. Common data repositories like Integrated Microbial Next Generation Sequencing (IMNGS)56, Metagenomic RAST (Rapid Annotation using Subsystems Technology)50, MGnify by EMBL-EBI78, NCBI (https://www.ncbi.nlm.nih.gov), SILVA86 host bulk of the microbial metagenomics and other metaomics data sets. However, it is important to understand the nature of the host environment which directly impacts the composition and conformation of its microbial community. Several biome-specific databases have been developed over the past decades which could be utilised to understand microbiomes across different habitats. Below we classified the available microbiome databases into five broad categories namely environmental, human, animal, plant, and industrial microbiome. Selected databases with prominent and structured datasets belonging to these categories are reported in Table 1 with brief descriptions and their biome content.

Table 1:

Some examples of microbiome databases: The table provides a brief description of some key example microbiome databases, along with web URLs and references

Database name Type of microbiome Description Link References
A Database for Domestic Animal Gut Microbiome Atlas (ADDAGMA) Animal Domestic animal gut microbiome data associated with specific phenotypes such as health and production traits http://addagma.omicsbio.info/ 111
Amadis Human Database for association between diseases and gut microbes http://gift2disease.net/GIFTED/ 64
Animal Microbiome Database (AMDB) Animal 16S rRNA gene profiles for animal gut microbiome across multiple projects http://leb.snu.ac.kr/amdb 113
AnimalMetagenome DB Animal Metagenomic data from domestic and wild animals http://animalmetagenome.com/ 47
Atacama Database Environmental Information on microbes present in Atacama desert https://www.atacamadb.cl/ 25
BASE: the Biomes of Australian Soil Environments soil microbial diversity database Environmental Database of amplicon data, complete with rich contextual information on edaphic, aboveground diversity and climate http://www.bioplatforms.com/soil-biodiversity/ 17
Beebiome Data Portal Animal Catalogue of bee-associated microbes and viruses https://beebiome.org/ 35
BeeExact Animal Facilitate standardized classification of bee-associated microbial communities, improve cross-study reproducibility, and help to highlight novel candidate taxa in need of characterization https://github.com/bdaisley/BEExact 27
biogasmicrobiome Industrial Microbiome data from anaerobic digesters and processes such as lignocellulose degradation https://biogasmicrobiome.env.dtu.dk/ 21
BugSigDB Human Manually curated database that standardises results from previous studies on microbial signatures https://bugsigdb.org/Main_Page 41
Coral Microbiome Database (Coral MD) Animal Compilation of SSU rRNA sequences from corals https://www.bco-dmo.org/dataset/724355 48
Disbiome Human Cataloguing microbiome composition changes in disease conditions—queries can be made based on data such as disease or microorganism name https://disbiome.ugent.be/home 49
Earth Microbiome Project (EMP) Environmental An initiative to catalogue the microbiomes across the globe https://earthmicrobiome.org/ 43
Extreme Microbiome Project (XMP) Environmental Description and characterisation of microbes surviving under extreme environmental conditions http://extrememicrobiome.org/ 100
Food Microbe Tracker Industrial Ribotype, phenotype, gene sequences, PFGE (pulsed-field gel electrophoresis) data for food-associated microbes https://www.foodmicrobetracker.net/login/login.aspx 105
FoodMicroBionet Industrial Data from and correlation network analyses for 17 studies on food-associated microbes http://www.foodmicrobionet.org/ 83
Genomic Features Of Bacterial Adaptation to Plants (GFOBAP) Plant Data on plant-associated microbes and genes that contribute to this association http://labs.bio.unc.edu/Dangl/Resources/gfobap_website/ 62
GMRepo Human Curated metagenomic data from the human gut https://gmrepo.humangut.info/ 26
GreenGenes Common 16S rRNA database that checks for chimeras https://greengenes.secondgenome.com/ 29
Human Microbiome Project Human Microbiome data from human skin and mucosal samples https://hmpdacc.org/hmp/ 68
Human Oral Microbiome Database Human Taxonomic and sequence data from bacteria associated with the mouth and aerodigestive tract https://www.homd.org/ 37
Integrated Microbial Next Generation Sequencing (IMNGS) Common Platform to screen for and process all 16S rRNA sequences available in SRA https://www.imngs.org/ 56
Life Science Data Archive (LSDA) Environmental Life Science Data Archive (LSDA) is a searchable NASA-federated data archive containing information on flight analog data along with the respective human, plant and animal data https://nlsp.nasa.gov/explore/lsdahome 108
MG-RAST Common Metagenomics data repository which includes microbiome data https://www.mg-rast.org/ 50
Mgnify Common Submission and analysis of microbiome data from various biomes (including human, aquatic, plant and wastewater microbiomes) and including multiple analysis types such as amplicon, assembly and metagenome data https://www.ebi.ac.uk/metagenomics/ 78
MicrobiomeDB Human Database and tool for querying microbiome studies (including human and animal) based on metadata queries, including sample details and taxon abundance https://microbiomedb.org/mbio/app 82
MiDAS Industrial 16S rRNA and taxonomic data for wastewater plant-associated microbes https://www.midasfieldguide.org/ 33
NASA Genelab Environmental NASA Genelab is a collection of omics datasets for the biological samples exposed to radiation and ions with high atomic number and energy. Along with the datasets, Genelab goes a step further to provide analytic and visualization tools for host-associated molecular signals https://genelab.nasa.gov/ 16
PAMDB Plant MLST/MLSA data for plant-associated microbes http://genome.ppws.vt.edu/cgi-bin/MLST/home.pl 4
SILVA Common rRNA database for organisms from 3 domains of life—Bacteria, Archaea and Eukaryota https://www.arb-silva.de/ 86
SKIOME Human Curated skin microbiome datasets with metadata https://github.com/giuliaago/SKIOMEMetadataRetrieval 1
SpaceLID Environmental Space Life Investigation Database (SpaceLID) records investigations published in peer reviewed academic journals. In SpaceLID, each publication constitutes an entry, which commonly includes the procedure and outcomes presented in figures and tables of multiple experiments https://bidd.group/spacelid/ 108
The Integrative HMP (iHMP) Research Network Consortium Human Expanded version of HMP—includes IBD, Type-2 diabetes and preterm birth https://hmpdacc.org/ihmp/ 67
UNITE Common Database for eukaryotic ITS sequences—provides representative sequences for a Species Hypothesis (SH) which serves to act as a bridge between OTU and species classification for fungi https://unite.ut.ee/ 81
Vaginal Microbiome Consortium Human Data from the Vaginal microbiome project, including 16S rRNA, whole metagenome shotgun sequencing data, and phenotype and clinical metadata https://www.vmc.vcu.edu/index.html 39

Environmental Microbiome

The complex interplay among microorganisms in a community and their role in utilising environmental resources drive species richness of the biosphere. Understanding this microbial diversity, complex patterns of distribution, interactions, and community assembly across diverse ecosystems is essential to find ecological selections, the effect of geological parameters, and surveilling the acquisition of antimicrobial resistance (AMR). From 2010 onwards, the Earth Microbiome Project (EMP) has assembled 27,751 samples from 97 different studies to reveal the microbial landscape of the earth’s environments ranging from plant surface, rhizosphere to fresh water, salt water, soil, sediment, animal gut, and many more43,98. RefSoil+34 and BASE (abbreviated from the Biomes of Australian Soil Environments)17 are two extensive databases on soil microbiomes. The former contains plasmid information from soil microbiomes with the potential to transfer antibiotic resistance genes (ARGs) from the environment to clinical strains, while the latter directs us to soil metagenomics data from Australia and Antarctica, initiating an improvement of agricultural outcomes and restoring biological functionalities complementing physicochemical properties of the soil.

Apart from ‘hospitable’ environments, microorganisms thriving in extremely harsh environments, ranging from the deep oceans to dry deserts, exhibit growth under limiting conditions77. In 2014, a scientific consortium initiated by the Association of Biomolecular Resource Facilities (ABRF) Metagenomics Research Group (MGRG) called The Extreme Microbiome Project (XMP), attempted to discover and characterise the extremophiles and novel organisms from Lake Hillier in Australia, Darvaza gas crater in Turkmenistan, Permafrost tunnels in Alaska, Ethiopian toxic hot springs, and many more extreme places100. Along with the novel biomolecular techniques for sample processing at extreme locations, XMP projects have benchmarked several computational tools for de novo taxa classification and identification of novel biosynthetic gene clusters (BGCs) from minimally assembled genomes32. Microorganisms that evolved and adapted in these extreme environments possess heterogeneous biological resources with potential biotechnology applications. Atacama Database is a freely available, curated database with 2302 cultured and uncultured microorganisms from one of the aridest and rich in diverse microorganism deserts on the planet, Atacama, in northern Chile25.

Moreover, microorganisms present in human inhabitance strongly influence human health. Confirming biological cleanliness in built environments is essential for protecting human health and avoiding contamination in industry-level assemblies73. Along with inhabiting humans, the microbiome of a built environment depends on many factors, including geographical location, ventilation, usage, and constructional design23. An interesting study led by Jack Gilbert in 2014, Home Microbiome Project, aimed to understand the dynamic microbial interactions between home and their habitats. A total of 1609 samples from a combination of seven houses, 18 people, three dogs, and one cat were collected. This longitudinal study showed that unique microbial patterns in the homes could be predicted by their occupants, and microorganisms are able to colonise rapidly in the new environments as the habitats move house59. Later, the International Metagenomics and Metadesign of Subways and Urban Biomes (MetaSUB) Consortium, established in 2015, gathered over 4500 samples from 60 cities over a period of 3 years and generated a global catalogue of the urban microbiome28. MetaSUB initiative noticed a core set of non-human microorganisms consistently present across the cities. They identified novel bacterial and viral contents in the samples and the global widespread of ARGs. However, other built-in cleanroom environments like operation theatres (OTs) and intensive care units (ICUs) need to maintain a standard operating environment with minimal microbial content87. Recent studies from hospitals investigated a detailed understanding of cleanroom microbiomes and their requirement for the maintenance and regulation of the microbial community structure of the built environment87,90,110.

The study of extraterrestrial environments and spacecrafts equips us with insights into the effect of space on microbiota, their survival, metabolite production, biofilm formation, development of drug resistance, as well as the impact of space travel on astronauts’ health. Through their extensive specialised experiments, Spaceflight Life Investigations (SLIs) generate new perspectives on basic microbiology studies and provide unique opportunities for pharmaceutical industries108. Life Science Data Archive (LSDA) is a searchable National Aeronautics and Space Administration (NASA)-federated data archive containing information on flight analogue data along with the respective human, plant, and animal data93. Further, NASA Genelab is a collection of omics datasets for the biological samples exposed to radiation and ions with high atomic numbers and energy. Along with the datasets, Genelab goes a step further to provide analytic and visualisation tools for host-associated molecular signals16. Another initiative, Space Life Investigation Database (SpaceLID), collected and reported procedures and outcomes of 448 SLIs across 90 species from multiple studies108. A recent study by Kumar et al. revealed the central role of microbial metabolism and metabolite dependencies in the development of the International Space Station (ISS) microbiome through combined graph-theoretical and constraint-based modelling approaches53. Computational approaches such as these can provide significant insights towards unravelling the complexity of various microbiomes.

Human Microbiome

Microorganisms inhabit collectively on multiple human body sites, from the digestive tract and skin to respiratory and reproductive tracts, and possess 100 times the gene content of a human42. The complex microbial ecosystem in different parts of the body is unique in its composition and provides more genetic diversity and flexibility to the host42. Blooming research on the human microbiome provides evidence for their implications in critical host functionalities like innate and adaptive immune system development117, energy homeostasis22,24, and metabolism115. For instance, more abundant gut microbiota can partly explain the immune system response modulation91. Despite the significance of the human microbiome, the microbiota composition can be easily influenced by several intrinsic (host genetics and age)30,70 and extrinsic (lifestyle, geographical location, diet, and antibiotic uptake)54,79,80 factors and leads to higher interpersonal variation. The alteration in microbiota composition at spatial and temporal scales within a person (intrapersonal variation) along with interpersonal variation makes it difficult to untangle the microbiota association with human wellness or with the disease onset and progression and understand the influence of external factors on the microbiota composition.

To unravel the underlying host–microbiota and microbe–microbe interactions, mechanistic studies are done on different populations present across the world. One such interdisciplinary study, Human Microbiome Project (HMP)68, attempted to find new ways of determining disease predisposition and improving health through the monitoring and manipulating human microbiota by analysing the skin and mucosal (mouth and gut) microbiota samples collected from the United States. The project is expanded with the longitudinal microbiome samples of people suffering from Inflammatory Bowel Disease (IBD), Type-2 diabetes, and preterm birth to understand the impact on the dynamics of human health and disease (The Integrative HMP (iHMP) Research Network Consortium)67. Since the human gut microbiota is highly dense and involved in critical host functions, several studies on the European population (MetaHIT63,85), African population2, and Asian population (Asian Microbiome Project52,97, Westlake Gut Project45) have been done to investigate the interaction of intestinal microbiota with the human host and the trajectories of microbiota development.

The compositional analysis of the human microbiota offers only a restricted view of the microbial assembly, and employing ecosystem-specific gene and genome functional information with the composition could help in probing the evolutionary trajectory of the microbiota structure. The Unified Human Gastrointestinal Genome (UHGG) catalogue3 showcased a community effort to build a reference sequence repository comprising 2,04,938 genomes from 4644 microorganisms and provided a glimpse of the structural and functional diversity of the human gut microbiota. Following the gut microbiome, multiple studies on the oral microbiome are collated into Human Oral Microbiome Database (HOMD) to improve the taxonomic assignment quality using standard reference sequence of the microorganism present in the nose and alimentary canal37. Though these databases allow the researcher to access genomic information from multiple studies, raw sequences need to be analysed through the metagenomic pipeline (QIIME219, MOTHUR92, MetaPhLAn418 and HUMAnN14) and visualised using a bar graph to identify the variation in microbiota composition. But, MicrobiomeDB, a data-discovery and analysis tool, aids in collecting samples from 20 human microbiome studies, including HMP, ECAM, BONUS, and MAL-ED cohorts in a biom format, which contains taxonomic information of the microbiome rather than raw sequences82. In-built characterization of microbiome samples for diversity metrics, relative abundance, and differential abundance analysis and interpreting results through visualisation enables the user to understand the data and design the experiments accordingly.

Common microbiome databases such as MGnify and CNGB-MicrobiomeDB, and microbiota-specific databases such as GMRepo (gut)26, Human Microbiota Bioactive Resources (gut), SKIOME (skin)1, Vaginal Microbiome Consortium39 presented a collection of datasets on various human microbiomes from different demographics. The collected data can be harnessed to predict the characteristic microbial signature or biomarker associated with the physiological condition of the host using graph-based61, constraint-based46 or machine-learning-based approaches74. Databases such as BugSigDB41, Disbiome49 and Amadis64 curated the microbial association with various disease conditions based on experimental evidence and differential abundance studies. These comprehensive databases could help the research community to understand the context-specific effects of a microorganism in microbiota. Apart from studying the microbiome-host phenotype association to engineer the microbiota for diagnostic/ therapeutic interventions, human microbiome research can also assist in exploring the evolutionary history of human microbiota and discovering novel bioactive compounds to address the challenges associated with the treatment of multi-drug resistant (MDR) human pathogens.

Animal Microbiome

In recent years, apart from the environmental and human microbiome, researchers have started to explore the microbiome of species such as livestock8,109, bees35, howler monkeys6, and corals106 to understand their impact on pathogen resistance, chemical resistance, and climate change9. Among non-human microbiomes, the animal gastrointestinal microbiome comprises a wide variety of microorganisms that play a crucial role in the physiological processes of the host, including development, immunity, and metabolism38,112 and also induce pathogenesis at times51. For investigating bidirectional host–microbe interactions across cross-host and cross-phenotypes, robust databases have become necessary lately. The Animal Microbiome Database (AMDB) by Yang et al.  is a manually curated, user-friendly database containing bacterial 16S ribosomal RNA (rRNA) gene profiles of 10,478 bacterial taxa across 467 animal species from 34 major projects113. The interactive interface of AMDB is convenient for searching by taxa of interest, taxonomic profiles of the samples, and summary of included projects. Another comprehensive database, ADDAGMA (abbreviated from A Database for Domestic Animal Gut Microbiome Atlas), is a publicly available database for domestic animal gut microbiome equipped with browsing and query-based searching for annotated microbial and metadata from major four domestic animals (cattle, chicken, horse, and pig) along with 48 relevant phenotypes111. Databases of these volumes are quite useful for capturing the variation in the animal gut microbiome and their association with host phenotypes. Despite possessing a large quantity of microbial data, AMDB and ADDAGMA only contain bacterial diversity, though animal guts comprise archaea, fungi, protozoa, and viruses as well. A recent database CRAMdb has facilitated the users to study consistently annotated microbiomes (278 archaea, 9430 bacteria, 2216 fungi, and 458 viruses) across 516 animals and 43 phenotypes60. This exhaustive database provided a microbiome landscape from the gastrointestinal tract and other body sites (e.g. skin) of the host. The animal metagenome database, popularly known as, AnimalMetagenome DB, is another public animal metagenomic database majorly from wild animals47. Other macro databases like IMNGS and MGnify also contain metagenomics data from multiple sources, including animals as hosts.

A specialised gut microbial community in honey bees plays a critical role in their growth and development and, subsequently, in the production of honey from nectar20. Dysbiosis of bee gut microbiome through environmental factors such as stress, diet, and antibiotic exposure increases the risk of pathogenic attack, leading to significant economic loss to apiarists89. Beebiome Data Portal is a curated database with 14,418 biosamples from 292 bioprojects, along with metadata and reference sequences of the bee-associated microorganism35. Moreover, to address under-annotated bee microbiome, exploration of uncultivated microbial dark matter, and inconsistent phylogenetic placement,27 developed a curated, comprehensive and non-redundant reference database, BEExact, based on 16S rRNA sequencing27. Further, away from the land, swallowed in the ocean, marine invertebrate corals provide an abode for a plethora of microbial communities with multiple inter-species interactions. Coral microbiomes, also known as Beneficial Microorganisms for Corals (BMC), predominantly consist of bacteria and archaea, and they play a quintessential role in the health and ecology of corals84. In the era of anthropogenic perturbations, conserving marine ecosystems through coral protection and subsequent restoration, manipulation, and reversing microbial dysbiosis is crucial. Coral Microbiome Database (Coral MD) is a collection of 21,000 small subunit rRNA sequences from nine studies. It identifies a set of 39 bacteria and two archaea specific to coral holobiont48.

However, owing to the large and decentralised nature of the data, it is still difficult for users to meta-analyse, mine, and compare the data sets. Qiita, a web-based, open-source microbiome study management platform, is popularly used for quick re-analysis of data sets using standardised platforms with graphical user interfaces44. Also, MG-RAST is a repository of metagenomics data along with the tools for annotation and analysis of them50. These tools are constantly evolving in terms of pipeline development and data standardisation to complement the pace of sequencing advances over the last decade.

Plant Microbiome

The plant microbiome plays an important role in plant health, nutrient uptake, disease resistance, and tolerance to stressful environmental conditions102. Typically, the plant-associated microbiome can be divided into the phyllosphere, a microbiome associated with the aerial or above-ground regions, and the rhizosphere, which is associated with the root or the regions of the plant that are below-ground. Additionally, endophytes colonise the region within the plant’s stem, roots, leaves, or other tissues71. Root-associated fungi are of two types, ectomycorrhizal and endomycorrhizal fungi, with ectomycorrhizal fungi playing an important role in root defence against pathogens75. The plant microbiome is recruited from various “reservoirs”—the rhizosphere microbiome, for instance, is recruited from bulk soil via root exudates107. One challenge that arises while identifying and cataloguing the microbial diversity associated with plants is the correct identification of different microbial species. In fact, due to the slow advances in the taxonomic assignment of sequences, around 43% of 16S rRNA records in GenBank are classified as “environmental” samples or “unclassified”29. Databases such as SILVA86, GreenGenes29, and UNITE81 are used for the identification of microorganisms. The GreenGenes database checks for chimeras, which are common in 16S rRNA screening, and performs alignments and taxonomic classification of species. The SILVA database includes data on bacteria, eukaryotes, and archaeal microorganisms. It can further be used for 16S rRNA-based taxonomic assignment along with UNITE, a fungal Internal Transcribed Spacer (ITS) sequence database. A review by Lucaciu et al. discusses a number of such tools and databases that can be used in plant microbiome studies71.

While these are commonly used resources and can be used in different kinds of studies, some plant-specific databases can be used to deposit and access microbiome data. The PAMDB database allows Multilocus Sequence Typing (MLST) and Multilocus Sequence Analysis (MLSA) data to be deposited along with metadata such as the region and year of data collection4. The metadata also includes various substrates, such as soil and water, from which microorganisms have been isolated. Later, Trivedi et al. suggested a trait-based or functional approach for characterising plant microbiomes and their association with the plants101. A comprehensive and massive initiative by Levy et al. sequenced 484 genomes from root bacteriomes and compared more than 3500 genomes to identify and characterise candidate genes required at the bacteria-plant interface62 63. Another challenge in plant microbiome studies involves contamination from the host DNA, which is difficult to resolve due to the close phylogenetic relationship between plant organelles such as chloroplasts and bacterial sequences. It is essential that DNA sequences from the host plant, including plastid sequences, are filtered out accurately40. Databases such as the CpGDB database, which maintain a collection of chloroplast sequences from 3823 plant species, can be used to overcome this effect95.

Effectively utilising the microorganism associated with crop species or the crop microbiome has been proposed as a strategy to increase crop productivity and food security. Crop improvement techniques that focus on the rhizosphere and endophytic microorganisms are more likely to succeed compared to techniques that utilise microorganisms from the bulk soil due to evolutionary processes that have shaped interactions between host plants and their microbial communities95. The AgMicrobiomeBase project collected metagenomic data from the soil microbiome (bulk soil and the rhizosphere) for crops such as barley and wheat across nine different types of soil in the UK (https://agmicrobiomebase.org). However, there are some challenges associated with harnessing the crop microbiome, one of the major ones being standardising the use of specific microorganisms under different environmental or soil conditions. Moreover, algae-associated bacteria and fungi play an important role in the functioning of the holobiont. However, most studies on algal microbiomes or the phycosphere have been focused on wastewater treatment studies65. Studies of the forest microbiome involve integrating data from a number of sources, including soil, roots, foliage, bark, rocks, and even dead wood. All these microbiomes together form a complex and dynamic ecosystem that is primarily influenced by the trees present10. However, we noticed a relatively small set of databases that explicitly targeted plant microbiomes. Macro databases such as MGnify, which are used to deposit microbiome data, reveal disparity as it has 99,926 samples for the human digestive system alone, outnumbering the number of plant samples (28,666) by more than three times (Accessed on 18th January 2023). We showed the distribution of the samples present in the MGnify database across the described biomes in Fig. 1 for providing an estimate of experiments conducted across these biomes. Approaches such as MLST/MLSA which are used in general for generating sequencing data could be exploited for getting more information on unexplored plant microbiomes.

Figure 1:

Figure 1:

The pie chart in the centre depicts the segment of samples from MGnify, one of the major microbiome databases, across discussed biomes. The corresponding percentages represent biome-specific samples in MGnify. Other specific databases for each of the biomes are listed adjacently. This illustration was generated using Adobe Illustrator CS6.

Industrial Microbiome

Diverse microbiomes influence complex processes11 like the synthesis of polymeric substances and maintaining geochemical cycles in nature, making them important contributors to industrial applications. Microbial cooperation is extremely important to perform these complex activities, which a single microbial species cannot perform individually. The food industry is one such industry that can make the most out of microbial communities104. Microbiome studies in this field focus on the microbial composition of a healthy gut for designing probiotics76. Assessing the metabolic capability of microbial communities for product production, sequencing microbial genomes in complex microbiomes and characterising gut microbiome is essential for healthcare7. A study by Beck et al. monitored changes in food microbiome in 31 high protein powder (HPP) samples in poultry meals13. A microbiome analysis pipeline was developed to detect the microbe composition in each sample and used it as an indicator for the presence of contaminants in the poultry meal. An interesting initiative, the Food Microbe Tracker database, contains ribotype, phenotype, gene sequences, PFGE (Pulsed-field Gel Electrophoresis), and reference data for 96,792 isolates in total105. Moreover, the FoodMicroBionet database (while currently inaccessible) has collected data from 17 studies on microorganisms associated with foods such as dairy and meat, along with network-based correlation analyses83.

Wastewater management is another concern in industries. Microorganisms can play a major part in treating wastewater owing to their ability to degrade contaminants and pollutants in industrial exudates. Understanding wastewater or sewer microbiomes has implications for human health and for wastewater treatment. These microbiomes vary with time and are influenced by the season and environmental conditions57. The MiDAS database contains 16S rRNA and taxonomic data for wastewater treatment plant microbiomes collected by the MiDAS consortium33. Moreover, anaerobic digester microbiomes are required for the proper degradation of the organic matter, and they are essential for the production of the final product, methane32. The biogas microbiome database (https://biogasmicrobiome.env.dtu.dk) contains data from various anaerobic digesters, including genomes from microorganisms associated with biogas reactors and processes such as lignocellulose degradation, cheese whey degradation, and saccharide degradation.

Conclusion

In the recent past, microbiome research has grown rapidly, with an emphasis on studying microbial interactions and their functions in communities. Being associated with human physiology to industry-level bioprocesses, microorganisms perform a range of functions through their diverse metabolic potential. Identification of novel microorganisms and understanding their co-occurrence patterns, community dynamics, and AMR are some of the important aspects of microbiome research. Despite multiple experimental and computational methods developed we are yet to explore the role of microorganisms in many other contexts due to cost and time constraints. Repurposing the existing data for designing novel strategies to study microbiota at temporal and spatial scale will be beneficial for deciding specific wet lab experiments and speeding up the research. To streamline the data comparison and re-analyses, we reviewed major microbiome databases and resources across environments and hosts, enabling the integration of data types.

In this paper, we argue the importance of establishing common microbiome data repositories for cataloguing the microbial composition of the community of interest and facilitating further novel analyses. We categorise the databases into five major biomes namely environmental, human, animal, plant, and industrial microbiomes, and discuss the benefits and limitations of the existing resources. Environmental microbiome being one of the most important microbiomes determine the sustainability, and safety of the environment we are living in. Temporal microbiome data (e.g. from MetaSUB) have the potential to track the dynamics of the microbiomes, characterise the AMR, and subsequently, find the BGCs. Moreover, microorganisms thriving in extreme environments exhibit auxotrophy and communicate cooperatively in the community. Identified small molecules from these environments show the extreme metabolic capacity of the extremophiles28. Despite having many microbiome databases capturing the meta-omics of the environmental microbiome from different environments, they lack meta-information from the environments, which are very crucial factors in exploring the context-specific variation in altering the composition and functionality of the community69.

Human microbiome research has progressed tremendously compared to other ecosystems such as plant and animal microbiomes. The identification of probiotics/ biomarkers to engineer the microbiota using a computational approach has attained great attention due to their therapeutic/ diagnostic potential and better bioavailability. However, longitudinal studies on extensive sampling of multiple body sites (from each subject) with multi-omic analysis are needed to unveil the microbiota evolution, interactions between the various microbiome within humans (skin, oral, gut microbiome, etc.) and crosstalk between the host metabolism and community function. On the other part, the animal microbiome also becomes an essential domain of microbiomes owing to their contribution to our daily life. The animal gut is the most studied organ for microbiome research for its role in producing several by-products of interest. However, other microbiomes like skin microbiomes, are closely associated with the fitness of their animal host and are poorly studied58. We further suggest that characterising other animal microbiomes will be useful for understanding microbial symbiosis, the immune system of the host, and tracking their exposure to environmental contaminants.

The plant microbiome is perhaps one of the underrated microbiomes. Microbial interactions in plant microbial communities control plant growth, health, nutrient uptake, and subsequently regulate beneficial traits. A recent study showed that data-driven modelling approaches in plant microbiomes can link the data to the phenotype through multiscale computational simulations101. Modelling engineered microbial communities will be effective for the safe and sustainable production of plants on a large scale. Moving forward, industrial microbiomes are very application-oriented as it involves studying microbes that are involved in industrial processes and wastewater management. Despite having several databases, in most cases, the data are generated for only a specific purpose and remain unusable beyond the scope.

Moreover, the recent outbreak of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in 2019 caused a global socio-economic disruption and snatched over 5 million lives to date. Though viruses are often minimally explored in microbiome studies, the COVID-19 pandemic has driven researchers to investigate host-virome interactions. VIROME is a web application with robust pipelines for analysing viral metagenomics sequences and secondary analyses36. Rapid progress in computational tools during these necessary times accelerated the research and marked its capabilities. Nonetheless, with the development of new algorithms and quick analysis of data, storing the structured outcome in a database will increase the scope and the size of the domain. However, even with the many advances in computational methods, different ecosystems demand different data collection and preprocessing steps, restricting robust tool development and comparative studies. Development of tools to predict microbial interactions in different ecosystems could help in understanding the ecosystem-specific community structure and their role in the host–microbiome interactions. In summary, our review will encourage the community to develop a generic database of microbiomes and increase cross-community Microbiome Wide Association Studies (MWAS) and reveal the hidden potential of the tiny organisms that have a massive impact on our environment, health and well-being.

Acknowledgements

P.S. and I.P. are recipients of the Prime Minister’s Research Fellowship (PMRF) from the Ministry of Education, Government of India. A.M. acknowledges the Half-Time Teaching Assistantship (HTTA) from the Ministry of Education, Government of India. K.R. acknowledges support from the Science and Engineering Board (SERB) MATRICS Grant MTR/2020/000490.

Biographies

Pratyay Sengupta

is a Prime Minister’s Research Fellow at the Department of Biotechnology, IIT Madras. He has a Bachelor of Biotechnology from the National Institute of Technology, Durgapur. His research focuses on developing computational tools for the analysis of genomics and metagenomics data, with a particular emphasis on microbial community analysis.graphic file with name 41745_2023_370_Figa_HTML.jpg

Shobhan Karthick Muthamilselvi Sivabalan

is a 4th year undergraduate from IIT Madras pursuing his dual degree in Biological Engineering, with a specialisation in Computational Biology. His love for biology and craze for computers have led him to the field of computational biology. His research interests include metabolic modelling and network-based analysis of microbial communities.graphic file with name 41745_2023_370_Figb_HTML.jpg

Amrita Mahesh

is currently pursuing her B.S.--M.S. in Biological Sciences at IIT Madras. Her research interests include understanding microbial interactions in microbiomes and cancer genomics.graphic file with name 41745_2023_370_Figc_HTML.jpg

Indumathi Palanikumar

is a Prime Minister’s Research Fellow at the Department of Biotechnology, IIT Madras. She completed her master’s degree in Biotechnology at IIT Guwahati, where she conducted research on characterising a recombinant ovine protein in Pichia pastoris. Currently, her doctoral research focuses on studying the transient impact of enteric pathogens on the gut microbiota of infants in low- and middle-income countries.graphic file with name 41745_2023_370_Figd_HTML.jpg

Dinesh Kumar Kuppa Baskaran

has a Master of Science (by research) from IIT Madras. His thesis involved developing methods to predict various features of microbiomes by analysing the interactions among the microbes within them. He utilises a combination of network-based and constraint-based approaches to model microbial communities.graphic file with name 41745_2023_370_Fige_HTML.jpg

Karthik Raman

has a PhD from IISc Bangalore (2009). After a postdoctoral stint at the University of Zurich, he joined IIT Madras in 2011 as a faculty at the Bhupat and Jyoti Mehta School of Biosciences, where he is now a professor. His research group works on developing algorithms and computational tools to understand, predict and manipulate complex biological networks, especially microbiomes. He has 50+ publications and has also authored a textbook on Computational Systems Biology.graphic file with name 41745_2023_370_Figf_HTML.jpg

Declarations

Conflict of interest

There is no conflict of interest between authors in writing this review.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Pratyay Sengupta, Email: pratyaysengupta@smail.iitm.ac.in.

Shobhan Karthick Muthamilselvi Sivabalan, Email: be19b009@smail.iitm.ac.in.

Amrita Mahesh, Email: bs18b002@smail.iitm.ac.in.

Indumathi Palanikumar, Email: indumathi@smail.iitm.ac.in.

Dinesh Kumar Kuppa Baskaran, Email: bt19s011@smail.iitm.ac.in.

Karthik Raman, Email: kraman@iitm.ac.in.

References

  • 1.Agostinetto G, Bozzi D, Porro D, Casiraghi M, Labra M, Bruno A. SKIOME Project: a curated collection of skin microbiome datasets enriched with study-related metadata. Database. 2022;2022:baac033. doi: 10.1093/database/baac033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Allali I, Abotsi RE, Tow LA, Thabane L, Zar HJ, Mulder NM, Nicol MP. Human microbiota research in Africa: a systematic review reveals gaps and priorities for future research. Microbiome. 2021;9(1):241. doi: 10.1186/s40168-021-01195-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Almeida A, Nayfach S, Boland M, Strozzi F, Beracochea M, Shi ZJ, Finn RD. A unified catalog of 204,938 reference genomes from the human gut micro biome. Nat Biotechnol. 2021;39(1):105–114. doi: 10.1038/s41587-020-0603-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Almeida NF, Shuangchun Y, Rongman C, Clarke CR, Morris CE, Schaads NW, Vinatzer BA. PAMDB, a multilocus sequence typing and analysis database and website for plant-associated microbes. Phytopathology. 2010;100(3):208–215. doi: 10.1094/PHYTO-100-3-0208. [DOI] [PubMed] [Google Scholar]
  • 5.Alsharif W, Saad MM, Hirt H (2020) Desert microbes for boosting sustainable agriculture in extreme environments, vol 11. Frontiers Media S.A. 10.3389/fmicb.2020.01666 [DOI] [PMC free article] [PubMed]
  • 6.Amato KR, Yeoman CJ, Kent A, Righini N, Carbonero F, Estrada A, Leigh SR. Habitat degradation impacts black howler monkey (Alouatta pigra) gastrointestinal microbiomes. ISME J. 2013;7(7):1344–1353. doi: 10.1038/ismej.2013.16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Andoh A, Kuzuoka H, Tsujikawa T, Nakamura S, Hirai F, Suzuki Y, Matsumoto T. Multicenter analysis of fecal microbiota profiles in Japanese patients with Crohn’s disease. J Gastroenterol. 2012;47(12):1298–1307. doi: 10.1007/s00535-012-0605-0. [DOI] [PubMed] [Google Scholar]
  • 8.Andrade BG, de Souza MM, Amat S, Regitano LC, de Oliveira PS (2022) Editorial: Microbiome genomics for livestock production, vol 13. Frontiers Media S.A. 10.3389/fgene.2022.1000749 [DOI] [PMC free article] [PubMed]
  • 9.Bahrndorff S, Alemu T, Alemneh T, Lund Nielsen J. The micro-biome of animals: implications for conservation biology. London: Hindawi Limited; 2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Baldrian P. Forest microbiome: diversity, complexity and dynamics. FEMS Microbiol Rev. 2017;41(2):109–130. doi: 10.1093/femsre/fuw040. [DOI] [PubMed] [Google Scholar]
  • 11.Basile A, Campanaro S, Kovalovszki A, Zampieri G, Rossi A, Angelidaki I, Treu L. Revealing metabolic mechanisms of interaction in the anaerobic digestion microbiome by flux balance analysis. Metab Eng. 2020;62:138–149. doi: 10.1016/j.ymben.2020.08.013. [DOI] [PubMed] [Google Scholar]
  • 12.Baskaran DKK, Umale S, Zhou Z, Raman K, Ananthara man K (2022) Metagenome-based metabolic modelling predicts unique microbial interactions in deep-sea hydrothermal plume microbiomes. Retrieved 2023-01-30, from (Pages: 2022.11.06.515352 Section: New Results). 10.1101/2022.11.06.515352
  • 13.Beck KL, Haiminen N, Chambliss D, Edlund S, Kunitomi M, Huang BC, Weimer BC. Monitoring the microbiome for food safety and quality using deep shotgun sequencing. NPJ Sci Food. 2021;5(1):3. doi: 10.1038/s41538-020-00083-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Beghini F, McIver LJ, Blanco-Míguez A, Dubois L, Asnicar F, Maharjan S, Segata N. Integrating taxonomic, functional, and strain-level profiling of diverse microbial communities with bioBakery. eLife. 2021;10:e65088. doi: 10.7554/eLife.65088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Berg G, Rybakova D, Fischer D, Cernava T, Vergès MCC, Charles T, Schloter M (2020) Microbiome definition re-visited: old concepts and new challenges (8) (1). BioMed Central Ltd. 10.1186/s40168-020-00875-0
  • 16.Berrios DC, Galazka J, Grigorev K, Gebre S, Costes SV. NASA GeneLab: interfaces for the exploration of space omics data. Nucleic Acids Res. 2021;49(D1):D1515–D1522. doi: 10.1093/nar/gkaa887. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Bissett A, Fitzgerald A, Meintjes T, Mele PM, Reith F, Dennis PG, Young A (2016) Introducing BASE: the biomes of Australian soil environments soil microbial diversity database, vol 5(1). BioMed Central Ltd. 10.1186/s13742-016-0126-5 [DOI] [PMC free article] [PubMed]
  • 18.Blanco-Miguez A, Beghini F, Cumbo F, McIver LJ, Thompson KN, Zolfo M, Segata N. Extending and improving metagenomic taxonomic profiling with uncharacterized species with MetaPhlAn 4. bioRxiv. 2022 doi: 10.1101/2022.08.22.504593. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Abnet CC, Al-Ghalith GA, Caporaso JG. Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat Biotechnol. 2019;37(8):852–857. doi: 10.1038/s41587-019-0209-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Borum AE. Microbiota and its importance in honey bees. Bee Stud Apicult Res Inst. 2021;13(1):23–30. doi: 10.51458/bstd.2021.14. [DOI] [Google Scholar]
  • 21.Campanaro S, Treu L, Kougias PG, De Francisci D, Valle G, Angelidaki I. Metagenomic analysis and functional characterization of the biogas microbiome using high throughput shotgun sequencing and a novel binning strategy. Biotechnol Biofuels. 2016;9(1):26. doi: 10.1186/s13068-016-0441-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Cani PD, Van Hul M, Lefort C, Depommier C, Rastelli M, Everard A. Microbial regulation of organismal energy homeostasis. Nat Metab. 2019;1(1):34–46. doi: 10.1038/s42255-018-0017-4. [DOI] [PubMed] [Google Scholar]
  • 23.Checinska Sielaff A, Urbaniak C, Mohan GBM, Stepanov VG, Tran Q, Wood JM, Venkateswaran K. Characterization of the total and viable bacterial and fungal communities associated with the International Space Station surfaces. Microbiome. 2019 doi: 10.1186/s40168-019-0666-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Chen D, He J, Li J, Zou Q, Si J, Guo Y, Shi H. Microbiome and metabolome analyses reveal novel inter play between the skin microbiota and plasma metabolites in psoriasis. Front Microbiol. 2021 doi: 10.3389/fmicb.2021.643449. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Contador CA, Veas-Castillo L, Tapia E, Antipán M, Miranda N, Ruiz-Tagle B, Asenjo JA, van Leeuwenhoek A. Atacama database: a platform of the microbiome of the Atacama Desert. Int J Gen Mol Microbiol. 2020;113(2):185–195. doi: 10.1007/s10482-019-01328-x. [DOI] [PubMed] [Google Scholar]
  • 26.Dai D, Zhu J, Sun C, Li M, Liu J, Wu S, Chen W-H. GMrepo v2: a curated human gut microbiome database with special focus on disease markers and cross-dataset comparison. Nucleic Acids Res. 2022;50(D1):D777–D784. doi: 10.1093/nar/gkab1019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Daisley BA, Reid G. BEExact: a metataxonomic database tool for high-resolution inference of bee-associated microbial communities. mSystems. 2021 doi: 10.1128/msystems.00082-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Danko D, Bezdan D, Afshin EE, Ahsanuddin S, Bhattacharya C, Butler DJ, Zubenko S. A global metagenomic map of urban microbiomes and antimicrobial resistance. Cell. 2021;184(13):3376–3393. doi: 10.1016/j.cell.2021.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, Andersen GL. Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol. 2006;72(7):5069–5072. doi: 10.1128/AEM.03006-05. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Deshpande NP, Riordan SM, Castaño-Rodríguez N, Wilkins MR, Kaak-oush NO. Signatures within the esophageal micro biome are associated with host genetics, age, and disease. Microbiome. 2018;6(1):227. doi: 10.1186/s40168-018-0611-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Dick GJ. The microbiomes of deep-sea hydrothermal vents: distributed globally, shaped locally. Berlin: Nature Publishing Group; 2019. [DOI] [PubMed] [Google Scholar]
  • 32.Donia MS, Cimermancic P, Schulze CJ, Wieland Brown LC, Martin J, Mitreva M, Fischbach MA. A systematic analysis of biosynthetic gene clusters in the human microbiome reveals a common family of antibiotics. Cell. 2014;158(6):1402–1414. doi: 10.1016/j.cell.2014.08.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Dueholm MKD, Nierychlo M, Andersen KS, Rudkjøbing V, Knutsson S, Albertsen M, Nielsen PH. MiDAS 4: a global catalogue of full-length 16S rRNA gene sequences and taxonomy for studies of bacterial communities in wastewater treatment plants. Nat Commun. 2022;13(1):1908. doi: 10.1038/s41467-022-29438-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Dunivin TK, Choi J, Howe A, Shade A. RefSoil+: a reference database for genes and traits of soil plasmids. mSystems. 2019 doi: 10.1128/msystems.00349-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Engel P, Kwong WK, McFrederick Q, Anderson KE, Barribeau SM, Chandler JA, Dainat B. The bee microbiome: impact on bee health and model for evolution and ecology of host-microbe interactions. mBio. 2016 doi: 10.1128/mBio.02164-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Eric Wommack K, Bhavsar J, Polson SW, Chen J, Dumas M, Srinivasiah S, Nasko DJ. VIROME: a standard operating procedure for analysis of viral metagenome sequences. Stand Genom Sci. 2012;6(3):427–439. doi: 10.4056/sigs.2945050. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Escapa IF, Chen T, Huang Y, Gajare P, Dewhirst FE, Lemon KP. New insights into human nostril microbiome from the expanded human oral microbiome database (eHOMD): a resource for the microbiome of the human aerodigestive tract. mSystems. 2018 doi: 10.1128/mSystems.00187-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Esser D, Lange J, Marinos G, Sieber M, Best L, Prasse D, Sommer F. Functions of the microbiota for the physiology of animal metaorganisms. Basel: S. Karger AG; 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Fettweis JM, Serrano MG, Sheth NU, Mayer CM, Glascock AL, Brooks JP, Vaginal Microbiome Consortium (additional members) Species-level classification of the vaginal microbiome. BMC Genom. 2012;13(8):S17. doi: 10.1186/1471-2164-13-S8-S17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Fitzpatrick CR, Lu-Irving P, Copeland J, Guttman DS, Wang PW, Baltrus DA, Johnson MT. Chloroplast sequence variation and the efficacy of peptide nucleic acids for blocking host amplification in plant microbiome studies. Microbiome. 2018 doi: 10.1186/s40168-018-0534-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Geistlinger L, Mirzayi C, Zohra F, Azhar R, Elsafoury S, Grieve C, Waldron L. BugSigDB: accelerating microbiome research through systematic comparison to published microbial signatures. medRxiv. 2022 doi: 10.1101/2022.10.24.22281483. [DOI] [Google Scholar]
  • 42.Gilbert JA, Blaser MJ, Caporaso JG, Jansson JK, Lynch SV, Knight R. Current understanding of the human microbiome. Nat Med. 2018;24(4):392–400. doi: 10.1038/nm.4517. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Gilbert JA, Jansson JK, Knight R. The earth microbiome project: successes and aspirations. London: BioMed Central Ltd; 2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Gonzalez A, Navas-Molina JA, Kosciolek T, McDonald D, Vázquez-Baeza Y, Ackermann G, Knight R. Qiita: rapid, web-enabled microbiome meta-analysis. Nat Methods. 2018;15(10):796–798. doi: 10.1038/s41592-018-0141-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Gou W, Chen Y, Pan A, Wang H, Zhang K, Pan X-F, Zheng J-S. Westlake gut project: a consortium of microbiome epidemiology for the gut microbiome and health research in China. Med Microecol. 2022;14:100064. doi: 10.1016/j.medmic.2022.100064. [DOI] [Google Scholar]
  • 46.Heinken A, Basile A, Thiele I. Advances in constraint-based modelling of microbial communities. Curr Opin Syst Biol. 2021;27:100346. doi: 10.1016/j.coisb.2021.05.007. [DOI] [Google Scholar]
  • 47.Hu R, Yao R, Li L, Xu Y, Lei B, Tang G, Hu S. A database of animal metagenomes. Sci Data. 2022 doi: 10.1038/s41597-022-01444-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Huggett MJ, Apprill A. Coral microbiome database: integration of sequences reveals high diversity and relatedness of coral-associated microbes. Environ Microbiol Rep. 2019;11(3):372–385. doi: 10.1111/1758-2229.12686. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Janssens Y, Nielandt J, Bronselaer A, Debunne N, Verbeke F, Wynendaele E, De Spiegeleer B. Disbiome database: linking the microbiome to disease. BMC Microbiol. 2018;18(1):50. doi: 10.1186/s12866-018-1197-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Keegan KP, Glass EM, Meyer F. MG-RAST, a metagenomics service for analysis of microbial community structure and function. Methods in molecular biology. Totowa: Humana Press Inc; 2016. pp. 207–233. [DOI] [PubMed] [Google Scholar]
  • 51.Khalil A, Batool A, Arif S. Healthy cattle microbiome and dysbiosis in diseased phenotypes. Ruminants. 2022;2(1):134–156. doi: 10.3390/ruminants2010009. [DOI] [Google Scholar]
  • 52.Kisuse J, Laongkham O, Nakphaichit M, Therdtatha P, Momoda R, Tanaka M, Nakayama J. Urban diets linked to gut microbiome and metabolome alterations in children: a comparative cross-sectional study in Thailand. Front Microbiol. 2018 doi: 10.3389/fmicb.2018.01345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Kumar RK, Singh NK, Balakrishnan S, Parker CW, Raman K, Venkateswaran K. Metabolic modeling of the international space station microbiome reveals key microbial interactions. Microbiome. 2022;10(1):102. doi: 10.1186/s40168-022-01279-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Kumbhare SV, Patangia DV, Patil RH, Shouche YS, Patil NP. Factors influencing the gut microbiome in children: from infancy to childhood. J Biosci. 2019;44(2):49. doi: 10.1007/s12038-019-9860-z. [DOI] [PubMed] [Google Scholar]
  • 55.Kyrpides NC, Eloe-Fadrosh EA, Ivanova NN. Microbiome data science: understanding our microbial planet. New York: Elsevier Ltd; 2016. [DOI] [PubMed] [Google Scholar]
  • 56.Lagkouvardos I, Joseph D, Kapfhammer M, Giritli S, Horn M, Haller D, Clavel T. IMNGS: a comprehensive open resource of processed 16S rRNA microbial profiles for ecology and diversity studies. Sci Rep. 2016 doi: 10.1038/srep33721. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.LaMartina EL, Mohaimani AA, Newton RJ. Urban wastewater bacterial communities assemble into seasonal steady states. Microbiome. 2021;9(1):116. doi: 10.1186/s40168-021-01038-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Lavrinienko A, Mappes T, Tukalenko E, Mousseau TA, Møller AP, Knight R, Watts PC. Environmental radiation alters the gut microbiome of the bank vole Myodes glareolus. ISME J. 2018;12(11):2801–2806. doi: 10.1038/s41396-018-0214-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Lax S, Smith DP, Hampton-Marcell J, Owens SM, Handley KM, Scott NM, Gilbert JA. Longitudinal analysis of microbial interaction between humans and the indoor environment. Science. 2014;345(6200):1048–1052. doi: 10.1126/science.1254529. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Lei B, Xu Y, Lei Y, Li C, Zhou P, Wang L, Hu S. CRAMdb: a comprehensive database for composition and roles of microbiome in animals. Nucl Acids Res. 2022 doi: 10.1093/nar/gkac973. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Lei X, Wang Y. Predicting microbe-disease association by learning graph representations and rule-based inference on the heterogeneous network. Front Microbiol. 2020;11:579. doi: 10.3389/fmicb.2020.00579. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Levy A, Salas Gonzalez I, Mittelviefhaus M, Clingenpeel S, Herrera Paredes S, Miao J, Dangl JL. Genomic features of bacterial adaptation to plants. Nat Genet. 2018;50(1):138–150. doi: 10.1038/s41588-017-0012-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Li J, Jia H, Cai X, Zhong H, Feng Q, Sunagawa S, Wang J. An integrated catalog of reference genes in the human gut microbiome. Nat Biotechnol. 2014;32(8):834–841. doi: 10.1038/nbt.2942. [DOI] [PubMed] [Google Scholar]
  • 64.Li L, Jing Q, Yan S, Liu X, Sun Y, Zhu D, Xue D. Amadis: a comprehensive database for association between microbiota and disease. Front Physiol. 2021 doi: 10.3389/fphys.2021.697059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Lian J, Wijffels RH, Smidt H, Sipkema D. The effect of the algal microbiome on industrial production of microalgae. New York: Wiley; 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Ling N, Wang T, Kuzyakov Y. Rhizosphere bacteriome structure and functions. Nat Commun. 2022 doi: 10.1038/s41467-022-28448-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Lloyd-Price J, Arze C, Ananthakrishnan AN, Schirmer M, Avila-Pacheco J, Poon TW, Huttenhower C. Multiomics of the gut microbial ecosystem in inflammatory bowel diseases. Nature. 2019;569(7758):655–662. doi: 10.1038/s41586-019-1237-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Lloyd-Price J, Mahurkar A, Rahnavard G, Crabtree J, Orvis J, Hall AB, Huttenhower C. Strains, functions and dynamics in the expanded Human Microbiome Project. Nature. 2017;550(7674):61–66. doi: 10.1038/nature23889. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Lobanov V, Gobet A, Joyce A. Ecosystem-specific microbiota and microbiome databases in the era of big data. Environ Microbiome. 2022;17(1):37. doi: 10.1186/s40793-022-00433-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Lopera-Maya EA, Kurilshikov A, van der Graaf A, Hu S, Andreu-Sánchez S, Chen L, Sanna S. Effect of host genetics on the gut microbiome in 7738 participants of the Dutch Microbiome Project. Nat Genet. 2022;54(2):143–151. doi: 10.1038/s41588-021-00992-y. [DOI] [PubMed] [Google Scholar]
  • 71.Lucaciu R, Pelikan C, Gerner SM, Zioutis C, Köstlbacher S, Marx H, Rattei T (2019) A bioinformatics guide to plant microbiome analysis, vol 10. Front Media S.A. 10.3389/fpls.2019.01313 [DOI] [PMC free article] [PubMed]
  • 72.Machado D, Maistrenko OM, Andrejev S, Kim Y, Bork P, Patil KR, Patil KR. Polarization of microbial communities between competitive and cooperative metabolism. Nat Ecol Evol. 2021;5(2):195–203. doi: 10.1038/s41559-020-01353-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Mahnert A, Vaishampayan P, Probst AJ, Auerbach A, Moissl-Eichinger C, Venkateswaran K, Berg G. Cleanroom maintenance significantly reduces abundance but not diversity of indoor microbiomes. PLoS One. 2015 doi: 10.1371/journal.pone.0134848. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Marcos-Zambrano LJ, Karaduzovic-Hadziabdic K, Loncar Turukalo T, Przymus P, Trajkovik V, Aasmets O, Truu J. Applications of machine learning in human microbiome studies: a review on feature selection, biomarker identification, disease prediction and treatment. Front Microbiol. 2021 doi: 10.3389/fmicb.2021.634511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Marx DH (1972) ECTOMYCORRHIZAE AS BIOLOGICAL DETERRENTS 3558 TO PATHOGENIC ROOT INFECTIONS (Tech. Rep.). Retrieved from www.annualreviews.org [DOI] [PubMed]
  • 76.McFarland LV, Evans CT, Goldstein EJC. Strain-specificity and disease-specificity of probiotic efficacy: a systematic review and meta-analysis. Front Med. 2018 doi: 10.3389/fmed.2018.00124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Merino N, Aronson HS, Bojanova DP, Feyhl-Buska J, Wong ML, Zhang S, Giovannelli D. Living at the extremes: extremophiles and the limits of life in a planetary context. Front Microbiol. 2019 doi: 10.3389/fmicb.2019.00780. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Mitchell AL, Almeida A, Beracochea M, Boland M, Burgin J, Cochrane G, Finn RD. MGnify: the microbiome analysis resource in 2020. Nucleic Acids Res. 2020;48(D1):D570–D578. doi: 10.1093/nar/gkz1035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Moosa Y, Kwon D, de Oliveira T, Wong EB. Determinants of vaginal microbiota composition. Front Cell Infect Microbiol. 2020 doi: 10.3389/fcimb.2020.00467. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Moskovicz V, Gross A, Mizrahi B. Extrinsic factors shaping the skin microbiome. Microorganisms. 2020;8(7):1023. doi: 10.3390/microorganisms8071023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Nilsson RH, Larsson KH, Taylor AF, Bengtsson-Palme J, Jeppesen TS, Schigel D, Abarenkov K. The UNITE database for molecular identification of fungi: handling dark taxa and parallel taxonomic classifications. Nucleic Acids Res. 2019;47(D1):D259–D264. doi: 10.1093/nar/gky1022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Oliveira FS, Brestelli J, Cade S, Zheng J, Iodice J, Fischer S, Beiting DP. MicrobiomeDB: a systems biology platform for integrating, mining and analyzing microbiome experiments. Nucleic Acids Res. 2018;46(D1):D684–D691. doi: 10.1093/nar/gkx1027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Parente E, Zotta T, Ricciardi A. FoodMicrobionet v4: a large, integrated, open and transparent database for food bacterial communities. Int J Food Microbiol. 2022;372:109696. doi: 10.1016/j.ijfoodmicro.2022.109696. [DOI] [PubMed] [Google Scholar]
  • 84.Peixoto RS, Rosado PM, Leite DCdA, Rosado AS, Bourne DG (2017) Beneficial microorganisms for corals (BMC): Proposed mech anisms for coral health and resilience, vol 8 (No. MAR). Frontiers Media S.A. 10.3389/fmicb.2017.00341 [DOI] [PMC free article] [PubMed]
  • 85.Qin J, Li R, Raes J, Arumugam M, Burgdorf KS, Manichanh C, Wang J. A human gut microbial gene catalogue established by metagenomic sequencing. Nature. 2010;464(7285):59–65. doi: 10.1038/nature08821. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Quast C, Pruesse E, Yilmaz P, Gerken J, Schweer T, Yarza P, Glöckner FO. The SILVA ribosomal RNA gene database project: improved data processing and web-based tools. Nucleic Acids Res. 2013 doi: 10.1093/nar/gks1219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Rampelotto PH, Sereia AF, De Oliveira LFV, Margis R. Exploring the hospital microbiome by high-resolution 16S rRNA profiling. Int J Mol Sci. 2019 doi: 10.3390/ijms20123099. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Ray P, Lakshmanan V, Labbé JL, Craven KD (2020) Microbe to microbiome: a paradigm shift in the application of microorganisms for sustainable agriculture, vol 11. Frontiers Media S.A. 10.3389/fmicb.2020.622926 [DOI] [PMC free article] [PubMed]
  • 89.Raymann K, Moran NA. The role of the gut microbiome in health and disease of adult honey bee workers. New York: Elsevier Inc; 2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Salazar C, Giménez M, Riera N, Parada A, Puig J, Galiana A, Iraola G. Human microbiota drives hospital-associated antimicrobial resistance dissemination in the urban environment and mirrors patient case rates. Microbiome. 2022;10(1):208. doi: 10.1186/s40168-022-01407-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Schirmer M, Smeekens SP, Vlamakis H, Jaeger M, Oosting M, Franzosa EA, Xavier RJ. Linking the human gut microbiome to inflammatory cytokine production capacity. Cell. 2016;167(4):1125–1136. doi: 10.1016/j.cell.2016.10.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Schloss PD, Westcott SL, Ryabin T, Hall JR, Hartmann M, Hollister EB, Weber CF. Introducing mothur: open-source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol. 2009;75(23):7537–7541. doi: 10.1128/AEM.01541-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Scott RT, Grigorev K, Mackintosh G, Gebre SG, Mason CE, Del Alto ME, Costes SV. Advancing the integration of biosciences data sharing to further enable space exploration. New York: Elsevier B.V; 2020. [DOI] [PubMed] [Google Scholar]
  • 94.Singh BK, Trivedi P. Microbiome and the future for food and nutrient security. Microb Biotechnol. 2017;10(1):50–53. doi: 10.1111/1751-7915.12592. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Singh BP. CpGDB: a comprehensive database of chloroplast genomes. Bioinformation. 2020;16(2):171–175. doi: 10.6026/97320630016171. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Singh NK, Wood JM, Karouia F, Venkateswaran K. Succession and persistence of microbial communities and antimicrobial resistance genes associated with International Space Station environmental surfaces. Microbiome. 2018 doi: 10.1186/s40168-018-0585-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Tanaka M, Korenori Y, Washio M, Kobayashi T, Momoda R, Kiyohara C, Nakayama J. Signatures in the gut micro-biota of Japanese infants who developed food allergies in early childhood. FEMS Microbiol Ecol. 2017;93(8):fix099. doi: 10.1093/femsec/fix099. [DOI] [PubMed] [Google Scholar]
  • 98.Thompson LR, Sanders JG, McDonald D, Amir A, Ladau J, Locey KJ, Zhao H. A communal catalogue reveals Earth’s multiscale microbial diversity. Nature. 2017;551(7681):457–463. doi: 10.1038/nature24621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.Thursby E, Juge N. Introduction to the human gut microbiota. London: Portland Press Ltd.; 2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Tighe S, Afshinnekoo E, Rock TM, McGrath K, Alexander N, McIntyre A, Mason CE. Genomic methods and microbiological technologies for profiling novel and extreme environments for the extreme microbiome project (XMP) J Biomol Tech. 2017;28(1):31–39. doi: 10.7171/jbt.17-2801-004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Trivedi P, Leach JE, Tringe SG, Sa T, Singh BK (2020). Plant–microbiome interactions: from community assembly to plant health, vol 18(11). Nature Research. 10.1038/s41579-020-0412-1 [DOI] [PubMed]
  • 102.Turner TR, James EK, Poole PS. The plant microbiome. Genome Biol. 2013 doi: 10.1186/gb-2013-14-6-209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Urbaniak C, Morrison MD, Thissen JB, Karouia F, Smith DJ, Mehta S, Venkateswaran K. Microbial tracking-2, a metagenomics analysis of bacteria and fungi onboard the International Space Station. Microbiome. 2022 doi: 10.1186/s40168-022-01293-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.van den Bogert B, Boekhorst J, Pirovano W, May A. On the role of bioinformatics and data science in industrial microbiome applications. Front Genet. 2019 doi: 10.3389/fgene.2019.00721. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Vangay P, Fugett EB, Sun Q, Wiedmann M. Food microbe tracker: a web-based tool for storage and comparison of food-associated microbes. J Food Prot. 2013;76(2):283–294. doi: 10.4315/0362-028X.JFP-12-276. [DOI] [PubMed] [Google Scholar]
  • 106.van Oppen MJ, Blackall LL. Coral microbiome dynamics, functions and design in a changing world. Berlin: Nature Publishing Group; 2019. [DOI] [PubMed] [Google Scholar]
  • 107.Vives-Peris V, de Ollas C, Gómez-Cadenas A, Pérez-Clemente RM. Root exudates: from plant to rhizosphere and beyond. Berlin: Springer; 2020. [DOI] [PubMed] [Google Scholar]
  • 108.Wang J, Wang T, Zeng X, Wang S, Yu Z, Wei Y, Zhao Y. Database of space life investigations and bioinformatics of microbiology in extreme environments. Front Microbiol. 2022 doi: 10.3389/fmicb.2022.1017773. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Wegl G, Grabner N, Köstelbauer A, Klose V, Ghanbari M. Toward best practice in livestock microbiota research: a comprehensive comparison of sample storage and DNA extraction strategies. Front Microbiol. 2021 doi: 10.3389/fmicb.2021.627539. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.Westwood J, Burnett M, Spratt D, Ball M, Wilson DJ, Wellsteed S, Gilbert J (2014) The hospital microbiome project: meeting report for the UK science and innovation network UK-USA workshop ‘beating the superbugs: Hospital microbiome studies for tackling antimicrobial resistance’, October 14th 2013. Standards in Genomic Sciences, 9 (1). 10.1186/1944-3277-9-12
  • 111.Xu Y, Lei B, Zhang Q, Lei Y, Li C, Li X, Hu S. ADDAGMA: a database for domestic animal gut microbiome atlas. Comput Struct Biotechnol J. 2022;20:891–898. doi: 10.1016/j.csbj.2022.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Xue W, Li JJ, Zou Y, Zou B, Wei L. Microbiota and ocular diseases. Front Cell Infect Microbiol. 2021;11:759333. doi: 10.3389/fcimb.2021.759333. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Yang J, Park J, Jung Y, Chun J. AMDB: a database of animal gut microbial communities with manually curated metadata. Nucleic Acids Res. 2022;50(D1):D729–D735. doi: 10.1093/nar/gkab1009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Young VB. The role of the microbiome in human health and disease: an introduction for clinicians. Berlin: BMJ Publishing Group; 2017. [DOI] [PubMed] [Google Scholar]
  • 115.Zhang C, Yin A, Li H, Wang R, Wu G, Shen J, Zhao L. Dietary modulation of gut microbiota contributes to alleviation of both genetic and simple obesity in children. EBioMedicine. 2015;2(8):968–984. doi: 10.1016/j.ebiom.2015.07.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Zhang W, Cao S, Ding W, Wang M, Fan S, Yang B, Zhang YZ. Structure and function of the Arctic and Antarctic marine microbiota as revealed by metagenomics. Microbiome. 2020 doi: 10.1186/s40168-020-00826-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Zheng D, Liwinski T, Elinav E. Interaction between micro-biota and immunity in health and disease. Cell Res. 2020;30(6):492–506. doi: 10.1038/s41422-020-0332-7. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Journal of the Indian Institute of Science are provided here courtesy of Nature Publishing Group

RESOURCES