Skip to main content
ISME Communications logoLink to ISME Communications
. 2022 Feb 1;2:9. doi: 10.1038/s43705-022-00092-w

MIxS-SA: a MIxS extension defining the minimum information standard for sequence data from symbiont-associated micro-organisms

Fátima Jorge 1,, Jaelle C Brealey 2, Paul J Brindley 3, Marie Buysse 4,5, Cinzia Cantacessi 6, Olivier Duron 4,5, Raina Fichorova 7, Connor R Fitzpatrick 8, Megan Hahn 9, Christopher Hunter 10, Vincent Hervé 11, Laura J Knoll 12, Kevin D Kohl 13, Marco Lalle 14, Julius Lukeš 15, Joaquín Martínez Martínez 16, Susan L Perkins 17, Robert Poulin 1, Karyna Rosario 18, Adam C Schneider 19, Lynn M Schriml 20, Luke R Thompson 21,22, Ramona L Walls 23, Nolwenn M Dheilly 24,25,
PMCID: PMC9723553  PMID: 37938691

Abstract

The symbiont-associated (SA) environmental package is a new extension to the minimum information about any (x) sequence (MIxS) standards, established by the Parasite Microbiome Project (PMP) consortium, in collaboration with the Genomics Standard Consortium. The SA was built upon the host-associated MIxS standard, but reflects the nestedness of symbiont-associated microbiota within and across host-symbiont-microbe interactions. This package is designed to facilitate the collection and reporting of a broad range of metadata information that apply to symbionts such as life history traits, association with one or multiple host organisms, or the nature of host-symbiont interactions along the mutualism-parasitism continuum. To better reflect the inherent nestedness of all biological systems, we present a novel feature that allows users to co-localize samples, to nest a package within another package, and to identify replicates. Adoption of the MIxS-SA and of the new terms will facilitate reports of complex sampling design from a myriad of environments.

Subject terms: Microbiome, Genomics

Introduction

Interspecific interactions are ubiquitous across the Tree of Life. With the realization that eukaryotic organisms can harbor rich microbial communities, came also the view that these smaller partners may in fact play important roles in mediating host-symbiont associations, thus adding a further layer to this complex set of nested interactions, i.e. host-symbiont-microbe [18]. As the number of studies exploring the microorganisms associated with symbiotic organisms increases, likewise does the need for compliant standardized metadata that provides contextual information associated with each study and sample. Standardized metadata allows for the integration of data across organisms, resources, and within data repositories. Here, we present the symbiont-associated (SA) environmental package as a new extension to the minimum information about any (x) sequence (MIxS) standards [9], which will be included in MIxS version 6. Whilst the MIxS-SA expands upon the MIxS host-associated environmental package [9], it reflects the need for a new standard that takes into account the distinct life history traits of symbionts, their association with one or multiple host organisms, the complex nature of host-symbiont interactions along the mutualism-parasitism continuum, and the nestedness of symbiont-associated microbiota. We also propose adding the term ‘relationship to other packages’ to all environmental packages across the domains of life, to allow for integrated analysis of symbiont and host microbiota by linking metadata elements across environmental packages. This will allow users to nest a package within another package, and to identify replicates. This added feature is pivotal for the study of the microbiome of symbionts that are themselves nested within a host, reflects the inherent nestedness of all ecosystems and will facilitate reports of complex sampling design from a myriad of environments.

Collecting relevant metadata (data describing data) is now widely recognized as critical to contextualize samples and increase their reusability and reproducibility [911]. The Genomic Standards Consortium (GSC, https://gensc.org) has developed and maintains a suite of minimal information metadata standards for describing sequence metadata (checklists) for genome (MIGS), metagenome (MIMS), marker gene sequences (MIMARKS), simple amplified genome (MISAG), metagenome-assembled genome (MIMAG), virus genomes (MIUViG) and environmental packages for describing habitat-specific contextual data of the sampling environment [9, 10, 12, 13], collectively referred to as the Minimum Information about any (x) Sequence (MIxS) standard (ref. [9], https://gensc.org/mixs/).

The MIxS standards are used broadly across the microbiome research communities. These standards have been integrated into large scale microbiome projects (e.g. Human Microbiome Project, https://www.hmpdacc.org/), Earth Microbiome Project (https://earthmicrobiome.org/), Microbiology of the Built Environment (MoBE, https://www.microbe.net), microbiome bioinformatics platforms (e.g., QIIME, Qiita, mothur, JGI GOLD, MG-RAST, EBI, NCBI) and are now required upon manuscript submission. A primary advantage of the MIxS standards is the collation of large aggregates of associated metadata that can be harnessed to uncover, and eventually comprehend, patterns of microbial diversity and ecology.

The MIxS-SA package was initially drafted during the 1st Parasite Microbiome project workshop that involved the contribution of members of the GSC in addition to microbial ecologists, parasitologists, pathologists and marine biologists [14]. Participants rapidly identified the need to incorporate information on the nestedness of symbiont-associated systems, and the absence within the MIxS host-associated package of descriptors of complex life histories of mutualistic and parasitic symbionts. Until now, researchers have either omitted this information or added research-specific symbiont-associated annotations, limiting significantly the potential to compare, combine and/or reuse data from different systems and studies. Whereas the MIxS-SA package was initially designed for the study of parasite-microbes interaction, the scope of the package was expanded to include non-parasitic symbionts. This addition is a necessary expansion due to the context-dependent nature of symbiotic interactions and the ability of a given symbiont to interact differently with different organisms. Notably, the resulting MIxS-SA package reduces the need to develop additional highly similar packages for different types of symbionts.

Symbiotic associations are generally classified as mutualistic (mutually beneficial association), commensal (beneficial association to one of the partners, but not harmful to the other), or parasitic (detrimental association to one of the partners) [15]. In the context of the symbiont-associated package, the term symbiont applies to macro and microorganisms that can establish a physical interaction with at least one other organism at some stage of their life cycle regardless of the nature and dependence of the interaction. As such, this definition also covers symbiotic organisms that establish facultative and accidental associations (e.g., dead-end hosts), not requiring evolutionary processes to explain their association, but excludes free-living organisms that establish a symbiotic relationship with another free-living organism (e.g., flowers and bees). The MIxS-SA package presented herein has gone through an open and iterative review process engaging the GSC community and experts studying symbiotic organisms across various symbiont and host taxa.

Here, we present the selected list of metadata descriptors for symbiont-associated microbiota studies, including a subset of mandatory (M) terms that underpin metadata compliance (Table 1; Supplementary Information SI-1 contains all MIxS-SA items). In order to allow comparative studies of the microbiota of, sometimes closely related, free-living and symbiotic organisms, the MIxS-SA includes terms already found in the MIxS host-associated package. Thus, in MixS-SA, the term “host” (when used alone) refers to the host of the biological sample which is the symbiotic organism. New terms were created to characterise the “host of the symbiotic host”. We provide symbiont-associated package specific “Expected values” and “Examples”. Changes to the package (addition of terms, modification etc.) can be proposed by the community by creating a ticket on the MIxS GitHub page (https://github.com/GenomicsStandardsConsortium/mixs).

Table 1.

MIxS symbiont-associated environmental package representative terms, along with requirement status, description and MIXS IDs.

MIxS Package Metadata category Package item Req Definition MIXS ID
Symbiont-associated Symbiont specific descriptors host dependence M Type of host dependence for the symbiotic host organism to its host. 0001315
type of symbiosis C Type of biological interaction established between the symbiotic host organism being sampled and its respective host. 0001307
symbiotic host organism life cycle type M Type of life cycle of the symbiotic host species (the thing being sampled). Simple life cycles occur within a single host, complex ones within multiple different hosts over the course of their normal life cycle. 0001300
host life stage M Description of life stage of host. 0000251
mode of transmission C The process through which the symbiotic host organism entered the host from which it was sampled. 0001312
route of transmission O Description of path taken by the symbiotic host organism being sampled in order to establish a symbiotic relationship with the host (with which it was observed at the time of sampling) via a mode of transmission (specified in mode_transmission). 0001316
host number individual O Number of symbiotic host individuals pooled at the time of collection. 0001305
Symbiont – host relationship descriptors observed host symbionts O The taxonomic name of the organism(s) found living in mutualistic, commensalistic, or parasitic symbiosis with the specific host. For cases when the specific host of the sample is a symbiont this field should refer to other organisms it is associated with: e.g.: hyperparasite species X (parasite of the parasite). 0001309
host specificity C Level of specificity of symbiont-host interaction: e.g. generalist (symbiont able to establish associations with distantly related hosts) or species-specific. 0001308
host of the symbiont role C Role of the host in the life cycle of the symbiotic organism. 0001303
host cellular location C The localization of the symbiotic host organism within the host from which it was sampled: e.g., intracellular if the symbiotic host organism is localized within the cells or extracellular if the symbiotic host organism is localized outside of cells. 0001313
duration of association with the host O Time spent in host of the symbiotic organism at the time of sampling; relevant scale depends on symbiotic organism and study. 0001299
observed coinfecting organisms in host of host O The taxonomic name of any coinfecting organism observed in a symbiotic relationship with the host of the sampled host organism. e.g. where a sample collected from a host trematode species (A) which was collected from a host_of_host fish (B) that was also infected with a nematode (C), the value here would be (C) the nematode {species name} or {common name}. Multiple coinfecting species may be added in a comma-separated list. For listing symbiotic organisms associated with the host (A) use the term Observed host symbiont. 0001310
Host of the symbiont descriptors host of the symbiotic host common name O Common name of the host of the symbiotic host organism. 0001324
host of the symbiotic host local environmental context O For a symbiotic host organism the local anatomical environment within its host may have causal influences. Report the anatomical entity(s) which are in the direct environment of the symbiotic host organism being sampled and which you believe have significant causal influences on your sample or specimen. For example, if the symbiotic host organism being sampled is an intestinal worm, its local environmental context will be the term for intestine from UBERON (http://uberon.github.io/). 0001325
host of the symbiotic host environmental medium O Report the environmental material(s) immediately surrounding the symbiotic host organism at the time of sampling. This usually will be a tissue or substance type from the host, but may be another material if the symbiont is external to the host. We recommend using classes from the UBERON ontology, but subclasses of ‘environmental material’ (http://purl.obolibrary.org/obo/ENVO_00010483) may also be used. EnvO documentation about how to use the field: https://github.com/EnvironmentOntology/envo/wiki/Using-ENVO-with-MIxS. Terms from other OBO ontologies are permissible as long as they reference mass/volume nouns (e.g., air, water, blood) and not discrete, countable entities (e.g., intestines, heart). 0001326
host of the symbiotic host taxon id O NCBI taxon id of the host of the symbiotic host organism. 0001306
host of the symbiotic host subject id O A unique identifier by which each host of the symbiotic host organism subject can be referred to, de-identified, e.g. #H14. 0001327
Core Relationship to other samplesa C indicates the direct relationship with another sample from the same Bioproject. Accepted terms are: ‘technical replicate of’, ‘after’, ‘before’, ‘next to’, ‘within’, and ‘contains’. Can be repeated to reveal relationship to many samples. TBD
Biotic relationship C Description of relationship(s) between the subject organism and other organism(s) it is associated with. E.g., parasite on species X; mutualist with species Y. The target organism is the subject of the relationship, and the other organism(s) is the object. 0000028
Otherb Observed host symbionts O The taxonomic name of the organism(s) found living in mutualistic, commensalistic, or parasitic symbiosis with the specific host. 0001309

Req requirements, O optional, C conditional, M mandatory, TBD to be determined.

aTerm may be slightly amended when included to all packages.

bTerm added to host-associated, human-associated, plant-associated, human-vaginal, human-skin, human-oral, and human-gut packages.

Given the diversity of symbiotic interactions and that the nature and dependence of such interactions can be context-dependent rather than a fixed trait, it was crucial to define terms and provide value syntax that were inclusive for diverse types of symbioses and also across the symbiont life histories and transmission processes. For example, the term “host dependence” (a mandatory item) and “type of symbiosis” (a conditional item) are discrete but complementary items. While “host dependence” aims to provide a general characterization of the known type of host dependence for the symbiotic organism (e.g., facultative), “type of symbiosis” was specifically designed to further characterize the type of biological interaction established between the symbiotic organism and its respective host at the moment the biological sample was taken (e.g., mutualistic). As a result, the MIxS-SA package features mandatory and conditionally mandatory, and optional features that enable flexibility according to the knowledge of the study system at the time of sampling. Two examples of MIxS-SA-compliant metadata are provided in Supplementary information (SI-2), and the respective study designs are presented in Fig. 1. The examples refer to 16 S rRNA gene studies of (a) the bacterial communities of the parasite Coitocaecum parvum, a trematode, across four of its life stages: the sporocyst, the metacercaria and the adult, as well as the free-living cercaria [16], and (b) of the leaves and roots of the parasitic plant Orobanche hederae and its ivy host, Hedera spp. [17].

Fig. 1. Examples of study design for the sampling of microbes of symbiotic organisms, their hosts and environment.

Fig. 1

a Trematode Coitocaecum parvum different life stages (S1, S2, S3, S4) are reported with the MIxS-SA package. The microbiome of infected snail (H1), amphipod (H2) and fish (H3) hosts are reported with the MIxS host-associated package. In addition, the microbiome of environmental sediment (E1) and water (E2) from which these organisms were collected can be reported with MIxS-sediment and MIxS-water, respectively. The following relationships are reported: S1 “within” H1, S2 “within” E2, S3 “within” H2, S4 “within” H3, H1 “next to” E1, H1 “within” E2, H2 “within” E2, H3 “within” E2, E1 “next to” E2. b Angiosperm Orobanche hederae (S5, S6) parasitizing a host plant (P1, P2, P3) is reported using the included MIxS-SA and MIxS-PA (plant-associated) packages respectively. In addition, the MIxS-soil package is used to report corresponding soil samples. The following relationships are reported: S6 ‘within’ P1, S5 ‘next to’ S6, P1 ‘next to’ P2, P1 ‘next to’ P3, P2 ‘next to’ P3, P1 ‘within’ E3, P3 ‘within’ E4, E3 ‘next to’ E4.

While identical terms are often used in several of the 17 environmental packages currently available (https://gensc.org/mixs/), here we introduce three additional new terms: one is shared by several relevant MIxS environmental packages, and the two others will feature within the core MIxS package. The new term “observed host symbionts” provides a more comprehensive descriptor for the subject organism associations with smaller symbionts and it has been added to the host-associated, human-associated, plant-associated, human-vaginal, human-skin, human-oral and human-gut packages. The term “biotic relationship” has been added to the core package as a conditional descriptor of the relationship between the subject organism and other larger host organism(s). Finally, it appears necessary to include in the MIxS core a new term that takes into account the nested feature of most associations found in nature, such as host-symbiont-microorganism, in which multiple packages are necessary to describe the samples of the study (e.g., water, sediment, host-associated, and symbiont-associated). The proposed term “relationship to other samples” indicates the direct relationship between two samples from the same Bioproject, that are described in different environmental package(s). This proposed feature, still under development, will allow for integrated analyses of the microbiota of symbiotic organisms and their direct environment, even in the context of co-infections (e.g., symbiont-associated SA1234 is “within” host-associated sample HA8974, “next to” symbiont-associated sample SA7890). This feature will also benefit other studies by providing ecologically-relevant contextual information (e.g., host-associated HA2567 is “within” environmental water sample W1234, “next to” host-associated sample HA5679, ‘next’ to environmental soil sample S5897). In conclusion, it is our hope that the MIxS-SA, together with the new terms, will enable researchers to better conduct integrated analyses of multi-level biological systems with the ultimate goal of better understanding the role of microbes associated with symbionts.

Supplementary information

Table S1 (23.9KB, xlsx)
Table S3 (23.7KB, xlsx)

Acknowledgements

This contribution is part of the Parasite Microbiome Project. The authors sincerely thank the Gordon and Betty Moore Foundation for fully sponsoring the 1st Parasite Microbiome project workshop, and for supporting Open Access publishing. FJ was funded by a grant from the Marsden Fund (Royal Society of New Zealand) to RP (Principal Investigator) and NMD (Associate Investigator). The National Institutes of Health grants RO1CA164719 (PJB), and R01AI144016-01(LJK). VH was supported by the FEDER grant InFoBioS n°EX011185 (région Val de Loire). JMM acknowledges support from award OCE-1933285, National Science Foundation, USA. ML is part of the PARADISE consortium, supported by funding from the European Union’s Horizon 2020 Research and Innovation programme under grant agreement No 773830: One Health European Joint Programme. JL was supported by the ERD funds (16_019/0000759).

Author contributions

NMD conceptualized the MIxS package. All authors contributed to the writing and design of the MIxS-SA package. FJ and NMD prepared the manuscript. All authors agreed with the final version of the package and manuscript.

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Contributor Information

Fátima Jorge, Email: fatima.esperancajorge@otago.ac.nz.

Nolwenn M. Dheilly, Email: nolwenn.dheilly@anses.fr

Supplementary information

The online version contains supplementary material available at 10.1038/s43705-022-00092-w.

References

  • 1.Dheilly NM, Poulin R, Thomas F. Biological warfare: microorganisms as drivers of host-parasite interactions. Infect Genet Evol. 2015;34:251–9. doi: 10.1016/j.meegid.2015.05.027. [DOI] [PubMed] [Google Scholar]
  • 2.Dheilly NM, Ewald PW, Brindley PJ, Fichorova RN, Thomas F. Parasite-microbe-host interactions and cancer risk. PLoS Pathog. 2019;15:e1007912. doi: 10.1371/journal.ppat.1007912. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Bass D, Stentiford GD, Wang HC, Koskella B, Tyler CR. The pathobiome in animal and plant diseases. Trends Ecol Evol. 2019;34:996–1008. doi: 10.1016/j.tree.2019.07.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Brinker P, Fontaine MC, Beukeboom LW, Salles JF. Host, symbionts, and the microbiome: the missing tripartite interaction. Trends Microbiol. 2019;27:480–8. doi: 10.1016/j.tim.2019.02.002. [DOI] [PubMed] [Google Scholar]
  • 5.Barrow P, Dujardin JC, Fasel N, Greenwood AD, Osterrieder K, Lomonossoff G, et al. Viruses of protozoan parasites and viral therapy: Is the time now right? Virol J. 2020;17:142. doi: 10.1186/s12985-020-01410-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Husnik F, Tashyreva D, Boscaro V, George EE, Lukeš J, Keeling PJ. Bacterial and archaeal symbioses with protists. Curr. Biol. 2021;31:R862–R877.. doi: 10.1016/j.cub.2021.05.049. [DOI] [PubMed] [Google Scholar]
  • 7.Hahn MA, Dheilly NM. Experimental models to study the role of microbes in host-parasite interactions. Front. Microbiol. 2016;7:1300. doi: 10.3389/fmicb.2016.01300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Robinson AJ, House GL, Morales DP, Kelliher JM, Gallegos-Graves LV, LeBrun ES, et al. Widespread bacterial diversity within the bacteriome of fungi. Commun Biol. 2021;4:1168. doi: 10.1038/s42003-021-02693-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Yilmaz P, Kottmann R, Field D, Knight R, Cole JR, Amaral-Zettler L, et al. Minimum information about a marker gene sequence (MIMARKS) and minimum information about any (x) sequence (MIxS) specifications. Nat Biotechnol. 2011;29:415–20. doi: 10.1038/nbt.1823. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Bowers R, Kyrpides NC, Stepanauskas R, Harmon-Smith M, Doud D, Reddy TBK, et al. Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea. Nat Biotechnol. 2017;35:725–31. doi: 10.1038/nbt.3893. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Schriml LM, Chuvochina M, Davies N, Eloe-Fadrosh EA, Finn RD, Hugenholtz P, et al. COVID-19 pandemic reveals the peril of ignoring metadata standards. Sci Data. 2020;7:188. doi: 10.1038/s41597-020-0524-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Field D, Garrity G, Gray T, Morrison N, Selengut J, Sterk P, Tatusova T, et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26:541–7. doi: 10.1038/nbt1360. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Roux S, Adriaenssens EM, Dutilh BE, Koonin EV, Kropinski AM, Krupovic M, et al. Minimum Information about an Uncultivated Virus Genome (MIUViG) Nat Biotechnol. 2019;37:29–37. doi: 10.1038/nbt.4306. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Dheilly NM, Martínez Martínez J, Rosario K, Brindley PJ, Fichorova RN, Kaye JZ, et al. Parasite microbiome project: grand challenges. PLoS Pathog. 2019;15:e1008028. doi: 10.1371/journal.ppat.1008028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Leung TLF, Poulin R. Parasitism, commensalism, and mutualism: exploring the many shades of symbioses. Vie Milieu. 2008;58:107–15. [Google Scholar]
  • 16.Jorge F, Dheilly NM, Poulin R. Persistence of a core microbiome through the ontogeny of a multi-host parasite. Front Microbiol. 2020;11:954. doi: 10.3389/fmicb.2020.00954. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Fitzpatrick CR, Schneider AC. Unique bacterial assembly, composition, and interactions in a parasitic plant and its host. J Exp Bot. 2020;71:2198–209. doi: 10.1093/jxb/erz572. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Table S1 (23.9KB, xlsx)
Table S3 (23.7KB, xlsx)

Articles from ISME Communications are provided here courtesy of Oxford University Press

RESOURCES