Abstract
The International Committee on Systematics of Prokaryotes (ICSP) discussed and rejected in 2020 a proposal to modify the International Code of Nomenclature of Prokaryotes to allow the use of gene sequences as type for naming prokaryotes. An alternative nomenclatural code, the Code of Nomenclature of Prokaryotes Described from Sequence Data (SeqCode), which considers genome sequences as type material for naming species, was published in 2022. Members of the ICSP subcommittee for the taxonomy of the phylum Chlamydiae (Chlamydiota) consider that the use of gene sequences as type would benefit the taxonomy of microorganisms that are difficult to culture such as the chlamydiae and other strictly intracellular bacteria. We recommend the registration of new names of uncultured prokaryotes in the SeqCode registry.
Keywords: SeqCode, Taxogenomics, Chlamydiales, Intracellular bacteria, Chlamydia
1. Main text
The classification and naming of prokaryotes is essential for communication in biological research. While an increasing number of bacterial taxa are described based on genome sequences only, only cultured bacteria can serve as type material to name new species. The International Committee on Systematics of Prokaryotes (ICSP) held a virtual discussion in 2020 on proposals to use gene/genome sequences as type for the taxonomy of prokaryotes [[1], [2], [3], [4], [5], [6]]. Following the discussion, members of the ICSP voted and rejected these proposals [7], which triggered the development of an alternative nomenclatural code for prokaryotes described from sequence data (SeqCode) [8].
2. The requirement for pure culture prevents naming most chlamydiae
Chlamydiologists are facing some difficulties with the current rules of the International Code of Nomenclature of Prokaryotes (ICNP), in particular regarding the deposition of a viable pure culture in two different international collections (rule 30) [9]. We put forth that the proposal to use genome sequences as type [[1], [2], [3]] would in fact be positive for the taxonomy of microorganisms such as members of the chlamydiae that are often very difficult to culture. Only a small proportion of all chlamydial species has been described so far [10,11]. As obligate intracellular bacteria that are resistant to axenic growth in pure culture, chlamydiae require cultured eukaryotic cells as host for their own growth. In recent years, an increasing number of Candidatus species was described mainly based on genomic data. These include Candidatus Chlamydia sanzinia, a snake pathogen [12], Candidatus Similichlamydia epinephelii, a fish pathogen [13], Candidatus Rhabdochlamydia helvetica, a tick symbiont [14], Candidatus Syngnamydia medusae, associated with jellyfish [15] and Candidatus Rhabdochlamydia oedothoracis, associated with the dwarf spider Oedothorax gibbosus [16]. In fact, our understanding of chlamydial diversity gained a new momentum with the application of single-cell and metagenomic approaches; many of the more recently discovered highly diverse chlamydiae are characterized based on single-cell assembled genomes (SAGs) and metagenome-assembled genomes (MAGs) [[17], [18], [19], [20]]. Our subcommittee thus feels strongly that naming bacterial species based on genomics and metagenomics data should be allowed a) when it has not been possible to isolate the corresponding strain in culture, and b) as long as it is clearly stated that the new strain designation is only based on sequence data. The latter distinction could be achieved by the Whitman and SeqCode proposal of using the “Ts” superscript [3,8] or by continuing to use the Candidatus status. If sequence quality standards are applied and appropriate metadata is provided [21], it is possible to establish a consistent taxonomy for the chlamydiae.
3. Gene and genome sequences are at the core of prokaryotic taxonomy
While nothing in the ICNP restrict the freedom of taxonomic thought or action (Principle 1), prokaryotic (and eukaryotic) taxonomy heavily rely on gene and genome sequence to build a coherent classification framework [22]. Molecular sequences are more revealing of evolutionary relationships than are classical phenotypes, in particular for prokaryotes [23]. Species are generally delineated using whole genome similarity measures and grouped into higher taxonomic ranks based on phylogenetic analyses [22]. Using sequences as type would facilitate the unambiguous identification of prokaryotic species or even subspecies and prevent the proliferation of synonymous names referring to the same organisms. The genome sequence of Candidatus Protochlamydia massiliensis, isolated from Vermamoeba vermiformis [24] is for instance nearly identical to that of Candidatus Protochlamydia naegleriophila isolated from Naegleria amoebae (average nucleotide identity >99%) and should logically be part of the same species [25]. Since both species have a Candidatus status, no priority is officially granted by the ICNP [9]. A centralized registry such as SeqCode with type sequences and associated names would prevent this situation to happen and logically grant priority to the earliest name [8], greatly facilitating the work of taxonomists. Classifying and naming species based on genome sequences would instantly provide an accurate representation of chlamydial diversity. The Genome Taxonomy Database (GTDB) already provides a regularly updated taxonomy of publicly available genomes and could serve as reference for updating the taxonomy of the phylum Chlamydiae (Chlamydiota) [26,27].
4. The need for quality standards
An important concern is the quality of genome sequences used as type material [6]. Indeed, binning of assembled contigs can lead to errors whereby MAGs are artifactual hybrids of distantly related organism genomes. Evaluation of the quality and completeness of genome assemblies generally rely on the identification of nearly universal marker genes. Tools such as CheckM allow the evaluation of the completeness and potential contamination of any bacterial genome sequence based on the identification of nearly universal bacterial or lineage-specific marker genes [28,29]. The evaluation of the completeness of the genome should be carried out with care considering the large variations in genome size and gene content observed within the phylum. Hybrid assemblies can also be detected by investigating the taxonomic properties of individual genes and contigs with tools such as CAT and BAT [30]. Genome completeness and contamination estimates are already available on GTDB and on the chlamydiae specific database ChlamDB [31]. Some MAGs from poorly characterized chlamydial clades are estimated to be less than 50% complete; however the genome segments that constitute these MAGs can be unequivocally identified as segments of chlamydial genomes [17]. The level of contamination in chlamydial SAGs and MAGs is generally low, and currently only the MAG of Candidatus Similichlamydia epinephelii exhibits more than 5% contamination. Phylogenetic analysis indicates that this assembly might represent an admixture of two related strains [17]. Overall, the quality of sequences should not be a major problem if DNA sequences are used as type material for taxonomy of the phylum Chlamydiae (Chlamydiota).
5. Recommendations of the ICSP subcommittee for the taxonomy of the phylum Chlamydiae (Chlamydiota) to name uncultured species under the SeqCode
The ICSP subcommittee for the taxonomy of the phylum Chlamydiae (Chlamydiota) consider the SeqCode as a good solution to tackle the problem of naming uncultured prokaryotes. Although the SeqCode is not endorsed by the ICSP [32], we hope that both codes can coexist and be united in the near future. We are in favor of using genomes as type to name uncultured prokaryotes and encourage the use of the “T” and abbreviations to distinguish species with a cultured type (T) from species with a sequence type (Ts). Reference genomes of undescribed species should be registered in the SeqCode registry (https://seqco.de). Data quality standards are discussed in the SeqCode paper [8] and are summarized in Table 1. Raw sequence data and minimal metadata should be associated to the sequence assembly. Reported data should comply with existing standards from the Genomic Standards Consortium [21,33]. The quality of the sequences should be evaluated based on the identification of marker genes and a comprehensive investigation of the taxonomy of individual contigs (e.g. inferred from homology search or phylogenetic analyses [30]). Species should be delineated based on genome average nucleotide identity (ANI) with a cutoff of 95% of identity [2]. The average amino-acid identity (AAI) of orthologous protein sequences can also be used to classify genomes at family (45–65%), genus (65–95%), and species level (95–100%) [2]. Methods based on Percent of Conserved Proteins (POCP) should be used with caution considering the high variability of genome size within the phylum [20]. The classification at higher taxonomic ranks should rely on a phylogeny based on single copy orthologs and the comparison of a set of marker genes such as those used by GTDB [26] or those that have been shown by Pillonel et al. to be taxonomically informative and highly discriminant for the delineation of taxa in the phylum Chlamydiae (Chlamydiota) [34]. Genome assemblies of lower quality or those that are incomplete could be used as provisional taxon with the status Candidatus and replaced by better quality genome assemblies when available. To improve the consistency of the classification within the phylum, the ICSP subcommittee for the taxonomy of chlamydiae recommends the use of centralized genome sequence databases allowing comparison of new sequences to existing chlamydial genomes and the classification of new genomes in a consistent manner. GTDB with its recently introduced standardized calculation of species clusters and the GTDB Toolkit [26,35], and the chlamydiae-specific genome database ChlamDB [31] are recommended to serve as central resources for the curation of genome-based taxonomy of any members of the phylum Chlamydiae (Chlamydiota).
Table 1.
Overview of recommended metadata and quality standards for the use of DNA sequence information to name uncultured taxa. This proposal is partially adapted from Refs. [8,27,37].
Mandatory | Raw sequence data publicly available (e.g. sequencing reads submitted to Sequence Read Archive [SRA]) |
Compliance to standards from the Genomic Standards Consortium: provide the minimum information about a genome sequence (MIGS) [34], single amplified genome (MISAG) or metagenome-assembled genome (MIMAG) [21] | |
Genome contamination <5% (based on the identification of >100 nearly universal marker genes) | |
Genome >90% complete (based on >100 nearly universal marker genes) for species level assignment | |
Average amino acid identity (AAI) or whole-genome average nucleotide identity (ANI) to any of the most similar known chlamydial genomes | |
Phylogenetic analysis using single copy marker genes and representatives of all known chlamydial genomes [27,28,34] | |
For isolates, read coverage≥10x | |
Recommended | Host name/taxid (if known) |
16S rRNA gene(s) sequences >75% complete and passing chimera checks | |
High genome integrity (contig no. <100; N50 >25 kilobases (kb); largest contig >100 kb) | |
For MAG/SAG, read coverage≥10x | |
For MAG/SAG, sequence identified in more than 1 sample (from different places or different time points) |
During our previous subcommittee meetings, we recognized the advantages of using genome sequences as type to name members of the phylum Chlamydiae (Chlamydiota) given its large proportion of yet uncultured representatives [36], and we will continue, within the framework of the ICSP subcommittee and our regular meetings, to work on this important issue. About 40% of all publicly available genome sequences lack a species name [26], indicating that using DNA sequence data as type material will not only facilitate the communication of research on chlamydiae but will be helpful for all Bacteria and Archaea [26]. Thus, the ICSP subcommittee for the taxonomy of the phylum Chlamydiae is in favor of using the genomes to name prokaryotes, but with a specific nomenclature, i.e. using the “Ts” epithet.
Funding information
This work received no specific grant from any funding agency.
CRediT authorship contribution statement
Gilbert Greub: Conceptualization, Writing – review & editing. Trestan Pillonel: Conceptualization, Writing – original draft. Patrik M. Bavoil: Writing – review & editing. Nicole Borel: Writing – review & editing. Lee Ann Campbell: Writing – review & editing. Deborah Dean: Writing – review & editing. Scott Hefty: Writing – review & editing. Matthias Horn: Writing – review & editing. Servaas A. Morré: Writing – review & editing. Scot P. Ouellette: Writing – review & editing. Yvonne Pannekoek: Writing – review & editing. Mirja Puolakkainen: Writing – review & editing. Peter Timms: Writing – review & editing. Raphael Valdivia: Writing – review & editing. Daisy Vanrompay: Writing – review & editing.
Declaration of competing interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Handling Editor: Patricia Schlagenhauf
References
- 1.Whitman W.B. Modest proposals to expand the type material for naming of prokaryotes. Int J Syst Evol Microbiol. 2016;66:2108–2112. doi: 10.1099/ijsem.0.000980. [DOI] [PubMed] [Google Scholar]
- 2.Konstantinidis K.T., Rosselló-Móra R., Amann R. Uncultivated microbes in need of their own taxonomy. ISME J. 2017;11:2399–2406. doi: 10.1038/ismej.2017.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Whitman W.B., Sutcliffe I.C., Rossello-Mora R. Proposal for changes in the international code of nomenclature of prokaryotes: granting priority to Candidatus names. Int J Syst Evol Microbiol. 2019;69:2174–2175. doi: 10.1099/ijsem.0.003419. [DOI] [PubMed] [Google Scholar]
- 4.Rossello-Mora R., Konstantinidis K.T., Sutcliffe I., Whitman W. Opinion: response to concerns about the use of DNA sequences as types in the nomenclature of prokaryotes. Syst Appl Microbiol. 2020;43 doi: 10.1016/j.syapm.2020.126070. [DOI] [PubMed] [Google Scholar]
- 5.Bisgaard M., Christensen H., Clermont D., Dijkshoorn L., Janda J.M., Moore E.R.B., Nemec A., Nørskov-Lauritsen N., Overmann J., Reubsaet F.A.G. The use of genomic DNA sequences as type material for valid publication of bacterial species names will have severe implications for clinical microbiology and related disciplines. Diagn Microbiol Infect Dis. 2019;95:102–103. doi: 10.1016/j.diagmicrobio.2019.03.007. [DOI] [PubMed] [Google Scholar]
- 6.Overmann J., Huang S., Nübel U., Hahnke R.L., Tindall B.J. Relevance of phenotypic information for the taxonomy of not-yet-cultured microorganisms. Syst Appl Microbiol. 2019;42:22–29. doi: 10.1016/j.syapm.2018.08.009. [DOI] [PubMed] [Google Scholar]
- 7.Sutcliffe I.C., Dijkshoorn L., Whitman W.B., Executive Board, on behalf of the I Minutes of the International Committee on Systematics of Prokaryotes online discussion on the proposed use of gene sequences as type for naming of prokaryotes, and outcome of vote. Int J Syst Evol Microbiol. 2020;70:4416–4417. doi: 10.1099/ijsem.0.004303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hedlund B.P., Chuvochina M., Hugenholtz P., Konstantinidis K.T., Murray A.E., Palmer M., Parks D.H., Probst A.J., Reysenbach A.-L., Rodriguez-R L.M., et al. SeqCode: a nomenclatural code for prokaryotes described from sequence data. Nat. Microbiol. 2022;7:1702–1708. doi: 10.1038/s41564-022-01214-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.International code of nomenclature of prokaryotes Int J Syst Evol Microbiol, 69, S1–S111. [DOI] [PubMed]
- 10.Lagkouvardos I., Weinmaier T., Lauro F.M., Cavicchioli R., Rattei T., Horn M. Integrating metagenomic and amplicon databases to resolve the phylogenetic and ecological diversity of the Chlamydiae. ISME J. 2014;8:115–125. doi: 10.1038/ismej.2013.142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Köstlbacher S., Collingro A., Halter T., Schulz F., Jungbluth S.P., Horn M. Pangenomics reveals alternative environmental lifestyles among chlamydiae. Nat Commun. 2021;12:4021. doi: 10.1038/s41467-021-24294-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Taylor-Brown A., Bachmann N.L., Borel N., Polkinghorne A. Culture-independent genomic characterisation of Candidatus Chlamydia sanzinia, a novel uncultivated bacterium infecting snakes. BMC Genom. 2016;17:710. doi: 10.1186/s12864-016-3055-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Taylor-Brown A., Pillonel T., Bridle A., Qi W., Bachmann N.L., Miller T.L., Greub G., Nowak B., Seth-Smith H.M.B., Vaughan L., et al. Culture-independent genomics of a novel chlamydial pathogen of fish provides new insight into host-specific adaptations utilized by these intracellular bacteria. Environ Microbiol. 2017;19:1899–1913. doi: 10.1111/1462-2920.13694. [DOI] [PubMed] [Google Scholar]
- 14.Pillonel T., Bertelli C., Aeby S., de Barsy M., Jacquier N., Kebbi-Beghdadi C., Mueller L., Vouga M., Greub G. Sequencing the obligate intracellular Rhabdochlamydia helvetica within its tick host ixodes ricinus to investigate their symbiotic relationship. Genome Biol. Evol. 2019;11:1334–1344. doi: 10.1093/gbe/evz072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Viver T., Orellana L.H., Hatt J.K., Urdiain M., Díaz S., Richter M., Antón J., Avian M., Amann R., Konstantinidis K.T., et al. The low diverse gastric microbiome of the jellyfish Cotylorhiza tuberculata is dominated by four novel taxa. Environ Microbiol. 2017;19:3039–3058. doi: 10.1111/1462-2920.13763. [DOI] [PubMed] [Google Scholar]
- 16.Halter T., Köstlbacher S., Collingro A., Sixt B.S., Tönshoff E.R., Hendrickx F., Kostanjšek R., Horn M. Ecology and evolution of chlamydial symbionts of arthropods. ISME Commun. 2022;2:1–11. doi: 10.1038/s43705-022-00124-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Pillonel T., Bertelli C., Greub G. Environmental metagenomic assemblies reveal seven new highly divergent chlamydial lineages and hallmarks of a conserved intracellular lifestyle. Front Microbiol. 2018;9 doi: 10.3389/fmicb.2018.00079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Dharamshi J.E., Tamarit D., Eme L., Stairs C.W., Martijn J., Homa F., Jørgensen S.L., Spang A., Ettema T.J.G. Marine sediments illuminate chlamydiae diversity and evolution. Curr Biol. 2020;30:1032–1048. doi: 10.1016/j.cub.2020.02.016. e7. [DOI] [PubMed] [Google Scholar]
- 19.Collingro A., Köstlbacher S., Mussmann M., Stepanauskas R., Hallam S.J., Horn M. Unexpected genomic features in widespread intracellular bacteria: evidence for motility of marine chlamydiae. ISME J. 2017;11:2334–2344. doi: 10.1038/ismej.2017.95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Dharamshi J.E., Köstlbacher S., Schön M.E., Collingro A., Ettema T.J.G., Horn M. Gene gain facilitated endosymbiotic evolution of Chlamydiae. Nat. Microbiol. 2023;8:40–54. doi: 10.1038/s41564-022-01284-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Bowers R.M., Kyrpides N.C., Stepanauskas R., Harmon-Smith M., Doud D., Reddy T.B.K., Schulz F., Jarett J., Rivers A.R., Eloe-Fadrosh E.A., et al. Minimum information about a single amplified genome (MISAG) and a metagenome-assembled genome (MIMAG) of bacteria and archaea. Nat Biotechnol. 2017;35:725–731. doi: 10.1038/nbt.3893. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Hugenholtz P., Chuvochina M., Oren A., Parks D.H., Soo R.M. Prokaryotic taxonomy and nomenclature in the age of big sequence data. ISME J. 2021;15:1879–1892. doi: 10.1038/s41396-021-00941-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Woese C.R., Kandler O., Wheelis M.L. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci USA. 1990;87:4576–4579. doi: 10.1073/pnas.87.12.4576. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Benamar S., Bou Khalil J.Y., Blanc-Tailleur C., Bilen M., Barrassi L., La Scola B. Developmental cycle and genome analysis of Protochlamydia massiliensis sp. nov. A new species in the parachlamydiacae family. Front Cell Infect Microbiol. 2017;7 doi: 10.3389/fcimb.2017.00385. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Bertelli C., Cissé O.H., Rusconi B., Kebbi-Beghdadi C., Croxatto A., Goesmann A., Collyn F., Greub G. CRISPR system acquisition and evolution of an obligate intracellular chlamydia-related bacterium. Genome Biol. Evol. 2016;8:2376–2386. doi: 10.1093/gbe/evw138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Parks D.H., Chuvochina M., Chaumeil P.-A., Rinke C., Mussig A.J., Hugenholtz P. A complete domain-to-species taxonomy for Bacteria and Archaea. Nat Biotechnol. 2020;38:1079–1086. doi: 10.1038/s41587-020-0501-8. [DOI] [PubMed] [Google Scholar]
- 27.Parks D.H., Chuvochina M., Waite D.W., Rinke C., Skarshewski A., Chaumeil P.-A., Hugenholtz P. A standardized bacterial taxonomy based on genome phylogeny substantially revises the tree of life. Nat Biotechnol. 2018;36:996–1004. doi: 10.1038/nbt.4229. [DOI] [PubMed] [Google Scholar]
- 28.Parks D.H., Imelfort M., Skennerton C.T., Hugenholtz P., Tyson G.W. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res. 2015;25:1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Chklovski A., Parks D.H., Woodcroft B.J., Tyson G.W. 2022. CheckM2: a rapid, scalable and accurate tool for assessing microbial genome quality using machine learning. [DOI] [PubMed] [Google Scholar]
- 30.von Meijenfeldt F.A.B., Arkhipova K., Cambuy D.D., Coutinho F.H., Dutilh B.E. Robust taxonomic classification of uncharted microbial sequences and bins with CAT and BAT. Genome Biol. 2019;20:217. doi: 10.1186/s13059-019-1817-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Pillonel T., Tagini F., Bertelli C., Greub G. ChlamDB: a comparative genomics database of the phylum Chlamydiae and other members of the Planctomycetes-Verrucomicrobiae-Chlamydiae superphylum. Nucleic Acids Res. 2020;48:D526–D534. doi: 10.1093/nar/gkz924. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Göker M., Moore E.R.B., Oren A., Trujillo M.E. Status of the SeqCode in the international journal of systematic and evolutionary microbiology. Int J Syst Evol Microbiol. 2023;72 doi: 10.1099/ijsem.0.005754. [DOI] [PubMed] [Google Scholar]
- 33.Field D., Garrity G., Gray T., Morrison N., Selengut J., Sterk P., Tatusova T., Thomson N., Allen M.J., Angiuoli S.V., et al. The minimum information about a genome sequence (MIGS) specification. Nat Biotechnol. 2008;26:541–547. doi: 10.1038/nbt1360. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Pillonel T., Bertelli C., Salamin N., Greub G. Taxogenomics of the order chlamydiales. Int J Syst Evol Microbiol. 2015;65:1381–1393. doi: 10.1099/ijs.0.000090. [DOI] [PubMed] [Google Scholar]
- 35.Chaumeil P.-A., Mussig A.J., Hugenholtz P., Parks D.H. GTDB-Tk: a toolkit to classify genomes with the Genome Taxonomy Database. Bioinformatics. 2020;36:1925–1927. doi: 10.1093/bioinformatics/btz848. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Borel N., Horn M., Greub G. International committee on Systematics of prokaryotes (ICSP) subcommittee on the taxonomy of chlamydiae. Minutes of the closed meeting, 20 march 2019, seattle, WA, USA. Int J Syst Evol Microbiol. 2019;69:3654–3656. doi: 10.1099/ijsem.0.003738. [DOI] [PubMed] [Google Scholar]
- 37.Murray A.E., Freudenstein J., Gribaldo S., Hatzenpichler R., Hugenholtz P., Kämpfer P., Konstantinidis K.T., Lane C.E., Papke R.T., Parks D.H., et al. Roadmap for naming uncultivated archaea and bacteria. Nat. Microbiol. 2020;5:987–994. doi: 10.1038/s41564-020-0733-x. [DOI] [PMC free article] [PubMed] [Google Scholar]