Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
letter
. 2019 May 30;8(22):e00369-19. doi: 10.1128/MRA.00369-19

Beware of False “Type Strain” Genome Sequences

Francisco Salvà-Serra a,b,c,d,e,, Daniel Jaén-Luchoro a,b,c,d, Roger Karlsson a,b,d,f, Antoni Bennasar-Figueras e,g, Hedvig E Jakobsson b,d, Edward R B Moore a,b,c,d,
Editor: Irene L G Newtonh
PMCID: PMC6544187  PMID: 31147430

LETTER

With this letter, we warn users of bacterial DNA sequence data about recent cases misusing the term “type strain” in bacterial genome sequence reports and highlight the importance that the term is used in the correct context.

In recent articles published in the journal Genome Announcements (GA), (now Microbiology Resource Announcements [MRA]) (15), the complete genome sequences of five strains of five species of bacteria were reported. The titles of the articles stated that the strains represent the respective “type strains” of the five species, although the articles do not present details about the processes of defining the type strains. At this point, it is important to point out that the concept and description of “type strain” are not arbitrary; the type strains of bacterial species are defined by rule 18a of the International Code of Nomenclature of Prokaryotes as follows: “The type strain is made up of living cultures of an organism, which are descended from a strain designated as the nomenclatural type” (6). Strains serving as nomenclatural type material are designated precisely, and this may be done in various ways (7); typically, a “holotype strain” is designated at the time of valid publication of a new species name (rule 18b), in which the type strain must conform to defined conditions (rule 30). The five species described in the GA publications had been validly published previously, and the type strains for those species were already defined, preserved, and publicly available in numerous culture collections. Meanwhile, the strains reported in the five GA articles were recently isolated independently. Therefore, since they are not descended from the already defined type strains, they cannot represent the authentic type strains of the species.

To confirm these observations, we performed average nucleotide identity (ANI) analyses using JSpeciesWS (8, 9) between the five genome sequences reported in the GA articles and the genome sequences of the documented type strains of the five species (Table 1). The ANI values range from 83.10 to 98.96% (percentages aligned, 75.56 to 91.18%), confirming that the described strains are different and, furthermore, confirming that one of them, Lelliottia nimipressuralis SGAir0187, is misclassified and is not a strain of this species.

TABLE 1.

Average nucleotide identity values between the genome sequences of the strains erroneously reported as type strains and the genome sequences of the authentic type strains of the corresponding species

Strain erroneously reported as type strain GenBank accession no. Authentic type strain GenBank accession no. ANIba (%) Aligned nucleotides (%)
Acinetobacter indicus SGAir0564 CP024620 A. indicus CIP 110367T AYET00000000 96.02 83.59
Bacillus altitudinis SGAir0031 CP022319 B. altitudinis 41KF2bT ASJC00000000 97.92 91.18
Lelliottia nimipressuralis SGAir0187 CP025034 L. nimipressuralis CCUG 25894T SDDX00000000 83.10 75.56
Pseudomonas stutzeri SGAir0442 CP025149 P. stutzeri CGMCC 1.1803T CP002881 97.04 88.04
Staphylococcus haemolyticus SGAir0252 CP025031 S. haemolyticus NCTC 11042T PPQA00000000 98.96 88.85
a

Average nucleotide identity based on BLAST. Species threshold, ≥95%.

Representing genome sequences from false type strains of given species has the potential to lead to erroneous conclusions in future studies that may rely on the published misinformation, as they may be used as the wrong reference points. We encourage the authors of the five GA publications and the journal to publish corrigenda and remove the words “type strain” from the titles of the publications. We also warn users of publicly available genome sequence data to be cautious in accepting the metadata associated with genome sequences; recent studies have clearly demonstrated the presence of high numbers of misclassified genome sequences in different taxa according to genome sequence identity criteria for species-level identifications (1013). Public databases archiving genome sequence data are implementing new controls for catching sequence deposits that do not adhere to accepted taxonomic standards for identification. However, the public databases may be able to control descriptions of only the sequence data submitted; they cannot control the descriptions of the data that authors may use in publications. We encourage users of reported and archived genome sequence data to perform relevant taxonomic controls, for instance, by comparing housekeeping gene sequences from genomes with independently determined sequences of documented type strains; if possible, comparisons should be made from the data of the original descriptions of the species.

In conclusion, type strains serve as important “reference points” in bacterial taxonomy and systematics and are fundamental and essential resources for microbiology. The presence of sequences erroneously reported as type strains is a real example of “fake news” and is potentially dangerous, as it can easily lead to further errors in studies relying on them. With the presented examples of inaccurate and inappropriate use of the term “type strain,” we emphasize how important it is to confirm the designation. In fact, we would propose that the public databases establish dedicated repositories for the genome sequences of the type strains of validly published species. Essentially, while genome sequence data represent enormous resources, it is imperative to perform basic and relevant controls when reporting and using public genome sequences.

Ed. Note: The authors of the published articles did not respond.

REFERENCES

  • 1.Vettath VK, Junqueira ACM, Uchida A, Purbojati RW, Houghton JNI, Chenard C, Drautz-Moses DI, Wong A, Kolundzija S, Clare ME, Lau KJX, Gaultier NE, Heinle CE, Premkrishnan BNV, Gusareva ES, Acerbi E, Yang L, Schuster SC. 2017. Complete genome sequence of Bacillus altitudinis type strain SGAir0031 isolated from tropical air collected in Singapore. Genome Announc 5:e01260-17. doi: 10.1128/genomeA.01260-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Heinle CE, Junqueira ACM, Uchida A, Purbojati RW, Houghton JNI, Chenard C, Drautz-Moses DI, Wong A, Kolundzija S, Clare ME, Kushwaha KK, Panicker D, Putra A, Gaultier NE, Premkrishnan BNV, Vettath VK, Schuster SC. 2018. Complete genome sequence of Lelliottia nimipressuralis type strain SGAir0187, isolated from tropical air collected in Singapore. Genome Announc 6:e00231-18. doi: 10.1128/genomeA.00231-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Vettath VK, Junqueira ACM, Uchida A, Purbojati RW, Houghton JNI, Chenard C, Drautz-Moses DI, Wong A, Kolundzija S, Clare ME, Kushwaha KK, Panicker D, Putra A, Gaultier NE, Heinle CE, Premkrishnan BNV, Schuster SC. 2018. Complete genome sequence of Acinetobacter indicus type strain SGAir0564 isolated from tropical air collected in Singapore. Genome Announc 6:e00230-18. doi: 10.1128/genomeA.00230-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Premkrishnan BNV, Junqueira ACM, Uchida A, Purbojati RW, Houghton JNI, Chenard C, Wong A, Kolundzija S, Clare ME, Kushwaha KK, Panicker D, Putra A, Gaultier NE, Heinle CE, Vettath VK, Drautz-Moses DI, Schuster SC. 2018. Complete genome sequence of Staphylococcus haemolyticus type strain SGAir0252. Genome Announc 6:e00229-18. doi: 10.1128/genomeA.00229-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Kee C, Junqueira ACM, Uchida A, Purbojati RW, Houghton JNI, Chenard C, Wong A, Kolundzija S, Clare ME, Kushwaha KK, Panicker D, Putra A, Gaultier NE, Premkrishnan BNV, Heinle CE, Vettath VK, Drautz-Moses DI, Schuster SC. 2018. Complete genome sequence of Pseudomonas stutzeri type strain SGAir0442, isolated from Singapore air samples. Genome Announc 6:e00424-18. doi: 10.1128/genomeA.00424-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Parker CT, Tindall BJ, Garrity GM. 2019. International Code of Nomenclature of Prokaryotes. Int J Syst Evol Microbiol 69:S1–S111. doi: 10.1099/ijsem.0.000778. [DOI] [PubMed] [Google Scholar]
  • 7.Tindall BJ, Kämpfer P, Euzéby JP, Oren A. 2006. Valid publication of names of prokaryotes according to the rules of nomenclature: past history and current practice. Int J Syst Evol Microbiol 56:2715–2720. doi: 10.1099/ijs.0.64780-0. [DOI] [PubMed] [Google Scholar]
  • 8.Goris J, Konstantinidis KT, Klappenbach JA, Coenye T, Vandamme P, Tiedje JM. 2007. DNA-DNA hybridization values and their relationship to whole-genome sequence similarities. Int J Syst Evol Microbiol 57:81–91. doi: 10.1099/ijs.0.64483-0. [DOI] [PubMed] [Google Scholar]
  • 9.Richter M, Rosselló-Móra R, Oliver Glöckner F, Peplies J. 2016. JSpeciesWS: a Web server for prokaryotic species circumscription based on pairwise genome comparison. Bioinformatics 32:929–931. doi: 10.1093/bioinformatics/btv681. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Beaz-Hidalgo R, Hossain MJ, Liles MR, Figueras MJ. 2015. Strategies to avoid wrongly labelled genomes using as example the detected wrong taxonomic affiliation for Aeromonas genomes in the GenBank database. PLoS One 10:e0115813. doi: 10.1371/journal.pone.0115813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Gomila M, Peña A, Mulet M, Lalucat J, García-Valdés E. 2015. Phylogenomics and systematics in Pseudomonas. Front Microbiol 6:214. doi: 10.3389/fmicb.2015.00214. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Jensen A, Scholz CF, Kilian M. 2016. Re-evaluation of the taxonomy of the Mitis group of the genus Streptococcus based on whole genome phylogenetic analyses, and proposed reclassification of Streptococcus dentisani as Streptococcus oralis subsp. dentisani comb. nov., Streptococcus tigurinus as Streptococcus oralis subsp. tigurinus comb. nov., and Streptococcus oligofermentans as a later synonym of Streptococcus cristatus. Int J Syst Evol Microbiol 66:4803–4820. doi: 10.1099/ijsem.0.001433. [DOI] [PubMed] [Google Scholar]
  • 13.Gonzales-Siles L, Salvà-Serra F, Degerman A, Nordén R, Lindh M, Skovbjerg S, Moore ER. 2018. Identification and capsular serotype sequetyping of Streptococcus pneumoniae strains. bioRxiv doi: 10.1101/415422. [DOI] [PubMed]

Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES