Abstract
The reported Agrobacterium radiobacter DSM 30174T genome is highly fragmented, hindering robust comparative genomics and genome-based taxonomic analysis. We re-sequenced the Agrobacterium radiobacter type strain, generating a dramatically improved genome with high contiguity. In addition, we sequenced the genome of Agrobacterium tumefaciens B6T, enabling for the first time, a proper comparative genomics of these contentious Agrobacterium species. We provide concrete evidence that the previously reported Agrobacterium radiobacter type strain genome (Accession Number: ASXY01) is contaminated which explains its abnormally large genome size and fragmented assembly. We propose that Agrobacterium tumefaciens be reclassified as Agrobacterium radiobacter subsp. tumefaciens and that Agrobacterium radiobacter retains it species status with the proposed name of Agrobacterium radiobacter subsp. radiobacter. This proposal is based, first on the high pairwise genome-scale average nucleotide identity supporting the amalgamation of both Agrobacterium radiobacter and Agrobacterium tumefaciens into a single species. Second, maximum likelihood tree construction based on the concatenated alignment of shared genes (core genes) among related strains indicates that Agrobacterium radiobacter NCPPB3001 is sufficiently divergent from Agrobacterium tumefaciens to propose two independent sub-clades. Third, Agrobacterium tumefaciens demonstrates the genomic potential to synthesize the L configuration of fucose in its lipid polysaccharide, fostering its ability to colonize plant cells more effectively than Agrobacterium radiobacter.
Keywords: Type strain, Average nucleotide identity, Phylogenomics, Agrobacterium radiobacter, Agrobacterium tumefaciens, Lipopolysaccharide, Agrobacterium, Ti plasmid
Introduction
The taxonomy and phylogeny of the genus Agrobacterium has proven to be complex and controversial. Bacteria of the genus Agrobacterium have been grouped into six species based on the disease phenotype associated, in part, with the resident disease-inducing plasmid. Among those six species are Agrobacterium tumefaciens causing crown gall on dicotyledonous plants, stone fruit and nut trees and Agrobacterium radiobacter that is not known to cause plant diseases of any kind (Bouzar & Jones, 2001; Conn, 1942; Kerr & Panagopoulos, 1977; Panagopoulos, Psallidas & Alivizatos, 1978; Riker et al., 1930; Starr & Weiss, 1943; Süle, 1978). An alternative classification approach grouped Agrobacterium organisms into three biovars based on physiological and biochemical properties without consideration of disease phenotype (Keane, Kerr & New, 1970; Kerr & Panagopoulos, 1977; Panagopoulos, Psallidas & Alivizatos, 1978). The species and biovar classification schemes do not coincide well, in a large part, because of the disease-inducing plasmids, tumor-inducing (pTi) and hairy root-inducing (pRi), are readily transmissible plasmids (Young et al., 2001).
Many widely used approaches for bacterial species definition include composition of peptidoglycan, base composition of DNA, fatty acid and 16S rDNA sequence (Stackebrandt et al., 2002) in addition to newer methods based on the whole-genome analysis (Coutinho et al., 2016; Jain et al., 2018), horizontal gene transfer analysis (Bobay & Ochman, 2017) or the core genome analysis (Moldovan & Gelfand, 2018) which is used in the present study. The genus Agrobacterium is a prime example with many proposals and oppositions regarding the amalgamation of Agrobacterium and Rhizobium over the last three or four decades (Farrand, Van Berkum & Oger, 2003; Gaunt et al., 2001; Young et al., 2001, 2003). However, more recent studies appear to favor the preservation of the genus Agrobacterium backed by strong genetic and genomic evidence (Gan & Savka, 2018; Ramírez-Bahena et al., 2014). Within the genus Agrobacterium, the taxonomic status of Agrobacterium radiobacter and Agrobacterium tumefaciens remains contentious (Sawada et al., 1993; Young, 2008; Young, Pennycook & Watson, 2006). Agrobacterium radiobacter (originally proposed as Bacillus radiobacter) is a non-pathogenic soil bacterium associated with nitrogen utilization isolated more than a century ago in 1902 (Beijerinck & Van Delden, 1902; Conn, 1942). On the other hand, Agrobacterium tumefaciens (previously Bacterium tumefaciens) is a plant pathogen capable of inducing tumorigenesis (Smith & Townsend, 1907). However, the descriptive assignment for Agrobacterium tumefaciens was later found to be contributed by a set of genes located on the large Ti plasmid that can be lost (Gordon & Christie, 2014). In other words, the curing of Ti plasmid in Agrobacterium tumefaciens will change its identity to the non-pathogenic species, Agrobacterium radiobacter. Furthermore, comparative molecular analysis based on single-copy housekeeping genes also supports the close relatedness of Agrobacterium radiobacter and Agrobacterium tumefaciens, blurring the taxonomic boundaries between these species (Mousavi et al., 2015; Shams et al., 2013). As taxa are reclassified into different populations that do not conform to the characteristics of the original description, the given names lose their significant and descriptive importance. Consistent with the Judicial Commission according to the Rules of the International Code of Nomenclature of Bacteria, Tindall (2014) concluded that the combination of Agrobacterium radiobacter has priority over the combination Agrobacterium tumefaciens when the two are treated as members of the same species since Agrobacterium radiobacter was the first proposed and described in 1902 whereas Agrobacterium tumefaciens was first proposed and described in 1907) (Tindall, 2014). However, given that Agrobacterium tumefaciens has been more widely studied than Agrobacterium radiobacter due to its strong relevance to agriculture (Bourras, Rouxel & Meyer, 2015), it remains unclear but interesting to see if the broader scientific community will obey this rule by adopting the recommended species name change in future studies.
To our knowledge, a detailed comparative genomics analysis of Agrobacterium radiobacter and Agrobacterium tumefaciens type strains has not been reported despite their genome availability (Zhang et al., 2014). The high genomic relatedness of both type strains was briefly mentioned by Kim & Gan (2017) through whole genome alignment and pairwise nucleotide identity calculation from homologous regions. However, evidence is now mounting that the Agrobacterium radiobacter DSM 30147T reported by Zhang et al. (2014) is contaminated, warranting immediate investigation (Jeong, Pan & Park, 2016). The assembled genome is nearly 7 megabases, the largest among Agrobacterium currently sequenced at that time with up to 6,853 predicted protein-coding genes contained in over 600 contigs. At sequencing depth of nearly 200×, its genome assembly is unusually fragmented even for a challenging microbial genome (Utturkar et al., 2017). Furthermore, the phylogenomic placement of Agrobacterium radiobacter DSM 30147T based on this genome assembly has been questionable as evidenced by its basal position and substantially longer branch length relative to other members of the species (Gan & Savka, 2018). The overly fragmented nature of this assembly also precludes fruitful comparative genomics focusing on gene synteny analysis. More importantly, analysis done on a contaminated assembly but with the assumption that it is not, will likely lead to incorrect biological interpretations (Allnutt et al., 2018).
In this study, we sequenced the whole genome of Agrobacterium radiobacter using a type strain that was sourced from the National Collection of Plant Pathogenic Bacteria (NCPPB). We produced a contiguous genome assembly exhibiting genomic statistics that are more similar to other assembled Agrobacterium genomes. We show here, through comparative genomics and phylogenetics, that the previously assembled Agrobacterium radiobacter DSM 30147T genome contains substantial genomic representation from another Agrobacterium sp. isolated and sequenced by the same lab, consistent with our initial suspicion of strain contamination. Using the newly assembled genome for subsequent comparative analysis, we provide genomic evidence that Agrobacterium radiobacter DSM 30147T and Agrobacterium tumefaciens B6T are the same species. However, strain DSM 30147T should not be considered as a merely non-tumorigenic strain of Agrobacterium tumefaciens as substantial genomic variation exists between these two type strains notably in the nucleotide sugar metabolism pathway that may contribute to their ecological niche differentiation.
Materials and Methods
DNA extraction and whole genome sequencing
Approximately 10 bacterial colonies were scrapped using a sterile P200 pipette tip from a 3-day-old nutrient agar culture and resuspended in lysis buffer with proteinase K (Sokolov, 2000) followed by incubation at 56 °C for 3 h. DNA purification was performed as previously described. The extracted DNA was normalized to 0.2 ng/μL and prepared using the Nextera XT library preparation kit (Illumina, San Diego, CA, USA) according to the manufacturer’s instructions. The library was sequenced on an Illumina MiSeq desktop sequencer located at the Monash University Malaysia Genomics Facility (2 × 250 bp run configuration) that routinely sequences mostly decapod crustacean mitogenomes (Gan, Tan & Austin, 2016a; Gan et al., 2016b; Tan et al., 2015) and occasionally microbial genomes (Gan et al., 2014, 2015; Wong et al., 2014) without prior history of processing any member from the Agrobacterium genomospecies 4.
De novo assembly and genome completeness assessment
Raw paired-end reads were adapter-trimmed using Trimmomatic v0.36 (Bolger, Lohse & Usadel, 2014) followed by error-correction and de novo assembly using Spades Assembler v3.9 (Bankevich et al., 2012) (See Data S1 for specific trimming and assembly settings). Genome completeness was assessed with BUSCOv3 (Rhizobiales database) (Waterhouse et al., 2017).
Protein clustering
Gene prediction used Prodigal v2.6 (Hyatt et al., 2010). Clustering of the predicted coding sequence was performed with CD-HIT-EST using the settings “-C 0.95, -T 0.8” (Li & Godzik, 2006). Identification of unique and shared clusters were done using basic unix commands, for example, csplit, grep, sort and uniq. The specific commands used and files generated during clustering can be found in the Zenodo database (https://doi.org/10.5281/zenodo.1489356).
Phylogenetic analysis
Reconstruction of the Agrobacterium phylogeny used PhyloPhlAN (Segata et al., 2013). PhyloPhlAN is a bioinformatic pipeline that identifies conserved proteins (400 markers) from microbial genomes and uses them to construct a high-resolution phylogeny using maximum likelihood inference approach (Price, Dehal & Arkin, 2010). For single gene tree construction, protein sequences were aligned with mafft v7.3 (Katoh & Standley, 2013) using the the most accurate setting (–localpair –maxiterate 1000) followed by phylogenetic tree construction via IqTree v1.65 with optimized model (Kalyaanamoorthy et al., 2017; Nguyen et al., 2014). Visualization and annotation of phylogenetic trees was performed with Figtree v1.4.3 (http://tree.bio.ed.ac.uk/software/figtree/).
Pan-genome construction and phylogenomics
Whole genome sequences were reannotated with Prokka v1.1 using the default setting (Seemann, 2014). The Prokka-generated gff files were used as the input for Roary v3.12.0 to calculate the pan-genome (Page et al., 2015). Maximum likelihood tree construction of the core-genome alignment and tree visualization used FastTree2 v2.1.10 (-nt -gtr) (Price, Dehal & Arkin, 2010) and FigTree v 1.4.3, respectively. Input and output files associated with the Roary analysis have been deposited in the Zenodo database (https://doi.org/10.5281/zenodo.1489356).
Detection and visualization of Ti plasmid
Genome sequences of each member of the genomospecies 4 except for the problematic DSM 37014T strain were used as the query for blastN search (e-value 1e−100) against the octopine-type Ti plasmid (Altschul et al., 1990). The result of the similarity search was subsequently visualized in Blast Ring Image Generator v0.95 (Alikhan et al., 2011).
Genome annotation and KEGG pathway reconstruction
Whole genome sequences of Agrobacterium tumefaciens B6T and Agrobacterium radiobacter NCPPB 3001T were submitted to the online server GhostKoala (Kanehisa, Sato & Morishima, 2016b) for annotation and the annotated genomes were subsequently used to reconstruct KEGG pathways (Kanehisa et al., 2016a) in the same webserver. Identification of proteins with TIGRFAM signatures of interest (Haft, Selengut & White, 2003) used HMMsearch v3.1b2 with the option “–cut_tc” activated to filter for only protein hits passing the TIGRFAM trusted cutoff values (Johnson, Eddy & Portugaly, 2010).
Results
An improved Agrobacterium radiobacter type strain genome
Raw sequencing data and whole genome assembly for strains B6 and NCPPB3001 reported in this study are linked to the NCBI Bioproject IDs PRJNA300485 and PRJNA300611, respectively. The newly assembled genome of Agrobacterium radiobacter type strain that was sourced from the NCPPB is approximately 30% smaller than the first reported Agrobacterium radiobacter DSM 30147T genome with 96% less contigs (22 vs 612), 20-fold longer N50 (480 vs 23 kb) and assembled length that is much more similar to other Agrobacterium spp. (Table 1). In addition, it is near-complete with 685 out of 686 BUSCO Rhizobiale single-copy genes detected as either partial or complete with minimal evidence of contamination as indicated by the near absence of duplicated single-copy gene(<0.1%). On the contrary, the current DSM 30147 genome is missing 25.1% of the single copy gene with up to 34.8% duplication rate. At the time of this manuscript writing, another genome of Agrobacterium radiobacter type strain that was sourced from another culture collection centre, for example, the Belgian Coordinated Collections of Microorganisms has been deposited in the NCBI wgs database (Agrobacterium radiobacter LMG140T; Table 1) with assembly statistics that are highly similar to the type strain genome reported in this study.
Table 1. Genome statistics of publicly available Agrobacterium genomospecies 4 whole genome sequences.
| Assembly accession | Strain | Isolation source | Country | Size | GC% | # Contig |
|---|---|---|---|---|---|---|
| GCF_900045375 | B6 | Apple Gall (Iowa) | USA | 5.8 | 59.07 | 4 |
| GCF_001541315* | B6 | Apple Gall (Iowa) | USA | 5.6 | 59.32 | 52 |
| GCF_001692245 | B140/95 | Peach/Almond Rootstock | USA | 5.7 | 59.23 | 45 |
| GCF_002179795 | LMG 215 | Humulus lupulus gall (USA) | USA | 5.4 | 59.48 | 33 |
| GCF_000233975 | CCNWGS0286 | R. pseudoacacia nodules | China | 5.2 | 59.53 | 49 |
| GCF_900011755 | Kerr 14 = LMG 15 = CFBP 5761 | Soil around Prunus dulcis | Australia | 5.9 | 59.04 | 5 |
| GCF_002591665 | 186 | English Walnut gall | California | 5.7 | 59.42 | 22 |
| GCF_002008215 | LMG 140 = NCPPB 3001 = CFBP 5522= DSM 30147 | Saprobic soil | Germany | 5.5 | 59.34 | 22 |
| GCF_000421945 | LMG 140 = NCPPB 3001 = CFBP 5522 = DSM 30147 | Saprobic soil | Germany | 7.17 | 59.86 | 612 |
| GCF_001541305* | LMG 140 = NCPPB 3001 = CFBP 5522 = DSM 30147 | Saprobic soil | Germany | 5.5 | 59.36 | 22 |
| GCF_900012605 | CFBP 5621 | Lotus corniculata, root tissue commensal | France | 5.4 | 59.32 | 3 |
| GCF_003031125 | LAD9 (CGMCC No. 2962) | Landfill leachate treatment system | China | 5.9 | 59.13 | 49 |
| GCF_000384555 | 224MFTsu31 | Rhizosphere of L. luteus in Hungary, formerly R. lupini H13-3 | USA | 4.8 | 59.73 | 21 |
| GCF_900188475 | 719_389 | Rhizosphere and endosphere of Arabidopsis thaliana. | USA | 4.9 | 59.73 | 18 |
| GCF_000384555 | UNC420CL41Cvi | Plant associated | USA | 5 | 59.69 | 18 |
Note:
Reported in this study.
The inflated genome size of Agrobacterium radiobacter DSM 30147(T) is due to technical errors
Instead of sharing a recent common ancestor as would be expected for a recently duplicated gene, the duplicated single copy genes coding for seryl-tRNA synthetase in Agrobacterium radiobacter DSM 30147T were placed in two distinct clusters with one affiliated to genomospecies 4 and the other affiliated to genomospecies 7 (Fig. 1A). Such an unexpected clustering pattern raises the suspicion of genome assembly from two or more non-clonal bacterial strains. In addition, by performing comparison at the genome-scale based on whole proteome clustering of Agrobacterium radiobacter DSM 30147T/NCPPB 3001T (Previous study, GCF_000421945; This study, GCF_001541305), A. sp. TS43 (unpublished, GCF_001526605) and Agrobacterium tumefaciens B6 (GCF_001541315), we observed a high number of proteins that were exclusively shared between Zhang et al. Agrobacterium radiobacter DSM 30147 and A. sp. TS43 belonging to genomospecies 7 (Fig. 1B). Coincidentally, despite not sharing the same Bioproject ID, the whole genomes of strains DSM 30147T and TS43 were sequenced by the Zhang et al., and submitted to NCBI on the same date, 30 May 2013, hinting strain contamination during sample processing in the lab.
Figure 1. Phylogenetic and genomic evidence indicating contamination in the published A. radiobacter DSM 30147T genome.

(A) Maximum likelihood phylogenetic tree of seryl-tRNA synthetases from Agrobacterium genomospecies 4 and 7. Codes after the tildes are contigs containing the corresponding homologs. Node labels indicate ultra-fast bootstrap support value and branch length indicates number of substitutions per site. Duplicated homologs in the problematic A. radiobacter DSM 30147 genome were colored red. (B) Venn diagram of the core proteome of selected Agrobacterium strains from genomospecies 4. Numbers in the overlapping regions indicate the number of coding sequences (CDS) that shared by two or more groups at 95% nucleotide identity cutoff.
Genome-scale average nucleotide identity calculation supports the amalgamation of Agrobacterium radiobacter and Agrobacterium tumefaciens into a single genomospecies
Single gene tree shows that Agrobacterium radiobacter NCPPB 3001T and Agrobacterium tumefaciens B6T belong to the genomospecies 4 clade (Fig. 1A), corroborating with the PhyloPhlAN phylogenomic tree that was constructed based on the alignment of 400 universal single-copy proteins (Fig. S1). The pairwise average nucleotide identity (ANI) among strains within this clade is consistently more than 95% further supporting their affiliation to the same genomospecies (Fig. 2) (Coutinho et al., 2016; Jain et al., 2018). As expected, pairwise ANI of less than 92% was observed when they were compared with strains from genomospecies 7 (strains RV3 and Zutra 3/1). A 100% pairwise ANI was observed between Agrobacterium radiobacter type strains that were sourced from NCPPB and LMG. In addition, non-type strains B140/95 and CFBP5621 also exhibit a strikingly high pairwise ANI (>99%) to the type strains of Agrobacterium tumefaciens and Agrobacterium radiobacter, respectively, leading to the formation of sub-clusters within genomospecies 4 (Fig. 2).
Figure 2. A heatmap showing the hierarchical clustering of Agrobacterium strains based on genomic distance.
Values in boxes indicate pairwise average nucleotide identity. Horizontal colored bar below the heatmap indicate the genomospecies assigned to each genome (G7, genomospecies 7; G4, genomospecies 4). Boxed labels indicate genomes sequenced in this study.
Is Agrobacterium radiobacter a non-tumorigenic strain of Agrobacterium tumefaciens?
A majority of the currently sequenced strains from genomospecies 4 are non-tumorigenic as evidenced by the near complete lack of genomic region with significant nucleotide similarity to the octopine-type Ti reference plasmid (Fig. 3). Of the 14 genomes analyzed, only strains B6T and B140/95 exhibit a complete coverage of the Ti plasmid with near 100% sequence identity while strain 186 shows hits mainly to the essential gene clusters of a Ti plasmid such as the vir gene cluster (black rings and gene labels in Fig. 3) at a substantially lower sequence identity (50% < x < 90%) (Fig. 3), suggesting that it may be harboring a dissimilar variant of Ti plasmid, for example, different opine type. In addition, although lacking hits to the virulence gene of the Ti plasmid, the tra and trb clusters involved in plasmid conjugal transfer are present in strains Kerr 14, CCNWGS0286 and UNC420CL41Cvi. Despite belonging to the same genomospecies, core genome alignment and phylogenomic analysis indicates that Agrobacterium radiobacter NCPPB3001T is sufficiently divergent from Agrobacterium tumefaciens B6T leading to their separation into two distinct sub-clusters (Fig. 4A). This is also resonated by their different sub-cluster placement in the pairwise ANI heatplot (Fig. 2). Furthermore, strains from both subclades could be broadly differentiated by the set of core accessory genes that they harbor (Fig. 4B). Therefore, even though Agrobacterium radiobacter does not harbor a Ti plasmid, it cannot be considered as a non-tumorigenic strain of Agrobacterium tumefaciens given multiple lines of evidence indicating its substantial genomic divergence from Agrobacterium tumefaciens.
Figure 3. Prevalence and sequence conservation of the octopine-type Ti plasmid among Agrobacterium genomospecies 4.
Each genome (labelled 1–15) is represented by a colored ring shaded based on nucleotide percentage similarity to the reference Ti plasmid (min. 50%; max. 100%). The outermost ring highlights the gene regions involved in tumorigenesis (vir, iaa and ipt) and plasmid conjugation (trb and tra). Asterisks indicate genomes sequenced in this study.
Figure 4. Genomic divergence among genomospecies 4 strains.
(A) Unrooted maximum likelihood tree constructed based on the core genome alignment. Branch length and node labels indicate number of substitutions per site and FastTree2 SH-like support values, respectively. Putative subclades were colored blue, red and purple (B) Distribution of accessory (non-core) gene clusters among strains determined with Roary and plotted with the perl script roary2svg.pl (https://github.com/sanger-pathogens/Roary/blob/master/contrib/roary2svg/roary2svg.pl). A total of 7,906 accessory gene clusters were identified by Roary and the number of accessory genes presence in each genome are shown in the most right column. Vertical gray lines/bars along the plot indicate presence of accessory gene. Asterisks indicate genomes sequenced in this study.
Agrobacterium genomospecies 4 strains differ in their genomic potential for nucleotide sugar metabolism
Individual comparison of the reconstructed KEGG pathways in Agrobacterium tumefaciens (Fig. 5A) and Agrobacterium radiobacter (Fig. 5B) revealed stark contrast in the anabolism of dTDP-L-rhamnose which is commonly found in the O-antigen of lipopolysaccharide (LPS) in gram-negative bacteria. Surprisingly, the entire enzyme set required for the generation of dTDP-L-rhamnose from D-glucose-phosphate (Table 2) is absent in Agrobacterium tumefaciens B6, suggesting that this common nucleotide sugar may be absent from the LPS O-antigen of strain B6. A manual inspection of the accessory genes uniquely shared by Agrobacterium tumefaciens strains B6 and B140/95 identified a homolog cluster containing GDP-L-fucose synthase (EC 1.1.1.271) that is involved in the enzymatic production of GDP-L-fucose from GDP-4-dehydro-6-deoxy-D-mannose and NADH (Table 2; Fig. 5C). As expected, the genes coding for this enzyme and GDP-mannose 4,6-dehydratase involved in the conversion of GDP-alpha-D-mannose to GDP-4-dehydro-6-deoxy-D-mannose, are absent in the Agrobacterium radiobacter NCPPB3001 genome (Fig. 5D). Intriguingly, HMMsearch scan revealed the presence of two protein hits to the TIGR01479 HMM profile in Agrobacterium tumefaciens B6 that corresponds to D-mannose 1,6-phosphomutase (EC 5.4.2.8) required for the synthesis of D-mannose 6-phosphate. In addition to strain B6, its close relative, strain B140/95, and a more distantly related strain Kerr14 also harbor two copies of this gene. However, one of the D-mannose 1,6-phosphomutases in strain Kerr14 is more divergent with a lower TIGRFAM HMM sequence score (Table 2). Furthermore, it exhibits less than 70% protein identity to the Agrobacterium tumefaciens B6 and B140/95 homologs, forming a private protein cluster in the pan-genome (data not shown).
Figure 5. KEGG pathway of nucleotide sugar metabolism associated with Agrobacterium lipopolysaccharide synthesis.
(A & B) genomic potential of A. tumefaciens B6 and A. radiobacter DSM 30147, respectively, in the biosynthesis of dTDP-L-rhamnose. (C & D) genomic potential of A. tumefaciens B6 and A. radiobacter DSM 30147, respectively, in the biosynthesis of GDP-L-Fucose. Numbers in boxes indicate Enzyme Commission numbers. White and green boxes indicate absence and presence of the corresponding enzymes, respectively, based on GhostKoala annotation (Kanehisa, Sato & Morishima, 2016b).
Table 2. Identification of Agrobacterium proteins with TIGRFAM domains involved in the biosynthesis of nucleotide sugar.
| Assembly ID | Strain | TIGR01479 (EC 5.4.2.8) | TIGR01472 (EC 4.2.1.47) | TIGR01207 (EC 2.7.7.24) | TIGR01181 (EC 4.2.1.46) | TIGR01221 (EC 5.1.3.13) | TIGR01214 (EC 1.1.1.133) | |
|---|---|---|---|---|---|---|---|---|
| 1st hit | 2nd hit | |||||||
| GCF_900045375 | B6 | 690.2 | 566.6 | 589.5 | ||||
| GCF_001541315 | B6 | 690.2 | 566.6 | 589.5 | ||||
| GCF_001692245 | B140/95 | 690.2 | 566.6 | 589.5 | ||||
| GCF_900011755 | Kerr14 | 691.3 | 690.2 | 428.6* | ||||
| GCF_001541305 | NCPPB3001 | 690.2 | 494.6 | 488.5 | 215.4 | 331.5 | ||
| GCF_002008215 | LMG140 | 690.2 | 494.6 | 488.5 | 215.4 | 331.5 | ||
| GCF_900012605 | CFBP5621 | 689.3 | 494.6 | 489.5 | 215.4 | 331.5 | ||
| GCF_002591665 | 186 | 689.3 | 494.6 | 488.5 | 215.4 | 331.8 | ||
| GCF_003031125 | LAD9 | 688.5 | 494.4 | 487.9 | 215.4 | 329.9 | ||
| GCF_000233975 | CCNWGS | 644.8 | 494.6 | 487.5 | 215.4 | 331.8 | ||
| GCF_002179795 | LMG215 | 690.2 | ||||||
| GCF_000384555 | 224MFTsu31 | 644.8 | ||||||
| GCF_000482285 | UNC420CL41Cvi | 644.8 | ||||||
| GCF_900188475 | 719_389 | 687.5 | ||||||
Notes:
Numbers indicate bit scores calculated based on protein alignment to the model with higher scores indicating stronger and more significant hits.
Formed a separate protein cluster from the rest of genomospecies 4 GDP-mannose-4,6-dehydratase orthologs (<70% pairwise protein identity).
Discussion
We re-sequenced the genome of Agrobacterium radiobacter type strain using strain directly obtained from NCPPB. The assembled Agrobacterium radiobacter genome reported in this study exhibits assembly statistics that are consistent with a high-quality draft genome such as high genome completeness and contiguity, near-zero contamination/duplication and comparable genome size to other closely related strains (Gan, Lee & Savka, 2018; Parks et al., 2015). Furthermore, given the improved contiguity and dramatic reduction in the number of contigs of this newly assembled draft genome, we recommend using this genome in place of the previously published draft genome for future Agrobacterium comparative studies.
The distinct separation of Agrobacterium genomospecies 4 and 7 at 95% ANI cutoff corroborates with the previously established “genomic yardstick” for species differentiation (Konstantinidis & Tiedje, 2005; Richter & Rosselló-Móra, 2009). Using this percentage cutoff, the ANI approach has been successfully used to provide a near “black-and-white” pattern of species separation in even some of the most diverse bacterial genera such as Pseudomonas, Arcobacter and Stenotrophomonas (Pérez-Cataluña et al., 2018; Tran, Savka & Gan, 2017; Vinuesa, Ochoa-Sánchez & Contreras-Moreira, 2018). Given the increasing evidence highlighting the robustness and reliability of the ANI approach in species delineation, the pairwise ANI between Agrobacterium tumefaciens and Agrobacterium radiobacter type strains that is at least 2.5% higher than the 95% cutoff value is rigorous evidence that they belong to the same genomospecies, effectively serving as the final nail in the coffin for the decade-long debate on their taxonomic status. The amalgamation of Agrobacterium radiobacter and Agrobacterium tumefaciens into a single species have been repeatedly suggested in the past few years but was complicated by the special status of Agrobacterium tumefaciens as the type species of the genus Agrobacterium despite the priority that Agrobacterium radiobacter has over Agrobacterium tumefaciens as it was isolated and described 3 years before Agrobacterium tumefaciens (Young et al., 2001, 2003). Despite sharing numerous morphological and biochemical features, differences in genomic features such as pairwise ANI, phylogenomic clustering and core accessory gene contents do exist among members in Agrobacterium genomospecies 4 that can facilitate the identification of genotypic and phenotypic variants to accurately delimit sub-species relationships in the future (Brenner, Staley & Krieg, 2000; Jezbera et al., 2011; Meier-Kolthoff et al., 2014; Tan et al., 2013).
To date the LPS for both type strains have been determined (De Castro et al., 2002, 2004). In stark contrast to Agrobacterium radiobacter, the Agrobacterium tumefaciens LPS consists of D-arabinose and L-fucose that have yet been reported to date in another members of the genus Agrobacterium (De Castro et al., 2002). The presence of the L configuration of fucose is considered to be rare even among plant pathogenic bacteria but may be associated with the ability of Agrobacterium tumefaciens to colonize or bind to wounded plant cell (Lippincott, Whatley & Lippincott, 1977; Whatley et al., 1976; Whatley & Spiess, 1977). It has been previously shown that the LPS of Agrobacterium tumefaciens but not Agrobacterium radiobacter can bind to the plant cells thus providing protection against subsequent infection by pathogenic strains (Whatley et al., 1976). The presence and absence of nucleotide sugars in the O-chain constituent of LPS in both type strains corroborates with their observed genomic potential in the nucleotide sugar metabolism pathway thus underscoring the utility of comparative genomics in facilitating the prediction of microbial host range and ecological niche (Klosterman et al., 2011). For example, the absence of L-rhamnose and L-fucose in the LPS of Agrobacterium tumefaciens B6 and Agrobacterium radiobacter DSM30147, respectively, is consistent with the lack of genes coding for enzymes involved with the particular nucleotide sugar metabolism. Generation of Agrobacterium tumefaciens B6 LPS mutant via targeted gene deletion (Kaczmarczyk, Vorholt & Francez-Charlot, 2012) or the classical but more laborious transposon mutagenesis approach followed by characterization of the LPS mutant host-range and phytopathogenicity will be instructive (Gan et al., 2011; Reuhs et al., 2005).
Our current genomic sampling indicates that the Ti plasmid appears to be restricted to the Agrobacterium tumefaciens subclade. The maintenance of the Ti plasmid is metabolically taxing given its large size (Barker et al., 1983; Glick, 1995). Even if the Ti plasmid was conjugally transfer, for example, to Agrobacterium radiobacter, the inability of Agrobacterium radiobacter to colonize plant host as evidenced by its LPS incompatibility will not confer an advantage to the new plasmid host in a natural environment (Thomashow et al., 1980). Furthermore, in the absence of high density Acyl-homoserine lactone (AHL) signals which is required to trigger Ti plasmid conjugation (Fuqua & Winans, 1994; Pappas, 2008; Zhang, Wang & Zhang, 2002), the newly acquired Ti plasmid in Agrobacterium radiobacter may be cured in its natural soil habitat after a few generations. Although the spontaneous transfer of the Ti plasmid from tumorigenic Agrobacterium tumefaciens to Agrobacterium radiobacter K84 has been reported previously, strain K84 was re-classified based on a recent core gene analysis to Rhizobium rhizogenes K84 (Velázquez et al., 2010; Vicedo et al., 1996), reiterating the pervasive taxonomic inconsistency within the genus Agrobacterium that may have confound previous biological interpretations (De Ley et al., 1966; Lindström et al., 1995; Young, 2008). Given that a large majority of Agrobacterium genetics was performed during the pre-NGS era (Gan & Savka, 2018), it remains unknown as to how many Agrobacterium tumefaciens and Agrobacterium radiobacter strains have been molecularly misclassified due to their high genomic relatedness.
The inability to accurately identify plasmid and chromosomal-derived contigs among the draft genomes means that some of the core accessory genes among tumorigenic strains may be plasmid-derived and should be treated with caution as the low-copy-number Ti-plasmid is prone to curing in the absence of AHL signals. Despite the value of complete genome assembly in enabling the accurate partitioning of plasmid and chromosomal genomic region (Arredondo-Alonso et al., 2017), the representation of complete Agrobacterium genomes in current database is still very low as a majority of the genomes were assembled from short Illumina reads that cannot effectively span repetitive region (Wibberg et al., 2011; Wood et al., 2001). Furthermore, most Agrobacterium strains harbor multiple large plasmids that further complicate short-read-only assembly graph (Kado & Liu, 1981; Lowe et al., 2009; Shao et al., 2018). Given the currently available genomic resources for Agrobacterium, defining subspecies within the Agrobacterium genomospecies 4 based on the identification of lineage-specific gene set (Moldovan & Gelfand, 2018) will be challenging. However, we anticipate that the advent of high throughput long-read sequencing that can span large repetitive region in recent years is likely going to overcome this limitation allowing a more accurate depiction of microbial pangenome (Gan et al., 2012; Gan, Lee & Austin, 2017; Schmid et al., 2018a, 2018b). Future hybrid genome assemblies (Illumina and Nanopore/PacBio reads) of members from genomospecies 4 with comprehensive metadata and reliable phenotypic information, will be instructive.
Conclusions
Despite belonging to the same genomospecies, Agrobacterium tumefaciens and Agrobacterium radiobacter are by no means clonal at the chromosomal level and instead demonstrate sufficient genomic characters that qualify their separation into two sub-species. In addition, the difference in the LPS profile among two type strains will have implications to host specificity leading to geographical separation. In the spirit of preserving the naming of both species but at the same time respecting the taxonomic jurisdiction for strain priority, we propose Agrobacterium tumefaciens to be reclassified as Agrobacterium radiobacter subsp. tumefaciens and for Agrobacterium radiobacter to retains its species status with the proposed name of Agrobacterium radiobacter subsp. radiobacter.
Supplemental Information
The tree was rooted with members from the species Rhizobium rhizogenes (labeled as Agrobacterium rhizogenes) as the outgroup. Blue and red-colored clades belong to Agrobacterium genomospecies 4 and 7, respectively. Node labels indicate local SH-like support values. Branch lengths indicate the number of substitutions per site.
Funding Statement
Michael A. Savka and Han Ming Gan received support from the College of Science and the Thomas H. Gosnell School of Life Sciences at Rochester Institute of Technology. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Additional Information and Declarations
Competing Interests
The authors declare that they have no competing interests.
Author Contributions
Han Ming Gan conceived and designed the experiments, performed the experiments, analyzed the data, contributed reagents/materials/analysis tools, prepared figures and/or tables, authored or reviewed drafts of the paper, approved the final draft.
Melvin V.L. Lee performed the experiments, analyzed the data, contributed reagents/materials/analysis tools, prepared figures and/or tables, approved the final draft.
Michael A. Savka conceived and designed the experiments, analyzed the data, authored or reviewed drafts of the paper, approved the final draft.
DNA Deposition
The following information was supplied regarding the deposition of DNA sequences:
Raw sequencing data and whole genome assembly for strains B6 and NCPPB3001 reported in this study are linked to the NCBI Bioproject IDs PRJNA300485 and PRJNA300611, respectively.
Data Availability
The following information was supplied regarding data availability:
LMVK00000000.1, ASM154131v1: https://www.ncbi.nlm.nih.gov/assembly/GCF_001541315.1;
LMVJ00000000.1, ASM154130v1: https://www.ncbi.nlm.nih.gov/assembly/GCF_001541305.1;
Code and data are available at Han Ming Gan. (2018). Dataset for “Improved genome of Agrobacterium radiobacter type strain provides new taxonomic insight into Agrobacterium genomospecies 4” [Data set]. Zenodo. DOI 10.5281/zenodo.1489356.
References
- Alikhan et al. (2011).Alikhan N-F, Petty NK, Zakour NLB, Beatson SA. BLAST ring image generator (BRIG): simple prokaryote genome comparisons. BMC Genomics. 2011;12(1):1. doi: 10.1186/1471-2164-12-402. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Allnutt et al. (2018).Allnutt T, Yan CZY, Crowley TM, Gan HM. Commentary: genome sequence of Vibrio parahaemolyticus VP152 strain isolated from Penaeus indicus in malaysia. Frontiers in microbiology. 2018;9:865. doi: 10.3389/fmicb.2018.00865. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Altschul et al. (1990).Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. Journal of Molecular Biology. 1990;215(3):403–410. doi: 10.1016/s0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- Arredondo-Alonso et al. (2017).Arredondo-Alonso S, Willems RJ, Van Schaik W, Schürch AC. On the (im)possibility of reconstructing plasmids from whole-genome short-read sequencing data. Microbial Genomics. 2017;3(10):e000128. doi: 10.1099/mgen.0.000128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bankevich et al. (2012).Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. Journal of Computational Biology. 2012;19(5):455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barker et al. (1983).Barker R, Idler K, Thompson D, Kemp J. Nucleotide sequence of the T-DNA region from the Agrobacterium tumefaciens octopine Ti plasmid pTi15955. Plant Molecular Biology. 1983;2(6):335–350. doi: 10.1007/bf01578595. [DOI] [PubMed] [Google Scholar]
- Beijerinck & Van Delden (1902).Beijerinck M, Van Delden A. On a colourless bacterium, whose carbon food comes from the athmosphere. Koninklijke Nederlandse Akademie Van Wetenschappen Proceedings Series B Physical Sciences. 1902;5:398–413. [Google Scholar]
- Bobay & Ochman (2017).Bobay L-M, Ochman H. Biological species are universal across life’s domains. Genome Biology and Evolution. 2017;9(3):491–501. doi: 10.1093/gbe/evx026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bolger, Lohse & Usadel (2014).Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bourras, Rouxel & Meyer (2015).Bourras S, Rouxel T, Meyer M. Agrobacterium tumefaciens gene transfer: how a plant pathogen hacks the nuclei of plant and nonplant organisms. Phytopathology. 2015;105(10):1288–1301. doi: 10.1094/phyto-12-14-0380-rvw. [DOI] [PubMed] [Google Scholar]
- Bouzar & Jones (2001).Bouzar H, Jones JB. Agrobacterium larrymoorei sp. nov., a pathogen isolated from aerial tumours of Ficus benjamina. International Journal of Systematic and Evolutionary Microbiology. 2001;51(3):1023–1026. doi: 10.1099/00207713-51-3-1023. [DOI] [PubMed] [Google Scholar]
- Brenner, Staley & Krieg (2000).Brenner D, Staley J, Krieg N. Classification of procaryotic organisms and the concept of bacterial speciation. In: Boone DR, Castenholz RW, Garrity GM, editors. Bergey’s Manual of Systematic Bacteriology. Second Edition. Vol. 1. New York: Springer Verlag; 2000. pp. 27–38. [DOI] [Google Scholar]
- Conn (1942).Conn HJ. Validity of the genus Alcaligenes. Journal of Bacteriology. 1942;44:353–360. doi: 10.1128/jb.44.3.353-360.1942. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coutinho et al. (2016).Coutinho F, Tschoeke DA, Thompson F, Thompson C. Comparative genomics of Synechococcus and proposal of the new genus Parasynechococcus. PeerJ. 2016;4:e1522. doi: 10.7717/peerj.1522. [DOI] [PMC free article] [PubMed] [Google Scholar]
- De Castro et al. (2004).De Castro C, Bedini E, Garozzo D, Sturiale L, Parrilli M. Structural determination of the O-chain moieties of the lipopolysaccharide fraction from Agrobacterium radiobacter DSM 30147. European Journal of Organic Chemistry. 2004;2004:3842–3849. doi: 10.1002/ejoc.200400238. [DOI] [Google Scholar]
- De Castro et al. (2002).De Castro C, De Castro O, Molinaro A, Parrilli M. Structural determination of the O-chain polysaccharide from Agrobacterium tumefaciens, strain DSM 30205. European Journal of Biochemistry. 2002;269(12):2885–2888. doi: 10.1046/j.1432-1033.2002.02955.x. [DOI] [PubMed] [Google Scholar]
- De Ley et al. (1966).De Ley J, Bernaerts M, Rassel A, Guilmot J. Approach to an improved taxonomy of the genus Agrobacterium. Journal of General Microbiology. 1966;43(1):7–17. doi: 10.1099/00221287-43-1-7. [DOI] [PubMed] [Google Scholar]
- Farrand, Van Berkum & Oger (2003).Farrand SK, Van Berkum PB, Oger P. Agrobacterium is a definable genus of the family Rhizobiaceae. International Journal of Systematic and Evolutionary Microbiology. 2003;53(5):1681–1687. doi: 10.1099/ijs.0.02445-0. [DOI] [PubMed] [Google Scholar]
- Fuqua & Winans (1994).Fuqua WC, Winans SC. A LuxR-LuxI type regulatory system activates Agrobacterium Ti plasmid conjugal transfer in the presence of a plant tumor metabolite. Journal of Bacteriology. 1994;176(10):2796–2806. doi: 10.1128/jb.176.10.2796-2806.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gan et al. (2012).Gan HM, Chew TH, Tay Y-L, Lye SF, Yahya A. Genome sequence of Hydrogenophaga sp. strain PBC, a 4-aminobenzenesulfonate-degrading bacterium. Journal of Bacteriology. 2012;194(17):4759–4760. doi: 10.1128/jb.00990-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gan et al. (2015).Gan HM, Gan HY, Ahmad NH, Aziz NA, Hudson AO, Savka MA. Whole genome sequencing and analysis reveal insights into the genetic structure, diversity and evolutionary relatedness of luxI and luxR homologs in bacteria belonging to the Sphingomonadaceae family. Frontiers in Cellular and Infection Microbiology. 2015;4:188. doi: 10.3389/fcimb.2014.00188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gan et al. (2014).Gan HY, Gan HM, Savka MA, Triassi AJ, Wheatley MS, Smart LB, Fabio ES, Hudson AO. Whole-genome sequences of 13 endophytic bacteria isolated from shrub willow (Salix) grown in geneva, New York. Genome Announcements. 2014;2(3):e00288-14. doi: 10.1128/genomeA.00288-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gan et al. (2011).Gan HM, Ibrahim Z, Shahir S, Yahya A. Identification of genes involved in the 4-aminobenzenesulfonate degradation pathway of Hydrogenophaga sp. PBC via transposon mutagenesis. FEMS Microbiology Letters. 2011;318(2):108–114. doi: 10.1111/j.1574-6968.2011.02245.x. [DOI] [PubMed] [Google Scholar]
- Gan, Lee & Austin (2017).Gan HM, Lee YP, Austin CM. Nanopore long-read guided complete genome assembly of Hydrogenophaga intermedia, and genomic insights into 4-aminobenzenesulfonate, p-aminobenzoic acid and hydrogen metabolism in the genus Hydrogenophaga. Frontiers in Microbiology. 2017;8:1880. doi: 10.3389/fmicb.2017.01880. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gan, Lee & Savka (2018).Gan HM, Lee MVJ, Savka MA. High-quality draft genome sequence of the type strain of Allorhizobium vitis, the primary causal agent of grapevine crown gall. Microbiology Resource Announcements. 2018;7(9):e01045-18. doi: 10.1128/mra.01045-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gan & Savka (2018).Gan HM, Savka MA. One more decade of Agrobacterium Taxonomy. Current Topics in Microbiology and Immunology. 2018;418:1–14. doi: 10.1007/82_2018_81. [DOI] [PubMed] [Google Scholar]
- Gan, Tan & Austin (2016a).Gan HM, Tan MH, Austin CM. The complete mitogenome of the red claw crayfish Cherax quadricarinatus (Von Martens, 1868)(Crustacea: Decapoda: Parastacidae) Mitochondrial DNA Part A. 2016a;27(1):385–386. doi: 10.3109/19401736.2014.895997. [DOI] [PubMed] [Google Scholar]
- Gan et al. (2016b).Gan HM, Tan MH, Eprilurahman R, Austin CM. The complete mitogenome of Cherax monticola (Crustacea: Decapoda: Parastacidae), a large highland crayfish from New Guinea. Mitochondrial DNA Part A. 2016b;27:337–338. doi: 10.3109/19401736.2014.892105. [DOI] [PubMed] [Google Scholar]
- Gaunt et al. (2001).Gaunt M, Turner S, Rigottier-Gois L, Lloyd-Macgilp S, Young J. Phylogenies of atpD and recA support the small subunit rRNA-based classification of rhizobia. International Journal of Systematic and Evolutionary Microbiology. 2001;51(6):2037–2048. doi: 10.1099/00207713-51-6-2037. [DOI] [PubMed] [Google Scholar]
- Glick (1995).Glick BR. Metabolic load and heterologous gene expression. Biotechnology Advances. 1995;13(2):247–261. doi: 10.1016/0734-9750(95)00004-a. [DOI] [PubMed] [Google Scholar]
- Gordon & Christie (2014).Gordon JE, Christie PJ. The Agrobacterium Ti plasmids. Microbiology Spectrum. 2014;2(6):PLAS-0010-2013. doi: 10.1128/microbiolspec.PLAS-0010-2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haft, Selengut & White (2003).Haft DH, Selengut JD, White O. The TIGRFAMs database of protein families. Nucleic Acids Research. 2003;31(1):371–373. doi: 10.1093/nar/gkg128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hyatt et al. (2010).Hyatt D, Chen G-L, LoCascio PF, Land ML, Larimer FW, Hauser LJ. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 2010;11(1):119. doi: 10.1186/1471-2105-11-119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jain et al. (2018).Jain C, Rodriguez-R LM, Phillippy AM, Konstantinidis KT, Aluru S. High throughput ANI analysis of 90K prokaryotic genomes reveals clear species boundaries. Nature Communications. 2018;9:5114. doi: 10.1038/s41467-018-07641-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jeong, Pan & Park (2016).Jeong H, Pan J-G, Park S-H. Contamination as a major factor in poor Illumina assembly of microbial isolate genomes. biorxiv preprint. 2016 doi: 10.1101/081885. [DOI] [Google Scholar]
- Jezbera et al. (2011).Jezbera J, Jezberová J, Brandt U, Hahn MW. Ubiquity of Polynucleobacter necessarius subspecies asymbioticus results from ecological diversification. Environmental Microbiology. 2011;13(4):922–931. doi: 10.1111/j.1462-2920.2010.02396.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Johnson, Eddy & Portugaly (2010).Johnson LS, Eddy SR, Portugaly E. Hidden Markov model speed heuristic and iterative HMM search procedure. BMC Bioinformatics. 2010;11(1):431. doi: 10.1186/1471-2105-11-431. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kaczmarczyk, Vorholt & Francez-Charlot (2012).Kaczmarczyk A, Vorholt JA, Francez-Charlot A. Markerless gene deletion system for sphingomonads. Applied and Environmental Microbiology. 2012;78(10):3774–3777. doi: 10.1128/AEM.07347-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kado & Liu (1981).Kado CI, Liu ST. Rapid procedure for detection and isolation of large and small plasmids. Journal of Bacteriology. 1981;145(3):1365–1373. doi: 10.1128/jb.145.3.1365-1373.1981. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kalyaanamoorthy et al. (2017).Kalyaanamoorthy S, Minh BQ, Wong TK, Von Haeseler A, Jermiin LS. ModelFinder: fast model selection for accurate phylogenetic estimates. Nature Methods. 2017;14(6):587–589. doi: 10.1038/nmeth.4285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kanehisa et al. (2016a).Kanehisa M, Sato Y, Kawashima M, Furumichi M, Tanabe M. KEGG as a reference resource for gene and protein annotation. Nucleic Acids Research. 2016a;44(D1):D457–D462. doi: 10.1093/nar/gkv1070. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kanehisa, Sato & Morishima (2016b).Kanehisa M, Sato Y, Morishima K. BlastKOALA and GhostKOALA: KEGG tools for functional characterization of genome and metagenome sequences. Journal of Molecular Biology. 2016b;428(4):726–731. doi: 10.1016/j.jmb.2015.11.006. [DOI] [PubMed] [Google Scholar]
- Katoh & Standley (2013).Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Molecular Biology and Evolution. 2013;30(4):772–780. doi: 10.1093/molbev/mst010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Keane, Kerr & New (1970).Keane PJ, Kerr A, New PB. Crown gall of stone fruit II. Identification and nomenclature of Agrobacterium isolates. Australian Journal of Biological Sciences. 1970;23(3):585–596. doi: 10.1071/bi9700585. [DOI] [Google Scholar]
- Kerr & Panagopoulos (1977).Kerr A, Panagopoulos CG. Biotypes of Agrobacterium radiobacter var. tumefaciens and their biological control. Journal of Phytopathology. 1977;90(2):172–179. doi: 10.1111/j.1439-0434.1977.tb03233.x. [DOI] [Google Scholar]
- Kim & Gan (2017).Kim K, Gan HM. A glimpse into the genetic basis of symbiosis between Hydrogenophaga and their helper strains in the biodegradation of 4-aminobenzenesulfonate. Journal of Genomics. 2017;5:77–82. doi: 10.7150/jgen.20216. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klosterman et al. (2011).Klosterman SJ, Subbarao KV, Kang S, Veronese P, Gold SE, Thomma BPHJ, Chen Z, Henrissat B, Lee Y-H, Park J, Garcia-Pedrajas MD, Barbara DJ, Anchieta A, De Jonge R, Santhanam P, Maruthachalam K, Atallah Z, Amyotte SG, Paz Z, Inderbitzin P, Hayes RJ, Heiman DI, Young S, Zeng Q, Engels R, Galagan J, Cuomo CA, Dobinson KF, Ma L-J. Comparative genomics yields insights into niche adaptation of plant vascular wilt pathogens. PLOS Pathogens. 2011;7(7):e1002137. doi: 10.1371/journal.ppat.1002137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Konstantinidis & Tiedje (2005).Konstantinidis KT, Tiedje JM. Genomic insights that advance the species definition for prokaryotes. Proceedings of the National Academy of Sciences of the United States of America. 2005;102(7):2567–2572. doi: 10.1073/pnas.0409727102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li & Godzik (2006).Li W, Godzik A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006;22(13):1658–1659. doi: 10.1093/bioinformatics/btl158. [DOI] [PubMed] [Google Scholar]
- Lindström et al. (1995).Lindström K, Van Berkum P, Gillis M, Martinez E, Novikova N, Jarvis B. Report from the roundtable on Rhizobium taxonomy. In: Tikhonovich IA, Provorov NA, Romanov VI, Newton WE, editors. Nitrogen Fixation. Fundamentals and Applications. Kluwer, Dordrecht: Springer; 1995. pp. 807–810. [Google Scholar]
- Lippincott, Whatley & Lippincott (1977).Lippincott BB, Whatley MH, Lippincott JA. Tumor induction by Agrobacterium involves attachment of the bacterium to a site on the host plant cell wall. Plant Physiology. 1977;59(3):388–390. doi: 10.1104/pp.59.3.388. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lowe et al. (2009).Lowe N, Gan HM, Chakravartty V, Scott R, Szegedi E, Burr TJ, Savka MA. Quorum-sensing signal production by Agrobacterium vitis strains and their tumor-inducing and tartrate-catabolic plasmids. FEMS Microbiology Letters. 2009;296(1):102–109. doi: 10.1111/j.1574-6968.2009.01627.x. [DOI] [PubMed] [Google Scholar]
- Meier-Kolthoff et al. (2014).Meier-Kolthoff JP, Hahnke RL, Petersen J, Scheuner C, Michael V, Fiebig A, Rohde C, Rohde M, Fartmann B, Goodwin LA, Chertkov O, Reddy TBK, Pati A, Ivanova NN, Markowitz V, Kyrpides NC, Woyke T, Göker M, Klenk H-P. Complete genome sequence of DSM 30083T, the type strain (U5/41T) of Escherichia coli, and a proposal for delineating subspecies in microbial taxonomy. Standards in Genomic Sciences. 2014;9(1):2. doi: 10.1186/1944-3277-9-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moldovan & Gelfand (2018).Moldovan MA, Gelfand MS. Pangenomic definition of prokaryotic species and the phylogenetic structure of Prochlorococcus spp. Frontiers in Microbiology. 2018;9:428. doi: 10.3389/fmicb.2018.00428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mousavi et al. (2015).Mousavi SA, Willems A, Nesme X, De Lajudie P, Lindstrom K. Revised phylogeny of Rhizobiaceae: proposal of the delineation of Pararhizobium gen. nov., and 13 new species combinations. Systematic and Applied Microbiology. 2015;38(2):84–90. doi: 10.1016/j.syapm.2014.12.003. [DOI] [PubMed] [Google Scholar]
- Nguyen et al. (2014).Nguyen L-T, Schmidt HA, Von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Molecular Biology and Evolution. 2014;32(1):268–274. doi: 10.1093/molbev/msu300. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Page et al. (2015).Page AJ, Cummins CA, Hunt M, Wong VK, Reuter S, Holden MT, Fookes M, Falush D, Keane JA, Parkhill J. Roary: rapid large-scale prokaryote pan genome analysis. Bioinformatics. 2015;31(22):3691–3693. doi: 10.1093/bioinformatics/btv421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Panagopoulos, Psallidas & Alivizatos (1978).Panagopoulos C, Psallidas P, Alivizatos A. Studies on biotype 3 of Agrobacterium radiobacter var. tumefaciens. Proceedings of the 4th International Conference on Plant Pathogenic Bacteria Vol 1; Institut National de la Recherche Agronomique, Anger, France: Sta. Path. Veg. Phytobact; 1978. pp. 221–228. [Google Scholar]
- Pappas (2008).Pappas KM. Cell–cell signaling and the Agrobacterium tumefaciens Ti plasmid copy number fluctuations. Plasmid. 2008;60(2):89–107. doi: 10.1016/j.plasmid.2008.05.003. [DOI] [PubMed] [Google Scholar]
- Parks et al. (2015).Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Research. 2015;25(7):1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pérez-Cataluña et al. (2018).Pérez-Cataluña A, Salas-Massó N, Dieguez ABL, Balboa S, Lema A, Romalde JL, Figueras MJ. Revisiting the taxonomy of the genus Arcobacter: getting order from the chaos. Frontiers in Microbiology. 2018;9:2077. doi: 10.3389/fmicb.2018.02077. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Price, Dehal & Arkin (2010).Price MN, Dehal PS, Arkin AP. FastTree 2–approximately maximum-likelihood trees for large alignments. PLOS ONE. 2010;5:e9490. doi: 10.1371/journal.pone.0009490. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ramírez-Bahena et al. (2014).Ramírez-Bahena MH, Vial L, Lassalle F, Diel B, Chapulliot D, Daubin V, Nesme X, Muller D. Single acquisition of protelomerase gave rise to speciation of a large and diverse clade within the Agrobacterium/Rhizobium supercluster characterized by the presence of a linear chromid. Molecular Phylogenetics and Evolution. 2014;73:202–207. doi: 10.1016/j.ympev.2014.01.005. [DOI] [PubMed] [Google Scholar]
- Reuhs et al. (2005).Reuhs BL, Relić B, Forsberg LS, Marie C, Ojanen-Reuhs T, Stephens SB, Wong C-H, Jabbouri S, Broughton WJ. Structural characterization of a flavonoid-inducible Pseudomonas aeruginosa A-Band-Like O antigen of Rhizobium sp. strain NGR234, required for the formation of nitrogen-fixing nodules. Journal of Bacteriology. 2005;187(18):6479–6487. doi: 10.1128/jb.187.18.6479-6487.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Richter & Rosselló-Móra (2009).Richter M, Rosselló-Móra R. Shifting the genomic gold standard for the prokaryotic species definition. Proceedings of the National Academy of Sciences of the United States of America. 2009;106(45):19126–19131. doi: 10.1073/pnas.0906412106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Riker et al. (1930).Riker A, Banfield W, Wright W, Keitt G, Sagen HE. Studies on infectious hairy root of nursery apple trees. Journal of Agricultural Research. 1930;41(7):507–540. [Google Scholar]
- Sawada et al. (1993).Sawada H, Ieki H, Oyaizu H, Matsumoto S. Proposal for rejection of Agrobacterium tumefaciens and revised descriptions for the genus Agrobacterium and for Agrobacterium radiobacter and Agrobacterium rhizogenes. International Journal of Systematic Bacteriology. 1993;43(4):694–702. doi: 10.1099/00207713-43-4-694. [DOI] [PubMed] [Google Scholar]
- Schmid et al. (2018a).Schmid M, Frei D, Patrignani A, Schlapbach R, Frey JE, Remus-Emsermann MNP, Ahrens CH. Pushing the limits of de novo genome assembly for complex prokaryotic genomes harboring very long, near identical repeats. Nucleic Acids Research. 2018a;46:8953–8965. doi: 10.1093/nar/gky726. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schmid et al. (2018b).Schmid M, Muri J, Melidis D, Varadarajan AR, Somerville V, Wicki A, Moser A, Bourqui M, Wenzel C, Eugster-Meier E, Frey JE, Irmler S, Ahrens CH. Comparative genomics of completely sequenced Lactobacillus helveticus genomes provides insights into strain-specific genes and resolves metagenomics data down to the strain level. Frontiers in Microbiology. 2018b;9:63. doi: 10.3389/fmicb.2018.00063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seemann (2014).Seemann T. Prokka: rapid prokaryotic genome annotation. Bioinformatics. 2014;30(14):2068–2069. doi: 10.1093/bioinformatics/btu153. [DOI] [PubMed] [Google Scholar]
- Segata et al. (2013).Segata N, Börnigen D, Morgan XC, Huttenhower C. PhyloPhlAn is a new method for improved phylogenetic and taxonomic placement of microbes. Nature Communications. 2013;4(1):2304. doi: 10.1038/ncomms3304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shams et al. (2013).Shams M, Vial L, Chapulliot D, Nesme X, Lavire C. Rapid and accurate species and genomic species identification and exhaustive population diversity assessment of Agrobacterium spp. using recA-based PCR. Systematic and Applied Microbiology. 2013;36(5):351–358. doi: 10.1016/j.syapm.2013.03.002. [DOI] [PubMed] [Google Scholar]
- Shao et al. (2018).Shao S, Zhang X, Van Heusden GPH, Hooykaas PJ. Complete sequence of the tumor-inducing plasmid pTiChry5 from the hypervirulent Agrobacterium tumefaciens strain Chry5. Plasmid. 2018;96–97:1–6. doi: 10.1016/j.plasmid.2018.02.001. [DOI] [PubMed] [Google Scholar]
- Smith & Townsend (1907).Smith EF, Townsend CO. A plant-tumor of bacterial origin. Science. 1907;25(643):671–673. doi: 10.1126/science.25.643.671. [DOI] [PubMed] [Google Scholar]
- Sokolov (2000).Sokolov EP. An improved method for DNA isolation from mucopolysaccharide-rich molluscan tissues. Journal of Molluscan Studies. 2000;66(4):573–575. doi: 10.1093/mollus/66.4.573. [DOI] [Google Scholar]
- Stackebrandt et al. (2002).Stackebrandt E, Frederiksen W, Garrity GM, Grimont PA, Kämpfer P, Maiden MC, Nesme X, Rosselló-Mora R, Swings J, Trüper HG. Report of the ad hoc committee for the re-evaluation of the species definition in bacteriology. International Journal of Systematic and Evolutionary Microbiology. 2002;52(3):1043–1047. doi: 10.1099/00207713-52-3-1043. [DOI] [PubMed] [Google Scholar]
- Starr & Weiss (1943).Starr M, Weiss J. Growth of phytopathogenic bacteria in a synthetic asparagin medium. Phytopathology. 1943;33:314–318. [Google Scholar]
- Süle (1978).Süle S. Biotypes of Agrobacterium tumefaciens in Hungary. Journal of Applied Bacteriology. 1978;44(2):207–213. doi: 10.1111/j.1365-2672.1978.tb00792.x. [DOI] [Google Scholar]
- Tan et al. (2015).Tan MH, Gan HM, Schultz MB, Austin CM. MitoPhAST, a new automated mitogenomic phylogeny tool in the post-genomic era with a case study of 89 decapod mitogenomes including eight new freshwater crayfish mitogenomes. Molecular Phylogenetics and Evolution. 2015;85:180–188. doi: 10.1016/j.ympev.2015.02.009. [DOI] [PubMed] [Google Scholar]
- Tan et al. (2013).Tan JL, Khang TF, Ngeow YF, Choo SW. A phylogenomic approach to bacterial subspecies classification: proof of concept in Mycobacterium abscessus. BMC Genomics. 2013;14(1):879. doi: 10.1186/1471-2164-14-879. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thomashow et al. (1980).Thomashow M, Panagopoulos C, Gordon M, Nester E. Host range of Agrobacterium tumefaciens is determined by the Ti plasmid. Nature. 1980;283(5749):794–796. doi: 10.1038/283794a0. [DOI] [Google Scholar]
- Tindall (2014).Tindall B. Agrobacterium radiobacter (Beijerinck and van Delden 1902) Conn 1942 has priority over Agrobacterium tumefaciens (Smith and Townsend 1907) Conn 1942 when the two are treated as members of the same species based on the principle of priority and Rule 23a, Note 1 as applied to the corresponding specific epithets. Opinion 94: judicial commission of the international committee on systematics of prokaryotes. International Journal of Systematic and Evolutionary Microbiology. 2014;64(Pt 10):3590–3592. doi: 10.1099/ijs.0.069203-0. [DOI] [PubMed] [Google Scholar]
- Tran, Savka & Gan (2017).Tran PN, Savka MA, Gan HM. In-silico taxonomic classification of 373 genomes reveals species misidentification and new genospecies within the genus Pseudomonas. Frontiers in Microbiology. 2017;8:1296. doi: 10.3389/fmicb.2017.01296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Utturkar et al. (2017).Utturkar SM, Klingeman DM, Hurt RA, Brown SD. A case study into microbial genome assembly gap sequences and finishing strategies. Frontiers in Microbiology. 2017;8:1272. doi: 10.3389/fmicb.2017.01272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Velázquez et al. (2010).Velázquez E, Palomo JL, Rivas R, Guerra H, Peix A, Trujillo ME, García-Benavides P, Mateos PF, Wabiko H, Martínez-Molina E. Analysis of core genes supports the reclassification of strains Agrobacterium radiobacter K84 and Agrobacterium tumefaciens AKE10 into the species Rhizobium rhizogenes. Systematic and Applied Microbiology. 2010;33(5):247–251. doi: 10.1016/j.syapm.2010.04.004. [DOI] [PubMed] [Google Scholar]
- Vicedo et al. (1996).Vicedo B, López MJ, Asíns MJ, López MM. Spontaneous transfer of the Ti plasmid of Agrobacterium tumefaciens and the nopaline catabolism plasmid of A. radiobacter strain K84 in crown gall tissue. Phytopathology. 1996;86(5):528–534. doi: 10.1094/phyto-86-528. [DOI] [Google Scholar]
- Vinuesa, Ochoa-Sánchez & Contreras-Moreira (2018).Vinuesa P, Ochoa-Sánchez LE, Contreras-Moreira B. GET_PHYLOMARKERS, a software package to select optimal orthologous clusters for phylogenomics and inferring pan-genome phylogenies, used for a critical geno-taxonomic revision of the genus Stenotrophomonas. Frontiers in Microbiology. 2018;9:771. doi: 10.3389/fmicb.2018.00771. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Waterhouse et al. (2017).Waterhouse RM, Seppey M, Simão FA, Manni M, Ioannidis P, Klioutchnikov G, Kriventseva EV, Zdobnov EM. BUSCO applications from quality assessments to gene prediction and phylogenomics. Molecular Biology and Evolution. 2017;35(3):543–548. doi: 10.1093/molbev/msx319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Whatley et al. (1976).Whatley M, Bodwin J, Lippincott B, Lippincott J. Role of Agrobacterium cell envelope lipopolysaccharide in infection site attachment. Infection and Immunity. 1976;13:1080–1083. doi: 10.1128/iai.13.4.1080-1083.1976. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Whatley & Spiess (1977).Whatley MH, Spiess LD. Role of bacterial lipopolysaccharide in attachment of Agrobacterium to moss. Plant Physiology. 1977;60:765–766. doi: 10.1104/pp.60.5.765. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wibberg et al. (2011).Wibberg D, Blom J, Jaenicke S, Kollin F, Rupp O, Scharf B, Schneiker-Bekel S, Sczcepanowski R, Goesmann A, Setubal JC, Schmitt R, Pühler A, Schlüter A. Complete genome sequencing of Agrobacterium sp. H13-3, the former Rhizobium lupini H13-3, reveals a tripartite genome consisting of a circular and a linear chromosome and an accessory plasmid but lacking a tumor-inducing Ti-plasmid. Journal of Biotechnology. 2011;155(1):50–62. doi: 10.1016/j.jbiotec.2011.01.010. [DOI] [PubMed] [Google Scholar]
- Wong et al. (2014).Wong YM, Juan JC, Gan HM, Austin CM. Draft genome sequence of Clostridium perfringens strain JJC, a highly efficient hydrogen producer isolated from landfill leachate sludge. Genome Announcements. 2014;2(2):e00064-14. doi: 10.1128/genomeA.00064-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wood et al. (2001).Wood DW, Setubal JC, Kaul R, Monks DE, Kitajima JP, Okura VK, Zhou Y, Chen L, Wood GE, Almeida NF. The genome of the natural genetic engineer Agrobacterium tumefaciens C58. Science. 2001;294(5550):2317–2323. doi: 10.1126/science.1066804. [DOI] [PubMed] [Google Scholar]
- Young (2008).Young JM. Agrobacterium: From Biology to Biotechnology. New York: Springer; 2008. Agrobacterium—taxonomy of plant-pathogenic Rhizobium species; pp. 183–220. [Google Scholar]
- Young et al. (2001).Young J, Kuykendall L, Martinez-Romero E, Kerr A, Sawada H. A revision of Rhizobium Frank 1889, with an emended description of the genus, and the inclusion of all species of Agrobacterium Conn 1942 and Allorhizobium undicola de Lajudie et al. 1998 as new combinations: Rhizobium radiobacter, R. rhizogenes, R. rubi, R. undicola and R. vitis. International Journal of Systematic and Evolutionary Microbiology. 2001;51(1):89–103. doi: 10.1099/00207713-51-1-89. [DOI] [PubMed] [Google Scholar]
- Young et al. (2003).Young J, Kuykendall L, Martinez-Romero E, Kerr A, Sawada H. Classification and nomenclature of Agrobacterium and Rhizobium–a reply to Farrand et al. (2003) International Journal of Systematic and Evolutionary Microbiology. 2003;53(5):1689–1695. doi: 10.1099/ijs.0.02762-0. [DOI] [PubMed] [Google Scholar]
- Young, Pennycook & Watson (2006).Young JM, Pennycook SR, Watson DR. Proposal that Agrobacterium radiobacter has priority over Agrobacterium tumefaciens. Request for an opinion. International Journal of Systematic and Evolutionary Microbiology. 2006;56(2):491–493. doi: 10.1099/ijs.0.64030-0. [DOI] [PubMed] [Google Scholar]
- Zhang et al. (2014).Zhang L, Li X, Zhang F, Wang G. Genomic analysis of Agrobacterium radiobacter DSM 30147T and emended description of A. radiobacter (Beijerinck and van Delden 1902) Conn 1942 (Approved Lists 1980) emend. Sawada et al. 1993. Standards in Genomic Sciences. 2014;9(3):574–584. doi: 10.4056/sigs.4688352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang, Wang & Zhang (2002).Zhang H-B, Wang L-H, Zhang L-H. Genetic control of quorum-sensing signal turnover in Agrobacterium tumefaciens. Proceedings of the National Academy of Sciences of the United States of America. 2002;99(7):4638–4643. doi: 10.1073/pnas.022056699. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
The tree was rooted with members from the species Rhizobium rhizogenes (labeled as Agrobacterium rhizogenes) as the outgroup. Blue and red-colored clades belong to Agrobacterium genomospecies 4 and 7, respectively. Node labels indicate local SH-like support values. Branch lengths indicate the number of substitutions per site.
Data Availability Statement
The following information was supplied regarding data availability:
LMVK00000000.1, ASM154131v1: https://www.ncbi.nlm.nih.gov/assembly/GCF_001541315.1;
LMVJ00000000.1, ASM154130v1: https://www.ncbi.nlm.nih.gov/assembly/GCF_001541305.1;
Code and data are available at Han Ming Gan. (2018). Dataset for “Improved genome of Agrobacterium radiobacter type strain provides new taxonomic insight into Agrobacterium genomospecies 4” [Data set]. Zenodo. DOI 10.5281/zenodo.1489356.




