Abstract
Ralstonia solanacearum is a causal agent of plant bacterial wilt with thousands of distinct strains in a heterogeneous species complex. Here we report the genome sequence of a phylotype IB strain, Y45, isolated from tobacco (Nicotiana tabacum) in China. Compared with the published genomes of eight strains which were isolated from other hosts and habitats, 794 specific genes and many rearrangements/inversion events were identified in the tobacco strain, demonstrating that this strain represents an important node within the R. solanacearum complex.
GENOME ANNOUNCEMENT
Ralstonia solanacearum is a phytopathogenic bacterium with an unusually broad host range and a wide geographical distribution (1). It causes a wilt disease with deadly effects in many economically important crops like tomato, potato, tobacco, and banana (1, 5). Understanding the characteristics of R. solanacearum strains is helpful to determine the way to prevent or reduce potential economic losses. So far, eight genomes of R. solanacearum strains from three hosts (tomato, potato, and geranium) with a variety of habitats have been sequenced (6, 8, 14, 16, 19). Owing to the significant genomic variations of R. solanacearum strains derived from different regions and hosts, a strain (Y45) isolated from tobacco in Yunnan Province, China, was sequenced, annotated, and compared to the other eight strains. It is the first R. solanacearum strain isolated in East Asia.
Contigs and scaffolds were assembled and aligned using SOAPdenovo 1.05 (11) and SOAPaligner (10). Protein-coding and other genes were predicted by Glimmer3.0, rRNAmmer, tRNAscan, and Rfam (4, 7, 9, 17). Repetitive elements were identified by RepeatMasker (http://www.repeatmasker.org) and Tandem Repeat Finder (2). Phylogenetic and genomic analyses were carried out using MEGA4 based on egl and hrpB sequences (13, 18) and Mauve (15).
A total of 1,067.5 Mb clean data from two different insert size libraries (500 bp and 2 kb) were generated for strain Y45 using the Illumina HiSeq2000 paired-end strategy and assembled into 46 scaffold sequences (90% of the assembly falls into 10 scaffolds lager than 142 kb) with a length of approximately 5.73 Mb, representing 98.5% of the genome, whose size was estimated to be 5.82 Mb using the 17-mer approach (GSP; http://gsizepred.sourceforge.net/) (3, 12). Compared with the well-documented genome of strain GMI1000 (16), the 46 segments could be divided into three parts, including a chromosome segment (3,726,478 bp), a megaplasmid segment (1,985,910 bp), and sequences which were not assigned to either of the two circular replicons, suggesting their uniqueness to this strain (17,900 bp). In all, 5,496 coding sequences, 53 tRNAs, and 5 rRNAs were predicted in the genome, respectively. Moreover, 14,472-bp transposable elements and 43,648-bp tandem repeats were predicted, comprising 0.25% and 0.76% of the genome, respectively. Of the 5,496 genes, 4,864 could produce a hit in at least one of the public protein databases (BLASTP e value, ≤1e-5); 632 genes, however, did not match any known proteins in the current public databases. In a comparative genome analysis with the eight sequenced R. solanacearum strains, 794 specific genes were identified within the genome of strain Y45, most of which are of unknown function or encode hypothetical proteins. Notably, genomic alignments revealed many rearrangements and inversions between strain Y45 (phylotype IB, according to our phylogenetic analysis) and the other strains, which occurred since Y45 separated from the other strains. Thus, the genome of strain Y45 represents a unique phylogenetic node and provides invaluable information for R. solanacearum genomic research.
Nucleotide sequence accession numbers.
This Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank under accession no. AFWL00000000. The version described in this paper is the first version and is available under accession no. AFWL01000000.
Acknowledgments
We thank Weiliang Chen and Zhengyi Wang (Zhejiang University) for their help in DNA collection.
This project was supported by the China National Tobacco Corporation Yunnan Branch (2010YN01 and 2011YN04).
REFERENCES
- 1. Álvarez B., Biosca E. G., López M. M. 2010. On the life of Ralstonia solanacearum, a destructive bacterial plant pathogen, p. 267–279 In Mendez-Vilas A. (ed.), Current research, technology and education topics in applied microbiology and microbial biotechnology, vol. 1 Formatex, Badajoz, Spain [Google Scholar]
- 2. Benson G. 1999. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27:573–580 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Chor B., Horn D., Goldman N., Levy Y., Massingham T. 2009. Genomic DNA k-mer spectra: models and modalities. Genome Biol. 10:R108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Delcher A. L., Bratke K. A., Powers E. C., Salzberg S. L. 2007. Identifying bacterial genes and endosymbiont DNA with Glimmer. Bioinformatics 23:673–679 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Elphinstone J. G. 2005. The current bacterial wilt situation: a global overview, p. 9–28 In Allen C., Prior P., Haywar A. C. (ed.), Bacterial wilt disease and the Ralstonia solanacearum species complex. APS Press, St. Paul, MN [Google Scholar]
- 6. Gabriel D. W., et al. 2006. Identification of open reading frames unique to a select agent: Ralstonia solanacearum race 3 biovar 2. Mol. Plant Microbe Interact. 19:69–79 [DOI] [PubMed] [Google Scholar]
- 7. Griffiths-Jones S., et al. 2005. Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res. 33:D121–D124 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Guidot A., et al. 2009. Specific genes from the potato brown rot strains of Ralstonia solanacearum and their potential use for strain detection. Phytopathology 99:1105–1112 [DOI] [PubMed] [Google Scholar]
- 9. Lagesen K., et al. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35:3100–3108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Li R., et al. 2009. SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics 25:1966–1967 [DOI] [PubMed] [Google Scholar]
- 11. Li R., et al. 2010. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 20:265–272 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Marçais G., Kingsford C. 2011. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics 27:764–770 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Prior P., Fegan M. 2005. Recent development in the phylogeny and classification of Ralstonia solanacearum. Acta Hortic. 695:127–136 [Google Scholar]
- 14. Remenant B., et al. 2010. Genomes of three tomato pathogens within the Ralstonia solanacearum species complex reveal significant evolutionary divergence. BMC Genomics 11:379. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Rissman A. I., et al. 2009. Reordering contigs of draft genomes using the Mauve aligner. Bioinformatics 25:2071–2073 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Salanoubat M., et al. 2002. Genome sequence of the plant pathogen Ralstonia solanacearum. Nature 415:497–502 [DOI] [PubMed] [Google Scholar]
- 17. Schattner P., Brooks A. N., Lowe T. M. 2005. The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 33:W686–W689 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Tamura K., Dudley J., Nei M., Kumar S. 2007. MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol. Biol. Evol. 24:1596–1599 [DOI] [PubMed] [Google Scholar]
- 19. Xu J., et al. 2011. Complete genome sequence of the plant pathogen Ralstonia solanacearum strain Po82. J. Bacteriol. 193:4261–4262 [DOI] [PMC free article] [PubMed] [Google Scholar]
