We present the complete genome sequences of two Xanthomonas euroxanthea strains isolated from buds of a walnut tree. The whole-genome sequences of strains CPBF 367 and CPBF 426 consist of two circular chromosomes of 4,923,218 bp and 4,883,254 bp and two putative plasmids of 45,241 bp and 17,394 bp, respectively. These data may contribute to the understanding of Xanthomonas species-specific adaptations to walnut.
ABSTRACT
We present the complete genome sequences of two Xanthomonas euroxanthea strains isolated from buds of a walnut tree. The whole-genome sequences of strains CPBF 367 and CPBF 426 consist of two circular chromosomes of 4,923,218 bp and 4,883,254 bp and two putative plasmids of 45,241 bp and 17,394 bp, respectively. These data may contribute to the understanding of Xanthomonas species-specific adaptations to walnut.
ANNOUNCEMENT
Xanthomonas is a genetically diverse genus of bacteria comprising etiological agents of several plant diseases affecting crops worldwide and causing large yield losses and negative economic impacts (1–5). Xanthomonas arboricola is a species known to cause pathologies in several fruit trees, including walnut trees (Juglans regia L.). The occurrence of multiple Xanthomonas lineages within the same walnut host has revealed a consortium of walnut-associated Xanthomonas bacteria distinct from Xanthomonas arboricola pv. juglandis (5). More recently, based on phenotypic and genomic analyses, such as average nucleotide identity (ANI) and digital DNA-DNA hybridization (dDDH), it has been proposed that some of these bacteria are members of a new species designated Xanthomonas euroxanthea, which includes both pathogenic and nonpathogenic strains on walnut (6). Here, we sequenced the whole genome of two strains belonging to this new Xanthomonas species, CPBF 367 (LMG 31036 = CCOS 1890) and CPBF 426 (LMG 31038 = CCOS1892). These genomes should contribute to unveiling the ecology, evolution, and virulence of strains belonging to this new Xanthomonas species and disclose niche-specific adaptations.
Both strains were isolated in April 2016 in Loures, Portugal, from asymptomatic dormant buds of a walnut tree known to develop symptoms of walnut bacterial blight during the growing season. Characteristic mucoid yellow colonies were grown on yeast dextrose carbonate (YDC) agar medium, and pure colonies were further cultured in liquid medium from which an aliquot was cryopreserved at −80° in YDC with 30% glycerol as previously detailed (7).
DNA extraction for sequencing was carried out from cryopreserved bacteria recovered on bacterial culture medium M2 (yeast extract, 2 g liter−1; Bacto peptone, 5 g liter−1; NaCl, 5 g liter−1; KH2PO4, 0.45 g liter−1; Na2HPO4 12H2O, 2.39 g liter−1) at 28°C and 100 rpm for 48 h. DNA was extracted using the E.Z.N.A. bacterial DNA purification kit (Omega Bio-tek, Norcross, GA) and sequenced with Illumina and Oxford Nanopore Technologies (ONT) MinION platforms. Illumina sequencing was outsourced to GATC Biotech, AG (Konstanz, Germany), using an Illumina HiSeq instrument with a standard 2 × 150-bp paired-end library protocol. This resulted in 8,413,466 and 6,494,807 reads, with 506× and 394× coverage for CPBF 367 and CPBF 426, respectively. For Nanopore sequencing, libraries were prepared with the SQK-LSK109 kit and multiplexed using the EXP-NBD104 barcoding kit. Sequencing was performed on a MinION sequencer using an R9.4.1 flow cell. Reads were base called and demultiplexed using Guppy v3.4.1 (high accuracy base-calling mode) and produced for CPBF 367 (18,857 reads with a mean length of 6,386 nucleotides [nt] [±8,299] and a maximum read length of 92,631 nt) and CBPF 426 (19,997 reads with a mean length of 5,895 nt [±6,300] and a maximum read length of 74,948 nt). The reads were assembled de novo following a hybrid Nanopore-Illumina approach using Unicycler v0.4.8 (8) and annotated with PGAP v2020-03-30.build4489 (9). Assembly metrics were calculated using QUAST v5.0.2 (10). The assembly quality and completeness were assessed using BUSCO v4.0.6 (parameters: lineage xanthomonadales_odb10) (11), which indicated that the CPBF 367 and CPBF 426 genomes are both >99.8% complete. Default parameters were used for all software unless otherwise noted.
The assembly of CPBF 367 has a total length of 4,968 Mb, comprising 2 circular contigs of 4,923,218 and 45,241 bp with a G+C content of 65.81%. The annotation indicates 4,014 coding genes in a total of 4,174 genes from 4,094 coding DNA sequences (CDSs), with 2 complete sets of rRNAs (5S, 16S, and 23S rRNAs), 56 tRNAs, 18 noncoding RNAs (ncRNAs), and 80 pseudogenes. For CPBF 426, the assembly has a total length of 4,900 Mb, comprising 2 circular contigs of 4,883,254 and 17,394 bp with a G+C content of 65.85%. The annotation revealed 4,010 coding genes in 4,157 genes from 4,080 CDSs, with 2 complete sets of rRNAs, 53 tRNAs, 18 ncRNAs, and 70 pseudogenes. Table 1 contains a summary of the two complete genome sequences.
TABLE 1.
Summary of sequencing data and genome statistics
Characteristic | Data for strain: |
|
---|---|---|
CPBF 367 | CPBF 426 | |
ENA/GenBank accession number | GCA_903989455 | GCA_903989465 |
No. of ONT reads | 18,857 | 19,997 |
ONT read length (nt) (mean ± SD) | 6,386 ± 8,299 | 5,895 ± 6,300 |
No. of Illumina reads | 8,413,466 | 6,494,807 |
Illumina read length (nt) | 2 × 150 | 2 × 150 |
Mean coverage (×) | 532 | 415 |
G+C content (%) | 65.81 | 65.85 |
N50 (bp) | 4,923,218 | 4,883,254 |
No. of sequences | 2 | 2 |
Genome size (bp) | 4,968,459 | 4,900,648 |
Chromosome size (bp) | 4,923,218 | 4,883,254 |
Plasmid size (bp) | 45,241 | 17,394 |
No. of genes (total) | 4,174 | 4,157 |
No. of CDSs (total) | 4,094 | 4,080 |
No. of genes (coding) | 4,014 | 4,010 |
No. of CDSs (with protein) | 4,014 | 4,010 |
No. of genes (RNA) | 80 | 77 |
No. of rRNAs (5S, 16S, 23S) | 2, 2, 2 | 2, 2, 2 |
No. of complete rRNAs (5S, 16S, 23S) | 2, 2, 2 | 2, 2, 2 |
No. of tRNAs | 56 | 53 |
No. of ncRNAs | 18 | 18 |
No. of pseudogenes | 80 | 70 |
No. of CDSs (without protein) | 80 | 70 |
OrthoANI v1.40 (12) indicates an average nucleotide identity value higher than 97%, assigning these strains to X. euroxanthea, together with a previously reported strain, CPBF 424 (13, 14). Interestingly, while X. euroxanthea CPBF 424 holds an apparently functional T3SS and was shown to be pathogenic in walnut plantlets, X. euroxanthea strains CPBF 367 and CPBF 426 were deficient for most T3SS operon genes, suggesting that these strains are nonpathogenic in walnut, which was confirmed in walnut pathogenicity assays for strain CPBF 367 (6). Altogether, the distinct genomic repertoires of these X. euroxanthea strains (CPBF 367 and CPBF 426), in comparison to those of X. arboricola pv. juglandis strains, will provide valuable genomic data for a better understanding of the pathogenicity and virulence of walnut-associated Xanthomonas.
Data availability.
The raw data and assembled/annotated sequences have been deposited in the European Nucleotide Archive (ENA). The study accession number is PRJEB39139. Illumina reads, MinION reads, and the assembled genome are accessible under the accession numbers ERX2780809, ERX4296808, and GCA_903989455, respectively, for CPBF 367 and ERX2780811, ERX4296809, and GCA_903989465, respectively, for CPBF 426. Supplementary data are accessible via the BioStudies database with the accession number S-BSST433.
ACKNOWLEDGMENTS
Funding for this work was provided by the EU Horizon 2020 Research and Innovation program (grant agreement 668981), European Structural & Investment Funds (ESIFs) through COMPETE 2020, National Funds through FCT (project EVOXANT-PTDC/BIA-EVF/3635/2014-POCI-01-0145-FEDER-016600), and the Operational Thematic Program for Competitiveness and Internationalization (POCI) under the PORTUGAL 2020 Partnership Agreement through the European Regional Development Fund (FEDER) project PORBIOTA-POCI-01-0145-FEDER-022127. Leonor Martins was supported by a Ph.D. fellowship from FCT (SFRH/BD/137079/2018).
REFERENCES
- 1.Lamichhane JR. 2014. Xanthomonas arboricola diseases of stone fruit, almond, and walnut trees: progress toward understanding and management. Plant Dis 98:1600–1610. doi: 10.1094/PDIS-08-14-0831-FE. [DOI] [PubMed] [Google Scholar]
- 2.Moragrega C, Özaktan H. 2010. Apical necrosis of Persian (English) walnut (Juglans regia): an update. J Plant Pathol 92:S1.67–S1.71. [Google Scholar]
- 3.Frutos D. 2010. Bacterial diseases of walnut and hazelnut and genetic resources. J Plant Pathol 92:S1.79–S1.85. [Google Scholar]
- 4.Hajri A, Meyer D, Delort F, Guillaumès J, Brin C, Manceau C. 2010. Identification of a genetic lineage within Xanthomonas arboricola pv. juglandis as the causal agent of vertical oozing canker of Persian (English) walnut in France. Plant Pathol 59:1014–1022. doi: 10.1111/j.1365-3059.2010.02362.x. [DOI] [Google Scholar]
- 5.Fernandes C, Albuquerque P, Cruz L, Tavares F. 2018. Genotyping and epidemiological metadata provides new insights into population structure of Xanthomonas isolated from walnut trees. bioRxiv doi: 10.1101/397703. [DOI]
- 6.Martins L, Fernandes C, Blom J, Dia NC, Pothier JF, Tavares F. 2020. Xanthomonas euroxanthea sp. nov., a new xanthomonad species including pathogenic and non-pathogenic strains of walnut. Int J Syst Evol Microbiol In press. doi: 10.1099/ijsem.0.004386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Fernandes C, Albuquerque P, Sousa R, Cruz L, Tavares F. 2017. Multiple DNA markers for identification of Xanthomonas arboricola pv. juglandis isolates and its direct detection in plant samples. Plant Dis 101:858–865. doi: 10.1094/PDIS-10-16-1481-RE. [DOI] [PubMed] [Google Scholar]
- 8.Wick RR, Judd LM, Gorrie CL, Holt KE. 2017. Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput Biol 13:e1005595. doi: 10.1371/journal.pcbi.1005595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI Prokaryotic Genome Annotation Pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. 2015. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
- 12.Lee I, Kim YO, Park SC, Chun J. 2016. OrthoANI: an improved algorithm and software for calculating average nucleotide identity. Int J Syst Evol Microbiol 66:1100–1103. doi: 10.1099/ijsem.0.000760. [DOI] [PubMed] [Google Scholar]
- 13.Konstantinidis KT, Tiedje JM. 2005. Genomic insights that advance the species definition for prokaryotes. Proc Natl Acad Sci U S A 102:2567–2572. doi: 10.1073/pnas.0409727102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Fernandes C, Blom J, Pothier JF, Tavares F. 2018. High-quality draft genome sequence of Xanthomonas sp. strain CPBF 424, a walnut-pathogenic strain with atypical features. Microbiol Resour Announc 7:e00921-18. doi: 10.1128/MRA.00921-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The raw data and assembled/annotated sequences have been deposited in the European Nucleotide Archive (ENA). The study accession number is PRJEB39139. Illumina reads, MinION reads, and the assembled genome are accessible under the accession numbers ERX2780809, ERX4296808, and GCA_903989455, respectively, for CPBF 367 and ERX2780811, ERX4296809, and GCA_903989465, respectively, for CPBF 426. Supplementary data are accessible via the BioStudies database with the accession number S-BSST433.