Pantoea agglomerans strain C1 has plant growth-promoting (PGP) traits and exhibits antimicrobial activity. The genome comprises 4.8 Mb, 4,696 protein-coding sequences, and a G+C content of 55.2%.
ABSTRACT
Pantoea agglomerans strain C1 has plant growth-promoting (PGP) traits and exhibits antimicrobial activity. The genome comprises 4.8 Mb, 4,696 protein-coding sequences, and a G+C content of 55.2%.
ANNOUNCEMENT
The genus Pantoea (family Enterobacteriaceae) comprises bacteria that can be isolated from a wide variety of environments and hosts, including soil, water, and plants (1–3). Distinctive strains of Pantoea are recognized as plant growth-promoting rhizobacteria (PGPR) and can potentially be used to develop new biofertilizers or biological control agents for sustainable agriculture (4).
P. agglomerans strain C1 was isolated from the phyllosphere of Lactuca sativa L. (lettuce) plants treated with vegetal-derived protein hydrolysates (5). It has some interesting plant growth-promoting features, such as the ability to produce indole-3-acetic acid (a plant hormone of the auxin class), and showed antagonistic activity against some phytopathogenic fungi and bacteria (6).
The genomic DNA was extracted from cells grown overnight in Luria broth by using a PureLink genomic DNA minikit (Thermo Fisher Scientific, Italy), as reported elsewhere (6). The genomic library was prepared using a Nextera XT DNA library prep kit and Nextera XT index kit (Illumina, USA). The library fragment size distribution was checked using an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA), and DNA was quantified using a Qubit 2.0 fluorometer and the Qubit double-stranded DNA (dsDNA) high-sensitivity (HS) assay kit (Thermo Fisher Scientific). The whole-genome sequencing of strain C1 was performed at Bio-Fab Research S.r.l. (Rome, Italy) using the MiSeq platform (paired-end 2 × 300 bp) with the MiSeq reagent kit v3 (Illumina). In total, 7,525,566 reads were obtained, achieving about 150-fold genomic coverage. Read quality was checked using FastQC 0.11.5 (7), whereas adapter and low-quality sequences were trimmed using Trimmomatic 0.39 (8). Default parameters were used for all software unless otherwise noted. The resulting reads were assembled into contigs using the A5-MiSeq assembly pipeline (version 20160826) (9), and the quality of the assembly was assessed using QUAST version 5.0.2 (10). The Mauve tool (11) was used to align and order the assembled contigs using Pantoea agglomerans strain C410P1 (GenBank accession number CP016889) as a reference.
The genome assembly produced 22 contigs, with an N50 value of 645,999 bp, and the largest contig was 2,247,784 bp. The genome of P. agglomerans C1 was 4,846,925 bp, with a G+C content of 55.2%, which falls within the genome size (4.5 to 6.3 Mb) and G+C content (52% to 55%) of other strains belonging to the same genus. The RASTtk (12, 13) and PATRIC (14) programs were used for genome annotation, with default parameters and using the Enterobacteriales data set as a reference. No plasmid was detected in the genome by using Plasmid Finder (15). P. agglomerans C1 has 4,696 protein-encoding genes (PEGs), as predicted using GLIMMER 3 (16), of which 3,784 (81%) have a predicted function assigned using the GenDB annotation pipeline (17), and 1,383 (30%) and 975 (20%) PEGs were assigned to an Enzyme Commission (EC) and Gene Ontology (GO) functional category, respectively. Furthermore, a total of 70 tRNA-coding genes and 12 rRNA genes were predicted in the chromosome sequence. Genes potentially involved in the promotion of plant growth were also carefully mined. This analysis allowed the identification of genes for phosphate solubilization and the production of phytohormones, pest/disease-suppressing γ-aminobutyric acid (GABA), siderophores, organic acids (e.g., gluconic acid), and volatile compounds that influence plant growth and increase plant resistance against pathogens.
Manual analysis revealed that the C1 genome carries cluster operons encoding putative arsenic (As) and copper (Cu) resistance-related proteins, as well as genes for cadmium (Cd) tolerance and oxidative stress response. Furthermore, two distinct large intact prophage regions, named prophage_1 and prophage_2, were examined using the PHAge Search Tool (PHAST) (18). The prophage_1 region has a size of 39.70 kb, with a G+C content of 52.51%, and 62.22% similarity to PHAGE_Erwini_ENT90_NC_019932, encompassing 41 out of 45 open reading frames (ORFs) present in the region that matches the phage protein database. The prophage_2 region was 30.1 kb, with a G+C content of 52.50%, and 55.81% similarity to PHAGE_Salmon_RE_2010_NC_01948, with 32 out of 45 ORFs that match the phage protein database.
Data availability.
The genome sequence of P. agglomerans C1 is available under NCBI BioProject accession number PRJNA523737, with Sequence Read Archive (SRA) accession number SRP212904.
ACKNOWLEDGMENT
This work was partially supported by the University of Tuscia (Viterbo, Italy).
REFERENCES
- 1.Grimont PAD, Grimont F. 2005. Genus Pantoea, p 713–720. In Brenner DJ, Krieg NR, Staley JT (ed), Bergey’s manual of systematic bacteriology. Volume 2: the Proteobacteria, part B, the Gammaproteobacteria , 2nd ed Springer, New York, NY. [Google Scholar]
- 2.Walterson AM, Stavrinides J. 2015. Pantoea: insights into a highly versatile and diverse genus within the Enterobacteriaceae. FEMS Microbiol Rev 39:968–984. doi: 10.1093/femsre/fuv027. [DOI] [PubMed] [Google Scholar]
- 3.Dutkiewicz J, Mackiewicz B, Lemieszek MK, Golec M, Milanowski J. 2016. Pantoea agglomerans: a mysterious bacterium of evil and good. Part IV. Beneficial effects. Ann Agric Environ Med 23:206–222. doi: 10.5604/12321966.1203879. [DOI] [PubMed] [Google Scholar]
- 4.Ferreira CMH, Soares H, Soares EV. 2019. Promising bacterial genera for agricultural practices: an insight on plant growth-promoting properties and microbial safety aspects. Sci Total Environ 682:779–799. doi: 10.1016/j.scitotenv.2019.04.225. [DOI] [PubMed] [Google Scholar]
- 5.Luziatelli F, Ficca AG, Colla G, Švecová E, Ruzzi M. 2016. Effects of a protein hydrolysate-based biostimulant and two micronutrient-based fertilizers on plant growth and epiphytic bacterial population of lettuce. Acta Hortic 1148:43–48. doi: 10.17660/ActaHortic.2016.1148.5. [DOI] [Google Scholar]
- 6.Luziatelli F, Ficca AG, Colla G, Švecová EB, Ruzzi M. 2019. Foliar application of vegetal-derived bioactive compounds stimulates the growth of beneficial bacteria and enhances microbiome biodiversity in lettuce. Front Plant Sci 10:60. doi: 10.3389/fpls.2019.00060. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wingett SW, Andrews S. 2018. FastQ Screen: a tool for multi-genome mapping and quality control. F1000Res 7:1338. doi: 10.12688/f1000research.15931.2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Tritt A, Eisen JA, Facciotti MT, Darling AE. 2012. An integrated pipeline for de novo assembly of microbial genomes. PLoS One 7:e42304. doi: 10.1371/journal.pone.0042304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Gurevich A, Saveliev V, Vyahhi N, Tesler G. 2013. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29:1072–1075. doi: 10.1093/bioinformatics/btt086. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Darling AC, Mau B, Blattner FR, Perna NT. 2004. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res 14:1394–1403. doi: 10.1101/gr.2289704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Overbeek R, Olson R, Pusch GD, Olsen GJ, Davis JJ, Disz T, Edwards RA, Gerdes S, Parrello B, Shukla M, Vonstein V, Wattam AR, Xia F, Stevens R. 2014. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res 42:D206–D214. doi: 10.1093/nar/gkt1226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Brettin T, Davis JJ, Disz T, Edwards RA, Gerdes S, Olsen GJ, Olson R, Overbeek R, Parrello B, Pusch GD, Shukla M, Thomason JA III, Stevens R, Vonstein V, Wattam AR, Xia F. 2015. RASTtk: a modular and extensible implementation of the RAST algorithm for building custom annotation pipelines and annotating batches of genomes. Sci Rep 5:8365. doi: 10.1038/srep08365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Wattam AR, Davis JJ, Assaf R, Boisvert S, Brettin T, Bun C, Conrad N, Dietrich EM, Disz T, Gabbard JL, Gerdes S, Henry CS, Kenyon RW, Machi D, Mao C, Nordberg EK, Olsen GJ, Murphy-Olson DE, Olson R, Overbeek R, Parrello B, Pusch GD, Shukla M, Vonstein V, Warren A, Xia F, Yoo H, Stevens RL. 2017. Improvements to PATRIC, the all-bacterial Bioinformatics Database and Analysis Resource Center. Nucleic Acids Res 45:D535–D542. doi: 10.1093/nar/gkw1017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Carattoli A, Zankari E, Garcia-Fernandez A, Voldby Larsen M, Lund O, Villa L, Aarestrup FM, Hasman H. 2014. In silico detection and typing of plasmids using PlasmidFinder and plasmid multilocus sequence typing. Antimicrob Agents Chemother 58:3895–3903. doi: 10.1128/AAC.02412-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Salzberg SL, Delcher AL, Kasif S, White O. 1998. Microbial gene identification using interpolated Markov models. Nucleic Acids Res 26:544–548. doi: 10.1093/nar/26.2.544. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Meyer F, Goesmann A, McHardy AC, Bartels D, Bekel T, Clausen J, Kalinowski J, Linke B, Rupp O, Giegerich R, Pühler A. 2003. GenDB—an open source genome annotation system for prokaryote genomes. Nucleic Acids Res 31:2187–2195. doi: 10.1093/nar/gkg312. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Zhou Y, Liang Y, Lynch KH, Dennis JJ, Wishart DS. 2011. PHAST: a fast phage search tool. Nucleic Acids Res 39:W347–W352. doi: 10.1093/nar/gkr485. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The genome sequence of P. agglomerans C1 is available under NCBI BioProject accession number PRJNA523737, with Sequence Read Archive (SRA) accession number SRP212904.