Abstract
Pantoea ananatis strain MHSD5 is a bacterial endophyte isolated from the surface sterilized leaves of Pellaea calomelanos, which is a medicinal plant obtained in Limpopo province of South Africa. We present here the draft genome sequence and annotation of P. ananatis strain MHSD5. The genome assembly was 4.6 Mb in size with an N50 of 550,557 bp. A total of 4,350 putative protein coding sequence genes were predicted with PGAAP. This is the first draft genome of a bacterial endophyte symbiotically associated with P. calomelanos. This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession PUEK00000000. The version described in this paper is version PUEK01000000.
Specifications Table
Subject area | Biology |
More specific subject area | Plant-microbe interaction, Bacteriology, Genomics, Bioinformatics |
Type of data | Table, figure |
How data was acquired | Genome sequencing: Illumina MiSeq at Inqaba Biotechnological Company, Pretoria, South Africa, |
De novo sequence assembly: Web-based Galaxy Unicycler version 0.4.1.1, Bioinformatics approaches: NCBI Prokaryotic Genome Automatic Annotation Pipeline (PGAAP), Rapid Annotation using Subsystem Technology server (RAST). | |
Data format | Analysed |
Experimental factors | Genomic sequencing, assembly and annotation |
Experimental features | The whole genome of Pantoea ananatis strain MHSD5 was sequenced on Illumina MiSeq sequencing platform. Read quality was assessed using Galaxy platform-FastQC version 0.69. De novo assembly of the reads were performed by Galaxy web platform-Unicycler version 0.4.1.1, and quality of assembly was assessed with Quast Genome assembly Quality version 4.6.3 (https://usegalaxy.org). |
Data source location | P. ananatis strain MHSD5 was isolated from surface sterilized leaves of P. calomelanos obtained from Botlokwa, Limpopo Province, South Africa (23°29׳34.8"S 29°42׳11.2"E). |
Data accessibility | Genome assembly,annotation and analysis of data are found in this article and the raw data together with NCBI PGAAP annotation were deposited at the NCBI repository: |
https://www.ncbi.nlm.nih.gov/bioproject/PRJNA434382, | |
Bioproject ID: 434382, BioSample: SAMN08555277 | |
This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession PUEK00000000. (http://www.ncbi.nlm.nih.gov/nuccore/PUEK00000000) | |
The genome annotation performed at RAST server are also given in this article. |
Value of the data
-
•
The first draft genome of bacterial endophyte symbiotically associated with Pellaea calomelanos.
-
•
The whole genomic data provided information on genetic components of Pantoea ananatis strain MHSD5 involved in endophytic life style.
-
•
Pantoea genus comprise species associated with various hosts, thus the genome information will advance genome comparison of various Pantoea species and further provide insights into the biology and evolution of the genus.
1. Data
The genus Pantoea has diverse species, which have been isolated from several environments such as aquatic and terrestrial environments [1]. Pantoea genus currently has twenty described species which have associations with humans, plants, insects, and animals [1]. Association of Pantoea species with various hosts can be parasitic, mutualistic or commensal [1], [2]. The genus consists of yellow-pigmented, gram-negative, rod-shaped bacteria in the Enterobacteriaceae family. Although most species have been reported to have pathogenic associations with humans, animals and plants [3], [4], few have been reported to be symbiotically associated with plants [5]. As plant endophytes, Pantoea bacteria have been reported to promote plant growth via a variety of mechanisms and produce bioactive compounds with antibiotic activities [6], [7], [8], [9].
Since Pantoea species have associations with different hosts in different environments, there is need for understanding genetic factors which allow this group of bacteria to successfully colonize various hosts. In addition, the availability of various genomes of Pantoea genus will promote whole genome comparison within this group and further our understanding of genetic factors that contribute to Pantoea species thriving in different environments and thus delineating their biology and evolution.
We recently isolated Pantoea ananatis strain MHSD5 from surface sterilized leaves of Pellaea calomelanos, a medicinal plant utilized for treatment of chest colds, asthma, headaches, head colds as well as mouth and nasal ulcers [10]. Initial identification of the bacterium was confirmed by sequencing of its 16S rRNA gene, which was deposited in GenBank with accession number MF613651. This is the first study to isolate, identify and report on the draft whole genome sequence of bacterial endophyte associated with P. calomelanos.
DNA sequencing was performed with Illumina MiSeq platform which generated 250 Mb data of reads. The genome assembly of P. ananatis strain MHSD5 produced 39 contigs, with N50 of 550,557 base pairs (bp), the largest contig with 1,441,770 bp. The genome of P. ananatis strain MHSD5 was 4,650,216 bp, with GC% content of 54.16%, which falls within the genome sizes (4.5–6.3 Mb) and GC% content (52–55%) ranges of some of the sequenced Pantoea species. Genome annotation was performed with Prokaryotic Genome Automatic Annotation Pipeline (PGAAP) and Rapid Annotation using Subsystem Technology server (RAST) [11], [12]. P. ananatis strain MHSD5 has 4,437 genes, among the identified genes 4,350 are protein coding sequence genes (CDS) and 119 are pseudogenes. The genome also has 3 rRNAs with five operons (5S, 16S and 23S) and 70 tRNAs genes.
PGAAP and RAST annotation pipelines resulted in minor differences (Table 1). We registered the annotation results from PGAAP on GenBank. The RAST Subsystem Information (Fig. 1), shows the subsystem feature counts, with 92 genes involved in virulence, disease and defence; and carbohydrates having the most number of genes of 561 and 4 genes of secondary metabolism which are the lowest number of genes. Through the RAST sequence based comparison tool [13], we compared the assembled genomes of P. ananatis strain MHSD5 and Pantoea stewartii DC283 [14], using the latter as a reference genome as shown in Fig. 2(a) and (b). Although there were genome gaps on P. ananatis strain MHSD5 (Fig. 2(a)), it showed 70–99% similarity in protein sequences of both reverse and forward (bidirectional hits) to Pantoea stewartii DC283.
Table 1.
Outcome comparison of Pantoea ananatis strain MHSD5 genome annotation using PGAAP and RAST.
Genomic feature | aPGAAP | RAST |
---|---|---|
Total number of genes | 4437 | 4397 |
Protein coding genes | 4350 | 4324 |
Number of RNAs | 87 | 73 |
Contigs | 39 | 39 |
N50 | 550,557 bp | 550,557 bp |
GC% | 54.16% | 54.2% |
Only the PGAAP results were registered with GenBank.
Fig. 1.
The subsystem distribution of Pantoea ananatis strain MHSD5 generated from RAST annotation server.
Fig. 2.
(a) Pantoea ananatis strain MHSD5 genome compared to P. stewartii DC283, with the latter used as reference genome, (b) colour co-ordination similarity of the genome comparison in percentages. Bidirectional best hit refers to both forward and reverse hits.
Genomics is crucial in identifying genes that are important for the bacterial endophyte to successfully penetrate, colonize and have symbiotic association with its plant host. In accordance to the life style of endophytes, we identified genes involved in nitrogen fixation, production of plant hormone indole acetic acid (IAA), production of antibiotics, toxins as well as toxin-antitoxin genes. Similar genes have been previously reported in other endophytic bacteria such as Azoarcus sp. strain BH72 as well as Gluconacetobacter diazotrophicus Pal5 [15], [16]. Some of the identified toxin and antitoxin genes are similar to Enterobacter sp. 638, a plant growth promoting endophyte which was fully sequenced and annotated [17].
2. Experimental design, materials and methods
2.1. Genome extraction and sequencing
P. ananatis strain MHSD5 was isolated from surface sterilized leaves of P. calomelanos obtained from Botlokwa, Limpopo Province, South Africa (23°29׳34.8"S 29°42׳11.2"E). Whole genomic DNA of Pantoea ananatis strain MHSD5 was extracted from nutrient agar pure colonies using Quick-DNA™ ZR fungal/bacterial DNA MiniPrep kit (Zymo Research, catalogue #D6005) according to manufacturers’ protocol. The extracted DNA was cleaned with ZR fungal/bacterial DNA clean and concentrator-5 (catalogue #D4003). The whole genome sequence was performed at a commercial service provider Inqaba Biotechnological Company PTY (LTD.), Pretoria, South Africa. Briefly, genomic DNA sample was fragmented using an ultrasonication approach (Covaris). The DNA library was prepared using the NEBNext Ultra™ II DNA Library Prep Kit for Illumina (New England BioLabs Inc.) according to the manufacturers’ protocol. The resulting fragments were size selected, end repaired and Illumina specific adapter sequences were ligated to each fragment. Following quantification, the samples were individually indexed and a second size selection step was performed (AMPure XP Bead-based), and sequenced on Illumina MiSeq platform, using a MiSeq v3 (600 cycle) kit. 250 Mb of data (2 × 300 bp long paired end reads) was produced.
2.2. Genome quality assessment, de novo assembly and annotation
Quality assessment of raw reads was performed by FastQC version 0.69. The reads were de novo assembled using Unicycler version 0.4.1.1 and the assembly was assessed by Quast version 4.6.3. The assembly generated 39 contigs with an N50 of 550,557 bp and 53× coverage. All the pre-annotation analysis were performed on Galaxy web platform (https://usegalaxy.org) [18]. The NCBI Prokaryotic Genome Automatic Annotation Pipeline (PGAAP) was used to perform initial annotation, which was followed by Rapid Annotation using Subsystem Technology (RAST).
Acknowledgements
This research was financially supported by the University of Johannesburg, FRC/URC and the National Research Foundation of South Africa: Thuthuka Grant no. [TTK170405225920]. Siphiwe G. Mahlangu received the NRF Masters Scarce Skills Scholarship from the National Research Foundation Grant no. [SFH160628174791].
Footnotes
Transparency data associated with this article can be found in the online version at doi:10.1016/j.dib.2018.06.039.
Transparency document. Supplementary material
Supplementary material
References
- 1.Walterson A.M., Stavrinides J. Pantoea: insights into a highly versatile and diverse genus within the Enterobacteriaceae. FEMS Microbiol. Rev. 2015;39(6):968–984. doi: 10.1093/femsre/fuv027. [DOI] [PubMed] [Google Scholar]
- 2.Brady C., Cleenwerck I., Venter S., Vancanneyt M., Swings J., Coutinho T. Phylogeny and identification of Pantoea species associated with plants, humans and the natural environment based on multilocus sequence analysis (MLSA) Syst. Appl. Microbiol. 2008;31:447–460. doi: 10.1016/j.syapm.2008.09.004. [DOI] [PubMed] [Google Scholar]
- 3.De Maayer P., Chan W.Y., Rubagotti E., Venter S.N., Toth I.K., Birch P.R.J., Coutinho T.A. Analysis of the Pantoea ananatis pan-genome reveals factors underlying its ability to colonize and interact with plant, insect and vertebrate hosts. BMC Genom. 2014;15:404. doi: 10.1186/1471-2164-15-404. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Chen C., Xin K., Liu H., Cheng J., Shen X., Wang Y., Zhang L. Pantoea alhagi, a novel endophytic bacterium with ability to improve growth and drought tolerance in wheat. Sci. Rep. 2017;7:41564. doi: 10.1038/srep41564. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Feng Y., Shen D., Song W. Rice endophyte Pantoea agglomerans YS19 promotes host plant growth and affects allocations of host photosynthates. J. Appl. Microbiol. 2006;100:938–945. doi: 10.1111/j.1365-2672.2006.02843.x. [DOI] [PubMed] [Google Scholar]
- 6.da Silva J.F., Barbosa R.R., de Souza A.N., da Motta O.V., Teixeira G.n., Carvalho V.S., de Souza A.L.S.R., de Souza Filho G.A. Isolation of Pantoea ananatis from sugarcane and characterization of its potential for plant growth promotion. Genet. Mol. Res. 2015;14(4):15301–15311. doi: 10.4238/2015.November.30.6. [DOI] [PubMed] [Google Scholar]
- 7.Cimmino A., Andolfi A., Marchi G., Surico G., Evidente A. Phytohormone production by strains of Pantoea agglomerans from knots on olive plants caused by Pseudomonas savastanoi pv. Savastanoi. Phytopathol. Mediterr. 2006;45:247–252. [Google Scholar]
- 8.Smits T.H.M., Rezzonico F., Kamber T., Blom J., Goesmann A., Ishimaru C.A., Frey J.E., Stockwell V.O., Duffy B. Metabolic versatility and antibacterial metabolite biosynthesis are distinguishing genomic features of the fire blight antagonist Pantoea vagans C9-1. PLoS One. 2011;6:e22247. doi: 10.1371/journal.pone.0022247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wright S.A., Zumoff C.H., Schneider L., Beer S.V. Pantoea agglomerans strain EH318 produces two antibiotics that inhibit Erwinia amylovora in vitro. Appl. Environ. Microbiol. 2001;67:284–292. doi: 10.1128/AEM.67.1.284-292.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.SANBI, Pellaea calomelanos, 2018. 〈http://pza.sanbi.org/pellaea-calomelanos/〉, (Accessed 30 March 2018).
- 11.Tatusova T., DiCuccio M., Badretdin A., Chetvernin V., Nawrocki E.P., Zasllavsky L., Lomsadze A., Pruitt K.D., Borodovsky M., Ostell J. NCBI prokaryotic genome annotation pipeline. Nucleic Acid. Res. 2016;44(14):6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Aziz R.K., Bartels D., Best A.A. The RAST server: rapid annotations using subsystems technology. BMC Genom. 2008;9:1–15. doi: 10.1186/1471-2164-9-75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Overbeek R., Olson R., Pusch G.D. The SEED and the rapid annotation of microbial genomes using subsystems technology (RAST) Nucleic Acids Res. 2013;42:204–214. doi: 10.1093/nar/gkt1226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Duong D.A., Stevens A.M., Jensen R.V. Complete genome assembly of Pantoea stewartii subsp. Stewartii DC283, a corn pathogen. Genome Announc. 2017;22 doi: 10.1128/genomeA.00435-17. (e00435-17) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Krause A., Ramakumar A., Bartels D., Battistoni F., Bekel T., Boch J., Böhm M., Friedrich F., Hurek T., Krause L., Linke B., McHardy A.C., Sarkar A., Schneiker S., Syed A.A., Thauer R., Vorhölter F.J., Weidner S., Pühler A., Reinhold-Hurek B., Kaiser O., Goesmann A. Complete genome of mutualistic, N2-fixing grass endophyte Azoarcus sp. Strain BH72. Nat. Biotechnol. 2007 doi: 10.1038/nbt1243. [DOI] [PubMed] [Google Scholar]
- 16.Bertalan M., Albano R., de Padua V., Rouws L., Rojas C. Complete genome sequence of the sugarcane nitrogen-fixing endophyte Gluconacetobacter diazotrophicus Pal5. BMC Genom. 2009;10:450. doi: 10.1186/1471-2164-10-450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Taghavi S., van der Lelie D., Hoffman A., Zhang Y.B., Walla M.D., Vangrondveld J., Newman L., Monchy S. Genome sequence of the plant growth promoting endophytic bacterium Enterobacter sp. 638. PLoS Genet. 2010;6(5):e1000943. doi: 10.1371/journal.pgen.1000943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Afgan E., Baker D., van den Beek M., Blankenberg D., Bouvier D., Čech M., Chilton J., Clements D., Coraor N., Eberhard C., Grüning B., Guelrler A., Hillman-Jackson J., Von Kuster G., Rasche E., Soranzo N., Turaga N., Taylor J., Nekrutenko A., Goecks J. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses. Nucleic Acids Res. 2016;44:W3–W10. doi: 10.1093/nar/gkw343. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary material