The genus Saccharibacter is currently understudied, with only one described species, Saccharibacter floricola, isolated from a flower. In an effort to better understand the microbes that come in contact with native bee pollinators, we isolated and sequenced four additional strains of Saccharibacter from native bees in the genera Melissodes and Anthophora. These genomes range in size from 2,104,494 to 2,316,791 bp (mean, 2,246,664 bp) and contain between 1,860 and 2,167 (mean, 2,060) protein-coding genes.
ABSTRACT
The genus Saccharibacter is currently understudied, with only one described species, Saccharibacter floricola, isolated from a flower. In an effort to better understand the microbes that come in contact with native bee pollinators, we isolated and sequenced four additional strains of Saccharibacter from native bees in the genera Melissodes and Anthophora. These genomes range in size from 2,104,494 to 2,316,791 bp (mean, 2,246,664 bp) and contain between 1,860 and 2,167 (mean, 2,060) protein-coding genes.
ANNOUNCEMENT
The genus Saccharibacter currently comprises only one described species, isolated from the pollen of Japanese flowers (1). Saccharibacter floricola is an acetic acid bacterium (AAB). AAB have been shown to be associated with diverse insects that rely on sugar-rich diets (2, 3). As the genus Saccharibacter has been isolated from flowers, it may also come in contact with insect pollinators and play a role in their utilization of nectar as a food source. To determine whether this is the case and to bolster the available data for this potentially important genus, we isolated and sequenced the genomes of four Saccharibacter sp. strains isolated from native bee pollinators.
Four strains of Saccharibacter were isolated from native bees. Strains EH60, EH70, and E611 were isolated from bees in the genus Anthophora, and strain 17.LH.SD was isolated from a bee in the genus Melissodes, collected from Yosemite National Park, CA, and The Dalles, OR, respectively. Initial samples were streaked on MRS plus 2% fructose agar plates, after which single colonies were picked and grown in liquid culture. Strains were grown in yeast-peptone-glucose medium at 30°C with aeration. Total DNA was extracted using the DNeasy blood and tissue kit (Qiagen, Hilden, Germany), and sequencing library preparation was performed using the NEBNext Ultra II DNA library preparation kit (NEB, Ipswich, MA), both according to the manufacturers’ protocols. The resulting libraries were subjected to 250-bp paired-end sequencing on the Illumina NextSeq 500 platform (version 2 chemistry) (strain 17.LH.SD was sequenced on the Illumina MiSeq platform [version 2 chemistry] with 300-bp paired-end sequencing) at the Indiana University Center for Genomics and Bioinformatics (Bloomington, IN). These sequencing runs generated 104,267 to 941,404 (mean, 551,094) read pairs.
Initial de novo assembly of these strains was performed using MaSuRCA version 3.2.8 with default settings (4). Reads were not subjected to quality control (QC) prior to assembly, as MaSuRCA performs internal QC during the assembly process. Additionally, reads were randomly subsampled down to approximately 50× coverage prior to assembly (100× for strain EH70). The completeness of the assemblies was assessed using both BUSCO (version 3; alphaproteobacteria lineage data set) (5) and CheckM (version 1.1.1) (6). Both of these tools indicated that the assemblies were >99% complete.
The assemblies resulted in 13 to 30 (mean, 23) contigs comprising 2,104,494 to 2,316,791 bp (mean, 2,246,664 bp), with an N50 contig length of 297,436 to 461,108 bp (mean, 341,556 bp). The GC content of these strains was 48.61 to 51.47% (mean, 50.56%). Annotation was carried out with NCBI’s Prokaryotic Genome Annotation Pipeline (PGAP) (7), which predicted between 1,860 and 2,167 (mean, 2,060) protein-coding genes and 1 to 4 (mean, 2) rRNAs. Assembly statistics are summarized in Table 1.
TABLE 1.
Strain | GenBank accession no. | No. of contigs | Genome size (Mb) | % GC content | N50 (bp) | L50 | Host genus | Isolation locationa |
---|---|---|---|---|---|---|---|---|
EH60 | WVHQ00000000 | 30 | 2.32 | 51.47 | 301,953 | 4 | Anthophora | Yosemite NP, CA |
EH611 | WVHP00000000 | 25 | 2.28 | 51.47 | 303,728 | 4 | Anthophora | Yosemite NP, CA |
EH70 | WVHN00000000 | 26 | 2.29 | 50.70 | 297,436 | 4 | Anthophora | Yosemite NP, CA |
17.LH.SD | WVHO00000000 | 13 | 2.10 | 48.61 | 461,108 | 2 | Melissodes | The Dalles, OR |
NP, National Park.
Data availability.
These whole-genome shotgun projects have been deposited at DDBJ/ENA/GenBank under the accession numbers WVHN00000000, WVHO00000000, WVHP00000000, and WVHQ00000000. The versions described in this paper are versions WVHN01000000, WVHO01000000, WVHP01000000, and WVHQ01000000. Sequencing reads have been deposited in the Sequence Read Archive (SRA) under the accession numbers SRR10728926 to SRR10728929.
REFERENCES
- 1.Jojima Y, Mihara Y, Suzuki S, Yokozeki K, Yamanaka S, Fudou R. 2004. Saccharibacter floricola gen. nov., sp nov., a novel osmophilic acetic acid bacterium isolated from pollen. Int J Syst Evol Microbiol 54:2263–2267. doi: 10.1099/ijs.0.02911-0. [DOI] [PubMed] [Google Scholar]
- 2.Chouaia B, Gaiarsa S, Crotti E, Comandatore F, Degli Esposti M, Ricci I, Alma A, Favia G, Bandi C, Daffonchio D. 2014. Acetic acid bacteria genomes reveal functional traits for adaptation to life in insect guts. Genome Biol Evol 6:912–920. doi: 10.1093/gbe/evu062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Crotti E, Rizzi A, Chouaia B, Ricci I, Favia G, Alma A, Sacchi L, Bourtzis K, Mandrioli M, Cherif A, Bandi C, Daffonchio D. 2010. Acetic acid bacteria, newly emerging symbionts of insects. Appl Environ Microbiol 76:6963–6970. doi: 10.1128/AEM.01336-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Zimin AV, Marcais G, Puiu D, Roberts M, Salzberg SL, Yorke JA. 2013. The MaSuRCA genome assembler. Bioinformatics 29:2669–2677. doi: 10.1093/bioinformatics/btt476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. 2015. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]
- 6.Parks DH, Imelfort M, Skennerton CT, Hugenholtz P, Tyson GW. 2015. CheckM: assessing the quality of microbial genomes recovered from isolates, single cells, and metagenomes. Genome Res 25:1043–1055. doi: 10.1101/gr.186072.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Tatusova T, DiCuccio M, Badretdin A, Chetvernin V, Nawrocki EP, Zaslavsky L, Lomsadze A, Pruitt KD, Borodovsky M, Ostell J. 2016. NCBI Prokaryotic Genome Annotation Pipeline. Nucleic Acids Res 44:6614–6624. doi: 10.1093/nar/gkw569. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
These whole-genome shotgun projects have been deposited at DDBJ/ENA/GenBank under the accession numbers WVHN00000000, WVHO00000000, WVHP00000000, and WVHQ00000000. The versions described in this paper are versions WVHN01000000, WVHO01000000, WVHP01000000, and WVHQ01000000. Sequencing reads have been deposited in the Sequence Read Archive (SRA) under the accession numbers SRR10728926 to SRR10728929.