Skip to main content
Microbiology Resource Announcements logoLink to Microbiology Resource Announcements
. 2018 Jul 26;7(3):e00834-18. doi: 10.1128/MRA.00834-18

New Reference Genome Sequences for 17 Bacterial Strains of the Honey Bee Gut Microbiota

Kirsten M Ellegaard a, Philipp Engel a,
Editor: J Cameron Thrashb
PMCID: PMC6211356  PMID: 30533872

We sequenced the genomes of 17 strains isolated from the gut of honey bees, including strains representing the genera Lactobacillus, Bifidobacterium, Gilliamella, Snodgrassella, Frischella, and Commensalibacter. These genome sequences represent an important step forward in the development of a comprehensive reference database to aid future analysis of this emerging gut microbiota model.

ABSTRACT

We sequenced the genomes of 17 strains isolated from the gut of honey bees, including strains representing the genera Lactobacillus, Bifidobacterium, Gilliamella, Snodgrassella, Frischella, and Commensalibacter. These genome sequences represent an important step forward in the development of a comprehensive reference database to aid future analysis of this emerging gut microbiota model.

ANNOUNCEMENT

The honey bee gut is colonized by a remarkably simple community dominated by only 8 to 10 bee-specific phylotypes (1). However, genome-level analyses have shown that several of the phylotypes comprise highly divergent strains (24). As such, the honey bee is a promising future model for studying strain-level evolution and function in gut-associated bacterial communities (5). Here, we present 17 new genome sequences of strains isolated from the gut of honey bees, which were generated to facilitate the development of a reference genome database for this community. All strains were isolated from honey bees collected from our apiary in Lausanne, Switzerland, by culturing gut homogenates on agar plates (6) under microaerophilic or anaerobic conditions (7).

Four strains of the genus Lactobacillus (Table 1) were selected for sequencing with PacBio 20K (Pacific Biosciences) single-molecule real-time (SMRT) technology. The strains were grown overnight in MRS broth supplemented with fructose and cysteine (8) at 35°C under anaerobic conditions, and total genomic DNA was extracted using a cetyltrimethylammonium bromide-based extraction protocol (7). De novo genome assembly was done using the Hierarchical Genome Assembly Process (HGAP) version 2.3. Another 13 strains representing the genera Bifidobacterium, Gilliamella, Snodgrassella, Frischella, and Commensalibacter (Table 1) were selected for sequencing with Illumina technology. The strains were cultured as described previously (7), and total genomic DNA was extracted with the GenElute bacterial genomic DNA kit according to the manufacturer’s instructions (Table 1). Sequencing libraries were prepared with the TruSeq DNA kit and sequenced on the MiSeq platform (Illumina) using the paired-end 2 × 250-bp protocol. All 13 genomes were sequenced to a minimum depth of 50× (Table 1). The resulting FASTQ files were trimmed with Trimmomatic (9) to remove eventual adapter sequences and low-quality reads using the following parameters: LEADING, 20; TRAILING, 20; SLIDINGWINDOW, 4:15; and MINLEN, 50. The reads were assembled with SPAdes version 3.7.1 (10) using the “-careful” flag and multiple k-mer sizes (21, 33, 55, 77, 99, 127). Small contigs (less than 500 bp) and contigs with low k-mer coverage (less than 5×) were removed from the assemblies, resulting in 6 to 40 contigs per assembly, with a median N50 of 529,190 bp. For strains with related complete genome sequences or scaffolds available, the contigs were reordered with Mauve (11).

TABLE 1.

Genome assembly statistics and strain information

Genus Species Phylotypea Sublineage Strain Extracted
DNA (µg)b
No. of
contigs
N50 (bp) Assembly
size (bp)
Coverage
(×)
GC content
(%)
No. of
genesc
Lactobacillus L. apis Firm5 Firm5-1 ESL0185 17.9 1 1,683,102 1,683,102 420 37 1,578
Lactobacillus L. helsingborgensis Firm5 Firm5-2 ESL0183 19.3 2 1,856,015 1,867,232 300 37 1,780
Lactobacillus L. melliventris Firm5 Firm5-3 ESL0184 19.8 4 1,505,590 2,036,181 320 36 2,015
Lactobacillus L. kulllabergensis Firm5 Firm5-4 ESL0186 16.1 1 2,018,944 2,018,944 290 36 1,915
Bifidobacterium B. asteroides Bifido Bifido-1 ESL0170 1.1 7 1,162,986 2,175,262 200 60 1,771
Bifidobacterium B. asteroides Bifido Bifido-1 ESL0198 1.3 12 618,428 2,235,610 280 60 1,820
Bifidobacterium B. asteroides Bifido Bifido-1 ESL0199 5.3 7 558,059 2,167,340 50 59 1,741
Bifidobacterium B. asteroides Bifido Bifido-1 ESL0200 4.7 16 500,320 1,933,421 300 60 1,621
Bifidobacterium B. indicum /
B. coryneforme
Bifido Bifido-2 ESL0197 1.1 6 1,389,647 1,715,238 300 61 1,408
Gillamella G. apicola Gilliamella Gilli-1 ESL0178 0.3 18 364,598 2,885,657 200 34 2,602
Gillamella G. apis Gilliamella Gilli-2 ESL0169 3.9 13 481,163 2,430,778 270 35 2,227
Gillamella G. apis Gilliamella Gilli-2 ESL0172 1.8 17 374,672 2,685,772 200 34 2,468
Gillamella NAd Gilliamella Gilli-3 ESL0177 2.0 19 953,736 3,086,198 50 35 2,868
Gillamella NA Gilliamella Gilli-3 ESL0182 1.2 31 255,373 3,537,173 160 35 3,257
Snodgrassella S. alvi Snodgrasella NA ESL0196 3.7 15 1,281,809 2,446,304 130 41 2,224
Frischella F. perrara Frischella NA ESL0167 3.1 40 277,847 2,558,525 200 34 2,313
Commensali-
bacter
Commensalibacter
sp.
Commensalibacter NA ESL0284 1.3 13 471,180 1,948,862 50 38 1,767
a

Based on 16S rRNA amplicon sequencing.

b

Total amount of extracted DNA.

c

Gene count based on the JGI Microbial Genome Annotation Pipeline.

d

NA, not applicable.

Assembly qualities were checked by remapping reads to assemblies with the Burrows-Wheeler Aligner (12) and by GC-skew visualization with DNAplotter (13). For strain ESL0184, the main chromosome was cut into three contigs due to assembly uncertainty generated by a duplicated prophage sequence. Strains ESL0183, ESL0185, and ESL0186 were submitted as complete genomes, with strain ESL0183 having a small plasmid contig of 11.3 kb.

Core phylogenies were generated for the Lactobacillus, Bifidobacterium, and Gilliamella strains, including previously published isolates derived from honey bees, using OrthoFinder (14) for ortholog prediction and RAxML (15) for phylogenetic inference. Based on the phylogenies, the Lactobacillus and Bifidobacterium strains represent members of previously reported sublineages, whereas two strains of the genus Gilliamella (ESL0177 and ESL0182) represent a new sublineage, with strain ESL182 having the largest genome size reported for this genus to date (3.5 Mbp) (Table 1).

Data availability.

The complete genome sequences for the strains reported here have been deposited in GenBank under the accession numbers CP029476, CP029544/CP029545, and CP029477, and the whole-genome shotgun projects have been deposited under the accession numbers QGLH00000000, QGLJ00000000, QGLK00000000, QGLL00000000, QGLI00000000, QGLG00000000, QGLQ00000000, QGLN00000000, QGLO00000000, QGLP00000000, QGLR00000000, QGLS00000000, QGLM00000000, and QGLT00000000. Additionally, the genomes were annotated using the JGI Microbial Genome Annotation Pipeline, where they have been deposited under the genome identification numbers 2684622912, 2684622914, 2684622911, 2684622916, 2684622918, 2684622919, 2684622920, 2684622917, 2684622913, 2684622925, 2684622922, 2684622923, 2684622924, 2684622926, 2684622927, 2684622921, and 2756170209.

ACKNOWLEDGMENTS

This work was funded by Human Frontier Science Program (HFSP) Young Investigator grant RGY0077/2016, the European Research Council Starting Grant (ERC-StG) “MicroBeeOme,” Swiss National Science Foundation grant 31003A_160345, and the Fondation Herbette at the University of Lausanne.

REFERENCES

  • 1.Kwong WK, Moran NA. 2016. Gut microbial communities of social bees. Nat Rev Microbiol 14:374–384. doi: 10.1038/nrmicro.2016.43. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Ellegaard KM, Tamarit D, Javelind E, Olofsson TC, Andersson SGE, Vasquez A. 2015. Extensive intra-phylotype diversity in lactobacilli and bifidobacteria from the honeybee gut. BMC Genomics 16:284. doi: 10.1186/s12864-015-1476-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Engel P, Stepanauskas R, Moran NA. 2014. Hidden diversity in honey bee gut symbionts detected by single-cell genomics. PLoS Genet 10:e1004596. doi: 10.1371/journal.pgen.1004596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Engel P, Martinson VG, Moran NA. 2012. Functional diversity within the simple gut microbiota of the honey bee. Proc Natl Acad Sci U S A 109:11002–11007. doi: 10.1073/pnas.1202970109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Ellegaard KM, Engel P. 2016. Beyond 16S rRNA community profiling: intra-species diversity in the gut microbiota. Front Microbiol 7:1475. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Engel P, James RR, Koga R, Kwong WK, McFrederick QS, Moran NA. 2013. Standard methods for research on Apis mellifera gut symbionts. J Apicult Res 52:1–24. doi: 10.3896/IBRA.1.52.4.07. [DOI] [Google Scholar]
  • 7.Kesnerova L, Mars RAT, Ellegaard KM, Troilo M, Sauer U, Engel P. 2017. Disentangling metabolic functions of bacteria in the honey bee gut. PLoS Biol 15:e2003467. doi: 10.1371/journal.pbio.2003467. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Olofsson TC, Alsterfjord M, Nilson B, Butler E, Vasquez A. 2014. Lactobacillus apinorum sp. nov., Lactobacillus mellifer sp. nov., Lactobacillus mellis sp. nov., Lactobacillus melliventris sp. nov., Lactobacillus kimbladii sp. nov., Lactobacillus helsingborgensis sp. nov. and Lactobacillus kullabergensis sp. nov., isolated from the honey stomach of the honeybee Apis mellifera. Int J Syst Evol Microbiol 64:3109–3119. doi: 10.1099/ijs.0.059600-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120. doi: 10.1093/bioinformatics/btu170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. 2012. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol 19:455–477. doi: 10.1089/cmb.2012.0021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Darling AE, Mau B, Perna NT. 2010. progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS One 5:e11147. doi: 10.1371/journal.pone.0011147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Li H, Durbin R. 2009. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25:1754–1760. doi: 10.1093/bioinformatics/btp324. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Carver T, Thomson N, Bleasby A, Berriman M, Parkhill J. 2009. DNAPlotter: circular and linear interactive genome visualization. Bioinformatics 25:119–120. doi: 10.1093/bioinformatics/btn578. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Emms DM, Kelly S. 2015. OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy. Genome Biol 16:157. doi: 10.1186/s13059-015-0721-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Stamatakis A. 2006. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22:2688–2690. doi: 10.1093/bioinformatics/btl446. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

The complete genome sequences for the strains reported here have been deposited in GenBank under the accession numbers CP029476, CP029544/CP029545, and CP029477, and the whole-genome shotgun projects have been deposited under the accession numbers QGLH00000000, QGLJ00000000, QGLK00000000, QGLL00000000, QGLI00000000, QGLG00000000, QGLQ00000000, QGLN00000000, QGLO00000000, QGLP00000000, QGLR00000000, QGLS00000000, QGLM00000000, and QGLT00000000. Additionally, the genomes were annotated using the JGI Microbial Genome Annotation Pipeline, where they have been deposited under the genome identification numbers 2684622912, 2684622914, 2684622911, 2684622916, 2684622918, 2684622919, 2684622920, 2684622917, 2684622913, 2684622925, 2684622922, 2684622923, 2684622924, 2684622926, 2684622927, 2684622921, and 2756170209.


Articles from Microbiology Resource Announcements are provided here courtesy of American Society for Microbiology (ASM)

RESOURCES