Abstract
Saccharomyces boulardii is the only yeast approved as a probiotic for human consumption. Here, we report the draft genome sequence of the strain ATCC MYA-796, derived from the French Ultra Levure probiotic drug. The genome has a size of 11.6 Mb with 5,305 putative open reading frames predicted.
GENOME ANNOUNCEMENT
Isolated by the French scientist Henri Boulard in 1920 during a cholera outbreak, the yeast Saccharomyces cerevisiae var. boulardii is still the only eukaryotic microorganism used as a probiotic in human health for the treatment of gastrointestinal disorders (1). This probiotic yeast is used worldwide and has been tested for clinical efficacy against several diseases, including traveler’s diarrhea, antibiotic-associated diarrhea (AAD), acute adult diarrhea, HIV-related diarrhea, Helicobacter pylori diseases, Clostridium difficile and Salmonella typhi infections, and Crohn’s disease, among others (2). Recently, a work was published describing the genome of an S. boulardii strain commercialized in India (3).
Here, we report the draft genome sequence of S. cerevisiae var. boulardii strain ATCC MYA-796, derived from the French Ultra Levure probiotic drug. The genomic DNA was sequenced using Illumina HiSeq by Axeq Technologies (http://www.axeq.com). A total of 48.3 million paired-end reads of 101 bp with an estimated 403× coverage were produced. Different k-mer values were tested to obtain the ideal value (k 61). The de novo assembly was performed using SOAPdenovo version 2.04 (4) with parameters -R -u -F -M 2. The contigs were scaffolded using SSPACE (5) and ordered by CONTIGuator (6) using the S. cerevisiae S288c genome as a reference; gaps were closed with GapCloser (4). The resulting assembly has 193 contigs (>400 bp) with a total length of 11,405,855 bp (the largest contig having a length of 459,679 bp), an N50 of 203 kb (in 19 contigs), and a G+C content of 38.1% and 100% of completeness, as estimated by mapping the orthologous genes (KOG databases) using CEGMA software (7). Genome annotation performed by MAKER2 (8) revealed 5,305 putative open reading frames (ORFs). An automatic annotation using the BLASTp algorithm (9) revealed 5,321 ORFs with significant similarity (E-value cutoff ≤10−3) to sequences deposited in the non-redundant (nr) protein database from NCBI. Using the tRNAscan-SE version 1.3 software (9), we found 273 tRNA genes scattered across the contigs.
The draft sequence of the yeast S. cerevisiae var. boulardii strain ATCC MYA-796 complements the genetic information of this probiotic yeast already deposited in GenBank.
Nucleotide sequence accession numbers.
Data related to the whole-genome shotgun project of S. cerevisiae var. boulardii ATCC MYA-796 has been deposited at DDBJ/EMBL/GenBank under the accession number JRHY00000000. The version herein described is under accession number JRHY01000000.
ACKNOWLEDGMENTS
We thank Flaviano S. Martins from the Universidade Federal de Minas Gerais (Belo Horizonte, Brazil) for helpful discussion and suggestions, and EMBRAPA Informática Agropecuária (Campinas, Brazil) for providing access to genome annotation.
This work was partially funded by Fondazione RiMED (Palermo, Italy).
Footnotes
Citation Batista TM, Marques ETA, Jr., Franco GR, Douradinha B. 2014. Draft genome sequence of the probiotic yeast Saccharomyces cerevisiae var. boulardii strain ATCC MYA-796. Genome Announc. 2(6):e01345-14. doi:10.1128/genomeA.01345-14.
REFERENCES
- 1. Czerucka D, Piche T, Rampal P. 2007. Review article: yeast as probiotics—Saccharomyces boulardii. Aliment. Pharmacol. Ther. 26:767–78. 10.1111/j.1365-2036.2007.03442.x. [DOI] [PubMed] [Google Scholar]
- 2. McFarland LV. 2010. Systematic review and meta-analysis of Saccharomyces boulardii in adult patients. World J. Gastroenterol. 16:2202–2222. 10.3748/wjg.v16.i18.2202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Khatri I, Akhtar A, Kaur K, Tomar R, Prasad GS, Ramya TNC, Subramanian S. 2013. Gleaning evolutionary insights from the genome sequence of a probiotic yeast Saccharomyces boulardii. Gut Pathog. 5:30. 10.1186/1757-4749-5-30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Luo R, Liu B, Xie Y, Li Z, Huang W, Yuan J, He G, Chen Y, Pan Q, Liu Y, Tang J, Wu G, Zhang H, Shi Y, Liu Y, Yu C, Wang B, Lu Y, Han C, Cheung DW, Yiu S-M, Peng S, Xiaoqian Z, Liu G, Liao X, Li Y, Yang H, Wang J, Lam T-W, Wang J. 2012. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler. GigaScience. 1:18. 10.1186/2047-217X-1-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W. 2011. Scaffolding pre-assembled contigs using SSPACE. Bioinformatics 27:578–579. 10.1093/bioinformatics/btq683. [DOI] [PubMed] [Google Scholar]
- 6. Galardini M, Biondi EG, Bazzicalupo M, Mengoni A. 2011. CONTIGuator: a bacterial genomes finishing tool for structural insights on draft genomes. Source Code Biol. Med. 6:11. 10.1186/1751-0473-6-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Parra G, Bradnam K, Korf I. 2007. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics 23:1061–1067. 10.1093/bioinformatics/btm071. [DOI] [PubMed] [Google Scholar]
- 8. Holt C, Yandell M. 2011. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinformatics 12:491. 10.1186/1471-2105-12-491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL. 2009. BLAST+: architecture and applications. BMC Bioinformatics 10:421. 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]