Abstract
The genome of lager brewer's yeast is a hybrid, with Saccharomyces eubayanus and Saccharomyces cerevisiae as sub-genomes. Due to their specific use in the beer industry, relatively little information is available. The genome of brewing yeast was sequenced and annotated in this study. We obtained a genome size of 22.7 Mbp that consisted of 133 scaffolds, with 65 scaffolds larger than 10 kbp. With respect to the annotation, 9939 genes were obtained, and when they were submitted to a local alignment, we found that 53.93% of these genes corresponded to S. cerevisiae, while another 42.86% originated from S. eubayanus. Our results confirm that our strain is a hybrid of at least two different genomes.
Keywords: Saccharomyces pastorianus, Lager beer, Lager yeast, Hybrid
1. Introduction
The brewing process is probably the most ancient type of biotechnology. There is evidence that the production and consumption of beer began in Egypt in the Early Dynastic Period (5500–3100 BCE) [1]. The first fermentation process was termed “high” because yeast floats to the top of the tank at a temperature between 15 °C and 25 °C during production of ale beer with the yeast Saccharomyces cerevisiae. Lager yeast did not emerge until the 15th century. This yeast is capable of fermenting at a temperature lower than 10 °C and flocculates at the bottom of the tank. The fermentation is followed by a maturation process called “lagering” (Masschelein, 1986).
The Saccharomyces sensu stricto complex includes six parented species: S. cerevisiae, Saccharomyces bayanus, Saccharomyces cariocanus, Saccharomyces kudriavzevii, S. mikatae and S. paradoxus [2]; however, it has been observed that lager-brewing yeast is a hybrid species of two combined genomes of S. eubayanus and S. cerevisiae [3], [4], [5], [6], [7], [8], [9], [10]. This provides an important source of chromosomal rearrangements, leading to the gene number and the size of the complete genome [11], [12], [13], [14]. It has been proposed and recently demonstrated that lager yeast is the product of two independent hybridization events that can be divided into two groups: Saaz and Frohberg, or group I and group II, respectively [15], [16], [17], [18], [19].
With the use of next generation sequencing (NGS) technologies, such as the Illumina Platform, 40,175 prokaryote and eukaryotes genomes have been reported, including 210 different strains of the Saccharomyces complex (http://www.ncbi.nlm.nih.gov/genome/browse/ - revised July 22, 2015).
To obtain a higher level of understanding of the sequenced organism, the data obtained from NGS has been assembled and annotated.
The annotation process consists of identifying the biological characteristics from sequences of the assembly. This can be performed through gene prediction and homologous sequence alignment [20], [21], [22].
Here, we present the analysis of the lager yeast genome Saccharomyces sp. strain 790 and its comparison with S. eubayanus and S. cerevisiae S288c. This study provides information about the genome structure of Saccharomyces sp. strain 790.
2. Materials and methods
2.1. Strains and sequences
The brewing yeast Saccharomyces sp. strain 790 and a reference sequence of 76 scaffolds from S. eubayanus were obtained from the yeast collection of Cervecería Cuauhtémoc Moctezuma S.A. de C.V. The S. cerevisiae S288c reference genome sequence was retrieved from the yeast genome database (www.yeastgenome.org).
2.2. Sequencing and genome assembly
The brewing yeast genome was sequenced using the FLX 454 Titanium (Roche) and MiSeq (Illumina) massive sequencing platforms according to the manufacturer's protocols. We obtained 0.8 million reads from FLX 454 Titanium (454 Life Sciences, Branfort, CT) with an average size of 400 bp; 6 million pair-end reads from Illumina (Illumina, San Diego, CA) with an average size of 150 bp; 5 million mate-pair reads from Illumina with an insert size of 350 bp and a size of 101 × 2 bp; and 11.7 million mate-pair reads from Illumina with an insert of 8 kb and a size of 51 × 2 bp. Approximately 454 Illumina pair-end reads were assembled with a Newbler DeNovo Assembler (Roche). Contigs were then processed using SSPACE 1.0 software (Boetzer et al. 2011). This Whole Genome Shotgun project has been deposited at DDBJ/ENA/GenBank under the accession LSMH00000000. The version described in this paper is version LSMH01000000.
The sequencing quality data were analyzed with FastQC 0.10.1 software with a value ≥ Q30 [23].
Likewise, alignments were made against reference sequences (S. cerevisiae S288c and S. eubayanus) with the MUMmer 3.23 software package [24].
2.3. Genome annotation
The bioinformatics analysis for the annotation was performed with MAKER v2.31.8 software (University of Utah). MAKER is an integrative tool that yields a putative position of the genes. Fig. 1 depicts the annotation steps [22].
Fig. 1.
Maker's annotation pipeline. In step 1, the repetitive sequences are identified and masked; proteins and ESTs are aligned; and using this information, a gene prediction model is made and the structural annotation is completed. Those models are then submitted to a local alignment in Blast for functional annotation.
3. Results and discussion
The read assembly yielded 133 scaffolds with a ~ 70 × depth and a N50 of 568,800 bp, suggesting a complete genome size of ~ 22.7 Mbp (Table 1), similar to previous reports of other lager beer yeasts [11], [15], [25], [26]. Approximately 65/133 scaffolds had a size > 10 kbp, which represents 99.667% of the assembled genome (Table 2). Table 3 shows a comparison of the assembly level of the sequenced genomes of the Saccharomyces species (as of August 2015). It also shows a similar genome size compared to other brewing yeasts [11], [26]. The alignments against the reference genome, S. cerevisiae S288C, assigned scaffolds to each of its 16 chromosomes, and some scaffolds covered different portions of more than one chromosome; for example, scaffold01 (SF01) aligns with two chromosomes: a small portion with chromosome 1 and with chromosome 12. Scaffold17 (SF17) aligns with chromosomes 4 and 5 (Fig. 2).
Table 1.
Assembly of Saccharomyces sp. 790.
| Assembled reads | 17,034,361 |
|---|---|
| Depth | ~ 70 × |
| Estimated genome size (bp) | 22,741,276 |
| Number of scaffolds | 133 |
| Average scaffold size (bp) | 170,987 |
| Largest scaffold (bp) | 1,404,408 |
| N50 | 568,800 |
| ‘Ns’ | 399,699 |
Table 2.
Size range among scaffolds.
| Size level | Number of scaffolds | Bp | Whole genome proportion (%) | Cumulative % of the whole genome |
|---|---|---|---|---|
| 0:999 | 47 | 31,168 | 0.137 | 0.137 |
| 1 k:9999 | 21 | 44,610 | 0.196 | 0.333 |
| 10 k: 99,999 | 18 | 1,063,816 | 4.678 | 5.011 |
| 100 k 199, 999 | 11 | 1,608,982 | 7.075 | 12.086 |
| 200 k: 299,999 | 6 | 1,528,906 | 6.723 | 18.809 |
| 300 k: 399,999 | 6 | 2,092,805 | 9.203 | 28.01 |
| 400 k: 499, 999 | 4 | 1,700,637 | 7.478 | 35.490 |
| 500 k: 599, 999 | 7 | 3,759,004 | 16.529 | 52.020 |
| 600 k: 699,999 | 5 | 3,213,208 | 14.129 | 66.149 |
| 700 k: 799,999 | 1 | 712, 480 | 3.133 | 69.282 |
| 800 k: 899,999 | 1 | 806,158 | 3.545 | 72.827 |
| 900 k: 999,999 | 5 | 4,775,094 | 20.997 | 93.824 |
| 1;404,408 | 1 | 1,404,408 | 6.176 | 100 |
| Total | 133 | 22,741,276 |
Table 3.
Sequenced genome assembly level of Saccharomyces species.
| Organism | Finished genome | Number of chromosomes | Number of scaffolds | Number of contigs | Size (Mpb) |
|---|---|---|---|---|---|
| S. cerevisiae | ✓ | 16 | 17 | – | 12.16 |
| S. kudriavzevii | ✗ | 16 | 2054 | – | 11.19 |
| S. pastorianus | ✗ | – | – | 2425 | 24.21 |
| S. paradoxus | ✗ | 16 | – | 832 | 11.87 |
| S. mikatae | ✗ | 16 | – | 1648 | 11.47 |
| S. bayanus | ✗ | 16 | – | 586 | 11.87 |
| S. boulardii | ✗ | 16 | 48 | – | 11.64 |
| S. arboricola | ✓ | 16 | 35 | – | 11.62 |
| S. uvarum | ✗ | – | – | 3985 | 11.60 |
| S. carlsbergensis | ✗ | 29 | 77 | – | 19.37 |
| S. cerevisiae - S. kudriavzevii | ✗ | – | 60 | 419 | 23.37 |
| S. pastorianus—S. weihenstephan 34/70 | ✗ | – | – | 1358 | 22.96 |
| 790 | ✗ | ~ 32 | 133 | – | 22.74 |
Fig. 2.
Alignment dot plot of Saccharomyces sp. 790 versus S. cerevisiae S288C.
The estimated size matches the previous and known information; this is due to the presence of 16 chromosomes of the S. cerevisiae sub-genome and 16 of S. eubayanus. This suggests an overall estimation of 32 chromosomes without considering ploidy. Likewise, its size is close to the sum of the aforementioned genomes (~ 12 Mbp each). This observation is consistent with the previously reported data by Nakao et al. (2009), Borneman et al. (2011) and Walther et al. (2014), who reported the sequence and assembly of the lager brewing yeast genomes Saccharomyces carlsbergensis (78 scaffolds, 29 chromosomes with a 19.5 Mbp length), and Saccharomyces pastorianus Weihenstephan 34/70 (985 scaffolds, ~ 29 chromosomes, and 22.9 Mbp).
The annotation yielded 9939 CDS and a gff file with their locations in the scaffolds of the assembly (Fig. 3). The protein and transcript sequences, were subjected to a local alignment with the Blast tool [27] against a local database using the sequence of S. cerevisiae S288C as a reference. The transcripts were considered to be genes because previous reports showed that only approximately 5% of the yeast genome contains introns [28], [29].
Fig. 3.
Visualization of annotations with IGV Tools v2.3 (http://www.broadinstitute.org/software/igv/home). Scaffold 01 alignments and predictions from the Maker's annotation.
The scaffolds were classified using the results obtained from Blastn according to the mean identity percentage in all of the genes contained in the same scaffold, as follows (Table 4):
%Id > 99.0% and E value < 10− 6 = scaffold belongs to S. cerevisiae.
%Id < 90.0% and E value < 10− 6 = scaffold does not belong to S. cerevisiae.
99.0% > %Id > 90.0% and E value < 10− 6 = hybrid scaffold.
Our identity criterion was validated by subjecting the gene sequences from S. cerevisiae S288C to a local alignment against S. eubayanus, and we found that the %Id between these strains was < 90% (Supplementary Table S1) and the average size of the CDS was 1550 bp. Approximately 96.8% of the genome was annotated; 53.93% corresponded to S. cerevisiae, 42.86% were non-cerevisiae and 3.20% remained un-annotated Fig. 4 (Supplementary Table S2).
Fig. 4.
Schematic representation of Saccharomyces sp. 790 annotations. S. cerevisiae, S. eubayanus and non-annotated proportions are represented in blue, orange and gray, respectively.
4. Conclusions
From the findings in this work, it can be concluded that Saccharomyces sp. 790 is a hybrid between S. cerevisiae and S. eubayanus. Its nuclear genome consists of approximately 32 chromosomes, 16 of which correspond to the S. cerevisiae genome and 16 to the S. eubayanus genome, without considering ploidy. A total of 133 scaffolds were obtained in the last version of the assembly. Nine scaffolds presented continuous translocations (scaffolds 1, 4, 6, 23 for the S. cerevisiae sub-genome and 26, 11, 17, 22 and 32 for the S. eubayanus sub-genome), which indicate homologous recombination events. One scaffold presented a possible recombination event (scaffold 3). Data on the chromosome number and size, as well as the number of scaffolds obtained, are consistent with previous reports on lager yeast [6], [15], [16]. The next step is to improve the assembly with physical mapping techniques.
The following are the supplementary data related to this article.
Comparative analysis of Saccharomyces cerevisiae S288C and Saccharomyces eubayanus.
Saccharomyces sp. 790 and Saccharomyces cerevisiae comparative results.
Acknowledgements
This work was supported, in part, by the Consejo Nacional de Ciencia y Tecnología (CONACYT) of México Grant number: 212773. P. M. De León-Medina and R. Elizondo-González were recipients of a fellowship from CONACYT.
References
- 1.Hornsey I.S. 2003. A History of Beer and Brewing. [Google Scholar]
- 2.Naumova E.S., Bulat S.A., Mironenko N.V., Naumov G.I. Differentiation of six sibling species in the Saccharomyces sensu stricto complex by multilocus enzyme electrophoresis and UP-PCR analysis, Antonie van Leeuwenhoek. Int. J. Gen. Mol. Microbiol. 2003;83:155–166. doi: 10.1023/a:1023328320228. [DOI] [PubMed] [Google Scholar]
- 3.Nilsson-Tillgren T., Gjermansen C., Kielland-Brandt M.C., Petersen J.G.L., Holmberg S. Genetic differences between Saccharomyces carlsbergensis and S. cerevisiae. Analysis of chromosome III by single chromosome transfer. Carlsberg Res. Commun. 1981;46:65–76. [Google Scholar]
- 4.Martini A.V., Kurtzman C.P. Deoxyribonucleic acid relatedness among species of the genus Saccharomyces sensu stricto. Int. J. Syst. Bacteriol. 1985;35:508–511. [Google Scholar]
- 5.Vaughan-Martini A., Martini A. Facts, myths and legends on the prime industrial microorganism. J. Ind. Microbiol. 1995;14:514–522. doi: 10.1007/BF01573967. [DOI] [PubMed] [Google Scholar]
- 6.Casaregola S., Nguyen H.V., Lapathitis G., Kotyk A., Gaillardin C. Analysis of the constitution of the beer yeast genome by PCR, sequencing and subtelomeric sequence hybridization. Int. J. Syst. Evol. Microbiol. 2001;51:1607–1618. doi: 10.1099/00207713-51-4-1607. ( http://www.ncbi.nlm.nih.gov/pubmed/11491364) [DOI] [PubMed] [Google Scholar]
- 7.Naumova E.S., Naumov G.I., Masneuf-Pomarède I., Aigle M., Dubourdieu D. Molecular genetic study of introgression between Saccharomyces bayanus and S. cerevisiae. Yeast. 2005;22:1099–1115. doi: 10.1002/yea.1298. [DOI] [PubMed] [Google Scholar]
- 8.Rainieri S., Zambonelli C., Kaneko Y. Saccharomyces sensu stricto: systematics, genetic diversity and evolution. J. Biosci. Bioeng. 2003;96:1–9. ( http://www.ncbi.nlm.nih.gov/pubmed/16233475) [PubMed] [Google Scholar]
- 9.Rainieri S., Kodama Y., Kaneko Y., Mikata K., Nakao Y., Ashikari T. Pure and mixed genetic lines of Saccharomyces bayanus and Saccharomyces pastorianus and their contribution to the lager brewing strain genome. Appl. Environ. Microbiol. 2006;72:3968–3974. doi: 10.1128/AEM.02769-05. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Nguyen H.-V., Legras J.-L., Neuvéglise C., Gaillardin C. Deciphering the hybridisation history leading to the lager lineage based on the mosaic genomes of saccharomyces bayanus strains NBRC1948 and CBS380T. PLoS ONE. 2011;6 doi: 10.1371/journal.pone.0025821. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Nakao Y., Kanamori T., Itoh T., Kodama Y., Rainieri S., Nakamura N. Genome sequence of the lager brewing yeast, an interspecies hybrid. DNA Res. 2009;16:115–129. doi: 10.1093/dnares/dsp003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Querol A., Bond U. The complex and dynamic genomes of industrial yeasts. FEMS Microbiol. Lett. 2009;293:1–10. doi: 10.1111/j.1574-6968.2008.01480.x. [DOI] [PubMed] [Google Scholar]
- 13.Krogerus K., Magalhães F., Vidgren V., Gibson B. New lager yeast strains generated by interspecific hybridization. 2015 doi: 10.1007/s10295-015-1597-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Hebly M., Brickwedde A., Bolat I., Driessen M.R.M., de Hulster E.A.F., van den Broek M. S. cerevisiae × S. eubayanus interspecific hybrid, the best of both worlds and beyond. FEMS Yeast Res. 2015;15:fov005. doi: 10.1093/femsyr/fov005. [DOI] [PubMed] [Google Scholar]
- 15.Dunn B., Sherlock G. Reconstruction of the genome origins and evolution of the hybrid lager yeast Saccharomyces pastorianus. Genome Res. 2008;18:1610–1623. doi: 10.1101/gr.076075.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Usher J., Bond U. Recombination between homoeologous chromosomes of lager yeasts leads to loss of function of the hybrid GPH1 gene. Appl. Environ. Microbiol. 2009;75:4573–4579. doi: 10.1128/AEM.00351-09. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wright J.C., Sugden D., Francis-McIntyre S., Riba-Garcia I., Gaskell S.J., Grigoriev I.V. Exploiting proteomic data for genome annotation and gene model validation in Aspergillus niger. BMC Genomics. 2009;10:61. doi: 10.1186/1471-2164-10-61. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Libkind D., Hittinger C.T., Valério E., Gonçalves C., Dover J., Johnston M. Microbe domestication and the identification of the wild genetic stock of lager-brewing yeast. Proc. Natl. Acad. Sci. U. S. A. 2011;108:14539–14544. doi: 10.1073/pnas.1105430108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Gibson B.R., Storgårds E., Krogerus K., Vidgren V. 2013. Comparative Physiology and Fermentation Performance of Saaz and Frohberg Lager Yeast Strains and the Parental Species Saccharomyces eubayanus; pp. 255–266. [DOI] [PubMed] [Google Scholar]
- 20.Cantarel B.L., Korf I., Robb S.M.C., Parra G., Ross E., Moore B. MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res. 2008;18:188–196. doi: 10.1101/gr.6743907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Holt C., Yandell M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinformatics. 2011;12:491. doi: 10.1186/1471-2105-12-491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Yandell M., Ence D. A beginner's guide to eukaryotic genome annotation. Nat. Rev. Genet. 2012;13:329–342. doi: 10.1038/nrg3174. [DOI] [PubMed] [Google Scholar]
- 23.Andrews S. FastQC: a quality control tool for high throughput sequence data. 2010. http://www.bioinformatics.babraham.ac.uk/projects/fastqc/
- 24.Delcher A.L., Salzberg S.L., Phillippy A.M. Using MUMmer to identify similar regions in large sequence sets. Curr. Protoc. Bioinformatics. Chapter 10. 2003 doi: 10.1002/0471250953.bi1003s00. Unit 10.3. [DOI] [PubMed] [Google Scholar]
- 25.Borneman A.R., Desany B.A., Riches D., Affourtit J.P., Forgan A.H., Pretorius I.S. Whole-genome comparison reveals novel genetic elements that characterize the genome of industrial strains of Saccharomyces cerevisiae. PLoS Genet. 2011;7 doi: 10.1371/journal.pgen.1001287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Walther A., Hesselbart A., Wendland J. Genome sequence of Saccharomyces carlsbergensis, the world's first pure culture lager yeast. G3 (Bethesda) 2014;4:783–793. doi: 10.1534/g3.113.010090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Camacho C., Coulouris G., Avagyan V., Ma N., Papadopoulos J., Bealer K. BLAST +: architecture and applications. BMC Bioinformatics. 2009;10:421. doi: 10.1186/1471-2105-10-421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Spingola M., Grate L., Haussler D., Ares M. Genome-wide bioinformatic and molecular analysis of introns in Saccharomyces cerevisiae. RNA. 1999;5:221–234. doi: 10.1017/s1355838299981682. http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=1369754&tool=pmcentrez&rendertype=abstract [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Parenteau J., Durand M., Veronneau S., Lacombe A.E.-A., Morin G., Guerin V. Deletion of many yeast introns reveals a minority of genes that require splicing for function. Mol. Biol. Cell. 2008;19:1932–1941. doi: 10.1091/mbc.E07-12-1254. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Comparative analysis of Saccharomyces cerevisiae S288C and Saccharomyces eubayanus.
Saccharomyces sp. 790 and Saccharomyces cerevisiae comparative results.




