ABSTRACT
Here, we report the first sequenced genome of an indigenous Australian wine isolate of Torulaspora delbrueckii using the Oxford Nanopore MinION and Illumina HiSeq sequencing platforms. The genome size is 9.4 Mb and contains 4,831 genes.
GENOME ANNOUNCEMENT
Torulaspora delbrueckii occurs saprophytically on wine grape surfaces worldwide (1). Under winemaking conditions, it displays a less vigorous fermentation phenotype than Saccharomyces cerevisiae, differing in flavor and aroma compound production (2). Its favorable oenological traits, such as low acetic acid production and osmotic tolerance, have led to its commercialization and adoption for use in the wine industry (3). Despite this, T. delbrueckii is still not well characterized at a molecular level, with no other genomes reported from wine isolates.
For a better understanding of T. delbrueckii oenological traits, RNA sequencing (RNA-seq) studies under wine-like conditions are required. However, in the absence of a closely related reference sequence, significant challenges remain when it comes to assembling short reads into full-length gene and transcript models. In order to facilitate future wine-related studies, the genome of a wine isolate of T. delbrueckii was sequenced and characterized.
T. delbrueckii strain COFT1 was isolated from a spontaneous wine fermentation at the Yalumba Wine Company (Angaston, South Australia, Australia). High-molecular-weight genomic DNA was extracted according to the phenol-chloroform protocol (4), including a 2-h preincubation with Zymolase. DNA was prepared for sequencing with the MinION device using the SQK-LSK108 library prep kit (protocol GDE_9002_v108_revT_18Oct2016) and R9.4 chemistry. A total of 138,992 reads were obtained, for a total of 1,214 Mbp (130× coverage) and an average length of 8,737 bp. Fast5 files were base called using Albacore version 2.0.2. Passed reads were trimmed for adapters using PoreChop version 0.2.3 and then assembled using SMARTdenovo version 1.0 (https://github.com/ruanjue/smartdenovo). Contigs obtained from the assembly were polished using Racon version 0.5.0 (5) and Nanopolish version 0.8.5 (6). A final polish of the assembly was performed with Pilon version 1.22 using Illumina HiSeq RNA reads extracted from pure culture laboratory ferments with strain COFT1. The final assembly had no gaps, with a total length of 9,356,826 bp arranged in 9 chromosomes (1 mitochondrial chromosome) and an average GC content of 42%. The genome was first annotated with YGAP (7), and 5,231 genes were predicted. An improved annotation was performed with MAKER2 (8), providing STAR RNA alignment information (9) for the prediction of protein-coding genes. A total of 4,831 protein-coding genes were identified, compared to 4,714 and 4,972 protein-coding genes reported in previous genomes (10, 11). Functional annotation of the predicted protein sequences was performed using BLASTP (12) against the Swiss-Prot protein sequence database (E value = 1e-5) (13). BUSCO assessment (14) revealed a genome completeness of 98%.
The functional annotation of the genome, as well as its completeness, allows comparison with the species S. cerevisiae and a better understanding of the oenological traits of this yeast. The genome sequence reported here will assist in delivering clearer transcriptional results in complex wine-like fermentations (e.g., mixed fermentations), providing useful insight into these processes for the wine industry.
Accession number(s).
The final genome sequence has been deposited to NCBI GenBank database under accession numbers CP027647 to CP027655.
ACKNOWLEDGMENTS
We thank the Yalumba Wine Company for providing the wine samples and the Genomics Facility at SAHMRI (Adelaide, South Australia) for the use of the TapeStation 2200.
This work was supported by the Australian Research Council Industrial Transformation Training Centre for Innovative Wine Production (project IC130100005) and Wine Australia (project AGT1502).
Footnotes
Citation Tondini F, Jiranek V, Grbin PR, Onetto CA. 2018. Genome sequence of Australian indigenous wine yeast Torulaspora delbrueckii COFT1 using nanopore sequencing. Genome Announc 6:e00321-18. https://doi.org/10.1128/genomeA.00321-18.
REFERENCES
- 1.Renouf V, Claisse O, Lonvaud-Funel A. 2005. Understanding the microbial ecosystem on the grape berry surface through numeration and identification of yeast and bacteria. Aust J Grape Wine Res 11:316–327. doi: 10.1111/j.1755-0238.2005.tb00031.x. [DOI] [Google Scholar]
- 2.Jolly NP, Varela C, Pretorius IS. 2014. Not your ordinary yeast: non-Saccharomyces yeasts in wine production uncovered. FEMS Yeast Res 14:215–237. doi: 10.1111/1567-1364.12111. [DOI] [PubMed] [Google Scholar]
- 3.Ciani M, Morales P, Comitini F, Tronchoni J, Canonico L, Curiel JA, Oro L, Rodrigues AJ, Gonzalez R. 2016. Non-conventional yeast species for lowering ethanol content of wines. Front Microbiol 7:642. doi: 10.3389/fmicb.2016.00642. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Sambrook J, Russell DW. 2006. Purification of nucleic acids by extraction with phenol: chloroform. Cold Spring Harb Protoc 2006:pdb.prot4045. doi: 10.1101/pdb.prot4045. [DOI] [PubMed] [Google Scholar]
- 5.Vaser R, Sović I, Nagarajan N, Šikić M. 2017. Fast and accurate de novo genome assembly from long uncorrected reads. Genome Res 27:737–746. doi: 10.1101/gr.214270.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Loman NJ, Quick J, Simpson JT. 2015. A complete bacterial genome assembled de novo using only nanopore sequencing data. Nat Methods 12:733. doi: 10.1038/nmeth.3444. [DOI] [PubMed] [Google Scholar]
- 7.Proux-Wéra E, Armisén D, Byrne KP, Wolfe KH. 2012. A pipeline for automated annotation of yeast genome sequences by a conserved-synteny approach. BMC Bioinformatics 13:237. doi: 10.1186/1471-2105-13-237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Holt C, Yandell M. 2011. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinformatics 12:491. doi: 10.1186/1471-2105-12-491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Dobin A, Gingeras TR. 2015. Mapping RNA-seq reads with STAR. Curr Protoc Bioinformatics 51:11–19. doi: 10.1002/0471250953.bi1114s51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Gomez-Angulo J, Vega-Alvarado L, Escalante-García Z, Grande R, Gschaedler-Mathis A, Amaya-Delgado L, Arrizon J, Sanchez-Flores A. 2015. Genome sequence of Torulaspora delbrueckii NRRL Y-50541, isolated from Mezcal fermentation. Genome Announc 3:e00438-15. doi: 10.1128/genomeA.00438-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Gordon JL, Armisén D, Proux-Wéra E, ÓhÉigeartaigh SS, Byrne KP, Wolfe KH. 2011. Evolutionary erosion of yeast sex chromosomes by mating-type switching accidents. Proc Natl Acad Sci U S A 108:20024. doi: 10.1073/pnas.1112808108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. 1990. Basic local alignment search tool. J Mol Biol 215:403. doi: 10.1016/S0022-2836(05)80360-2. [DOI] [PubMed] [Google Scholar]
- 13.Bairoch A, Apweiler R. 2000. The Swiss-Prot protein sequence database and its supplement TrEMBL in 2000. Nucleic Acids Res 28:45–48. doi: 10.1093/nar/28.1.45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Simão FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. 2015. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31:3210–3212. doi: 10.1093/bioinformatics/btv351. [DOI] [PubMed] [Google Scholar]