We report the draft genome sequences of type strains for Dioszegia crocea and its closely related species Dioszegia aurantiaca, which should improve our understanding of the epiphytic phylloplane yeasts. These data will also have implications for the plant microbiome, since Dioszegia is considered a microbial “hub” taxon.
ABSTRACT
We report the draft genome sequences of type strains for Dioszegia crocea and its closely related species Dioszegia aurantiaca, which should improve our understanding of the epiphytic phylloplane yeasts. These data will also have implications for the plant microbiome, since Dioszegia is considered a microbial “hub” taxon.
ANNOUNCEMENT
Members of the genus Dioszegia (Tremellales, Agaricomycotina, Basidiomycota) are known as epiphytic phylloplane yeasts characterized by orange-yellow or orange to deep-orange colonies and the production of ballistoconidia (1). Most of the Dioszegia species have been isolated from plant leaves, although more recently, some have also been found in soil and in water. A study by Renker et al. (2), in which a sequence identical to the internal transcribed spacer (ITS) region of Dioszegia crocea was observed in DNA isolated from the roots and spores of arbuscular mycorrhizal fungi, suggested the possibility of a close relationship between plants and Dioszegia species.
To contribute to the understanding of epiphytic phylloplane yeasts, we determined the draft genome sequence for type strains of D. crocea (JCM 2961, isolated from a strawberry and originally described as Bullera crocea) (3) and its phylogenetically closely related species Dioszegia aurantiaca (JCM 2956, isolated from overwintered nettle stems of Urtica spp. and originally described as Bullera aurantiaca) (4); both are now classified as members of the genus Dioszegia (5). Strains obtained from the Japan Collection of Microorganisms (JCM), RIKEN BioResource Research Center, were cultivated on YM agar (glucose 10.0 g/liter, peptone 5.0 g/liter, yeast extract 3.0 g/liter, malt extract 3.0 g/liter, agar 20.0 g/liter; BD Difco) plates at 25°C, and the cells were harvested. The genomic DNA was prepared from freeze-dried cell pellets, according to the method of Raeder and Broda (6), and purified using a Genomic-tip 100/G kit (Qiagen, Tokyo), following the manufacturer’s instructions. Paired-end and mate pair libraries with approximate insert sizes of 240 bp and 3 kbp, respectively, were prepared from DNA using a TruSeq DNA PCR-free library preparation kit and a Nextera mate pair sample preparation kit (Illumina, CA), respectively, with some modifications (7). Genome sequencing was accomplished on a HiSeq 2500 platform (Illumina). The reads from mate pair libraries were processed with NextClip v0.8 (8) to trim adapter sequences. ALLPATHS-LG v52488 (9) was used with default parameters to assemble the reads into scaffolds. The completeness of the assemblies was assessed using CEGMA (10). Protein-coding genes were predicted using MAKER (2.31.8) (11) in collaboration with AUGUSTUS (3.0.3) (12) and SNAP (2013-02-16) (13), both of which were trained with the Cryptococcus neoformans var. neoformans JEC21 sequence and GeneMark-ES (4.21) (14). The genes were functionally annotated using Sma3s (15) based on the UniProt-TrEMBL and UniProt-sprot (release 2015_11) databases. tRNAs, small noncoding RNAs, and transposons were also annotated using tRNAscan-SE (1.23) (16), Infernal cmscan (1.1.1) (17), RepeatMasker (open-4.0.5) (http://www.repeatmasker.org/), and RepeatRunner (18). The assemblies and annotations are summarized in Table 1.
TABLE 1.
Assembly and annotation summary for genomic data of D. crocea and D. aurantiaca
| Species | Strain | No. of reads | No. of scaffolds/contigs | Total length (Mbp) | Coverage (×) | Scaffold N50 value (kb) | GC content (%) | CEGMA score (%) | No. of protein-coding genes | No. of tRNAs | No. of small noncoding RNAs | No. of transposons |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| D. crocea | JCM 2961 | 35,136,662 | 26/86 | 20.6 | 176 | 1,954 | 53.2 | 95.6 | 8,753 | 35 | 135 | 126 |
| D. aurantiaca | JCM 2956 | 24,122,574 | 52/139 | 19.3 | 112 | 1,282 | 53.6 | 94.8 | 8,106 | 37 | 207 | 206 |
Our data should help elucidate the properties of not only Dioszegia species but also other epiphytic phylloplane yeasts. Recently, Agler et al. (19) reported that Dioszegia is one of the microbial “hub” taxa that play important roles in the plant-microorganism relationship; thus, we believe that our genomic information will also have implications for the plant microbiome.
Data availability.
The genome sequences have been deposited at DDBJ/EMBL/GenBank under BioProject no. PRJDB3718 with accession no. BCKK00000000 (for D. crocea JCM 2961) and PRJDB3721 with accession no. BCKN00000000 (for D. aurantiaca JCM 2956). The annotations are available via the JCM website (https://www.jcm.riken.jp/cgi-bin/nbrp/nbrp_list.cgi).
ACKNOWLEDGMENTS
This work was supported by a grant from the Genome Information Upgrading Program of the National BioResource Project (NBRP) of the Ministry of Education, Culture, Sports, Science and Technology (MEXT), Japan, to M.O. and by a research grant from MEXT to the RIKEN Center for Life Science Technologies and RIKEN Center for Integrative Medical Science.
REFERENCES
- 1.Takashima M, Nakase T. 2011. Dioszegia Zsolt emend. Takashima, Deak & Nakase (2001), p 1747–1757. In Kurtzman CP, Fell JW, Boekhout T (ed), The yeasts: a taxonomic study, 5th ed. Elsevier, San Diego, CA. [Google Scholar]
- 2.Renker C, Blanke V, Börstler B, Heinrichs J, Buscot F. 2004. Diversity of Cryptococcus and Dioszegia yeasts (Basidiomycota) inhabiting arbuscular mycorrhizal roots or spores. FEMS Yeast Res 4:597–603. doi: 10.1016/j.femsyr.2004.01.001. [DOI] [PubMed] [Google Scholar]
- 3.Buhagiar RWM, Yarrow D, Barnett JA. 1983. Bullera crocea and Bullera armeniaca, two new yeasts from fruit and vegetables. Microbiology 129:3149–3155. doi: 10.1099/00221287-129-10-3149. [DOI] [Google Scholar]
- 4.Johri BN, Bandoni RJ. 1984. Taxonomy of the sporobolomycetous genus Bullera, p 536–547. In Proceedings of the International Symposium on Taxonomy of Fungi, Madras, 1973, part 2. University of Madras, Chennai, India. [Google Scholar]
- 5.Takashima M, Deak T, Nakase T. 2001. Emendation of Dioszegia with redescription of Dioszegia hungarica and two new combinations, Dioszegia aurantiaca and Dioszegia crocea. J Gen Appl Microbiol 47:75–84. doi: 10.2323/jgam.47.75. [DOI] [PubMed] [Google Scholar]
- 6.Raeder U, Broda P. 1985. Rapid preparation of DNA from filamentous fungi. Lett Appl Microbiol 1:17–20. doi: 10.1111/j.1472-765X.1985.tb01479.x. [DOI] [Google Scholar]
- 7.Park N, Shirley L, Gu Y, Keane TM, Swerdlow H, Quail MA. 2013. An improved approach to mate-paired library preparation for Illumina sequencing. Methods Next Gen Sequen 1:10–20. doi: 10.2478/mngs-2013-0001. [DOI] [Google Scholar]
- 8.Leggett RM, Clavijo BJ, Clissold L, Clark MD, Caccamo M. 2014. NextClip: an analysis and read preparation tool for Nextera long mate air libraries. Bioinformatics 30:566–568. doi: 10.1093/bioinformatics/btt702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Gnerre S, Maccallum I, Przybylski D, Ribeiro FJ, Burton JN, Walker BJ, Sharpe T, Hall G, Shea TP, Sykes S, Berlin AM, Aird D, Costello M, Daza R, Williams L, Nicol R, Gnirke A, Nusbaum C, Lander ES, Jaffe DB. 2011. High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc Natl Acad Sci U S A 108:1513–1518. doi: 10.1073/pnas.1017351108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Parra G, Bradnam K, Korf I. 2007. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics 23:1061–1067. doi: 10.1093/bioinformatics/btm071. [DOI] [PubMed] [Google Scholar]
- 11.Cantarel BL, Korf I, Robb SM, Parra G, Ross E, Moore B, Holt C, Sánchez Alvarado A, Yandell M. 2007. MAKER: an easy-to-use annotation pipeline designed for emerging model organism genomes. Genome Res 18:188–196. doi: 10.1101/gr.6743907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Stanke M, Schöffmann O, Morgenstern B, Waack S. 2006. Gene prediction in eukaryotes with a generalized hidden Markov model that uses hints from external sources. BMC Bioinformatics 7:62. doi: 10.1186/1471-2105-7-62. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Johnson AD, Handsaker RE, Pulit S, Nizzari MM, O'Donnell CJ, de Bakker P. 2008. SNAP: a Web-based tool for identification and annotation of proxy SNPs using HapMap. Bioinformatics 24:2938–2939. doi: 10.1093/bioinformatics/btn564. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Ter-Hovhannisyan V, Lomsadze A, Chernoff YO, Borodovsky M. 2008. Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training. Genome Res 18:1979–1990. doi: 10.1101/gr.081612.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Muñoz-Mérida A, Viguera E, Claros MG, Trelles O, Pérez-Pulido AJ. 2014. Sma3s: a three-step modular annotator for large sequence datasets. DNA Res 21:341–353. doi: 10.1093/dnares/dsu001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res 25:955–964. doi: 10.1093/nar/25.5.955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Nawrocki EP, Eddy SR. 2013. Infernal 1.1: 100-fold faster RNA homology searches. Bioinformatics 29:2933–2935. doi: 10.1093/bioinformatics/btt509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Smith CD, Edgar RC, Yandell MD, Smith DR, Celniker SE, Myers EW, Karpen GH. 2007. Improved repeat identification and masking in Dipterans. Gene 389:1–9. doi: 10.1016/j.gene.2006.09.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Agler MT, Ruhe J, Kroll S, Morhenn C, Kim ST, Weigel D, Kemen EM. 2016. Microbial hub taxa link host and abiotic factors to plant microbiome variation. PLoS Biol 14:e1002352. doi: 10.1371/journal.pbio.1002352. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The genome sequences have been deposited at DDBJ/EMBL/GenBank under BioProject no. PRJDB3718 with accession no. BCKK00000000 (for D. crocea JCM 2961) and PRJDB3721 with accession no. BCKN00000000 (for D. aurantiaca JCM 2956). The annotations are available via the JCM website (https://www.jcm.riken.jp/cgi-bin/nbrp/nbrp_list.cgi).
