Abstract
In the present article, we report data on the whole genome sequence of a wild edible and medicinal ectomycorrhizal fungus Russula griseocarnosa. The R. griseocarnosa genome consists of 64.81 Mb with a GC-pair content of 49.41%. The genome assembly consists of 471 scaffolds and 16128 coding protein genes. The coding protein genes was annotated in different databases (GO, KEGG and CAZys), respectively. The whole genome sequence and functional annotation provide important information for ectomycorrhizal fungus, which can be used as a basis for cultivation and breeding of R. griseocarnosa. The Whole Genome project of Russula griseocarnosa has been deposited at DDBJ/ENA/GenBank under the accession RMVF00000000. The version described is RMVF01000000. To further interpretation of the data provided in this article, please refer to the research article ‘Whole genome sequencing and genome annotation of the wild edible mushroom, Russula griseocarnosa’ [1].
Keywords: Russula griseocarnosa, Ectomycorrhizal fungus, Whole genome, Genome annotation
Specifications Table
| Subject area | Biology | 
| More specific subject area | Microbiology, Genomics | 
| Type of data | Table, figures | 
| How data was acquired | PacBio RS and Illumina Hiseq X-Ten sequencing | 
| Data format | Annotated and comparative analyzed | 
| Experimental factors | The fruiting body samples were obtained and quickly frozen in liquid nitrogen before stored in a −80 °C freezer. Total DNA of fruiting body was extracted immediately. | 
| Experimental features | DNA Sequencing was performed by using PacBio RS and Illumina Hiseq X-Ten, genome assembly, annotation and analysis were carried out. | 
| Data source location | The fruiting bodies of Russula griseocarnosa were collected from Linjing Town, Teng County, Guangxi Province, China (2 Jun. 2017) (23.15 N, 110.66 E) | 
| Data accessibility | The whole genome sequence of Russula griseocarnosa has been deposited at DDBJ/ENA/GenBank under the accession RMVF00000000. The version described is RMVF01000000. The BioSample, BioProject and SRA accession number are SAMN09602224, PRJNA479704 and SRP153002, respectively. | 
| Related research article | F. Yu, J. Song, J.F. Liang, S.K. Wang, J.K. Lu, Whole genome sequencing and genome annotation of the wild edible mushroom, Russula griseocarnosa. Genomics. (2019) in press [1]https://doi:10.1016/j.ygeno.2019.04.012. | 
| Value of the data 
 | 
1. Data
Russula griseocarnosa (Fig. 1) is a wild edible and medicinal ectomycorrhizal fungus that is native to southern China. The resulting draft genome of R. griseocarnosa present the 64.81 Mb in size with a G+C content of 49.41%. The genome sequence was assembly with 471 scaffolds and 16128 coding protein genes [1]. The data illustrated in Fig. 2 show the Gene Ontology (GO) distribution of the protein coding genes and Fig. 3 gives a complete overview of the KEGG pathway. According comparative analysis, The GO annotations of Russula griseocarnosa genes were similar with Agaricus bisporus [2] in “Localization”, “Biological regulation”, and “Regulation of biological process”, and fewer numbers than that of Laccaria bicolor [1], [3]. Compared with KEGG metabolic annotations, the most genes of Russula griseocarnosa pathways was not significantly in Laccaria bicolor and Agaricus bisporus, but R. griseocarnosa had less genes in "Tryptophan metabolism" and "Starch and sucrose metabolism" pathways [1].
Fig. 1.
Fruiting bodies of Russula griseocarnosa.
Fig. 2.
The Gene Ontology (GO) function annotation of Russula griseocarnosa.
Fig. 3.
The KEGG function annotation of Russula griseocarnosa.
The CAZymes coding genes of R. griseocarnosa encode enzymes involved in the degradation of plant cell wall polysaccharides, non-plant polysaccharides (for example, animal and bacterial polysaccharides) and fungal cell wall (Fig. 4). The CAZymes coding genes of R. griseocarnosa was similar to the symbiotic fungal species Scleroderma citrinum [4] in non-plant polysaccharides degradation and fungal cell wall degradation, and higer number of plant cell wall polysaccharides degradation. The plant cell wall polysaccharides degradation associated with cellulose degrading enzymes (GH6, GH7, GH44 and GH45), hemicellulose-degrading enzymes (GH10, GH11 and GH115) and pectin-degrading enzymes (GH43, GH51, GH78, GH93, PL1, PL3, and PL4) were absent in Russula griseocarnosa, Laccaria bicolor, and Scleroderma citrinum genomes [1].
Fig. 4.
Comparison of CAZys associated with cell wall degradation.
2. Experimental design, materials and methods
2.1. Fungal material
Fruiting bodies of R. griseocarnosa were collected from Linjing Town, Teng County, Guangxi Province, China in 2017. The fruiting body samples was frozen in liquid nitrogen and stored at −80 °C freezer until DNA extract.
2.2. DNA extraction and sequencing
Genomic DNA was extracted using the Omega Fungal DNA Kit D3390-02. Quality of DNA was determined using TBS-380 fluorometer (Turner BioSystems Inc., Sunnyvale, CA). The concentration of at least 20 mg/L (OD260/280 = 1.8–2.0).
R. griseocarnosa genome was sequenced using Illumina HiSeq X-ten sequencing and PacBio RS sequencing at Shanghai Majorbio Bio-pharm Biotechnology Co., Ltd, China. Paired-end libraries with 300 bp inserts were constructed in Illumina HiSeq X-ten sequencing. 8-10k insert shotgun libraries were generated in Pacific Biosciences RS sequencing.
2.3. Genome assembly and annotation
The genome sequence was assembled as follows: (1) PacBio long reads were corrected and assembled by Canu (v1.7) [5]; (2) Illumina reads corrected and used for scaffolding by SOAPdenovo (v2.04). Fill the gaps using GapCloser (v1.12) package; and (3) PacBio reads were modified based on Illumina reads. The final assembly produced a circular genome sequence without gaps.
Protein coding sequences were predicted using the automated pipeline MAKER2 (v2.31.9) [6]. It combining data for mRNAs, proteins, the ab initio predictions of SNAP [7] and GeneMark-ES (v2.3a) [8].
The predicted protein coding sequences was annotated in Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) database using Blastp (v2.3.0). The Carbohydrate-active enzymes (CAZymes) were performed using blastp (cut off e-value≤1e−5) at http://www.cazy.org/ [9].
Acknowledgements
This study was supported by the Science and Technology Project of Guangdong Province (2017B020205002), and the National Natural Science Foundation of China (No. 31770657 and 31570544). The authors thank Dr Hong Luo from the Kunming Institute of Botany, Chinese Academy of Sciences for research guidance.
Conflict of interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
References
- 1.Yu F., Song J., Liang J.F., Wang S.K., Lu J.K. Whole genome sequencing and genome annotation of the wild edible mushroom, Russula griseocarnosa. Genomics. 2019 doi: 10.1016/j.ygeno.2019.04.012. https://doi:10.1016/j.ygeno.2019.04.012 [DOI] [PubMed] [Google Scholar]
- 2.Morin E., Kohler A., Baker A.R., Foulongne-Oriol M., Lombard V., Nagy L.G., Ohm R.A., Patyshakuliyeva A., Brun A., Aerts A.L., Bailey A.M., Billette C., Coutinho P.M., Deakin G., Doddapaneni H., Floudas D., Grimwood J., Hildén K., Kües U., LaButti K.M., Lapidus A., Lindquist E.A., Lucas S.M., Murat C., Riley R.W., Salamov A.A., Schmutz J., Subramanian V., Wösten H.A.B., Xu J., Eastwood D.C., Foster G.D., Sonnenberg A.S.M., Cullen D., De Vries R.P., Lundell T., Hibbett D.S., Henrissat B., Burton K.S., Kerrigan R.W., Challen M.P., Grigoriev I.V., Martin F. Genome sequence of the button mushroom Agaricus bisporus reveals mechanisms governing adaptation to a humic-rich ecological niche. Proc. Natl. Acad. Sci. U. S. A. 2012;109:17501–17506. doi: 10.1073/pnas.1206847109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Labbé J., Zhang X., Yin T., Schmutz J., Grimwood J., Martin F., Tuskan G.A., Tacon F.L. A genetic linkage map for the ectomycorrhizal fungus Laccaria bicolor and its alignment to the whole-genome sequence assemblies. New Phytol. 2008;180:316–328. doi: 10.1111/j.1469-8137.2008.02614.x. [DOI] [PubMed] [Google Scholar]
- 4.Kohler A., Kuo A., Nagy L.G., Morin E., Barry K.W., Buscot F., Canbäck B., Choi C., Cichocki N., Clum A., Colpaert J., Copeland A., Costa M.D., Doré J., Floudas D., Gay G., Girlanda M., Henrissat B., Herrmann S., Hess J., Högberg N., Johansson T., Khouja H., LaButti K., Lahrmann U., Levasseur A., Lindquist E.A., Lipzen A., Marmeisse R., Martino E., Murat C., Ngan C.Y., Nehls U., Plett J.M., Pringle A., Ohm R.A., Perotto S., Peter M., Riley R., Rineau F., Ruytinx J., Salamov A., Shah F., Sun H., Tarkka M., Tritt A., Veneault-Fourrey C., Zuccaro A., Consortium M.G.I., Tunlid A., Grigoriev I.V., Hibbett D.S., Martin F. Convergent losses of decay mechanisms and rapid turnover of symbiosis genes in mycorrhizal mutualists. Nat. Genet. 2015;47:410–415. doi: 10.1038/ng.3223. [DOI] [PubMed] [Google Scholar]
- 5.Koren S., Walenz B.P., Berlin K., Miller J.R., Bergman N.H., Phillippy A.M. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 2017;27:722. doi: 10.1101/gr.215087.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Holt C., Yandell M. MAKER2: an annotation pipeline and genome-database management tool for second-generation genome projects. BMC Bioinf. 2011;12:491. doi: 10.1186/1471-2105-12-491. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Korf I. Gene finding in novel genomes. BMC Bioinf. 2004;5:59. doi: 10.1186/1471-2105-5-59. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Borodovsky M., Lomsadze A. Eukaryotic gene prediction using genemark.hmm-E and genemark-ES. Curr. Protoc. Bioinformatics. 2011;25:1–10. doi: 10.1002/0471250953.bi0406s35. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Cantarel B.L., Coutinho P.M., Rancurel C. The carbohydrate-active EnZymes database (CAZy): an expert resource for glycogenomics. Nucleic Acids Res. 2009;37:D233–D238. doi: 10.1093/nar/gkn663. [DOI] [PMC free article] [PubMed] [Google Scholar]




