Skip to main content
Data in Brief logoLink to Data in Brief
. 2018 Feb 15;17:1108–1111. doi: 10.1016/j.dib.2018.02.027

Transcriptome datasets of oil palm pathogen Ganoderma boninense

Irene Liza Isaac a, AWCY Walter a, Mohd Faizal Abu Bakar b, AS Idris c, Farah Diba Abu Bakar a, Izwan Bharudin a,, Abdul Munir Abdul Murad a
PMCID: PMC5988404  PMID: 29876468

Abstract

Ganoderma boninense is known to be the causal agent for basal stem rot (BSR) affecting the oil palm industry worldwide thus cumulating to high economic losses every year. Several reports have shown that a compatible monokaryon pair needs to mate; producing dikaryotic mycelia to initiate the infection towards the oil palm. However, the molecular events occurs during mating process are not well understood. We performed transcriptome sequencing using Illumina RNA-seq technology and de novo assembly of the transcripts from monokaryon, mating junction and dikaryon mycelia of G. boninense. Raw reads from these three libraries were deposited in the NCBI database with accession number SRR1745787, SRR1745773 and SRR1745777, respectively.


Specifications table

Subject area Biology
More specific subject area Molecular biology of fungal mating
Type of data Transcriptome data
How data was acquired Paired-end transcriptome of G. boninense was sequenced using Illumina HiSeq. 2000 at Malaysia Genome Institute. De novo transcriptome assembly was performed using Trinity V.2.0.2
Data format Raw sequence (FASTQ)
Experimental factors Mycelia from different stages; monokaryon, mating junction and dikaryon
Experimental features G. boninense PER71 was obtained from Malaysian Palm Oil Board (MPOB). All cultures were maintained on Congo Red Yeast Mannitol Agar (CYMA).
Data source location Bangi, Malaysia
Data accessibility Raw FASTQ files were deposited in NCBI SRA database with accession numberSRR1745787, SRR1745773 and SRR1745777(https://www.ncbi.nlm.nih.gov/sra/SRX809922)

Value of the data

  • The data obtained using Illumina sequencer is the first report on RNA-seq of G. boninense from three different stages; monokaryon, mating junction and dikaryon.

  • The data presented here can be used for investigating the mating process in this fungus.

  • This permits the identification of differentially expressed genes that may play a significant role in the pathogenesis of G. boninense towards oil palm.

1. Data

Transcriptome data of Ganoderma boninense were generated from the polyA-enriched cDNA libraries prepared from the total RNA extracted from three different stages as mentioned above. Details of the experimental procedure and sequence analyses were described in the next section.

2. Experimental design, materials and methods

2.1. Fungal strain, culture condition and inoculation

G. boninense PER71 dikaryon culture was obtained from Malaysian Palm Oil Board (MPOB). The fruiting body was grown and single spores were obtained which later are mated to obtain a compatible pair. All cultures were maintained on CYMA [maltose 1% (w/v), glucose 2% (w/v), yeast extract 0.2% (w/v), tryptone 0.2% (w/v), MgSO4.7H2O 0.05% (w/v), KH2PO4 (0.46%) (w/v)]. For transcriptome analysis, mycelia from monokaryon, mating interaction mycelia (mating junction) and dikaryon mycelia were grown on cellulose membrane placed on fresh CYMA plates which later left for 5 days at 30 °C before scrapping it out. As for mating interaction mycelia, a compatible monokaryon pair is used and maintained on CYMA individually. Agar disc of both monokaryon was dissected and placed within 2 cm between each other on a fresh CYMA plate. Plates were incubated at 30 °C for 4–5 days and the interacting mycelia are scrapped out. The obtained mycelia are frozen in liquid nitrogen and stored at −80 °C.

2.2. RNA extraction and sequencing

Total RNA from each of sample was isolated using RNeasy Plant Mini Kit (Qiagen, Germany) according to the manufacturer protocol. The integrity and quantity isolated RNA was quantified using Nanodrop and Bioanalyzer [1]. mRNA from each sample was isolated from 8 μg of total RNA using PolyATtract® mRNA Isolation Systems (Promega, USA) according to manufacturer's protocol. The cDNA library preparation was done using the Illumina mRNA Sequencing Sample Preparation (San Diego, CA, USA) as per the manufacturer's instructions. Libraries were built and quantified using a Qubit® 2.0 Fluorometer (Life Technologies, USA). Ten pM of the prepared library was loaded onto the flowcell for cluster generation as per instructions by cBot User Guide (Illumina, USA). Sequencing was performed with paired-end 2 × 100 bp and 2 × 93 bp nucleotides multiplex procedure on an Illumina HiSeq. 2000.

2.3. Assembly and RNA-seq analysis

The raw RNA-seq data from G. boninense was trimmed and filtered with SolexaQA v.2.2 [2] to acquire high-quality reads. Phred quality value of Q20 and reads longer than 50 bp were used as parameters. Paired-end reads were determined using Perl script select_paired.pl. De novo assembly of high-quality reads was carried out using Trinity V.2.0.2. [3], with default parameters. Table 1 shows the RNA-seq statistics whereas the assembly statistics as in Table 2.

Table 1.

Statistics of the RNA-seq generated from three different libraries.

Monokaryon Mating junction Dikaryon
Raw reads 94,798,526 103,341,138 131,501,350
Clean reads 82,999,421 85,822,868 114,260,189
Read counts for transcriptome assembly (paired-end reads) 76,459,032 76,887,432 104,993,372
Average read length (bp) 90 90 90
Total base pair (bp) 6,881,895,529 6,904,050,780 9,413,554,394

Table 2.

Assembly statistics using Trinity.

Attributes Value
Number of transcripts 35,903
Total residues (bp) 58,592,631
Average length (bp) 1632
N50 transcript 3118
Largest transcript (bp) 19,896
Smallest transcript (bp) 201

Acknowledgements

This work was supported by the Ministry of Higher Education, Malaysia [grant number ERGS/1/2013/STG03/UKM/02/1] and Universiti Kebangsaan Malaysia [grant number GGPM-2017-071]. We are grateful to staffs from Malaysia Genome Institute for technical assistance for transcriptomic sequencing.

Footnotes

Transparency document

Transparency data associated with this article can be found in the online version at doi:10.1016/j.dib.2018.02.027.

Transparency document. Supplementary material

Supplementary material

mmc1.docx (53.1KB, docx)

.

References

  • 1.Schroeder A., Mueller O., Stocker S., Salowsky R., Leiber M., Gassmann M., Lightfoot S., Menzel W., Granzow M., Ragg T. The RIN: an RNA integrity number for assigning integrity values to RNA measurements. BMC Mol. Biol. 2006;7(3) doi: 10.1186/1471-2199-7-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Cox M.P., Peterson D.A., Biggs P.J. SolexaQA: at-a-glance quality assessment of Illumina second-generation sequencing data. BMC Bioinform. 2010;11:485. doi: 10.1186/1471-2105-11-485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Grabherr M.G., Haas B.J., Yassour M., Levin J.Z., Thompson D.A., Amit I., Adiconis X., Fan L., Raychowdhury R., Zeng Q., Chen Z., Mauceli E., Hacohen N., Gnirke A., Rhind N., di Palma F., Birren B.W., Nusbaum C., Lindblad-Toh K., Friedman N., Regev A. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 2011;29:644–652. doi: 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary material

mmc1.docx (53.1KB, docx)

Articles from Data in Brief are provided here courtesy of Elsevier

RESOURCES