RNASeq data from Indonesian recalcitrant and non-recalcitrant rice varieties on anther culture

Siti Fatimah; Mohammad Syafii; Siti Zulaeha; Mega Dewi Haristianita; Devit Purwoko

doi:10.1016/j.dib.2022.108760

. 2022 Nov 17;45:108760. doi: 10.1016/j.dib.2022.108760

RNASeq data from Indonesian recalcitrant and non-recalcitrant rice varieties on anther culture

Siti Fatimah ^a, Mohammad Syafii ^a, Siti Zulaeha ^b, Mega Dewi Haristianita ^c, Devit Purwoko ^b,^⁎

PMCID: PMC9747646 PMID: 36533293

Abstract

The assembly of dihaploid rice plants through anther culture was constrained due to the recalcitrant properties. A comprehensive investigation of gene expression patterns among rice varieties with recalcitrant and non-recalcitrant anthers will help to understand the cellular mechanisms and biological processes of recalcitrant properties in rice anther cultures. Therefore, we performed RNA sequencing and analysis on the anthers of three selected Indonesian rice varieties with opposite recalcitrant properties. The varieties are Fatmawati with non-recalcitrant properties, IR64 recalcitrant and Tarabas unknown. The Illumina NextSeq PE150 sequencer was used to generate a total crude nucleotide of approximately 41.21 Gb in size. From 272,239,682 total paired final raw reads, 137,343,391 total net reads were obtained and uploaded to NCBI's Sequence Read Archive (SRA) repository under BioProject accession number PRJNA856048. This dataset allowed us to identify and profile all expressed genes with functions associated with recalcitrant and non-recalcitrant properties. In addition, the transcriptome data obtained will be valuable for the discovery of potential gene markers and functional SNPs associated with functional traits to assist rice breeding programs through the development of Marker Assisted Selection (MAS).

Keywords: Oryza sativa, Anther culture, Dihaploid, Recalcitrant, Transcriptomics

Specifications Table

Subject	Agricultural and Biological Sciences
Specific subject area	Plant Transcriptomics
Type of data	Table, text file
How the data were acquired	Illumina NextSeq PE150 platform.
Data format	Raw (FASTQ)
Description of data collection	The anthers of three rice plants (Fatmawati, IR64 and Tarabas) were taken from young tassels when the microspore stage was in the uninucleate stage [1]. RNA of three rice anther samples was extracted using GeneAll® Ribospin™ Plant (GeneAll Biotechnology Co., Ltd.) and was submitted for RNA sequencing.
Data source location	Rice anther samples were collected at: • Institution: Trunojoyo University • City/Town/Region: Telang, Bangkalan, Madura • Country: Indonesia
Data accessibility	Repository name: NCBI Sequence Read Archive (SRA) Data identification number: PRJNA856048 Direct URL to data: https://www.ncbi.nlm.nih.gov/bioproject/PRJNA856048 The sequencing reads of three rice anther set are available in NCBI SRA accession number SRX16357934 (Tarabas), SRX16357933 (IR64), SRX16357932 (Fatmawati). https://www.ncbi.nlm.nih.gov/sra/SRX16357934 https://www.ncbi.nlm.nih.gov/sra/SRX16357933 https://www.ncbi.nlm.nih.gov/sra/SRX16357932 Mendeley Data Data identification number (permanent identifier, i.e. DOI number): 10.17632/w5nj767fzk.2 Direct link to the dataset: https://data.mendeley.com/datasets/w5nj767fzk [2]

Open in a new tab

Value of the Data

•
These transcriptome data from young tassels at the uninucleate stage of the microspore stage were generated from the selected 3 rice varieties, which represent recalcitrant and non-recalcitrant complete sets of transcriptome data.
•
Using this information, we can identify genes that are useful for understanding the molecular and cellular underpinnings of recalcitrant traits in rice anther cultures.
•
With the help of these data, rice anther culture recalcitrant and non-recalcitrant transcriptomics can be compared. Different gene expression levels between types might aid in understanding the biological and molecular mechanisms behind some important rice features.
•
Future genetic improvement studies on dihaploid rice will use these RNAseq data in conjunction with rice genome data to identify functional markers, such as single nucleotide polymorphisms (SNPs) and microsatellites related with recalcitrant traits.

1. Data Description

Anther culture is one of the in vitro techniques to accelerate the acquisition of pure lines in the form of androgenic dihaploid (DH) plants [3]. The dataset in this article is RNA-seq raw reads for anther taken from young tassels when the microspore stage was in the uninucleate stage of 3 rice varieties (Fatmawati with non-recalcitrant properties (F1), IR64 recalcitrant and Tarabas unknown). The raw data obtained from the Illumina NextSeq PE150 sequencer were deposited as a FASTQ format in NCBI's Sequence Read Archive (SRA) repository under BioProject accession number PRJNA856048. Accession numbers for each rice variety in the Mandeley database are presented in Table 1. Sequencing data analysis of each rice variety e.g. raw and clean reads, raw and clean nucleotides were performed as shown in Table 2. The quality of the net reads was assessed and a high quality percentage of net reads were obtained. The high quality reads were assembled to generate the contigs and mapped to Oryza_sativa.IRGSP-1.0 reference genome, the number of mapped contigs was estimated (Table 3). Oryza_sativa.IRGSP-1.0 reference genome was used for contigsmapping as it is a well-assembled and annotated genome, whereas the genome of indica rice cultivars has not yet been properly annotated [4]. In addition, transcript assembly for the reference genome with an ORF minimum 400 bp predicted the number of transcripts for each rice variety as listed in Table 3.

Table 1.

Shows the summary of sequence information including the rice variety, phenotype, and fastq ID assigned to the metadata.

Rice variety	Phenotype	Fastq ID
IR-3	recalcitrant	IR-3_S7_L001_R1_001.fastq
		IR-3_S7_L001_R2_001.fastq
		IR-3_S7_L002_R1_001.fastq
		IR-3_S7_L002_R2_001.fastq
		IR-3_S7_L003_R1_001.fastq
		IR-3_S7_L003_R2_001.fastq
		IR-3_S7_L004_R1_001.fastq
		IR-3_S7_L004_R2_001.fastq
F1	non-recalcitrant	F1_S8_L001_R1_001.fastq
		F1_S8_L001_R2_001.fastq
		F1_S8_L002_R1_001.fastq
		F1_S8_L002_R2_001.fastq
		F1_S8_L003_R1_001.fastq
		F1_S8_L003_R2_001.fastq
		F1_S8_L004_R1_001.fastq
		F1_S8_L004_R2_001.fastq
RTR3	unknown	RTR-3_S9_L001_R1_001.fastq
		RTR-3_S9_L001_R2_001.fastq
		RTR-3_S9_L002_R1_001.fastq
		RTR-3_S9_L002_R2_001.fastq
		RTR-3_S9_L003_R1_001.fastq
		RTR-3_S9_L003_R2_001.fastq
		RTR-3_S9_L004_R1_001.fastq
		RTR-3_S9_L004_R2_001.fastq

Open in a new tab

Table 2.

Statistics of sequencing data of individual recalcitrant and non-recalcitrant rice variety.

Rice variety	Phenotype	Raw reads (paired-end)	Raw nucleotides (bp)	Clean reads (paired-end)	Clean nucleotides (bp)
IR-3	recalcitrant	52,150,394	14,132,155,236	47,404,708	13,066,022,103
F1	non-recalcitrant	41,127,935	11,220,340,364	37,385,293	10,290,236,364
RTR3	unknown	44,065,061	12,189,584,927	40,055,141	11,232,417,836
Total		137,343,391	37,542,080,527	124,845,142	34,588,676,303

Open in a new tab

Table 3.

Statistics of contigs assembling, contigs mapping and number of ORF for each recalcitrant and non-recalcitrant rice variety.

Rice variety	High-quality reads (paired-end)	Percentage of high quality reads (%)	Contigs Assembled	Mapped contigs	Percentages of mapped contigs (%)	Number of ORF
IR-3	47404708	90.9	98207	22483	22.89	88376
F1	37385293	90.9	92226	17690	19.18	83463
RTR3	40055141	90.9	91010	17659	19.40	84307
Total	124845142		281443	57832		256146

Open in a new tab

2. Experimental Design, Materials, and Methods

2.1. Sample sites taken

Rice was grown in the greenhouse at Trunojoyo University in Madura, East Java, Indonesia, in pots filled with soil from rice fields. Each pot contained four plants. The fertilization procedure involved the use of three different fertilizers: urea at a dose of 200 kg ha-1 (5 g/pot), SP36 at a dose of 100 kg ha-1 (2.5 g/pot), and KCl at a dose of 100 kg ha-1 (2.5 g/pot). Maintenance is carried out based on the lowland rice crop.

2.2. Anther sampling and RNA extraction

The pregnant stage was used to harvest the rice panicles of Tarabas, IR64, and Fatmawati. As part of the cold temperature pre-treatment, panicles were cleaned, wrapped in paper towels wet with water, placed in a zip-lock plastic bag, and kept at low temperatures for seven days [1]. Additionally, panicles were chosen so that the anthers would be in the uninucleate microspore growth stage when they were used as samples. The chosen anthers were then placed in liquid nitrogen to freeze, and they were kept at 80°C for later use. Following the technique, the total RNA was extracted using RibospinTM Plant (GeneAll, 2012). Then, using the RNA Nano 6000 kit in the Bioanalyzer 2100 system, the quantity and quality of RNA were assessed (Agilent Technologies, CA,).

2.3. Library preparation and next-generation sequencing

The mRNA library was purified using the TruSeq RNA sample preparation v2 kit (Illumina Inc, CA, USA) according to the manufacturer's protocol. Literature quantification was carried out using a Qubit Fluorometer and standard real-time PCR. The libraries were then sequenced using the Illumina NovaSeq 6000 PE150 (Novogene, China).

2.4. Data analysis

Initially, FASTQC was used to do quality control on the produced reads [5]. Trimmomatic was used to filter the raw reads to get rid of any low-quality reads (v0.39, Bolger et al 2014). From raw reads, data filtering involves eliminating adaptor sequences, contaminants, and poor-quality reads. The transcriptome was assembled using rnaSPAdes 3.15.3 in Galaxy at type of paired end: default (–pe); orientation of reads: FR (-><-); an additional set of short-reads: disabled; k-mer detection option: auto; Phred quality offset: auto; strand specificity: disabled and a minimum length of 150 bp using high-quality reads [6]. The contigs were mapped onto the reference genome using geneious RNA at medium sensitivity, executing five iterations [7]. The analyses mentioned above used the default parameters. Further downstream analyses, such as calling SNPs, co-expression networks for genes, and differentially expressed gene analysis, will utilise these sequences and information [8].

3. Ethics Statements

The author hereby consciously assures that for the manuscript RNASeq data from Indonesian Recalcitrant and Non-recalcitrant rice varieties on anther culture the following elements are fulfilled:

(1)
It is the authors' original work, which has not been published anywhere else.
(2)
This document is not being published anywhere else.
(3)
The document reflects the author's research and analysis in a truthful and comprehensive way.
(4)
The paper rightly assigns significant contributions from co-authors and co-researchers.
(5)
All sources used are correctly disclosed (good citation).
(6)
All authors have been personally and actively engaged in substantive work leading up to the document, and will take public responsibility for its content.

I agree with the above statements and declare that this submission complies with the Data In Brief policies contained in the Author's Guide and the Statement of Ethics [9].

CRediT authorship contribution statement

Siti Fatimah: Project administration, Supervision, Writing – review & editing. Mohammad Syafii: Conceptualization, Methodology, Writing – original draft. Siti Zulaeha: Data curation, Resources, Investigation. Mega Dewi Haristianita: Supervision, Writing – review & editing. Devit Purwoko: Data curation.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

We would like to thank the management of gene technology lab-BRIN for facility use in RNA isolation and the Research and Community Service Institute-Trunojoyo Madura University for funding this research through the 2021 Trunojoyo Madura University Self Research No: B/3683/UN46.4.1.1/PM.01.02/2021.

Data Availability

RNASeq data profile from Indonesian Recalcitrant and Non-recalcitrant rice varieties on anther culture (Original data) (Mendeley Data).

References

1.Dash B., Bhuyan S.S., Singh S.K., Chandravani M., Swain N., Rout P., Katara J.L., Parameswaran C., Devanna B.N., Samantaray S. Androgenesis in indica rice: A comparative competency in development of doubled haploids. PLOS One. 2022;17(5) doi: 10.1371/journal.pone.0267442. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Purwoko Devit, Zulaeha Siti, Fatimah Siti, Syafii Mohammad, Haristianita Mega Dewi. “RNASeq data profile from Indonesian Recalcitrant and Non-recalcitrant rice varieties on anther culture”. Mendeley Data. 2022:V1. doi: 10.17632/w5nj767fzk.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Dewi I.S., Putri N.H., Purwoko B.S. Response of Anther Donor Genotypes (F1) from Indica x Indica Crosses to Rice Anther Culture. Jurnal AgroBiogen. 2019;15(1):45–52. doi: 10.21082/jbio.v15n1.2019.p45-52. doi: [DOI] [Google Scholar]
4.Tanaka T., Nishijima R., Teramoto S., Kitomi Y., Hayashi T., Uga Y., Kawakatsu T. De novo Genome Assembly of the indica Rice Variety IR64 Using Linked-Read Sequencing and Nanopore Sequencing. G3 Genes|Genomes|Genetics. 2020;10(5):1495–1501. doi: 10.1534/g3.119.400871. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.S. Andrew. FastQC; a quality control tool for high throughput sequence data (2010) Available online at https://www.bioinformatics.babraham.ac.uk/projects/fastqc/.
6.Bushmanova E., Antipov D., Lapidus A., Prjibelski A.D. (2018). rnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data. GigaScience. 2019;8(9):giz100. doi: 10.1093/gigascience/giz100. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Geneious Prime 2022.1.1. (http://www.geneious.com).
8.Zainal-Abidin R.A., Zainal Z., Mohamed-Hussein Z.A., Abu-Bakar N., Ab Razak M.S.F., Simoh S., Sew Y.S. RNA-seq data from whole rice grains of pigmented and non-pigmented Malaysian rice varieties. Data Brief. 2020;30 doi: 10.1016/j.dib.2020.105432. PMID: 32280737PMCID: PMC7138961. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Purwoko D., Safarrida A., Tajuddin T., Rupaedah B., Suyono A., Wahid A., Sugianto M., Suja'i I. Metagenomic data of microbial in natural empty fruit bunches degradation. Data Brief. 2022;41 doi: 10.1016/j.dib.2022.107967. https://www.sciencedirect.com/science/article/pii/S2352340922001780 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

RNASeq data profile from Indonesian Recalcitrant and Non-recalcitrant rice varieties on anther culture (Original data) (Mendeley Data).

[bib0001] 1.Dash B., Bhuyan S.S., Singh S.K., Chandravani M., Swain N., Rout P., Katara J.L., Parameswaran C., Devanna B.N., Samantaray S. Androgenesis in indica rice: A comparative competency in development of doubled haploids. PLOS One. 2022;17(5) doi: 10.1371/journal.pone.0267442. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0002] 2.Purwoko Devit, Zulaeha Siti, Fatimah Siti, Syafii Mohammad, Haristianita Mega Dewi. “RNASeq data profile from Indonesian Recalcitrant and Non-recalcitrant rice varieties on anther culture”. Mendeley Data. 2022:V1. doi: 10.17632/w5nj767fzk.1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0003] 3.Dewi I.S., Putri N.H., Purwoko B.S. Response of Anther Donor Genotypes (F1) from Indica x Indica Crosses to Rice Anther Culture. Jurnal AgroBiogen. 2019;15(1):45–52. doi: 10.21082/jbio.v15n1.2019.p45-52. doi: [DOI] [Google Scholar]

[bib0004] 4.Tanaka T., Nishijima R., Teramoto S., Kitomi Y., Hayashi T., Uga Y., Kawakatsu T. De novo Genome Assembly of the indica Rice Variety IR64 Using Linked-Read Sequencing and Nanopore Sequencing. G3 Genes|Genomes|Genetics. 2020;10(5):1495–1501. doi: 10.1534/g3.119.400871. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0005] 5.S. Andrew. FastQC; a quality control tool for high throughput sequence data (2010) Available online at https://www.bioinformatics.babraham.ac.uk/projects/fastqc/.

[bib0006] 6.Bushmanova E., Antipov D., Lapidus A., Prjibelski A.D. (2018). rnaSPAdes: a de novo transcriptome assembler and its application to RNA-Seq data. GigaScience. 2019;8(9):giz100. doi: 10.1093/gigascience/giz100. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0007] 7.Geneious Prime 2022.1.1. (http://www.geneious.com).

[bib0008] 8.Zainal-Abidin R.A., Zainal Z., Mohamed-Hussein Z.A., Abu-Bakar N., Ab Razak M.S.F., Simoh S., Sew Y.S. RNA-seq data from whole rice grains of pigmented and non-pigmented Malaysian rice varieties. Data Brief. 2020;30 doi: 10.1016/j.dib.2020.105432. PMID: 32280737PMCID: PMC7138961. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib0009] 9.Purwoko D., Safarrida A., Tajuddin T., Rupaedah B., Suyono A., Wahid A., Sugianto M., Suja'i I. Metagenomic data of microbial in natural empty fruit bunches degradation. Data Brief. 2022;41 doi: 10.1016/j.dib.2022.107967. https://www.sciencedirect.com/science/article/pii/S2352340922001780 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

RNASeq data from Indonesian recalcitrant and non-recalcitrant rice varieties on anther culture

Siti Fatimah

Mohammad Syafii

Siti Zulaeha

Mega Dewi Haristianita

Devit Purwoko

Abstract

1. Data Description

Table 1.

Table 2.

Table 3.

2. Experimental Design, Materials, and Methods

2.1. Sample sites taken

2.2. Anther sampling and RNA extraction

2.3. Library preparation and next-generation sequencing

2.4. Data analysis

3. Ethics Statements

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Data Availability

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

RNASeq data from Indonesian recalcitrant and non-recalcitrant rice varieties on anther culture

Siti Fatimah

Mohammad Syafii

Siti Zulaeha

Mega Dewi Haristianita

Devit Purwoko

Abstract

1. Data Description

Table 1.

Table 2.

Table 3.

2. Experimental Design, Materials, and Methods

2.1. Sample sites taken

2.2. Anther sampling and RNA extraction

2.3. Library preparation and next-generation sequencing

2.4. Data analysis

3. Ethics Statements

CRediT authorship contribution statement

Declaration of Competing Interest

Acknowledgments

Data Availability

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases