De novo transcriptome assembly of two Vigna angularis varieties collected from Korea

Yeonhwa Jo; Sen Lian; Jin Kyong Cho; Hoseong Choi; Sang-Min Kim; Sun-Lim Kim; Bong Choon Lee; Won Kyong Cho

doi:10.1016/j.gdata.2016.05.004

. 2016 May 5;8:119–120. doi: 10.1016/j.gdata.2016.05.004

De novo transcriptome assembly of two Vigna angularis varieties collected from Korea

Yeonhwa Jo ^a,¹, Sen Lian ^b,¹, Jin Kyong Cho ^c, Hoseong Choi ^a, Sang-Min Kim ^d, Sun-Lim Kim ^d, Bong Choon Lee ^d, Won Kyong Cho ^a,^c,^⁎

PMCID: PMC4878840 PMID: 27257605

Abstract

The adzuki bean (Vigna angularis), a member of the family Fabaceae, is widely grown in Asia, from East Asia to the Himalayas. The adzuki bean is known as an ingredient that adds sweetness to diverse desserts made in Eastern Asian countries. Libraries prepared from two V. angularis varieties referred to as Taejin Black and Taejin Red were paired-end sequenced using the Illumina HiSeq 2000 system. The raw data in this study can be available in NCBI SRA database with accession numbers of SRR3406660 and SRR3406553. After de novo transcriptome assembly using Trinity, we obtained 324,219 and 280,056 transcripts from Taejin Black and Taejin Red, respectively. We predicted a total of 238,321 proteins and 179,519 proteins for Taejin Black and Taejin Red, respectively, by the TransDecoder program. We carried out BLASTP on the predicted proteins against the Swiss-Prot protein sequence database to predict the putative functions of identified proteins. Taken together, we provide transcriptomes of two adzuki bean varieties by RNA-Seq, which might be usefully applied to generate molecular markers.

Keywords: Adzuki bean, RNA-Seq, Transcriptome, Variety, Vigna angularis

Specifications
Organism/cell line/tissue	Adzuki bean (Vigna angularis)/leaves
Sex	N.A.
Sequencer or array type	HiSeq2000
Data format	Raw and processed
Experimental factors	De novo transcriptome assembly of two adzuki bean varieties
Experimental features	Leaves of ten adzuki bean plants (five of the Taejin Black variety and five of the Taejin Red variety) were harvested for total RNA extraction. Two prepared libraries were paired-end sequenced using the HiSeq 2000 system. The obtained data for each variety were subjected to de novo transcriptome assembly using Trinity, and coding regions were predicted by TransDecoder. We performed BLASTP against the Swiss-Prot protein database to annotate the identified proteins.
Consent	N/A
Sample source location	Hoengseong, South Korea (37°28′49.6″N 127°58′34.3″E)

Open in a new tab

1. Direct link to deposited data

http://www.ncbi.nlm.nih.gov/sra/SRR3406660 for Vigna angularis variety Taejin Black.

http://www.ncbi.nlm.nih.gov/sra/SRR3406553 for V. angularis variety Taejin Red.

2. Introduction

The adzuki bean (V. angularis), a member of the family Fabaceae, is widely grown in Asia, from East Asia to the Himalayas [1]. Particularly, adzuki beans are known as an ingredient that adds sweetness to diverse desserts made in Eastern Asian countries, such as China, Japan, Korea, and Taiwan [2]. Most adzuki bean cultivars in Eastern Asian countries are red in color; however, some cultivars display several different colors, such as white, black, and gray. The adzuki bean is relatively small (about 5 mm). According to a previous study, the adzuki bean originated from northeast Asia [3]. The adzuki bean is a diploid legume, and its draft genome contains approximately 26,857 genes [2], [4]. In this study, we performed de novo transcriptome assembly for two adzuki bean varieties by RNA-Seq.

3. Experimental design, materials, and methods

3.1. Plant materials

Adzuki bean plants for two varieties were grown in a field located in Gadam-ri, Hoengseong-up, South Korea. Leaves from five plants for each variety were harvested and immediately frozen in liquid nitrogen for further experiments.

3.2. RNA isolation, library preparation, and sequencing

Ten leaves collected from five plants were pooled and used for total RNA extraction using the RNeasy Plant Mini Kit (Qiagen, Hilden, Germany). For mRNA library preparation, we used a TruSeq RNA Library Prep Kit v2 according to the manufacturer's instructions (Illumina, San Diego, U.S.A.). In brief, the poly-A-containing mRNAs were isolated using poly-T oligo-attached magnetic beads. The first strand of cDNA, followed by a second strand of cDNA, was synthesized from purified mRNAs. End repair was performed followed by adenylation of 3′ ends. Adapters were ligated, and PCR was conducted to selectively enrich DNA fragments with adapters and to amplify the amount of DNA in the library, respectively. The quality control of generated libraries was conducted using a 2100 Bioanalyzer (Agilent, Santa Clara, U.S.A.). The libraries were paired-end sequenced by Macrogen Co. (Seoul, South Korea) using the HiSeq 2000 platform.

3.3. De novo transcriptome assembly, identification of protein coding regions, and annotation

Libraries prepared from two V. angularis varieties referred to as Taejin Black and Taejin Red, respectively, were paired-end sequenced using the Illumina HiSeq 2000 system. We obtained 9.474 GB and 9.116 GB of raw data from Taejin Black and Taejin Red, respectively. We de novo assembled transcriptomes of two adzuki bean varieties using Trinity program ver. 2.0.6 [5]. Detailed information on the de novo transcriptome assembly of the two V. angularis varieties is summarized in Table 1. We obtained 324,219 and 280,056 transcripts from Taejin Black and Taejin Red, respectively. The numbers of components for Taejin Black and Taejin Red were 94,654 and 93,698, respectively. The N50 values for Taejin Black and Taejin Red were 2634 bp and 2461 bp, respectively. The median contig lengths for Taejin Black and Taejin Red were 1504 bp and 1301 bp, respectively. We analyzed coding regions within the assembled transcripts by the TransDecoder program implemented in the Trinity software distribution. As a result, we predicted a total of 238,321 proteins and 179,519 proteins for Taejin Black and Taejin Red, respectively. We carried out BLASTP on the predicted proteins against the Swiss-Prot protein sequence database to predict the putative functions of the identified proteins. The BLAST results showed that most proteins were matched to known proteins in the Swiss-Prot database except 41,619 proteins and 31,720 proteins for Taejin Black and Taejin Red, respectively. Many proteins were derived from eukaryotes (17,274 and 12,123 proteins for Taejin Black and Taejin Red, respectively) followed by bacteria, viruses, and archaea. The transcriptome data in this study will be usefully applied to generate molecular markers for the adzuki bean.

Table 1.

Summary of de novo assembled transcriptomes for two adzuki bean varieties (Taejin Black and Taejin Red).

Index	Taejin Black	Taejin Red
Total trinity transcripts	324,219	280,056
Total trinity components	94,654	93,698
Percent GC	40.82	40.73
Contig N50	2634	2461
Median contig length	1504	1301
Average contig	1772.39	1621.81
Total assembled bases	574,641,146	454,197,434

Open in a new tab

Conflict of interest

The authors declare that they have no competing interests.

Acknowledgments

This work was carried out with the support of the “Cooperative Research Program for Agriculture Science & Technology Development (project no. PJ01186102)” conducted by the Rural Development Administration, Republic of Korea. This work is dedicated to the memory of my father, Tae Jin Cho (1946–2015).

References

1.Zong X.X., Kaga A., Tomooka N., Wang X.W., Han O.K., Vaughan D. The genetic diversity of the Vigna angularis complex in Asia. Genome. 2003;46:647–658. doi: 10.1139/g03-041. [DOI] [PubMed] [Google Scholar]
2.Kang Y.J., Satyawan D., Shim S., Lee T., Lee J., Hwang W.J., Kim S.K., Lestari P., Laosatit K., Kim K.H. Draft genome sequence of adzuki bean, Vigna angularis. Sci. Rep. 2015;5 doi: 10.1038/srep08069. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Lee G.-A. Archaeological perspectives on the origins of adzuki (Vigna angularis) The Holocene. 2013;23:453–459. [Google Scholar]
4.Parida A., Raina S., Narayan R. Quantitative DNA variation between and within chromosome complements of Vigna species (Fabaceae) Genetica. 1990;82:125–133. [Google Scholar]
5.Grabherr M.G., Haas B.J., Yassour M., Levin J.Z., Thompson D.A., Amit I., Adiconis X., Fan L., Raychowdhury R., Zeng Q. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 2011;29:644–652. doi: 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0005] 1.Zong X.X., Kaga A., Tomooka N., Wang X.W., Han O.K., Vaughan D. The genetic diversity of the Vigna angularis complex in Asia. Genome. 2003;46:647–658. doi: 10.1139/g03-041. [DOI] [PubMed] [Google Scholar]

[bb0010] 2.Kang Y.J., Satyawan D., Shim S., Lee T., Lee J., Hwang W.J., Kim S.K., Lestari P., Laosatit K., Kim K.H. Draft genome sequence of adzuki bean, Vigna angularis. Sci. Rep. 2015;5 doi: 10.1038/srep08069. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bb0015] 3.Lee G.-A. Archaeological perspectives on the origins of adzuki (Vigna angularis) The Holocene. 2013;23:453–459. [Google Scholar]

[bb0020] 4.Parida A., Raina S., Narayan R. Quantitative DNA variation between and within chromosome complements of Vigna species (Fabaceae) Genetica. 1990;82:125–133. [Google Scholar]

[bb0025] 5.Grabherr M.G., Haas B.J., Yassour M., Levin J.Z., Thompson D.A., Amit I., Adiconis X., Fan L., Raychowdhury R., Zeng Q. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 2011;29:644–652. doi: 10.1038/nbt.1883. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

De novo transcriptome assembly of two Vigna angularis varieties collected from Korea

Yeonhwa Jo

Sen Lian

Jin Kyong Cho

Hoseong Choi

Sang-Min Kim

Sun-Lim Kim

Bong Choon Lee

Won Kyong Cho

Abstract

1. Direct link to deposited data

2. Introduction

3. Experimental design, materials, and methods

3.1. Plant materials

3.2. RNA isolation, library preparation, and sequencing

3.3. De novo transcriptome assembly, identification of protein coding regions, and annotation

Table 1.

Conflict of interest

Acknowledgments

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

De novo transcriptome assembly of two Vigna angularis varieties collected from Korea

Yeonhwa Jo

Sen Lian

Jin Kyong Cho

Hoseong Choi

Sang-Min Kim

Sun-Lim Kim

Bong Choon Lee

Won Kyong Cho

Abstract

1. Direct link to deposited data

2. Introduction

3. Experimental design, materials, and methods

3.1. Plant materials

3.2. RNA isolation, library preparation, and sequencing

3.3. De novo transcriptome assembly, identification of protein coding regions, and annotation

Table 1.

Conflict of interest

Acknowledgments

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases