Abstract
Mungbean Yellow Mosaic Virus (MYMIV) is the viral pathogen that causes yellow mosaic disease to a number of legumes including Vigna mungo. VM84 is a recombinant inbred line resistant to MYMIV, developed in our laboratory through introgression of resistance trait from V. mungo line VM-1. Here we present the quality control passed transcriptome data of mock inoculated (control) and MYMIV-infected VM84, those have already been submitted in Sequence Read Archive (SRX1032950, SRX1082731) of NCBI. QC reports of FASTQ files generated by ‘SeqQC V2.2’ bioinformatics tool.
Keywords: Vigna mungo, Transcriptome, Annotation, Recombinant inbred lines
| Resource table: | |
|---|---|
| Name of resource | VM84 |
| Institution | Division of Plant Biology, Bose Institute, Kolkata 700054, India |
| Person who created resource | Sayak Ganguli, Avishek Dey, Rahul Banik, Anirban Kundu and Amita Pal |
| Contact person and email | Amita Pal, amita@jcbose.ac.in |
| Date archived/stock date | 11.06.2015 and 07.07.2015 |
| Type of resource | Recombinant inbred line of Vigna mungo resistant to MYMIV |
| Link to directly related literature that employed/validated this resource | Field Crop Res 135 (2012) 116–125 |
| Information in public databases |
|
1. Resource details
Yellow mosaic disease of blackgram (Vigna mungo) is caused by Mungbean yellow mosaic India virus (MYMIV). Irregular, chlorotic, yellow patches on the leaves indicate successful disease onset — the characteristic phenotype of MYMIV-infected susceptible plants. Cent percent yield loss occurs when MYMIV infects the host at the juvenile stage. MYMIV is transmitted through the whitefly, Bemisia tabaci Genn. [1]. It is one of the most devastating types of biotic stresses that causes up to 100% damage to a large number of leguminous crops. One candidate MYMIV resistance gene, CYR1, has been reported by Maiti et al. [2] and introgressed to develop several recombinant inbred lines (RILs) [3]. Here we report the transcriptome data of mock inoculated control and MYMIV infected resistant RIL, VM84.
1.1. Comparison of control and inoculated datasets based on reads and contigs
The total number of processed reads for the two samples was found to be 77.342016 for the control and 107.47377 million reads for the MYMIV inoculated cultivars, indicating a rise in approximately 30 million reads for the infected genotype; probably as a result of the expression of stress and defense pathway associated genes (Fig. 1). Following assembly of the reads into contigs this difference in expression between the control and the inoculated sets was found to be more evident as depicted in Fig. 2. However, average contig lengths were found to be more or less proportional (Fig. 3), indicating that the difference in the number of contigs can be attributed to the differential expression of a few genes as well as expression of new genes as a result of infection and associated stress.
Fig. 1.
Pie chart showing number of reads in control and MYMIV infected Vigna mungo RIL 84.
Fig. 2.
Pie chart showing number of contigs generated in control and MYMIV infected Vigna mungo RIL 84.
Fig. 3.
Bar graph showing different length of contigs generated in control and MYMIV infected Vigna mungo RIL 84.
1.2. De-novo assembly and transcript generation
De-novo assembly of Illumina HiSeq2000 data was performed using velvet-1.2.102 and Oases_0.2.083 was used for transcript generation for various k-mers and concluded that hash lengths (k-mer) 55 (for control sample) and 57 (for MYMIV-infected sample) were better than others considering various parameters like the total number of transcripts generated, maximum transcript length, total transcript length and less number of N's. De-novo transcript statistics is presented in Table 1.
Table 1.
De-novo V. mungo transcripts statistics.
| Transcript statistics | Control sample | Infected sample |
|---|---|---|
| k-mer | 55 | 57 |
| Transcripts generated | 49,720 | 103,842 |
| Maximum transcript length | 15,357 | 23,005 |
| Minimum transcript length | 200 | 200 |
| Average transcript length | 1688.2 | 1375 |
| Median transcript length | 939 | 3422.5 |
| Total transcripts length | 83,938,205 | 142,778,942 |
| Total number of non-ATGC characters | 536 | 897 |
| Percentage of non-ATGC characters | 0.001 | 0.001 |
| Transcripts > = 200 bp | 49,720 | 103,842 |
| Transcripts > = 500 bp | 42,048 | 76,066 |
| Transcripts > = 1 kbp | 33,281 | 54,945 |
| Transcripts > = 10 kbp | 42 | 77 |
| N50 value | 2254 | 2031 |
| Percentage of reads used | 96.48 | 93.32 |
1.3. Transcripts annotation
In the absence of genomic information, V. mungo transcripts were annotated using the following databases:
-
i.
Medicago Protein (Uniprot)
-
ii.
Soybean Protein (Uniprot)
-
iii.
Cowpea EST (NCBI).
The annotation statistics are shown in Table 2. Maximum transcript annotation was possible using soybean database.
Table 2.
Annotation statistics of V. mungo transcripts.
| Annotation | Control sample | Infected sample |
|---|---|---|
| Total transcripts | 49,720 | 103,842 |
| Transcripts annotated with Medicago database | 30,497 | 53,091 |
| Transcripts annotated with Soybean database | 36,280 | 61,661 |
| Transcripts annotated with Cowpea EST database | 16,884 | 17,188 |
| Total annotated transcripts | 37,723 | 64,154 |
| Percentage of annotated transcripts | 75.87 | 61.78 |
2. Materials and methods
Plants samples (mock inoculated control and MYMIV infected V. mungo, line VM84) were collected and prepared following the method described by Kundu et al. [4].
Total RNA was extracted from control and infected leaves using Trizol reagent (Invitrogen, Carlsbad, CA) following the manufacturer's protocol, followed by Dnase-I treatment (Sigma-Aldrich, USA) and purification in an RNeasy Plant Mini Kit (Qiagen, USA). Qualitative and quantitative assessments of the extracted RNA were done by an Agilent 2100 Bioanalyzer (RNA Nano Chip, Agilent). RNA samples were supplied to Genotypic Technologies Pvt. Ltd. (Bangalore, India) for preparation of transcript library and high throughput sequencing using Illumina HiSeq 2000 platform.
3. Verification and authentication
RNA sequencing has become a common method for analyses of functional plant genomics. Direct sequencing of mRNA provides a cost effective alternative to microarray technology for the analyses of gene expression for the entire transcriptome of a particular species [5]. Cell type specific transcript levels provide important research avenues for assessing the exact range of reads per sample for analyzing differential gene expression [6]. It was claimed that depth of coverage is directly proportional to the identification of new genes [5], [7], [8]. Li et al. [9] has established, using a negative binomial model of variations, that log2 fold change of two or more decreased the number of replicates to a maximum of six for effective identification of differentially expressed genes.
Acknowledgments
This work was supported by the Council of Scientific and Industrial Research, New-Delhi, India for the Emeritus Scientist's Scheme to AP (Sanction No. 21(0884)/12/EMR-II). AD is thankful to CSIR for a Research Associateship. We are thankful to the Director, Bose Institute for providing all infrastructural facilities.
References
- 1.Nariani T.K. Yellow mosaic of mung (Phaseolus aureus L.) Ind. Phytopathol. 1960;13:24–29. [Google Scholar]
- 2.Maiti S., Paul S., Pal A. Isolation, characterization, and structure analysis of a non-TIR-NBS-LRR encoding candidate gene from MYMIV-resistant Vigna mungo. Mol. Biotechnol. 2012;52:217–233. doi: 10.1007/s12033-011-9488-1. [DOI] [PubMed] [Google Scholar]
- 3.Kundu A., Pal A. Identification and characterization of elite inbred lines with MYMIV-resistance in Vigna mungo. Field Crop Res. 2012;135:116–125. [Google Scholar]
- 4.Kundu A., Patel A., Paul S., Pal A. Transcript dynamics at early stages of molecular interactions of MYMIV with resistant and susceptible genotypes of the leguminous host, Vigna mungo. PLoS One. April 17, 2015 doi: 10.1371/journal.pone.0124687. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Marioni J.C., Mason C.E., Mane S.M., Stephens M., Gilad Y. RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008;18(9):1509–1517. doi: 10.1101/gr.079558.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Blencowe B.J., Ahmad S., Lee L.J. Current-generation high-throughput sequencing: deepening insights into mammalian transcriptomes. Genes Dev. 2009;23:1379–1386. doi: 10.1101/gad.1788009. [DOI] [PubMed] [Google Scholar]
- 7.Griffith M., Griffith L.O., Mwenifumbo J., Goya R., Morrissy S.A. Alternative expression analysis by RNA sequencing. Nat. Methods. 2010;7:843–847. doi: 10.1038/nmeth.1503. [DOI] [PubMed] [Google Scholar]
- 8.Toung J.M., Morley M., Li M., Cheung V.G. RNA-sequence analysis of human B-cells. Genome Res. 2011;21:991–998. doi: 10.1101/gr.116335.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Li J., Tibshirani R. Finding consistent patterns: a nonparametric approach for identifying differential expression in RNA-Seq data. Stat. Methods Med. Res. 2013;22:519–536. doi: 10.1177/0962280211428386. [DOI] [PMC free article] [PubMed] [Google Scholar]



