Skip to main content
Genomics Data logoLink to Genomics Data
. 2016 Jan 7;7:226–228. doi: 10.1016/j.gdata.2016.01.005

Analyses of MYMIV-induced transcriptome in Vigna mungo as revealed by next generation sequencing

Sayak Ganguli b,1, Avishek Dey a,1, Rahul Banik c, Anirban Kundu a,2, Amita Pal a,
PMCID: PMC4778624  PMID: 26981413

Abstract

Mungbean Yellow Mosaic Virus (MYMIV) is the viral pathogen that causes yellow mosaic disease to a number of legumes including Vigna mungo. VM84 is a recombinant inbred line resistant to MYMIV, developed in our laboratory through introgression of resistance trait from V. mungo line VM-1. Here we present the quality control passed transcriptome data of mock inoculated (control) and MYMIV-infected VM84, those have already been submitted in Sequence Read Archive (SRX1032950, SRX1082731) of NCBI. QC reports of FASTQ files generated by ‘SeqQC V2.2’ bioinformatics tool.

Keywords: Vigna mungo, Transcriptome, Annotation, Recombinant inbred lines


Resource table:
Name of resource VM84
Institution Division of Plant Biology, Bose Institute, Kolkata 700054, India
Person who created resource Sayak Ganguli, Avishek Dey, Rahul Banik, Anirban Kundu and Amita Pal
Contact person and email Amita Pal, amita@jcbose.ac.in
Date archived/stock date 11.06.2015 and 07.07.2015
Type of resource Recombinant inbred line of Vigna mungo resistant to MYMIV
Link to directly related literature that employed/validated this resource Field Crop Res 135 (2012) 116–125
Information in public databases

1. Resource details

Yellow mosaic disease of blackgram (Vigna mungo) is caused by Mungbean yellow mosaic India virus (MYMIV). Irregular, chlorotic, yellow patches on the leaves indicate successful disease onset — the characteristic phenotype of MYMIV-infected susceptible plants. Cent percent yield loss occurs when MYMIV infects the host at the juvenile stage. MYMIV is transmitted through the whitefly, Bemisia tabaci Genn. [1]. It is one of the most devastating types of biotic stresses that causes up to 100% damage to a large number of leguminous crops. One candidate MYMIV resistance gene, CYR1, has been reported by Maiti et al. [2] and introgressed to develop several recombinant inbred lines (RILs) [3]. Here we report the transcriptome data of mock inoculated control and MYMIV infected resistant RIL, VM84.

1.1. Comparison of control and inoculated datasets based on reads and contigs

The total number of processed reads for the two samples was found to be 77.342016 for the control and 107.47377 million reads for the MYMIV inoculated cultivars, indicating a rise in approximately 30 million reads for the infected genotype; probably as a result of the expression of stress and defense pathway associated genes (Fig. 1). Following assembly of the reads into contigs this difference in expression between the control and the inoculated sets was found to be more evident as depicted in Fig. 2. However, average contig lengths were found to be more or less proportional (Fig. 3), indicating that the difference in the number of contigs can be attributed to the differential expression of a few genes as well as expression of new genes as a result of infection and associated stress.

Fig. 1.

Fig. 1

Pie chart showing number of reads in control and MYMIV infected Vigna mungo RIL 84.

Fig. 2.

Fig. 2

Pie chart showing number of contigs generated in control and MYMIV infected Vigna mungo RIL 84.

Fig. 3.

Fig. 3

Bar graph showing different length of contigs generated in control and MYMIV infected Vigna mungo RIL 84.

1.2. De-novo assembly and transcript generation

De-novo assembly of Illumina HiSeq2000 data was performed using velvet-1.2.102 and Oases_0.2.083 was used for transcript generation for various k-mers and concluded that hash lengths (k-mer) 55 (for control sample) and 57 (for MYMIV-infected sample) were better than others considering various parameters like the total number of transcripts generated, maximum transcript length, total transcript length and less number of N's. De-novo transcript statistics is presented in Table 1.

Table 1.

De-novo V. mungo transcripts statistics.

Transcript statistics Control sample Infected sample
k-mer 55 57
Transcripts generated 49,720 103,842
Maximum transcript length 15,357 23,005
Minimum transcript length 200 200
Average transcript length 1688.2 1375
Median transcript length 939 3422.5
Total transcripts length 83,938,205 142,778,942
Total number of non-ATGC characters 536 897
Percentage of non-ATGC characters 0.001 0.001
Transcripts > = 200 bp 49,720 103,842
Transcripts > = 500 bp 42,048 76,066
Transcripts > = 1 kbp 33,281 54,945
Transcripts > = 10 kbp 42 77
N50 value 2254 2031
Percentage of reads used 96.48 93.32

1.3. Transcripts annotation

In the absence of genomic information, V. mungo transcripts were annotated using the following databases:

  • i.

    Medicago Protein (Uniprot)

  • ii.

    Soybean Protein (Uniprot)

  • iii.

    Cowpea EST (NCBI).

The annotation statistics are shown in Table 2. Maximum transcript annotation was possible using soybean database.

Table 2.

Annotation statistics of V. mungo transcripts.

Annotation Control sample Infected sample
Total transcripts 49,720 103,842
Transcripts annotated with Medicago database 30,497 53,091
Transcripts annotated with Soybean database 36,280 61,661
Transcripts annotated with Cowpea EST database 16,884 17,188
Total annotated transcripts 37,723 64,154
Percentage of annotated transcripts 75.87 61.78

2. Materials and methods

Plants samples (mock inoculated control and MYMIV infected V. mungo, line VM84) were collected and prepared following the method described by Kundu et al. [4].

Total RNA was extracted from control and infected leaves using Trizol reagent (Invitrogen, Carlsbad, CA) following the manufacturer's protocol, followed by Dnase-I treatment (Sigma-Aldrich, USA) and purification in an RNeasy Plant Mini Kit (Qiagen, USA). Qualitative and quantitative assessments of the extracted RNA were done by an Agilent 2100 Bioanalyzer (RNA Nano Chip, Agilent). RNA samples were supplied to Genotypic Technologies Pvt. Ltd. (Bangalore, India) for preparation of transcript library and high throughput sequencing using Illumina HiSeq 2000 platform.

3. Verification and authentication

RNA sequencing has become a common method for analyses of functional plant genomics. Direct sequencing of mRNA provides a cost effective alternative to microarray technology for the analyses of gene expression for the entire transcriptome of a particular species [5]. Cell type specific transcript levels provide important research avenues for assessing the exact range of reads per sample for analyzing differential gene expression [6]. It was claimed that depth of coverage is directly proportional to the identification of new genes [5], [7], [8]. Li et al. [9] has established, using a negative binomial model of variations, that log2 fold change of two or more decreased the number of replicates to a maximum of six for effective identification of differentially expressed genes.

Acknowledgments

This work was supported by the Council of Scientific and Industrial Research, New-Delhi, India for the Emeritus Scientist's Scheme to AP (Sanction No. 21(0884)/12/EMR-II). AD is thankful to CSIR for a Research Associateship. We are thankful to the Director, Bose Institute for providing all infrastructural facilities.

References

  • 1.Nariani T.K. Yellow mosaic of mung (Phaseolus aureus L.) Ind. Phytopathol. 1960;13:24–29. [Google Scholar]
  • 2.Maiti S., Paul S., Pal A. Isolation, characterization, and structure analysis of a non-TIR-NBS-LRR encoding candidate gene from MYMIV-resistant Vigna mungo. Mol. Biotechnol. 2012;52:217–233. doi: 10.1007/s12033-011-9488-1. [DOI] [PubMed] [Google Scholar]
  • 3.Kundu A., Pal A. Identification and characterization of elite inbred lines with MYMIV-resistance in Vigna mungo. Field Crop Res. 2012;135:116–125. [Google Scholar]
  • 4.Kundu A., Patel A., Paul S., Pal A. Transcript dynamics at early stages of molecular interactions of MYMIV with resistant and susceptible genotypes of the leguminous host, Vigna mungo. PLoS One. April 17, 2015 doi: 10.1371/journal.pone.0124687. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Marioni J.C., Mason C.E., Mane S.M., Stephens M., Gilad Y. RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008;18(9):1509–1517. doi: 10.1101/gr.079558.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Blencowe B.J., Ahmad S., Lee L.J. Current-generation high-throughput sequencing: deepening insights into mammalian transcriptomes. Genes Dev. 2009;23:1379–1386. doi: 10.1101/gad.1788009. [DOI] [PubMed] [Google Scholar]
  • 7.Griffith M., Griffith L.O., Mwenifumbo J., Goya R., Morrissy S.A. Alternative expression analysis by RNA sequencing. Nat. Methods. 2010;7:843–847. doi: 10.1038/nmeth.1503. [DOI] [PubMed] [Google Scholar]
  • 8.Toung J.M., Morley M., Li M., Cheung V.G. RNA-sequence analysis of human B-cells. Genome Res. 2011;21:991–998. doi: 10.1101/gr.116335.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Li J., Tibshirani R. Finding consistent patterns: a nonparametric approach for identifying differential expression in RNA-Seq data. Stat. Methods Med. Res. 2013;22:519–536. doi: 10.1177/0962280211428386. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Genomics Data are provided here courtesy of Elsevier

RESOURCES