Abstract
This article contains data on RDM16 and STA1 regulate differential usage of exon/intron in RNA directed DNA Methylation pathway (RdDM) (Sharma et al., 2016) [5]. This data include expression profiles of top 100 genes that has at least one exon or intron differentially expressed in three different contrast, i.e., WT (Wild type) vs RDM16, WT vs STA1, and RDM16 vs STA1. Also we included the alignment of MORC6 protein to the ATPase-C family members that have conserved three ATP binding sites and conserved Mg2+ binding sites in the spliced exon.
Keywords: RNA directed DNA methylation pathway, Alternative splicing, RNA-Seq, Arabidopsis thaliana
Specifications Table
| Subject area | Bioinformatics, Genomics |
| More specific subject area | Alternative splicing, Differential Expression |
| Type of data | Figures, Table, Alignment |
| How data was acquired | Gene Expression Omnibus (GEO) id: GSE44635, URL: https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE44635 from the article [1] |
| Data format | Analyzed (Fig. 1, Fig. 2, Fig. 3, AlignmentFig. 4, Fig. 5, Table 1) |
| Experimental factors | Secondary analysis of published data |
| Experimental features | Computational analysis |
| Data source location | – |
| Data accessibility | Accessible from this article |
Value of the data
-
•
The data from article [5] shows the expression profiles of the genes that contain at least one alternative splicing event in different conditions. This information will be useful for other researchers to understand the regulation of gene expression by alternative splicing.
-
•
Alignment of MORC6 protein to the ATPase-C family simplifies the mechanism by which splicing factor RDM16 regulate the MORC6.
-
•
This data provides the information of the genes that are affected in RdDM pathway by knockdown of RDM16 and STA1 splicing factors. This data will help other researcher to validate the findings of the exon/intron level analysis in RdDM pathway.
1. Data
Fig. 1, Fig. 2, Fig. 3 depict expression profile of top 100 genes that has at least one exon or intron differentially expressed in WT vs RDM16, WT vs STA1, and RDM16 vs STA1 respectively. The color key is given with Fig. 3.
Fig. 1.
This figure depicts expression profile of top 100 genes that has at least one exon or intron differentially expressed in WT vs RDM16. Color key used in expression profiles of genes in different contrasts is given with Fig. 3. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 2.
Depicts expression profile of top 100 genes that has at least one exon or intron differentially expressed in WT vs STA1. Color key used in expression profiles of genes in different contrasts is given with Fig. 3.
Fig. 3.
Depicts expression profile of top 100 genes that has at least one exon or intron differentially expressed in RDM16 vs STA1. The color key is given with figure.
Fig. 4: Figure shows the alignment of MORC6 protein to the ATPase-C family members that have conserved three ATP binding sites at 8, 11 and 14th position of the alignment. There are few more ATP binding sites at 55–65, 104–107, 123–125, 166–169 but may not be contributing in the ATP binding since co-factor binding site is only available in the protein sequence that is coded by exon4 in MORC6 (region highlighted in yellow).
Fig. 4.
Alignment of MORC6 protein to the ATPase-C family members that have conserved three ATP binding sites at 8, 11 and 14th position of the alignment. There are few more ATP binding sites at 55–65, 104–107, 123–125, 166–169 but may not be contributing in the ATP binding since co factor binding site is only available in the protein sequence that is coded by exon4 in MORC6 (region highlighted in yellow).
Fig. 5: Figure shows the alignment of MORC6 protein to the ATPase-C family members that have conserved Mg2+ binding site at 11th position of the alignment. Highlighted (yellow color) query sequence shows the protein sequence that is coded by exon4 in MORC6. ASP (D) and ASN (N) are essential amino acid for Mg2+ binding but do not contribute in it [7].
Fig. 5.
Alignment of MORC6 protein to the ATPase-C family members that have conserved Mg2+ binding site at 11th position of the alignment. Highlighted (yellow color) query sequence shows the protein sequence that is coded by exon4 in MORC6. ASP (D) and ASN (N) are essential amino acid for Mg2+ binding but do not contribute in it (Jorgensen et al. [7]).
2. Experimental design, materials and methods
The experiment contains RNA-Seq samples in three conditions; WT (wild type), mutant RDM16 and STA1. The raw data were downloaded from Gene Expression Omnibus (GEO) with accession number GSE44635. The alignment of the reads were done using TopHat2 pipeline [2] (Table 1) and the reads were counted via featurecount function in Rsubread package [4]. We used edgeR in order to find the differentially expressed exons and introns [6]. Fig. 1, Fig. 2, Fig. 3 were prepared using in-built functions in R. The alignment of the MORC6 protein to ATPase-C family members was done using ClustalX software [3]
Table 1.
Summary of the TopHat2 alignment. (Values are in millions).
| Sample | Pairs | Aligned pairs (%) | Multiple alignments (%) | Discordant alignments (%) | Concordant pairs (%) |
|---|---|---|---|---|---|
| WT | 25.74 | 24.10 (93.6%) | 1.51 (6.2%) | 0.02 (0.1%) | 24.08 (93.5%) |
| RDM-16 | 26.62 | 24.93 (93.7%) | 1.69 (6.7%) | 0.02 (0.1%) | 24.91 (93.6%) |
| STA1 | 26.56 | 24.90 (93.8%) | 2.15 (8.5%) | 0.02 (0.1%) | 24.88 (93.7%) |

Color key used in expression profiles of genes in different contrasts (for fig. 1, 2 and 3).
Acknowledgement
We would like to thank Prof. Vera van Noort, Computational System Biology Group, CMPG, KU Leuven, Belgium, for support and valuable suggestions.
Footnotes
Transparency data associated with this article can be found in the online version at http://dx.doi.org/10.1016/j.dib.2017.03.050.
Transparency document. Supplementary material
Supplementary material
.
References
- 1.Huang C.F., Miki D., Tang K., Zhou H.R., Zheng Z., Chen W., Ma Z.Y., Yang L., Zhang H., Liu R., He X.J., Zhu J.K. A Pre-mRNA-splicing factor is required for RNA-directed DNA methylation in Arabidopsis. PLoS Genet. 2013;9(9):e1003779. doi: 10.1371/journal.pgen.1003779. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Kim D., Pertea G., Trapnell C., Pimentel H., Kelley R., Salzberg S.L. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14(4):R36. doi: 10.1186/gb-2013-14-4-r36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.M.A. Larkin, G. Blackshields, N.P. Brown, R. Chenna, P.A. McGettigan, H. McWilliam, F. Valentin, I.M. Wallace, A. Wilm, R. Lopez, J.D. Thompson, T.J. Gibson, D.G. Higgins, Clustal W and Clustal X version 2.0. Bioinformatics, 23(21), 2007, pp. 2947–2948. [DOI] [PubMed]
- 4.Liao Y., Smyth G.K., Shi W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics. 2014;30(7):923–930. doi: 10.1093/bioinformatics/btt656. http://www.ncbi.nlm.nih.gov/pubmed/24227677 [DOI] [PubMed] [Google Scholar]
- 5.Sharma R.D., Bogerts B., Goyal N. RDM16 and STA1 regulate differential usage of exon/intron in RNA directed DNA methylation pathway. Gene. 2017;609:62–67. doi: 10.1016/j.gene.2017.01.027. [DOI] [PubMed] [Google Scholar]
- 6.Robinson M.D., McCarthy D.J., Smyth G.K. edgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Jorgensen P.L., Pedersen P.A. Structure–function relationships of Na+, K+, ATP, or Mg2+ binding and energy transduction in Na,K-ATPase. Biochim. Biophys. Acta - Bioenergy. 2001;1505(1):57–74. doi: 10.1016/s0005-2728(00)00277-2. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Supplementary material





