Abstract
Background
Translatomics data, particularly genome-wide ribosome profiling and polysome profiling, provide multiple levels of gene regulatory information that can be used to assess general transcription and translation, as well translational efficiency. The increasing popularity of these techniques has resulted in multiple algorithms to detect translational regulation, typically distributed in the form of command line tools that require a basic level of programming ability. Additionally, due to the static nature of current software, dynamic transcriptional and translational comparative analysis cannot be adequately achieved. In order to streamline hypothesis generation, investigators must have the ability to manipulate and interact with their data in real-time.
Results
To address the lack of integration in current software, we introduce RIVET, Ribosomal Investigation and Visualization to Evaluate Translation, an R shiny based graphical user interface for translatomics data exploration and differential analysis. RIVET can analyze either microarray or RNA sequencing data from polysome profiling and ribosome profiling experiments. RIVET provides multiple choices for statistical analysis as well as integration of transcription, translation, and translational efficiency data analytics and the ability to visualize all results dynamically.
Conclusions
RIVET is a user-friendly tool designed for bench scientists with little to no programming background. RIVET facilitates the data analysis of translatomics data allowing for dynamic generation of results based on user-defined inputs and publication ready visualization. We expect RIVET will allow for scientists to efficiently make more comprehensive data observations that will lead to more robust hypothesis regarding translational regulation.
Electronic supplementary material
The online version of this article (10.1186/s12864-018-5166-z) contains supplementary material, which is available to authorized users.
Keywords: Ribosome profiling, Polysome profiling, Translational regulation, Translational efficiency, Differential expression, R shiny
Background
Regulation of gene expression is a multi-tiered process critical for maintaining cellular homeostasis. Upon disruption of regulatory mechanisms, gene expression can become disrupted at the level of transcription, cytoplasmic export, RNA stability, RNA localization, and at the level of mRNA translation leading to aberrant protein expression. Translational control of mRNA can be achieved globally, via cellular mechanisms that modulate the levels of protein production from all mRNA, or in a targeted process, through mechanisms that promote or inhibit the specific translation of a subset of mRNAs [1]. Aberrant translation of mRNA has been shown to play an important role in many diseases, particularly in cancers and cancers refractory to treatment [2, 3].
Two commonly used approaches to study genome-wide translational control are polysome profiling and ribosome profiling (reviewed in [4]). Polysome profiling captures both transcriptional and translational regulation by isolating cytoplasmic RNA and enriching for actively translating mRNA bound to ribosomes. Enrichment of actively translated mRNA is performed by separation of ribosome-bound mRNA using sucrose-gradient ultracentrifugation followed by fractionation of mRNA pertaining to the quantity of bound ribosomes followed by RNA-seq or microarray analysis for transcriptional quantitation (Fig. 1b). Cytoplasmic mRNA preserved prior to ribosome-enrichment as well as polyribosome enriched mRNA fractions are then used to measure active transcription by performing microarray analysis or sequencing on total RNA (Fig. 1a). Thereby, each mRNA in the cell will have measurements for transcriptional changes as well as translational changes occurring due to varied experimental conditions. And, as polysome profiling allows for an exact enrichment of mRNA corresponding to the number of bound ribosomes, changes that shift the level of translation of a particular mRNA can also be examined through density-based segmentation of the polysome fractions [5] (Fig. 1b). An alternate technique, ribosome profiling, also utilizes a multi-omics approach to capture both transcriptional and translational effects, however, exclusively quantifies mRNA using an RNA sequencing platform (Fig. 1c). Additionally, ribosome profiling differs from polysome profiling in that nuclease digestion is performed with ribosome-enriched mRNA, degrading unbound mRNA and leaving a ribosome protected RNA footprint of ~ 30 nts. Nuclease digestion allows for additional analysis of location of the ribosome on mRNA. For differential expression analysis, only a transcription and translational fraction of mRNA can be captured utilizing ribosome profiling [6]. Both techniques are schematized in Fig. 1.
Datasets derived from translational control experiments have the ability to provide multiple layers of information regarding mRNA regulation including transcriptomic and translatomic data. Further, these experiments allow for examination of translation independent from transcription, also known as translational efficiency. Recently, with the upsurge in the use of ribosome profiling, there has been a corresponding increase in algorithms that detect translational efficiency that can be applied to ribosome profiling and RNA sequencing-based polysome profiling data [7–10]. However, these algorithms model discrete data based on negative binomial distributions, and therefore should not be applied to continuous based microarray data, which is prohibitive to the analysis of publicly available genome-wide polysome profiling microarray datasets. Additionally, these packages rarely implement comprehensive data visualization tools and require the use of R-packages and/or command-line interfaces, restricting use to scientists proficient in programming [10]. The inability to easily perform differential analysis and visualization for both polysome and ribosome profiling highlight the need for a user-friendly tool that can adapt to both polysome profiling and ribosome profiling data sets.
Here we present RIVET, an R shiny graphical user interface to identify mRNA regulation occurring in the transcriptome, translatome, and to identify those genes that are regulated at the level of translation independent of transcription. RIVET is a comprehensive user interface (UI) that takes in a user-defined matrix of RNA-seq counts data or microarray data, normalizes the data and performs differential expression analysis with several options for statistical packages, followed by comprehensive data visualization. RIVET is based on limma [11] and edgeR [12] statistical modeling packages and utilizes ggplot2 [13] graphical environment. To perform differential analysis for translational efficiency, the user can choose to utilize a log2 ratio of polysome to total [6, 14] or heavier polysome to lighter polysomes [15, 16] or a statistical model incorporating an interaction between condition and polysome to total [17, 18]. RIVET differs from other bioinformatics tools that it provides normalization, differential expression detection at the level of transcription and translation including translation occurring within multiple polyribosome fractions, and publication-ready visualizations. RIVET simplifies the process of analyzing translatome-based genome-wide data allowing users to gain critical insights from data quickly and interactively.
Implementation
Rivet UI architecture
The RIVET UI (http://ruggleslab.shinyapps.io/RIVET) is comprised of 5 components split into modules: normalization, transcription, polysome, translational efficiency, and translational regulation (Fig. 2). In order to use RIVET, users begin with the normalization module and input a matrix of counts data from RNA sequencing experiments or a background-corrected normalized matrix of microarray data. Based on input data, a multidimensional scaling (MDS) plot is automatically generated for the user to ascertain sample quality and potential batch effects. The user can then select the number of polysome fractions to analyze and segregate samples into control and experimental groups and transcription and translation groups. Following sample segregation, differential expression analysis is automatically performed, using either limma or edgeR, based on user preference (Fig. 2a). Upon completion of all tasks in the normalization module, the user can toggle between the transcription module, the polysome module, the translational efficiency module, or the translational regulation module.
The transcription module focuses on transcription in isolation of translation, generating a volcano plot visualizing effect size of differential analysis at the transcriptional level. The user can choose thresholds to consider for most differentially expressed mRNAs through which a downloadable spreadsheet of selected mRNAs is generated (Fig. 2b). The translation module focuses on translation alone, generating either a volcano plot, in the case of one polysome fraction, or a heatmap in the case of multiple polysome fractions (Fig. 2c). The translational efficiency module takes both transcription and translation into account, generating a scatter plot visualizing different types of translational regulation and a downloadable spreadsheet corresponding to genes regulated in different ways (e.g. transcription alone, transcription and translation, translation alone). The user can choose to analyze translational efficiency using either the log2 ratio method or the interaction term method [17, 18](Fig. 2d). All modules allow for download of user-selected differential expression and dynamically-created visualizations.
Data input
Typical polysome and ribosome profiling experiments utilize microarrays or RNA sequencing to quantify both transcriptional data as well as ribosome-enriched data. Data input for RIVET, from microarrays or RNA sequencing, must be in matrix format with samples as columns and genes as rows; most RNA sequencing pipelines and microarray pipeline output data in RIVET ready to use format. Data from microarrays must be background corrected prior to input into RIVET. Most publicly deposited microarray data can be easily downloaded as a matrix of data with background correction and data normalization already performed. Examples of appropriately formatted data can be found in Additional file 1: Table S1 If the user needs to perform background correction and normalization for raw microarray CEL files, we refer the user to the GenePattern suite of graphical user interface tools that can be utilized to generate a spreadsheet of expression values as RIVET input. In brief, to use GenePattern to create RIVET input files, the user should provide a zip archive of Affymetirx CEL files from microarray experiments. If microarray files are derived from 3′ biased IVT Affymetrix arrays, the user can use the ExpressionFileCreator module to upload zip file of arrays. Default parameter settings using RMA background correction and quantile normalization will output a GCT formatted file that can be uploaded to RIVET. If microarray files are derived from Affymetrix 1.1, 2.0, 2.1 ST arrays, exon arrays, HTA 2.0 arrays, the module AffySTExpressionFileCreator can be used with a zip archive of Affymetrix CEL files and default user settings. For more information see the GenePattern software website, genepattern.broadinstitute.org [19].
Because RIVET RNA sequencing-based statistical analyses utilize statistics that assume negative binomial distributions or perform normalization that requires discrete counts data, RNA sequencing data must be input as a raw counts matrix; TPM, RPKM, FPKM normalization cannot be handled by RIVET statistical analysis. Appropriate RNA sequencing counts table are generated by commonly used RNA-seq pipelines including STAR [20] or HTSeq [21].
Sample selection and user-defined analysis choices
The normalization module of RIVET is split into a sidebar panel allowing the user to choose multiple parameters for defining statistical analysis and experimental design, as well as a main panel containing information regarding sample categorization and a quality control MDS plot. Matrix-formatted data can be uploaded into RIVET via the sidebar upload feature. When provided with an input counts table, RIVET automatically generates an MDS plot for quality control purposes in the main page under the plot module. Within the main page samples module, RIVET provides the user with an interface to label samples based on experiment and control as well as whether samples were generated from transcriptional or translational data (Additional file 2: Figure S1B). To group samples, the user is presented with drop-down menus containing the names of samples as defined by the column headers in the input matrix and must choose how to categorize each sample according to experiment type. Additionally, when inputting polysome data, the user has the option to define additional translational categories depending on the number of polysome fractions in the experiment. When more than one translational fraction is defined, the software will dynamically generate additional experiment and control input boxes to further categorize polysome samples. For most polysome and ribosome profiling experiments, the default box allowing for input of 1 experiment sample type and 1 control is appropriate. In order to run statistical analysis, the user must complete all prompts in the main panel of the samples module; default values in all other options in the sidebar panel are sufficient to complete the run.
In order to provide an example of user-defined analysis input, we will use a simplified experimental design from Geter et al. [2] (GSE107590). In this study, the authors silenced the expression of eIF4E (experiment) in Tamoxifen-resistant breast cancer cells compared to a non-silencing control (control) using biological duplicates. The authors performed polysome profiling on experiment and control samples where a sample from cytoplasmic RNA (total) was utilized to quantify transcription in each condition. In addition, mRNAs bound to 2–3 ribosomes (light) and 4–6 ribosomes (heavy) in each condition were enriched so that in total there were 12 samples (Additional file 2: Figure S1A). Samples isolated from total mRNA would be entered into the Transcription category with control samples replicate 1 and 2 entered into the control box and experiment samples replicate 1 and 2 entered into the experiment box. The number of polysome fractions can also be changed to automatically allow for additional inputs for associated samples (Additional file 2: Figure S1B).
Pipeline architecture
Upon input of a matrix of expression values and completion of required parameters, RIVET automatically carries out two work flows simultaneously, the first workflow generates normalized data to be used in the statistical model, the second workflow will generate a targets matrix and contrasts matrix defined based on user input labels for experiment and control, as well as, user input sample groupings (Fig. 3). The first workflow will normalize counts data utilizing the dge object for edgeR statistical analysis or voom normalization (limma package) as indicated by the user. The second work flow will produce a targets dataframe and contrasts dataframe in the following manner: 1) a targets object will be generated based on user defined sample groupings from the main page of the normalization module and the column headings of the matrix of reordered sample labels. The targets object will contain information regarding the assignment of samples to a particular label class. In the example above regarding eIF4E silencing vs control tamoxifen resistant cells, the sample classes would be as follows: NS.transcription, E4.transcription, NS.translation1, E4.translation1, NS.translation2, E4.translation.2. Samples would be assigned as follows: NS.transcripion: replicate 1 control transcription, replicate 2 control transcription, E4.transcription: replicate 1 experiment transcription, replicate 2 experiment transcription, NS.translation1: replicate 1 control translation heavy polysome, replicate 2 control translation heavy polysome, 4E.translation1: replicate 1 experiment translation heavy polysome, replicate 2 experiment translation heavy polysome, NS.translation2: replicate 1 control translation light polysome, replicate 2 control translation light polysome, 4E.translation2: replicate 1 experiment translation light polysome, replicate 2 experiment translation light polysome (Table 1) 2) In tandem, a contrasts matrix will be defined based on user-defined labels of control and experiment in the samples submodule of the normalization module. The contrasts matrix for the above example would compare the means for 1) NS vs 4E transcripton, 2) NS vs 4E translation heavy, 3) NS vs 4E translation light (Table 2). Statistical analysis will utilize normalized data, the targets frame, and contrasts frame as input.
Table 1.
NS.transcription | E4.transcription | NS.translation.1 | E4.translation.1 | NS.translation.2 | E4.translation.2 | |
---|---|---|---|---|---|---|
R1_NS_TamR_T | 1 | 0 | 0 | 0 | 0 | 0 |
R2_NS_TamR_T | 1 | 0 | 0 | 0 | 0 | 0 |
R1_4E_TamR_T | 0 | 1 | 0 | 0 | 0 | 0 |
R2_4E_TamR_T | 0 | 1 | 0 | 0 | 0 | 0 |
R1_NS_TamR_H | 0 | 0 | 1 | 0 | 0 | 0 |
R2_NS_TamR_H | 0 | 0 | 1 | 0 | 0 | 0 |
R1_4E_TamR_H | 0 | 0 | 0 | 1 | 0 | 0 |
R2_4E_TamR_H | 0 | 0 | 0 | 1 | 0 | 0 |
R1_NS_TamR_L | 0 | 0 | 0 | 0 | 1 | 0 |
R2_NS_TamR_L | 0 | 0 | 0 | 0 | 1 | 0 |
R1_4E_TamR_L | 0 | 0 | 0 | 0 | 0 | 1 |
R2_4E_TamR_L | 0 | 0 | 0 | 0 | 0 | 1 |
Table 2.
E4.transcription-NS.transcription | E4.translation.1-NS.translation.1 | E4.translation.2-NS.translation.2 | |
---|---|---|---|
NS.transcription | −1 | 0 | 0 |
E4.transcription | 1 | 0 | 0 |
NS.translation.1 | 0 | −1 | 0 |
E4.translation.1 | 0 | 1 | 0 |
NS.translation.2 | 0 | 0 | −1 |
E4.translation.2 | 0 | 0 | 1 |
For generation of translational efficiency data, user input data will be used in a similar manner but contrasts frame along with statistical model will be different. The statistical model will be defined as follows:
~ 0 + type + treatment + type:treatment.
In the above statistical model, type defines polysome or cytosolic RNA and treatment defines the experimental conditions tested. The contrasts matrix will define translational efficiency as the interaction of type and treatment. If the user chooses to utilize log2 ratio, a matrix of ratios will be calculated grouped by [polysome/total]control and [polysome/total]experiment. The matrix will then be used as input to normalization and contrasts matrix will be defined as previous.
Upon completion of statistical analysis, a matrix is generated containing the log2 fold-change for experiment standardized to control of every mRNA, as well as p-values and adjusted p-values for all comparisons. These statistics are generated for both transcription and for each polysome fraction within this matrix and is available for download as a tab-delimited text file at the bottom of the normalization sidebar menu.
Transcription and translational analysis
As the full statistical analysis results are provided in the normalization module, the transcription and translation modules allow users to define a threshold of significance for differential analysis in real time, with complementary dynamic visualization of data based on the threshold criteria chosen. Within the transcription module, users use sliders provided in the sidebar panel to control the threshold of log fold-change, p-value cutoff, or adjusted p-value cutoff. A volcano plot is generated in the main panel automatically with default settings of > 1 log-fold change and p-value < 0.05. Based on user-defined thresholds, the number of significant genes, highlighted in blue on the plot, will automatically adjust (Fig. 2b). Once the user is satisfied with threshold criteria, a tab-delimited text file of selected genes with log-fold change, p-value, and adjusted p-value can be downloaded.
The user interface layout of the translation module is similar to that of transcription with slider options to choose log fold change and p-value / adjusted p-value significance thresholds. If provided with one translational fraction, a volcano plot is also generated displaying user-defined significant genes. However, if more than one polysome fraction is provided, the list of significant genes per each fraction is concatenated and the log-fold change of each mRNA across polysome fractions is displayed as a heatmap (Fig. 2c). In this way, the user can view the dynamics of each mRNA’s log-fold change across polysome fractions to ascertain modulations in the level of translation of each mRNA due to treatment. Both heatmap and text file can be downloaded of this analysis.
Translational efficiency analysis
Within the translational efficiency module, the user can utilize the sidebar menu to choose what method to determine translation efficiency: log2 ratio or interaction (defined above). Thresholds for fold-change as well as p-value or adjusted p-value are included in this panel. The main panel includes either a volcano plot for a single polysome fraction or heatmap for multiple polysome fractions. When the user selects a log-fold change threshold, the log-fold change refers to the ratio of polysome/totalexperiment vs. polysome/totalcontrol. The visualization of multiple polysome fractions for translational efficiency, displayed as a heatmap, is the log fold change of polysomeexperiment and polysomecontrol as translational efficiency is a way to select for mRNAs with greater change in translation than transcription, however, the fold-change in polysomes is more biologically relevant. The genes displayed in the heatmap are those that are defined as having significant differences in translational efficiency due to treatment (Fig. 2d).
Examination of types of translational regulation
A variety of biological mechanisms underlie translational regulation. Comparing mRNA expression levels in transcription and translation can help characterize this regulation. These types of regulation include translational buffering, where changes in mRNA expression at the level of transcription are not reflected in translation; changes in transcription that are directly reflected in translation; changes occurring that are opposite in magnitude in transcription and translation; and lastly, changes occurring at the level of translation and not transcription.
User defined thresholds for significance at the transcription and translation level are used to determine the category of regulation assigned to each mRNA in the translational regulation module. If the user chooses to navigate to the translational regulation module directly following completion of the normalization tab, the module will use the default thresholds (p-value < 0.05) to select mRNAs to highlight on the scatter plot. Our tool allows users to toggle between the different types of translational regulation: transcription alone, translation alone, transcription and translation, and opposite. This provides users the ability to view mRNAs that are regulated in all types of translational regulation, and to receive a quantification of the number of genes regulated at each level as a bar plot (Fig. 2e). All figures as well as a list of mRNAs selected for each type of translational regulation can be downloaded in the sidebar panel.
Results
We used RIVET to re-analyze previously published datasets to demonstrate the it’s utility in analyzing data generated by three commonly used platforms: microarray, polysome profiling, and ribosome profiling.
Microarray data
Data from Silvera et al. was used as a microarray analysis test set [3]. This study compared the effects of treatment with two mTOR inhibitors, PP242 and RAD001, with and without irradiation on the selective translation of mRNAs in an inflammatory breast cancer cell line. To reduce the complexity of the dataset, we used only PP242 treated samples and DMSO control samples subsampled from the spreadsheet of RMA background corrected and filtered data downloaded from GEO accession GSE92598. Data was uploaded into RIVET in the normalization module using the microarray option for downstream analysis. The RIVET-generated MDS plot revealed that the greatest separation in this dataset was between polysome-derived samples and cytoplasmic samples which can be typical of microarray-derived polysome experiments [22] (Fig. 4a). Comparison of the transcription and polysome derived volcano plots (log2 fold-change > 1; p-value < 0.05), PP242 effects are largely at the level of transcription compared to translation (Fig. 4b). Interestingly, when examining all types of regulation in the translational efficiency module, most genes appear to be translationally buffered; i.e. genes are regulated at the level of transcription without corresponding changes occurring in translation (Fig. 4c). Quantification of significant genes in transcription, translation, and all regulation can be handled by examining the number of genes produced in transcription and translation spreadsheets as well as counting the number of items per category in the all regulation spreadsheet.
Polysome profiling data
To demonstrate the utility of RIVET in polysome profiling, we used data generated by Geter et al. [2]. Unlike most studies, the authors of this study combined fractions 4–6 to study heavily translated mRNAs and fractions 2–3 to study poorly translated mRNAs from the polysome gradient. The goal of the study was to identify the effects of eIF4E reduction in tamoxifen-resistant breast cancer cells compared to a normal control. In total, there were two biological replicates per sample type yielding 12 samples in total. Following differential expression analysis by RIVET, a number of interesting biological observations were made. When examining eIF4E-mediated regulation occurring in translation, each mRNA was either upregulated or downregulated across the polysome gradient, which could be easily seen in the translation module with the heatmap visualization (Fig. 5a). Homodirectionality of fold-change across the polysome gradient suggests 1) many genes are regulated at the level of transcription leading to large changes that occur at the level of translation 2) changes occurring in translation are large and independent of effects in transcription.
Using RIVET, these two models can be easily verified using the translational efficiency module. With default thresholds (log2 fold change > 1; p-value< 0.05) for both transcription and translation, the user can discern from the all-regulation scatter plot that most changes are transcriptional and these changes are being reflected in translation. If the fold-change threshold is increased to examine the largest effect size, the scatter plot demonstrates that most genes are both transcriptionally and translationally regulated (Fig. 5c). Though transcriptional changes lead to the largest effects in translation in this dataset, as has been observed in many studies, small effects in translation can affect biology dramatically. And these small changes in translation tend to be in translationally-regulated mRNAs that can be uniquely identified using polysome profiling [5]. To examine the dynamics of this particular subset of mRNAs, the user can use both the translation and the translational efficiency module. The heatmap graphic will highlight the dynamics of each mRNA that is regulated at the level of translational efficiency across the polyribosome gradient. Immediately apparent in the heatmap are the heterodirectionality of fold-changes across the gradient, suggesting shifts in translation from heavier to lighter polyribosome fractions (Fig. 5b). Therefore, the dynamic features of RIVET allow users to gain clearer and more robust biological insight from data than would be possible using static methods.
Ribosome profiling data
Ribosome profiling data (GSE35469) were obtained from Hsieh et al. and examined the comparison of prostate cancer (PC3) cells treated with or without PP242, a drug targeting the mTORC1/2 complex [23]. The original study design implemented 2 biological replicates and we utilized PP242-treated or vehicle treated samples with paired ribosome profiling and RNA sequencing samples to give a total of 6 samples. Using downloaded counts for each sample, we analyzed 5,688 genes as input into the RIVET differential analysis pipeline after filtering for genes with greater than 256 reads in RNA sequencing samples (as per methods described in Hsieh et al.). To ascertain our ability to use RIVET to analyze ribosome footprinting data, we compared the ability of the limma statistical framework within RIVET to select consistently regulated mRNAs to those found in Hsieh et al. using a log fold-change threshold > 1.5 as described. We found that ~ 60% of translational efficient mRNAs in Hsieh et al., were also identified by RIVET (Fig. 6a). As fold-changes in Hsieh et al. overall were larger than those calculated by RIVET, we calculated a compression score for RIVET fold-changes, and transformed the 1.5-fold threshold by this value yielding a new threshold of 1-fold for RIVET data to allow for a more comparable fold-change comparison between studies. Using this fold-change setting in RIVET we were able to recapitulate 90% of translationally efficient mRNAs in Hsieh et al. and identify 84 additional mRNAs. The rank of fold-changes for translational efficiency were similar between Hsieh et al. and RIVET (rho = 0.88 Spearman correlation of rank differences) (Fig. 6b). Differences between data are likely due to differences in selection criteria between analysis as Hsieh et al. utilized a false-discovery rate correction for fold-change thresholds as opposed to p-values for selected mRNAs as described in supplemental Fig. 4 of their paper. Although differences exist between methodologies, RIVET successfully identifies trends described in Hsieh et al. For example, a majority of mRNAs are down-regulated in translational efficiency in agreement with Hsieh et al. (Fig. 6c). Further, when translational efficiency is examined utilizing the translational regulation module in RIVET, most mRNAs show little change at the level of transcription with robust changes in translation as concluded by the Hsieh et al. (Fig. 6d). These data suggest that RIVET can be utilized to distinguish differential translation and translational regulation in ribosome profiling data.
Comparison to other resources
Rivet has several advantages over currently available packages that carry out differential expression analysis in ribosome profiling and polysome profiling experiments, summarized in Table 3. Like Riborex [8] and tRanslatome [24], RIVET uses existing generalized linear model (GLM) based frameworks, either limma or edgeR depending on user selection. These GLMs have been shown to complete more rapid analysis compared with similar GLM methods such as Xtail [10], RiboDiff [9], and Babel [7, 8] . Moreover, the use of limma or edgeR support the analysis of small sample size (less than 3 biological replicates) unlike anota2seq, which require 3 biological replicates unless there are more than 2 experimental conditions [17]. However, unlike all other translatome tools, RIVET makes use of a graphical user interface to make these statistical frameworks accessible to users with limited computer programming abilities. Also, unlike other tools, RIVET offers the advantage of allowing users to include analysis of multiple polysome fractions which may be critical in capturing nuances in translational regulation that have important impacts on biology. Important to the use of legacy microarray polysome profiling data, RIVET, like anota2seq [17], can analyze both continuous microarray data as well as RNA sequencing data in current polysome profiling and ribosome profiling methodologies. As an additional advantage, as a simplification for the user, RIVET explicitly chooses the appropriate normalization method for the data depending on the platform. Finally, RIVET, leveraging multiple visualization packages such as ggplot2 and heatmap.2 function from gplot, has the capability to provide high-level and interactive data visualization to allow users to gain insight from data and produce publication ready graphs. tRanslatome incorporates data visualization with statistical analysis, however, RIVET allows for the ability for users to view visualizations interactively, alleviating the need to run scripts multiple times to gain biological insight. Limitations of this software include an inability to utilize genomic location information provided in ribosome footprinting data. RIVET is intended for differential analysis; for further exploration and visualization of ribosome footprinting data at nucleotide resolution, we refer the user to RiboGalaxy, a tool dedicated to ribosome footprinting analysis with a graphical user interface [25].
Table 3.
Software | Rivet | Xtail | Anota2seq | Babel | RiboDiff | tRanslatome | Riborex |
---|---|---|---|---|---|---|---|
Novel Statistical Model | x | x | x | x | |||
Utilizes existing statistical frameworks | x | x | x | ||||
Visualization | x | x | |||||
GUI | x | ||||||
RNA seq & microarray | x | x | x | ||||
Interactive | x | ||||||
Integration of all levels of analysis | x | x | x | ||||
No Programming ability | x | ||||||
Statistical Choice | x | x | x | ||||
Multiple Polysomes | x |
Conclusion
Translatome experiments including polysome profiling and ribosome profiling have been used to explore changes in gene expression at the level of both transcription and translation. Recently developed computational tools to examine translatome experiment data provide differential analysis or visualization of data but rarely both and lack an easy-to-use graphical user interface. RIVET is comprehensive, performing differential analysis at the level of transcription, translation, and translational efficiency in combination with interactive data visualization. RIVET can handle both microarray and RNA sequencing platforms and provides the user with the ability to analyze multiple translational fractions. Utilizing the R shiny interface, RIVET provides users without programming skills the ability to analyze multi-omics data and visualize and downloadable publication quality results. To demonstrate the utility of RIVET for all types of translational regulation studies, we re-analyzed 3 studies that examined translational regulation using RNA-sequencing based ribosome footprinting and polysome profiling with multiple polysome fractions and polysome profiling with microarrays. In conclusion, RIVET is a comprehensive, adaptable and user-friendly framework for novel hypothesis generation and data exploration which will help to drive forward the field of protein translation.
Availability and requirements
Project name: Ribosomal Investigation and Visualization to Evaluate Translation.
Project home page: https://ruggleslab.shinyapps.io/RIVET/
Operating system: Platform independent
Programming Language: R/Rshiny
Other requirements: None
License: None
Any restrictions to use by non-academics: None
Additional files
Acknowledgements
This work has used computing resources at the NYU High Performance Computing Facility (HPCF).
Funding
This work has been supported by RO1CA178509 and RO1CA207893 from the National Institutes of Health and by BCRF-16-143 from the Breast Cancer Research Foundation.
Availability of data and materials
The Geter et al. polysome profiling dataset was analyzed and has been mounted to the RIVET web page as a sample dataset and were original data were downloaded from Gene Expression Omnibus accession GSE107590. The original data can be found: Geter et al. (2017). Hyperactive mTOR and MNK1 phosphorylation of eIF4E confer tamoxifen resistance and estrogen independence through selective mRNA translation reprogramming. Genes Dev. 31:2235–49. The microarray dataset was downloaded from the Gene Expression Omnibus accession GSE92598 and the original data can be found in the published article Silvera et al. (2017). mTORC1 and − 2 Coordinate Transcriptional and Translational Reprogramming in Resistance to DNA Damage and Replicative Stress in Breast Cancer Cells. Mol Cell Biol. 37:e00577–16. The ribosome profiling dataset was downloaded from the Gene Expression Omnibus accession GSE35469 and the original data can be found in the published article Hsieh et al. (2012), The translational landscape of mTOR signaling steers cancer initiation and metastasis. Nature. 485:55–61.
Authors’ contributions
AE developed the software, ran test analysis and contributed to writing the manuscript. RJS aided in conceptualizing the tool and edited the manuscript. KVR led the project, aided in software development and wrote the manuscript. All authors read and approved the final manuscript.
Ethics approval and consent to participate
Not Applicable.
Consent for publication
Not Applicable.
Competing interests
The authors declare that they have no competing interests.
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Robert J. Schneider, Phone: 212-263-6006, Email: Robert.schneider@nyulangone.org
Kelly V. Ruggles, Phone: 212-263-3642, Email: Kelly.ruggles@nyulangone.org
References
- 1.Silvera D, Formenti SC, Schneider RJ. Translational control in cancer. Nat Rev Cancer. 2010;10:254. doi: 10.1038/nrc2824. [DOI] [PubMed] [Google Scholar]
- 2.Geter PA, Ernlund AW, Bakogianni S, Alard A, Arju R, Giashuddin S, et al. Hyperactive mTOR and MNK1 phosphorylation of eIF4E confer tamoxifen resistance and estrogen independence through selective mRNA translation reprogramming. Genes Dev. 2017;31:2235–2249. doi: 10.1101/gad.305631.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Silvera D, Ernlund A, Arju R, Connolly E, Volta V, Wang J, et al. mTORC1 and −2 Coordinate Transcriptional and Translational Reprogramming in Resistance to DNA Damage and Replicative Stress in Breast Cancer Cells. Mol Cell Biol [Internet]. 2017;37(5):e00577–16 Available from: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5311240/ [cited 2 Dec 2017]. [DOI] [PMC free article] [PubMed]
- 4.Larsson O., Tian B., Sonenberg N. Toward a Genome-Wide Landscape of Translational Control. Cold Spring Harbor Perspectives in Biology. 2012;5(1):a012302–a012302. doi: 10.1101/cshperspect.a012302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Gandin V, Masvidal L, Hulea L, Gravel S-P, Cargnello M, McLaughlan S, et al. nanoCAGE reveals 5′ UTR features that define specific modes of translation of functionally related MTOR-sensitive mRNAs. Genome Res. 2016;26:636–648. doi: 10.1101/gr.197566.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Ingolia NT, Ghaemmaghami S, Newman JRS, Weissman JS. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science. 2009;324:218–223. doi: 10.1126/science.1168978. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Olshen AB, Hsieh AC, Stumpf CR, Olshen RA, Ruggero D, Taylor BS. Assessing gene-level translational control from ribosome profiling. Bioinformatics. 2013;29:2995–3002. doi: 10.1093/bioinformatics/btt533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Li W, Wang W, Uren PJ, Penalva LOF, Smith AD. Riborex: fast and flexible identification of differential translation from Ribo-seq data. Bioinformatics. 2017;33:1735–1737. doi: 10.1093/bioinformatics/btx047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Zhong Y, Karaletsos T, Drewe P, Sreedharan VT, Kuo D, Singh K, et al. RiboDiff: detecting changes of mRNA translation efficiency from ribosome footprints. Bioinformatics. 2017;33:139–141. doi: 10.1093/bioinformatics/btw585. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Xiao Zhengtao, Zou Qin, Liu Yu, Yang Xuerui. Genome-wide assessment of differential translations with ribosome profiling data. Nature Communications. 2016;7:11194. doi: 10.1038/ncomms11194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, et al. limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res. 2015;43:e47. doi: 10.1093/nar/gkv007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26:139–140. doi: 10.1093/bioinformatics/btp616. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Wickham H. ggplot2: Elegant Graphics for Data Analysis. Verlag New York: Springer; 2009. [Internet]. Available from: http://ggplot2.org
- 14.Parent R, Beretta L. Translational control plays a prominent role in the hepatocytic differentiation of HepaRG liver progenitor cells. Genome Biol. 2008;9:R19. doi: 10.1186/gb-2008-9-1-r19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Badura M, Braunstein S, Zavadil J, Schneider RJ. DNA damage and eIF4G1 in breast cancer cells reprogram translation for survival and DNA repair mRNAs. PNAS. 2012;109:18767–18772. doi: 10.1073/pnas.1203853109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Schott J, Reitter S, Philipp J, Haneke K, Schäfer H, Stoecklin G. Translational regulation of specific mRNAs controls feedback inhibition and survival during macrophage activation. PLoS Genet. 2014;10:e1004368. doi: 10.1371/journal.pgen.1004368. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Oertlin C, Lorent J, Gandin V, Murie C, Masvidal L, Cargnello M, et al. Genome-wide analysis of differential translation and differential translational buffering using anota2seq. bioRxiv. 2017;106922 Available from: 10.1101/106922.
- 18.Cenik C, Cenik ES, Byeon GW, Grubert F, Candille SI, Spacek D, et al. Integrative analysis of RNA, translation, and protein levels reveals distinct regulatory variation across humans. Genome Res. 2015;25:1610–1621. doi: 10.1101/gr.193342.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Reich M, Liefeld T, Gould J, Lerner J, Tamayo P, Mesirov JP. GenePattern 2.0. Nat Genet. 2006;38:500. doi: 10.1038/ng0506-500. [DOI] [PubMed] [Google Scholar]
- 20.Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29:15–21. doi: 10.1093/bioinformatics/bts635. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Anders S, Pyl PT, Huber W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics. 2015;31:166–169. doi: 10.1093/bioinformatics/btu638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Larsson Ola, Bitterman Peter B. mTOR Pathway and mTOR Inhibitors in Cancer Therapy. Totowa, NJ: Humana Press; 2009. Genome-Wide Analysis of Translational Control; pp. 217–236. [Google Scholar]
- 23.Hsieh AC, Liu Y, Edlind MP, Ingolia NT, Janes MR, Sher A, et al. The translational landscape of mTOR signalling steers cancer initiation and metastasis. Nature. 2012;485:55–61. doi: 10.1038/nature10912. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Tebaldi T, Dassi E, Kostoska G, Viero G, Quattrone A. tRanslatome: an R/Bioconductor package to portray translational control. Bioinformatics. 2014;30:289–291. doi: 10.1093/bioinformatics/btt634. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Michel AM, Mullan JPA, Velayudhan V, O’Connor PBF, Donohue CA, Baranov PV. RiboGalaxy: a browser based platform for the alignment, analysis and visualization of ribosome profiling data. RNA Biol. 2016;13:316–319. doi: 10.1080/15476286.2016.1141862. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The Geter et al. polysome profiling dataset was analyzed and has been mounted to the RIVET web page as a sample dataset and were original data were downloaded from Gene Expression Omnibus accession GSE107590. The original data can be found: Geter et al. (2017). Hyperactive mTOR and MNK1 phosphorylation of eIF4E confer tamoxifen resistance and estrogen independence through selective mRNA translation reprogramming. Genes Dev. 31:2235–49. The microarray dataset was downloaded from the Gene Expression Omnibus accession GSE92598 and the original data can be found in the published article Silvera et al. (2017). mTORC1 and − 2 Coordinate Transcriptional and Translational Reprogramming in Resistance to DNA Damage and Replicative Stress in Breast Cancer Cells. Mol Cell Biol. 37:e00577–16. The ribosome profiling dataset was downloaded from the Gene Expression Omnibus accession GSE35469 and the original data can be found in the published article Hsieh et al. (2012), The translational landscape of mTOR signaling steers cancer initiation and metastasis. Nature. 485:55–61.