Skip to main content
PLOS Computational Biology logoLink to PLOS Computational Biology
. 2021 Dec 8;17(12):e1009658. doi: 10.1371/journal.pcbi.1009658

Multifunctional RNA-binding proteins influence mRNA abundance and translational efficiency of distinct sets of target genes

Valentin Schneider-Lunitz 1,, Jorge Ruiz-Orera 1,, Norbert Hubner 1,2,3,*, Sebastiaan van Heesch 4,*
Editor: Greg Tucker-Kellogg5
PMCID: PMC8687540  PMID: 34879078

Abstract

RNA-binding proteins (RBPs) can regulate more than a single aspect of RNA metabolism. We searched for such previously undiscovered multifunctionality within a set of 143 RBPs, by defining the predictive value of RBP abundance for the transcription and translation levels of known RBP target genes across 80 human hearts. This led us to newly associate 27 RBPs with cardiac translational regulation in vivo. Of these, 21 impacted both RNA expression and translation, albeit for virtually independent sets of target genes. We highlight a subset of these, including G3BP1, PUM1, UCHL5, and DDX3X, where dual regulation is achieved through differential affinity for target length, by which separate biological processes are controlled. Like the RNA helicase DDX3X, the known splicing factors EFTUD2 and PRPF8—all identified as multifunctional RBPs by our analysis—selectively influence target translation rates depending on 5’ UTR structure. Our analyses identify dozens of RBPs as being multifunctional and pinpoint potential novel regulators of translation, postulating unanticipated complexity of protein-RNA interactions at consecutive stages of gene expression.

Author summary

The lifecycle of an RNA molecule is controlled by hundreds of proteins that can bind RNA, also known as RNA-binding proteins (RBPs). These proteins recognize landing sites within the RNA and guide the RNA’s transcription from DNA, its processing into a mature messenger RNA, its translation into protein, or its degradation once the RNA is no longer needed. Although we now mechanistically understand how certain RBPs regulate these processes, for many RBP-target interactions the consequences imposed by RNA binding are not well understood. For 143 RBPs with known RNA binding positions, the authors of the current study investigated how RNA molecules responded to fluctuations in the expression levels of these RBPs, across each of 80 human hearts. Using statistical approaches, they could show that many RBPs influenced stages of the RNA lifecycle that they were not known to be involved in. Some RBPs turned out to be true "all-rounders" of RNA metabolism: they controlled the RNA transcript levels of some genes, whereas they influenced the translation rates of others. This unexpected multifunctionality unveiled previously hidden aspects of the everyday RNA-binding protein working life.

Introduction

RNA-protein interactions are crucial for a wide range of processes in multiple subcellular compartments, including RNA transcription, splicing, editing, transport, stability, localization, and translation [1]. Using state-of-the-art mass spectrometry-based approaches, recent studies have identified up to thousands of potential RNA-binding proteins (RBPs), although for many their precise roles remain unknown [24]. Whereas RBPs can interact with target RNAs through well-defined protein-RNA binding domains (RBD), uncharacterized RBDs are frequently discovered [5,6] highlighting the complex, diverse, and still largely unknown nature of RNA-protein interactions. The possession of more than a single RBD could theoretically allow RBPs to be multifunctional, for instance through the separate regulation of different sets of targets [7]. Multifunctionality may additionally be established through the condition- or cell type-specific expression of RBPs and their interaction partners, or through dynamic shuttling of RBPs between different subcellular compartments, such as the nucleus and cytosol [811]. Accordingly, out of a large number of RBPs whose subcellular localization was recently evaluated [12], the vast majority could be detected in more than one compartment, albeit of thus far largely unknown biological significance. The potential importance of RBP shuttling was exemplified by the subcellular redistribution of dozens of RBPs upon global induction of mRNA decay [13], and compartment-specific changes in RBP interactomes that could be witnessed upon cellular stress [14]. These observations indicated that RBPs may dynamically relocate to form a mechanistic bridge between normally compartment-restricted consecutive stages of gene expression regulation: some taking place in the nucleus (e.g., transcription, mRNA processing, splicing), others in the cytosol (e.g., translation, RNA decay).

RBPs are crucial regulators of mRNA translation—the synthesis of protein from a messenger RNA template by ribosomes [15]. In the cytosol, RBPs can act in a general capacity (e.g., mRNA translation initiation or elongation), as well as in more specialized functions where select accessory RBPs or heterogeneous ribosomes facilitate the translation of dedicated subsets of target RNAs [1618]. In the nucleus, RBPs can coordinate the process of splicing and translation by modulating different structural and sequence properties of the mRNAs that are exported to the cytoplasm. For instance, highly structured regions in 5’ UTRs can decrease the efficiency of translation initiation at the cost of an overall lower translational output [1922], whereas increased RNA structures in CDS or 3’ UTR regions may also enhance transcript stability and hence mRNA half-life, in turn yielding higher protein output over time [23,24]. Additionally, alternative UTR usage can expose translated upstream ORFs (uORFs) or miRNA binding sites affecting mRNA translation and/or stability. Other alternative splice events might affect intrinsic properties of an mRNA, such as the ‘swiftness’ of nuclear export [25], codon usage [26,27], affinity with the exon junction complex [28], or susceptibility to nonsense-mediated decay [29].

Mechanistic insight into the quantitative effects of RBP expression on mRNA translation–i.e., when translation rates of the endogenous RBP targets respond directly to changes in RBP abundance–was recently provided by several studies [3032]. These studies used this quantitative relationship to assign novel functions to known RBPs and to investigate the kinetics through which RBPs regulate their targets. For instance, Chothani et al. used a computational correlation approach to define the frequency with which RBP levels correlate with target translation rates. This helped to pinpoint key RBP network hubs that were crucial for translational regulation during cardiac fibrosis in vitro and in vivo [30]. In a separate study, Luo et al. used luciferase-based 3’ UTR tethered function assays for 690 RBPs, identifying 50 RBPs whose expression induced significant positive or negative effects on mRNA stability and/or translation. This resulted in the novel characterization of the stress granule RBP UBAP2L as a ribosome-associated RBP [31]. Lastly, Sharma et al. developed a methodology to investigate RNA-protein kinetics in a time-resolved manner for the RBP DAZL [32]. This approach helped establish a quantitative relationship that precisely explained the effect of DAZL on mRNA levels and ribosome association, which correlated with the cumulative probability of DAZL binding within clusters of proximal 3’ UTR binding sites.

In our current study, we describe the computational identification of putatively new, and sometimes dual functions of RBPs in the regulation of mRNA translation. We integrate crosslinking immunoprecipitation (CLIP)-derived mRNA targets for 143 RBPs [12,33] with the transcriptomes and translatomes of 80 human hearts [34]—a tissue where translational control is known to play a central role in gene expression regulation [30,34,35]. We show that the expression levels of many, but not all, investigated RBPs indeed correlate with target mRNA abundance and/or translational efficiency in vivo. For a subset of 21 RBPs–including proteins with diverse previously described roles, such as the endoribonuclease G3BP1, the helicase DDX3X, the protease PUM1 and the deubiquitinase UCHL5 –we could independently assign dose-dependent effects to both mRNA levels and translational efficiencies of largely distinct sets of target genes, each involved in unrelated biological processes. Mechanistically, these target genes also appeared to be regulated independently, driven by differential affinity for protein-coding sequence length or 5’ UTR structure.

Our results show that RBPs with more than a single role in human biology are likely to be more prevalent than currently anticipated. We postulate that multifunctional RBPs may use their functional plasticity in a condition- or compartment-specific manner to regulate gene expression at multiple levels, for separately defined sets of target genes.

Results

RNA-binding protein abundance determines the efficiency of target gene translation

With the aim to define which RBPs can influence more than a single stage of gene expression control, we first determined whether RBP abundance can have predictive value for the extent of target gene regulation in the human heart. To this extent, we compiled protein-RNA interactions for 143 cardiac-expressed RBPs, consisting of the muscle-specific RBM20 [33] and 142 mostly ubiquitously expressed RBPs previously characterized in depth as part of ENCODE [12] (see Methods, S1A Fig and S1 Table). We then correlated the expression levels of these 143 RBPs with mRNA abundance and translational efficiencies (TE) of 11,387 cardiac-expressed genes across 80 human hearts—the largest set of matched human tissue transcriptomes and translatomes that is currently available [34]. mRNA abundance was measured using RNA-seq counts normalized per gene and per sample, whereas TE was defined as the ratio of Ribo-seq and RNA-seq counts normalized for each individual gene and sample (see Methods). This revealed a clear quantitative relationship between RBP expression levels and the extent of target gene expression control (e.g., translation rates: Figs 1A and S1B).

Fig 1. RNA-binding protein abundance predicts target translational regulation.

Fig 1

(A) Schematic of the RBP-target correlation approach. Using the quantified Ribo-seq and RNA-seq data from 80 hearts, pairwise RBP versus target mRNA abundance or translational efficiency correlations were calculated. A heatmap with hierarchically clustered translational efficiency Spearman’s Rho correlations of RBPs and translated mRNAs in the human heart are shown. Six clusters of coregulated RBPs are highlighted (See also S1 Table). (B) Heatmap with Glass’ △ scores that quantify the effect size of the witnessed significance of associations between RBPs and target gene mRNA abundance and TE. Only significant RBPs are shown: 37 TE-RBPs (orange) and 58 mRNA-RBPs (green). For three selected RBPs (one per category), histograms illustrate the significance of the calculated associations. (C) Dot plot displaying the fraction of translational efficiency RBP-target correlations that can be replicated in an independent set of primary cardiac fibroblasts [30]. For each RBP, the significance of the replication was evaluated by comparing the replicated fraction between observed and randomized sets and it is represented as a brown (significant) or red (non-significant) dot. The size of the dots indicates the strength of significance (-log10 (padj)) and grey dots correspond to the fraction of replicated correlations in randomized sets. Error bars indicate mean values with standard deviation (SD).

Next, we calculated the frequency with which mRNA levels and translational efficiencies of the CLIP-derived target genes correlated significantly with the abundance of each RBP. For this, we statistically evaluated associations through the sampling of 100,000 equally sized sets of simulated theoretical targets out of the 11,387 translated genes in the human heart, yielding 58 RBPs with significant associations (empirical padj ≤ 0.05) with target mRNA abundance (hereafter denoted as “mRNA-RBPs” that regulate “mRNA targets”) and 37 RBPs with significant associations with target translational efficiencies (“TE-RBPs” regulating “TE targets”) (Figs 1B and S1C). The effect size of each significant association was quantified using the Glass’ Delta score (△) [36], a measure of the difference between the experimental and simulated groups divided by the standard deviation of the control. These significant targets, for instance, included the candidate tumor suppressor and ubiquitous splicing regulator RBM5 [37], which we identified as a cardiac mRNA-RBP influencing mRNA abundance of at least 138 correlating targets (padj = 2.83 × 10−5; Glass’ Δ = 27.2). Reassuringly, our strategy validated known TE-RBPs such as the eukaryotic translation initiation factor EIF4G2, whose expression dynamics could be associated with target gene translational efficiencies of at least 235 correlating targets expressed in the human heart (padj = 5.26 × 10−5; Glass’ Δ = 6.3) [38,39] (Fig 1B). Importantly, we could replicate our calculations for 25 out of 37 depicted TE-RBPs in an independent, though smaller cohort of primary cardiac fibroblast translatomes (n = 20; [30])—a system previously explored to identify RBPs with key roles in cardiac fibrosis (Fig 1C).

Positive and negative control of translation by known and unknown factors

We detected 27 (out of 37) TE-RBPs without prior evidence of regulating mRNA translation, including 4 RBPs with no function assigned to their RNA binding ability at all (NKRF, FAM120A, SUB1 and UCHL5) (S2A Fig). To define how these RBPs interact with existing RBP networks to regulate target gene translation in a coordinated manner, we hierarchically clustered the correlation coefficients of all 37 TE-RBPs and their CLIP targets. This primarily divided the matrix in two distinct groups of TE-RBPs with marked opposite effects on target TE, indicative of competition and/or cooperation between subsets of RBPs (Fig 2A). Interestingly, the presented method joins known and novel TE-RBPs with opposing or concordant directionality of regulation on shared targets (Fig 2B). For instance, depending on the shared target gene bound by the splicing factor U2AF2 and the protease UCHL5, completely opposite effects on TE could be observed and independently replicated (Figs 2B, S2B and S2C). Moreover, most shared targets were not affected by RBP collinearity (Fig 2B, see Methods). Although these shared modes of target regulation were in part concordant with protein-protein interactions annotated in the STRING database [40], a subset of coregulatory “RBP hubs” contained proteins with previously unknown functional similarities (e.g., UCHL5 and U2AF2).

Fig 2. CLIP analysis identifies coregulated in vivo targets of novel master regulators of translation in the human heart.

Fig 2

(A) Heatmap displaying the hierarchically clustered correlations between the cardiac expression levels of the 37 TE-RBPs (as determined by normalized Ribo-seq expression) and the cardiac TE of 6,153 correlating target genes. Each of the significantly correlating target genes was previously found to be bound by at least one of these 37 TE-RBPs based on CLIP experiments (see Methods). The clustering separates two groups with opposite effects on TE, whose targets are enriched for mRNA metabolism (padj = 6.17 x 10−54) and endoplasmic reticulum (padj = 1.82 x 10−7) GO terms, respectively. (B) Dendrogram with hierarchically clustered TE-RBPs based on pairwise RBP-RBP overlaps. Shared target genes of all paired RBPs were included for clustering. Bottom heatmaps with translational efficiency correlations of selected RBP clusters and shared significant targets. These plots illustrate distinct cooperative and competitive RBP-target regulation modes. Pie charts illustrate the fraction of targets that remain significant after correcting for RBP collinearity per cluster. STRING protein-protein interaction networks [40] from selected RBP clusters reveal functional association of coregulated RBPs. Colours in edges and nodes indicate the sources of STRING evidence and known RBP functions. (C) Heatmap with hierarchically clustered Spearman’s Rho correlation scores of RBM20 and the translational efficiency of the predicted target genes. Significant correlating targets (n = 163, padj ≤ 0.05) and targets involved in muscle process (GO: 0003012) are highlighted in orange and light blue colours respectively. A list of sarcomere gene targets positively correlating with RBM20 is displayed. Selected bottom histograms illustrate the significance of RBM20 with correlating TE targets and the absence of significance with correlating mRNA targets. (D) Scatter plots representing the correlation between RBM20 expression (as measured by normalized Ribo-seq counts; x-axis) and the translational efficiency (TE; y-axis) of two sarcomere genes: TNNI3K and TTN. Score and level of significance of the two Spearman’s correlations are displayed. (E) Left: Scatter plot showing the correlation between normalized RBM20 expression levels (as measured by Ribo-seq) and the percent spliced in (PSI) of TTN exon 156. Right: Box plot comparing average TTN I-Band isoform-specific TEs, showing a marked difference between TTN isoform N2B (ENST00000460472), displaying a significantly higher TE than TTN isoform N2BA (ENST00000591111) (Wilcoxon rank sum test, p-value = 0.034).

Amongst the 27 potentially new TE-RBPs was the muscle-specific and dilated cardiomyopathy-associated splicing regulator RBM20, whose expression correlated particularly well with the TE of 163 experimentally validated target genes (out of 561 total targets; Glass’ Δ = 7.0; Fig 2C). Importantly, RBM20 levels specifically influenced TE and had no impact on overall mRNA abundance or stability. Most RBM20 targets, including the sarcomere genes TTN and TNNI3K, correlated positively (i.e., higher RBM20 expression associates with increased target gene TE; Fig 2D) and especially those positively correlating targets showed strong enrichment for muscle function processes (GO:0003012, padj ≤ 5.97 × 10−16) (Fig 2C). To investigate a possible connection between RBM20-mediated mRNA splicing and the subsequent efficiency of mRNA translation, we evaluated whether splicing rates of known target exons correlated directly with TE. Splicing rates were quantified using calculated percent spliced in (PSI) scores, a metric that evaluates the efficiency of splicing for a specific exon into the transcript population of a gene (see Methods). For 66 out of 163 (± 40%) translationally regulated RBM20 target genes, the extent of alternative splicing indeed correlated with RBM20 abundance. A clear example is the exon inclusion measured across the TTN I-band, whose exons are only included in the longer TTN N2BA isoform. These I-band exons specifically drive the negative correlation of RBM20 expression with overall TTN TE, indicating that their inclusion reduces the efficiency with which TTN can be translated. We had previously observed that TTN translation rates are strongly isoform-dependent [34] and can now mechanistically connect this to splicing control by RBM20 (Fig 2E), a consequence that seems generalizable for more muscle-specific RBM20 targets, including other sarcomere components (Fig 2D). Although the precise mechanism through which RBM20 influences TE remains unknown, this RBP may omit the inclusion of exons with inefficient codon translation rates or exons that impact the stability or structure of the transcript, both of which influence protein synthesis rates [28,41].

Multifunctional RBPs including DDX3X and G3BP1 regulate mRNA abundance and translational efficiency of independent sets of target genes

Among the 74 RBPs that correlated significantly with target gene TE (37 TE-RBPs) or mRNA abundance (58 mRNA-RBPs), a subset of 21 RBPs could be associated independently with both molecular traits (Figs 1B and 3A). To investigate whether the association with mRNA abundance and TE was interrelated (and hence successive, i.e., higher target expression drives increased TE), we examined the sets of genes that correlated significantly with either trait. This revealed a very limited overlap between correlating target genes for all 21 RBPs (16.71 ± 8.19%, Fig 3A and S2 Table). To substantiate this observation, we compared the trait-specific strength of the effect sizes (again as measured by the Glass’ Delta (△) [36]) and found largely no relation (S2 Table) between the correlations as independently witnessed for both traits, confirming the absence of a carryover effect (Figs 3B and S3A). This led us to denote these RBPs as “multifunctional RBPs”—context-specific RBPs whose functional outcome depends on the set of mRNAs it targets. A key example appears to be the multifunctional RBP DDX3X [42,43], whose abundance correlates significantly with the mRNA levels of 339 target genes (padj = 2.83 × 10−5; Glass’ Δ = 6.9) and the translational efficiency of 730 target genes (padj = 5.25 × 10−5; Glass’ Δ = 11.89), of which only 43 targets overlap between both sets (Fig 3A and 3C). The consequences of DDX3X binding for mRNA abundance (positive correlation) or TE (negative correlation) are opposite, though this is not the case for all multifunctional RBPs (S3B Fig). Three other multifunctional RBPs similarly act as repressors of translation whilst having a positive effect on mRNA abundance (DDX6, NKRF, GEMIN5), one RBP shows the exact opposite behavior (FAM120A), and all others have concordant roles at both layers of control (e.g., TRA2A, FASTKD2, SRSF1).

Fig 3. Multifunctional RBPs regulate translation of distinct sets of target genes.

Fig 3

(A) Heatmap with Glass’ △ scores quantifying the effect size of the witness effects for mRNA and TE correlations. Both effect sizes are significant for a highlighted set of 21 multifunctional RBPs. For this set of RBPs, individual Venn Diagrams representing the overlap in the total number of mRNA and TE targets are displayed. (B) Bar plot quantifying the magnitude of mRNA and TE effect size (Glass’ △ scores) for multifunctional RBPs. RBP effect sizes are largely independent of the mode of regulation. (C) Selected histograms and dot plots illustrating the significance of RBP-target correlations and the enrichment of GO terms for the targets bound by 4 multifunctional RBPs: DDX3X, G3BP1, PUM1, and UCHL5. For each RBP, the 12 most significant parental GO terms are displayed. For three of the RBPs, mRNA and TE targets exhibit different enrichment of significant GO terms. (D) Box plots with transcript, 5’ UTR, CDS, and 3’ UTR sequence lengths in nucleotides for mRNA and TE targets corresponding to the four selected multifunctional RBPs in (C). A total of 9 multifunctional RBPs bind targets with significantly different CDS lengths (Wilcoxon rank sum test). See also S3 Fig.

Of note, the TE and mRNA target genes of multifunctional RBPs can be enriched for separate biological processes, indicating that duality can contribute to independent biological outcomes. For instance, correlating DDX3X and UCHL5 TE targets code for proteins involved in RNA splicing (GO:0008380, padj = 7.70 × 10−30), while their mRNA targets did not show any clear functional enrichment (Fig 3C). For G3BP1, mRNA targets code for proteins involved in localization to nuclear body (GO:1903405, padj = 5.13 × 10−12), whereas this is not the case for translationally regulated targets, which are enriched for RNA splicing (GO:0008380, padj = 1.61 × 10−10). Such biological discrepancies are not always present: independent of the mode of regulation, both types of PUM1 targets (TE or mRNA) appear to code for proteins involved in mRNA processing (GO:0006397, TE padj = 2.64 × 10−22, mRNA padj = 6.40 × 10−13).

Differential affinity of multifunctional RBPs for CDS lengths and 5’ UTR structures

Dual- or multiprotein functionality can be achieved through context-specific differences in subcellular localization [44], interaction partners [45,46], or the presence of multiple RNA-binding domains [47]—all of which can fine tune or restrict the subset of recognized target genes. Based on published immunofluorescence imaging-based evidence of subcellular RBP localization [12], 13 out of 21 multifunctional RBPs indeed localized equally well to both nucleus and cytosol, suggesting functionality in both compartments (S3 Table).

We additionally explored the relative position of CLIP binding sites in target genes (i.e., the position of binding within the mRNA: 5’ or 3’ UTR, CDS, or intronic). Most of the RBPs (including e.g., DDX3X) were recruited to target gene regions in different proportions, but we observed no marked difference in binding positioning between mRNA and TE targets (S3B Fig). However, for SRSF7 and GEMIN5, mRNA and TE targets were enriched for intronic and CDS binding sites respectively. For DDX6, we observed an enrichment of TE targets with 3’ UTR binding sites, of which 87% correlated negatively with the RBP. A recent study describing a tethered function assay to assess RBP functionality found a highly concordant role for DDX6 in the repression of translation, which resulted directly from its recruitment to target gene 3’ UTRs [31]. This is in line with previously described repressive functions of DDX6 during mRNA translation [48].

For DDX3X and 10 other RBPs, we noticed a significant change in target transcript length, mostly explained by differences in target CDS length, which slightly increased or decreased between TE and mRNA targets (Figs 3D and S3C). The most significant changes in CDS length were seen for GEMIN5 (decrease for TE targets; 2,226 nt vs. 1,519 nt; padj = 3.66 × 10−9), PRPF8 (decrease for TE targets; 2,243 nt vs. 2,076 nt; padj = 1.03 × 10−8), DDX3X (decrease for TE targets; 1,659 nt vs 1,376 nt; padj = 2.81 × 10−7) and G3BP1 (increase for TE targets; 1,985 vs. 2,798 nt; padj = 6.11 × 10−5).

Our attention was drawn to the DEAD-box helicase DDX3X, which regulates translation initiation by interacting with, and subsequently disentangling, highly structured RNA sequences [4952]. DDX3X binds 5’ UTRs and the small ribosomal unit to facilitate the translation of a subset of mRNAs with long and structured leader sequences [52]. In order to define if additional RBPs may be required for, or involved in, translation initiation at targets with highly structured 5’ UTRs, we looked into the 5’ UTR’s minimum free energy (MFE, length normalized) of TE and mRNA target genes, for each of the multifunctional RBPs. The MFE defines the most thermodynamically probable secondary structure for each RNA sequence, with lower MFE values pointing to more complex and structured predicted conformations. We observed that between the positively and negatively correlating target translational efficiencies of 17 out of 21 multifunctional RBPs, 5’ UTR sequences differed in structural composition (Fig 4A). In contrast, there were poor to almost no differences for the significantly correlating mRNA targets of the same RBPs. Strikingly, three RBPs exhibited by far the strongest MFE differences between positively and negatively correlating targets: next to DDX3X (padj = 9.47 x 10−47), those were the core spliceosome factors PRPF8 (padj = 2.70 x 10−29) and EFTUD2 (padj = 1.69 x 10−30) (Fig 4A and 4B). Positively and negatively correlating targets also displayed minor differences in 5’ UTR lengths (S4A Fig), but the difference in MFE normalized by length was much greater than the small difference observed in UTR length, which was therefore corrected for. Moreover, the effect of MFE was still highly significant after controlling for UTR length (S4B Fig). The targets shared between these RBPs displayed similar directions of correlation with the three RBPs (Fig 4C) suggesting that these three RBPs might rely on a similar mechanism for translational regulation that is dependent on the 5’ UTR structures of shared targets. Interestingly, we found that positively and negatively correlating targets were involved in different functions and cellular compartments (S4C Fig). For instance, positively correlating targets from DDX3X and EFTUD2 were enriched for proteins that constitute ribonucleoprotein complexes, in line with a recent study that showed strong translational downregulation of ribonucleoproteins after knockdown of DDX3X [53]. On the contrary, negatively correlating targets of DDX3X and EFTUD2 were enriched for proteins localizing to membrane-bounded cytoplasmic organelles, such as the mitochondria.

Fig 4. Differential affinity of multifunctional RBPs for 5’ UTR structures often drives opposite quantitative TE effects.

Fig 4

(A) Dot plot displaying the significance of the differences in 5’ UTR minimum free energy (MFE, normalized by length) between target genes that correlate positively or negatively with each multifunctional RBP. Significance values are calculated separately for mRNA (green) and TE (brown) targets. Adjusted p-values are shown on a -log10 scale and calculated using the Wilcoxon Rank Sum test and only 5’ UTR sequences with a minimum length of 20 nucleotides were evaluated. A dashed vertical line indicates the minimum adjusted p-value to consider the differences in MFE as significant (padj < 0.05). (B) Box and violin plots with length normalized MFE scores for positively and negatively correlated TE targets corresponding to the three selected multifunctional RBPs with the highest significance (Wilcoxon rank sum test) in Fig 4A (DDX3X, EFTUD2, PRPF8). For comparison, non-correlating target genes were included in the panel figure. (C) Three-way Venn Diagram representing the overlap in the number of TE targets for the three selected RBPs. The heatmap represents TE correlations of 156 shared target genes for the three cases.

Discussion

Increasing evidence suggests that RBPs can act as multifunctional gene expression regulators [14,54]. Here, we built an in-silico method for the large-scale analysis of RBP-driven regulation using correlation as a proxy for mRNA abundance and translational efficiency (TE) of target genes across 80 human heart samples. Our approach underscores the functional importance of RBP expression fluctuations in the control of gene expression, a mechanistic feature recently highlighted by others in vitro and in vivo to assign new unknown roles for RBPs [3032]. We exploited the quantitative effect of RBPs on known target genes to implicate 74 RBPs in the regulation of mRNA abundance or translation.

We discovered 27 RBPs with previously unknown roles in translation, some of which have well-characterized functions in other biological processes, including mRNA splicing. Previous work revealed a handful of splicing factors that can independently mediate post-splicing activities, such as mRNA translation [5557]. Instead, alternative splicing mediated by splicing factors can cause downstream effects on the translation of spliced target genes, depending on the qualitative decision of which mRNA isoforms are being produced [1929]. Remarkably, the high fraction of splicing factors that we find to influence translation would suggest previously unanticipated roles for many more splicing regulators in this process. A prominent example is the muscle-specific and cardiac disease-relevant splicing regulator RBM20. RBM20 is a splicing repressor that modulates isoform abundances of many sarcomere genes [33,58]. Here we demonstrate that RBM20 expression correlates positively with the TE of TTN and other sarcomere genes, suggesting that nuclear splicing control can impact cytoplasmic protein synthesis. While we had previously reported that TTN isoform-specific changes in TE exist [34], we now for the first time show that RBM20 has a crucial impact on TTN’s translational output. Future functional studies should determine the specific mode of action of the splicing factors involved in translation and whether any isoform-specific characteristics contribute to the observed differences in TE.

Besides the discovery of potential novel functions for a subset of RBPs, we provide evidence that 21 RBPs can modulate both target mRNA abundance and TE—a class of RBPs that we classify as “multifunctional RBPs”. Multifunctional RBPs appear to be involved in the regulation of mRNA abundance and TE of distinct groups of target genes. These target genes can be concordantly or discordantly regulated on either layer of gene expression control. We discovered that the specific affinity of several RBPs to structural properties of mRNAs, such as protein-coding sequence length, UTR length or RNA secondary structure, contribute separately to the observed independent effects on mRNA abundance or translation. Hence, further experimental validation is required to confirm the quantitative effect of each individual multifunctional RBP and identify the precise mechanisms behind this dual functionality. In support of our findings, a recent study [32] has inspected the connection between the binding kinetics of the RBP DAZL and its effect on mRNA abundance and translation of specific sets of target genes, identifying several 3’ UTR features–UTR length, presence of binding clusters, distance to the polyadenylation site–that are linked to the trait-specific regulation of different groups of targets. In addition, the usage of different ribosome binding domains, the recognition of alternative RBP motifs and the presence of binding sites located in different gene regions (i.e., UTRs, CDS, or introns) can also be indicative of RBP multifunctionality [59]. Nevertheless, we found these mRNA characteristics to remain largely unchanged for the targets of most multifunctional RBPs identified in this study.

For a subgroup of multifunctional RBPs, we noticed that the targets regulated on the transcriptional or translational level represent functionally different gene classes. Indeed, the observed biological diversity in our study seems to match the condition-specific regulatory complexity that needs to be achieved by a single RBP. For instance, this appears to be the case for G3BP1—a known multifunctional RBP that can selectively compartmentalize specific sets of mRNAs to stress granules, in order to reprogram mRNA translation under certain global stress conditions [6062], as well as tissue-specific contexts such as atrial fibrillation in heart [63]. Additionally, G3BP1 plays an important role in DNA/RNA unwinding [64] and binds to specific RNA stem-loop structures to trigger mRNA degradation [65], which is essential for maternal mRNA clearance [66]. Another example is DDX3X, a DEAD box helicase which can respond to stressors (e.g., viral infections [67]) by switching subcellular compartments. DDX3X is involved in multiple processes required for RNA metabolism [42,43], for which it uses its capacity to unwind complex and structured 5’ UTRs to promote translation initiation at selected subsets of mRNAs [43,49,52,68]. However, there is ongoing debate as to the precise roles of DDX3X and the mechanisms through which it regulates RNA metabolism [68], as it can act both as a repressor or activator of translation [69].

Our work points to an intricate relation between the direction of translation regulation and target 5’ UTR structure, with the TE of certain targets being positively or negatively influenced by multifunctional RBP binding depending on the complexity of target 5’ UTR sequences. Unexpectedly, our results show that increased levels of DDX3X correlate with a lower TE for targets with highly structured 5’ UTRs. This seems to contradict recent in vitro reports where DDX3X knockdown in human cells [52] resulted in translational repression of mRNAs with structured 5’ UTRs. Next to DDX3X, the strongest impact of the 5’ UTR on translational output is observed for EFTUD2 and PRPF8, which display patterns of regulation highly similar to DDX3X, suggesting an analogous mode of action in the control of target translation rates. Surprisingly, EFTUD2 and PRPF8 are splicing factors which are part of the central component of the U5 snRNP spliceosome [70] and had not been implicated in translation before. However, the conserved GTPase EFTUD2 has sequence similarity to the translation elongation factor EF-2 [71], possibly explaining its capacity to influence translation. Both ancient paralogs may have evolved and diversified to complement each other.

Whereas dual functionality of extensively studied and characterized RBPs such as DDX3X and G3BP1 had (to a certain extent) been described previously, for a selection of other RBPs our results provide initial observations of dual functionality. For instance, UCHL5 (also known as UCH37) is a protease with RNA binding capacity that may be part of the INO80 chromatin remodeling complex [72], though its role within RNA metabolism is yet unknown. We established a quantitative relationship between UCHL5 expression and variability in mRNA abundance of genes involved in chromatin organization, as well as with changes in TE of genes involved in RNA splicing. Although UCHL5 shared target genes with core splicing factors (U2AF2, EFTUD2 and PRPF8), its effect on the TE of targets shared with these three splicing factors is completely opposite, suggesting contrasting regulatory behavior, and possibly competition.

Very little is known about the molecular processes that control RBP multifunctionality, although some possible mechanisms have been recently investigated, including the formation of heterogeneous RBP complexes [73,74], switches from monomers to multimers in a concentration-dependent manner [75], and changes in subcellular localization [55,76]. In our current study, the potential mechanisms behind the observed multifunctionality could not be explained in a uniform way: there appears to be no ‘one size fits all’ scenario. It is very probable that RBP multifunctionality is achieved by specific combinations of individual RBP and target features, whose precise dissection requires experimental follow-up into each individual multifunctional RBP. An RBP may bind distinct sets of RNA within the nucleus, though, for a subset of targets, the consequences of binding may only become apparent at a later stage of gene expression (e.g., a change in transcript isoform production that is accompanied by a downstream effect on TE). Alternatively, multifunctional RBPs may physically take part in multiple stages of gene expression by adapting subcellular localizations. For instance, HNRNPM (one of the core splicing ribonucleoproteins that we found to influence both target gene mRNA abundance and TE) localizes to nucleus [16] but can be exported to the cytosol to induce cap-independent translation upon hypoxia [55]. Another example, DDX3X, shuttles between nucleus and cytosol [67]. It remains to be established if RBPs with shared targets bind these targets simultaneously, or if there is sequential crosstalk of RBPs and other proteins in the control of target expression. RBP abundance may also respond to target availability and not vice versa, possibly explaining why we find many splicing regulators to rank highly within the RBP hierarchy.

There are limitations to the methods we employ to define biological roles for RBPs and assign targets to each. First, we note that significant associations also occur in the randomly sampled target sets, which indicates a certain level of fuzziness in RBP-target assignment. This appears inevitable for RBPs with general functions in expression control, which bind and share large numbers of targets. However, together with the calculated Glass’ delta, the distributions of significant correlations in the 100,000 randomly sampled target sets give us a good indication of our false discovery rate in target assignment. Particularly for investigations into the effects of RBP binding on individual target genes, the presence of potential false-positive significant correlations indicates that independent experimental validation is warranted. A second limitation includes the use of eCLIP experiments from two cell lines unrelated to the heart. Our results would benefit from large-scale human heart-specific CLIP experiments that better complement the tissue-specificity of the cardiac expressed genes. Nevertheless, all discovered multifunctional RBPs are ubiquitously expressed across human tissues (S1A Fig), suggesting that these proteins are major players in RNA transcriptional and translational regulation, contributing to both global and cell-line or tissue-specific manifestations.

In conclusion, our results illustrate unanticipated complexity in RBP-RNA interactions at multiple consecutive levels of gene expression. This warrants future in-depth experimental research into the identified RBPs in human cardiac biology. Understanding how RBPs cooperate, communicate, interact, and compete across subcellular compartments and in response to changing conditions will be essential to fully comprehend the quantitative nature of the regulatory principles that underlie mRNA metabolism.

Methods

Ribosome profiling and RNA sequencing data analysis

We re-analyzed ribosome profiling (Ribo-seq) and matched RNA-seq datasets from 80 human hearts that we generated and published previously (EGA accession code: EGAS00001003263) [34]. In short, Ribo-seq reads were clipped for residual adapters using FASTX toolkit [77]. Reads mapping to the mitochondrial RNA, ribosomal RNA and tRNA sequences were removed from downstream analysis. Full length paired mRNA-seq reads (2 × 101nt) were trimmed to 29-mers (average length of Ribo-seq reads) to establish a comparable analysis of both Ribo-seq and mRNA-seq datasets and avoid any mapping or quantification bias due to different read length or filtering. Next, Ribo-seq and trimmed mRNA-seq reads were mapped to the human reference genome (GRCh38, Ensembl v87) using STAR v2.5.2b [78] with maximum of 2 mismatches and -seedSearchStartLmaxOverLread = 0.5. Quantification of gene expression was performed by counting reads mapping to coding sequence (CDS) regions of annotated protein-coding genes, using HTSeq v0.9.1 [79]. Gene counts were normalized by estimating the size factors simultaneously on Ribo-seq and RNA-seq datasets using DESeq2 v1.12.4 [80]. This joint normalization is required to compare both measures of gene expression [81]. Translational efficiency (TE) was calculated on the Ribo-seq against RNA-seq ratio for each individual gene and sample, as described previously [34].

Identification of RBP targets from published eCLIP and HITS-CLIP data

Processed eCLIP data of 150 RBPs were obtained from ENCODE (30) for HepG2 (n = 103) and K562 (n = 120) cell lines. Datasets consisted of BED files containing eCLIP peaks and BAM files containing reads mapped to the human genome (GRCh38.p10/hg38). The identification of robust eCLIP peaks across replicates and cell lines was performed as suggested by Van Nostrand and colleagues [12]. First, we used BEDTools [82] to quantify the coverage of each predicted peak using input (mock) and immunoprecipitation (IP, antibody against RBP) BAM files. Next, for each peak, the relative information content was defined as pi x log2 (pi/qi), where p and q are the sum of reads mapping to the peak in IP and negative control respectively. The information content was used to calculate the Irreproducible Discovery Rate (IDR) [83], a parameter indicating reproducible peaks across biological replicates. A significant and reproducible peak was defined meeting an IDR cut-off < 0.01, p-value ≤ 10−5 and fold-enrichment (FC) > 8. In case two or more peaks overlapped the same genomic region, the most significant one was included in the peak table. Additionally, non-overlapping peaks were pooled into a single table, in order to get a complete set in both cell lines. While CLIP data was produced in a non-cardiac setting, CLIP signals are usually preserved among similarly expressed genes of the same RBP independent of the cell line, with peak differences instead reflecting cell type-specific expression rather than binding affinity [12]. Additionally, for the muscle-specific splicing repressor RBM20, which was not part of the ENCODE dataset but included for its importance for cardiac splicing and heart disease [33,58], significant rat RBM20 HITS-CLIP targets were obtained from Maatz et al. [33] and converted to GRCh38.p10/hg38 genomic coordinates. Only 143 RBPs with expression in human heart tissue were kept (mean FPKM across samples >1; 142 ENCODE RBPs and RBM20).

Overall, we retrieved an average of 4,300 eCLIP-seq peaks per experiment. Finally, we mapped these peaks to the annotated transcriptome (Ensembl v.87) and, for each RBP experiment, all the genes supported by at least one CLIP-seq peak were defined as putative target genes.

RBP-target correlation and clustering

For RBP-target correlations and clustering we included genes expressed in the human heart (mean FPKM across samples > 1) with at least one Ribo-seq and mRNA-seq read in a minimum of 20 samples (n = 11,387). Next, for pairwise complete observations, we calculated Spearman correlations between the expression level of the RBP (as measured by Ribo-seq) and either target gene mRNA-seq counts or translational efficiency. Only target genes that showed a significant (padj ≤ 0.05) correlation after correction for multiple testing using the Benjamini-Hochberg approach [84] were retained for downstream analyses. The computed RBP-target correlation matrix was used to calculate the Euclidean distance followed by hierarchical clustering, to group RBPs with similar consequences on their target genes. Cluster visualization was done using heatmap.3 (https://github.com/obigriffith/biostar-tutorials/tree/master/Heatmaps). To ensure significant correlation between RBP and its target genes and exclude numerical relationship (collinearity) between two RBPs, we selected clusters of RBPs and target genes with similar expression profiles and calculated partial correlations. Statistical comparison was enabled by Fisher Z-transformation of two correlation coefficients to a normal distribution.

Target gene enrichment

To identify RBPs that are putative modulators of target gene mRNA abundance and/or TE, we calculated the frequency with which target genes supported with CLIP-seq data correlated significantly with each RBP. We leveraged the significance of these correlating associations by generating 100,000 equally sized sets of theoretical targets out of all translated genes in the human heart; an approach that has previously been shown to be highly effective [30]. For each set, we quantified the amount of significantly correlating genes and compared the theoretical distribution against the actual observation applying an empirical test: Empirical p-value = sum(theoretical targets ≥ true RBP targets) /100,000.

Empirical p-values were corrected for multiple testing (Benjamini-Hochberg method). RBPs that showed a significant (padj ≤ 0.05) enrichment of correlating CLIP-derived target genes were considered as putative regulators of mRNA abundance (n = 58) and/or TE (n = 37). It should be noted that, because of the fixed number of generated random sets, the minimum empirical p-value that can be calculated after correction for multiple testing is 5.25 x 10−5. Hence, the empirical test cannot quantify the strength of significance for a specific observation. Instead, we calculated Glass’ Delta (△)[36] as a measure of the effect size, which is defined as the difference between the two target sets divided by the standard deviation of the theoretical group. Effect size = (true RBP targets—mean(theoretical targets)) / sd(theoretical targets)

RBP expression across GTEx tissues

To determine the patterns of expression of each RBP across human tissues, we obtained expression data from the Genotype-Tissue Expression (GTEx) Project [85], a database that comprises a large set of samples corresponding to 54 human tissues. We used these data to determine the number of tissues with detectable (average TPM ≥ 1) or high (average TPM ≥ 10) expression of a given RBP (S1A Fig). An RBP was categorized as ubiquitously expressed if expression was detected in more than 30 tissues with a TPM ≥ 10.

Replication of target regulation using a public fibroblast cohort

We retrieved raw RNA-seq and Ribo-seq data from a cohort of 20 primary cardiac fibroblast cultures stimulated with TGF-beta [30] and used it as a replication cohort. Raw data are available via the gene expression omnibus (GEO submission: GSE131112, GSE123018, GSE131111) repository. Read pre-processing, mapping, gene quantification and correlation analysis were done following the same procedures described above for the heart datasets (see ‘Read mapping and gene quantification’ and ‘RBP-target correlation and clustering’ subsections). To prove that the regulatory effect of RBPs in target translational regulation can be replicated in an independent dataset, we quantified the fraction of RBP-target correlations with similar direction of regulation in both fibroblast and human heart cohorts. Statistical significance of the observed replications was evaluated by running 10,000 permutations of the correlation coefficients in fibroblasts and comparing the fraction of shared directionality between both cohorts in observed and randomized sets.

Analysis of differential exon splicing

To evaluate whether RBM20 could influence the TE of target genes by modulating isoform production ratios (exon in- or exclusion), we estimated exon splicing rates by calculating the percentage spliced in (PSI) for all exons of known and correlating RBM20 target genes, as described previously [86]. For PSI calculation, we re-mapped the 80 paired-end cardiac mRNA-seq (2 × 101nt) datasets to improve splice site coverage using STAR v2.5.2b [78], allowing a maximum of 6 mismatches.

Functional analysis of RBP associations and target genes

Known and predicted RBP-RBP interactions were retrieved from the STRING database [40] with confidence network edges and default settings. Moreover, we assigned biological functions to define gene targets with gProfiler2 v0.1.9 (archive revision fof4439, [87]) and extracted enriched sets of ‘child’ and ‘parent’ GO terms for the individual sets mRNA and TE targets (padj ≤ 0.05). Significant GO terms that involve a minimum of 20 and a maximum of 500 genes were considered, to avoid the inclusion of too general terms that show significance due to beneficial input to term size ratios.

Analysis of minimum free energy in 5’ UTRs

We predicted 5’ UTR secondary structures through energy minimization using RNAfold from the Vienna Package v2.4 [88]. Using the 5’ UTR sequence of each target gene as input, minimum free energies (MFE) were calculated and length-normalized to observe differences in UTR complexity for target genes that are positively or negatively correlating with RBPs.

General remarks on statistical analysis

Statistical analysis and generation of figures was done using R v3.6.2 [89]. A full list of tools and methods used for data analysis is stated in each corresponding Methods section. Statistical parameters such as n, median/mean, standard deviation (SD) and significance are named in the figures and/or the figure legends. The “n” represents the number of RBPs in Figs 4, S3A and S4.

Supporting information

S1 Table. Analysis information for 143 RBPs.

Table with all 143 cardiac expressed RBPs, number of mRNA and TE correlating targets and significance of correlations, clusters of coregulated RBPs, and average RNA expression levels in human left ventricle.

(XLSX)

S2 Table. Multifunctional RBPs.

Table with all 21 multifunctional RBPs, number of target genes per molecular trait, and names and significance of the best 5 GO enrichment results.

(XLSX)

S3 Table. Multifunctional RBPs’ localization.

Table with all 21 multifunctional RBPs and their cellular localization (0: absent; 1: present).

(XLSX)

S1 Data. RData object containing DESeq2 normalized poly(A) RNA-seq and Ribo-seq count matrices together, TE matrix (Ribo-seq/RNA-seq) and PSI count matrix.

To access this file, download R and load the RData object with the R function load: load(RData).

(ZIP)

S2 Data. RData object containing correlation matrices, global result- and effect-size tables for all mRNA-RBPs and TE-RBPs as well as a table with MFE values for all target transcripts for each TE-RBP.

To access this file, download R and load the RData object with the R function load: load(RData).

(ZIP)

S1 Fig. RNA-binding protein abundance predicts target translational regulation.

(A) Bar plot displaying the patterns of expression of the 143 RBPs across tissues. Average expression values in transcript per million (TPM) units were retrieved from the Genotype-Tissue Expression (GTEx) Project. Most of the RBPs are ubiquitously expressed across human tissues. (B) STRING protein-protein association networks from six coregulated RBP clusters (see also Fig 1A). Most of the clustered RBPs are involved in known functional interactions. (C) Heatmaps with Glass’ △ scores for all 37 TE-RBPs quantifying the effect size of the witness effects for significant TE correlations.

(TIF)

S2 Fig. CLIP analysis identifies coregulated in vivo targets of novel master regulators of translation in the human heart.

(A) Described functions by Van Nostrand et al. for the set of TE-RBPs. Functions related to translation (translation regulation and ribosome basic translation) are highlighted with dark red boxes. (B-C) Scatter plots representing the correlation of heart (B) and primary cardiac fibroblasts (C) translational efficiencies between UCHL5 and U2AF2 and two shared targets, KPNA4 and MYL6. UCHL5 and U2AF2 have marked opposite effects on their shared targets, indicative of a competitive effect replicated in two independent datasets. Scores and level of significance of the two Spearman’s correlations are displayed.

(TIF)

S3 Fig. Multifunctional RBPs regulate translation of distinct sets of target genes.

(A) Network representing multifunctional RBP-target interactions for both mRNA-RBPs (green) and TE-RBPs (brown) of strong correlating pairs. Blue lines indicate shared targets in both mRNA abundance and TE regulation of the same RBP. (B) Left: heatmap representing the average mRNA and TE RBP-target correlation values for all 21 multifunctional RBPs. Middle: heatmap representing differences in the relative proportion of feature binding sites (TE-mRNA) for all 21 multifunctional RBPs. Right: bar plot showing the overall proportion of feature binding sites for all 21 multifunctional RBPs. (C) Box plots with 5’ UTR, CDS, and 3’ UTR sequence lengths in nucleotides for mRNA and TE targets corresponding to the set of 21 multifunctional RBPs. For each target gene, the most abundant isoform is represented.

(TIF)

S4 Fig. Differential affinity of multifunctional RBPs for 5’ UTR structures often drives opposite quantitative TE effects.

(A) Box and violin plots with 5’ UTR lengths for positively and negatively correlated TE targets corresponding to DDX3X, EFTUD2, and PRPF8. (B) Box and violin plots with length normalized MFE scores for positively and negatively correlated TE targets. We subsampled sets of 50 genes per group and RBP, so each of the groups had a similar distribution of 5’ UTR lengths. For comparison, non-correlating target genes were included in the panel figure. (C) Enriched GO terms in the sets of positive and negative correlating targets for DDX3X, EFTUD2, and PRPF8. For each RBP, the 5 most significant GO terms are displayed.

(TIF)

Acknowledgments

We would like to thank Prof. Dr. Markus Landthaler for his helpful feedback and discussions.

Data Availability

This study includes no newly generated data deposited in external repositories. Accession numbers for all data used in this study have been reported within the Materials and Methods section with reference to the studies that generated and published the data. All scripts generated for the analyses are available via the GitHub development platform at: https://github.com/vschnei/Dual-function_RBP_manuscript_analysis.

Funding Statement

This work was supported by the European Union’s Horizon 2020 research and innovation programme [European Research Council (ERC) advanced grant, grant agreement n° AdG788970 to N.H.; https://erc.europa.eu]; the Leducq Foundation [11 CVD-01 to N.H.; https://www.fondationleducq.org]; and the "Bundesministerium für Bildung und Forschung" [grant CaRNAtion to N.H; https://www.dfg.de]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Gerstberger S, Hafner M, Tuschl T. A census of human RNA-binding proteins. Nat Rev Genet. 2014;15: 829–845. doi: 10.1038/nrg3813 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Caudron-Herger M, Rusin SF, Adamo ME, Seiler J, Schmid VK, Barreau E, et al. R-DeeP: Proteome-wide and Quantitative Identification of RNA-Dependent Proteins by Density Gradient Ultracentrifugation. Mol Cell. 2019;75: 184—199.e10. doi: 10.1016/j.molcel.2019.04.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Mallam AL, Sae-Lee W, Schaub JM, Tu F, Battenhouse A, Jang YJ, et al. Systematic Discovery of Endogenous Human Ribonucleoprotein Complexes. Cell Rep. 2019;29: 1351—1368.e5. doi: 10.1016/j.celrep.2019.09.060 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Trendel J, Schwarzl T, Horos R, Prakash A, Bateman A, Hentze MW, et al. The Human RNA-Binding Proteome and Its Dynamics during Translational Arrest. Cell. 2019;176: 391—403.e19. doi: 10.1016/j.cell.2018.11.004 [DOI] [PubMed] [Google Scholar]
  • 5.Castello A, Fischer B, Frese CK, Horos R, Alleaume A-M, Foehr S, et al. Comprehensive Identification of RNA-Binding Domains in Human Cells. Mol Cell. 2016;63: 696–710. doi: 10.1016/j.molcel.2016.06.029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Beckmann BM, Castello A, Medenbach J. The expanding universe of ribonucleoproteins: of novel RNA-binding proteins and unconventional interactions. Pflugers Arch. 2016;468: 1029–1040. doi: 10.1007/s00424-016-1819-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kong X, Chen Y-Y, Lin J, Flowers E, Van Nostrand E, Blue SM, et al. The cohesin loader NIPBL interacts with pre-ribosomal RNA and treacle to regulate ribosomal RNA synthesis. bioRxiv. 2019; doi: 10.1101/658492 [DOI] [Google Scholar]
  • 8.Dassi E. Handshakes and Fights: The Regulatory Interplay of RNA-Binding Proteins. Front Mol Biosci. 2017;4: 67. doi: 10.3389/fmolb.2017.00067 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Díaz-Muñoz MD, Turner M. Uncovering the Role of RNA-Binding Proteins in Gene Expression in the Immune System. Front Immunol. 2018;9: 1094. doi: 10.3389/fimmu.2018.01094 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Gray NK, Hrabálková L, Scanlon JP, Smith RWP. Poly(A)-binding proteins and mRNA localization: who rules the roost? Biochem Soc Trans. 2015;43: 1277–1284. doi: 10.1042/BST20150171 [DOI] [PubMed] [Google Scholar]
  • 11.Yugami M, Okano H, Nakanishi A, Yano M. Analysis of the nucleocytoplasmic shuttling RNA-binding protein HNRNPU using optimized HITS-CLIP method. PLoS One. 2020;15: e0231450. doi: 10.1371/journal.pone.0231450 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Van Nostrand EL, Freese P, Pratt GA, Wang X, Wei X, Xiao R, et al. A large-scale binding and functional map of human RNA-binding proteins. Nature. Springer US; 2020;583: 711–719. doi: 10.1038/s41586-020-2077-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Gilbertson S, Federspiel JD, Hartenian E, Cristea IM, Glaunsinger B. Changes in mRNA abundance drive shuttling of RNA binding proteins, linking cytoplasmic RNA degradation to transcription. Elife. 2018;7. doi: 10.7554/eLife.37663 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Backlund M, Stein F, Rettel M, Schwarzl T, Perez-Perri JI, Brosig A, et al. Plasticity of nuclear and cytoplasmic stress responses of RNA-binding proteins. Nucleic Acids Res. 2020;48: 4725–4740. doi: 10.1093/nar/gkaa256 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Hinnebusch AG. The scanning mechanism of eukaryotic translation initiation. Annu Rev Biochem. 2014;83: 779–812. doi: 10.1146/annurev-biochem-060713-035802 [DOI] [PubMed] [Google Scholar]
  • 16.Harvey SE, Xu Y, Lin X, Gao XD, Qiu Y, Ahn J, et al. Coregulation of alternative splicing by hnRNPM and ESRP1 during EMT. RNA. 2018;24: 1326–1338. doi: 10.1261/rna.066712.118 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Shi Z, Fujii K, Kovary KM, Genuth NR, Röst HL, Teruel MN, et al. Heterogeneous Ribosomes Preferentially Translate Distinct Subpools of mRNAs Genome-wide. Mol Cell. 2017;67: 71—83.e7. doi: 10.1016/j.molcel.2017.05.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Simsek D, Tiu GC, Flynn RA, Byeon GW, Leppek K, Xu AF, et al. The Mammalian Ribo-interactome Reveals Ribosome Functional Diversity and Heterogeneity. Cell. 2017;169: 1051—1065.e18. doi: 10.1016/j.cell.2017.05.022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Lim CS, T Wardell SJ, Kleffmann T, Brown CM. The exon-intron gene structure upstream of the initiation codon predicts translation efficiency. Nucleic Acids Res. 2018;46: 4575–4591. doi: 10.1093/nar/gky282 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wang SE, Brooks AES, Poole AM, Simoes-Barbosa A. Determinants of translation efficiency in the evolutionarily-divergent protist Trichomonas vaginalis. BMC Mol cell Biol. 2020;21: 54. doi: 10.1186/s12860-020-00297-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Leppek K, Das R, Barna M. Functional 5’ UTR mRNA structures in eukaryotic translation regulation and how to find them. Nat Rev Mol Cell Biol. 2018;19: 158–174. doi: 10.1038/nrm.2017.103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Pelletier J, Sonenberg N. Insertion mutagenesis to increase secondary structure within the 5’ noncoding region of a eukaryotic mRNA reduces translational efficiency. Cell. 1985;40: 515–526. doi: 10.1016/0092-8674(85)90200-4 [DOI] [PubMed] [Google Scholar]
  • 23.Mauger DM, Cabral BJ, Presnyak V, Su S V, Reid DW, Goodman B, et al. mRNA structure regulates protein expression through changes in functional half-life. Proc Natl Acad Sci U S A. 2019;116: 24075–24083. doi: 10.1073/pnas.1908052116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Cheng S, Fahmi NA, Park M, Chang J-W, Sun J, Thao K, et al. mTOR-driven widespread exon skipping renders multifaceted gene regulation and proteome complexity. bioRxiv. Cold Spring Harbor Laboratory; 2020; 2020.02.27.967737. doi: 10.1101/2020.02.27.967737 [DOI] [Google Scholar]
  • 25.Katahira J. Nuclear export of messenger RNA. Genes (Basel). 2015;6: 163–184. doi: 10.3390/genes6020163 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Drummond DA, Wilke CO. Mistranslation-induced protein misfolding as a dominant constraint on coding-sequence evolution. Cell. 2008;134. doi: 10.1016/j.cell.2008.05.042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Qian W, Yang J-R, Pearson NM, Maclean C, Zhang J. Balanced codon usage optimizes eukaryotic translational efficiency. PLoS Genet. 2012;8: e1002603. doi: 10.1371/journal.pgen.1002603 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Nott A, Le Hir H, Moore MJ. Splicing enhances translation in mammalian cells: an additional function of the exon junction complex. Genes Dev. United States; 2004;18: 210–222. doi: 10.1101/gad.1163204 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Lewis BP, Green RE, Brenner SE. Evidence for the widespread coupling of alternative splicing and nonsense-mediated mRNA decay in humans. Proc Natl Acad Sci U S A. National Academy of Sciences; 2003;100: 189–192. doi: 10.1073/pnas.0136770100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Chothani S, Schäfer S, Adami E, Viswanathan S, Widjaja AA, Langley SR, et al. Widespread Translational Control of Fibrosis in the Human Heart by RNA-Binding Proteins. Circulation. 2019; doi: 10.1161/CIRCULATIONAHA.119.039596 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Luo E-C, Nathanson JL, Tan FE, Schwartz JL, Schmok JC, Shankar A, et al. Large-scale tethered function assays identify factors that regulate mRNA stability and translation. Nat Struct Mol Biol. 2020;27: 989–1000. doi: 10.1038/s41594-020-0477-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Sharma D, Zagore LL, Brister MM, Ye X, Crespo-Hernández CE, Licatalosi DD, et al. The kinetic landscape of an RNA-binding protein in cells. Nature. Nature Publishing Group; 2021; 1–5. doi: 10.1038/s41586-021-03222-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Maatz H, Jens M, Liss M, Schafer S, Heinig M, Kirchner M, et al. RNA-binding protein RBM20 represses splicing to orchestrate cardiac pre-mRNA processing. J Clin Invest. 2014;124: 3419–3430. doi: 10.1172/JCI74523 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.van Heesch S, Witte F, Schneider-Lunitz V, Schulz JF, Adami E, Faber AB, et al. The Translational Landscape of the Human Heart. Cell. United States, United States; 2019;178: 242–260.e29. doi: 10.1016/j.cell.2019.05.010 [DOI] [PubMed] [Google Scholar]
  • 35.Schafer S, Adami E, Heinig M, Rodrigues KEC, Kreuchwig F, Silhavy J, et al. Translational regulation shapes the molecular landscape of complex disease phenotypes. Nat Commun. 2015;6: 7200. doi: 10.1038/ncomms8200 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Freeman PR, Hedges L V, Olkin I. Statistical Methods for Meta-Analysis. Biometrics. 1986;42: 454. doi: 10.2307/2531069 [DOI] [Google Scholar]
  • 37.Sutherland LC, Wang K, Robinson AG. RBM5 as a putative tumor suppressor gene for lung cancer [Internet]. Journal of Thoracic Oncology. Lippincott Williams and Wilkins; 2010. pp. 294–298. doi: 10.1097/JTO.0b013e3181c6e330 [DOI] [PubMed] [Google Scholar]
  • 38.Liberman N, Marash L, Kimchi A. The translation initiation factor DAP5 is a regulator of cell survival during mitosis. Cell Cycle. 2009;8: 204–209. doi: 10.4161/cc.8.2.7384 [DOI] [PubMed] [Google Scholar]
  • 39.Weber R, Kleemann L, Hirschberg I, Chung M, Valkov E IC. DAP5 enables translation re-initiation on structured messenger RNAs. bioRxiv. 2021; [Google Scholar]
  • 40.Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, et al. STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 2019;47: D607—D613. doi: 10.1093/nar/gky1131 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Zhou Z, Dang Y, Zhou M, Li L, Yu C-H, Fu J, et al. Codon usage is an important determinant of gene expression levels largely through its effects on transcription. Proc Natl Acad Sci U S A. 2016;113: E6117—E6125. doi: 10.1073/pnas.1606724113 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Mo J, Liang H, Su C, Li P, Chen J, Zhang B. DDX3X: structure, physiologic functions and cancer. Mol Cancer. BioMed Central; 2021;20: 38. doi: 10.1186/s12943-021-01325-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Soto-Rifo R, Ohlmann T. The role of the DEAD-box RNA helicase DDX3 in mRNA metabolism. Wiley Interdiscip Rev RNA. Wiley Interdiscip Rev RNA; 2013;4: 369–385. doi: 10.1002/wrna.1165 [DOI] [PubMed] [Google Scholar]
  • 44.Buchan JR. mRNP granules. Assembly, function, and connections with disease. RNA Biol. 2014;11: 1019–1030. doi: 10.4161/15476286.2014.972208 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Cirillo L, Cieren A, Barbieri S, Khong A, Schwager F, Parker R, et al. UBAP2L Forms Distinct Cores that Act in Nucleating Stress Granules Upstream of G3BP1. Curr Biol. 2020;30: 698—707.e6. doi: 10.1016/j.cub.2019.12.020 [DOI] [PubMed] [Google Scholar]
  • 46.Yang W, Ru Y, Ren J, Bai J, Wei J, Fu S, et al. G3BP1 inhibits RNA virus replication by positively regulating RIG-I-mediated cellular antiviral response. Cell Death Dis. 2019;10: 946. doi: 10.1038/s41419-019-2178-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Müller-McNicoll M, Neugebauer KM. How cells get the message: dynamic assembly and function of mRNA-protein complexes. Nat Rev Genet. 2013;14: 275–287. doi: 10.1038/nrg3434 [DOI] [PubMed] [Google Scholar]
  • 48.Kamenska A, Simpson C, Vindry C, Broomhead H, Bénard M, Ernoult-Lange M, et al. The DDX6-4E-T interaction mediates translational repression and P-body assembly. Nucleic Acids Res. 2016/06/24. Oxford University Press; 2016;44: 6318–6334. doi: 10.1093/nar/gkw565 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Guenther U-P, Weinberg DE, Zubradt MM, Tedeschi FA, Stawicki BN, Zagore LL, et al. The helicase Ded1p controls use of near-cognate translation initiation codons in 5’ UTRs. Nature. 2018;559: 130–134. doi: 10.1038/s41586-018-0258-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Soto-Rifo R, Rubilar PS, Limousin T, de Breyne S, Décimo D, Ohlmann T. DEAD-box protein DDX3 associates with eIF4F to promote translation of selected mRNAs. EMBO J. 2012;31: 3745–3756. doi: 10.1038/emboj.2012.220 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Sen ND, Zhou F, Ingolia NT, Hinnebusch AG. Genome-wide analysis of translational efficiency reveals distinct but overlapping functions of yeast DEAD-box RNA helicases Ded1 and eIF4A. Genome Res. 2015;25: 1196–205. doi: 10.1101/gr.191601.115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Calviello L, Venkataramanan S, Rogowski KJ, Wyler E, Wilkins K, Tejura M, et al. DDX3 depletion represses translation of mRNAs with complex 5′ UTRs. Nucleic Acids Res. 2021;49: 5336–5350. doi: 10.1093/nar/gkab287 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Gong C, Krupka JA, Gao J, Grigoropoulos NF, Giotopoulos G, Asby R, et al. Sequential inverse dysregulation of the RNA helicases DDX3X and DDX3Y facilitates MYC-driven lymphomagenesis. Mol Cell. United States; 2021;81: 4059–4075.e11. doi: 10.1016/j.molcel.2021.07.041 [DOI] [PubMed] [Google Scholar]
  • 54.De Gaudenzi JG, Jäger A V, Izcovich R, Campo VA. Insights into the Regulation of mRNA Processing of Polycistronic Transcripts Mediated by DRBD4/PTB2, a Trypanosome Homolog of the Polypyrimidine Tract-Binding Protein. J Eukaryot Microbiol. 2016;63: 440–452. doi: 10.1111/jeu.12288 [DOI] [PubMed] [Google Scholar]
  • 55.Chen T-M, Lai M-C, Li Y-H, Chan Y-L, Wu C-H, Wang Y-M, et al. hnRNPM induces translation switch under hypoxia to promote colon cancer development. EBioMedicine. 2019;41: 299–309. doi: 10.1016/j.ebiom.2019.02.059 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Maslon MM, Heras SR, Bellora N, Eyras E, Cáceres JF. The translational landscape of the splicing factor SRSF1 and its role in mitosis. Elife. eLife Sciences Publications Ltd; 2014;2014. doi: 10.7554/eLife.02028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Kim J, Park RY, Chen JK, Kim J, Jeong S, Ohn T. Splicing factor SRSF3 represses the translation of programmed cell death 4 mRNA by associating with the 5′-UTR region. Cell Death Differ. Nature Publishing Group; 2014;21: 481–490. doi: 10.1038/cdd.2013.171 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Guo W, Schafer S, Greaser ML, Radke MH, Liss M, Govindarajan T, et al. RBM20, a gene for hereditary cardiomyopathy, regulates titin splicing. Nat Med. 2012;18: 766–773. doi: 10.1038/nm.2693 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Ray D, Kazan H, Cook KB, Weirauch MT, Najafabadi HS, Li X, et al. A compendium of RNA-binding motifs for decoding gene regulation. Nature. 2013;499: 172–177. doi: 10.1038/nature12311 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Matsuki H, Takahashi M, Higuchi M, Makokha GN, Oie M, Fujii M. Both G3BP1 and G3BP2 contribute to stress granule formation. Genes to Cells. John Wiley & Sons, Ltd; 2013;18: 135–146. doi: 10.1111/gtc.12023 [DOI] [PubMed] [Google Scholar]
  • 61.Ying S, Khaperskyy DA. UV damage induces G3BP1-dependent stress granule formation that is not driven by mTOR inhibition-mediated translation arrest. J Cell Sci. Company of Biologists Ltd; 2021;133. doi: 10.1242/jcs.248310 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Sahoo PK, Lee SJ, Jaiswal PB, Alber S, Kar AN, Miller-Randolph S, et al. Axonal G3BP1 stress granule protein limits axonal mRNA translation and nerve regeneration. Nat Commun. 2018;9: 3358. doi: 10.1038/s41467-018-05647-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Dong G, Liang F, Sun B, Wang C, Liu Y, Guan X, et al. Presence and function of stress granules in atrial fibrillation. PLoS One. Public Library of Science; 2019;14. doi: 10.1371/journal.pone.0213769 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Costa M, Ochem A, Staub A, Falaschi A. Human DNA helicase VIII: A DNA and RNA helicase corresponding to the G3BP protein, an element of the Ras transduction pathway. Nucleic Acids Res. 1999; doi: 10.1093/nar/27.3.817 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Fischer JW, Busa VF, Shao Y, Leung AKL. Structure-Mediated RNA Decay by UPF1 and G3BP1. Mol Cell. Cell Press; 2020;78: 70–84.e6. doi: 10.1016/j.molcel.2020.01.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Laver JD, Ly J, Winn AK, Angers S, Smibert CA, Lipshitz HD. The RNA-Binding Protein Rasputin/G3BP Enhances the Stability and Translation of Its Target mRNAs. CellReports. 2020;30: 3353–3367.e7. doi: 10.1016/j.celrep.2020.02.066 [DOI] [PubMed] [Google Scholar]
  • 67.Yedavalli VSRK, Neuveut C, Chi Y-H, Kleiman L, Jeang K-T. Requirement of DDX3 DEAD box RNA helicase for HIV-1 Rev-RRE export function. Cell. United States; 2004;119: 381–392. doi: 10.1016/j.cell.2004.09.029 [DOI] [PubMed] [Google Scholar]
  • 68.Shen L, Pelletier J. General and Target-Specific DExD/H RNA Helicases in Eukaryotic Translation Initiation. Int J Mol Sci. 2020;21. doi: 10.3390/ijms21124402 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Hilliker A, Gao Z, Jankowsky E, Parker R. The DEAD-Box Protein Ded1 Modulates Translation by the Formation and Resolution of an eIF4F-mRNA Complex. Mol Cell. 2011;43: 962–972. doi: 10.1016/j.molcel.2011.08.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Malinová A, Cvačková Z, Matějů D, Hořejší Z, Abéza C, Vandermoere F, et al. Assembly of the U5 snRNP component PRPF8 is controlled by the HSP90/R2TP chaperones. J Cell Biol. 2017;216: 1579–1596. doi: 10.1083/jcb.201701165 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Fabrizio P, Laggerbauer B, Lauber J, Lane WS, Lührmann R. An evolutionarily conserved U5 snRNP-specific protein is a GTP-binding factor closely related to the ribosomal translocase EF-2. EMBO J. 1997;16: 4092–4106. doi: 10.1093/emboj/16.13.4092 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Yao T, Song L, Jin J, Cai Y, Takahashi H, Swanson SK, et al. Distinct Modes of Regulation of the Uch37 Deubiquitinating Enzyme in the Proteasome and in the Ino80 Chromatin-Remodeling Complex. Mol Cell. Mol Cell; 2008;31: 909–917. doi: 10.1016/j.molcel.2008.08.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Copsey AC, Cooper S, Parker R, Lineham E, Lapworth C, Jallad D, et al. The helicase, DDX3X, interacts with poly(A)-binding protein 1 (PABP1) and caprin-1 at the leading edge of migrating fibroblasts and is required for efficient cell spreading. Biochem J. 2017;474: 3109–3120. doi: 10.1042/BCJ20170354 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Damianov A, Ying Y, Lin C-H, Lee J-A, Tran D, Vashisht AA, et al. Rbfox Proteins Regulate Splicing as Part of a Large Multiprotein Complex LASR. Cell. 2016;165: 606–619. doi: 10.1016/j.cell.2016.03.040 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Kim Y, Myong S. RNA Remodeling Activity of DEAD Box Proteins Tuned by Protein Concentration, RNA Length, and ATP. Mol Cell. 2016;63: 865–876. doi: 10.1016/j.molcel.2016.07.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Burgess HM, Richardson WA, Anderson RC, Salaun C, Graham S V, Gray NK. Nuclear relocalisation of cytoplasmic poly(A)-binding proteins PABP1 and PABP4 in response to UV irradiation reveals mRNA-dependent export of metazoan PABPs. J Cell Sci. 2011;124: 3344–3355. doi: 10.1242/jcs.087692 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Hannon GJ. FASTX-Toolkit. [Internet]. 2010. Available: http://hannonlab.cshl.edu/fastx_toolkit
  • 78.Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: Ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29: 15–21. doi: 10.1093/bioinformatics/bts635 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Anders S, Pyl PT, Huber W. HTSeq—a Python framework to work with high-throughput sequencing data. Bioinformatics. Oxford University Press; 2015;31: 166–169. doi: 10.1093/bioinformatics/btu638 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15: 550. doi: 10.1186/s13059-014-0550-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Zhong Y, Karaletsos T, Drewe P, Sreedharan VT, Kuo D, Singh K, et al. RiboDiff: Detecting changes of mRNA translation efficiency from ribosome footprints. Bioinformatics. 2017; doi: 10.1093/bioinformatics/btw585 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26: 841–842. doi: 10.1093/bioinformatics/btq033 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Li Q, Brown JB, Huang H, Bickel PJ. Measuring reproducibility of high-throughput experiments. Ann Appl Stat. 2011;5: 1752–1779. doi: 10.1214/11-AOAS466 [DOI] [Google Scholar]
  • 84.Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B. 1995;57. [Google Scholar]
  • 85.GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat Genet. 2013;45: 580–585. doi: 10.1038/ng.2653 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Schafer S, Miao K, Benson CC, Heinig M, Cook SA, Hubner N. Alternative Splicing Signatures in RNA-seq Data: Percent Spliced in (PSI). Curr Protoc Hum Genet. 2015;87: 11.16.1–11.16.14. doi: 10.1002/0471142905.hg1116s87 [DOI] [PubMed] [Google Scholar]
  • 87.Raudvere U, Kolberg L, Kuzmin I, Arak T, Adler P, Peterson H, et al. g:Profiler: a web server for functional enrichment analysis and conversions of gene lists (2019 update). Nucleic Acids Res. 2019;47: W191–W198. doi: 10.1093/nar/gkz369 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Lorenz R, Bernhart SH, Höner zu Siederdissen C, Tafer H, Flamm C, Stadler PF, et al. ViennaRNA Package 2.0. Algorithms Mol Biol. 2011;6: 26. doi: 10.1186/1748-7188-6-26 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.R Development Core Team. R: a language and environment for statistical computing. [Internet]. 2016.
PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009658.r001

Decision Letter 0

Greg Tucker-Kellogg, Daniel A Beard

28 Sep 2021

Dear van Heesch,

Thank you very much for submitting your manuscript "Multifunctional RNA-binding proteins influence mRNA abundance and translational efficiency of distinct sets of target genes" for consideration at PLOS Computational Biology. As with all papers reviewed by the journal, your manuscript was reviewed by members of the editorial board and by several independent reviewers. The reviewers appreciated the attention to an important topic. Based on the reviews, we are likely to accept this manuscript for publication, providing that you modify the manuscript according to the review recommendations.

Please prepare and submit your revised manuscript within 30 days. If you anticipate any delay, please let us know the expected resubmission date by replying to this email.

When you are ready to resubmit, please upload the following:

[1] A letter containing a detailed list of your responses to all review comments, and a description of the changes you have made in the manuscript. Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out

[2] Two versions of the revised manuscript: one with either highlights or tracked changes denoting where the text has been changed; the other a clean version (uploaded as the manuscript file).

Important additional instructions are given below your reviewer comments.

Thank you again for your submission to our journal. We hope that our editorial process has been constructive so far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Greg Tucker-Kellogg, PhD

Associate Editor

PLOS Computational Biology

Daniel Beard

Deputy Editor

PLOS Computational Biology

***********************

A link appears below if there are any accompanying review attachments. If you believe any reviews to be missing, please contact ploscompbiol@plos.org immediately:

[LINK]

Reviewer's Responses to Questions

Comments to the Authors:

Please note here if the review is uploaded as an attachment.

Reviewer #1: Attached

Reviewer #2: Multifunctional RNA-binding proteins influence mRNA abundance and translational efficiency of distinct sets of target genes

by Schneider-Lunitz et al.

Schneider-Lunitz et al. combine the transcriptomes and translatomes of human hearts with the target preferences of human RBPs to deduce potential associations and identify new multi-functional regulators of mRNA stability and translation. Genome-wide measurements for mRNA abundance and translational efficiency (derived from matched Ribo-seq and RNA-seq) for 80 human hearts were taken from the authors’ previous study (Van Heesch et al, Cell, 2019). RBP targets were extracted from publicly available eCLIP data for 142 RBPs, plus RBM20 from an independent dataset. Using correlation analysis (Spearman), the authors identify significant associations between the expression of a given RBP and the mRNA abundance (58 RBP-mRNA associations) or translation efficiency (37 RBP-TE associations) of its targets. They find that many RBPs show shared modes of regulation, having overlapping targets with opposing or concordant directionality of regulation. Moreover, they identify 21 RBPs as multifunctional RBPs, which influence mRNA abundance and translational efficiency for different subsets of target RNAs.

Overall, this is an interesting and very well performed study. The manuscript is well written and the figures are of high quality. The analyses are based on solid data and statistical approaches. Specific points that would enhance the impact of the study are listed below.

Based on the high similarity in the regulatory patterns of some RBPs (Figure 2AB), the question arises whether such results could be influenced by colinearity in the RBPs’ expression and/or a certain fuzziness in the assignment of the target genes for some RBPs. The impact of colinearity could for instance be assessed by partial correlations for pairs of similar RBPs to test whether RBP1 still shows a significant association with its target genes when correlations are corrected for RBP2. Regarding the second factor, the validity of target gene assignment for a given RBP could be assessed by cross-testing target genes, i.e., correlating the target genes of RBP1 against RBP2 and vice versa.

Minor comments:

Figure 2A: How many and which genes went into this analysis? Is this the union of target genes of all RBPs or some selection?

“For instance, depending on the shared target gene bound by the splicing factor U2AF2 and the protease UCHL5, completely opposite effects on TE could be observed and independently replicated (Figure 2C, Figure S2B and S2C).”

Reference to Figure 2C seems wrong.

Figure 2D/E: Extend legends to clarify what units were used for the scatter plots

Figure S3B: This figure requires more explanation. What exactly is shown in the middle panel? It seems that some RBPs show a region-dependent effect. For instance, as expected, SRSF7 seems to tend to regulate mRNA abundance via intronic binding and translational efficiency via CDS binding.

Figure 4B: MFE values have been normalised by length, but it is unclear whether this really works. The authors should check for differences in lengths between categories and use length-matched subsamples if needed. Also, what is the MFE distribution for non-correlated genes for comparison?

Reviewer #3: This manuscript presents a bioinformatics study investigating 143 RNA-binding proteins’ (RBPs) transcriptional and translational regulation roles in human heart. The authors re-analyzed previously published paired Ribo-seq and RNA-seq data from 80 human hearts and interrogated the correlation between expression levels of the 143 cardiac-expressed RBPs and the expression level and translational efficiency (TE) of their corresponding RNA-binding targets (obtained from ENCODE database, CLIP-seq approaches). In total, 58 RBPs were found to have significant associations with target mRNA abundance (denoted as “mRNA-RBPs”) and 37 RBPs with significant associations with target translational efficiencies (denoted as “TE-RBPs”), among which 21 RBPs were found in both categories and denoted as multifunctional RBPs. These includes 27 novel TE-RBPs, previously unknown for translational control process, including a muscle-specific and dilated cardiomyopathy-associated splicing regulator RBM20. The authors showed the multifunctional RBPs are context-specific where their functional outcome depends on the set of mRNAs they target, and their target genes also appear to be regulated independently, facilitated by a differential affinity for coding sequence length or 5’ UTR structure. In summary, this study gives a comprehensive analysis on the RBP regulatory roles in human heart based on current available transcriptome, translational efficiency and RNA-binding data with rigorous experiment design and statistical tests. It provides a list of potential TE-RBPs, multifunctional RBPs and many interesting insights of how these RBPs regulating the expression and translation, which could be further validated and studied in this field.

Major Comments:

1. This study comes up with a very interesting sets of RBPs that potentially participate in the expression and translation control of the mRNAs and provides the directionality information as well. I was wondering how this list overlaps with previously published screen studies that focused on the RBP functions and that also showed directionality, such as the study Luo et al. in Nature Structural & Molecular Biology (2020) and other similar studies.

Minor comments:

1. In the “Multifunctional RBPs including DDX3X and G3BP1 regulate mRNA abundance and translational efficiency of independent sets of target genes” of Results section, I have some problem in understanding the expression “which decreased further for the most strongly correlating targets”( line 11 of the paragraph) and Figure S3 A. I was wondering if the author could explain a bit more, highlight the “most strongly correlating targets” in the corresponding figure or something that can help readers better understand the results.

2. In the discussion, the authors mentioned all discovered multifunctional RBPs are ubiquitously expressed across human tissues, but I did not find the data or plots presenting this result in the manuscript. I suggest it would be great if the authors could show the results in the main figures or supplementary figures (e.g. using heatmap and GTEx data).

**********

Have the authors made all data and (if applicable) computational code underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data and code underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data and code should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data or code —e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Figure Files:

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email us at figures@plos.org.

Data Requirements:

Please note that, as a condition of publication, PLOS' data policy requires that you make available all data used to draw the conclusions outlined in your manuscript. Data must be deposited in an appropriate repository, included within the body of the manuscript, or uploaded as supporting information. This includes all numerical values that were used to generate graphs, histograms etc.. For an example in PLOS Biology see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5.

Reproducibility:

To enhance the reproducibility of your results, we recommend that you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option to publish peer-reviewed clinical study protocols. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

References:

Review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript.

If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Attachment

Submitted filename: Review.docx

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009658.r003

Decision Letter 1

Greg Tucker-Kellogg, Daniel A Beard

18 Nov 2021

Dear van Heesch,

We are pleased to inform you that your manuscript 'Multifunctional RNA-binding proteins influence mRNA abundance and translational efficiency of distinct sets of target genes' has been provisionally accepted for publication in PLOS Computational Biology.

Before your manuscript can be formally accepted you will need to complete some formatting changes, which you will receive in a follow up email. A member of our team will be in touch with a set of requests.

Please note that your manuscript will not be scheduled for publication until you have made the required changes, so a swift response is appreciated.

IMPORTANT: The editorial review process is now complete. PLOS will only permit corrections to spelling, formatting or significant scientific errors from this point onwards. Requests for major changes, or any which affect the scientific understanding of your work, will cause delays to the publication date of your manuscript.

Should you, your institution's press office or the journal office choose to press release your paper, you will automatically be opted out of early publication. We ask that you notify us now if you or your institution is planning to press release the article. All press must be co-ordinated with PLOS.

Thank you again for supporting Open Access publishing; we are looking forward to publishing your work in PLOS Computational Biology. 

Best regards,

Greg Tucker-Kellogg, PhD

Associate Editor

PLOS Computational Biology

Daniel Beard

Deputy Editor

PLOS Computational Biology

***********************************************************

PLoS Comput Biol. doi: 10.1371/journal.pcbi.1009658.r004

Acceptance letter

Greg Tucker-Kellogg, Daniel A Beard

29 Nov 2021

PCOMPBIOL-D-21-01425R1

Multifunctional RNA-binding proteins influence mRNA abundance and translational efficiency of distinct sets of target genes

Dear Dr van Heesch,

I am pleased to inform you that your manuscript has been formally accepted for publication in PLOS Computational Biology. Your manuscript is now with our production department and you will be notified of the publication date in due course.

The corresponding author will soon be receiving a typeset proof for review, to ensure errors have not been introduced during production. Please review the PDF proof of your manuscript carefully, as this is the last chance to correct any errors. Please note that major changes, or those which affect the scientific understanding of the work, will likely cause delays to the publication date of your manuscript.

Soon after your final files are uploaded, unless you have opted out, the early version of your manuscript will be published online. The date of the early version will be your article's publication date. The final article will be published to the same URL, and all versions of the paper will be accessible to readers.

Thank you again for supporting PLOS Computational Biology and open-access publishing. We are looking forward to publishing your work!

With kind regards,

Agnes Pap

PLOS Computational Biology | Carlyle House, Carlyle Road, Cambridge CB4 3DN | United Kingdom ploscompbiol@plos.org | Phone +44 (0) 1223-442824 | ploscompbiol.org | @PLOSCompBiol

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Table. Analysis information for 143 RBPs.

    Table with all 143 cardiac expressed RBPs, number of mRNA and TE correlating targets and significance of correlations, clusters of coregulated RBPs, and average RNA expression levels in human left ventricle.

    (XLSX)

    S2 Table. Multifunctional RBPs.

    Table with all 21 multifunctional RBPs, number of target genes per molecular trait, and names and significance of the best 5 GO enrichment results.

    (XLSX)

    S3 Table. Multifunctional RBPs’ localization.

    Table with all 21 multifunctional RBPs and their cellular localization (0: absent; 1: present).

    (XLSX)

    S1 Data. RData object containing DESeq2 normalized poly(A) RNA-seq and Ribo-seq count matrices together, TE matrix (Ribo-seq/RNA-seq) and PSI count matrix.

    To access this file, download R and load the RData object with the R function load: load(RData).

    (ZIP)

    S2 Data. RData object containing correlation matrices, global result- and effect-size tables for all mRNA-RBPs and TE-RBPs as well as a table with MFE values for all target transcripts for each TE-RBP.

    To access this file, download R and load the RData object with the R function load: load(RData).

    (ZIP)

    S1 Fig. RNA-binding protein abundance predicts target translational regulation.

    (A) Bar plot displaying the patterns of expression of the 143 RBPs across tissues. Average expression values in transcript per million (TPM) units were retrieved from the Genotype-Tissue Expression (GTEx) Project. Most of the RBPs are ubiquitously expressed across human tissues. (B) STRING protein-protein association networks from six coregulated RBP clusters (see also Fig 1A). Most of the clustered RBPs are involved in known functional interactions. (C) Heatmaps with Glass’ △ scores for all 37 TE-RBPs quantifying the effect size of the witness effects for significant TE correlations.

    (TIF)

    S2 Fig. CLIP analysis identifies coregulated in vivo targets of novel master regulators of translation in the human heart.

    (A) Described functions by Van Nostrand et al. for the set of TE-RBPs. Functions related to translation (translation regulation and ribosome basic translation) are highlighted with dark red boxes. (B-C) Scatter plots representing the correlation of heart (B) and primary cardiac fibroblasts (C) translational efficiencies between UCHL5 and U2AF2 and two shared targets, KPNA4 and MYL6. UCHL5 and U2AF2 have marked opposite effects on their shared targets, indicative of a competitive effect replicated in two independent datasets. Scores and level of significance of the two Spearman’s correlations are displayed.

    (TIF)

    S3 Fig. Multifunctional RBPs regulate translation of distinct sets of target genes.

    (A) Network representing multifunctional RBP-target interactions for both mRNA-RBPs (green) and TE-RBPs (brown) of strong correlating pairs. Blue lines indicate shared targets in both mRNA abundance and TE regulation of the same RBP. (B) Left: heatmap representing the average mRNA and TE RBP-target correlation values for all 21 multifunctional RBPs. Middle: heatmap representing differences in the relative proportion of feature binding sites (TE-mRNA) for all 21 multifunctional RBPs. Right: bar plot showing the overall proportion of feature binding sites for all 21 multifunctional RBPs. (C) Box plots with 5’ UTR, CDS, and 3’ UTR sequence lengths in nucleotides for mRNA and TE targets corresponding to the set of 21 multifunctional RBPs. For each target gene, the most abundant isoform is represented.

    (TIF)

    S4 Fig. Differential affinity of multifunctional RBPs for 5’ UTR structures often drives opposite quantitative TE effects.

    (A) Box and violin plots with 5’ UTR lengths for positively and negatively correlated TE targets corresponding to DDX3X, EFTUD2, and PRPF8. (B) Box and violin plots with length normalized MFE scores for positively and negatively correlated TE targets. We subsampled sets of 50 genes per group and RBP, so each of the groups had a similar distribution of 5’ UTR lengths. For comparison, non-correlating target genes were included in the panel figure. (C) Enriched GO terms in the sets of positive and negative correlating targets for DDX3X, EFTUD2, and PRPF8. For each RBP, the 5 most significant GO terms are displayed.

    (TIF)

    Attachment

    Submitted filename: Review.docx

    Attachment

    Submitted filename: Response_to_reviewers.docx

    Data Availability Statement

    This study includes no newly generated data deposited in external repositories. Accession numbers for all data used in this study have been reported within the Materials and Methods section with reference to the studies that generated and published the data. All scripts generated for the analyses are available via the GitHub development platform at: https://github.com/vschnei/Dual-function_RBP_manuscript_analysis.


    Articles from PLoS Computational Biology are provided here courtesy of PLOS

    RESOURCES