RNA-binding proteins (RBPs) interact with mRNA to form supramolecular complexes called messenger ribonucleoprotein (mRNP) particles. These dynamic assemblies direct and regulate individual steps of gene expression; however, their composition and functional importance remain largely unknown. Here, we develop a total internal reflection fluorescence-based single-molecule imaging assay to investigate stoichiometry and co-occupancy of 15 RBPs within mRNPs from Saccharomyces cerevisiae. We show compositional heterogeneity of single mRNPs and plasticity across different growth conditions, with major co-occupants of mRNPs containing the nuclear cap-binding complex identified as Yra1 (1–10 copies), Nab2 (1–6 copies), and Npl3 (1–6 copies). Multicopy Yra1-bound mRNPs are specifically co-occupied by the THO complex and assembled on mRNAs biased by transcript length and RNA secondary structure. Yra1 depletion results in decreased compaction of nuclear mRNPs demonstrating a packaging function. Together, we provide a quantitative framework for gene- and condition-dependent RBP occupancy and stoichiometry in individual nuclear mRNPs.
In brief
Asada et al. characterize the stoichiometry and co-occupancy of RBPs related to mRNA biogenesis and export in single mRNPs, identifying gene- and condition-dependent plasticity. Of these RBPs, Yra1 demonstrates highly variable stoichiometry and is required for mRNP compaction, highlighting the role of varied RBP composition in mRNP organization.
Graphical Abstract
Regulated gene expression emerges from the summation of transcription, mRNA processing, export, localization, translation, and decay. These events are directed by dynamic interactions involving RNAs, RNA-binding proteins (RBPs), and associated factors in the form of messenger ribonucleoprotein (mRNP) particles.1 RBPs engage pre-mRNAs co-transcriptionally to facilitate transcript maturation (e.g., 5′ capping, splicing, cleavage, and polyadenylation).2–4 Progress along the gene expression pathway from nucleus to cytoplasm is further accompanied by the gain and loss of RBPs from mRNPs to direct steps within the gene expression pathway that includes export, translation, and decay.3–5 In S. cerevisiae, core RBPs engaged in mRNP biogenesis and export have been identified and well characterized (summarized in Figure S1), which are also well conserved in metazoans. Recent proteome-wide analyses of mRNA-bound RBPs have identified thousands of RBPs in both yeast and human cells,6–17 emphasizing the complexity and diversity of post-transcriptional gene regulation organized by RBP interaction networks.
Transcriptome-wide approaches have cataloged the mRNAs bound by RBPs including the binding position of RBPs along transcripts.18–20 These data have provided insights into the in vivo RNA-binding patterns of mRNP biogenesis and export factors (Figure S1); however, they represent collective interactions across mRNPs at all stages of gene expression. This limits the conclusions that can be made about individual mRNPs, since it is not possible to infer from these data if two RBPs mapping to the same transcripts were ever present in the same mRNP. This leaves questions about the functional role of mRNP heterogeneity across transcripts generated from different genes, or the same gene in the context of regulated gene expression, unaddressed. As such, it remains a major challenge to understand how mRNP architecture (i.e., the RBPs present in an mRNP, their stoichiometry, and overall topological organization) facilitates regulated gene expression at the level of the functional unit, which is an individual RNA-protein complex.
One aspect of mRNP architecture to consider is the spatial organization of the mRNA itself. It is known that the positioning of transcript regions in proximity to each other promotes pre-mRNA splicing and translation regulation.21,22 For example, recent single-molecule fluorescence in situ hybridization (smFISH) studies have documented changes in mRNP compaction in relation to transcription, export, and translation status.23–25 Proper packaging and compaction of the mRNA is likely to have benefits that include preventing the mRNA from being targeted for decay, promoting efficient export through a nuclear pore complex (NPC), and preventing intra- or intermolecular interactions that interfere with gene expression.26,27 Electron microscopy studies have shown that the ~35-kb Balbiani ring mRNA in C. tentans is compacted ~200-fold into a nuclear mRNP with a diameter of ~50 nm that changes shape during nuclear export.28,29 S. cerevisiae mRNPs are similarly compacted into a heterogeneous set of particles with dimensions that correlate with transcript length.30,31 In humans, data suggest that the exon junction complex (EJC) and serine and arginine-rich (SR) proteins cooperate to promote mRNA packaging32,33; however, a lack of EJC components in S. cerevisiae indicates that other RBPs must fulfill this function.34 While a molecular understanding of mRNP architecture is lacking, it is expected that one or more RBPs form a structural scaffold that organizes the mRNA.27 The identification of RBPs associated with mRNA biogenesis (describedin FigureS1) provides candidates for mRNP packaging.5,26 Notably, two recent studies have defined protein-protein and protein-RNA interaction networks in human cells and S. cerevisiae that would promote mRNA packaging, with both studies identifying Yra1 (Aly/REF) and the TREX complex as key packaging components.30,35 Still, the composition, organization, and heterogeneity among individual nuclear mRNPs remain largely unknown.
Here, a total internal reflection fluorescence (TIRF)-based imaging method termed mRNP single-molecule pull-down (mRNP-SiMPull) was developed to assess RBP stoichiometry and co-occupancy within individual mRNPs isolated from Saccharomyces cerevisiae. Using this approach, the stoichiometries of 15 RBPs within cap-binding complex (CBC) containing mRNPs were measured, the co-occupancy of select RBPs was determined, and plasticity in RBP composition was demonstrated in response to altered growth conditions. Our quantitative measures of individual mRNP compositions demonstrate that nuclear mRNPs are highly heterogeneous with a common set of RBP constituents (i.e., CBC, Npl3, Yra1, and Nab2) with varied occupancy and stoichiometry with other RBPs. Of these, Yra1 is a highly variable copy number component displaying gene feature-dependent occupancy that is required for the formation of export-competent compacted nuclear mRNPs.
Establishment of the mRNP single-molecule pull-down (mRNP-SiMPull) methodology
A TIRF-based imaging method, mRNP-SiMPull, was established for the quantitative analysis of single mRNPs (Figure 1A), which was motivated by published methods aimed at the quantitation of protein complexes (SiMPull) and reconstituted splicing reactions (CoSMoS).36,37 Briefly, yeast cells are broken by cryogrinding, and complexes are isolated from a minimally cross-linked formaldehyde-treated lysate via pull-down of the nuclear CBC (composed of a Cbp80 and Cbp20 heterodimer38) using protein A (PrA)-tagged Cbp80 (Cbp80-PrA). Subsequently, mRNPs are eluted from the beads by proteolytic cleavage and loaded onto a glass slide where individuals mRNPs are captured by an antibody directed against a second RBP of interest tagged with mNeonGreen (mNG) or fluorescently labeled SNAP tag (SNAPf). Captured material is imaged using TIRF microscopy. Since fluorescent molecules photobleach stochastically, the counting of photobleaching steps provides RBP stoichiometry information in each detected spot (Figure S2A). Co-localization analysis can be performed using strains that express RBPs differentially tagged with mNG and SNAPf to provide RBP co-occupancy information within individual mRNP complexes. Importantly, each time mRNP-SiMPull is performed, data from a control strain lacking a PrA tagged subunit (Cbp80 or other target RBP) is collected. These data are used to perform a background subtraction, accounting for non-specific binding of fluorescent RBPs and daily variability in extract and slide preparation (see STAR Methods).
Figure 1. mRNP-SiMPull for characterization of in vivo mRNP composition.
(A) Schematic representation of mRNP-SiMPull procedure for isolating mRNPs from yeast cells and single-molecule imaging of RBP components.
(B) Cartoon schematic of (i) Nab2-mNG imaging in mRNP-SiMPull with IgG-beads targeting nuclear cap-binding complex component Cbp80-PrA followed by (ii) mRNP capture via mNG antibody on the glass surface. Representative TIRF images of Nab2-mNG obtained by mRNP-SiMPull from cell lysates expressing Cbp80-PrA or untagged Cbp80 (no tag) with or without RNase A treatment. Graph shows the number of detected spots in triplicate experiments with mean and standard deviation (error bar).
(C) A PrA-mNG-GST-SNAPf-3HA fusion protein was used to determine fluorescent reporter activity in the mRNP-SiMPull procedure. Images show PrA-mNG-GST-SNAPf-3HA captured on the TIRF slide via hemagglutinin (HA) antibody after labeling with SNAP-surface 549. Co-localization of mNG and SNAPf tag spots was calculated with the mean and standard deviations shown from triplicate experiments.
(D) Nop58-mNG and mNG-Snu13 imaging with Nop56-PrA pull-down in mRNP-SiMPull. Pull-down was performed by (i) IgG-beads targeting Nop56-PrA followed by (ii) mNG antibody capture of snoRNP complexes on the glass surface. Representative TIRF images of Nop58-mNG and mNG-Snu13 analyzed in a Nop56-PrA pull-down and by mRNP-SiMPull.
(E) Stoichiometry distribution of Nop58 and Snu13 in Nop56 pull-down analyzed by photobleaching steps analysis. Blue bars show mean data with standard deviation with dots showing individual data points in triplicate experiments. Orange line displays the expected complex stoichiometry distribution following correction for fluorescent reporter activity using finite mixture modeling. Image scale bars, 5 μm.
To begin to assess RBPs with mRNP-SiMPull, the presence of the nuclear poly(A)-RNA-binding protein (PABP) Nab239 was assayed within Cbp80-containing mRNPs (i.e., isolation of capped and polyadenylated mRNPs). Using a Cbp80-PrA/Nab2-mNG strain, complexes of varied brightness were captured and detected, which were absent from a control strain lacking the PrA tag (Figure 1B). The Nab2-mNG spots were reduced to background levels upon addition of RNase A, indicating that detected signals represent mRNPs (Figure 1B). To confirm physiological relevance, mixing experiments were performed using lysates that individually contained Cbp80-PrA or Nab2-mNG. Upon mixing, detected Nab2-mNG spot numbers were comparable to the untagged control (Figure S2B), which was also the case for another RBP, the export adaptor Yra1 (Figure S2C). These data indicate that spots detected in Cbp80-PrA purifications represent mRNPs generated in the living cell, not RBP-mRNA interactions generated in vitro or non-specific interactions captured by crosslinking.
RBP dissociation from isolated mRNPs during the mRNP-SiMPull procedure is a possibility; hence Nab2 stoichiometry was assessed in a normally processed sample (~50 min) and after an extended incubation after immunoprecipitation (IP) (~100 min total). The extended incubation did not alter the Nab2-mNG stoichiometry distribution pattern (Figure S2D), indicating that mRNP architecture information was maintained. It was also observed that Nab2 stoichiometry values were not impacted by the tag used (e.g., mNG vs. SNAPf, Figure S2E), and measured fluorescent intensities of isolated mRNPs correlated with the number of observed photobleaching steps (Figure S2F). These data support the validity of the step-counting data.
To further ensure the accuracy of collected data, which may be impacted by inactive fluorescent reporters, the activities of mNG and SNAPf were assessed using a yeast expressing a PrA-mNG-GST-SNAPf-3xHA fusion protein (Figure 1C). Following the mRNP-SiMPull procedure, detected SNAP labeled spots were 74% ± 7% positive for mNG, suggesting~25% of mNG molecules were inactive, which closely matches what has been reported for GFP.40 Similarly, detected mNG spots were 78% ± 6% positive for SNAPf. With these estimates, RBP stoichiometry measurements can be corrected to compensate for the probability of mRNPs containing a non-fluorescent molecule, which was accomplished using a finite mixture model of truncated binomials (see STAR Methods).
To demonstrate the application of mRNP-SiMPull, Nop56-PrA was used with Nop58-mNG or Snu13-mNG to isolate box C/D snoRNP complexes, within which Nop58 and Snu13 exist in a 1:2 ratio.41 Detected spots of Snu13 were brighter than Nop58 spots (Figure 1D), and uncorrected photobleaching step analysis data indicated a Snu13 dimer, which upon correction for reporter activity clearly reproduces the expected 1:2 ratio of Nop58 and Snu13 in a box C/D snoRNP complex (Figure 1E). Of note, a fraction of Nop58 and Snu13 spots showed higher stoichiometries, which are expected to represent molecular assemblies related to ribosome biogenesis.42
Together, the observed RNase sensitivity of analyzed complexes, requirement that RBPs are co-expressed in the same cell, stability of detected complexes, reproducibility of the data generated by different fluorescent reporters, and measurement of known stoichiometries demonstrate that mRNP-SiMPull provides information on RNPs formed in vivo.
mRNP biogenesis and export factor occupancy in single mRNPs
With the ability to collect data on in vivo mRNP compositions, the occupancy of known mRNA biogenesis and export-related factors were tested by mRNP-SiMPull using Cbp80-PrA (member of the CBC) with mNG-tagged RBPs. It is important to note that the CBC remains bound to an mRNA post export until replacement by translation initiation factors,43,44 as these data will include mRNPs that represent stages of gene expression in both the nucleus and cytoplasm.
Comparing all RBPs (Figures 2 and S3 and Table S1), the SR-like protein Npl3 showed particularly high enrichment with Cbp80-PrA, approximately 4-fold higher than any other RBP. Upon RNase A treatment, most spots were lost for almost all RBPs tested, while ~50% of Npl3 spots were not RNase A sensitive (Figures 1B and S4). Npl3 was reported to directly bind the CBC,45 which likely accounts for the presence of a large fraction of RNase-insensitive Npl3 spots in a Cbp80-PrA pull-down. Still, even considering the RNase-insensitive fraction, Npl3 showed a high level of enrichment compared to other RBPs. This may reflect the existence of Npl3 in cytoplasmic mRNPs and/or non-coding RNPs (e.g., with small nuclear [sn]RNAs, small nucleolar [sno]RNAs, or long non-coding [lnc]RNAs), as Npl3 has functions in translation and binds non-coding RNAs.46,47 Based on the interactions of Npl3 with the CBC complex and its role in pre-mRNA capping quality control,45 it is also possible that Npl3 would be detected with Cbp80 in nascent and abortive transcripts that lack other RBPs found in mature mRNPs. It is expected that these multi-functional features of Npl3 account for the frequent association of Npl3 with Cbp80 within RNase-sensitive complexes.
Figure 2. Occupancy of mRNA biogenesis and export-related RBPs in Cbp80-containing mRNPs.
(A) Cartoon summarizes the procedure to analyze the frequency of target RBPs in the population of CBC-containing mRNPs by mRNP-SiMPull. To perform these assays, RBPs were tagged with mNG in a strain with Cbp20-SNAP-3HA with/without Cbp80-PrA. IgG pull-downs were performed to enrich Cbp80-bound mRNPs and CBC itself. The elute was separately diluted and loaded into the mNG antibody coated (to capture RBP-mNG-bound mRNPs) and HA antibody coated (to capture Cbp20-SNAPf-3HA-bound mRNPs and free CBC complexes) slides. The spot number of RBP-mNG counts normalized by Cbp20-SNAPf-3HA counts was used for the comparison between the different RBPs.
(B) Representative images used to determine the frequency of target RBP-containing mRNPs in the population of total Cbp80-bound mRNPs. See Figure S3 for comparison with untagged Cbp80 control strains. Scale bar, 5 μm.
(C) Graph showing the spot number of RBP-mNG normalized by Cbp20-SNAPf-3HA value in the same sample. Mean and standard deviation are shown with individual data points from triplicate experiments. It was noted that mNG tagging of Yra1 and Pab1 caused a growth defect that was tag specific (Figure S3A), but these strains were used in this experiment to maintain consistency.
Other RBPs strongly enriched with Cbp80 were Yra1 and the poly(A)-RNA-binding proteins Nab2 and Pab1 (Figures 2 and S3 and Table S1). In contrast, Yra2, the THO complex component Hpr1, and the SR-like proteins Gbp2 and Hrb1 were less frequently observed. Of the three cleavage- and polyadenylation-related RBPs, Yth1, Pcf11, and Hrp1 (components in CPF, CFIA, and CFIB complexes, respectively), only Hrp1 showed an appreciable level of enrichment. This raises the possibility that after the cleavage and polyadenylation reaction, Hrp1 remains associated with the mRNA and is exported to the cytoplasm with the mRNP. This is consistent with the observation that Hrp1 shuttles between the nucleus and cytoplasm and has a reported function in nonsense-mediated decay.48,49 The essential mRNA export receptor Mex67 was rarely observed. This supports recent works indicating that Mex67/NXF1 does not commonly join an mRNP in the nucleoplasm and is independently recruited to the NPC to mediate mRNP export.50,51 Overall, these data indicate that Npl3, Yra1, Nab2, and Pab1 commonly occupy Cbp80-containing mRNPs and that RBP occupancy is in line with protein expression levels52; i.e., highly expressed nuclear RBPs functioning in mRNA processing are more frequently bound to mRNPs (Table S1).
RBP stoichiometry in single mRNPs
To address RBP stoichiometry in single mRNPs, copy number measures were obtained by photobleaching step analysis (Figure 3A), with the addition of the DEAD-box protein Sub2 and THO complex subunit Mft1 to the RBPs tested. In these assays, SNAP-tagged Yra1 and Pab1 were used due to an observed growth defect caused by mNG tagging (Figure S3A). All resulting stoichiometry data were corrected for reporter activity using a finite mixture model of truncated binomials (see STAR Methods). As expected for Cbp20, which forms a 1:1 complex with Cbp80,53 Cbp20 spots were uniformly dim, and ~80% of spots showed one-step photobleaching (Figures 3B and S5A). The cleavage and polyadenylation factors (Yth1, Pcf11, and Hrp1) also commonly had one molecule per mRNP (Figures S5B–S3D).
Figure 3. mRNA biogenesis and export-related RBP stoichiometry in CBC-containing mRNPs.
(A) Cartoon depicts the pull-down procedure for RBP-mNG- or -SNAPf-containing mRNPs in mRNP-SiMPull. Pull-down was performed by (i) IgG-beads followed by (ii) mRNP capturing via mNG, HA (for SNAPf-3HA), or Yra1 antibody on the glass surface.
(B–L) Representative TIRF images of target RBPs (B: nuclear cap-binding complex component, Cbp20, C: nuclear poly A binding protein, Nab2, D: cytoplasmic poly A binding protein, Pab1, E: SR-like protein, Npl3, F: SR-like protein, Gbp2, G: SR-like protein, Hrb1, H: THO complex component, Hpr1, I: THO complex component, Mft1, J: RNA helicase, Sub2, K: mRNA export receptor, Mex67, L: mRNA export adapter protein, Yra1) obtained by mRNP-SiMPull from cell lysates co-expressing Cbp80-PrA (see Figure S5A for control images with untagged Cbp80 strains). Graphs display stoichiometry distributions determined by photobleaching steps analysis. Blue bars show mean data with standard deviation with dots showing individual data points in replicate experiments (12 and three replicates for L and the others, respectively). Orange line displays the expected stoichiometry distribution following correction for fluorescent reporter activity using finite mixture modeling. For Hpr1 (H) and Mft1 (I), magenta squares represent model estimation assuming a dimer as the base unit. Average number (n) of spots analyzed per replicate experiment is indicated on each graph. Image scale bars, 5 μm.
For the nuclear PABP Nab2, photobleaching data showed that one to six copies were detected within mRNPs (Figures 3C and S5A). This aligns with Nab2 binding ~25 adenines (As) in vitro and the measured length distribution of mRNA poly(A)-tails in yeast that have a mean and median poly(A)-tail length of 40 and 37 As, with lengths up to 140 As.54,55 Nab2 has also been shown to bind within the body of mRNAs and to form a dimer,19,56 which may contribute to these stoichiometries. In contrast, the mostly cytoplasmic PABP, Pab1, was most often present as one molecule per mRNP (Figures 3D and S5A). While Nab2 is the major nuclear PABP, the presence of Pab 1 on CBC-containing mRNPs isconsistent with Pab1 shuttling between nucleus and cytoplasm and contributing to poly(A)-tail length control and export.57,58 Moreover, the CBC remains bound to the mRNA post export until replacement by translation initiation factors for the pioneer round of translation,43,44 which may reflect the observed CBC-containing Pab1-bound mRNPs. It is reported that cytoplasmic degradation of translated mRNAs by the poly(A)-nuclease Pan complex requires more than two Pab1 molecules on the poly(A)-tail.59 The presence of a single copy of Pab1 in Cbp80-containing mRNPs suggests a mechanism that could distinguish and protect recently exported mRNAs from decay and favor translation.
The three SR-like proteins, Npl3, Gbp2, and Hrb1, showed similar stoichiometry distributions and were most often present as one copy in an mRNP; however, ~30%–40% of mRNPs contained two to six copies of these RBPs (Figures 3E–3G and S5A). This stoichiometric variation may reflect gene-specific mRNP compositions linked to SR-like protein functions in splicing (Npl3) and quality control (Gbp2 and Hrb1).60–62 In addition, it is known that Npl3 self-association is modulated by Npl3 methylation, and an Npl3-Npl3 interaction is required for monosome formation to activate translation,63,64 which may contribute to the observed stoichiometry.
THO complex components Hpr1 and Mft1 most frequently showed two-step photobleaching (Figures 3H, 3I, and S5A), with statistical modeling indicating that no observed mRNPs contained a single copy of Hpr1 or Mft1. This corresponds with reported structural data showing that the yeast THO complex forms a homodimer65–67 and confirms the THO complex is present on mRNA as a dimer in vivo. By limiting the statistical model to multiples of two, the data suggest that the THO complex is present as a single homodimer two-thirds of the time, with the remaining mRNPs most frequently having two homodimers present. This range of THO complex stoichiometry corresponds with estimates of the human tetrameric THO complex, which was recently modeled as one to three copies per mRNP.35 In the case of Sub2, a THO complex binding partner,66–68 an approximately equal ratio of mRNPs with one or two copies of Sub2 was indicated by the data (Figures 3J and S5A).
In the case of the major mRNA export factor, Mex67 (NXF1/TAP in humans),69–71 multiple adapter RBPs are known to aid association with the mRNP.62,72–76 Models likewise suggest multiple copies of Mex67 would be required for efficient transport through an NPC5; however, Mex67 stoichiometry was most often one per mRNP (Figures 3K and S5A). These data are in line with recent works that suggest Mex67 is not a stable component of mRNPs and associates transiently late in biogenesis with an mRNP at or near NPCs to mediate export.50,51 The majority of Mex67-bound Cbp80-containing mRNPs also contained Mtr2 (Figure S5F), proving the model that Mex67 is loaded into mRNPs as a Mex67-Mtr2 heterodimer.70
Finally, Yra1 showed a large variation in stoichiometry with ~50% of spots showing 2–10 copies (Figures 3L and S5A). The observed number of Yra1 molecules cannot be explained by assemblies containing multiple mRNPs, as recently reported in yeast,77 since most spots displayed Cbp20-mNG spot intensities consistent with a single copy of the CBC (Figure S5G). Previous work established that Yra1 does not actively shuttle between nucleus and cytoplasm72,73 and that Yra1 is removed from an mRNP prior to export via ubiquitination by Tom174 and is further regulated by Dbp2.78 When mRNP-SiMPull was performed using a tom1Δ strain, or following depletion of Dbp2, photobleaching step analysis indicated Yra1 stoichiometries remained like the controls (Figure S6). These data indicate that the majority of mRNPs being analyzed are upstream of Tom1 function and that loss of Tom1 or Dbp2 activity does not perturb Yra1 loading into an mRNP. Yra2, a paralog of Yra1,79 showed a stoichiometry of mostly one molecule per mRNP (Figure S5E), consistent with Yra1 and Yra2 having distinct functions in mRNA biogenesis and export. Overall, these RBP stoichiometry and co-occupancy data provide an important quantitative framework to be considered with emerging structural data to inform models of mRNP architecture and mechanisms of nuclear export.
Multicopy Yra1 mRNPs contain Nab2, Npl3, and Hpr1
The observed range of Yra1 and other RBP stoichiometries reveals mRNP heterogeneity across the transcriptome and stages of gene expression. To further define the compositional nature of isolated Cbp80-containing mRNPs, co-occupancy of Yra1 with other RBPs was analyzed by two-color mRNP-SiMPull. For these assays, Cbp80-PrA pull-down material was loaded on glass slides functionalized with a Yra1 antibody to capture SNAPf-Yra1 complexes and determine co-localization with a mNG-tagged RBP (Figures 4A and S7A). Yra1-containing mRNPs were categorized as single or multicopy using measured intensities (Figures 4B, S7B, and S7C, see STAR Methods). Among these two groups, it was determined that single-copy Yra1 spots had a co-localization frequency of ~50% with Cbp20 (Figures 4 and S7A). Assuming Cbp20 is present as a single copy, combined with the measured mNG reporter activity (78%), these data indicate that approximately ~64% of single-copy Yra1 spots represent capped mRNAs. Co-localization between Cbp20 and multicopy Yra1 spots increased to ~70%, indicating that ~90% (based on reporter activity) of these spots are associated with the CBC.
Figure 4. Co-localization analysis of Yra1 with other RBPs.
(A) Cartoon depicting the pull-down procedure for co-localization analysis of SNAPf-Yra1 and RBP-mNG by two-color mRNP-SiMPull. Pull-down was performed by (i) IgG-beads followed by (ii) mRNP capture via Yra1 antibody on the glass surface. Representative TIRF images used for co-localization analysis between SNAPf-Yra1 and other mNG-tagged RBPs. Scale bar, 5 μm.
(B) Graph shows percent co-localization with one Yra1 vs. multiple Yra1 containing spots for the indicated RBPs. Yra1 spots were separated into two groups (one or multiple Yra1) based on spot intensity (see STAR Methods and Figure S7). The mean and standard deviation of percent co-localization calculated with fluorescent protein activity-uncorrected raw data for three replicate experiments are shown for each. Averaged spot numbers analyzed in each replicate for Cbp20, Npl3, Nab2, Hpr1, Sub2, Gbp2, Hrb1, Mex67, Yra2, and Pab1 images are 266, 202, 330, 359, 446, 281, 271, 309, 257, and 240 for one Yra1 and 126, 86, 195, 171, 233, 146, 165, 267, 181, and 173 for multiple Yra1s, respectively.
Npl3 and Nab2 showed an uncorrected co-localization frequency of ~70% with multiple Yra1 spots, which decreased to ~30% for single Yra1 spots (Figure 4B). Hpr1 (~40%), Sub2 (~25%), Gbp2 (~25%), Hrb1 (~30%), and Mex67 (~20%) also showed biased co-localization to multicopy Yra1 spots with these RBPs showing a co-localization of ~10% or less to single Yra1 spots. Both Yra2 and Pab1 showed low levels of co-localization in all spots. In cases other than Cbp20, the data cannot be corrected for reporter activity due to a lack of knowledge about the populations these mRNPs are isolated from and their associated stoichiometry distributions. Consequently, these numbers represent a lower bound of RBP co-occupancy in mRNPs containing Yra1. The measured differences in RBP co-occupancy between single and multicopy Yra1 spots likely result from gene-specific mRNP heterogeneity, capturing mRNPs at different points in the gene expression pathway, and technical limitations of the approach (e.g., presence of free labeled protein). It is expected that multicopy Yra1 mRNPs with CBC, Nab2, Npl3, and THO complex are representative of a sub-population of mature nuclear mRNPs. Indeed, loss of any of these factors severely disrupts multiple aspects of gene expression,38,39,47,58,60,61,68,72,75,80–82 causing mRNA export defects and lethality in the case of Yra1 and Nab2.39,72
From the two-color mRNP-SiMPull data, spot intensity information was extracted to further investigate if the stoichiometries of different RBPs are correlated with Yra1 stoichiometry. Importantly, there is a strong relationship between intensity and measured stoichiometry within mRNP-SiMPull data (Figure S2F), which supports the validity of using intensity to infer stoichiometry in these analyses. Using this approach, no significant correlations related to copy number were identified among RBPs co-localizing with Yra1 (Figure S8). This suggests that Yra1 does not have a fixed binding partner among the tested RBPs that co-varies with respect to stoichiometry. These data are supported by recent crosslinking mass spectrometry data that showed the most frequent links involving Yra1 occurred between copies of Yra1, as well as a novel mRNP constituent Yhs7,30 which was not included in this study. Of the RBPs analyzed here, Npl3 shows some of the highest stoichiometry values after Yra1, and it was recently noted that Npl3 contains positively charged intrinsically disordered regions (IDRs) like Yra1, with IDRs in Yra1 promoting RNA-RNA interactions.30 These shared features (i.e., variable stoichiometry and IDRs) raise the possibility that Yra1 and Npl3, as multicopy constituents of mRNPs (this work), similarly promote RNA-RNA interactions on different sub-populations of mRNPs and/or act redundantly within the same mRNPs. Future work will employ RNA aptamers within the mRNP-SiMPull methodology to isolate gene-specific mRNPs, which will allow for an evaluation of gene-specific mRNP packaging networks and heterogeneity that exists involving Yra1, Npl3, and other RBPs.
THO complex facilitates generation of multicopy Yra1 mRNPs
Co-localization analysis by two-color mRNP-SiMPull offers strong evidence for distinct types of Yra1-containing mRNPs. Current models suggest a major pathway for Yra1 loading is through the THO complex and Sub2 (as the TREX complex) involving Sub2 ATPase activity.68,83–85 The biased co-localization of Hpr1 (THO complex) and Sub2 with multiple Yra1-containing mRNPs (Figure 4) further suggests that TREX functions to load and/or stabilize Yra1 within mRNPs. To investigate this model, Hpr1-PrA was used in place of Cbp80-PrA to isolate mRNPs (Figure 5A), which resulted in a significant enrichment of multicopy Yra1 containing mRNPs (Figure 5B). Upon RNase A treatment of the Hpr1-PrA-associated material, most bright Yra1 spots were lost, confirming these are mRNPs, but dim spots were still present compared to a no-tag control (Figures S9A and S9B). Photobleaching step analysis of these RNase A-resistant spots showed mostly one-step bleaching (Figures S9C and S9D), which is consistent with Yra1 bound to the THO complex (i.e., part of the TREX complex) independent of RNA.68
Figure 5. THO-dependent formation of multiple Yra1-containing mRNPs.
(A) Comparison of Yra1 stoichiometry distribution in Cbp80-PrA and Hpr1-PrA pull-down samples. Cartoon shows the pull-down procedure of mRNP-SiMPull by (i) IgG-beads to target Cbp80-PrA or Hpr1-PrA followed by mRNP capture via Yra1 antibody on the glass surface. Representative TIRF images of SNAPf-Yra1 obtained by mRNP-SiMPull from Cbp80-PrA and Hpr1-PrA pull-downs.
(B) Line graph showing uncorrected raw mean photobleaching step data from triplicate experiments for SNAPf-Yra1 comparing Cbp80-PrA to Hpr1-PrA. p values were calculated by a non-parametric Kolmogorov-Smirnoff (KS) two-sample test.
(C) Comparison of Yra1 stoichiometry in wild-type and tho2Δ strains by mRNP-SiMPull. Cartoon shows the pull-down procedure of mRNP-SiMPull by (i) IgG-beads for targeting Cbp80-PrA followed by (ii) mRNP capturing via Yra1 antibody on the glass surface. Cell lysates were loaded into the Yra1 antibody-coated glass slide for the analysis of input samples. Representative TIRF images of SNAPf-Yra1 obtained by mRNP-SiMPull from cell lysate (Input) and Cbp80-PrA pull-down samples in wild-type and tho2Δ strains.
(D) Line graph shows uncorrected raw mean photobleaching step data from triplicate experiments for SNAPf-Yra1 comparing wild type to tho2Δ in Cbp80-PrA pull-down samples. p value is calculated by a non-parametric Kolmogorov-Smirnoff (KS) two-sample test. Average number (n) of spots analyzed per replicate experiment is indicated on each graph. Scale bar, 5 μm.
To investigate the role of TREX in loading/stabilizing Yra1 within mRNPs, Yra1 stoichiometry was assessed with Cbp80-PrA in a tho2Δ strain, with THO2 encoding the largest subunit of the THO complex. In a tho2Δ strain, photobleaching analysis revealed that multicopy Yra1 mRNPs declined from 45% to 28% of the population (~40% decrease) with a near complete loss of complexes with more than four copies of Yra1 (Figures 5C and 5D). These data demonstrate that THO complex supports multicopy binding of Yra1 to an mRNP, but the presence of Yra1 is not solely dependent on THO, as evidenced by total spot number and the persistence of multicopy Yra1 mRNPs. This corresponds with Yra1 being essential,86 while the THO complex is not,68 and the description of THO-independent Yra1 recruitment mechanisms that involve Pcf11, the RNA Pol II CTD, and interactions with other RBPs.30,87,88 It is also possible that Sub2 continues to engage Yra1 in the absence of the THO complex, but at a reduced efficiency, thus altering Yra1 stoichiometry. Strains carrying temperature-sensitive or auxin-induced degradation alleles of Sub2 were not successfully generated in the presence of tagged versions of Yra1 and Cbp80 that are needed for mRNP-SiMPull, leaving this possibility untested. These data demonstrate that the THO complex, likely in the form of TREX, is present in mRNPs and functions to generate and/or stabilize mRNPs with increased Yra1 stoichiometries.
Transcript-specific features are correlated with Yra1 copy number
Previous RNA binding data suggest Yra1 and the THO complex are associated with most gene transcripts, but Yra1 shows a propensity for longer transcripts.20 The THO complex has also been linked to the maintenance of genome stability via prevention of R-loop formation,89 which reportedly increases with gene length.90,91 Combining these findings with the data presented here, a putative model is that Yra1 is loaded on transcripts in a length-biased manner by TREX to support mRNP packaging, which upon disruption increases R-loop formation and genome instability. An expectation of this model is that THO complex-bound mRNPs would be biased to longer transcripts. Thus, RNA-seq analysis was performed on material isolated from two-step pull-downs that targeted Cbp80-PrA vs. Hpr1-PrA in the first step and selected for Yra1 in the second step (i.e., Cbp80/Yra1 vs. Hpr1/Yra1) (Figure 6A). This analysis identified 1,616 (Cbp80/Yra1) and 753 (Hpr1/Yra1) genes that were significantly enriched over a no-tag control sample with 522 genes in common (Figure S10A). The genes in common to both Cbp80/Yra1 and Hpr1/Yra1 two-step pull-downs, which are expected to be enriched for multiple Yra1-containing mRNPs, were significantly longer than the genome average or those only enriched by Cbp80/Yra1 (Figure 6B). Upon performing the same Cbp80/Yra1 pull-down in a tho2D strain, a statistically significant loss of longer transcripts was observed (Figure S10B), which is suggestive of a loss of Yra1 from these transcripts and a decreased pull-down efficiency. These data are consistent with transcript length being a feature associated with THO complex-bound mRNPs and increased Yra1 copy number.
Figure 6. Yra1 is required for mRNP compaction.
(A) RNA-seq analysis to define mRNAs that form one and multiple Yra1-bound mRNPs. Cartoon shows the pull-down procedure for RNA-seq sample preparation. First pull-down was performed by (i) IgG-beads to target Cbp80-PrA or Hpr1-PrA. In the second pull-down (ii), Yra1-bound mRNPs were purified by Yra1 antibody-conjugated beads from which RNA was extracted for RNA-seq.
(B and C) Violin plot showing gene length (bp) and preference to form secondary structure (average PARS score within each gene) of all annotated genes in S. cerevisiae vs. significantly enriched in only Cbp80/Yra1 (1,094 genes) or in both Cbp80/Yra1 and Hpr1/Yra1 pull-downs (522 genes). Median and quartile are shown as solid and dotted lines, respectively. p value was calculated by Wilcoxon’s rank-sum test.
(D) Western blot shows Yra1 depletion by auxin-induced degron system after 2 h at the indicated temperatures. Protein size marker position is indicated at right side.
(E–G) Illustrations show Atto647 and Alexa 594 smFISH probes used to target the same (E) or different (F and G) regions of target mRNAs for distance measurements by super-resolution STED imaging. Representative maximum projection images are shown, including four magnified examples from the nuclear volume for each sample. Scale bar, 400 nm. Dot plots display distances measured with probe sets targeting IRA2 (F) and TAO3 (G) mRNAs. Median and SD (standard deviation) are shown in nanometers. Statistic test was performed using Kolmogorov-Smirnov test. ns, not sensitive. ***p < 0.001.
Since Yra1 has robust RNA-RNA annealing activity,86 RNA secondary structure was also investigated as a feature linked to Yra1 stoichiometry utilizing published parallel analysis of RNA structure (PARS) data.92 PARS data in S. cerevisiae provides in vitro data on the propensity of nucleotides within a transcript to be present in a double- or single-stranded conformation, with positive PARS values indicating nucleotides more commonly in a double-strand RNA conformation. The averaged PARS score in each gene was compared, and it was found that genes enriched in both IPs showed significantly higher PARS score than the genome average or genes enriched only by Cbp80/Yra1 (Figure 6C). This indicates that transcripts enriched by both Cbp80/Yra1 and Hpr1/Yra1 (i.e., multiple Yra1-containing mRNPs) have sequences with a higher potential to form a secondary structure. In addition, genes enriched in both IPs were generally found to be expressed at higher levels, have increased synthesis and decay rates,93 and lack introns (Figures S10C–S10F). The anti-correlate observed between the THO complex and spliced mRNAs matches reports of intron-containing mRNAs being less sensitive to loss of THO complex function.94 These findings suggest that Yra1 stoichiometry is linked to transcript features important for mRNP packaging, including transcript length and RNA secondary structure.
Yra1 loading is required for stable compaction of nuclear mRNPs
Nuclear mRNPs form compacted particles.23,25,30,31,35 Given the correlation of Yra1 copy number with transcript length and secondary structure, Yra1 RNA-RNA annealing activity,86 and the extensive protein-protein and protein-RNA network Yra1 engages in,30 it is likely that Yra1 function is related to the maintenance of mRNP compaction. To examine this hypothesis, nuclear mRNP compaction was analyzed by measuring the distance between the 5′ and 3′ region of a target mRNA using smFISH and super-resolution STED imaging. Considering the importance of temperature to RNA folding, these data were collected at 25°C and 37°C with and without auxin-induced depletion95 of Yra1 (Figure 6D). Two mRNAs, IRA2 (9,240 nt) and TAO3 (7,131 nt), were selected as mRNAs that are enriched in the RNA-seq datasets generated from Cbp80/Yra1 and Hpr1/Yra1 pull-downs. To determine the co-localization precision of this approach, a set of differentially labeled alternating smFISH probes targeting an ~2-kb region within a control mRNA (MDN1) were used. STED imaging of the overlapping probe sets showed a high degree of co-localization with a median distance between the two labeled spots of ~18 nm, indicating the co-localization precision of this setup (Figures 6E–6G). At 25°C, depletion of Yra1 caused a robust mRNA export defect for both targets, but 5′–3′ spot distances (40 nm for IRA2 and 38 nm for TAO3) were not significantly changed compared to control (36 nm for IRA2 and 41 nm for TAO3; Figures 6F, 6G, S10G, and S10H). In contrast, spatially separated 5′ and 3′ spots were frequently observed upon depletion of Yra1 at 37°C with a median distance (51 nm for IRA2 and 56 nm for TAO3) that was significantly increased compared to control (37 nm for IRA2 and 40 nm for TAO3; Figures 6F, 6G, S10G, and S10H). Notably, in the absence of Yra1 at 37°C, the number of mRNAs per cell was strongly diminished compared to 25°C (Figures 6F and 6G), suggesting changes in mRNP packaging are likely accompanied by increased nuclear decay and/or decreased transcription. These data indicate that with increased temperature, Yra1 becomes essential to maintaining mRNP compaction and gene expression.
Temperature is a significant determinant of RNA annealing, and it isreported that the thermal stability of secondary structure within mRNA is relatively lower than non-coding RNAs.96 As such, it is possible that changes in growth temperature need to be buffered by changes in RBP stoichiometry to maintain mRNP packaging and gene expression. A failure to do so may result in an inability to maintain mRNP packaging and proper gene expression, as observed upon depletion of Yra1 at 37°C (Figure 6). To evaluate whether RBP stoichiometries vary with growth temperature, Nab2, Npl3, Yra1, and Hpr1 stoichiometries were compared across yeast cultures grown at 25°C, 30°C, and 37°C (Figures 7 and S11). Of the tested RBPs, photobleaching step analysis indicated the stoichiometry distribution of Yra1 was dramatically altered by temperature, showing a rise in multicopy Yra1 mRNPs as temperature increased, which was accompanied by a ~50% decrease in the population of single-copy Yra1 mRNPs at 37°C. Npl3 and Nab2 copy number were also increased at 37°C, which in the case of Nab2 may reflect the reported lengthening of mRNA poly(A)-tails at 37°C.55 In contrast, it was observed that Hpr1 copy number decreased at 37°C, highlighting that not all RBP stoichiometries are increased with temperature. Importantly, material captured on slide directly from lysate had RBP spot intensities that were indistinguishable between temperatures, suggesting increased stoichiometries were not the result of protein aggregation. Transcriptome analyses of yeasts grown at 25°C, 30°C, and 37°C showed that no genes were differentially expressed between 25°C and 30°C, and only 40 and 11 genes were differentially expressed after 2 h of growth at 37°C when compared to 25°C or 30°C cultures (Table S2). This shows that mRNP compositions are changing in response to growth temperature, and these changes are not the result of an altered transcriptome. These data demonstrate that mRNP composition is regulated in response to growth temperature with the same transcripts adopting different RBP compositions to support gene expression.
Figure 7. RBP stoichiometry in mRNPs is altered by cell growth temperature.
Representative TIRF images of SNAPf-Yra1, Nab2-mNG, Npl3-mNG, and Hpr1-mNG obtained by mRNP-SiMPull from cells grown at 25°C or 37°C for 2 h. IP (Cbp80-PrA pull-down) and input (cell lysate sample) images are shown. Scale bar, 5 μm. Line graphs show uncorrected raw mean photobleaching step data from triplicate experiments for 25°C and 37°C. p values were calculated by a non-parametric Kolmogorov-Smirnoff (KS) two-sample tests. Average number (n) of spots analyzed per replicate experiment is indicated on each graph.
Here, quantitative measures of mRNP composition at the single-molecule level are provided by mRNP-SiMPull. These data highlight the plasticity of mRNPs, with compositions that are gene feature dependent and responsive to cellular growth conditions (Figure S12A). The data reveal that Yra1 (Aly/REF) is present in individual mRNPs from 1 to 10 copies, and it interacts with mRNAs in a manner biased by length and RNA secondary structure. Yra1-containing mRNPs also commonly contain the poly(A)-RBP Nab2 and SR-like protein Npl3, and in association with THO/TREX complex, Yra1 stoichiometry is increased (Figure S12B). Given the expectation that the CBC is rapidly replaced post nuclear export by translation initiation factors44 and that Yra1 does not shuttle between nucleus and cytoplasm,72,73 these data indicate that CBC, Npl3, Nab2, and Yra1 form core components of nuclear mRNPs. Furthermore, at least for IRA2 and TAO3, Yra1 is required for the establishment and/or maintenance of a compacted mRNP structure at 37°C (Figure 6). This role of Yra1 as an organizer of mRNP structure aligns with the original identification of Yra1 as a factor with robust RNA annealing activity.86 Given that the mouse ALY gene can complement the lethality of a YRA1 loss of function mutant in S. cerevisiae,72 it is likely this function is conserved among Yra1 orthologs of the Opisthokonta supergroup.
Recently, crosslinking mass spectrometry analysis of yeast mRNPs purified via the THO complex identified intermolecular interactions between copies of Yra1 and between Yra1 and the other RBPs including Nab2.30 In addition, it was demonstrated by Bonneau et al. that positively charged IDRs in Yra1 and the THO complex subunit Tho2 promote RNA annealing, with IDRs also identified in other RBPs, including Npl3 and Yra2. This work using mRNP-SiMPull has shown that individual capped mRNPs containing the THO complex and Yra1 are also frequently co-occupied by Npl3 and Nab2 (Figures 3, 4, and 5). In humans, recent cryo-EM analyses have similarly suggested a critical role for Aly/REF (Yra1) in organizing mRNPs through multivalent protein-RNA and protein-protein interactions that involve bridging multiple TREX and EJC complexes.35 Although yeast lack an EJC,34 the central role of Yra1(Aly/REF) in both yeast and humans appears to be conserved. Specifically, both Yra1 and Aly/REF play a critical role in mediating intermolecular RBP interactions to organize compacted mRNPs, which is achieved through forming mRNPs with varying RBP stoichiometries. Combing these findings with data presented here, we propose that Yra1 is a critical conserved mRNP organizer, acting with other RBPs (e.g., CBC, Npl3, Nab2, Sub2, and the THO complex) in a regulated manner to generate compact and export-competent mRNPs.
Limitations of the study
This study highlights mRNP compositions and heterogeneity linked to transcript features but is limited by the fact that the identity of the transcript within each mRNP analyzed is not known. Future work will need to characterize gene-specific mRNP compositions, which will be influenced by differences in mRNA processing (e.g., splicing) and gene expression regulation (e.g., transcription factor identity, cellular state, and stress). These questions may be addressed by altering the mRNP-SiMPull methodology to use mRNA aptamers (e.g., MS2) in one of the isolation steps to capture gene-specific mRNPs. In addition, it will be important to understand changes in mRNP composition across mRNA biogenesis and export, including transient and low-abundance configurations. By using Cbp80 to enrich nuclear mRNPs, this work reflects complexes across gene expression, including mRNPs post nuclear export, and likely represents the most frequent configurations (i.e., slow kinetic steps) within the process. Similarly, measured RBP stoichiometries reflect heterogeneity resulting from gene-specific difference, capturing mRNPs at different points in the gene expression pathway, which may include nascent transcripts or decay products, and technical limitations of the approach (e.g., presence of free labeled protein). In the future, it may be possible to identify further mRNP intermediates, and extend stoichiometry measurements, across the gene expression pathway by targeting a broader repertoire and combination of RBPs for enrichment. In addition, mutants could be employed to accumulate biogenesis intermediates with the caveat that mutants will also disrupt the gene expression program and generate alternate or aberrant mRNPs due to the induced cellular perturbation.97–100 It is expected that future use, and variations, of mRNP-SiMPull could provide these types of data to advance models of post-transcriptional gene regulation.
