Abstract
Ribosome biogenesis is a complex process orchestrated by a host of ribosome assembly factors. Although it is known that many of the proteins involved in this process have RNA binding activity, the full repertoire of proteins that interact with the precursor ribosomal RNA is currently unknown. To gain a greater understanding of the extent to which RNA-protein interactions have the potential to control ribosome biogenesis, we used RNA affinity isolation coupled with proteomics to measure the changes in RNA-protein interactions that occur when rRNA transcription is blocked. Our analysis identified 211 out of 457 nuclear RNA binding proteins with a >3-fold decrease in RNA-protein interaction after inhibition of RNA polymerase I (RNAPI). We have designated these 211 RNA binding proteins as the RNAPI RNA interactome. As expected, the RNAPI RNA interactome is highly enriched for nucleolar proteins and proteins associated with ribosome biogenesis. Selected proteins from the interactome were shown to be nucleolar in location and to have RNA binding activity that was dependent on RNAPI activity. Furthermore, our data show that two proteins, which are required for rRNA maturation, AATF and NGDN, and which form part of the RNA interactome, both lack canonical RNA binding domains and yet are novel pre-rRNA binding proteins.
INTRODUCTION
The ribosome is a complex macromolecular machine, composed of ribosomal RNA (rRNA) and ribosomal proteins, which translates the genetic code into functional polypeptides. Ribosome biogenesis commences with the synthesis of a large precursor rRNA (pre-rRNA) by RNA polymerase I (RNAPI). Processing of the pre-rRNA through cleavage and trimming events leads to the production of mature 28S, 18S and 5.8S rRNA. Ribosomal proteins are precisely guided on to the folding rRNA to form the pre-40S and pre-60S ribosomal subunits (1). Overall, ribosome biogenesis is a highly coordinated process that requires numerous conserved assembly factors. Thus, in yeast, ∼350 assembly factors have been identified, whereas in mammalian cells, >500 proteins that affect ribosome biogenesis have been described (2,3). Many of these mammalian ribosome biogenesis factors are RNA binding proteins (RBPs). Indeed, recent RNA interactome studies suggest that ∼70% of the 286 proteins that influence rRNA processing may interact with RNA (2). This finding is unsurprising given that the pre-rRNA must be cleaved, digested, modified, and dynamically remodelled to create a functional ribosome. However, it has not been defined which of these potential ribosome biogenesis RBPs contact the rRNA directly and how many have a more indirect role in ribosome biogenesis.
With the advent of RNA interactome capture (RIC) it has emerged that RNA binding proteins are considerably more prevalent than was previously realised (4). One recent estimate suggests that there are nearly 1400 potential RBPs in the human genome (5,6). A substantial number of these RBPs lack classical RNA binding domains and many have no obvious link to RNA biology (5,6). Rather surprisingly, despite the fact that RIC relies on the capture of polyadenylated RNA, a considerable proportion of these RBPs have known roles in ribosome biogenesis or links to nucleolar function. Our data show that RIC also isolates significant quantities of pre-rRNA through RNA-RNA contacts with the mRNA. Therefore, we used this property to characterise those RBPs that directly contact the pre-rRNA or other RNAPI transcripts. We identified 211 potential RBPs that displayed reduced RNA association after RNAPI inhibition, which we designate the RNAPI RNA interactome. Importantly, these RBPs were highly enriched for nucleolar resident proteins and proteins with a known role or proposed role in ribosome biogenesis. Selected proteins from our RNAPI RNA interactome were shown to interact with RNA and these interactions were diminished after RNAPI inhibition. Finally, we extended our study by providing evidence that proteins from the previously identified ANN complex (7), comprised of AATF, neuroguidin (NGDN) and NOL10, and which functions in ribosome biogenesis, are novel pre-rRNA binding proteins.
MATERIALS AND METHODS
Cell culture, constructs and stable cell lines
U-2 OS and MCF10A cells were grown at 37°C with 5% CO2. U-2 OS were cultured in Dulbecco's modified Eagle's medium (DMEM) supplemented with 10% FBS and MCF10A in DMEM/F12 Ham's Mixture supplemented with 5% horse serum, 10 μg/ml insulin (Sigma-Aldrich), 100 ng/ml cholera toxin (Sigma-Aldrich), 20 ng/ml EGF (Peprotech) and 0.5 mg/ml hydrocortisone (Sigma-Aldrich). MCF10A and U-2 OS cells were treated with 10 nM actinomycin D (Sigma-Aldrich) or 2.5 μM CX5461 (Selleckchem).
The U-2 OS Flp-In T-REx cell line was generated using the manufacturer's recommended protocol (Thermo Fisher Scientific). The coding regions of AATF, NGDN and TAP26 were inserted in frame with the N-terminal 3xFLAG tag in pCDNA5/FRT/TO (Thermo Fisher Scientific). Constructs were then transfected into the U-2 OS Flp-In T-REx cell line and stable clones were selected with 100 μg/ml hygromycin B (Invitrogen) and 2.5 μg/ml blasticidin (Corning).
Western analysis, immunoprecipitation and RNA immunoprecipitation (RIP)
For western analysis, proteins were separated on 4–12% SDS PAGE gel (Thermo Fisher Scientific), transferred on to PVDF membrane (Bio-Rad) and detected with the indicated antibodies (Supplementary Table S1).
To prepare nuclei, U-2 OS cells were incubated in CLB buffer (10 mM HEPES pH 7.5, 10 mM NaCl, 3 mM MgCl2, 0.35 M sucrose, 0.5% NP-40) with protease inhibitors (cOmplete, Roche) for 5 min on ice and then centrifuged at 1300g for 5 min at 4°C. The nuclear pellet was washed twice with CLB and incubated for 30 min in SNEB buffer (20 mM HEPES pH 7.5, 150 mM NaCl, 1% NP40, 0.5% deoxycholate, 0.1% SDS) with protease inhibitors, with or without 0.2 U/ml Benzonase (Novagen), as indicated. The extract was then cleared by centrifugation.
For standard immunoprecipitations, protein G Dynabeads (Thermo Fisher Scientific) were incubated for 30 min with 3 μg of specific antibody (Supplementary Table S1) or rabbit/mouse IgG in TBS. After extensive washing, the antibody-bead conjugates were incubated with protein extract for 2 h at 4°C. Proteins were eluted with either 2xSDS PAGE loading buffer or, for FLAG immunoprecipitations, proteins were eluted with 3 ng/μl FLAG peptide (Sigma-Aldrich).
For RNA immunoprecipitations (RIP), U-2 OS cells were cross-linked with 0.2% formaldehyde for 8 min, incubated in neutralisation buffer (2.5 M glycine and 25 mM Tris pH 7.4) for 5 min and washed twice in PBS. Nuclear extraction and immunoprecipitation were conducted as described above with the following modifications. The beads were washed twice in 20 mM HEPES pH 7.5, 350 mM NaCl, 1% NP40, 0.5% deoxycholate, 0.25% SDS, 5 mM DTT for 3 min and twice with 20 mM Tris pH 7.4, 150 mM NaCl, 0.1% NP-40 for 5 min. Finally, the beads were treated with proteinase K (Thermo Fisher Scientific) and Turbo DNase (Ambion). Purified RNA was used for RT-qPCR analysis.
RT-qPCR
Reverse transcription was performed with random primers (Invitrogen) and Superscript™ II (Thermo Fisher Scientific) according to the manufacturer's instructions with 5 μl of purified RIP RNA or 300 ng of total RNA. The qPCR was performed using SensiFAST™ Sybr Lo-Rox Mix kit (Bioline) with specific primers to amplify sequences (Supplementary Table S1).
RNA interactome capture (RIC)
RIC was performed on nuclear extracts of MCF10A and U-2 OS. Briefly, cells were treated with 10 nM actinomycin D, 2.5 μM CX5461 or DMSO. Cells were washed with cold PBS, and cross-linked with UVC (150 mJ/cm2) irradiation. Nuclei were isolated as described above and were resuspended in oligo(dT) binding buffer (20 mM Tris pH7.4, 500 mM LiCl, 0.5% LiDS, 1 mM EDTA, 5 mM DTT). Nuclear lysate was incubated with magnetic oligo(dT) beads (NEB) for one hour at room temperature. Subsequently, the beads were washed and the RNA was eluted from the beads (4). The eluted RNA was supplemented with 2 mM MgCl2, 125U Benzonase (Sigma-Aldrich) and 300U RNase I (Ambion) to digest the RNA.
Quantitative mass spectrometry
LC–MS/MS was used to identify and quantify RNA binding proteins (RBPs) as described previously (8). Briefly, UV-crosslinked RBPs were separated by SDS PAGE and serial gel slices digested in situ with trypsin (9). Extracted tryptic peptides were analysed using data-independent acquisition (DIA) on a nanoAcquity UPLC system coupled to a Waters Synapt G2-S HDMS mass spectrometer. The PLGS ‘TOP 3’ method was used for absolute quantification of proteins (10).
Immunolocalization
U-2 OS cells were seeded on to glass coverslips and after the indicated treatments were fixed in 4% paraformaldehyde. Primary antibody incubations were performed at 4°C overnight with the indicated antibodies (Supplementary Table S1). Secondary fluorescent antibodies were incubated for 1 h at room temperature (Supplementary Table S1). Finally, the nuclei were labelled with Hoechst 33342 (Thermo Fisher Scientific). Images were acquired on Zeiss 510 confocal microscope and analysed with ImageJ.
Bio-informatic analysis
Protein/Gene IDs across different datasets were unified to Uniprot IDs or Gene Symbols using the ‘queryMany’ function in the ‘mygene’ package available through Bioconductor (http://bioconductor.org/packages/release/bioc/html/mygene.html). We assessed our dataset using Gene Ontology, InterPro domain and KEGG pathway annotations all accessed via the ‘mygene’ package (11,12). Analyses were performed using the R statistical framework using the Bioconductor packages goseq, mygene, clusterProfiler and their dependencies (11). ClusterProfiler code was modified to generate easy to interpret visualisation and output tables complete with the genes in each enriched category. The background list used in these analyses was derived from the MCF10A proteome (Dezi, unpublished). The maximum expression of each protein (across replicates) was used as bias during enrichment analyses performed using goseq (13,14). Only those proteins present in the background list are represented in functional enrichment analyses. Hence, there are 457 proteins in our interactome, but only 402 of them are represented in the MCF10A background and have GO terms associated with them. Code used in this study and supplementary data can be accessed via Bitbucket using this link https://bitbucket.org/emm13/2018_pineiro_nucleolarrbp/src.
RESULTS
Identification of RNAPI-dependent RNA binding proteins
RNA interactome analysis is a technique that was originally designed to isolate mRNA binding proteins (15,16). First, cells are irradiated with ultraviolet light to covalently couple proteins and any RNA in contact with these proteins in vivo. Cells are lysed under denaturing conditions, and polyadenylated mRNA and their associated proteins are then isolated using oligo(dT) affinity chromatography. This isolation process allows for the use of highly stringent wash conditions to remove any non-covalently bound proteins. Consequently, after nuclease digestion to remove the RNA, the sample is highly enriched for proteins that interact with mRNA (17). Recent advances in mass spectrometry have allowed the identification of ‘poly(A) RNA binding proteins’ from several different organisms and cell-types and these studies have vastly expanded the repertoire of potential RBPs (6).
In preliminary experiments using RIC, we noted that the RNA that was isolated using this technique contained a significant quantity of both pre-ribosomal RNA and mature rRNA (Supplementary Figure S1 and Figure 1C). Since the buffer conditions in the RIC protocol (0.5% LiDS, 0.5% LiCl) favour nucleic acid hybridization, we reasoned that rRNA was purified as a consequence of interactions between the mRNA and the abundant rRNA (Figure 1A). Disruption of these RNA-RNA interactions using heat coupled with two rounds of oligo(dT) purification, as used in the original RIC protocol from the Dreyfuss lab (18), resulted in almost complete loss of rRNA species from the RIC RNA, but the majority of the mRNA was retained (Supplementary Figure S1B). Moreover, we show that prior depletion of poly(A) RNAs from the lysate results in complete loss of rRNA species in the RIC RNA (Supplementary Figure S1C). Thus, these data suggest that the more recently developed RIC protocol, which uses only one round of oligo(dT) purification and no denaturation of the RNA (17), has the potential to identify not only mRNA binding proteins, but also rRNA binding proteins (Figure 1A).
Since an RNA interactome contains pre-ribosomal RNA (Supplementary Figure S1; Figure 1C), we reasoned that we could identify RBPs involved in ribosome biogenesis by analyzing changes in the RNA interactome that occur after in vivo pre-rRNA depletion through inhibition of RNAPI. We focussed on the nuclear RNA interactome to minimize the potential for RNA-protein interactions in the mature ribosome masking any changes that are due to the loss of pre-rRNA and we used a low concentration of actinomycin D (ActD) to preferentially inhibit rRNA transcription (19). Thus MCF10A cells were exposed to ActD at a concentration that was sufficient to inhibit the activity of RNAPI, but with minimal effect on RNA polymerase II (RNAPII) transcription. Three hours after exposure to 10 nM ActD, we confirmed that there was a substantial decrease in total RNA transcription (Figure 1B). Furthermore, northern analysis of RIC RNA revealed that there was a large decrease in the abundance of the 45S, 32S, 30S and 12S pre-rRNA in the captured RNA, indicating that rRNA transcription was inhibited under these conditions and that most of the existing intermediates are processed during this time (Figure 1C). However, ActD had little or no effect on the synthesis of polyadenylated RNA and therefore at this concentration the drug has little effect on RNAPII transcription (Figure 1B). Subsequently, RBPs were isolated by RIC from control and ActD treated cells. The spectrum of recovered RBPs was altered considerably after RNAPI inhibition, suggesting that many RBPs recovered by RIC interact with RNAPI RNA (Figure 1D). Examination of the recovered RBPs by western analysis revealed that, as we predicted, the association of the known pre-rRNA binding proteins, nucleolin (NCL) and nucleophosmin (NPM) with RNA, was considerably reduced after RNAPI inhibition. However, the recovery of the mRNA binding proteins PTB and HuR was unchanged under these conditions (Figure 1E). These observations strongly suggested that our method selectively depletes RNAPI RNA and its associated RBPs and therefore this approach could be used to characterise the RNAPI RNA interactome.
RBPs were isolated from control and ActD treated cells followed by quantitative label-free mass spectrometry to reveal RNA binding interactions that are dependent on RNAPI activity. We identified 457 potential RNA binding proteins in the nuclear proteome of MCF10A cells (Supplementary Table S2). Of these proteins, 211 showed reduced binding (3-fold or more; ActD/Ctrl ratio ≤ 0.35) following treatment with ActD and we classified these as RNAPI-dependent RBPs (RNAPI-dep; Figure 1F). Those proteins that do not show a significant change after RNAPI inhibition (246/457) were labelled as RNAPI-independent RBPs (RNAPI-indep; Figure 1F, Supplementary Table S2).
Analyses of these data showed that 384 of 402 proteins with Gene Ontology (GO) terms (95%) are functionally annotated as RNA binding (Supplementary Table S3: ‘GO:0003723’). As expected, many of the RBPs identified contain canonical RNA binding domains such as RRM and KH domains (Figure 2A) and interestingly, a significantly higher proportion of RNAPI-independent RBPs containing the RRM domain relative to their RNAPI-dependent counterparts (Figure 2A; P value = 1.562e–15). Furthermore, a comparison of the MCF10A RNA interactome capture data with those from three other cell lines (17,20) showed that 90% (409/457) of these candidate RBPs have been identified previously using RIC (Figure 2B). A substantial number of the remaining 10% (48/457) also have RNA binding functions (red circles, Supplementary Figure S2A). In conjunction, these analyses confirm that our method has delivered a high confidence set of RNA binding proteins.
Functional analysis supports RNAPI-dependent RBPs as nucleolar resident proteins.
We predicted that the majority of nuclear RBPs that have decreased RNA binding upon RNAPI inhibition are located in the nucleolus. Using GO annotations, 193 of the 457 nuclear RBPs are known to be resident in the nucleolus (‘GO:0005730’) of which 75% (144) are RNAPI-dependent (Supplementary Table S2). In addition, 56% (257/457) of the nuclear RBPs identified in this study were present in the nucleolar proteome characterised by Andersen et al. (21) with 79% (166/211) of the RNAPI-dependent and 37% (91/246) of the RNAPI-independent RBPs described as nucleolar (Figure 2C, Supplementary Tables S4 and S5). To gain further support for the nucleolar localisation of the RNAPI-dependent RBPs, we searched for the presence of nucleolar localisation signals (NoLS; (22)) and nucleolar detention signals (NoDS; (23)). Our data showed no difference between NoDs in the RNAPI-dependent versus independent RBPs. However, there was a significant difference in the distribution of NoLS, with more NoLS decorating RNAPI-dependent proteins than RNAPI-independent proteins (Supplementary Figure S2B). Taken together these observations support the nucleolar localisation of the RNAPI-dependent RBPs.
To gain insight into the function of the nuclear RBPs, we performed enrichment analyses using Kyoto Encyclopedia of Gene and Genomes (KEGG) pathways and GO terms (Figure 2D and E; Supplementary Table S3 and S6). KEGG pathway analysis revealed that the RNAPI-dependent RBPs are enriched for proteins involved in ribosome biogenesis, whereas the RNAPI-independent RBPs play a role in splicing, mRNA surveillance and RNA transport (Figure 2D). When GO enrichment was applied to the RNAPI-dependent proteins, the terms identified are rRNA processing (GO:0006364), maturation of SSU-rRNA from tricistronic rRNA transcript (GO:0000462) and ribosomal large subunit biogenesis (GO:0042273) with gene products associated with the nucleolus, the small subunit processome and the large subunit pre-ribosome (Figure 2E; Supplementary Table S6). In addition, 44% (92/211) of the RNAPI-dependent RBPs were also identified as playing a role in ribosomal RNA processing in a large scale siRNA screen (2), whereas, only 10% (27/246) of the RNAPI-independent RBPs scored positive in this screen (Figure 2C). Thus, the RNAPI-dependent RBPs are highly enriched for ribosome biogenesis functions. In contrast, the biological processes associated with the RNAPI-independent RBPs are mRNA splicing (GO:000398), mRNA processing (GO:006397) and the regulation of alternative splicing (GO:000381) (Figure 2E).
We chose to investigate further RNAPI-dependent RBPs that have no canonical RNA binding domains and have a poorly characterised role in ribosome biogenesis (Figure 2F). Thus, AATF, NOL10 and NGDN are part of the recently described ANN complex that may play a role in pre-rRNA cleavage (7). TAP26 and IFI16 were previously identified as potential ribosome biogenesis factors (2). Finally, the Ku proteins (KU70/80) are known to be nucleolar and interact with the telomerase RNA (hTR) (24,25), but they have never been defined as ribosome biogenesis factors. All of these candidate RNAPI-dependent RBPs were identified in previous RNA interactome analyses, supporting our observations that they contact RNA. Nevertheless, there is no direct evidence that any of these proteins interact with pre-rRNA.
RNAPI-dependent RBPs accumulate in the nucleolus
In the first instance, we analysed the subcellular distribution of our chosen proteins to confirm that they accumulate in the nucleolus. For immunolocalisation of AATF, KU70, NOL10 and IFI16, we used antibodies to the endogenous proteins. Commercial antibodies of sufficient quality were not available for immunolocalisation of NGDN or TAP26 and therefore we generated stable inducible cell lines expressing epitope tagged proteins using the Flp-In T-REx system (Supplementary Figure S3). All of our subsequent experiments were performed in U-2 OS cells since we encountered problems establishing a stable Flp-In MCF10A cell line. We felt justified in doing so because ribosome biogenesis is an extremely well conserved process and U-2 OS are a common model cell line to study nucleolar functions. In addition, large-scale protein localisation data are available for these cells as part of the Human Atlas protein project (26). We also investigated the role of RNAPI transcription on the nucleolar retention of the RNAPI-dependent RBPs using the RNAPI inhibitors ActD and CX5461. CX5461 is a selective inhibitor of RNAPI that prevents the polymerase from loading on to the rDNA promoter (27,28). We used ActD and CX5461 at concentrations that substantially inhibited rRNA synthesis, but had no effect on mRNA synthesis. U-2 OS cells were treated with either CX5461 or ActD and rRNA transcription was measured using RT-qPCR analysis of precursor rRNA that contain the 5′ETS. Treatment of cells with CX5461 or 10 nM ActD substantially inhibited rRNA transcription, whereas the abundance of the short-lived c-myc mRNA was unaltered by these treatments (Supplementary Figure S4).
In unperturbed cells, the majority of AATF, TAP26, NOL10, IFI16 and NGDN was found in the nucleolus (Figure 3A–E). In contrast, as reported previously (24), KU70 was found predominantly in the nucleoplasm with some accumulation in the nucleolus (Figure 3F). After inhibition of RNAPI transcription with ActD or CX5461, KU70 staining was no longer found in the nucleolus, suggesting that RNAPI activity facilitates the retention of KU70 in the nucleolus (Figure 3F). A significant quantity of TAP26, IFI16 and NGDN was also released from the nucleolus after RNAPI inhibition (Figure 3B, D, E). However, ActD and CX5461 had little or no effect on the nucleolar localisation of NOL10 or AATF (Figure 3A and C). We speculate that in the case of proteins that remain associated with the nucleolus following RNAPI inhibition such as AATF and NGDN, both RNA-protein and protein-protein interactions are required for tethering in this organelle. However, proteins that are located in the nucleolus solely due to rRNA binding (e.g. IFI16 and KU70) are released following exposure to cell stresses that inhibit RNAPI allowing their relocation into the cytoplasm/nucleoplasm, where they may have alternative functions. Thus, all of our candidate RNAPI-dependent RBPs have a nucleolar presence, but the extent to which nucleolar accumulation depends on RNAPI transcription varies between proteins.
The interaction of AATF, NOL10, NGDN, IFI16, TAP26, KU70 and KU80 with RNA is disrupted following inhibition of RNAPI
To validate the RNAPI RNA interactome, we inhibited RNAPI activity using ActD or CX5461 and performed RNA interactome capture followed by western analysis for our proteins of interest. Nucleolin (NCL) or nucleophosmin (NPM), which are RBPs with well-described roles in ribosome biogenesis, were used as positive controls, and the mRNA binding protein, PTB, was used as a negative control. After inhibition of RNAPI, there was a considerable reduction in the recovery of AATF, NOL10, NGDN, TAP26 and IFI16 in the RIC samples and a more modest reduction in the recovery of the Ku proteins (Figure 4). The magnitude of the changes is in complete agreement with our mass spectrometry experiments (Figure 2F). The pre-rRNA binding proteins NCL and NPM also show decreased recovery in these experiments, whereas the recovery of mRNA binding protein PTB is unchanged. Taken together these data suggest that our candidate proteins interact with RNAPI RNA. For AATF, NOL10, NGDN, IFI16 and TAP26, inhibition of rRNA synthesis has a substantial impact on the recovery of these proteins by RIC, suggesting that the major RNA species that these proteins interact with in this assay is derived from RNAPI (Figure 3A–D). However, the partial inhibition of Ku protein RNA binding after RNAPI inhibition (Figure 3E and F) most likely reflects the interaction of these proteins with other classes of RNA, in addition to RNAPI transcripts (29,30).
AATF and NGDN are novel pre-rRNA binding proteins
To provide direct evidence for the interaction of proteins identified in our RNAPI interactome with ribosomal RNA we chose to study AATF and NGDN in further detail. It has been suggested that these proteins in combination with NOL10 form a complex, the ANN complex, that plays a role in 18S rRNA maturation by facilitating endonucleolytic cleavage within the 5′ETS and ITS1 (7). Our RNA interactome data confirms that all three proteins contact RNA derived from RNAPI, despite the lack of a recognizable RNA binding domain in any of the three proteins. Clearly, the major RNAPI transcript in the cell is pre-rRNA and therefore, we tested the hypothesis that the members of the ANN complex are novel pre-rRNA binding proteins.
Initially, we attempted to confirm the previously published interaction between these three proteins (7). When endogenous AATF was immuno-affinity isolated from U-2 OS cells, we found that significant quantities of NOL10 and NGDN co-purified with this protein, suggesting that the three proteins are present in a complex (Figure 5A). Interestingly, only NOL10 co-purified with AATF when AATF was immuno-affinity purified from nuclease-treated extracts (Figure 5A). The release of NGDN from the complex following nuclease treatment implies that NOL10 and AATF interact through protein-protein interactions, whereas NGDN is co-purified with AATF and NOL10 through RNA–protein interactions.
To probe the interaction of AATF and NGDN with pre-rRNA, we took a cross-linking immunoprecipitation approach. Thus, RNA-protein complexes were cross-linked in vivo using a short exposure to a relatively low concentration of formaldehyde. Subsequently, immuno-affinity isolation was performed with stringent lysis and wash conditions to selectively purify cross-linked RNA-protein complexes. After immunoprecipitation, the association of AATF and NGDN with pre-rRNA was investigated by RT-qPCR amplification of the 5′ETS pre-rRNA sequence. In the absence of cross-linking, pre-rRNA is enriched in the AATF immunoprecipitation by ∼40-fold compared to the IgG control. However, after cross-linking there is a 315-fold enrichment of pre-rRNA in the AATF immunoprecipitation (Figure 5B). These data indicate that we have selectively enriched cross-linked RNA-protein complexes under these conditions and that AATF interacts with the pre-rRNA. In addition, under the same conditions we observed a 92-fold enrichment of pre-rRNA after cross-linking immunoprecipitation of NGDN (Figure 5C). Therefore, NGDN also interacts with the pre-rRNA. We further confirmed the specificity of the interaction of AATF and NGDN with pre-rRNA by inhibiting the synthesis of pre-rRNA with either ActD or CX5461 and performing cross-linking immunoprecipitation RT-qPCR. The abundance of pre-rRNA in both AATF and NGDN cross-linked immunoprecipitations decreased dramatically after inhibition of RNAPI, confirming that AATF and NGDN specifically interact with the pre-rRNA (Figure 5D and E). When combined with our RNA interactome study, these data confirm that AATF and NGDN are novel pre-rRNA binding proteins.
DISCUSSION
RNA interactome capture was originally developed to identify novel mRNA binding proteins (18), but importantly our data show that it can also be used to identify RBPs that associate with other RNA species. Thus, we demonstrate that the same conditions that favour interaction between oligo(dT) and polyadenylated RNA also drive the interaction of an abundant RNA, such as the pre-rRNA, with the captured polyadenylated RNA (Figure 1A, Supplementary Figure S1). Therefore, we exploited this phenomenon to define the RBPs that associate with RNAPI RNA in the nucleus. We identified 211 RBPs with a >3-fold difference in RNA binding following the selective inhibition of RNAPI activity (Figures 1 and 2) and we have designated these RBPs as the RNAPI RNA interactome. Our data have important implications for the interpretation of the previously published RNA interactome capture experiments. We suggest that these data cannot be regarded as mRNA interactomes or even poly(A) RNA interactomes, but that they should be considered to be RNA interactomes. This conclusion is supported by analysis of the composite list of human RNA interactomes (6), which reveals that 87% (183/211) of our RNAPI-dependent RBPs are present in this list (Supplementary Table S6). Furthermore, a comparison of the composite list with the nucleolar protein database (NOPdbv2 (21)) reveals that 32% (442/1393) of these proteins are nucleolar (Supplementary Table S7).
An alternative strategy to identify RBPs associated with a specific transcript is to use oligonucleotides-linked to beads that target a defined RNA. For example, the recently developed method by Rogell et al., whereby proteins bound to the target RNA are captured by hybridization with antisense locked nucleic acid (LNA)/DNA oligonucleotides covalently coupled to a magnetic resin (31). However, although this technique was successfully employed to capture the RBPs bound to a reporter mRNA containing the Sex-lethal (Sxl) binding motifs, when LNAs were directed against 18S or 28S rRNAs very few ribosomal proteins were obtained, with <30 proteins identified. As expected, there is limited overlap between our data and the data of Rogell et al., however, many of the ribosomal RNA binding proteins identified by Rogell et al. were classified by our work as RNAPI independent (e.g. HNRNPs, YBX3, ELAV1), suggesting that some of these interactions may not be rRNA specific. The authors suggested that the dearth of ribosomal proteins identified was likely to be due to inefficient crosslinking of proteins bound to double-stranded RNA, which involves much of the ribosomal RNA. Importantly, our data show that the many interaction sites provided by mRNA-rRNA contacts in the RIC technique are sufficient to overcome such issues and the method that we use provides an accurate data set of the RBPs that interact with rRNA.
RNAPI is known to generate two species of RNA, the 45S pre-rRNA and non-coding RNA derived from the intergenic spacer region of rDNA (IGS rRNA) (32). IGS rRNA are approximately 100-fold less abundant than pre-rRNA (32), suggesting that the majority of the RNA-protein interactions we observe in the RNAPI RNA interactome involve pre-rRNA. Consistent with this interpretation, bioinformatic analysis reveals that terms associated with ribosome biogenesis are significantly enriched in our RNAPI-dependent RBPs (Figure 2) and ∼73% of these proteins have previously been shown to influence ribosome biogenesis (2,3). Thus, characterisation of the RNAPI RNA interactome has provided a comprehensive record of the RBPs that influence ribosome biogenesis, particularly those involved in the processing of pre-rRNA. Furthermore, we have now confirmed that AATF and NGDN, which have been shown to play a role in facilitating cleavage of the pre-rRNA within the 5′ETS and ITS1 (7), are novel pre-rRNA binding proteins. While previous studies (7) suggested that these proteins bind directly each other, our data suggest that they also interact through an RNA intermediate. Further investigation of these candidates using techniques such as UV-crosslinking combined with immunoprecipitation and sequencing could be used to define the sites of pre-rRNA interaction.
Importantly, the RNAPI RNA interactome provides a new tool to understand how RNA-protein interactions control ribosome biogenesis. These data are of considerable interest since there are a number of disorders that are associated with dysregulated ribosome biogenesis. These include leukaemias (e.g. T-ALL, CLL), solids tumours (e.g. colon, gastric) and a group of rare syndromes termed ribosomopathies, many of which are associated with an increased cancer risk ((33,34). Ribosome biogenesis has been identified as a pathway that can be targeted in tumorigenesis and small molecule inhibitors are currently under development (e.g. CX5461 (27,28)). Thus, it is reasonable to predict that the RBPs required for ribosome biogenesis could provide new therapeutic avenues to identify novel chemicals that target this process.
DATA AVAILABILITY
Code used in this study and supplementary data can be accessed via Bitbucket using this link https://bitbucket.org/emm13/2018_pineiro_nucleolarrbp/src.
Supplementary Material
ACKNOWLEDGEMENTS
We thank Professor Martin Bushell for critical reading of the manuscript. We thank Drs Tom S Smith for support with bioinformatic analysis and Rayner Queiroz for mass spectrometry.
SUPPLEMENTARY DATA
Supplementary Data are available at NAR Online.
FUNDING
BBSRC [BB/M006700/1 to D.P., J.A.]; Wellcome Trust (to M.R., K.S.L.); MRC programme funding [MC_UP_A600_1023 to A.E.W., M.S.]. Funding for open access charge: MRC/Wellcome Trust/BBSRC.
Conflict of interest statement. None declared.
REFERENCES
- 1. de la Cruz J., Karbstein K., Woolford J.L. Jr. Functions of ribosomal proteins in assembly of eukaryotic ribosomes in vivo. Annu. Rev. Biochem. 2015; 84:93–129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Tafforeau L., Zorbas C., Langhendries J.L., Mullineux S.T., Stamatopoulou V., Mullier R., Wacheul L., Lafontaine D.L.. The complexity of human ribosome biogenesis revealed by systematic nucleolar screening of pre-rRNA processing factors. Mol. cell. 2013; 51:539–551. [DOI] [PubMed] [Google Scholar]
- 3. Badertscher L., Wild T., Montellese C., Alexander L.T., Bammert L., Sarazova M., Stebler M., Csucs G., Mayer T.U., Zamboni N. et al. . Genome-wide RNAi screening identifies protein modules required for 40S subunit synthesis in human cells. Cell Rep. 2015; 13:2879–2891. [DOI] [PubMed] [Google Scholar]
- 4. Castello A., Fischer B., Eichelbaum K., Horos R., Beckmann B.M., Strein C., Davey N.E., Humphreys D.T., Preiss T., Steinmetz L.M. et al. . Insights into RNA biology from an atlas of mammalian mRNA-binding proteins. Cell. 2012; 149:1393–1406. [DOI] [PubMed] [Google Scholar]
- 5. Ryder S.P. Protein-mRNA interactome capture: cartography of the mRNP landscape [version 1; referees: 3 approved]. F1000Res. 2016; 5:2627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Hentze M.W., Castello A., Schwarzl T., Preiss T.. A brave new world of RNA-binding proteins. Nat. Rev. Mol. Cell. Biol. 2018; 19:327–341. [DOI] [PubMed] [Google Scholar]
- 7. Bammert L., Jonas S., Ungricht R., Kutay U.. Human AATF/Che-1 forms a nucleolar protein complex with NGDN and NOL10 required for 40S ribosomal subunit synthesis. Nucleic Acids Res. 2016; 44:9803–9820. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. King H.A., Cobbold L.C., Pichon X., Poyry T., Wilson L.A., Booden H., Jukes-Jones R., Cain K., Lilley K.S., Bushell M. et al. . Remodelling of a polypyrimidine tract-binding protein complex during apoptosis activates cellular IRESs. Cell Death Differ. 2014; 21:161–171. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Dickens L.S., Boyd R.S., Jukes-Jones R., Hughes M.A., Robinson G.L., Fairall L., Schwabe J.W., Cain K., Macfarlane M.. A death effector domain chain DISC model reveals a crucial role for caspase-8 chain assembly in mediating apoptotic cell death. Mol. Cell. 2012; 47:291–305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Silva J.C., Gorenstein M.V., Li G.Z., Vissers J.P., Geromanos S.J.. Absolute quantification of proteins by LCMSE: a virtue of parallel MS acquisition. Mol. Cell. Prot. 2006; 5:144–156. [DOI] [PubMed] [Google Scholar]
- 11. Young M.D., Wakefield M.J., Smyth G.K., Oshlack A.. Gene ontology analysis for RNA-seq: accounting for selection bias. Genome Biol. 2010; 11:R14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Yu G., Wang L.G., Han Y., He Q.. ClusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012; 16:284–287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Wu C., Macleod I., Su A.I.. BioGPS and MyGene.info: organizing online, gene-centric information. Nucleic Acids Res. 2013; 41:D561–D565. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Xin J., Mark A., Afrasiabi C., Tsueng G., Juchler M., Gopal N., Stupp G.S., Putman T.E., Ainscough B.J., Griffith O.L. et al. . High-performance web services for querying gene and variant annotation. Genome Biol. 2016; 17:1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Glisovic T., Bachorik J.L., Yong J., Dreyfuss G.. RNA-binding proteins and post-transcriptional gene regulation. FEBS Lett. 2008; 582:1977–1986. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Pinol-Roma S., Swanson M.S., Matunis M.J., Dreyfuss G.. Purification and characterization of proteins of heterogeneous nuclear ribonucleoprotein complexes by affinity chromatography. Methods Enzymol. 1990; 181:326–331. [DOI] [PubMed] [Google Scholar]
- 17. Castello A., Horos R., Strein C., Fischer B., Eichelbaum K., Steinmetz L.M., Krijgsveld J., Hentze M.W.. System-wide identification of RNA-binding proteins by interactome capture. Nat. Protoc. 2013; 8:491–500. [DOI] [PubMed] [Google Scholar]
- 18. Pinol-Roma S., Adam S.A., Choi Y.D., Dreyfuss G.. Ultraviolet-induced cross-linking of RNA to proteins in vivo. Methods Enzymol. 1989; 180:410–418. [DOI] [PubMed] [Google Scholar]
- 19. Bensaude O. Inhibiting eukaryotic transcription: Which compound to choose? How to evaluate its activity. Transcription. 2011; 2:103–108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Beckmann B.M., Horos R., Fischer B., Castello A., Eichelbaum K., Alleaume A.M., Schwarzl T., Curk T., Foehr S., Huber W. et al. . The RNA-binding proteomes from yeast to man harbour conserved enigmRBPs. Nat. Commun. 2015; 6:10127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21. Andersen J.S., Lam Y.W., Leung A.K., Ong S.E., Lyon C.E., Lamond A.I., Mann M.. Nucleolar proteome dynamics. Nature. 2005; 433:77–83. [DOI] [PubMed] [Google Scholar]
- 22. Scott M.S., Boisvert F.M., McDowall M.D., Lamond A.I., Barton G.J.. Characterization and prediction of protein nucleolar localization sequences. Nucleic Acids Res. 2010; 38:7388–7399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Mekhail K., Rivero-Lopez L., Al-Masri A., Brandon C., Khacho M., Lee S.. Identification of a common subnuclear localization signal. Mol. Biol. Cell. 2007; 18:3966–3977. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Li L.L., Yeh N.H.. Cell cycle-dependent migration of the DNA-binding protein Ku80 into nucleoli. Expt. Cell Res. 1992; 199:262–268. [DOI] [PubMed] [Google Scholar]
- 25. Pfingsten J.S., Goodrich K.J., Taabazuing C., Ouenzar F., Chartrand P., Cech T.R.. Mutually exclusive binding of telomerase RNA and DNA by Ku alters telomerase recruitment model. Cell. 2012; 148:922–932. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Regev A., Teichmann S.A., Lander E.S., Amit I., Benoist C., Birney E., Bodenmiller B., Campbell P., Carninci P., Clatworthy M. et al. . The human cell atlas. eLife. 2017; 6:e27041. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Drygin D., Lin A., Bliesath J., Ho C.B., O’Brien S.E., Proffitt C., Omori M., Haddach M., Schwaebe M.K., Siddiqui-Jain A. et al. . Targeting RNA polymerase I with an oral small molecule CX-5461 inhibits ribosomal RNA synthesis and solid tumor growth. Cancer Res. 2011; 71:1418–1430. [DOI] [PubMed] [Google Scholar]
- 28. Bywater M.J., Poortinga G., Sanij E., Hein N., Peck A., Cullinane C., Wall M., Cluse L., Drygin D., Anderes K. et al. . Inhibition of RNA polymerase I as a therapeutic strategy to promote cancer-specific activation of p53. Cancer Cell. 2012; 22:51–65. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Britton S., Coates J., Jackson S.P.. A new method for high-resolution imaging of Ku foci to decipher mechanisms of DNA double-strand break repair. J. Cell Biol. 2013; 202:579–595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Zhang Y., He Q., Hu Z., Feng Y., Fan L., Tang Z., Yuan J., Shan W., Li C., Hu X. et al. . Long noncoding RNA LINP1 regulates repair of DNA double-strand breaks in triple-negative breast cancer. Nat. Struct. Mol. Biol. 2016; 23:522–530. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Rogell B., Fischer B., Rettel M., Krijgsveld J., Castello A., Hentze M.W.. Specific RNP capture with antisense LNA/DNA mixmers. RNA. 2017; 23:1290–1302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32. Mayer C., Schmitz K.M., Li J., Grummt I., Santoro R.. Intergenic transcripts regulate the epigenetic state of rRNA genes. Mol. Cell. 2006; 22:351–361. [DOI] [PubMed] [Google Scholar]
- 33. De Keersmaecker K., Sulima S.O., Dinman J.D.. Ribosomopathies and the paradox of cellular hypo- to hyperproliferation. Blood. 2015; 125:1377–1382. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Sulima S.O., Hofman I.J.F., De Keersmaecker K., Dinman J.D.. How ribosomes translate cancer. Cancer Dis. 2017; 7:1069–1087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35. Knight J.R., Bastide A., Peretti D., Roobol A., Roobol J., Mallucci G.R., Smales C.M., Willis A.E.. Cooling-induced SUMOylation of EXOSC10 down-regulates ribosome biogenesis. RNA. 2016; 22:623–635. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Code used in this study and supplementary data can be accessed via Bitbucket using this link https://bitbucket.org/emm13/2018_pineiro_nucleolarrbp/src.