Abstract
U5 snRNP is a complex particle essential for RNA splicing. U5 snRNPs undergo intricate biogenesis that ensures that only a fully mature particle assembles into a splicing competent U4/U6•U5 tri-snRNP and enters the splicing reaction. During splicing, U5 snRNP is substantially rearranged and leaves as a U5/PRPF19 post-splicing particle, which requires re-generation before the next round of splicing. Here, we show that a previously uncharacterized protein TSSC4 is a component of U5 snRNP that promotes tri-snRNP formation. We provide evidence that TSSC4 associates with U5 snRNP chaperones, U5 snRNP and the U5/PRPF19 particle. Specifically, TSSC4 interacts with U5-specific proteins PRPF8, EFTUD2 and SNRNP200. We also identified TSSC4 domains critical for the interaction with U5 snRNP and the PRPF19 complex, as well as for TSSC4 function in tri-snRNP assembly. TSSC4 emerges as a specific chaperone that acts in U5 snRNP de novo biogenesis as well as post-splicing recycling.
Subject terms: RNA quality control, RNA splicing
The correct assembly and recycling of the multicomponent spliceosome remains largely elusive. Here, the authors show that a previously uncharacterized protein TSSC4 associates with de novo formed spliceosomal U5 snRNP as well as with a post-splicing U5-PRPF19 particle, and that TSSC4 is important for assembly of the splicing competent tri-snRNP.
Introduction
Pre-mRNA splicing is a key step in protein expression catalyzed by one of the largest complexes in the cell called the spliceosome. The human spliceosome is a dynamic multicomponent molecular machine that assembles de novo on each intron to be removed and needs to be recycled after each round of splicing. Over 160 proteins and 5 small noncoding nuclear RNAs (snRNAs) are involved in spliceosome formation and rearrangements during RNA splicing1. SnRNAs together with associated proteins form small nuclear ribonucleoprotein particles (snRNPs), which are basic building blocks of the spliceosome. Almost all human introns are removed by the major snRNPs named U1, U2, U4, U5, and U6, while a small fraction of introns (U12 type) is removed by the minor spliceosome consisting of U11, U12, U4atac, and U6atac. U5 snRNP is shared by both major and minor pathways (reviewed in ref. 2).
U5 snRNP contains U5 snRNA tightly associated with a heptameric ring of Sm proteins and eight U5-specific proteins. Sm ring assembly on U5 snRNA is promoted and controlled by SMN and PRMT5 complexes in the cytoplasm and the core snRNP (U5 snRNA+7 Sm proteins) is imported to the nucleus where it interacts with additional U5 proteins3–7. Four of the U5-specific proteins PRPF8/U5-220K, SNRNP200/hBrr2/U5-200K, EFTUD2/hSnu114/U5-116K, and SNRNP40/WDR57/U5-40K form a stable RNA-free Heterotetrameric Complex (RHC)8. Yeast and human PRPF8 associate with the chaperone AAR2, which helps to properly fold PRPF89–11. PRPF8 then interacts with EFTUD2 and SNRNP20012. In yeast, Aar2p is exchanged with Brr2p13,14, while in humans AAR2 co-purifies with all four RHC members12, which indicates that AAR2 stays associated until the whole RHC is formed. Initial phases of RHC formation are further promoted and controlled by the R2TP/HSP90 chaperone, which interacts both directly and indirectly via a bridging factor ZNHIT2 with EFTUD212,15,16. PRPF8 and likely the whole pre-formed RHC then interacts with the core U5 snRNP in nuclear Cajal bodies7 and this step is promoted by another chaperone ECD17. Finally, the last four U5-specific proteins PRPF6/U5-102K, DDX23/U5-100K, CD2BP2/U5-52K, and TXNL4A/hDib1/Dim1/U5-15K join the complex, but little is known about this step of U5 snRNP biogenesis. Three of these proteins, namely PRPF6, CD2BP2, and TXNL4A directly interact with each other18–20 but whether they need the assistance of any chaperone to be incorporated into maturing U5 snRNP is unknown.
Before U5 snRNP can participate in splicing it needs to associate with U4/U6 di-snRNP to create the splicing competent U4/U6•U5 tri-snRNP. During tri-snRNP assembly, CD2BP2 protein is released and at least three additional proteins are added18,21,22. The PRPF6 protein is the key factor that interacts with U5 and U4/U6 proteins and bridges both particles in the tri-snRNP19,23–25. Consistently, downregulation of PRPF6 inhibits the tri-snRNP formation and results in sequestration of individual tri-snRNP components in Cajal bodies24,26.
During splicing reaction, the whole spliceosome and particularly the tri-snRNP undergo significant rearrangements. All di-snRNP proteins and a half of U5-specific proteins leave during spliceosome activation and only RHC and Sm proteins remain bound to U5 snRNA1. When splicing is completed, a post-splicing complex is released by the concerted action of RNA helicases DHX8/PRPF22 and DHX15/PRPF4327–30. The post-splicing U5 snRNP leaves the spliceosome together with the PRPF19 complex as a 35S particle31,32. How U5 snRNP is recycled from this complex into the active tri-snRNP form is currently unknown.
U5 chaperones AAR2 and ZNHIT2 co-precipitate with U5-specific proteins and several poorly characterized factors, including ECD, TSSC4, NCDN12,15. Among these proteins, only ECD has been shown to interact with U5 snRNP and its chaperones and promote U5 snRNP assembly15,17,33,34. Here, we focused on the protein TSSC4, which repeatedly co-purified with U5-specific proteins as well as with AAR2 and ZNHIT2 chaperones12,15,35 and analyzed its potential role in snRNP metabolism. Human TSSC4 has been identified as a gene that escapes imprinting in an otherwise imprinted genomic region36. A single-nucleotide polymorphism in TSSC4 has been associated with reduced body height in humans but the molecular function of this protein is unknown37. In this work, we provide evidence that TSSC4 associates with the U5 snRNP and the U5/PRPF19 complex. We identified TSSC4 domains important for interactions with the U5 snRNP and the PRPF19 complex and show how TSSC4 downregulation affects tri-snRNP formation. Based on our results, we propose that TSSC4 is a key protein involved in both de novo biogenesis as well as post-splicing recycling of U5 snRNP.
Results
TSSC4 associates with mono-U5 and U5/PRPF19 complexes
To get insight into a function of TSSC4 in snRNP biogenesis, we first mapped its interaction partners by an unbiased quantitative proteomic approach based on stable isotope labeling in cell culture in conjunction with immunoprecipitation (SILAC IPs)12,38,39. We tagged TSSC4 with either GFP or FLAG, expressed labeled proteins in HeLa cells, and immunopurified them with anti-GFP or anti-FLAG antibodies, respectively (Fig. 1a and Supplementary Data 1). The proteomic analysis of both GFP and FLAG IPs revealed a strong association of TSSC4 with Sm proteins and all U5-specific proteins including CD2BP2, a marker of mono-U5 snRNP, which is not a component of the U4/U6•U5 tri-snRNP18. We also identified several U5 chaperones such as AAR2, ZNHIT2, RUVBL1, RUVBL2, and ECD and a putative chaperone EAPP that has been shown to interact with U5 proteins PRPF8 and EFTUD212,15,17. In addition, tagged TSSC4 co-purified with low levels of PRPF19, a core component of the PRPF19 complex. All detected TSSC4 interactors are listed in Supplementary Data 1.
To confirm the SILAC-IP results, we immunoprecipitated TSSC4-GFP and TSSC4-FLAG and detected co-precipitated proteins by western blotting (Fig. 1b and Supplementary Fig. 1a). This result confirmed that TSSC4 interacts with U5 snRNP and the PRPF19 complex and co-precipitated all U5-specific proteins, Sm proteins, and core subunits of the PRPF19 complex (PRPF19 and CDC5L). At the same time, TSSC4 did not pull down SART3, a marker of U4/U6 di-snRNP, the component of U4/U6 and U4/U6•U5 particles PRPF31 and U2-specific SNRPA1 (U2A’). Consistently with the set of co-precipitated proteins, TSSC4-GFP also pulled down specifically the U5 snRNA (Fig. 1c). We did not detect any TSSC4 self-interaction because ectopically expressed TSSC4-FLAG did not pull down endogenous TSSC4 (Supplementary Fig. 1b). We noticed that TSSC4 migrates in SDS-polyacrylamide gels around 50 kDa while its predicted molecular weight is 34 kDa (Supplementary Fig. 1b). TSSC4 is an acidic protein with the isoelectric point 4.81. A similar shift in apparent molecular weight has been described for another acidic U5-specific protein CD2BP2 with isoelectric point 4.4918. Finally, we provided evidence that all the detected interactions of TSSC4 with U5 and PRPF19 complex proteins are RNA independent (Fig. 1d, see also control RNA gel at Supplementary Fig. 1c). These data show that TSSC4 associates specifically with U5 snRNP-containing complexes and that this binding is based on protein–protein interactions.
To further characterize TSSC4 containing complexes, we immunoprecipitated TSSC4-FLAG, eluted precipitated complexes with a FLAG peptide, and resolved them by ultracentrifugation in a glycerol gradient (Fig. 2a, bottom panel). We observed two discrete U5 snRNA-containing complexes. One complex sedimented in fractions 14–16, which was a position where the mono-U5 snRNP migrated when the nuclear extract was resolved on an identical gradient run in parallel (Fig. 2a, top panel). Consistent with the prediction that this complex represents the U5 snRNP, we detected U5-specific proteins PRPF8 and EFTUD2 but not a U4/U6 di-snRNP marker PRPF4 in these fractions (Fig. 2b). We also observed endogenous TSSC4 (marked by an asterisk) sedimenting in nuclear extracts in the same fractions as the U5 snRNP. Surprisingly, we detected only traces of TSSC4-FLAG (marked by double asterisks) co-sedimenting with U5 snRNP, which indicates that the FLAG tag might partially destabilize TSSC4 association with U5 snRNP. A second U5 snRNA-containing complex precipitated by TSSC4-FLAG sedimented in lower fractions 19–20 where we detected U5-specific proteins PRPF8 and EFTUD2 and PRPF19. These data indicate that TSSC4 interacts with two distinct U5 snRNA-containing particles, one can be identified as a mono-U5 snRNP and the second complex contains U5 and PRPF19 complex markers, which is consistent with a spectrum of TSSC4 associated proteins detected by immunoprecipitations (Fig. 1).
U5 snRNP and the PRPF19 complex interact with different TSSC4 domains
Next, we decided to determine TSSC4 protein domains important for interactions with U5 snRNP and the PRPF19 complex. Analysis of the human TSSC4 sequence by BLAST and Phyre2 did not reveal any significant homology to known protein domains. We, therefore, decided to perform an unbiased screen and deleted consecutive windows of 50 amino acids (aa) from TSSC4 protein (Fig. 3a). All deletion mutants were tagged with GFP at the C-terminus and transiently expressed in HeLa cells. All TSSC4 constructs localized primarily to the cell nucleus and only the Δ201–250 construct localized weakly also to the cytoplasm (Supplementary Fig. 2a). TSSC4-GFP full-length and deletion mutants were immunoprecipitated by anti-GFP antibodies and co-precipitation of U5 snRNP-specific proteins and all four core PRPF19 complex subunits (PRPF19, CDC5L, PLRG1, and BCAS2) was monitored by western blotting (Fig. 3b). In parallel, we also monitored co-purified RNAs (Supplementary Fig. 2b). We identified two protein regions that were important for TSSC4 interaction with its partners. Deletion of aa 51–100 (Δ51–100) reduced TSSC4 interaction with U5 snRNP components (Fig. 3b, top panel and Supplementary Fig. 2b) and removal of aa 201–250 (Δ201–250) almost completely abolished association with the PRPF19 complex (Fig. 3b, bottom panel). We also noticed a differential effect of the 51–100 aa deletion on the interaction with U5 subunits. While U5 snRNA, SNRPB/SmB, CD2BP2, and PRPF6 binding was completely lost upon 51–100 aa deletion, only partial reduction was observed in the case of RHC proteins PRPF8, SNRNP200, and EFTUD2. Unexpectedly, the interaction of the Δ51–100 mutant with SNRNP40, which is also a component of RHC was also abolished. In addition, a small but reproducible reduction of SNRNP200 and PRPF8 binding was also observed in the case of the Δ201–250 deletion mutant. Deletion of the last 30 aa at the C-terminus destabilized the protein. We, therefore, expressed and immunoprecipitated the last 50 C-terminal aa tagged with GFP but did not detect any interaction with U5 snRNP or PRPF19 complex components.
A protein sequence comparison of TSSC4 done using T-Coffee tool40 across the animal kingdom revealed six conserved domains, which we named Hom1–Hom6 (Supplementary Fig. 2c and Fig. 3c). These homology regions largely correspond to conserved regions identified previously by comparing human and mouse TSSC4 sequences41. We also performed an in silico analysis of TSSC4 ordered/disordered regions (Supplementary Fig. 2d) using MetaDisorderMD242,43. This analysis predicted that most TSSC4 sequence is largely disordered with a few exceptions involving the central part of the molecule spanning Hom3 and 4 domains, which shows a weak tendency for ordered structures.
The 51–100 aa window overlaps with the conserved Hom2 region while aa 201–250 contains the conserved Hom4 sequence. To further narrow down interaction domains of TSSC4 with U5 snRNP and PRPF19 complex, we prepared several additional constructs (Fig. 3c). First, we deleted 24 aa (V77-L100) that overlap Δ51–100 and Hom2 regions (ΔHom2 construct) and 31 aa (S200-R230) that intersect Δ201–250 and Hom4 sequences (ΔHom4 construct). We also mutated eight or six conserved aa in Hom2 and Hom4 domains into alanines. All constructs containing GFP at the C-terminus were expressed transiently in HeLa cells. They localized into the cell nucleus and weakly also in the cytoplasm in the case of the ΔHom4 and Hom4-3_Ala constructs (Supplementary Fig. 2e). All TSSC4-GFP constructs were immunoprecipitated using anti-GFP antibodies and co-precipitated components of U5 and PRPF19 complexes were monitored by western blotting (Fig. 3d). Deletion of the conserved regions Hom2 and Hom4 had the same effect as deletion of aa 51–100 and 201–250, respectively. Removal of Hom2 region reduced the interaction with U5 snRNP, and deletion of Hom4 reduced association with the PRPF19 complex. More detailed mutagenesis revealed 16 conserved aa (M85-L100) essential for interaction with PRPF6 and SNRPB proteins (Hom2-2_Ala and Hom2-3_Ala; Fig. 3d). Substitution of these aa with alanines also partially reduced but not completely abolished association with U5 proteins PRPF8, SNRNP200, and EFTUD2. We further identified six conserved aa in the middle of the Hom4 domain (V213-P218) that are essential for association with the PRPF19 complex (Hom4-2_Ala). We also removed Hom1 region close to the N-terminus but this deletion destabilized the whole protein. We, therefore, substituted Hom1 aa into alanines (Hom1-1_Ala and Hom1-2_Ala). We still observed reduced stability of Hom1-1_Ala but Hom1-2_Ala was expressed at similar levels as other TSSC4 constructs (see GFP signal in inputs at Fig. 3d). We observed a weak phenotype on SNRNP200 binding in the case of Hom1-1_Ala construct, which might reflect a lower expression of this construct. A similarly weak but reproducible reduction of the TSSC4-SNRNP200 association was observed for ΔHom2 and Hom2-2_Ala, Hom2-3_Ala substitution mutants and for ΔHom4 and Hom4-2_Ala constructs.
Deletion of the central part of TSSC4 (aa151–200) did not reveal any binding phenotype despite it contained the most conserved region of TSSC4–Hom3. We, therefore, created a double ΔHom2ΔHom3 mutant, tagged it with GFP and transiently expressed in HeLa cells. The double mutant also localized into the cell nucleus (Supplementary Fig. 2e). Analysis of U5 subunits co-precipitating with the double mutant revealed a significant reduction of all U5 components and only SNRNP200 binding was unchanged (Fig. 3e).
To identify a minimal part of TSSC4 that recapitulates the binding profile of the full-length protein, we created several constructs spanning various regions of the central part of TSSC4 whose deletion reduced interaction with U5 snRNP and PRPF19 complex (Fig. 3f). All constructs were tagged with GFP and after transient expression in HeLa cells immunoprecipitated with anti-GFP antibodies (Fig. 3g). Hom2 alone did not associate with any of the tested proteins but together with Hom3, this construct pulled down all tested U5-specific and Sm proteins. In addition, we also detected a weak association with PRPF19 complex components PRPF19 and CDC5L, which likely reflect immunoprecipitation of the U5/PRPF19 complex. All constructs localize equally to the nucleus and the cytoplasm (Supplementary Fig. 2f) and TSSC4 fragments containing domain 2 (Hom2) and domains 3 and 4 (Hom3-Hom4) were partially destabilized and expressed on a lower level than other constructs (Fig. 3g, see GFP signal).
These results show that three conserved regions bind different partners. Hom2 is essential and together with Hom3 also sufficient for association with U5 snRNP. Both domains make specific contacts with U5 subcomplexes where Hom3 primarily interacts with PRPF8 and EFTUD2 proteins that form a subcomplex early during biogenesis12 and Hom2 mediates interaction with the mature U5 particle. Hom4 domain is essential but not sufficient for binding of the PRPF19 complex. We did not identify a mutant that would completely abolish TSSC4 association with SNRNP200, which indicates multiple contacts between these two proteins. To further narrow TSSC4 interaction partners, we tested TSSC4 interaction with all U5-specific proteins and the PRPF19 protein in yeast two-hybrid assay. However, we did not detect any positive interaction with any of the tested proteins (Supplementary Table 1). Given these negative results and the fact that we found multiple interaction platforms for U5 snRNP, we speculate that TSSC4 makes multiple weak contacts with several U5 snRNP and PRPF19 complex proteins.
TSSC4 can contact U5-specific proteins already in the cytoplasm
The interaction between TSSC4 and AAR2 suggests that TSSC4 can associate with U5 proteins early during biogenesis. To test whether TSSC4 can interact with U5-specific proteins already in the cytoplasm, we performed a tethering assay that we previously applied to monitor interactions of chaperones AAR2 and R2TP with PRPF812. We fused TSSC4 with a fragment of DDX6/p54 that targets the fusion protein to P-bodies, and with dsRed to monitor the fusion protein localization44. We expressed the fusion protein TSSC4-DDX6-dsRed in HeLa cells together with five U5-specific proteins and SNRPD1/SmD1 (Fig. 4). We observed strong P-body localization of PRPF8-GFP and weaker but also clear tethering of EFTUD2-GFP and SNRNP200-GFP. However, these proteins make a preassembled particle already in the cytoplasm12 and we therefore cannot unambiguously determine which of these proteins is the binding partner for TSSC4. U5-specific proteins PRPF6-FLAG and CD2BP2-GFP, and SNRPD1-GFP were not accumulated in P-bodies when co-expressed with the TSSC4-DDX6-dsRed fusion protein. These data suggest that TSSC4 interacts preferentially with RHC components, and this interaction can occur already in the cytoplasm before their import to the nucleus. Based on these results and the fact that TSSC4 associates with known RHC chaperones AAR2, ZNHIT2, and RUVBL1/2 as well as all U5-specific proteins and U5 snRNA (Fig. 1 and ref. 12), we conclude that TSSC4 associates with RHC subunits during their biogenesis in the cytoplasm and stays associated till the complete U5 snRNP is formed.
However, TSSC4 is mainly nuclear protein and this model of action predicts that it is constantly exported to the cytoplasm and reimported to the nucleus. To test whether TSSC4 shuttle between the nucleus and the cytoplasm, we performed a heterokaryon shuttling assay. TSSC4-GFP and PRPF8-GFP as negative control were expressed in HeLa cells together with mCherry, which freely diffuses between the nucleus and the cytoplasm and serves as a marker of successful fusion. After proteosynthesis inhibition with cycloheximide or puromycin, HeLa cells were fused with mouse NIH-3T3 fibroblasts and the presence of GFP and mCherry signals in mouse nuclei monitored after 3 h (Supplementary Fig. 3). While PRPF8-GFP remained in HeLa nuclei, TSSC4-GFP clearly localized to mouse nuclei showing its ability to shuttle between the nucleus and the cytoplasm.
TSSC4 promotes U4/U6•U5 tri-snRNP formation
Based on the TSSC4 interaction profile we speculated that TSSC4 could play a role in U5 snRNP metabolism. We and others have previously shown that the accumulation of U4, U5, and U6 snRNAs in Cajal bodies represents a convenient readout of the U5 snRNP assembly and recycling status. For example, inhibition of U5 snRNP maturation by PRPF8 or R2TP downregulation results in the accumulation of U5, U4, and U6 snRNAs in Cajal bodies7,12. In contrast, inhibition of post-spliceosomal complex recycling caused specific accumulation of U4 and U6 snRNAs but not U5 snRNA45. To test a function of TSSC4 in U5 snRNP biogenesis and recycling, we reduced TSSC4 expression by RNA interference and assayed localization of snRNAs in Cajal bodies (Fig. 5). TSSC4 downregulation increased Cajal body accumulation of U4, U5, and U6 snRNAs but not U2 snRNA, which indicated specific defects in U5 and/or tri-snRNP biogenesis. To learn more about the identity of U5 snRNP sequestered in the Cajal body, we analyzed the presence of several U5-specific proteins in Cajal bodies after TSSC4 knockdown. Surprisingly, the Cajal body localization of all tested U5-specific proteins was also increased after TSSC4 knockdown (Fig. 6). In addition, TSSC4 knockdown induced Cajal body accumulation of mono-U5-specific marker CD2BP2, which indicated that the mono-U5 snRNP localized to Cajal bodies. In contrast, we did not observe increased Cajal body localization of PRPF19, which suggested that the U5/PRPF19 complex did not accumulate in Cajal bodies. These data indicated that in the absence of TSSC4, U5 snRNP accumulates in Cajal bodies but this U5 particle is not able to make a contact with U4/U6 di-snRNP and form the functional tri-snRNP, resulting in the accumulation of U4, U5, and U6 snRNAs in this nuclear structure.
This was a rather unexpected finding because inhibition of tri-snRNP assembly by PRPF31 and PRPF6 knockdown induced Cajal body accumulation of U4 and U6 snRNAs only24. We therefore first wanted to confirm that U5 snRNP proteins and specifically PRPF6, which is the U5-specific protein that interacts with U4/U6 di-snRNP and is essential for tri-snRNP assembly23,46, are present in U5 snRNP after TSSC4 downregulation. TSSC4 was downregulated in HeLa cells stably expressing PRPF8-GFP47, PRPF8-GFP was immunoprecipitated using anti-GFP antibodies and co-precipitation of PRPF6 and other U5-specific proteins assayed by western blotting (Fig. 7a). We did not notice any difference in protein composition of U5 snRNP in control cells and cells treated with anti-TSSC4 siRNA, which showed that the removal of TSSC4 did not significantly affect the composition of the U5 snRNP and in particular the tri-snRNP bridging factor PRPF6 was still present.
To confirm that TSSC4 is important for U4/U6•U5 tri-snRNP formation, we knocked down TSSC4 and analyzed snRNP complexes by ultracentrifugation in glycerol gradients (Fig. 7b). TSSC4 downregulation resulted in redistribution of U5 snRNP proteins from fractions where U4/U6•U5 tri-snRNP migrates into lighter fractions, indicating inhibition of tri-snRNP formation or instability of the tri-snRNP particle. These data are consistent with fluorescence results (Figs. 5–6) and together show that TSSC4 is important for tri-snRNP assembly and/or stability. If TSSC4 depletion reduces tri-snRNP concentration then it should also affect RNA splicing. To test this prediction, we downregulated TSSC4 by RNAi and analyzed the splicing efficiency of seven genes (Fig. 7c). Downregulation of PRPF8, which was used as a positive control, reduced splicing in six cases. TSSC4 reduction exhibited a much milder effect and we observed significantly reduced splicing of two genes (LDHA and RPL19). It should be noted that PRPF8 knockdown also partially destabilized TSSC4 but TSSC4 downregulation did not alter PRPF8 expression (Supplementary Fig. 4).
Finally, we decided to probe whether TSSC4 domains essential for binding of U5 snRNP (Hom2) and the PRPF19 complex (Hom4) are also important for tri-snRNP biogenesis. We downregulated endogenous TSSC4 by RNAi, expressed GFP tagged wild-type TSSC4 or deletion/substitution mutants resistant to anti-TSSC4 siRNA, and monitored U5 snRNA localization in Cajal bodies as a readout of tri-snRNP biogenesis (Fig. 8a, b). Expression of the full-length TSSC4 clearly reduced U5 snRNA accumulation in Cajal bodies, indicating rescue of the tri-snRNP assembly and the specificity of the knockdown. This rescue was mostly independent of the TSSC4-GFP expression level (Supplementary Fig. 5). TSSC4 lacking the Hom2 domain partially rescued the phenotype but was about 2 times less efficient than wild-type TSSC4. Expression of constructs where PRPF19 interacting domain was missing (ΔHom4) or mutated (Hom4-2_Ala) significantly reduced U5 snRNA accumulation in Cajal bodies but the rescue was also less efficient than in the case of the full-length TSSC4. These data indicate that both the U5 snRNP interaction domain Hom2 and the PRPF19 binding domain Hom4 are important for the correct function of TSSC4 in tri-snRNP assembly.
Discussion
In this manuscript, we describe a function for the previously uncharacterized protein TSSC4. We show that TSSC4 interacts (Fig. 1) and co-sediments (Fig. 2) with the U5 snRNP but we did not find any indication that TSSC4 is a stable component of the U4/U6•U5 tri-snRNP. Based on these data, we propose that TSSC4 is a mono-U5 snRNP-specific protein. TSSC4 also co-precipitates known U5 snRNP chaperones AAR2, ZNHIT2, ECD, and R2TP, which specifically function during assembly of RHC9–12,15,33. This finding suggests that TSSC4 is involved in specific steps of U5 snRNP maturation involving RHC. This conclusion is consistent with TSSC4 ability to shuttle between the nucleus and the cytoplasm (Supplementary Fig. 3), and our tethering experiment revealing that TSSC4 is able to interact with RHC components PRPF8, SNRNP200, and EFTUD2 already in the cytoplasm where these proteins form a stable complex (Fig. 4)12. However, TSSC4 downregulation did not reveal any defects in U5 snRNP composition (Fig. 7), and all the tested mono-U5-specific proteins accumulated in Cajal bodies after TSSC4 knockdown (Fig. 6). These results strongly indicate that TSSC4 is not important for the incorporation of individual U5 snRNP components into a maturing particle. However, TSSC4 downregulation caused defects in tri-snRNP formation and/or stability (Figs. 5–7). Based on the fact that TSSC4 is not a stable component of the tri-snRNP and thus it is difficult to imagine how it can stabilize this particle, we propose that TSSC4 promotes tri-snRNP assembly. Because all components are present in the U5 snRNP upon TSSC4 knockdown, we propose that TSSC4 functions as a chaperone that facilitates U5 snRNP association with the U4/U6 di-snRNP.
RHC and Sm proteins remain associated with U5 snRNA after splicing while other U5 and tri-snRNP-specific proteins dissociate48. In addition, PRPF19 and IBC/XAB2 complexes and their associated factors form together with U5 snRNP the post-splicing 35S particle31,32. Because TSSC4 interacts with all core components of the PRPF19 complex (Figs. 1–3) and a complex containing U5 snRNA and PRPF19 co-purifies with TSSC4 (Fig. 2), we conclude that TSSC4 associates with the post-splicing U5/PRPF19 particle. Interestingly, a single-nucleotide polymorphism in the PRPF19 complex-interacting Hom4 region of TSSC4 has been associated with lower human height37, suggesting that this phenotype might be due to a defect in U5 snRNP recycling. Our co-purification assays suggest several modes of interactions between TSSC4 and U5 snRNP subunits: (1) Hom2 and Hom3 interacts with U5 snRNP where Hom2 primarily mediates association with the mature U5 snRNP and Hom3 with PRPF8/EFTUD2. (2) With SNRNP200 where the interaction platform is still unclear but it likely involves multiple regions. The tethering assay (Fig. 4) shows that TSSC4 binding to PRPF8, SNRNP200, and EFTUP2 can occur already in the cytoplasm, which suggests that TSSC4 interaction with RHC components happens primarily during their biogenesis. In addition, TSSC4 nuclear localization and binding to mature U5 snRNP and the U5/PRPF19 complex points to a function of TSSC4 in later stages of U5 maturation and post-splicing recycling.
A particle containing U5 snRNA, Sm, and RHC proteins emerges as an important intermediate in U5 snRNP metabolism where de novo biogenesis and recycling pathways merge. In both cases, additional U5-specific proteins (PRPF6, DDX23, CD2BP2, and TXNL4A) need to be added before U5 snRNP can associate with the U4/U6 di-snRNP. We hypothesize that TSSC4 acts in this step to support an optimal U5 snRNP structure compatible with tri-snRNP formation. In yeast, Aar2p co-purifies with the mature U5 snRNP and it has been suggested that Aar2p functions in U5 snRNP recycling after splicing49. In humans, AAR2 is primarily cytoplasmic and co-precipitates only with a subset of U5 proteins and chaperones involved in RHC biogenesis12. TSSC4 might have evolved to replace the nuclear function of AAR2 in higher eukaryotes. Consistently, we did not identify TSSC4 homologs in unicellular eukaryotes and the first homolog was found in Nematostella vectensis (Supplementary Fig. 2c). Taking our data together, we propose that TSSC4 is a U5-specific chaperone that interacts with freshly assembled U5 snRNPs as well as with U5 particles recycled after splicing and promotes U5 snRNP interaction with the di-snRNP and formation of the splicing competent U4/U6•U5 tri-snRNP (Fig. 8c).
Methods
Cell culture
HeLa cells, stable cell lines derived from HeLa cells, and mouse NIH-3T3 cells were cultured in high-glucose (4.5 g/l) Dulbecco’s modified Eagle’s medium (DMEM, Sigma-Aldrich) supplemented with 10% fetal bovine serum (FBS, Gibco) and 1% penicillin/streptomycin (Gibco). HeLa cells stably expressing PRPF8-GFP and SNRNP200-GFP from bacterial artificial chromosomes were kindly provided by Ina Poser (MPI-CBG, Dresden, Germany)50 and characterized previously in ref. 47.
Plasmids and transfection
TSSC4 sequence was amplified from HeLa total RNA by reverse transcription followed by PCR using specific primers listed in Supplementary Table 2 and inserted into pEGFP-N1 (Clontech) and FLAG-C3 (pEGFP-C3 backbone where the GFP sequence was replaced with Triple FLAG) using BglII and SacII restriction sites. TSSC4 deletion mutants (Δ51–100, Δ101–150, Δ151–200, Δ201–250, Δ251–300, ΔHom2, ΔHom4, ΔHom2ΔHom3) and substitution mutants (Hom1-1_Ala, Hom1-2_Ala, Hom2-1_Ala, Hom2-2_Ala, Hom2-3_Ala, Hom4-1_Ala, Hom4-2_Ala, Hom4-3_Ala) were prepared by site-directed mutagenesis (for primer sequences see Supplementary Table 2). TSSC4 fragments (Hom2, Hom2Hom3, Hom4, Hom3Hom4) were amplified using specific primers (Supplementary Table 2) and inserted into pEGFP-N1 by XhoI/KpnI restriction sites. All fragments were designed with ATG as a first codon. SiRNA-resistant TSSC4 constructs were prepared by mutating siRNA target site CGGTGGTGCTGAAGTGGAA (nt 123–141) to TGGCGGCGCAGAGGTCGAG using site-directed mutagenesis (Supplementary Table 2), the original amino acid sequence was preserved in siRNA-resistant constructs.
EFTUD2 was cloned into pEGFP-C3 vector using BamHI/XhoI restriction sites, CD2BP2 into pEGFP-N1 using BamHI/EcoRI sites, and PRPF6 into pFLAG-CMV-4 using HindIII/EcoRI sites. SNRPD1-pEGFP-C1 plasmid was kindly provided by A. Lamond (University of Dundee, UK). For the P-body tethering assay, TSSC4 was cloned into DDX6-dsRed-pcDNA5 FRT plasmid using the Gateway system (Invitrogen). All constructs were verified by DNA sequencing.
Plasmids were transiently transfected into the cells with Lipofectamine 3000 Transfection Reagent (Thermo Fisher Scientific) according to the manufacturer’s instructions and cells analyzed 24 h post transfection.
RNAi
SiRNAs were transfected using the Oligofectamine Transfection Reagent (Thermo Fisher Scientific) according to the manufacturer’s protocol. A single transfection was performed at a final concentration of 50 nM siRNA, and the cells were then incubated for 72 h. Alternatively, cells were transfected twice with 20 nM siRNA (final concentration) after 48 h. Both approaches resulted in comparable knockdown efficiency. Pre-annealed siRNA duplexes were obtained from Ambion: TSSC4 (s19605) 5′-CGGUGGUGCUGAAGUGGAAtt-3′, PRPF8 5′-CCUGUAUGCCUGACCGUUUtt-3′7, and negative control no. 5 siRNA.
Antibodies
For indirect fluorescent immunostaining, the following primary antibodies were used: anti-coilin (H-300, Santa Cruz Biotechnology; used 1:500), anti-FLAG (M2, Sigma-Aldrich; used 1:500), anti-PRPF19 (G-7, Santa Cruz Biotechnology; used 1:250) and anti-GFP (B-2, Santa Cruz Biotechnology; used 1:500), together with secondary antibodies conjugated with Alexa-488, Alexa-555 or Alexa-647 (Thermo Fisher Scientific; all used 1:750). Immunoprecipitation was performed using antibodies against GFP (provided by D. Drechsel, MPI-CBG, Dresden, Germany; used 6 µg per sample) and FLAG (M2 or F7425, Sigma-Aldrich; used 2 µg per sample). Following primary antibodies were used for western blotting: anti-TSSC4 (Proteintech; used 1:1000), anti-PRPF8 (E-5, Santa Cruz Biotechnology; used 1:250), anti-SNRNP200 (HPA029321, Sigma-Aldrich; used 1:500), anti-EFTUD2 (provided by R. Lührmann, MPI, Göttingen, Germany; used 1:1000), anti-PRPF6 (B-1, Santa Cruz Biotechnology; used 1:500), anti-SNRNP40 (PA5-55331, Thermo Fisher Scientific; used 1:3000), anti-CD2BP2 (PA5-59603, Thermo Fisher Scientific; used 1:6000), anti-DDX23 (PA5-34843, Thermo Fisher Scientific; used 1:1000), anti-TXNL4A (B-11, Santa Cruz Biotechnology; used 1:500), anti-PRPF19 (G-7, Santa Cruz Biotechnology; used 1:500), anti-CDC5L (D-11, Santa Cruz Biotechnology; used 1:500), anti-PLRG1 (E-12, Santa Cruz Biotechnology; used 1:100), anti-BCAS2 (F-5, Santa Cruz Biotechnology; used 1:250), AAR2 (SAB1104418, Sigma-Aldrich; used 1:500), anti-PRPF31 (ab188577, Abcam; used 1:4000), anti-SART3 (ab176822, Abcam; used 1:3000), anti-PRPF4 (provided by R. Lührmann, MPI, Göttingen, Germany; used 1:500), anti-SNRPA1 (ab128937, Abcam; used 1:4000), anti-GFP (B-2, Santa Cruz Biotechnology; used 1:500), anti-FLAG (M2, Sigma-Aldrich; used 1:1000), anti-FLAG (F7425, Sigma-Aldrich; used 1:400), anti-GAPDH (ab9484, Abcam; used 1:1000) and anti-Sm (Y12) produced from a hybridoma cell line (a gift from K. Neugebauer, Yale University, New Haven, USA) at the Antibody facility (Institute of Molecular Genetics of the Czech Academy of Sciences; used 1:100). Secondary antibodies conjugated with horseradish peroxidase (Jackson ImmunoResearch Laboratories) were used (1:10,000).
Immunofluorescence, fluorescence in situ hybridization, and image acquisition
Cells grown on coverslips were fixed with 4% paraformaldehyde/PIPES for 10 min at room temperature (RT), permeabilized with 0.2 or 0.5% Triton X-100 for 10 or 5 min, respectively, at RT and stained with primary antibodies diluted in phosphate buffered saline (PBS) for 1 h. After a PBS wash, the coverslips were stained with secondary antibodies for 1 h and then either mounted or further processed to achieve snRNA visualization. snRNAs were visualized by in situ hybridization using fluorescent DNA probes tagged on their 5′ end by Alexa-488 or Cy3. Probe sequences are listed in Supplementary Table 2. Cells were fixed again in 4% PFA/PIPES for 10 min and treated sequentially with 0.1 M glycine/0.2 M Tris-HCl (pH 7.4) for 10 min and 50% formamide/2× SSC for at least 10 min. The probe at a final concentration of 2 µM was hybridized in 50% formamide/10% dextran sulfate/1% BSA/2× SSC for 1 h at 37 °C. The final washing steps were performed as follows: 50% formamide/2x SSC for 20 min at 37 °C, 2× SSC for 20 min at 37 °C, and 1× SSC for 20 min at RT. The in situ hybridization protocol substantially quenched GFP fluorescence. We, therefore, enhanced GFP visualization by indirect immunostaining with anti-GFP antibodies. All coverslips were mounted in Fluoromount-G (Southern Biotech) containing 4′,6-diamidino-2-phenylindole (DAPI).
Images were acquired using DeltaVision microscope system (Applied Precision Ltd.) coupled to the Olympus IX70 microscope equipped with a 60×/1.42 NA oil immersion objective, a CoolSNAP HQ2 camera (Photometrics; Princeton Instruments), and the acquisition software SoftWoRx (Applied Precision Ltd.). Stacks of 10–20 z sections with 200-nm z steps were taken per sample and subjected to mathematical deconvolution using SoftWoRx software. Maximum intensity projections of deconvolved pictures are presented. Fluorescent signal in Cajal bodies (CBs), P-bodies, and nucleoplasm was analyzed using Fiji software51. snRNA and protein accumulation in CB was calculated as [(mean intensity in CB)/(mean intensity in nucleoplasm)]. Protein accumulation in P-bodies was calculated as [((mean intensity in P-body) – (mean intensity of background))/(mean intensity in nucleoplasm)]. Between 40 and 100 cells were analyzed for CB accumulation (cell numbers per sample are specified in individual figures) and 50 P-bodies were analyzed for P-body tethering assay.
Heterokaryon assay
HeLa cells were co-transfected with TSSC4-GFP and empty pmCherry-C3 plasmids. Alternatively, HeLa stably expressing PRPF8-GFP were transfected with pmCherry-C3 only. All HeLa were passaged 24 h post transfection, and 7 × 105 cells were seeded on coverslips together with 7 × 105 mouse NIH-3T3 cells and grown overnight.
The protocol for heterokaryon preparation was adapted from ref. 52. Cells were treated with 50 µg/ml cycloheximide (CHX) or 0.5 µg/ml puromycin (Puro) for 2 h and then with 100 µg/ml CHX or 1 µg/ml Puro for 30 min to block protein synthesis. After a PBS wash, cells were fused by incubation in 50% PEG 4000 (Thermo Scientific) for 2 min. Fused cells were washed with PBS, incubated in DMEM supplemented with 100 µg/ml CHX or 1 µg/ml Puro for 3 h, and fixed with 4% paraformaldehyde/PIPES. Coverslips were mounted in Fluoromount-G medium containing DAPI and imaged as described above. The presence of mCherry protein in mouse NIH-3T3 nuclei served for the identification of heterokaryons.
Immunoprecipitation
Cells grown on a 10-cm Petri dish were harvested in ice-cold PBS, resuspended in NET2 buffer (50 mM Tris-HCl, pH 7.4, 150 mM NaCl, 0.05% Nonidet P-40) supplemented with Protease Inhibitor Cocktail Set III (Millipore) and RNasin ribonuclease inhibitor (Promega), and sonicated on ice bath by 2 × 45 pulses (0.5 s at 40% of maximum energy). Cell lysates were centrifuged at 20,000×g, 4 °C for 10 min, and then the supernatants were pre-cleaned by incubation with Protein G PLUS agarose beads (Santa Cruz Biotechnology) for 1 h at 4 °C and finally incubated overnight at 4 °C with anti-GFP or anti-FLAG antibodies pre-bound to agarose beads. Three percent of the lysate was put aside before the immunoprecipitation step and used as an input sample.
After the immunoprecipitation, beads were washed with NET2 buffer and proteins and/or RNA isolated. For protein analysis, the beads were directly mixed with SDS sample buffer (0.25 M Tris-HCl pH 6.8, 20% glycerol, 4% SDS, 2% β-mercaptoethanol, 0.02% bromphenol blue), and proteins resolved on a 10% polyacrylamide gel and detected by western blotting. RNA was extracted using phenol/chloroform, dissolved in urea sample buffer (20 mM Tris-HCl pH 8.0, 8 M urea, 0.2% xylene blue), and then resolved on a 7 M urea denaturing polyacrylamide gel and silver stained.
Northern blot
RNA was resolved on 7 M urea denaturing polyacrylamide gel and transferred by capillarity to cationized Zeta-Probe nylon membrane (Bio-Rad). The transfer was performed for 16 h in 20× SSC buffer and RNA was then UV cross-linked to the membrane by 120 mJ/cm2. Hybridization with a radioactively labeled probe was done according to ref. 53. The DNA oligonucleotide probe complementary to U5 snRNA (Supplementary Table 2 and ref. 53) was terminally labeled with 1 μl of [γ-32P]ATP (3000 Ci/mmol, 10 mCi/ml) using T4 polynucleotide kinase, purified on Micro-Spin G25 columns (GE Healthcare) and denatured at 95 °C for 5 min. For pre-hybridization, the membrane was incubated in Church buffer (0.25 M sodium phosphate buffer pH 7.2, 1 mM EDTA, 1% BSA, 7% SDS) for 2 h at 65 °C. The probe was then added to the Church buffer and hybridized on the membrane at 65 °C overnight. After washing with Wash buffer-1 (2× SSC, 0.1% SDS) for 2× 10 min at 50 °C and Wash buffer-2 (0.2× SSC, 0.1% SDS) for 2× 10 min at 50 °C, the membrane was exposed to Storage Phosphor Screen and scanned on Amersham Typhoon 9500 (GE Healthcare).
Glycerol-gradient ultracentrifugation
TSSC4-FLAG co-precipitated complexes were eluted from the beads by incubation with 2 × 25 µg of triple FLAG peptide (Thermo Fisher Scientific) for 8 h and overnight at 4 °C. Nuclear extracts were prepared using NE-PER nuclear and cytoplasmic extraction reagents (Thermo Fisher Scientific) according to the manufacturer’s instructions. Samples were diluted in gradient buffer (20 mM HEPES/KOH pH 8.0, 150 mM KCl, 1.5 mM MgCl2) supplemented with Protease Inhibitor Cocktail Set III (Millipore), 0.5 mM PMSF, and 0.5 mM DTT and loaded on a linear 10–30% glycerol gradient. Complexes were fractionated by centrifugation at 130,000×g (32,000 rpm) using SW-40 rotor (Beckman Coulter) for 17 h at 4 °C. Individual fractions (500 µl each, 24 fractions in total) were collected and from every fraction, RNA and proteins were isolated using TRIzol Reagent (Invitrogen) according to the manufacturer’s protocol.
SILAC labeling and proteomic analysis
For SILAC experiments, HeLa Flp-In clones expressing GFP and FLAG-tagged TSSC4 were grown for 15 days in isotopically labeled media (CIL/Eurisotop), to ensure complete incorporation of semi-heavy label l-lysine-2HCl (2H4, 96–98%)/l-arginine-HCl (13C6, 99%) (percentages represent the isotopic purity of the labeled amino acids). Parental HeLa cells that did not express the tagged protein were used in control condition and were grown in a regular culture medium containing light (non-labeled) l-lysine-HCl/l-arginine-HCl. For each condition, 100 × 106 cells were trypsinized and cryogrinded before being resuspended in the lysis buffer (20 mM HEPES, pH 7.9, 100 mM KCl, 2 mM EDTA, 10% glycerol, 0.1% NP-40, 0.5% Triton X-100, protease inhibitor cocktail (cOmplete, Roche)). Extracts were then incubated for 20 min at 4 °C, clarified by centrifugation for 10 min at 20,000×g, and finally pre-cleared by incubation with Protein G Sepharose beads (GE Healthcare) for 1 h at 4 °C. For SILAC-IP done with GFP tag, extracts were incubated with 50 µL of GFP-Trap beads (Chromotek) while for SILAC-IP with FLAG tag, agarose beads coated with M2 antibodies were used (Sigma-Aldrich). After incubating 1.5 h at 4 °C, beads were washed five times with lysis buffer, and the two isotopic conditions (light for control IP and medium for IP with tagged TSSC4) were finally pooled. Bound proteins were eluted by adding 1% SDS to the beads and boiling for 10 min. The eluates were reduced with DTT (10 mM, 2 min, 95 °C) and alkylated with iodoacetamide (50 mM, 30 min). Proteins were separated by SDS/PAGE and stained with colloidal blue. Each lane was cut in eight bands, and each band was gel-digested with trypsin (Promega V5280) in 100 mM triethylammonium bicarbonate. Digestion peptides were dried and resuspended in 0.1% formic acid/2% acetonitrile solution before being analyzed on a LTQ Velos Pro Orbitrap Elite mass spectrometer coupled to an Ultimate 3000 (Thermo Fisher Scientific) nanoflow chromatography. Desalting and pre-concentration of samples were performed on-line on a Pepmap precolumn (0.3 × 10 mm, Thermo Fisher Scientific) in buffer A (2% acetonitrile, 0.1% formic acid). A gradient consisting of 2–40% buffer B (B = 99.9% acetonitrile with 0.1% formic acid; 3–33 min) and 40–80% buffer B (33–34 min) was used to separate peptides at 300 nL/min from a Pepmap capillary reversed-phase column (0.075 × 150 mm, Thermo Fisher Scientific). Mass spectra were acquired using a top 20 collision-induced dissociation data-dependent acquisition method. The Orbitrap was programmed to perform a FT 400–1400 Th mass scan (60,000 resolution) with the top 20 ions in intensity selected for collision-induced dissociation data-dependent acquisition MS/MS in the LTQ. FT spectra were internally calibrated using a single lock mass (445.1200 Th). Target ion numbers were 500,000 for FT full scan on the Orbitrap and 10,000 MSn on the LTQ. Data were acquired using the Xcalibur software v2.2. Protein identification and quantitation were performed using the MaxQuant software v1.5.2.8 (http://www.maxquant.org/)54. Database for identification was Human Reference Proteome set (reviewed + isoforms) downloaded on May 25, 2017 from Uniprot.org. Proteins were identified with at least 2 matching peptides including at least one unique or Razor peptide (FDR < 1%, decoy approach). Significance calculations were performed with the Perseus v1.4.2 software.
Splicing efficiency analysis
Cells were harvested in ice-cold PBS, resuspended in NET2 buffer supplemented with Protease Inhibitor Cocktail Set III and RNasin ribonuclease inhibitor, and cell lysates were prepared as described above. Eight percent of the lysate was put aside, mixed with SDS sample buffer, and used for knockdown efficiency control by western blot. Total RNA was isolated from the rest of the cell lysate using TRIzol Reagent (Invitrogen) according to the manufacturer’s protocol. The RNA pellet was resuspended in nuclease-free water, treated with Turbo DNase (Thermo Fisher Scientific), and again precipitated. Reverse transcription was performed with SuperScript III (Thermo Fisher Scientific) and random hexamers, and 1 or 5 µg of the total RNA was used per reaction.
cDNA was diluted 1:10 and analyzed by quantitative PCR using LightCycler 480 (Roche) and SYBR Green I reaction mix (Roche). Primer sequences are provided in Supplementary Table 2. Splicing efficiency (ratio pre-mRNA/mRNA) was calculated as [2Ct(mRNA) – Ct(pre-mRNA)]. For every experiment, two samples with different amounts of total RNA used (1 and 5 µg) were analyzed and always the average of both values is presented.
Yeast two-hybrid assay
Plasmids pACTII and pAS2ΔΔ were introduced into haploid Saccharomyces cerevisiae strains (Y187 and CG1945, respectively). Strains were crossed and grown on YEPD and then plated on double- (–Leu–Trp) and triple-selective media (–Leu–Trp–His). Growth was assessed visually after 3 days at 30 °C. The strength of interactions was evaluated by comparing the number of clones growing on –Leu–Trp (selection of diploids) and –Leu–Trp–His plates (selection for interaction). Growth was sometimes performed with a gradual concentration of 3-amino-1,2,4-Triazol (3-AT) which is a competitive inhibitor of the HIS3 gene product.
Supplementary information
Acknowledgements
We thank David Drechsler, Ina Poser, and Angus Lamond for providing us with reagents. This work was supported by the Czech Science Foundation (18-10035S and 21-04132S), the Agence Nationale de la Recherche [ANR-16-CE11-0032-04] and Institut national du Cancer [INCa PLBio 2016-161], the institutional funding (RVO68378050 and RVO68378050-KAV-NPUI), and the Ligue Nationale Contre le Cancer. The microscopy images were acquired at the Light Microscopy Core Facility, Institute of Molecular Genetics in Prague, Czech Republic supported by MEYS (LM2015062, CZ.02.1.01/0.0/0.0/16_013/0001775) and OPPK (CZ.2.16/3.1.00/21547). Mass spectrometry experiments were carried out using facilities of the Functional Proteomic Platform of Montpellier (FPP). Klara Klimesova, Jitka Vojackova, and Nenad Radivojevic were students of Faculty of Science, Charles University in Prague.
Source data
Author contributions
K.K. performed all the experiments except SILAC-IP shown (Fig. 1a), snRNA, and proteins localization after TSSC4 knockdown (Figs. 5–6) and glycerol gradient after TSSC4 knockdown (Fig. 7b). J.V. made the initial observation regarding the TSSC4 function and contributed by snRNA and protein localization after TSSC4 knockdown (Figs. 5–6), glycerol-gradient ultracentrifugation after TSSC4 knockdown (Fig. 7b) and immunoprecipitation of TSSC4 shown at (Supplementary Figs. 1a and 2b). N.R. created selected TSSC4-GFP constructs (Fig. 3f) and performed initial IPs with them. F.V., E.B., and C.V. performed and analyzed TSSC4 SILAC-IP (Fig. 1a) and C.V. performed yeast two-hybrid assay (Supplementary Table 1). D.S. conceived the project and wrote the manuscript.
Data availability
The data that support this study are available from the corresponding author upon reasonable request. The mass spectrometry proteomics data generated in this study have been deposited to the ProteomeXchange Consortium via PRIDE55 and are available with the identifier PXD024929. For TSSC4 sequence alignment, the following data were obtained from Uniprot: TSSC4 Homo sapiens—Q9Y5U2; TSSC4 Bos taurus—Q1LZD3; TSSC4 Mus musculus—Q9JHE7; TSSC4 Gallus gallus—Q5ZJS5; TSSC4 Danio rerio—Q0P4A6; TSSC4 Xenopus laevis—A0A1L8GD85; TSSC4 Drosophila melanogaster—Q9VT74; TSSC4 Nematostella vectensis—A7RVI2. The database used for identification of proteins after SILAC-IP was Human Reference Proteome set (reviewed + isoforms) downloaded from Uniprot.org (proteome ID UP000005640). The original microscopy images can be provided upon request. Source data are provided with this paper.
Competing interests
The authors declare no competing interests.
Footnotes
Peer review information Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
These authors contributed equally: Klára Klimešová, Jitka Vojáčková.
Supplementary information
The online version contains supplementary material available at 10.1038/s41467-021-23934-y.
References
- 1.Kastner, B., Will, C. L., Stark, H. & Luhrmann, R. Structural insights into nuclear pre-mRNA splicing in higher eukaryotes. Cold Spring Harb Perspect. Biol. 11, a032417 (2019). [DOI] [PMC free article] [PubMed]
- 2.Tycowski, K. T., Kolev, N. G., Conrad, N. K., Fok, V. & Steitz, J. A. The ever-growing world of small nuclear ribonucleoproteins. in The RNA world (eds Gesteland, R. F., Cech, T. R. & Atkins, J. F.) 327–368 (Cold Spring Harbor Laboratory Press, 2006).
- 3.Neuenkirchen N, et al. Reconstitution of the human U snRNP assembly machinery reveals stepwise Sm protein organization. EMBO J. 2015;34:1925–1941. doi: 10.15252/embj.201490350. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Chari A, et al. An assembly chaperone collaborates with the SMN complex to generate spliceosomal snRNPs. Cell. 2008;135:497–509. doi: 10.1016/j.cell.2008.09.020. [DOI] [PubMed] [Google Scholar]
- 5.Grimm C, et al. Structural basis of assembly chaperone-mediated snRNP formation. Mol. Cell. 2013;49:692–703. doi: 10.1016/j.molcel.2012.12.009. [DOI] [PubMed] [Google Scholar]
- 6.Pellizzoni L, Yong J, Dreyfuss G. Essential role for the SMN complex in the specificity of snRNP assembly. Science. 2002;298:1775–1779. doi: 10.1126/science.1074962. [DOI] [PubMed] [Google Scholar]
- 7.Novotny I, et al. SART3-dependent accumulation of incomplete spliceosomal snRNPs in Cajal bodies. Cell Rep. 2015;10:429–440. doi: 10.1016/j.celrep.2014.12.030. [DOI] [PubMed] [Google Scholar]
- 8.Achsel T, Ahrens K, Brahms H, Teigelkamp S, Luhrmann R. The human U5-220kD protein (hPrp8) forms a stable RNA-free complex with several U5-specific proteins, including an RNA unwindase, a homologue of ribosomal elongation factor EF-2, and a novel WD-40 protein. Mol. Cell Biol. 1998;18:6756–6766. doi: 10.1128/MCB.18.11.6756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Weber G, et al. Mechanism for Aar2p function as a U5 snRNP assembly factor. Genes Dev. 2011;25:1601–1612. doi: 10.1101/gad.635911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Weber G, et al. Structural basis for dual roles of Aar2p in U5 snRNP assembly. Genes Dev. 2013;27:525–540. doi: 10.1101/gad.213207.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Santos K, Preussner M, Heroven AC, Weber G. Crystallization and biochemical characterization of the human spliceosomal Aar2-Prp8RNaseH complex. Acta Crystallogr. Section: F Struct. Biol. Commun. 2015;71:1421–1428. doi: 10.1107/S2053230X15019202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Malinova A, et al. Assembly of the U5 snRNP component PRPF8 is controlled by the HSP90/R2TP chaperones. J. Cell Biol. 2017;216:1579–1596. doi: 10.1083/jcb.201701165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Boon K-L, et al. prp8 mutations that cause human retinitis pigmentosa lead to a U5 snRNP maturation defect in yeast. Nat. Struct. Mol. Biol. 2007;14:1077–1083. doi: 10.1038/nsmb1303. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Nguyen TH, et al. Structural basis of Brr2-Prp8 interactions and implications for U5 snRNP biogenesis and the spliceosome active site. Structure. 2013;21:910–919. doi: 10.1016/j.str.2013.04.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Cloutier P, et al. R2TP/Prefoldin-like component RUVBL1/RUVBL2 directly interacts with ZNHIT2 to regulate assembly of U5 small nuclear ribonucleoprotein. Nat. Commun. 2017;8:15615. doi: 10.1038/ncomms15615. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Horejsi Z, et al. Phosphorylation-dependent PIH1D1 interactions define substrate specificity of the R2TP cochaperone complex. Cell Rep. 2014;7:19–26. doi: 10.1016/j.celrep.2014.03.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Erkelenz, S. et al. Ecd promotes U5 snRNP maturation and Prp8 stability. Nucleic Acids Res.49, 1688–1707 (2021). [DOI] [PMC free article] [PubMed]
- 18.Laggerbauer B, et al. The human U5 snRNP 52K protein (CD2BP2) interacts with U5-102K (hPrp6), a U4/U6.U5 tri-snRNP bridging protein, but dissociates upon tri-snRNP formation. RNA. 2005;11:598–608. doi: 10.1261/rna.2300805. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Liu S, Rauhut R, Vornlocher HP, Luhrmann R. The network of protein-protein interactions within the human U4/U6.U5 tri-snRNP. RNA. 2006;12:1418–1430. doi: 10.1261/rna.55406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Nielsen TK, Liu S, Luhrmann R, Ficner R. Structural basis for the bifunctionality of the U5 snRNP 52K protein (CD2BP2) J. Mol. Biol. 2007;369:902–908. doi: 10.1016/j.jmb.2007.03.077. [DOI] [PubMed] [Google Scholar]
- 21.Fetzer S, Lauber J, Will CL, Luhrmann R. The [U4/U6.U5] tri-snRNP-specific 27K protein is a novel SR protein that can be phosphorylated by the snRNP-associated protein kinase. Rna. 1997;3:344–355. [PMC free article] [PubMed] [Google Scholar]
- 22.Makarova OV, Makarov EM, Luhrmann R. The 65 and 110 kDa SR-related proteins of the U4/U6.U5 tri-snRNP are essential for the assembly of mature spliceosomes. EMBO J. 2001;20:2553–2563. doi: 10.1093/emboj/20.10.2553. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Makarov EM, Makarova OV, Achsel T, Luhrmann R. The human homologue of the yeast splicing factor prp6p contains multiple TPR elements and is stably associated with the U5 snRNP via protein-protein interactions. J. Mol. Biol. 2000;298:567–575. doi: 10.1006/jmbi.2000.3685. [DOI] [PubMed] [Google Scholar]
- 24.Schaffert N, Hossbach M, Heintzmann R, Achsel T, Luhrmann R. RNAi knockdown of hPrp31 leads to an accumulation of U4/U6 di-snRNPs in Cajal bodies. EMBO J. 2004;23:3000–3009. doi: 10.1038/sj.emboj.7600296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Agafonov DE, et al. Molecular architecture of the human U4/U6.U5 tri-snRNP. Science. 2016;351:1416–1420. doi: 10.1126/science.aad2085. [DOI] [PubMed] [Google Scholar]
- 26.Novotny I, Blazikova M, Stanek D, Herman P, Malinsky J. In vivo kinetics of U4/U6.U5 tri-snRNP formation in Cajal bodies. Mol. Biol. Cell. 2011;22:513–523. doi: 10.1091/mbc.e10-07-0560. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Company M, Arenas J, Abelson J. Requirement of the RNA helicase-like protein PRP22 for release of messenger RNA from spliceosomes. Nature. 1991;349:487–493. doi: 10.1038/349487a0. [DOI] [PubMed] [Google Scholar]
- 28.Zhang X, et al. Structures of the human spliceosomes before and after release of the ligated exon. Cell Res. 2019;29:274–285. doi: 10.1038/s41422-019-0143-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Arenas JE, Abelson JN. Prp43: An RNA helicase-like factor involved in spliceosome disassembly. Proc. Natl Acad. Sci. USA. 1997;94:11798–11802. doi: 10.1073/pnas.94.22.11798. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Wan R, Bai R, Shi Y. Molecular choreography of pre-mRNA splicing by the spliceosome. Curr. Opin. Struct. Biol. 2019;59:124–133. doi: 10.1016/j.sbi.2019.07.010. [DOI] [PubMed] [Google Scholar]
- 31.Makarov EM, et al. Small nuclear ribonucleoprotein remodeling during catalytic activation of the spliceosome. Science. 2002;298:2205–2208. doi: 10.1126/science.1077783. [DOI] [PubMed] [Google Scholar]
- 32.Makarova OV, Makarov EM. The 35S U5 snRNP is generated from the activated spliceosome during in vitro splicing. PLoS ONE. 2015;10:e0128430. doi: 10.1371/journal.pone.0128430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Claudius AK, Romani P, Lamkemeyer T, Jindra M, Uhlirova M. Unexpected role of the steroid-deficiency protein ecdysoneless in pre-mRNA splicing. PLoS Genet. 2014;10:e1004287. doi: 10.1371/journal.pgen.1004287. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Mir RA, et al. A novel interaction of ecdysoneless (ECD) protein with R2TP complex component RUVBL1 is required for the functional role of ECD in cell cycle progression. Mol. Cell Biol. 2015;36:886–899. doi: 10.1128/MCB.00594-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Huttlin EL, et al. Architecture of the human interactome defines protein communities and disease networks. Nature. 2017;545:505–509. doi: 10.1038/nature22366. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Lee MP, et al. Two novel genes in the center of the 11p15 imprinted domain escape genomic imprinting. Hum. Mol. Genet. 1999;8:683–690. doi: 10.1093/hmg/8.4.683. [DOI] [PubMed] [Google Scholar]
- 37.Kim JJ, et al. Exome sequencing and subsequent association studies identify five amino acid-altering variants influencing human height. Hum. Genet. 2012;131:471–478. doi: 10.1007/s00439-011-1096-4. [DOI] [PubMed] [Google Scholar]
- 38.Ong SE, et al. Stable isotope labeling by amino acids in cell culture, SILAC, as a simple and accurate approach to expression proteomics. Mol. Cell. Proteom. 2002;1:376–386. doi: 10.1074/mcp.M200025-MCP200. [DOI] [PubMed] [Google Scholar]
- 39.Trinkle-Mulcahy L, et al. Identifying specific protein interaction partners using quantitative mass spectrometry and bead proteomes. J. Cell Biol. 2008;183:223–239. doi: 10.1083/jcb.200805092. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Notredame C, Higgins DG, Heringa J. T-Coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 2000;302:205–217. doi: 10.1006/jmbi.2000.4042. [DOI] [PubMed] [Google Scholar]
- 41.Paulsen M, et al. Sequence conservation and variability of imprinting in the Beckwith-Wiedemann syndrome gene cluster in human and mouse. Hum. Mol. Genet. 2000;9:1829–1841. doi: 10.1093/hmg/9.12.1829. [DOI] [PubMed] [Google Scholar]
- 42.Kozlowski LP, Bujnicki JM. MetaDisorder: a meta-server for the prediction of intrinsic disorder in proteins. BMC Bioinforma. 2012;13:111. doi: 10.1186/1471-2105-13-111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Nielsen JT, Mulder FAA. Quality and bias of protein disorder predictors. Sci. Rep. 2019;9:5137. doi: 10.1038/s41598-019-41644-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Boulon S, et al. HSP90 and its R2TP/Prefoldin-like cochaperone are involved in the cytoplasmic assembly of RNA polymerase II. Mol. Cell. 2010;39:912–924. doi: 10.1016/j.molcel.2010.08.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Stanek D, et al. Spliceosomal small nuclear ribonucleoprotein particles repeatedly cycle through Cajal bodies. Mol. Biol. Cell. 2008;19:2534–2543. doi: 10.1091/mbc.e07-12-1259. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Makarova OV, Makarov EM, Liu S, Vornlocher HP, Luhrmann R. Protein 61K, encoded by a gene (PRPF31) linked to autosomal dominant retinitis pigmentosa, is required for U4/U6center dotU5 tri-snRNP formation and pre-mRNA splicing. EMBO J. 2002;21:1148–1157. doi: 10.1093/emboj/21.5.1148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Huranova M, et al. The differential interaction of snRNPs with pre-mRNA reveals splicing kinetics in living cells. J. Cell Biol. 2010;191:75–86. doi: 10.1083/jcb.201004030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Agafonov DE, et al. Semiquantitative proteomic analysis of the human spliceosome via a novel two-dimensional gel electrophoresis method. Mol. Cell Biol. 2011;31:2667–2682. doi: 10.1128/MCB.05266-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Gottschalk A, Kastner B, Luhrmann R, Fabrizio P. The yeast U5 snRNP coisolated with the U1 snRNP has an unexpected protein composition and includes the splicing factor Aar2p. RNA. 2001;7:1554–1565. [PMC free article] [PubMed] [Google Scholar]
- 50.Poser I, et al. BAC TransgeneOmics: a high-throughput method for exploration of protein function in mammals. Nat. Methods. 2008;5:409–415. doi: 10.1038/nmeth.1199. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Schindelin J, et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods. 2012;9:676–682. doi: 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Botti V, et al. Cellular differentiation state modulates the mRNA export activity of SR proteins. J. Cell Biol. 2017;216:1993–2009. doi: 10.1083/jcb.201610051. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Pandey RR, et al. The mammalian cap-specific m(6)Am RNA methyltransferase PCIF1 regulates transcript levels in mouse tissues. Cell Rep. 2020;32:108038. doi: 10.1016/j.celrep.2020.108038. [DOI] [PubMed] [Google Scholar]
- 54.Maurizy C, et al. The RPAP3-Cterminal domain identifies R2TP-like quaternary chaperones. Nat. Commun. 2018;9:2093. doi: 10.1038/s41467-018-04431-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Perez-Riverol Y, et al. The PRIDE database and related tools and resources in 2019: improving support for quantification data. Nucleic Acids Res. 2019;47:D442–D450. doi: 10.1093/nar/gky1106. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The data that support this study are available from the corresponding author upon reasonable request. The mass spectrometry proteomics data generated in this study have been deposited to the ProteomeXchange Consortium via PRIDE55 and are available with the identifier PXD024929. For TSSC4 sequence alignment, the following data were obtained from Uniprot: TSSC4 Homo sapiens—Q9Y5U2; TSSC4 Bos taurus—Q1LZD3; TSSC4 Mus musculus—Q9JHE7; TSSC4 Gallus gallus—Q5ZJS5; TSSC4 Danio rerio—Q0P4A6; TSSC4 Xenopus laevis—A0A1L8GD85; TSSC4 Drosophila melanogaster—Q9VT74; TSSC4 Nematostella vectensis—A7RVI2. The database used for identification of proteins after SILAC-IP was Human Reference Proteome set (reviewed + isoforms) downloaded from Uniprot.org (proteome ID UP000005640). The original microscopy images can be provided upon request. Source data are provided with this paper.