Skip to main content
eLife logoLink to eLife
. 2016 Nov 29;5:e14707. doi: 10.7554/eLife.14707

Structural basis for the recognition of spliceosomal SmN/B/B’ proteins by the RBM5 OCRE domain in splicing regulation

André Mourão 1,2,, Sophie Bonnal 3,, Komal Soni 1,2,, Lisa Warner 1,2,, Rémy Bordonné 4, Juan Valcárcel 3,5,, Michael Sattler 1,2,
Editor: Douglas L Black6
PMCID: PMC5127646  PMID: 27894420

Abstract

The multi-domain splicing factor RBM5 regulates the balance between antagonistic isoforms of the apoptosis-control genes FAS/CD95, Caspase-2 and AID. An OCRE (OCtamer REpeat of aromatic residues) domain found in RBM5 is important for alternative splicing regulation and mediates interactions with components of the U4/U6.U5 tri-snRNP. We show that the RBM5 OCRE domain adopts a unique β–sheet fold. NMR and biochemical experiments demonstrate that the OCRE domain directly binds to the proline-rich C-terminal tail of the essential snRNP core proteins SmN/B/B’. The NMR structure of an OCRE-SmN peptide complex reveals a specific recognition of poly-proline helical motifs in SmN/B/B’. Mutation of conserved aromatic residues impairs binding to the Sm proteins in vitro and compromises RBM5-mediated alternative splicing regulation of FAS/CD95. Thus, RBM5 OCRE represents a poly-proline recognition domain that mediates critical interactions with the C-terminal tail of the spliceosomal SmN/B/B’ proteins in FAS/CD95 alternative splicing regulation.

Research organism: Human

eLife digest

The information required to produce proteins is encoded within genes. In the first step of creating a protein, its gene is “transcribed” to form a pre-messenger RNA molecule (called pre-mRNA for short). Both the gene and the pre-mRNA contain regions called exons that code for protein, and regions called introns that do not. The pre-mRNA therefore undergoes a process called splicing to remove the introns and join the exons together into a final mRNA molecule that is “translated” to make the protein.

Many pre-mRNAs can be spliced in several different ways to include different combinations of exons in the final mRNA molecule. This process of “alternative splicing” allows different versions of a protein to be produced from the same gene. Changes that alter the pattern of alternative splicing in a cell affect various cellular and developmental processes and have been linked to diseases such as cancer.

The pre-mRNA transcribed from a gene called FAS can be alternatively spliced so that it either does or does not contain an exon that enables the protein to embed itself in the cell membrane. The protein produced from mRNA that includes this exon generates a cell response that leads to cell death. By contrast, protein produced from mRNA that lacks this exon is released from cells and promotes their survival. A splicing factor called RBM5 promotes the removal of this exon from FAS pre-mRNA.

RBM5 binds to some of the proteins that make up the molecular machine that splices pre-mRNA molecules. Mourão, Bonnal, Soni, Warner et al. have now used a technique called nuclear magnetic resonance spectroscopy to solve the three-dimensional structure formed when RBM5 binds to one of these proteins, called SmN. Further experiments introduced specific mutations to the proteins to investigate their effects in human cells. This revealed that mutations that impaired the association between RBM5 and SmN compromised the activity of RBM5 to regulate the alternative splicing of FAS pre-mRNA molecules.

Future research could examine how RBM5 associates with pre-mRNAs and other components of the splicing machinery, and investigate whether proteins that are closely related to RBM5 act in similar ways.

Introduction

An essential step during the regulation of eukaryotic gene expression is the removal of non-coding intron sequences from pre-mRNA transcripts through the process of pre-mRNA splicing. The catalytic steps of pre-mRNA splicing are carried out by the spliceosome, a large and dynamic assembly of five small nuclear ribonucleoprotein (snRNP) complexes and more than 150 additional splicing factor proteins (Wahl et al., 2009). Many splicing factors are involved in early steps of the assembly of the spliceosome through the recognition of short regulatory RNA motifs and/or through protein-protein interactions. Alternative splicing is the mechanism by which particular intronic or exonic regions are included or excluded to produce diverse mRNAs from the same gene (Blencowe, 2006). It is thought that more than 90% of human multi-exon genes undergo alternative splicing (Pan et al., 2008; Wang et al., 2008). The genomic diversity of eukaryotic gene expression is thus greatly expanded by alternative splicing of mRNA transcripts. Often, the protein products of alternative splicing have antagonistic roles in cellular functions and are implicated in human diseases (Cooper et al., 2009). Notably, mutations in splicing factors that modulate alternative splicing decisions have been implicated in cancer (David and Manley, 2010; Bonnal et al., 2012).

A biologically important example of alternative splicing is found in the FAS gene (also known as CD95 or APO-1). The FAS gene encodes a transmembrane signaling protein that stimulates a pro-apoptotic signaling cascade upon binding of the FAS ligand at the cell surface (Krammer, 2000). Alternatively spliced FAS transcripts that exclude exon 6 encode a soluble Fas isoform that lacks the transmembrane domain. This soluble isoform can be secreted outside of the cell where it sequesters the FAS ligand and inhibits downstream activation of apoptosis (Cheng et al., 1994; Cascino et al., 1995). Thus, regulation of the alternative splicing of FAS can either stimulate or inhibit cell survival. The pro-apoptotic Fas protein plays an important role during T-lymphocyte maturation (Liu et al., 1995; Papoff et al., 1996; Van Parijs et al., 1998; Roesler et al., 2005) and additional evidence implicates this isoform in the proliferation of cancer cells (Chen et al., 2010).

A number of splicing factors have been shown to modulate FAS alternative splicing, including RBM5. The multi-domain RNA-binding protein 5 (RBM5) regulates FAS splicing by promoting skipping of exon 6 (Bonnal et al., 2008). RBM5 is a 92 kDa, multi-domain protein with an arginine-serine (RS)-rich region, two RNA Recognition Motifs (RRM1 and RRM2), two Zinc-Finger domains (ZF1 and ZF2), a C-terminal OCtamer REpeat (OCRE) domain (Callebaut and Mornon, 2005) in addition to a glycine patch, and KEKE (lysine/glutamate) repeats (Figure 1). It belongs to the family of RNA Binding Motif (RBM) proteins, including RBM6 and RBM10, which share a similar domain organization with RBM5 and have 30% and 50% amino acid identity, respectively (Sutherland et al., 2005). The RBM5 (also known as H37 and LUCA-15) and RBM6 genes are located in a chromosomal region 3p21.3, which is frequently deleted in heavy smokers and lung cancers (Oh et al., 2002; Zabarovsky et al., 2002). RBM5 is known to regulate the alternative splicing of apoptosis–related genes, such as FAS and Caspase-2 (Bonnal et al., 2008; Fushimi et al., 2008). It has also been reported to suppress metastasis by modulating the expression of Rac1, β-catenin, collagen and laminin (Oh et al., 2010). However, RBM5 has also been found to be up-regulated in some aggressive forms of breast and ovarian cancers (Oh et al., 1999; Rintala-Maki et al., 2007). The possible dual role of RBM5 in cancer progression may be linked to its various activities as splicing regulator, including differential recognition of its pre-mRNA targets.

Figure 1. Structure and functional analysis of the RBM5 OCRE domain.

(A) Domain composition of the related RBM5, RBM6 and RBM10 family proteins. (B) Sequence alignment of OCRE domains of human RBM5, RBM10 and RBM6 proteins. (C,D) Two views in a cartoon presentation of the RBM5 OCRE domain, showing side chains of conserved and exposed tyrosine residues, some of which were probed by mutational analysis. (C) At one side of the β-sheet surface the N-terminal extension shields the tyrosines (orange) from solvent exposure, (D) on the opposite surface of the β-sheet, numerous tyrosine residues (yellow) are solvent accessible. (E) Mutational analysis of conserved residues of the RBM5 OCRE domain. Specific mutations of conserved and accessible residues of RBM5 OCRE domain impair the activity of the protein in FAS alternative splicing regulation ex vivo. HeLa cells were co-transfected with a Fas wild type alternative splicing reporter (harbouring sequences between the 5’ end of exon 5 and the first 47 nucleotides of exon 7) and T7-RBM5 expression plasmids. RNA and proteins were isolated 24 hr after transfection. Patterns of alternative splicing were studied by RT-PCR using specific primers (PT1, PT2) and the percentage of inclusion was calculated and is presented in the histogram for a minimum of 16 replicas of the experiment. T-test (two-tailed distribution, homoscedastic) results are mentioned (**<0,01; ***<0,001). Full quantification and T-test results are provided in Figure 5—figure supplement 2.

Figure 1.

Figure 1—figure supplement 1. NMR analysis of the RBM5 OCRE domain.

Figure 1—figure supplement 1.

(A) Superimposition of 1H,15N-HSQC spectra of two different constructs comprising the RBM5 OCRE domain, residues 451–511 (red) and residues 451–532 (blue). (B) {1H}-15N heteronuclear NOE relaxation analysis of the two constructs (residues 451–511, red) and residues 451–532, blue) indicate that the peptide backbone beyond residue 511 is flexible. (C) Secondary chemical shifts of OCRE 451–511 indicates that OCRE comprises β-strand and extended conformations only.(D) Stereo view of the ensemble of 10 lowest energy NMR structures of the RBM5 OCRE domain and (E) Electrostatic surface charge of the OCRE domain, same view as in Figure 1B.

In the regulation of FAS alternative splicing, RBM5 inhibits mature spliceosome formation by blocking association of the U4/U6.U5 tri-snRNP complex (in complex B) after the splice sites flanking exon 6 have been recognized by U1 and U2 snRNPs (Bonnal et al., 2008). RBM5 therefore appears to influence the pairing between the splice sites after complex A formation and thus promote tri-snRNP assembly at the distal splice sites, leading to exon 6 skipping (Bonnal et al., 2008). Moreover, it was shown that the OCRE domain of RBM5 interacts with components of the tri-snRNP and is essential for the function of RBM5 in regulating FAS splicing (Bonnal et al., 2008).

OCRE domains, which are present in different proteins in various organisms (Xiao et al., 2013), were identified by bioinformatic sequence analysis as a tandem of five imperfect octameric repeat sequences with triplets of aromatic residues (Callebaut and Mornon, 2005), expected to predominantly adopt β-strand secondary conformation. Due to the presence of OCRE domain in RNA binding proteins such as RBM5, they were presumed to facilitate RNA binding (Callebaut and Mornon, 2005). However, at least in the case of RBM5, the OCRE domain appears to play a role in mediating protein-protein interactions (Bonnal et al., 2008).

In order to provide molecular insights into the function of the OCRE domain in splicing regulation, we have combined structural and biochemical experiments with mutational analyses in vitro and in vivo. We show that the RBM5 OCRE domain directly binds to the spliceosomal SmN/B/B’ proteins. Sm proteins are core components of most spliceosomal snRNPs and form a heptameric ring composed of SmD1, SmD2, SmF, SmE, SmG, SmD3 and SmN/B/B’, which jointly recognize the uridine-rich Sm site RNA motif in U1, U2, U4 and U5 snRNAs (Kambach et al., 1999; Will and Lührmann, 2001; Pomeranz Krummel et al., 2009; Weber et al., 2010; Leung et al., 2011; Matera and Wang, 2014; Kondo et al., 2015). SmB and SmB' are encoded by two transcript variants from the SNRPB gene, while a different gene, highly expressed in brain and heart, encodes the homologous protein SmN (Schmauss et al., 1992). SmN/B/B’, SmD1, and SmD3 have C-terminal extensions that include symmetrically dimethylated RG repeats (Brahms et al., 2001; Tripsianes et al., 2011). The SmN/B/B’ C-terminal tails contain additional proline-rich sequences, where these regions in SmN, SmB and SmB’ are 93% homologous (van Dam et al., 1989).

Here, we show that the RBM5 OCRE domain binds to the C-terminal proline-rich motifs present in SmB and SmN. The structure of the RBM5 OCRE domain adopts a unique β-sheet fold that recognizes the proline-rich C-terminal tails of the SmN/B/B' proteins through aromatic-CH interactions. We demonstrate that disruption of these interactions by mutations in the OCRE domain or in the proline-rich motifs of its ligands decreases the affinity between SmN/B/B’ and RBM5 in vitro and affects alternative splicing regulation of the FAS gene. Our results demonstrate that OCRE is a novel protein-protein interaction domain that mediates interactions with the core spliceosome in alternative splicing regulation.

Results

The RBM5 OCRE domain adopts a unique β-sheet fold involved in splicing regulation

To gain insights into the molecular functions of the RBM5 OCRE domain, we determined the three-dimensional structure for the human RBM5 OCRE domain using solution NMR techniques. Previous reports (Callebaut and Mornon, 2005) and a multiple sequence alignment of the related splicing factors RBM5, RBM6, and RBM10 indicated that the OCRE domain spans amino acid residues 451–511 of RBM5 (Figure 1A,B). NMR chemical shift and 15N relaxation analyses of a construct with a C-terminal extension (451–532, including the KEKE region) did not reveal additional structural elements, suggesting that residues 451–511 indeed define the OCRE fold (Figure 1—figure supplement 1A,B). An analysis of 13C secondary chemical shifts shows that the secondary structure of the RBM5 OCRE domain comprises six β-strands between residues 460 and 505 (Figure 1B; Figure 1—figure supplement 1C). Notably, residues 452–459 preceding β1, and the loops connecting the β-strands are well-structured and not flexible (Figure 1—figure supplement 1B,C).

The solution structure of the RBM5 OCRE domain is shown in Figures 1C,D and Figure 1—figure supplement 1D,E; structural statistics are provided in Table 1. Residues 462–465 (β1), 470–473 (β2), 478–481 (β3), 486–489 (β4), 494–498 (β5), and 504–506 (β6) make up six consecutive anti–parallel β-strands. An N-terminal extension (residues 452–459) packs against one side of the β−sheet and is connected to the β1 strand by a 310 helix (residues 459–461) (Figures 1C,D). The six β–strands form a twisted β–sheet where aromatic side chains are exposed on opposite surfaces of the β–sheet (Figure 1C,D). On one side, the aromatic side chains of Tyr464 (in strand β1), Tyr 471 (β2), Tyr480 (β3), Tyr487 (β4) form extended aromatic side chain interactions on the surface. Tyr487 forms an additional hydrophobic cluster with Tyr462 and Pro457 from the N-terminal extension. Trp498 and Tyr505 from the C-terminal extension, and the N-terminal Tyr454 are also involved in this cluster (Figure 1C). This forms a compact fold that is further stabilized by a salt bridge involving Lys453 and Glu503 and thus brings the N- and C-terminal regions of the OCRE domain in close spatial proximity. Due to the interactions with residues from the N-terminal extension this aromatic surface of the OCRE domain is shielded from the solvent. In contrast, on the other side of the β-sheet, the interactions of the aromatic side chains of Tyr470 (β2), Tyr479 (β3), Tyr486, Tyr488 (β4), and Tyr495, Tyr497 (β5) form an aromatic surface with the tyrosine hydroxyl groups exposed to the solvent (Figure 1D). The N-terminal extension has an extended conformation consistent with secondary 13C chemical shifts (Figure 1—figure supplement 1C). Although the topology of the six antiparallel β-strands is rather simple, the twisted β-sheet of the OCRE domain appears unique. Structural similarity searches with Dali (Holm et al., 2008) and SSM (Krissinel and Henrick, 2004) did not reveal any significant structural homologs (Z-scores <2), indicating that the OCRE domain represents a unique fold. While the β-sheet fold is quite simple, the specific arrangement of the N-terminal extension represent the unique features of the OCRE fold. The electrostatic surface potential of the OCRE domain is predominantly negatively charged with some hydrophobic patches (Figure 1—figure supplement 1E), consistent with a potential role in protein–protein interactions and less likely for nucleic acid binding.

Table 1. Structural statistics RBM5 OCRE and OCRE/SmN complex.

OCRE OCRE + SmN
NMR distance and dihedral restraints
Distance restraints
 Total NOE 1171 1127
 Intra-residue 220 419
 Inter-residue
Sequential (|i-j| = 1) 292 118
Medium-range (|i-j| < 4) 156 114
Long-range (|i-j > 5) 503 329
 Hydrogen bonds 20 11
Protein-peptide intermolecular N/A 109
Dihedral angle restraints
 ϕ 47 51
 ψ 51 51
Structure statistics
Violations (mean and s.d.)
 Distance restraints (>0.2 Å) 0.685 ± 0.363 0.324 ± 0.111
 Dihedral angle restraints (>5 °) 0 ± 0 0 ± 0
 Max. distance restraint violation (Å) 0.657 ± 0.470 0.587 ± 0.142
 Max. dihedral angle restraint violation (°) 0 0
Deviations from idealized geometry
 Bond lengths (Å) 0.0038 ± 0.0002 0.0046 ± 0.0001
 Bond angles (°) 0.444 ± 0.026 0.616 ± 0.015
 Impropers (°) 1.230 ± 0.0681 1.737 ± 0.00889
Average pairwise r.m.s.d.* (Å)
 Heavy 0.72 ± 0.05 0.71 ± 0.10
 Backbone 0.38 ± 0.07 0.38 ± 0.09

Pairwise r.m.s.d. was calculated among 10 refined structures for residues 455-508 (RBM5 OCRE) and 221-229 (SmN) after water refinement.

To get initial insights into functionally relevant residues and subdomains, we focused on surface-exposed amino acids conserved between OCRE domains from different RBM proteins. These include Tyr495, Tyr497 on one surface of the fold and Tyr454, Glu501 and Asp458 on the opposite side. To assess the functional relevance of these residues, we generated triple and double point mutations (Y495/Y497/E501(YYE)>AAK) and (Y454/458(YD)>AR), respectively. The mutants were expressed in HeLa cells, co-transfected with a FAS alternative splicing reporter (Forch et al., 2000) and the pattern of Fas exon 6 inclusion/skipping analyzed by RT-PCR. While the YD mutant had similar activity as the WT protein in promoting Fas exon 6 skipping, the YYE mutation failed to induce significant levels of FAS exon 6 skipping, similar to the effect of deleting the entire OCRE domain (Figure 1E). To further investigate the importance of the residues in the YYE cluster, single point mutants were generated (Y495A, Y497A, E501D/K) and tested by co-transfection. Replacement of the aromatic tyrosines that are exposed on one surface of the β-sheet to alanine impaired the function of the protein in FAS alternative splicing regulation, arguing that the tyrosine residues in the surface of the β-sheet are important for the function of the OCRE domain in splicing regulation (Figure 1E).

RBM5 OCRE domain binds to the C-terminal tails of SmN/B/B’

It was previously shown that the RBM5 OCRE domain mediates interactions with protein components of the U5 snRNP complex, but the direct binding partners were not determined (Bonnal et al., 2008). SmN, a core snRNP protein highly homologous to SmB/B', was among proteins, identified by mass-spectrometry, pulled down with GST-OCRE RBM5 from HeLa cell extracts (Bonnal et al., 2008). We also found RBM5 as an interacting partner in a two-hybrid screen using the C-terminal tail of human SmB as bait (unpublished results). SmN/B/B' are components of the Sm core, a heptamer of Sm proteins that associates with all spliceosomal snRNPs except U6. While SmB and SmB’ are ubiquitous, SmN is primarily found in neuronal and cardiac cells but has also been found to be expressed in HeLa cells (Sharpe et al., 1990). The three SmN/B/B’ isoforms have a similar domain architecture, comprising a small globular N-terminal Sm domain (Kambach et al., 1999) and a C-terminal region that is expected to be intrinsically disordered. The C-terminal region starts with an arginine-glycine (RG)-rich region at ~residue 90 that harbors symmetrically dimethylated arginine residues, and additionally comprises a 60–70 residue proline-rich region of unknown function beyond residue 167 (Figure 2A).

Figure 2. Interaction of RBM5 OCRE with SmN/B/B’ proteins.

(A) Domain structure and multiple sequence alignment of the C-terminal tails of human SmN/B/B’ proteins. (B) In vitro pull down assays of 35S-labeled SmB with GST-RBM5 full-length, N- and C-terminal halves of human RBM5. First lane represents 20% of the input used in the pull down. (C) Conserved tyrosine residues in the RBM5 OCRE domain are important for the interaction with SmN protein. GST-pull down assays were carried out with different variants of RBM5 comprising the OCRE and KEKE domains harboring the mutations indicated. The detection of the protein was carried out by western blot with T7 epitope antibody and Ponceau staining was performed as a loading control. Quantification is provided in Figure 2—figure supplement 1. (D) The C-terminal domain of human SmB binds to the OCRE domain of RBM5 in vivo. Yeast two hybrid plasmids encoding the C-terminal domain of human SmB (SmB C-tail) and the indicated RBM5 coding regions were transformed into yeast. Serial dilutions of equivalent amounts of exponentially growing yeast were plated on double and triple dropout media. Growth on -Leu -Trp -His is indicative of an interaction between the tested proteins: RBM5-FL: full length RBM5 protein; RBM5 ∆OCRE: deletion of the RBM5 OCRE domain (amino-acids 452–535); RBM5 OCRE: RBM5 OCRE domain (amino-acids 452–535). (E) Surface-exposed tyrosine residues are important for SmB C-tail binding in vivo. RBM5-FL: full length RBM5 protein; RBM5 Y495F: RBM5-FL carrying a tyrosine to phenylalanine substitution at position 495; RBM5-FL Y495A: RBM5 carrying a tyrosine to alanine substitution at position 495. The two-hybrid assay was performed as described in panel d. (F) NMR titrationof 15N-labeled RMB5 OCRE domain (0.2 mM- black) with SmN residues 219–229 (red) at two-fold molar excess. (G) Mapping of NMR chemical shift perturbations upon titration of the OCRE domain with SmN (residues 219–229) onto a surface representation of the RBM5 OCRE domain structure.

Figure 2.

Figure 2—figure supplement 1. Aromatic residues in RBM5 OCRE are important for interaction with SmN.

Figure 2—figure supplement 1.

(A) GST-pull down assays were carried out with T7 epitope-tagged-SmN (T7-SmN) and wildtype RBM5 fragment of the region comprising the OCRE and KEKE domain and different variants bearing mutations of Tyr495 to Ala, Thr, Phe and Trp (as in Figure 2C).The detection of the protein was carried out by Western blot using anti-T7 and GST-epitope antibodies and by Ponceau staining as loading control. (B) Quantification of the ratio between T7-SmN and GST-fusion proteins for 4 to 7 replicates of the experiment. The ratio was normalized to 1 for wild type GST-OCRE. T-test (two-tailed distribution, homoscedastic) results are indicated (*<0,05; **<0,01 and ***<0,001).
Figure 2—figure supplement 2. Interaction of RBM5 OCRE with C-terminal regions of SmN and SmB.

Figure 2—figure supplement 2.

NMR titration of 0.2 mM 15N-labeled OCRE domain (black) with SmN and SmB peptides (red), (A) SmN 97–196, (B) SmN 167–196, (C) SmN 167–240 (A-C: 2-fold molar excess), (D) SmN 219–229 (6-fold molar excess), (E) SmB 167–231, 4-fold molar excess. (F) ITC analysis of the interaction of OCRE with C-terminal tails of SmN (residues 167 to 240) and SmB (residues 167 to 231). (G) Comparison of the chemical shift perturbations seen for the C-terminal tails of (C) SmN and SmB (E).

As the globular N-terminal Sm domain in SmN/B/B’ proteins participates in the formation of the heptameric Sm core in the U snRNP complexes (Kambach et al., 1999; Pomeranz Krummel et al., 2009) it seems unlikely that it could interact with the RBM5 OCRE domain. We thus considered the possibility that, in particular, the C-terminal domain of SmN/B/B’ could directly bind to the RBM5 OCRE domain. As an initial test, we verified that recombinant purified GST-RBM5, or a derivative containing the C-terminal (OCRE-containing) domain, pulled down in vitro translated SmB, while a GST-fusion of the N-terminal domain of RBM5 did not (Figure 2B). Next we carried out pull-down experiments using recombinant GST-tagged RBM5 OCRE domains (wild type and mutants where surface-exposed tyrosine residues are replaced by alanine) and assessed the ability to pull down purified recombinant N-terminal T7 epitope-tagged-SmN protein produced in mammalian cells (Figure 2C; Figure 2—figure supplement 1A). These experiments demonstrate that recombinant GST-tagged OCRE domain binds SmN in vitro and that the mutations Y479A, Y488A, or Y495A diminish SmN binding (Figure 2C), consistent with the decrease in activity of Y495A in splicing regulation (Figure 1E). The mutations Y486A, Y497A and E501K do not significantly impair the interaction (Figure 2C). Interestingly, mutation of Y495 to tryptophan or phenylalanine did not compromise binding, arguing that an aromatic residue at this position is required for the interaction (Figure 2—figure supplement 1).

To further characterize these interactions by an independent method, we performed yeast-two-hybrid assays and tested whether the C-terminal tail of SmB binds to different variants of RBM5. These experiments demonstrate a robust interaction between RBM5 and the SmB tail, where the OCRE domain is found to be necessary and sufficient for the interaction (Figure 2D). Moreover, mutations of surface-exposed tyrosines to alanine reduce the interaction, while mutation to another aromatic amino acid (phenylalanine) maintains the interaction (Figure 2E) as was observed for the full-length SmN protein (Figure 2C; Figure 2—figure supplement 1).

RBM5 OCRE domain recognizes proline-rich motifs in the SmN/B/B’ tails

We next wished to identify the region in the Sm tails that mediates the OCRE interaction by using NMR and isothermal titration calorimetry (ITC) experiments. We studied binding of different constructs of the C-terminal tail of SmN, which harbors a region comprising Arg-Gly motifs (residues 95–134, not shown) and a proline rich region (residues 167–240, up to the C-terminal end, Figure 2A) (Weber et al., 2010). The interaction of the RBM5 OCRE domain was assessed by monitoring NMR 1H, 15N chemical shifts of 15N-labeled OCRE domain during a titration with different SmN-derived peptides (Figure 2—figure supplement 2). Titrations with two different SmN fragments (residues 97–196 and 167–196) that both comprise parts of the proline-rich region gave rise to significant and comparable chemical shift perturbations (CSPs) (Figure 2—figure supplement 2A,B), suggesting that it is the proline-rich region, which is common to both peptides, that mediates the interaction. A peptide comprising the complete proline-rich region (residues 167–240) shows comparable chemical shift changes. SmN (167–196) and (167–240), both of which comprise about one half and the complete proline-rich region, bind to RBM5 OCRE with dissociation constants of KD = 195 μM and 41 μM, respectively, as determined by ITC (Figure 2—figure supplement 2F; Table 2).

Table 2. Isothermal titration calorimetry data for the OCRE/SmN/B/B’ interaction.

RBM5_OCRE WT Y495A Y495T Y495F Y495W Y488A Y486A Y479A Y454A/D458K E501K
SmN_167-240 WT WT WT WT WT WT WT WT WT
KD (µM) 41 ± 2 172 ± 6 186 ± 12 87 ± 3 35 ± 3 220 ± 11 145 ± 5 111 ± 5 106 ± 15 48 ± 2
KD values were determined from replicate measurements, with standard deviations as indicated.
RBM5_OCRE WT WT WT WT WT WT
SmN_167-240 4P→4A 4P/3P→4A/3A 4P/3P→4G/3G R1→E1 R2→E2 R1/R2→E1/E2
KD (µM) 29 ± 3 103 ± 16 192 ± 66 66.5 ± 6 74 ± 4 149.5 ± 46
RBM5_OCRE WT
SmN_167-196 WT
KD (µM) 195 ± 21
RBM5_OCRE WT WT WT
SmB_167-231 WT 4P→4A R1/R2→E1/E2
KD (µM) 21 ± 3 20 ± 2 159 ± 8

SmB and SmN share a poly-proline-rich region in their C-terminal tails, with 80% sequence identity and 98% sequence similarity (Figure 2A). Upon titration of the SmB C-terminal region (residues 167–231) to 15N-labeled OCRE, significant NMR CSPs are observed (Figure 2—figure supplement 2E). Moreover, the observed CSPs are very similar to those seen with the SmN titration and affect the same residues, indicating that SmB and SmN bind to the same site in the RBM5 OCRE domain (Figure 2—figure supplement 2G). The binding affinity of the SmB and SmN peptides to OCRE, determined by isothermal titration calorimetry (ITC) gave similar KD values of 21 μM and 41 μM, respectively (Table 2). Considering the high sequence similarity between SmB, SmB’ and SmN, it is expected that OCRE will interact with proline-rich sequences present in SmB and SmB’ as well.

It is interesting to note that the NMR CSPs observed for the OCRE domain when titrated with a peptide comprising just one proline-rich motif (PRM, residues 219–229) (Figure 2F) or the larger fragments of SmN are very similar (Figure 2—figure supplement 2A–D). This indicates that all SmN fragments harboring PRMs utilize the same binding site on the OCRE domain. The strongest CSPs are observed for the amides of Y470, Y488, and Y495, which map to the cluster of exposed aromatic tyrosines on β2, β4, and β5 of the OCRE domain, respectively (Figure 2G). However, the short peptide containing four consecutive proline residues induces comparable CSPs only at much higher peptide:OCRE ratio compared to the longer Sm tail peptides, thus indicating higher binding affinity for the longer Sm tail constructs (Figure 2—figure supplement 2). Nearly identical spectral changes are induced by the peptide, suggesting that all PRMs contribute to the overall affinity for the OCRE domain. Thus, the interaction with multiple motifs is enhanced by avidity, e.g., interaction of multiple PRMs within the Sm tails provides increased local concentration of PRM ligand motifs and thereby contributes to the significantly higher affinity of Sm-tails compared to a single PRM.

In order to assess the sequence requirements of PRMs for binding to the OCRE domain, we compared the amino acid sequences of the SmN/B tails (Figure 3A). This analysis reveals the presence of multiple (five to six) PRMs harboring three or four consecutive prolines in SmB or SmN, which are flanked by conserved arginine residues within ±3 residues on either side of the poly-proline motif (Weber et al., 2010). The role of these conserved sequence features was probed by compared binding affinities of wild type and variant Sm tails to the RBM5 OCRE domain using ITC experiments (Figure 3A). Reducing the number or PRMs significantly decreases the binding affinity, consistent with avidity effects and contribution to the overall affinity by the presence of multiple PRMs. Replacing four prolines by four alanines has little effect. This observation is consistent with the fact that consecutive stretches of alanines adopt a poly-proline type II (PPII) helical conformation, as confirmed by CD spectra (Figure 3—figure supplement 1). As the SmN/B/B’ PRMs adopt a PPII helical conformation when bound to the OCRE domain (see below), the replacement by four alanines can thus be tolerated to some extent. Notably, charge reversal of two arginines flanking the poly-proline motif on either side significantly decreases the binding affinity. These data suggest that a PPII helical conformation flanked by positively charged arginine residues is specifically recognized by the OCRE domain.

Figure 3. Proline-rich motif sequence requirements for OCRE binding.

(A) Sequence of the C-terminal tail of human SmN/B/B’ and mutations of proline-rich-motifs (PRMs) tested. Conserved three or four-proline motifs and flanking arginine residues are highlighted. The binding affinities of these tails to the RBM5 OCRE domain determined by ITC (Table 2) are indicated on the right. Note, that association constants are shown here. (B) Binding of various PRM peptides to RBM5 OCRE monitored by NMR titrations. Relative binding affinities of PRM peptides to RBM5 OCRE based on a normalized chemical shift perturbation score are shown.

Figure 3.

Figure 3—figure supplement 1. Circular dichroism spectra of peptides used in the OCRE binding study.

Figure 3—figure supplement 1.

The presence of a strong negative band at 200 nm and a weak positive band at 217 nm are characteristic features of a PPII helix in CD spectra (Drake et al., 1988).In the presence of a higher number of proline residues, the positive CD band shifts towards 229 nm (Petrella et al., 1996). Therefore, wildtype (blue) and 4A mutant peptide (red) show PPII helical conformation, while the APAP mutant (green), which lacks the characteristic positive band at 217/229 nm does not exhibit a PPII helical conformation. The 4G mutant peptide (orange) also shows residual PPII helical conformation, consistent with previous reports (Kelly et al., 2001; Brown and Zondlo, 2012). Of the two peptides with three prolines, the one where the 3P motif is preceded by GIMA (purple) has PPII helical conformation (although the negative band at 199.5 nm is less intense as compared to the wildtype peptide), while the one preceded by VGRA (cyan) has a random coil conformation (lacking a positive band around ~229 nm).
Figure 3—figure supplement 2. Residues used for normalized CSP score calculation.

Figure 3—figure supplement 2.

Overlay of 15N-HSQC spectra of free OCRE (black) vs bound to wild type peptide-GMRPPPPGIRG (red), mutant peptides -GMAPPPPGIRG (blue) and GMEPPPPGIEG (green). Zoomed regions on the right show the residues that were used for calculation of the normalized chemical shift perturbations (CSP) score to allow a comparison of relative binding affinities of the different peptides. Only spectra of three peptide bound states of OCRE are shown to illustrate the affinity range: largest CSPs are observed for GMRPPPPGIRG, intermediate for GMAPPPPGIRG, and lowest for GMEPPPPGIEG.

The contribution of individual residues within the PRM motif was determined by comparing relative binding affinities of wild type and mutant 11-mer peptides that exhibit sequence features observed in the Sm tails (Figure 3B). As the binding affinities of these peptides are beyond the detection limit of ITC, we resorted to a semi-quantitative NMR CSP-based approach. We devised a normalized CSP score for a set of seven amide signals surrounding the binding pocket showing significant CSPs in the OCRE domain to obtain a proxy for the relative binding affinities (see Materials and methods for details) (Figure 3B—figure supplement 2). As the comparison between different peptides is based on the CSP score derived from the same set of residues, it indirectly reflects their relative binding affinities. Several themes emerge from this comparison suggesting that the OCRE domain has a preferred binding motif, but can also accommodate some sequence variants. First, wild type peptides with three prolines induce smaller CSPs than those with four proline motifs and binding saturation is achieved only at higher peptide concentration, indicating a lower affinity. PRMs harboring only three consecutive proline residues show 2–3-fold reduced affinity relative to four proline motifs. Second, an APAP motif, which disrupts PPII conformations, has a strongly reduced CSP score, suggesting that the structure and composition of the PPII are important for interaction. Third, both arginines flanking the PRM motif are important for the interaction, as a double charge reversal (R→E) strongly reduces the binding affinity. Neutralizing the charge with R→A mutations had a smaller effect, presumably because no charge clashes are introduced with the negatively charged surface of the OCRE domain (Figure 1—figure supplement 1E) as is the case for R→E mutations. Taken together, the analysis shown in Figure 3 indicates that a PPII conformation mediated by four consecutive proline residues with flanking positively charged residues is optimal for OCRE binding. Notably, the arginine residue preceding the PRM appears more important for the interaction than the one located C-terminal to the PRM.

Solution structure of an RBM5 OCRE/SmN peptide complex

To determine molecular details of the recognition of SmN/B/B’ by the RBM5 OCRE domain, we determined its structure in complex with a proline-rich peptide derived from the C-terminal region of SmN (GMRPPPPGIRG) corresponding to SmN residues 219–229 (Figure 4; Figure 4—figure supplement 1A). This motif is also found within residues 219–229 in SmB with only one amino acid difference (GMRPPPPGMRG). The structure of the complex is defined by numerous NOEs, and is supported by 109 distance restraints derived from intermolecular NOEs (Figure 4—figure supplement 1B, Table 2). The structure shows that the central proline stretch (SmN Pro222-Pro225) of the bound peptide adopts a PPII helix (Figure 4). The PPII conformation is also indicated by the 13C chemical shifts of the peptide. Considering the large excess of peptide, the chemical shifts largely reflect the unbound state. This indicates that the PPII conformation is already preformed in the absence of the OCRE domain, consistent with our CD data (Figure 3—figure supplement 1). The central part of the proline-rich peptide interacts with conserved tyrosine residues in the OCRE domain in strands β3 (Tyr470, Tyr479), β4 (Tyr486, Tyr488) and β5 (Tyr495) (Figure 4A–C). On one side, Tyr488 stacks with SmN Pro222 and on the other with Tyr479, which itself contacts SmN Pro224 by T-stacking. The Tyr486 side chain stacks with SmN Pro225 and interacts with Tyr495, which in turn contacts Pro223 in the SmN peptide.

Figure 4. Structure of the RBM5 OCRE/SmN peptide complex.

(A,B) Side and top views in a cartoon presentation of the RBM5 OCRE/SmN peptide complex.The secondary structure and loops in the RBM5 OCRE domain are colored in green and grey, respectively, the proline-rich motif (PRM) peptide corresponding to SmN residues 221–229 is shown in yellow. (C–E) Zoomed views of key interaction sites. (C) The SmN PRM adopts a proline-type II helical conformation and is recognized by stacking with key tyrosine residues from the OCRE domain. The side chain of SmN Ile227 packs against the hydrophobic surface of the PPII helix. (D) Recognition of SmN Arg221 by interactions with hydroxyl groups of Tyr472, Tyr479 and Ser490 (E) Arg228 forms electrostatic contacts with the side chains of Asp481 and Ser484.

Figure 4.

Figure 4—figure supplement 1. Structural analysis of the OCRE/SmN complex.

Figure 4—figure supplement 1.

(A) Stereo view of the ensemble of 10 lowest energy NMR structures of the RBM5 OCRE domain in complex with SmN peptide, same view as in Figure 1D. (B) Intermolecular NOEs between the RBM5 OCRE domain and the SmN peptide. A zoomed view of a 2D ω1-filtered NOESY spectrum recorded on a 2 mM 15N,13C-labeled OCRE domain bound to 14 mM unlabelled SmN (residues 219–229). Assignments of the OCRE domain and the unlabelled SmN peptide are indicated on top and on the right, respectively.
Figure 4—figure supplement 2. NMR spectra and ITC data for OCRE domain mutants.

Figure 4—figure supplement 2.

Comparison of NMR spectra of the RBM5 OCRE domain wild type (black) and different mutants (red, as indicated).Spectra in (A) indicate that the overall fold is intact, with some potential local effects near the residue mutated. (B) These mutants suggest local unfolding of the β-strand, which comprises the altered residue, but are also consistent with an overall folded domain.

A unique binding orientation of the peptide is defined by a combination of the stacking interactions with the PPII conformation of the peptide, and interactions with the arginine residues flanking the proline-rich stretch on either side, as well as SmN Ile227. The side chain of SmN Arg221, preceding the poly-proline stretch is poised to interact with the hydroxyl groups of OCRE Tyr472, Tyr479 and Ser490 in loop β1-β2 (Figure 4D). The side chain of SmN Arg228 can form potential hydrogen bonds with the side chains of Asp481 and Ser484 in the OCRE domain (Figure 4E). Both residues are conserved in loop β3-β4 in the OCRE domains of RBM5 and RBM10 (Figure 1B). In addition, the hydrophobic side chain of SmN Ile227, located C-terminal of the proline-rich motif packs against SmN Pro223, consistent with numerous intermolecular NOEs between the two side chains, and thus shields the hydrophobic PPII helix from solvent exposure (Figure 4C). To enable these contacts involving the Ile227 side chain, the presence of a preceding Gly226, enables a specific backbone conformation by allowing unusual backbone torsion angles.

Considering the specific spacing of the arginine residues to the four-proline-stretch in the SmN ligand, these interactions define a unique, unambiguous orientation of the peptide along the OCRE β-sheet surface. The structure of the OCRE domain does not undergo significant conformational changes upon binding to the SmN peptide, with a backbone coordinate r.m.s.d. of 1.7Å between the free and SmN-bound OCRE domain. However, the tyrosine side chains involved in the SmN interaction slightly rearrange to optimize contacts with the proline-rich SmN ligand.

To summarize, the RBM5 OCRE domain uses an array of tyrosine side chains that are exposed from one side of its β-sheet to recognize a proline-rich motif in a PPII conformation. Additional specific interactions are mediated by two positively charged side chains flanking the N- and C-terminal sides of the poly-proline helix in the bound SmN peptide, which in part involve the side chain hydroxyl groups of the exposed tyrosines, thus consistent with the strong conservation of tyrosines in the OCRE domain. These structural insights reveal that the OCRE domain represents a novel proline-rich motif binding domain.

Structure-based mutation analysis in vitro and in vivo

To evaluate the importance of contacts observed in the OCRE-SmN structure we carried out a mutational analysis of the OCRE domain. We compared the binding of wild-type and mutant OCRE domains to SmN/B/B’ in vitro using GST pull-downs, ITC and NMR titrations, and analyzed the functional activity of wild-type and OCRE domain mutants in alternative splicing regulation. We focused on the effects of mutation of tyrosine residues located within the proline-rich motif binding pocket, including Tyr470, Tyr479, Tyr486, Tyr488 and Tyr495 and control mutations of residues that are not involved in the SmN interactions, i.e. Tyr497, Glu501 and Tyr454/Asp458.

To analyze whether the effects of the OCRE mutations could induce (partial) disruption of the OCRE structure, recombinant OCRE proteins were expressed and purified and structural integrity analyzed by NMR (Figure 4—figure supplement 2). The NMR spectra of the Y495F, Y495W and E501K OCRE domains suggest that these proteins are globular folded. Small changes in position and intensities of the NMR signals for the Y495F/W mutations likely reflect that aromatic side chains have significant contributions to the chemical shifts of surrounding residues, but are still consistent with the integrity of the overall fold. NMR spectra of the Y495A, Y495T, Y497A and the double mutant Y454A/D458K suggest partial unfolding of strand β5 and the N-terminal extension, respectively, even though the overall fold appears to remain intact. Larger changes observed for the Y479A, Y486A or Y488A OCRE domains suggest more significant destabilization of the fold (Figure 4—figure supplement 2).

GST-pulldown experiments (Figure 2—figure supplement 1) performed with Tyr495 mutations in the RBM5 OCRE domain confirm the importance of the aromatic side chain of Tyr495 for Sm binding. Next, we compared the binding affinities of wild type OCRE domains with Y495A, Y495F, Y488A and of the double mutant Y454A/D458K using ITC (Table 2). Consistent with the structural analysis and the observed effects in splicing regulation, the Y495A and Y488A mutants show strongly reduced binding to the SmN-derived proline-rich ligands, KD = 172 and 220 μM, respectively, compared to the wild type OCRE domain (KD = 41 μM). The Y495F mutation does not strongly affect the SmN interaction KD = 87 μM. The double mutant Y454A/D458K shows a slightly reduced affinity to SmN KD = 106 μM, which, however, might reflect partial destabilization of the mutant OCRE fold (Figure 4—figure supplement 2). Yeast two-hybrid assays confirmed the importance of an aromatic residue at this position also for the interaction of the OCRE domain with SmB, as the Y495A mutation reduced the interaction between RBM5 OCRE and SmB while the Y495F retained binding competence (Figure 2E).

To probe the functional effects of the OCRE domain mutations, we tested the splicing activity of full-length RBM5 harboring mutations in the ligand binding region of the OCRE domain (Figure 5; Figure 5—figure supplements 1 and 2). As shown in Figure 1E, substitutions of conserved tyrosines that are involved in SmN interactions (Tyr495 and Tyr497) impair the activity of RBM5 in Fas alternative splicing regulation (Figures 5A,C; Figure 5—figure supplement 2). Further mutations of aromatic residues, including tyrosines 470, 479, 486 and 488 by alanine compromise the Fas exon 6 skipping activity of RBM5 to different extents, consistent with the reduced binding of these mutants to SmN-derived proline-rich ligands (see above) and with the location of these residues in the binding pocket for the SmN poly-proline motif (Figures 4, 5A–C).

Figure 5. Mutational analysis of the RBM5 OCRE – SmN/B/B’ interaction.

Specific mutations of conserved residues in the RBM5 OCRE domain impair the activity of the protein in FAS alternative splicing regulation ex vivo. HeLa cells were co-transfected with a FAS alternative splicing reporter and T7-RBM5 expression plasmids (wild type or mutations of tyrosine residues as indicated). RNA and proteins were isolated 24 hr after transfection. Patterns of alternative splicing were studied by RT-PCR using specific primers. (A) Inclusion and skipping products are annotated. (B) Protein expression was detected by western blot with an anti-T7 epitope antibody. (C) Quantification of the activity of RBM5 OCRE domain mutants in FAS alternative splicing regulation of 3 to 10 replicates of the experiment. The percentage of inclusion is presented. T-test (two-tailed distribution, homoscedastic) results are mentioned (**<0,01; ***<0,001).

Figure 5.

Figure 5—figure supplement 1. Effect of mutation of non-aromatic residues in RBM5 OCRE on FAS alternative splicing regulation.

Figure 5—figure supplement 1.

(A) Mutations at position Ser468 of RBM5 OCRE domain impair the activity of the protein in FAS alternative splicing regulation ex vivo. HeLa cells were co-transfected with a FAS alternative splicing reporter and T7-RBM5 expression plasmids. RNA and proteins were isolated 24 hr after transfection. The pattern of alternative splicing was studied by RT-PCR using specific primers (PT1/PT2). Inclusion and skipping products are annotated. The asterisk indicates non-specific amplification. (B) Protein expression corresponding to the samples in (A), detected by western blot with an anti-T7 epitope antibody. (C) Quantification of the activity of RBM5 OCRE domain mutants on FAS alternative splicing regulation for 12 to 19 replicates, as indicated at the bottom of the histogram bars. Average and standard deviation of the percentage of inclusion is represented. T-test (two-tailed distribution, homoscedastic) results are indicated (***<0,001).
Figure 5—figure supplement 2. Effects and statistical analysis of RBM5 mutations on FAS alternative splicing.

Figure 5—figure supplement 2.

(A) Quantification of the effect of RBM5 wild type and mutants on FAS wild type alternative splicing reporter regulation for all the experiments carried out in this study.Ctr: no RBM5 expressed. The number of replicates per mutant is annotated at the bottom of each histogram bar. (B) T-test (two-tailed distribution, homoscedastic) results for the data in (a), corresponding to each pairwise comparison (green<0,05; purple<0,01; orange<0,001).
Figure 5—figure supplement 3. Effects of SmN wild type and mutants expression on FAS alternative splicing.

Figure 5—figure supplement 3.

(A) Silencing of SmB/B’ affects alternative splicing of Fas and hnRNP AB genes.RNA was isolated from siRNA (on target smart pool siRNA, Dharmacon)-transfected HeLa cells. After reverse transcription using oligodT and random primers, PCR were performed using gene specific primers (Saltzman et al., 2011). The inclusion (I) and skipping (S) products of each gene are indicated. (B) SmN overexpression regulates FAS alternative splicing regulation. HeLa cells were co-transfected with a FAS alternative splicing reporter mutated at position -20 (U-20C) upstream of the 3’ splice site associated to exon 6 and with T7-SmN expression plasmids. RNA and proteins were isolated 24 hr after transfection. The pattern of alternative splicing was studied 24 hr later by RT-PCR using vector-specific primers (PT1/PT2). Inclusion and skipping products are indicated. Protein expression was detected by western blot using an anti-T7 epitope antibody. The asterisk indicates non-specific products. (C) Weakness of the 5’ splice site associated to FAS exon 6 is important for alternative splicing regulation by Sm proteins. RNA was isolated from siRNA HeLa transfected cells and the pattern of alternative splicing of FAS WT and FAS U1c mutant reporter (harbouring a mutation in the 5’ splice site of exon 6 that increases the base pairing potential with the 5’ end of U1 snRNA(Izquierdo et al., 2005) (sequence indicated) was detected by RT-PCR using vector-specific primers (PT1/PT2). Inclusion and skipping products are indicated. The asterisk indicates an additional amplification product. Western blot against SmB/B’ proteins (using the antibody 12F5, Santa Cruz, reference sc130670) was performed to evaluate the level of inactivation of the expression of the proteins and U2AF65 western blot analysis was performed as a control using the MC3 antibody. (D) Mutations in the C-terminal tail of SmN impair activity in FAS alternative splicing regulation. HeLa cells were co-transfected as in panel (B) using plasmids expressing T7-SmN variants bearing mutations indicated in the right part of the figure and containing an amino-terminal SV40 Large T antigen NLS (PKKKRKV). Average and standard deviation of the percentage of inclusion is indicated in the histogram. T-test (two-tailed distribution, homoscedastic) results are indicated (**<0,01; ***<0,001) as well as the number of replicates carried out for each mutant (from 5 to 51).

Substitution of the important Tyr495 residue by threonine or glutamate also compromised activity, while replacement by other aromatic residues (phenylalanine or tryptophan) retained full activity in splicing assays (Figure 5A–C; Figure 5—figure supplement 2), consistent with sustained SmN interaction (Figure 2; Figure 2—figure supplement 1). This argues that, in agreement with the structural analysis, the aromatic nature of the tyrosine is a key feature of the splicing regulatory properties of the OCRE domain. In contrast, mutations of residues, Tyr454, Asp458, Glu501 (Figure 1E) and Ser468 and Asn483 (Figure 5—figure supplement 1), which are remote from the SmN binding site, had only moderate effects on the activity of RBM5 on FAS splicing. All the mutant proteins accumulated to levels similar as the wild type (Figure 5B). These data further argue that specific recognition of the proline-rich SmN peptide by a cluster of aromatic residues in the RBM5 OCRE domain is required for the function of RBM5 as a splicing regulator.

We next wished to probe the contributions of different regions in the SmN/B/B’ proteins for FAS alternative splicing. Consistent with previous results (Saltzman et al., 2011), we observed that the knock down of the SmN/B/B’ proteins by siRNA in HeLa cells led to an increase in the level of FAS exon 6 skipping in endogenous transcripts or in transcripts derived from a Fas reporter (Figure 5—figure supplement 3A,C). These effects were attenuated by strengthening FAS exon 6-associated 5’ splice site (Figure 5—figure supplement 3C), as has been previously reported (Saltzman et al., 2011). Co-expression of SmN with a FAS alternative splicing reporter bearing a mutation (Fas U-20C) that increases FAS exon 6 skipping (Izquierdo et al., 2005), led to increased levels of exon inclusion, while neither expression of the amino terminal part nor of the C-terminal part of the protein did (Figure 5—figure supplement 3B). While expression of SmN variants harboring mutations in the proline-rich stretches (either to glycine or to APAP motif) does not significantly compromise exon inclusion, mutation of the arginines flanking stretches of four prolines does (Figure 5—figure supplement 3D). However, the results of SmN/B/B’ overexpression experiments are highly variable and difficult to interpret, perhaps because of additional complex effects of Sm protein overexpression on snRNP biogenesis/activity.

Taken together, the structural and functional analyses show that three tyrosine residues in the RBM5 OCRE domain (Tyr479, Tyr488 and Tyr495) play crucial roles in the recognition of proline-rich regions present in the conserved C-terminal tails of SmN/B/B’ and reveal a tight correlation between this binding and the activity of RBM5 as a regulator of FAS alternative splicing.

We have previously shown that the alternative splicing activity RBM5 depends on the C-terminal region of RBM5 and that recombinant RBM5 inhibits the transition from the pre-spliceosomal complex A to spliceosomal complex B (Bonnal et al., 2008). To assess which region of the RBM5 protein is required, we performed in vitro spliceosome assembly assays with AdML pre-mRNA, using the N- and C-terminal halves of RBM5 as well as with a C-terminal version that lacks the OCRE domain or a fragment that includes only the OCRE and KEKE domains (Figure 6—figure supplement 1). These and additional transient transfection experiments (Figure 6—figure supplement 2) confirm that the inhibition of complex B formation depends on the C-terminal region of RBM5 and reveal that the OCRE domain is necessary but not sufficient for this activity. This is consistent with a key role for the interaction of OCRE with the PRM in the spliceosomal tails in the tri-snRNP, but argues that additional interactions involving the C-terminal region of RBM5 are important, including for example the previously reported interaction with U2AF65 (Bonnal et al., 2008).

Figure 6. Spliceosome assembly assays with different regions of RBM5.

(A) RBM5 inhibits the full spliceosome assembly on AdML and blocks the transition from complexes A to B in an OCRE dependent manner.Splicing complexes assembled on AdML were resolved by electrophoresis on native gel in the presence or absence of 25, 50, 100 and 200 ng/ul of the indicated proteins. The position of the H, A and B complexes are indicated. (B) Quantification of the activity of spliceosome assembly of the C-term, N-term and ∆OCRE proteins. The experiment was performed three times and the results of a T-test (two-tailed distribution, homoscedastic) are indicated (*<0,05).

Figure 6.

Figure 6—figure supplement 1. Effects of RBM5 full-length and fragments on spliceosome assembly.

Figure 6—figure supplement 1.

The OCRE KEKE domains of RBM5 are not sufficient to trigger the block in transition from of A to B complexes. Splicing complexes assembled on AdML were resolved by electrophoresis on native gel in the presence or absence of 80, 400 and 2000 ng/ul of GST and GST OCRE KEKE proteins and 25, 50, 100 and 200 ng/ul of N-term, C-term and C-term ΔOCRE proteins. The presented result was reproduced twice under different conditions.
Figure 6—figure supplement 2. Effects of RBM5 full-length and OCRE on FAS alternative splicing.

Figure 6—figure supplement 2.

The OCRE KEKE domains of RBM5 are not sufficient to trigger Fas exon 6 skipping.HeLa cells were co-transfected using plasmids expressing T7-RBM5 wild type protein and T7-NLS-RBM5 OCRE KEKE (T7 epitope fused to the SV40 Large T antigen SV40 NLS (PKKKRKV) and region from amino acid 452 to amino acid 535 of RBM5 protein). Average and standard deviation of the percentage of inclusion are indicated in the histogram. T-test (two-tailed distribution, homoscedastic) result is 8.4 E-07. The number of replicates carried out for each construct is indicated at the bottom.

Discussion

The Octamer Repeat (OCRE) domain of RBM5 adopts a unique three-dimensional fold with a large number of conserved tyrosine residues. Although the primary sequence signature could suggest a linear sequence of octamer repeats, our structural analysis clearly reveals that the OCRE domain adopts a small globular fold. We show that the twisted β-sheet of the OCRE domain exposes conserved tyrosine residues that provide a platform for the recognition of a proline rich motif (PRM). This interaction is based upon a network of tyrosine residues as well as anchoring residues located in the loops of the β-strands. The SmN PRM is flanked by a positively charged arginine residue preceding the four consecutive prolines, followed by a hydrophobic residue and another arginine. Taking into account the sequence conservation, binding studies (Figure 3), and the details of molecular recognition of the SmN peptide (Figure 4), we propose that the OCRE domain recognizes a RPPP(P)GϕR consensus motif. A key feature of the motif is a central PPII helix, which is recognized by a network of parallel and aromatic T-stacking of Tyr470, Tyr479, Tyr482, Tyr488 and Tyr495 with the proline side chains in the PPII helix. These tyrosines are exposed at one side of the OCRE β-sheet and collectively embed the PPII helix. Specific contacts that define the orientation of the bound peptide are mediated by two flanking arginine residues, with a stronger contribution of the N-terminal arginine (SmN Arg221, Figure 3B). The glycine residue may enable an unusual backbone conformation such that hydrophobic residues (ϕ, Ile/Met/Val flanking different PRMs in the SmN/B/B’ C-terminal tails, Figure 3A) can shield the central hydrophobic PPII motif from solvent exposure. Notably, unbound PRMs harboring 3–4 consecutive prolines already adopt (at least partially) a preformed PPII conformation and thus reduce the entropy loss associated with binding to the OCRE domain. The somewhat reduced affinity for peptides with only three prolines may reflect a smaller propensity of forming PPII conformation in the free ligands and the lack of a preceding arginine, which is consistently present in all four proline PRMs.

The tertiary fold and the mode of PRM recognition by the OCRE domain is distinct from other PRM binding domains, such as GYF, SH3, and WW domains (Figure 7A). Although – as a common feature – aromatic residues are used to interact with proline residues and to recognize the PPII helix (Ball et al., 2005), different secondary structure elements are found in GYF (α-helical), SH3 (α/β fold), WW (β-sheet) and OCRE domains (β-sheet). Interestingly, a somewhat related PRM motif in the CD2 protein (PPPPGHR) was reported to interact with the GYF domain of the CD2BP2 protein (Freund et al., 2002). The CD2BP2 GYF domain and the FBP21 WW domain have been previously implicated in binding to the PRM sequences of SmB, suggesting that interactions between PRMs and these factors may be critical for the function of these proteins in spliceosome regulation (Bedford et al., 1998; Klippel et al., 2011).

Figure 7. OCRE is a novel PRM-binding domain that interacts with snRNPs.

Figure 7.

(A) Comparison of proline-rich motif (PRM) recognition by PRM binding domains.Cartoon representations of various PRM binding domains (green) with side chains of important residues shown as sticks and annotated. PRM ligands are shown by stick representation (yellow). PDB accession codes: GYF (PDB 1L2Z), SH3 (PDB accession 1ABO), WW (PDB 1I5H), and RBM5 OCRE. (B) The C-terminal tail of SmN/B/B’ comprising RG-rich regions and proline-rich motifs (PRM) extends far from the core Sm core fold in the seven-membered Sm ring of assembled U snRNPs. The crystal structure of U4 snRNP core (PDB 4WZJ) (Leung et al., 2011) is shown, the C-terminal tails of SmD1/D3, which harbour RG-rich regions and SmN/B/B’ comprising RG and PRM regions are not visible in the crystal structure and indicated schematically. (C) The seven-membered Sm core rings of U4 and U5 snRNPs are located at the outside of the assembled U4/U6.U5 tri-snRNP (Nguyen et al., 2015). Thus, also in the tri snRNP the SmN/B/B’ tails are accessible for interactions with the RBM5 OCRE domain.

An interesting aspect of the OCRE/Sm interaction is that the presence of multiple PRM motifs greatly enhances the overall affinity ranging from KD ≈ mM for a single PRM peptide to KD = 20–40 µM for complete Sm tails, and thus comparable to other PRM binding domains. The relatively weak affinity of a single PRM found in the Sm tails argues that avidity effects due to the presence of multiple motifs play an important role for high affinity binding by increasing the local concentration of PRM motifs available for OCRE binding. We note that avidity effects provided by multiple binding sites have also been observed for other PRM binding domains (Varani et al., 2000; Klippel et al., 2011), and that this feature is reminiscent of the recognition of dimethyl-arginine residues in the Sm tails by Tudor domains (Tripsianes et al., 2011). On the other hand, the weak binding affinity of a single motif indicates that the interaction between RBM5 and SmN/B/B’ may have been selected to be transient, as expected for a regulatory interaction that needs to promote interactions with the spliceosome but also needs to be disrupted at later steps of the splicing reaction.

Previously, we have shown that the splicing regulation by RBM5 acts at the stage of spliceosomal B complex formation (Bonnal et al., 2008). The role of RBM5 and its OCRE domain in this context could be to attract the U4/U6.U5 tri-snRNP to the splice sites. The fact that the SmN/B/B’ tails are intrinsically disordered and extend very far away from the seven-membered Sm core ring in U snRNPs (Figure 7B,C), is consistent with such an activity. In fact, the recent EM structure of the tri-snRNP (Nguyen et al., 2015) revealed that the Sm cores of U4 and U5, as well as the LSm ring of U6 are located at the outside of this large RNP (Figure 7C). Thus, the C-terminal tails of SmN/B/B’ of the U4 and U5 components are clearly accessible and could scan for possible binding partners of their proline-rich motifs, such as the RBM5 OCRE domain. This could help to localize the tri-snRNP to the splice sites that are regulated by RBM5.

SmB was described to participate in alternative splicing of its own pre-mRNA and many additional genes, including the FAS gene (Saltzman et al., 2011). Recently, mutations in SmB were linked to cerebro-costo-mandibular syndrome (Bacrot et al., 2015). The multi-domain protein RBM5 promotes FAS exon 6 skipping and it has been proposed that for this activity the protein modulates splice site pairing after the competing 5’ and 3’ splice sites have been recognized by U1 and U2 snRNPs, respectively (Bonnal et al., 2008). This model is based upon the observation that RBM5 inhibits the transition from A (U2 snRNP binding) to B (U4/U6.U5 tri-snRNP binding) complex formation in the introns flanking exon 6. It is conceivable that interactions mediated by the C-terminal tails of SmN/B/B’, which are present in U1, U2 and the tri-snRNP, facilitate the transition from A to B complex and that binding of the OCRE domain of RBM5 prevents these interactions and therefore the progression of spliceosome assembly. Direct or indirect association of RBM5 with particular regions of the pre-mRNA – likely involving other regions of the protein including RRM or zinc finger domains – may prevent A to B transition of pre-spliceosomes assembled on the flanking introns of exon 6, but fail to prevent tri-snRNP assembly on the distal splice sites, thus facilitating exon skipping. Alternatively, or in addition, RBM5 interaction with the C-terminal tails of SmN/B/B’ may lead to a general decrease in snRNP function, possibly more acute for U2 snRNP. As decreased activity of U2 snRNP – e.g. by individual knock down of its protein components – is known to result in increased FAS exon skipping (Papasaikas et al., 2015), RBM5-mediated reduction in U2 function could potentially explain at least part of the effects of the protein on FAS splicing regulation.

Taken together, our study provides a molecular understanding of how the OCRE domain of RBM5 interacts with proline-rich sequences of the SmN/B/B' tail and thus identifies a key interaction essential for regulation of alternative splicing.

It is interesting to note that, based on primary sequence alignments, the OCRE domains of RBM5 and RBM10 are expected to adopt highly similar structures and possibly functions. In fact, the recently reported structure of the RBM10 OCRE domain is highly similar to RBM5 OCRE (backbone coordinate r.m.s.d. 1.1Å) (Martin et al., 2016). Interestingly, mutation of a conserved tyrosine residue (corresponding to Tyr470 in RBM5, where it contributes to the recognition of the PRMs by the OCRE domain) is frequently found associated with lung carcinoma (Imielinski et al., 2012). It is thus tempting to speculate that splicing defects that are linked to impaired interactions with PRM in SmN/B/B’ proteins contribute to the pathogenesis.

Materials and methods

Cloning, protein expression and purification

RBM5 (Swiss Prot P42756) OCRE domain (451–511) and all the mutants were subcloned into a pETM11 vector (with a N-terminal His6-tag) from the corresponding GST-RBM5 full-length DNA. The several SmN and SmB constructs were cloned into a pETM30 vector, containing a N-terminal His6-tag followed by glutathione S-transferase protein. In both vectors, a TEV site is present before the corresponding protein. All the proteins were produced by overexpression in E. coli BL21 cells at 20°C for 16 hr after induction with 0.5 mM of IPTG when cells were at approximately O.D. 0.7 in media supplemented with 30 μg/ml kanamycin. For unlabeled proteins, bacteria were grown in Luria broth.

For isotope-labeled proteins, bacteria were grown in M9 minimal media supplemented with 13C-glucose and/or 15NH4Cl. Cell lysates were suspended in buffer containing 20 mM Tris pH 8, 300 mM NaCl, 5% glycerol, 10 mM imidazole, and 2 mM of 2-mercaptoethanol and purified with Ni-NTA Superflow beads (Qiagen, Hilden, Germany) using standard conditions. After overnight cleavage of the fusion protein with tobacco etch virus protease, proteins were purified in a gel filtration column 26/60 sephadex II (GE Healthcare, München, Germany) and buffer was exchanged to NMR buffer; 20 mM sodium phosphate pH 6.5, 50 mM NaCl. For measurements in D2O, the protein was lyophilized and dissolved in D2O. The OCRE domain used for NMR studies thus comprises residues 451–511 preceded by a GAM tripeptide that results from the TEV cleavage. The SmN 219–229 peptide (Peptide Specialty Laboratory, Heidelberg, Germany) was dialyzed against water, lyophilized and then dissolved in NMR buffer.

GST-tagged RBM5 OCRE domain proteins were expressed in BL21 cells. Bacteria were transformed and grown in one liter LB to an absorbance at 600 nm of 0.6 before the induction of the expression with 1 mM IPTG for 3 hr at 37°C. The pellet was resuspended in cold buffer X (20 mM Tris pH 7.5; 1 M NaCl; 0.2 mM EDTA; 1 mM DDT and protease inhibitors cocktail (Roche Diagnostics, reference 11697498001, Mannheim, Germany) and sonicated. The supernatant was collected after centrifugation for 20 min at 10,000 rpm at 4°C and incubated with Glutathione Sepharose 4B beads for 15 min on a rotating wheel at 4°C. The beads were washed with 50 ml buffer X 3 times 10 min and eluted on column with 50 mM glutathione, 100 mM Hepes pH 8,0; 1 mM DTT. The selected fractions were dialyzed against buffer D (20 mM Hepes pH 8.0; 20% glycerol; 0.2 mM EDTA; 0.1 M KCl; 1 mM DTT and 0.01% NP40), frozen in liquid nitrogen and stored at −80°C. T7-tagged SmN protein was expressed and purified from HEK 293T cells (Cazalla et al., 2005).

The N-terminal (aa 1–318) and C-terminal (320–815) regions of RBM5 were produced by PCR amplification using specific primers and cloned into the pET15b vector (Novagen) generating plasmids N-term and C-term, respectively. The plasmid C-term ΔOCRE was obtained by removing the aa 452 to 511 of plasmid C-term using site-directed mutagenesis. Recombinant His6-tagged proteins were expressed in E. coli (BL21-CodonPlus), solubilized under denaturing conditions (6M Guanidine hydrochloride) (Wingfield et al., 2001) and purified by Ni-NTA affinity chromatography. After purification, the proteins were dialysed overnight against buffer D using Slide-A-Lyser devices (Pierce).

NMR spectroscopy

Spectra were recorded at 298 K in DRX500, 600 and 900 spectrometers with cryogenic triple resonance probes using 13C, 15N labeled OCRE sample (1 mM) for the apo structure, and a 13C, 15N labeled OCRE and unlabeled peptide (2 mM and 14 mM, respectively) for the complex. Data were processed with NMRPipe (Delaglio et al., 1995) and analyzed using NMRView (Johnson and Blevins, 1994). For backbone resonance assignments, standard experiments were recorded, including 3D HNCA, HNCACB, HN(CO)CACB experiments. Side chain resonances assignments were made with 3D HCCH-TOCSY and 3D 15N-HSQC TOCSY experiments. Distance restraints were derived from 15N- and 13C-resolved three-dimensional, 1H homonuclear two-dimensional (mixing time 120 ms). In addition, for the complex structure, assignment and distance restraints information for the peptide were obtained by recording a 15N, 13C filtered TOCSY with the mixing times of 15, 30, 60, 90 ms and NOESY experiments with NOE mixing times of 120 ms. H/D exchange experiments were recorded after lyophilization. 15N relaxation experiments (T1, T2 and {1H}–15N heteronuclear NOEs) were measured for both the apo and bound form of OCRE. Chemical shift mapping on OCRE was done by monitoring the 2D 1H,15N HSQC, 2D 15N-labeled OCRE (0.2 mM) with an excess of unlabeled SmN constructs until no further changes in chemical shifts were observed in the 2D 1H,15N HSQC spectra. Combined CSPs were calculated as Δδ (ppm) = ((10*ΔδHN)2 + (ΔδN)2)1/2.

Determination of relative binding affinity for peptides from NMR titrations

A semi-quantitative approach was taken to assess the contribution of the amino acid sequence of PRM motifs in Sm tails to the binding interaction. Due to the relatively weak interaction, NMR titrations were used to compare relative affinities between wild type peptides and peptides with specific mutations. The SmN derived wild type and mutant peptides were titrated into OCRE at a 1:10 ratio of OCRE:peptide. CSPs from 7 OCRE residues (Y470, Y471, Y479, D481, N483, S490, Y495) were added for each of the peptide titration and normalized with that of the wild type peptide (GMRPPPPGIRG). The score is used to compare the relative affinities of the various peptides with the wild type (Figure 3B). An overlay of 1H,15N-HSQC spectra of OCRE bound to wild type (GMRPPPPGIRG) or variant peptides -GMAPPPPGIRG, GMEPPPPGIEG illustrates the different extent of CSPs observed for seven residues in the OCRE domain that are most strongly affected by the binding (Figure 3—figure supplement 2). These residues were selected to define the CSP score.

Structure calculations

For the apo structures, automatic NOE assignments and structure calculations were initially performed by CYANA3 (Güntert, 2009). Subsequently, NOEs were manually checked and applied as distance restraints together with dihedral angle restraints in a simulated annealing protocol using ARIA (Linge et al., 2001) and CNS (Brunger et al., 1998). Dihedral restraints were derived from TALOS+ (Shen et al., 2009), hydrogen bond distance restraints were applied based on secondary structure identified by TALOS+, and added during structure calculations. For the complex structure, manual assigned NOEs were applied as distance restraints together with dihedral angle and measure hydrogen bond restraints in a simulated annealing protocol using ARIA (Linge et al., 2001) and CNS (Brunger et al., 1998). Water refinement was performed on the final ensembles of NMR structures (Linge et al., 2003). The structural quality of the 10 lowest energy structures out of 100 calculated structures was evaluated using ProcheckNMR (Laskowski et al., 1996), and the iCING (Doreleijers et al., 2012) and PSVS (Bhattacharya et al., 2007) servers. Ramachandran statistics for the free RBM5 OCRE domain and the complex structure with SmN (219–229) are 91.5%/8.5%/0%/0% and 88.0%/9.8%/2.0%/0.2% in the most favored/additionally/generously/disallowed regions, respectively. Ribbon representations and the electrostatic surface potential were prepared with PYMOL (DeLano Scientific, San Carlos, CA, USA).

Isothermal titration calorimetry

ITC experiments were performed using an ITC200 instrument (MicroCal, Wolverton Mill, UK) at 24°C. SmN constructs (168–196, 168–240) and an SmN peptide (219–229) at 1 mM, 1 mM and 5 mM, respectively, were titrated into OCRE (100 μM, 100 μM and 1 mM, respectively). For the mutation analysis, purified OCRE mutants (2 mM) were titrated to wild type SmN (29.5 μM) or SmB (40 mM). All purified proteins were in the same buffer, 20 mM sodium phosphate pH 6.5, 50 mM NaCl. The lyophilized peptide was dialyzed against water, lyophilized again, and then dissolved in the same buffer as the protein. The heat of dilution was measured by titrating SmN or OCRE mutants into buffer. The titration protocol consisted of one initial injection of 0.4 μL followed by 38 injections of 1 μl of the ligand into the protein sample with intervals of 120 s, allowing the titration peak to reach the baseline. Data were calculated using the program Origin v7.0 (MicroCal) and duplicates were measured for all the experiments.

Circular Dichroism (CD) spectroscopy

All CD spectra were recorded on a JASCO-J715 spectropolarimeter and analyzed with Spectramanager version 1.53.00 (Jasco Corp.). The temperature was regulated using a Peltier type control system (PTC-348WI). The spectra were recorded at 5°C in 20 mM sodium phosphate,100 mM NaCl, pH 6.5 buffer from 190–260 nm wavelength with a 1.0 nm bandwidth, 0.5 nm pitch at a scan speed of 50 nm/min, in cuvettes with 0.1 cm path length. All spectra are presented as an average of 20 scans, obtained after buffer subtraction and plotted as mean residue ellipticity (deg cm2 dmol−1) vs wavelength (nm). All peptides were measured at 0.3 mM concentration in a buffer containing 20 mM sodium phosphate, 100 mM NaCl, pH 6.5 at 5°C.

Glutathione S-transferase pull-down experiments

GST RBM5 OCRE and T7 SmN or 35S-labeled SmB proteins were incubated for one hour at 4°C degrees on a rotating wheel in 1 ml PBS supplemented with 0.1% Triton X100. 45 µl of packed and equilibrated GSH beads (Glutathione Sepharose 4B, GE Healthcare, reference 17-0756-05) were added and the samples were incubated for one hour more as before. The beads were then washed four times with 1 ml PBS-0.1% Triton X100 and the proteins were directly eluted in SDS loading dye at 95°C for 5 min under shaking, loaded on SDS gels, separated by electrophoresis. Proteins were revealed by autoradiography or analyzed by western blot using the following antibodies: anti-T7 (T7-Tag Antibody HRP conjugate, Novagen, reference 69048) and anti-GST (GST(B14)-HRP mouse monoclonal, Santa Cruz Biotechnology, reference sc-138 HRP).

Ex vivo splicing analysis

Protocol of co-transfection, RT-PCR and western blot analysis were carried out as previously described (Bonnal et al., 2008). siRNA against Sm proteins were carried out as described (Saltzman et al., 2011).

Spliceosome assembly assays

Cy5-CTP labeled AdML RNA bearing exon 1 – intron 1 – exon 2 was in vitro transcribed using T7 Megascript kit (Ambion). The spliceosome assembly reaction was performed as described previously (Mackereth et al., 2011) with 10 ng/ul fluorescently labeled RNA and the indicated recombinant protein. After electrophoresis, the gel was analyzed directly with a PhosphorImager Typhoon. Quantification was carried out using Image Quant.

Yeast two hybrid assays

The C-terminal domain of human SmB (aminoacids 84–231; Accession n°: NM_003091) was PCR-amplified using oligonucleotides carrying EcoRI (Forward) and SalI (Reverse) restriction sites and the pGFP-huSmB plasmid as template (Mouaikel et al., 2003). After agarose gel electrophoresis, the DNA fragment was purified using the GeneClean procedure and transferred into EcoR1-Sal1 cut pGBT9 vector carrying the DNA binding domain of Gal4 (Fields and Song, 1989).

The various pACT2-RBM5 plasmids containing the RBM5 full length and mutated coding sequences in frame with the Gal4AD (activation domain) were constructed using the Gateway system and a pACT2-based vector according to the manufacturer's instructions (Invitrogen). The plasmids used for amplification have been described previously (Bonnal et al., 2008). All pDonor constructs were sequenced prior to proceed to LR recombination. The sequences of cloning junctions and coding sequences of all plasmids were verified to ensure the absence of any unwanted mutations.

Yeast strains were grown using standard procedures and media. For 2YH assays, the pGBT-CterSmB plasmid and the appropriate pACT2 plasmids were transformed into the CG1945 strain (Fromont-Racine et al., 1997). Transformants were selected on double selectable media (-Leu -Trp) and further grown in minimal -Trp -Leu liquid medium. Growth of yeast was measured by spotting serial dilutions of liquid cultures on -Leu -Trp -His plates which enables selection of interacting partners.

Accession codes

The atomic coordinates for the NMR ensembles of the RBM5 OCRE domain and the complex with the SmN (219–229) are deposited in the Protein Data Bank under accession numbers 5MFY and 5MF9, respectively. The chemical shift assignments have been deposited in the Biological Magnetic Resonance Data Bank under accession numbers 34068 and 34067.

Acknowledgements

The authors are grateful to Reinhard Lührmann for discussions. We thank the Bavarian NMR Centre (BNMRZ) for NMR measurement time. LW acknowledges support by an EMBO Long Term Fellowship ALTF 1520-2011 and NIH Grant P20GM103408; KS for support by the IMPRS-LS Graduate School. This work was supported by the Deutsche Forschungsgemeinschaft (DFG SFB1035, GRK1721) to MS, and by the Centre National de la Recherche Scientifique (CNRS) to RB. Work in JV’s lab is supported by Fundación Botín, by Banco de Santander through its Santander Universities Global Division and Consolider RNAREG, MICINN and AGAUR, and by the Spanish Ministry of Economy and Competitiveness ‘Centro de Excelencia Severo Ochoa 2013-2017’, SEV-2012-0208. Work in JV’s lab was also supported by the European Research Council (ERC AdG - GA670146 - MASCP).

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Juan Valcárcel, Email: juan.valcarcel@crg.es.

Michael Sattler, Email: sattler@helmholtz-muenchen.de.

Douglas L Black, University of California, Los Angeles, United States.

Funding Information

This paper was supported by the following grants:

  • Centre National de la Recherche Scientifique to Rémy Bordonné.

  • Fundación Botín to Juan Valcárcel.

  • Ministerio de Economía y Competitividad Centro de Excelencia Severo Ochoa 2013-2017, SEV-2012-0208 to Juan Valcárcel.

  • Banco de Santander Consolider RNAREG, MICINN, AGAUR to Juan Valcárcel.

  • European Research Council ERC AdG - GA670146 - MASCP to Juan Valcárcel.

  • Deutsche Forschungsgemeinschaft SFB1035 to Michael Sattler.

  • Deutsche Forschungsgemeinschaft GRK1721 to Michael Sattler.

Additional information

Competing interests

Reviewing editor, eLife.

No competing interests declared.

Author contributions

Analysis and interpretation of data, Drafting or revising the article, Conception and design, Acquisition of data.

Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article.

Acquisition of data, Analysis and interpretation of data.

Conception and design, Acquisition of data, Analysis and interpretation of data, Drafting or revising the article.

Conception and design, Acquisition of data, Analysis and interpretation of data.

Conception and design, Analysis and interpretation of data, Drafting or revising the article.

Conception and design, Analysis and interpretation of data, Drafting or revising the article.

References

  1. Bacrot S, Doyard M, Huber C, Alibeu O, Feldhahn N, Lehalle D, Lacombe D, Marlin S, Nitschke P, Petit F, Vazquez MP, Munnich A, Cormier-Daire V. Mutations in SNRPB, encoding components of the core splicing machinery, cause cerebro-costo-mandibular syndrome. Human Mutation. 2015;36:187–190. doi: 10.1002/humu.22729. [DOI] [PubMed] [Google Scholar]
  2. Ball LJ, Kühne R, Schneider-Mergener J, Oschkinat H. Recognition of proline-rich motifs by protein-protein-interaction domains. Angewandte Chemie. 2005;44:2852–2869. doi: 10.1002/anie.200400618. [DOI] [PubMed] [Google Scholar]
  3. Bedford MT, Reed R, Leder P. WW domain-mediated interactions reveal a spliceosome-associated protein that binds a third class of proline-rich motif: the proline glycine and methionine-rich motif. PNAS. 1998;95:10602–10607. doi: 10.1073/pnas.95.18.10602. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Bhattacharya A, Tejero R, Montelione GT. Evaluating protein structures determined by structural genomics consortia. Proteins. 2007;66:778–795. doi: 10.1002/prot.21165. [DOI] [PubMed] [Google Scholar]
  5. Blencowe BJ. Alternative splicing: new insights from global analyses. Cell. 2006;126:37–47. doi: 10.1016/j.cell.2006.06.023. [DOI] [PubMed] [Google Scholar]
  6. Bonnal S, Martínez C, Förch P, Bachi A, Wilm M, Valcárcel J. RBM5/Luca-15/H37 regulates Fas alternative splice site pairing after exon definition. Molecular Cell. 2008;32:81–95. doi: 10.1016/j.molcel.2008.08.008. [DOI] [PubMed] [Google Scholar]
  7. Bonnal S, Vigevani L, Valcárcel J. The spliceosome as a target of novel antitumour drugs. Nature Reviews Drug Discovery. 2012;11:847–859. doi: 10.1038/nrd3823. [DOI] [PubMed] [Google Scholar]
  8. Brahms H, Meheus L, de Brabandere V, Fischer U, Lührmann R. Symmetrical dimethylation of arginine residues in spliceosomal Sm protein B/B' and the Sm-like protein LSm4, and their interaction with the SMN protein. RNA. 2001;7:1531–1542. doi: 10.1017/S135583820101442X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Brown AM, Zondlo NJ. A propensity scale for type II polyproline helices (PPII): aromatic amino acids in proline-rich sequences strongly disfavor PPII due to proline-aromatic interactions. Biochemistry. 2012;51:5041–5051. doi: 10.1021/bi3002924. [DOI] [PubMed] [Google Scholar]
  10. Brunger AT, Adams PD, Clore GM, DeLano WL, Gros P, Grosse-Kunstleve RW, Jiang JS, Kuszewski J, Nilges M, Pannu NS, Read RJ, Rice LM, Simonson T, Warren GL. Crystallography & NMR system: a new software suite for macromolecular structure determination. Acta Crystallographica Section D. 1998;54:905–921. doi: 10.1107/S0907444998003254. [DOI] [PubMed] [Google Scholar]
  11. Callebaut I, Mornon JP. OCRE: a novel domain made of imperfect, aromatic-rich octamer repeats. Bioinformatics. 2005;21:699–702. doi: 10.1093/bioinformatics/bti065. [DOI] [PubMed] [Google Scholar]
  12. Cascino I, Fiucci G, Papoff G, Ruberti G. Three functional soluble forms of the human apoptosis-inducing Fas molecule are produced by alternative splicing. Journal of Immunology. 1995;154:2706–2713. [PubMed] [Google Scholar]
  13. Cazalla D, Sanford JR, Cáceres JF. A rapid and efficient protocol to purify biologically active recombinant proteins from mammalian cells. Protein Expression and Purification. 2005;42:54–58. doi: 10.1016/j.pep.2005.03.035. [DOI] [PubMed] [Google Scholar]
  14. Chen L, Park SM, Tumanov AV, Hau A, Sawada K, Feig C, Turner JR, Fu YX, Romero IL, Lengyel E, Peter ME. CD95 promotes tumour growth. Nature. 2010;465:492–496. doi: 10.1038/nature09075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Cheng J, Zhou T, Liu C, Shapiro JP, Brauer MJ, Kiefer MC, Barr PJ, Mountz JD. Protection from Fas-mediated apoptosis by a soluble form of the Fas molecule. Science. 1994;263:1759–1762. doi: 10.1126/science.7510905. [DOI] [PubMed] [Google Scholar]
  16. Cooper TA, Wan L, Dreyfuss G. RNA and disease. Cell. 2009;136:777–793. doi: 10.1016/j.cell.2009.02.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. David CJ, Manley JL. Alternative pre-mRNA splicing regulation in cancer: pathways and programs unhinged. Genes & Development. 2010;24:2343–2364. doi: 10.1101/gad.1973010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Delaglio F, Grzesiek S, Vuister GW, Zhu G, Pfeifer J, Bax A. NMRPipe: a multidimensional spectral processing system based on UNIX pipes. Journal of Biomolecular NMR. 1995;6:277–293. doi: 10.1007/bf00197809. [DOI] [PubMed] [Google Scholar]
  19. Doreleijers JF, Sousa da Silva AW, Krieger E, Nabuurs SB, Spronk CA, Stevens TJ, Vranken WF, Vriend G, Vuister GW. CING: an integrated residue-based structure validation program suite. Journal of Biomolecular NMR. 2012;54:267–283. doi: 10.1007/s10858-012-9669-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Drake AF, Siligardi G, Gibbons WA. Reassessment of the electronic circular dichroism criteria for random coil conformations of poly(L-lysine) and the implications for protein folding and denaturation studies. Biophysical Chemistry. 1988;31:143–146. doi: 10.1016/0301-4622(88)80019-X. [DOI] [PubMed] [Google Scholar]
  21. Fields S, Song O. A novel genetic system to detect protein-protein interactions. Nature. 1989;340:245–246. doi: 10.1038/340245a0. [DOI] [PubMed] [Google Scholar]
  22. Freund C, Kühne R, Yang H, Park S, Reinherz EL, Wagner G. Dynamic interaction of CD2 with the GYF and the SH3 domain of compartmentalized effector molecules. The EMBO Journal. 2002;21:5985–5995. doi: 10.1093/emboj/cdf602. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Fromont-Racine M, Rain JC, Legrain P. Toward a functional analysis of the yeast genome through exhaustive two-hybrid screens. Nature Genetics. 1997;16:277–282. doi: 10.1038/ng0797-277. [DOI] [PubMed] [Google Scholar]
  24. Fushimi K, Ray P, Kar A, Wang L, Sutherland LC, Wu JY. Up-regulation of the proapoptotic caspase 2 splicing isoform by a candidate tumor suppressor, RBM5. PNAS. 2008;105:15708–15713. doi: 10.1073/pnas.0805569105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Förch P, Puig O, Kedersha N, Martínez C, Granneman S, Séraphin B, Anderson P, Valcárcel J. The apoptosis-promoting factor TIA-1 is a regulator of alternative pre-mRNA splicing. Molecular Cell. 2000;6:1089–1098. doi: 10.1016/s1097-2765(00)00107-6. [DOI] [PubMed] [Google Scholar]
  26. Güntert P. Automated structure determination from NMR spectra. European Biophysics Journal. 2009;38:129–143. doi: 10.1007/s00249-008-0367-z. [DOI] [PubMed] [Google Scholar]
  27. Holm L, Kääriäinen S, Rosenström P, Schenkel A. Searching protein structure databases with DaliLite v.3. Bioinformatics. 2008;24:2780–2781. doi: 10.1093/bioinformatics/btn507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Imielinski M, Berger AH, Hammerman PS, Hernandez B, Pugh TJ, Hodis E, Cho J, Suh J, Capelletti M, Sivachenko A, Sougnez C, Auclair D, Lawrence MS, Stojanov P, Cibulskis K, Choi K, de Waal L, Sharifnia T, Brooks A, Greulich H, Banerji S, Zander T, Seidel D, Leenders F, Ansén S, Ludwig C, Engel-Riedel W, Stoelben E, Wolf J, Goparju C, Thompson K, Winckler W, Kwiatkowski D, Johnson BE, Jänne PA, Miller VA, Pao W, Travis WD, Pass HI, Gabriel SB, Lander ES, Thomas RK, Garraway LA, Getz G, Meyerson M. Mapping the hallmarks of lung adenocarcinoma with massively parallel sequencing. Cell. 2012;150:1107–1120. doi: 10.1016/j.cell.2012.08.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Izquierdo JM, Majós N, Bonnal S, Martínez C, Castelo R, Guigó R, Bilbao D, Valcárcel J. Regulation of Fas alternative splicing by antagonistic effects of TIA-1 and PTB on exon definition. Molecular Cell. 2005;19:475–484. doi: 10.1016/j.molcel.2005.06.015. [DOI] [PubMed] [Google Scholar]
  30. Johnson BA, Blevins RA. NMR View: A computer program for the visualization and analysis of NMR data. Journal of Biomolecular NMR. 1994;4:603–614. doi: 10.1007/BF00404272. [DOI] [PubMed] [Google Scholar]
  31. Kambach C, Walke S, Nagai K. Structure and assembly of the spliceosomal small nuclear ribonucleoprotein particles. Current Opinion in Structural Biology. 1999;9:222–230. doi: 10.1016/S0959-440X(99)80032-3. [DOI] [PubMed] [Google Scholar]
  32. Kelly MA, Chellgren BW, Rucker AL, Troutman JM, Fried MG, Miller AF, Creamer TP. Host-guest study of left-handed polyproline II helix formation. Biochemistry. 2001;40:14376–14383. doi: 10.1002/prot.10477. [DOI] [PubMed] [Google Scholar]
  33. Klippel S, Wieczorek M, Schümann M, Krause E, Marg B, Seidel T, Meyer T, Knapp EW, Freund C. Multivalent binding of formin-binding protein 21 (FBP21)-tandem-WW domains fosters protein recognition in the pre-spliceosome. Journal of Biological Chemistry. 2011;286:38478–38487. doi: 10.1074/jbc.M111.265710. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Kondo Y, Oubridge C, van Roon AM, Nagai K. Crystal structure of human U1 snRNP, a small nuclear ribonucleoprotein particle, reveals the mechanism of 5' splice site recognition. eLife. 2015;4:e04986. doi: 10.7554/eLife.04986. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Krammer PH. CD95's deadly mission in the immune system. Nature. 2000;407:789–795. doi: 10.1038/35037728. [DOI] [PubMed] [Google Scholar]
  36. Krissinel E, Henrick K. Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions. Acta Crystallographica Section D Biological Crystallography. 2004;60:2256–2268. doi: 10.1107/S0907444904026460. [DOI] [PubMed] [Google Scholar]
  37. Laskowski RA, Rullmannn JA, MacArthur MW, Kaptein R, Thornton JM. AQUA and PROCHECK-NMR: programs for checking the quality of protein structures solved by NMR. Journal of Biomolecular NMR. 1996;8:477–486. doi: 10.1007/BF00228148. [DOI] [PubMed] [Google Scholar]
  38. Leung AK, Nagai K, Li J. Structure of the spliceosomal U4 snRNP core domain and its implication for snRNP biogenesis. Nature. 2011;473:536–539. doi: 10.1038/nature09956. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Linge JP, O'Donoghue SI, Nilges M. Automated assignment of ambiguous nuclear overhauser effects with ARIA. Methods in Enzymology. 2001;339:71–90. doi: 10.1016/s0076-6879(01)39310-2. [DOI] [PubMed] [Google Scholar]
  40. Linge JP, Williams MA, Spronk CA, Bonvin AM, Nilges M. Refinement of protein structures in explicit solvent. Proteins. 2003;50:496–506. doi: 10.1002/prot.10299. [DOI] [PubMed] [Google Scholar]
  41. Liu C, Cheng J, Mountz JD. Differential expression of human Fas mRNA species upon peripheral blood mononuclear cell activation. Biochemical Journal. 1995;310:957–963. doi: 10.1042/bj3100957. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Mackereth CD, Madl T, Bonnal S, Simon B, Zanier K, Gasch A, Rybin V, Valcárcel J, Sattler M. Multi-domain conformational selection underlies pre-mRNA splicing regulation by U2AF. Nature. 2011;475:408–U174. doi: 10.1038/nature10171. [DOI] [PubMed] [Google Scholar]
  43. Martin BT, Serrano P, Geralt M, Wüthrich K. Nuclear magnetic resonance structure of a novel globular domain in RBM10 containing OCRE, the octamer repeat sequence motif. Structure. 2016;24:158–164. doi: 10.1016/j.str.2015.10.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Matera AG, Wang Z. A day in the life of the spliceosome. Nature Reviews Molecular Cell Biology. 2014;15:108–121. doi: 10.1038/nrm3742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Mouaikel J, Narayanan U, Verheggen C, Matera AG, Bertrand E, Tazi J, Bordonné R. Interaction between the small-nuclear-RNA cap hypermethylase and the spinal muscular atrophy protein, survival of motor neuron. EMBO Reports. 2003;4:616–622. doi: 10.1038/sj.embor.embor863. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Nguyen TH, Galej WP, Bai XC, Savva CG, Newman AJ, Scheres SH, Nagai K. The architecture of the spliceosomal U4/U6.U5 tri-snRNP. Nature. 2015;523:47–52. doi: 10.1038/nature14548. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Oh JJ, Grosshans DR, Wong SG, Slamon DJ. Identification of differentially expressed genes associated with HER-2/neu overexpression in human breast cancer cells. Nucleic Acids Research. 1999;27:4008–4017. doi: 10.1093/nar/27.20.4008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Oh JJ, Taschereau EO, Koegel AK, Ginther CL, Rotow JK, Isfahani KZ, Slamon DJ. RBM5/H37 tumor suppressor, located at the lung cancer hot spot 3p21.3, alters expression of genes involved in metastasis. Lung Cancer. 2010;70:253–262. doi: 10.1016/j.lungcan.2010.02.012. [DOI] [PubMed] [Google Scholar]
  49. Oh JJ, West AR, Fishbein MC, Slamon DJ. A candidate tumor suppressor gene, H37, from the human lung cancer tumor suppressor locus 3p21.3. Cancer Research. 2002;62:3207–3213. [PubMed] [Google Scholar]
  50. Pan Q, Shai O, Lee LJ, Frey BJ, Blencowe BJ. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nature Genetics. 2008;40:1413–1415. doi: 10.1038/ng.259. [DOI] [PubMed] [Google Scholar]
  51. Papasaikas P, Tejedor JR, Vigevani L, Valcárcel J. Functional splicing network reveals extensive regulatory potential of the core spliceosomal machinery. Molecular Cell. 2015;57:7–22. doi: 10.1016/j.molcel.2014.10.030. [DOI] [PubMed] [Google Scholar]
  52. Papoff G, Cascino I, Eramo A, Starace G, Lynch DH, Ruberti G. An N-terminal domain shared by Fas/Apo-1 (CD95) soluble variants prevents cell death in vitro. Journal of Immunology. 1996;156:4622–4630. [PubMed] [Google Scholar]
  53. Petrella EC, Machesky LM, Kaiser DA, Pollard TD. Structural requirements and thermodynamics of the interaction of proline peptides with profilin. Biochemistry. 1996;35:16535–16543. doi: 10.1021/bi961498d. [DOI] [PubMed] [Google Scholar]
  54. Pomeranz Krummel DA, Oubridge C, Leung AK, Li J, Nagai K. Crystal structure of human spliceosomal U1 snRNP at 5.5 A resolution. Nature. 2009;458:475–480. doi: 10.1038/nature07851. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Rintala-Maki ND, Goard CA, Langdon CE, Wall VE, Traulsen KE, Morin CD, Bonin M, Sutherland LC. Expression of RBM5-related factors in primary breast tissue. Journal of Cellular Biochemistry. 2007;100:1440–1458. doi: 10.1002/jcb.21134. [DOI] [PubMed] [Google Scholar]
  56. Roesler J, Izquierdo JM, Ryser M, Rösen-Wolff A, Gahr M, Valcarcel J, Lenardo MJ, Zheng L. Haploinsufficiency, rather than the effect of an excessive production of soluble CD95 (CD95{Delta}TM), is the basis for ALPS Ia in a family with duplicated 3' splice site AG in CD95 intron 5 on one allele. Blood. 2005;106:1652–1659. doi: 10.1182/blood-2004-08-3104. [DOI] [PubMed] [Google Scholar]
  57. Saltzman AL, Pan Q, Blencowe BJ. Regulation of alternative splicing by the core spliceosomal machinery. Genes & Development. 2011;25:373–384. doi: 10.1101/gad.2004811. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Schmauss C, Brines ML, Lerner MR. The gene encoding the small nuclear ribonucleoprotein-associated protein N is expressed at high levels in neurons. Journal of Biological Chemistry. 1992;267:8521–8529. [PubMed] [Google Scholar]
  59. Sharpe NG, Wall W, Latchman DS. Expression of the tissue specific RNA splicing protein SmN in HeLa cells. Nucleic Acids Research. 1990;18:3425–3426. doi: 10.1093/nar/18.11.3425. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Shen Y, Delaglio F, Cornilescu G, Bax A. TALOS+: a hybrid method for predicting protein backbone torsion angles from NMR chemical shifts. Journal of Biomolecular NMR. 2009;44:213–223. doi: 10.1007/s10858-009-9333-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Sutherland LC, Rintala-Maki ND, White RD, Morin CD. RNA binding motif (RBM) proteins: a novel family of apoptosis modulators? Journal of Cellular Biochemistry. 2005;94:5–24. doi: 10.1002/jcb.20204. [DOI] [PubMed] [Google Scholar]
  62. Tripsianes K, Madl T, Machyna M, Fessas D, Englbrecht C, Fischer U, Neugebauer KM, Sattler M. Structural basis for dimethylarginine recognition by the Tudor domains of human SMN and SPF30 proteins. Nature Structural & Molecular Biology. 2011;18:1414–1420. doi: 10.1038/nsmb.2185. [DOI] [PubMed] [Google Scholar]
  63. van Dam A, Winkel I, Zijlstra-Baalbergen J, Smeenk R, Cuypers HT. Cloned human snRNP proteins B and B' differ only in their carboxy-terminal part. The EMBO Journal. 1989;8:3853–3860. doi: 10.1007/978-3-642-46681-6_7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Van Parijs L, Biuckians A, Abbas AK. Functional roles of Fas and Bcl-2-regulated apoptosis of T lymphocytes. Journal of Immunology. 1998;160:2065–2071. [PubMed] [Google Scholar]
  65. Varani L, Gunderson SI, Mattaj IW, Kay LE, Neuhaus D, Varani G. The NMR structure of the 38 kDa U1A protein - PIE RNA complex reveals the basis of cooperativity in regulation of polyadenylation by human U1A protein. Nature Structural Biology. 2000;7:329–335. doi: 10.1038/74101. [DOI] [PubMed] [Google Scholar]
  66. Wahl MC, Will CL, Lührmann R. The spliceosome: design principles of a dynamic RNP machine. Cell. 2009;136:701–718. doi: 10.1016/j.cell.2009.02.009. [DOI] [PubMed] [Google Scholar]
  67. Wang ET, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C, Kingsmore SF, Schroth GP, Burge CB. Alternative isoform regulation in human tissue transcriptomes. Nature. 2008;456:470–476. doi: 10.1038/nature07509. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Weber G, Trowitzsch S, Kastner B, Lührmann R, Wahl MC. Functional organization of the Sm core in the crystal structure of human U1 snRNP. The EMBO Journal. 2010;29:4172–4184. doi: 10.1038/emboj.2010.295. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Will CL, Lührmann R. Spliceosomal UsnRNP biogenesis, structure and function. Current Opinion in Cell Biology. 2001;13:290–301. doi: 10.1016/s0955-0674(00)00211-8. [DOI] [PubMed] [Google Scholar]
  70. Wingfield PT, Palmer I, Liang SM. Folding and purification of insoluble (inclusion body) proteins from Escherichia coli. Current Protocols in Protein Science. 2001;Chapter 6:6–5. doi: 10.1002/0471140864.ps0605s00. [DOI] [PubMed] [Google Scholar]
  71. Xiao SJ, Wang LY, Kimura M, Kojima H, Kunimoto H, Nishiumi F, Yamamoto N, Nishio K, Fujimoto S, Kato T, Kitagawa S, Yamane H, Nakajima K, Inoue A. S1-1/RBM10: multiplicity and cooperativity of nuclear localisation domains. Biology of the Cell. 2013;105:162–174. doi: 10.1111/boc.201200068. [DOI] [PubMed] [Google Scholar]
  72. Zabarovsky ER, Lerman MI, Minna JD. Tumor suppressor genes on chromosome 3p involved in the pathogenesis of lung and other cancers. Oncogene. 2002;21:6915–6935. doi: 10.1038/sj.onc.1205835. [DOI] [PubMed] [Google Scholar]

Decision letter

Editor: Douglas L Black1

In the interests of transparency, eLife includes the editorial decision letter and accompanying author responses. A lightly edited version of the letter sent to the authors after peer review is shown, indicating the most substantive concerns; minor comments are not usually included.

Thank you for submitting your article "Structural basis for the recognition of spliceosomal SmN/B/B' proteins by the RBM5 OCRE domain in splicing regulation" for consideration by eLife. Your article has been favorably evaluated by James Manley as the Senior Editor and three reviewers, one of whom, Douglas Black, is a member of our Board of Reviewing Editors.

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

Summary:

This paper from the Sattler and Valcárcel groups examines the interaction of splicing regulatory proteins of the Rbm5 family with the SmN/B/B' proteins within the core of the Sm snRNPs. There are many regulatory proteins that bind to the pre-mRNA to alter spliceosome assembly, but remarkably few interactions have been described between these regulators and core components of the spliceosome. Such information is key to understanding the mechanisms of splicing regulation. In previous work, the Valcárcel lab has characterized the Rbm5 family of splicing regulators, showing that they control important splicing events for apoptosis and cancer progression. They also reported that the octamer repeat (OCRE) domain of the Rbm5 protein interacts with the tri-snRNP complex that forms part of the mature spliceosome. OCRE domains containing repeats of aromatic residues are found in multiple proteins but their structure and interactions are unknown. Here the authors solve the structure of the Rbm5 OCRE by NMR, showing that it adopts an antiparallel β sheet structure with each repeat comprising a β strand. The conserved tyrosines project from both sides of the sheet. An N-terminal extension interacts with the residues on one face of the sheet to stabilize the domain, while tyrosines on the opposite face are available for other interactions. The authors define repeated proline-rich motifs within the C-terminal domains of the SmN/B/B' proteins as ligands for the OCRE domain. They show this interaction requires particular aromatic residues on the interaction surface of OCRE. They define a consensus proline rich motif (PRM) on the SmN protein that is required for OCRE interaction, and which includes 3 or 4 continuous proline residues flanked by arginines. Solving the structure of the OCRE bound to this motif, they show that the PRM forms a proline type II helix and the OCRE engages the PRM through a combination of aromatic stacking interactions between the prolines and the tyrosines, and both hydrogen bonding and electrostatic interactions with the flanking arginines. Through coexpression of mutant Rbm5 with a reporter minigene they show that these OCRE residues are indeed needed for Rbm5 regulatory activity. Finally, they compare OCRE to other domains such as SH3 and WW that also use aromatic residues to recognize poly-proline motifs.

The reviewers agreed that this is a well-executed study that uses a range of approaches to illuminate new aspects of splicing, including the structure and interactions of OCRE domain and the role of the OCRE/SmB interaction in splicing regulation. A number of issues were raised that need to be addressed before the paper is suitable for publication.

Essential revisions:

1) It is shown that mutation of the 4 prolines of the polyproline motif to alanines does not impact binding affinity. This is attributed to the alanines' ability to adopt a polyproline type II helical conformation. However, the hydrophobic interactions described should still be weakened upon mutation of prolines to alanines. The authors should test substitutions to other hydrophobic amino acids, such as valine, to prove that the backbone conformation is the major determinant of binding affinity.

2) The results in Figure 3B are surprising. It is well accepted that the extent of chemical shift perturbation does not necessarily correlate with KD in mutant analysis, as the mutant ligand can bind in a slightly different orientation. In addition, mutations have a large influence on the extent of the chemical shift perturbations observed for residues in direct contact. Figure 3B needs to be reinterpreted in light of this, and the authors need to demonstrate that much more clearly that their KD changes are as they describe.

3) The model for how the Rbm5/SmB interaction is altering spliceosome assembly is not at all clear. They propose based on earlier work that Rbm5 is interacting with SmB within the tri-snRNP. Isn't it also possible that the interaction could occur between Rbm5 and SmB in the U1 and U2 snRNP's? In Bonnal et al., 2008, they show that GST-OCRE interacts with the 220 and 200 kDa proteins of the U5 snRNP when incubated in HeLa nuclear extract. However, the data in that paper is not sufficient to propose that it must be a tri-snRNP interaction that is relevant here. The most prominent protein identified in that study was the U1-70K protein, suggesting that the OCRE domain also binds to the U1 snRNP. OCRE could thus indirectly affect the recruitment of the tri-snRNP via the U1 interaction. Additional bands in the OCRE-domain pulldown were also not identified, and no information was provided about which snRNAs were pulled down. Thus, it is not clear if the OCRE domain can bind all or only some snRNPs containing SmB/B'/N. These questions are not mentioned in the present Discussion but dramatically affect any model for how OCRE alters spliceosome assembly. Given the nicely shown interaction of RBM5's OCRE domain with the proline-rich regions of SmB/B'/N, and the fact that these proteins are present in the U1, U2, U4 and U5 snRNPs, a more thorough analysis of the interaction partners of the OCRE domain in nuclear extract is needed. The pulldown analysis should be repeated, analyzing all proteins and RNAs that are bound by OCRE, as well as analyzing how the stringency of the buffers used during pulldown and wash affect the spectrum of interacting species.

4) Also in the previous study (Bonnal et al., 2008), addition of RBM5 to an in vitro splicing reaction led to the accumulation of A complexes, and a block to B complex formation on intron 5 or 6 of Fas pre-mRNA. This indicated that stable recruitment of the tri-snRNP was hindered in the presence of RBM5. What effect does a RBM5 OCRE domain alone, or an RBM5 mutant lacking OCRE or with an inactive OCRE domain have on splicing complex formation in vitro? This experiment would provide additional information about the mechanism whereby RBM5's OCRE domain affects splicing. For example, Rbm5 also interacts with U2AF. It is possible that this U2AF interaction is sufficient to block to tri-snRNP recruitment, and the OCRE/SmB interaction is serving other roles.

eLife. 2016 Nov 29;5:e14707. doi: 10.7554/eLife.14707.025

Author response


[…] The reviewers agreed that this is a well-executed study that uses a range of approaches to illuminate new aspects of splicing, including the structure and interactions of OCRE domain and the role of the OCRE/SmB interaction in splicing regulation. A number of issues were raised that need to be addressed before the paper is suitable for publication.

Essential revisions:

1) It is shown that mutation of the 4 prolines of the polyproline motif to alanines does not impact binding affinity. This is attributed to the alanines' ability to adopt a polyproline type II helical conformation. However, the hydrophobic interactions described should still be weakened upon mutation of prolines to alanines. The authors should test substitutions to other hydrophobic amino acids, such as valine, to prove that the backbone conformation is the major determinant of binding affinity.

Thank you for this suggestion. We have performed further experiments to test the contribution and properties of the proline side chains by substituting proline by valine and comparing binding affinities in vitro. Unfortunately, a 4V version of the peptide (GMRVVVVGIRG) was completely insoluble, and thus no data could be obtained for this variant. Likewise, the SmN protein harboring 4P/3P→4V/3V mutations could not be expressed and purified. Presumably, the bulky and hydrophobic valine side chains cause aggregation of the protein, as observed for the peptide in vitro.

As a perhaps less optimal, but feasible alternative to evaluate the importance of hydrophobic interactions involving the proline side chains for OCRE binding, we tested peptides and SmN variants where prolines were replaced by glycine. CD spectra of the 4G peptide (revised Figure 3—figure supplement 1) indicate that it adopts (at least partially) a PPII conformation, consistent with previous reports1,2.

However, the affinity of the 4P/3P→4G/3G variant SmN protein to the OCRE domain is significantly reduced (KD = 192 ± 66 µM) compared to the wild type SmN protein (KD = 41 ± 2 µM) (revised Table 2). This is consistent with a reduced PPII propensity but also a contribution of hydrophobic interactions involving proline side chains in the wild type SmN, which are not possible with the glycine mutants.

Consistent with the reduced affinity for the OCRE domain observed in vitro, the activity of the 4P/3P→4G/3G SmN protein to promote FAS alternative splicing is significantly impaired (revised Figure 5—figure supplement 3D), comparable to the reduced activity of the APAP mutant.

The results of the 4P/3P→4G/3G mutant are therefore consistent with the concept that adoption of a polyproline type II helical conformation by the C-terminal tail of SmN, and establishment of a defined set of hydrophobic interactions between proline residues and aromatic side chains in the OCRE fold are necessary for OCRE binding and for alternative splicing regulation.

References

1) Brown AM, Zondlo NJ. 2012. A propensity scale for type II polyproline helices (PPII): aromatic amino acids in proline-rich sequences strongly disfavor PPII due to proline-aromatic interactions. Biochemistry 51 (25):5041-5051. doi:10.1021/bi3002924

2) Kelly MA, Chellgren BW, Rucker AL, Troutman JM, Fried MG, Miller AF, Creamer TP. 2001. Host-guest study of left-handed polyproline II helix formation. Biochemistry 40 (48):14376-14383

2) The results in Figure 3B are surprising. It is well accepted that the extent of chemical shift perturbation does not necessarily correlate with KD in mutant analysis, as the mutant ligand can bind in a slightly different orientation. In addition, mutations have a large influence on the extent of the chemical shift perturbations observed for residues in direct contact. Figure 3B needs to be reinterpreted in light of this, and the authors need to demonstrate that much more clearly that their KD changes are as they describe.

We agree with the comment that an interpretation of chemical shift perturbations (CSP) in terms of affinity is approximate and has to be done with caution. We have resorted to this approach as no other experimental technique is available to provide an – at least semi-quantitative – measure of relative affinities. Given that the peptide ligands do not exhibit aromatic side chains themselves and considering that we know the structural details of the interaction, we believe that changes for the amide signals shown in Figure 3B report on the local changes in chemical shift and depend on the population of bound state, i.e. relative affinities.

Note, that the signals shown in Figure 3B were carefully chosen as they report on key interactions with the (wildtype) ligand. However, the affinity score was based on analyzing all amide signals that experience significant chemical shift changes (i.e. above a certain cut-off), thereby providing some averaging of local effects, which thus should correlate in a reasonable way with overall binding affinity. To illustrate the changes in CSPs for different amide signals, we show the NMR spectra for representative peptide ligands in new Figure 3—figure supplement 2. The figure also shows that the CSPs for the different peptides follow a linear vector, which suggests that only a 2-site process is sensed, i.e. free vs. bound states, as otherwise deviations from linearity of the CSP vectors would indicate more local variations depending on the residues analyzed.

We have carefully updated the description of the approach and added a cautionary statement about its interpretation. Please note that the same mutations were introduced into the complete tail peptide. The very good agreement between relative affinities determined for the complete tails by ITC and those determined for the peptides by our NMR analysis, suggests that the NMR data provide useful information, which was then useful to guide our mutational analysis of the recognition of a single peptide motif.

3) The model for how the Rbm5/SmB interaction is altering spliceosome assembly is not at all clear. They propose based on earlier work that Rbm5 is interacting with SmB within the tri-snRNP. Isn't it also possible that the interaction could occur between Rbm5 and SmB in the U1 and U2 snRNP's? In Bonnal et al., 2008, they show that GST-OCRE interacts with the 220 and 200 kDa proteins of the U5 snRNP when incubated in HeLa nuclear extract. However, the data in that paper is not sufficient to propose that it must be a tri-snRNP interaction that is relevant here. The most prominent protein identified in that study was the U1-70K protein, suggesting that the OCRE domain also binds to the U1 snRNP. OCRE could thus indirectly affect the recruitment of the tri-snRNP via the U1 interaction. Additional bands in the OCRE-domain pulldown were also not identified, and no information was provided about which snRNAs were pulled down. Thus, it is not clear if the OCRE domain can bind all or only some snRNPs containing SmB/B'/N. These questions are not mentioned in the present Discussion but dramatically affect any model for how OCRE alters spliceosome assembly. Given the nicely shown interaction of RBM5's OCRE domain with the proline-rich regions of SmB/B'/N, and the fact that these proteins are present in the U1, U2, U4 and U5 snRNPs, a more thorough analysis of the interaction partners of the OCRE domain in nuclear extract is needed. The pulldown analysis should be repeated, analyzing all proteins and RNAs that are bound by OCRE, as well as analyzing how the stringency of the buffers used during pulldown and wash affect the spectrum of interacting species.

We have worked hard to determine the complement of U snRNAs associated to OCRE in our pull downs. Unfortunately, however, the results were inconclusive because a high background of U snRNAs was observed even in the control pull down samples. After many attempts to improve the signal-to-noise ratio using different pull down and stringency conditions, we were forced to accept that these assays do not provide evidence of consistent enrichment of any particular snRNP. One possibility is that the pull down somehow dislocates the snRNP-associated proteins from other RNP components, including the snRNAs. This is also consistent with the apparent absence of other Sm proteins from the precipitates. It seems likely that our pull-down analyses are a gross simplification of the sophisticated interactions that occur in the context of the spliceosome assembly pathway. Nevertheless, our assays revealed a direct interaction between the OCRE domain and the SmB/B'/N C terminal tail that, on the basis of our structure-function studies, is directly relevant for splicing regulation by RBM5.

We agree that an interaction with any U snRNP where the SmB/B’/N tails are accessible should be in principle possible. Looking at the recent high-resolution crystal and cryo-EM structures of spliceosomal complexes, we note that the long and flexible tails are predicted to be available for interactions in the tri-snRNP, in U1 snRNP and in complex B. We have expanded the text to discuss these structural insights and how contacts with either U1 / U2 or with the tri-snRNP could accommodate the activity of RBM5 as a repressor of the transition from complex A to B.

4) Also in the previous study (Bonnal et al., 2008), addition of RBM5 to an in vitro splicing reaction led to the accumulation of A complexes, and a block to B complex formation on intron 5 or 6 of Fas pre-mRNA. This indicated that stable recruitment of the tri-snRNP was hindered in the presence of RBM5. What effect does a RBM5 OCRE domain alone, or an RBM5 mutant lacking OCRE or with an inactive OCRE domain have on splicing complex formation in vitro? This experiment would provide additional information about the mechanism whereby RBM5's OCRE domain affects splicing. For example, Rbm5 also interacts with U2AF. It is possible that this U2AF interaction is sufficient to block to tri-snRNP recruitment, and the OCRE/SmB interaction is serving other roles.

We have tested the activity of the OCRE domain alone, as well as of a RBM5 mutant lacking the OCRE domain, in spliceosome assembly and in in vivo splicing assays. The results of the new Figure 6—figure supplements 1 and 2 show that the OCRE domain alone (in a construct that also includes the flanking KEKE domain) fails to inhibit spliceosome assembly in biochemical assays (Figure 6—figure supplement 1) and to regulate Fas exon 6 in vivo (Figure 6—figure supplement 2). In addition, new Figure 6 shows that both the deletion of the C-terminal region of RBM5 and a deletion of the OCRE domain compromise the ability of RBM5 to inhibit the transition from complex A to complex B in spliceosome assembly assays. Collectively, the results indicate that the OCRE domain is necessary but not sufficient to block B complex formation and that additional contributions from other parts in the C-terminal half of RBM5 contribute to this activity. One possibility, now discussed in the manuscript, is indeed that an interaction with U2AF, which has been previously mapped to a C-terminal region in RBM5, is important for regulation by RBM5. Investigating these additional interactions and their specific contributions to the modulation of spliceosome assembly will be the focus of our future work.


Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

RESOURCES