Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2013 Jul 8;110(30):E2802–E2811. doi: 10.1073/pnas.1303445110

Isolated pseudo–RNA-recognition motifs of SR proteins can regulate splicing using a noncanonical mode of RNA recognition

Antoine Cléry a, Rahul Sinha b, Olga Anczuków b, Anna Corrionero c,d, Ahmed Moursy a, Gerrit M Daubner a, Juan Valcárcel c,d, Adrian R Krainer b, Frédéric H-T Allain a,1
PMCID: PMC3725064  PMID: 23836656

Significance

Serine/arginine (SR) proteins are key regulators of eukaryotic gene expression and have been associated with multiple human diseases including cancers. Several members of this protein family contain a noncanonical RNA recognition motif (RRM), the pseudo-RRM, for which the mode of RNA recognition is unknown. Here, we solved the structure of SRSF1 pseudo-RRM bound to RNA. It reveals the RNA motif recognized and a very unusual mode of interaction, which is conserved for all the SR proteins containing pseudo-RRMs. Finally, we show that the pseudo-RRM in isolation often is sufficient to regulate splicing, and we reveal its mechanism of action.

Keywords: NMR, protein–RNA complex, splicing factor

Abstract

Serine/arginine (SR) proteins, one of the major families of alternative-splicing regulators in Eukarya, have two types of RNA-recognition motifs (RRMs): a canonical RRM and a pseudo-RRM. Although pseudo-RRMs are crucial for activity of SR proteins, their mode of action was unknown. By solving the structure of the human SRSF1 pseudo-RRM bound to RNA, we discovered a very unusual and sequence-specific RNA-binding mode that is centered on one α-helix and does not involve the β-sheet surface, which typically mediates RNA binding by RRMs. Remarkably, this mode of binding is conserved in all pseudo-RRMs tested. Furthermore, the isolated pseudo-RRM is sufficient to regulate splicing of about half of the SRSF1 target genes tested, and the bound α-helix is a pivotal element for this function. Our results strongly suggest that SR proteins with a pseudo-RRM frequently regulate splicing by competing with, rather than recruiting, spliceosome components, using solely this unusual RRM.


Serine/arginine (SR) proteins are highly conserved in Eukarya and are key regulators of gene expression. These proteins are required for pre-mRNA splicing, regulate alternative splicing events, and play crucial roles in genomic stability, mRNA transcription, nuclear export, nonsense-mediated mRNA decay, and translation (15). Several studies have revealed links between these proteins and diseases (6), making the proteins potential therapeutic targets (7). For the last two decades, SR proteins have been studied intensively as regulators of alternative splicing, a mechanism used to modulate the expression of more than 95% of human genes (8, 9). Importantly, it is estimated that 15–50% of human disease-causing mutations affect splicing (10). Alternative splicing consists of the alternative selection of splice sites present within pre-mRNA, leading to different versions of mature mRNAs from a single gene (11). As a result, in some cases, proteins with opposite functions can be generated. In the context of cancer, alternative splicing can generate pro- or antiapoptotic isoforms (12). As an example, alternatively spliced variants of FAS/CD95 can be generated by inclusion or skipping of exon 6, depending on competition between antagonistic splicing factors (1316). In the absence of Fas exon 6, the apoptotic receptor lacks the transmembrane domain and becomes soluble and antiapoptotic (17).

Each SR protein has a modular structure with one or two RNA-recognition motifs (RRMs) at its N-terminal part, followed by a C-terminal arginine-serine–rich (RS) domain containing multiple RS dipeptide repeats. In many cases, SR proteins have been shown to interact with purine-rich exonic splicing enhancers (ESEs) and to promote inclusion of the targeted exon in mRNAs (18). Two main models have been proposed to explain the mode of action of ESE-bound SR proteins in splicing regulation. In the recruitment model SR proteins recruit U1-70K and/or U2AF35 to 5′ and 3′ splice sites, respectively, to promote spliceosome assembly (4). In the inhibitor model SR proteins activate splicing by preventing the binding of splicing repressors, such as hnRNP A1, around the regulated splice site (4). In some cases, SR proteins have been shown not to enhance splicing but rather to promote exon skipping (1926). However, their mode of action in these cases is still unclear.

The RS domain of SR proteins was shown to be important for splicing, because it initiates spliceosome recruitment by interacting with the branchpoint sequence (27) and/or spliceosomal components (2830). However, in some cases, this domain also can be dispensable for constitutive and enhancer-dependent splicing (31), indicating a key function of the RRMs in this process. Although previously it was shown that the RRMs of SRSF1 can contact the spliceosomal component U1-70K directly (32), the main role of the RRMs seems to be to provide RNA-binding specificity and to dictate the position of SR proteins on pre-mRNAs (18).

The oncoprotein SRSF1 (SF2/ASF) (3336) is one of the best-studied SR proteins. Depletion of SRSF1 protein results in death in chicken cells (37) and embryonic lethality in Caenorhabditis elegans (38). Like all SR proteins containing two RRMs, it comprises an N-terminal canonical RRM followed by a pseudo-RRM (Fig. 1A). Although pseudo-RRMs appear to be critical for the activity of SR proteins (3942), their mode of action still needs to be clarified. Unlike canonical RRMs, pseudo-RRMs have a conserved, distinctive, invariant heptapeptide, SWQDLKD, on their α-helix 1 (43). In addition, their β-sheet surface lacks the set of conserved aromatic residues usually involved in RNA binding (44), suggesting that this domain uses an unusual mode of RNA recognition.

Fig. 1.

Fig. 1.

SRSF1 pseudo-RRM interaction with 5′-UGAAGGAC-3′ RNA. (A) The sequence of recombinant SRSF1 protein used in this study is shown. Amino acid numbering is according to the PDB sequence. Amino acids involved in the formation of β-strands and α-helices are underlined. (B) Superimposition of 1H-15N HSQC spectra representing NMR titration of the 15N-labeled SRSF1 pseudo-RRM with increasing amounts of unlabeled 5′-UGAAGGAC-3′ RNA. The titration was performed at 40 °C in NMR buffer. The peaks corresponding to the free and RNA-bound protein states (RNA:protein ratios of 0.3:1 and 1:1) are colored blue, orange, and red, respectively. The negative peaks, corresponding to the amides of arginine side-chains in the free and RNA-bound (1:1 ratio) states are colored green and magenta, respectively. The highest chemical-shift perturbations observed upon RNA binding are indicated by black arrows. (C) Kd determination of SRSF1 pseudo-RRM in complex with 5′-UGAAGGAC-3′ RNA by ITC. The measured Kd value is indicated. (D) Representation of the combined chemical-shift perturbations (Δδ = [(δHN)2 + (δN/6.51)2]1/2) of SRSF1 pseudo-RRM amides upon 5′-UGAAGGAC-3′ RNA binding, as a function of the pseudo-RRM amino acid sequence. Secondary-structure elements of the protein domain are displayed at the bottom of the graph. The highest chemical-shift perturbations are indicated.

In this study, we solved the NMR structure of SRSF1 pseudo-RRM bound to RNA. The structure reveals that the domain specifically binds a GGA motif, primarily using the conserved residues located in α-helix 1. Remarkably, this unusual mode of RNA recognition is conserved for all pseudo-RRMs we tested from human, yeast, and fly. Moreover, we show that the isolated pseudo-RRM can regulate many different alternative splicing events as effectively as the full-length protein. Finally, our data suggest that the pseudo-RRM acts in splicing regulation by competing with binding of other splicing factors rather than by recruiting them to the cassette exon.

Results

SRSF1 Interacts with Purine-Rich Sequences Using only Its Pseudo-RRM.

SRSF1 (SF2/ASF) (34, 35) consists of two RRMs responsible for RNA recognition and one C-terminal RS domain (Fig. 1A) (18). The RNA-binding specificity of SRSF1 is still unclear. The consensus sequence 5′-ACGCGCA-3′ was determined by SELEX in the presence of the N-terminal canonical RRM alone (RRM1), whereas guanine-adenine (GA)–rich RNA sequences were selected with a version of SRSF1 with both RRMs (45). Because purine-rich sequences are found in natural ESEs bound by SRSF1 (46), we investigated the contribution of each RRM in binding to this type of sequence. We used a 5′-AGGAGAAC-3′ RNA, which was selected by SELEX and had the highest affinity for the SRSF1 protein containing the two RRMs (45).

Overlay of 1H-15N heteronuclear single-quantum coherence (HSQC) spectra recorded with the RNA-bound form of each isolated RRM and a protein with both RRMs connected by their natural linker showed that chemical shifts are similar (SI Appendix, Fig. S1A). This analysis reveals that, upon RNA binding, the RRMs behave similarly when they are separated or linked in their natural conformation. In addition, isothermal titration calorimetry (ITC) measurements indicated that only the pseudo-RRM interacts efficiently with the purine-rich sequence tested (SI Appendix, Fig. S1B), demonstrating that SRSF1 RRM2 is the primary domain for recognition of purine-rich ESEs. Therefore, we investigated the RNA-binding mode of this domain using NMR spectroscopy. To obtain NMR spectra of sufficient quality for structure determination, we tested several RNA sequences derived from SELEX (45) and CLIP consensus (46). High-quality NMR spectra were obtained with SRSF1 RRM2 (amino acids 107–203) fused to the solubility tag GB1 and bound to 5′-UGAAGGAC-3′ (Fig. 1B). Importantly, this sequence fits the consensus sequence determined by CLIP with SRSF1 (5′-UGA/GU/AGA/GA/U-3′) (46). We measured a dissociation constant of 0.8 μM for this complex, using ITC (Fig. 1C). NMR titration revealed large chemical shift perturbations in the β2-strand and, more surprisingly in the α-helix 1 and N-terminal extremity of the domain (Fig. 1D). We calculated the structure of this complex using 1,487 nuclear Overhauser effect (NOE)-derived distance restraints, including 38 intermolecular ones (SI Appendix, Table S1 and Fig. S2). We obtained a very precise ensemble of 16 structures (Fig. 2A) with an rmsd of 0.92 Å for all of the heavy atoms (SI Appendix, Table S1).

Fig. 2.

Fig. 2.

Overview of the solution structure of SRSF1 pseudo-RRM in complex with 5′-UGAAGGAC-3′ RNA. (A) Overlay of the 16 lowest-energy structures superimposed on the backbone of the structured parts of the protein and heavy atoms of RNA. The protein backbone is shown in gray, and heavy atoms are shown in orange (P atoms), yellow (C atoms for RNA), green (C atoms for protein), red (O atoms), and blue (N atoms). Only the ordered regions of the RRM (residues 116–193) and RNA (A4 to C8) are shown. (B) The structure of the complex is shown in ribbon (protein backbone) and stick (RNA) representation. The color scheme is the same as in A. Important protein side-chains involved in RNA interactions are represented as sticks. (C) Stereo view of the structure of the SRSF1 pseudo-RRM in complex with RNA. (D and E) Two views of the molecular recognition of the 5′-GGA-3′ RNA sequence by the pseudo-RRM. Protein–RNA interactions involving G5, G6, and A7 nucleotides are shown. Color schemes are as in A. Hydrogen bonds are represented by purple dashed lines.

SRSF1 Pseudo-RRM Uses α-Helix 1 to Recognize GGA.

The pseudo-RRM in complex with RNA adopted a canonical β1α1β2β3α2β4 fold that is unchanged compared with the non–ligand-bound (apo) form of the protein (47). The RNA structure adopted an extended conformation with all nucleotides in the C2′-endo sugar conformation, except for G5, which had a C3′-endo conformation (Fig. 2). The mode of binding is very unusual for an RRM, because the RNA-binding site is centered on α-helix 1, and the β-sheet surface does not contact the RNA (Fig. 2 B and C). The pseudo-RRM recognizes only three of the eight nucleotides, G5, G6, and A7, which wrap the bottom of α-helix 1. The N-terminal part of this helix, comprising Ser133, Trp134, and Gln135, forms an ideal surface for binding three nucleotides, with G5 stacking on the Trp134 side-chain, G6 stacking on the main-chain, and A7 stacking over the surface formed by the Ser133 and Gln135 side-chains (Fig. 2 C and D). Additionally, the side-chains of Trp134 and Gln135 interact with the sugar rings of G5 and G6, respectively. All these contacts provide affinity for the three nucleotides and organize the GGA triplet into an arch with two consecutive bases perpendicular to one another, but they do not confer sequence specificity. Sequence specificity is achieved by other features of the pseudo-RRM. The Watson–Crick edges of G5 and G6 are recognized by several hydrogen bonds formed with the main-chain of Ala150 (in the β2-strand) and the side-chains of Lys138 and Asp139 (in the α-helix 1), respectively (Fig. 2D). In addition, the Arg117 side-chain in the inter-RRM linker contacts the N7s of both guanines. Finally, A7’s N6 amino group is hydrogen-bonded with Asp136 from α-helix 1, and its H2 interacts with the His183 side-chain (in the α2–β4 loop) (Fig. 2D). No specific contact is observed for A1, U2, G3, A4, and C8. Overall, the structure revealed that the SRSF1 pseudo-RRM recognizes a 5′-GGA-3′ motif, with a majority of the contacts to the RNA involving the highly conserved S133WQDLKD139 sequence within α-helix 1.

In good agreement with the structure, the domain binds 5′-UGAAGGAC-3′ and 5′-AGGAC-3′ RNAs with almost the same affinity (Kd of 0.8 and 1 μM, respectively) (Fig. 1 and SI Appendix, Fig. S3). Moreover, substitutions of either G5 or G6 by adenines led to no detectable binding measured by ITC, and substitution of A7 by a guanine or a cytosine decreased the affinity by a factor of two or three, respectively (SI Appendix, Fig. S3). The invariant SWQDLKD motif located in α-helix 1 of pseudo-RRMs represents a sequence signature of these domains (43). Our structure clearly explains this conservation, considering that, except for the leucine that anchors α-helix 1 into the domain hydrophobic core, each residue of this heptapeptide contacts the RNA (Fig. 2). We measured a five- to 10-fold decrease in the RNA-binding affinity when we mutated each of these residues individually (SI Appendix, Fig. S4).

Isolated SRSF1 Pseudo-RRM Regulates Splicing Events in Cells.

We next investigated whether the RNA-binding specificity we determined for SRSF1 pseudo-RRM is relevant in vivo in the context of the full-length protein. We transfected human cells with several variants of SRSF1 and analyzed their effect on the splicing of insulin-like growth factor 1 (IGF1) exon 5, which has an SRSF1-dependent purine-rich ESE containing a GGA motif (Fig. 3A) (48). As previously reported (48), exon 5 inclusion increased in the presence of the SRSF1 WT protein (Fig. 3 BE). Each single amino acid substitution introduced into the full-length protein and expected to affect the binding of SRSF1 pseudo-RRM to RNA severely reduced the splicing enhancement of IGF1 exon 5 (Fig. 3 BE, lanes 8–13 and 15). Such large changes in splicing were surprising, considering that the mutations tested decreased RNA-binding affinity only by five- to 10-fold (SI Appendix, Fig. S4). This effect might originate from the overexpression of these protein variants resulting in nuclear concentrations close to their Kd. Conversely, mutation of Tyr149, which is not involved in the interaction with RNA, did not have a significant effect on splicing (Fig. 3 BE, lane 14). These results indicate that, in the context of the full-length protein, SRSF1 pseudo-RRM uses the mode of RNA interaction shown in our structure. In good agreement with the dominant function of the pseudo-RRM in the recognition of purine-rich ESEs by SRSF1, deletion of the pseudo-RRM (ΔRRM2) prevented this splicing enhancement, whereas deletion of the canonical RRM1 or the RS domain still allowed exon 5 inclusion, although less efficiently than in the full-length protein (Fig. 3 BE, lanes 1–5). However, the increase in the percentage of exon 6 inclusion observed in the absence of the RS domain shows that this part of the protein also is involved in the regulation of IGF1 alternative splicing (Fig. 3E, lane 5). More surprisingly, unlike RRM1, the isolated RRM2 could enhance exon 5 splicing to nearly the same extent as SRSF1 after deletion of the RS domain (ΔRS) and more than RRM1-deleted SRSF1 (ΔRRM1) (Fig. 3 BE, lanes 1–7), showing that the pseudo-RRM can activate splicing as a single domain in vivo.

Fig. 3.

Fig. 3.

Effect of SRSF1 mutations on IGF1 exon 5 splicing in cells. (A) Schematic representation of the IGF1 minigene used for the cotransfection experiments, with the location of primers (arrows) used to amplify the mRNA isoforms containing either exon 5 or exon 6. Thicker boxes, thinner boxes, and lines represent exons, UTRs, and introns, respectively. (B) Western blots showing the relative expression levels of the different T7-tagged versions of SRSF1, with β-catenin levels used as a reference. (C) RT-PCR gels showing the levels of exon 5 and exon 6 mRNA isoforms upon overexpression of either WT SRSF1 or various mutant versions in HeLa cells. GAPDH was used as a reference for loading. (D and E) Graphs showing the relative band intensities of the exon 5 (D) or exon 6 (E) mRNA isoforms upon overexpression of various SRSF1 versions. n = 4; error bars indicate SD.

In contrast to the effect observed with IGF1 exon 5, SRSF1 promotes FAS/CD95 exon 6 skipping, which gives rise to an antiapoptotic isoform (Fig. 4A) (49). To understand better the mode of action of SRSF1 pseudo-RRM in splicing, we investigated its role in splicing of FAS exon 6. As shown in Fig. 4, the isolated pseudo-RRM also could repress splicing of FAS exon 6, even more efficiently than the full-length protein (Fig. 4 BD). Together with our data for IGF1, these results demonstrate that the isolated pseudo-RRM can recapitulate the effects of full-length SRSF1 either as an activator or as a repressor of exon inclusion, depending on the pre-mRNA substrate. Similar to our observations for IGF1 exon 5, mutating Lys138 or Asp139 in the SWQDLKD motif of the domain impaired the ability of the isolated RRM to promote FAS exon 6 skipping (Fig. 4 BD). These data indicate that binding of the SRSF1 pseudo-RRM to RNA via α-helix 1 is required for its function as a regulator of alternative splicing.

Fig. 4.

Fig. 4.

Effect of SRSF1 mutations on FAS exon 6 splicing in cells. (A) Schematic representation of FAS exon 6 minigene. Exon 6 inclusion or skipping generates mRNA isoforms encoding a pro- or an antiapoptotic form of the Fas/CD95 receptor, respectively. The location of primers used to amplify transcripts derived from the FAS minigene used in cotransfection experiments is shown. (B) Western blots showing the relative expression levels of the different T7-tagged versions of SRSF1. (C) RT-PCR gel showing the levels of exon 6 mRNA isoforms upon overexpression of either full-length SRSF1 or WT and various mutant versions of the SRSF1 pseudo-RRM in HeLa cells. The black arrow indicates the junction between two initially distant lanes present in the same gel. (D) Graph showing the percentages of exon 6 inclusion in FAS mRNAs upon overexpression of various SRSF1 versions. Error bars indicate SD.

To determine whether the pseudo-RRM can regulate splicing of additional SRSF1 target genes, we analyzed the splicing patterns of 18 endogenous SRSF1 targets. Among these, MKNK2 and MSTR1/RON were shown previously to be regulated by SRSF1, whereas the remaining splicing events correspond to targets identified by RNA-seq in HeLa cells. We performed semiquantitative radioactive RT-PCR on RNA from HeLa cells upon two- to threefold overexpression of either full-length SRSF1 or the pseudo-RRM only (Fig. 5A). To enrich for cells expressing SRSF1 or the pseudo-RRM, we cotransfected GFP to enable sorting of the cells by flow cytometry. We observed that seven of the 18 splicing events responded to expression of the pseudo-RRM or the full-length protein to a similar extent (Fig. 5 BD and SI Appendix, Fig. S5). Moreover, the pseudo-RRM promoted exon skipping or exon inclusion, depending on the particular SRSF1 splicing target. These results suggest that SRSF1 pseudo-RRM is sufficient for the regulation of a substantial fraction of SRSF1 splicing targets.

Fig. 5.

Fig. 5.

Effect of SRSF1 pseudo-RRM expression on alternative splicing of endogenous SRSF1 target genes. (A) SRSF1 levels in control (pCGT) and in SRSF1- and RRM2-overexpressing HeLa cells were assessed by radioactive RT-PCR, followed by native PAGE and autoradiography. SRSF1 levels were normalized to ACTB mRNA and expressed as the fold change compared with control cells (n = 3). ACTB mRNA was used as a loading control. Error bars indicate SEM. (B) Splicing of known SRSF1 target genes (MKNK2, RON) was analyzed upon SRSF1 and RRM2 overexpression in cells from A, as described above. Representative gels are shown in SI Appendix, Fig. S5A. The ratio of each splicing isoform was quantified, and the relative level of exon inclusion was plotted (n = 3; t test ***P < 0.0005, **P < 0.005). Error bars indicate SD. (C and D) RT-PCR analysis of SRSF1 target genes expressed in cells from A that respond to both RRM2 and full-length SRSF1 (C) or only to full-length SRSF1 (D). Primers located in the exons flanking the alternatively spliced exon (SI Appendix, Table S1) allow amplification of either the full-length isoform or the skipped-exon isoform. For representative gels see SI Appendix, Fig. S5 B and C. The ratio of each isoform was quantified, and the relative level of exon inclusion was plotted (n ≥ 3; t test ***P < 0.0005, **P < 0.005, *P < 0.01). Error bars indicate SD.

SRSF1 Pseudo-RRM Competes Against hnRNP A1.

We then tried to determine how the isolated SRSF1 pseudo-RRM regulates alternative splicing. Because of the relatively small size of the domain, it is unlikely that it would act on splicing by recruiting spliceosome components. Indeed, it was reported recently that both RRMs of SRSF1 are required to interact with U1-70K (32). A more tempting hypothesis could be that the pseudo-RRM regulates splicing by competing for RNA binding with one or several splicing factors. Previous studies indicated that hnRNP A1 is an efficient competitor of SRSF1 full-length protein (50, 51). HnRNP A1 recognizes AGG motifs sequence specifically (52). Because we showed that SRSF1 pseudo-RRM recognizes GGA, a sequence containing AGGA in principle could be recognized by both proteins. To test our hypothesis, we performed in vivo competition assays using IGF1 and FAS minigenes with AGGA elements (Fig. 6). Increasing the expression of hnRNP A1 promoted IGF1 exon 5 skipping (Fig. 6A, Upper, lanes 1–4) and, more surprisingly, FAS exon 6 inclusion (Fig. 6B, Upper, lanes 1–4), thus exhibiting an effect on FAS and IGF1 splicing opposite of that of SRSF1 (Fig. 6 A and B, Lower, lanes 1–4). In addition, the effects of hnRNP A1 on FAS and IGF1 splicing were efficiently reversed by overexpression of the SRSF1 pseudo-RRM alone (Fig. 6 A and B, Lower, lanes 5–8). Similarly, increasing the expression of hnRNP A1 prevented SRSF1 RRM2 from promoting IGF1 exon 5 inclusion and FAS exon 6 skipping (Fig. 6 A and B, Upper, lanes 5–8). These results suggest that the isolated SRSF1 pseudo-RRM can stimulate exon inclusion or exon skipping (depending on the substrate) by competing with hnRNP A1.

Fig. 6.

Fig. 6.

Effect on IGF1 and FAS splicing of SRSF1 pseudo-RRM and hnRNP A1 coexpression in cells. (A) RT-PCR gels showing the levels of IGF1 exon 5 and exon 6 mRNA isoforms upon coexpression of hnRNP A1 and SRSF1 pseudo-RRM in HeLa cells. (Upper) The SRSF1 pseudo-RRM was coexpressed with increasing amounts of hnRNP A1. (Lower) hnRNP A1 was coexpressed with increasing amounts of the SRSF1 pseudo-RRM. GAPDH was used as a reference for loading. The black arrow indicates the junction between two initially distant lanes present in the same gel. (B) As in A, RT-PCR gels show the levels of FAS exon 6 mRNA isoforms upon coexpression of hnRNP A1 and the SRSF1 pseudo-RRM.

Unusual Mode of Interaction of Pseudo-RRMs with RNA Is Conserved.

With the exception of R117 and H183, all of the residues of SRSF1 pseudo-RRM involved in sequence-specific recognition of GGA are fully conserved in the pseudo-RRMs of all known SR proteins containing two RRMs, including Drosophila B52 and yeast Npl3p (SI Appendix, Fig. S6). This conservation strongly suggests that all these domains have the same sequence specificity. To test this hypothesis, we performed affinity measurements by ITC and NMR titrations of various pseudo-RRMs in complex with 5′-AGGAC-3′ or 5′-UGAAGGAC-3′ RNA (SI Appendix, Figs. S7 and S8). Although all tested pseudo-RRMs interacted with these RNAs, their affinity was 2–15 times less than that of SRSF1 pseudo-RRM. The lower binding affinity of the pseudo-RRMs of SRSF4, -5, and -6 and Npl3p may result from the lack of conservation of Arg117 and His183 and/or from a different amino acid environment around the binding site (Fig. 2 and SI Appendix, Fig. S6). However, the NMR titration with the 5′-AGGAC-3′ RNA unambiguously showed that α-helix 1 remains the RNA-binding surface for all these pseudo-RRMs, as evidenced by the large chemical shift changes observed for the NMR signals corresponding to the tryptophan and glutamine residues in the conserved SWQDLKD motif and the alanine located in the β2-strand of these domains (Fig. 7A and SI Appendix, Fig. S8). In addition, none of the pseudo-RRMs that we tested interacted with the mutant 5′-AGAAC-3′ RNA, in which the central G6 was mutated to an adenine (underlined), supporting our observation above that the interaction is sequence specific (SI Appendix, Fig. S9). Taken together, these results indicate that this unusual mode of binding and the GGA sequence specificity can be generalized to all known pseudo-RRMs. In agreement with this conclusion, and similar to our observations with the corresponding full-length proteins, SRSF9’s pseudo-RRM also activated FAS exon 6 skipping, whereas SRSF5’s pseudo-RRM was less active (Fig. 7B). These results are consistent with the affinity measurements, considering that the affinity of SRSF5 for 5′-AGGAC-3′ is five- to 10-fold lower than that of the other two pseudo-RRMs (SI Appendix, Fig. S7).

Fig. 7.

Fig. 7.

Pseudo-RRMs use a conserved mode of interaction with RNA, centered on α-helix 1. (A) Selection of chemical-shift perturbations observed for all the tested pseudo-RRMs in the presence of 5′-AGGAC-3′ RNA. Close view of chemical-shift perturbations observed in SI Appendix, Fig. S8 and corresponding to residues for which the interaction with RNA is characteristic of the binding mode observed for SRSF1 pseudo-RRM. Titrations were performed at 40 °C in the NMR buffer. The peaks corresponding to the free and RNA-bound states (RNA:protein ratios of 0.3:1 and 1:1, respectively) are colored blue, orange, and red, respectively. All the recombinant proteins used in this figure were purified in the absence of the GB1 tag. Kd values determined by ITC measurements are shown for each protein. (B) SRSF9’s but not SRSF5′s pseudo-RRM induces FAS exon 6 skipping. RT-PCR gels (Top), Western blot analyses (Middle), and graph (Bottom) showing percentages of exon 6 inclusion in FAS mRNAs upon overexpression of various pseudo-RRM versions are shown. Error bars indicate SD. (C) Structures of the SRSF1 pseudo-RRM bound to the GG dinucleotide of the 5′-UGAAGGAC-3′ RNA and to the Trp-Gly-His tripeptide of SRPK1 are shown. The side-chains of Trp88 and His90 occupy the same sites as G6 and G5, respectively. The protein backbone is shown in gray, with the side-chains of SRSF1 and SRPK1 in green and black, respectively. Heavy atoms of RNA are in orange (P atoms), yellow (C atoms), red (O atoms), and blue (N atoms).

Discussion

Interaction of Pseudo-RRM with RNA, a Conserved Mode of Binding.

In this paper, we show that the pseudo-RRM is primarily responsible for the interaction of SRSF1 with purine-rich sequences, which are found in the ESEs targeted by SR proteins in vivo (18). In addition, our data reveal that all pseudo-RRMs we tested interact specifically with the 5′-GGA-3′ motif using the same mode of binding. In particular, the presence of two consecutive guanines is crucial, because their substitution abolished the interaction of the pseudo-RRM with RNA (SI Appendix, Figs. S3 and S9). Consistent with our results, all the consensus sequences obtained by SELEX for pseudo-RRM-containing SR proteins comprise at least one GG dinucleotide (18), and RNA sequences with the highest affinity for Npl3p also comprise this motif (53). This very similar RNA sequence specificity shared by all pseudo-RRMs could explain the functional redundancy observed in some cases for SR proteins (18). However, although the pseudo-RRM plays a crucial role in RNA recognition and alternative splicing regulation, it is likely that in some cases the specificity of SR proteins is modulated by the canonical RRM1 to accommodate longer or slightly different sequences.

α-Helix 1 of Pseudo-RRMs, a Binding Platform for Both Protein and RNA.

Although previous studies suggested the potential involvement of the α-helix 1 of SRSF1 pseudo-RRM in protein (42, 54) and RNA (40, 47) interactions, there was no rational as to how such a small surface could accommodate both types of polymers. These observations now can be explained by our structure and by the structure of SRPK1 (54) bound to the SRSF1 pseudo-RRM. To our surprise, we found that the mode of binding of the α-helix 1 is very similar for both the RNA and the protein. Remarkably, the side-chains of SRPK1, Trp88, and His90 occupy the same sites as the bases of G6 and G5, respectively. The backbones of both polymers follow the same path in both structures with the Gln135 side-chain being hydrogen bonded with the main-chain of both ligands (Fig. 7C). Note that the directionality of the polymers is different. In addition, three nucleotides are needed to replace two amino acids (Figs. 2D and 7C). This double involvement of α-helix 1 also implies that SRSF1 phosphorylation by SRPK1 and RNA binding must be mutually exclusive events. This information is important to understand better the mode of action of SRSF1 in cells, because the use of the same binding platform to interact with either RNA or protein partners most likely dictates sequential ligand recognition at different regulatory steps in the gene-expression pathway.

An Additional Mode of RNA Recognition for the Family of RRMs.

Although three subclasses of RRMs were identified in Eukarya (44), until now, structural studies have been conducted for only two of them. Although the canonical RRM binds RNA through its β-sheet surface (SI Appendix, Fig. S10) (52), the quasi-RRM uses a totally different recognition interface, involving the loop regions (SI Appendix, Fig. S10) (55). Our study reveals the structure of a member of the third subclass, the pseudo-RRM, bound to RNA and demonstrates the use of another RNA-binding surface of RRMs, which is centered on α-helix 1 (SI Appendix, Fig. S10). It illustrates the unusual RNA-binding variability and plasticity of this simple protein module. To date, only one structure of a bacterial RRM has been shown to interact with RNA using the α-helix 1 (56). However, this RRM is not a pseudo-RRM, because it lacks a SWQDLKD motif, and no sequence-specific contacts were observed with the α-helix 1 (SI Appendix, Fig. S11).

Pseudo-RRM, a Key Domain of SRSF1 for Splicing Regulation.

SR proteins are well known for promoting splice-site selection via their interaction with ESEs. However, their mode of action appears to be more diverse than anticipated, with a variety of possible mechanisms influencing splice-site selection. Recent genome-wide analysis performed in mammalian cells confirmed the idea that SR proteins can either promote or inhibit exon inclusion depending on their position on pre-mRNAs (57). Their binding to intronic sequences was reported primarily to promote exon skipping (58), but the effect of their interaction with exonic sequences seems to be more complex. It was proposed that the binding of SR proteins to alternative exons promotes their inclusion, whereas their interaction with a flanking exon causes skipping of the internal alternative exon (22, 59).

Here, we found that the isolated pseudo-RRM of SRSF1 is sufficient to modulate splicing of nearly half of the tested SRSF1 target genes (Fig. 5 and SI Appendix, Fig. S5). We propose that the domain either activates or inhibits splicing by competing with binding of the splicing repressor hnRNP A1 (50, 51) to a 5′-AGGA-3′ motif, which contains overlapping binding sites for both proteins (5′-AGG-3′ and 5′-GGA-3′ for hnRNP A1 and SRSF1 pseudo-RRM, respectively) (SI Appendix, Fig. S10). In the context of IGF1, the motif is located in alternative exon 5, and the pseudo-RRM activates inclusion of the targeted exon, whereas hnRNP A1 represses splicing (Figs. 6A and 8A). An opposite effect was observed with FAS, because SRSF1 pseudo-RRM promotes exon 6 skipping, and hnRNP A1 activates its inclusion (Fig. 6B). The effect observed with hnRNP A1 was surprising, because hnRNPs were shown primarily to mediate splicing repression from exonic positions (6062). Nevertheless, a study performed in parallel with our work recently reported the same effect of hnRNP A1 on FAS exon 6 splicing (63). The authors identified the binding site of this splicing factor on exon 5, close to the 5′ splice site. Remarkably, the binding sequence comprises a 5′-AGGA-3′ motif, which could explain the competition we observed between SRSF1 pseudo-RRM and hnRNP A1 on FAS alternative splicing (Fig. 8B). In addition, the 3′ splice site of FAS intron 5 also contains a 5′-AGGA-3′ sequence, and therefore SRSF1 binding could block recognition of this 3′ splice site by U2AF35, thus contributing to exon 6 skipping (Fig. 8B). In this context, the function of hnRNP A1 in 3′ splice-site proofreading and 3′ splice-site recognition (64) also could help explain the antagonism between SRSF1 and hnRNP A1 in the regulation of FAS alternative splicing. Finally, our data confirm that the effect of SR and hnRNP proteins on splicing is position dependent (65).

Fig. 8.

Fig. 8.

Models for splicing regulation by the SRSF1 pseudo-RRM. (A) Binding of the SRSF1 pseudo-RRM to IGF1 exon 5 promotes its inclusion. The domain interacts with an AGGA-containing ESE and prevents the recruitment of hnRNP A1 to the targeted exon. (B) The SRSF1 pseudo-RRM binds to FAS exon 5 close to the 5′ splice site and promotes the skipping of exon 6. HnRNP A1 competes with the binding of the SRSF1 pseudo-RRM to FAS exon 5 and activates exon 6 inclusion (63). Alternatively, the SRSF1 pseudo-RRM also could compete with the proofreading activity of hnRNP A1 at the 3′ splice site of FAS introns 5 (64).

Interestingly, our data obtained with the SRSF1 pseudo-RRM alone fit perfectly with the above-mentioned mode of action of SR proteins in splicing regulation (22). Indeed, the binding of SRSF1 pseudo-RRM to the alternative exon of IGF1 itself activates its inclusion (Fig. 8A), whereas interaction of the domain with the flanking exon 5 and/or with the 3′ splice site of intron 5 promotes skipping of FAS alternative exon 6 (Fig. 8B). We still need to understand the link between SRSF1 pseudo-RRM and the spliceosome machinery, but our data indicate that this domain can be crucial and even sufficient to regulate these types of splicing events. Finally, the activity of pseudo-RRMs in splicing is likely to be influenced by multiple additional parameters, including formation of RNA secondary structures or competition with other proteins targeting overlapping binding sites (57, 58), which may limit the accessibility to these domains of 5′-AGGA-3′ motifs that are not involved in splicing regulation.

Taken together, our results suggest that the pseudo-RRM can activate or repress splicing, depending on the position of the sequence bound by SRSF1 and hnRNP A1 (an AGGA motif located in an ESE or close to a splice site, respectively) (Fig. 8). Remarkably, and contrary to the usually invoked mechanism of action, in this competitive model, the RS domain of the SR proteins is dispensable. Finally, our detailed insights into the unusual mode of RNA binding and mode of action of SR proteins should help facilitate the design of molecules that can shift the balance of alternative splicing isoforms for therapeutic purposes.

Accession Codes.

We deposited the chemical shifts of SRSF1 RRM2–UGAAGGAC in the BioMagResBank under accession number 19248. We have deposited the coordinates of the SRSF1 RRM2–UGAAGGAC structures in the Protein Data Bank (PDB) under the PDB ID 2M8D.

Materials and Methods

Preparation of RNA–Protein Complexes.

We cloned in the pET28 expression vector all pseudo-RRM ORFs corresponding to amino acids 107–203 of SRSF1, 103–193 of B52, 193–282 of Npl3p, 97–178 of SRSF4, 101–183 of SRSF5, 103–189 of SRSF6, and 103–185 of SRSF9, as well as SRSF1 RRM1 (residues 1–97) and SRSF1 RRM1+2 (residues 1–203). These recombinant proteins were fused to an N-terminal 6×His tag and were overexpressed at 37 °C in Escherichia coli BL21 (DE3) codon plus cells in minimal M9 medium containing 1 g/L 15NH4Cl and 4 g/L glucose (for 15N-labeled protein) or 1 g/L 15NH4Cl and 2 g/L 13C-glucose (for 15N- and 13C-labeled protein). A GB1 tag was fused at the N-terminal extremity of the SRSF1 pseudo-RRM to increase the solubility and stability of the protein. As seen in Fig. 1 and SI Appendix, Fig. S8, we observed the same chemical shift perturbations upon RNA binding in the presence or absence of the GB1 tag, demonstrating that the tag does not influence the pseudo-RRM interaction with RNA.

Proteins were purified by two successive nickel affinity chromatography (Qiagen) steps, as previously described (66), dialyzed against NMR buffer (50 mM l-Glu, 50 mM l-Arg, 0.05% β-mercaptoethanol, and 20 mM NaH2PO4 at pH 5.5) and concentrated to 0.8 mM with a 10-kDa molecular mass cutoff Centricon device (Vivascience). The SRSF1 RRM1, RRM2, and RRM1+2 proteins used in SI Appendix, Fig. S1 were dialyzed against a buffer that allows solubility of the protein containing two RRMs at a concentration of 0.2 mM (150 mM KCl, 1.5 mM MgCl2, 0.2 mM EDTA, 50 mM l-Glu, 50 mM l-Arg, 0.05% β-mercaptoethanol, and 20 mM Na2HPO4 at pH 7).

WT and mutant RNA oligonucleotides were purchased from Dharmacon, deprotected according to the manufacturer’s instructions, desalted using a G-15–size exclusion column (Amersham), lyophilized, and resuspended in NMR buffer.

RNA-protein complexes used to solve structures were formed in NMR buffer at an RNA:protein ratio of 1:1, at a 0.8 mM concentration.

NMR Measurement

All NMR measurements were performed in NMR buffer at 313 K using Bruker AVIII-500 MHz and 700 MHz equipped with a cryoprobe and AVIII-600 MHz and Avance-900 MHz spectrometers. Data were processed using Topspin 2.0 (Bruker) and analyzed with Sparky (www.cgl.ucsf.edu/home/sparky/).

Protein sequence-specific backbone and side-chain assignments were achieved using 2D 1H-15N HSQC, 2D 1H-13C HSQC, 3D HNCA, 3D HNCO, 3D CBCACONH, 3D HcCH total correlation spectroscopy (TOCSY), 3D nuclear Overhauser effect spectroscopy (NOESY), 1H-15N HSQC, and 3D NOESY 1H-13C HSQC aliphatic (for review, see ref. 67). Aromatic proton assignments were performed using 2D 1H-1H TOCSY and 3D NOESY 1H-13C HSQC aromatic.

Resonance assignments of RNA in complex with SRSF1 pseudo-RRM were performed using 2D 1H-1H TOCSY, 2D 1H-1H NOESY, 2D 13C 1F-filtered 2F-filtered NOESY (68), and natural abundance 2D (13C-1H) HSQC in 100% D2O.

Intermolecular NOEs were obtained using 2D 1H-1H NOESY and 3D 13C 1F-edited 3F-filtered HSQC-NOESY (69) using unlabeled RNA and 15N-labeled and 15N- and 13C-labeled proteins, respectively. Intermolecular NOEs between imino protons of the RNA G5 and G6 nucleotides and SRSF1 protons were obtained using 2D 1H-1H NOESY at 298 K in H2O.

All NOESY spectra were recorded with a mixing time of 150 ms, the 3D TOCSY spectrum with a mixing time of 23 ms, and the 2D TOCSY with a mixing time of 50 ms.

Structure Calculation and Refinement.

AtnosCandid software (70, 71) was used to generate preliminary structures and a list of automatically assigned NOE distance constraints for SRSF1 pseudo-RRM in complex with RNA. Peak picking and NOE assignments were performed using 3D NOESY (15N- and 13C-edited) spectra. Additionally, intraprotein hydrogen-bond constraints were added based on hydrogen–deuterium exchange experiments on the amide protons. For these hydrogen bonds, the oxygen acceptors were identified based on preliminary structures calculated without hydrogen-bond constraints.

Seven iterations were performed, and 100 independent structures were calculated at each iteration step. Structures of the protein–RNA complexes were calculated with CYANA (71) by adding the manually assigned intramolecular RNA and RNA–protein intermolecular distance restraints. In addition, hydrogen-bond constraints between protein and RNA were used based on characteristic chemical-shift perturbations observed upon RNA binding for amide and carbonyl of the Ala150 backbone (using 2D 1H-15N HSQC and 3D HNCO spectra, respectively) and of carboxyl groups of the Asp136 and Asp139 side-chains (using the 3D HNCACO spectrum). Oxygen acceptors were identified as described above. For each CYANA run, 50 independent structures were calculated. These 50 structures were refined with the SANDER module of AMBER 9.0 (72) by simulated annealing run in implicit water using the ff99 force-field (73).

The best structures based on energy and NOE violations were analyzed with PROCHECK (74). The Ramachandran plot of the SRSF1 pseudo-RRM in complex with RNA indicates that 72.7% of the residues are in the most favored regions, 22.5% in the additional allowed regions, 3.8% in the generously allowed regions, and 1% in the disallowed regions. All of the figures showing structures were generated with MOLMOL (75).

ITC.

ITC experiments were performed on a VP-ITC instrument (Microcal), calibrated according to the manufacturer’s instructions. Protein and RNA samples were dialyzed against the NMR buffer. Concentrations of proteins and RNAs were determined using optical-density absorbance at 280 and 260 nm, respectively. Then 20 μM of all of the tested RNAs were titrated with 400 μM of recombinant protein by 40 injections of 6 μL every 5 min at 40 °C. Raw data were integrated, normalized for the molar concentration, and analyzed using Origin 7.0 software according to a 1:1 RNA:protein ratio binding model.

Cell Culture and Transfections.

We cultured HeLa cells in DMEM (Invitrogen) supplemented with 10% (vol/vol) FBS (Invitrogen). We used Fugene 6 transfection reagent (Invitrogen) to cotransfect 2.1 µg of pEGFP-IGF1 minigene (48) with 0.7 µg of pCG-T7::SRSF1 or its mutants, in 40–50% confluent HeLa cells grown on six-well plates, according to the manufacturer’s recommendations. For in vivo splicing assays performed with the FAS minigene, cells were cotransfected with 200 ng of FAS minigene plasmid and 1.5 μg of SRSF1 cDNA expression vectors (full-length or RRM2 derivatives). For the titration in Fig. 6A the following plasmid amounts were used: Upper, SRSF1 RRM2: 1 μg; hnRNP A1: 1, 2, and 4 μg; Lower, hnRNP A1: 1 μg; SRSF1 RRM2: 1, 2, and 4 μg. In Fig. 6B, Upper, SRSF1 RRM2: 2 μg; hnRNP A1: 0.5, 1, and 2 μg; Lower, hnRNP A1: 1 μg; SRSF1 RRM2: 1, 2, and 4 μg.

For splicing of endogenous SRSF1 targets, HeLa cells were transfected using FuGENE 6 (Promega) with 10 µg of pCG-T7-control, -SRFS1 or -RRM2 plasmids (76) together with 1 µg of pEGFP per 10-cm plate. Cells were trypsinized 48 h after transfection, and GFP-positive cells were sorted on a Becton Dickinson FACSAria IIU Cell Sorter.

In Vivo Splicing Assays.

After 48 h, total RNA was isolated using TRIzol reagent (Invitrogen) and treated with DNase I (Promega). Reverse transcription was performed using ImProm-II reverse transcriptase (Promega) and oligo dT primers.

For assays performed with the IGF1 minigene, we used the cDNA templates to perform radioactive PCRs using vector-specific (pEGFP) primers, as described (77). For assays involving the FAS minigene, we performed RT-PCRs as described (78). To calculate the IGF1 exon 5 or exon 6 inclusion levels, the α-32P-dCTP–labeled PCR-amplicons were separated by native PAGE, followed by phosphorimage analysis on a Fujifilm FLA-5100 instrument (Fuji Medical Systems USA, Inc.). We quantified the band intensities using MultiGauge software version 2.3 (Fujifilm) and normalized the values for the G+C content according to the DNA sequence. We further normalized the G+C corrected band intensities of individual isoforms in each lane to the corresponding intensities of GAPDH bands, as a control for loading. We then set the levels of exon 5 and exon 6 expression from the empty vector lane as 100 and represented the expression levels of these isoforms upon overexpression of WT-SRSF1 or its mutant forms as fold changes.

For splicing of endogenous SRSF1 targets, radioactive touchdown PCR with α-32P-dCTP was used to amplify endogenous transcripts with primers described in SI Appendix Table S2. PCR products were separated by 8% native PAGE, and bands were quantified with a phosphorimager (Fuji Image Reader FLA-5100). The signal intensity was normalized first to the GC content of the amplified transcript and then to the sum of different isoform mRNAs or to β-actin if only one band was detected. The changes were expressed as percentages of exon inclusion, using empty vector as a control.

Supplementary Material

Supporting Information

Acknowledgments

We thank Dr. Lenka Skrisovska for providing the Npl3 clone; Prof. Stuart Wilson and Dr. Guillaume Hautbergue for providing the SRSF1 clone; Prof. Jamal Tazi and Dr. Francois Juge for providing the B52 clone; Prof. Shern Chew for the IGF1 minigene; Markus Blatter for help with structure calculations; Dr. Mario Schubert, Dr. Christophe Maris, and Dr. Fred Damberger for help with NMR spectroscopy settings; Dr. Julien Boudet for advice during ITC measurements; and Prof. Giuseppe Biamonti and Dr. James Stevenin for useful discussions. We also thank the Schweizerischer Nationalfonds-National Centers of Competence in Research Structural Biology and the European Alternative Splicing Network (EURASNET) for financial support of F.H.-T.A., EURASNET, Consolider RNAREG, and Fundación Marcelino Botín for financial support of J.V., and the European Molecular Biology Organization for financial support of A.C., who also received a postdoctoral fellowship from the Fondation Suisse de Recherche sur les Maladies Musculaires. R.S., O.A., and A.R.K. were supported by National Institutes of Health Grant R37-GM42699.

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

Data deposition: The chemical shifts of SRSF1 RRM2–UGAAGGAC have been deposited in the BioMagResBank (BMRB), www.bmrb.wisc.edu (accession no. 19248). The coordinates of the SRSF1 RRM2–UGAAGGAC structures have been deposited in the Protein Data Bank (PDB), www.pdb.org (PDB ID code 2M8D).

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1303445110/-/DCSupplemental.

References

  • 1.Graveley BR. Sorting out the complexity of SR protein functions. RNA. 2000;6(9):1197–1211. doi: 10.1017/s1355838200000960. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Chen M, Manley JL. Mechanisms of alternative splicing regulation: Insights from molecular and genomics approaches. Nat Rev Mol Cell Biol. 2009;10(11):741–754. doi: 10.1038/nrm2777. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Huang Y, Steitz JA. SRprises along a messenger’s journey. Mol Cell. 2005;17(5):613–615. doi: 10.1016/j.molcel.2005.02.020. [DOI] [PubMed] [Google Scholar]
  • 4.Long JC, Caceres JF. The SR protein family of splicing factors: Master regulators of gene expression. Biochem J. 2009;417(1):15–27. doi: 10.1042/BJ20081501. [DOI] [PubMed] [Google Scholar]
  • 5.Zhong XY, Wang P, Han J, Rosenfeld MG, Fu XD. SR proteins in vertical integration of gene expression from transcription to RNA processing to translation. Mol Cell. 2009;35(1):1–10. doi: 10.1016/j.molcel.2009.06.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Tazi J, Bakkour N, Stamm S. Alternative splicing and disease. Biochim Biophys Acta. 2009;1792(1):14–26. doi: 10.1016/j.bbadis.2008.09.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Soret J, Gabut M, Tazi J. SR proteins as potential targets for therapy. Prog Mol Subcell Biol. 2006;44:65–87. doi: 10.1007/978-3-540-34449-0_4. [DOI] [PubMed] [Google Scholar]
  • 8.Pan Q, Shai O, Lee LJ, Frey BJ, Blencowe BJ. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat Genet. 2008;40(12):1413–1415. doi: 10.1038/ng.259. [DOI] [PubMed] [Google Scholar]
  • 9.Wang ET, et al. Alternative isoform regulation in human tissue transcriptomes. Nature. 2008;456(7221):470–476. doi: 10.1038/nature07509. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Wang GS, Cooper TA. Splicing in disease: Disruption of the splicing code and the decoding machinery. Nat Rev Genet. 2007;8(10):749–761. doi: 10.1038/nrg2164. [DOI] [PubMed] [Google Scholar]
  • 11.Irimia M, Blencowe BJ. Alternative splicing: Decoding an expansive regulatory layer. Curr Opin Cell Biol. 2012;24(3):323–332. doi: 10.1016/j.ceb.2012.03.005. [DOI] [PubMed] [Google Scholar]
  • 12.Schwerk C, Schulze-Osthoff K. Regulation of apoptosis by alternative pre-mRNA splicing. Mol Cell. 2005;19(1):1–13. doi: 10.1016/j.molcel.2005.05.026. [DOI] [PubMed] [Google Scholar]
  • 13.Bonnal S, et al. RBM5/Luca-15/H37 regulates Fas alternative splice site pairing after exon definition. Mol Cell. 2008;32(1):81–95. doi: 10.1016/j.molcel.2008.08.008. [DOI] [PubMed] [Google Scholar]
  • 14.Corsini L, et al. U2AF-homology motif interactions are required for alternative splicing regulation by SPF45. Nat Struct Mol Biol. 2007;14(7):620–629. doi: 10.1038/nsmb1260. [DOI] [PubMed] [Google Scholar]
  • 15.Förch P, et al. The apoptosis-promoting factor TIA-1 is a regulator of alternative pre-mRNA splicing. Mol Cell. 2000;6(5):1089–1098. doi: 10.1016/s1097-2765(00)00107-6. [DOI] [PubMed] [Google Scholar]
  • 16.Izquierdo JM, et al. Regulation of Fas alternative splicing by antagonistic effects of TIA-1 and PTB on exon definition. Mol Cell. 2005;19(4):475–484. doi: 10.1016/j.molcel.2005.06.015. [DOI] [PubMed] [Google Scholar]
  • 17.Cheng J, et al. Protection from Fas-mediated apoptosis by a soluble form of the Fas molecule. Science. 1994;263(5154):1759–1762. doi: 10.1126/science.7510905. [DOI] [PubMed] [Google Scholar]
  • 18.Bourgeois CF, Lejeune F, Stévenin J. Broad specificity of SR (serine/arginine) proteins in the regulation of alternative splicing of pre-messenger RNA. Prog Nucleic Acid Res Mol Biol. 2004;78:37–88. doi: 10.1016/S0079-6603(04)78002-2. [DOI] [PubMed] [Google Scholar]
  • 19.Jumaa H, Nielsen PJ. The splicing factor SRp20 modifies splicing of its own mRNA and ASF/SF2 antagonizes this regulation. EMBO J. 1997;16(16):5077–5085. doi: 10.1093/emboj/16.16.5077. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Buratti E, Stuani C, De Prato G, Baralle FE. SR protein-mediated inhibition of CFTR exon 9 inclusion: Molecular characterization of the intronic splicing silencer. Nucleic Acids Res. 2007;35(13):4359–4368. doi: 10.1093/nar/gkm444. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Gallego ME, Gattoni R, Stévenin J, Marie J, Expert-Bezançon A. The SR splicing factors ASF/SF2 and SC35 have antagonistic effects on intronic enhancer-dependent splicing of the beta-tropomyosin alternative exon 6A. EMBO J. 1997;16(7):1772–1784. doi: 10.1093/emboj/16.7.1772. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Han J, et al. SR proteins induce alternative exon skipping through their activities on the flanking constitutive exons. Mol Cell Biol. 2011;31(4):793–802. doi: 10.1128/MCB.01117-10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Jiang ZH, Zhang WJ, Rao Y, Wu JY. Regulation of Ich-1 pre-mRNA alternative splicing and apoptosis by mammalian splicing factors. Proc Natl Acad Sci USA. 1998;95(16):9155–9160. doi: 10.1073/pnas.95.16.9155. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Lemaire R, Winne A, Sarkissian M, Lafyatis R. SF2 and SRp55 regulation of CD45 exon 4 skipping during T cell activation. Eur J Immunol. 1999;29(3):823–837. doi: 10.1002/(SICI)1521-4141(199903)29:03<823::AID-IMMU823>3.0.CO;2-C. [DOI] [PubMed] [Google Scholar]
  • 25.Solis AS, Peng R, Crawford JB, Phillips JA, 3rd, Patton JG. Growth hormone deficiency and splicing fidelity: Two serine/arginine-rich proteins, ASF/SF2 and SC35, act antagonistically. J Biol Chem. 2008;283(35):23619–23626. doi: 10.1074/jbc.M710175200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.ten Dam GB, et al. Regulation of alternative splicing of CD45 by antagonistic effects of SR protein splicing factors. J Immunol. 2000;164(10):5287–5295. doi: 10.4049/jimmunol.164.10.5287. [DOI] [PubMed] [Google Scholar]
  • 27.Shen H, Green MR. A pathway of sequential arginine-serine-rich domain-splicing signal interactions during mammalian spliceosome assembly. Mol Cell. 2004;16(3):363–373. doi: 10.1016/j.molcel.2004.10.021. [DOI] [PubMed] [Google Scholar]
  • 28.Kohtz JD, et al. Protein-protein interactions and 5′-splice-site recognition in mammalian mRNA precursors. Nature. 1994;368(6467):119–124. doi: 10.1038/368119a0. [DOI] [PubMed] [Google Scholar]
  • 29.Wu JY, Maniatis T. Specific interactions between proteins implicated in splice site selection and regulated alternative splicing. Cell. 1993;75(6):1061–1070. doi: 10.1016/0092-8674(93)90316-i. [DOI] [PubMed] [Google Scholar]
  • 30.Amrein H, Hedley ML, Maniatis T. The role of specific protein-RNA and protein-protein interactions in positive and negative control of pre-mRNA splicing by Transformer 2. Cell. 1994;76(4):735–746. doi: 10.1016/0092-8674(94)90512-6. [DOI] [PubMed] [Google Scholar]
  • 31.Zhu J, Krainer AR. Pre-mRNA splicing in the absence of an SR protein RS domain. Genes Dev. 2000;14(24):3166–3178. doi: 10.1101/gad.189500. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Cho S, et al. Interaction between the RNA binding domains of Ser-Arg splicing factor 1 and U1-70K snRNP protein determines early spliceosome assembly. Proc Natl Acad Sci USA. 2011;108(20):8233–8238. doi: 10.1073/pnas.1017700108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Manley JL, Krainer AR. A rational nomenclature for serine/arginine-rich protein splicing factors (SR proteins) Genes Dev. 2010;24(11):1073–1074. doi: 10.1101/gad.1934910. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Ge H, Manley JL. A protein factor, ASF, controls cell-specific alternative splicing of SV40 early pre-mRNA in vitro. Cell. 1990;62(1):25–34. doi: 10.1016/0092-8674(90)90236-8. [DOI] [PubMed] [Google Scholar]
  • 35.Krainer AR, Conway GC, Kozak D. The essential pre-mRNA splicing factor SF2 influences 5′ splice site selection by activating proximal sites. Cell. 1990;62(1):35–42. doi: 10.1016/0092-8674(90)90237-9. [DOI] [PubMed] [Google Scholar]
  • 36.Karni R, et al. The gene encoding the splicing factor SF2/ASF is a proto-oncogene. Nat Struct Mol Biol. 2007;14(3):185–193. doi: 10.1038/nsmb1209. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Wang J, Takagaki Y, Manley JL. Targeted disruption of an essential vertebrate gene: ASF/SF2 is required for cell viability. Genes Dev. 1996;10(20):2588–2599. doi: 10.1101/gad.10.20.2588. [DOI] [PubMed] [Google Scholar]
  • 38.Longman D, Johnstone IL, Cáceres JF. Functional characterization of SR and SR-related genes in Caenorhabditis elegans. EMBO J. 2000;19(7):1625–1637. doi: 10.1093/emboj/19.7.1625. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Chandler SD, Mayeda A, Yeakley JM, Krainer AR, Fu XD. RNA splicing specificity determined by the coordinated action of RNA recognition motifs in SR proteins. Proc Natl Acad Sci USA. 1997;94(8):3596–3601. doi: 10.1073/pnas.94.8.3596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Chiodi I, et al. RNA recognition motif 2 directs the recruitment of SF2/ASF to nuclear stress bodies. Nucleic Acids Res. 2004;32(14):4127–4136. doi: 10.1093/nar/gkh759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Dauksaite V, Akusjärvi G. The second RNA-binding domain of the human splicing factor ASF/SF2 is the critical domain controlling adenovirus E1A alternative 5′-splice site selection. Biochem J. 2004;381(Pt 2):343–350. doi: 10.1042/BJ20040408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Michlewski G, Sanford JR, Cáceres JF. The splicing factor SF2/ASF regulates translation initiation by enhancing phosphorylation of 4E-BP1. Mol Cell. 2008;30(2):179–189. doi: 10.1016/j.molcel.2008.03.013. [DOI] [PubMed] [Google Scholar]
  • 43.Birney E, Kumar S, Krainer AR. Analysis of the RNA-recognition motif and RS and RGG domains: Conservation in metazoan pre-mRNA splicing factors. Nucleic Acids Res. 1993;21(25):5803–5816. doi: 10.1093/nar/21.25.5803. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Cléry A, Blatter M, Allain FH. RNA recognition motifs: Boring? Not quite. Curr Opin Struct Biol. 2008;18(3):290–298. doi: 10.1016/j.sbi.2008.04.002. [DOI] [PubMed] [Google Scholar]
  • 45.Tacke R, Manley JL. The human splicing factors ASF/SF2 and SC35 possess distinct, functionally significant RNA binding specificities. EMBO J. 1995;14(14):3540–3551. doi: 10.1002/j.1460-2075.1995.tb07360.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Sanford JR, et al. Identification of nuclear and cytoplasmic mRNA targets for the shuttling protein SF2/ASF. PLoS ONE. 2008;3(10):e3369. doi: 10.1371/journal.pone.0003369. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Tintaru AM, et al. Structural and functional analysis of RNA and TAP binding to SF2/ASF. EMBO Rep. 2007;8(8):756–762. doi: 10.1038/sj.embor.7401031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Smith PJ, et al. An exonic splicing enhancer in human IGF-I pre-mRNA mediates recognition of alternative exon 5 by the serine-arginine protein splicing factor-2/alternative splicing factor. Endocrinology. 2002;143(1):146–154. doi: 10.1210/endo.143.1.8598. [DOI] [PubMed] [Google Scholar]
  • 49.Gonçalves V, Matos P, Jordan P. Antagonistic SR proteins regulate alternative splicing of tumor-related Rac1b downstream of the PI3-kinase and Wnt pathways. Hum Mol Genet. 2009;18(19):3696–3707. doi: 10.1093/hmg/ddp317. [DOI] [PubMed] [Google Scholar]
  • 50.Eperon IC, et al. Selection of alternative 5′ splice sites: Role of U1 snRNP and models for the antagonistic effects of SF2/ASF and hnRNP A1. Mol Cell Biol. 2000;20(22):8303–8318. doi: 10.1128/mcb.20.22.8303-8318.2000. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Mayeda A, Krainer AR. Regulation of alternative pre-mRNA splicing by hnRNP A1 and splicing factor SF2. Cell. 1992;68(2):365–375. doi: 10.1016/0092-8674(92)90477-t. [DOI] [PubMed] [Google Scholar]
  • 52.Ding J, et al. Crystal structure of the two-RRM domain of hnRNP A1 (UP1) complexed with single-stranded telomeric DNA. Genes Dev. 1999;13(9):1102–1115. doi: 10.1101/gad.13.9.1102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Deka P, Bucheli ME, Moore C, Buratowski S, Varani G. Structure of the yeast SR protein Npl3 and Interaction with mRNA 3′-end processing signals. J Mol Biol. 2008;375(1):136–150. doi: 10.1016/j.jmb.2007.09.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Ngo JC, et al. A sliding docking interaction is essential for sequential and processive phosphorylation of an SR protein by SRPK1. Mol Cell. 2008;29(5):563–576. doi: 10.1016/j.molcel.2007.12.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Dominguez C, Fisette JF, Chabot B, Allain FH. Structural basis of G-tract recognition and encaging by hnRNP F quasi-RRMs. Nat Struct Mol Biol. 2010;17(7):853–861. doi: 10.1038/nsmb.1814. [DOI] [PubMed] [Google Scholar]
  • 56.Hardin JW, Hu YX, McKay DB. Structure of the RNA binding domain of a DEAD-box helicase bound to its ribosomal RNA target reveals a novel mode of recognition by an RNA recognition motif. J Mol Biol. 2010;402(2):412–427. doi: 10.1016/j.jmb.2010.07.040. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Pandit S, et al. Genome-wide analysis reveals SR protein cooperation and competition in regulated splicing. Mol Cell. 2013;50(2):223–235. doi: 10.1016/j.molcel.2013.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Zhou Z, Fu XD. Regulation of splicing by SR proteins and SR protein-specific kinases. Chromosoma. 2013;122(3):191–207. doi: 10.1007/s00412-013-0407-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Sanford JR, et al. Splicing factor SFRS1 recognizes a functionally diverse landscape of RNA transcripts. Genome Res. 2009;19(3):381–394. doi: 10.1101/gr.082503.108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Chen CD, Kobayashi R, Helfman DM. Binding of hnRNP H to an exonic splicing silencer is involved in the regulation of alternative splicing of the rat beta-tropomyosin gene. Genes Dev. 1999;13(5):593–606. doi: 10.1101/gad.13.5.593. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Jacquenet S, et al. A second exon splicing silencer within human immunodeficiency virus type 1 tat exon 2 represses splicing of Tat mRNA and binds protein hnRNP H. J Biol Chem. 2001;276(44):40464–40475. doi: 10.1074/jbc.M104070200. [DOI] [PubMed] [Google Scholar]
  • 62.Rothrock CR, House AE, Lynch KW. HnRNP L represses exon splicing via a regulated exonic splicing silencer. EMBO J. 2005;24(15):2792–2802. doi: 10.1038/sj.emboj.7600745. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Oh HK, et al. 2013. hnRNP A1 contacts exon 5 to promote exon 6 inclusion of apoptotic Fas gene. Apoptosis 18(7):825–835.
  • 64.Tavanez JP, Madl T, Kooshapur H, Sattler M, Valcárcel J. hnRNP A1 proofreads 3′ splice site recognition by U2AF. Mol Cell. 2012;45(3):314–329. doi: 10.1016/j.molcel.2011.11.033. [DOI] [PubMed] [Google Scholar]
  • 65.Erkelenz S, et al. Position-dependent splicing activation and repression by SR and hnRNP proteins rely on common mechanisms. RNA. 2013;19(1):96–102. doi: 10.1261/rna.037044.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Cléry A, et al. Molecular basis of purine-rich RNA recognition by the human SR-like protein Tra2-β1. Nat Struct Mol Biol. 2011;18(4):443–450. doi: 10.1038/nsmb.2001. [DOI] [PubMed] [Google Scholar]
  • 67.Sattler M, Schleucher J, Griesinger C. Heteronuclear multidimensional NMR experiments for the structure determination of proteins in solution employing pulsed field gradients. Prog NMR Spec. 1999;34:93–158. [Google Scholar]
  • 68.Peterson RD, Theimer CA, Wu H, Feigon J. New applications of 2D filtered/edited NOESY for assignment and structure elucidation of RNA and RNA-protein complexes. J Biomol NMR. 2004;28(1):59–67. doi: 10.1023/B:JNMR.0000012861.95939.05. [DOI] [PubMed] [Google Scholar]
  • 69.Lee W, Revington MJ, Arrowsmith C, Kay LE. A pulsed field gradient isotope-filtered 3D 13C HMQC-NOESY experiment for extracting intermolecular NOE contacts in molecular complexes. FEBS Lett. 1994;350(1):87–90. doi: 10.1016/0014-5793(94)00740-3. [DOI] [PubMed] [Google Scholar]
  • 70.Herrmann T, Güntert P, Wüthrich K. Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. J Mol Biol. 2002;319(1):209–227. doi: 10.1016/s0022-2836(02)00241-3. [DOI] [PubMed] [Google Scholar]
  • 71.Herrmann T, Güntert P, Wüthrich K. Protein NMR structure determination with automated NOE-identification in the NOESY spectra using the new software ATNOS. J Biomol NMR. 2002;24(3):171–189. doi: 10.1023/a:1021614115432. [DOI] [PubMed] [Google Scholar]
  • 72.Case DA, et al. The Amber biomolecular simulation programs. J Comput Chem. 2005;26(16):1668–1688. doi: 10.1002/jcc.20290. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Wang JM, Cieplak P, Kollman PA. How well does a restrained electrostatic potential (RESP) model perform in calculating conformational energies of organic and biological molecules? J Comput Chem. 2000;21(12):1049–1074. [Google Scholar]
  • 74.Laskowski RA, Rullmannn JA, MacArthur MW, Kaptein R, Thornton JM. AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR. J Biomol NMR. 1996;8(4):477–486. doi: 10.1007/BF00228148. [DOI] [PubMed] [Google Scholar]
  • 75.Koradi R, Billeter M, Wuthrich K. 1996. MOLMOL: A program for display and analysis of macromolecular structures. J Mol Graph 14(1):51–55, 29–32. [DOI] [PubMed]
  • 76.Cáceres JF, Misteli T, Screaton GR, Spector DL, Krainer AR. Role of the modular domains of SR proteins in subnuclear localization and alternative splicing specificity. J Cell Biol. 1997;138(2):225–238. doi: 10.1083/jcb.138.2.225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Hua Y, Vickers TA, Baker BF, Bennett CF, Krainer AR. Enhancement of SMN2 exon 7 inclusion by antisense oligonucleotides targeting the exon. PLoS Biol. 2007;5(4):e73. doi: 10.1371/journal.pbio.0050073. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Corrionero A, Raker VA, Izquierdo JM, Valcárcel J. Strict 3′ splice site sequence requirements for U2 snRNP recruitment after U2AF binding underlie a genetic defect leading to autoimmune disease. RNA. 2011;17(3):401–411. doi: 10.1261/rna.2444811. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information
1303445110_sapp.pdf (5.2MB, pdf)

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES