Abstract
There is growing evidence that a substantial number of protein domains ascribed as DNA-binding also interact with RNA to regulate biological processes. Several recent studies have revealed that the Sox2 transcription factor binds RNA through its high mobility group box (HMGB) domain in vitro and in vivo. High conservation of this domain amongst members of the Sox family of transcription factors suggests that RNA-binding activity may be a general feature of these proteins. To address this hypothesis, we examined a subset of HMGB domains from human Sox family proteins for their ability to bind both DNA and RNA in vitro. We observed selective, high affinity interactions between Sox family HMGB domains and various model RNA elements including a four-way junction RNA, a hairpin RNA with an internal bulge, G-quadruplex RNA, and a fragment of the long non-coding RNA ES2, which is known to directly interact with Sox2. Importantly, the HMGB domains bind these RNA ligands significantly tighter than non-consensus dsDNA and in some cases with affinities rivaling their consensus dsDNA sequences. These data suggest that RNA-binding is a conserved feature of the Sox family of transcription factors with the potential to modulate unappreciated biological functions.
Keywords: Transcription factor, HMGB, RNA binding protein
Graphical Abstract

Increasingly, proteins previously recognized as only binding DNA have begun to be appreciated for their RNA-binding capabilities which are essential for biological processes.1, 2 Several large-scale surveys seeking to identify proteins that indirectly or directly interact with RNA have putatively found a number of DNA-binding proteins not classified as RNA-binding.3–5 Two classes of DNA-binding proteins frequently observed to be associated with RNA are chromatin remodeling proteins and transcription factors (TFs). In these surveys, high mobility group box (HMGB) DNA-binding domains have been detected across multiple independent studies,2, 6–8 leading to the hypothesis that a subset of HMGB domain-containing proteins known to site-specifically bind DNA also recognize and interact with RNA in the cell.
An important HMGB-containing TF found to interact with RNA is Sox2; these interactions are mediated directly by the HMGB domain.9–18 Sox2 is essential for embryonic development, maintenance of stem cell populations, induction of pluripotent stem cells and is considered an oncogene.19–22 The interaction of Sox2 with RNA is important for a variety of Sox2-driven biological processes.13, 15, 16, 18 Sox2 modulates alternative splicing decisions and also associates with a subset of these alternatively spliced transcripts.11, 18 Sox2-RNA interactions are also essential for Sox2 genomic localization; knockdown of Sox2’s interacting RNAs modifies it genomic localization with deleterious impacts on normal cellular processes.10, 15 Sox2-RNA interactions within cancer cells impact cellular decisions such as differentiation or by promoting tumorigenesis.12, 17 In vitro, Sox2 uses distinct residues when interacting with DNA or RNA and RNA binding is competitive with DNA.9 Together, these studies demonstrate Sox2 is an RNA-binding protein (RBP) and these interactions are an essential component of Sox2’s function.
Sox2 is a member of the larger family of Sox proteins, all of which contain the highly conserved HMGB DNA-binding domain, raising the question of whether other Sox protein family members directly bind RNA. In humans, 20 Sox family proteins have been identified, many of which are conserved throughout eukarya.23–25 While compelling evidence exists for RNA associations by Sox4, Sox9, Sox10, and Sox2126–29, their ability to directly interact with RNA has not yet been assessed. To address our question, we selected a representative set of HMGB domains from each Sox group (see below) within the Sox family and tested the affinity for consensus dsDNA, non-consensus dsDNA and four model RNAs. We find that most Sox HMGB domains examined directly interact with RNA in vitro with similar or slightly weaker affinities than measured for the consensus dsDNA but much higher than a non-consensus dsDNA ligand. These data strongly suggest that, like Sox2, a significant subset of human Sox proteins may interact with RNA in vivo.
All Sox family proteins possess the ~80 amino acid HMGB DNA-binding domain containing a core DNA recognition element, RPMNAFMVW, that is conserved with nearly 100% identity (cyan box, Figure 1).23, 30, 31 Alignment of Sox family HMGB domains shows that all members share at least ~50% sequence identity with the parent Sox protein, sex-determining region Y protein (SRY).31 Further, structures of Sox HMGB domains across the family, either in the unbound or DNA-bound states are highly similar, with all comprising three alpha helices forming an L-shaped wedge that inserts residues into the minor groove of B-form DNA to induce bends of ~70°.30, 32, 33 Recognition of DNA targets in different contexts is made possible by the flexible and malleable structure of the HMGB domain34—a property also attributed to many RBPs.35
Figure 1.

High conservation of amino acids across the Sox family HMGB domain supports their RNA-binding potential. Sequence alignment of Sox family and Lef1 HMGB domains and their categorization into distinct groups. Sox family core residues are within the cyan box (top left). The asterisk denotes the HMGB domains and sequences used in this study which include two non-native N-terminal GP residues. Red highlighted residues are involved in Sox2 RNA-binding.
Previously, we conducted a thorough alanine scan of the Sox2 HMGB domain, revealing that the DNA- and RNA-binding surfaces of the Sox2 HMGB domain significantly overlap, but residues that are only important for binding either DNA or RNA were also identified.9 Residues found to be important for Sox2 binding of RNA are highly conserved throughout the Sox family (red bars, Figure 1), suggesting RNA-binding potential for other Sox HMGB domains.
To support our hypothesis, we surveyed the literature and databases for biological properties that suggest Sox family proteins interact with RNA in vivo (Figure S1). First, evidence that uses experimental approaches that determine direct binding of proteins to RNA was considered, such as UV-crosslinking RNA immunoprecipitation (RIP) in vivo and/or in vitro biochemical binding assays. Evidence for Sox-RNA interactions using these approaches are separate from methods such as formaldehyde crosslinking RIP in vivo or non-crosslinking pull down assays that cannot distinguish between direct and indirect interactions with RNA. This analysis reveals that while Sox1 and Sox2 are the only Sox proteins with evidence for directly interacting with RNA,9, 18 Sox4, Sox9, Sox10 and Sox21 at a minimum, indirectly interact with RNA (Figure S1A and B).26–29
Sox family proteins interact with protein partners on DNA,36 and it is likely they also use protein partners when binding RNA, as evidenced by interactions between Sox2 and the DLX5-SMARCA4-EVF2 lncRNA RNP complex.10 Like Sox2, six other Sox proteins are components of RNP complexes, RNP condensates, or localize to sites where RNA processing occurs including the nucleolus (Sox15), nuclear speckles (SRY, Sox6, Sox7), paraspeckles (Sox9), lamp brush chromosome lateral loops (Sox9), and stress granules (Sox3) (Figure S1C).37–42 Sox2’s protein partners are highly enriched with proteins involved in RNA metabolism such as splicing.43 Examination of the protein-protein interaction database APID44 supports that Sox2-RNP complexes are an important component of Sox2 biology as ~30% (human) and ~20% (mouse) of Sox2’s known interacting protein partners are labeled as RBPs (human shown in Figure S1D). Using APID, thirteen other Sox proteins interact with proteins known to bind RNA (Figure S1D), suggesting Sox family proteins are components of RNP assemblies. Collectively, this supports our hypothesis that Sox family proteins bind RNA, although, these data do not demonstrate direct Sox-RNA interactions.
To determine whether human Sox family proteins directly bind RNA in vitro and thereby have the potential to directly interact with RNA in vivo, a subset of Sox-family HMGB domains were selected for examination. Human Sox family proteins are divided into nine distinct groups based upon sequence conservation within the HMGB domain and within each group the HMGB domain residues are at least ~70% conserved (Figure 1).23, 31 For this study, we chose nine human Sox HMGB domains—one HMGB domain from each Sox group (denoted by asterisks, Figure 1). To ensure that each protein in this analysis comprised the full HMGB domain, the sequence of each domain was defined by UniProt45 and at least three additional residues on the N- and C-termini were added (sequences of proteins and UniProt ID given in Table S1).
To determine if a non-Sox family sequence specific DNA-binding HMGB domain can bind RNA, we included Lef1 in this study.46 Lef1’s HMGB domain is similar in sequence and structure to the Sox family members HMGB domain (Figure 1), but is part of the TCF/Lef1 family of sequence specific TFs and their HMGB domains are nearly 100% sequence identical (Figure S2).47 Previously, TCF7 has been shown to directly interact with RNA in vitro through its HMGB domain48, suggesting Lef1 and the other TCF proteins may also directly interact with RNA through their HMGB domain.
We first validated the ability of the selected HMGB domains to bind both site-specific and non-site-specific dsDNAs by determining apparent dissociation constants (KD, app) for each protein-nucleic acid complex using fluorescence anisotropy (FA) (Figure 2). We further used protein-binding stoichiometric electrophoretic mobility shift assay (EMSA) (Figure 2), which are completed with nucleic acid ligand concentrations greater than the KD, app. The HMGB domains of nine Sox-family proteins and Lef1 were expressed as a maltose binding protein-HMGB domain fusion, which were cleaved with 3C protease and purified to homogeneity (Figure S3). Sox family proteins bind the same core consensus DNA motif (5’-TTGT) while allowing variations in recognition of the flanking base pairs immediately 5’- and 3’- to the core.30, 49 This core DNA motif is confirmed for multiple Sox proteins in vivo.50–52 However, many of these Sox HMBG domains have not been assessed using in vitro methods to determine their affinity for the core consensus dsDNA motif.
Figure 2.

Sox17 binds dsDNA. A. FA assay with dsDNA containing the Sox consensus motif, 5’ TTGT dsDNA. B. EMSA to measure protein:DNA stoichiometry with dsDNA containing the Sox consensus motif. C. FA assay with dsDNA without Sox consensus sequence. D. EMSA to measure protein:DNA stoichiometry with dsDNA without the Sox consensus motif. Fit FA binding curves of Sox17 HMGB domain against each ligand with KD, app obtained from first transition, N=at least 3, standard deviation reported.
The Sox2 FGF4 enhancer sequence containing the core TTGT sequence was used for all Sox proteins53, 54 and a separate consensus dsDNA sequence of 5’-CCTTTGAA was used for Lef1.46, 55
Representative binding curves for the Sox17 HMGB domain are shown in Figure 2A. All the HMGB domains bound the consensus dsDNA ligand with affinities ranging from 0.6 to 72 nM (Table 1), consistent with reports for those previously tested (Table S2).9, 49, 54, 56–59 Almost all were in the 1 nM range, with only two family members (Sox6 and Sox30) being significantly weaker. In the consensus dsDNA FA titrations, two distinct transitions were observed that correspond to 1:1 and 2:1 protein:dsDNA complexes. This was established by correlating each transition observed by FA to a protein-induced shift in an EMSA (Figure 2B and Figure S4); this behavior has been previously observed.9, 59 The first transition corresponds to high affinity binding to the core consensus sequence and is the KD, app reported in Table 1 for all HMGB domains. The second transition likely results from a lower affinity binding event corresponding to a second protein binding with KD2, app values given in Table S3. The total anisotropy change in our FA experiments are consistent with previous reports of proteins binding FAM labeled nucleic acids.60, 61 Importantly, these data demonstrate that most Sox family HMGB domains bind the core 5’-TTGT sequence with highly similar in vitro affinities.
Table 1.
Binding affinities of Sox- and Lef1-HMGB domains against the nucleic acid ligands.
| HMGB domain | Nucleic Acid Ligands, KD, app, nM | |||||
|---|---|---|---|---|---|---|
| Consensus dsDNA | Non-consensus dsDNA | ES2 lncRNA fragment | Stem loop w/ internal bulge | RNA 4-way junction | TERRA G-quad | |
| SRY | 0.6 ± 0.3 | 840 ± 130 | 6.9 ± 0.6 | 20 ± 1 | 4.6 ± 0.5 | 130 ± 80 |
| Sox2 | 0.6 ± 0.1 | 600 ± 20 | 2.8 ± 1.2 | 16 ± 0.8 | 5.2 ± 0.3 | 0.7 ± 0.5 |
| Sox6 | 31 ± 15 | 1200 ± 300 | 72 ± 7 | 290 ± 15 | 96 ± 4 | 34 ± 5 |
| Sox9 | 0.6 ± 0.2 | 970 ± 390 | 4.2 ± 2.0 | 28 ± 7 | 8.4 ± 0.1 | 1.2 ± 0.5 |
| Sox11 | 4.4 ± 1.9 | 1200 ± 600 | 100 ± 16 | 360 ± 30 | 68 ± 9 | 1.9 ± 1.3 |
| Sox15 | 1.2 ± 0.8 | 1200 ± 800 | 12 ± 6 | 260 ± 7 | 70 ± 11 | 2.1 ± 2.5 |
| Sox17 | 0.5 ± 0.2 | 310 ± 40 | 1.5 ± 0.7 | 6.9 ± 3.8 | 1.6 ± 0.2 | 0.2 ± 0.2 |
| Sox21 | 0.5 ± 0.2 | 1200 ± 300 | 3.2 ± 0.5 | 79 ± 17 | 23 ± 7 | 3.1 ± 2.5 |
| Sox30 | 72 ± 12 | ≥5000 | 910 ± 130 | 3100 ± 500 | 990 ± 120 | 460 ± 100 |
| Lef1 | 1.6 ± 0.3 | 1000 ± 300 | 61 ± 8 | 630 ± 120 | 89 ± 8 | 4.1 ± 3.2 |
Note: N=at least 3 with standard deviation reported.
The binding affinity for Sox17 (Figure 2C) and the other HMGB domains to dsDNA that does not contain the consensus core motif was measured to assess the specificity for core consensus dsDNA and an EMSA confirms this qualitatively weaker affinity (Figure 2D and Figure S5). All HMGB domains bound the non-consensus dsDNA with measured KD, apps of 300 – 1200 nM (Table 1), reflecting a >1000-fold greater affinity of consensus DNA for most domains. The exception was Sox6 which only discriminates by ~40-fold; this lower specificity could be due to lower affinity for the FGF4 enhancer sequence compared to the other Sox HMGB domains. Differences in affinity when bases at the 5’ and/or 3’ ends of the core TTGT motif are altered, were observed for SRY,57 which may explain Sox6’s observed lower affinity for FGF4.
As evidence for Sox proteins acting as RNA binders in vivo continues to grow, it is important to investigate if there are any family-wide RNA preferences. Since potential high affinity RNA-binding sites are unknown for Sox family HMGB domains, we developed a set of four model RNAs representing various structural and sequence motifs to assess binding (sequence and secondary structures shown in Figure S6 and Table S4). We selected a fragment of the ES2 lncRNA previously shown to bind the Sox2 HMGB domain with high affinity in vitro9 while the full length ES2 lncRNA associates with Sox2 in vivo.14 In addition, we previously found that the Sox2 HMGB domain bound small duplex and hairpin RNAs.9 Another study identified stem-loop RNA targets of Sox2 using SELEX.18 Other HMGB proteins directly bind similar RNAs, including TCF7’s interaction with double stranded RNA with an internal bulge48, HMGB1’s interaction with hairpin RNA62, and HMGD binding dsRNA stem loops containing an internal bulge.63 Thus, we included a small hairpin RNA with an internal bulge in this survey. Other HMGB proteins directly interact with RNA 4-way junctions64, 65, and therefore an RNA 4-way junction substrate was included. Finally, Sox2, Sox21, and other HMGB proteins directly and/or indirectly interact with G-quadruplex RNA.29, 66 Thus, the well-characterized TERRA G-quadruplex RNA was used in this study.67 The RNAs used in this study were 3’ fluorophore labeled and purified to homogeneity (Figure S7).
These RNA ligands were each assessed for binding to the panel of Sox family and Lef1 HMGB domains using the FA binding assay. As a representative, the Sox17 HMGB domain binding with the four RNAs is shown in Figure 3. The KD, app was measured by FA for each RNA under the same conditions as the previously described dsDNAs (Figure 3A, C, E, G) and EMSAs were used to assess whether multiple Sox17 HMGB domains bind the RNA ligands (Figure 3B, D, F, H). Sox17 binds these four RNAs with affinities ranging from 0.2 – 6.9 nM and multiple transitions are observed in the FA assays for all RNA ligands (Figure 3A, C, E, G). This is likely due to multiple Sox17 binding sites with a variety of affinities (Figure 3B, D, F, H). For example, at a 2:1 Sox17:lncRNA ES2 fragment ratio, three bound bands are present (Figure 3B and Figure S8). The RNA 4-way junction appears to accommodate up to four Sox17 molecules at a 3.25:1 ratio (Figure S9) and at a 0.25:1 ratio a second band is already present, suggesting these binding sites may have similar affinities (Figure 3F, S9). In the case of the smaller stem loop with internal bulge RNA and TERRA G-quadruplex, at 2:1 (Figure 3D, H) and 3:1 (Figure S10, S11) ratios respectively, the RNAs still only form two predominant species suggesting the RNAs can accommodate two HMGB domains or that the other potential sites are very low affinity. While the TERRA RNA is homogenous under denaturing conditions (Figure S7), under native conditions there is one predominate species and a lesser populated species (Figure 3H, Figure S11). This could be a result of unfolded and folded states, TERRA’s ability to bind either K+ and Na+ ions,69, 70 or to dimerize.71 Interestingly, Sox17 appears to prefer the predominate TERRA species, further suggesting Sox17’s selectivity of certain RNA features. Thus, Sox17 appears to be able to bind to a variety of RNA features with high affinity and to distinct sites within the same RNA.
Figure 3.

Sox17 binds various RNA ligands with high affinity. A. FA assay with lncRNA ES2 fragment. B. EMSA to determine protein:RNA stoichiometry with lncRNA ES2 fragment. C. FA assay with stem loop with internal bulge. D. EMSA with stem loop with internal bulge. E. FA assay with RNA 4-way junction. F. EMSA with RNA 4-way junction. G. FA assay with TERRA G-quadruplex RNA H. EMSA with TERRA G-quadruplex. Sfold68 predicted secondary structures of nucleic acid ligands used in this study are overlayed into each binding curve. Fit FA binding curves of Sox17 HMGB domain against each ligand with KD, app obtained from first transition, N= at least 3, standard deviation reported.
We also observed that almost all the HMGB domains directly interact with RNA ligands with a wide range of nM affinities (Table 1). These affinities are consistent with previous reports of Sox2’s affinity for RNA ligands, 10–100 nM,9 and are also within the range observed for other HMGB proteins, such as TFAM’s affinity to G-quadruplex RNA of ~1 nM.66 Importantly, most of the affinities for RNA were significantly tighter than those observed to non-consensus dsDNA and, in many cases, approximately the same or comparable as to the consensus dsDNA containing the Sox core motif (Table 1). The one exception to our observed general pattern of binding was Sox30, which displayed lower affinities for all RNAs tested in this study. It is not clear why Sox30 does not bind RNA with high affinity, but it should be noted that it is one of the least conserved members of the Sox family and is missing some of the conserved positively charged residues within the Sox HMGB domain that contribute to nucleic acid binding (Figure 1).
In addition to high affinity RNA binding, the HMGB domains displayed varying degrees of selectivity between the RNA substrates. Binding affinities of all RNA ligands for each HMGB domain from Table 1 were compared to each other and ranked from highest affinity (1) to lowest affinity (4) (Figure 4). This representation of data from Table 1 allows a direct and straightforward visualization to identify HMGB domains that exhibit divergent preferences amongst the RNA substrates. Most of the HMGB domains bind the TERRA G-quadruplex RNA substrate with the highest affinity. Selective recognition of this motif by Sox HMGB domains suggest interactions could occur in vivo. In contrast, the majority of the HMGB domains bound the stem-loop with an internal bulge the weakest except for SRY and Sox15. However, most Sox HMGB domains showed a strong preference for the longer ES2 lncRNA hairpin which contains several internal loops, suggesting that more complex RNA structures might be targeted by these proteins. As Sox proteins have diverse, yet distinct and highly specific tissue expression with separate genomic occupancies to regulate gene expression, differences in Sox-HMGB RNA selectivity may be important to promote RNA-binding in each Sox protein’s native cellular environment with their distinct transcriptomes, which merits further exploration.
Figure 4.

HMGB domains show selectivity for RNA ligands. Each protein’s affinity for the RNA ligand is ranked from highest affinity ligand to lowest affinity ligand using data from Table 1. SL w/ IB is a Stem Loop with an Internal Bulge.
The driving factors for selectivity are not yet known but likely arise from differences in each Sox protein’s HMGB domain that favor RNA structural features and/or sequence preferences. As our small RNA library was not normalized by size and electrostatics, known to be important factors in Sox HMGB-nucleic acid interactions, it is possible the size of the RNA impacts affinity by presenting multiple binding sites. For example, many of the HMGBs bound the smaller stem loop with internal bulge RNA (35 bases) with weaker affinities than to the more complex ES2 lncRNA (89 bases). However, the highest affinity RNA ligand overall was the TERRA G-quadruplex (27 bases). Together, these data suggest that RNA size is not the primary factor driving high affinities and RNA structure and/or sequence preferences dominate.
This survey of Sox-family and Lef1 HMGB domains’ ability to bind RNA in vitro serves to advance a developing understanding of DNA-binding domains that directly interact with RNA. Clear evidence for the Sox family as an RNA-binding family has emerged from several studies that indicate multiple Sox proteins directly and/or indirectly with RNA in vivo, localize to RNP complexes, and interact with RBPs. This study furthers this understanding by providing compelling evidence that nine Sox HMGB domains and the Lef1 HMGB domain directly interact with a variety of RNA substrates with affinities comparable to that of their consensus dsDNA targets. Notably, all the HMGB domains exhibit selectivity for distinct RNA ligands which could play a role in binding specific RNAs in the context of each protein’s cellular environment. While this study does not demonstrate RNA interactions in vivo or that these direct interactions impact Sox/Lef1 family driven biology, the collective evidence presented herein provides motivation for further exploration of unappreciated aspects of the biological functions of HMGB proteins.
Supplementary Material
ACKNOWLEDGEMENTs
We thank the Shared Instrument Pool (SIP) core facility (RRID SCR_018986), Department of Biochemistry, University of Colorado Boulder, for the use of the Typhoon FLA 9500 and centrifuges.
Funding Sources
This work was supported by a grant from the National Institutes of Health to D.S.W. and R.T.B. (R01 GM120347). D.J.H. was supported in part through the Biophysics Training Program, funded through the National Institutes of Health (T32 GM065103) and A.E.H. was partially supported through a University of Colorado undergraduate research opportunity program.
ABBREVIATIONS
- FA
Fluorescence anisotropy
- FGF4
Fibroblast growth factor 4
- HMGB
High mobility group box
- RIP
RNA immunoprecipitation
- RNP
Ribonucleoprotein particle
- Sox
Sex-determining region Y-Box
- TF
Transcription factor
Footnotes
Supporting Information.
The Supporting Information is available free of charge at (web address). Supporting Information contains materials and methods, supplementary Figures and Tables.
UniProt Accession IDs for proteins used in this study: SRY-Q05066; Sox2-P48431; Sox6-P35712; Sox9-P48436; Sox11-P35716; Sox15-O60248; Sox17-Q9H6I2; Sox21-Q9Y651; Sox30-O94993; Lef1-Q9UJU2.
The authors declare no competing financial interest.
REFERENCES
- [1].Hudson WH, and Ortlund EA (2014) The structure, function and evolution of proteins that bind DNA and RNA, Nat Rev Mol Cell Biol 15, 749–760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [2].Hentze MW, Castello A, Schwarzl T, and Preiss T (2018) A brave new world of RNA-binding proteins, Nat Rev Mol Cell Biol 19, 327–341. [DOI] [PubMed] [Google Scholar]
- [3].Caudron-Herger M, Rusin SF, Adamo ME, Seiler J, Schmid VK, Barreau E, Kettenbach AN, and Diederichs S (2019) R-DeeP: Proteome-wide and Quantitative Identification of RNA-Dependent Proteins by Density Gradient Ultracentrifugation, Mol Cell 75, 184–199 e110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [4].Conrad T, Albrecht AS, de Melo Costa VR, Sauer S, Meierhofer D, and Orom UA (2016) Serial interactome capture of the human cell nucleus, Nat Commun 7, 11212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [5].He C, Sidoli S, Warneford-Thomson R, Tatomer DC, Wilusz JE, Garcia BA, and Bonasio R (2016) High-Resolution Mapping of RNA-Binding Regions in the Nuclear Proteome of Embryonic Stem Cells, Mol Cell 64, 416–430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Trendel J, Schwarzl T, Horos R, Prakash A, Bateman A, Hentze MW, and Krijgsveld J (2019) The Human RNA-Binding Proteome and Its Dynamics during Translational Arrest, Cell 176, 391–403 e319. [DOI] [PubMed] [Google Scholar]
- [7].Castello A, Fischer B, Frese CK, Horos R, Alleaume AM, Foehr S, Curk T, Krijgsveld J, and Hentze MW (2016) Comprehensive Identification of RNA-Binding Domains in Human Cells, Mol Cell 63, 696–710. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [8].Urdaneta EC, Vieira-Vieira CH, Hick T, Wessels HH, Figini D, Moschall R, Medenbach J, Ohler U, Granneman S, Selbach M, and Beckmann BM (2019) Purification of cross-linked RNA-protein complexes by phenol-toluol extraction, Nat Commun 10, 990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].Holmes ZE, Hamilton DJ, Hwang T, Parsonnet NV, Rinn JL, Wuttke DS, and Batey RT (2020) The Sox2 transcription factor binds RNA, Nat Commun 11, 1805. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Cajigas I, Chakraborty A, Lynam M, Swyter KR, Bastidas M, Collens L, Luo H, Ay F, and Kohtz JD (2021) Sox2-Evf2 lncRNA-mediated mechanisms of chromosome topological control in developing forebrain, Development 148. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Tung CL, Hou PH, Kao YL, Huang YW, Shen CC, Cheng YH, Wu SF, Lee MS, and Li C (2010) SOX2 modulates alternative splicing in transitional cell carcinoma, Biochem Biophys Res Commun 393, 420–425. [DOI] [PubMed] [Google Scholar]
- [12].Jiang Y, Jiang YY, Xie JJ, Mayakonda A, Hazawa M, Chen L, Xiao JF, Li CQ, Huang ML, Ding LW, Sun QY, Xu L, Kanojia D, Jeitany M, Deng JW, Liao LD, Soukiasian HJ, Berman BP, Hao JJ, Xu LY, Li EM, Wang MR, Bi XG, Lin DC, and Koeffler HP (2018) Co-activation of super-enhancer-driven CCAT1 by TP63 and SOX2 promotes squamous cancer progression, Nat Commun 9, 3619. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Guo X, Wang Z, Lu C, Hong W, Wang G, Xu Y, Liu Z, and Kang J (2018) LincRNA-1614 coordinates Sox2/PRC2-mediated repression of developmental genes in pluripotency maintenance, J Mol Cell Biol 10, 118–129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [14].Ng SY, Johnson R, and Stanton LW (2012) Human long non-coding RNAs promote pluripotency and neuronal differentiation by association with chromatin modifiers and transcription factors, EMBO J 31, 522–533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].Ng SY, Bogu GK, Soh BS, and Stanton LW (2013) The long noncoding RNA RMST interacts with SOX2 to regulate neurogenesis, Mol Cell 51, 349–359. [DOI] [PubMed] [Google Scholar]
- [16].Jing R, Guo X, Yang Y, Chen W, Kang J, and Zhu S (2020) Long noncoding RNA Q associates with Sox2 and is involved in the maintenance of pluripotency in mouse embryonic stem cells, Stem Cells 38, 834–848. [DOI] [PubMed] [Google Scholar]
- [17].Zhuang W, Ge X, Yang S, Huang M, Zhuang W, Chen P, Zhang X, Fu J, Qu J, and Li B (2015) Upregulation of lncRNA MEG3 Promotes Osteogenic Differentiation of Mesenchymal Stem Cells From Multiple Myeloma Patients By Targeting BMP4 Transcription, Stem Cells 33, 1985–1997. [DOI] [PubMed] [Google Scholar]
- [18].Hou L, Wei Y, Lin Y, Wang X, Lai Y, Yin M, Chen Y, Guo X, Wu S, Zhu Y, Yuan J, Tariq M, Li N, Sun H, Wang H, Zhang X, Chen J, Bao X, and Jauch R (2020) Concurrent binding to DNA and RNA facilitates the pluripotency reprogramming activity of Sox2, Nucleic Acids Res 48, 3869–3887. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Zhang S, and Cui W (2014) Sox2, a key factor in the regulation of pluripotency and neural differentiation, World J Stem Cells 6, 305–311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [20].Wuebben EL, and Rizzino A (2017) The dark side of SOX2: cancer - a comprehensive overview, Oncotarget 8, 44917–44943. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [21].Takahashi K, and Yamanaka S (2006) Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors, Cell 126, 663–676. [DOI] [PubMed] [Google Scholar]
- [22].Rizzino A, and Wuebben EL (2016) Sox2/Oct4: A delicately balanced partnership in pluripotent stem cells and embryogenesis, Biochim Biophys Acta 1859, 780–791. [DOI] [PubMed] [Google Scholar]
- [23].Schepers GE, Teasdale RD, and Koopman P (2002) Twenty pairs of sox: extent, homology, and nomenclature of the mouse and human sox transcription factor gene families, Dev Cell 3, 167–170. [DOI] [PubMed] [Google Scholar]
- [24].Phochanukul N, and Russell S (2010) No backbone but lots of Sox: Invertebrate Sox genes, Int J Biochem Cell Biol 42, 453–464. [DOI] [PubMed] [Google Scholar]
- [25].Guth SI, and Wegner M (2008) Having it both ways: Sox protein function between conservation and innovation, Cell Mol Life Sci 65, 3000–3018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [26].Girardot M, Bayet E, Maurin J, Fort P, Roux P, and Raynaud P (2018) SOX9 has distinct regulatory roles in alternative splicing and transcription, Nucleic Acids Res 46, 9106–9118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [27].Hu X, Li Q, and Zhang J (2019) The Long Noncoding RNA LINC00908 Facilitates Hepatocellular Carcinoma Progression Via Interaction With Sox-4, Cancer Manag Res 11, 8789–8797. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- [28].Stamou M, Ng SY, Brand H, Wang H, Plummer L, Best L, Havlicek S, Hibberd M, Khor CC, Gusella J, Balasubramanian R, Talkowski M, Stanton LW, and Crowley WF (2020) A Balanced Translocation in Kallmann Syndrome Implicates a Long Noncoding RNA, RMST, as a GnRH Neuronal Regulator, J Clin Endocrinol Metab 105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [29].Serikawa T, Spanos C, von Hacht A, Budisa N, Rappsilber J, and Kurreck J (2018) Comprehensive identification of proteins binding to RNA G-quadruplex motifs in the 5’ UTR of tumor-associated mRNAs, Biochimie 144, 169–184. [DOI] [PubMed] [Google Scholar]
- [30].Hou L, Srivastava Y, and Jauch R (2017) Molecular basis for the genome engagement by Sox proteins, Semin Cell Dev Biol 63, 2–12. [DOI] [PubMed] [Google Scholar]
- [31].Bowles J, Schepers G, and Koopman P (2000) Phylogeny of the SOX family of developmental transcription factors based on sequence and structural indicators, Dev Biol 227, 239–255. [DOI] [PubMed] [Google Scholar]
- [32].Remenyi A, Lins K, Nissen LJ, Reinbold R, Scholer HR, and Wilmanns M (2003) Crystal structure of a POU/HMG/DNA ternary complex suggests differential assembly of Oct4 and Sox2 on two enhancers, Genes Dev 17, 2048–2059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [33].Murphy EC, Zhurkin VB, Louis JM, Cornilescu G, and Clore GM (2001) Structural basis for SRY-dependent 46-X,Y sex reversal: modulation of DNA bending by a naturally occurring point mutation, J Mol Biol 312, 481–499. [DOI] [PubMed] [Google Scholar]
- [34].Weiss MA (2001) Floppy SOX: mutual induced fit in hmg (high-mobility group) box-DNA recognition, Mol Endocrinol 15, 353–362. [DOI] [PubMed] [Google Scholar]
- [35].Jarvelin AI, Noerenberg M, Davis I, and Castello A (2016) The new (dis)order in RNA regulation, Cell Commun Signal 14, 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [36].Kamachi Y, Uchikawa M, and Kondoh H (2000) Pairing SOX off: with partners in the regulation of embryonic development, Trends Genet 16, 182–187. [DOI] [PubMed] [Google Scholar]
- [37].Thul PJ, Akesson L, Wiking M, Mahdessian D, Geladaki A, Blal HA, Alm T, Asplund A, Bjork L, Breckels LM, Backstrom A, Danielsson F, Fagerberg L, Fall J, Gatto L, Gnann C, Hober S, Hjelmare M, Johansson F, Lee S, Lindskog C, Mulder J, Mulvey CM, Nilsson P, Oksvold P, Rockberg J, Schutten R, Schwenk JM, Sivertsson A, Sjostedt E, Skogs M, Stadler C, Sullivan DP, Tegel H, Winsnes C, Zhang C, Zwahlen M, Mardinoglu A, Ponten F, von Feilitzen K, Lilley KS, Uhlen M, and Lundberg E (2017) A subcellular map of the human proteome, Science 356. [DOI] [PubMed] [Google Scholar]
- [38].Sato Y, Yano S, Ewis AA, and Nakahori Y (2011) SRY interacts with ribosomal proteins S7 and L13a in nuclear speckles, Cell Biol Int 35, 449–452. [DOI] [PubMed] [Google Scholar]
- [39].Ohe K, Lalli E, and Sassone-Corsi P (2002) A direct role of SRY and SOX proteins in pre-mRNA splicing, P Natl Acad Sci USA 99, 1146–1151. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [40].Hata K, Nishimura R, Muramatsu S, Matsuda A, Matsubara T, Amano K, Ikeda F, Harley VR, and Yoneda T (2008) Paraspeckle protein p54(nrb) links Sox9-mediated transcription with RNA processing during chondrogenesis in mice, J Clin Invest 118, 3098–3108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [41].Penrad-Mobayed M, Perrin C, L’Hote D, Contremoulins V, Lepesant JA, Boizet-Bonhoure B, Poulat F, Baudin X, and Veitia RA (2018) A role for SOX9 in post-transcriptional processes: insights from the amphibian oocyte, Sci Rep-Uk 8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [42].Markmiller S, Soltanieh S, Server KL, Mak R, Jin WH, Fang MY, Luo EC, Krach F, Yang DJ, Sen A, Fulzele A, Wozniak JM, Gonzalez DJ, Kankel MW, Gao FB, Bennett EJ, Lecuyer E, and Yeo GW (2018) Context-Dependent and Disease-Specific Diversity in Protein Interactions within Stress Granules, Cell 172, 590–+. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [43].Samudyata, Amaral PP, Engstrom PG, Robson SC, Nielsen ML, Kouzarides T, and Castelo-Branco G (2019) Interaction of Sox2 with RNA binding proteins in mouse embryonic stem cells, Exp Cell Res 381, 129–138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [44].Alonso-Lopez D, Campos-Laborie FJ, Gutierrez MA, Lambourne L, Calderwood MA, Vidal M, and De Las Rivas J (2019) APID database: redefining protein-protein interaction experimental evidences and binary interactomes, Database-Oxford. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [45].Bateman A, and Martin MJ, and Orchard S, and Magrane M, and Agivetova R, and Ahmad S, and Alpi E, and Bowler-Barnett EH, and Britto R, and Bursteinas B, and Bye-A-Jee H, and Coetzee R, and Cukura A, and Da Silva A, and Denny P, and Dogan T, and Ebenezer T, and Fan J, and Castro LG, and Garmiri P, and Georghiou G, and Gonzales L, and Hatton-Ellis E, and Hussein A, and Ignatchenko A, and Insana G, and Ishtiaq R, and Jokinen P, and Joshi V, and Jyothi D, and Lock A, and Lopez R, and Luciani A, and Luo J, and Lussi Y, and Mac-Dougall A, and Madeira F, and Mahmoudy M, and Menchi M, and Mishra A, and Moulang K, and Nightingale A, and Oliveira CS, and Pundir S, and Qi GY, and Raj S, and Rice D, and Lopez MR, and Saidi R, and Sampson J, and Sawford T, and Speretta E, and Turner E, and Tyagi N, and Vasudev P, and Volynkin V, and Warner K, and Watkins X, and Zaru R, and Zellner H, and Bridge A, and Poux S, and Redaschi N, and Aimo L, and Argoud-Puy G, and Auchincloss A, and Axelsen K, and Bansal P, and Baratin D, and Blatter MC, and Bolleman J, and Boutet E, and Breuza L, and Casals-Casas C, and de Castro E, and Echioukh KC, and Coudert E, and Cuche B, and Doche M, and Dornevil D, and Estreicher A, and Famiglietti ML, and Feuermann M, and Gasteiger E, and Gehant S, and Gerritsen V, and Gos A, and Gruaz-Gumowski N, and Hinz U, and Hulo C, and Hyka-Nouspikel N, and Jungo F, and Keller G, and Kerhornou A, and Lara V, and Le Mercier P, and Lieberherr D, and Lombardot T, and Martin X, and Masson P, and Morgat A, and Neto TB, and Paesano S, and Pedruzzi, and Pilbout S, and Pourcel L, and Pozzato M, and Pruess M, and Rivoire C, and Sigrist C, and Sonesson K, and Stutz A, and Sundaram S, and Tognolli M, and Verbregue L, and Wu CH, and Arighi CN, and Arminski L, and Chen CM, and Chen YX, and Garavelli JS, and Huang HZ, and Laiho K, and McGarvey P, and Natale DA, and Ross K, and Vinayaka CR, and Wang QH, and Wang YQ, and Yeh LS, and Zhang J, and Consortium U (2021) UniProt: the universal protein knowledgebase in 2021, Nucleic Acids Research 49, D480–D489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [46].Travis A, Amsterdam A, Belanger C, and Grosschedl R (1991) Lef-1, a Gene Encoding a Lymphoid-Specific with Protein, an Hmg Domain, Regulates T-Cell Receptor-Alpha Enhancer Function, Gene Dev 5, 880–894. [DOI] [PubMed] [Google Scholar]
- [47].Hrckulak D, Kolar M, Strnad H, and Korinek V (2016) TCF/LEF Transcription Factors: An Update from the Internet Resources, Cancers 8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [48].Park MW, Choi KH, and Jeong S (2005) Inhibition of the DNA binding by the TCF-1 binding RNA aptamer, Biochem Biophys Res Commun 330, 11–17. [DOI] [PubMed] [Google Scholar]
- [49].Mertin S, McDowall SG, and Harley VR (1999) The DNA-binding specificity of SOX9 and other SOX proteins, Nucleic Acids Research 27, 1359–1364. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [50].Fufa TD, Harris ML, Watkins-Chow DE, Levy D, Gorkin DU, Gildea DE, Song LY, Safi A, Crawford GE, Sviderskaya EV, Bennett DC, Mccallion AS, Loftus SK, and Pavan WJ (2015) Genomic analysis reveals distinct mechanisms and functional classes of SOX10-regulated genes in melanocytes, Hum Mol Genet 24, 5433–5450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [51].Kuo PY, Leshchenko VV, Fazzari MJ, Perumal D, Gellen T, He T, Iqbal J, Baumgartner-Wennerholm S, Nygren L, Zhang F, Zhang W, Suh KS, Goy A, Yang DT, Chan WC, Kahl BS, Verma AK, Gascoyne RD, Kimby E, Sander B, Ye BH, Melnick AM, and Parekh S (2015) High-resolution chromatin immunoprecipitation (ChIP) sequencing reveals novel binding targets and prognostic role for SOX11 in mantle cell lymphoma, Oncogene 34, 1231–1240. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [52].Aksoy I, Jauch R, Chen JX, Dyla M, Divakar U, Bogu GK, Teo R, Ng CKL, Herath W, Sun LL, Hutchins AP, Robson P, Kolatkar PR, and Stanton LW (2013) Oct4 switches partnering from Sox2 to Sox17 to reinterpret the enhancer code and specify endoderm, Embo Journal 32, 938–953. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [53].Yuan HB, Corbi N, Basilico C, and Dailey L (1995) Developmental-Specific Activity of the Fgf-4 Enhancer Requires the Synergistic Action of Sox2 and Oct-3, Gene Dev 9, 2635–2645. [DOI] [PubMed] [Google Scholar]
- [54].Scaffidi P, and Bianchi ME (2001) Spatially precise DNA bending is an essential activity of the Sox2 transcription factor, J Biol Chem 276, 47296–47302. [DOI] [PubMed] [Google Scholar]
- [55].Giese K, and Grosschedl R (1993) Lef-1 Contains an Activation Domain That Stimulates Transcription Only in a Specific Context of Factor-Binding Sites, Embo Journal 12, 4667–4676. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [56].Sanchez-Moreno I, Coral-Vazquez R, Mendez P, and Canto P (2008) Full-length SRY protein is essential for DNA binding, Mol Hum Reprod 14, 325–330. [DOI] [PubMed] [Google Scholar]
- [57].Giese K, Pagel J, and Grosschedl R (1994) Distinct DNA-Binding Properties of the High-Mobility Group Domain of Murine and Human Sry Sex-Determining Factors, P Natl Acad Sci USA 91, 3368–3372. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [58].Connor F, Cary PD, Read CM, Preston NS, Driscoll PC, Denny P, Crane-Robinson C, and Ashworth A (1994) DNA binding and bending properties of the post-meiotically expressed Sry-related protein Sox-5, Nucleic Acids Res 22, 3339–3346. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [59].Moosa MM, Tsoi PS, Choi KJ, Ferreon ACM, and Ferreon JC (2018) Direct Single-Molecule Observation of Sequential DNA Bending Transitions by the Sox2 HMG Box, Int J Mol Sci 19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [60].Anderson BJ, Larkin C, Guja K, and Schildbach JF (2008) Using fluorophore-labeled oligonucleotides to measure affinities of protein-DNA interactions, Methods Enzymol 450, 253–272. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [61].LeTilly V, and Royer CA (1993) Fluorescence anisotropy assays implicate protein-protein interactions in regulating trp repressor DNA binding, Biochemistry-Us 32, 7753–7758. [DOI] [PubMed] [Google Scholar]
- [62].Yu R, Yang D, Lei S, Wang X, Meng X, Xue B, and Zhu H (2015) HMGB1 Promotes Hepatitis C Virus Replication by Interaction with Stem-Loop 4 in the Viral 5’ Untranslated Region, J Virol 90, 2332–2344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [63].Arimondo PB, Gelus N, Hamy F, Payet D, Travers A, and Bailly C (2000) The chromosomal protein HMG-D binds to the TAR and RBE RNA of HIV-1, FEBS Lett 485, 47–52. [DOI] [PubMed] [Google Scholar]
- [64].Brown TA, Tkachuk AN, and Clayton DA (2015) Mitochondrial Transcription Factor A (TFAM) Binds to RNA Containing 4-Way Junctions and Mitochondrial tRNA, PLoS One 10, e0142436. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [65].Genzor P, and Bortvin A (2015) A Unique HMG-Box Domain of Mouse Maelstrom Binds Structured RNA but Not Double Stranded DNA, Plos One 10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [66].Lyonnais S, Tarres-Soler A, Rubio-Cosials A, Cuppari A, Brito R, Jaumot J, Gargallo R, Vilaseca M, Silva C, Granzhan A, Teulade-Fichou MP, Eritja R, and Sola M (2017) The human mitochondrial transcription factor A is a versatile G-quadruplex binding protein, Sci Rep-Uk 7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [67].Martadinata H, and Phan AT (2013) Structure of Human Telomeric RNA (TERRA): Stacking of Two G-Quadruplex Blocks in K+ Solution, Biochemistry-Us 52, 2176–2183. [DOI] [PubMed] [Google Scholar]
- [68].Ding Y, Chan CY, and Lawrence CE (2004) Sfold web server for statistical folding and rational design of nucleic acids, Nucleic Acids Res 32, W135–141. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [69].Xu Y, Kaminaga K, and Komiyama M (2008) G-quadruplex formation by human telomeric repeats-containing RNA in Na+ solution, J Am Chem Soc 130, 11179–11184. [DOI] [PubMed] [Google Scholar]
- [70].Zhang AY, and Balasubramanian S (2012) The kinetics and folding pathways of intramolecular G-quadruplex nucleic acids, J Am Chem Soc 134, 19297–19308. [DOI] [PubMed] [Google Scholar]
- [71].Collie GW, Parkinson GN, Neidle S, Rosu F, De Pauw E, and Gabelica V (2010) Electrospray mass spectrometry of telomeric RNA (TERRA) reveals the formation of stable multimeric G-quadruplex structures, J Am Chem Soc 132, 9328–9334. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
