Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2026 Feb 21.
Published in final edited form as: Cell. 2025 Dec 19;189(3):832–852.e24. doi: 10.1016/j.cell.2025.11.026

Nuclear speckle proteins form intrinsic and MALAT1-dependent microphases

Min Kyung Shinn 1,2, Dylan T Tomares 1,2,9, Vicky Liu 1,2,9, Avnika Pant 1,2, Yuanxin Qiu 2,3, Andreas Vitalis 4, You Jin Song 5,6, Yuna Ayala 7, Kiersten M Ruff 1,2, Gregory W Strout 8, Matthew D Lew 2,3, Kannanganattu V Prasanth 5,6, Rohit V Pappu 1,2,10,*
PMCID: PMC12922802  NIHMSID: NIHMS2139662  PMID: 41421357

SUMMARY

Pre-mRNA processing components in nuclear speckles encompass one or more folded RNA recognition motifs (RRMs) and disordered regions with specific sequence grammars. Such proteins include serine/arginine-rich splicing factors (SRSFs) and transactive response DNA binding protein (TDP)-43. The SRSFs and TDP-43 are unique archetypes of block copolymers encoding specific patterns of inter-domain homotypic and heterotypic attractions and repulsions. The interplay of these interactions drives microphase separation and the formation of ordered, size-limited assemblies. Microphases of SRSFs and TDP-43 are 23–45 nm in diameter, each comprising tens of molecules. Sub-micron-scale assemblies of SRSFs in cells are consistent with being clusters of microphases. The speckle-associated regulatory long non-coding RNA (lncRNA) metastasis-associated lung adenocarcinoma transcript 1 (MALAT1) binds specifically and preferentially to SRSF1 microphases, while destabilizing TDP-43 microphases. In protein mixtures, the interactions between microphases drive the formation of micron-scale double-emulsion structures with core-shell organization. Our findings show how interactions involving copolymers featuring folded domains and disordered regions drive the formation of microphases.

Graphical Abstract

graphic file with name nihms-2139662-f0001.jpg

In brief

In nuclear speckles, sequence-encoded inter-domain attractive and repulsive interactions drive proteins to form hierarchical structures, including size-limited microphases, clusters of microphases, micron-scale double emulsions in protein mixtures, and microphases stabilized by preferential and protein-specific binding to the lncRNA MALAT1.

INTRODUCTION

In metazoan cells, nuclear speckles function as hubs that regulate key steps of gene expression.1-8 Speckles are enriched in RNA-binding proteins that contribute to pre-mRNA splicing.8-13 Splicing factor activities are regulated by polyadenylated RNAs and regulatory long non-coding RNAs (lncRNAs) that are enriched in the transcriptomes of speckles.7,14-16

It has been proposed that nuclear speckles form via macrophase separation or liquid-liquid phase separation (LLPS) of individual components.17 Drivers of macrophase separation feature uniformly attractive interactions18,19 (Figure 1A), and macrophase sizes are orders of magnitude larger than the nanoscale dimensions of the underlying macromolecules, encompassing millions of molecules. At equilibrium, a macrophase would be a single dense phase that coexists with the dilute phase.20 However, equilibrium macrophases are not typically observed in cells. Control over size can arise through active processes,21,22 modulation by solid particles,23 confinement effects,24,25 or dynamical arrest.26 Super-resolution structured illumination microscopy has shown that nuclear speckle components are organized inhomogeneously into distinct territories, forming size-limited sub-micron-scale structures.27-33 Based on this, we conjectured that nuclear speckles might be emulsions of microphases formed by block copolymer-like molecules.

Figure 1. SRSF1 is an unconventional block copolymer.

Figure 1.

(A) Schematic of a linear flexible homopolymer that undergoes macrophase separation or LLPS featuring uniformity of attractions (blue lines). Above csat, a single micron-scale dense phase should form that coexists with the dilute phase. Inset shows a schematic of how millions of molecules would be homogeneously mixed within dense phases.

(B) Schematic of a conventional di-block copolymer featuring disordered A- (purple) and B-blocks (green). Homotypic inter-block interactions are attractive, whereas heterotypic interactions are repulsive.

(C) Sequence architecture of hnRNP-A1, which forms macrophases in vitro and in cells.

(D) Space-filling model of hnRNP-A1. Residues are colored based on residue type, with blue, red, green, and white spheres signifying basic, acidic, polar, and hydrophobic residues, respectively.

(E) Sequence architecture of SRSF1.

(F) Space-filling model of SRSF1.

(G) Computed ΔXY values for pairs of domains within hnRNP-A1.

(H) Computed ΔXY values for pairs of SRSF1 domains.

(I) Sequence architecture of SRSF1Δ213–247.

(J) Space-filling model of SRSF1Δ213–247.

(K) Computed ΔXY values for pairs of SRSF1Δ213–247 domains.

See also Figure S1.

Conventional block copolymers comprise disordered blocks that are mutually incompatible with one another34-42 (Figure 1B). The homotypic inter-block interactions are attractive, and heterotypic inter-block interactions are repulsive36,40-43 (Figure 1B). At finite concentrations, the homotypic inter-block attractions can drive phase separation. These attractions compete against heterotypic inter-block repulsions that oppose growth into macrophases. The result is the formation of size-stabilized assemblies known as microphases that are defined by system-specific nanoscale morphologies and internal ordering.36,41,44

Here, we show that speckle-associated serine/arginine-rich splicing factor (SRSF)1, SRSF3, SRSF5, SRSF7, and TDP (transactive response DNA binding protein)-43 are unconventional block copolymers that undergo microphase separation. The regulatory lncRNA MALAT1 (metastasis-associated lung adenocarcinoma transcript 1) binds specifically and preferentially to remodel SRSF1 microphases.45

RESULTS

Establishing the block copolymeric nature of SRSF1

Multi-domain RNA recognition motif (RRM)-containing proteins, such as hnRNP-A1 (Figures 1C and 1D) and SRSF1 (Figures 1D and 1F), have distinct RRMs and intrinsically disordered regions (IDRs). Folded RRMs are pre-organized by intra-domain microphase separation46 and have protein-specific physicochemical properties (Figures S1A-S1F). The sequence grammars of the IDRs are also different.47-49 IDR1 in SRSF1 and the low complexity domain (LCD) of hnRNP-A1 (A1-LCD) are enriched in Gly-rich patches, high non-random Gly content, and the presence of arginine-glycine (RG)-motifs (Figure S1G). However, the two regions differ in the Phe, Asn, and aromatic contents (enriched in A1-LCD), and the extent of linear segregation of Gly residues (high in IDR1 of SRSF1). The IDR2 of SRSF1 shows non-random enrichments of Ser/Arg residues (Frac S, Frac R, R patch, S patch).

Next, we used atomistic simulations to characterize homotypic and heterotypic inter-domain interactions for hnRNP-A1 (Figure 1G) and SRSF1 (Figure 1H). For each pair of domains X and Y, we extracted an inter-domain interaction parameter, ΔXY, where 1ΔXY+1 (see STAR Methods). Negative values of ΔXY signify attractions, whereas positive values reflect repulsions. The magnitudes of ΔXY quantify the strengths of interactions. If ΔXY0, then interactions between X and Y are nearly ideal. Note that each element in the inter-domain interaction matrix is a grand sum over the collection of inter-residue attractions and repulsions, and changes at the residue level directly affect the calculated ΔXY.50

For hnRNP-A1, all inter-domain interactions are either attractive or nearly ideal (ΔXY<0 or ΔXY0) (Figure 1G). This explains why it drives macrophase separation.51,52 For SRSF1, the homotypic inter-domain interactions are either strongly repulsive (RRM1-RRM1, IDR1-IDR1, IDR2-IDR2) or nearly ideal (RRM2-RRM2) (Figure 1H). Heterotypic IDR-RRM interactions (IDR1-RRM1, IDR1-RRM2, IDR2-RRM1, and IDR2-RRM2) are attractive, whereas heterotypic IDR1-IDR2, and RRM1-RRM2 interactions are repulsive. The pattern of homotypic repulsions combined with heterotypic attractions and repulsions makes SRSF1 an unconventional tetra-block copolymer (compare Figure 1H to Figure 1B). Cellular studies have used SRSF1Δ213–247, featuring an internal deletion within IDR245,53,54 (Figures 1I and 1J). The interactions for SRSF1Δ213–247 are qualitatively similar to those of SRSF1, although the inter-IDR repulsions are weaker. The interactions for SRSF1Δ213–247 may be viewed as an interpolation of interaction patterns of SRSF1 and SRSF9, which is another tetrablock SRSF (Figures 1H and 1K versus Figures S1H-S1J).

Threshold concentrations for phase separation

We measured right-angle light scattering (RALS) in vitro to assess the existence of a threshold concentration (cμ) for phase separation of SRSF1 and SRSF1Δ213–247.55 For microphase-forming systems, the existence of a threshold concentration signifies a system in the strong segregation limit.44 We performed RALS measurements over a 100-fold concentration range in aqueous buffers (10% glycerol, 20 mM HEPES, pH 7.4, 50 mM KCl, 5 mM MgCl2) and observed discontinuities at 0.57 ± 0.03 μM for SRSF1 and 0.45 ± 0.05 μM for SRSF1Δ213–247 (Figures 2A, S2A, and S2B).45 The measured cμ increased with increasing [KCl] (Figure S2C). To compare sequence-intrinsic interactions, all subsequent measurements for all systems were performed using identical solution conditions. Measurements of time-dependent fluorescence intensities of pyrene, a probe that partitions into dense phases if the microenvironments are distinct from those of dilute phases,56 yielded estimates of cμ that were consistent with RALS (Figures S2D and S2E).

Figure 2. SRSF1 and SRSF1Δ213–247 form microphases.

Figure 2.

(A) cμ of SRSF1 = 0.57 ± 0.03 μM and for SRSF1Δ213–247 = 0.45 ± 0.05. Legends show the mean (d¯) ± standard deviation (SD) and median (d~) values.

(B) QF-DEEM images at three different magnifications of the microphases formed by SRSF1.

(C) QF-DEEM images at three different magnifications of SRSF1Δ213–247 microphases.

(D and E) Histograms of diameters of microphases constructed from 1,516 images for SRSF1 and (E) 753 images for SRSF1Δ213–247 (mean ± SD).

(F and G) Show results from TRFQ experiments (mean ± SD, see STAR Methods). Estimates of nμ, values of slopes, and unreliability index (Φ) are shown.

(H) Histogram of Rg values for SRSF1 from atomistic simulations.

(I) Histogram of Rg,μ values from atomistic simulations of microphases comprising 24 SRSF1 molecules (mean ± SD).

(J) Profiles of gD(r). Legend provides annotation of color coding, and the inset is a zoomed-in version.

(K) Representative snapshot from simulations showing 24 SRSF1 molecules within a microphase (see Video S1).

(L) Histogram of Rg values for SRSF1Δ213–247 extracted from atomistic simulations.

(M) Histogram of Rg,μ values from simulations of microphases comprising 48 SRSF1Δ213–247 molecules (mean ± SD).

(N) Profiles of gD(r).

(O) Representative snapshot from simulations showing 48 SRSF1Δ213–247 molecules within a microphase.

See also Figure S2.

Structural characterization reveals the formation of microphases

We used quick freeze deep-etch electron microscopy (QF-DEEM) for structural characterization of microphases. Protein samples were flash-frozen against a polished copper block cooled by liquid He at 4 K57 and fractured at 170 K to optimize etching, deposition, and imaging. This avoided drying, wicking, or exposure to air-water interfaces.58 SRSF1 formed spheroidal nanoscale structures (Figure 2B) whereas SRSF1Δ213–247 formed biconcave-disc-like structures (Figure 2C). We analyzed images spanning a large field of view to quantify the diameters of 1,516 SRSF1 (Figure S2F; see STAR Methods) and 753 SRSF1Δ213–247 microphases. The mean and median values for the diameters are 23.6 ± 6.0 and 22.4 nm for SRSF1 and 41.0 ± 8.7 and 42.5 nm for SRSF1Δ213–247 (Figure 2D).

Next, we estimated the number of protein molecules (nμ) per microphase by measuring time-resolved fluorescence quenching (TRFQ) of pyrene in the presence of cetylpyridinium chloride as a quencher.59 We estimated nμ of ~23 for SRSF1 and ~50 for SRSF1Δ213–247 microphases (Figures 2F and 2G), corresponding to apparent packing fractions of p ≈ 0.29 and p ≈ 0.1 for SRSF1 and SRSF1Δ213–247 microphases, respectively.

We performed atomistic simulations, using the ABSINTH implicit solvation model and forcefield paradigm,60,61 of individual SRSF1 molecules to obtain a mean radius of gyration (Rg,SRSF1) of 2.7 ± 0.2 nm (Figure 2H). We also performed atomistic simulations of 24 SRSF1 molecules to characterize the internal organization within microphases. The mean radius of gyration of the simulated microphase (Rg,μ) was 9.8 ± 0.2 nm (Figure 2I). The spatial organization of domain sites within simulated microphases was analyzed in terms of radial distribution functions, gD(r) (Figure 2J). For a distance rRg,SRSF1 of an SRSF1 molecule, the relative density of IDR2 within the interior is the lowest, and that of IDR1 is the highest of the four domains. The densities of the two RRMs proximal to the center-of-mass lie between those of the two IDRs. Beyond a length scale of 8 nm, there is a crossover, and the density of IDR2 is higher than that of IDR1 (Figure 2J, inset). Due to confinement imposed by the spheroidal envelope of the microphase, all gD(r) profiles converge and decay to zero at the edge of the envelope (Figure 2K; Video S1).

From atomistic simulations, the mean Rg,SRSF1Δ213247 is 2.6 ± 0.2 nm (Figure 2L), and the Rg,μ of SRSF1Δ213–247 assemblies was found to be 12.2 ± 0.3 nm (Figure 2M; Video S2). The gD(r) profiles show three distinct layers. First, there is preferential accumulation of all domains at short-range r<Rg,SRSF1Δ213247 (Figure 2N) with the highest density of IDR1 and IDR2 and depletion of RRM2 near the center-of-mass. Second, there is a hierarchical accumulation of domains on the intermediate length scale, with the highest density being for RRM1, followed by IDR1, RRM2, and IDR2 in that order. Third, the densities converge at a radial distance of 15 nm and decay to zero at the edge of the microphase.

We also characterized the phase behavior of SRSF1ΔIDR2, a construct that lacks the entirety of IDR2. The threshold concentration was cμ=0.44±0.08μM (Figure S2G). QF-DEEM revealed a polydisperse distribution of spheroidal assemblies defined by mean and median diameters of 33.6 ± 13.7 and 28.8 nm, respectively (Figures S2H and S2I).

Microphases associate to form sub-micron-scale clusters

As protein concentrations increase, the abundance of microphases also increases. This enables short-range attractions that drive clustering of microphases, such that condensed counterions are shared among microphases within a cluster62 (Figure 3A). The buildup of charge within clusters of microphases engenders long-range repulsions,63,64 thereby stabilizing the sizes of clusters.

Figure 3. Microphases associate to form sub-micron-scale clusters.

Figure 3.

(A) An accumulation of surface charge shown by blue + marks will draw a counterion cloud (dashed red envelope) around each microphase (solid blue circles) to enable clustering via short-range attractions and long-range repulsions.

(B) Distributions of cluster sizes extracted from MRPS measurements of SRSF1Δ213–247 above cμ. Solid lines and shaded regions show mean and SD, respectively.

(C) Confocal microscopy showing the positive staining with the ANEPPS dye.

(D) Images from confocal microscopy, collected at concentrations above cμ. Schematic depicts how microphases form clusters and networks of clusters.

(E) Box and whisker plot from analysis of QF-DEEM images yielded estimates for the number of microphases within a ~380 nm sized cluster. The box extends from the 25th to 75th percentiles. The line in the middle of the box is at the median value of 23 microphases per cluster. The whiskers extend from the minimum to the maximum number of microphases detected in the probe spheres.

(F) Single-molecule microscopy with photoactivatable Janelia Fluor 549-labeled SRSF1 showing that clusters are composed of multiple microphases. The image is produced as a maximum projection of 200 frames collected, where the color bar shows the fluorescence intensity.

(G) Histograms of fluorophores with trajectories of less than 50 nm (slow) or more than 50 nm (fast) over each 20 ms frame for 20,000 frames each, with an inset for a zoomed-in field of view of the outlined area. The histograms are represented as heatmaps mapped onto the image.

(H) Excess variance computed over bins that quantify the displacements of slow-versus fast-moving molecules.

(I) Confocal microscopy images of AF488-SRSF1Δ213–247 showing destabilization and loss of clusters and microphases with increasing concentrations of L-arginine.

(J) RALS data, shown at two concentrations of L-arginine.

(K) Confocal microscopy images AF488-SRSF1Δ213–247 showing loss of clustering in the presence of increased v/v% of PEG8000.

(L) Microphase separation of SRSF1Δ213–247 is preserved in the presence of 10% v/v of PEG8000.

(M) Image of GFP-tagged SRSF1Δ213–247 bodies in X. laevis GVs.

(N) Merge of images collected simultaneously via bright-field microscopy and the intensity in the GFP channel shows that the SRSF1Δ213–247 bodies are proximal to Cajal bodies in the GVs.

(O) CDF of the diameters of SRSF1Δ213–247 positive bodies remains invariant at different levels of the injected mRNA. Latrunculin A (Lat-A) treatments (orange dotted) to oocytes injected with 100 ng/μL (orange) mRNA did not result in an increase in the size of the intra-GV SRSF1Δ213–247 positive bodies.

See also Figure S3.

We measured the size distributions of SRSF1Δ213–247 at concentrations above cμ using microfluidic resistive pulse sensing (MRPS) (see STAR Methods).50,65 Over a 5-fold range of concentrations, the MRPS measurements reveal the presence of clusters of sizes between 358.9 ± 20.4 and 386.1 ± 26.3 nm. While the sizes of clusters stayed essentially the same, their abundance increased with SRSF1Δ213–247 concentration (Figure 3B).

The buildup of charge at interfaces should be measurable as an electric field. We assessed this using the di-4-ANEPPS (di-4-aminoaphthylethenylpyridinium)66 dye, which fluoresces in response to a change in the presence of a local electric field. Using confocal microscopy, we detected strong fluorescence signals when we incubated the dye with SRSF1Δ213–247 at concentrations above cμ (Figure 3C).

We used confocal microscopy for direct imaging of the clusters (Figures 3D and S3A). SRSF1Δ213–247 molecules formed clusters of ~400 nm in diameter, equivalent to sizes measured via MRPS. Cluster sizes did not change as SRSF1Δ213–247 concentrations were increased; the abundance of clusters increased with increasing SRSF1Δ213–247 concentration (Figures 3D and S3A). We estimated a median number of ~23 microphases within a cluster of diameter ~380 nm (Figures 3E and S3B).

To study the internal dynamics of molecules within microphases and clusters, we used single-molecule localization microscopy67 and measured the blinking of SRSF1Δ213–247 labeled with the photoactivatable Janelia Fluor 549. Labeled molecules were mixed with unlabeled molecules at a 1:50 molar ratio. The maximum projection of 2 × 104 frames (20 ms per frame) collected over 6.7 min onto a single image showed the presence of clusters, mirroring observations from confocal imaging (Figure 3F).

We utilized single-particle tracking photoactivated localization microscopy (sptPALM)68 to track motions of SRSF1Δ213–247 molecules (Figure S3D). The cumulative distribution function (CDF) of displacements showed that the probability of finding molecules with displacements that are less than or equal to 50 nm is 0.5 (Figure S3E). The CDF allowed for the classification of molecules as being slow-moving (Video S3) within a given frame. The displacements of slow-moving molecules were less than 50 nm. This length scale is equivalent to the size of a single SRSF1Δ213–247 microphase (Figure S3F). Molecules were defined to be fast-moving if the displacements were larger than 50 nm (Video S4). Comparing the histograms for the two sets of displacements, we observed that there were more slow molecules, implying that SRSF1Δ213–247 molecules were mostly confined within microphases (Figure 3G). Analysis of the excess variance of displacements across all bins within a mask showed prominent outliers, with some bins featuring a strong localization of fluorophores within a ~50 nm region, corresponding to the sizes of individual microphases (Figures 3H and S3G). In fluorescence recovery after photobleaching (FRAP) of Alexa Fluor 488-labeled SRSF1Δ213–247 clusters, the fluorescence intensity did not recover for at least 120 s after photobleaching (Figure S3C).

Disruption of microphases and clusters

Addition of 500 mM L-arginine69-71 abrogated microphases (Figures 3I and 3J). These results are concordant with previous observations72 showing an increase in solubility in the presence of an octameric peptide of RS repeats (RSRSRSRS) or a mixture of arginine and glutamate (Arg/Glu) in solution.

The addition of high-molecular-weight polyethylene glycol (PEG8000) destabilized clustering without disrupting microphase separation (Figures 3K and 3L). Low-molecular-weight excipients such as ethylene glycol were ineffective at disrupting clustering (Figure S3H). PEG polymers exhibit long-range repulsions from positively charged surfaces73 and shielding happens on the length scale of microphases.

SRSF1Δ213–247 forms size-limited clusters in cells

We leveraged the large sizes of the Xenopus laevis oocytes to quantify the sizes of SRSF1Δ213–247 assemblies in germinal vesicles (GVs). We expressed GFP-tagged SRSF1Δ213–247 by microinjection of different mRNA concentrations into oocytes. Following the expression, the GVs were extracted and imaged using confocal microscopy.74 We observed more than 200 SRSF1-positive bodies in each GV (Figure 3M). Some of the SRSF1Δ213–247-positive bodies were attached to a larger non-fluorescent body as observed by bright-field microscopy (Figures 3N and S3I). This is consistent with nuclear speckles being attached to Cajal bodies.75 Irrespective of the injected mRNA concentration, the sizes of SRSF1Δ213–247-positive bodies were narrowly distributed with a mean diameter of ~300–400 nm (Figure 3O). The sizes of SRSF1Δ213–247-positive bodies are consistent with the clusters of microphases that were observed in vitro (Figures 3B and 3D).

Nucleoli and histone locus bodies fuse and coarsen upon incubation of X. laevis oocytes with latrunculin A (Lat-A), which disrupts nuclear actin.74 In contrast, the size distributions of SRSF1Δ213–247-positive bodies did not change with Lat-A treatment (Figure 3O). Therefore, the uniform size distributions observed for SRSF1Δ213–247-positive bodies in GVs appear to be intrinsic properties of clusters of SRSF1Δ213–247 microphases.

SRSF3, SRSF5, and SRSF7 also form microphases

Next, we studied SRSF3, SRSF5, and SRSF7 as representatives of different architectures76 (Figures 4A, S4A, and S4B). In HeLa cells, SRSF1 is the most abundant of the SRSFs, followed by SRSF3 and SRSF7. SRSF5 is of relatively low abundance.77 The C-terminal IDRs are enriched in arginine and serine (red box in Figure S4C). SRSF7 has the longest SR-rich IDR. Additionally, the IDR1 of SRSF1 and the 38-residue linker between the RRM and C-terminal IDR of SRSF7 have sequence grammars that are shared with prion-like domains.18

Figure 4. Microphase separation is a shared feature of SRSFs.

Figure 4.

(A) Sequence architectures of SRSF1, SRSF3, SRSF5, and SRSF7.

(B–D) Inter-domain ΔXY coefficients for SRSF3 (B), SRSF5 (C), and SRSF7 (D).

(E) QF-DEEM images of 5 μM SRSF3 (top), 10 μM SRSF (middle), and 2 μM SRSF7 at two different magnifications (left) versus (right).

(F) Histograms of diameters (in nm) constructed from 1,446 images for SRSF3 (top), 1,656 images for SRSF5 (middle), and 656 images for SRSF7 (bottom) (mean (d¯) ± SD and median (d~)).

(G) Sub-micron-scale clusters formed by SRSF1 (top left), SRSF7 (top right), SRSF3 (bottom left), and SRSF5 (bottom left).

(H–J) Results from microinjections and expression of 200 ng/μL mRNAs of mCherry-tagged SRSF3 (H), SRSF5 (I), and SRSF7 (J).

(K) CDFs of size distributions of bodies formed by SRSF3, SRSF5, and SRSF7.

(L) Bright-field (top), fluorescence microscopy (middle), and merged (bottom) images obtained from injections of different amounts (shown across the columns) of mCherry-tagged mRNAs of SRSF7.

(M) CDF showing the evolution of sizes formed by SRSF7 bodies at different injection levels.

See also Figure S4.

Inter-domain homotypic interactions in SRSF3 are nearly ideal (RRM-RRM) or repulsive (IDR-IDR), whereas IDR-RRM interactions are attractive (Figure 4B). The inter-domain interaction patterns for SRSF5 are qualitatively similar to those of SRSF1. However, the homotypic inter-IDR2 repulsions are stronger in SRSF5, and IDR2-RRM2 attractions are weaker (Figure 4C). The presence of the linker introduces a new degree of freedom into SRSF7 when compared with SRSF3 (Figure 4D). Thus, SRSF5 is a tetra-block system, whereas SRSF3 and SRSF7 are di- and triblock systems, respectively.

The measured values of cμ were 1.0 ± 0.14 μM for SRSF7, 2.0 ± 0.14 μM for SRSF3, and 9.5 ± 0.71 μM for SRSF5 (Figures S4D-S4G). The fluorescence lifetimes of pyrene corroborated the cμ values, with the rate constants plateauing to protein-specific values of 1.01 ± 0.1, 0.93 ± 0.17, and 1.08 ± 0.17 ns−1 for SRSF7, SRSF3, and SRSF5, respectively (Figure S4H). QF-DEEM imaging showed that all three systems formed spheroidal assemblies (Figure 4E). The mean and median diameters were found to be 39.4 ± 8.0 nm and 39.4 nm for SRSF3, 26.6 ± 7.8 nm and 24.5 nm for SRSF5, and 29.1 ± 11.1 nm and 26.0 nm for SRSF7 (Figure 4F).

The TRFQ assay yielded estimates of nμ that are highly unreliable (Figures S4I-S4K). Hence, we used the distribution of Rg values of individual molecules, the size distributions of the spheroidal assemblies obtained from QF-DEEM, and an assumption of equivalent packing fractions to that of SRSF1 assemblies to obtain estimates of nμ, which were ~93, ~14, and ~8 for microphases of SRSF3, SRSF5, and SRSF7, respectively.

MRPS measurements (Figures S4M-S4P) showed that SRSF7, SRSF3, and SRSF5 formed clusters that were ~275–300 nm in diameter. These sizes remained invariant over a range of protein concentrations, although the abundance of clusters increased with increasing concentration. Confocal imaging using fluorescently labeled SRSF3, SRSF7, and SRSF5 showed the formation of sub-micron-scale clusters for each protein (Figure 4G).

We injected mRNAs expressing mCherry-tagged SRSF3, SRSF5, and SRSF7 into the X. laevis oocytes. At 200 ng/μL mRNA, SRSF3 and SRSF5 formed de novo nuclear bodies that were not adjacent to nuclear speckles or Cajal bodies (Figures 4H and 4I). At the same mRNA concentration, SRSF7 formed bodies that are adjacent to nuclear speckles (Figure 4J). The CDFs of the size distributions of bodies formed by SRSF3 and SRSF5 show a sharp rise (Figure 4K). This is consistent with the bodies having uniform size distributions. The sizes of the bodies are also consistent with estimates from MRPS measurements in vitro (Figure S4L).

We quantified how the sizes of SRSF7 bodies varied with mRNA concentration. At 50 ng/μL, SRSF7 forms bodies that are adjacent to Cajal bodies (Figure 4L). However, above 1,000 ng/μL, SRSF7 forms macrophases that are distributed across the nucleoplasm (Figures 4L and 4M). These data are consistent with high protein levels and possibility of interactions with other nuclear components affecting the overall phase behavior of SRSF7 in GVs.

Double emulsions form in mixtures of microphases

Given the results observed for SRSF7 expression in GVs, we investigated the possibility of microphases being remodeled via heterotypic protein interactions. We computed the normalized interaction parameters ΔXY for pairs of domains from SRSF1Δ213–247 and SRSF3 (Figure 5A). The inter-RRM interactions are nearly ideal; the interactions between SRSF1-IDR1 and SRSF3-RRM and that of the SRSF3-IDR with SRSF1-RRM1 are equivalently attractive; the interactions between SRSF1Δ213–247-IDR2 and SRSF3-RRM are also attractive; the inter-IDR interactions are different, albeit strongly repulsive. Equivalent interaction matrices were computed for mixtures of SRSF1Δ213–247 with SRSF5 and SRSF7 (Figures S5A and S5B).

Figure 5. Microphase modulation in mixtures and by chimeras.

Figure 5.

(A) Interaction matrices for pairs of domains drawn from SRSF1Δ213–247 and SRSF3.

(B) QF-DEEM images of nanoscale structures of microphases formed in mixtures of SRSF1Δ213–247 with SRSF3.

(C) Histogram of diameters (in nm) constructed from the analysis of 1,252 distinct images (mean (d¯) ± SD and median (d~)).

(D–F) Fluorescence microscopy images of sub-micron-scale structures formed in mixtures of SRSF1Δ213–247 and SRSF3 (D), SRSF5 (E), and SRSF7 (F). In (D)–(F), yellow lines represent line scans, and magenta arrows point to spherical caps at interfaces. Rightmost column shows normalized fluorescence intensities in SRSF1Δ213–247 (green) and SRSFx (cyan) channels.

(G) Impact of each of SRSF3, SRSF5, and SRSF7 on cμ of SRSF1Δ213–247.

(H) Matrix of ΔXY for ch-SRSF1.

(I) Data from three independent RALS measurements for ch-SRSF1 yield a cth value (mean ± SD) shown in the inset. Data shown are from a single measurement.

(J) Fluorescence microscopy images of macrophases formed at different concentrations of ch-SRSF1.

(K) ΔXY for ch-hnRNP-A1.

(L) Data from a single RALS measurement for ch-hnRNP-A1.

(M) Fluorescence microscopy images of ch-hnRNP-A1 collected using samples that comprise 1% of molecules with an AF488 label at the N terminus.

(N and O) Results of the injection of mRNAs of mCherry-tagged ch-SRSF1 (N) and ch-hnRNP-A1 (O) into X. laevis. The images on the left are from bright-field microscopy, those in the middle are from fluorescence microscopy, and the rightmost image shows the merge of the two. ch-hnRNP-A1 forms size-limited assemblies that co-localize with Cajal bodies (O), see CDFs in (P), ch-SRSF1 forms bodies of heterogeneous sizes (N).

See also Figure S5.

We used QF-DEEM to analyze the nanoscale structures formed in mixtures of 2 μM SRSF1Δ213–247, which is above its intrinsic cμ, and 1 μM SRSF3, which is below its cμ (Figure 4E). The images show biconcave discs and spheroids (Figure 5B). The mean and median sizes were 33.7 ± 13.3 nm and 37.9 nm, respectively (Figure 5C). The biconcave structures were akin to those of the SRSF1Δ213–247 microphases. The spheroids are likely a result of SRSF3 molecules adhering to the interface of pre-formed SRSF1Δ213–247 microphases.

The size-limiting behavior of SRSF1Δ213–247 microphases implies that their interfacial free energies are likely to be low or vanishing.22 Adhesive interactions with an excipient can alter the interfacial free energy and remodel microphases.78 We explored this possibility using confocal imaging in binary mixtures of SRSF1Δ213–247 with each of SRSF3 (Figure 5D), SRSF5 (Figure 5E), or SRSF7 (Figure 5F). In all cases, the concentration of SRSF1Δ213–247 was above its cμ, whereas those of the other SRSFs were below their cμ. The micron-scale organizations observed in all three binary mixtures were double emulsions with core-shell architectures (Figure S5C). Line scans show that the core is enriched in SRSF1Δ213–247 and the shell is enriched in the other SRSF (Figures 5D-5F). Sub-micron-scale caps that become generators of the double emulsion structures were also visible (see arrows in Figures 5D-5F).

We used RALS to measure the threshold concentration (cth) of SRSF1Δ213–247 for phase separation in the presence of different amounts of SRSF3, SRSF5, or SRSF7. The threshold concentration rises to ~1 μM at a 0.05:1 molar ratio of SRSF7-to-SRSF1Δ213–247, a 0.5:1 ratio of SRSF3-to-SRSF1Δ213–247, and a 4.5:1 ratio of SRSF5-to-SRSF1Δ213–247 (Figure 5G).

Investigations of chimeric proteins

IDRs such as A1-LCD drive macrophase separation.18,79 The driving forces for macrophase separation can be enhanced by replacing all aromatic residues within this sequence with tyrosine (allY).79 We used QF-DEEM to investigate structures of allY macrophases. We observed micron-scale droplets defined by interiors featuring nanoscale inhomogeneities and percolated hub-like entities80 (Figure S5F). Injection of mRNA for mCherry-tagged full-length hnRNP-A1 into X. laevis oocytes showed that the expressed protein localizes to the dense fibrillar and granular components of nucleoli (Figure S5G), which show characteristics of macrophases.81,82

Next, we designed two chimeric proteins, ch-SRSF1 and ch-hnRNP-A1. In ch-SRSF1, the IDR2 of SRSF1 was replaced with the 129-residue-long A1-LCD(Δhexa).52,83 In ch-SRSF1, the repulsions of IDR2 are replaced by attractions, the repulsions involving IDR1 are diminished, and so are the attractions involving RRM1 and RRM2 (Figure 5H). Using RALS, we observed a sharp discontinuity at a concentration (cth) of 1.5 ± 0.4 μM (Figure 5I). Above cth, there is a concentration-dependent increase in the abundance, fusion, and growth of micron-scale puncta (Figure 5J). Thus, ch-SRSF1 drives macrophase separation. Imaging disordered, fluid-like macrophases using QF-DEEM is challenging due to their fragility. However, in one field of view, we observed a micron-scale body (Figure S5H). The interiors within this one-off image of a micron-scale punctum show finer, nanoscale structures.

To generate ch-hnRNP-A1, we replaced the A1-LCD with IDR2Δ, which is IDR2 of SRSF1Δ213–247. This replacement enhances the attractions of the C-terminal IDR with RRM1 and RRM2 when compared with the parent protein (Figure 5K), and C-terminal attractions are replaced by repulsions. RALS measurements show a non-linear, albeit monotonic increase in scattering intensity. Confocal microscopy showed the presence of sparse sub-micron-scale assemblies. The ch-hnRNP-A1 chimera exhibits behaviors that are concordant with the weak segregation limit44 whereby only small, size-limited clusters are formed.

We also injected mRNAs of mCherry-tagged ch-SRSF1 and ch-hnRNP-A1 into X. laevis oocytes. ch-SRSF1 formed a range of de novo micron-scale bodies that are distinct from nuclear speckles or Cajal bodies (Figure 5N), whereas ch-hnRNP-A1 co-localized to Cajal bodies (Figure 5O). Thus, while ch-SRSF1 forms bodies with a heterogeneous distribution of sizes consistent with macrophase separation, ch-hnRNP-A1 forms submicron-scale clusters that are size-limited (Figure 5P).

TDP-43 forms microphases

TDP-43 colocalizes with sites of active splicing in neurons.84 In addition to two RRMs, TDP-43 comprises a folded N-terminal oligomerization domain (NTD),85,86 and a prion-like C-terminal LCD87 (Figure 6A). The two RRMs (Figures 6B, S6A, and S6B) are different from those of RRMs in SRSFs and hnRNP-A1. The computed inter-domain interaction coefficients (Figure 6C) show that prominent attractions are homotypic, involving the LCD and NTD, and heterotypic, involving the LCD and each of the other domains (RRM1, RRM2, and NTD). All other interactions are either repulsive (RRM2-RRM2, RRM2-NTD) or nearly ideal (RRM1-RRM1, RRM1-NTD, RRM1-RRM2). Thus, TDP-43 has the architecture of an “inverse bolaamphiphile,” similar to Matrin-3, which forms spherical and wormlike microphases.50 Flexible bolaamphiphiles comprise hydrophilic ends connected by hydrophobic chains.88 An inverse bolaamphiphile has the opposite orientation.

Figure 6. TDP-43 forms microphases and double emulsions in mixtures.

Figure 6.

(A) Sequence architecture of TDP-43.

(B) Space-filling model of TDP-43. Color coding is identical to that of space-filling models in Figure 1.

(C) Computed ΔXY coefficients for TDP-43.

(D) Representative RALS trace of TDP-43. Results are shown from a single representative measurement (solid circles) and spread over three technical replicates across different protein preps (pale envelope). cμ value is shown as mean ± SD.

(E) QF-DEEM images 0.7 μM TDP-43.

(F) Histogram of diameters from analysis of 1,326 microphases (mean (d¯) ± SD and median (d~)).

(G) Results from TRFQ measurements performed over a range of TDP-43 concentrations.

(H) Histogram of Rg,μ values of microphases comprising 12 TDP-43 molecules.

(I) Profiles of gD(r).

(J) Representative snapshot from simulations showing organization of 12 TDP-43 molecules within a microphase.

(K) Representative fluorescence microscopy image of a double emulsion formed in a binary mixture of 1 μM TDP-43 and 1 μM SRSF1Δ213–247. Rightmost panel shows the line scan of normalized fluorescence intensities in the TDP-43 (cyan) and SRSF1Δ213–247 (green) channels. Magenta arrows point to spherical caps.

See also Figure S6.

RALS measurements yielded a cμ of 0.18 ± 0.08 μM for TDP-43 (Figure 6D). The rate constants determined from pyrene fluorescence lifetime measurements decreased with increasing TDP-43 concentration until cμ, beyond which the values of the rate constants plateaued at ~0.93 ± 0.10 ns (Figure S6C). QF-DEEM imaging at 0.7 μM TDP-43 showed biconcave structures (Figure 6E) with mean and median diameters of 33.7 ± 9.5 nm and 33.7 nm, respectively (Figure 6F). Each TDP-43 microphase accommodates ~13 TDP-43 molecules per microphase as inferred from TRFQ experiments (Figure 6G). Note that previous reports suggested that TDP-43 forms oligomers comprising ~12 molecules.89 The packing fraction of TDP-43 in the biconcave structures is 0.07.

Atomistic simulations using 12 TDP-43 molecules that were restrained to be within an envelope of radius 15 nm showed assemblies with a mean Rg,μ of 8.3 ± 0.2 nm (Figure 6H). The radial distribution profiles show accumulation of the NTD within the microphase interior (Figures 6I and 6J). This is followed by RRM2, RRM1, and the LCD. At intermediate length scales, between 6 and 8 nm, the density of the LCD becomes the highest, followed by the NTD, and the two RRMs (Video S5). Confocal imaging showed weak clustering at concentrations above cμ (Figure S6D).

In binary mixtures with SRSFΔ213–247, all domains of TDP-43 make attractive interactions with IDR1 and IDR2 of SRSFΔ213–247 (Figure S6E). Interactions between RRM2 of TDP-43 and RRM1 of SRSFΔ213–247 are repulsive, as are interactions between RRM2 of SRSFΔ213–247 and both the NTD and RRM2 of TDP-43 (Figure S6E). The attractions enable adhesive interactions at interfaces between microphases. However, the domains cannot mix and form homogeneous structures because of the repulsions. Accordingly, the interface expands, and the spreading effect leads to core-and-shell-like double emulsions with TDP-43 predominantly localizing to the shell (Figure 6K).

Effects of MALAT1 on microphases

MALAT1 is a key lncRNA that regulates splicing activity within nuclear speckles.45,90 It is upregulated in cancer cells91-93 where it accumulates in the nucleus and localizes to nuclear speckles.27,90,94 MALAT1 serves as a hub that interacts with splicing factors, such as U1 small nuclear ribonucleoprotein particle (snRNP)95 and transcriptionally active chromatin.96 Human MALAT1 is ~7,000 nt long,94 and undergoes further processing to produce a 6,900 nt-long transcript and a tRNA-like small RNA of 61 nt.97 MALAT1 is predicted to comprise 200 evolutionarily conserved helices, including a 3′ triple helix,98 pseudoknots, tetraloops, internal loops, and long-range intramolecular interactions.99

We produced MALAT1 using in vitro transcription and generated 3′-labeled MALAT1 with Alexa Fluor 647 for fluorescence assays. Using fluorescence correlation spectroscopy (FCS), the hydrodynamic radius (Rh) of MALAT1 was found to be 6.8 ± 0.7 nm (Figure S7A). The reported Rh of BunVL, a single-stranded RNA of similar length, is ~9.2 nm in solutions where the ionic strength is more than two times lower than what we use in our measurements.100 Our Rh measurements are consistent with studies showing that compaction is a universal property of mRNAs and lncRNAs.101

MALAT1 can bind to proteins as dispersed monomers or within microphases. In the polyphasic linkage formalism,102-104 if MALAT1 is the ligand, and SRSF1 or SRSF1Δ213–247 is the macromolecule, then the threshold concentration for microphase separation in the presence of the ligand (cμ,L) can be written as: cμ,L=cμ,0(PμP0). Here, cμ,0 is measured in the absence of ligand, Pμ is the binding polynomial of the ligand to macromolecules in the microphase, and P0 is the binding polynomial outside the microphase. If Pμ=P0, then cμ,L=cμ,0; if Pμ<P0, then the affinity of the ligand is higher for the macromolecule in the microphase, and cμ,L<cμ,0; finally, if Pμ>P0, then the affinity of the ligand is higher for the macromolecule in the dispersed phase, and cμ,L>cμ,0.

We measured cμ,L for SRSF1 and SRSF1Δ213–247 as a function of MALAT1 concentration. Increasing amounts of MALAT1 lead to a lowering of cμ (Figure 7A). Thus, MALAT1 binds preferentially to SRSF1 and SRSF1Δ213–247 microphases. Changes to cμ saturate above a 5:1 molar ratio of MALAT1(in nucleotides)-to-protein. The effects of MALAT1 on lowering the cμ values are similar for SRSF1 and SRSF1Δ213–247. Thus, the region spanning residues 213–247 does not contribute to preferential binding of MALAT1 to SRSF1 microphases. The number of SRSF1 molecules per microphase (nμ) decreased from ~24 in the absence of MALAT1 to ~12 in the presence of MALAT1 (Figure 7B). Likewise, for SRSF1Δ213–247, nμ decreased from ~50 in the absence of MALAT1 to ~12 in the presence of MALAT1. Thus, upon preferential binding to microphases, MALAT1 enables the release of either ~12 SRSF1 molecules or ~37 SRSFΔ213–247 molecules. This release of positively charged proteins provides an entropic driving force that enhances microphase separation. The release of protein molecules also creates room for the incorporation of MALAT1 into microphases.

Figure 7. MALAT1 binds specifically and preferentially to SRSF1 microphases.

Figure 7.

(A) In the presence of MALAT1, the cμ of SRSF1 and SRSF1Δ213–247 decreases in a dose-dependent manner.

(B) TRFQ experiments, performed over a range of SRSF1 and SRSFΔ213–247 concentrations at a molar ratio of SRSF1-to-MALAT1 at 1-to-5 × 10 −4, yielded an estimate of ~12 protein molecules per microphase (mean ± SD for slope and nμ).

(C) MALAT1 colocalizes with sub-micron-scale clusters of SRSF1 (top row) and SRSF1Δ213–247 (bottom row).

(D) QF-DEEM images of SRSF1Δ213–247-MALAT1 microphases.

(E) Histograms collected across 1,510 MALAT1 bound microphases of SRSF1Δ213–247 show their diameters in nm.

(F) Comparative impact of MALAT1 versus MALAT1scr on the cμ values of SRSF1Δ213–247.

(G) MALAT1scr colocalizes with clusters of microphases formed by SRSF1Δ213–247.

(H) MALAT1 does not affect cμ of TDP-43 at a molar ratio of TDP43-to-MALAT1 at 1-to-5 × 10−4, but at high RNA-to-protein ratios, it binds preferentially to the dilute phase.

(I) QF-DEEM images of TDP-43 + MALAT1 microphases.

(J) Histograms of microphase sizes in nm of 818 images of TDP-43 in the presence of MALAT1. Mean (d¯) ± SD and median (d~).

See also Figure S7.

Confocal images show colocalization of MALAT1 with the clusters of microphases formed by SRSF1 and SRSF1Δ213–247 (Figure 7C). In the presence of MALAT1, the QF-DEEM imaging shows spheroidal SRSF1Δ213–247 microphases (Figure 7D). From the size distributions (Figure 7E), the mean and median values are 36.4 ± 12.0 nm and 34.0 nm, respectively. Thus, preferential binding of MALAT1 remodels SRSF1Δ213–247 microphases.

MALAT1 interacts with SRSF1 through hepta-nucleotide AGAAGAA motifs.105 There are five such motifs within a 500-nt stretch. Using computations, we generated a distribution of 104 sequences by randomly scrambling the RNA sequence within the 500-nt stretch. Regions that are 5′ and 3′ to this stretch were fixed to be those of wild-type MALAT1. Within the ensemble of scrambled sequences, the occurrence of GAA, AGAA, and AAG sub-motifs within MALAT1 is highly non-random (Figure S7B). This is suggestive of a selection bias for these motifs. To test the specificity of MALAT1 effects, we selected a scrambled sequence, MALAT1scr. In MALAT1scr, the probabilities of finding GAA, AGAA, and AAG motifs were significantly lower than for the wild-type MALAT1. The nucleotide composition and length of the sequence of the MALAT1scr are identical to those of the wild-type. Across a range of MALAT1scr concentrations, excepting large molar excesses, MALAT1scr has a negligible effect on the measured cμ of SRSF1Δ213–247 (Figure 7F). However, confocal imaging shows that MALAT1scr colocalizes with SRSF1Δ213–247 clusters (Figure 7G). Together, these data show that, beyond colocalization, cognate binding sites need to be present in MALAT1 to enable preferential binding.

To further assess the binding specificity, we measured the impact of MALAT1 on the microphase separation of TDP-43 (Figure 7H). The cμ of TDP-43 is not affected for RNA-to-protein ratios that are below 10-to-1. However, in the presence of more than a10-fold excess of MALAT1, the cμ of TDP-43 shifts upward. Thus, excess MALAT1 destabilizes microphase separation of TDP-43 (Figure 7H). This is reminiscent of reentrant phase behaviors that result from excesses of RNA versus protein.106,107 QF-DEEM showed that the structures formed by TDP-43 were preserved in the presence of MALAT1 (Figure 7I). This contrasts with the remodeling of biconcave structures formed by SRSF1Δ213–247. The effect of MALAT1 is evident in the increased sizes of the TDP-43 microphases, with the mean and median sizes being 43.6 ± 11.7 nm and 42.9 nm, respectively.

DISCUSSION

SRSFs and TDP-43 are functional copolymers of ordered and disordered domains (FRODOs) that undergo microphase separation. For individual FRODOs in solution, the structures observed in vitro constitute a well-defined hierarchy. Individual proteins that are 4–6 nm in diameter form microphases that are 25–40 nm in diameter. The sizes of microphases are stabilized by the interplay of intermolecular attractions and repulsions. As concentrations increase beyond cμ, microphases associate to form sub-micron-scale clusters that are ~300–400 nm in diameter. The sizes of clusters are stabilized by the interplay of inter-microphase short-range attractions and long-range repulsions.64 Microphase separation of FRODOs involves a combination of site-specific interactions of folded domains, distributed multivalent interactions among IDRs and surfaces of folded domains, and overall solubility considerations.

In mixtures of microphase-separating proteins, there is an interplay between two types of inter-domain attractions and repulsions: those involving proteins of the same kind and those involving different proteins. Adhesive interactions at microphase interfaces generate spherical caps with both proteins. However, repulsions drive spreading over mixing, thus giving rise to double emulsion structures through polymer-polymer phase separation36,40,78 that maximizes inter-protein and inter-block attractions while minimizing repulsions. Thus, we conjecture that the demixing phenomenon reported recently for TDP-43 in stress granules108 might be a generic consequence of interactions at the level of microphases. The broader implication would be that stress granules are also emulsions of microphases.

Chromatin is also a multi-block copolymer, and organization into distinct territories might be a manifestation of microphase separation.37,39 The binding of transcription factors, such as TFEB, to target genomic loci lowers the interfacial tensions of condensates, thereby suppressing coarsening and driving binding-induced microphase separation.109,110 Bridging-induced phase separation also generates size-limited DNA-cohesin clusters.111 Binding of different RRM-containing proteins to the 5′, middle, and 3′ regions of NEAT1 (nuclear enriched abundant transcript 1) has been proposed to generate regions of different hydrophobicity along the lncRNA, thereby converting NEAT1 into a triblock copolymer.112 An intriguing, albeit uncorroborated possibility is that the binding of RRM-containing proteins drives an unfolding of the lncRNA. If so, the intrinsic microphase separation of paraspeckle proteins would be destabilized, and microphases would be realized by the generation of a block copolymer that forms on a very different length scale.112 This scenario contrasts with our observations showing that preferential and specific binding of MALAT1 to SRSF1 microphases releases the proteins from pre-formed microphases. Within nucleoli, Yao et al. showed that fibrillarin (FBL) forms distinct territories within dense fibrillar components (DFCs).113 These FBL-dense regions are likely to be clusters of microphases. As a preliminary assessment, we computed interaction matrices for the DFC proteins FBL (Figure S7C) and nucleolin (NCL) (Figure S7D). The computations suggest that both proteins are bolaamphiphiles,88 thereby making a case for their microphase separation within nucleoli.

What is the relevance of microphase separation in nuclear speckles? On average, there are likely to be ~106 copies of SRSF1 in a cell.77 The estimated nuclear concentration would be 1–10 μM, which is above the cμ value that we measured in vitro. Thus, microphase separation seems unavoidable within nuclei. Also, the effective molarity (ceff) within SRSF1 microphases, estimated as ceff=(pφnμvμ)×(1NA), would be ~9.6 mM. Here, p=0.29 is the packing parameter, φ=0.005 is the volume fraction of each monomer in the microphase, nμ=24 is the number of molecules, vμ=6.86×1024m3 is the volume of each microphase, and NA=6.02×1023 is Avogadro’s number. This high local concentration will help increase the splicing efficiency by driving efficient spliceosome assembly.6,15,114 MALAT1 binds specifically and preferentially to SRSF1 microphases, enhancing the driving forces for microphase separation. This likely contributes to the interplay between regulation of splicing and RNA polymerase II (RNA Pol II) mediated transcription. Indeed, Guo et al.115 showed that phosphorylation of the C-terminal domain (CTD) switches RNA Pol II association from Mediator condensates to nuclear speckles. Specifically, the phosphorylated CTD (pCTD) is incorporated into condensates formed by splicing factors such as SRSF1 and SRSF2.115

Song et al.45 showed that the pCTD binds preferentially to SRSF1 microphases. In ternary mixtures with MALAT1 and SRSF1, pCTD displaces MALAT1 from SRSF1 microphases.45 Thus, the recruitment of SRSF1 microphases by MALAT1, which stabilizes the microphases while releasing a fraction of the SRSF1 molecules from microphases, can lead to the assembly of microphases near transcription sites of speckle-associated genes. The stronger preferential binding of the pCTD likely displaces MALAT1 from SRSF1 microphases.

Limitations of the study

Currently, we lack knowledge about how post-translational modifications such as serine phosphorylation 90,116,117 impact the intrinsic and MALAT1-mediated microphase separation of SRSFs. We also lack direct probes of microphase separation in live cells that have the resolution of QF-DEEM to investigate microphase functions in cells, although recent methods show promise.118,119

STAR★METHODS

METHOD DETAILS

Generation of DNA constructs

pGEMT-MALAT1 was cloned by amplifying the MALAT1 cDNA from pCMV-MALAT1 and ligating it into the pGEM-T easy vector (Promega, USA) according to the manufacturer’s protocol. pGEMT-MALAT1scr was generated by Gibson cloning with a synthesized gene block (Genscript, USA) for the 500-nucleotide region.128 For Gibson assembly reactions, the insert DNA and plasmid DNA were incubated at a 3:1 molar ratio with the latter fixed at 50 ng and were combined with 2X Gibson Master Mix (final volume: 20 μL). The mixture was incubated at 50°C for one hour then cooled on ice. This mixture containing the assembled plasmid was used to transform into DH5a E. coli cells. DNA concentration was measured by absorbance at 260 nm. pET28-SRSF1Δ213-247 was cloned by subcloning SRSF1 cDNA from YFP-SRSF1 into the pET28 vector by restriction digestion and ligation. pET28-SRSF1 was cloned by sub-cloning SRSF1 from pSMT3-SRSF1. pET28-SRSF3 (Uniprot: P84103), pET28-SRSF5 (Uniprot: Q13243), pET28-SRSF7 (Uniprot: Q16629), pET28-ch-SRSF1, and pET28-ch-hnRNPA1 were generated by gene synthesis (GenScript, USA). pET28-SRSF1ΔIDR2 was generated by site-directed mutagenesis. The listed pET28 constructs were transformed into the DH5α and BL21-CodonPlus (DE3)-RIPL cells.

Protein purification and labeling

SRSF1, SRSF1Δ213-247, SRSF1ΔIDR2, SRSF3, SRSF5, SRSF7, ch-SRSF1, and ch-hnRNPA1

The His6-SUMO N-terminally tagged constructs were expressed in BL21-CodonPlus(DE3)-RIPL E. coli cells (New England Biolabs, USA) grown in LB broth (Millipore Sigma, USA), supplemented with kanamycin, spectinomycin, and chloramphenicol. Cultures were incubated at 37°C in a shaker incubator at 220 rpm until OD600 of 0.4 was reached. Cultures were then induced with 0.35 mM Isopropyl b-D-1-thiogalactopyranoside (IPTG) and incubated for 6-8 additional hours at 23°C. Cells were harvested by centrifugation at 4000 rpm for 30 minutes, washed in Lysis Buffer (10% glycerol, 20 mM HEPES (pH 8.0), 1.25 M NaCl), and stored as pellets at −80°C.

For lysis, the cell pellet was gently resuspended in 5 mL of cold Lysis Buffer per gram of pellet with one cOmplete protease inhibitor tablet (Roche, Switzerland). 1 mg/mL of lysozyme (Roche, Switzerland) was added and incubated at 4°C for 1 hour with gentle rocking. The solution was then sonicated (Branson 550 with an L102C horn attachment) using five sets of the following 20-round cycle: 1 second on / 2 second off at 30% power. The solution was then centrifuged for 30 minutes at 10,000 rpm.

Supernatant was incubated with equilibrated 4 mL HisPur Ni-NTA resin (Thermofisher Scientific, USA) at 4°C for one hour. The solution was then poured into a manual column, and the flow-through fractions were collected. The resin was then washed with 30 mL lysis buffer and then eluted by Elution buffer (10% glycerol, 20 mM HEPES (pH 8.0), 1.25 M NaCl, 300 mM Imidazole) in 2 mL fractions. The fractions were confirmed by SDS-PAGE. Fractions containing His6-SUMO-SRSF1 were pooled and the SUMO tags were cleaved during an overnight dialysis in the presence of 1:50 molar ratio of His6-tagged Ulp1 protease to His6-SUMO-SRSF in Cleavage Buffer (10% glycerol, 20 mM HEPES (pH 8.0), 1.25 M NaCl, 1 mM DTT (dithiothreitol)). The dialysate was incubated with 4 mL of Ni-NTA resin equilibrated with lysis buffer. Flow-through was collected, and the resin was eluted with elution buffer in 5 mL fractions. The presence of SRSF was confirmed in the flow-through and His6-SUMO and His6-Ulp1 in the elution fractions. Flow-through fractions were pooled and diluted 25-fold in dilution buffer (10% glycerol, 20 mM HEPES (pH 8.0)) to lower the total NaCl concentration. This solution was further purified using a HiTrap Heparin HP 5 mL column (Cytiva, USA) on a continuous gradient with Buffer A (10% glycerol, 20 mM HEPES (pH 8.0), 50 mM NaCl), and Buffer B (10% glycerol, 20 mM HEPES (pH 8.0), 1.25 M NaCl) by FPLC (ÄKTA Pure, Cytiva, USA). Peak fractions were confirmed by SDS-PAGE, pooled, and concentrated in Amicon Ultra 3 kDa MWCO (molecular weight cut-off) concentrator columns (Millipore Sigma, USA). Concentrated protein was aliquoted into 100 μL, flash frozen in liquid N2, and stored at −80°C. Concentration of protein was determined by absorbance at 280 nm. Predicted extinction coefficients129 for the constructs used to determine protein concentration are as follows: 27850 M−1cm−1 (SRSF1), 24870 M−1cm−1 (SRSF1Δ213-247), 23505 M−1cm−1 (SRSF1ΔIDR2), 16960 M−1cm−1 (SRSF3), 12950 M−1cm−1 (SRSF5), 21890 M−1cm−1 (SRSF7), 33935 M−1cm−1 (ch-SRSF1), 13075 M−1cm−1 (ch-hnRNPA1). Table S1 includes data for the ratio of absorbance measured at 260 nm to 280 nm. Values close to 0.6 confirm the presence of minimal nucleic acid contaminants (Figure S7F).

SDS-PAGE was performed after each purification step for all constructs. Gel images of the purified products are shown in Figure S7G.

TDP-43

The His6-SUMO N-terminally tagged construct was expressed in BL21(DE3) E. coli cells (New England Biolabs, USA) from the SUMO-TDP43 plasmid (pET28b(+)-SUMO-TDP43). grown in LB broth (Millipore Sigma, USA), supplemented with kanamycin. Cultures were incubated at 37°C in a shaker incubator at 220 rpm until OD600 of 0.4 was reached. Cultures were then induced with 0.35 mM Isopropyl b-D-1-thiogalactopyranoside (IPTG) for 6-8 additional hours at 23°C. Cells were harvested by centrifugation at 4000 rpm for 30 minutes, washed with Lysis Buffer (10% glycerol, 20 mM HEPES (pH 8.0), 1.25 M NaCl), and stored as pellets at −80°C.

For lysis, the cell pellet was gently resuspended in 5 mL of cold Lysis Buffer per gram of pellet with one cOmplete protease inhibitor tablet (Roche, Switzerland). 1 mg/mL of lysozyme (Roche, Switzerland) and 10 μg/mL DNase I were added and incubated at 4°C for 1 hour with gentle rocking. The solution was then sonicated (Branson 550 with an L102C horn attachment) for 2 minutes using 30 seconds on/off cycle at 50% power. The solution was centrifuged for 30 minutes at 10,000 rpm.

Supernatant was incubated with equilibrated 4 mL HisPur Ni-NTA resin (Thermofisher Scientific, USA) at 4°C for one hour. Then, the solution was poured into a manual column, and flow-through fractions were collected. The resin was then washed with 30 mL Wash buffer (10% glycerol, 20 mM HEPES, (pH 8.0), 1.25 M NaCl, 50 mM Imidazole) and the eluted by Elution buffer (10% glycerol, 20 mM HEPES, (pH 8.0), 1.25 M NaCl, 300 mM Imidazole) in 2 mL fractions. The fractions were confirmed by SDS-PAGE. Fractions containing His6-SUMO-TDP-43 were pooled and cleaved of His6-SUMO tags during an overnight dialysis in the presence of 1:50 molar ratio of His6-tagged Ulp1 protease to His6-SUMO-TDP-43 in Cleavage Buffer (10% glycerol, 20 mM HEPES (pH 8.0), 1.25 M NaCl, 1 mM DTT (dithiothreitol)). The dialysate was incubated with 4 mL of Ni-NTA resin equilibrated with lysis buffer. Flow-through was collected, and the resin was eluted with elution buffer in 5 mL fractions. The presence of TDP-43 was confirmed in the flow-through and His6-SUMO and His6-Ulp1 in the elution fractions. Flow-through fractions were pooled and diluted 25-fold in Dilution buffer (10% glycerol, 20 mM HEPES (pH 8.0)) to lower the total NaCl concentration. This solution was further purified using a HiTrap Heparin HP 5 mL column (Cytiva, USA) on a continuous gradient with Buffer A (10% glycerol, 20 mM HEPES (pH 8.0), 50 mM NaCl), and Buffer B (10% glycerol, 20 mM HEPES (pH 8.0), 1.25 M NaCl) by FPLC (ÄKTA Pure, Cytiva, USA). Peak fractions were confirmed by SDS-PAGE, pooled, and concentrated in Amicon Ultra 3 kDa MWCO concentrator columns (Millipore Sigma, USA). Concentrated protein was aliquoted into 100 μL, flash frozen in liquid N2, and stored at −80°C. Concentration of protein was determined by absorbance at 280 nm. Predicted extinction coefficients129 for the constructs used to determine protein concentration are as follows: TDP-43 is 44920 M−1cm−1.

Labeling proteins with fluorophores

For covalent conjugation of all protein constructs, including the SRSFs, TDP-43, ch-SRSF1, and ch-hnRNPA1, with NHS (h-hydroxysuccinimide ester) AlexaFluor 488 or AlexaFluor 405 (Thermofisher Scientific, USA), 1 mg of protein was dialyzed into Labeling Buffer (10% v/v glycerol, 20 mM HEPES (pH 8.3), 1.25 M NaCl) using Pierce Microdialysis Plates (ThermoFisher Scientific, USA). The dialysate was combined with NHS-Alexa Fluor dye (dissolved in DMSO) at a dye:protein molar ratio of 4:1. This mixture was incubated under gentle rocking for one hour at 4°C in the absence of light. The mixture was then dialyzed extensively into experimental buffer (10% (v/v) glycerol, 20 mM HEPES (pH 7.4), 50 mM KCl, 5 mM MgCl2) to remove unincorporated dye. Concentrations of protein and AlexaFluor 488 were determined by absorbance measurements (ε280=27850M-1cm-1, ε495=73,000M-1cm-1). Labeling efficiency was determined as the molar ratio of dye to protein and ranged between 0.4–0.6. Labeled protein was aliquoted into single-use volumes (2 μL), flash frozen in liquid N2, and stored at −80°C.

In vitro transcription and 3’ RNA labelling

For all RNA reagents, plasmids containing the desired RNA transcript were linearized via incubation with 5% v/v restriction enzymes (MALAT1 and MALAT1scr – SalI) and 10% v/v CutSmart Buffer (New England Biolabs, USA) at 37°C for 4 hours. Linearization was confirmed by gel electrophoresis of DNA samples in a 1% agarose gel with non-digested DNA as controls. Linearized plasmids were purified using a PCR clean-up kit (IBI Scientific, USA) as per the manufacturer’s manual. In vitro transcription was performed for each transcript using the mMESSAGE mMACHINE Transcription Kit (ThermoFisher, USA) with either SP6 for the mRNAs that were microinjected into X. laevis oocytes or Hi-T7 polymerase for the in vitro experiments performed using MALAT1 and MALAT1scr . The purity and molecular weight of RNA were confirmed using gel electrophoresis (Figure 7G). Confirmed RNA transcripts were aliquoted into single use volumes (2 μL), flash frozen with liquid N2, and stored at −80°C. MALAT1 and MALAT1scr transcripts were labelled with AlexaFluor 647 hydrazide as described previously.49

QUANTIFICATION AND STATISTICAL ANALYSIS

In each of the sections that follow, we describe the methods obtaining and analyzing data from in vitro measurements based on right angle light scattering, time-resolved fluorescence quenching, single molecule imaging and tracking, fluorescence correlation spectroscopy, microfluidic resistive pulse sensing, and analysis of QF-DEEM images. The extraction of quantitative information from each of these measurements and the statistical analyses are customized for each method and are described in the paragraphs that describe the methods used to deploy the measurements and analyze the data. In general, for each measurement, our data are reported for at least three biological replicates and at least three technical replicates. In most cases, our data exceed these minimal thresholds. The sections that follow also describe the methods and analysis of data from measurements performed by imaging GVs of Xenopus oocytes following injections and incubation of mRNAs. The sections below also describe the methods used for computing various parameters and the details of the atomistic simulations. Each of these sections have self-contained details regarding the methods of quantification of the information provided in the main text and supplemental figures, and the statistical analyses used in the quantifications.

Right angle light scattering (RALS)

All constructs were dialyzed against the experimental buffer (10% v/v glycerol, 20 mM HEPES (pH 7.4), 50 mM KCl, 5 mM MgCl2) using Pierce Microdialysis Plates (ThermoFisher Scientific, USA). The intensity of static right-angle light scattering at 320 nm (I) was measured at each titration point using PTI Quantamaster (Horiba Scientific, Japan). The intersection of linear fits to I versus [protein] (log-log plot) identified discontinuity points in the concentration dependence of the scattering intensity, which is indicative of an assembly formation and/or phase transition. For each experiment, we report results using at least two biological and three technical replicates. Note that the RALS data collected at a series of different concentrations are proof of reversibility of the microphases because the intensities are recovered by increasing or decreasing the concentration.

Optimal linear fits were determined using a modified jackknife approach.130 Each dataset was subjected to two independent series of linear fits, with one series of fits for the low concentration arm (LCA) and the second series of fits for the high concentration arm (HCA). For each arm, the fitting procedure was initiated with a linear fit to the four lowest or highest concentration data points, respectively, and the root mean square error (RMSE) of each fit was recorded. Following these initial fits, the next highest or lowest concentration data point was added to the respective LCA or HCA data set, the expanded data sets were re-fit, and the new RMSEs were recorded. This process was continued, expanding the fitted dataset by one data point at a time, until all the points in the full dataset were included in both the LCA and HCA linear fits. The intersections of each pair of LCA and HCA best fits were recorded, and the average intersection point for all best fits, for all trials at a given protein concentration, were determined. This is the value reported as the threshold concentration, cth, or as cμ for microphase separation. The presence of a discontinuity in light scattering intensity, measured over a range of protein concentrations, indicates the formation of a distinct phase defined by large assemblies.130-132

Fluorescence lifetime measurements

Samples were prepared with noted concentration of protein and a constant concentration of 20 nM pyrene. The 340 nm LED was used as an excitation source with a 365 nm long pass filter in an EasyLife X Lifetime Fluorescence Spectrometer (Horiba Scientific, Japan). The fluorescence decay curves were recorded using the EasyLife X software and fitted to the following equation in Prism 10 software (GraphPad, USA): I(t)=I0ekdecayt where I(t) and I0 are fluorescence intensities at time t and at t=0, respectively. Pyrene fluorescence lifetime was quantified as the inverse of the rate constant kdecay:

Time-resolved Fluorescence Quenching (TRFQ)

Samples were prepared with noted concentration of protein and a constant concentration of 20 nM pyrene and 0.4 μM cetylpyridinium chloride. The 340 nm LED was used as an excitation source with a 365 nm long pass filter in an EasyLife X Lifetime Fluorescence Spectrometer (Horiba Scientific, Japan).

The fluorescence decay curves were recorded using the EasyLife X software and fitted by Equation 1 in Prism 10 software (GraphPad, USA).

I(t)=I(0)exp(tτ0R[1exp(tτQ)]) (Equation 1)

I(t) and I(0) are fluorescence intensities at time t and zero, respectively. τ0, τQ, and R are three parameters representing the results of the nonlinear fit. τ0 is pyrene lifetime in the absence of quencher. τQ is pyrene lifetime in the presence of quencher. In the presence of quencher, the fluorescence decay is described by the Equation 1 with kQ=1τQ being the rate constant for inter microphase quenching. The extracted parameter R, which is equal to cQcμ, is the ratio of the quencher concentration cQ to the concentration of microphases, cμ. The number of molecules per microphase, nμ, can be calculated using the following relationship (Equation 2):

nμ=R[protein]cμcQ (Equation 2)

The pyrene concentration was chosen to be less than 0.05 × quencher concentration which avoids the formation of pyrene excimers. Individual solutions at a given protein concentration were prepared by dilution from the solution with the largest protein concentration. For each solution condition, the fluorescence decay curves were recorded and analyzed to determine nμ. The plots of the number of molecules per microphase, nμ, were generated over the protein concentrations and linearly fit to obtain the y-intercept as an estimated number of molecules per microphase at cμ. The reliability index, Φ, of the estimate was determined as a product of the standard error of the mean of the estimate of nμ, extracted from the linear regression, and the slope of the line of best fit. Small values of Φ signal high reliability in the estimation of nμ, whereas large values of Φ imply high unreliability. For constructs with high unreliability, nμ values were estimated assuming the identical packing fraction as SRSF1 of 0.29. The packing fraction was calculated using Equation 3:

p=nμ(Rg3Rμ3) (Equation 3)

In Equation 3, Rg is the average of radius of gyration of a single molecule obtained from atomistic simulations. Rμ is the median radius of microphases, which is d¯2, the median diameter of microphases divided by 2, obtained from the size distributions of microphases determined from QF-DEEM images, and nμ is the number of molecules within a microphase.

Fluorescence Confocal Microscopy

Confocal microscopy imaging performed on the Eclipse Ti2 microscope (Nikon, Japan) with a Yokogawa CSU X1 disk module and a LunF laser launch equipped with 405 nm, 488 nm, 561 nm, and 647 nm lasers, using a 60X, 1.4 NA Apo oil immersion objective (Nikon, Japan) and an Orca Flash 4.0 CMOS camera (Hamamatsu, Japan). All images were captured at room temperature using NIS-Elements software (Nikon, Japan) and saved as 16-bit ‘.nd2’ files. Images within a data set were taken with identical imaging parameters ensuring that signal was not saturated. No averaging, binning, and projecting were used. All images of purified SRSF1 and RNA constructs shown are representative crops of one or a few entities (e.g., a condensate) where the brightness and contrast have been optimized. Line scans were obtained using ImageJ and plotted using Prism 10 (GraphPad, USA). Samples were prepared with 1:250 molar ratio of labeled-to-unlabeled protein and 1:50 labeled-to-unlabeled RNA and imaged in silicone wells mounted on a coverslip.

Single-molecule imaging of SRSF1 with PA-JF-549

We covalently labeled SRSF1 protein with photoactivatable Janelia Fluor® 549 (PA-JF-549) NHS ester as described above for AlexaFluor labeling. For imaging, we mixed 1 μM of unlabeled SRSF1 with labeled SRSF1 at the molar ratio of 1:50, such that each microphase is labeled, on average, with an individual fluorophore based on the determined number of SRSF1 molecules per microphase. A reduced dilution of 1:100 was applied for larger cluster imaging to mitigate the background signal. For each sample, 15 μL of solution was placed in a silicon well (Grace Bio-Labs, USA) adhered to a cover slip.

Single-molecule microscopy was performed with a home-built wide-field epi-fluorescence microscope, equipped with an oil-immersion objective (UPLSAPO100XO, NA 1.4, Olympus, Japan), a dichroic mirror (Semrock Di01-R488/561), and an emission filter (Semrock FF01-593/46). Activation (405 nm) and excitation (561 nm) lasers shared the same optical path to illuminate the sample or activate photoactivatable fluorophore, respectively. The PA-JF-549 was activated under constant low intensity (~50 W·m−2) activation wavelength. Autofluorescence and crosstalk were not observed with 405 nm activation in the emission channel. A high peak intensity (~4088 W·m−2) of 561 nm laser was used to excite activated PA-JF-549. Single-molecule images and trajectories were acquired at 50 Hz for 20,000 frames for an average of ~300 photons per molecule in each 20 ms frame. Imaging was performed in triplicate from three independent preparations.

Single-molecule image analysis: localization and tracking

Single-molecule blinking and movements were observed within the acquired image stacks. Direct projections were created by summarizing all pixel information over time on finding either the maximum value (maximum projection) or the standard deviation (std projection) of the temporal intensity profile of each pixel. The maximum projection indicates the spatial distribution and emission intensity of PA-JF-549, and the std projection indicates the switching rate between the “fluorescent” and “dark” states. Both projections were used to evaluate and optimize activation/excitation control and labeled / unlabeled protein mixture ratio for observation of single molecule behaviors.

The localization and tracking of individual molecules were performed as described80 by applying the Robust Statistical Estimation algorithm (RoSE).133 All position estimates were subsequently classified and grouped into molecular displacements. This localization algorithm enables accurate position measurements of molecules on each frame, further improving the temporal resolution. Single molecule localization depends on the sequential detection and position estimation of individual molecules in each frame. A maximum likelihood estimator was used to fit a standard point spread function to the images and measure the sub-pixel lateral positions and brightness of each molecule. The localization algorithm has ~13 nm precision in measuring lateral position of each molecule. This theoretical limit on precision is evaluated from the average of the measured fluorescence signal (300 photons) and background (9 photons).

The dim emitters were filtered out and their positions were grouped into emitter displacements by connecting estimates between two consecutive frames. Two estimates on separate frames were classified as the same emitter if: 1) they were the closest pair of estimates between the frames, and 2) their distance, d, was within the confinement radius of 200 nm. All distances were computed in Euclidean form. We classified the speed of movement of each emitter by applying a threshold to the displacement, with displacements of d < 50 nm categorized as slow and those above 50 nm as fast. This length scale was chosen because it provided the clearest segregation of the displacement of the emitters. The 2D histograms of fast- and slow-moving emitters were generated by counting the number of fast and slow molecules originating from each binned 25-by-25 nm2 region on the spatial coordinates.

Excess variance of displacements across all bins within a mask were calculated as described in Equation 4:

Excess variance=Variance of bin valuesMean of bin valuese (Equation 4)

Fluorescence Correlation Spectroscopy (FCS)

Two samples were prepared in 8-well plates (0.17 ± 0.005 mm thickness) in the experimental buffer, assumed to have the same viscosity as water: (1) a mixture of 5 nM unlabeled MALAT1 and 0.05 nM labeled MALAT1 and (2) 0.05 nM free dye. MALAT1 was labeled with AlexaFluor 647 hydrazide. The free AlexaFluor 647 hydrazide was used to determine the size of the beam waist, and the measured diffusion time, extracted from the autocorrelation function, was used to estimate the hydrodynamic radius (Rh) of MALAT1 per the Stokes-Einstein Equation. The diffusion coefficient of AlexaFluor 647 hydrazide with the instrument setup used here is 3.3 x 10−10 m2s−1. All data were collected on a Confocor II LSM system (Carl Zeiss-Evotec, Jena, Germany) with a 40x water-immersion objective. Samples were excited at 633 nm, and emission was collected with a 650 nm cut-on long pass filter. 10 replicates of 10-second fluorescence intensity autocorrelation function traces were collected and analyzed with Zeiss Confocor II FCS software. To extract the diffusion time, the autocorrelation function G(t) was fit to Equation 5 assuming a single-component system:

G(t)=1+1N1(1+tτD)(1+tτDr02z02)12 (Equation 5)

where τD is the diffusion time, N is the number of fluorophores in the confocal detection volume, t is the correlation time, r0 is the resolution in the x-y plane, and z0 is the resolution in the z plane.

The Stokes-Einstein equation, as written out in Equations 6 and 7, was used to estimate the particle hydrodynamic radius Rh:

Rh=kBTτD6πηDrefτD,ref (Equation 6)
τD=r024D (Equation 7)

Here, kB=1.38×1020m2gs2K1 is the Boltzmann’s constant, T=296K is the temperature in Kelvin, η=0.94gm1s1 is the viscosity of the buffer, assumed to be the same as that of water, and D is the diffusion coefficient.

Surface Electrostatic Potentials (SEPs)

The RRM structures were extracted from full-length structures of each protein from AlphaFold120,134,135 (SRSF1: AF-Q07955-F1-v4; SRSF3: AF-P84103-F1-v4; SRSF5: AF-Q13243-F1-v4; SRSF7: AF-Q16629-F1-v4; TDP-43: AF-Q13148-F1-v4, hnRNP-A1: AF-P09651-F1-v4). Site-specific surface electrostatic potentials (SEPs) of RRMs were determined using “Coulombic Surface Coloring” on the solvent-excluded molecular surfaces generated by UCSF Chimera.125 The following default parameters were used in generating the molecular surfaces by a rolling probe sphere: probe radius of 1.4 Å to approximately represent a water molecule, vertex density of 2.0 vertices per Å2, and line width of 1.0 pixel in mesh surface representation. The SEP values from each vertex of the surfaces were recorded and plotted as scatterplots.

Hydrophobicity

Hydrophobicity values were determined along the RRM sequences using a window size of 9 according to the Kyte-Doolittle scale136 using ProtParam of ExPASy.126 The hydrophobicity values were plotted as scatterplots in Figures S1E, S4B, and S6B.

Generation of biconcave disc geometry

The biconcave discs were generated as a continuous surface from a quartic equation (Equation 5) using the Mathematica package (Wolfram, USA) script from Kuchel et al.,127 while varying the diameter of the biconcave disc (d), thickness of the dent at the center (b), and maximum height at the rim (h) as noted in each figure caption. The Mathematica notebook is included in supplemental information. In brief, the quartic equation used was:

(x2+y2+z2)2+P(x2+y2)+Qz2+R=0 (Equation 8)

where P and Q are coefficients, R is the constant term, and x, y, and z are coordinates. Here, P, Q, and R, are defined as follows:

P=d22+h22(d2b21)h22(d2b21)(1b2h2)12Q=Pd2b2+b24(d4b41)R=Pd24d416

Microfluidics Resistive Pulse Sensing

Microfluidics Resistive Pulse Sensing (MRPS) measurements were performed on an nCS1 instrument (Spectradyne, USA). This technique measures the change in electrical resistance as particles of different size pass through a constriction of defined resistance.65 The abundance and the size of particles are determined based on the transit times and the magnitudes of the electrical signals, respectively.

Samples were prepared at indicated concentrations in experimental buffer (10 % v/v glycerol, 20 mM HEPES, pH 7.4, 5 mM MgCl2, and 50 mM KCl). 5 μL of each sample was measured using C-2000 cartridges, with a measurement range of 250–2000 nm. At least three acquisitions were collected and combined for analysis, with each acquisition being collected over 10 minutes. Control experiments were performed with buffer, in the absence of protein, where no significant abundance of particles was detected. The particle size distributions from the triplicate experiments were averaged and the resulting curve was smoothed using the non-linear least squares locally weighted scatterplot smoothing (LOWESS) method as implemented in Prism (GraphPad, Boston, MA, USA). The algorithm in Prism uses the number of neighboring points, input by the user, to create a smooth curve. We set the number of neighboring points to ten. This was deemed reasonable since each MRPS measurement generates 200-300 independent points.

The particle size distributions were fitted to Lorentzian distributions to accommodate the presence of low abundances away from the center of the peak using nonlinear least squares regression, set to converge when five iterations in a row change the sum-of-squares by less than 0.0001% (Equation 9):

L(d)=A1+(dcd)2w (Equation 9)

where A is the particle abundance at the center of the distribution, cd is the center of the distribution, w is the width of the distribution, and d is the particle diameter. The cd, which is located at the peak of the Lorentzian, were plotted as the diameter of the clusters (Figure S4L).

Average fitted parameters for Figure 3B are as follows. 0.5 μM: A = 5167 ± 321, cd = 358.9 ± 20.4, w = 74.9 ± 14.0; 1.0 μM: A = 8011 ± 347, cd = 345.7 ± 25.8, w = 83.5 ± 14.1; 1.5 μM: A = 7098 ± 269, cd = 386.1 ± 26.3, w = 79.1 ± 10.6; 2.5 μM: A = 12099 ± 327, cd = 378.4 ± 43.4, w = 84.6 ± 11.8.

Isolation and image analysis of fluorescent protein-positive bodies from Xenopus laevis oocytes

Harvested oocytes were manually disrupted from clustering, then subjected to collagenase digestion (gentle rocking) for 2 hours at 18°C. Oocytes were stored in ND96 Buffer (5 mM HEPES, 96 mM NaCl, 2 mM KCl, 1.8 mM CaCl2, 1 mM MgCl2) that was filter-sterilized and supplemented with 2.5 mM sodium pyruvate (ThermoFisher, USA) and 1X penicillin-streptomycin (MilliporeSigma, USA) at 18°C. Healthy stage VI oocytes were selected and injected using freshly pulled microneedles (Drummond, USA) and an injector (Nanoinject II, Drummond, USA). A total of 23 nL of mRNA in ddH2O (typically at a total mass of 20 ng) encoding for noted constructs were injected into each oocyte. Injected oocytes were stored individually in wells in a 48-well polystyrene SterileTissue Culture Plates (Fisher Scientific, USA) supplemented with ND-96 buffer at 18°C for at least 18 hours to allow expression and localization of exogenous protein. Immediately prior to imaging, the germinal vesicles were manually dissected in mineral oil and mounted on a glass slide with 6 μL of mineral oil. A 22 x 22mm glass coverslip was overlaid onto the sample and then immediately imaged. This procedure was carried out for at least two separate harvests of oocytes and similar results were obtained across these biological replicates. All images shown are of a single Z-slice that is representative and is at least 2 mm above the coverslip; brightness and contrast have been optimized.

The diameters of fluorescent protein-positive bodies expressing the noted mRNA-encoded constructs were determined using Fiji. The recorded.nd2 files were split into each fluorescent and brightfield channels for analysis. The fluorescent channel images was converted to 8-bit from 16-bit and turned into a binary image by adjusting the intensity threshold to 99%. Using the “Analyze particle” and the “Particle manager” tools, the mask for the fluorescent protein-positive bodies was used to obtain the area. The area was assumed to be a perfect circle and used to determine the diameter of the SRSF1-positive bodies as d=2sqrt(areaπ). Apparent bodies smaller than 32 pixels (0.10835 mm/pixel) were omitted due to falling below resolution limits. At least 100 fluorescent protein-positive bodies from at least three independent GVs were analyzed for each construct.

Quick-freeze deep etch electron microscopy (QF-DEEM)

Samples for freeze-fracture were prepared in the experimental buffer (10 % v/v glycerol, 20 mM HEPES, pH 7.4, 5 mM MgCl2, and 50 mM KCl) at a concentration above the cμ, except for allY A1-LCD. The allY A1-LCD was dialyzed against 20 mM HEPES (pH 7.0), 1 M NaCl and diluted to a final concentration at 20 mM HEPES (pH 7.0), 150 mM NaCl final at approximately 20°C. The saturation concentration (csat) for allY A1-LCD under the final solution condition is 50 μM.79 3 μL drops of sample were added onto a glass chip supported on boiled and fixed (2% glutaraldehyde) egg white prior to rapidly pressing the sample onto a liquid helium-cooled copper block polished to a mirror finish. Frozen samples were transferred to liquid nitrogen for storage prior to replica generation.

Replicas of each sample were generated using a Leica EM ACE900 Freeze Fracture System. In brief, quick-frozen samples were transferred from liquid nitrogen to the ACE900 Freeze Fracture System. The surface was fractured at −115 to −120°C using a cold knife held at −185°C. Samples were then etched at −104°C for 3 minutes prior to shadowing with 4-5 nm Pt/C at an angle of 24°. The replica was supported with 8 nm of carbon deposited at an angle of 85° for a total of 12 nm thickness. Replicas were washed in 48 wt% HF to remove the glass support prior to washing several times with deionized water. Cleaned replica were transferred to copper grids for TEM imaging (JEOL JEM-1400 Plus) at 120 kV.

QF-DEEM Image Analysis

Identification of microphases from.tif files was performed as follows:

  1. Particle (microphase) identification in a rectangular bounding box using a difference of Gaussians method

  2. Fitting to ellipses within bounding boxes

  3. Estimating the number of particles within a probe circle of a defined radius.

A Gaussian filter was applied on each image with a standard deviation of σ=1 and σ=3. Both operations smooth the image, which when subtracted, identifies boundaries (“bounding box”) of particles. Images which have even intensity produce consistent detections while images with uneven intensity may require different thresholds to be applied for sections of image. Particles within a defined range of area (e.g., 100–500 pixel2 for 10,000x magnification) are selected for further processing. The detected particles within the bounding box are fitted to an ellipse and scaled by (ab)12, where a is the semimajor and b is the semi-minor axis, respectively, of the fitted ellipse. Then, a circular ansatz is applied to the fitted ellipses. Particles were excluded if: 1) the fitted ellipse eccentricity was greater than 0.6; or the area of the fitted ellipse covered less than 40% of the area of the bounding box. These two criteria in combination allowed us to achieve high sensitivity in detecting areas that were circular and confidently fitted within the bounding boxes.

The average number of particles found within a probe circle of diameter of 380 nm was determined using a sweep approach by generating a circular mask. If a particle was at the edge of the probe circle, it was still considered as detected within the circular mask. Particles are detected within the probe circle as it scans over the image at a 100-pixel interval. To ensure the entirety of the probe circle remains within the image, the scan was constrained to the region of an image that excludes 100 pixels along the edge of the image. A box plot of the number of particles found in all non-empty sweeps are recorded.

Fluorescence Recovery after Photobleaching (FRAP)

FRAP was performed on the LSM 980 with Airyscan 2 (Zeiss, Germany) using a 63X high NA oil immersion objective with 488 nm excitation laser at 0.2% laser power and electron gain of 650. All images were captured at room temperature using ZEN Connect. The image was colored with Fire Lookup Table using ImageJ.137 The fluorescence intensity was plotted using Prism 10 (GraphPad, USA). Samples were prepared with 1:250 molar ratio of labeled-to-unlabeled AlexaFluor 488-SRSF1 and imaged in silicone wells mounted on a coverslip.

Atomistic simulations of individual molecules and pairs of molecules

All-atom Metropolis Monte Carlo (MC) simulations were performed using the ABSINTH implicit solvent model and forcefield paradigm as made available in the CAMPARI simulation package (http://campari.sourceforge.net).60,61 The simulations utilized the abs_3.5_opls.prm parameter set in conjunction with optimized parameters for neutralizing and excess Na+ and Cl− ions.138 Each simulation was performed using spherical droplets with a diameter of 200 Å and explicit ions to mimic a concentration of 1.7 mM NaCl. Ensembles corresponding to a simulation temperature of 340 K were used in the analysis reported in this work. This choice reflects recent calibrations, which show that better concordance with experimental data and improved sampling efficiency are achieved at the higher temperature.18,139

For simulations involving the folded domains, we used the predicted structures from AlphaFold (Uniprot: Q07955 (SRSF1), P84103 (SRSF3), Q13243 (SRSF5), Q16629 (SRSF7), Q13148 (TDP-43), P09651-2 (hnRNP-A1)).120,134 Conformations of the RRMs were initially equilibrated to ensure consistency of bond lengths and bond angles with the ABSINTH model, as was done in previous work.140,141 For simulations involving the IDRs, a single simulation was performed at 20 K to build the IDRs. Then the end PDB structures from these simulations were used to generate ten distinct full-length starting structures to seed ten independent simulations, each initiated using a distinct random seed. The backbone flexibility of folded domains was limited by applying restraints on the starting structure,139 whereas the sidechain degrees of freedom were unrestrained. To free the simulations of biases from the initial coordinates, a short, high-temperature (500 K) simulation comprising 2×104 MC steps was performed. The final structures from each of the high-temperature simulations were then used as the input structures for ten independent simulations. The temperature for the production runs was set to be 340 K. Each simulation comprised a total of 4.0×107 MC steps that combined the full spectrum of translational, rotational, pivot, local, and concerted moves.61,139,142 Conformations pertaining to the first 5×106 MC steps were discarded as equilibration. Simulation results were analyzed to extract quantities of interest including Rg values using the MDTraj143 and SOURSOP packages.124

In simulations with pairs of domains designated as X and Y, the goal was to extract pairwise inter-domain interaction coefficients. These serve as proxies of normalized second virial coefficients. To achieve this, each simulation comprised two domains X and Y that were either a pair of RRMs, a pair of IDRs, or an RRM and an IDR. Simulations were performed in large spherical droplets. Each droplet is over an order of magnitude larger than the dimensions of an individual domain represents an ultra-dilute system for a pair of domains. In a finite unbiased simulation, most of the inter-domain distances sampled would correspond to dissociated molecules. To overcome this problem, we applied an inter-domain distance restraint, modeled using a harmonic potential. Pairs of alpha carbon atoms located close to the centroid of each molecule were restrained to be within a shallow harmonic well, defined by a spring constant of 10 kcal/mol-Å and an equilibrium distance of 75 Å. Note that no restraints were applied on any other inter-residue or inter-atomic distances. From the ensemble of equilibrated configurations for each of the distinct pairs of domains, extracted from ten independent simulations for each pair, we quantified statistics for inter-residue distances using MDTraj143 and SOURSOP.124 These statistics were used to compute the ensemble-averaged distances RXY(ij) between pairs of residues i and j, where the former is located on domain X and the latter on domain Y. To calibrate the observed pattern of inter-domain, inter-residue distances against a suitable prior, we performed reference simulations to model the expectations for an ideal, non-interacting pair of the domains of interest. In this non-interacting prior, the domains freely sample the available simulation volume, with the presence of the weak restraint as described above. These reference simulations were performed using the inverse power potentials introduced by Tran et al.144 In these simulations, the non-bonded interactions are such that all terms, excepting the Lennard-Jones repulsions were switched off. The exponent for the repulsive exponent was set to six, which corresponds to Maxwellian particles, and the ε was set to 0.0001. This choice was based on calibrations, which showed for IDRs, irrespective of the sequence, an exponent of six and ε of 0.0001, with all other non-bonded interactions switched off, the ensemble-averaged intramolecular, internal distance profiles showed fractal behavior whereby the distances between pairs of residues i and jRijji0.5. This is the scaling expected for a Flory random coil.140,141,145 Inter-domain, inter-residue distances extracted from simulations using pairs of domains based on the inverse-power potential with an exponent of six and ε of 0.0001 were used to extract ensemble-averaged distances denoted as RXY,ideal(ij).

Generation of computed interaction matrices

For each pair of domains, X and Y, the computed sets of ensemble-averaged, inter-domain, inter-residue distances denoted as RXY(ij) and RXY,ideal(ij), were used to compute excess interaction coefficients as follows:

ΔXY(ij)=RXY(ij)RXY,ideal(ij)RXY,ideal(ij) (Equation 10)

The parameter defined in Equation 10 varies between −1 and +1. Next, we used the excess interaction coefficient for each pair of residues across the domains X and Y, as defined in Equation 10, to compute the excess interaction coefficient ΔXY using:

ΔXY=sgn(Δ)i=1NXj=1NY[ΔXY(ij)]2NXNY (Equation 11)

Here, ΔXY is the grand sum (defined as the sum over all the elements) of the upper or lower triangular matrix, where each element of the matrix is ΔXY(ij) as defined in Equation 10. The signum or sign function sgn is such that, the sgn(Δgn(Δ) is set t if Δ if Δ) is if Δ if Δ) is s if Δ if Δ) is set to −1n function sgn is such that,NX and NY refer to the numbers of residues within domains X and Y, respectively. This procedure yields values for ΔXY that are normalized and signed such that 1ΔXY+1.

Simulations of internal organization within microphases of SRSF1, SRSF1Δ213-247, and TDP-43

All-atom Markov Chain Metropolis Monte Carlo (MC) simulations were performed using the ABSINTH implicit solvent model and forcefield paradigm as made available in the CAMPARI simulation package (http://campari.sourceforge.net).60,61 The simulations utilized the abs_3.5_opls.prm parameter set.138 Each simulation was performed using spherical droplets with a diameter of 300 Å (for SRSF1 and TDP-43) or 400 Å (for SRSF1Δ213-247), which are consistent with QF-DEEM observations. Ensembles corresponding to a temperature of 340 K were used in the analysis reported in this work. Copies of a molecule of a resulting conformation from a single molecule simulation (see above) were placed on a cuboid grid using a 4x3x2 lattice for 24 molecules of SRSF1, a 4x4x3 lattice for 48 molecules of SRSF1Δ213-247, and a 4x3x1 lattice for 12 molecules of TDP-43. On each lattice, the spacing between the molecules was 90 Å. The cuboids were then placed in a spherical droplet as a boundary determined by the mean diameter of microphases observed in the QF-DEEM images, which were as follows: 300 Å for SRSF1, 400 Å for SRSF1Δ213-247, and 300 Å for TDP-43. To free the system of biases from the initial coordinates, ten replicates of a short simulation of 1×104 MC steps, allowing only for 3D rigid body moves of the molecule, were performed to generate distinct starting structures for independent simulations. These rigid body moves enable a loss of memory of the original placements of molecules on a grid. The final structures from each of the rigid-body simulations were then used as the input structures for 50 independent simulations for TDP-43, 100 independent simulations for SRSF1, and 200 independent simulations for SRSF1Δ217-243. These numbers account for the total number of atoms in each system. Each simulation was initiated using a different random seed. For the folded domains, the backbone flexibility was limited by applying restraints on the starting structure, whereas the sidechain degrees of freedom were unrestrained. Each simulation comprised a total of 5.0×105 MC steps that combined 10% of rigid body moves and 90% of the full spectrum of translational, rotational, pivot, local, and concerted moves. Conformations pertaining to the first 1×103 MC steps were discarded as equilibration.

For each microphase, the radius of gyration of the system, Rg,μ, was determined from the eigenvalues (λx, λy, λz) of the gyration tensor as follows:

Rg,μ=λx2+λy2+λz2 (Equation 12)

Generation of gD(r) profiles

The radial distribution profiles, gD(r), for each domain D were computed from atomistic simulations of microphases. For each domain D, we chose a fixed number of sites, with the number of sites scaling proportional to the Rg of each domain: 10 each for SRSF1 RRMs, 15 for SRSF1 IDR1, 25 for SRSF1 IDR2, 15 for SRSF1 IDR2Δ213-247, and 5 for each domain in TDP-43. At a 5 Å interval from the center-of-mass of the microphase, the relative probability of observing a site of a specific domain D was determined by referencing to a prior of 4πpr2Δ of obserp is the packing fraction as described in the time-resolved fluorescence quenching (TRFQ) section of STAR Methods. The packing fractions are 0.29 for SRSF1, 0.1 for SRSF1Δ213-247, and 0.07 for TDP-43. Then, the gD(r) quantifies the density of observing a domain D relative to the prior. The analysis was performed over the last 80 frames of each simulation and repeated for each replicate. Mean values across all frames and replicates are plotted as solid circles and lines. The samples were then pooled and bootstrap analysis across the 800 samples for each system was performed using sampling with replacement, whereby 102 distinct samples were generated by randomly choosing 200 data points from the pool of samples. Each bootstrap trial is shown as a lighter colored line on the same plot.

Sequence feature comparison by NARDINI+

The current version of NARDINI+48,49 uses the protein sequences from the human proteome from Swissprot Homo sapiens database (May 2015, 20882 entries).146 All IDR sequences within the SRSFs and TDP-43 were analyzed in terms of 90 sequence features that quantify to compositional biases and the binary patterning of pairs of residue types with respect to one another.48,49,147

Non-random binary patterns were extracted using the NARDINI algorithm.47 This helps quantify if pairs of residue types are non-randomly distributed along the linear sequence. The non-randomness is quantified in terms of z-scores, whereby deviations from z-scores of zero quantify the magnitude and type of non-randomness. A positive z-score implies non-random segregation of the pairs of residue types in question, whereas a negative z-score implies non-random mixing of the pairs of residue types. For this analysis, we followed the approach of Cohan et al.,47 and grouped residues as follows: pol (S, T, N, Q, C, H), hyd (I, L, M, V), pos (K, R), neg (E, D), aro (F, W, Y), ala (A), pro (P), and gly (G).

Compositional biases were determined by analyzing 54 parameters, that include the fractions of each amino acid type (20 features); the fractions of positive, negative, polar, aliphatic, and aromatic residues (5 features); the ratios of Arg to Lys and Glu to Asp (2 features); the fraction of residues promoting chain expansion (FCE), the fraction of charged residues (FCR), the net charge per residue (NCPR), the fraction of disorder promoting residues, the hydrophobicity, the isoelectric point, and the polyproline-II propensity (7 features). Most compositional features were quantified using localCIDER.123 The “RG Patch” feature is defined as the fraction of the sequence that is made up of patches of an Arginine-Glycine (RG) pair. Here, a patch had to have at least four occurrences of the given residue or two occurrences of RG and could not extend past two interruptions. For the various compositional features, z-scores for each IDR were calculated using the mean and standard deviation of the human IDRome. Z-scores greater than zero indicate an enrichment of the compositional feature compared to the human IDRome, and z-scores less than zero indicate a depletion of the compositional feature compared to the human IDRome.

Supplementary Material

Video S1. Simulation movie for SRSF1, related to Figure 2J. Movie from a representative atomistic simulation of 24 SRSF1 molecules within a spherical boundary of 30 nm in diameter over 8000 frames at an interval of 100 frames. The domains are colored according to the scheme used in Figure 2J.
Download video file (4.5MB, mp4)
Video S2. Simulation movie for SRSF1Δ213–247, related to Figure 2N. Movie from a representative atomistic simulation of 48 SRSF1Δ213-247 molecules within a spherical boundary of 40 nm in diameter over 8000 frames at an interval of 100 frames. The domains are colored according to the scheme used in Figure 2N.
Download video file (3.8MB, mp4)
Video S3. Single-molecule tracking of slow-moving SRSF1 molecules, related to Figure 3G. Blue circles indicate localized SRSF1 molecules in each 20 ms frame, while blue lines represent single-molecule trajectories across frames. Older trajectories gradually fade over time. Raw frames containing slow dynamics (displacements less than 50 nm per 20 ms) are concatenated together. Empty frames are omitted.
Download video file (5.8MB, mp4)
Video S4. Single-molecule tracking of fast-moving SRSF1 molecules, related to Figure 3G. Orange circles indicate localized SRSF1 molecules in each 20 ms frame, while orange lines represent single-molecule trajectories across frames. Older trajectories gradually fade over time. Raw frames containing fast dynamics (displacements over 50 nm per 20 ms) are concatenated together. Empty frames are omitted.
Download video file (7.7MB, mp4)
Video S5. Simulation movie for TDP-43, related to Figure 6I. Movie from a representative atomistic simulation of 12 TDP-43 molecules within a spherical boundary of 30 nm in diameter over 8000 frames at an interval of 100 frames. The domains are colored according to the scheme used in Figure 6I.
Download video file (3.9MB, mp4)
Supplementary Table S1: Ratio of absorbance at 260 nm (A260) to absorbance at 280 nm (A280). Related to Figure S7F.
7

Supplemental information can be found online at https://doi.org/10.1016/j.cell.2025.11.026.

KEY RESOURCES TABLE

REAGENT or RESOURCE SOURCE IDENTIFIER
Chemicals and other resources
Kanamycin Sigma 1355006
Spectinomycin Fisher Scientific ICN15899302
Chloramphenicol Gold Biotechnology C-105
LB broth (Lennox) Sigma L3022
NHS AlexaFluor 488 ThermoFisher Sci A20000
NHS AlexaFluor 405 ThermoFisher Sci A30100
AlexaFluor 647 Hydrazide ThermoFisher Sci A20502
PA JaneliaFluor 549, NHS ester Tocris 6149
HEPES Sigma 54457
NaCl Sigma S3014
KCl Sigma P9541
MgCl2 Sigma M8266
Glycerol Fisher Sci G33-1
Dithiothreitol Gold Biotechnology DTT10
HisPur Ni-NTA resin ThermoFisher Sci 88221
Imidazole Sigma I202
Pyrene Sigma 82648
Cetylpyridinium chloride Sigma C0732
Critical Commercial Assays
Fisher Pure link quick plasmid miniprep kit Fisher K210011
T7 mMESSAGE mMACHINE Transcription Kit Fisher M0255A
Monarch RNA clean up NEB T2040L
Biological samples
Stage VI Xenopus laevis oocytes Jonathan Silva Laboratory at Washington University in St. Louis N/A
Deposited data
HeLa cell protein abundances Nagaraj et al.77 N/A
AlphaFold structures Jumper et al.120 SRSF1: AF-Q07955-F1-v4 TDP43: AF-Q13148-F1-v4 SRSF3: AF-P84103-F1-v4 SRSF5: AF-Q13243-F1-v4 SRSF7: AF-Q16629-F1-v4; SRSF9: AF-Q13242-F1-v4; hnRNPA1: AF-P09651-F1-v4
Bacterial strains
DH5α strain E. coli cells New England Biolabs C2987I
BL21-CodonPlus(DE3)-RIPL cells New England Biolabs 230280
Recombinant DNA
pSMT3-SRSF1 Fargason et al.72 Addgene: 201056
pET28-SRSF1 This study N/A
pET28-SRSF1Δ213-247 This study N/A
pET28-SRSF1ΔIDR2 This study N/A
pET28-SRSF3 This study Uniprot: P84103 (Human)
pET28-SRSF5 This study Uniprot: Q13243 (Human)
pET28-SRSF7 This study Uniprot: Q16629 (Human)
pET28-ch-SRSF1 This study N/A
pET28-ch-hnRNPA1 This study N/A
pET28b(+)-SUMO-TDP43 Grese et al.121 N/A
pDEST17-A1LCD_deltahexa Martin et al.18 Addgene: 169714
pGEMT-MALAT1scr This study N/A
Software and Algorithms
Prism GraphPad https://www.graphpad.com/
ZEN Zeiss https://www.zeiss.com/microscopy/us/products/software/zeiss-zen.html
FIJI Schindelin et al.122 https://imagej.net/software/fiji/
NARDINI+ Cohan et al.47 and King et al.49 https://www.github.cco/mshinn23/nardini; https://github.com/kierstenruff/RUFF_KING_Grammars_of_IDRs_using_NARDINI-
localCIDER Holehouse et al.123 http://pappulab.github.io/localCIDER/
SOURSOP Lalmansingh et al.124 https://github.com/holehouse-lab/soursop
CAMPARI Vitalis and Pappu60 https://campari.sourceforge.net/
UCSF Chimera Pettersen et al.125 https://www.rbvi.ucsf.edu/chimera
ProtScale (Expasy) Gasteiger et al.126 https://web.expasy.org/protscale/
Custom Python 3 scripts for DEEM analysis This study https://www.github.com/Pappulab/microphase-assemblies/
Custom Python 3 scripts for gD(r) analysis This study http://doi.org/10.5281/zenodo.17096527
Custom Python 3 scripts for interaction matrix construction This study http://doi.org/10.5281/zenodo.17096527
Custom Python 3 scripts for single-molecule trajectory analysis This study http://doi.org/10.5281/zenodo.17096527
Mathematica notebook Kuchel et al.127 http://doi.org/10.5281/zenodo.17096527

Highlights.

  • Proteins with RRMs and IDRs are defined by inter-domain attractions and repulsions

  • Interplay of attractions and repulsions yields 25–40 nm protein-specific microphases

  • MALAT1 binds preferentially to microphases by releasing positively charged proteins

  • Microphases form clusters in cells and micron-scale core-shell emulsions in mixtures

ACKNOWLEDGMENTS

We thank Matthew King for advice on the cloning of SRSF1; Harshani Pathirana for advice on purification of TDP-43; Jared Lalmansingh for help with image analysis; Yiyang Chen for discussions of single-molecule experiments; Michael Vahey for the use of the spinning-disk confocal microscope; Yifan Dai and Michael Chen for help with the ANEPPS measurements; and Gaurav Chauhan, Souradeep Ghosh, Artem Rumyantsev, and Andrea Soranno for discussions of microphase separation. This project was supported by funds from the US National Institutes of Health through grants K99GM152778 to M.K.S., R01NS121114 to R.V.P., R35GM124858 to M.D.L., T32EB019944 to Y.J.S., and R01GM132458 to K.V.P.; the St. Jude Children’s Research Hospital research collaborative on the biology and biophysics of RNP granules to R.V.P.; the US Air Force Office of Scientific Research (grant FA9550-20-1-0241) to R.V.P.; the US National Science Foundation (grant MCB-2227268 to R.V.P. and Center for Quantitative Cell Biology to K.V.P.); the Cancer Center at Illinois seed grant to K.V.P.; and ARPA-H to K.V.P. QF-DEEM images were obtained using a JEOL JEM-1400 120kV transmission electron microscope in the Washington University Center for Cellular Imaging (WUCCI), which is supported by the Washington University School of Medicine, the Children’s Discovery Institute of Washington University and St. Louis Children’s Hospital (CDI-CORE-2015-505 and CDI-CORE-2019-813), and the Foundation for Barnes-Jewish Hospital (3770 and 4642).

Footnotes

RESOURCE AVAILABILITY

Lead contact

Requests for further information and resources should be directed to and will be fulfilled by the lead contact, Rohit V. Pappu (pappu@wustl.edu).

Materials availability

Plasmids generated in this study are available from the lead contact upon request.

Data and code availability

The collection of all QF-DEEM images obtained for each of the constructs, computational data, and code used for analyses is available via http://doi.org/10.5281/zenodo.17096527. The trajectory and PDB file data for the construction of interaction matrices are available upon request from the lead contact.

DECLARATION OF INTERESTS

R.V.P. is a member of the scientific advisory board of and a shareholder in Dewpoint Therapeutics.

REFERENCES

  • 1.Biggiogera M, and Fakan S (1998). Fine structural specific visualization of RNA on ultrathin sections. J. Histochem. Cytochem 46, 389–395. 10.1177/002215549804600313. [DOI] [PubMed] [Google Scholar]
  • 2.Galganski L, Urbanek MO, and Krzyzosiak WJ (2017). Nuclear speckles: molecular organization, biological function and role in disease. Nucleic Acids Res. 45, 10350–10368. 10.1093/nar/gkx759. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Spector DL, and Lamond AI (2011). Nuclear speckles. Cold Spring Harb. Perspect. Biol 3, a000646. 10.1101/cshperspect.a000646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Chen Y, and Belmont AS (2019). Genome organization around nuclear speckles. Curr. Opin. Genet. Dev 55, 91–99. 10.1016/j.gde.2019.06.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Hall LL, Smith KP, Byron M, and Lawrence JB (2006). Molecular anatomy of a speckle. Anat. Rec. A Discov. Mol. Cell. Evol. Biol 288, 664–675. 10.1002/ar.a.20336. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Bhat P, Chow A, Emert B, Ettlin O, Quinodoz SA, Strehle M, Takei Y, Burr A, Goronzy IN, Chen AW, et al. (2024). Genome organization around nuclear speckles drives mRNA splicing efficiency. Nature 629, 1165–1173. 10.1038/s41586-024-07429-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Chaturvedi P, and Belmont AS (2024). Nuclear speckle biology: At the cross-roads of discovery and functional analysis. Curr. Opin. Cell Biol 91, 102438. 10.1016/j.ceb.2024.102438. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Smith KP, Hall LL, and Lawrence JB (2020). Nuclear hubs built on RNAs and clustered organization of the genome. Curr. Opin. Cell Biol 64, 67–76. 10.1016/j.ceb.2020.02.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Mintz PJ, Patterson SD, Neuwald AF, Spahr CS, and Spector DL (1999). Purification and biochemical characterization of interchromatin granule clusters. EMBO J. 18, 4308–4320. 10.1093/emboj/18.15.4308. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Saitoh N, Spahr CS, Patterson SD, Bubulya P, Neuwald AF, and Spector DL (2004). Proteomic analysis of interchromatin granule clusters. Mol. Biol. Cell 15, 3876–3890. 10.1091/mbc.e04-03-0253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Rappsilber J, Ryder U, Lamond AI, and Mann M (2002). Large-scale proteomic analysis of the human spliceosome. Genome Res. 12, 1231–1245. 10.1101/gr.473902. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wahl MC, Will CL, and Lührmann R (2009). The spliceosome: design principles of a dynamic RNP machine. Cell 136, 701–718. 10.1016/j.cell.2009.02.009. [DOI] [PubMed] [Google Scholar]
  • 13.Zhou Z, Licklider LJ, Gygi SP, and Reed R (2002). Comprehensive proteomic analysis of the human spliceosome. Nature 419, 182–185. 10.1038/nature01031. [DOI] [PubMed] [Google Scholar]
  • 14.Barutcu AR, Wu M, Braunschweig U, Dyakov BJA, Luo Z, Turner KM, Durbic T, Lin ZY, Weatheritt RJ, Maass PG, et al. (2022). Systematic mapping of nuclear domain-associated transcripts reveals speckles and lamina as hubs of functionally distinct retained introns. Mol. Cell 82, 1035–1052.e9. 10.1016/j.molcel.2021.12.010. [DOI] [PubMed] [Google Scholar]
  • 15.Wu J, Xiao Y, Liu Y, Wen L, Jin C, Liu S, Paul S, He C, Regev O, and Fei J (2024). Dynamics of RNA localization to nuclear speckles are connected to splicing efficiency. Sci. Adv 10, eadp7727. 10.1126/sciadv.adp7727. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Bogolyubova I, Bogolyubov D, and Parfenov V (2009). Localization of poly(A)(+) RNA and mRNA export factors in interchromatin granule clusters of two-cell mouse embryos. Cell Tissue Res. 338, 271–281. 10.1007/s00441-009-0860-4. [DOI] [PubMed] [Google Scholar]
  • 17.Banani SF, Lee HO, Hyman AA, and Rosen MK (2017). Biomolecular condensates: organizers of cellular biochemistry. Nat. Rev. Mol. Cell Biol 18, 285–298. 10.1038/nrm.2017.7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Martin EW, Holehouse AS, Peran I, Farag M, Incicco JJ, Bremer A, Grace CR, Soranno A, Pappu RV, and Mittag T (2020). Valence and patterning of aromatic residues determine the phase behavior of prion-like domains. Science 367, 694–699. 10.1126/science.aaw8653. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Brangwynne CP, Tompa P, and Pappu RV (2015). Polymer physics of intracellular phase transitions. Nat. Phys 11, 899–904. 10.1038/nphys3532. [DOI] [Google Scholar]
  • 20.Berry J, Weber SC, Vaidya N, Haataja M, and Brangwynne CP (2015). RNA transcription modulates phase transition-driven nuclear body assembly. Proc. Natl. Acad. Sci. USA 112, E5237–E5245. 10.1073/pnas.1509317112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Zwicker D, Seyboldt R, Weber CA, Hyman AA, and Jülicher F (2017). Growth and division of active droplets provides a model for protocells. Nat. Phys 13, 408–413. 10.1038/nphys3984. [DOI] [Google Scholar]
  • 22.Wurtz JD, and Lee CF (2018). Chemical-Reaction-Controlled Phase Separated Drops: Formation, Size Selection, and Coarsening. Phys. Rev. Lett 120, 078102. 10.1103/PhysRevLett.120.078102. [DOI] [PubMed] [Google Scholar]
  • 23.Folkmann AW, Putnam A, Lee CF, and Seydoux G (2021). Regulation of biomolecular condensates by interfacial protein clusters. Science 373, 1218–1224. 10.1126/science.abg7071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Rosowski KA, Sai T, Vidal-Henriquez E, Zwicker D, Style RW, and Dufresne ER (2020). Elastic ripening and inhibition of liquid–liquid phase separation. Nat. Phys 16, 422–425. 10.1038/s41567-019-0767-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Lee DSW, Choi C-H, Sanders DW, Beckers L, Riback JA, Brangwynne CP, and Wingreen NS (2023). Size distributions of intracellular condensates reflect competition between coalescence and nucleation. Nat. Phys 19, 586–596. 10.1038/s41567-022-01917-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Ranganathan S, and Shakhnovich EI (2020). Dynamic metastable long-living droplets formed by sticker-spacer proteins. eLife 9, e56159. 10.7554/eLife.56159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Fei J, Jadaliha M, Harmon TS, Li ITS, Hua B, Hao Q, Holehouse AS, Reyer M, Sun Q, Freier SM, et al. (2017). Quantitative analysis of multilayer organization of proteins and RNA in nuclear speckles at super resolution. J. Cell Sci 130, 4180–4192. 10.1242/jcs.206854. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Ahn EY, DeKelver RC, Lo MC, Nguyen TA, Matsuura S, Boya-pati A, Pandit S, Fu XD, and Zhang DE (2011). SON controls cell-cycle progression by coordinated regulation of RNA splicing. Mol. Cell 42, 185–198. 10.1016/j.molcel.2011.03.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Sharma A, Takata H, Shibahara K, Bubulya A, and Bubulya PA (2010). Son is essential for nuclear speckle organization and cell cycle progression. Mol. Biol. Cell 21, 650–663. 10.1091/mbc.e09-02-0126. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Ilık IA, Malszycki M, Lübke AK, Schade C, Meierhofer D, and Aktas T (2020). SON and SRRM2 are essential for nuclear speckle formation. eLife 9, e60579. 10.7554/eLife.60579. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Dopie J, Sweredoski MJ, Moradian A, and Belmont AS (2020). Tyramide signal amplification mass spectrometry (TSA-MS) ratio identifies nuclear speckle proteins. J. Cell Biol 219, e201910207. 10.1083/jcb.201910207. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Zhang M, Gu Z, Guo S, Sun Y, Ma S, Yang S, Guo J, Fang C, Shu L, Ge Y, et al. (2024). SRRM2 phase separation drives assembly of nuclear speckle subcompartments. Cell Rep. 43, 113827. 10.1016/j.celrep.2024.113827. [DOI] [PubMed] [Google Scholar]
  • 33.Paul S, Arias MA, Wen L, Liao SE, Zhang J, Wang X, Regev O, and Fei J (2024). RNA molecules display distinctive organization at nuclear speckles. iScience 27, 109603. 10.1016/j.isci.2024.109603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Latham AP, Zhu L, Sharon DA, Ye S, Willard AP, Zhang X, and Zhang B (2025). Microphase Separation Produces Interfacial Environment within Diblock Biomolecular Condensates. eLife 12, RP90750. 10.7554/elife.90750.3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Remini L, Segers M, Palmeri J, Walter J-C, Parmeggiani A, and Carlon E (2024). Chromatin structure from high resolution microscopy: Scaling laws and microphase separation. Phys. Rev. E 109, 024408. 10.1103/PhysRevE.109.024408. [DOI] [PubMed] [Google Scholar]
  • 36.Bates FS, and Fredrickson GH (1990). Block Copolymer Thermodynamics: Theory and Experiment. Annu. Rev. Phys. Chem 41, 525–557. 10.1146/annurev.pc.41.100190.002521. [DOI] [PubMed] [Google Scholar]
  • 37.Adame-Arana O, Bajpai G, Lorber D, Volk T, and Safran S (2023). Regulation of chromatin microphase separation by binding of protein complexes. eLife 12, e82983. 10.7554/eLife.82983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Panagiotopoulos AZ (2024). Solvent Selectivity Controls Micro-Versus Macrophase Separation in Multiblock Chains. Macromolecules 57, 8253–8261. 10.1021/acs.macromol.4c01214. [DOI] [Google Scholar]
  • 39.Hilbert L, Sato Y, Kuznetsova K, Bianucci T, Kimura H, Jülicher F, Honigmann A, Zaburdaev V, and Vastenhouw NL (2021). Transcription organizes euchromatin via microphase separation. Nat. Commun 12, 1360. 10.1038/s41467-021-21589-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Bates FS (1991). Polymer-Polymer Phase Behavior. Science 251, 898–905. 10.1126/science.251.4996.898. [DOI] [PubMed] [Google Scholar]
  • 41.Leibler L (1980). Theory of Microphase Separation in Block Copolymers. Macromolecules 13, 1602–1617. 10.1021/ma60078a047. [DOI] [Google Scholar]
  • 42.Hassouneh W, Zhulina EB, Chilkoti A, and Rubinstein M (2015). Elastin-like Polypeptide Diblock Copolymers Self-Assemble into Weak Micelles. Macromolecules 48, 4183–4195. 10.1021/acs.macromol.5b00431. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Navarro LA, Ryan JJ, Dzuricky M, Gradzielski M, Chilkoti A, and Zauscher S (2021). Microphase Separation of Resilin-like and Elastin-like Diblock Copolypeptides in Concentrated Solutions. Bio-macromolecules 22, 3827–3838. 10.1021/acs.biomac.1c00672. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Bates CM, and Bates FS (2017). 50th Anniversary Perspective: Block Polymers—Pure Potential. Macromolecules 50, 3–22. 10.1021/acs.macromol.6b02355. [DOI] [Google Scholar]
  • 45.Song YJ, Shinn MK, Bangru S, Wang Y, Sun Q, Hao Q, Chaturvedi P, Freier SM, Perez-Pinera P, Nelson ER, et al. (2024). Chromatin-associated lncRNA-splicing factor condensates regulate hypoxia responsive RNA processing of genes pre-positioned near nuclear speckles. Preprint at bioRxiv. 10.1101/2024.10.31.621310. [DOI] [Google Scholar]
  • 46.Ushakova AS, Govorun EN, and Khokhlov AR (2006). Globules of amphiphilic macromolecules. J. Phys.: Condens. Matter 18, 915–930. 10.1088/0953-8984/18/3/010. [DOI] [Google Scholar]
  • 47.Cohan MC, Shinn MK, Lalmansingh JM, and Pappu RV (2022). Uncovering Non-random Binary Patterns Within Sequences of Intrinsically Disordered Proteins. J. Mol. Biol 434, 167373. 10.1016/j.jmb.2021.167373. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Patil A, Strom AR, Paulo JA, Collings CK, Ruff KM, Shinn MK, Sankar A, Cervantes KS, Wauer T, St Laurent JD, et al. (2023). A disordered region controls cBAF activity via condensation and partner recruitment. Cell 186, 4936–4955.e26. 10.1016/j.cell.2023.08.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.King MR, Ruff KM, Lin AZ, Pant A, Farag M, Lalmansingh JM, Wu T, Fossat MJ, Ouyang W, Lew MD, et al. (2024). Macromolecular condensation organizes nucleolar sub-phases to set up a pH gradient. Cell 187, 1889–1906.e24. 10.1016/j.cell.2024.02.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Sprunger ML, Shinn MK, Talir SK, Lee K, Pappu RV, and Jackrel ME (2025). Matrin-3 forms spherical and wormlike assemblies that are modulated by RNA binding and ALS/FTD-associated mutations. Mol. Cell 85, 3640–3660.e9. 10.1016/j.molcel.2025.08.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Martin EW, Thomasen FE, Milkovic NM, Cuneo MJ, Grace CR, Nourse A, Lindorff-Larsen K, and Mittag T (2021). Interplay of folded domains and the disordered low-complexity domain in mediating hnRNPA1 phase separation. Nucleic Acids Res. 49, 2931–2945. 10.1093/nar/gkab063. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Molliex A, Temirov J, Lee J, Coughlin M, Kanagaraj AP, Kim HJ, Mittag T, and Taylor JP (2015). Phase separation by low complexity domains promotes stress granule assembly and drives pathological fibrillization. Cell 163, 123–133. 10.1016/j.cell.2015.09.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Phair RD, and Misteli T (2000). High mobility of proteins in the mammalian cell nucleus. Nature 404, 604–609. 10.1038/35007077. [DOI] [PubMed] [Google Scholar]
  • 54.Tripathi V, Song DY, Zong X, Shevtsov SP, Hearn S, Fu XD, Dundr M, and Prasanth KV (2012). SRSF1 regulates the assembly of pre-mRNA processing factors in nuclear speckles. Mol. Biol. Cell 23, 3694–3706. 10.1091/mbc.E12-03-0206. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Kratochví1 P (1987). Classical Light Scattering from Polymer Solutions (Elsevier; ). [Google Scholar]
  • 56.Piñeiro L, Novo M, and Al-Soufi W (2015). Fluorescence emission of pyrene in surfactant solutions. Adv. Colloid Interface Sci 215, 1–12. 10.1016/j.cis.2014.10.010. [DOI] [PubMed] [Google Scholar]
  • 57.Severs NJ (2007). Freeze-fracture electron microscopy. Nat. Protoc 2, 547–576. 10.1038/nprot.2007.55. [DOI] [PubMed] [Google Scholar]
  • 58.Heuser JE (2011). The origins and evolution of freeze-etch electron microscopy. J. Electron Microsc. (Tokyo) 60, S3–S29. 10.1093/jmicro/dfr044. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Pisárcik M, Devínsky F, and Pupák M (2015). Determination of micelle aggregation numbers of alkyltrimethylammonium bromide and sodium dodecyl sulfate surfactants using time-resolved fluorescence quenching. Open Chem. 13, 922–931. 10.1515/chem-2015-0103. [DOI] [Google Scholar]
  • 60.Vitalis A, and Pappu RV (2009). ABSINTH: A new continuum solvation model for simulations of polypeptides in aqueous solutions. J. Comput. Chem 30, 673–699. 10.1002/jcc.21005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Radhakrishnan A, Vitalis A, Mao AH, Steffen AT, and Pappu RV (2012). Improved atomistic Monte Carlo simulations demonstrate that poly-L-proline adopts heterogeneous ensembles of conformations of semi-rigid segments interrupted by kinks. J. Phys. Chem. B 116, 6862–6871. 10.1021/jp212637r. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Angelini TE, Liang H, Wriggers W, and Wong GCL (2003). Like-charge attraction between polyelectrolytes induced by counterion charge density waves. Proc. Natl. Acad. Sci. USA 100, 8634–8637. 10.1073/pnas.1533355100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Sweatman MB, Fartaria R, and Lue L (2014). Cluster formation in fluids with competing short-range and long-range interactions. J. Chem. Phys 140, 124508. 10.1063/1.4869109. [DOI] [PubMed] [Google Scholar]
  • 64.Sweatman MB, and Lue L (2019). The Giant SALR Cluster Fluid: A Review. Adv. Theor. Simul 2, 1900025. 10.1002/adts.201900025. [DOI] [Google Scholar]
  • 65.Hoffmann C, Ruff KM, Edu IA, Shinn MK, Tromm JV, King MR, Pant A, Ausserwöger H, Morgan JR, Knowles TPJ, et al. (2025). Synapsin condensation is governed by sequence-encoded molecular grammars. J. Mol. Biol 437, 168987. 10.1016/j.jmb.2025.168987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Asamoah OK, Wuskell JP, Loew LM, and Bezanilla F (2003). A fluorometric approach to local electric field measurements in a voltage-gated ion channel. Neuron 37, 85–97. 10.1016/s0896-6273(02)01126-1. [DOI] [PubMed] [Google Scholar]
  • 67.Moerner WE (1997). Those blinking single molecules. Science 277, 1059–1060. 10.1126/science.277.5329.1059. [DOI] [Google Scholar]
  • 68.Manley S, Gillette JM, Patterson GH, Shroff H, Hess HF, Betzig E, and Lippincott-Schwartz J (2008). High-density mapping of single-molecule trajectories with photoactivated localization microscopy. Nat. Methods 5, 155–157. 10.1038/nmeth.1176. [DOI] [PubMed] [Google Scholar]
  • 69.Alberti S, Saha S, Woodruff JB, Franzmann TM, Wang J, and Hyman AA (2018). A User’s Guide for Phase Separation Assays with Purified Proteins. J. Mol. Biol 430, 4806–4820. 10.1016/j.jmb.2018.06.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Boke E, Ruer M, Wühr M, Coughlin M, Lemaitre R, Gygi SP, Alberti S, Drechsel D, Hyman AA, and Mitchison TJ (2016). Amyloid-like Self-Assembly of a Cellular Compartment. Cell 166, 637–650. 10.1016/j.cell.2016.06.051. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Brudar S, and Hribar-Lee B (2023). The Effect of Arginine on the Phase Stability of Aqueous Hen Egg-White Lysozyme Solutions. Int. J. Mol. Sci 24, 1197. 10.3390/ijms24021197. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Fargason T, De Silva NIU, Powell E, Zhang Z, Paul T, Shariq J, Zaharias S, and Zhang J (2023). Peptides that Mimic RS repeats modulate phase separation of SRSF1, revealing a reliance on combined stacking and electrostatic interactions. eLife 12, e84412. 10.7554/eLife.84412. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Sheth SR, and Leckband D (1997). Measurements of attractive forces between proteins and end-grafted poly(ethylene glycol) chains. Proc. Natl. Acad. Sci. USA 94, 8399–8404. 10.1073/pnas.94.16.8399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Feric M, and Brangwynne CP (2013). A nuclear F-actin scaffold stabilizes ribonucleoprotein droplets against gravity in large cells. Nat. Cell Biol 15, 1253–1259. 10.1038/ncb2830. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Cáceres JF, Misteli T, Screaton GR, Spector DL, and Krainer AR (1997). Role of the modular domains of SR proteins in subnuclear localization and alternative splicing specificity. J. Cell Biol 138, 225–238. 10.1083/jcb.138.2.225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Howard JM, and Sanford JR (2015). The RNAissance family: SR proteins as multifaceted regulators of gene expression. Wiley Interdiscip. Rev. RNA 6, 93–110. 10.1002/wrna.1260. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Nagaraj N, Wisniewski JR, Geiger T, Cox J, Kircher M, Kelso J, Pääbo S, and Mann M (2011). Deep proteome and transcriptome mapping of a human cancer cell line. Mol. Syst. Biol 7, 548. 10.1038/msb.2011.81. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Zhao Y, Li B, Chen X, Zhou Y, Song T, and Shi W (2023). Liquid–Liquid Phase Separation at Double Emulsion Interfaces: Equilibrium Structures and Dynamic Pathways. Macromolecules 56, 8834–8844. 10.1021/acs.macromol.3c01157. [DOI] [Google Scholar]
  • 79.Bremer A, Farag M, Borcherds WM, Peran I, Martin EW, Pappu RV, and Mittag T (2022). Deciphering how naturally occurring sequence features impact the phase behaviours of disordered prion-like domains. Nat. Chem 14, 196–207. 10.1038/s41557-021-00840-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Wu T, King MR, Qiu Y, Farag M, Pappu RV, and Lew MD (2025). Single-fluorogen imaging reveals distinct environmental and structural features of biomolecular condensates. Nat. Phys 21, 778–786. 10.1038/s41567-025-02827-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Brangwynne CP, Mitchison TJ, and Hyman AA (2011). Active liquid-like behavior of nucleoli determines their size and shape in Xenopus laevis oocytes. Proc. Natl. Acad. Sci. USA 108, 4334–4339. 10.1073/pnas.1017150108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Feric M, Vaidya N, Harmon TS, Mitrea DM, Zhu L, Richardson TM, Kriwacki RW, Pappu RV, and Brangwynne CP (2016). Coexisting Liquid Phases Underlie Nucleolar Subcompartments. Cell 165, 1686–1697. 10.1016/j.cell.2016.04.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Das T, Zaidi FK, Farag M, Ruff KM, Mahendran TS, Singh A, Gui X, Messing J, Taylor JP, Banerjee PR, et al. (2025). Tunable metastability of condensates reconciles their dual roles in amyloid fibril formation. Mol. Cell 85, 2230–2245.e7. 10.1016/j.mol-cel.2025.05.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Casafont I, Bengoechea R, Tapia O, Berciano MT, and Lafarga M (2009). TDP-43 localizes in mRNA transcription and processing sites in mammalian neurons. J. Struct. Biol 167, 235–241. 10.1016/j.jsb.2009.06.006. [DOI] [PubMed] [Google Scholar]
  • 85.Hutten S, Usluer S, Bourgeois B, Simonetti F, Odeh HM, Fare CM, Czuppa M, Hruska-Plochan M, Hofweber M, Polymenidou M, et al. (2020). Nuclear Import Receptors Directly Bind to Arginine-Rich Dipeptide Repeat Proteins and Suppress Their Pathological Interactions. Cell Rep. 33, 108538. 10.1016/j.celrep.2020.108538. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Wang J, Choi J-M, Holehouse AS, Lee HO, Zhang X, Jahnel M, Maharana S, Lemaitre R, Pozniakovsky A, Drechsel D, et al. (2018). A Molecular Grammar Governing the Driving Forces for Phase Separation of Prion-like RNA Binding Proteins. Cell 174, 688–699.e16. 10.1016/j.cell.2018.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Conicella AE, Zerze GH, Mittal J, and Fawzi NL (2016). ALS Mutations Disrupt Phase Separation Mediated by alpha-Helical Structure in the TDP-43 Low-Complexity C-Terminal Domain. Structure 24, 1537–1549. 10.1016/j.str.2016.07.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Fuhrhop J-H, and Wang T (2004). Bolaamphiphiles. Chem. Rev 104, 2901–2937. 10.1021/cr030602b. [DOI] [PubMed] [Google Scholar]
  • 89.Fang Y-S, Tsai K-J, Chang Y-J, Kao P, Woods R, Kuo P-H, Wu C-C, Liao J-Y, Chou S-C, Lin V, et al. (2014). Full-length TDP-43 forms toxic amyloid oligomers that are present in frontotemporal lobar dementia-TDP patients. Nat. Commun 5, 4824. 10.1038/ncomms5824. [DOI] [PubMed] [Google Scholar]
  • 90.Tripathi V, Ellis JD, Shen Z, Song DY, Pan Q, Watt AT, Freier SM, Bennett CF, Sharma A, Bubulya PA, et al. (2010). The Nuclear-Retained Noncoding RNA MALAT1 Regulates Alternative Splicing by Modulating SR Splicing Factor Phosphorylation. Mol. Cell 39, 925–938. 10.1016/j.molcel.2010.08.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Malakar P, Shilo A, Mogilevsky A, Stein I, Pikarsky E, Nevo Y, Benyamini H, Elgavish S, Zong X, Prasanth KV, et al. (2017). Long Noncoding RNA MALAT1 Promotes Hepatocellular Carcinoma Development by SRSF1 Upregulation and mTOR Activation. Cancer Res. 77, 1155–1167. 10.1158/0008-5472.Can-16-1508. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Jadaliha M, Zong X, Malakar P, Ray T, Singh DK, Freier SM, Jensen T, Prasanth SG, Karni R, Ray PS, et al. (2016). Functional and prognostic significance of long non-coding RNA MALAT1 as a metastasis driver in ER negative lymph node negative breast cancer. Oncotarget 7, 40418–40436. 10.18632/oncotarget.9622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Gutschner T, Hä mmerle M, and Diederichs S (2013). MALAT1 — a paradigm for long noncoding RNA function in cancer. J. Mol. Med. (Berl) 91, 791–801. 10.1007/s00109-013-1028-y. [DOI] [PubMed] [Google Scholar]
  • 94.Hutchinson JN, Ensminger AW, Clemson CM, Lynch CR, Lawrence JB, and Chess A (2007). A screen for nuclear transcripts identifies two linked noncoding RNAs associated with SC35 splicing domains. BMC Genomics 8, 39. 10.1186/1471-2164-8-39. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Engreitz JM, Sirokman K, McDonel P, Shishkin AA, Surka C, Russell P, Grossman SR, Chow AY, Guttman M, and Lander ES (2014). RNA-RNA interactions enable specific targeting of noncoding RNAs to nascent Pre-mRNAs and chromatin sites. Cell 159, 188–199. 10.1016/j.cell.2014.08.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.West JA, Davis CP, Sunwoo H, Simon MD, Sadreyev RI, Wang PI, Tolstorukov MY, and Kingston RE (2014). The long noncoding RNAs NEAT1 and MALAT1 bind active chromatin sites. Mol. Cell 55, 791–802. 10.1016/j.molcel.2014.07.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Wilusz JE, Freier SM, and Spector DL (2008). 3′ end processing of a long nuclear-retained noncoding RNA yields a tRNA-like cytoplasmic RNA. Cell 135, 919–932. 10.1016/j.cell.2008.10.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Wilusz JE, JnBaptiste CK, Lu LY, Kuhn CD, Joshua-Tor L, and Sharp PA (2012). A triple helix stabilizes the 3′ ends of long noncoding RNAs that lack poly(A) tails. Genes Dev. 26, 2392–2407. 10.1101/gad.204438.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 99.McCown PJ, Wang MC, Jaeger L, and Brown JA (2019). Secondary Structural Model of Human MALAT1 Reveals Multiple Structure-Function Relationships. Int. J. Mol. Sci 20, 5610. 10.3390/ijms20225610. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Borodavka A, Singaram SW, Stockley PG, Gelbart WM, Ben-Shaul A, and Tuma R (2016). Sizes of Long RNA Molecules Are Determined by the Branching Patterns of Their Secondary Structures. Biophys. J 111, 2077–2085. 10.1016/j.bpj.2016.10.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Lai WC, Kayedkhordeh M, Cornell EV, Farah E, Bellaousov S, Rietmeijer R, Salsi E, Mathews DH, and Ermolenko DN (2018). mRNAs and lncRNAs intrinsically form secondary structures with short end-to-end distances. Nat. Commun 9, 4328. 10.1038/s41467-018-06792-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Wyman J, and Gill SJ (1980). Ligand-linked phase changes in a biological system: applications to sickle cell hemoglobin. Proc. Natl. Acad. Sci. USA 77, 5239–5242. 10.1073/pnas.77.9.5239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Ruff KM, Dar F, and Pappu RV (2021). Polyphasic linkage and the impact of ligand binding on the regulation of biomolecular condensates. Biophys. Rev 2, 021302. 10.1063/5.0050059. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Ruff KM, Dar F, and Pappu RV (2021). Ligand effects on phase separation of multivalent macromolecules. Proc. Natl. Acad. Sci. USA 118, e2017184118. 10.1073/pnas.2017184118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Wang X, Juan L, Lv J, Wang K, Sanford JR, and Liu Y (2011). Predicting sequence and structural specificities of RNA binding regions recognized by splicing factor SRSF1. BMC Genomics 12, S8. 10.1186/1471-2164-12-S5-S8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Banerjee PR, Milin AN, Moosa MM, Onuchic PL, and Deniz AA (2017). Reentrant Phase Transition Drives Dynamic Substructure Formation in Ribonucleoprotein Droplets. Angew. Chem. Int. Ed 56, 11354–11359. 10.1002/anie.201703191. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Henninger JE, Oksuz O, Shrinivas K, Sagi I, LeRoy G, Zheng MM, Andrews JO, Zamudio AV, Lazaris C, Hannett NM, et al. (2021). RNA-Mediated Feedback Control of Transcriptional Condensates. Cell 184, 207–225.e24. 10.1016/j.cell.2020.11.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Yan X, Kuster D, Mohanty P, Nijssen J, Pombo-García K, Garcia Morato J, Rizuan A, Franzmann TM, Sergeeva A, Ly AM, et al. (2025). Intra-condensate demixing of TDP-43 inside stress granules generates pathological aggregates. Cell 188, 4123–4140.e18. 10.1016/j.cell.2025.04.039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 109.Wang Z, Yang C, Guan D, Li J, and Zhang H (2023). Cellular proteins act as surfactants to control the interfacial behavior and function of biological condensates. Dev. Cell 58, 919–932.e5. 10.1016/j.devcel.2023.04.004. [DOI] [PubMed] [Google Scholar]
  • 110.Shinn MK, and Pappu RV (2023). Soaping up transcriptional condensates. Dev. Cell 58, 915–916. 10.1016/j.devcel.2023.05.002. [DOI] [PubMed] [Google Scholar]
  • 111.Ryu J-K, Bouchoux C, Liu HW, Kim E, Minamino M, de Groot R, Katan AJ, Bonato A, Marenduzzo D, Michieletto D, et al. (2021). Bridging-induced phase separation induced by cohesin SMC protein complexes. Sci. Adv 7, eabe5905. 10.1126/sciadv.abe5905. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Yamazaki T, Yamamoto T, Yoshino H, Souquere S, Nakagawa S, Pierron G, and Hirose T (2021). Paraspeckles are constructed as block copolymer micelles. EMBO J. 40, e107270. 10.15252/embj.2020107270. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Yao R-W, Xu G, Wang Y, Shan L, Luan P-F, Wang Y, Wu M, Yang L-Z, Xing Y-H, Yang L, et al. (2019). Nascent Pre-rRNA Sorting via Phase Separation Drives the Assembly of Dense Fibrillar Components in the Human Nucleolus. Mol. Cell 76, 767–783.e11. 10.1016/j.molcel.2019.08.014. [DOI] [PubMed] [Google Scholar]
  • 114.Ghaemi Z, Peterson JR, Gruebele M, and Luthey-Schulten Z (2020). An in-silico human cell model reveals the influence of spatial organization on RNA splicing. PLoS Comput. Biol 16, e1007717. 10.1371/journal.pcbi.1007717. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Guo YE, Manteiga JC, Henninger JE, Sabari BR, Dall’Agnese A, Hannett NM, Spille J-H, Afeyan LK, Zamudio AV, Shrinivas K, et al. (2019). Pol II phosphorylation regulates a switch between transcriptional and splicing condensates. Nature 572, 543–548. 10.1038/s41586-019-1464-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Greig JA, Nguyen TA, Lee M, Holehouse AS, Posey AE, Pappu RV, and Jedd G (2020). Arginine-Enriched Mixed-Charge Domains Provide Cohesion for Nuclear Speckle Condensation. Mol. Cell 77, 1237–1250.e4. 10.1016/j.molcel.2020.01.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Ghosh G, and Adams JA (2011). Phosphorylation mechanism and structure of serine-arginine protein kinases. FEBS J. 278, 587–597. 10.1111/j.1742-4658.2010.07992.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Wu M, Xu G, Han C, Luan P-F, Xing Y-H, Nan F, Yang L-Z, Huang Y, Yang Z-H, Shan L, et al. (2021). lncRNA SLERT controls phase separation of FC/DFCs to facilitate Pol I transcription. Science 373, 547–555. 10.1126/science.abf6582. [DOI] [PubMed] [Google Scholar]
  • 119.Wu W, Yan Z, Nguyen TC, Bouman Chen Z, Chien S, and Zhong S (2019). Mapping RNA-chromatin interactions by sequencing with iM-ARGI. Nat. Protoc 14, 3243–3272. 10.1038/s41596-019-0229-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, Tunyasuvunakool K, Bates R, Žídek A, Potapenko A, et al. (2021). Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589. 10.1038/s41586-021-03819-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Grese ZR, Bastos AC, Mamede LD, French RL, Miller TM, and Ayala YM (2021). Specific RNA interactions promote TDP-43 multivalent phase separation and maintain liquid properties. EMBO Rep. 22, e53632. 10.15252/embr.202153632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122.Schindelin J, Arganda-Carreras I, Frise E, Kaynig V, Longair M, Pietzsch T, Preibisch S, Rueden C, Saalfeld S, Schmid B, et al. (2012). Fiji: an open-source platform for biological-image analysis. Nat. Methods 9, 676–682. 10.1038/nmeth.2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Holehouse AS, Das RK, Ahad JN, Richardson MOG, and Pappu RV (2017). CIDER: Resources to Analyze Sequence-Ensemble Relationships of Intrinsically Disordered Proteins. Biophys. J 112, 16–21. 10.1016/j.bpj.2016.11.3200. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 124.Lalmansingh JM, Keeley AT, Ruff KM, Pappu RV, and Holehouse AS (2023). SOURSOP: A Python Package for the Analysis of Simulations of Intrinsically Disordered Proteins. J. Chem. Theor. Comput 19, 5609–5620. 10.1021/acs.jctc.3c00190. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Pettersen EF, Goddard TD, Huang CC, Meng EC, Couch GS, Croll TI, Morris JH, and Ferrin TE (2021). UCSF ChimeraX: Structure visualization for researchers, educators, and developers. Protein Sci. 30, 70–82. 10.1002/pro.3943. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 126.Gasteiger E, Gattiker A, Hoogland C, Ivanyi I, Appel RD, and Bairoch A (2003). ExPASy: The proteomics server for in-depth protein knowledge and analysis. Nucleic Acids Res. 31, 3784–3788. 10.1093/nar/gkg563. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Kuchel PW, Cox CD, Daners D, Shishmarev D, and Galvosas P (2021). Surface model of the human red blood cell simulating changes in membrane curvature under strain. Sci. Rep 11, 13712. 10.1038/s41598-021-92699-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 128.Gibson DG, Young L, Chuang R-Y, Venter JC, Hutchison CA, and Smith HO (2009). Enzymatic assembly of DNA molecules up to several hundred kilobases. Nat. Methods 6, 343–345. 10.1038/nmeth.1318. [DOI] [PubMed] [Google Scholar]
  • 129.Philo JS (2023). SEDNTERP: a calculation and database utility to aid interpretation of analytical ultracentrifugation and light scattering data. Eur. Biophys. J 52, 233–266. 10.1007/s00249-023-01629-0. [DOI] [PubMed] [Google Scholar]
  • 130.Posey AE, Ruff KM, Harmon TS, Crick SL, Li A, Diamond MI, and Pappu RV (2018). Profilin reduces aggregation and phase separation of huntingtin N-terminal fragments by preferentially binding to soluble monomers and oligomers. J. Biol. Chem 293, 3734–3746. 10.1074/jbc.RA117.000357. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 131.Petsev DN, Wu X, Galkin O, and Vekilov PG (2003). Thermodynamic Functions of Concentrated Protein Solutions from Phase Equilibria. J. Phys. Chem. B 107, 3921–3926. 10.1021/jp0278317. [DOI] [Google Scholar]
  • 132.Sedlák M, and Amis EJ (1992). Concentration and molecular weight regime diagram of salt-free polyelectrolyte solutions as studied by light scattering. J. Chem. Phys 96, 826–834. 10.1063/1.462468. [DOI] [Google Scholar]
  • 133.Mazidi H, Lu J, Nehorai A, and Lew MD (2018). Minimizing Structural Bias in Single-Molecule Super-Resolution Microscopy. Sci. Rep 8, 13133. 10.1038/s41598-018-31366-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134.Varadi M, Anyango S, Deshpande M, Nair S, Natassia C, Yordanova G, Yuan D, Stroe O, Wood G, Laydon A, et al. (2022). Alpha-Fold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models. Nucleic Acids Res. 50, D439–D444. 10.1093/nar/gkab1061. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 135.Varadi M, Bertoni D, Magana P, Paramval U, Pidruchna I, Radhakrishnan M, Tsenkov M, Nair S, Mirdita M, Yeo J, et al. (2024). Alpha-Fold Protein Structure Database in 2024: providing structure coverage for over 214 million protein sequences. Nucleic Acids Res. 52, D368–D375. 10.1093/nar/gkad1011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 136.Kyte J, and Doolittle RF (1982). A simple method for displaying the hydropathic character of a protein. J. Mol. Biol 157, 105–132. 10.1016/0022-2836(82)90515-0. [DOI] [PubMed] [Google Scholar]
  • 137.Schneider CA, Rasband WS, and Eliceiri KW (2012). NIH Image to ImageJ: 25 years of image analysis. Nat. Methods 9, 671–675. 10.1038/nmeth.2089. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 138.Mao AH, and Pappu RV (2012). Crystal lattice properties fully determine short-range interaction parameters for alkali and halide ions. J. Chem. Phys 137, 064104. 10.1063/1.4742068. [DOI] [PubMed] [Google Scholar]
  • 139.Mittal A, Holehouse AS, Cohan MC, and Pappu RV (2018). Sequence-to-Conformation Relationships of Disordered Regions Tethered to Folded Domains of Proteins. J. Mol. Biol 430, 2403–2421. 10.1016/j.jmb.2018.05.012. [DOI] [PubMed] [Google Scholar]
  • 140.Lyle N, Das RK, and Pappu RV (2013). A quantitative measure for protein conformational heterogeneity. J. Chem. Phys 139, 121907. 10.1063/1.4812791. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 141.Meng W, Lyle N, Luan B, Raleigh DP, and Pappu RV (2013). Experiments and simulations show how long-range contacts can form in expanded unfolded proteins with negligible secondary structure. Proc. Natl. Acad. Sci. USA 110, 2123–2128. 10.1073/pnas.1216979110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 142.Mittal A, Lyle N, Harmon TS, and Pappu RV (2014). Hamiltonian Switch Metropolis Monte Carlo Simulations for Improved Conformational Sampling of Intrinsically Disordered Regions Tethered to Ordered Domains of Proteins. J. Chem. Theor. Comput 10, 3550–3562. 10.1021/ct5002297. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 143.McGibbon RT, Beauchamp KA, Harrigan MP, Klein C, Swails JM, Hernández CX, Schwantes CR, Wang LP, Lane TJ, and Pande VS (2015). MDTraj: A Modern Open Library for the Analysis of Molecular Dynamics Trajectories. Biophys. J 109, 1528–1532. 10.1016/j.bpj.2015.08.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 144.Tran HT, Wang X, and Pappu RV (2005). Reconciling observations of sequence-specific conformational propensities with the generic polymeric behavior of denatured proteins. Biochemistry 44, 11369–11380. 10.1021/bi050196l. [DOI] [PubMed] [Google Scholar]
  • 145.Peran I, Holehouse AS, Carrico IS, Pappu RV, Bilsel O, and Raleigh DP (2019). Unfolded states under folding conditions accommodate sequence-specific conformational preferences with random coillike dimensions. Proc. Natl. Acad. Sci. USA 116, 12301–12310. 10.1073/pnas.1818206116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 146.Coudert E, Gehant S, de Castro E, Pozzato M, Baratin D, Neto T, Sigrist CJA, Redaschi N, and Bridge A; UniProt Consortium (2023). Annotation of biologically relevant ligands in UniProtKB using ChEBI. Bioinformatics 39, btac793. 10.1093/bioinformatics/btac793. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 147.Zarin T, Strome B, Peng G, Pritšsanac I, Forman-Kay JD, and Moses AM (2021). Identifying molecular features that are associated with biological function of intrinsically disordered protein regions. eLife 10, e60220. 10.7554/eLife.60220. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Video S1. Simulation movie for SRSF1, related to Figure 2J. Movie from a representative atomistic simulation of 24 SRSF1 molecules within a spherical boundary of 30 nm in diameter over 8000 frames at an interval of 100 frames. The domains are colored according to the scheme used in Figure 2J.
Download video file (4.5MB, mp4)
Video S2. Simulation movie for SRSF1Δ213–247, related to Figure 2N. Movie from a representative atomistic simulation of 48 SRSF1Δ213-247 molecules within a spherical boundary of 40 nm in diameter over 8000 frames at an interval of 100 frames. The domains are colored according to the scheme used in Figure 2N.
Download video file (3.8MB, mp4)
Video S3. Single-molecule tracking of slow-moving SRSF1 molecules, related to Figure 3G. Blue circles indicate localized SRSF1 molecules in each 20 ms frame, while blue lines represent single-molecule trajectories across frames. Older trajectories gradually fade over time. Raw frames containing slow dynamics (displacements less than 50 nm per 20 ms) are concatenated together. Empty frames are omitted.
Download video file (5.8MB, mp4)
Video S4. Single-molecule tracking of fast-moving SRSF1 molecules, related to Figure 3G. Orange circles indicate localized SRSF1 molecules in each 20 ms frame, while orange lines represent single-molecule trajectories across frames. Older trajectories gradually fade over time. Raw frames containing fast dynamics (displacements over 50 nm per 20 ms) are concatenated together. Empty frames are omitted.
Download video file (7.7MB, mp4)
Video S5. Simulation movie for TDP-43, related to Figure 6I. Movie from a representative atomistic simulation of 12 TDP-43 molecules within a spherical boundary of 30 nm in diameter over 8000 frames at an interval of 100 frames. The domains are colored according to the scheme used in Figure 6I.
Download video file (3.9MB, mp4)
Supplementary Table S1: Ratio of absorbance at 260 nm (A260) to absorbance at 280 nm (A280). Related to Figure S7F.
7

RESOURCES