Abstract
Hyperphosphorylation of the C-terminal domain (CTD) of the RPB1 subunit of human RNA polymerase (Pol) II is essential for transcriptional elongation and mRNA processing1–3. The CTD contains 52 heptapeptide repeats of the consensus sequence YSPTSPS. The highly repetitive nature and abundant possible phosphorylation sites of the CTD exert special constraints on the kinases that catalyse its hyperphosphorylation. Positive transcription elongation factor b (P-TEFb)—which consists of CDK9 and cyclin T1—is known to hyperphosphorylate the CTD and negative elongation factors to stimulate Pol II elongation1,4,5. The sequence determinant on P-TEFb that facilitates this action is currently unknown. Here we identify a histidine-rich domain in cyclin T1 that promotes the hyperphosphorylation of the CTD and stimulation of transcription by CDK9. The histidine-rich domain markedly enhances the binding of P-TEFb to the CTD and functional engagement with target genes in cells. In addition to cyclin T1, at least one other kinase—DYRK1A6—also uses a histidine-rich domain to target and hyperphosphorylate the CTD. As a low-complexity domain, the histidine-rich domain also promotes the formation of phase-separated liquid droplets in vitro, and the localization of P-TEFb to nuclear speckles that display dynamic liquid properties and are sensitive to the disruption of weak hydrophobic interactions. The CTD—which in isolation does not phase separate, despite being a low-complexity domain—is trapped within the cyclin T1 droplets, and this process is enhanced upon pre-phosphorylation by CDK7 of transcription initiation factor TFIIH1–3. By using multivalent interactions to create a phase-separated functional compartment, the histidine-rich domain in kinases targets the CTD into this environment to ensure hyperphosphorylation and efficient elongation of Pol II.
Among all transcription-related cyclins, cyclin (CYC) T—which includes T1 (CYCT1) and T2—has the longest C-terminal regions. In CYCT1, the N-terminal region is structured and contains the cyclin-box repeats required for binding and activating CDK9, whereas the C-terminal region has only a few isolated motifs and is mostly unstructured (Fig. 1a).
To determine whether the CYCT1 C-terminal region is important for regulating CDK9 activity, we performed in vitro kinase reactions to examine affinity-purified CDK9–CYCT1–Flag heterodimers that contained various truncated forms of CYCT1 with progressively shortened C termini to phosphorylate the CTD (Fig. 1a and Extended Data Fig. 1a). A mixture of glutathione S-transferase (GST)–CTD52 (RPB1 CTD containing all 52 repeats) and GST–CTD9 (CTD containing 9 consensus repeats) was used in all reactions as the kinase substrates.
The various CYCT1 truncations did not substantially affect the ability of the associated CDK9 to autophosphorylate by producing the ATP-dependent mobility shift (Fig. 1a), nor did the truncations decrease the phosphorylation of CTD9. However, upon truncation to a position at and beyond 533, CDK9 became largely unable to produce the hyper-phosphorylated CTD52 (hereafter, IIo) as revealed by the anti-phospho-Ser5 antibody 3E8 (Fig. 1a). A similar pattern was also detected with the anti-phospho-Ser2 antibody 3E10 (Extended Data Fig. 1b). Thus, a region around position 533 in CYCT1 promoted the hyperphosphorylation by CDK9 of CTD52.
To determine whether the CYCT1 C-terminal truncations affect the transcriptional activity of P-TEFb, we used the Gal4-tethering system to test fusion proteins containing the Gal4 DNA-binding domain attached to the various truncated forms of CYCT1 to activate luciferase expression from the HIV-1 promoter containing Gal4-binding sites (Fig. 1b). Correlating with the kinase results, progressive CYCT1 C-terminal truncations to and beyond position 533 markedly reduced P-TEFb transcriptional activity (Fig. 1b).
The coiled-coil domain (CCD, amino acids 379–430) and the histidine-rich domain (HRD, amino acids 480–550) exist around position 533 (Fig. 1a). In kinase reactions, only the deletion of HRD (ΔHRD) but not CCD (ΔCCD) blocked the hyperphosphorylation by CDK9 of CTD52 (Fig. 1c). Consistently, HRD but not CCD was required for P-TEFb transcriptional activity in the Gal4-tethering assay (Fig. 1d). Dependence on HRD for Tat/P-TEFb-activated HIV-1 elongation has previously been proposed7, and here confirmed by our finding that wild-type CYCT1 but not CYCT1ΔHRD could effectively rescue the RNAi-knockdown of endogenous CYCT1 expression to produce long (promoter-distal) but not short (promoter-proximal) viral transcripts (Fig. 1e).
The importance of the HRD for the transcriptional activity of P-TEFb was generalized to cellular genes, as CYCT1ΔHRD produced significantly less mRNA from four representative immediate early genes—FOS, JUNB, MYC and EGR1—as well as HSP70–1 (also known as HSPA1A) under both basal and heat-shock conditions (Extended Data Fig. 2), all of which require P-TEFb for optimal transcription. Co-immunoprecipitation analysis (Fig. 1f and Extended Data Fig. 2e) shows that the decreased activity of CYCT1ΔHRD was not due to any substantial change in binding to the major P-TEFb partners, including BRD4 and subunits of the super elongation complex and the 7SK small nuclear ribonucleoprotein particle that regulate P-TEFb activity1,4,5.
What is the mechanism by which the HRD promotes the activity of P-TEFb? The first hint came from the finding that wild-type CYCT1 in HeLa nuclei was more resistant to salt extraction, as compared to CYCT1ΔHRD (Fig. 2a), which suggests that the HRD promoted the retention of CYCT1 in the nucleus. To determine more precisely the location and dynamics of this retention, we performed fluorescence recovery after photobleaching (FRAP) analysis of bindings of Halo-tagged wild-type CYCT1 and CYCT1 HRD to a gene array activated by reverse tetracycline-controlled transactivator (Fig. 2b). This array contains about 200 copies of an integrated transgene marked by YFP–Lac repressor bound to the lac operator, which enables FRAP to be performed at this spot8. We observed significantly faster FRAP recovery for CYCT1ΔHRD on the array than for wild-type CYCT1 (Fig. 2c). Further quantitative analysis (Extended Data Fig. 3a, b and Supplementary Information) revealed that this was mainly due to longer apparent residence time of wild-type CYCT1 (τoff ~ 56 s), as compared to CYCT1ΔHRD (τoff ~ 4 s).
To generalize the FRAP finding to endogenous genes (active and inactive) and cross-validate it using an orthogonal technique, we performed single-particle tracking in U2OS cells of Halo-tagged wild-type CYCT1 and CYCT1ΔHRD proteins at 95 Hz to determine whether the HRD affects CYCT1 diffusion dynamics and bound fractions9,10 (Fig. 2d, Extended Data Fig. 3c–e, Supplementary Video 1 and Supplementary Information). Consistent with the HRD-facilitated association of CYCT1 with chromatin and the transcriptional machinery, CYCT1ΔHRD diffused faster and showed a smaller bound fraction (8.8%) than did wild-type CYCT1 (13.0%). Next, we conducted single-particle tracking using a longer exposure time (500 ms; Supplementary Video 2) to blur out fast-diffusing molecules and focus on bound molecules that were presumably located at predominantly activated genes. We observed a wide distribution of CYCT1 binding events that probably encompassed both specific and non-specific interactions (Fig. 2e). Because this distribution did not fit well with models that assumed one or two single rate-limiting steps, we could not reliably attribute a single residence time. Nevertheless, consistent with the FRAP result, wild-type CYCT1 showed binding events that were significantly more stable than those of CYCT1ΔHRD (Fig. 2e). Together, the single-particle tracking and FRAP results underscore the key role of the HRD in promoting the binding of P-TEFb to activated genes to phosphorylate CTD52.
What could be the direct target of the HRD on activated genes? Consistent with a previous report that a CYCT1 mutant that lacked the HRD failed to bind the CTD7, wild-type GST–CYCT1 precipitated more Pol II from HeLa nuclear extracts than did GST–CYCT1ΔHRD (Fig. 2f). Moreover, GST–CTD pulled down P-TEFb containing wild-type CYCT1, but not CYCT1ΔHRD (Fig. 2g). Finally, a direct and HRD-dependent interaction was detected between a recombinant CYCT1 C-terminal fragment and mCherry–CTD in both the GST pulldown and glycerol gradient formats (Extended Data Fig. 8a, b). Thus, P-TEFb uses the HRD to directly target the CTD.
In addition to P-TEFb, CDK7– CYCH in TFIIH is also a CTD kinase for Pol II to clear the promoter during transcription initiation1. Unlike CYCT1, CYCH lacks a long C-terminal region and HRD. Compared to CDK9–CYCT1–Flag, affinity-purified CDK7–CYCH–Flag produced a markedly lower level of CTD52 IIo in kinase reactions (Fig. 3a), which was confirmed with recombinant CAK (CDK7–CYCH–MAT1) and P-TEFb (Millipore) in a time-and dosage-dependent manner (Extended Data Fig. 4a, b).
Appending a CYCT1 C-terminal fragment (amino acids 401–630) containing the HRD to CYCH markedly increased hyperphosphorylation by CDK7 of CTD52 (Fig. 3b), and partially rescued the CYCT1 knockdown to support HIV-1 transcription (Fig. 3c). However, when the HRD was deleted from the CYCH +CYCT1 chimaera, both the production of CTD52 IIo and the rescue of CYCT1 knockdown were mostly abolished (Fig. 3b, c), which indicates the importance of the HRD to these processes. Consistently, the chimaera, but not wild-type CYCH, was precipitated by GST–CTD (Fig. 3d).
We next investigated whether other CTD kinases also use an HRD to target Pol II for hyperphosphorylation. We noticed that the kinase DYRK1A, a CTD kinase that controls transcription of selected growth-related genes6 and that is associated with Down syndrome, also contains an HRD (Fig. 3e). The rest of DYRK1A and CYCT1 are non-homologous. In kinase reactions, wild-type DYRK1A and DYRK1AΔHRD autophosphorylated to a similar extent, but only wild-type DYRK1A efficiently hyperphosphorylated CTD52 (Fig. 3e). In a co-immunoprecipitation assay employing three different anti-RPB1 antibodies, wild-type DYRK1A precipitated more Pol II than did DYRK1AΔHRD (Fig. 3f and Extended Data Fig. 4c). The precipitation of DCAF7, which binds DYRK1A N terminus11, was unaffected by DYRK1AΔHRD. Thus, in addition to CYCT1, at least one other kinase also requires a functional HRD to efficiently bind and hyper-phosphorylate CTD52.
The HRD is a low-complexity domain owing to the overrepresentation of only a single amino acid. This domain, including a central cluster of multiple consecutive histidines, is highly conserved in vertebrate CYCT1 (Extended Data Fig. 5a). Using the prediction program IUPred12 (http://iupred.enzim.hu), we found that the human CYCT1 HRD displays the highest disorder tendency in a broader intrinsically disordered region (IDR) that lacks well-defined structure13 and overlaps with the CYCT1 C-terminal region (Extended Data Fig. 5b). Recently, the IDRs—especially those containing a low-complexity domain—have been shown to promote liquid–liquid phase separation, which probably drives the formation of intracellular membrane-less organelles14 for compartmentalized biochemical reactions13.
In vitro, phase separation is reversible13 and influenced by several parameters (for example, temperature, ionic strength, post-translational modifications and so on), which enable liquid droplets to form upon reaching a certain threshold and to quickly dissemble when pushed in the opposite direction15. To determine whether T1-IDR (amino acids 462–654)—the longest, HRD-containing CYCT1 IDR—could phase separate in an HRD-dependent manner, we purified the GFP–T1-IDR and GFP–T1-IDRΔHRD fusions from Escherichia coli (Extended Data Fig. 5c). At 150 mM NaCl, the protein solutions remained translucent. When lowered to 37.5 mM, a typical concentration used to induce phase separation in vitro16,17, the wild-type GFP–T1-IDR solution immediately turned opaque, whereas GFP–T1-IDRΔHRD showed no change (Extended Data Fig. 5e).
Under a microscope, wild-type GFP–T1-IDR spontaneously formed micrometre-sized, spherical droplets, whereas GFP–T1-IDRΔHRD produced only a low level of irregular aggregates (Fig. 4a). Similarly, the GFP fusion that contained the longest IDR (amino acids 491–686) of DYRK1A also formed droplets in an HRD-dependent manner (Extended Data Fig. 6). Notably, the GFP–T1-IDR droplets quickly disappeared once NaCl returned to 150 mM (Extended Data Fig. 5f). Furthermore, 1,6-hexanediol, a compound that is known to perturb weak hydrophobic interactions to disassemble structures that exhibit liquid-like properties18,19, completely blocked droplet formation (Extended Data Fig. 5g). Finally, when nine histidines within the histidine cluster in CYCT1 HRD were changed to alanines (Extended Data Fig. 7a), the resulting GFP–T1-IDR(9A) mutant formed only tiny droplets (Fig. 4b), which indicates that the histidines are essential for phase separation. Functionally, the mutation decreased phosphorylation by P-TEFb of CTD52, and the transcriptional activity of P-TEFb (Extended Data Fig. 7c, d).
In cells, both ectopically expressed CYCT1–Flag or eGFP–CYCT1 and endogenous CYCT1 displayed a punctuated staining pattern inside the nuclei (Fig. 4c–e and Extended Data Fig. 5h). This has previously been attributed to the localization of P-TEFb to the nuclear speckles enriched with transcription and splicing factors20,21. Importantly, HRD and its histidine cluster were needed to target CYCT1 to the speckles (Fig. 4c and Extended Data Figs. 5h, 7e), and the same has also been reported for DYRK1A22. Mirroring its disruption of the GFP–T1-IDR droplets in vitro, 1,6-hexanediol also quickly dissembled the CYCT1 nuclear speckles (Fig. 4d). Finally, when performing time-lapse phase-contrast imaging of cells expressing eGFP–CYCT1, we observed multiple cells in which CYCT1 speckles displayed at least one fusion event within the period of a few minutes, demonstrating their dynamic and liquid-like properties (Fig. 4e).
Following previous findings23,24, we asked whether the HRDcontaining IDR could recruit the CTD into phase-separated droplets, given their demonstrated direct interaction. Indeed, although the CTD itself is a low-complexity sequence, recombinant mCherry–CTD23 (Extended Data Fig. 5d) alone did not phase separate, but it was readily incorporated into droplets when incubated together with GFP–T1-IDR (Fig. 4f).
During transcription, the CTD is phosphorylated first by CAK in TFIIH and then by P-TEFb1,4,5. Pre-phosphorylation by CAK (Extended Data Fig. 8c) not only enhanced the incorporation of mCherry–CTD into the GFP–T1-IDR droplets, but also promoted phase separation overall by producing bigger and brighter droplets (Fig. 4f). Consistently, the pre-phosphorylated GST–CTD precipitated more GFP–T1-IDR (Extended Data Fig. 8d). Underscoring the physiological relevance of these observations, hyperphosphorylated Pol IIo is known to preferentially localize in the nuclear speckles25.
Although both CTD hyperphosphorylation by CDK9 and HRD-mediated droplet formation were largely inhibited by 2.5% 1,6-hexanediol, the HRD–CTD binding (and CDK9–CYCT1 interaction) was not substantially inhibited until 1,6-hexanedil was at 10% (Fig. 4g–i). This important difference suggests that phase separation, which is caused by weak, multivalent interactions among the HRDs and is easily disrupted by 1,6-hexanediol, is critical for the hyperphosphorylation of the CTD. On the other hand, the relatively drug-resistant HRD–CTD binding is probably key to the recruitment of the CTD to CDK9 in droplets, but is insufficient to establish the optimal environment for hyperphosphorylation (Extended Data Fig. 9).
In summary, our data indicate that in at least two kinases—CYCT1 of P-TEFb and DYRK1A—the HRD, a low-complexity domain of previously unknown function, promotes the hyperphosphorylation of the CTD by targeting the CTD as well as inducing phase separation in vitro and in cells. The phase-separated droplets and speckles compartmentalize the kinase and substrate to enable highly efficient reactions, which results in the hyperphosphorylation of the CTD and robust transcriptional elongation and RNA processing.
FUS and TAF15, which are two proteins that contain low-complexity domain and that are active in transcription initiation, have previously been shown to form phase-separated hydrogels that trap the CTD23,24. Additionally, the bidirectionally transcribed enhancers and resulting antisense transcripts have been proposed to control initiation in part through phase separation26. What was unknown until now is whether any transcription factors and the CTD are involved in droplet formation after initiation1. Our finding that phase separation is induced by CYCT1 of P-TEFb (a well-defined transcription elongation factor) and DYRK1A (a probable gene-specific elongation factor6) has expanded the regulatory roles of phase separation to the next stage of the transcription cycle. Furthermore, these studies show that some key ini\tiation and elongation factors that phase separate should no longer be viewed as passive passengers waiting to be picked up by the CTD. Rather, they have active roles in recruiting Pol II through multivalent interactions to their droplets and/or speckles that function as hubs where much of transcription and RNA processing is dynamically controlled.
Extended Data
Supplementary Material
Acknowledgements
We thank S. McKnight, M. Geyer, J. Hurley and their colleagues for providing the various expression plasmids, and U. Schulze-Gahmen for technical help. This work was supported by the National Institutes of Health grant R01AI041757 to Q.Z. and the California Institute of Regenerative Medicine grant LA1–08013 to X.D.
Footnotes
Competing interests The authors declare no competing interests.
Reviewer information Nature thanks J. Lis, D. Taatjes and the other anonymous reviewer(s) for their contribution to the peer review of this work.
Data availability
Uncropped scans of all western blots are provided in Supplementary Figure 1. The raw slowSPT and spaSPT data are freely available in Spot-On readable CSV and Matlab formats in the form of single-molecule trajectories at Zenodo (https://zenodo.org/record/1215836). The Spot-On Matlab code is available, together with a step-by-step guide, at Gitlab (https://gitlab.com/tjian-darzacq-lab/spot-on-matlab). All other data are available from the corresponding author on reasonable request.
Online content
Any Methods, including any statements of data availability and Nature Research reporting summaries, along with any additional references and Source Data files, are available in the online version of the paper at https://doi.org/10.1038/s41586-018-0174-3.
Additional information
Extended data is available for this paper at https://doi.org/10.1038/s41586-018-0174-3.
Supplementary information is available for this paper at https://doi.org/10.1038/s41586-018-0174-3.
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Harlen KM & Churchman LS The code and beyond: transcription regulation by the RNA polymerase II carboxy-terminal domain. Nat.source Rev. Mol. Cell Biol 18, 263–273 (2017). [DOI] [PubMed] [Google Scholar]
- 2.Eick D & Geyer M The RNA polymerase II carboxy-terminal domain (CTD) code. Chem. Rev 113, 8456–8490 (2013). [DOI] [PubMed] [Google Scholar]
- 3.Zaborowska J, Eglo S & Murphy S The pol II CTD: new twists in the tail. Nat. Struct. Mol. Biol 23, 771–777 (2016). [DOI] [PubMed] [Google Scholar]
- 4.Zhou Q, Li T & Price DH RNA polymerase II elongation control. Annu. Rev. Biochem 81, 119–143 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Kwak H & Lis JT Control of transcriptional elongation. Annu. Rev. Genet 47, 483–508 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Di Vona C et al. Chromatin-wide profling of DYRK1A reveals a role as a gene-specifc RNA polymerase II CTD kinase. Mol. Cell 57, 506–520 (2015). [DOI] [PubMed] [Google Scholar]
- 7.Taube R, Lin X, Irwin D, Fujinaga K & Peterlin BM Interaction between P-TEFb and the C-terminal domain of RNA polymerase II activates transcriptional elongation from sites upstream or downstream of target genes. Mol. Cell. Biol 22, 321–331 (2002). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Darzacq X et al. In vivo dynamics of RNA polymerase II transcription. Nat. Struct. Mol. Biol 14, 796–806 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Hansen AS, Pustova I, Cattoglio C, Tjian R & Darzacq X CTCF and cohesion regulate chromatin loop stability with distinct dynamics. eLife 6, e25776 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Hansen AS et al. Robust model-based analysis of single-particle tracking experiments with Spot-On. eLife 7, e33125 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Xiang J et al. DYRK1A regulates Hap1–Dcaf7/WDR68 binding with implication for delayed growth in Down syndrome. Proc. Natl Acad. Sci. USA 114, E1224–E1233 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Dosztányi Z, Csizmok V, Tompa P & Simon I IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content. Bioinformatics 21, 3433–3434 (2005). [DOI] [PubMed] [Google Scholar]
- 13.Mitrea DM & Kriwacki RW Phase separation in biology; functional organization of a higher order. Cell Commun. Signal 14, 1 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Courchaine EM, Lu A & Neugebauer KM Droplet organelles? EMBO J 35, 1603–1612 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Banani SF, Lee HO, Hyman AA & Rosen MK Biomolecular condensates: organizers of cellular biochemistry. Nat. Rev. Mol. Cell Biol 18, 285–298 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Lin Y, Protter DS, Rosen MK & Parker R Formation and maturation of phase-separated liquid droplets by RNA-binding proteins. Mol. Cell 60, 208–219 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Ying Y et al. Splicing activation by Rbfox requires self-aggregation through its tyrosine-rich domain. Cell 170, 312–323.e310 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Strom AR et al. Phase separation drives heterochromatin domain formation. Nature 547, 241–245 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Molliex A et al. Phase separation by low complexity domains promotes stress granule assembly and drives pathological fbrillization. Cell 163, 123–133 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Herrmann CH & Mancini MA The Cdk9 and cyclin T subunits of TAK/P-TEFb localize to splicing factor-rich nuclear speckle regions. J. Cell Sci 114, 1491–1503 (2001). [DOI] [PubMed] [Google Scholar]
- 21.Marcello A et al. Recruitment of human cyclin T1 to nuclear bodies through direct interaction with the PML protein. EMBO J 22, 2156–2166 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Salichs E, Ledda A, Mularoni L, Albà MM & de la Luna S Genome-wide analysis of histidine repeats reveals their role in the localization of human proteins to the nuclear speckles compartment. PLoS Genet 5, e1000397 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Kato M et al. Cell-free formation of RNA granules: low complexity sequence domains form dynamic fbers within hydrogels. Cell 149, 753–767 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kwon I et al. Phosphorylation-regulated binding of RNA polymerase II to fbrous polymers of low-complexity domains. Cell 155, 1049–1060 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Mortillaro MJ et al. A hyperphosphorylated form of the large subunit of RNA polymerase II is associated with splicing complexes and the nuclear matrix. Proc. Natl Acad. Sci. USA 93, 8253–8257 (1996). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hnisz D, Shrinivas K, Young RA, Chakraborty AK & Sharp PA A phase separation model for transcriptional control. Cell 169, 13–23 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.