Abstract
The post-translational regulation of protein function is involved in most cellular processes. As such, synthetic biology tools that operate at this level provide opportunities for manipulating cellular states. Here, we deploy a proximity-triggered protein trans-splicing technology to enable the time-resolved synthesis of target proteins from pre-made parts. The modularity of the strategy allows for the addition or removal of various control elements as a function of the splicing reaction, in the process permitting the cellular location and/or activity state of starting materials and products to be differentiated. The approach is applied to a diverse set of proteins, including the kinase oncofusions BCR-ABL and DNAJ-PKAc where dynamic cellular phosphorylation events are dissected, revealing distinct phases of signaling and identifying molecular players connecting the oncofusion to cancer transformation as novel therapeutic targets of cancer cells. We envision that the tools and control strategies developed herein will allow the activity of both naturally occurring and designer proteins to be harnessed for basic and applied research.
Introduction
Alterations in cellular phenotypes are driven by dynamic changes in the activity of proteins involved in signal transduction and gene regulation processes1,2. Understanding how such changes propagate through associated molecular pathways requires tools that operate on relevant timescales, which can often be in the minutes to hours range3,4. Inducible gene expression approaches such as Cre-Lox and Tet-On/Off systems may not have the temporal resolution to address changes that occur quickly5,6, and while small molecule probes can allow for extremely rapid and specific perturbation (typically inhibition) of a system3,7, the availability of such tool compounds is limited by the druggability of specific targets8. This issue of generalizability extends to strategies based on the incorporation of unnatural photocaged amino acids9,10 which, while offering exquisite temporal control, require that the protein in question have the requisite structure-activity characteristics needed for such an approach11. Thus, there remains a need for broadly applicable control tools that allow seamless post-translational manipulation of protein structure and function, and that are flexible with respect to the type of control elements one might want to deploy in each context.
In principle, conditional protein trans-splicing (CPS) systems offer an attractive route to the post-translational manipulation of protein function12–17. These technologies harness split inteins, which upon complementation, support the ligation of flanking polypeptides (‘exteins’) and in the process remove themselves from the final splice product (Extended Data Fig. 1a). Recent studies into the way intein fragments associate and fold have led to the design of a CPS approach based on the use of genetically ‘caged’ split inteins18,19. In this system, partial sequences from the complementary intein fragments are appended via linkers such that the native N- and C-terminal split inteins (IntN and IntC, respectively) become reversibly trapped in an intramolecular folded state that blocks complementation. Protein trans-splicing is induced upon removal of the extensions by proteolysis18, or by bringing the caged split inteins into proximity19. The latter scenario involves a domain swapping rearrangement within the heterodimeric complex, a process that triggers protein splicing (Extended Data Fig. 1b). Critically, the final splice product carries no extraneous protein domains associated with the splicing apparatus, a feature that can, in principle, be leveraged to install transient functional control elements within the starting materials.
In the present study, we incorporate caged split inteins and various control elements into devices that allow post-translational assembly of intracellular proteins (Fig. 1a and Supplementary Table 1, 2). We show that this strategy permits both the cellular localization and catalytic activity of target proteins to be altered in response to a small molecule trigger. In the case of oncogenic fusion proteins, we demonstrate that this allows dynamic cellular phosphorylation events to be observed, revealing distinct phases of signaling.
Fig. 1 |. CAGE modules regulate the subcellular location of splice products via control elements.

a, Schematic of the post-translational activation of a target protein via proximity-triggered CPS using CAGE modules that tightly control the inactive split constructs by appending control elements. b, Schematic of the subcellular translocation of mVenus (green) after proximity-triggered CPS with CAGEC-RFP (red). The control elements fused to the CAGE modules or exteins regulate the subcellular locations of the starting constructs and the splice product. c, d, e, f, Left – immunoblots of the mVenus-RFP splice product in co-transfected HEK293T cells used for live cell imaging (right). Right – live cell fluorescence imaging of mVenus and RFP localization in HEK293T cells co-expressing mVenus-CAGEN and CAGEC-RFP, treated with either DMSO or rapalog (100 nM) for 10 hours. mVenus (green) and RFP (red) were visualized by their intrinsic fluorescence, and Hoechst 33342 (blue) was used as a nuclear marker. Scale bars, 10 μm. ‘Caax’ in (c), (f) and ‘LMNA’ in (d) were fused to the C-terminus of mVenus-CAGEN, and ‘NLS’ in (e), (f) was fused to the C-terminus of CAGEC-RFP. Immunoblots and live cell fluorescence images shown in (c-f) are representative of three independent biological repeats. CAGEN refers to ‘NrdJ-1NCage-FKBP’, and CAGEC refers to ‘FRB-NrdJ-1CCage’ in (c-f).
Results
CAGE regulates splice products by altering localization
Two different caged split inteins were integrated into our system, each responsive to a non-toxic version of rapamycin (rapalog) via an induced proximity mechanism (Extended Data Fig. 1c and Supplementary Table 1, 2). In addition to the previously described conditional version of the Npu split intein19 (Extended Data Fig. 1d), which requires a cysteine residue at the splice junction to function, we also incorporated an inducible version of the NrdJ-1 split intein into the reaction manifold. The latter requires a serine at the splice junction and was thus expected to increase the range of targets amenable to the approach. A rapalog-inducible CPS system based on the NrdJ-1 split intein was developed following structure-guided optimization of the appended caging sequences on the IntN and IntC fragments (Extended Data Fig. 2a–e and Supplementary Table 3). Briefly, the responsiveness of this rapalog-inducible CPS system was tuned by manipulating the caging strength through charge-swap mutations in key salt bridges between the appended caging unit and the NrdJ-1 split intein. Hereafter, we refer to the caged split inteins and associated FKBP and FRB heterodimerization domains as CAGEN and CAGEC. In addition to allowing rapalog-dependent induction of protein trans-splicing in mammalian cells, the NrdJ-1 CAGE system was also promiscuous with respect to extein sequences around the splice junction (Extended Data Fig. 2f, g), a property expected to expand the number of potential insertion sites of the CAGE system in target proteins.
With two complementary CAGE systems in hand, we next asked whether the intrinsic bond making, and bond breaking features of protein trans-splicing could be harnessed to manipulate cellular protein localization. To test this, model fluorescent proteins mVenus and RFP were fused to CAGEN and CAGEC, respectively, along with a series of cellular localization elements whose position within the constructs was expected to result in a change in cellular localization of mVenus upon splicing (Fig. 1b). Consistent with our designs, rapalog treatment of HEK293T cells co-expressing cognate pairs of CAGE constructs led to a splicing-dependent redistribution of mVenus signal from either the plasma membrane, nuclear envelope, or cytosol (Fig. 1c–e and Supplementary Fig. 1, 2). Notably, protein trans-splicing was strongly reduced when the two CAGE constructs were compartmentalized in the cell such that rapalog-induced proximity was prevented (Fig. 1f and Supplementary Fig. 1).
Encouraged by the results with the model fluorescent proteins, we next applied the CAGE system to control the assembly and cellular localization of more challenging protein targets, namely the oncofusion transcription factors, AML1-ETO20, MLL-AF921 and NUP98-HOXA922. These fusion proteins result from chromosomal translocations and are associated with various types of leukemia23,24. In each case, CAGE constructs were designed to allow the assembly and nuclear localization of the proteins from pre-made fragments in response to rapalog treatment (Fig. 2a and Supplementary Table 1, 2). Notably, use of the NrdJ-1 CAGE system permitted the native amino acid sequences at the fusion junctions to be used in the traceless assembly of the oncofusion proteins, whereas the Npu CAGE system required the incorporation of a 2–3 amino acid ‘scar’ to facilitate the slicing reaction (Extended Data Fig. 3a–h and Extended Data Fig. 4a, g). Gratifyingly, we observed dose and time-dependent generation of all three oncofusions upon rapalog treatment of cells expressing the relevant NrdJ-1 and Npu CAGE pairs (Fig. 2b, Extended Data Fig. 3f–h, and Extended Data Fig. 4b–d, h–j). Moreover, as per our designs, we observed nuclear localization of all three oncofusion proteins due to removal of the nuclear export sequences present within the starting materials (Fig. 2c, Extended Data Fig. 3i, and Extended Data Fig. 4e, f, k, l). In the case of the AML1-ETO generated by a NrdJ-1 CAGE system, a series of genomic and proteomics studies were performed to confirm that the splice product binds to known genomic targets of the oncofusion as shown by ChIP-qPCR25–27 (Fig. 2d, Extended Data Fig. 5a, and Supplementary Table 5 and Supplementary Data 1), and associates with known nuclear protein partners linked to transcriptional regulation25,28,29 (Fig. 2e, f, Extended Data Fig. 5b–d, and Supplementary Data 2). Collectively, these data indicate that the CAGE system can be used to control the assembly and cellular localization of target proteins including large oncofusion proteins associated with dysregulation of gene expression in cancer.
Fig. 2 |. Conditional generation of oncofusion transcription factors.

a, Schematic showing use of control elements to achieve differential subcellular compartmentalization of A-CAGEN and CAGEC-B constructs and the A/B splice product. Oncofusion transcription factors AML1-ETO, MLL-AF9, and NUP98-HOXA9 are generated by proximity-triggered CPS with this strategy. b, Immunoblots of splice products (SP, HA-tagged) generated in HEK293T cells co-expressing CAGEN (Myc-tag) and CAGEC (HA-tag) constructs. Cells were treated with DMSO or rapalog (100 nM) for indicated time-points. Left – AML1-ETO. Middle – MLL-AF9. Right – NUP98-HOXA9. ‘anti-HA (dark)’ indicates longer exposure to the immunoblots against anti-HA bands. c, Immunoblots of the HA-tagged AML1-ETO splice product generated in HEK293T cells co-expressing AML1-CAGEN (Myc-tag) and CAGEC-ETO (HA-tag) treated with rapalog (100 nM) for 18 hours followed by a subcellular fraction to separate cytosolic and nuclear proteins. Tubulin and histone H4 serve as cytosolic and nuclear markers, respectively. d, Genomic binding of HA-tagged AML1-ETO splice product. Stable HEK293T cells expressing AML1-CAGEN and CAGEC-ETO constructs were treated with DMSO or rapalog (100 nM) for 18 hours. ChIP-qPCR was then used to amplify chromatin derived from immunoprecipitations with anti-HA antibody (n=6 independent experiments, mean and s.d.). Statistical significance was determined by two-way student’s t-tests: *P < 0.05; **P < 0.005; ***P < 0.001. e, Protein binding to the AML1-ETO splice product determined by anti-HA co-immunoprecipitation. Stable HEK293T cells expressing AML1-CAGEN and CAGEC-ETO constructs were treated with DMSO or rapalog (100 nM) for 18 hours. Nuclei were isolated and an anti-HA co-immunoprecipitation performed. Proteins known to interact with the native AML1-ETO fusion were probed using the indicted antibodies. H4 serves as a negative control. f, Quantitative proteomics analysis of co-enriched proteins following anti-HA immunoprecipitation from nuclei isolated from stable HEK293T cells expressing AML1-CAGEN and CAGEC-ETO constructs and treated with DMSO or rapalog. The green box in the waterfall plot indicates the top 20% of enrichment following the induction of PTS. All data points represent the mean ratios (n=2, biological replicate runs). Data in (b), (c), and (e) are representative of n=3 independent experiments. In panels (b-f), CAGEN refers to ‘NrdJ-1NCage-FKBP-NES’, and CAGEC refers to ‘NES-FRB-NrdJ-1CCage’.
CAGE activates BCR-ABL upon protein trans-splicing
Next, we asked whether our approach could be used to control the activity of an enzyme. To explore this, we turned to the oncofusion protein, BCR-ABL, a constitutively active protein tyrosine kinase that drives chronic myeloid leukemia (CML) and that has served as a paradigm for the development of targeted cancer therapy30. This oncofusion results from a translocation between chromosomes 9 and 21, also known as the Philadelphia chromosome31, resulting in part of the breakpoint cluster region (BCR) protein being linked to the Abelson 1 (ABL1) protein tyrosine kinase. Critically, the fusion results in the loss of a key N-terminal autoregulatory region within ABL1, resulting in de-repression of its kinase activity and, ultimately, mitogenesis32. The nature of BCR-ABL activation presented an interesting challenge for extending the scope of our CAGE system since simply splitting the oncoprotein at the fusion junction (as was done in the previous examples) would not eliminate aberrant kinase activity. As a potential solution to this problem, we conceived an intramolecular, allosteric control strategy in which the CAGE apparatus is effectively nested between the ABL1 autoregulatory region and the remainder of the protein (Fig. 3a, c and Extended Data Fig. 6a). We anticipated that this would keep the kinase in the “off-state” until the splicing reaction, which would simultaneously remove this autoregulatory control element whilst ligating the BCR region. To test this idea, we generated a series of constructs of the general form ABLauto-linker-CAGEC-ABLC with a Npu CAGE system, where the length of the autoregulatory region (ABLauto) and the adjacent linker were varied (Extended Data Fig. 6b and Supplementary Table 1, 2). Screening of these candidates led to the identification of an optimal arrangement that exhibited comparable levels of basal kinase activity, as judged by global phosphotyrosine (pTyr) levels, to the wild-type control protein (Extended Data Fig. 6c, d). Further examination revealed that the ABLauto region in the construct was subject to N-myristoylation (Extended Data Fig. 6f, g), which is known to be critical for allosteric maintenance of basal ABL kinase activity33. Importantly, autoregulation of ABL kinase activity was lost in a mutant version of the CAGE construct, unable to accept the myristoyl modification (Fig. 3b).
Fig. 3 |. Post-translational generation of BCR-ABL allows dissection of dynamic cellular phosphorylation events.

a, Left – crystal structure of ABL1 kinase (PDB: 2FO0). The unstructured cap region which connects the SH3 domain to the N-terminal myristoyl group is shown as a cartoon. The myristoyl modification binds to an allosteric site in the C-lobe of the kinase domain. Right – schematic depicting the design of the autoinhibited CAGEC-ABL1 construct. b, Immunoblots of the basal tyrosine phosphorylation levels in HEK293T cells expressing wild-type or G2A mutant of ABL1 and CAGEC-ABL1 constructs. c, Schematic of proximity-triggered CPS between complementary BCR-CAGEN and CAGEC-ABL1 constructs to generate the BCR-ABL oncofusion. Molecular weights of the splice product and CAGE constructs are indicated. The CAGEC-ABL1 construct is kept in an inactive state prior to splicing through use of a N-myristoyl control element. d, Immunoblots of BCR-ABL (FLAG-tag) splice product (SP) and global tyrosine phosphorylation levels in HEK293T cells co-expressing BCR-CAGEN (Myc-tag) and CAGEC-ABL1 (FLAG-tag). Cells were treated with DMSO or rapalog (100 nM) or rapalog and Imatinib (10 μM) for 18 hours prior to analysis. ‘anti-FLAG (dark)’ indicates longer exposure to the immunoblots against anti-FLAG bands. e, Immunoblots of the tyrosine phosphorylation levels in HEK293T cells co-expressing BCR-CAGEN and CAGEC-ABL1. Cells were treated with rapalog (100 nM) for indicated time-points before analysis with the indicated antibodies. Data in (b), (d), and (e) are representative of n=3 independent experiments. CAGEN refers to ‘NpuNCage-FKBP,’ and CAGEC refers to ‘ABLauto-FRB-NpuCCage’.
We next asked whether the generation of BCR-ABL by proximity-induced CPS would lead to the activation of protein tyrosine kinase activity. Gratifyingly, upon co-expression of the complementary BCR- and ABL1 CAGE constructs, we observed rapalog-dependent generation of the BCR-ABL splice product accompanied by a clear increase in global pTyr levels (Fig. 3d and Extended Data Fig. 6e, h, i). Importantly, this increase was not observed in the presence of the ABL inhibitor Imatinib (Fig. 3d). From this, we conclude that the observed increase in global phosphorylation is due to the generation of BCR-ABL via proximity-triggered CPS. To explore cellular pTyr dynamics upon acquisition of onco-kinase activity, we carried out time-resolved quantitative phospho-proteomics (Supplementary Fig. 3a, b). This analysis revealed 39 sites in 37 proteins whose pTyr levels reproducibly increased over 6 hours in a manner that satisfied our stringency criteria (Extended Data Fig. 7a, b, Supplementary Fig. 3c, and Supplementary Data 3). This list included multiple pTyr sites within BCR-ABL itself, as well as pTyr sites within several receptor protein tyrosine kinases and cyclin dependent kinases. Broadly speaking, the identified pTyr sites could be divided into those that reach a maximum within 2 hours of initiation, versus those that accumulate more slowly (Extended Data Fig. 7b). Among the former group we were surprised to see that several proteins, such as the cytoskeletal signaling protein paxillin (pY118), exhibited an acute increase in phosphorylation which then decreased at later timepoints. This differential dynamic behavior was verified for representative examples using immunoblotting analysis (Fig. 3e and Extended Data Figure 7c). Notably, use of a Tet-On system for inducible BCR-ABL expression in HEK293T cells did not lead to comparable changes in pTyr dynamics at the earlier timepoints following doxycycline treatment (Extended Data Fig. 7d).
Time-resolved DNAJ-PKAc activation deconvolutes signaling
Given the success with BCR-ABL, we were keen to determine whether our CAGE system could be applied to another oncofusion involving a protein kinase. For this, we chose the DNAJ-PKAc fusion protein which is the key driver in fibrolamellar hepatocellular carcinoma (FL-HCC), a rare cancer of the liver34,35. This oncofusion results from a deletion in chromosome 19, resulting in the in-frame fusion of the J domain of heat shock protein DNAJB1 and most of the catalytic subunit of cAMP-dependent protein kinase (PRKACA)34. Reminiscent of BCR-ABL, the genetic fusion results in loss of the first 14 residues from PRKACA which, notably, includes an autoregulatory N-myristoylation modification36 (Extended Data Fig. 8a, b). While DNAJ-PKAc is regarded as the central player in FL-HCC oncogenesis, many questions remain regarding how cellular signaling is altered by this chimeric protein during cell transformation37–40. Consequently, we imagined that the ability to control the synthesis of the chimera using our CAGE system could provide new insights into these molecular mechanisms.
Informed by the crystal structure of DNAJ-PKAc41,42, we designed a NrdJ-1 CAGE system in which the splicing junction was located within the unstructured linker region immediately following the N-terminal alpha-helix of PRKACA (Fig. 4a and Extended Data Fig. 8b–d). A similar regulation strategy was adopted to that successfully employed in the BCR-ABL system. Thus, the CAGE apparatus was nested between the autoregulatory region (PKAcauto) and PRKACA (Extended Data Fig. 8c, d). Metabolic labeling studies confirmed that the PKAcauto control element in the CAGEC-PRKACA construct was N-myristoylated (Extended Data Fig. 8e, f). Immunoblotting using an antibody against phospho-PKA substrates revealed that HEK293T cells co-expressing the complementary DNAJB1- and PRKACA CAGE constructs did not have increased levels of phosphorylation compared to control cells lacking the CPS apparatus (Extended Data Fig. 8g). By contrast, rapalog-induced generation of the DNAJ-PKAc splice product resulted in a change in the global PKA phosphorylation signature similar to that of positive control cells expressing the oncofusion kinase (Extended Data Fig. 8g, h).
Fig. 4 |. Dynamic cellular phosphorylation events induced by post-translational generation of DNAJ-PKAc.

a, Schematic showing DNAJ-PKAc generated by proximity-triggered CPS between complementary DNAJB1-CAGEN and CAGEC-PRKACA constructs. Molecular weights of the splice product and the CAGE constructs are indicated, and the N-myristate tail in CAGEC-PRKACA is proposed to be involved in autoregulation. b, Summary of the dynamic protein phosphorylation phases downstream of DNAJ-PKAc activation as revealed using the CAGE system. c, Left – DNAJ-PKAc splice product (SP) generation in AML12 cells co-expressing DNAJB1-CAGEN and CAGEC-PRKACA. Cells were treated with DMSO, rapalog (100 nM), or rapalog with SGK1 inhibitor (GS-9007, SGK1i, 5 μM) for 6 hours, followed by immunoblotting using the indicated antibodies. ‘anti-HA (dark)’ indicates longer exposure to the immunoblots against anti-HA bands. Right – the bar charts showing quantified pERK levels of indicated treatments normalized by the ERK intensity (n=4 independent experiments, means and s.e.m.). d, e, Left – representative wells for non-transformed (d) and DNAJ-PKAc-expressing (e) AML12 cells treated with DMSO or GS-9007 (5 μM) for 10 days and stained with crystal violet. Right – clonogenic cell growth of non-transformed (d) and DNAJ-PKAc-expressing (e) AML12 cells quantified by the area covered by colonies and normalized by the average colony area of cells in the DMSO treatment of each plate (n=24 independent experiments, means and s.d.). CAGEN refers to ‘NrdJ-1NCage-FKBP,’ and CAGEC refers to ‘PKAcauto-FRB-NrdJ-1CCage’. In panels (c-e), statistical significance was determined by two-way student’s t-tests: *P < 0.05; **P < 0.005; ***P < 0.001.
Next, we imported the DNAJ-PKAc CAGE system into AML12 cells, non-transformed murine hepatocytes that have previously been used for studying this oncofusion protein37,38. As anticipated, the DNAJ-PKAc product was generated in a time and dose-dependent manner upon rapalog treatment of a stable AML12 cell line co-expressing the CAGE constructs with ERK phosphorylation level changes not observed in non-transformed AML12 cells (Extended Data Fig. 8i–k). Co-immunoprecipitation studies confirmed that the DNAJ-PKAc splice product associates with the known interaction partner, chaperonin heat shock protein 70 (HSP70)37 (Extended Data Fig. 8l). We then carried out time-dependent quantitative phosphoproteomics analyses in both HEK293T and AML12 cells following induction of protein splicing. Analysis of the HEK293T dataset revealed 245 phosphopeptides from 230 proteins whose phosphorylation levels reproducibly changed over 12 hours, as exemplified by the phosphorylation at PRKACA (Supplementary Fig. 3c, Supplementary Fig. 4a, b, and Supplementary Data 4). Analysis of the dataset using the Kinase Library platform43 revealed that the basophilic motif and the AGC (PKA, PKG and PKC) kinase family target sites are enriched in the UP-phosphorylated group, whereas the DOWN-phosphorylated group align with proline-directed motif sites (Supplementary Fig. 4c, d and Supplementary Data 6). Broadly similar results were obtained from the AML12 cell line, in this case 138 phosphopeptides from 127 proteins reproducibly changed during the time course (Supplementary Fig. 5a–d and Supplementary Data 5, 7).
To further explore the signaling events downstream of DNAJ-PKAc generation, we grouped the proteins in the datasets based on phosphorylation dynamics (Supplementary Fig. 6 and Supplementary Fig. 7). All clusters were characterized by maximal changes of phosphorylation levels, either increased or decreased, at the specific time points after triggering splicing; 1 hour (cluster 1), 2 hours (cluster 2), 6 hours (cluster 3), and 12 hours (cluster 4). GO analysis indicated distinct biological roles for protein members in each cluster (Supplementary Fig. 8a, b). In the mouse liver cells, proteins in cluster 1 are linked to mRNA splicing/receptor tyrosine kinases-associated signaling (phosphorylated) or RNA metabolism (dephosphorylated), cluster 2 associates with mitotic cell transition/protein localization (phosphorylated) or RNA Pol II transcription (dephosphorylated), cluster 3 is linked to MAPK signaling (phosphorylated), and the top terms in cluster 4 are regulation of protein/calcium ion transport (phosphorylated) or mitotic cell cycle/Rho GTPases signaling/chromosome organization (dephosphorylated), respectively (Fig. 4b). Notably, the dynamic phosphorylation events downstream of DNAJ-PKAc appear to be context-dependent – the GO terms associated with each cluster are not the same between the HEK293T and AML12 cells two cell types, a divergence highlighted by principal component analysis (PCA) of the phosphoproteomics datasets (Supplementary Fig. 9).
Finally, we wondered if our inducible DNAJ-PKAc system could provide insights into the molecular player(s) involved in up-regulation of the MAPK/ERK signaling pathway, which is a hallmark of FL-HCC37,38. With this in mind, we were particularly intrigued by the finding that phosphorylation levels of Serum and glucocorticoid-regulated kinase 1 (SGK1) increased at early time points upon induction of DNAJ-PKAc in AML12 cells (Supplementary Fig. 7). SGK1, a member of the AGC kinase family, is overexpressed in many cancers where it affects cellular signaling pathways, including MAPK/ERK activation44–46. Interestingly, phosphorylation at serine 401 on SGK1 identified in our time-resolved phospho-proteomic analysis and in AML12 cells stably expressing DNAJ-PKAc was previously found to be essential for full SGK1 activity47 (Extended Data Fig. 9a–d), suggesting this protein as a candidate for linking DNAJ-PKAc to activation of the MAPK/ERK pathway. Consistent with this idea, elevated ERK phosphorylation was not observed in DNAJ-PKAc-activated AML12 cells treated with the SGK1 inhibitor, GS-900748,49 (Fig. 4c). Furthermore, compared to the non-transformed AML12 cells, the growth of AML12 cells stably expressing DNAJ-PKAc and their ERK phosphorylation were selectively inhibited by GS-9007 (Fig. 4d, e and Extended Data Fig. 10a–c). Overexpression of SGK1 selectively increases the ERK phosphorylation and the growth of AML12 cells stably expressing DNAJ-PKAc (Extended Data Fig. 10d–g). Together, these results implicate SGK1 in connecting the activity of the DNAJ-PKAc oncofusion with elevated MAPK signaling in this early FL-HCC model (Extended Data Fig. 10h).
Discussion
In this study, we have developed strategies for controlling protein function at that post-translational level using caged split inteins. The platform has been successfully applied to multiple targets, including five different native oncofusions of differing sizes and functions, illustrating the generality of the method. The temporal control afforded by our strategy allowed us to follow changes in cellular phosphorylation following the generation of the kinase oncofusions. In the case of BCR-ABL, different phases of pTyr dynamics were uncovered. Perhaps most remarkable was the observation of transient phosphorylation events that occur in the first few hours after initiating splicing, including in receptor tyrosine kinases (RTKs). The fact that we also observed delayed tyrosine phosphorylation of known substrates of these RTKs, including various nuclear ribonucleoproteins, illustrates how our approach can provide insights into the cascade of signaling events emanating from the onset of BCR-ABL kinase activity.
Application of the CAGE system to DNAJ-PKAc has provided insights into the signaling pathways perturbed by this oncofusion. Our phospho-proteomics studies indicate that many basophilic motif sites and AGC kinase targets, including serine 11 in WT-PRKACA, undergo phosphorylation following DNAJ-PKAc generation. However, it is worth noting that early transient phosphorylation targets were not solely dominated by this single motif and kinase targets. This suggests that DNAJ-PKAc may contribute to oncogenesis not only through its kinase activity but also by deregulating signaling landscapes through other mechanisms37,39. Secondly, our research has revealed distinct temporal phases of phosphorylation following DNAJ-PKAc generation, including MAPK signaling, which provides insight into the underlying molecular connections between the oncofusion and cancer transformation. With respect to this, we provide evidence that the kinase SGK1 acts as upstream intermediate connecting DNAJ-PKAc activity to the MAPK pathway. Lastly, our research revealed multiple dephosphorylation events after DNAJ-PKAc generation in the HEK293T and AML12 cell lines. Notably, we observed the early transient phosphorylation of several phosphatase regulatory subunits after DNAJ-PKAc generation, suggesting potential mechanistic links that could be explored in future studies.
The CAGE system described herein possesses several key attributes that can be exploited for controlling protein function. Firstly, since it works at the post-translational level, target fusion proteins can be generated in a dose and time-dependent manner from premade parts. As demonstrated above, this provides opportunities for studying acute changes in cellular physiology upon protein gain of function. Secondly, the method is traceless (NrdJ-1 CAGE) or near-scarless (Npu CAGE), meaning that the entire CPS apparatus, as well as any other appended control elements, is removed from the final splice product. This distinguishes the approach from more traditional induced proximity strategies based on chemical or optogenetic inputs where the assembly apparatus is retained in the final complex50,51. Lastly, the fact that amide bonds are both broken and formed during protein trans-splicing provides opportunities for building additional control into the system. In this study, we demonstrated this through the transient introduction of multiple subcellular localization factors and autoregulatory domains with the CAGE constructs. However, we envision that this approach can be extended to other control elements, such as protein domains that bind to other proteins or nucleic acids52,53. More generally, the system provides a platform for the ‘metathesis’ of protein molecules; splicing results in the simultaneous exchange of multiple functional domains through both covalent and non-covalent interactions (Supplementary Fig. 10). This capability should engender the approach to a wide range of synthetic biology applications.
Methods
Materials
Alkynyl myristic acid (YnMyr, CAS No. 82909-47-5) was obtained from Click Chemistry Tools. Azo biotin-azide (CAS No. 1339202-33-3) and Imatinib (Cas No.152459-95-5) were obtained from Sigma Aldrich, rapamycin (Cas No.53123-88-9) from ApexBio (Huston, TX), AP21967 (rapalog) from Takara, Doxycycline (Cas No.24390-14-5) from Stem Cell Technologies, and SGK1 inhibitor GS-9007 (Cas No.1426214-51-8) was obtained from Cayman Chemical. All commercially obtained chemicals were dissolved in DMSO and used without further purification.
Cell culture
HEK293T (CRL-3216) and AML12 (CRL-2254) cell lines were purchased from ATCC. HEK293T and stably expressed HEK293T cells were propagated in DMEM (Thermo Fisher) supplemented with 10% v/v fetal bovine serum (FBS, Atlanta Biologicals), 2 mM L-glutamine (Life Technologies), and 1% v/v penicillin-streptomycin (5,000 U/mL, Life Technologies). AML12 and stably expressed AML12 cells were propagated in DMEM/F-12 (Gibco) supplemented with 10% v/v FBS, 0.1% v/v gentamicin (50 mg/mL, Sigma Aldrich), 1% v/v ITS-G (100x, 1.0 mg/mL recombinant human insulin, 0.55 mg/mL human transferrin, 0.67 μg/mL sodium selenite, Gibco), and 0.04 μg/mL dexamethasone (Sigma Aldrich). AML12 and stable AML12 cells were switched to serum-free media for 24 hours before the experiments and treated with the rapalog under the same condition37. All cell lines were grown at 37 °C in a 5% CO2 humid incubator.
Transfection
Each plate (6-well or 10 cm, Fisher Scientific) of HEK293T cells at ~70% confluency was transfected with plasmids encoding ‘POIN-CAGEN’ and ‘CAGEC-POIC’ with Lipofectamine 2000 (Thermo Fischer) according to the manufacturer’s protocol - ‘POI’ = Protein of Interest. After 6 hours, the media was aspirated and replaced with fresh media. Transfection was performed for 24 or 48 hours in an incubator at 37 °C in 5% CO2.
Lentiviral production and generation of stable cell lines
Lenti-X 293T cells (Takara, 632180) were seeded in 6-well plates (Fisher Scientific) at 1.0 × 106 cells per well and transfected after 24 hours with transfer plasmid (POIN-CAGEN_IRES2_EGFP or CAGEC-POIC_IRES2_mCherry in pEF1α backbone, 4 μg) and packaging vectors (pMD2.G 1.2 μg and psPAX 3.6 μg) using Lipofectamine 2000. Following overnight transfection, media was exchanged and allowed to incubate for an additional 24 hours. The viral collection was performed at 24, 48, and 72 hours. Viral media was filtered with a 0.45 μm PES syringe filter (Membrane Solutions), and Polybrene (Sigma #H9268) was added to a concentration of 8 μg/mL before infection of target cell lines.
HEK293T and AML12 cell lines were infected with the 24-hour or 48-hour viral harvests. After 24 hours, cells were allowed to recover by exchanging the media. Serial infections were performed for co-viral transduction of the two CAGE constructs; cells were initially selected with Blasticidin S (20 μg/mL final concentration to HEK293T cells and 2 μg/mL to AML12 cells, Gibco) after the first viral infection, then another virus was added to the cells for the second infection. Then, double-positive cells were isolated from the mix via FACS after being cultured with Blasticidin S.
SGK1-targeting shRNA vector refers to a pLKO.1 vector incorporating TRC Clone ID TRCN0000022885 shRNA oligo generated following the addgene protocol (Addgene 10878). Lentiviruses were generated by following the method above. Non-transformed and DNAJ-PKAc-expressing AML12 cells were infected with the 24-hour viral harvests. After 24 hours, cells were allowed to recover by exchanging the media and selected with Puromycin (2 μg/mL to AML12 cells, Gibco).
BCR-ABL Tet-On HEK293T cells were generated by transfection with a Piggybac transposon plasmid encoding for BCR-ABL under the control of a doxycycline-inducible promoter and a Piggybac transposase plasmid (a gift from the Kadoch lab, Dana Farber Cancer Institute) (2:1 ratio). Lipofectamine 3000 (Thermo Fischer) was used for the transfection according to the manufacturer’s protocol, transformed cells were selected by being cultured with 10 μg/mL Blasticidin S after 30 hours of transfection.
Fluorescence-activated cell sorting (FACS)
After co-viral transduction followed by selection with Blasticidin S, cells were collected after trypsin treatment (Trypsin-EDTA 0.05%, Gibco), washed twice with FACS buffer (2% FBS in PBS), and resuspended in FACS buffer at a 1 × 106 cells/mL concentration. Cells were filtered through the cell strainer (40 μm, Fisher Scientific) and sorted using a FACSAria Fusion cell sorter (BD Bioscience) at the Princeton Flow Cytometry Resource Facility. Cells expressing both GFP and mCherry were collected as single clones in 96-well plates or in a common tube and expanded in DMEM media until used for testing CPS modules in cells. The GFP was excited by the 488 nm laser, with emitted fluorescence detected through a 530/30 bandpass filter, and the mCherry was excited by the 561 nm laser, and emission was detected through a 610/20 bandpass filter. The cell sorter was equipped with a 100-micron nozzle, and the sheath pressure was maintained at 20 psi.
Cloning
All cloning was performed using the Phusion High-Fidelity DNA polymerase (New England Biolabs) and primers from IDT or Sigma Aldrich. Plasmids of the general forms ‘POIN-CAGEN’ and ‘CAGEC-POIC’ were cloned into vector backbones using Gibson Assembly (New England Biolabs) following the manufacturer’s instructions. The encoded constructs were controlled by the CMV promoter (strong expression when transiently transfected, pCMV5 vector backbone) and sv40 promoter (weak expression when transfected, pSG5 vector backbone). Point mutations were installed using inverse PCR. For generating lentiviral vectors for constitutive expression of CAGE constructs, the genes incorporating POIN-CAGEN with IRES2_EGFP or CAGEC-POIC with IRES2_mCherry were cloned into the 3rd generation lentiviral vector via restriction enzyme digestions using ClaI and MluI (New England Biolabs) followed by ligation using T4 DNA ligase (New England Biolabs). Plasmid sequences were designed with SnapGene software and verified by Sanger sequencing (Genewiz) using universal primers. Amino acid sequences for the encoded proteins are provided in Supplementary Table 2. In addition, the primer sequences used for point mutations of caging sequence in NrdJ-1 CAGE modules, with the structure of NrdJ-1 predicted by AlphaFold2, were provided in Supplementary Table 3.
Live cell imaging
HEK293T cells co-expressing ‘mVenus-CAGEN’ and ‘CAGEC-RFP’ were seeded onto a 35 mm dish with a 14 mm glass diameter (MatTek) and treated with either rapalog or DMSO for 8–10 hours. Time-resolved fluorescence imaging was performed with rapalog (100 nM) treatment for desired timepoints, and all treatments were finished simultaneously. Next, the cells were stained with Hoechst 33342 (1:2,000 dilution, Invitrogen) for 5 minutes and washed twice with Live Cell Imaging Solution (Thermo Fisher). The stained cells were imaged using a Nikon CSU-21 spinning disk confocal microscope with NIS Elements (version 5.2) software. Excitation of mVenus was performed using a 488 nm laser, RFP was excited using a 561 nm laser, and Hoechst was excited using a 405 nm laser. Images were captured with a 60x Plan Apo λ Oil objective and a Hamamatsu ORCA-Flash4.0 sCMOS camera.
Subcellular fractionation
1.0 × 107 HEK293T cells transfected with plasmids encoding ‘POIN-CAGEN’ and ‘CAGEC-POIC’ and treated with DMSO or rapalog (100 nM) were lysed by hypotonic lysis in 1 mL RSB buffer (10 mM Tris, 15 mM NaCl, 1.5 mM MgCl2, EDTA-free Halt protease inhibitor cocktail, pH 7.6) for 10 minutes on ice. The crude nuclei were isolated by centrifugation at 400 g for 5 minutes at 4 °C. After collecting the supernatant as a cytosolic fraction, the nuclei were resuspended in 1 mL RSB buffer, homogenized with 10 strokes of a loose pestle Dounce homogenizer, and pelleted at 400 g for 5 minutes at 4 °C. The nuclei were resuspended in 1 mL RSB buffer and centrifuged at 700 g for 5 minutes at 4 °C. Finally, the nuclei were resuspended in 300 μL of lysis buffer (Tris 20 mM, NaCl 150 mM, 0.5% v/v SDS, EDTA-free Halt protease inhibitor cocktail, pH 7.6), sonicated (Fisher Scientific FB-505, Amplitude: 25%, No pulse, Time: 15 seconds × 2 times), and insoluble debris cleared by centrifugation at 17,000 g for 10 minutes at 4 °C. Alternatively, the isolated nuclei were resuspended in desired buffers for other experiments, such as ChIP or co-immunoprecipitation assays.
Metabolic labeling of cells with YnMyr and N-myristoylated protein enrichment
Enrichment of N-myristoylated proteins in cells treated with the analog probe was performed using a protocol adapted from the literature54 with minor modifications, as described below. HEK293T cells transfected with the plasmid encoding ‘ABL1-HA’, ‘CAGEC-ABL1-HA’, or ‘CAGEC-PRKACA-HA’ were incubated with culture media containing YnMyr (20 μM) for 24 hours before cell lysis. After treatment, cells were washed with cold PBS (2 x) and lysed on ice using lysis buffer (Tris 20 mM, NaCl 150 mM, 1% v/v SDS, EDTA-free Halt protease inhibitor cocktail, pH 7.6) with sonication (Fisher Scientific FB-505, Amplitude: 30%, No pulse, Time: 8 seconds × 2 times). Lysates were centrifuged at 17,000 g for 10 minutes to remove insoluble material. The supernatant was collected, and protein concentration was measured using the BCA assay (Thermo Fisher). A click mixture containing azido-azo biotin, Tris(2-carboxyethyl)phosphine (TCEP), Tris(benzyltriazolylmethyl)amine (TBTA), and CuSO4 was prepared by combining the reagents in that order. The click mixture was then added to the diluted lysate and the reaction allowed to proceed for 1 hour at room temperature in the dark. The final reaction conditions were as follows: 1 mg/mL protein, 1% SDS, 50 mM Tris pH 8.0, 0.1 mM Azo biotin-azide (stock solution 5 mM in DMSO), 1 mM TCEP (stock solution 50 mM in ddH2O, prepared fresh), 0.2 mM TBTA (stock solution 10 mM in DMSO), 1 mM CuSO4 (stock solution 50 mM in ddH2O, prepared fresh). The samples were vortex-mixed briefly with the addition of methanol (3 volumes), chloroform (0.75 volume), and ddH2O (2 volumes), then centrifuged at 17,000 g for 10 minutes to pellet precipitated proteins. After removing both the top and bottom layers, protein pellets were washed twice with methanol (2.5 volumes) and air-dried. After protein precipitation, the pellet was resuspended in PBS containing 1% SDS supplement with EDTA-free Halt protease inhibitor. Once the pellet was completely dissolved, the samples were diluted with 1% SDS in PBS to 1 mg/mL protein concentration.
Streptavidin agarose resin (Thermo Scientific) was washed with 1% v/v SDS PBS (2 x). Typically, 50 μL of resin slurry was used for 1 mg lysate. The agarose beads were added to the samples, and the affinity enrichment carried out overnight at 4 °C or 2 hours at room temperature. The supernatant was removed, and the beads were washed with 1% SDS in PBS (3 x) followed by PBS (2 x). Enriched proteins were eluted by either heating the beads with 2X Laemmli buffer containing 4% beta-mercaptoethanol (βMe) at 95 °C for 10 minutes or by incubating the beads with 25 mM of Sodium dithionite (Sigma Aldrich) and 1% SDS in PBS at room temperature for 30 minutes (2 x). Samples were then prepared for SDS-PAGE analysis below.
Immunoblotting
Cells were harvested by scraping, pelleted by centrifugation, washed twice with PBS, and lysed in 8 M urea, 50 mM NH4HCO3 pH 8.0 or TBS (50 mM Tris, 150 mM NaCl, pH 7.6) containing 1% SDS supplement with EDTA-free Halt protease inhibitor and phosphatase inhibitor cocktails (Thermo Scientific) with sonication (Amplitude: 20%, No pulse, Time: 6 seconds) at 4 °C. Cells were then cleared by centrifugation, and the supernatant was diluted into 4x Laemmli buffer containing 8% beta-mercaptoethanol (βMe) as reducing agents. Samples were prepared for SDS-PAGE by heating to 95 °C for 10 minutes, cooled to room temperature, resolved on 6%, 8%, and 4–12% Bis-Tris protein gels, and transferred onto nitrocellulose membranes by standard immunoblotting methods. Membranes were blocked in 3% BSA in TBS containing 0.1% tween-20 (TBST) and probed with primary antibodies (see Supplementary Table 4). After the overnight incubation of membranes with primary antibodies, the secondary donkey anti-rabbit and donkey anti-mouse (LI-COR) were used at 1:10,000 dilution in 3% BSA-containing TBST and incubated for 1 hour prior to washing and imaging on a LI-COR infrared scanner processed with Odyssey Infrared Imaging Software (version 3.0). Whole protein bands were stained via Ponceau S staining solution (Thermo Fisher). Densitometry measurements were performed with ImageJ software. Graphs were generated in GraphPad Prism with analysis algorithms indicated in the figure legends.
Time- and dose-dependent rapalog treatment studies or doxycycline treatments (200 ng/mL) were performed in cells 24 hours after transient transfection of plasmids encoding ‘POIN-CAGEN’ and ‘CAGEC-POIC’ or stable cells. The indicated concentration of rapalog (stock solution 1 mM in DMSO) or equivalent DMSO in appropriate media was added to cells in 6-well plates or 10 cm dishes. Following the indicated incubation, cells were lysed in 8 M urea, 50 mM NH4HCO3 pH 8.0, or TBS containing 1% SDS supplement with EDTA-free Halt protease inhibitor and phosphatase inhibitor cocktails (Thermo Scientific) and processed for immunoblotting as indicated above.
ChIP sample preparation
HEK293T cells stably expressing the genes encoding ‘AML1N-CAGEN’ and ‘CAGEC-ETOC’ were treated with DMSO or rapalog (100 nM) followed by fixing the cells in 1% formaldehyde for 8 minutes. After quenching the excess formaldehyde with Glycine (~ 0.125 M final, from 2.5 M stock), cells were washed with cold PBS (2 x) and lysed by following the subcellular fractionation protocol to isolate the nuclei. Then, the nuclear pellets were resuspended in high salt buffer (50 mM HEPES-NaOH, 500 mM NaCl, 1 mM EDTA, 0.1% Sodium deoxycholate, 1% Triton-X, 0.05% SDS, EDTA-free Halt protease inhibitor, pH 7.5) and sonicated (Amplitude: 25%, Pulse: 15 seconds on / 45 seconds off, Time: 2 minutes – 8 cycles) at 4 °C. The samples were centrifuged at 17,000 g for 10 minutes to remove insoluble material. Each ChIP was performed with 1 million cells, and chromatin samples were incubated overnight at 4 °C with anti-HA magnetic beads (Pierce, 88836) washed with high salt buffer (3 x) before incubation. The immobilized protein-DNA complexes were washed 3 times with high salt buffer, 2 times with low salt buffer (10 mM Tris-Cl, 250 mM LiCl, 1 mM EDTA, 0.5 % NP40, 0.5% sodium deoxycholate, pH 8.0), and 1 time with TE buffer (10 mM Tris-Cl, 1 mM EDTA, pH 8.0). Elution and reverse-crosslinking were carried out in the elution & de-crosslink buffer (50 mM Tris-Cl, 10 mM EDTA, 1% SDS, pH 8.0) supplemented with RNAse A and Proteinase K at 65 °C for 4 hours. Inputs without affinity pulldown with antibodies were also reverse-crosslinked by the same method. The supernatants were collected and purified via PCR purification and gel extract columns (Tomas Scientific), and DNA was eluted from columns with TE buffer.
ChIP-qPCR and data analysis
ChIP samples and inputs (1%) were used for real-time quantitative polymerase chain reaction (RT-qPCR) to get cycle threshold (Ct) values. Quantification of PCR was performed with PowerUp™ SYBR™ Green master mix (Thermo Fisher) using a ViiA 7 Real-Time PCR system (Thermo Fisher). Ct values from n=6 (three independent measurements from two independent experiments) of each sample were normalized by input Ct values, and fold enrichment was calculated with standard deviations after normalization with the average values of DMSO-treated conditions. In addition, P values were calculated by univariate two-sided t-test with the values of rapalog and DMSO-treated conditions. The qPCR primer sequences used in this study are provided in Supplementary Table 5, and Ct values of ChIP-qPCR experiments are provided in Supplementary Data 1.
Co-immunoprecipitation assay (AML1-ETO)
10 cm plates of HEK293T cells stably expressing the genes encoding ‘AML1N-CAGEN’ and ‘CAGEC-ETOC’ were treated with DMSO or rapalog (100 nM) for 18 hours and washed with cold PBS (2 x). After hypotonic lysis, nuclei were isolated by following the subcellular fractionation protocol. Then, isolated nuclei samples were resuspended in 450 μL modified buffer A (20 mM HEPES, 1.5 mM MgCl2, 150 mM KCl, 0.1% Triton-X, 0.05% SDS, EDTA-free Halt protease inhibitor cocktail, pH 7.5) and benzonase nuclease (4 μL, Millipore) was then added to each condition for 1–2 hours at room temperature (Benzonase nuclease digestion time varies by enzyme batch and must be determined for each experiment. 3 μL of the digest was taken every 30 minutes, and these aliquots were run on a 1% agarose gel, and the digestion efficiency was visualized with ethidium bromide staining). Once digestion was completed, the samples were spun at 1.3 kg for 5 minutes at 4 °C, and the supernatants were collected (fraction S1). 550 μL of modified buffer A was added to the pellets, and each sample was rotated end-over-end at 4 °C for 1–2 hours. The samples were spun at 13,000g for 5 minutes at 4 °C, and the supernatants were collected (fraction S2). Fraction S1 and S2 are combined in a Lo-bind Eppendorf tube after centrifugation (13,000 g for 5 minutes), and the protein concentration of each sample was measured using the BCA assay (Thermo Fisher) and diluted with modified buffer A to 0.8 mg/mL protein concentration. 35 μL of anti-HA magnetic beads (Pierce, 88836) were pre-washed twice with modified buffer B (20 mM HEPES, 1.5 mM MgCl2, 150 mM KCl, 0.3% Triton-X, 0.1% SDS, EDTA-free Halt protease inhibitor, pH 7.5) and added to the sample. The HA-IP was performed overnight with end-over-end rotation at 4 °C. The remaining samples were kept as inputs. The beads were washed sequentially with 4x modified buffer B and 1x ddH2O for 2 minutes each rotation. 120 μL of 1x Laemmli buffer (no reducing agents) was added, and the beads were heated at 98 °C for 10 minutes. After collecting the eluents and adding reducing agents (DTT, 10 mM final), samples were run on a Bis-Tris gel for immunoblotting with indicated antibodies (see Supplementary Table 4).
Co-immunoprecipitation assay (DNAJ-PKAc)
15 cm plates of AML12 cells stably expressing the genes encoding ‘DNAJB1-CAGEN’ and ‘CAGEC-PRKACA’ were treated with DMSO or rapalog (100 nM) for 24 hours and washed with cold PBS (2 x). Cells were then lysed in TBS (50 mM Tris, 150 mM NaCl, pH 7.6) containing 1% Triton-X supplement with EDTA-free Halt protease inhibitor cocktails (Thermo Scientific) with sonication (Amplitude: 20%, Pulse: 1 second on / 1 second off, Time: 30 seconds) at 4 °C. Lysates were centrifuged at 17,000 g for 10 minutes to remove insoluble material, and protein concentration was measured by BCA assay. Lysates were then diluted to 2 mg/mL protein concentration and transferred to a Lo-bind Eppendorf tube. Next, 20 μL of anti-HA magnetic beads (Pierce, 88836) were pre-washed twice with the lysis buffer above and added to the sample. The HA-IP was performed overnight with end-over-end rotation at 4 °C. The remained samples were kept as inputs. The beads were washed sequentially with 5 times lysis buffer containing 0.05% SDS and 2 times ddH2O for 2 minutes each rotation. Next, 75 μL of S-TRAP elution buffer (TEAB 50 mM, 5% SDS, pH 7.1) was added, and the beads were heated at 45 °C for 10 minutes. After collecting the eluents and diluted into 4x Laemmli buffer containing 8% beta-mercaptoethanol (βMe) as reducing agents, samples were run on a Bis-Tris gel for immunoblotting.
SILAC cell culture
SILAC labeling was performed by growing cells for at least eight passages in lysine- and arginine-free SILAC medium (DMEM, Invitrogen) supplemented with 10% dialyzed fetal calf serum (Thermo Fisher), 2 mM L-glutamine, and 1% P/S. “Light” and “heavy” media were supplemented with natural lysine and arginine (Sigma Aldrich, 0.4 mM final), and 13C-, 15N-labeled lysine and arginine (Cambridge Isotope Laboratories, 0.4 mM final), respectively.
Proteomics sample preparation for quantitative AP-MS (S-TRAP)
Quantitative AP-MS of AML1-ETO splice product with SILAC proteomics was performed with “heavy” and “light” labeled HEK293T cells stably expressing ‘AML1N-CAGEN’ and ‘CAGEC-ETOC’ constructs55. SILAC-labeled cells, grown to 70% confluency in two 10 cm cell plates each, were incubated with DMSO alone (light cells) or rapalog (100 nM, heavy cells) for 18 hours in SILAC DMEM. All cells were then washed with cold PBS (2 x) and lysed by following the subcellular fractionation protocol to isolate the nuclei. Then, each heavy and light isolated nuclei samples were benzonase nuclease digested by following the co-immunoprecipitation protocol described above. After combining fractions S1 and S2 of each sample, protein concentration was measured using the BCA assay (Thermo Fisher) and diluted with modified buffer A to 0.8 mg/mL protein concentration. For SILAC label-swap experiments, “light” HEK293T cells were incubated with rapalog, and “heavy” cells were incubated with DMSO and processed as above. 40 μL of anti-HA magnetic beads were pre-washed twice with modified buffer B and added per 1 mL sample (total 2 mL per condition) in a Lo-bind Eppendorf tube. The HA-IP was performed overnight with end-over-end rotation at 4 °C. Then, the beads were washed sequentially with 4x modified buffer B and 1x LC-MS grade H2O for 2 minutes each rotation. 100 μL of S-TRAP elution buffer (TEAB 50 mM, 5% SDS, pH 7.1) was added, and the beads were heated at 45 °C for 10 minutes. After collecting the eluent, beads were rinsed again with 100 μL of S-TRAP elution buffer and the eluants combined. Eluents from heavy (+) and light (−) co-immunoprecipitated samples were mixed 1:1 v/v ratio and processed for the S-TRAP sample preparation approach.
The SILAC mixture of co-enriched proteins in S-TRAP elution buffer was digested by following the manufacturer’s protocol (ProtiFi). In brief, proteins were reduced by TCEP (20 mM final) with orbital shaking for 10 minutes at 95 °C. After cooling to room temperature, samples were then alkylated by adding chloroacetamide (40 mM final) with orbital shaking in the dark for 30 minutes at room temperature. Aqueous phosphoric acid was added to a final concentration of ~1.2% v/v. 1 volume of sample solution was then mixed with 6 volumes of S-TRAP binding buffer (90% methanol + 10% 1 M TEAB, pH 7.1) followed by centrifugation at 10,000 g for 2.5 minutes. The supernatants were loaded on to an S-TRAP micro spin column and passed through by centrifugation (1,800 g for 1 minute) multiple times. The columns were then washed by adding S-TRAP binding buffer and centrifugation three times. Proteins captured on the columns were then digested by adding 20 μL of digestion buffer (50 mM TEAB, pH 8.5) containing 1 μg of sequencing grade trypsin (Promega) 1 hour at 47 °C. Following trypsinization, proteolyzed peptides were eluted by adding buffers and centrifuging (3,000 g for 1 minute) sequentially with 40 μL digestion buffer, 40 μL of 0.2% formic acid in LC-MS grade water, 40 μL of formic acid in 40% acetonitrile, and 40 μL of formic acid in 60% acetonitrile. All eluted peptides were combined and dried using a SpeedVac (Thermo Fisher). Dried peptides were dissolved in 0.1% formic acid (pH 3) for proteomic analysis.
Phosphopeptides enrichment from BCR-ABL or DNAJ-PKAc activated mammalian cells
Transfected HEK293T cells or stable AML12 cells were treated with rapalog (100 nM) for each desired timepoint; 0 minutes, 30–60 minutes, 120 minutes, 360 minutes, and 720 minutes (DNAJ-PKAc only). After finishing all treatments simultaneously, cells were washed twice with cold PBS and lysed in 1 mL of 8 M urea, 25 mM NH4HCO3 pH 8.0 supplement with EDTA-free Halt protease and phosphatase inhibitor cocktails per plate with sonication (Amplitude: 30%, No pulse, Time: 10 seconds × 2 times) at 4 °C. Lysates were centrifuged at 17,000 g for 10 minutes to remove insoluble material, and protein concentration was measured by BCA assay. Lysates were then diluted to 4 mg/mL protein concentration with 8 M Urea buffer, followed by disulfide reduction with dithiothreitol (DTT) (10 mM, 40 minutes, 50 °C), alkylation (iodoacetamide, 30 mM, 30 min, room temperature, protected from light) and quenching (DTT, 5mM, 10 minutes, room temperature). The proteome solution was diluted 4-fold with ammonium bicarbonate solution (25 mM, pH 8.0), CaCl2 added (1 mM), and digested with sequencing grade trypsin (~1:100 enzyme/protein ratio; Thermo Fisher) at 37 °C while rotating overnight. Peptide digestion reactions were stopped by acidification to pH 2–3 with 2% formic acid, and peptides were then desalted with Sep-Pak tC18 cartridges (50–100 mg, Waters) and then lyophilized.
The lyophilized peptides from each condition were re-dissolved in Binding/Equilibration buffer (150 μL per 3 mg peptides) and applied to TiO2 spin tips (3 mg peptides per tip) for phosphopeptide enrichment according to the manufacturer’s protocol (High-Select™ TiO2 Phosphopeptide Enrichment Kit, A32993, Thermo Scientific). Eluted phosphopeptides were transferred to Lo-bind Eppendorf tubes, and samples were immediately dried in a SpeedVac. To enrich any remaining phosphopeptides not captured using the TiO2 column the flow-throughs were passed over PHOS-Select IMAC (P9740, Sigma Aldrich) according to the manufacturer’s protocol. After drying the eluents, phosphopeptides from IMAC were combined with the TiO2-enriched peptides per fraction.
In the BCR-ABL studies, the enriched phosphopeptides were further used for immuno-affinity-based enrichment of phosphotyrosine peptides using a protocol adapted from reference56 with minor modifications, as described below. In brief, dried phosphopeptides from each fraction were re-dissolved in 800 μL cold IAP buffer (50 mM Tris-HCl, 150 mM NaCl, 1% w/v n-Octyl-β-D-glucopyranoside (EMD Millipore), EDTA-free Halt protease inhibitor and phosphatase inhibitor cocktails, pH 7.4). The peptide mixtures were agitated in a shaker for 30 minutes to dissolve the peptides thoroughly. Meanwhile, 200 μL of 25% anti-phosphotyrosine antibody (PY99) agarose beads (sc-7020 AC, Santa Cruz) were pre-washed twice with cold IAP buffer and added to the sample. The pY-IP was performed overnight with end-over-end rotation at 4 °C. After centrifugation (1,500 g for 1 minute at 4 °C), the beads were washed sequentially with 3x IAP buffer and 2x LC-MS grade H2O for 2 minutes each rotation. 200 μL of 0.15% trifluoroacetic acid (TFA) in H2O was added, and the beads were agitated on a shaker for 15 minutes at room temperature to elute the phosphotyrosine peptides. Elution was repeated three times, and eluents were combined and desalted with ZipTip C18 tips (100 μL, Millipore) and dried in a SpeedVac.
TMT labeling of enriched phosphopeptides
Phosphopeptides from different timepoint fractions (three biological replicates) were separately labeled with 16-plex isobaric tandem mass tags (TMTpro™ 16plex Label Reagent Set, A44521, Thermo Fisher) according to the manufacturer’s protocol. In brief, 0.5 mg of TMTpro label reagents were equilibrated to room temperature, reconstituted with 20 μL of anhydrous acetonitrile (Optima grade), and added to dried peptides dissolved in 100 μL of TEAB (100 mM, pH 8.5). Labeling was carried out at room temperature for 2 hours with end-to-end rotation and quenched with 5 μL of 5% hydroxylamine (Sigma Aldrich) for 15 minutes. Finally, labeled peptides were combined into a new protein LoBind tube (Eppendorf) and dried in a SpeedVac before processing by the Princeton Proteomics and Mass Spectrometry Core Facility.
After drying, TMT-labeled peptides were re-dissolved in 300 μL of 0.1% TFA in water and fractionated into 8 fractions using the High pH Reversed-Phase Peptide Fractionation Kit (84868, Pierce). Fractions 1, 4, and 7 were combined as sample 1. Fractions 2 and 6 were combined as sample 2. Fractions 3, 5, and 8 were combined as sample 3. Three combined samples were dried in a SpeedVac and resuspended in 5% acetonitrile/water (0.1% formic acid, pH 3) for proteomics analysis.
Proteomic LC-MS/MS runs
Proteomics samples were injected on to an Easy-nLC 1200 UPLC system employed a 45 cm × 100 μm (inner diameter) nano-capillary column packed with 1.9 μm C18-AQ resin (Dr. Maisch, Germany) mated to a metal emitter in-line with an Orbitrap Fusion Lumos (Thermo Scientific, USA). The column temperature was 45 °C, and a two-hour gradient method (SILAC) and a three-hour gradient method (TMT) were employed at a flow rate of 360 nL/min.
For running SILAC samples, the mass spectrometer was operated in data-dependent mode with a 120,000 resolution MS1 scan (positive mode, profile mode, AGC target of 4 × 105, maximum IT of 50 ms, scan range of 375–1500 m/z) in the Orbitrap, followed by HCD fragmentation in the ion trap with 35% collision energy. A dynamic exclusion list was invoked to exclude previously fragmented peptides for 60 s, and a maximum cycle time of 3 s was used. The isolation window for precursor ions was set to 1.6 m/z in the quadrupole. The ion trap was operated in Rapid mode with an AGC target of 1 × 104 and a maximum IT of 54 ms.
For running TMT labeled samples, the mass spectrometer was operated in data-dependent mode with synchronous precursor selection (SPS) – MS3 method with a 120,000 resolution MS1 scan (positive mode, profile mode, maximum IT of 5 × 103, scan range of 400–1500 m/z) in the Orbitrap followed by CID fragmentation in the ion trap with 30% collision energy for MS2 and HCD fragmentation in the Orbitrap (50,000 resolution) with 55% collision energy for MS3. The MS3 scan range was set to 110–500 with an injection time of 200 ms. A dynamic exclusion list was invoked to exclude previously sequenced peptides for 60 s, and a maximum cycle time of 2.5 s was used. The isolation window for precursor ions was set to 0.7 m/z in the quadrupole. The ion trap was operated in Turbo mode.
Proteomics data processing and analysis – AML1-ETO (SILAC qAP-MS, MaxQuant)
Data processing was performed using MaxQuant (v.2.3.1.0) software with default parameters unless otherwise noted. Replicates were analyzed in parallel with the match between runs selected. The UniProt human proteome reference database was used (proteome ID UP000005640), and Trypsin/P was selected as the enzyme. Methionine oxidation (+15.9949) and N-terminal acetylation (+42.0106) were specified as variable modifications and cysteine carbamidomethylation (+57.02146) as a static modification. For protein quantification, re-quantify was enabled, and the minimum label ratio count was set to one. Only proteins quantified in both experimental replicates were reported. Contaminants, including mitochondrial proteins, were excluded. Final outputs were filtered to achieve a < 1% false discovery rate (FDR).
The SILAC ratios of each protein from two biologically independent experiments were combined and converted to Log2 values, and P values were calculated by a univariate two-sided t-test and converted to −Log10 values. The waterfall plot was plotted with the median Log2 SILAC ratios of each protein. The proteins that passed the filter (rapalog-to-DMSO ratio > 2, P < 0.005) were considered significantly enriched proteins with the affinity pulldown of AML1-ETO splice product. Data are summarized in Supplementary Data 2.
Proteomic data processing – BCR-ABL (TMT tyrosine phosphoproteomics, MaxQuant)
Data processing was performed using MaxQuant (v.2.3.1.0) software with default parameters unless otherwise noted. ‘Reporter ion MS3’ was chosen as a type for quantification, data were searched against the UniProt human proteome reference database (proteome ID UP000005640), and Trypsin/P was selected as the enzyme with a maximum of 3 missed cleavages. For processing TMTpro (16-plex) labeled samples in MaxQuant, parameters were imported with the correction of reporter ion isotopic distribution values provided from the TMTpro reagent set (Lot# WK334325). Each TMT 12-plex experiment was loaded as three fractions and three technical repeat runs. Methionine oxidation (+15.9949) and phosphorylation (+79.9663) on serine, threonine, and tyrosine were specified as variable modifications (maximum modification in the peptide set to 3), and cysteine carbamidomethylation (+57.02146) as a static modification. The maximum peptide mass was set to 6,000 Da, and the minimum label ratio count was set to one and quantified using both unique and razor peptides. Final outputs were filtered to achieve a < 1% false discovery rate (FDR).
Proteomic data processing – DNAJ-PKAc (TMT phosphoproteomics, Proteome Discoverer)
The data were processed using Proteome Discoverer (v.2.5.0.400) software (Thermo Fisher) with customized workflows modified from a default version processing workflow ‘PWF_Tribird_TMTpro_Quan_SPS_MS3_SequestHT_Percolator’ and a consensus workflow ‘CWF_Comprehensive_Enhanced Annotation_Reporter_Quan.’ For searching data, the UniProt human (proteome ID UP000005640) or mouse (proteome ID UP000000589) proteome reference database was used with default parameters of the workflows unless otherwise noted below, and reporter ion isotropic distribution values were obtained from the TMTpro reagent set (Lot# XE341501). In the processing workflow, trypsin was selected as the enzyme with a maximum of 2 missed cleavages, and variable modifications of methionine oxidation (+15.995) and phosphorylation (+79.966) on serine, threonine, and tyrosine were specified. Cysteine carbamidomethylation (+57.021) and TMTpro reagents labeling on the N-terminus and lysine (+304.207) were set as static modifications. Site localization probability values were calculated using the ‘IMP-ptmRS’ node. In the consensus workflow, the site probability threshold was set to 70 in the ‘PSM Grouper’ node. The reporter co-isolation threshold was set to 90 in the ‘Reporter Ions Quantifier’ node, and motif information was obtained by adding the ‘Modification Sites’ node. Final outputs were filtered to achieve a <1% false discovery rate (FDR).
Quantitative phosphoproteomics data analysis
The reporter ion intensities of identified phosphotyrosine peptides (BCR-ABL) or phosphopeptides (DNAJ-PKAc) were added up for each timepoint fraction from all LC-MS/MS runs, followed by consolidation with the same phosphosites (Supplementary Fig. 2b). Relative phosphorylation levels were calculated using the maximum TMT channel intensity, set to 100, within each protein phosphosite. Only phosphorylation dynamics of the phosphosites were counted for subsequent analyses when satisfying; (i) non-zero TMT ion intensities shown in at least 2 timepoint fractions in each biological replicate and (ii) notable changes between the values at a specific timepoint having average maximum value and the values at 0 minutes (P < 0.1 for BCR-ABL and P < 0.06 for DNAJ-PKAc, calculated by a univariate two-sided t-test). Data are summarized in Supplementary Data 3 (BCR-ABL, HEK293T), Supplementary Data 4 (DNAJ-PKAc, HEK293T), and Supplementary Data 5 (DNAJ-PKAc, AML12).
For clustering the protein phosphosites based on their phosphorylation dynamics, relative phosphorylation levels calculated by the method above were re-normalized to fold changes by the average value at 0 hours, and these re-normalized values were then converted to log2 values. The maximum fold change allowed in the dataset is set to 20 before log2 conversion. Then, phosphorylation sites were grouped statically and visualized as a heatmap using the Python and Seaborn package (v.0.11.2). Dynamic phosphorylation curves of phosphosites were plotted using the Prism software by the fold changes. Gene ontology analysis was performed by uploading protein lists to the Metascape website (https://metascape.org) and performing enrichment analysis57. Enriched sequence motifs of phosphosites were analyzed by uploading peptide sequences to the pLogo generator (https://plogo.uconn.edu) and performing enrichment analysis58. pSer/Thr sites identified in DNAJ-PKAc datasets (Supplementary Data 6, 7) were further analyzed by the Kinase Library platform43 to predict the clustered phosphorylation-site motifs and the kinase family targets (https://kinase-library.phosphosite.org/site), and if more than 4 kinases scored higher than 99%, the top 4 ranked candidates from each substrate prediction result were considered. Principal component analysis (PCA) was performed using the R-package (BiocGenerics) with the log2 average fold-change values in each time point of the proteins in HEK293T (P < 0.06, n = 801) and AML12 (P < 0.06, n = 602) as inputs. Other graphs were generated in GraphPad Prism, with analysis algorithms indicated in the figure legends.
Immunoprecipitation of AML1-ETO splice product from mammalian cells
10 cm of HEK293T cells transfected with plasmids encoding ‘FLAG-AMLN-CAGEN’ and ‘CAGEC-ETOC’ were treated with rapalog (100 nM) for 18 hours. Cells were washed twice with cold PBS and lysed in 350 μL of 8 M urea, 25 mM NH4HCO3 pH 8.0 supplement with Roche cOmplete EDTA-free protease inhibitors. After sonication (Amplitude: 35%, Pulse: 1 second on / 1 second off, Time: 15 seconds) at 4 °C, lysates were centrifuged at 17,000 g for 10 minutes to remove insoluble material and then diluted ten times with 25 mM NH4HCO3 to 0.8 M Urea. 150 μL of 100% Anti-DYKDDDDK G1 affinity resin (GenScript) was prewashed with cold PBS (3 x) and then added to the 11 mL of lysates for FLAG-IP with end-over-end rotation at 4 °C overnight. After removing supernatants, the resins were washed sequentially with 8× 0.8 M Urea in PBS and 2x PBS. 250 μL of 2X Laemmli buffer supplement with 5 mM of DTT was added, and the beads were heated at 95 °C for 10 minutes, and the eluent collected. The splice product AML1-ETO was separated by 8% Bis-Tris SDS-PAGE gel, which was then stained with Coomassie stain until the band (~100 kDa) was just visible. The product bands were isolated in gel pieces and subjected to in-gel reduction, alkylation, and overnight LysC digestion by following the standard protocol59. Peptides eluted from in-gel tryptic digestions were dried in a SpeedVac and were dissolved in 5% acetonitrile/water (0.1% formic acid, pH 3) for proteomics analysis (below).
Direct MS1 and MS2 analysis of splice junction peptide in AML1-ETO splice product
A tryptic peptide of the splice junction in AML1-ETO splice product was initially validated by manual inspection of chromatograms and MS1 spectra with theoretical m/z values of peptide candidates; m/z = 519.517 (z = 4) and m/z = 692.353 (z = 3)
Validated MS1 ions were further analyzed by “MS2 run only” mode to identify the splice junction peptide with the desired b- and y- ions. The LC-MS/MS experiment was performed with an Easy-nLC 1200 UPLC system employing a 45 cm × 75 μm (inner diameter) nano-capillary column packed with 1.9 μm C18-AQ resin (Dr. Maisch, Germany) mated to a metal emitter in-line with an Orbitrap Fusion Lumos (Thermo Scientific, USA). The column temperature was 45 °C, and a one-hour gradient method was run at a flow rate of 300 nL/min. The mass spectrometer was operated to scan MS2 from MS1 of the inclusion list with a 60,000 resolution (positive mode, profile mode, AGC target of 4 × 105, maximum IT of 118 ms) in the Orbitrap, followed by HCD fragmentation in the ion trap with 30% collision energy. The isolation window for precursor ions was set to 0.4 m/z in the quadrupole.
Raw files were processed with Thermo XCalibur Qual Browser, and MS2 peaks were manually annotated.
Direct MS2 and MS3 analysis of phosphorylated Serine 401 SGK1 peptide
A TMTpro-labeled tryptic peptide of phosphorylated Serine 401 in SGK1 was initially identified by the phosphoproteomics dataset summarized in Supplementary Data 7, with theoretical m/z value of the peptide candidate; m/z = 635.691 (z = 3)
Validated MS1 ion was further analyzed by “Targeted Mass” mode with synchronous precursor selection (SPS) – MS3 method with a 120,000 resolution MS1 scan (positive mode, profile mode, maximum IT of 5 × 103) in the Orbitrap followed by CID fragmentation in the ion trap with 30% collision energy for MS2 and HCD fragmentation in the Orbitrap (50,000 resolution) with 55% collision energy for MS3. The MS3 scan range was set to 110–500 with an injection time of 200 ms. A dynamic exclusion list was invoked to exclude previously sequenced peptides for 60 s, and a maximum cycle time of 2.5 s was used. The isolation window for precursor ions was set to 0.7 m/z in the quadrupole. The ion trap was operated in Turbo mode.
Raw files were processed with Thermo XCalibur Qual Browser, and MS2 and MS3 peaks were manually annotated. Bar graphs were generated in GraphPad Prism, with analysis algorithms indicated in the figure legends.
Clonogenic assay of cells
Non-transformed and DNAJ-PKAc-expressing AML12 cells were seeded at 1,000 cells/well in 12-well plates. DMSO or GS-9007 (5 μM) was added to normal growth media the following day for inhibitor tests. Media with the drug was refreshed every 4 days. To test the impact of SGK1 overexpression, non-DNAJ-PKAc-expressing and DNAJ-PKAc-expressing AML12 cells, either non-transformed or overexpressing SGK1, were seeded at 500 cells per well in 12-well plates, and the media was refreshed every 4 days. After 10 days of treatment, equivalent to at least 6 potential cell divisions, cells were rinsed in PBS and fixed in 4% paraformaldehyde in PBS for 20 minutes. Next, cells were stained with 0.1% crystal violet in 10% methanol, washed three times with water, and dried at room temperature for image capture using HP OfficeJet 3830 scanner. Finally, colonies were quantified using ImageJ with masking (convert to a binary image first, then Image > Adjust > Threshold) and particle analysis (Analyze > Analyze particles) tools. Bar graphs were generated in GraphPad Prism, with analysis algorithms indicated in the figure legends.
Extended Data
Extended Data Fig. 1.

CAGE modules generate the splice products in live cells by non-toxic rapamycin analog treatment.
Extended Data Fig. 2.

Engineering the CAGE toolbox via charge-swap mutations of the NrdJ-1 cages.
Extended Data Fig. 3.

AML1-ETO splice product generated from co-expressed CAGE modules in cells.
Extended Data Fig. 4.

MLL-AF9 and NUP98-HOXA9 splice products generated from co-expressed CAGE modules in cells.
Extended Data Fig. 5.

Multi-omics validation of the scarless AML1-ETO splice product.
Extended Data Fig. 6.

Design of CAGEC-ABL1 with appended native autoregulatory domain for inducible tyrosine phosphorylation in cells upon BCR-ABL activation.
Extended Data Fig. 7.

Dynamic tyrosine phosphorylation events following BCR-ABL activation.
Extended Data Fig. 8.

DNAJB1-CAGEN and CAGEC-PRKACA for inducible kinase activation.
Extended Data Fig. 9.

MS2 and MS3 analysis of phosphorylated Serine 401 SGK1 peptide.
Extended Data Fig. 10.

Pharmacologically inhibiting or overexpressing SGK1 in DNAJ-PKAc-expressing AML12 cells.
Supplementary Material
Supplementary Information is available in the online version of this paper.
Acknowledgements
We thank members of the Muir lab for their many helpful comments and insights. We thank Saw Kyin and Henry H. Shwe at the Princeton University Proteomics and Mass Spectrometry Core Facility. We thank Christina J. DeCoste and Katherine Rittenbach at the Princeton University Flow Cytometry Resource Facility for cell sorting services funded by Rutgers Cancer Institute of New Jersey P30CA072720-5921. We thank Gary S. Laevsky and Sha Wang at the Princeton University Confocal Imaging Facility. We also thank Jennifer M. Miller at the Princeton University Genomics Core Facility. Research reported in this publication was supported by NIH grant R01GM086868 (T.W.M).
Footnotes
Extended Data is available for this paper.
Competing interests
Authors declare that they have no competing interests.
Data and materials availability
All relevant data are included in the manuscript and Supplementary Information. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD044530. FASTA files were obtained from the UniProt Human (Proteome ID: UP000005640) and Mouse (Proteome ID: UP000000589) Protein Databases (https://www.uniprot.org). Protein structures for ABL1 kinase (PDB: 2FO0), DNAJ-PKAc (PDB: 4WB7), mouse PRKACA (PDB: 4DFX), and PKA tetrameric complex (PDB: 3TNP) were obtained from the RCSB Protein Data Bank (https://www.rcsb.org). The cell lines generated are available upon request.
References
- 1.Allis CD & Jenuwein T The molecular hallmarks of epigenetic control. Nat Rev Genet 17, 487–500 (2016). [DOI] [PubMed] [Google Scholar]
- 2.Lemmon MA & Schlessinger J Cell signaling by receptor tyrosine kinases. Cell 141, 1117–1134 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Jaeger MG & Winter GE Fast-acting chemical tools to delineate causality in transcriptional control. Mol Cell 81, 1617–1630 (2021). [DOI] [PubMed] [Google Scholar]
- 4.Toettcher JE, Voigt CA, Weiner OD & Lim WA The promise of optogenetics in cell biology: interrogating molecular circuits in space and time. Nat Methods 8, 35–38 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Saez E, No D, West A & Evans RM Inducible gene expression in mammalian cells and transgenic mice. Curr Opin Biotechnol 8, 608–616 (1997). [DOI] [PubMed] [Google Scholar]
- 6.Sauer B & Henderson N Site-specific DNA recombination in mammalian cells by the Cre recombinase of bacteriophage P1. Proc Natl Acad Sci U S A 85, 5166–5170 (1988). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Arrowsmith CH, et al. The promise and peril of chemical probes. Nat Chem Biol 11, 536–541 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Makley LN & Gestwicki JE Expanding the number of ‘druggable’ targets: non-enzymes and protein-protein interactions. Chem Biol Drug Des 81, 22–32 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Wu N, Deiters A, Cropp TA, King D & Schultz PG A genetically encoded photocaged amino acid. J Am Chem Soc 126, 14306–14307 (2004). [DOI] [PubMed] [Google Scholar]
- 10.Arbely E, Torres-Kolbus J, Deiters A & Chin JW Photocontrol of tyrosine phosphorylation in mammalian cells via genetic encoding of photocaged tyrosine. J Am Chem Soc 134, 11912–11915 (2012). [DOI] [PubMed] [Google Scholar]
- 11.Wang J, et al. Time-resolved protein activation by proximal decaging in living systems. Nature 569, 509–513 (2019). [DOI] [PubMed] [Google Scholar]
- 12.Mootz HD & Muir TW Protein splicing triggered by a small molecule. J Am Chem Soc 124, 9044–9045 (2002). [DOI] [PubMed] [Google Scholar]
- 13.Schwartz EC, Saez L, Young MW & Muir TW Post-translational enzyme activation in an animal via optimized conditional protein splicing. Nat Chem Biol 3, 50–54 (2007). [DOI] [PubMed] [Google Scholar]
- 14.Tyszkiewicz AB & Muir TW Activation of protein splicing with light in yeast. Nat Methods 5, 303–305 (2008). [DOI] [PubMed] [Google Scholar]
- 15.Wong S, Mosabbir AA & Truong K An Engineered Split Intein for Photoactivated Protein Trans-Splicing. PLoS One 10, e0135965 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Yao Z, et al. Split Intein-Mediated Protein Ligation for detecting protein-protein interactions and their inhibition. Nat Commun 11, 2440 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Slomovic S & Collins JJ DNA sense-and-respond protein modules for mammalian cells. Nat Methods 12, 1085–1090 (2015). [DOI] [PubMed] [Google Scholar]
- 18.Gramespacher JA, Stevens AJ, Nguyen DP, Chin JW & Muir TW Intein Zymogens: Conditional Assembly and Splicing of Split Inteins via Targeted Proteolysis. J Am Chem Soc 139, 8074–8077 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Gramespacher JA, Burton AJ, Guerra LF & Muir TW Proximity Induced Splicing Utilizing Caged Split Inteins. J Am Chem Soc 141, 13708–13712 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Mulloy JC, et al. The AML1-ETO fusion protein promotes the expansion of human hematopoietic stem cells. Blood 99, 15–23 (2002). [DOI] [PubMed] [Google Scholar]
- 21.Rao RC & Dou Y Hijacked in cancer: the KMT2 (MLL) family of methyltransferases. Nat Rev Cancer 15, 334–346 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Ahn JH, et al. Phase separation drives aberrant chromatin looping and cancer development. Nature 595, 591–595 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Mitelman F, Johansson B & Mertens F The impact of translocations and gene fusions on cancer causation. Nat Rev Cancer 7, 233–245 (2007). [DOI] [PubMed] [Google Scholar]
- 24.Mulloy JC, Wunderlich M, Zheng Y & Wei J Transforming human blood stem and progenitor cells: a new way forward in leukemia modeling. Cell Cycle 7, 3314–3319 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Sun XJ, et al. A stable transcription factor complex nucleated by oligomeric AML1-ETO controls leukaemogenesis. Nature 500, 93–97 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Ben-Ami O, et al. Addiction of t(8;21) and inv(16) acute myeloid leukemia to native RUNX1. Cell Rep 4, 1131–1143 (2013). [DOI] [PubMed] [Google Scholar]
- 27.Li Y, et al. Genome-wide studies identify a novel interplay between AML1 and AML1/ETO in t(8;21) acute myeloid leukemia. Blood 127, 233–242 (2016). [DOI] [PubMed] [Google Scholar]
- 28.Wang J, Hoshino T, Redner RL, Kajigaya S & Liu JM ETO, fusion partner in t(8;21) acute myeloid leukemia, represses transcription by interaction with the human N-CoR/mSin3/HDAC1 complex. Proc Natl Acad Sci U S A 95, 10860–10865 (1998). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Yang G, Thompson MA, Brandt SJ & Hiebert SW Histone deacetylase inhibitors induce the degradation of the t(8;21) fusion oncoprotein. Oncogene 26, 91–101 (2007). [DOI] [PubMed] [Google Scholar]
- 30.Wong S & Witte ON The BCR-ABL story: bench to bedside and back. Annu Rev Immunol 22, 247–306 (2004). [DOI] [PubMed] [Google Scholar]
- 31.Nowell PC Discovery of the Philadelphia chromosome: a personal perspective. J Clin Invest 117, 2033–2035 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Nagar B, et al. Structural basis for the autoinhibition of c-Abl tyrosine kinase. Cell 112, 859–871 (2003). [DOI] [PubMed] [Google Scholar]
- 33.Hantschel O, et al. A myristoyl/phosphotyrosine switch regulates c-Abl. Cell 112, 845–857 (2003). [DOI] [PubMed] [Google Scholar]
- 34.Honeyman JN, et al. Detection of a recurrent DNAJB1-PRKACA chimeric transcript in fibrolamellar hepatocellular carcinoma. Science 343, 1010–1014 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Graham RP, et al. DNAJB1-PRKACA is specific for fibrolamellar carcinoma. Mod Pathol 28, 822–829 (2015). [DOI] [PubMed] [Google Scholar]
- 36.Bastidas AC, et al. Role of N-terminal myristylation in the structure and regulation of cAMP-dependent protein kinase. J Mol Biol 422, 215–229 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Turnham RE, et al. An acquired scaffolding function of the DNAJ-PKAc fusion contributes to oncogenic signaling in fibrolamellar carcinoma. Elife 8(2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Dinh TA, et al. Hotspots of Aberrant Enhancer Activity in Fibrolamellar Carcinoma Reveal Candidate Oncogenic Pathways and Therapeutic Vulnerabilities. Cell Rep 31, 107509 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Zhang JZ, et al. Phase Separation of a PKA Regulatory Subunit Controls cAMP Compartmentation and Oncogenic Signaling. Cell 182, 1531–1544 e1515 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Kastenhuber ER, et al. DNAJB1-PRKACA fusion kinase interacts with beta-catenin and the liver regenerative response to drive fibrolamellar hepatocellular carcinoma. Proc Natl Acad Sci U S A 114, 13076–13084 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Cheung J, et al. Structural insights into mis-regulation of protein kinase A in human tumors. Proc Natl Acad Sci U S A 112, 1374–1379 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Zhang P, et al. Structure and allostery of the PKA RIIbeta tetrameric holoenzyme. Science 335, 712–716 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Johnson JL, et al. An atlas of substrate specificities for the human serine/threonine kinome. Nature 613, 759–766 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Amato R, et al. IL-2 signals through Sgk1 and inhibits proliferation and apoptosis in kidney cancer cells. J Mol Med (Berl) 85, 707–721 (2007). [DOI] [PubMed] [Google Scholar]
- 45.Meng F, Yamagiwa Y, Taffetani S, Han J & Patel T IL-6 activates serum and glucocorticoid kinase via p38alpha mitogen-activated protein kinase pathway. Am J Physiol Cell Physiol 289, C971–981 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Won M, et al. Protein kinase SGK1 enhances MEK/ERK complex formation through the phosphorylation of ERK2: implication for the positive regulatory role of SGK1 on the ERK function during liver regeneration. J Hepatol 51, 67–76 (2009). [DOI] [PubMed] [Google Scholar]
- 47.Chen W, et al. Regulation of a third conserved phosphorylation site in SGK1. J Biol Chem 284, 3453–3460 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Halland N, et al. Discovery of N-[4-(1H-Pyrazolo[3,4-b]pyrazin-6-yl)-phenyl]-sulfonamides as Highly Active and Selective SGK1 Inhibitors. ACS Med Chem Lett 6, 73–78 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Castel P, et al. PDK1-SGK1 Signaling Sustains AKT-Independent mTORC1 Activation and Confers Resistance to PI3Kalpha Inhibition. Cancer Cell 30, 229–242 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Spencer DM, Wandless TJ, Schreiber SL & Crabtree GR Controlling signal transduction with synthetic ligands. Science 262, 1019–1024 (1993). [DOI] [PubMed] [Google Scholar]
- 51.Levskaya A, Weiner OD, Lim WA & Voigt CA Spatiotemporal control of cell signalling using a light-switchable protein interaction. Nature 461, 997–1001 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Taverna SD, Li H, Ruthenburg AJ, Allis CD & Patel DJ How chromatin-binding modules interpret histone modifications: lessons from professional pocket pickers. Nat Struct Mol Biol 14, 1025–1040 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Qi LS, et al. Repurposing CRISPR as an RNA-guided platform for sequence-specific control of gene expression. Cell 152, 1173–1183 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Thinon E, et al. Global profiling of co- and post-translationally N-myristoylated proteomes in human cells. Nat Commun 5(2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Mann M Functional and quantitative proteomics using SILAC. Nat Rev Mol Cell Biol 7, 952–958 (2006). [DOI] [PubMed] [Google Scholar]
- 56.Zoumaro-Djayoon AD, Heck AJR & Munoz J Targeted analysis of tyrosine phosphorylation by immuno-affinity enrichment of tyrosine phosphorylated peptides prior to mass spectrometric analysis. Methods 56, 268–274 (2012). [DOI] [PubMed] [Google Scholar]
- 57.Zhou Y, et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun 10, 1523 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.O’Shea JP, et al. pLogo: a probabilistic approach to visualizing sequence motifs. Nat Methods 10, 1211–1212 (2013). [DOI] [PubMed] [Google Scholar]
- 59.Shevchenko A, Tomas H, Havlis J, Olsen JV & Mann M In-gel digestion for mass spectrometric characterization of proteins and proteomes. Nat Protoc 1, 2856–2860 (2006). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are included in the manuscript and Supplementary Information. The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the dataset identifier PXD044530. FASTA files were obtained from the UniProt Human (Proteome ID: UP000005640) and Mouse (Proteome ID: UP000000589) Protein Databases (https://www.uniprot.org). Protein structures for ABL1 kinase (PDB: 2FO0), DNAJ-PKAc (PDB: 4WB7), mouse PRKACA (PDB: 4DFX), and PKA tetrameric complex (PDB: 3TNP) were obtained from the RCSB Protein Data Bank (https://www.rcsb.org). The cell lines generated are available upon request.
