Abstract
Using an evolved pyrrolysyl-tRNA synthetase-tRNAPyl pair, a Se-alkylselenocysteine was genetically incorporated into histone H3 with a high protein expression yield. Quantitative oxidative elimination of Se-alkylselenocysteine followed by Michael addition reactions with various thiol nucleophiles generated biologically active mimics of H3 with posttranslational modifications including lysine methylation, lysine acetylation, and serine phosphorylation.
Epigenetic changes are crucial for the development and differentiation of the various cell types in an organism and typically involve postreplicational modifications to DNA(1) and posttranslational modifications to proteins that are closely associated with DNA.(2) Among posttranslational modifications of proteins, those on histones are probably the most diversified.(3) Modifications of histones are central to the regulation of chromatin dynamics and regulate biological processes such as replication, repair, transcription, and genome stability. The importance and complexity of histone modifications has led to the coin of the term “the histone code”. In order to decipher the histone code, recently several methods have been introduced to synthesize histones with posttranslational modifications. Native chemical ligation and its derivative, expressed protein ligation, have been combined together with solid phase peptide synthesis to synthesize acetylated and ubiquitinated histones.(4, 5) Although elegant, the synthesis generally takes lengthy steps and also yields only small quantity of histones. Another method to synthesize histones with posttranslational modifications is to site-specifically incorporate modified amino acids directly into histones at amber mutation sites during protein translation using evolved pyrroly-syl-tRNA synthetase (PylRS)-tRNAPyl pairs.(6–8) High purity histones can be obtained in large quantities this way, but each time a different PylRS-tRNAPyl pair needs to be evolved for a specific modification. In a more diversified approach, dehydroalanine (Dha) precursors can be genetically installed at designated sites followed by reactions with thiol-containing molecules to install histones with different posttranslational mimics.(9, 10) Using the genetic incorporation of phenylselenocysteine (PheSec) followed by oxidative elimination to generate Dha that then undertook Michael additions with a series of cysteamine derivatives, Schultz and coworkers showed that histones with acetylated and methylated lysine mimics could be synthesized.(9) We have been primarily focusing on the Dha-based approach for its easy access to histones with multiple types of modifications, among which N,N-dimethyllysine and N,N,N-trimethyllysine have not been directly installed into histones through the genetic NAA incorporation approach. Along this line, Schultz’s method of oxidatively eliminating PheSec in a histone protein to install Dha is straightforward. However the genetic incorporation of PheSec has relatively low efficiency. For wild type histone H3, its expression in E. coli can reach to 300 mg/L in LB medium. But the expression of H3 incorporated with PheSec could only reach to 15 mg/L in LB medium under the same conditions. The low solubility of PheSec (up to 1 mM)(9) at the physiological pH also excludes the possibility of improving its incorporation level by increasing its concentration. For these reasons, we have been searching alternative methods to incorporate Dha into histones for the site-specific installation of posttranslational mimics on them.
We previously showed that an evolved M. mazei PylRS, mKRS1 together with tRNAPyl allows the specific incorporation of Nε-Cbz-lysine(11) (1 in Figure 1) at an amber mutation site of a protein in E. coli.(7) This mutant enzyme also shows high substrate promiscuity and recognizes 2 and 3 shown in Figure 1. When growing BL21 cells transformed with two plasmids pEVOL-mKRS1-pylT and pET-H3K9TAG that carry genes coding tRNAPyl, mKRS1 with an additional Y384F mutation and Xenopus laevis H3 with an amber mutation at its K9 position in LB medium supplemented with 1, 2, or 3, overexpression of H3 was observed. The expression levels for all three conditions are above 100 mg/L. Given that the expression level of wild type H3 at the same condition was 300 mg/L, these expression levels represented more than 40% of that of wild type H3 and are significantly better than the incorporation level of PheSec. When neither 2 nor 3 was not provided in the medium, only a trace amount of H3 could be observed. As shown in Table 1, the purified proteins all showed the expected molecular weight (MW) when analyzed by electrospray ionization mass spectrometry (ESI-MS). To independently confirm the incorporation of 3, 3 was also genetically incorporated at the Q204 position of GFPUV. The purified protein was digested by Asp-N protease and the target fragment N-DNHYLSTXSALSK-C (X denotes 3) was analysed and confirmed by tandem mass spectrometry (SI Figure 2). Importantly, the isotope distribution pattern observed for the 3-containing peptide fragments matches the theoretical selenium isotope distributions (SI Figure 3), further confirming the incorporation of 3.
Figure 1.
The site-specific incorporation of 1, 2, and 3 at the K9 position of H3. Proteins were expressed in BL21 cells transformed with pEVOL-mKRS1-pylT and pET-H3K9TAG in LB medium supplemented with 2 mM 1, 2 mM 2, or 1.5 mM 3.
Table 1.
Theoretical and detected molecular weights of various H3 proteins.[h]
| Proteins[a] | Theoretic MWavg (Da) | Detected MWavg (Da) |
|---|---|---|
| H3K9-2 | 16423[b], 16465[c] | 16423[b], 16466[c] |
| H3K9-3 | 16470[b], 16512[c] | 16470[b], 16513[c] |
| H3K9Dha | 16228[d], 16244[e] | 16229[d], 16244[e] |
| 16270[f], 16286[g] | 16287[g] | |
| H3K9AcsK | 16347[d], 16363[e] | 16363[e] |
| 16389[f], 16405[g] | ||
| H3K9m1sK | 16319[d], 16335[e] | 16318[d], 16334[e] |
| 16361[f], 16377[g] | 16361[f], 16377[g] | |
| H3K9m2sK | 16333[d], 16349[e] | 16332[d], 16348[e] |
| 16375[f], 16391[g] | 16391[g] | |
| H3K9m3sK | 16348[d], 16364[e] | 16348[d], 16364[e] |
| 16390[f], 16406[g] | ||
| H3K9pC | 16342[d], 16358[e] | 16341[d], 16357[e] |
| 16384[f], 16400[g] |
Please see main text for the definitions of these abbreviations.
A full-length protein without M1.
A full-length protein without M1 but containing the N-terminal acetylation.
A full-length protein without M1 but containing one oxidized methionine residue.
A full-length protein without M1 but containing two oxidized methionine residues.
A full-length protein without M1 but containing one oxidized methionine residues and the N-terminal acetylation.
A full-length protein without M1 but containing two oxidized methionine residues and the N-terminal acetylation.
ESI-MS and FT-ICR-MS spectra of these proteins are shown as SI Figures 4–10 and 15–18.
It was previously demonstrated by Davis and coworkers that an S-alkylcysteine in a protein is susceptible to oxidative elimination to generate Dha in the presence of O-mesitylenesulfonylhydroxylamine (MSH).(10) To convert 2 in H3K9-2 to Dha under forcing conditions, H3K9-2 in 8 M urea, pH 8.0 was reacted with 1000 eq. of MSH for 0.5 h at room temperature, but no conversion was observed from the ESI-MS analysis of the final reaction product. When the reaction time was prolonged, unexpected products were observed, indicating that MSH reacted with other amino acids in the protein. Since H3K9-2 is fully denatured, the structure rigidity is not a factor that may influence the conversion.(12) At this stage, we suspect the somewhat bulky long side chain in 2 may inhibit attack from MSH.
To convert 3 in H3K9-3 to Dha, we tried two oxidative reagents, sodium periodate and H2O2. Although it was reported that sodium periodate can effectively convert PheSec in peptides to Dha, it apparently did not work on H3K9-3.(13) Only adducts with single and multiple oxygen atom(s) added to the protein were detected. On the other hand, oxidative elimination of H3K9-3 using H2O2 proceeded smoothly. In the presence of 100 eq. of H2O2, 3 in H3K9-3 was efficiently converted to Dha in 1 h at room temperature. ESI-MS analysis of the final product also indicated the oxidation of the two methionine residues in H3K9-3 to methionine sulfoxide (SI Figure 9). The detected molecular mass (16244 Da and 16287 Da) agreed well with the calculated mass of H3 with Dha incorporated at its K9 position (H3K9Dha) that also has two oxidized methionine residues (16244 Da and 16286 Da). There is also one peak at 12229 Da that matched the mass of H3K9Dha with only one oxidized methionine residue (calculated mass: 16228 Da). Similar methionine oxidations were observed with the H2O2 oxidation of PheSec-containing H3 and could be circumvented by replacing methionine residues with leucine residues.(9)
With an effective strategy to install the Dha precursor in H3 at hand, we then explored conditions to synthesize histone mimics with various posttranslational modifications (Scheme 1). To generate the acetylated histone mimic, H3K9Dha was incubated together with 1500 eq. of N-acetylcysteamine at 25 °C for 1 h. ESI-MS analysis of the final product (H3K9AcsK) indicated the quantitative conversion of Dha to N-acetylthialysine (AcsK) (Table 1). Top down tandem MS analysis of the final product also confirmed the site-specific installation of AcsK at the K9 position (SI Figures 11–12).(14, 15) This top down tandem MS analysis also confirmed an additional oxidation at M120 (SI Figures 13–14). We performed similar reactions to convert Dha in H3K9Dha to N-methylthialysine (m1sK), N,N-dimethylthialysine (m2sK), and N,N,N-trimethylthialysine (m3sK) by reacting H3K9Dha with the corresponding cysteamine derivatives to afford H3K9m1sK, H3K9m2sK, and H3K9m3sK. The final products were analysed by Fourier transform ion cyclotron resonance (FT-ICR) MS. As shown in Table 1, the detected mass agreed well with the calculated mass. Finally a phosphocysteine (pC)-containing histone analog was made by reacting H3K9Dha with 1000 eq. of thiophosphate for 1 h at room temperature. FT-ICR-MS analysis of the final product H3K9pC indicated a quantitative conversion. This chemically installed pC was stable at the physiological pH. Storing the final product in a 4 °C fridge for a week did not cause any significant degradation.
Scheme 1.
Oxidative elimination reactions to generate Dha that undergoes Michael addition reactions to form different posttranslational modification mimics.
To demonstrate that m1sK, m2sK, and m3sK can closely mimic their naturally counterparts, Ne-methyllysine, Ne,Ne-dimethyllysine, and Ne,Ne,Ne-trimethyllysine, H3K9m1sK, H3K9m2sK, and H3K9m3sK were immunoprecipitated by heterochromatin protein 1, HP-1β that specifically binds to H3 with mono-, di-, or trimethylated lysine at K9 but not wild type H3. The immunoprecipitates were then probed by anti-H3 antibody that specifically recognizes the C-terminal domain of H3. All three proteins were pulled out by HP-1β and detected by anti-H3 antibody (Figure 2). As a control, wild type H3 was not immunoprecipitated by HP-1β at all. To demonstrate N-acetylthialysine closely mimics N-acetyllysine, H3K9AcsK was probed by anti-H3K9Ac antibody that was raised against H3 with N-acetyllysine at K9 in a Western blot analysis. The intense detected band for H3K9AcsK proved the specific interaction between H3K9AcsK and anti-H3K9Ac antibody (Figure 2C). A similar assay for wild type H3 did not give any detectable signal. All these experiments clearly demonstrated that our histone mimics with various degrees of methylation and acetylation are biologically active analogs of their natural counterparts and thus are practically useful.
Figure 2.
(A) Immunoprecipitation assay of wild type H3, H3K9m1sK, and H3K9m2sK using HP-1β. Each protein was incubated with HP-1β and anti-HP-1β antibody and pulled down by immobilized protein A resins that bind Fc region of anti-HP-1β. PD represents pull down. (B) Immunoprecipitation assay of wild type H3 and H3K9m3sK using HP-1β. (C) Western blot analysis of wild type H3 and H3K9AcsK probed by anti-H3K9Ac antibody.
In conclusion, we have developed a general method to synthesize biologically active histone mimics with posttranslational modifications including acetylation, methylation and phosphorylation in large quantities and high purity. Compound 3 could be readily synthesized in multi-gram scale and genetically incorporated into representative histone H3 with protein expression yield up to 126 mg/L. The high expression yield reaches more than 40% of that of the natural histones and is among the highest yields that have been achieved so far with mutated histones.(9) Oxidative conversion of 3 to dehydroalanine and subsequent Michael addition reactions with readily available thiol nucleophiles could be conveniently done under mild non-denaturing conditions in near quantitative yield for each step to install many desired posttranslational modifications in histones. The nature of the amber mutation incorporation approach makes it possible to specifically install post-translational modifications at any desired position of a target histone of interest. We anticipate the application of our method in the histone biology research area will facilitate addressing fundamental questions such as how posttranslational modifications affect the chromatin structure, how these modifications regulate the interactions between chromatin and other non-chromatin regulatory proteins, and how modifications on chromatin cross regulate each others. One limitation with our approach is the lack of stereo-control on the newly formed unnatural amino acid mimic from Michael addition of Dha by thiol nucleophiles. We however still deem the strategy to have considerable utility given its methodological simplicity and versatility. In addition, reports have also proven that similarly modified proteins from Dha are indeed biologically active.(9, 10)
Supplementary Material
Acknowledgments
Funding
This work was supported in part by the National Institute of Health (grant 1R01CA161158 to W.R.L.), the Welch Foundation (grant A-1715 to W.R.L.), and National Science Foundation (grant DBI0821700 to D.H.R.).
Footnotes
ASSOCIATED CONTENT
Supporting Information. Synthesis of 2 and 3, Plasmid constructions, protein expression, ESI-MS and FT-ICR-MS analysis of proteins, and supplementary figures. This material is available free of charge via the Internet at http://pubs.acs.org.
References
- 1.Jones PA, Takai D. Science. 2001;293:1068–1070. doi: 10.1126/science.1063852. [DOI] [PubMed] [Google Scholar]
- 2.Turner BM. Cell. 2002;111:285–291. doi: 10.1016/s0092-8674(02)01080-2. [DOI] [PubMed] [Google Scholar]
- 3.Kouzarides T. Cell. 2007;128:693–705. doi: 10.1016/j.cell.2007.02.005. [DOI] [PubMed] [Google Scholar]
- 4.Shogren-Knaak M, Ishii H, Sun JM, Pazin MJ, Davie JR, Peterson CL. Science. 2006;311:844–847. doi: 10.1126/science.1124000. [DOI] [PubMed] [Google Scholar]
- 5.McGinty RK, Kim J, Chatterjee C, Roeder RG, Muir TW. Nature. 2008;453:812–816. doi: 10.1038/nature06906. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Neumann H, Peak-Chew SY, Chin JW. Nat Chem Biol. 2008;4:232–234. doi: 10.1038/nchembio.73. [DOI] [PubMed] [Google Scholar]
- 7.Wang YS, Wu B, Wang Z, Huang Y, Wan W, Russell WK, Pai PJ, Moe YN, Russell DH, Liu WR. Mol Biosyst. 2010;6:1557–1560. doi: 10.1039/c002155e. [DOI] [PubMed] [Google Scholar]
- 8.Groff D, Chen PR, Peters FB, Schultz PG. Chembiochem. 2010;11:1066–1068. doi: 10.1002/cbic.200900690. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Guo J, Wang J, Lee JS, Schultz PG. Angew Chem Int Ed. 2008;47:6399–6401. doi: 10.1002/anie.200802336. [DOI] [PubMed] [Google Scholar]
- 10.Bernardes GJ, Chalker JM, Errey JC, Davis BG. J Am Chem Soc. 2008;130:5052–5053. doi: 10.1021/ja800800p. [DOI] [PubMed] [Google Scholar]
- 11.Yanagisawa T, Ishii R, Fukunaga R, Kobayashi T, Sakamoto K, Yokoyama S. Chem Biol. 2008;15:1187–1197. doi: 10.1016/j.chembiol.2008.10.004. [DOI] [PubMed] [Google Scholar]
- 12.Chalker JM, Gunnoo SB, Boutureira O, Gerstberger SC, Fernandez-Gonzalez M, Bernardes GJL, Griffin L, Hailu H, Schofield CJ, Davis BG. Chem Sci. 2011;2:1666–1676. [Google Scholar]
- 13.Zhu Y, van der Donk WA. Org Lett. 2001;3:1189–1192. doi: 10.1021/ol015648a. [DOI] [PubMed] [Google Scholar]
- 14.Reid GE, McLuckey SA. J Mass Spectrom. 2002;37:663–675. doi: 10.1002/jms.346. [DOI] [PubMed] [Google Scholar]
- 15.Zinnel NF, Pai PJ, Russell DH. Anal Chem. 2012;84:3390–3397. doi: 10.1021/ac300193s. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.



