Abstract
We report the synthesis and genetic encoding of a recently discovered posttranslational modification, 2-hydroxyisobutryl-lysine, to the genetic code of E. coli. The production of homogeneous proteins containing this amino acid will facilitate the study of modification in full-length proteins.
The number and complexity of identified lysine post-translational modifications (PTMs) continues to grow. Historically the most commonly observed protein modifications are located at lysine side chains, such as methylation or acetylation. In these examples the charge state of a lysine side chain is altered, which in turn can give rise to changes in interactions with DNA and other biomolecules.1, 2 In the case ofepigenetic control of gene expression is vital, and malfunction of these systems can be a hallmark of disesase.3
In addition to acetylation and methylation, it has been reported that lysine residues can be modified by malonylation,4 propionylation and butyrylation,5 succinylation,6 and crotonylation.7 These modifications are derived from intracellular acyl-CoA metabolites and provide wide spectrum of epigenetic control of gene expression. The extent to which these modifications are actively added and removed by enzymes is a current interest in deciphering the “histone code”. Recently proteomics profiling revealed a new lysine modification that was identified as 2-hydroxyisobutyryl lysine (Khib)(1, Scheme 1).8 This modification appears to be conserved throughout evolution, appearing in human, mouse, Drosophila, and yeast cells.8 Intriguingly, unlike other histone marks which occur mainly at the N-terminus of histone proteins, Khib is seen distributed throughout the globular protein structure, and includes many lysine residues that have not been previously reported to be modified. Moreover, in comparison to other lysine PTMs, Khib is quite large and has the unique potential to serve as a hydrogen bond donor, raising the possibility of significant structural changes and alternative intermolecular interactions mediated by this modification. All of these point to an important role in chromatin function.
Scheme 1.
Synthesis of Khib.
Critical to the study of PTMs and is the ability to produce pure protein samples containing homogeneous compositions of a modified residue. Lysine modifications in particular may offer the ability to generate designer chromatin9 to study the effects of PTMs either individually or in concert. Modified amino acids can be added to proteins using combinations of peptide synthesis and expressed protein ligation.10 This approach has the ability to precisely add PTMs, but it is technically challenging and cannot be scaled easily. As an alternative, some amino acids containing PTMs have also been added to the genetic codes of both prokaryotes and eukaroyotes, allowing biosynthetic production of target proteins containing site-specific modifications. Indeed most lysine modifications have been incorporated using the pyrrolysine translational machinery.11-13 These in vivo production methods are very adaptable to biochemical laboratories. Moreover, biosynthetic production of proteins containing PTMs opens the door to more sophisticated experiments such as phage display,14 incorporation of isotopic labels,15, 16 and a wide variety of in vivo experiments. Towards these goals we describe the synthesis and addition of 2-hydroxyisobutyryl-lysine to the genetic code of E. coli.
The synthesis of Khib was performed from lysine by first preparing the copper complex, which allows for selective acylation of the ε-nitrogen (Scheme 1). This complex was then treated with 2-hydroxyisobutryl-O-succinimide ester and the copper removed by chelating agent to generate the final product. This synthesis is simple and can provide gram-scale quantities of amino acid for protein expression studies.
We decided to test whether Khib is a substrate for several variants of the pyrrolysyl-tRNA synthetase (PylRS) derived from either Methanosarcina barkeri (Mb) or Methanosarcina mazei (Mm) (see ESI). These included the wild-type enzymes and variants that have been shown to have relaxed substrate specificity towards other, larger unnatural amino acids. The screen utilized an expression plasmid for superfolder green fluorescent protein (sfGFP) containing an amber stop codon, TAG, in place of the codon for Y151. The plasmid also contains the gene encoding the Mm-pyrrolysyl tRNA (pylT). Incorporation of unnatural amino acid leads to production of full-length protein and a corresponding increase in cellular fluorescence. Using a plate-based screening assay we first examined fluorescence in the presence and absence of Khib for any observable differences. As a positive control, we also used Nε-(tertbutyloxycarbonyl)-L-lysine (BocLys, (2), Scheme 1), which is a known substrate for PylRS. Among the five variants we screened the most observable fluorescence difference was obtained using wild-type Mm PylRS. While the observable fluorescence was weak in comparison to BocLys, it did show clear differences when compared to controls (see ESI, Figure S2). Variants with larger active sites did not appear to accept Khib as substrate.
We chose to perform medium-scale expression of sfGFP in the presence and absence of 5mM Khib and purified the resulting His-tagged proteins proteins using Ni2+ affinity chromatography. As shown in Figure 1, we observed robust protein expression (~10mg/L) only the presence of Khib, indicating that this amino acid can serve as a substrate, without further evolution of PylRS. No protein is seen in the absence of Khib, verifying that endogenous amino acids are not substrates for Mm PylRS. In order to verify the position and identity of the mutation, the gel slice of the produced protein was excised and subjected to in-gel tryptic digest.17 Upon examining the tryptic fragments by LC/MS/MS, the spectrum of the expected fragment was trapped in a +2 charge state (Figure 2). Fragment masses of this ion are consistent with site-specific incorporation of Khib at the correct position, in place of Y151. No masses that correspond to the same fragment containing other natural amino acids at position 151 were seen. In addition to tryptic peptide analysis, the protein samples were analyzed by ESI-MS on intact protein which also confirms incorporation of the amino acid (see ESI, Figure S3). Interestingly, we did not observe masses corresponding to a lysine residue at position 151 (or a resulting tryptic fragment), which would be indicative of active deacylation of Khib in E. coli. Removal of other lysine PTMs has been previously observed and ascribed to bacterial sirtuins1312, and can be prevented by the use of a nicotinamde enzyme inhibitor. It is possible that Khib residues are not a substrate for these enzymes at all or when in the context of this mutation position in sfGFP.
Figure 1.

Production of sfGFP containing 1 at position 151. No protein is produced in the absence of added unnatural amino acid.
Figure 2.

MS/MS spectrum of tryptic fragment of sfGFP bearing Khib at position 151.
Conclusions
In conclusion, we have demonstrated that the pyrrolysyl-tRNA synthetase (PylRS) has a suitably relaxed substrate specificity to accommodate 2-hydroxyisobutyryl-lysine (Khib). This, coupled with a quick and high-yielding synthesis of this important amino acid makes this technology accessible to anyone working in the field of protein chemistry. We anticipate the ability to produce homogeneous samples of proteins containing Khib will facilitate the study of the histone code and chromatin function.
Supplementary Material
Acknowledgments
This work was funded in part by NIH/NIGMS (GM084396).
Footnotes
Electronic Supplementary Information (ESI) available: Full procedures for chemical synthesis and protein expression and purification are described the ESI.
Notes and references
- 1.Chi P, Allis CD, Wang GG. Nat. Rev. Cancer. 2010;10:457–469. doi: 10.1038/nrc2876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Walsh CT, Garneau-Tsodikova S, Gatto GJ., Jr Angew. Chem. Int. Ed Engl. 2005;44:7342–7372. doi: 10.1002/anie.200501023. [DOI] [PubMed] [Google Scholar]
- 3.Timmermann S, Lehrmann H, Polesskaya A, Harel-Bellan A. Cell Mol. Life Sci. 2001;58:728–736. doi: 10.1007/PL00000896. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Xie Z, Dai J, Dai L, Tan M, Cheng Z, Wu Y, Boeke JD, Zhao Y. Mol. Cell. Proteomics. 2012;11:100–107. doi: 10.1074/mcp.M111.015875. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Chen Y, Sprung R, Tang Y, Ball H, Sangras B, Kim SC, Falck JR, Peng J, Gu W, Zhao Y. Mol. Cell. Proteomics. 2007;6:812–819. doi: 10.1074/mcp.M700021-MCP200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Park J, Chen Y, Tishkoff DX, Peng C, Tan M, Dai L, Xie Z, Zhang Y, Zwaans BM, Skinner ME, Lombard DB, Zhao Y. Mol. Cell. 2013;50:919–930. doi: 10.1016/j.molcel.2013.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Tan M, Luo H, Lee S, Jin F, Yang JS, Montellier E, Buchou T, Cheng Z, Rousseaux S, Rajagopal N, Lu Z, Ye Z, Zhu Q, Wysocka J, Ye Y, Khochbin S, Ren B, Zhao Y. Cell. 2011;146:1016–1028. doi: 10.1016/j.cell.2011.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Dai L, Peng C, Montellier E, Lu Z, Chen Y, Ishii H, Debernardi A, Buchou T, Rousseaux S, Jin F, Sabari BR, Deng Z, Allis CD, Ren B, Khochbin S, Zhao Y. Nat. Chem. Biol. 2014;10:365–370. doi: 10.1038/nchembio.1497. [DOI] [PubMed] [Google Scholar]
- 9.Fierz B, Muir TW. Nat. Chem. Biol. 2012;8:417–427. doi: 10.1038/nchembio.938. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Muir TW, Sondhi D, Cole PA. Proc. Natl. Acad. Sci. U. S. A. 1998;95:6705–6710. doi: 10.1073/pnas.95.12.6705. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Wilkins BJ, Hahn LE, Heitmuller S, Frauendorf H, Valerius O, Braus GH, Neumann H. ACS Chem. Biol. ASAP; 2015. [DOI] [PubMed] [Google Scholar]
- 12.Neumann H, Peak-Chew SY, Chin JW. Nat. Chem. Biol. 2008;4:232–234. doi: 10.1038/nchembio.73. [DOI] [PubMed] [Google Scholar]
- 13.Gattner MJ, Vrabel M, Carell T. Chem. Commun. (Camb) 2013;49:379–381. doi: 10.1039/c2cc37836a. [DOI] [PubMed] [Google Scholar]
- 14.Tian F, Tsao ML, Schultz PG. J. Am. Chem. Soc. 2004;126:15962–15963. doi: 10.1021/ja045673m. [DOI] [PubMed] [Google Scholar]
- 15.Castaneda C, Liu J, Chaturvedi A, Nowicka U, Cropp TA, Fushman D. J Am Chem Soc. 2011;133:17855–68. doi: 10.1021/ja207220g. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Castaneda CA, Liu J, Kashyap TR, Singh RK, Fushman D, Cropp TA. Chem Commun (Camb) 2011;47:2026–8. doi: 10.1039/c0cc04868b. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Shevchenko A, Tomas H, Havlis J, Olsen JV, Mann M. Nat Protoc. 2006;1:2856–60. doi: 10.1038/nprot.2006.468. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.

