Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Jan 15.
Published in final edited form as: Bioorg Med Chem Lett. 2019 Dec 4;30(2):126876. doi: 10.1016/j.bmcl.2019.126876

Enhancing the Incorporation of Lysine Derivatives into Proteins with Methylester Forms of Unnatural Amino Acids

Han Zhou 1, Jenny W Cheung 2, Tomaya Carpenter 1, Stacey Jones Jr 1, Nhu H Luong 1, Nhi C Tran 1, Savannah E Jacobs 1, Sahan A Galbada Liyanage 2, T Ashton Cropp 2,3,*, Jun Yin 1,*
PMCID: PMC7644286  NIHMSID: NIHMS1546657  PMID: 31836447

Abstract

We have improved the incorporation of L- and D-forms of unnatural amino acid (UAA) Nε-thiaprolyl-L-lysine (ThzK) into ubiquitin (UB) and green fluorescent protein (GFP) by 2–6 folds with the use of the methylester forms of the UAAs in E coli cell culture. We also improved the yields of UAA-incorporated UB and GFP with the methylester forms of Nε-Boc-L-Lysine (BocK) and Nε-propargyl-L-Lysine (PrK) by 2–5 folds compared to their free acid forms. Our work demonstrated that using methylester-capped UAAs for protein expression is a useful strategy to enhance the yields of UAA-incorporated proteins.

Keywords: Unnatural amino acid, methyl ester, ubiquitin, aminoacyl-tRNA synthetase, protein expression

Graphical abstract

graphic file with name nihms-1546657-f0004.jpg


Unnatural amino acid (UAA) incorporation into proteins via genetic code expansion has greatly expanded the structural diversity and chemical reactivity of the functionalities on a protein scaffold. 15 UAA-incorporated proteins have become powerful tools for biological studies - they can generate precise posttranslational modification patterns6, or act as chemical probes to enable photocrosslinking7, biorthogonal labeling8, photocaging9 or enzymatic profiling capacities10 on the proteins. Currently, more than 150 UAAs have been incorporated into proteins with engineered tRNA-tRNA synthetase (RS) pairs5, 1113. Enhancing the yield of UAA-incorporated proteins would save efforts on UAA synthesis for protein expression and expand the use of UAA incorporation as a powerful tool of chemical biology. Here we took the incorporation of Nε-L- thiaprolyl-L-lysine (L-ThzK, 1a), and Nε-D-thiaprolyl-L-lysine (D-ThzK, 2a) into ubiquitin (UB) and green fluorescent protein (GFP) as a case study (Scheme 1A) and found supplying the E. coli cell cultures with the methylester forms of the UAAs would significantly enhance the yields of UAA-incorporated proteins. We also showed that the methylester forms of UAAs are effective in enhancing the incorporation of Nε-Boc-L-Lysine (BocK) and Nε-propargyl-L-Lysine (PrK) into UB and GFP. Our work demonstrates that optimizing the cellular uptake of UAAs by methylester capping could be a useful approach to enhance the yields of UAA-incorporated proteins and make them more accessible for the study of biological problems.

Scheme 1.

Scheme 1.

(A) Incorporating L/D-ThzK in the free acid forms (1a/2a) or in the methylester forms (1b/2b) into ubiquitin (UB). (B) Synthetic scheme for L/D-ThzK-OH 1a/2a and L/D-ThzK-OMe 1b/2b.

Chin and co-workers recently engineered pyrrolysyl (Pyl) tRNA synthetase (RS) from Methanosarcina barkeri (Mb) for the incorporation of L-ThzK UAA. 14 After the incorporation, the thiaprolyl ring in the ThzK side chain can be opened by treatment with methoxyamine to generate a Cys-conjugated Lys side chain. The 1,2-aminiothiol functionality on the Cys residue can be a reactive handle for conjugation with cyano-derivatized probes or with C-terminal thioester of proteins through expressed protein ligation. 1415 The same study showed that D-ThzK can be incorporated into proteins with wild type (wt) Mb PylRS. Subsequently, variants of PylRS were developed that would incorporate δ-thiol-L-Lys into proteins to enable the synthesis of protein conjugates with isopeptide linkages at specific Lys residues. 16 We rationalized that enhancing the incorporation of L/D-ThzK UAA into proteins would enable the synthesis of protein conjugates to study protein posttranslational modification by UB or UB-like proteins. 1718

Instead of preparing ThzK from Nε-Boc-L-Lys-OMe 5, 14 we chose N-Boc-Lys(Cbz)-OH 3, a cheaper starting material to begin our synthesis of the ThzK UAA (Scheme 1B). We first methylated 3 to generate protected Lys methylester 4 and then deprotected Cbz group on 4 by hydrogenation to afford 5. We then coupled 5 with L- or D-form of Boc-thiazolidine-4-carboxylic acid 6 to afford Boc and methylester protected ThzK (7a and 7b). We either removed all the protecting groups on 7a and 7b to generate the free acid forms of L/Z-ThzK-OH (1a and 2a) or only removed the Boc group to generate the methylester forms of ThzK (ThzK-OMe, 1b and 2b). Through this procedure, we achieved an overall yield of 25% for the synthesis of ThzK-OMe. All ThzK UAAs were purified by HPLC and their identities confirmed by ESI and NMR (Supplementary Figures S1S4). Free acid and methylester forms of the UAA were stable after HPLC purification and lyophilization. They could be stored in the 4 °C cold box for weeks with no apparent decomposition.

To test the incorporation efficiency, we first used the engineered Mb L-ThzKRS14 to express UB with L-ThzK incorporated at the K11 position (Scheme 1A). We supplied the cell culture with 1 mM L-ThzK-OH (1a) and found 1 liter of cell culture would yield ~6 mg of UB with L-ThzK incorporated after purifying the 6×His-tagged UB by a Ni-NTA affinity column (Figure 1A). We determined protein concentration in the elution by Bio-Rad protein assay19. We also attempted the incorporation of D-ThzK-OH (2a) into UB with wt Mb PylRS and found a similar yield of UB expression (Figure 1B). The expression and purification of UB with the K11 amber codon in the absence of UAA is shown in Supplementary Figure S5.

Figure 1.

Figure 1.

Incorporating L-ThzK, D-ThzK, Bock, PrK into UB at the K11 position with the free acid or methylester forms of the UAA. (A) UB expression with Mb L-ThzKRS and L-ThzK-OH. The PAGE gel was stained with Coomassie blue to analyze samples from the flow through of the Ni-NTA column (F), once wash by lysis buffer (L), twice by wash buffer (W1 and W2), and twice by elution buffer (E1 and E2). (B) UB expression with wt Mb PyIRS and D-ThzK-OH. (C) UB expression with wt Mb PyIRS and Bock-OH. (D) UB expression with wt Mb PyIRS and Prk-OH. (E) UB expression with Mb L-ThzKRS and L-Thzk-OMe. (F) UB expression with wt Mb PyIRS and D-Thzk-OMe. (G) UB expression with wt Mb PyIRS and Bock-OMe. (H) UB expression with wt Mb PyIRS and Prk-OMe. 1.0 mM of free acid or methylester forms of L/D-Thzk and PrK were used for UB expression in (A), (B), (D), (E), (F), and (H), and 0.1 mM of free acid or methylester forms of Bock UAA was used for UB expression in (C) and (G).

We suspected the L- and D-ThzK-OH in their free carboxylic acid forms could have poor membrane permeability to the E coli cell and so they would not be readily available for protein expression inside the cells. We thus tried methylester forms of L- and D-ThzK-OMe (1b and 2b) for ThzK incorporation into K11 of UB. We supplied 1 mM L-ThzK-OMe to the cell culture to compare the efficiency of incorporation with the same concentration of ThzK-OH with the coexpression of Mb L-ThzKRS. We found we could express 33 mg (± 3.0 mg) of L-ThzK-incorporated UB from 1 liter of culture among three trials (Figure 1E). In contrast, the yield of L-ThzK incorporation into UB with L-ThzK-OH was only 6.0 mg (± 1.5 mg) among three trails (Figure 1A). We found similar improvements when we compared the yields of D-ThzK incorporation into UB with D-ThzK-OMe and D-ThzK-OH in the presence of Mb PylRS. The methyl ester form of UAA yielded 37 mg (± 3.0 mg) of UB from 1 liter of culture, while the free acid form yielded 6.3 mg (± 1.5 mg) of UB (Figure 1B and 1F). The correct incorporation of ThzK into UB using L- or D-ThzK-OMe was confirmed by MALDI mass spectrometry comparing the difference of the molecular weights (MW) of UB with either ThzK or Lys incorporated (Supplementary Figures S6, S7, and S8).

We then tested if the methylester forms of other UAAs such as BocK and PrK would enhance their incorporation into UB with wt Mb PylRS. We found the yield of BocK incorporation into UB had a high yield of more than 50 mg per liter with 1 mM of either BocK-OH or BocK-OMe. However, when we lowered the concentration of the UAA, the methylester form of BocK showed a clear advantage in enhancing the yield of BocK-incorporated proteins. At 0.1 mM, BocK-OMe yielded 14 mg of UB (± 1.5 mg) from 1 liter of culture, while BocK-OH yielded 4.0 mg of UB (± 1.0 mg) (Figure 1C and 1G). When the UAA concentrations were lowered to 0.01 mM, BocK-OMe yielded 10 mg of UB (± 1.5 mg) from 1 liter of culture, while BocK-OH yielded 1.0 mg of UB (± 0.5 mg) (Supplementary Figure S9). The incorporation of BocK into to UB with the methylester form of the UAA was confirmed by MALDI mass spectrometry (Supplementary Figure S10).

We also prepared PrK-OH and PrK-OMe to compare their efficiency of incorporation into UB with wt Mb PylRS. We found 1 mM PrK-OM yielded 31 mg UB (± 1.5 mg) from 1 liter of culture while same concertation of PrK-OH yielded 15 mg UB (± 1.5 mg) from 1 liter of culture (Figure 1D and 1H). The incorporation of PrK into UB with the methylester form of the UAA was confirmed by MALDI mass spectrometry (Supplementary Figure S11). Thus, the methylester form of PrK also enhanced the yield of its incorporation into proteins.

Besides UB, we used sfGFP as a model system to compare the yields of protein expression with free acid or methylester forms of the UAAs. After the induction of sfGFP expression, we analyzed the yields of protein expression by polyacrylamide gel electrophoresis (PAGE) and Coomassie blue staining of the gels (Figure 2). The incorporation efficiency of either L- or D-ThzK-OH into position 151 of sfGFP at 2 mM of the UAA was low when wt PylRS of the Methanosarcina mazei (Mm) or Mb origin were used for protein expression (Figure 2A and 2B). When we used the engineered L-ThzKRS of Mb origin for incorporation, the yield of L/D-ThzK-OH incorporation into GFP was still low as judged by the weak bands of sfGFP on the PAGE gel (Figure 2C). When 2 mM BocK-OH was used, there is a decent yield of sfGFP with the coexpression of Mm or Mb PylRS (Figure 2A and 2B). BocK UAA is known for its high efficiency of incorporation into various proteins by wt PylRS, so such a result was expected. 20 Correspondingly, the sfGFP fluorescence from the cell culture showed high intensity with BocK incorporation into sfGFP by wt Mm or Mb PylRS. In contrast, the fluorescence intensities associated with L- or D-ThzK-OH incorporation were low with either wt PylRS or engineered L-ThzKRS (Figure 2D).

Figure 2.

Figure 2.

Comparing the yield of BocK and ThzK incorporation into sfGFP with wt Mm and Mb PyIRS and ThzKRS. (A) Using wt Mm PyIRS to incorporate various forms of UAA into GFP. Efficiency of incorporation was evaluated based on the intensity of the GFP band on the PAGE gel analyzing the crude cell lysates. (B) Using wt Mb PyIRS to incorporate various forms of UAA into GFP. (C) Using Mb L-ThzKRS to measure the yield of GFP expression with various UAA. (D) The intensities of fluorescence signals from cells expressing GFP using various UAA and tRNA synthetase pairs. The results are the average of three trials. In (A) to (D), 2 mM of UAAs were used for protein expression, (−) sign designates the control expression in absence of the UAAs, and M is the protein marker.

When we performed ThzK incorporation into sfGFP with 2 mM L- or D-ThzK-OMe, we found higher sfGFP expression with the presence of ThzKRS compared to the cell cultures with the free acid forms of L/D-ThzK as suggested by the Coomassie blue staining of the PAGE gel of the crude cell lysates (Figure 2C). sfGFP fluorescence from the cell culture with L- or D-ThzK-OMe was also significantly higher than the corresponding culture using the free acids of UAA (Figure 2D). Out of three trials, L-ThzK-OMe gave 2-fold stronger fluorescence signal than the free acid with either wt Mb or Mm PylRS or engineered ThzKRS for incorporation, and D-ThzK-OMe gave 3–5-fold stronger fluorescence signal than the corresponding free acid with wt or engineered PylRS.

BocK-OH and BocK-OMe did not show as large a difference in the yields of sfGFP expression when they were supplied at high concentrations (2 mM) in the cell culture. We thus varied the concentrations of the two forms of BocK UAA in the lower range of concentrations for sfGFP expression in the presence of wt Mb PylRS. We found the yield of sfGFP expression with BocK-OMe was at least 4-fold higher than with Boc-OH when the UAA concentrations were varied between 0.01 and 0.075 mM (Supplementary Figure S12A). When the BocK UAA concentrations were greater than 0.1 mM, the difference in sfGFP expression became less, but still, the methylester form of BocK gave a higher yield of sfGFP than the free acid form up to 0.5 mM of the UAAs (Supplementary Figure S12B). The difference in the yields of sfGFP with BocK-OH and BocK-OMe was confirmed by PAGE analysis of the cell lysates (Supplementary Figure S12C). These results suggest that for a UAA such as BocK, which can be incorporated into proteins with high efficiency in a free acid form, the same yield of protein can be achieved with a lower concentration of the UAA in the methylester form. Thus, UAA material can be saved for large scale protein expressions by being supplied as a methylester. This could be an advantage for using the methylester forms of the UAAs for incorporation since many UAAs are not readily available and require multistep synthesis to prepare.

Our work showed that the yield of L/D-ThzK incorporation into UB could be increased by as much as 6-fold with the use of the methylester form of the UAA. We also observed a similar enhancement in yields of UAA-incorporated proteins with the methylester forms of BocK and PrK comparing to their free acid forms. With the use of the same tRNA synthetases and E coli strains for incorporation, the enhancement of the yield is likely due to the improved membrane permeability of the methylester forms of the UAA to E coli cells comparing to the free acid forms. Previously Wang and co-workers demonstrated the beneficial effect of the acetoxymethyl ester form of a Tyr UAA analog in enhancing its incorporation into GFP in mammalian HEK293 cells. 21 Liu’s group reported the use of methylester form of L-ThzK to achieve good yield of UB expression in E. coli cells. 2223 Here we carried out a detailed comparison of the yields of L- and D-ThzK incorporation between the methylester and free acid forms of the UAA and found ThzK methylester could enhance UAA incorporation by ~6 fold. We also found the incorporation of BocK and Prk UAA were benefitted from using their methylester forms. We thus suggest UAA methylesters could enhance the yield of UAA-incorporated proteins and expand their use as chemical probes with designed functionalities.

Supplementary Material

1

Acknowledgments

This work was supported by grants from NSF (1420193 to J.Y. and 1710460 to J.Y. and T.A.C.) and NIH (R01GM104498 to J.Y.). We thank Jason Chin of Medical Research Council Laboratory of Molecular Biology, Cambridge, UK, for kindly sharing the Mb L-ThzKRS plasmid and helpful discussions.

Footnotes

Supplementary data

Supplementary data (experimental procedures for UAA synthesis and protein expression) associated with this article can be found online.

Declaration of interests

☒ The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

☐The authors declare the following financial interests/personal relationships which may be considered as potential competing interests:

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  • 1.Lang K; Chin JW, Cellular incorporation of unnatural amino acids and bioorthogonal labeling of proteins. Chem Rev 2014, 114 (9), 4764–806. [DOI] [PubMed] [Google Scholar]
  • 2.Liu CC; Schultz PG, Adding new chemistries to the genetic code. Annu Rev Biochem 2010, 79, 413–44. [DOI] [PubMed] [Google Scholar]
  • 3.Wang L; Schultz PG, Expanding the genetic code. Angewandte Chemie (International ed. in English) 2004, 44 (1), 34–66. [DOI] [PubMed] [Google Scholar]
  • 4.Nguyen TA; Cigler M; Lang K, Expanding the Genetic Code to Study Protein-Protein Interactions. Angewandte Chemie (International ed. in English) 2018, 57 (44), 14350–14361. [DOI] [PubMed] [Google Scholar]
  • 5.Dumas A; Lercher L; Spicer CD; Davis BG, Designing logical codon reassignment - Expanding the chemistry in biology. Chem Sci 2015, 6 (1), 50–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Chen H; Venkat S; McGuire P; Gan Q; Fan C, Recent Development of Genetic Code Expansion for Posttranslational Modification Studies. Molecules (Basel, Switzerland) 2018, 23 (7). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Pham ND; Parker RB; Kohler JJ, Photocrosslinking approaches to interactome mapping. Current opinion in chemical biology 2013, 17 (1), 90–101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Lee KJ; Kang D; Park HS, Site-Specific Labeling of Proteins Using Unnatural Amino Acids. Molecules and cells 2019, 42 (5), 386–396. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Wu N; Deiters A; Cropp TA; King D; Schultz PG, A genetically encoded photocaged amino acid. Journal of the American Chemical Society 2004, 126 (44), 14306–7. [DOI] [PubMed] [Google Scholar]
  • 10.Huguenin-Dezot N; Alonzo DA; Heberlig GW; Mahesh M; Nguyen DP; Dornan MH; Boddy CN; Schmeing TM; Chin JW, Trapping biosynthetic acyl-enzyme intermediates with encoded 2,3-diaminopropionic acid. Nature 2019, 565 (7737), 112–117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Chin JW, Expanding and reprogramming the genetic code of cells and animals. Annual review of biochemistry 2014, 83, 379–408. [DOI] [PubMed] [Google Scholar]
  • 12.Wang L, Genetically encoding new bioreactivity. New biotechnology 2017, 38 (Pt A), 16–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Wang L, Engineering the Genetic Code in Cells and Animals: Biological Considerations and Impacts. Acc Chem Res 2017, 50 (11), 2767–2775. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Nguyen DP; Elliott T; Holt M; Muir TW; Chin JW, Genetically encoded 1,2-aminothiols facilitate rapid and site-specific protein labeling via a bio-orthogonal cyanobenzothiazole condensation. J Am Chem Soc 2011, 133 (30), 11418–21. [DOI] [PubMed] [Google Scholar]
  • 15.Li X; Fekner T; Ottesen JJ; Chan MK, A pyrrolysine analogue for site-specific protein ubiquitination. Angewandte Chemie (International ed. in English) 2009, 48 (48), 9184–7. [DOI] [PubMed] [Google Scholar]
  • 16.Virdee S; Kapadnis PB; Elliott T; Lang K; Madrzak J; Nguyen DP; Riechmann L; Chin JW, Traceless and site-specific ubiquitination of recombinant proteins. J Am Chem Soc 2011, 133 (28), 10708–11. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Braxton CN; Quartner E; Pawloski W; Fushman D; Cropp TA, Ubiquitin Chains Bearing Genetically Encoded Photo-Cross-Linkers Enable Efficient Covalent Capture of (Poly)ubiquitin-Binding Domains. Biochemistry 2019, 58 (7), 883–886. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Castaneda C; Liu J; Chaturvedi A; Nowicka U; Cropp TA; Fushman D, Nonenzymatic assembly of natural polyubiquitin chains of any linkage composition and isotopic labeling scheme. J Am Chem Soc 2011, 133 (44), 17855–68. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Bradford MM, A rapid and sensitive method for the quantitation of microgram quantities of protein utilizing the principle of protein-dye binding. Anal Biochem 1976, 72, 248–54. [DOI] [PubMed] [Google Scholar]
  • 20.Mukai T; Kobayashi T; Hino N; Yanagisawa T; Sakamoto K; Yokoyama S, Adding l-lysine derivatives to the genetic code of mammalian cells with engineered pyrrolysyl-tRNA synthetases. Biochem Biophys Res Commun 2008, 371 (4), 818–22. [DOI] [PubMed] [Google Scholar]
  • 21.Takimoto JK; Xiang Z; Kang JY; Wang L, Esterification of an unnatural amino acid structurally deviating from canonical amino acids promotes its uptake and incorporation into proteins in mammalian cells. ChemBioChem 2010, 11 (16), 2268–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Bi X; Pasunooti KK; Lescar J; Liu CF, Thiazolidine-Masked alpha-Oxo Aldehyde Functionality for Peptide and Protein Modification. Bioconjug Chem 2017, 28 (2), 325–329. [DOI] [PubMed] [Google Scholar]
  • 23.Bi X; Pasunooti KK; Tareq AH; Takyi-Williams J; Liu CF, Genetic incorporation of 1,2-aminothiol functionality for site-specific protein modification via thiazolidine formation. Organic & biomolecular chemistry 2016, 14 (23), 5282–5. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES